SOTAVerified

Large Language Model

Papers

Showing 201225 of 6097 papers

TitleStatusHype
WIP: Large Language Model-Enhanced Smart Tutor for Undergraduate Circuit Analysis0
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning0
CAF-I: A Collaborative Multi-Agent Framework for Enhanced Irony Detection with Large Language Models0
SPBA: Utilizing Speech Large Language Model for Backdoor Attacks on Speech Classification Models0
Your Agent Can Defend Itself against Backdoor Attacks0
SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills0
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs0
Safe and Economical UAV Trajectory Planning in Low-Altitude Airspace: A Hybrid DRL-LLM Approach with Compliance Awareness0
DeepForm: Reasoning Large Language Model for Communication System Formulation0
JavelinGuard: Low-Cost Transformer Architectures for LLM Security0
LLM Unlearning Should Be Form-Independent0
G-Memory: Tracing Hierarchical Memory for Multi-Agent SystemsCode3
Event-Priori-Based Vision-Language Model for Efficient Visual Understanding0
Language-Grounded Hierarchical Planning and Execution with Multi-Robot 3D Scene Graphs0
MiniCPM4: Ultra-Efficient LLMs on End DevicesCode9
SpatialLM: Training Large Language Models for Structured Indoor Modeling0
Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations0
How Benchmark Prediction from Fewer Data Misses the MarkCode0
An Intelligent Fault Self-Healing Mechanism for Cloud AI Systems via Integration of Large Language Models and Deep Reinforcement Learning0
Statistical Hypothesis Testing for Auditing Robustness in Language Models0
DeepVideo-R1: Video Reinforcement Fine-Tuning via Difficulty-aware Regressive GRPO0
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest QuestionsCode2
QA-LIGN: Aligning LLMs through Constitutionally Decomposed QA0
Cognitive Weave: Synthesizing Abstracted Knowledge with a Spatio-Temporal Resonance GraphCode0
Show:102550
← PrevPage 9 of 244Next →

No leaderboard results yet.