SOTAVerified

Large Language Model

Papers

Showing 276300 of 6097 papers

TitleStatusHype
Reasoning-Table: Exploring Reinforcement Learning for Table ReasoningCode2
Compiler Optimization via LLM Reasoning for Efficient Model ServingCode2
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual FusionCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning EngineeringCode2
cadrille: Multi-modal CAD Reconstruction with Online Reinforcement LearningCode2
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization AlgorithmsCode2
WINA: Weight Informed Neuron Activation for Accelerating Large Language Model InferenceCode2
Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel DecodingCode2
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
LifelongAgentBench: Evaluating LLM Agents as Lifelong LearnersCode2
Demystifying and Enhancing the Efficiency of Large Language Model Based Search AgentsCode2
Large Language Model Psychometrics: A Systematic Review of Evaluation, Validation, and EnhancementCode2
YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language ModelsCode2
MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning EngineeringCode2
DynamicRAG: Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented GenerationCode2
GuidedQuant: Large Language Model Quantization via Exploiting End Loss GuidanceCode2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D GenerationCode2
MemEngine: A Unified and Modular Library for Developing Advanced Memory of LLM-based AgentsCode2
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single TransformerCode2
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language modelCode2
SegEarth-R1: Geospatial Pixel Reasoning via Large Language ModelCode2
GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video SegmentationCode2
Show:102550
← PrevPage 12 of 244Next →

No leaderboard results yet.