SOTAVerified

Language Modeling

Papers

Showing 10011025 of 14182 papers

TitleStatusHype
Generating Benchmarks for Factuality Evaluation of Language ModelsCode2
VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-TuningCode2
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language ModelCode2
Generalized Interpolating Discrete DiffusionCode2
Generative Modeling for Mathematical DiscoveryCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking CapabilitiesCode2
Virgo: A Preliminary Exploration on Reproducing o1-like MLLMCode2
Contrastive Decoding: Open-ended Text Generation as OptimizationCode2
Contrastive Search Is What You Need For Neural Text GenerationCode2
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning AbilitiesCode2
Generative Pre-trained Speech Language Model with Efficient Hierarchical TransformerCode2
Continuous Diffusion Model for Language ModelingCode2
ARAGOG: Advanced RAG Output GradingCode2
Forgetting Transformer: Softmax Attention with a Forget GateCode2
WaferLLM: Large Language Model Inference at Wafer ScaleCode2
Watch Every Step! LLM Agent Learning via Iterative Step-Level Process RefinementCode2
AgentSociety Challenge: Designing LLM Agents for User Modeling and Recommendation on Web PlatformsCode2
Formal Mathematics Statement Curriculum LearningCode2
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context ExamplesCode2
AgentSims: An Open-Source Sandbox for Large Language Model EvaluationCode2
FLAME: Financial Large-Language Model Assessment and Metrics EvaluationCode2
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language ModelsCode2
FIRST: Faster Improved Listwise Reranking with Single Token DecodingCode2
FLAIR: VLM with Fine-grained Language-informed Image RepresentationsCode2
Show:102550
← PrevPage 41 of 568Next →

No leaderboard results yet.