SOTAVerified

Language Modeling

Papers

Showing 10511075 of 14182 papers

TitleStatusHype
GPTailor: Large Language Model Pruning Through Layer Cutting and StitchingCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
Sampling from Your Language Model One Byte at a TimeCode1
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
SeqPE: Transformer with Sequential Position EncodingCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
Diffusion Sequence Models for Enhanced Protein Representation and GenerationCode1
Towards Universal Offline Black-Box Optimization via Learning Language Model EmbeddingsCode1
SAFE: Finding Sparse and Flat Minima to Improve PruningCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language ModelCode1
POSS: Position Specialist Generates Better Draft for Speculative DecodingCode1
Period-LLM: Extending the Periodic Capability of Multimodal Large Language ModelCode1
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality EvaluationCode1
ChatCFD: an End-to-End CFD Agent with Domain-specific Structured ThinkingCode1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical ReasoningCode1
Pretraining Language Models to Ponder in Continuous SpaceCode1
Unifying Multimodal Large Language Model Capabilities and Modalities via Model MergingCode1
REARANK: Reasoning Re-ranking Agent via Reinforcement LearningCode1
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM CompressionCode1
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI WorldCode1
Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem SolvingCode1
Show:102550
← PrevPage 43 of 568Next →

No leaderboard results yet.