SOTAVerified

Language Modeling

Papers

Showing 10511100 of 14182 papers

TitleStatusHype
GPTailor: Large Language Model Pruning Through Layer Cutting and StitchingCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
Sampling from Your Language Model One Byte at a TimeCode1
SeqPE: Transformer with Sequential Position EncodingCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
Diffusion Sequence Models for Enhanced Protein Representation and GenerationCode1
Towards Universal Offline Black-Box Optimization via Learning Language Model EmbeddingsCode1
SAFE: Finding Sparse and Flat Minima to Improve PruningCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language ModelCode1
POSS: Position Specialist Generates Better Draft for Speculative DecodingCode1
Period-LLM: Extending the Periodic Capability of Multimodal Large Language ModelCode1
Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series ForecastingCode1
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality EvaluationCode1
ChatCFD: an End-to-End CFD Agent with Domain-specific Structured ThinkingCode1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
Pretraining Language Models to Ponder in Continuous SpaceCode1
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical ReasoningCode1
Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM CompressionCode1
REARANK: Reasoning Re-ranking Agent via Reinforcement LearningCode1
Unifying Multimodal Large Language Model Capabilities and Modalities via Model MergingCode1
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI WorldCode1
Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem SolvingCode1
Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target AtomsCode1
Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across ModalitiesCode1
RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement LearningCode1
ChemMLLM: Chemical Multimodal Large Language ModelCode1
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
Speculative Decoding Reimagined for Multimodal Large Language ModelsCode1
U-SAM: An audio language Model for Unified Speech, Audio, and Music UnderstandingCode1
R3: Robust Rubric-Agnostic Reward ModelsCode1
3D Visual Illusion Depth EstimationCode1
Sample Efficient Reinforcement Learning via Large Vision Language Model DistillationCode1
Unifying Segment Anything in Microscopy with Multimodal Large Language ModelCode1
Multi-Token Prediction Needs RegistersCode1
ImagineBench: Evaluating Reinforcement Learning with Large Language Model RolloutsCode1
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous DrivingCode1
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model ReasoningCode1
Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold NetworksCode1
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from TextbooksCode1
CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and OptimizationCode1
WirelessAgent: Large Language Model Agents for Intelligent Wireless NetworksCode1
Visual Test-time Scaling for GUI Agent GroundingCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative DecodingCode1
PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant PhenotypingCode1
LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling MethodCode1
LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field EnlargementCode1
Show:102550
← PrevPage 22 of 284Next →

No leaderboard results yet.