SOTAVerified

Language Modeling

Papers

Showing 13261350 of 14182 papers

TitleStatusHype
MobA: Multifaceted Memory-Enhanced Adaptive Planning for Efficient Mobile Task AutomationCode1
FIRE: Fact-checking with Iterative Retrieval and VerificationCode1
MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation SystemsCode1
VividMed: Vision Language Model with Versatile Visual Grounding for MedicineCode1
HerO at AVeriTeC: The Herd of Open Large Language Models for Verifying Real-World ClaimsCode1
CREAM: Consistency Regularized Self-Rewarding Language ModelsCode1
DISP-LLM: Dimension-Independent Structural Pruning for Large Language ModelsCode1
TopoLM: brain-like spatio-functional organization in a topographic language modelCode1
FVEval: Understanding Language Model Capabilities in Formal Verification of Digital HardwareCode1
Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited ResponsesCode1
HARDMath: A Benchmark Dataset for Challenging Problems in Applied MathematicsCode1
EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMsCode1
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
Retraining-Free Merging of Sparse MoE via Hierarchical ClusteringCode1
Zeroth-Order Fine-Tuning of LLMs in Random SubspacesCode1
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model AgentsCode1
Parameter-Efficient Fine-Tuning of State Space ModelsCode1
Do Unlearning Methods Remove Information from Language Model Weights?Code1
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
Multi-Agent Collaborative Data Selection for Efficient LLM PretrainingCode1
AgroGPT: Efficient Agricultural Vision-Language Model with Expert TuningCode1
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model PromptingCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
Simplicity Prevails: Rethinking Negative Preference Optimization for LLM UnlearningCode1
Training-free Diffusion Model Alignment with Sampling DemonsCode1
Show:102550
← PrevPage 54 of 568Next →

No leaderboard results yet.