SOTAVerified

Large Language Model

Papers

Showing 10761100 of 6097 papers

TitleStatusHype
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy PreservationCode1
LRSCLIP: A Vision-Language Foundation Model for Aligning Remote Sensing Image with Longer TextCode1
LLM-SR: Scientific Equation Discovery via Programming with Large Language ModelsCode1
LLMVA-GEBC: Large Language Model with Video Adapter for Generic Event Boundary CaptioningCode1
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
LLM Self Defense: By Self Examination, LLMs Know They Are Being TrickedCode1
LLMZip: Lossless Text Compression using Large Language ModelsCode1
LLMs and the Abstraction and Reasoning Corpus: Successes, Failures, and the Importance of Object-based RepresentationsCode1
LLM-Rubric: A Multidimensional, Calibrated Approach to Automated Evaluation of Natural Language TextsCode1
AuditWen:An Open-Source Large Language Model for AuditCode1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
Do Large Language Model Benchmarks Test Reliability?Code1
LLMs Can Simulate Standardized Patients via Agent CoevolutionCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language CorrectionsCode1
DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt EngineerCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty AgentsCode1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
DualAD: Dual-Layer Planning for Reasoning in Autonomous DrivingCode1
LMEye: An Interactive Perception Network for Large Language ModelsCode1
Lshan-1.0 Technical ReportCode1
AttributionBench: How Hard is Automatic Attribution Evaluation?Code1
Show:102550
← PrevPage 44 of 244Next →

No leaderboard results yet.