SOTAVerified

Large Language Model

Papers

Showing 12011210 of 6097 papers

TitleStatusHype
DSGBench: A Diverse Strategic Game Benchmark for Evaluating LLM-based Agents in Complex Decision-Making EnvironmentsCode0
No Free Labels: Limitations of LLM-as-a-Judge Without Human Grounding0
Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance0
SpecServe: Efficient and SLO-Aware Large Language Model Serving with Adaptive Speculative Decoding0
R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcement LearningCode5
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information RetrievalCode2
This Is Your Doge, If It Please You: Exploring Deception and Robustness in Mixture of LLMsCode0
LLM-based Iterative Approach to Metamodeling in Automotive0
DETQUS: Decomposition-Enhanced Transformers for QUery-focused Summarization0
GEMA-Score: Granular Explainable Multi-Agent Score for Radiology Report EvaluationCode0
Show:102550
← PrevPage 121 of 610Next →

No leaderboard results yet.