SOTAVerified

Large Language Model

Papers

Showing 11761200 of 6097 papers

TitleStatusHype
Dissecting Human and LLM PreferencesCode1
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language CorrectionsCode1
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsCode1
ClusterLLM: Large Language Models as a Guide for Text ClusteringCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
CoLLM: A Large Language Model for Composed Image RetrievalCode1
CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal ControlCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
CloudEval-YAML: A Practical Benchmark for Cloud Configuration GenerationCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and CollaborationCode1
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language ModelCode1
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
C-LLM: Learn to Check Chinese Spelling Errors Character by CharacterCode1
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment AnalysisCode1
Common Sense Enhanced Knowledge-based Recommendation with Large Language ModelCode1
Empowering Large Language Model for Continual Video Question Answering with Collaborative PromptingCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
Can Large Language Models Understand Molecules?Code1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal DataCode1
Show:102550
← PrevPage 48 of 244Next →

No leaderboard results yet.