SOTAVerified

Large Language Model

Papers

Showing 11511200 of 6097 papers

TitleStatusHype
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
DOMINO: A Dual-System for Multi-step Visual Language ReasoningCode1
Meerkat: Audio-Visual Large Language Model for Grounding in Space and TimeCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion PerceptionCode1
MechAgents: Large language model multi-agent collaborations can solve mechanics problems, generate new data, and integrate knowledgeCode1
DrugAssist: A Large Language Model for Molecule OptimizationCode1
Do Large Language Model Benchmarks Test Reliability?Code1
MedFILIP: Medical Fine-grained Language-Image Pre-trainingCode1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output GenerationCode1
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
Measuring General Intelligence with Generated GamesCode1
DMoERM: Recipes of Mixture-of-Experts for Effective Reward ModelingCode1
Code4Struct: Code Generation for Few-Shot Event Structure PredictionCode1
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent CollaborationCode1
Matching Patients to Clinical Trials with Large Language ModelsCode1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
DiveR-CT: Diversity-enhanced Red Teaming Large Language Model Assistants with Relaxing ConstraintsCode1
Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language InterfacesCode1
Dynamic Updates for Language Adaptation in Visual-Language TrackingCode1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical ReasoningCode1
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning ProblemsCode1
Collaborative Large Language Model for Recommender SystemsCode1
Collaborative Retrieval for Large Language Model-based Conversational Recommender SystemsCode1
Distillation Matters: Empowering Sequential Recommenders to Match the Performance of Large Language ModelCode1
Dissecting Human and LLM PreferencesCode1
Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language CorrectionsCode1
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language ModelsCode1
ClusterLLM: Large Language Models as a Guide for Text ClusteringCode1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward PassesCode1
CoLLM: A Large Language Model for Composed Image RetrievalCode1
CoLLMLight: Cooperative Large Language Model Agents for Network-Wide Traffic Signal ControlCode1
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image SequencesCode1
CloudEval-YAML: A Practical Benchmark for Cloud Configuration GenerationCode1
Development and bilingual evaluation of Japanese medical large language model within reasonably low computational resourcesCode1
MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and CollaborationCode1
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language ModelCode1
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language ModelsCode1
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQLCode1
C-LLM: Learn to Check Chinese Spelling Errors Character by CharacterCode1
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment AnalysisCode1
Common Sense Enhanced Knowledge-based Recommendation with Large Language ModelCode1
Empowering Large Language Model for Continual Video Question Answering with Collaborative PromptingCode1
Empowering Many, Biasing a Few: Generalist Credit Scoring through Large Language ModelsCode1
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
Can Large Language Models Understand Molecules?Code1
M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and GenerationCode1
EndoChat: Grounded Multimodal Large Language Model for Endoscopic SurgeryCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
LUMA: A Benchmark Dataset for Learning from Uncertain and Multimodal DataCode1
Show:102550
← PrevPage 24 of 122Next →

No leaderboard results yet.