SOTAVerified

Large Language Model

Papers

Showing 12761300 of 6097 papers

TitleStatusHype
CityBench: Evaluating the Capabilities of Large Language Models for Urban TasksCode1
Detecting Hallucinations in Large Language Model Generation: A Token Probability ApproachCode1
MMIDR: Teaching Large Language Model to Interpret Multimodal Misinformation via Knowledge DistillationCode1
Multi-Modal Classifiers for Open-Vocabulary Object DetectionCode1
CITER: Collaborative Inference for Efficient Large Language Model Decoding with Token-Level RoutingCode1
Citekit: A Modular Toolkit for Large Language Model Citation GenerationCode1
Explaining Relationships Between Scientific DocumentsCode1
CIPHER: Cybersecurity Intelligent Penetration-testing Helper for Ethical ResearcherCode1
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
Democratizing Reasoning Ability: Tailored Learning from Large Language ModelCode1
Controllable Dialogue Simulation with In-Context LearningCode1
DesCo: Learning Object Recognition with Rich Language DescriptionsCode1
PoisonBench: Assessing Large Language Model Vulnerability to Data PoisoningCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial OptimizationCode1
ASSISTGUI: Task-Oriented Desktop Graphical User Interface AutomationCode1
GenerateCT: Text-Conditional Generation of 3D Chest CT VolumesCode1
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree SearchCode1
PRD: Peer Rank and Discussion Improve Large Language Model based EvaluationsCode1
MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model FrameworkCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
GIST: Generating Image-Specific Text for Fine-grained Object ClassificationCode1
Dataflow Analysis-Inspired Deep Learning for Efficient Vulnerability DetectionCode1
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human PreferencesCode1
Show:102550
← PrevPage 52 of 244Next →

No leaderboard results yet.