SOTAVerified

Large Language Model

Papers

Showing 351375 of 6097 papers

TitleStatusHype
LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input ContextsCode2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary ProjectsCode2
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
Granite GuardianCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
MotionLLaMA: A Unified Framework for Motion Synthesis and ComprehensionCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow DataCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
StoryTeller: Improving Long Video Description through Global Audio-Visual Character IdentificationCode2
The Super Weight in Large Language ModelsCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and AlternativesCode2
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsCode2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language ModelsCode2
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent EvaluationCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
Show:102550
← PrevPage 15 of 244Next →

No leaderboard results yet.