SOTAVerified

Large Language Model

Papers

Showing 351400 of 6097 papers

TitleStatusHype
LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input ContextsCode2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary ProjectsCode2
Towards a Multimodal Large Language Model with Pixel-Level Insight for BiomedicineCode2
Granite GuardianCode2
LinVT: Empower Your Image-level Large Language Model to Understand VideosCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
OpenAD: Open-World Autonomous Driving Benchmark for 3D Object DetectionCode2
MotionLLaMA: A Unified Framework for Motion Synthesis and ComprehensionCode2
Large Language Model with Region-guided Referring and Grounding for CT Report GenerationCode2
ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow DataCode2
BianCang: A Traditional Chinese Medicine Large Language ModelCode2
Squeezed Attention: Accelerating Long Context Length LLM InferenceCode2
LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language InterpretationCode2
StoryTeller: Improving Long Video Description through Global Audio-Visual Character IdentificationCode2
The Super Weight in Large Language ModelsCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and AlternativesCode2
V-DPO: Mitigating Hallucination in Large Vision Language Models via Vision-Guided Direct Preference OptimizationCode2
RAGViz: Diagnose and Visualize Retrieval-Augmented GenerationCode2
Plan-on-Graph: Self-Correcting Adaptive Planning of Large Language Model on Knowledge GraphsCode2
Protecting Privacy in Multimodal Large Language Models with MLLMU-BenchCode2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM GuidanceCode2
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language ModelsCode2
SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent EvaluationCode2
On the Role of Attention Heads in Large Language Model SafetyCode2
WeatherDG: LLM-assisted Diffusion Model for Procedural Weather Generation in Domain-Generalized Semantic SegmentationCode2
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse SamplingCode2
Mitigating Modality Prior-Induced Hallucinations in Multimodal Large Language Models via Deciphering Attention CausalityCode2
GenSim: A General Social Simulation Platform with Large Language Model based AgentsCode2
Robin3D: Improving 3D Large Language Model via Robust Instruction TuningCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet UpcyclingCode2
MaskLLM: Learnable Semi-Structured Sparsity for Large Language ModelsCode2
Control Industrial Automation System with Large Language Model AgentsCode2
Empirical Asset Pricing with Large Language Model AgentsCode2
Small Language Models: Survey, Measurements, and InsightsCode2
EEGUnity: Open-Source Tool in Facilitating Unified EEG Datasets Towards Large-Scale EEG ModelCode2
Archon: An Architecture Search Framework for Inference-Time TechniquesCode2
Diabetica: Adapting Large Language Model to Enhance Multiple Medical Tasks in Diabetes Care and ManagementCode2
AutoVerus: Automated Proof Generation for Rust CodeCode2
Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making FrameworkCode2
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model InitializationCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile InstructionsCode2
HESSO: Towards Automatic Efficient and User Friendly Any Neural Network Training and PruningCode2
LifeGPT: Topology-Agnostic Generative Pretrained Transformer Model for Cellular AutomataCode2
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression SegmentationCode2
Efficient LLM Scheduling by Learning to RankCode2
LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks YetCode2
Show:102550
← PrevPage 8 of 122Next →

No leaderboard results yet.