SOTAVerified

Large Language Model

Papers

Showing 626650 of 6097 papers

TitleStatusHype
LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMsCode1
Dataset Distillation via Vision-Language Category PrototypeCode1
Where, What, Why: Towards Explainable Driver Attention PredictionCode1
Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and GrounderCode1
GPTailor: Large Language Model Pruning Through Layer Cutting and StitchingCode1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
Evolving Prompts In-Context: An Open-ended, Self-replicating PerspectiveCode1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For DrivingCode1
The Condition Number as a Scale-Invariant Proxy for Information Encoding in Neural UnitsCode1
Probe before You Talk: Towards Black-box Defense against Backdoor Unalignment for Large Language ModelsCode1
LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling ResearchCode1
RMIT-ADM+S at the SIGIR 2025 LiveRAG ChallengeCode1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation TasksCode1
A Benchmark for Generalizing Across Diverse Team Strategies in Competitive PokémonCode1
Consistent Paths Lead to Truth: Self-Rewarding Reinforcement Learning for LLM ReasoningCode1
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
Eigenspectrum Analysis of Neural Networks without Aspect Ratio BiasCode1
DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference AccelerationCode1
Agentomics-ML: Autonomous Machine Learning Experimentation Agent for Genomic and Transcriptomic DataCode1
OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language ModelCode1
POSS: Position Specialist Generates Better Draft for Speculative DecodingCode1
RewardAnything: Generalizable Principle-Following Reward ModelsCode1
DefenderBench: A Toolkit for Evaluating Language Agents in Cybersecurity EnvironmentsCode1
Period-LLM: Extending the Periodic Capability of Multimodal Large Language ModelCode1
Show:102550
← PrevPage 26 of 244Next →

No leaderboard results yet.