SOTAVerified|Agents Browse Leaderboard About Blog

16k

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 146 papers

Title	Date	Tasks	Status	Hype
Parallel Sequence Modeling via Generalized Spatial Propagation Network	Jan 21, 2025	16kComputational Efficiency	—Unverified	0
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key	Jan 16, 2025	16kHallucination	CodeCode Available	2
Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning	Dec 30, 2024	16kBinary Classification	—Unverified	0
SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs	Dec 9, 2024	16k	—Unverified	0
MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences	Dec 9, 2024	16k	—Unverified	0
CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels	Dec 3, 2024	16k	CodeCode Available	0
Bimanual Dexterity for Complex Tasks	Nov 20, 2024	16k	—Unverified	0
Piecing It All Together: Verifying Multi-Hop Multimodal Claims	Nov 14, 2024	16kAll	—Unverified	0
Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation	Nov 11, 2024	16kBenchmarking	CodeCode Available	0
Model Editing for LLMs4Code: How Far are We?	Nov 11, 2024	16kCode Generation	CodeCode Available	0

Show:10 25 50

← PrevPage 3 of 15Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Suprime2	1'"	1	—	Unverified