SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 58515875 of 474278 papers

TitleStatusHype
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language ModelCode2
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
RI3D: Few-Shot Gaussian Splatting With Repair and Inpainting Diffusion PriorsCode2
EEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image EditingCode2
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with TransformerCode2
A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1Code2
Unlocking Generalization Power in LiDAR Point Cloud RegistrationCode2
3D Student Splatting and ScoopingCode2
Bayesian Prompt Flow Learning for Zero-Shot Anomaly DetectionCode2
VMBench: A Benchmark for Perception-Aligned Video Motion GenerationCode2
GroundingSuite: Measuring Complex Multi-Granular Pixel GroundingCode2
RoMA: Scaling up Mamba-based Foundation Models for Remote SensingCode2
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant TightnessCode2
Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model RepresentationsCode2
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any VideoCode2
KNighter: Transforming Static Analysis with LLM-Synthesized CheckersCode2
PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff DropCode2
Manify: A Python Library for Learning Non-Euclidean RepresentationsCode2
Teaching LMMs for Image Quality Scoring and InterpretingCode2
Exploring the best way for UAV visual localization under Low-altitude Multi-view Observation Condition: a BenchmarkCode2
CombatVLA: An Efficient Vision-Language-Action Model for Combat Tasks in 3D Action Role-Playing GamesCode2
Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent SpaceCode2
Efficient Alignment of Unconditioned Action Prior for Language-conditioned Pick and Place in ClutterCode2
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement LearningCode2
Show:102550
← PrevPage 235 of 18972Next →