SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 66266650 of 474278 papers

TitleStatusHype
GPD-1: Generative Pre-training for DrivingCode2
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical ModalitiesCode2
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline DataCode2
Pix2Poly: A Sequence Prediction Method for End-to-end Polygonal Building Footprint Extraction from Remote Sensing ImageryCode2
MAGE: A Multi-Agent Engine for Automated RTL Code GenerationCode2
DriveMM: All-in-One Large Multimodal Model for Autonomous DrivingCode2
Exploring What Why and How: A Multifaceted Benchmark for Causation Understanding of Video AnomalyCode2
From an Image to a Scene: Learning to Imagine the World from a Million 360 VideosCode2
FlashRNN: Optimizing Traditional RNNs on Modern HardwareCode2
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative ModelsCode2
Granite GuardianCode2
Maya: An Instruction Finetuned Multilingual Multimodal ModelCode2
Video Motion Transfer with Diffusion TransformersCode2
Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular VideoCode2
Driv3R: Learning Dense 4D Reconstruction for Autonomous DrivingCode2
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetCode2
Tactile DreamFusion: Exploiting Tactile Sensing for 3D GenerationCode2
ProcessBench: Identifying Process Errors in Mathematical ReasoningCode2
ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement TasksCode2
Proactive Agents for Multi-Turn Text-to-Image Generation Under UncertaintyCode2
Retrieving Semantics from the Deep: an RAG Solution for Gesture SynthesisCode2
How to Merge Your Multimodal Models Over Time?Code2
Splatter-360: Generalizable 360^ Gaussian Splatting for Wide-baseline Panoramic ImagesCode2
Bridging the Divide: Reconsidering Softmax and Linear AttentionCode2
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any GranularityCode2
Show:102550
← PrevPage 266 of 18972Next →