SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 14511475 of 659983 papers

TitleStatusHype
VideoChat-Flash: Hierarchical Compression for Long-Context Video ModelingCode4
OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and ReasoningCode4
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference OptimizationCode4
Training Software Engineering Agents and Verifiers with SWE-GymCode4
MINIMA: Modality Invariant Image MatchingCode4
The Thousand Brains Project: A New Paradigm for Sensorimotor IntelligenceCode4
LLM4AD: A Platform for Algorithm Design with Large Language ModelCode4
Dora: Sampling and Benchmarking for 3D Shape Variational Auto-EncodersCode4
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous DrivingCode4
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from DemonstrationCode4
Dimension Reduction with Locally Adjusted GraphsCode4
SocialED: A Python Library for Social Event DetectionCode4
Autoregressive Video Generation without Vector QuantizationCode4
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall SpacesCode4
Neural general circulation models optimized to predict satellite-based precipitation observationsCode4
SepLLM: Accelerate Large Language Models by Compressing One Segment into One SeparatorCode4
DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid SpacesCode4
Towards Effective, Efficient and Unsupervised Social Event Detection in the Hyperbolic SpaceCode4
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned EncodersCode4
Video Seal: Open and Efficient Video WatermarkingCode4
Hidden Biases of End-to-End Driving DatasetsCode4
MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental LearningCode4
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow ModelsCode4
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse ViewpointsCode4
SAT: Dynamic Spatial Aptitude Training for Multimodal Language ModelsCode4
Show:102550
← PrevPage 59 of 26400Next →