SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20012025 of 661570 papers

TitleStatusHype
TerraTorch: The Geospatial Foundation Models ToolkitCode4
Video-R1: Reinforcing Video Reasoning in MLLMsCode4
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion RefinementCode4
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
Proactive Detection of Voice Cloning with Localized WatermarkingCode4
Eliciting Latent Predictions from Transformers with the Tuned LensCode4
REFINE: Inversion-Free Backdoor Defense via Model ReprogrammingCode4
Relationships are Complicated! An Analysis of Relationships Between Datasets on the WebCode4
Benchmarking Graphormer on Large-Scale Molecular Modeling DatasetsCode4
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic AlignmentCode4
SWE-Search: Enhancing Software Agents with Monte Carlo Tree Search and Iterative RefinementCode4
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal FormalizationCode4
Recurrent Partial Kernel Network for Efficient Optical Flow EstimationCode4
DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to RealityCode4
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Are Transformers Effective for Time Series Forecasting?Code4
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning ModelsCode4
Repurposing Diffusion-Based Image Generators for Monocular Depth EstimationCode4
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-ShotCode4
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild VideosCode4
TableGPT2: A Large Multimodal Model with Tabular Data IntegrationCode4
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from DemonstrationCode4
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized SoundsCode4
MovieChat+: Question-aware Sparse Memory for Long Video Question AnsweringCode4
Show:102550
← PrevPage 81 of 26463Next →