SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 53515375 of 661570 papers

TitleStatusHype
Temporal Query Network for Efficient Multivariate Time Series ForecastingCode2
RBF++: Quantifying and Optimizing Reasoning Boundaries across Measurable and Unmeasurable Capabilities for Chain-of-Thought ReasoningCode2
CPRet: A Dataset, Benchmark, and Model for Retrieval in Competitive ProgrammingCode2
μPC: Scaling Predictive Coding to 100+ Layer NetworksCode2
MMAR: A Challenging Benchmark for Deep Reasoning in Speech, Audio, Music, and Their MixCode2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement LearningCode2
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language ModelsCode2
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly DetectionCode2
4Hammer: a board-game reinforcement learning environment for the hour long time frameCode2
Neurosymbolic Diffusion ModelsCode2
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated VideosCode2
Hybrid 3D-4D Gaussian Splatting for Fast Dynamic Scene RepresentationCode2
Optimizing Anytime Reasoning via Budget Relative Policy OptimizationCode2
Rethinking Features-Fused-Pyramid-Neck for Object DetectionCode2
AdaptThink: Reasoning Models Can Learn When to ThinkCode2
CSC-SQL: Corrective Self-Consistency in Text-to-SQL via Reinforcement LearningCode2
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
Learnware of Language Models: Specialized Small Language Models Can Do BigCode2
Degradation-Aware Feature Perturbation for All-in-One Image RestorationCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
Panda: A pretrained forecast model for universal representation of chaotic dynamicsCode2
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained OptimizationCode2
VideoRFT: Incentivizing Video Reasoning Capability in MLLMs via Reinforced Fine-TuningCode2
SLOT: Sample-specific Language Model Optimization at Test-timeCode2
GlobalGeoTree: A Multi-Granular Vision-Language Dataset for Global Tree Species ClassificationCode2
Show:102550
← PrevPage 215 of 26463Next →