SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33013325 of 177340 papers

TitleStatusHype
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
CrossOver: 3D Scene Cross-Modal AlignmentCode3
Harnessing Multiple Large Language Models: A Survey on LLM EnsembleCode3
BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life PredictionCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question AnsweringCode3
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
A Survey on Latent ReasoningCode3
Vision-Speech Models: Teaching Speech Models to Converse about ImagesCode3
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal ConsistencyCode3
Vision-to-Music Generation: A SurveyCode3
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and BeyondCode3
AI2Agent: An End-to-End Framework for Deploying AI Projects as Autonomous AgentsCode3
Perception-R1: Pioneering Perception Policy with Reinforcement LearningCode3
Learning to Reason under Off-Policy GuidanceCode3
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationCode3
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based ReasoningCode3
Causal-learn: Causal Discovery in PythonCode3
Memory Layers at ScaleCode3
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert CacheCode3
Addressing the Abstraction and Reasoning Corpus via Procedural Example GenerationCode3
A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge GraphsCode3
Emergent World Models and Latent Variable Estimation in Chess-Playing Language ModelsCode3
GiT: Towards Generalist Vision Transformer through Universal Language InterfaceCode3
Champion Solution for the WSDM2023 Toloka VQA ChallengeCode3
Show:102550
← PrevPage 133 of 7094Next →