SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 25262550 of 661570 papers

TitleStatusHype
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual LocalizationCode3
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection GuidanceCode3
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude MobilityCode3
LLMs can see and hear without any trainingCode3
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMsCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language ModelsCode3
Improved Denoising Diffusion Probabilistic ModelsCode3
Pareto Front Approximation for Multi-Objective Session-Based Recommender SystemsCode3
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem ProvingCode3
Stonefish: Supporting Machine Learning Research in Marine RoboticsCode3
Soundwave: Less is More for Speech-Text Alignment in LLMsCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha DecayCode3
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMsCode3
Baichuan-Audio: A Unified Framework for End-to-End Speech InteractionCode3
CrossOver: 3D Scene Cross-Modal AlignmentCode3
Harnessing Multiple Large Language Models: A Survey on LLM EnsembleCode3
BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life PredictionCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question AnsweringCode3
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
A Survey on Latent ReasoningCode3
Vision-Speech Models: Teaching Speech Models to Converse about ImagesCode3
Show:102550
← PrevPage 102 of 26463Next →