SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 32763300 of 177340 papers

TitleStatusHype
SonicSim: A customizable simulation platform for speech processing in moving sound source scenariosCode3
Multi-Level Speaker Representation for Target Speaker ExtractionCode3
PDL: A Declarative Prompt Programming LanguageCode3
Llama Scope: Extracting Millions of Features from Llama-3.1-8B with Sparse AutoencodersCode3
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context TrainingCode3
OSDFace: One-Step Diffusion Model for Face RestorationCode3
CityWalker: Learning Embodied Urban Navigation from Web-Scale VideosCode3
Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment BenchmarkingCode3
Prithvi-EO-2.0: A Versatile Multi-Temporal Foundation Model for Earth Observation ApplicationsCode3
Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual LocalizationCode3
Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection GuidanceCode3
CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers UpCode3
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude MobilityCode3
LLMs can see and hear without any trainingCode3
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMsCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language ModelsCode3
Improved Denoising Diffusion Probabilistic ModelsCode3
Pareto Front Approximation for Multi-Objective Session-Based Recommender SystemsCode3
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem ProvingCode3
Stonefish: Supporting Machine Learning Research in Marine RoboticsCode3
Soundwave: Less is More for Speech-Text Alignment in LLMsCode3
Slamming: Training a Speech Language Model on One GPU in a DayCode3
AlphaAgent: LLM-Driven Alpha Mining with Regularized Exploration to Counteract Alpha DecayCode3
Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMsCode3
Show:102550
← PrevPage 132 of 7094Next →