SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 27762800 of 661570 papers

TitleStatusHype
Flow Q-LearningCode3
One Diffusion Step to Real-World Super-Resolution via Flow Trajectory DistillationCode3
mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech RecognitionCode3
GFM-RAG: Graph Foundation Model for Retrieval Augmented GenerationCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization PerspectiveCode3
M+: Extending MemoryLLM with Scalable Long-Term MemoryCode3
MambaGlue: Fast and Robust Local Feature Matching With MambaCode3
OneForecast: A Universal Framework for Global and Regional Weather ForecastingCode3
Test-Time Training Scaling Laws for Chemical Exploration in Drug DesignCode3
Decoding-based RegressionCode3
Partially Rewriting a Transformer in Natural LanguageCode3
Rethinking Early Stopping: Refine, Then CalibrateCode3
LLMs can see and hear without any trainingCode3
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation ModelsCode3
Molecular Fingerprints Are Strong Models for Peptide Function PredictionCode3
Sparser, Better, Faster, Stronger: Sparsity Detection for Efficient Automatic DifferentiationCode3
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series ForecastingCode3
DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat GenerationCode3
Deformable Beta SplattingCode3
Parametric Retrieval Augmented GenerationCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
MedAgentBench: A Realistic Virtual EHR Environment to Benchmark Medical LLM AgentsCode3
One-Prompt-One-Story: Free-Lunch Consistent Text-to-Image Generation Using a Single PromptCode3
OSUM: Advancing Open Speech Understanding Models with Limited Resources in AcademiaCode3
Show:102550
← PrevPage 112 of 26463Next →