SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 25012550 of 177339 papers

TitleStatusHype
A Common Interface for Automatic DifferentiationCode3
GameGen-X: Interactive Open-world Game Video GenerationCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
Measuring AI Ability to Complete Long TasksCode3
Computation-Efficient Era: A Comprehensive Survey of State Space Models in Medical Image AnalysisCode3
InterpretML: A Unified Framework for Machine Learning InterpretabilityCode3
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation ModelsCode3
AlphaMath Almost Zero: Process Supervision without ProcessCode3
Impromptu VLA: Open Weights and Open Data for Driving Vision-Language-Action ModelsCode3
Normalizing Flows are Capable Generative ModelsCode3
3D Photography using Context-aware Layered Depth InpaintingCode3
SMILE: Zero-Shot Sparse Mixture of Low-Rank Experts Construction From Pre-Trained Foundation ModelsCode3
Focused Transformer: Contrastive Training for Context ScalingCode3
Deep Neural Networks for Encrypted Inference with TFHECode3
Foundations of Large Language ModelsCode3
MobileMamba: Lightweight Multi-Receptive Visual Mamba NetworkCode3
Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed TomographyCode3
EMCAD: Efficient Multi-scale Convolutional Attention Decoding for Medical Image SegmentationCode3
Expanding Language-Image Pretrained Models for General Video RecognitionCode3
WavChat: A Survey of Spoken Dialogue ModelsCode3
PirateNets: Physics-informed Deep Learning with Residual Adaptive NetworksCode3
FastViT: A Fast Hybrid Vision Transformer using Structural ReparameterizationCode3
Style Aligned Image Generation via Shared AttentionCode3
Camera Calibration via Circular Patterns: A Comprehensive Framework with Measurement Uncertainty and Unbiased Projection ModelCode3
emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose EstimationCode3
Separable Self-attention for Mobile Vision TransformersCode3
Safety at Scale: A Comprehensive Survey of Large Model SafetyCode3
ESRGAN: Enhanced Super-Resolution Generative Adversarial NetworksCode3
CBGBench: Fill in the Blank of Protein-Molecule Complex Binding GraphCode3
A Declarative System for Optimizing AI WorkloadsCode3
MaskGWM: A Generalizable Driving World Model with Video Mask ReconstructionCode3
How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary DetectionCode3
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative ModelsCode3
DETRs with Collaborative Hybrid Assignments TrainingCode3
Video-RAG: Visually-aligned Retrieval-Augmented Long Video ComprehensionCode3
SparseTSF: Modeling Long-term Time Series Forecasting with 1k ParametersCode3
SelfCodeAlign: Self-Alignment for Code GenerationCode3
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM ModelCode3
Scientific Large Language Models: A Survey on Biological & Chemical DomainsCode3
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence GenerationCode3
Language Models Are Greedy Reasoners: A Systematic Formal Analysis of Chain-of-ThoughtCode3
FramePainter: Endowing Interactive Image Editing with Video Diffusion PriorsCode3
Dopamine: A Research Framework for Deep Reinforcement LearningCode3
ModelScope Text-to-Video Technical ReportCode3
DocAgent: A Multi-Agent System for Automated Code Documentation GenerationCode3
Geometric-aware Pretraining for Vision-centric 3D Object DetectionCode3
Physics-Informed Diffusion ModelsCode3
An end-to-end strategy for recovering a free-form potential from a snapshot of stellar coordinatesCode3
MELODI: Exploring Memory Compression for Long ContextsCode3
Accelerating Production LLMs with Combined Token/Embedding SpeculatorsCode3
Show:102550
← PrevPage 51 of 3547Next →