SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15261550 of 659983 papers

TitleStatusHype
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt OptimizationCode4
LLMs-as-Judges: A Comprehensive Survey on LLM-based Evaluation MethodsCode4
Multi-head Temporal Latent AttentionCode4
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning DatasetCode4
A Survey on Video Diffusion ModelsCode4
MEDITRON-70B: Scaling Medical Pretraining for Large Language ModelsCode4
Deep Residual Learning for Image RecognitionCode4
Multi-label Cluster Discrimination for Visual Representation LearningCode4
Craw4LLM: Efficient Web Crawling for LLM PretrainingCode4
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsCode4
MiMo-VL Technical ReportCode4
LightGlue: Local Feature Matching at Light SpeedCode4
Catastrophic Forgetting in Deep Learning: A Comprehensive TaxonomyCode4
FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow ModelsCode4
Deepfake Generation and Detection: A Benchmark and SurveyCode4
Easi3R: Estimating Disentangled Motion from DUSt3R Without TrainingCode4
Pytorch-Wildlife: A Collaborative Deep Learning Framework for ConservationCode4
Agent Q: Advanced Reasoning and Learning for Autonomous AI AgentsCode4
InceptionNeXt: When Inception Meets ConvNeXtCode4
Neural Network DiffusionCode4
BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and MiningCode4
Hierarchically Coherent Multivariate Mixture NetworksCode4
Self-Supervised Prompt OptimizationCode4
Mamba-FETrack: Frame-Event Tracking via State Space ModelCode4
Accelerating Data Processing and Benchmarking of AI Models for PathologyCode4
Show:102550
← PrevPage 62 of 26400Next →