SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1045110475 of 177340 papers

TitleStatusHype
Abstractive Summarization of Spoken andWritten Instructions with BERTCode2
Controlling Length in Image CaptioningCode2
An Inverse Scaling Law for CLIP TrainingCode2
Recipro-CAM: Fast gradient-free visual explanations for convolutional neural networksCode2
Focal Loss for Dense Object DetectionCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
Scalable Benchmarking and Robust Learning for Noise-Free Ego-Motion and 3D Reconstruction from Noisy VideoCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
FlowDiffuser: Advancing Optical Flow Estimation with Diffusion ModelsCode2
ARCLE: The Abstraction and Reasoning Corpus Learning Environment for Reinforcement LearningCode2
Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-Syncing DeepFakesCode2
Consistent-Teacher: Towards Reducing Inconsistent Pseudo-targets in Semi-supervised Object DetectionCode2
Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at ScaleCode2
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question AnsweringCode2
Swin Transformer: Hierarchical Vision Transformer using Shifted WindowsCode2
Self-Supervised Contrastive Pre-Training For Time Series via Time-Frequency ConsistencyCode2
FinTSB: A Comprehensive and Practical Benchmark for Financial Time Series ForecastingCode2
HGSFusion: Radar-Camera Fusion with Hybrid Generation and Synchronization for 3D Object DetectionCode2
ABodyBuilder3: Improved and scalable antibody structure predictionsCode2
TrustRAG: Enhancing Robustness and Trustworthiness in RAGCode2
Scaling Language-Image Pre-training via MaskingCode2
LtU-ILI: An All-in-One Framework for Implicit Inference in Astrophysics and CosmologyCode2
TODS: An Automated Time Series Outlier Detection SystemCode2
LLMs in the Imaginarium: Tool Learning through Simulated Trial and ErrorCode2
A Survey of Machine UnlearningCode2
Show:102550
← PrevPage 419 of 7094Next →