SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42014225 of 177340 papers

TitleStatusHype
FlashFace: Human Image Personalization with High-fidelity Identity PreservationCode3
TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane NetworksCode3
CountGD: Multi-Modal Open-World CountingCode3
AudioSR: Versatile Audio Super-resolution at ScaleCode3
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual PretrainingCode3
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different ScenesCode3
CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial DomainsCode3
GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI AgentsCode3
Hierarchical Text-Conditional Image Generation with CLIP LatentsCode3
Self-QA: Unsupervised Knowledge Guided Language Model AlignmentCode3
Self-Discover: Large Language Models Self-Compose Reasoning StructuresCode3
Common Sense Reasoning for Deepfake DetectionCode3
Mosaic: An Architecture for Scalable & Interoperable Data ViewsCode3
The Unreasonable Ineffectiveness of the Deeper LayersCode3
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?Code3
Difference-in-Differences Estimation with Spatial SpilloversCode3
Prompting Is Programming: A Query Language for Large Language ModelsCode3
Scaling Instruction-Finetuned Language ModelsCode3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video DiffusionCode3
A Survey on Causal Discovery Methods for I.I.D. and Time Series DataCode3
FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather ForecastingCode3
The Forward-Forward Algorithm: Some Preliminary InvestigationsCode3
Benchmarking Automatic Machine Learning FrameworksCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
Show:102550
← PrevPage 169 of 7094Next →