SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42014250 of 177340 papers

TitleStatusHype
TripNet: Learning Large-scale High-fidelity 3D Car Aerodynamics with Triplane NetworksCode3
CountGD: Multi-Modal Open-World CountingCode3
AudioSR: Versatile Audio Super-resolution at ScaleCode3
UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual PretrainingCode3
Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dependent Concepts under Different ScenesCode3
CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial DomainsCode3
GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI AgentsCode3
Hierarchical Text-Conditional Image Generation with CLIP LatentsCode3
Self-QA: Unsupervised Knowledge Guided Language Model AlignmentCode3
Self-Discover: Large Language Models Self-Compose Reasoning StructuresCode3
Common Sense Reasoning for Deepfake DetectionCode3
Mosaic: An Architecture for Scalable & Interoperable Data ViewsCode3
The Unreasonable Ineffectiveness of the Deeper LayersCode3
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?Code3
Difference-in-Differences Estimation with Spatial SpilloversCode3
Prompting Is Programming: A Query Language for Large Language ModelsCode3
Scaling Instruction-Finetuned Language ModelsCode3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video DiffusionCode3
A Survey on Causal Discovery Methods for I.I.D. and Time Series DataCode3
FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather ForecastingCode3
The Forward-Forward Algorithm: Some Preliminary InvestigationsCode3
Benchmarking Automatic Machine Learning FrameworksCode3
OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at ScaleCode3
3D Diffuser Actor: Policy Diffusion with 3D Scene RepresentationsCode3
INP-Former++: Advancing Universal Anomaly Detection via Intrinsic Normal Prototypes and Residual LearningCode3
Koopman-Based Surrogate Modelling of Turbulent Rayleigh-Bénard ConvectionCode3
Local motion phases for learning multi-contact character movementsCode3
Traj-LIO: A Resilient Multi-LiDAR Multi-IMU State Estimator Through Sparse Gaussian ProcessCode3
On the Error Analysis of 3D Gaussian Splatting and an Optimal Projection StrategyCode3
T2V-Turbo-v2: Enhancing Video Generation Model Post-Training through Data, Reward, and Conditional Guidance DesignCode3
Transolver++: An Accurate Neural Solver for PDEs on Million-Scale GeometriesCode3
Vidu4D: Single Generated Video to High-Fidelity 4D Reconstruction with Dynamic Gaussian SurfelsCode3
Rectified Flow: A Marginal Preserving Approach to Optimal TransportCode3
TabArena: A Living Benchmark for Machine Learning on Tabular DataCode3
SpotlessSplats: Ignoring Distractors in 3D Gaussian SplattingCode3
Grad: Guided Relation Diffusion Generation for Graph Augmentation in Graph Fraud DetectionCode3
ParetoQ: Scaling Laws in Extremely Low-bit LLM QuantizationCode3
Large Language Model based Long-tail Query Rewriting in Taobao SearchCode3
SealQA: Raising the Bar for Reasoning in Search-Augmented Language ModelsCode3
MuLan: Adapting Multilingual Diffusion Models for Hundreds of Languages with Negligible CostCode3
View Selection for 3D Captioning via Diffusion RankingCode3
Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden IntermediatesCode3
ART: Anonymous Region Transformer for Variable Multi-Layer Transparent Image GenerationCode3
Lossless and Near-Lossless Compression for Foundation ModelsCode3
StarWhisper Telescope: Agent-Based Observation Assistant System to Approach AI AstrophysicistCode3
CTNet: A Convolutional Transformer Network for EEG-Based Motor Imagery ClassificationCode3
Affordable AI Assistants with Knowledge Graph of ThoughtsCode3
The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection BenchmarkCode3
DDT: Decoupled Diffusion TransformerCode3
Show:102550
← PrevPage 85 of 3547Next →