SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 70267050 of 474278 papers

TitleStatusHype
Double Difference Earthquake Location with Graph Neural NetworksCode2
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error PriorsCode2
Distill Visual Chart Reasoning Ability from LLMs to MLLMsCode2
Unleashing Reasoning Capability of LLMs via Scalable Question Synthesis from ScratchCode2
Probabilistic Language-Image Pre-TrainingCode2
Retrieval-Augmented Diffusion Models for Time Series ForecastingCode2
PixelGaussian: Generalizable 3D Gaussian Reconstruction from Arbitrary ViewsCode2
Real-time 3D-aware Portrait Video RelightingCode2
LoRANN: Low-Rank Matrix Factorization for Approximate Nearest Neighbor SearchCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Moving Object Segmentation in Point Cloud Data using Hidden Markov ModelsCode2
MMAU: A Massive Multi-Task Audio Understanding and Reasoning BenchmarkCode2
Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based ApproachCode2
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question AnsweringCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language ModelsCode2
Rawsamble: Overlapping and Assembling Raw Nanopore Signals using a Hash-based Seeding MechanismCode2
TabDPT: Scaling Tabular Foundation ModelsCode2
Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language ModelsCode2
Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth StudyCode2
An Intelligent Agentic System for Complex Image Restoration ProblemsCode2
One-Step Diffusion Distillation through Score Implicit MatchingCode2
Literature Meets Data: A Synergistic Approach to Hypothesis GenerationCode2
Improving Causal Reasoning in Large Language Models: A SurveyCode2
Frontiers in Intelligent ColonoscopyCode2
Show:102550
← PrevPage 282 of 18972Next →