SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45514575 of 177340 papers

TitleStatusHype
Are EEG-to-Text Models Working?Code3
Verdict: A Library for Scaling Judge-Time ComputeCode3
Compact 3D Scene Representation via Self-Organizing Gaussian GridsCode3
StyleGAN-Human: A Data-Centric Odyssey of Human GenerationCode3
TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context LearningCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
CLOVER: Cross-Layer Orthogonal Vectors Pruning and Fine-TuningCode3
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language ModelsCode3
MaxViT: Multi-Axis Vision TransformerCode3
A Survey of Large Language Models for GraphsCode3
SLEDGE: Synthesizing Driving Environments with Generative Models and Rule-Based TrafficCode3
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage ScenariosCode3
Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language ModelsCode3
Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge SurveyCode3
Panza: Design and Analysis of a Fully-Local Personalized Text Writing AssistantCode3
TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian SplattingCode3
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model LeaderboardsCode3
RFUAV: A Benchmark Dataset for Unmanned Aerial Vehicle Detection and IdentificationCode3
Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted ConceptsCode3
Detect Anything 3D in the WildCode3
Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather ForecastCode3
Sequence-Augmented SE(3)-Flow Matching For Conditional Protein Backbone GenerationCode3
Unlimiformer: Long-Range Transformers with Unlimited Length InputCode3
emotion2vec: Self-Supervised Pre-Training for Speech Emotion RepresentationCode3
The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and OptimizationCode3
Show:102550
← PrevPage 183 of 7094Next →