SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 30263050 of 661570 papers

TitleStatusHype
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language ModelsCode3
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive SurveyCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian SplattingCode3
GraphStorm: all-in-one graph machine learning framework for industry applicationsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
WeatherMesh-3: Fast and accurate operational global weather forecastingCode3
NdLinear Is All You Need for Representation LearningCode3
Bake off redux: a review and experimental evaluation of recent time series classification algorithmsCode3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic RepresentationCode3
CameraHMR: Aligning People with PerspectiveCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference OptimizationCode3
WHAC: World-grounded Humans and CamerasCode3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsCode3
Generative AI Act II: Test Time Scaling Drives Cognition EngineeringCode3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language ModelsCode3
Cognify: Supercharging Gen-AI Workflows With Hierarchical AutotuningCode3
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AICode3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsCode3
Show:102550
← PrevPage 122 of 26463Next →