SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 30013050 of 659983 papers

TitleStatusHype
RobustSAM: Segment Anything Robustly on Degraded ImagesCode3
Centaur: a foundation model of human cognitionCode3
ForestColl: Throughput-Optimal Collective Communications on Heterogeneous Network FabricsCode3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image GenerationCode3
DreamPhysics: Learning Physics-Based 3D Dynamics with Video Diffusion PriorsCode3
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning TasksCode3
Hydra-MDP: End-to-end Multimodal Planning with Multi-target Hydra-DistillationCode3
Model-based Asynchronous Hyperparameter and Neural Architecture SearchCode3
ContextCite: Attributing Model Generation to ContextCode3
Evaluation of the MACE Force Field Architecture: from Medicinal Chemistry to Materials ScienceCode3
Language Model InversionCode3
Evalverse: Unified and Accessible Library for Large Language Model EvaluationCode3
DBA-Fusion: Tightly Integrating Deep Dense Visual Bundle Adjustment with Multiple Sensors for Large-Scale Localization and MappingCode3
GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF FusionCode3
Improved motif-scaffolding with SE(3) flow matchingCode3
GPU-accelerated Evolutionary Multiobjective Optimization Using Tensorized RVEACode3
SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model CompressionCode3
OmniPred: Language Models as Universal RegressorsCode3
Deep OC-SORT: Multi-Pedestrian Tracking by Adaptive Re-IdentificationCode3
ReEvo: Large Language Models as Hyper-Heuristics with Reflective EvolutionCode3
ADBench: Anomaly Detection BenchmarkCode3
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map ConstructionCode3
On the Trajectory Regularity of ODE-based Diffusion SamplingCode3
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series ForecastingCode3
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language ModelsCode3
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive SurveyCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian SplattingCode3
GraphStorm: all-in-one graph machine learning framework for industry applicationsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
WeatherMesh-3: Fast and accurate operational global weather forecastingCode3
NdLinear Is All You Need for Representation LearningCode3
Bake off redux: a review and experimental evaluation of recent time series classification algorithmsCode3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic RepresentationCode3
CameraHMR: Aligning People with PerspectiveCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference OptimizationCode3
WHAC: World-grounded Humans and CamerasCode3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsCode3
Generative AI Act II: Test Time Scaling Drives Cognition EngineeringCode3
ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language ModelsCode3
Cognify: Supercharging Gen-AI Workflows With Hierarchical AutotuningCode3
Unitxt: Flexible, Shareable and Reusable Data Preparation and Evaluation for Generative AICode3
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI AgentsCode3
Show:102550
← PrevPage 61 of 13200Next →