SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2650 of 474278 papers

TitleStatusHype
Rethinking Language Model Scaling under Transferable Hypersphere Optimization0
Adaptive Block-Scaled Data Types0
HandX: Scaling Bimanual Motion and Interaction Generation0
Gen-Searcher: Reinforcing Agentic Search for Image Generation0
NeiGAD: Augmenting Graph Anomaly Detection via Spectral Neighbor Information0
LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models0
Kernel-Smith: A Unified Recipe for Evolutionary Kernel Optimization0
Courtroom-Style Multi-Agent Debate with Progressive RAG and Role-Switching for Controversial Claim Verification0
GraphWalker: Agentic Knowledge Graph Question Answering via Synthetic Trajectory Curriculum0
ORSIFlow: Saliency-Guided Rectified Flow for Optical Remote Sensing Salient Object Detection0
ELViS: Efficient Visual Similarity from Local Descriptors that Generalizes Across Domains0
Industrial3D: A Terrestrial LiDAR Point Cloud Dataset and CrossParadigm Benchmark for Industrial Infrastructure0
WAFT-Stereo: Warping-Alone Field Transforms for Stereo Matching0
Sommelier: Scalable Open Multi-turn Audio Pre-processing for Full-duplex Speech Language Models0
CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence0
ImagenWorld: Stress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks0
RSR-core: A High-Performance Engine for Low-Bit Matrix-Vector Multiplication0
KV Cache Quantization for Self-Forcing Video Generation: A 33-Method Empirical Study0
Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs0
Streamlined Open-Vocabulary Human-Object Interaction Detection0
Q-BIOLAT: Binary Latent Protein Fitness Landscapes for QUBO-Based Optimization0
OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery0
PRBench: End-to-end Paper Reproduction in Physics Research0
RHO: Robust Holistic OSM-Based Metric Cross-View Geo-Localization0
GS3LAM: Gaussian Semantic Splatting SLAM0
Show:102550
← PrevPage 2 of 18972Next →