SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 71267150 of 474278 papers

TitleStatusHype
AssayMatch: Learning to Select Data for Molecular Activity ModelsCode0
A Spatial Semantics and Continuity Perception Attention for Remote Sensing Water Body Change DetectionCode0
Clustered Error Correction with Grouped 4D Gaussian SplattingCode0
VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame InterpolationCode0
Labels Matter More Than Models: Quantifying the Benefit of Supervised Time Series Anomaly DetectionCode0
EvoVLA: Self-Evolving Vision-Language-Action ModelCode0
FlipVQA-Miner: Cross-Page Visual Question-Answer Mining from TextbooksCode0
SwiTrack: Tri-State Switch for Cross-Modal Object TrackingCode0
WWE-UIE: A Wavelet & White Balance Efficient Network for Underwater Image EnhancementCode0
ChangeDINO: DINOv3-Driven Building Change Detection in Optical Remote Sensing ImageryCode0
VideoSeg-R1:Reasoning Video Object Segmentation via Reinforcement LearningCode0
Medverse: A Universal Model for Full-Resolution 3D Medical Image Segmentation, Transformation and EnhancementCode0
QueryGym: A Toolkit for Reproducible LLM-Based Query ReformulationCode0
BioBench: A Blueprint to Move Beyond ImageNet for Scientific ML BenchmarksCode0
SpectralTrain: A Universal Framework for Hyperspectral Image ClassificationCode0
Q-MLLM: Vector Quantization for Robust Multimodal Large Language Model SecurityCode0
Learning-Enhanced Observer for Linear Time-Invariant Systems with Parametric UncertaintyCode0
C^2-Cite: Contextual-Aware Citation Generation for Attributed Large Language ModelsCode0
SceneEdited: A City-Scale Benchmark for 3D HD Map Updating via Image-Guided Change DetectionCode0
Investigating Hallucination in Conversations for Low Resource Languages0
FLARE: Adaptive Multi-Dimensional Reputation for Robust Client Reliability in Federated LearningCode0
Computer-Use Agents as Judges for Generative User InterfaceCode0
IndicGEC: Powerful Models, or a Measurement Mirage?Code0
CKDA: Cross-modality Knowledge Disentanglement and Alignment for Visible-Infrared Lifelong Person Re-identificationCode0
Step-Audio-EditX Technical Report0
Show:102550
← PrevPage 286 of 18972Next →