SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 55265550 of 474278 papers

TitleStatusHype
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective ResamplingCode2
Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU SimulationCode2
An All-Atom Generative Model for Designing Protein ComplexesCode2
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic TasksCode2
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and ResultsCode2
Enhancing Person-to-Person Virtual Try-On with Multi-Garment Virtual Try-OffCode2
Real-time High-fidelity Gaussian Human Avatars with Position-based Interpolation of Spatially Distributed MLPsCode2
Digital Twin Generation from Visual Data: A SurveyCode2
NoisyRollout: Reinforcing Visual Reasoning with Data AugmentationCode2
Embodied-R: Collaborative Framework for Activating Embodied Spatial Reasoning in Foundation Models via Reinforcement LearningCode2
Sleep-time Compute: Beyond Inference Scaling at Test-timeCode2
Representation Learning for Tabular Data: A Comprehensive SurveyCode2
Logits DeConfusion with CLIP for Few-Shot LearningCode2
MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer DevicesCode2
An Efficient and Mixed Heterogeneous Model for Image RestorationCode2
Enhancing Autonomous Driving Systems with On-Board Deployed Large Language ModelsCode2
3DAffordSplat: Efficient Affordance Reasoning with 3D GaussiansCode2
TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics DataCode2
Multi-scale convolutional transformer network for motor imagery brain-computer interfaceCode2
Autoregressive Distillation of Diffusion TransformersCode2
Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-ResolutionCode2
HypoBench: Towards Systematic and Principled Benchmarking for Hypothesis GenerationCode2
FLOWR: Flow Matching for Structure-Aware De Novo, Interaction- and Fragment-Based Ligand GenerationCode2
LLaVA-ReID: Selective Multi-image Questioner for Interactive Person Re-IdentificationCode2
Software package for simulations using the coarse-grained CALVADOS modelCode2
Show:102550
← PrevPage 222 of 18972Next →