SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49264950 of 661570 papers

TitleStatusHype
Cross-modal learning for plankton recognitionCode0
DISCOVER: A Solver for Distributional Counterfactual ExplanationsCode0
Fast-HaMeR: Boosting Hand Mesh Reconstruction using Knowledge DistillationCode0
HeBA: Heterogeneous Bottleneck Adapters for Robust Vision-Language ModelsCode0
Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language ModelsCode0
Learning to Present: Inverse Specification Rewards for Agentic Slide GenerationCode0
Draft and Refine with Visual ExpertsCode0
SO-Bench: A Structural Output Evaluation of Multimodal LLMsCode0
DesertFormer: Transformer-Based Semantic Segmentation for Off-Road Desert Terrain Classification in Autonomous Navigation SystemsCode0
Uni-DAD: Unified Distillation and Adaptation of Diffusion Models for Few-step Few-shot Image GenerationCode0
HyPER-GAN: Hybrid Patch-Based Image-to-Image Translation for Real-Time Photorealism EnhancementCode0
VLOD-TTA: Test-Time Adaptation of Vision-Language Object DetectorsCode0
GeoBridge: A Semantic-Anchored Multi-View Foundation Model Bridging Images and Text for Geo-LocalizationCode0
REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation ModelsCode0
Demystifing Video Reasoning1
InCoder-32B: Code Foundation Model for Industrial Scenarios1
SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models2
M^3: Dense Matching Meets Multi-View Foundation Models for Monocular Gaussian Splatting SLAM1
SegviGen: Repurposing 3D Generative Model for Part Segmentation2
Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models2
LenghuSky-8: An 8-Year All-Sky Cloud Dataset with Star-Aware Masks and Alt-Az Calibration for Segmentation and Nowcasting0
interwhen: A Generalizable Framework for Verifiable Reasoning with Test-time MonitorsCode0
SA-CycleGAN-2.5D: Self-Attention CycleGAN with Tri-Planar Context for Multi-Site MRI Harmonization0
Bootstrapping Embeddings for Low Resource Languages0
NeuronSpark: A Spiking Neural Network Language Model with Selective State Space Dynamics0
Show:102550
← PrevPage 198 of 26463Next →