SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 77767800 of 474278 papers

TitleStatusHype
Decoupling Augmentation Bias in Prompt Learning for Vision-Language ModelsCode0
TripleWin: Fixed-Point Equilibrium Pricing for Data-Model Coupled MarketsCode0
miniF2F-Lean Revisited: Reviewing Limitations and Charting a Path ForwardCode0
CudaForge: An Agent Framework with Hardware Feedback for CUDA Kernel OptimizationCode0
Computational Imaging Meets LLMs: Zero-Shot IDH Mutation Prediction in Brain GliomasCode0
An Augmentation Overlap Theory of Contrastive LearningCode0
C3-Diff: Super-resolving Spatial Transcriptomics via Cross-modal Cross-content Contrastive Diffusion ModellingCode0
DiffSwap++: 3D Latent-Controlled Diffusion for Identity-Preserving Face SwappingCode0
Seeing Across Time and Views: Multi-Temporal Cross-View Learning for Robust Video Person Re-IdentificationCode0
Object Detection as an Optional Basis: A Graph Matching Network for Cross-View UAV LocalizationCode0
GS-Verse: Mesh-based Gaussian Splatting for Physics-aware Interaction in Virtual Reality0
LTD-Bench: Evaluating Large Language Models by Letting Them Draw0
KAO: Kernel-Adaptive Optimization in Diffusion for Satellite Image0
CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents0
TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System0
Activation Transport Operators0
SmartWilds: Multimodal Wildlife Monitoring Dataset0
A Foundation Model for Brain MRI with Dynamic Modality IntegrationCode0
LAWCAT: Efficient Distillation from Quadratic to Linear Attention with Convolution across Tokens for Long Context ModelingCode0
Uniworld-V2: Reinforce Image Editing with Diffusion Negative-aware Finetuning and MLLM Implicit Feedback0
SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers0
Genie Envisioner: A Unified World Foundation Platform for Robotic Manipulation0
FlowRL: Matching Reward Distributions for LLM Reasoning0
MultiSoundGen: Video-to-Audio Generation for Multi-Event Scenarios via SlowFast Contrastive Audio-Visual Pretraining and Direct Preference Optimization0
Revisiting Long-context Modeling from Context Denoising Perspective0
Show:102550
← PrevPage 312 of 18972Next →