SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81268150 of 474278 papers

TitleStatusHype
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning0
LimRank: Less is More for Reasoning-Intensive Information Reranking0
Lookahead Anchoring: Preserving Character Identity in Audio-Driven Human Animation0
Track, Inpaint, Resplat: Subject-driven 3D and 4D Generation with Progressive Texture Infilling0
Variational Masked Diffusion Models0
In Search of the Unknown Unknowns: A Multi-Metric Distance Ensemble for Out of Distribution Anomaly Detection in Astronomical SurveysCode0
Pre-trained knowledge elevates large language models beyond traditional chemical reaction optimizers0
ReconViaGen: Towards Accurate Multi-view 3D Object Reconstruction via Generation0
PlanarGS: High-Fidelity Indoor 3D Gaussian Splatting Guided by Vision-Language Planar Priors0
OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM0
Evaluation of Vision-LLMs in Surveillance VideoCode0
T2ICount: Enhancing Cross-modal Understanding for Zero-Shot CountingCode0
One Stone with Two Birds: A Null-Text-Null Frequency-Aware Diffusion Models for Text-Guided Image InpaintingCode0
ESCA: Contextualizing Embodied Agents via Scene-Graph GenerationCode0
ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent EcosystemCode0
Learning Reconfigurable Representations for Multimodal Federated Learning with Missing DataCode0
Bi-Encoder Contrastive Learning for Fingerprint and Iris BiometricsCode0
LLM Meets Diffusion: A Hybrid Framework for Crystal Material GenerationCode0
Seq-DeepIPC: Sequential Sensing for End-to-End Control in Legged Robot NavigationCode0
Flexing in 73 Languages: A Single Small Model for Multilingual InflectionCode0
SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human ConversationsCode0
DecoDINO: 3D Human-Scene Contact Prediction with Semantic ClassificationCode0
PAHQ: Accelerating Automated Circuit Discovery through Mixed-Precision Inference OptimizationCode0
A Deep Latent Factor Graph Clustering with Fairness-Utility Trade-off PerspectiveCode0
BBOPlace-Bench: Benchmarking Black-Box Optimization for Chip PlacementCode0
Show:102550
← PrevPage 326 of 18972Next →