SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 65766600 of 474278 papers

TitleStatusHype
Benchmarking Scientific Understanding and Reasoning for Video Generation using VideoScience-BenchCode0
BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object DetectionCode0
AutoBrep: Autoregressive B-Rep Generation with Unified Topology and GeometryCode0
The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse ModelsCode0
Enhancing Job Matching: Occupation, Skill and Qualification Linking with the ESCO and EQF taxonomiesCode0
MasHeNe: A Benchmark for Head and Neck CT Mass Segmentation using Window-Enhanced Mamba with Frequency-Domain IntegrationCode0
Nav-R^2 Dual-Relation Reasoning for Generalizable Open-Vocabulary Object-Goal NavigationCode0
StructuredDNA: A Bio-Physical Framework for Energy-Aware Transformer RoutingCode0
Emergent Extreme-View Geometry in 3D Foundation Models0
Influence Functions for Efficient Data Selection in Reasoning0
The Station: An Open-World Environment for AI-Driven Discovery0
Learning Robust Social Strategies with Large Language Models0
Soft Adaptive Policy Optimization0
iMontage: Unified, Versatile, Highly Dynamic Many-to-many Image Generation0
DualCamCtrl: Dual-Branch Diffusion Model for Geometry-Aware Camera-Controlled Video Generation0
FlashVGGT: Efficient and Scalable Visual Geometry Transformers with Compressed Descriptor Attention0
StreamGaze: Gaze-Guided Temporal Reasoning and Proactive Understanding in Streaming Videos0
Envision: Benchmarking Unified Understanding & Generation for Causal World Process Insights0
CauSight: Learning to Supersense for Visual Causal DiscoveryCode0
KM-ViPE: Online Tightly Coupled Vision-Language-Geometry Fusion for Open-Vocabulary Semantic SLAM0
Rectifying LLM Thought from Lens of Optimization0
Artemis: Structured Visual Reasoning for Perception Policy Learning0
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models0
Apertus: Democratizing Open and Compliant LLMs for Global Language Environments0
Adaptive Pruning for Increased Robustness and Reduced Computational Overhead in Gaussian Process Accelerated Saddle Point Searches0
Show:102550
← PrevPage 264 of 18972Next →