SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 81018125 of 474278 papers

TitleStatusHype
Squrve: A Unified and Modular Framework for Complex Real-World Text-to-SQL TasksCode0
Mixture-of-Experts Meets In-Context Reinforcement LearningCode0
MH-GIN: Multi-scale Heterogeneous Graph-based Imputation Network for AIS Data (Extended Version)Code0
InstanceAssemble: Layout-Aware Image Generation via Instance Assembling AttentionCode0
PanicToCalm: A Proactive Counseling Agent for Panic AttacksCode0
FRBNet: Revisiting Low-Light Vision through Frequency-Domain Radial Basis NetworkCode0
GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler ResearchCode0
Diffusion Adaptive Text Embedding for Text-to-Image Diffusion ModelsCode0
Compositional Image Synthesis with Inference-Time ScalingCode0
Enhancing Vision-Language Models for Autonomous Driving through Task-Specific Prompting and Spatial ReasoningCode0
The Underappreciated Power of Vision Models for Graph Structural Understanding0
Magentic Marketplace: An Open-Source Environment for Studying Agentic Markets0
StreetMath: Study of LLMs' Approximation BehaviorsCode0
Scaling Up Occupancy-centric Driving Scene Generation: Dataset and MethodCode0
Identity-Preserving Text-to-Video Generation Guided by Simple yet Effective Spatial-Temporal Decoupled RepresentationsCode0
Less is More: Local Intrinsic Dimensions of Contextual Language Models0
OpenS2S: Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model0
AlignCAT: Visual-Linguistic Alignment of Category and Attribute for Weakly Supervised Visual GroundingCode0
RotaTouille: Rotation Equivariant Deep Learning for Contours0
ClaimGen-CN: A Large-scale Chinese Dataset for Legal Claim Generation0
Reconstruction Alignment Improves Unified Multimodal Models0
BTL-UI: Blink-Think-Link Reasoning Model for GUI Agent0
Topology Sculptor, Shape Refiner: Discrete Diffusion Model for High-Fidelity 3D Meshes GenerationCode0
Code Aesthetics with Agentic Reward Feedback0
Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences0
Show:102550
← PrevPage 325 of 18972Next →