SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45764600 of 661570 papers

TitleStatusHype
WildCap: Facial Albedo Capture in the Wild via Hybrid Inverse Rendering0
LANCE: Low Rank Activation Compression for Efficient On-Device Continual Learning0
Representing Beauty: Towards a Participatory but Objective Latent Aesthetics0
When a Robot is More Capable than a Human: Learning from Constrained Demonstrators0
Distributional Consistency Loss: Beyond Pointwise Data Terms in Inverse Problems0
Strategic Costs of Perceived Bias in Fair Selection0
Evontree: Ontology Rule-Guided Self-Evolution of Large Language Models0
S2WMamba: A Wavelet-Assisted Mamba-Based Dual-Branch Network For Pansharpening0
Analyzing Planner Design Trade-offs for MAPF under ADG-based Realistic Execution0
On Geometric Understanding and Learned Priors in Feed-forward 3D Reconstruction Models0
Toward Better Temporal Structures for Geopolitical Events Forecasting0
A Novel Patch-Based TDA Approach for Computed Tomography Imaging0
DiG: Differential Grounding for Enhancing Fine-Grained Perception in Multimodal Large Language Model0
Diffusion-DRF: Free, Rich, and Differentiable Reward for Video Diffusion Fine-Tuning0
Large Language Models Approach Expert Pedagogical Quality in Math Tutoring but Differ in Instructional and Linguistic Profiles0
Few-Shot Video Object Segmentation in X-Ray Angiography Using Local Matching and Spatio-Temporal Consistency LossCode0
SentGraph: Hierarchical Sentence Graph for Multi-hop Retrieval-Augmented Question Answering0
Aletheia: What Makes RLVR For Code Verifiers Tick?0
VisTIRA: Closing the Image-Text Modality Gap in Visual Math Reasoning via Structured Tool Integration0
Think3D: Thinking with Space for Spatial ReasoningCode0
Building a Correct-by-Design Lakehouse. Data Contracts, Versioning, and Transactional Pipelines for Humans and Agents0
LogicSkills: A Structured Benchmark for Formal Reasoning in Large Language Models0
Fluids You Can Trust: Property-Preserving Operator Learning for Incompressible Flows0
Synergizing Understanding and Generation with Interleaved Analyzing-Drafting Thinking0
Efficient Continual Learning in Language Models via Thalamically Routed Cortical Columns0
Show:102550
← PrevPage 184 of 26463Next →