SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,984 papers248,105 code links4,818 tasks

Papers

Showing 16261650 of 659984 papers

TitleStatusHype
Utility-Guided Agent Orchestration for Efficient LLM Tool Use0
Revealing Domain-Spatiality Patterns for Configuration Tuning: Domain Knowledge Meets Fitness Landscapes0
Infinite-dimensional spherical-radial decomposition for probabilistic functions, with application to constrained optimal control and Gaussian process regression0
PanORama: Multiview Consistent Panoptic Segmentation in Operating Rooms0
Span-Level Machine Translation Meta-Evaluation0
Translation from the Information Bottleneck Perspective: an Efficiency Analysis of Spatial Prepositions in Bitexts0
SegVGGT: Joint 3D Reconstruction and Instance Segmentation from Multi-View Images0
SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia0
Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents0
LIORNet: Self-Supervised LiDAR Snow Removal Framework for Autonomous Driving under Adverse Weather Conditions0
Timestep-Aware Block Masking for Efficient Diffusion Model Inference0
Hybrid topic modelling for computational close reading: Mapping narrative themes in Pushkin's Evgenij Onegin0
TAPAS: Efficient Two-Server Asymmetric Private Aggregation Beyond Prio(+)0
Structural Controllability of Large-Scale Hypergraphs0
Cov2Pose: Leveraging Spatial Covariance for Direct Manifold-aware 6-DoF Object Pose Estimation0
Channel Prediction-Based Physical Layer Authentication under Consecutive Spoofing Attacks0
2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction0
Diffusion-Based Makeup Transfer with Facial Region-Aware Makeup Features0
Graph2TS: Structure-Controlled Time Series Generation via Quantile-Graph VAEs0
Model-Driven Learning-Based Physical Layer Authentication for Mobile Wi-Fi Devices0
Promoting Critical Thinking With Domain-Specific Generative AI Provocations0
X-World: Controllable Ego-Centric Multi-Camera World Models for Scalable End-to-End Driving0
Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States0
Evaluating Test-Time Adaptation For Facial Expression Recognition Under Natural Cross-Dataset Distribution Shifts0
When Contextual Inference Fails: Cancelability in Interactive Instruction Following0
Show:102550
← PrevPage 66 of 26400Next →