SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 71767200 of 474278 papers

TitleStatusHype
Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph GenerationCode0
A Data-driven ML Approach for Maximizing Performance in LLM-Adapter ServingCode0
ReFactX: Scalable Reasoning with Reliable Facts via Constrained GenerationCode0
On the Alignment of Large Language Models with Global Human OpinionCode0
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse-Linear AttentionCode0
IWR-Bench: Can LVLMs reconstruct interactive webpage from a user interaction video?Code0
A Denoising Framework for Real-World Ultra-Low-Dose Lung CT Images Based on an Image Purification StrategyCode0
Wonder3D++: Cross-domain Diffusion for High-fidelity 3D Generation from a Single ImageCode0
UniUltra: Interactive Parameter-Efficient SAM2 for Universal Ultrasound SegmentationCode0
CASPER: Cross-modal Alignment of Spatial and single-cell Profiles for Expression RecoveryCode0
Exponential Lasso: robust sparse penalization under heavy-tailed noise and outliers with exponential-type lossCode0
Controlling False Positives in Image Segmentation via Conformal PredictionCode0
Convergence and Sketching-Based Efficient Computation of Neural Tangent Kernel Weights in Physics-Based LossCode0
SLAM-AGS: Slide-Label Aware Multi-Task Pretraining Using Adaptive Gradient Surgery in Computational CytologyCode0
Rethinking Saliency Maps: A Cognitive Human Aligned Taxonomy and Evaluation Framework for Explanations0
EulerESG: Automating ESG Disclosure Analysis with LLMsCode0
GPS: General Per-Sample PrompterCode0
Energy-based Autoregressive Generation for Neural Population DynamicsCode0
HSMix: Hard and Soft Mixing Data Augmentation for Medical Image SegmentationCode0
Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model CapabilitiesCode0
SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error CorrectionCode0
MI9: An Integrated Runtime Governance Framework for Agentic AI0
Spatial Policy: Guiding Visuomotor Robotic Manipulation with Spatial-Aware Modeling and Reasoning0
MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling0
Count The Notes: Histogram-Based Supervision for Automatic Music Transcription0
Show:102550
← PrevPage 288 of 18972Next →