SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 38263850 of 177340 papers

TitleStatusHype
Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token RecyclingCode3
Description Boosting for Zero-Shot Entity and Relation ClassificationCode3
LibCity: A Unified Library Towards Efficient and Comprehensive Urban Spatial-Temporal PredictionCode3
Bird-Eye Transformers for Text Generation ModelsCode3
Lightplane: Highly-Scalable Components for Neural 3D FieldsCode3
Apollo: Band-sequence Modeling for High-Quality Audio RestorationCode3
ExTrans: Multilingual Deep Reasoning Translation via Exemplar-Enhanced Reinforcement LearningCode3
Image Quality Assessment for Magnetic Resonance ImagingCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
MetaSpatial: Reinforcing 3D Spatial Reasoning in VLMs for the MetaverseCode3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksCode3
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory ModelCode3
UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language InterfaceCode3
PeriodWave: Multi-Period Flow Matching for High-Fidelity Waveform GenerationCode3
RoMa: Robust Dense Feature MatchingCode3
Cinemo: Consistent and Controllable Image Animation with Motion Diffusion ModelsCode3
ViTamin: Designing Scalable Vision Models in the Vision-Language EraCode3
Accelerating High-Fidelity Waveform Generation via Adversarial Flow Matching OptimizationCode3
Deep Learning for Multivariate Time Series Imputation: A SurveyCode3
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object InteractionsCode3
PathoTune: Adapting Visual Foundation Model to Pathological SpecialistsCode3
SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RLCode3
Bench: Extending Long Context Evaluation Beyond 100K TokensCode3
CRITERIA: a New Benchmarking Paradigm for Evaluating Trajectory Prediction Models for Autonomous DrivingCode3
MMSearch-R1: Incentivizing LMMs to SearchCode3
Show:102550
← PrevPage 154 of 7094Next →