SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 26512675 of 661570 papers

TitleStatusHype
A Review of Large Language Models and Autonomous Agents in ChemistryCode3
Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous CircleCode3
Accelerating Diffusion Transformers with Token-wise Feature CachingCode3
One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment LocomotionCode3
skscope: Fast Sparsity-Constrained Optimization in PythonCode3
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi DecodingCode3
Repeat After Me: Transformers are Better than State Space Models at CopyingCode3
SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place RecognitionCode3
Towards Universal Soccer Video UnderstandingCode3
Self-Rectifying Diffusion Sampling with Perturbed-Attention GuidanceCode3
Temporal Graph Analysis with TGXCode3
From Individual to Society: A Survey on Social Simulation Driven by Large Language Model-based AgentsCode3
Enhancing Visual Grounding for GUI Agents via Self-Evolutionary Reinforcement LearningCode3
Halton Scheduler For Masked Generative Image TransformerCode3
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio DistanceCode3
iNatAg: Multi-Class Classification Models Enabled by a Large-Scale Benchmark Dataset with 4.7M Images of 2,959 Crop and Weed SpeciesCode3
Q-Bench+: A Benchmark for Multi-modal Foundation Models on Low-level Vision from Single Images to PairsCode3
SemDeDup: Data-efficient learning at web-scale through semantic deduplicationCode3
PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionCode3
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language ModelsCode3
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image GenerationCode3
DynaMem: Online Dynamic Spatio-Semantic Memory for Open World Mobile ManipulationCode3
Universal Language Model Fine-tuning for Text ClassificationCode3
pfl-research: simulation framework for accelerating research in Private Federated LearningCode3
8-bit Optimizers via Block-wise QuantizationCode3
Show:102550
← PrevPage 107 of 26463Next →