SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49014925 of 661570 papers

TitleStatusHype
Spanning the Visual Analogy Space with a Weight Basis of LoRAs2
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents2
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories2
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents2
Experiential Reinforcement Learning2
Endless Terminals: Scaling RL Environments for Terminal Agents2
Latent Denoising Makes Good Tokenizers2
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference2
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model2
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics2
FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation2
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions2
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation2
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs2
Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation2
ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation2
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories2
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression2
The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder2
CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion2
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment2
Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey2
Olaf-World: Orienting Latent Actions for Video World Modeling2
Bolmo: Byteifying the Next Generation of Language Models2
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE2
Show:102550
← PrevPage 197 of 26463Next →