SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 49014950 of 661570 papers

TitleStatusHype
Spanning the Visual Analogy Space with a Weight Basis of LoRAs2
AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents2
AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories2
REDSearcher: A Scalable and Cost-Efficient Framework for Long-Horizon Search Agents2
Experiential Reinforcement Learning2
Endless Terminals: Scaling RL Environments for Terminal Agents2
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference2
VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model2
Latent Denoising Makes Good Tokenizers2
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics2
FISHER: A Foundation Model for Multi-Modal Industrial Signal Comprehensive Representation2
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions2
Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs2
ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation2
Unveiling Implicit Advantage Symmetry: Why GRPO Struggles with Exploration and Difficulty Adaptation2
ABot-N0: Technical Report on the VLA Foundation Model for Versatile Embodied Navigation2
CLI-Gym: Scalable CLI Task Generation via Agentic Environment Inversion2
DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories2
Accelerating Streaming Video Large Language Models via Hierarchical Token Compression2
The Million-Label NER: Breaking Scale Barriers with GLiNER bi-encoder2
Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey2
Olaf-World: Orienting Latent Actions for Video World Modeling2
Evolving Interactive Diagnostic Agents in a Virtual Clinical Environment2
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger2
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems2
Bolmo: Byteifying the Next Generation of Language Models2
ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation2
How to Correctly Report LLM-as-a-Judge Evaluations2
MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE2
InternSVG: Towards Unified SVG Tasks with Multimodal Large Language Models2
PISCO: Precise Video Instance Insertion with Sparse Control2
RAP: 3D Rasterization Augmented End-to-End Planning2
Learning to Continually Learn via Meta-learning Agentic Memory Designs2
Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics2
RealPDEBench: A Benchmark for Complex Physical Systems with Real-World Data2
compar:IA: The French Government's LLM arena to collect French-language human prompts and preference data2
FloodDiffusion: Tailored Diffusion Forcing for Streaming Motion Generation2
Learning a Generative Meta-Model of LLM Activations2
EEG Foundation Models: Progresses, Benchmarking, and Open Problems2
Sparse Video Generation Propels Real-World Beyond-the-View Vision-Language Navigation2
Context Forcing: Consistent Autoregressive Video Generation with Long Context2
Theory of Space: Can Foundation Models Construct Spatial Beliefs through Active Exploration?2
Rethinking the Trust Region in LLM Reinforcement Learning2
Infinite-World: Scaling Interactive World Models to 1000-Frame Horizons via Pose-Free Hierarchical Memory2
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis2
Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models2
SERA: Soft-Verified Efficient Repository Agents2
A Survey on Efficient Vision-Language-Action Models2
RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents2
Mind-Brush: Integrating Agentic Cognitive Search and Reasoning into Image Generation2
Show:102550
← PrevPage 99 of 13232Next →