SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 45014525 of 661570 papers

TitleStatusHype
Meta-Reinforcement Learning with Self-Reflection for Agentic SearchCode0
Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLM Reward ModelsCode0
Towards Motion-aware Referring Image SegmentationCode0
UniSAFE: A Comprehensive Benchmark for Safety Evaluation of Unified Multimodal ModelsCode0
Procedural Generation of Algorithm Discovery Tasks in Machine LearningCode0
Fine-Grained Post-Training Quantization for Large Vision Language Models with Quantization-Aware Integrated GradientsCode0
Self-Attention And Beyond the Infinite: Towards Linear Transformers with Infinite Self-AttentionCode0
Learning to See and Act: Task-Aware Virtual View Exploration for Robotic Manipulation1
Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery4
Complementary Reinforcement Learning1
Stereo World Model: Camera-Guided Stereo Video Generation1
Tree Search for LLM Agent Reinforcement Learning3
Generative Refocusing: Flexible Defocus Control from a Single Image3
Learning Goal-Oriented Vision-and-Language Navigation with Self-Improving Demonstrations at Scale1
FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models0
Parameter-Efficient Modality-Balanced Symmetric Fusion for Multimodal Remote Sensing Semantic SegmentationCode0
Deep Learning Multi-Horizon Irradiance Nowcasting: A Comparative Evaluation of Three Methods for Leveraging Sky Images0
PI-Mamba: Linear-Time Protein Backbone Generation via Spectrally Initialized Flow Matching0
The Cognitive Divergence: AI Context Windows, Human Attention Decline, and the Delegation Feedback Loop0
Agentic AI for Human Resources: LLM-Driven Candidate Assessment0
On the Carbon Footprint of Economic Research in the Age of Generative AI0
EMPD: An Event-based Multimodal Physiological Dataset for Remote Pulse Wave Detection0
Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores0
Scaling Attention via Feature Sparsity0
Latent Semantic Manifolds in Large Language Models0
Show:102550
← PrevPage 181 of 26463Next →