SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 62516300 of 661570 papers

TitleStatusHype
Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving0
CIPHER: Culvert Inspection through Pairwise Frame Selection and High-Efficiency Reconstruction0
Unified Text-Image-to-Video Generation: A Training-Free Approach to Flexible Visual Conditioning0
Preconditioned Test-Time Adaptation for Out-of-Distribution Debiasing in Narrative Generation0
OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation0
The Phenomenology of Hallucinations0
Sampling as Bandits: Evaluation-Efficient Design for Black-Box Densities0
Masked Representation Modeling for Domain-Adaptive Segmentation0
Revisiting Vision Language Foundations for No-Reference Image Quality Assessment0
UniPrototype: Humn-Robot Skill Learning with Uniform Prototypes0
Multi-View Camera System for Variant-Aware Autonomous Vehicle Inspection and Defect Detection0
Understanding Sensitivity of Differential Attention through the Lens of Adversarial Robustness0
TsLLM: Augmenting LLMs for General Time Series Understanding and Prediction0
Eliciting Chain-of-Thought Reasoning for Time Series Analysis using Reinforcement Learning0
Purrception: Variational Flow Matching for Vector-Quantized Image Generation0
Transfer Learning with Distance Covariance for Random Forest: Error Bounds and an EHR Application0
PRISM: Enhancing Protein Inverse Folding through Fine-Grained Retrieval on Structure-Sequence Multimodal Representations0
Justitia: Fair and Efficient Scheduling of Task-parallel LLM Agents with Selective Pampering0
VISTA: Verification In Sequential Turn-based Assessment0
Privacy-Preserving Explainable AIoT Application via SHAP Entropy Regularization0
IDALC: A Semi-Supervised Framework for Intent Detection and Active Learning based Correction0
Decoupled Action Expert: Confining Task Knowledge to the Conditioning Pathway0
UniFlow: Zero-Shot LiDAR Scene Flow for Autonomous Vehicles0
Uncertainty Quantification and Data Efficiency in AI: An Information-Theoretic Perspective0
ShaRP: SHAllow-LayeR Pruning for Efficient Video Large Language Models0
Composing Concepts from Images and Videos via Concept-prompt Binding2
SigMA: Path Signatures and Multi-head Attention for Learning Parameters in fBm-driven SDEs0
ARMFlow: AutoRegressive MeanFlow for Online 3D Human Reaction Generation0
GMODiff: One-Step Gain Map Refinement with Diffusion Priors for HDR Reconstruction0
On the Existence and Behavior of Secondary Attention SinksCode0
Diversity or Precision? A Deep Dive into Next Token Prediction0
V-CORE: Temporally Consistent Video Understanding for Video-LLM0
VIBEVOICE-ASR Technical Report0
DECEIVE-AFC: Adversarial Claim Attacks against Search-Enabled LLM-based Fact-Checking Systems0
Seg-MoE: Multi-Resolution Segment-wise Mixture-of-Experts for Time Series Forecasting Transformers0
Low-Dimensional and Transversely Curved Optimization Dynamics in Grokking0
Early-Warning Signals of Grokking via Loss-Landscape Geometry0
Evaluating Four FPGA-accelerated Space Use Cases based on Neural Network Algorithms for On-board Inference0
When LoRA Betrays: Backdooring Text-to-Image Models by Masquerading as Benign Adapters0
Induction Meets Biology: Mechanisms of Repeat Detection in Protein Language Models0
A Gauge Theory of Superposition: Toward a Sheaf-Theoretic Atlas of Neural Representations0
Do Mixed-Vendor Multi-Agent LLMs Improve Clinical Diagnosis?0
Aura: Universal Multi-dimensional Exogenous Integration for Aviation Time Series0
AgrI Challenge: A Data-Centric AI Competition for Cross-Team Validation in Agricultural Vision0
AutoControl Arena: Synthesizing Executable Test Environments for Frontier AI Risk Evaluation0
DyQ-VLA: Temporal-Dynamic-Aware Quantization for Embodied Vision-Language-Action Models0
Learning Adaptive LLM Decoding0
Robust Regularized Policy Iteration under Transition Uncertainty0
PDE-SSM: A Spectral State Space Approach to Spatial Mixing in Diffusion Transformers0
SVD Contextual Sparsity Predictors for Fast LLM Inference0
Show:102550
← PrevPage 126 of 13232Next →