SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1115111200 of 661570 papers

TitleStatusHype
VANGUARD: Vehicle-Anchored Ground Sample Distance Estimation for UAVs in GPS-Denied Environments0
A multi-center analysis of deep learning methods for video polyp detection and segmentation0
IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning0
Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models0
Dual Diffusion Models for Multi-modal Guided 3D Avatar Generation0
World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings0
AILS-NTUA at SemEval-2026 Task 12: Graph-Based Retrieval and Reflective Prompting for Abductive Event Reasoning0
SPRINT: Semi-supervised Prototypical Representation for Few-Shot Class-Incremental Tabular Learning0
Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images0
Algorithmic Compliance and Regulatory Loss in Digital Assets0
What Does Flow Matching Bring To TD Learning?0
SpotIt+: Verification-based Text-to-SQL Evaluation with Database Constraints0
Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection0
ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors1
Underrepresented in Foundation Model Pretraining Data? A One-Shot ProbeCode0
RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation0
FocusGraph: Graph-Structured Frame Selection for Embodied Long Video Question Answering0
A Constrained RL Approach for Cost-Efficient Delivery of Latency-Sensitive Applications0
Efficient Refusal Ablation in LLM through Optimal Transport0
RoboCasa365: A Large-Scale Simulation Framework for Training and Benchmarking Generalist Robots0
Dissecting Quantization Error: A Concentration-Alignment Perspective0
Robust Unscented Kalman Filtering via Recurrent Meta-Adaptation of Sigma-Point Weights0
Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks0
τ-Knowledge: Evaluating Conversational Agents over Unstructured Knowledge0
Turning Trust to Transactions: Tracking Affiliate Marketing and FTC Compliance in YouTube's Influencer Economy0
Accurate and Efficient Hybrid-Ensemble Atmospheric Data Assimilation in Latent Space with Uncertainty Quantification0
EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models0
Honest and Reliable Evaluation and Expert Equivalence Testing of Automated Neonatal Seizure Detection0
Multi-Agent Reinforcement Learning in Intelligent Transportation Systems: A Comprehensive Survey0
TSPC: A Two-Stage Phoneme-Centric Architecture for code-switching Vietnamese-English Speech Recognition0
Continuous Space-Time Video Super-Resolution with 3D Fourier Fields0
Pretraining Large Language Models with NVFP47
PrefDisco: Benchmarking Proactive Personalized Reasoning0
Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences0
RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring0
Observer-Actor: Active Vision Imitation Learning with Sparse-View Gaussian Splatting0
The Convergence of Schema-Guided Dialogue Systems and the Model Context Protocol0
Zatom-1: A Multimodal Flow Foundation Model for 3D Molecules and Materials0
Fine-grained Soundscape Control for Augmented Hearing0
Latent Particle World Models: Self-supervised Object-centric Stochastic Dynamics Modeling2
Towards Explainable Deep Learning for Ship Trajectory Prediction in Inland Waterways0
Dictionary Based Pattern Entropy for Causal Direction Discovery0
From Spark to Fire: Modeling and Mitigating Error Cascades in LLM-Based Multi-Agent Collaboration0
Standing on the Shoulders of Giants: Rethinking EEG Foundation Model Pretraining via Multi-Teacher Distillation0
Recognition of Daily Activities through Multi-Modal Deep Learning: A Video, Pose, and Object-Aware Approach for Ambient Assisted Living0
Progressive Refinement Regulation for Accelerating Diffusion Language Model Decoding0
Augmenting representations with scientific papers0
The Volterra signature0
Invariant Causal Routing for Governing Social Norms in Online Market Economies0
An LLM-Guided Query-Aware Inference System for GNN Models on Large Knowledge Graphs0
Show:102550
← PrevPage 224 of 13232Next →