SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 80018050 of 661570 papers

TitleStatusHype
A Systematic Study of Pseudo-Relevance Feedback with LLMs0
Algorithmic Capture, Computational Complexity, and Inductive Bias of Infinite Transformers0
Instruction set for the representation of graphs0
V2M-Zero: Zero-Pair Time-Aligned Video-to-Music Generation0
Agentar-Fin-OCR0
Text-Trained LLMs Can Zero-Shot Extrapolate PDE Dynamics, Revealing a Three-Stage In-Context Learning Mechanism0
Domain Feature Collapse: Implications for Out-of-Distribution Detection and Solutions0
MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images0
Grow with the Flow: 4D Reconstruction of Growing Plants with Gaussian Flow Fields0
Learning to Unscramble: Simplifying Symbolic Expressions via Self-Supervised Oracle Trajectories0
On the Value of Tokeniser Pretraining in Physics Foundation Models0
Human-Aware Robot Behaviour in Self-Driving Labs0
Scaling Reasoning Efficiently via Relaxed On-Policy Distillation0
Fingerprinting Concepts in Data Streams with Supervised and Unsupervised Meta-Information0
RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation0
Task-Conditioned Routing Signatures in Sparse Mixture-of-Experts Transformers0
A Learning-Based Superposition Operator for Non-Renewal Arrival Processes in Queueing Networks0
Monitoring and Prediction of Mood in Elderly People during Daily Life Activities0
Catalogue Grounded Multimodal Attribution for Museum Video under Resource and Regulatory Constraints0
High-resolution weather-guided surrogate modeling for data-efficient cross-location building energy prediction0
Procedural Fairness via Group Counterfactual Explanation0
Co-Diffusion: An Affinity-Aware Two-Stage Latent Diffusion Framework for Generalizable Drug-Target Affinity Prediction0
Efficient Approximation to Analytic and L^p functions by Height-Augmented ReLU Networks0
Beyond Barren Plateaus: A Scalable Quantum Convolutional Architecture for High-Fidelity Image Classification0
Attention Gathers, MLPs Compose: A Causal Analysis of an Action-Outcome Circuit in VideoViT0
GGPT: Geometry Grounded Point Transformer0
DeReason: A Difficulty-Aware Curriculum Improves Decoupled SFT-then-RL Training for General Reasoning0
Evidential learning driven Breast Tumor Segmentation with Stage-divided Vision-Language Interaction0
Security-by-Design for LLM-Based Code Generation: Leveraging Internal Representations for Concept-Driven Steering Mechanisms0
Senna-2: Aligning VLM and End-to-End Driving Policy for Consistent Decision Making and Planning0
Frequency-Modulated Visual Restoration for Matryoshka Large Multimodal Models0
Markovian Generation Chains in Large Language Models0
Trustworthy predictive distributions for rare events via diagnostic transport maps0
Cough activity detection for automatic tuberculosis screening0
A Unified Latent Space Disentanglement VAE Framework with Robust Disentanglement Effectiveness Evaluation0
A Standardized Framework For Evaluating Gene Expression Generative Models0
Mind the Sim2Real Gap in User Simulation for Agentic Tasks0
A Machine Learning-Enhanced Hopf-Cole Formulation for Nonlinear Gas Flow in Porous Media0
Artificial Intelligence for Sentiment Analysis of Persian Poetry0
Towards Automated Initial Probe Placement in Transthoracic Teleultrasound Using Human Mesh and Skeleton Recovery0
Enhancing Value Alignment of LLMs with Multi-agent system and Combinatorial Fusion0
Similarity-as-Evidence: Calibrating Overconfident VLMs for Interpretable and Label-Efficient Medical Active Learning0
A Minimal Agent for Automated Theorem Proving0
Bayesian Optimization of Partially Known Systems using Hybrid Models0
PEEM: Prompt Engineering Evaluation Metrics for Interpretable Joint Evaluation of Prompts and Responses0
Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search0
Resource-constrained Amazons chess decision framework integrating large language models and graph attention0
6ABOS: An Open-Source Atmospheric Correction Framework for the EnMAP Hyperspectral Mission Based on 6S0
Gradient Dynamics of Attention: How Cross-Entropy Sculpts Bayesian Manifolds0
Adaptive Event Stream Slicing for Open-Vocabulary Event-Based Object Detection via Vision-Language Knowledge Distillation0
Show:102550
← PrevPage 161 of 13232Next →