SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1050110550 of 661570 papers

TitleStatusHype
EgoCogNav: Cognition-aware Human Egocentric Navigation0
Cyber Threat Intelligence for Artificial Intelligence Systems0
Interpretable Multimodal Gesture Recognition for Drone and Mobile Robot Teleoperation via Log-Likelihood Ratio Fusion0
Deep Learning-Driven Friendly Jamming for Secure Multicarrier ISAC Under Channel Uncertainty0
Recursive Inference Machines for Neural Reasoning0
DEBISS: a Corpus of Individual, Semi-structured and Spoken Debates0
Probabilistic Dreaming for World Models0
Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning0
Linguistic trajectories of bipolar disorder on social media0
AILS-NTUA at SemEval-2026 Task 10: Agentic LLMs for Psycholinguistic Marker Extraction and Conspiracy Endorsement Detection0
K-Gen: A Multimodal Language-Conditioned Approach for Interpretable Keypoint-Guided Trajectory Generation0
Llama-Mimi: Exploring the Limits of Flattened Speech Language Modeling0
Conversational Speech Reveals Structural Robustness Failures in SpeechLLM Backbones0
EA-Swin: An Embedding-Agnostic Swin Transformer for AI-Generated Video Detection0
Improving Text-to-Image Generation with Intrinsic Self-Confidence Rewards0
Learning Optimal Distributionally Robust Individualized Treatment Rules Integrating Multi-Source Data0
Autonomous Algorithm Discovery for Ptychography via Evolutionary LLM Reasoning0
TML-Bench: Benchmark for Data Science Agents on Tabular ML TasksCode0
SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning0
Fully Automatic Data Labeling for Ultrasound Screen Detection0
FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications0
Uncertainty-aware Blood Glucose Prediction from Continuous Glucose Monitoring Data0
WaterSIC: information-theoretically (near) optimal linear layer quantization0
VisionPangu: A Compact and Fine-Grained Multimodal Assistant with 1.7B Parameters0
How Quantization Shapes Bias in Large Language Models0
Topology Structure Optimization of Reservoirs Using GLMY Homology0
Motion-Aware Animatable Gaussian Avatars DeblurringCode0
Some Super-approximation Rates of ReLU Neural Networks for Korobov Functions0
Quadrotor Navigation using Reinforcement Learning with Privileged Information0
BACE-RUL: A Bi-directional Adversarial Network with Covariate Encoding for Machine Remaining Useful Life Prediction0
AutoV: Loss-Oriented Ranking for Visual Prompt Retrieval in LVLMs0
Learning Physical Systems: Symplectification via Gauge Fixing in Dirac Structures0
Parameter Stress Analysis in Reinforcement Learning: Applying Synaptic Filtering to Policy Networks0
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining0
Overtone: Cyclic Patch Modulation for Clean, Efficient, and Flexible Physics Emulators0
Kernel Based Maximum Entropy Inverse Reinforcement Learning for Mean-Field Games0
SAMPO-Path: Segmentation Intent-Aligned Preference Optimization for Pathology Foundation Model Segmentation0
In-Training Defenses against Emergent Misalignment in Language Models0
Vevo2: A Unified and Controllable Framework for Speech and Singing Voice Generation0
Complexity-Regularized Proximal Policy Optimization0
New Insights into Optimal Alignment of Acoustic and Linguistic Representations for Knowledge Transfer in ASR0
AttnBoost: Retail Supply Chain Sales Insights via Gradient Boosting Perspective0
BabyHuBERT: Multilingual Self-Supervised Learning for Segmenting Speakers in Child-Centered Long-Form Recordings0
Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks0
Noise-to-Notes: Diffusion-based Generation and Refinement for Automatic Drum Transcription0
Towards Understanding Subliminal Learning: When and How Hidden Biases Transfer0
BridgeDrive: Diffusion Bridge Policy for Closed-Loop Trajectory Planning in Autonomous DrivingCode0
OPPO: Accelerating PPO-based RLHF via Pipeline Overlap0
Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs1
MachaGrasp: Morphology-Aware Cross-Embodiment Dexterous Hand Articulation Generation for Grasping0
Show:102550
← PrevPage 211 of 13232Next →