SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 87518800 of 661570 papers

TitleStatusHype
From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors1
Pri4R: Learning World Dynamics for Vision-Language-Action Models with Privileged 4D Representation0
Cognitively Layered Data Synthesis for Domain Adaptation of LLMs to Space Situational Awareness0
CTRL Your Shift: Clustered Transfer Residual Learning for Many Small Datasets0
AI Meets Mathematics Education: A Case Study on Supporting an Instructor in a Large Mathematics Class with Context-Aware AI0
AlpsBench: An LLM Personalization Benchmark for Real-Dialogue Memorization and Preference Alignment0
Rheos: Modelling Continuous Motion Dynamics in Hierarchical 3D Scene Graphs0
Joint Return and Risk Modeling with Deep Neural Networks for Portfolio Construction0
Speculating Experts Accelerates Inference for Mixture-of-ExpertsCode0
Neural Dynamics Self-Attention for Spiking Transformers0
Towards Differentiating Between Failures and Domain Shifts in Industrial Data Streams0
DynaTrust: Defending Multi-Agent Systems Against Sleeper Agents via Dynamic Trust Graphs0
Disentangling Prompt Dependence to Evaluate Segmentation Reliability in Gynecological MRI0
Patient-Level Multimodal Question Answering from Multi-Site Auscultation Recordings0
From Refusal Tokens to Refusal Control: Discovering and Steering Category-Specific Refusal Directions0
Graph2Video: Leveraging Video Models to Model Dynamic Graph Evolution0
WaveComm: Lightweight Communication for Collaborative Perception via Wavelet Feature Distillation0
Real-Time Monocular Scene Analysis for UAV in Outdoor Environments0
Agentic LLM Workflow for MR Spectroscopy Volume-of-Interest Placements in Brain Tumors0
The ARC of Progress towards AGI: A Living Survey of Abstraction and Reasoning0
Bi-CamoDiffusion: A Boundary-informed Diffusion Approach for Camouflaged Object Detection0
Learning When to Trust in Contextual Bandits0
Int3DNet: Scene-Motion Cross Attention Network for 3D Intention Prediction in Mixed Reality0
BrainCast: A Spatio-Temporal Forecasting Model for Whole-Brain fMRI Time Series Prediction0
Multimodal Deep Learning for Dynamic and Static Neuroimaging: Integrating MRI and fMRI for Alzheimer Disease Analysis0
IAML: Illumination-Aware Mirror Loss for Progressive Learning in Low-Light Image Enhancement Auto-encoders0
GraphVLM: Benchmarking Vision Language Models for Multimodal Graph LearningCode0
Thinking in Uncertainty: Mitigating Hallucinations in MLRMs with Latent Entropy-Aware Decoding2
FineRMoE: Dimension Expansion for Finer-Grained Expert with Its Upcycling Approach1
The Conundrum of Trustworthy Research on Attacking Personally Identifiable Information Removal Techniques0
Quantization of Ricci Curvature in Information Geometry0
ConFu: Contemplate the Future for Better Speculative Sampling0
Revisiting Sharpness-Aware Minimization: A More Faithful and Effective Implementation0
Where Do Flow Semantics Reside? A Protocol-Native Tabular Pretraining Paradigm for Encrypted Traffic Classification0
OmniGuide: Universal Guidance Fields for Enhancing Generalist Robot Policies0
Training Language Models via Neural Cellular Automata0
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents0
Multi-Agent Memory from a Computer Architecture Perspective: Visions and Challenges Ahead0
SBOMs into Agentic AIBOMs: Schema Extensions, Agentic Orchestration, and Reproducibility Evaluation0
Hybrid Quantum-Classical Encoding for Accurate Residue-Level pKa Prediction0
Cluster-Aware Attention-Based Deep Reinforcement Learning for Pickup and Delivery Problems0
InFusionLayer: a CFA-based ensemble tool to generate new classifiers for learning and modelingCode0
DualTurn: Learning Turn-Taking from Dual-Channel Generative Speech Pretraining0
One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States0
PSTNet: Physically-Structured Turbulence Network0
Slumbering to Precision: Enhancing Artificial Neural Network Calibration Through Sleep-like Processes0
DyLLM: Efficient Diffusion LLM Inference via Saliency-based Token Selection and Partial Attention0
Wasserstein Gradient Flows for Scalable and Regularized Barycenter Computation0
Thickening-to-Thinning: Reward Shaping via Human-Inspired Learning Dynamics for LLM Reasoning0
Hinge Regression Tree: A Newton Method for Oblique Regression Tree Splitting0
Show:102550
← PrevPage 176 of 13232Next →