SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 78517900 of 661570 papers

TitleStatusHype
SeDa: A Unified System for Dataset Discovery and Multi-Entity Augmented Semantic Exploration0
Alignment as Iatrogenesis: Pastoral Power, Collective Pathology, and the Structural Limits of Monolingual Safety Evaluation0
A New Modeling to Feature Selection Based on the Fuzzy Rough Set Theory in Normal and Optimistic States on Hybrid Information Systems0
PathoScribe: Transforming Pathology Data into a Living Library with a Unified LLM-Driven Framework for Semantic Retrieval and Clinical Integration0
PlayWorld: Learning Robot World Models from Autonomous Play0
VIVID-Med: LLM-Supervised Structured Pretraining for Deployable Medical ViTs0
Transformer-Based Multi-Region Segmentation and Radiomic Analysis of HR-pQCT Imaging for Osteoporosis Classification0
Agentic AI as a Network Control-Plane Intelligence Layer for Federated Learning over 6G0
Curveball Steering: The Right Direction To Steer Isn't Always Linear0
SPAARS: Safer RL Policy Alignment through Abstract Exploration and Refined Exploitation of Action Space0
Streaming Autoregressive Video Generation via Diagonal Distillation2
A Saccade-inspired Approach to Image Classification using Vision Transformer Attention Maps0
MM-tau-p^2: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings0
Fusing Semantic, Lexical, and Domain Perspectives for Recipe Similarity Estimation0
AutoViVQA: A Large-Scale Automatically Constructed Dataset for Vietnamese Visual Question Answering0
ENIGMA-360: An Ego-Exo Dataset for Human Behavior Understanding in Industrial Scenarios0
Ego: Embedding-Guided Personalization of Vision-Language Models0
LCA: Local Classifier Alignment for Continual Learning0
MA-EgoQA: Question Answering over Egocentric Videos from Multiple Embodied Agents1
Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums0
Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation0
Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning0
Large language models can disambiguate opioid slang on social media0
PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner0
Fuel Gauge: Estimating Chain-of-Thought Length Ahead of Time in Large Multimodal Models0
Overcoming Visual Clutter in Vision Language Action Models via Concept-Gated Visual Distillation0
On The Complexity of Best-Arm Identification in Non-Stationary Linear Bandits0
Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck0
Utility Function is All You Need: LLM-based Congestion Control0
Designing Service Systems from Textual Evidence0
HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation0
One Token, Two Fates: A Unified Framework via Vision Token Manipulation Against MLLMs Hallucination0
Dynamic Knowledge Fusion for Multi-Domain Dialogue State Tracking0
Beyond Interleaving: Causal Attention Reformulations for Generative Recommender Systems0
GeoSense: Internalizing Geometric Necessity Perception for Multimodal Reasoning0
Speech Codec Probing from Semantic and Phonetic Perspectives0
Few-Shot Adaptation to Non-Stationary Environments via Latent Trend Embedding for Robotics0
Graph-GRPO: Training Graph Flow Models with Reinforcement Learning0
Reactive Writers: How Co-Writing with AI Changes How We Engage with Ideas0
Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning0
Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design0
Motion Forcing: A Decoupled Framework for Robust Video Generation in Motion Dynamics0
Effective Dataset Distillation for Spatio-Temporal Forecasting with Bi-dimensional Compression0
Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks0
Fighting Hallucinations with Counterfactuals: Diffusion-Guided Perturbations for LVLM Hallucination Suppression0
Domain-Adaptive Health Indicator Learning with Degradation-Stage Synchronized Sampling and Cross-Domain Autoencoder0
AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory0
The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training0
FAR-Dex: Few-shot Data Augmentation and Adaptive Residual Policy Refinement for Dexterous Manipulation0
Spatio-Temporal Forecasting of Retaining Wall Deformation: Mitigating Error Accumulation via Multi-Resolution ConvLSTM Stacking Ensemble0
Show:102550
← PrevPage 158 of 13232Next →