SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1095111000 of 661570 papers

TitleStatusHype
Measuring the Fragility of Trust: Devising Credibility Index via Explanation Stability (CIES) for Business Decision Support Systems0
Layer by layer, module by module: Choose both for optimal OOD probing of ViT0
PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration0
Loop Closure via Maximal Cliques in 3D LiDAR-Based SLAM0
Video-based Locomotion Analysis for Fish Health Monitoring0
Model Change for Description Logic Concepts0
Thermodynamic Response Functions in Singular Bayesian Models0
FuseDiff: Symmetry-Preserving Joint Diffusion for Dual-Target Structure-Based Drug Design0
Keeping the Evidence Chain: Semantic Evidence Allocation for Training-Free Token Pruning in Video Temporal Grounding0
Uni-LVC: A Unified Method for Intra- and Inter-Mode Learned Video Compression0
POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation0
Let's Talk, Not Type: An Oral-First Multi-Agent Architecture for Guaraní0
Distant Object Localisation from Noisy Image Segmentation Sequences0
OSPO: Object-Centric Self-Improving Preference Optimization for Text-to-Image Generation0
Quantitative convergence of trained single layer neural networks to Gaussian processes0
Reinforcement Learning for Power-Flow Network Analysis0
From Phase Grounding to Intelligent Surgical Narratives0
Feature Resemblance: On the Theoretical Understanding of Analogical Reasoning in Transformers0
Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry0
A Behaviour-Aware Federated Forecasting Framework for Distributed Stand-Alone Wind Turbines0
Random Dot Product Graphs as Dynamical Systems: Limitations and Opportunities0
MI-DETR: A Strong Baseline for Moving Infrared Small Target Detection with Bio-Inspired Motion IntegrationCode0
The Consensus Trap: Dissecting Subjectivity and the "Ground Truth" Illusion in Data Annotation0
AILS-NTUA at SemEval-2026 Task 3: Efficient Dimensional Aspect-Based Sentiment Analysis0
Pessimistic Auxiliary Policy for Offline Reinforcement Learning0
Curriculum Learning for Efficient Chain-of-Thought Distillation via Structure-Aware Masking and GRPO0
EVMbench: Evaluating AI Agents on Smart Contract Security0
Legal interpretation and AI: from expert systems to argumentation and LLMs0
Asymptotic Behavior of Multi--Task Learning: Implicit Regularization and Double Descent Effects0
Interpretable Motion Artificat Detection in structural Brain MRI0
Lifelong Language-Conditioned Robotic Manipulation Learning0
LHM-Humanoid: Learning a Unified Policy for Long-Horizon Humanoid Whole-Body Loco-Manipulation in Diverse Messy Environments0
The unreasonable effectiveness of pattern matching0
Machine Learning for analysis of Multiple Sclerosis cross-tissue bulk and single-cell transcriptomics data0
The Cascade Equivalence Hypothesis: When Do Speech LLMs Behave Like ASRLLM Pipelines?0
Evaluating and Correcting Human Annotation Bias in Dynamic Micro-Expression RecognitionCode0
Optimizing Multi-Modality Trackers via Significance-Regularized TuningCode0
EgoTraj-Bench: Towards Robust Trajectory Prediction Under Ego-view Noisy ObservationsCode0
Detecting Hallucinations in Authentic LLM-Human InteractionsCode0
TerraCodec: Compressing Optical Earth Observation DataCode0
RePo: Language Models with Context Re-PositioningCode0
Yuan3.0 Ultra: A Trillion-Parameter Enterprise-Oriented MoE LLMCode0
Agentic Very Long Video UnderstandingCode0
PerfGuard: A Performance-Aware Agent for Visual Content GenerationCode0
Dr. Seg: Revisiting GRPO Training for Visual Large Language Models through Perception-Oriented DesignCode0
TumorFlow: Physics-Guided Longitudinal MRI Synthesis of Glioblastoma GrowthCode0
Toward Real-world Infrared Image Super-Resolution: A Unified Autoregressive Framework and Benchmark DatasetCode0
Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual RepresentationCode0
Locality-Attending Vision TransformerCode0
MPCEval: A Benchmark for Multi-Party Conversation GenerationCode0
Show:102550
← PrevPage 220 of 13232Next →