SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1620116250 of 474278 papers

TitleStatusHype
A strengthened bound on the number of states required to characterize maximum parsimony distance0
LLM-Powered CPI Prediction Inference with Online Text Time SeriesCode0
Generalized Gaussian Entropy Model for Point Cloud Attribute Compression with Dynamic Likelihood Intervals0
Geometry Reduced Order Modeling (GROM) with application to modeling of glymphatic functionCode0
An Interpretable Two-Stage Feature Decomposition Method for Deep Learning-based SAR ATR0
Intelligent Travel Activity Monitoring: Generalized Distributed Acoustic Sensing Approaches0
Deep Semantic Segmentation for Multi-Source Localization Using Angle of Arrival Measurements0
Knockoffs Inference under Privacy Constraints0
Assessing the Quality of Denoising Diffusion Models in Wasserstein Distance: Noisy Score and Optimal BoundsCode0
A Cytology Dataset for Early Detection of Oral Squamous Cell CarcinomaCode0
Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math ReasoningCode2
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical ReasoningCode2
SkillBlender: Towards Versatile Humanoid Whole-Body Loco-Manipulation via Skill BlendingCode2
Empirical Quantification of Spurious Correlations in Malware Detection0
A Study on Speech Assessment with Visual Cues0
Natural Language Guided Ligand-Binding Protein Design0
Learning Obfuscations Of LLM Embedding Sequences: Stained Glass Transform0
SLED: A Speculative LLM Decoding Framework for Efficient Edge Serving0
Adv-BMT: Bidirectional Motion Transformer for Safety-Critical Traffic Scenario Generation0
Fine-Tuning Large Audio-Language Models with LoRA for Precise Temporal Localization of Prolonged Exposure Therapy Elements0
Multi-Agent Language Models: Advancing Cooperation, Coordination, and Adaptation0
Time-Unified Diffusion Policy with Action Discrimination for Robotic Manipulation0
DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects0
Estimating the Number of Components in Panel Data Finite Mixture Regression Models with an Application to Production Function Heterogeneity0
Diffusion index forecasts under weaker loadings: PCA, ridge regression, and random projections0
You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks0
Neutral theory of cooperators0
Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri WomenCode0
Alice and the Caterpillar: A more descriptive null model for assessing data mining resultsCode0
Metritocracy: Representative Metrics for Lite Benchmarks0
Efficient Prediction of SO(3)-Equivariant Hamiltonian Matrices via SO(2) Local Frames0
Probability-One Optimization of Generalized Rayleigh Quotient Sum For Multi-Source Generalized Total Least-Squares0
Model Predictive Control-Based Optimal Energy Management of Autonomous Electric Vehicles Under Cold Temperatures0
CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation0
Simulation-trained conditional normalizing flows for likelihood approximation: a case study in stress regulation kinetics in yeastCode0
SAGE: Exploring the Boundaries of Unsafe Concept Domain with Semantic-Augment ErasingCode0
The COVID-19 Inflation Weighting in IsraelCode0
Beyond Nash Equilibrium: Bounded Rationality of LLMs and humans in Strategic Decision-making0
OWSM-Biasing: Contextualizing Open Whisper-Style Speech Models for Automatic Speech Recognition with Dynamic Vocabulary0
Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models0
Optimization and Control Technologies for Renewable-Dominated Hydrogen-Blended Integrated Gas-Electricity System: A Review0
Integer-Clustering Optimization of Hydrogen and Battery EV Fleets Considering DERs0
BemaGANv2: A Tutorial and Comparative Survey of GAN-based Vocoders for Long-Term Audio GenerationCode1
CoLMbo: Speaker Language Model for Descriptive ProfilingCode0
Tightly-Coupled LiDAR-IMU-Leg Odometry with Online Learned Leg Kinematics Incorporating Foot Tactile InformationCode2
Attention, Please! Revisiting Attentive Probing for Masked Image ModelingCode1
Towards Open Foundation Language Model and Corpus for Macedonian: A Low-Resource Language0
eFlesh: Highly customizable Magnetic Touch Sensing using Cut-Cell Microstructures0
From Judgment to Interference: Early Stopping LLM Harmful Outputs via Streaming Content Monitoring0
Synergizing Reinforcement Learning and Genetic Algorithms for Neural Combinatorial Optimization0
Show:102550
← PrevPage 325 of 9486Next →