SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 28512900 of 659983 papers

TitleStatusHype
Beyond Outliers: A Data-Free Layer-wise Mixed-Precision Quantization Approach Driven by Numerical and Structural Dual-Sensitivity0
OnlineHMR: Video-based Online World-Grounded Human Mesh Recovery0
WebPII: Benchmarking Visual PII Detection for Computer-Use Agents0
Public Profile Matters: A Scalable Integrated Approach to Recommend Citations in the Wild0
PJB: A Reasoning-Aware Benchmark for Person-Job Retrieval0
Motion-Adaptive Temporal Attention for Lightweight Video Generation with Stable Diffusion0
Causal Representation Learning on High-Dimensional Data: Benchmarks, Reproducibility, and Evaluation Metrics0
TimeAPN: Adaptive Amplitude-Phase Non-Stationarity Normalization for Time Series Forecasting0
VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation0
FACE-net: Factual Calibration and Emotion Augmentation for Retrieval-enhanced Emotional Video Captioning0
Text-to-Stage: Spatial Layouts from Long-form Narratives0
UAV-CB: A Complex-Background RGB-T Dataset and Local Frequency Bridge Network for UAV Detection0
QuantFL: Sustainable Federated Learning for Edge IoT via Pre-Trained Model Quantisation0
EI: Early Intervention for Multimodal Imaging based Disease Recognition0
UniSem: Generalizable Semantic 3D Reconstruction from Sparse Unposed Images0
PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation0
KineVLA: Towards Kinematics-Aware Vision-Language-Action Models with Bi-Level Action Decomposition0
Per-Domain Generalizing Policies: On Learning Efficient and Robust Q-Value Functions (Extended Version with Technical Appendix)0
Deep Learning-Based Airway Segmentation in Systemic Lupus Erythematosus Patients with Interstitial Lung Disease (SLE-ILD): A Comparative High-Resolution CT Analysis0
CLeAN: Continual Learning Adaptive Normalization in Dynamic Environments0
Conditional Inverse Learning of Time-Varying Reproduction Numbers Inference0
Identifying Latent Actions and Dynamics from Offline Data via Demonstrator Diversity0
Edit-As-Act: Goal-Regressive Planning for Open-Vocabulary 3D Indoor Scene Editing0
AdaMuS: Adaptive Multi-view Sparsity Learning for Dimensionally Unbalanced Data0
Physics-Aware Machine Learning for Seismic and Volcanic Signal Interpretation0
Attention Sinks Induce Gradient Sinks0
Benchmarking Reinforcement Learning via Stochastic Converse Optimality: Generating Systems with Known Optimal Policies0
Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain0
Adaptive Guidance for Retrieval-Augmented Masked Diffusion Models0
From Virtual Environments to Real-World Trials: Emerging Trends in Autonomous Driving0
Federated Distributional Reinforcement Learning with Distributional Critic Regularization0
Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation0
SARE: Sample-wise Adaptive Reasoning for Training-free Fine-grained Visual Recognition0
TAPESTRY: From Geometry to Appearance via Consistent Turntable Videos0
Event-Centric Human Value Understanding in News-Domain Texts: An Actor-Conditioned, Multi-Granularity Benchmark0
Omni-3DEdit: Generalized Versatile 3D Editing in One-Pass0
RHYME-XT: A Neural Operator for Spatiotemporal Control Systems0
ShapleyLaw: A Game-Theoretic Approach to Multilingual Scaling Laws0
ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation0
IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia0
Only relative ranks matter in weight-clustered large language models0
Multi-Armed Sequential Hypothesis Testing by Betting0
CARE: Covariance-Aware and Rank-Enhanced Decomposition for Enabling Multi-Head Latent Attention0
Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training0
AHOY! Animatable Humans under Occlusion from YouTube Videos with Gaussian Splatting and Video Diffusion Priors0
Versatile Editing of Video Content, Actions, and Dynamics without Training0
ScheduleMe: Multi-Agent Calendar Assistant0
TRiMS: Real-Time Tracking of Minimal Sufficient Length for Efficient Reasoning via RL0
A Deep Surrogate Model for Robust and Generalizable Long-Term Blast Wave Prediction0
Unlearnable phases of matter0
Show:102550
← PrevPage 58 of 13200Next →