SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 77517800 of 661570 papers

TitleStatusHype
AI-Enhanced Spatial Cellular Traffic Demand Prediction with Contextual Clustering and Error Correction for 5G/6G Planning0
Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments0
Ergodicity in reinforcement learning0
Towards Intelligent Spectrum Management: Spectrum Demand Estimation Using Graph Neural Networks0
An Extreme Multi-label Text Classification (XMTC) Library Dataset: What if we took "Use of Practical AI in Digital Libraries" seriously?0
Comparative Analysis of Modern Machine Learning Models for Retail Sales Forecasting0
X-WIN: Building Chest Radiograph World Model via Predictive Sensing0
Solving adversarial examples requires solving exponential misalignment0
Fish Audio S2 Technical ReportCode0
Beyond Scalars: Evaluating and Understanding LLM Reasoning via Geometric Progress and Stability0
Multi-Person Pose Estimation Evaluation Using Optimal Transportation and Improved Pose Matching0
Frames2Residual: Spatiotemporal Decoupling for Self-Supervised Video Denoising0
Layer Consistency Matters: Elegant Latent Transition Discrepancy for Generalizable Synthetic Image DetectionCode0
Self-Scaled Broyden Family of Quasi-Newton Methods in JAXCode0
RandMark: On Random Watermarking of Visual Foundation Models0
UltrasoundAgents: Hierarchical Multi-Agent Evidence-Chain Reasoning for Breast Ultrasound Diagnosis0
Federated Learning-driven Beam Management in LEO 6G Non-Terrestrial Networks0
Neural Field Thermal Tomography: A Differentiable Physics Framework for Non-Destructive Evaluation0
Uni-ASR: Unified LLM-Based Architecture for Non-Streaming and Streaming Automatic Speech Recognition0
Higher-Order Modular Attention: Fusing Pairwise and Triadic Interactions for Protein Sequences0
Maximum Risk Minimization with Random Forests0
SiliconMind-V1: Multi-Agent Distillation and Debug-Reasoning Workflows for Verilog Code Generation0
Group Resonance Network: Learnable Prototypes and Multi-Subject Resonance for EEG Emotion Recognition0
FreeFly-Thinking : Aligning Chain-of-Thought Reasoning with Continuous UAV Navigation0
Equitable Multi-Task Learning for AI-RANs0
Pixel Motion Diffusion is What We Need for Robot Control0
Zero-Shot Transferable Solution Method for Parametric Optimal Control Problems0
Global Minimizers of Sigmoid Contrastive Loss0
MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis0
DSER: Spectral Epipolar Representation for Efficient Light Field Depth Estimation0
Silhouette-Driven Instance-Weighted k-means0
Boosting Cross-problem Generalization in Diffusion-Based Neural Combinatorial Solver via Inference Time Adaptation0
Leveraging Spatial Context for Positive Pair Sampling in Histopathology Image Representation Learning0
Training with Pseudo-Code for Instruction Following0
IntrinsicWeather: Controllable Weather Editing in Intrinsic Space0
The Yokai Learning Environment: Tracking Beliefs Over Space and Time0
Order Optimal Regret Bounds for Sharpe Ratio Optimization under Thompson Sampling0
Chain-of-Thought Compression Should Not Be Blind: V-Skip for Efficient Multimodal Reasoning via Dual-Path Anchoring0
Universal Dynamics with Globally Controlled Analog Quantum Simulators0
Tensor Train Completion from Fiberwise Observations Along a Single Mode0
Empirical PAC-Bayes Bounds for Markov Chains0
GDR-learners: Orthogonal Learning of Generative Models for Potential Outcomes0
One-Prompt Strikes Back: Sparse Mixture of Experts for Prompt-based Continual Learning0
Overlap-Adaptive Regularization for Conditional Average Treatment Effect Estimation0
Geopolitics, Geoeconomics, and Sovereign Risk: Different Shocks, Different Channels0
MonitorVLM:A Vision Language Framework for Safety Violation Detection in Mining Operations0
A Systematic Evaluation of Self-Supervised Learning for Label-Efficient Sleep Staging with Wearable EEG0
Autoencoding-Free Context Compression for LLMs via Contextual Semantic AnchorsCode0
Assessing the Political Fairness of Multilingual LLMs: A Case Study based on a 21-way Multiparallel EuroParl Dataset0
Absolute indices for determining compactness, separability and number of clusters0
Show:102550
← PrevPage 156 of 13232Next →