SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1130111350 of 661570 papers

TitleStatusHype
Buzz, Choose, Forget: A Meta-Bandit Framework for Bee-Like Decision Making0
Leveraging Taxonomy Similarity for Next Activity Prediction in Patient Treatment0
Beyond Accuracy: What Matters in Designing Well-Behaved Image Classification Models?0
RLJP: Legal Judgment Prediction via First-Order Logic Rule-enhanced with Large Language Models0
An Approximation Theory Perspective on Machine Learning0
ObfusQAte: A Proposed Framework to Evaluate LLM Robustness on Obfuscated Factual Question Answering0
Robust Adversarial Quantification via Conflict-Aware Evidential Deep Learning0
EgoWorld: Translating Exocentric View to Egocentric View using Rich Exocentric Observations0
Partial Weakly-Supervised Oriented Object Detection0
Talking Trees: Reasoning-Assisted Induction of Decision Trees for Tabular Data0
From Ambiguity to Accuracy: The Transformative Effect of Coreference Resolution on Retrieval-Augmented Generation systems0
Knowing When to Quit: Probabilistic Early Exits for Speech Separation0
Function Induction and Task Generalization: An Interpretability Study with Off-by-One Addition0
Finite-Dimensional Gaussian Approximation for Deep Neural Networks: Universality in Random Weights0
From Privacy to Trust in the Agentic Era: A Taxonomy of Challenges in Trustworthy Federated Learning Through the Lens of Trust Report 2.00
Self-Supervised Inductive Logic Programming0
When Relevance Meets Novelty: Dual-Stable Periodic Optimization for Serendipitous Recommendation0
WebDS: An End-to-End Benchmark for Web-based Data Science0
ToolVQA: A Dataset for Multi-step Reasoning VQA with External Tools0
SEVADE: Self-Evolving Multi-Agent Analysis with Decoupled Evaluation for Hallucination-Resistant Irony Detection0
GaitSnippet: Gait Recognition Beyond Unordered Sets and Ordered Sequences0
Stochastic Self-Guidance for Training-Free Enhancement of Diffusion Models2
Subsampling Factorization Machine Annealing0
Adaptive Quantized Planetary Crater Detection System for Autonomous Space Exploration0
An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software0
Deep Hierarchical Learning with Nested Subspace Networks for Large Language Models0
Bridging Computational Social Science and Deep Learning: Cultural Dissemination-Inspired Graph Neural Networks0
Raw-JPEG Adapter: Efficient Raw Image Compression with JPEG0
Best-of- -- Asymptotic Performance of Test-Time LLM Ensembling0
Towards Personalized Deep Research: Benchmarks and Evaluations0
Weakly Supervised Concept Learning with Class-Level Priors for Interpretable Medical Diagnosis0
Learning Explicit Single-Cell Dynamics Using ODE Representations0
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics0
Annotation-Efficient Universal Honesty Alignment0
Kaleido: Open-Sourced Multi-Subject Reference Video Generation Model0
Citation Failure: Definition, Analysis and Efficient Mitigation0
Measurement-Consistent Langevin Corrector for Stabilizing Latent Diffusion Inverse Problem Solvers0
Can a Small Model Learn to Look Before It Leaps? Dynamic Learning and Proactive Correction for Hallucination Detection0
Categorical Emotions or Appraisals - Which Emotion Model Explains Argument Convincingness Better?0
AudAgent: Automated Auditing of Privacy Policy Compliance in AI Agents0
DecNefSimulator: A Modular, Interpretable Framework for Decoded Neurofeedback Simulation Using Generative Models0
Better audio representations are more brain-like: linking model-brain alignment with performance in downstream auditory tasks0
MatPedia: A Universal Generative Foundation for High-Fidelity Material Synthesis0
Freezing of Gait Prediction using Proactive Agent that Learns from Selected Experience and DDQN Algorithm0
What Triggers my Model? Contrastive Explanations Inform Gender Choices by Translation Models0
Learning under Distributional Drift: Prequential Reproducibility as an Intrinsic Statistical Resource0
OASI: Objective-Aware Surrogate Initialization for Multi-Objective Bayesian Optimization in TinyML Keyword Spotting0
Online Robust Reinforcement Learning with General Function Approximation0
Deterministic Coreset for Lp Subspace0
AI Skills Improve Job Prospects: Causal Evidence from a Hiring Experiment0
Show:102550
← PrevPage 227 of 13232Next →