SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1745117500 of 474278 papers

TitleStatusHype
On Efficient Estimation of Distributional Treatment Effects under Covariate-Adaptive RandomizationCode0
Information Bargaining: Bilateral Commitment in Bayesian PersuasionCode0
Benchmarking Misuse Mitigation Against Covert AdversariesCode0
Antithetic Noise in Diffusion Models0
Robust sensor fusion against on-vehicle sensor staleness0
Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM0
Trajectory Entropy: Modeling Game State Stability from Multimodality Trajectory Prediction0
Unintended Harms of Value-Aligned LLMs: Psychological and Empirical InsightsCode0
Splat and Replace: 3D Reconstruction with Repetitive Elements0
ScriptDoctor: Automatic Generation of PuzzleScript Games via Large Language Models and Tree Search0
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos0
DeepFake Doctor: Diagnosing and Treating Audio-Video Fake Detection0
Hierarchical and Collaborative LLM-Based Control for Multi-UAV Motion and Communication in Integrated Terrestrial and Non-Terrestrial Networks0
Improving choice model specification using reinforcement learning0
Saffron-1: Towards an Inference Scaling Paradigm for LLM Safety AssuranceCode0
Variational Inference for Quantum HyperNetworks0
Training-Free Query Optimization via LLM-Based Plan Similarity0
TADA: Training-free Attribution and Out-of-Domain Detection of Audio DeepfakesCode0
Unlocking Chemical Insights: Superior Molecular Representations from Intermediate Encoder LayersCode0
Recommender systems, stigmergy, and the tyranny of popularity0
Small Models, Big Support: A Local LLM Framework for Teacher-Centric Content Creation and Assessment using RAG and CAG0
Evaluating AI-Powered Learning Assistants in Engineering Higher Education: Student Engagement, Ethical Challenges, and Policy Implications0
The Geometry of Extended Kalman Filters on Manifolds with Affine Connection0
Machine learning for in-situ composition mapping in a self-driving magnetron sputtering system0
Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot LearningCode1
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code GenerationCode1
Revealing hidden correlations from complex spatial distributions: Adjacent Correlation AnalysisCode1
Mapping correlations and coherence: adjacency-based approach to data visualization and regularity discoveryCode1
Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flowsCode0
When Better Features Mean Greater Risks: The Performance-Privacy Trade-Off in Contrastive LearningCode0
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques0
Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning0
Scalable unsupervised feature selection via weight stabilityCode0
The Optimization Paradox in Clinical AI Multi-Agent SystemsCode0
SDS-Net: Shallow-Deep Synergism-detection Network for infrared small target detectionCode1
Domain Adaptation in Agricultural Image Analysis: A Comprehensive Review from Shallow Models to Deep Learning0
RecGPT: A Foundation Model for Sequential RecommendationCode2
Membership Inference Attacks for Unseen Classes0
DynamicMind: A Tri-Mode Thinking System for Large Language Models0
Textile Analysis for Recycling Automation using Transfer Learning and Zero-Shot Foundation Models0
Securing Traffic Sign Recognition Systems in Autonomous Vehicles0
Graph Persistence goes Spectral0
Large Language Models Can Be a Viable Substitute for Expert Political Surveys When a Shock Disrupts Traditional Measurement Approaches0
Distribution-Level AirComp for Wireless Federated Learning under Data Scarcity and Heterogeneity0
Multi-Modal Multi-Task Federated Foundation Models for Next-Generation Extended Reality Systems: Towards Privacy-Preserving Distributed Intelligence in AR/VR/MR0
Future of Work with AI Agents: Auditing Automation and Augmentation Potential across the U.S. Workforce0
When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented GenerationCode3
Few Labels are all you need: A Weakly Supervised Framework for Appliance Localization in Smart-Meter SeriesCode0
Bootstrapping World Models from Dynamics Models in Multimodal Foundation ModelsCode0
Unleashing the Potential of Consistency Learning for Detecting and Grounding Multi-Modal Media ManipulationCode1
Show:102550
← PrevPage 350 of 9486Next →