SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1420114250 of 474278 papers

TitleStatusHype
Lost in Translation? Converting RegExes for Log Parsing into Dynatrace Pattern Language0
Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders0
MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility ApplicationsCode1
PEVLM: Parallel Encoding for Vision-Language Models0
Spotting Out-of-Character Behavior: Atomic-Level Evaluation of Persona Fidelity in Open-Ended GenerationCode0
KnowRL: Exploring Knowledgeable Reinforcement Learning for FactualityCode1
Video Compression for Spatiotemporal Earth System DataCode2
Towards an Introspective Dynamic Model of Globally Distributed Computing Infrastructures0
EvDetMAV: Generalized MAV Detection from Moving Event CamerasCode1
Diffusion-based Task-oriented Semantic Communications with Model Inversion Attack0
Can One Safety Loop Guard Them All? Agentic Guard Rails for Federated Computing0
MILAAP: Mobile Link Allocation via Attention-based Prediction0
MAIZX: A Carbon-Aware Framework for Optimizing Cloud Computing Emissions0
A Principled Path to Fitted Distributional Evaluation0
Controlled Retrieval-augmented Context Evaluation for Long-form RAG0
Sampling Matters in Explanations: Towards Trustworthy Attribution Analysis Building Block in Visual Models through Maximizing Explanation Certainty0
Higher-Order Neuromorphic Ising Machines -- Autoencoders and Fowler-Nordheim Annealers are all you need for Scalability0
HARPT: A Corpus for Analyzing Consumers' Trust and Privacy Concerns in Mobile Health Apps0
SoK: Can Synthetic Images Replace Real Data? A Survey of Utility and Privacy of Synthetic Image Generation0
Machine Learning with Privacy for Protected Attributes0
Private Model Personalization Revisited0
SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing ImagesCode1
Consensus-Driven Uncertainty for Robotic Grasping based on RGB PerceptionCode0
Self-Supervised Multimodal NeRF for Autonomous DrivingCode1
PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket ConditioningCode2
CoVE: Compressed Vocabulary Expansion Makes Better LLM-based Recommender SystemsCode0
EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson RegressionCode1
One Prototype Is Enough: Single-Prototype Activation for Interpretable Image ClassificationCode0
Identifying Physically Realizable Triggers for Backdoored Face Recognition Networks0
WebGuard++:Interpretable Malicious URL Detection via Bidirectional Fusion of HTML Subgraphs and Multi-Scale Convolutional BERT0
Network Structures as an Attack Surface: Topology-Based Privacy Leakage in Federated Learning0
KnowML: Improving Generalization of ML-NIDS with Attack Knowledge Graphs0
PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty0
Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy0
From Reproduction to Replication: Evaluating Research Agents with Progressive Code MaskingCode0
Assessing Risk of Stealing Proprietary Models for Medical Imaging TasksCode0
Machine-Learning-Assisted Photonic Device Development: A Multiscale Approach from Theory to Characterization0
Fast and Distributed Equivariant Graph Neural Networks by Virtual Node LearningCode1
ToSA: Token Merging with Spatial AwarenessCode0
An ab initio foundation model of wavefunctions that accurately describes chemical bond breakingCode2
Quantum Neural Networks for Propensity Score Estimation and Survival Analysis in Observational Biomedical Studies0
Elucidated Rolling Diffusion Models for Probabilistic Weather ForecastingCode1
Context Attribution with Multi-Armed Bandit Optimization0
DiaLLMs: EHR Enhanced Clinical Conversational System for Clinical Test Recommendation and Diagnosis Prediction0
Persona-Assigned Large Language Models Exhibit Human-Like Motivated ReasoningCode0
Achieving Trustworthy Real-Time Decision Support Systems with Low-Latency Interpretable AI Models0
The Most Important Features in Generalized Additive Models Might Be Groups of Features0
Robotics Under Construction: Challenges on Job Sites0
Position: Machine Learning Conferences Should Establish a "Refutations and Critiques" Track0
MNN-AECS: Energy Optimization for LLM Decoding on Mobile Devices via Adaptive Core Selection0
Show:102550
← PrevPage 285 of 9486Next →