SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2015120200 of 474278 papers

TitleStatusHype
FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit DesignCode1
UniTalk: Towards Universal Active Speaker Detection in Real World ScenariosCode1
CSI-Bench: A Large-Scale In-the-Wild Dataset for Multi-task WiFi SensingCode1
Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency DetectionCode1
Synonymous Variational Inference for Perceptual Image CompressionCode0
HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined in HDDL with OpenAI GymCode0
Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference0
Multipath cycleGAN for harmonization of paired and unpaired low-dose lung computed tomography reconstruction kernels0
Data-Driven Control of Continuous-Time LTI Systems via Non-Minimal RealizationsCode0
A Human-Centric Approach to Explainable AI for Personalized EducationCode0
Improving Respiratory Sound Classification with Architecture-Agnostic Knowledge Distillation from EnsemblesCode0
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation0
ARiSE: Auto-Regressive Multi-Channel Speech Enhancement0
Operator-Splitting Methods for Neuromorphic Circuit Simulation0
Subspecialty-Specific Foundation Model for Intelligent Gastrointestinal Pathology0
Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis0
A Synthetic Business Cycle Approach to Counterfactual Analysis with Nonstationary Macroeconomic Data0
Causal Inference for Experiments with Latent Outcomes: Key Results and Their Implications for Design and Analysis0
Risk-Sensitive Conformal Prediction for Catheter Placement Detection in Chest X-rays0
Algorithm Unrolling-based Denoising of Multimodal Graph Signals0
Online Fair Division for Personalized 2-Value Instances0
Target Localization with Coprime Multistatic MIMO Radar via Coupled Canonical Polyadic Decomposition Based on Joint Eigenvalue Decomposition0
SimProcess: High Fidelity Simulation of Noisy ICS Physical ProcessesCode0
ChatPD: An LLM-driven Paper-Dataset Networking SystemCode0
On data usage and predictive behavior of data-driven predictive control with 1-norm regularizationCode0
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained KnowledgeCode1
GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On GitCode0
Voice Adaptation for Swiss German0
LiDAR Based Semantic Perception for Forklifts in Outdoor Environments0
Surf2CT: Cascaded 3D Flow Matching Models for Torso 3D CT Synthesis from Skin Surface0
Visual Cues Support Robust Turn-taking Prediction in NoiseCode0
On the performance of machine-learning-assisted Monte Carlo in sampling from simple statistical physics modelsCode0
A memristive model of spatio-temporal excitability0
Practical Adversarial Attacks on Stochastic Bandits via Fake Data Injection0
AudioTurbo: Fast Text-to-Audio Generation with Rectified Diffusion0
Articulatory modeling of the S-shaped F2 trajectories observed in Öhman's spectrographic analysis of VCV syllables0
Aspects of density approximation by tensor trains0
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
Spatial Knowledge Graph-Guided Multimodal Synthesis0
Evaluation of LLMs in Speech is Often Flawed: Test Set Contamination in Large Language Models for Speech Recognition0
MAMBO-NET: Multi-Causal Aware Modeling Backdoor-Intervention Optimization for Medical Image Segmentation Network0
Reference-Guided Identity Preserving Face Restoration0
Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional ChineseCode0
B-XAIC Dataset: Benchmarking Explainable AI for Graph Neural Networks Using Chemical DataCode0
SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing ProblemCode1
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action ControlCode2
Hybrid Batch Normalisation: Resolving the Dilemma of Batch Normalisation in Federated LearningCode1
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global IlluminationCode4
ReSCORE: Label-free Iterative Retriever Training for Multi-hop Question Answering with Relevance-Consistency Supervision0
Evaluation of LLMs in Medical Text Summarization: The Role of Vocabulary Adaptation in High OOV SettingsCode0
Show:102550
← PrevPage 404 of 9486Next →