SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1885118900 of 474278 papers

TitleStatusHype
Accelerating Diffusion LLMs via Adaptive Parallel Decoding0
Evaluating Robot Policies in a World Model0
Using Diffusion Ensembles to Estimate Uncertainty for End-to-End Autonomous Driving0
Diffusion Graph Neural Networks for Robustness in Olfaction Sensors and Datasets0
LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks0
IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling0
Quantifying and Reducing Speaker Heterogeneity within the Common Voice Corpus for Phonetic Analysis0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
Learning to Upsample and Upmix Audio in the Latent Domain0
LID Models are Actually Accent Classifiers: Implications and Solutions for LID on Accented Speech0
Quality Assessment of Noisy and Enhanced Speech with Limited Data: UWB-NTIS System for VoiceMOS 2024 and Beyond0
No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction0
CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning0
Bi-Level optimization for parameter estimation of differential equations using interpolationCode0
MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and GenerationCode2
A Foundation Model for Non-Destructive Defect Identification from Vibrational SpectraCode0
Reinforcement Learning for Hanabi0
Towards Temporally Explainable Dysarthric Speech Clarity AssessmentCode0
SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion ModelsCode1
DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition0
Position: Olfaction Standardization is Essential for the Advancement of Embodied Artificial Intelligence0
XMAD-Bench: Cross-Domain Multilingual Audio Deepfake BenchmarkCode0
Thinking Out of the Box: Hybrid SAT Solving by Unconstrained Continuous Optimization0
The iNaturalist Sounds Dataset0
Length Aware Speech Translation for Video Dubbing0
Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMsCode1
An LLM Agent for Functional Bug Detection in Network ProtocolsCode1
AVROBUSTBENCH: Benchmarking the Robustness of Audio-Visual Recognition Models at Test-TimeCode1
Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication TrainingCode0
Synergizing LLMs with Global Label Propagation for Multimodal Fake News DetectionCode1
PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge0
Channel-Imposed Fusion: A Simple yet Effective Method for Medical Time Series Classification0
The Security Threat of Compressed Projectors in Large Vision-Language Models0
SST: Self-training with Self-adaptive Thresholding for Semi-supervised Learning0
A Systematic Review of Metaheuristics-Based and Machine Learning-Driven Intrusion Detection Systems in IoT0
Video Signature: In-generation Watermarking for Latent Video Diffusion Models0
Blockchain-Enabled Privacy-Preserving Second-Order Federated Edge Learning in Personalized Healthcare0
Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment0
Deep-Learning-Driven Prefetching for Far Memory0
Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client Selection0
Learning with Calibration: Exploring Test-Time Computing of Spatio-Temporal Forecasting0
MIRROR: Cognitive Inner Monologue Between Conversational Turns for Persistent Reflection and Reasoning in Conversational LLMsCode1
Do Language Models Mirror Human Confidence? Exploring Psychological Insights to Address Overconfidence in LLMsCode0
Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual TrackingCode0
PackHero: A Scalable Graph-based Approach for Efficient Packer IdentificationCode0
Federated learning framework for collaborative remaining useful life prognostics: an aircraft engine case studyCode0
Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model TranslationsCode0
COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning0
G2S: A General-to-Specific Learning Framework for Temporal Knowledge Graph Forecasting with Large Language ModelsCode0
ChemReservoir -- An Open-Source Framework for Chemically-Inspired Reservoir ComputingCode0
Show:102550
← PrevPage 378 of 9486Next →