SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1870118750 of 474278 papers

TitleStatusHype
Uni-LoRA: One Vector is All You Need0
HouseTS: A Large-Scale, Multimodal Spatiotemporal U.S. Housing Dataset0
3D Skeleton-Based Action Recognition: A Review0
Dynamic Chunking and Selection for Reading Comprehension of Ultra-Long Context in Large Language ModelsCode0
AceVFI: A Comprehensive Survey of Advances in Video Frame InterpolationCode2
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video Diffusion TransformersCode9
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy0
Localized Forest Fire Risk Prediction: A Department-Aware Approach for Operational Decision Support0
GigaAM: Efficient Self-Supervised Learner for Speech RecognitionCode4
HASRD: Hierarchical Acoustic and Semantic Representation Disentanglement0
Bridging Subjective and Objective QoE: Operator-Level Aggregation Using LLM-Based Comment Analysis and Network MOS Comparison0
Language-Guided Multi-Agent Learning in Simulations: A Unified Framework and Evaluation0
CountingFruit: Real-Time 3D Fruit Counting with Language-Guided Semantic Gaussian Splatting0
Camera Trajectory Generation: A Comprehensive Survey of Methods, Metrics, and Future Directions0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG0
Test Automation for Interactive Scenarios via Promptable Traffic Simulation0
OG-VLA: 3D-Aware Vision Language Action Model via Orthographic Image Generation0
DriveMind: A Dual-VLM based Reinforcement Learning Framework for Autonomous Driving0
NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Prediction0
Source Tracing of Synthetic Speech Systems Through Paralinguistic Pre-Trained Representations0
Towards Fusion of Neural Audio Codec-based Representations with Spectral for Heart Murmur Classification via Bandit-based Cross-Attention Mechanism0
PARROT: Synergizing Mamba and Attention-based SSL Pre-Trained Models via Parallel Branch Hadamard Optimal Transport for Speech Emotion Recognition0
From Words to Waves: Analyzing Concept Formation in Speech and Text-Based Foundation Models0
Learning More with Less: Self-Supervised Approaches for Low-Resource Speech Emotion Recognition0
PseudoVC: Improving One-shot Voice Conversion with Pseudo Paired Data0
Leveraging Large Language Models for Sarcastic Speech Annotation in Sarcasm Detection0
General-purpose audio representation learning for real-world sound scenes0
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
CLAP-ART: Automated Audio Captioning with Semantic-rich Audio Representation Tokenizer0
Legal Compliance Evaluation of Smart Contracts Generated By Large Language Models0
Action Dependency Graphs for Globally Optimal Coordinated Reinforcement Learning0
FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual FusionCode2
How Programming Concepts and Neurons Are Shared in Code Language ModelsCode0
In-the-wild Audio Spatialization with Flexible Text-guided LocalizationCode0
HADA: Human-AI Agent Decision Alignment Architecture0
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents0
Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and AssessmentCode0
Behavioral Augmentation of UML Class Diagrams: An Empirical Study of Large Language Models for Method GenerationCode0
Speech Unlearning0
Counterfactual Activation Editing for Post-hoc Prosody and Mispronunciation Correction in TTS Models0
Choices and their Provenance: Explaining Stable Solutions of Abstract Argumentation Frameworks0
Multiverse Through Deepfakes: The MultiFakeVerse Dataset of Person-Centric Visual and Conceptual ManipulationsCode0
What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-trainingCode0
Enhancing Speech Instruction Understanding and Disambiguation in Robotics via Speech Prosody0
A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement0
Towards Predicting Any Human Trajectory In Context0
Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching0
HMPC-assisted Adversarial Inverse Reinforcement Learning for Smart Home Energy Management0
Beyond Attention: Learning Spatio-Temporal Dynamics with Emergent Interpretable Topologies0
Crowdsourcing MUSHRA Tests in the Age of Generative Speech Technologies: A Comparative Analysis of Subjective and Objective Testing MethodsCode1
Show:102550
← PrevPage 375 of 9486Next →