| Accelerating Diffusion LLMs via Adaptive Parallel Decoding | May 31, 2025 | | —Unverified | 0 |
| Evaluating Robot Policies in a World Model | May 31, 2025 | modelVideo Generation | —Unverified | 0 |
| Using Diffusion Ensembles to Estimate Uncertainty for End-to-End Autonomous Driving | May 31, 2025 | Autonomous DrivingCARLA longest6 | —Unverified | 0 |
| Diffusion Graph Neural Networks for Robustness in Olfaction Sensors and Datasets | May 31, 2025 | | —Unverified | 0 |
| LoHoVLA: A Unified Vision-Language-Action Model for Long-Horizon Embodied Tasks | May 31, 2025 | Task PlanningVision-Language-Action | —Unverified | 0 |
| IMPACT: Iterative Mask-based Parallel Decoding for Text-to-Audio Generation with Diffusion Modeling | May 31, 2025 | AudioCapsAudio Generation | —Unverified | 0 |
| Quantifying and Reducing Speaker Heterogeneity within the Common Voice Corpus for Phonetic Analysis | May 31, 2025 | Diversity | —Unverified | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Upsample and Upmix Audio in the Latent Domain | May 31, 2025 | Audio CompressionBandwidth Extension | —Unverified | 0 |
| LID Models are Actually Accent Classifiers: Implications and Solutions for LID on Accented Speech | May 31, 2025 | Chunking | —Unverified | 0 |
| Quality Assessment of Noisy and Enhanced Speech with Limited Data: UWB-NTIS System for VoiceMOS 2024 and Beyond | May 31, 2025 | Prediction | —Unverified | 0 |
| No Audiogram: Leveraging Existing Scores for Personalized Speech Intelligibility Prediction | May 31, 2025 | Predictionspeech-recognition | —Unverified | 0 |
| CodeSense: a Real-World Benchmark and Dataset for Code Semantic Reasoning | May 31, 2025 | In-Context Learning | —Unverified | 0 |
| Bi-Level optimization for parameter estimation of differential equations using interpolation | May 31, 2025 | Model Discoveryparameter estimation | CodeCode Available | 0 |
| MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| A Foundation Model for Non-Destructive Defect Identification from Vibrational Spectra | May 31, 2025 | | CodeCode Available | 0 |
| Reinforcement Learning for Hanabi | May 31, 2025 | Card GamesDeep Reinforcement Learning | —Unverified | 0 |
| Towards Temporally Explainable Dysarthric Speech Clarity Assessment | May 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| SEED: A Benchmark Dataset for Sequential Facial Attribute Editing with Diffusion Models | May 31, 2025 | AttributeFacial Editing | CodeCode Available | 1 |
| DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition | May 31, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Position: Olfaction Standardization is Essential for the Advancement of Embodied Artificial Intelligence | May 31, 2025 | EthicsNavigate | —Unverified | 0 |
| XMAD-Bench: Cross-Domain Multilingual Audio Deepfake Benchmark | May 31, 2025 | Audio GenerationFace Swapping | CodeCode Available | 0 |
| Thinking Out of the Box: Hybrid SAT Solving by Unconstrained Continuous Optimization | May 31, 2025 | Combinatorial Optimization | —Unverified | 0 |
| The iNaturalist Sounds Dataset | May 31, 2025 | Benchmarking | —Unverified | 0 |
| Length Aware Speech Translation for Video Dubbing | May 31, 2025 | Translation | —Unverified | 0 |
| Reasoning Like an Economist: Post-Training on Economic Problems Induces Strategic Generalization in LLMs | May 31, 2025 | | CodeCode Available | 1 |
| An LLM Agent for Functional Bug Detection in Network Protocols | May 31, 2025 | | CodeCode Available | 1 |
| AVROBUSTBENCH: Benchmarking the Robustness of Audio-Visual Recognition Models at Test-Time | May 31, 2025 | BenchmarkingTest-time Adaptation | CodeCode Available | 1 |
| Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication Training | May 31, 2025 | Dialogue Generation | CodeCode Available | 0 |
| Synergizing LLMs with Global Label Propagation for Multimodal Fake News Detection | May 31, 2025 | Fake News Detection | CodeCode Available | 1 |
| PointODE: Lightweight Point Cloud Learning with Neural Ordinary Differential Equations on Edge | May 31, 2025 | CPU | —Unverified | 0 |
| Channel-Imposed Fusion: A Simple yet Effective Method for Medical Time Series Classification | May 31, 2025 | ClassificationEEG | —Unverified | 0 |
| The Security Threat of Compressed Projectors in Large Vision-Language Models | May 31, 2025 | Computational Efficiency | —Unverified | 0 |
| SST: Self-training with Self-adaptive Thresholding for Semi-supervised Learning | May 31, 2025 | | —Unverified | 0 |
| A Systematic Review of Metaheuristics-Based and Machine Learning-Driven Intrusion Detection Systems in IoT | May 31, 2025 | feature selectionIntrusion Detection | —Unverified | 0 |
| Video Signature: In-generation Watermarking for Latent Video Diffusion Models | May 31, 2025 | DecoderVideo Generation | —Unverified | 0 |
| Blockchain-Enabled Privacy-Preserving Second-Order Federated Edge Learning in Personalized Healthcare | May 31, 2025 | Federated LearningPrivacy Preserving | —Unverified | 0 |
| Machine vs Machine: Using AI to Tackle Generative AI Threats in Assessment | May 31, 2025 | Specificity | —Unverified | 0 |
| Deep-Learning-Driven Prefetching for Far Memory | May 31, 2025 | Deep Learning | —Unverified | 0 |
| Assortment of Attention Heads: Accelerating Federated PEFT with Head Pruning and Strategic Client Selection | May 31, 2025 | Federated Learningparameter-efficient fine-tuning | —Unverified | 0 |
| Learning with Calibration: Exploring Test-Time Computing of Spatio-Temporal Forecasting | May 31, 2025 | Spatio-Temporal Forecasting | —Unverified | 0 |
| MIRROR: Cognitive Inner Monologue Between Conversational Turns for Persistent Reflection and Reasoning in Conversational LLMs | May 31, 2025 | | CodeCode Available | 1 |
| Do Language Models Mirror Human Confidence? Exploring Psychological Insights to Address Overconfidence in LLMs | May 31, 2025 | MMLU | CodeCode Available | 0 |
| Towards Effective and Efficient Adversarial Defense with Diffusion Models for Robust Visual Tracking | May 31, 2025 | Adversarial DefenseDenoising | CodeCode Available | 0 |
| PackHero: A Scalable Graph-based Approach for Efficient Packer Identification | May 31, 2025 | Graph Matching | CodeCode Available | 0 |
| Federated learning framework for collaborative remaining useful life prognostics: an aircraft engine case study | May 31, 2025 | Federated Learning | CodeCode Available | 0 |
| Translate With Care: Addressing Gender Bias, Neutrality, and Reasoning in Large Language Model Translations | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| COGNATE: Acceleration of Sparse Tensor Programs on Emerging Hardware using Transfer Learning | May 31, 2025 | Transfer Learning | —Unverified | 0 |
| G2S: A General-to-Specific Learning Framework for Temporal Knowledge Graph Forecasting with Large Language Models | May 31, 2025 | In-Context LearningKnowledge Graphs | CodeCode Available | 0 |
| ChemReservoir -- An Open-Source Framework for Chemically-Inspired Reservoir Computing | May 31, 2025 | | CodeCode Available | 0 |