| Psi-Sampler: Initial Particle Sampling for SMC-Based Inference-Time Reward Alignment in Score Models | Jun 2, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Bayes optimal learning of attention-indexed models | Jun 2, 2025 | Deep Attention | CodeCode Available | 0 |
| On-device Streaming Discrete Speech Units | Jun 2, 2025 | | CodeCode Available | 0 |
| Variational Adaptive Noise and Dropout towards Stable Recurrent Neural Networks | Jun 2, 2025 | Imitation LearningLearning Theory | —Unverified | 0 |
| Riemannian Time Warping: Multiple Sequence Alignment in Curved Spaces | Jun 2, 2025 | Multiple Sequence Alignmentspeech-recognition | —Unverified | 0 |
| Stochastically Dominant Peer Prediction | Jun 2, 2025 | FairnessPrediction | —Unverified | 0 |
| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trade-offs in Data Memorization via Strong Data Processing Inequalities | Jun 2, 2025 | Binary ClassificationMemorization | —Unverified | 0 |
| Self-supervised Latent Space Optimization with Nebula Variational Coding | Jun 2, 2025 | FormMetric Learning | —Unverified | 0 |
| Quantitative Error Feedback for Quantization Noise Reduction of Filtering over Graphs | Jun 2, 2025 | Quantization | —Unverified | 0 |
| An Empirical Study of Group Conformity in Multi-Agent Systems | Jun 2, 2025 | Diversity | —Unverified | 0 |
| MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs | Jun 2, 2025 | Instruction FollowingText Generation | —Unverified | 0 |
| Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach | Jun 2, 2025 | Retrieval | —Unverified | 0 |
| Learning Sparsity for Effective and Efficient Music Performance Question Answering | Jun 2, 2025 | Audio-visual Question AnsweringQuestion Answering | —Unverified | 0 |
| A Data-Based Architecture for Flight Test without Test Points | Jun 2, 2025 | Prediction | —Unverified | 0 |
| FreqPolicy: Frequency Autoregressive Visuomotor Policy with Continuous Tokens | Jun 2, 2025 | Computational Efficiency | —Unverified | 0 |
| Sparse Imagination for Efficient Visual World Model Planning | Jun 2, 2025 | Computational EfficiencyDecision Making | —Unverified | 0 |
| Generating Diverse Challenging Terrains for Legged Robots Using Quality-Diversity Algorithm | Jun 2, 2025 | Diversity | —Unverified | 0 |
| Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion | Jun 2, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Are Mamba-based Audio Foundation Models the Best Fit for Non-Verbal Emotion Recognition? | Jun 2, 2025 | Emotion RecognitionMamba | —Unverified | 0 |
| Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric | Jun 2, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion | Jun 2, 2025 | Face SwappingMetric Learning | —Unverified | 0 |
| SALF-MOS: Speaker Agnostic Latent Features Downsampled for MOS Prediction | Jun 2, 2025 | Speech Synthesistext-to-speech | —Unverified | 0 |
| Analyzing the Importance of Blank for CTC-Based Knowledge Distillation | Jun 2, 2025 | Automatic Speech RecognitionKnowledge Distillation | —Unverified | 0 |
| Data-assimilated model-informed reinforcement learning | Jun 2, 2025 | modelreinforcement-learning | —Unverified | 0 |
| Hybrid SIS Dynamics for Demand Modeling of Frequently Updated Products | Jun 2, 2025 | parameter estimation | —Unverified | 0 |
| Bregman Centroid Guided Cross-Entropy Method | Jun 2, 2025 | DiversityModel-based Reinforcement Learning | —Unverified | 0 |
| Update-Aware Robust Optimal Model Predictive Control for Nonlinear Systems | Jun 2, 2025 | Model Predictive Control | —Unverified | 0 |
| A Vertical Approach to Designing and Managing Sustainable Heterogeneous Edge Data Centers | Jun 2, 2025 | Scheduling | —Unverified | 0 |
| Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees | Jun 2, 2025 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Prediction of the Conditional Probability Densities of Time Interval Extrema with Application to Risk-Sensitive Scheduling | Jun 2, 2025 | Scheduling | —Unverified | 0 |
| Probing Quantum Spin Systems with Kolmogorov-Arnold Neural Network Quantum States | Jun 2, 2025 | Kolmogorov-Arnold Networks | —Unverified | 0 |
| NepTrain and NepTrainKit: Automated Active Learning and Visualization Toolkit for Neuroevolution Potentials | Jun 2, 2025 | Active LearningComputational Efficiency | —Unverified | 0 |
| Overcoming Data Scarcity in Scanning Tunnelling Microscopy Image Segmentation | Jun 2, 2025 | Few-Shot LearningImage Segmentation | —Unverified | 0 |
| GSCodec Studio: A Modular Framework for Gaussian Splat Compression | Jun 2, 2025 | Benchmarking | CodeCode Available | 2 |
| SEMNAV: A Semantic Segmentation-Driven Approach to Visual Semantic Navigation | Jun 2, 2025 | Domain AdaptationNavigate | CodeCode Available | 1 |
| Two-Stage Learning of Stabilizing Neural Controllers via Zubov Sampling and Iterative Domain Expansion | Jun 2, 2025 | | CodeCode Available | 0 |
| Lessons Learned from the URGENT 2024 Speech Enhancement Challenge | Jun 2, 2025 | Speech Enhancement | CodeCode Available | 0 |
| Trajectory First: A Curriculum for Discovering Diverse Policies | Jun 2, 2025 | DiversityReinforcement Learning (RL) | —Unverified | 0 |
| Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| DualMap: Online Open-Vocabulary Semantic Mapping for Natural Language Navigation in Dynamic Changing Scenes | Jun 2, 2025 | Natural Language QueriesNavigate | CodeCode Available | 2 |
| MUDI: A Multimodal Biomedical Dataset for Understanding Pharmacodynamic Drug-Drug Interactions | Jun 2, 2025 | | CodeCode Available | 0 |
| Learning collective variables that preserve transition rates | Jun 2, 2025 | | CodeCode Available | 0 |
| Provably Safe Reinforcement Learning from Analytic Gradients | Jun 2, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| WoMAP: World Models For Embodied Open-Vocabulary Object Localization | Jun 2, 2025 | Active Object LocalizationEfficient Exploration | —Unverified | 0 |
| Captivity-Escape Games as a Means for Safety in Online Motion Generation | Jun 2, 2025 | Motion GenerationMotion Planning | —Unverified | 0 |
| Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction | Jun 2, 2025 | AttributeSpeech Extraction | —Unverified | 0 |
| Investigating the Reasonable Effectiveness of Speaker Pre-Trained Models and their Synergistic Power for SingMOS Prediction | Jun 2, 2025 | Speaker Recognition | —Unverified | 0 |
| WCTC-Biasing: Retraining-free Contextual Biasing ASR with Wildcard CTC-based Keyword Spotting and Inter-layer Biasing | Jun 2, 2025 | Keyword Spottingspeech-recognition | —Unverified | 0 |
| Greening AI-enabled Systems with Software Engineering: A Research Agenda for Environmentally Sustainable AI Practices | Jun 2, 2025 | Benchmarking | —Unverified | 0 |