| Interactive OT Gym: A Reinforcement Learning-Based Interactive Optical tweezer (OT)-Driven Microrobotics Simulation Platform | May 27, 2025 | Reinforcement Learning (RL) | —Unverified | 0 |
| Towards Human-Like Trajectory Prediction for Autonomous Driving: A Behavior-Centric Approach | May 27, 2025 | Autonomous DrivingPrediction | —Unverified | 0 |
| PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems | May 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Topological Deep Learning for Speech Data | May 27, 2025 | Deep LearningPhoneme Recognition | —Unverified | 0 |
| Model as Loss: A Self-Consistent Training Paradigm | May 27, 2025 | DecoderSpeech Enhancement | —Unverified | 0 |
| Leveraging LLM and Self-Supervised Training Models for Speech Recognition in Chinese Dialects: A Comparative Analysis | May 27, 2025 | Accented Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| Study of Lightweight Transformer Architectures for Single-Channel Speech Enhancement | May 27, 2025 | Speech Enhancement | —Unverified | 0 |
| Text-Queried Audio Source Separation via Hierarchical Modeling | May 27, 2025 | Audio Source SeparationNatural Language Queries | —Unverified | 0 |
| MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection | May 27, 2025 | Triplet | —Unverified | 0 |
| Dub-S2ST: Textless Speech-to-Speech Translation for Seamless Dubbing | May 27, 2025 | Speech-to-Speech TranslationTranslation | —Unverified | 0 |
| Foundation Model Hidden Representations for Heart Rate Estimation from Auscultation | May 27, 2025 | Heart rate estimation | —Unverified | 0 |
| Phir Hera Fairy: An English Fairytaler is a Strong Faker of Fluent Speech in Low-Resource Indian Languages | May 27, 2025 | Synthetic Data GenerationVoice Cloning | —Unverified | 0 |
| PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts | May 27, 2025 | DiversityRhythm | —Unverified | 0 |
| An LLM-as-Judge Metric for Bridging the Gap with Human Evaluation in SE Tasks | May 27, 2025 | Code GenerationCode Summarization | —Unverified | 0 |
| Can Agents Fix Agent Issues? | May 27, 2025 | | —Unverified | 0 |
| Active Learning-Enhanced Dual Control for Angle-Only Initial Relative Orbit Determination | May 27, 2025 | Active LearningState Estimation | —Unverified | 0 |
| Analysis of Joint Radar and Communication in Disaster Scenarios | May 27, 2025 | Disaster ResponseManagement | —Unverified | 0 |
| Transfer learning for multifidelity simulation-based inference in cosmology | May 27, 2025 | Density Estimationparameter estimation | —Unverified | 0 |
| Iterative Corpus Refinement for Materials Property Prediction Based on Scientific Texts | May 27, 2025 | Property Prediction | —Unverified | 0 |
| Identifying Heart Attack Risk in Vulnerable Population: A Machine Learning Approach | May 27, 2025 | Hybrid Machine Learning | —Unverified | 0 |
| Hybrid Machine Learning and Mathematical Modeling for Tumor Dynamics Prediction: Comparing SPIONs against mNP-FDG | May 27, 2025 | Hybrid Machine Learning | —Unverified | 0 |
| Leveraging Diffusion Models for Parameterized Quantum Circuit Generation | May 27, 2025 | Computational EfficiencyDenoising | —Unverified | 0 |
| STA-Risk: A Deep Dive of Spatio-Temporal Asymmetries for Breast Cancer Risk Prediction | May 27, 2025 | Prediction | —Unverified | 0 |
| Highly Efficient Non-Separable Transforms for Next Generation Video Coding | May 27, 2025 | Video Compression | —Unverified | 0 |
| Optimizing Deep Learning for Skin Cancer Classification: A Computationally Efficient CNN with Minimal Accuracy Trade-Off | May 27, 2025 | Cancer ClassificationMedical Image Analysis | —Unverified | 0 |
| Prostate Cancer Screening with Artificial Intelligence-Enhanced Micro-Ultrasound: A Comparative Study with Traditional Methods | May 27, 2025 | DiagnosticSensitivity | —Unverified | 0 |
| Boosting Adversarial Transferability via High-Frequency Augmentation and Hierarchical-Gradient Fusion | May 27, 2025 | Adversarial Attack | —Unverified | 0 |
| Theoretical Bounds for Optimized Doppler-Based Motion Detection in UHF-RFID Readers | May 27, 2025 | Motion DetectionTAG | —Unverified | 0 |
| Gauss-Ramanujan Functions: Constructions, Properties, and Applications in Communications and Signal Processing | May 27, 2025 | Benchmarking | —Unverified | 0 |
| CiUAV: A Multi-Task 3D Indoor Localization System for UAVs based on Channel State Information | May 27, 2025 | Indoor Localization | —Unverified | 0 |
| Dynamic Resource Allocation in Distributed MIMO-LEO Satellite Networks | May 27, 2025 | Scheduling | —Unverified | 0 |
| A Unified RCS Modeling of Typical Targets for 3GPP ISAC Channel Standardization and Experimental Analysis | May 27, 2025 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Something's Fishy In The Data Lake: A Critical Re-evaluation of Table Union Search Benchmarks | May 27, 2025 | Representation Learning | —Unverified | 0 |
| GGBond: Growing Graph-Based AI-Agent Society for Socially-Aware Recommender Simulation | May 27, 2025 | AI AgentPersonality Alignment | —Unverified | 0 |
| SageAttention2++: A More Efficient Implementation of SageAttention2 | May 27, 2025 | QuantizationVideo Generation | CodeCode Available | 7 |
| Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits | May 27, 2025 | GPU | —Unverified | 0 |
| FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone Navigation | May 27, 2025 | BenchmarkingDecision Making | CodeCode Available | 1 |
| Enhancing Wearable Tap Water Audio Detection through Subclass Annotation in the HD-Epic Dataset | May 27, 2025 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 0 |
| Evaluating LLM Adaptation to Sociodemographic Factors: User Profile vs. Dialogue History | May 27, 2025 | | CodeCode Available | 0 |
| Label Leakage in Federated Inertial-based Human Activity Recognition | May 27, 2025 | Activity RecognitionFederated Learning | CodeCode Available | 0 |
| DeepConvContext: A Multi-Scale Approach to Timeseries Classification in Human Activity Recognition | May 27, 2025 | Action LocalizationActivity Recognition | CodeCode Available | 0 |
| Broad Spectrum Structure Discovery in Large-Scale Higher-Order Networks | May 27, 2025 | Link Prediction | CodeCode Available | 0 |
| DiMoSR: Feature Modulation via Multi-Branch Dilated Convolutions for Efficient Image Super-Resolution | May 27, 2025 | Computational EfficiencyImage Super-Resolution | CodeCode Available | 1 |
| Cardiac Digital Twins at Scale from MRI: Open Tools and Representative Models from ~55000 UK Biobank Participants | May 27, 2025 | Prognosis | CodeCode Available | 0 |
| Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation | May 27, 2025 | Large Language ModelLogical Reasoning | CodeCode Available | 1 |
| Music Source Restoration | May 27, 2025 | Music Source Separation | CodeCode Available | 1 |
| Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies | May 27, 2025 | Protein DesignReinforcement Learning (RL) | —Unverified | 0 |
| Scaling and Prompting for Improved End-to-End Spoken Grammatical Error Correction | May 27, 2025 | Grammatical Error Correction | —Unverified | 0 |
| RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving | May 27, 2025 | Code Generation | CodeCode Available | 0 |
| Wavelet Flow For Extragalactic Foreground Simulations | May 27, 2025 | | CodeCode Available | 0 |