| CapSpeech: Enabling Downstream Applications in Style-Captioned Text-to-Speech | Jun 3, 2025 | Speech Synthesistext-to-speech | —Unverified | 0 |
| Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss | Jun 3, 2025 | Automatic Lyrics TranscriptionAutomatic Speech Recognition | —Unverified | 0 |
| Automated Web Application Testing: End-to-End Test Case Generation with Large Language Models and Screen Transition Graphs | Jun 3, 2025 | FormScript Generation | —Unverified | 0 |
| Ensemble-MIX: Enhancing Sample Efficiency in Multi-Agent RL Using Ensemble Methods | Jun 3, 2025 | Ensemble LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Target Sensing Performance in Disaster-Specific ISAC Networks | Jun 3, 2025 | Integrated sensing and communicationISAC | —Unverified | 0 |
| Quantized Dissipative Uncertain Model for Fractional T_S Fuzzy systems with Time_Varying Delays Under Networked Control System | Jun 3, 2025 | Quantization | —Unverified | 0 |
| Recursive Privacy-Preserving Estimation Over Markov Fading Channels | Jun 3, 2025 | Privacy PreservingState Estimation | —Unverified | 0 |
| Unit Commitment with Cost-Oriented Temporal Resolution | Jun 3, 2025 | Bilevel OptimizationClustering | —Unverified | 0 |
| Dynamic Epsilon Scheduling: A Multi-Factor Adaptive Perturbation Budget for Adversarial Training | Jun 3, 2025 | Adversarial RobustnessScheduling | —Unverified | 0 |
| Grasp2Grasp: Vision-Based Dexterous Grasp Translation via Schrödinger Bridges | Jun 3, 2025 | Translation | —Unverified | 0 |
| Adversarial Attacks on Robotic Vision Language Action Models | Jun 3, 2025 | Vision-Language-Action | CodeCode Available | 1 |
| Solving the Pod Repositioning Problem with Deep Reinforced Adaptive Large Neighborhood Search | Jun 3, 2025 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| ThinkTank: A Framework for Generalizing Domain-Specific AI Agent Systems into Universal Collaborative Intelligence Platforms | Jun 3, 2025 | AI AgentRetrieval-augmented Generation | CodeCode Available | 1 |
| NetPress: Dynamically Generated LLM Benchmarks for Network Applications | Jun 3, 2025 | Benchmarking | CodeCode Available | 1 |
| Accelerating Model-Based Reinforcement Learning using Non-Linear Trajectory Optimization | Jun 3, 2025 | Model-based Reinforcement Learning | —Unverified | 0 |
| IP-Dialog: Evaluating Implicit Personalization in Dialogue Systems with Synthetic Data | Jun 3, 2025 | AttributeSynthetic Data Generation | —Unverified | 0 |
| The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing | Jun 3, 2025 | Feature ImportanceSentence | CodeCode Available | 0 |
| Towards Source Attribution of Singing Voice Deepfake with Multimodal Foundation Models | Jun 3, 2025 | Face Swapping | CodeCode Available | 0 |
| Grounded Vision-Language Interpreter for Integrated Task and Motion Planning | Jun 3, 2025 | Motion PlanningTask and Motion Planning | —Unverified | 0 |
| Adaptive Differential Denoising for Respiratory Sounds Classification | Jun 3, 2025 | Audio ClassificationClassification | CodeCode Available | 1 |
| Prompt-Unseen-Emotion: Zero-shot Expressive Speech Synthesis with Prompt-LLM Contextual Knowledge for Mixed Emotions | Jun 3, 2025 | Expressive Speech SynthesisPrompt Learning | —Unverified | 0 |
| Learned Controllers for Agile Quadrotors in Pursuit-Evasion Games | Jun 3, 2025 | Continual LearningReinforcement Learning (RL) | —Unverified | 0 |
| Rodrigues Network for Learning Robot Actions | Jun 3, 2025 | Imitation LearningInductive Bias | —Unverified | 0 |
| How do Pre-Trained Models Support Software Engineering? An Empirical Study in Hugging Face | Jun 3, 2025 | Code GenerationText Generation | —Unverified | 0 |
| Backpressure-based Mean-field Type Game for Scheduling in Multi-Hop Wireless Sensor Networks | Jun 3, 2025 | Scheduling | —Unverified | 0 |
| IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation | Jun 3, 2025 | 3D geometryVideo Generation | —Unverified | 0 |
| Overcoming Data Scarcity in Multi-Dialectal Arabic ASR via Whisper Fine-Tuning | Jun 3, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Structural Vibration Monitoring with Diffractive Optical Processors | Jun 3, 2025 | Autonomous NavigationStructural Health Monitoring | —Unverified | 0 |
| Axiomatics of Restricted Choices by Linear Orders of Sets with Minimum as Fallback | Jun 3, 2025 | Abstract Argumentation | —Unverified | 0 |
| On the influence of language similarity in non-target speaker verification trials | Jun 3, 2025 | Speaker Verification | —Unverified | 0 |
| Adaptive Graph Pruning for Multi-Agent Communication | Jun 3, 2025 | Code GenerationLarge Language Model | CodeCode Available | 0 |
| MotionRAG-Diff: A Retrieval-Augmented Diffusion Framework for Long-Term Music-to-Dance Generation | Jun 3, 2025 | Contrastive LearningMotion Synthesis | —Unverified | 0 |
| Rethinking Machine Unlearning in Image Generation Models | Jun 3, 2025 | BenchmarkingImage Generation | CodeCode Available | 1 |
| TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression | Jun 3, 2025 | | CodeCode Available | 1 |
| EgoVLM: Policy Optimization for Egocentric Video Understanding | Jun 3, 2025 | EgoSchemaQuestion Answering | CodeCode Available | 0 |
| Towards Explicit Geometry-Reflectance Collaboration for Generalized LiDAR Segmentation in Adverse Weather | Jun 3, 2025 | LIDAR Semantic SegmentationSemantic Segmentation | —Unverified | 0 |
| InterRVOS: Interaction-aware Referring Video Object Segmentation | Jun 3, 2025 | 8kObject | —Unverified | 0 |
| Multi Layered Autonomy and AI Ecologies in Robotic Art Installations | Jun 3, 2025 | Ethics | —Unverified | 0 |
| Beyond Text Compression: Evaluating Tokenizers Across Scales | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence | Jun 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons | Jun 3, 2025 | Atari GamesDecision Making | —Unverified | 0 |
| Enriching Location Representation with Detailed Semantic Information | Jun 3, 2025 | Contrastive Learning | —Unverified | 0 |
| ATAG: AI-Agent Application Threat Assessment with Attack Graphs | Jun 3, 2025 | AI Agent | —Unverified | 0 |
| Spatial Association Between Near-Misses and Accident Blackspots in Sydney, Australia: A Getis-Ord G_i^* Analysis | Jun 3, 2025 | Feature Importance | —Unverified | 0 |
| Corrigibility as a Singular Target: A Vision for Inherently Reliable Foundation Models | Jun 3, 2025 | Synthetic Data Generation | —Unverified | 0 |
| TestAgent: An Adaptive and Intelligent Expert for Human Assessment | Jun 3, 2025 | Large Language ModelQuestion Selection | —Unverified | 0 |
| Data Leakage and Deceptive Performance: A Critical Examination of Credit Card Fraud Detection Methodologies | Jun 3, 2025 | Fraud Detection | —Unverified | 0 |
| Universal Reusability in Recommender Systems: The Case for Dataset- and Task-Independent Frameworks | Jun 3, 2025 | Feature EngineeringModel Selection | —Unverified | 0 |
| A Learned Cost Model-based Cross-engine Optimizer for SQL Workloads | Jun 3, 2025 | Multi-Task Learning | —Unverified | 0 |
| Rethinking Dynamic Networks and Heterogeneous Computing with Automatic Parallelization | Jun 3, 2025 | Cloud Computing | —Unverified | 0 |