| KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction | May 29, 2025 | Question Answering | CodeCode Available | 3 |
| AnchorAttention: Difference-Aware Sparse Attention with Stripe Granularity | May 29, 2025 | | CodeCode Available | 1 |
| ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork | May 29, 2025 | | CodeCode Available | 0 |
| Implicit Inversion turns CLIP into a Decoder | May 29, 2025 | DecoderImage Generation | CodeCode Available | 0 |
| ZPressor: Bottleneck-Aware Compression for Scalable Feed-Forward 3DGS | May 29, 2025 | 3DGSGPU | CodeCode Available | 2 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |
| Multi-Sourced Compositional Generalization in Visual Question Answering | May 29, 2025 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Spectrotemporal Modulation: Efficient and Interpretable Feature Representation for Classifying Speech, Music, and Environmental Sounds | May 29, 2025 | Audio Classification | CodeCode Available | 0 |
| Advancing Image Super-resolution Techniques in Remote Sensing: A Comprehensive Survey | May 29, 2025 | Image Super-ResolutionSuper-Resolution | —Unverified | 0 |
| How Animals Dance (When You're Not Looking) | May 29, 2025 | Image Generation | —Unverified | 0 |
| Cross-Task Experiential Learning on LLM-based Multi-Agent Collaboration | May 29, 2025 | Large Language Model | —Unverified | 0 |
| Unsupervised Transcript-assisted Video Summarization and Highlight Detection | May 29, 2025 | Highlight DetectionReinforcement Learning (RL) | —Unverified | 0 |
| Weakly-supervised Localization of Manipulated Image Regions Using Multi-resolution Learned Features | May 29, 2025 | Bayesian InferenceImage Manipulation | —Unverified | 0 |
| Knowledge Distillation for Reservoir-based Classifier: Human Activity Recognition | May 29, 2025 | Activity RecognitionEdge-computing | —Unverified | 0 |
| DINGO: Constrained Inference for Diffusion LLMs | May 29, 2025 | Math | —Unverified | 0 |
| Knowledge Insulating Vision-Language-Action Models: Train Fast, Run Fast, Generalize Better | May 29, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Mobi-π: Mobilizing Your Robot Learning Policy | May 29, 2025 | Novel View Synthesis | —Unverified | 0 |
| Autoregressive Meta-Actions for Unified Controllable Trajectory Generation | May 29, 2025 | Autonomous DrivingTrajectory Prediction | —Unverified | 0 |
| VLM-RRT: Vision Language Model Guided RRT Search for Autonomous UAV Navigation | May 29, 2025 | Disaster ResponseLanguage Modeling | —Unverified | 0 |
| Trajectory Generator Matching for Time Series | May 29, 2025 | Time SeriesTime Series Generation | —Unverified | 0 |
| LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization | May 29, 2025 | DescriptiveVector Graphics | —Unverified | 0 |
| AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning | May 29, 2025 | Geometry Problem SolvingMathematical Reasoning | —Unverified | 0 |
| Be.FM: Open Foundation Models for Human Behavior | May 29, 2025 | Decision Making | —Unverified | 0 |
| MCTSr-Zero: Self-Reflective Psychological Counseling Dialogues Generation via Principles and Adaptive Exploration | May 29, 2025 | | —Unverified | 0 |
| MCP Safety Training: Learning to Refuse Falsely Benign MCP Exploits using Improved Preference Alignment | May 29, 2025 | RAGRetrieval | —Unverified | 0 |
| A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy | May 29, 2025 | Decision Making | —Unverified | 0 |
| Disrupting Vision-Language Model-Driven Navigation Services via Adversarial Object Fusion | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Verify-in-the-Graph: Entity Disambiguation Enhancement for Complex Claim Verification with Interactive Graph Representation | May 29, 2025 | Claim VerificationEntity Disambiguation | —Unverified | 0 |
| Exposing the Impact of GenAI for Cybercrime: An Investigation into the Dark Side | May 29, 2025 | Experimental DesignTime Series Analysis | —Unverified | 0 |
| TailorSQL: An NL2SQL System Tailored to Your Query Workload | May 29, 2025 | Large Language ModelTranslation | —Unverified | 0 |
| Accelerated Training of Federated Learning via Second-Order Methods | May 29, 2025 | Federated LearningSecond-order methods | —Unverified | 0 |
| Quality assessment of 3D human animation: Subjective and objective evaluation | May 29, 2025 | Human Animation | —Unverified | 0 |
| Sustainable Carbon-Aware and Water-Efficient LLM Scheduling in Geo-Distributed Cloud Datacenters | May 29, 2025 | Scheduling | —Unverified | 0 |
| DOPPLER: Dual-Policy Learning for Device Assignment in Asynchronous Dataflow Graphs | May 29, 2025 | Scheduling | —Unverified | 0 |
| Identity resolution of software metadata using Large Language Models | May 29, 2025 | Fairness | —Unverified | 0 |
| TrackVLA: Embodied Visual Tracking in the Wild | May 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| System Identification for Virtual Sensor-Based Model Predictive Control: Application to a 2-DoF Direct-Drive Robotic Arm | May 29, 2025 | Model Predictive Control | —Unverified | 0 |
| Learning coordinated badminton skills for legged manipulators | May 29, 2025 | Navigatereinforcement-learning | —Unverified | 0 |
| ZeroSep: Separate Anything in Audio with Zero Training | May 29, 2025 | Audio Source SeparationDenoising | —Unverified | 0 |
| MGE-LDM: Joint Latent Diffusion for Simultaneous Music Generation and Source Extraction | May 29, 2025 | ImputationMusic Generation | —Unverified | 0 |
| Bridging the Gap Between Semantic and User Preference Spaces for Multi-modal Music Representation Learning | May 29, 2025 | Collaborative FilteringContrastive Learning | —Unverified | 0 |
| Towards Robust Overlapping Speech Detection: A Speaker-Aware Progressive Approach Using WavLM | May 29, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Patient Domain Supervised Contrastive Learning for Lung Sound Classification Using Mobile Phone | May 29, 2025 | Contrastive LearningDiagnostic | —Unverified | 0 |
| Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization | May 29, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Latent Representations for Control Design with Provable Stability and Safety Guarantees | May 29, 2025 | Collision Avoidance | —Unverified | 0 |
| Interspeech 2025 URGENT Speech Enhancement Challenge | May 29, 2025 | DiversitySpeech Enhancement | —Unverified | 0 |
| GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion | May 29, 2025 | Depth EstimationImage to Video Generation | —Unverified | 0 |
| Joint Phase Shift Optimization and Precoder Selection for RIS-Assisted 5G NR MIMO Systems | May 29, 2025 | Benchmarking | —Unverified | 0 |
| Global optimization of graph acquisition functions for neural architecture search | May 29, 2025 | Bayesian Optimizationglobal-optimization | —Unverified | 0 |
| Grower-in-the-Loop Interactive Reinforcement Learning for Greenhouse Climate Control | May 29, 2025 | Reinforcement Learning (RL) | —Unverified | 0 |