| An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments | Jul 14, 2025 | Speech-to-Texttext-to-speech | —Unverified | 0 |
| Warehouse Spatial Question Answering with LLM Agent | Jul 14, 2025 | Question AnsweringSpatial Reasoning | CodeCode Available | 1 |
| WhisperKit: On-device Real-time ASR with Billion-Scale Transformers | Jul 14, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation | Jul 14, 2025 | | —Unverified | 0 |
| WildFX: A DAW-Powered Pipeline for In-the-Wild Audio FX Graph Modeling | Jul 14, 2025 | Music Generation | CodeCode Available | 1 |
| Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination | Jul 14, 2025 | MathMathematical Reasoning | CodeCode Available | 1 |
| Iceberg: Enhancing HLS Modeling with Synthetic Data | Jul 14, 2025 | Data AugmentationHigh-Level Synthesis | CodeCode Available | 0 |
| REST: Stress Testing Large Reasoning Models by Asking Multiple Problems at Once | Jul 14, 2025 | | CodeCode Available | 1 |
| A Simple Approximate Bayesian Inference Neural Surrogate for Stochastic Petri Net Models | Jul 14, 2025 | Bayesian InferenceEpidemiology | CodeCode Available | 0 |
| MLAR: Multi-layer Large Language Model-based Robotic Process Automation Applicant Tracking | Jul 14, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Wavelet-Enhanced Neural ODE and Graph Attention for Interpretable Energy Forecasting | Jul 14, 2025 | Graph AttentionTime Series Prediction | —Unverified | 0 |
| Scene-Aware Conversational ADAS with Generative AI for Real-Time Driver Assistance | Jul 14, 2025 | Autonomous Driving | —Unverified | 0 |
| On Gradual Semantics for Assumption-Based Argumentation | Jul 14, 2025 | | CodeCode Available | 0 |
| Text-Visual Semantic Constrained AI-Generated Image Quality Assessment | Jul 14, 2025 | Image DescriptionImage Quality Assessment | CodeCode Available | 1 |
| Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures | Jul 14, 2025 | Camera Pose EstimationPose Estimation | —Unverified | 0 |
| Overcoming catastrophic forgetting in neural networks | Jul 14, 2025 | Continual LearningL2 Regularization | —Unverified | 0 |
| Bridging Robustness and Generalization Against Word Substitution Attacks in NLP via the Growth Bound Matrix Approach | Jul 14, 2025 | Adversarial DefenseAdversarial Robustness | CodeCode Available | 0 |
| LifelongPR: Lifelong knowledge fusion for point cloud place recognition based on replay and prompt learning | Jul 14, 2025 | Autonomous DrivingContinual Learning | CodeCode Available | 0 |
| IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution | Jul 14, 2025 | Image Super-ResolutionSuper-Resolution | CodeCode Available | 1 |
| VoTranhAbyssCoreMicro and PoliticalCore: A Unified Framework for Simulating Complex Economic and Political Dynamics | Jul 14, 2025 | | CodeCode Available | 0 |
| Predictive Modeling: BIM Command Recommendation Based on Large-scale Usage Logs | Jul 13, 2025 | | CodeCode Available | 0 |
| TinyTroupe: An LLM-powered Multiagent Persona Simulation Toolkit | Jul 13, 2025 | | CodeCode Available | 0 |
| DRPCA-Net: Make Robust PCA Great Again for Infrared Small Target Detection | Jul 13, 2025 | | CodeCode Available | 0 |
| Auto-Regressively Generating Multi-View Consistent Images | Jul 13, 2025 | | CodeCode Available | 0 |
| SeqCSIST: Sequential Closely-Spaced Infrared Small Target Unmixing | Jul 13, 2025 | | CodeCode Available | 0 |
| EyeSeg: An Uncertainty-Aware Eye Segmentation Framework for AR/VR | Jul 13, 2025 | | CodeCode Available | 0 |
| Hear-Your-Click: Interactive Object-Specific Video-to-Audio Generation | Jul 13, 2025 | | CodeCode Available | 0 |
| ViSP: A PPO-Driven Framework for Sarcasm Generation with Contrastive Learning | Jul 13, 2025 | | CodeCode Available | 0 |
| When Schrödinger Bridge Meets Real-World Image Dehazing with Unpaired Training | Jul 13, 2025 | | CodeCode Available | 0 |
| Generative Cognitive Diagnosis | Jul 13, 2025 | | CodeCode Available | 0 |
| Efficient Multi-Person Motion Prediction by Lightweight Spatial and Temporal Interactions | Jul 13, 2025 | | CodeCode Available | 0 |
| Inter2Former: Dynamic Hybrid Attention for Efficient High-Precision Interactive | Jul 13, 2025 | CPUInteractive Segmentation | —Unverified | 0 |
| Landmark Detection for Medical Images using a General-purpose Segmentation Model | Jul 13, 2025 | Anatomical Landmark DetectionDiagnostic | —Unverified | 0 |
| Memory-Augmented SAM2 for Training-Free Surgical Video Segmentation | Jul 13, 2025 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Federated Learning with Graph-Based Aggregation for Traffic Forecasting | Jul 13, 2025 | Federated LearningGraph Learning | —Unverified | 0 |
| Lightweight Federated Learning over Wireless Edge Networks | Jul 13, 2025 | Bayesian OptimizationFederated Learning | —Unverified | 0 |
| Meta-Reinforcement Learning for Fast and Data-Efficient Spectrum Allocation in Dynamic Wireless Networks | Jul 13, 2025 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding | Jul 13, 2025 | Action SegmentationContrastive Learning | —Unverified | 0 |
| VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose Estimation | Jul 13, 2025 | 3D Pose EstimationPose Estimation | CodeCode Available | 0 |
| FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise | Jul 13, 2025 | Federated Learningimage-classification | —Unverified | 0 |
| Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI | Jul 13, 2025 | AI Agent | —Unverified | 0 |
| Prompt Engineering in Segment Anything Model: Methodologies, Applications, and Emerging Challenges | Jul 13, 2025 | Image SegmentationPrompt Engineering | —Unverified | 0 |
| DRAGD: A Federated Unlearning Data Reconstruction Attack Based on Gradient Differences | Jul 13, 2025 | Federated LearningReconstruction Attack | —Unverified | 0 |
| Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models | Jul 13, 2025 | AttributeBenchmarking | CodeCode Available | 0 |
| KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection | Jul 13, 2025 | Fake News DetectionMisinformation | —Unverified | 0 |
| BitParticle: Partializing Sparse Dual-Factors to Build Quasi-Synchronizing MAC Arrays for Energy-efficient DNNs | Jul 13, 2025 | Scheduling | —Unverified | 0 |
| AI-Enhanced Pediatric Pneumonia Detection: A CNN-Based Approach Using Data Augmentation and Generative Adversarial Networks (GANs) | Jul 13, 2025 | ClassificationData Augmentation | CodeCode Available | 0 |
| Planted in Pretraining, Swayed by Finetuning: A Case Study on the Origins of Cognitive Biases in LLMs | Jul 12, 2025 | | —Unverified | 0 |
| Fast3D: Accelerating 3D Multi-modal Large Language Models for Efficient 3D Scene Understanding | Jul 12, 2025 | | CodeCode Available | 0 |
| WellPINN: Accurate Well Representation for Transient Fluid Pressure Diffusion in Subsurface Reservoirs with Physics-Informed Neural Networks | Jul 12, 2025 | | CodeCode Available | 0 |