| CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory | Mar 4, 2025 | CPUGPU | —Unverified | 0 |
| Evaluation of adaptive sampling methods in scenario generation for virtual safety impact assessment of pre-crash safety systems | Mar 2, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 |
| Two-stream Beats One-stream: Asymmetric Siamese Network for Efficient Visual Tracking | Mar 1, 2025 | CPUGPU | CodeCode Available | 1 |
| AMPLE: Event-Driven Accelerator for Mixed-Precision Inference of Graph Neural Networks | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| TeleRAG: Efficient Retrieval-Augmented Generation Inference with Lookahead Retrieval | Feb 28, 2025 | CPUGPU | —Unverified | 0 |
| AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs | Feb 27, 2025 | CPUGPU | —Unverified | 0 |
| LLMs Have Rhythm: Fingerprinting Large Language Models Using Inter-Token Times and Network Traffic Analysis | Feb 27, 2025 | CPUGPU | —Unverified | 0 |
| Striving for Faster and Better: A One-Layer Architecture with Auto Re-parameterization for Low-Light Image Enhancement | Feb 27, 2025 | Computational EfficiencyCPU | CodeCode Available | 0 |
| LightFC-X: Lightweight Convolutional Tracker for RGB-X Tracking | Feb 25, 2025 | CPU | CodeCode Available | 1 |
| SparseTransX: Efficient Training of Translation-Based Knowledge Graph Embeddings Using Sparse Matrix Operations | Feb 24, 2025 | CPUGPU | CodeCode Available | 0 |
| A Universal Framework for Compressing Embeddings in CTR Prediction | Feb 21, 2025 | Click-Through Rate PredictionContrastive Learning | CodeCode Available | 0 |
| Distributed U-net model and Image Segmentation for Lung Cancer Detection | Feb 20, 2025 | CPUFederated Learning | —Unverified | 0 |
| Dynamic Low-Rank Sparse Adaptation for Large Language Models | Feb 20, 2025 | CPUGPU | CodeCode Available | 1 |
| Determining Layer-wise Sparsity for Large Language Models Through a Theoretical Perspective | Feb 20, 2025 | CPUGPU | —Unverified | 0 |
| Safe Beyond the Horizon: Efficient Sampling-based MPC with Neural Control Barrier Functions | Feb 20, 2025 | CPUModel Predictive Control | —Unverified | 0 |
| Object-Pose Estimation With Neural Population Codes | Feb 19, 2025 | CPUObject | —Unverified | 0 |
| On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation | Feb 18, 2025 | CPUIntent Detection | —Unverified | 0 |
| A^2ATS: Retrieval-Based KV Cache Reduction via Windowed Rotary Position Embedding and Query-Aware Vector Quantization | Feb 18, 2025 | CPUPosition | —Unverified | 0 |
| HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Feb 18, 2025 | Computational EfficiencyCPU | CodeCode Available | 2 |
| Robust 6DoF Pose Tracking Considering Contour and Interior Correspondence Uncertainty for AR Assembly Guidance | Feb 17, 2025 | CPUOptical Flow Estimation | —Unverified | 0 |
| Representation Learning on Out of Distribution in Tabular Data | Feb 14, 2025 | Contrastive LearningCPU | —Unverified | 0 |
| Habitizing Diffusion Planning for Efficient and Effective Decision Making | Feb 10, 2025 | CPUD4RL | CodeCode Available | 1 |
| Weighted-Sum Energy Efficiency Maximization in User-Centric Uplink Cell-Free Massive MIMO | Feb 10, 2025 | CPU | —Unverified | 0 |
| DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis | Feb 10, 2025 | CPUInference Optimization | —Unverified | 0 |