| FLY-TTS: Fast, Lightweight and High-Quality End-to-End Text-to-Speech Synthesis | Jun 30, 2024 | CPUDecoder | —Unverified | 0 |
| Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Jun 26, 2024 | CPUGPU | —Unverified | 0 |
| T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge | Jun 25, 2024 | Computational EfficiencyCPU | CodeCode Available | 4 |
| Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving | Jun 24, 2024 | CPUGPU | CodeCode Available | 7 |
| SLOctolyzer: Fully automatic analysis toolkit for segmentation and feature extracting in scanning laser ophthalmoscopy images | Jun 24, 2024 | AnatomyCPU | CodeCode Available | 1 |
| Towards Dynamic Resource Allocation and Client Scheduling in Hierarchical Federated Learning: A Two-Phase Deep Reinforcement Learning Approach | Jun 21, 2024 | CPUDeep Reinforcement Learning | —Unverified | 0 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| UpDLRM: Accelerating Personalized Recommendation using Real-World PIM Architecture | Jun 20, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated DCOPF using Gradient-Based Optimization | Jun 19, 2024 | CPUGPU | CodeCode Available | 0 |
| Sparse High Rank Adapters | Jun 19, 2024 | CPUGPU | —Unverified | 0 |