| GPU-RANC: A CUDA Accelerated Simulation Framework for Neuromorphic Architectures | Apr 24, 2024 | GPU | —Unverified | 0 |
| CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Apr 23, 2024 | DenoisingGPU | CodeCode Available | 1 |
| CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture | Apr 22, 2024 | GPUQuantization | —Unverified | 0 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments | Apr 22, 2024 | CPUFederated Learning | —Unverified | 0 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| SnapKV: LLM Knows What You are Looking for Before Generation | Apr 22, 2024 | 16kGPU | CodeCode Available | 3 |
| STROOBnet Optimization via GPU-Accelerated Proximal Recurrence Strategies | Apr 22, 2024 | GPU | —Unverified | 0 |
| GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Apr 22, 2024 | GPUMotion Generation | —Unverified | 0 |
| Turbo-CF: Matrix Decomposition-Free Graph Filtering for Fast Recommendation | Apr 22, 2024 | Collaborative FilteringGPU | CodeCode Available | 0 |