| Approximate Caching for Efficiently Serving Diffusion Models | Dec 7, 2023 | DenoisingGPU | —Unverified | 0 |
| PerSival: Neural-network-based visualisation for pervasive continuum-mechanical simulations in musculoskeletal biomechanics | Dec 7, 2023 | CPUGPU | —Unverified | 0 |
| SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM | Dec 6, 2023 | GPUQuantization | CodeCode Available | 1 |
| MMM: Generative Masked Motion Model | Dec 6, 2023 | GPUmodel | CodeCode Available | 1 |
| On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm | Dec 6, 2023 | Dataset DistillationDiversity | CodeCode Available | 1 |
| Holmes: Towards Distributed Training Across Clusters with Heterogeneous NIC Environment | Dec 6, 2023 | GPUScheduling | —Unverified | 0 |
| A Hardware Evaluation Framework for Large Language Model Inference | Dec 5, 2023 | GPULanguage Modeling | —Unverified | 0 |
| FlexModel: A Framework for Interpretability of Distributed Large Language Models | Dec 5, 2023 | Distributed ComputingGPU | CodeCode Available | 1 |
| Learning to Holistically Detect Bridges from Large-Size VHR Remote Sensing Imagery | Dec 5, 2023 | GPUobject-detection | —Unverified | 0 |
| DIPR: Efficient Point Cloud Registration via Dynamic Iteration | Dec 5, 2023 | GPUPoint Cloud Registration | —Unverified | 0 |