| LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs | Apr 16, 2024 | DecoderGPU | CodeCode Available | 1 |
| Shears: Unstructured Sparsity with Neural Low-rank Adapter Search | Apr 16, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| SparseDM: Toward Sparse Efficient Diffusion Models | Apr 16, 2024 | GPUVideo Generation | —Unverified | 0 |
| Interpolating neural network: A novel unification of machine learning and interpolation theory | Apr 16, 2024 | GPUPhysical Simulations | CodeCode Available | 1 |
| Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation | Apr 16, 2024 | GPUSegmentation | —Unverified | 0 |
| Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units | Apr 16, 2024 | GPU | —Unverified | 0 |
| Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models | Apr 15, 2024 | GPUParameter Prediction | —Unverified | 0 |
| Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Apr 15, 2024 | Computational EfficiencyGPU | CodeCode Available | 0 |
| LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism | Apr 15, 2024 | GPU | CodeCode Available | 2 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |