| ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models | Jun 12, 2024 | GPU | —Unverified | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| Sustainable self-supervised learning for speech representations | Jun 11, 2024 | GPUSelf-Supervised Learning | —Unverified | 0 |
| VoxNeuS: Enhancing Voxel-Based Neural Surface Reconstruction via Gradient Interpolation | Jun 11, 2024 | GPUSurface Reconstruction | —Unverified | 0 |
| Benchmarking and Boosting Radiology Report Generation for 3D High-Resolution Medical Images | Jun 11, 2024 | BenchmarkingGPU | —Unverified | 0 |
| FLUX: Fast Software-based Communication Overlap On GPUs Through Kernel Fusion | Jun 11, 2024 | GPU | CodeCode Available | 5 |
| Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models | Jun 11, 2024 | DiversityGPU | CodeCode Available | 2 |
| Label-Looping: Highly Efficient Decoding for Transducers | Jun 10, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Merlin: A Vision Language Foundation Model for 3D Computed Tomography | Jun 10, 2024 | 3D Semantic SegmentationComputed Tomography (CT) | CodeCode Available | 3 |