| Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Mar 26, 2024 | GPUImage Compression | CodeCode Available | 2 |
| The Unreasonable Ineffectiveness of the Deeper Layers | Mar 26, 2024 | GPUQuantization | CodeCode Available | 3 |
| SIP: Autotuning GPU Native Schedules via Stochastic Instruction Perturbation | Mar 25, 2024 | GPU | —Unverified | 0 |
| Invertible Diffusion Models for Compressed Sensing | Mar 25, 2024 | compressed sensingGPU | CodeCode Available | 2 |
| MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models | Mar 25, 2024 | GPUIn-Context Learning | CodeCode Available | 1 |
| ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration | Mar 25, 2024 | Computational EfficiencyGPU | CodeCode Available | 1 |
| MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline | Mar 25, 2024 | GPU | CodeCode Available | 1 |
| Real-time Neuron Segmentation for Voltage Imaging | Mar 25, 2024 | GPU | —Unverified | 0 |
| SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Mar 25, 2024 | DecoderGPU | CodeCode Available | 4 |
| A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters | Mar 24, 2024 | GPUScheduling | —Unverified | 0 |