| Forecasting GPU Performance for Deep Learning Training and Inference | Jul 18, 2024 | Deep LearningGPU | CodeCode Available | 2 |
| Attention in SRAM on Tenstorrent Grayskull | Jul 18, 2024 | CPUGPU | CodeCode Available | 1 |
| LiNR: Model Based Neural Retrieval on GPUs at LinkedIn | Jul 18, 2024 | AttributeGPU | —Unverified | 0 |
| Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark | Jul 18, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| WiNet: Wavelet-based Incremental Learning for Efficient Medical Image Registration | Jul 18, 2024 | GPUImage Registration | CodeCode Available | 1 |
| SmartQuant: CXL-based AI Model Store in Support of Runtime Configurable Weight Quantization | Jul 17, 2024 | GPUQuantization | —Unverified | 0 |
| FastSAM-3DSlicer: A 3D-Slicer Extension for 3D Volumetric Segment Anything Model with Uncertainty Quantification | Jul 17, 2024 | CPUDomain Adaptation | CodeCode Available | 1 |
| Spectra: Surprising Effectiveness of Pretraining Ternary Language Models at Scale | Jul 17, 2024 | GPULAMBADA | CodeCode Available | 2 |
| RoDE: Linear Rectified Mixture of Diverse Experts for Food Large Multi-Modal Models | Jul 17, 2024 | GPUNutrition | —Unverified | 0 |
| ARTEMIS: A Mixed Analog-Stochastic In-DRAM Accelerator for Transformer Neural Networks | Jul 17, 2024 | CPUGPU | —Unverified | 0 |