| iServe: An Intent-based Serving System for LLMs | Jan 8, 2025 | GPU | —Unverified | 0 | 0 |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 | 0 |
| Isotonic Data Augmentation for Knowledge Distillation | Jul 3, 2021 | AttributeData Augmentation | —Unverified | 0 | 0 |
| Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs | Oct 23, 2024 | GPUScheduling | —Unverified | 0 | 0 |
| It's always personal: Using Early Exits for Efficient On-Device CNN Personalisation | Feb 2, 2021 | GPUModel Compression | —Unverified | 0 | 0 |
| JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Mar 5, 2025 | GPUMamba | —Unverified | 0 | 0 |
| JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading | Aug 25, 2023 | GPUreinforcement-learning | —Unverified | 0 | 0 |
| Jellyfish: A Large Language Model for Data Preprocessing | Dec 4, 2023 | GPUImputation | —Unverified | 0 | 0 |
| John_Snow_Labs@SMM4H’22: Social Media Mining for Health (#SMM4H) with Spark NLP | Oct 1, 2022 | ClassificationGPU | —Unverified | 0 | 0 |
| Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics | Jul 25, 2020 | CPUGPU | —Unverified | 0 | 0 |
| Joint Scene and Object Tracking for Cost-Effective Augmented Reality Assisted Patient Positioning in Radiation Therapy | Oct 5, 2020 | CPUGPU | —Unverified | 0 | 0 |
| Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization | Oct 18, 2023 | Computational EfficiencyGPU | —Unverified | 0 | 0 |
| k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Keras Sig: Efficient Path Signature Computation on GPU in Keras 3 | Jan 14, 2025 | BenchmarkingC++ code | —Unverified | 0 | 0 |
| Kernel Operations on the GPU, with Autodiff, without Memory Overflows | Mar 27, 2020 | GPU | —Unverified | 0 | 0 |
| Kernel-Segregated Transpose Convolution Operation | Sep 8, 2022 | CPUGPU | —Unverified | 0 | 0 |
| KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning | May 24, 2025 | GPUparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| Kevin: Multi-Turn RL for Generating CUDA Kernels | Jul 16, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 | 0 |
| KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models | Nov 9, 2024 | Document RankingGPU | —Unverified | 0 | 0 |
| KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time Rendering | Jun 22, 2022 | GPUNeRF | —Unverified | 0 | 0 |
| KinectFusion: Real-Time Dense Surface Mapping and Tracking | Oct 26, 2011 | GPU | —Unverified | 0 | 0 |
| Kinetic Compressive Sensing | Mar 27, 2018 | Compressive SensingGPU | —Unverified | 0 | 0 |
| KineticNet: Deep learning a transferable kinetic energy functional for orbital-free density functional theory | May 8, 2023 | GPU | —Unverified | 0 | 0 |
| Knowledge Distillation of Transformer-based Language Models Revisited | Jun 29, 2022 | GPUKnowledge Distillation | —Unverified | 0 | 0 |
| Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image Classification | Jun 26, 2022 | GPUimage-classification | —Unverified | 0 | 0 |