| KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning | May 24, 2025 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Kevin: Multi-Turn RL for Generating CUDA Kernels | Jul 16, 2025 | GPUReinforcement Learning (RL) | —Unverified | 0 |
| KeyB2: Selecting Key Blocks is Also Important for Long Document Ranking with Large Language Models | Nov 9, 2024 | Document RankingGPU | —Unverified | 0 |
| KiloNeuS: A Versatile Neural Implicit Surface Representation for Real-Time Rendering | Jun 22, 2022 | GPUNeRF | —Unverified | 0 |
| KinectFusion: Real-Time Dense Surface Mapping and Tracking | Oct 26, 2011 | GPU | —Unverified | 0 |
| Kinetic Compressive Sensing | Mar 27, 2018 | Compressive SensingGPU | —Unverified | 0 |
| KineticNet: Deep learning a transferable kinetic energy functional for orbital-free density functional theory | May 8, 2023 | GPU | —Unverified | 0 |
| Knowledge Distillation of Transformer-based Language Models Revisited | Jun 29, 2022 | GPUKnowledge Distillation | —Unverified | 0 |
| Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image Classification | Jun 26, 2022 | GPUimage-classification | —Unverified | 0 |
| Knowledge Extracted from Recurrent Deep Belief Network for Real Time Deterministic Control | Jul 11, 2018 | Deep LearningGeneral Classification | —Unverified | 0 |
| Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback | May 30, 2024 | GPUKnowledge Graphs | —Unverified | 0 |
| KPNet: Towards Minimal Face Detector | Mar 17, 2020 | Face DetectionGPU | —Unverified | 0 |
| Kraken: Inherently Parallel Transformers For Efficient Multi-Device Inference | Aug 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| KunServe: Efficient Parameter-centric Memory Management for LLM Serving | Dec 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| KurTail : Kurtosis-based LLM Quantization | Mar 3, 2025 | GPULanguage Modeling | —Unverified | 0 |
| KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization | May 7, 2024 | GPULanguage Modeling | —Unverified | 0 |
| KVDirect: Distributed Disaggregated LLM Inference | Dec 13, 2024 | GPUScheduling | —Unverified | 0 |
| KV-Distill: Nearly Lossless Learnable Context Compression for LLMs | Mar 13, 2025 | GPUQuestion Answering | —Unverified | 0 |
| L2PF -- Learning to Prune Faster | Jan 7, 2021 | Autonomous DrivingGPU | —Unverified | 0 |
| L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference | Apr 24, 2025 | GPU | —Unverified | 0 |
| Label Delay in Online Continual Learning | Dec 1, 2023 | Continual LearningGPU | —Unverified | 0 |
| Label-Looping: Highly Efficient Decoding for Transducers | Jun 10, 2024 | GPUspeech-recognition | —Unverified | 0 |
| Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation | Apr 16, 2024 | GPUSegmentation | —Unverified | 0 |
| Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings | May 27, 2024 | Domain AdaptationGPU | —Unverified | 0 |
| LACoS-BLOOM: Low-rank Adaptation with Contrastive objective on 8 bits Siamese-BLOOM | May 10, 2023 | GPULanguage Modeling | —Unverified | 0 |