| Inference-time sparse attention with asymmetric indexing | Feb 12, 2025 | GPU | —Unverified | 0 |
| InferLite: Simple Universal Sentence Representations from Natural Language Inference Data | Oct 1, 2018 | GPUNatural Language Inference | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU | Feb 13, 2025 | GPULanguage Modeling | —Unverified | 0 |
| InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers | Feb 6, 2025 | GPULarge Language Model | —Unverified | 0 |
| Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | May 13, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Sep 11, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| Initial Orbit Determination for the CR3BP using Particle Swarm Optimization | Jul 23, 2022 | GPUPosition | —Unverified | 0 |
| Input Reconstruction Attack against Vertical Federated Large Language Models | Nov 7, 2023 | Federated LearningGPU | —Unverified | 0 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |
| INS-Conv: Incremental Sparse Convolution for Online 3D Segmentation | Jan 1, 2022 | CPUGPU | —Unverified | 0 |
| In search of the most efficient and memory-saving visualization of high dimensional data | Feb 27, 2023 | CPUDimensionality Reduction | —Unverified | 0 |
| Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units | Apr 16, 2024 | GPU | —Unverified | 0 |
| INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers | Jul 10, 2024 | GPU | —Unverified | 0 |
| Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline | Aug 16, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Instant 3D Object Tracking with Applications in Augmented Reality | Jun 23, 2020 | 3D Object TrackingCPU | —Unverified | 0 |
| Universal Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Sep 18, 2023 | GPUStyle Transfer | —Unverified | 0 |
| INSTA-YOLO: Real-Time Instance Segmentation | Feb 12, 2021 | GPUInstance Segmentation | —Unverified | 0 |
| Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation | Jun 26, 2025 | GPUImage Generation | —Unverified | 0 |
| InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling | May 27, 2025 | DenoisingGPU | —Unverified | 0 |
| InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference | Sep 8, 2024 | Edge-computingGPU | —Unverified | 0 |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Oct 11, 2023 | 4kDecoder | —Unverified | 0 |
| InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself | Sep 10, 2024 | GPU | —Unverified | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| Integrating Homomorphic Encryption and Trusted Execution Technology for Autonomous and Confidential Model Refining in Cloud | Aug 2, 2023 | Cloud ComputingGPU | —Unverified | 0 |
| Integration of Absolute Orientation Measurements in the KinectFusion Reconstruction pipeline | Feb 12, 2018 | 3D ReconstructionGPU | —Unverified | 0 |
| Interactive Evidence Detection: train state-of-the-art model out-of-domain or simple model interactively? | Nov 1, 2019 | Fact CheckingGPU | —Unverified | 0 |
| InterTrain: Accelerating DNN Training using Input Interpolation | Sep 29, 2021 | GPU | —Unverified | 0 |
| InTune: Reinforcement Learning-based Data Pipeline Optimization for Deep Recommendation Models | Aug 13, 2023 | CPUGPU | —Unverified | 0 |
| Invertible Learned Primal-Dual | Oct 19, 2021 | GPUImage Reconstruction | —Unverified | 0 |
| I/O Lower Bounds for Auto-tuning of Convolutions in CNNs | Dec 31, 2020 | GPU | —Unverified | 0 |
| IRLI: Iterative Re-partitioning for Learning to Index | Mar 17, 2021 | GPUInformation Retrieval | —Unverified | 0 |
| Irrational Complex Rotations Empower Low-bit Optimizers | Jan 22, 2025 | GPUQuantization | —Unverified | 0 |
| Is Architectural Complexity Overrated? Competitive and Interpretable Knowledge Graph Completion with RelatE | May 25, 2025 | GPUKnowledge Graph Completion | —Unverified | 0 |
| iServe: An Intent-based Serving System for LLMs | Jan 8, 2025 | GPU | —Unverified | 0 |
| ISO: Overlap of Computation and Communication within Seqenence For LLM Inference | Sep 4, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Isotonic Data Augmentation for Knowledge Distillation | Jul 3, 2021 | AttributeData Augmentation | —Unverified | 0 |
| Is the GPU Half-Empty or Half-Full? Practical Scheduling Techniques for LLMs | Oct 23, 2024 | GPUScheduling | —Unverified | 0 |
| It's always personal: Using Early Exits for Efficient On-Device CNN Personalisation | Feb 2, 2021 | GPUModel Compression | —Unverified | 0 |
| JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Mar 5, 2025 | GPUMamba | —Unverified | 0 |
| JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading | Aug 25, 2023 | GPUreinforcement-learning | —Unverified | 0 |
| Jellyfish: A Large Language Model for Data Preprocessing | Dec 4, 2023 | GPUImputation | —Unverified | 0 |
| John_Snow_Labs@SMM4H’22: Social Media Mining for Health (#SMM4H) with Spark NLP | Oct 1, 2022 | ClassificationGPU | —Unverified | 0 |
| Jointly Optimizing Preprocessing and Inference for DNN-based Visual Analytics | Jul 25, 2020 | CPUGPU | —Unverified | 0 |
| Joint Scene and Object Tracking for Cost-Effective Augmented Reality Assisted Patient Positioning in Radiation Therapy | Oct 5, 2020 | CPUGPU | —Unverified | 0 |
| Jorge: Approximate Preconditioning for GPU-efficient Second-order Optimization | Oct 18, 2023 | Computational EfficiencyGPU | —Unverified | 0 |
| k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning | Nov 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Keras Sig: Efficient Path Signature Computation on GPU in Keras 3 | Jan 14, 2025 | BenchmarkingC++ code | —Unverified | 0 |
| Kernel Operations on the GPU, with Autodiff, without Memory Overflows | Mar 27, 2020 | GPU | —Unverified | 0 |
| Kernel-Segregated Transpose Convolution Operation | Sep 8, 2022 | CPUGPU | —Unverified | 0 |