| Inference-time sparse attention with asymmetric indexing | Feb 12, 2025 | GPU | —Unverified | 0 |
| InferLite: Simple Universal Sentence Representations from Natural Language Inference Data | Oct 1, 2018 | GPUNatural Language Inference | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU | Feb 13, 2025 | GPULanguage Modeling | —Unverified | 0 |
| InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers | Feb 6, 2025 | GPULarge Language Model | —Unverified | 0 |
| Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | May 13, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Sep 11, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| Initial Orbit Determination for the CR3BP using Particle Swarm Optimization | Jul 23, 2022 | GPUPosition | —Unverified | 0 |
| Input Reconstruction Attack against Vertical Federated Large Language Models | Nov 7, 2023 | Federated LearningGPU | —Unverified | 0 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |