| Inference-time sparse attention with asymmetric indexing | Feb 12, 2025 | GPU | —Unverified | 0 |
| InferLite: Simple Universal Sentence Representations from Natural Language Inference Data | Oct 1, 2018 | GPUNatural Language Inference | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU | Feb 13, 2025 | GPULanguage Modeling | —Unverified | 0 |
| InfiniteHBD: Building Datacenter-Scale High-Bandwidth Domain for LLM with Optical Circuit Switching Transceivers | Feb 6, 2025 | GPULarge Language Model | —Unverified | 0 |
| Infinite Texture: Text-guided High Resolution Diffusion Texture Synthesis | May 13, 2024 | GPUTexture Synthesis | —Unverified | 0 |
| Inf-MLLM: Efficient Streaming Inference of Multimodal Large Language Models on a Single GPU | Sep 11, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| Initial Orbit Determination for the CR3BP using Particle Swarm Optimization | Jul 23, 2022 | GPUPosition | —Unverified | 0 |
| Input Reconstruction Attack against Vertical Federated Large Language Models | Nov 7, 2023 | Federated LearningGPU | —Unverified | 0 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |
| INS-Conv: Incremental Sparse Convolution for Online 3D Segmentation | Jan 1, 2022 | CPUGPU | —Unverified | 0 |
| In search of the most efficient and memory-saving visualization of high dimensional data | Feb 27, 2023 | CPUDimensionality Reduction | —Unverified | 0 |
| Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units | Apr 16, 2024 | GPU | —Unverified | 0 |
| INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers | Jul 10, 2024 | GPU | —Unverified | 0 |
| Monocular Instance Motion Segmentation for Autonomous Driving: KITTI InstanceMotSeg Dataset and Multi-task Baseline | Aug 16, 2020 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| Instant 3D Object Tracking with Applications in Augmented Reality | Jun 23, 2020 | 3D Object TrackingCPU | —Unverified | 0 |
| Universal Photorealistic Style Transfer: A Lightweight and Adaptive Approach | Sep 18, 2023 | GPUStyle Transfer | —Unverified | 0 |
| INSTA-YOLO: Real-Time Instance Segmentation | Feb 12, 2021 | GPUInstance Segmentation | —Unverified | 0 |
| Instella-T2I: Pushing the Limits of 1D Discrete Latent Space Image Generation | Jun 26, 2025 | GPUImage Generation | —Unverified | 0 |
| InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling | May 27, 2025 | DenoisingGPU | —Unverified | 0 |
| InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference | Sep 8, 2024 | Edge-computingGPU | —Unverified | 0 |
| InstructRetro: Instruction Tuning post Retrieval-Augmented Pretraining | Oct 11, 2023 | 4kDecoder | —Unverified | 0 |
| InstructSing: High-Fidelity Singing Voice Generation via Instructing Yourself | Sep 10, 2024 | GPU | —Unverified | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| Integrating Homomorphic Encryption and Trusted Execution Technology for Autonomous and Confidential Model Refining in Cloud | Aug 2, 2023 | Cloud ComputingGPU | —Unverified | 0 |