| Quantized Neural Network Inference with Precision Batching | Feb 26, 2020 | GPULanguage Modeling | —Unverified | 0 | 0 |
| QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache | Feb 5, 2025 | GPU | —Unverified | 0 | 0 |
| Quantum Annealing based Power Grid Partitioning for Parallel Simulation | Aug 7, 2024 | CPUGPU | —Unverified | 0 | 0 |
| Quantum-Enhanced Support Vector Machine for Large-Scale Stellar Classification with GPU Acceleration | Nov 21, 2023 | ClassificationComputational Efficiency | —Unverified | 0 | 0 |
| Quantum-inspired tensor network for Earth science | Jan 15, 2023 | GPUQuantum Machine Learning | —Unverified | 0 | 0 |
| Quantum-Powered Personalized Learning | Aug 25, 2024 | Computational EfficiencyGPU | —Unverified | 0 | 0 |
| Quantum Walks-Based Adaptive Distribution Generation with Efficient CUDA-Q Acceleration | Apr 18, 2025 | GPU | —Unverified | 0 | 0 |
| Query-focused Sentence Compression in Linear Time | Apr 19, 2019 | GPUSentence | —Unverified | 0 | 0 |
| Query-focused Sentence Compression in Linear Time | Nov 1, 2019 | GPUSentence | —Unverified | 0 | 0 |
| Query Processing on Tensor Computation Runtimes | Mar 3, 2022 | CPUGPU | —Unverified | 0 | 0 |
| Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization | Dec 13, 2019 | Computational EfficiencyForm | —Unverified | 0 | 0 |
| RADARS: Memory Efficient Reinforcement Learning Aided Differentiable Neural Architecture Search | Sep 13, 2021 | GPUNeural Architecture Search | —Unverified | 0 | 0 |
| RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation | Apr 18, 2024 | GPURAG | —Unverified | 0 | 0 |
| RAIN: Real-time Animation of Infinite Video Stream | Dec 27, 2024 | DenoisingGPU | —Unverified | 0 | 0 |
| Ramanujan Bipartite Graph Products for Efficient Block Sparse Neural Networks | Jun 24, 2020 | GPUimage-classification | —Unverified | 0 | 0 |
| Random 2.5D U-net for Fully 3D Segmentation | Oct 23, 2019 | GPUSegmentation | —Unverified | 0 | 0 |
| Random Offset Block Embedding Array (ROBE) for CriteoTB Benchmark MLPerf DLRM Model : 1000 Compression and 3.1 Faster Inference | Aug 4, 2021 | GPUModel Compression | —Unverified | 0 | 0 |
| RapidDock: Unlocking Proteome-scale Molecular Docking | Oct 16, 2024 | Drug DiscoveryGPU | —Unverified | 0 | 0 |
| Ray Tracing Algorithm for Reconfigurable Intelligent Surfaces | Feb 20, 2024 | GPU | —Unverified | 0 | 0 |
| RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | May 21, 2025 | GPUNatural Language Queries | —Unverified | 0 | 0 |
| Re2G: Retrieve, Rerank, Generate | Jan 16, 2022 | Fact CheckingGPU | —Unverified | 0 | 0 |
| READ: Recurrent Adaptation of Large Transformers | May 24, 2023 | GPUTransfer Learning | —Unverified | 0 | 0 |
| Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report | Nov 7, 2022 | Bokeh Effect RenderingDeep Learning | —Unverified | 0 | 0 |
| Experimental implementation of a neural network optical channel equalizer in restricted hardware using pruning and quantization | Sep 15, 2021 | CPUEdge-computing | —Unverified | 0 | 0 |
| Real-time 10,000 km Straight-line Transmission using a Software-defined GPU-Based Receiver | Aug 16, 2021 | GPU | —Unverified | 0 | 0 |