| QiMeng-Xpiler: Transcompiling Tensor Programs for Deep Learning Systems with a Neural-Symbolic Approach | May 4, 2025 | Code GenerationGPU | —Unverified | 0 |
| QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects | Feb 27, 2025 | 3D Pose EstimationAction Recognition | —Unverified | 0 |
| QoS-Efficient Serving of Multiple Mixture-of-Expert LLMs Using Partial Runtime Reconfiguration | May 10, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| QuadraNet V2: Efficient and Sustainable Training of High-Order Neural Networks with Quadratic Adaptation | May 6, 2024 | GPU | —Unverified | 0 |
| QuAILoRA: Quantization-Aware Initialization for LoRA | Oct 9, 2024 | Causal Language ModelingGPU | —Unverified | 0 |
| Qualities, challenges and future of genetic algorithms: a literature review | Nov 5, 2020 | Artificial LifeGPU | —Unverified | 0 |
| QuantEase: Optimization-based Quantization for Language Models | Sep 5, 2023 | GPUQuantization | —Unverified | 0 |
| Quantization-Aware Imitation-Learning for Resource-Efficient Robotic Control | Dec 2, 2024 | Autonomous DrivingDecision Making | —Unverified | 0 |
| Quantized Neural Network Inference with Precision Batching | Feb 26, 2020 | GPULanguage Modeling | —Unverified | 0 |
| QuantSpec: Self-Speculative Decoding with Hierarchical Quantized KV Cache | Feb 5, 2025 | GPU | —Unverified | 0 |