| AntMan: Sparse Low-Rank Compression to Accelerate RNN inference | Oct 2, 2019 | Knowledge DistillationLow-rank compression | —Unverified | 0 |
| Approximate FPGA-based LSTMs under Computation Time Constraints | Jan 7, 2018 | Autonomous VehiclesImage Captioning | —Unverified | 0 |
| ELRT: Efficient Low-Rank Training for Compact Convolutional Neural Networks | Jan 18, 2024 | Low-rank compressionModel Compression | —Unverified | 0 |
| Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization | May 17, 2024 | Bayesian OptimizationLow-rank compression | —Unverified | 0 |
| Adaptive Pruning of Pretrained Transformer via Differential Inclusions | Jan 6, 2025 | Low-rank compression | —Unverified | 0 |
| HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks | Jan 20, 2023 | GPULow-rank compression | —Unverified | 0 |
| LoRC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy | Oct 4, 2024 | GPULow-rank compression | —Unverified | 0 |
| Low-Rank Compression for IMC Arrays | Feb 10, 2025 | Low-rank compressionModel Compression | —Unverified | 0 |
| MLorc: Momentum Low-rank Compression for Large Language Model Adaptation | Jun 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Penrose Tiled Low-Rank Compression and Section-Wise Q&A Fine-Tuning: A General Framework for Domain-Specific Large Language Model Adaptation | Mar 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |