| SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition | Oct 4, 2019 | Inference Optimizationspeech-recognition | —Unverified | 0 |
| SySMOL: Co-designing Algorithms and Hardware for Neural Networks with Heterogeneous Precisions | Nov 23, 2023 | CPUGPU | —Unverified | 0 |
| The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries | Jun 14, 2025 | Bug fixingInference Optimization | —Unverified | 0 |
| Bayesian Active Learning in the Presence of Nuisance Parameters | Oct 23, 2023 | Active LearningExperimental Design | —Unverified | 0 |
| Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation | Nov 29, 2019 | Inference OptimizationSemantic Segmentation | —Unverified | 0 |
| Input Convex Neural Networks | Sep 22, 2016 | ImputationInference Optimization | CodeCode Available | 0 |
| A General Method for Amortizing Variational Filtering | Nov 13, 2018 | Inference OptimizationVariational Inference | CodeCode Available | 0 |
| Iterative Amortized Inference | Jul 24, 2018 | Inference OptimizationVariational Inference | CodeCode Available | 0 |
| Brevity is the soul of sustainability: Characterizing LLM response lengths | Jun 10, 2025 | DecoderInference Optimization | CodeCode Available | 0 |
| LLM-Rank: A Graph Theoretical Approach to Pruning Large Language Models | Oct 17, 2024 | Inference OptimizationNetwork Pruning | CodeCode Available | 0 |