| Hybrid Offline-online Scheduling Method for Large Language Model Inference Optimization | Feb 14, 2025 | GSM8KInference Optimization | —Unverified | 0 |
| Inference Optimization of Foundation Models on AI Accelerators | Jul 12, 2024 | Inference OptimizationModel Compression | —Unverified | 0 |
| The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities | Aug 23, 2024 | Computational EfficiencyInference Optimization | —Unverified | 0 |
| An approach to optimize inference of the DIART speaker diarization pipeline | Aug 5, 2024 | Inference OptimizationKnowledge Distillation | —Unverified | 0 |
| A bi-partite generative model framework for analyzing and simulating large scale multiple discrete-continuous travel behaviour data | Jan 18, 2019 | Bayesian InferenceBIG-bench Machine Learning | —Unverified | 0 |
| Advances and Open Challenges in Federated Foundation Models | Apr 23, 2024 | Computational EfficiencyFederated Learning | —Unverified | 0 |
| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bifocal Neural ASR: Exploiting Keyword Spotting for Inference Optimization | Aug 3, 2021 | Inference OptimizationKeyword Spotting | —Unverified | 0 |
| Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems | Jun 21, 2023 | Inference Optimization | —Unverified | 0 |
| CRVI: Convex Relaxation for Variational Inference | Jul 1, 2018 | Inference Optimizationregression | —Unverified | 0 |