| BING: Binarized Normed Gradients for Objectness Estimation at 300fps | Jun 1, 2014 | CPUObject | —Unverified | 0 |
| Brand New K-FACs: Speeding up K-FAC with Online Decomposition Updates | Oct 16, 2022 | CPU | —Unverified | 0 |
| Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference | Jun 17, 2024 | CPUGPU | —Unverified | 0 |
| End-to-end Adaptive Distributed Training on PaddlePaddle | Dec 6, 2021 | CPUGPU | —Unverified | 0 |
| Active Semantic Localization with Graph Neural Embedding | May 10, 2023 | CPUDomain Adaptation | —Unverified | 0 |
| End-to-end Optimization of Machine Learning Prediction Queries | May 31, 2022 | BIG-bench Machine LearningCPU | —Unverified | 0 |
| Efficient Inference For Neural Machine Translation | Oct 6, 2020 | CPUDecoder | —Unverified | 0 |
| End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene | Nov 30, 2023 | CPUInformation Retrieval | —Unverified | 0 |
| Energon: Towards Efficient Acceleration of Transformers Using Dynamic Sparse Attention | Oct 18, 2021 | CPUEdge-computing | —Unverified | 0 |
| An Automatic and Efficient BERT Pruning for Edge AI Systems | Jun 21, 2022 | CPUModel Compression | —Unverified | 0 |