| Representing Edge Flows on Graphs via Sparse Cell Complexes | Sep 4, 2023 | Inference OptimizationRepresentation Learning | CodeCode Available | 0 | 5 |
| Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert Merging | Jun 29, 2025 | Inference OptimizationMixture-of-Experts | CodeCode Available | 0 | 5 |
| Residual-Based Error Corrector Operator to Enhance Accuracy and Reliability of Neural Operator Surrogates of Nonlinear Variational Boundary-Value Problems | Jun 21, 2023 | Inference Optimization | —Unverified | 0 | 0 |
| Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks | May 20, 2024 | Inference OptimizationKnowledge Distillation | —Unverified | 0 | 0 |
| Energy-Efficient Transformer Inference: Optimization Strategies for Time Series Classification | Feb 23, 2025 | ClassificationInference Optimization | —Unverified | 0 | 0 |
| SNDCNN: Self-normalizing deep CNNs with scaled exponential linear units for speech recognition | Oct 4, 2019 | Inference Optimizationspeech-recognition | —Unverified | 0 | 0 |
| Faster MoE LLM Inference for Extremely Large Models | May 6, 2025 | Inference OptimizationMixture-of-Experts | —Unverified | 0 | 0 |
| Federated Learning While Providing Model as a Service: Joint Training and Inference Optimization | Dec 20, 2023 | Federated LearningInference Optimization | —Unverified | 0 | 0 |
| FluidML: Fast and Memory Efficient Inference Optimization | Nov 14, 2024 | Autonomous VehiclesInference Optimization | —Unverified | 0 | 0 |
| Hellinger-Kantorovich Gradient Flows: Global Exponential Decay of Entropy Functionals | Jan 28, 2025 | Inference Optimization | —Unverified | 0 | 0 |