| Enabling Highly Efficient Capsule Networks Processing Through A PIM-Based Architecture Design | Nov 7, 2019 | GPUImage Segmentation | —Unverified | 0 |
| Fast and Efficient 2-bit LLM Inference on GPU: 2/4/16-bit in a Weight Matrix with Asynchronous Dequantization | Nov 28, 2023 | GPUQuantization | —Unverified | 0 |
| Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small | Oct 21, 2024 | GPU | —Unverified | 0 |
| Enabling Efficient Serverless Inference Serving for LLM (Large Language Model) in the Cloud | Nov 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CAT: A Conditional Adaptation Tailor for Efficient and Effective Instance-Specific Pansharpening on Real-World Data | Apr 14, 2025 | Computational EfficiencyGPU | —Unverified | 0 |
| Vision Transformer Computation and Resilience for Dynamic Inference | Dec 6, 2022 | GPUSemantic Segmentation | —Unverified | 0 |
| Emulating Aerosol Microphysics with Machine Learning | Sep 22, 2021 | BIG-bench Machine LearningGPU | —Unverified | 0 |
| A Fourier Neural Operator Approach for Modelling Exciton-Polariton Condensate Systems | Sep 27, 2023 | GPU | —Unverified | 0 |
| EM-TTS: Efficiently Trained Low-Resource Mongolian Lightweight Text-to-Speech | Mar 13, 2024 | GPUSpeech Synthesis | —Unverified | 0 |
| CASPIANET++: A Multidimensional Channel-Spatial Asymmetric Attention Network with Noisy Student Curriculum Learning Paradigm for Brain Tumor Segmentation | Jul 8, 2021 | Brain Tumor SegmentationGPU | —Unverified | 0 |