| RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose | Mar 13, 2023 | 2D Human Pose Estimation2D Pose Estimation | —Unverified | 0 |
| FlexGen: High-Throughput Generative Inference of Large Language Models with a Single GPU | Mar 13, 2023 | CPUGPU | CodeCode Available | 5 |
| Towards MoE Deployment: Mitigating Inefficiencies in Mixture-of-Expert (MoE) Inference | Mar 10, 2023 | CPUDecoder | —Unverified | 0 |
| Fourier-MIONet: Fourier-enhanced multiple-input neural operators for multiphase modeling of geological carbon sequestration | Mar 8, 2023 | CPUGPU | CodeCode Available | 1 |
| AHPA: Adaptive Horizontal Pod Autoscaling Systems on Alibaba Cloud Container Service for Kubernetes | Mar 7, 2023 | CPUManagement | —Unverified | 0 |
| Run, Don't Walk: Chasing Higher FLOPS for Faster Neural Networks | Mar 7, 2023 | CPUGPU | CodeCode Available | 2 |
| EPAM: A Predictive Energy Model for Mobile AI | Mar 2, 2023 | CPUGPU | —Unverified | 0 |
| BenchDirect: A Directed Language Model for Compiler Benchmarks | Mar 2, 2023 | Active LearningCPU | —Unverified | 0 |
| In search of the most efficient and memory-saving visualization of high dimensional data | Feb 27, 2023 | CPUDimensionality Reduction | —Unverified | 0 |
| MP-Rec: Hardware-Software Co-Design to Enable Multi-Path Recommendation | Feb 21, 2023 | CPUGPU | —Unverified | 0 |