| Fiddler: CPU-GPU Orchestration for Fast Inference of Mixture-of-Experts Models | Feb 10, 2024 | CPUGPU | CodeCode Available | 3 |
| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| A Lightweight Inception Boosted U-Net Neural Network for Routability Prediction | Feb 7, 2024 | AvgCPU | CodeCode Available | 1 |
| ServeFlow: A Fast-Slow Model Architecture for Network Traffic Analysis | Feb 6, 2024 | CPU | —Unverified | 0 |
| ProactivePIM: Accelerating Weight-Sharing Embedding Layer with PIM for Scalable Recommendation System | Feb 6, 2024 | CPURecommendation Systems | —Unverified | 0 |
| Design and Implementation of an Automated Disaster-recovery System for a Kubernetes Cluster Using LSTM | Feb 5, 2024 | CPUManagement | —Unverified | 0 |
| Spin: An Efficient Secure Computation Framework with GPU Acceleration | Feb 4, 2024 | CPUGPU | —Unverified | 0 |
| Root Cause Analysis In Microservice Using Neural Granger Causal Discovery | Feb 2, 2024 | Causal DiscoveryContrastive Learning | CodeCode Available | 1 |
| Asynchronous Distributed Genetic Algorithms with Javascript and JSON | Jan 30, 2024 | CPU | —Unverified | 0 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 |