| QSync: Quantization-Minimized Synchronous Distributed Training Across Hybrid Devices | Jul 2, 2024 | GPUQuantization | CodeCode Available | 1 |
| Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs | Jul 1, 2024 | GPUMixture-of-Experts | CodeCode Available | 1 |
| LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference | Jun 28, 2024 | GPUQuantization | CodeCode Available | 1 |
| ConStyle v2: A Strong Prompter for All-in-One Image Restoration | Jun 26, 2024 | AllGPU | CodeCode Available | 1 |
| SEED: Accelerating Reasoning Tree Construction via Scheduled Speculative Decoding | Jun 26, 2024 | GPUManagement | CodeCode Available | 1 |
| Implicit-Zoo: A Large-Scale Dataset of Neural Implicit Functions for 2D Images and 3D Scenes | Jun 25, 2024 | GPUimage-classification | CodeCode Available | 1 |
| Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients | Jun 25, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Enhancing Dropout-based Bayesian Neural Networks with Multi-Exit on FPGA | Jun 20, 2024 | Autonomous DrivingCPU | CodeCode Available | 1 |
| CE-SSL: Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection | Jun 20, 2024 | Computational EfficiencyElectrocardiography (ECG) | CodeCode Available | 1 |
| LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional Adaptation | Jun 18, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |