| I-BERT: Integer-only BERT Quantization | Jan 5, 2021 | GPUNatural Language Inference | CodeCode Available | 2 | 5 |
| HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection | Feb 2, 2022 | Audio ClassificationEvent Detection | CodeCode Available | 2 | 5 |
| Accelerating Transformer Pre-training with 2:4 Sparsity | Apr 2, 2024 | GPU | CodeCode Available | 2 | 5 |
| CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models | Mar 28, 2025 | GPUGSM8K | CodeCode Available | 2 | 5 |
| HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation | Apr 27, 2022 | Domain AdaptationGPU | CodeCode Available | 2 | 5 |
| HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors | Jul 26, 2024 | Depth EstimationGPU | CodeCode Available | 2 | 5 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| Holistically-Attracted Wireframe Parsing: From Supervised to Self-Supervised Learning | Oct 24, 2022 | GPUSelf-Supervised Learning | CodeCode Available | 2 | 5 |
| HLSTransform: Energy-Efficient Llama 2 Inference on FPGAs Via High Level Synthesis | Apr 29, 2024 | CPUEdge-computing | CodeCode Available | 2 | 5 |
| HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference | Apr 8, 2025 | CPUGPU | CodeCode Available | 2 | 5 |