| Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM Inference | May 28, 2024 | GPUText Generation | CodeCode Available | 2 | 5 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 | 5 |
| Boundary-Aware Segmentation Network for Mobile and Web Applications | Jan 12, 2021 | Camouflaged Object SegmentationDecoder | CodeCode Available | 2 | 5 |
| H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models | Jun 24, 2023 | GPU | CodeCode Available | 2 | 5 |
| Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers | May 20, 2025 | GPUVideo Generation | CodeCode Available | 2 | 5 |
| GS^3: Efficient Relighting with Triple Gaussian Splatting | Oct 15, 2024 | GPU | CodeCode Available | 2 | 5 |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Jun 28, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading | Feb 18, 2025 | Computational EfficiencyCPU | CodeCode Available | 2 | 5 |
| GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion | Jun 14, 2024 | 3D GenerationGPU | CodeCode Available | 2 | 5 |
| Gradient Boosting Reinforcement Learning | Jul 11, 2024 | GPUreinforcement-learning | CodeCode Available | 2 | 5 |