| CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs | Sep 19, 2024 | GPU | CodeCode Available | 1 | 5 |
| Data-efficient LLM Fine-tuning for Code Generation | Apr 17, 2025 | Code GenerationGPU | CodeCode Available | 1 | 5 |
| Data-Efficient Multimodal Fusion on a Single GPU | Dec 15, 2023 | GPUImage Retrieval | CodeCode Available | 1 | 5 |
| Easy and Efficient Transformer : Scalable Inference Solution For large NLP model | Apr 26, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| EAGAN: Efficient Two-stage Evolutionary Architecture Search for GANs | Nov 30, 2021 | GPUImage Generation | CodeCode Available | 1 | 5 |
| Transformer Tracking | Mar 29, 2021 | GPUObject Tracking | CodeCode Available | 1 | 5 |
| EdgeNAT: Transformer for Efficient Edge Detection | Aug 20, 2024 | Edge DetectionGPU | CodeCode Available | 1 | 5 |
| Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs | Jul 1, 2024 | GPUMixture-of-Experts | CodeCode Available | 1 | 5 |
| Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory Prediction | Apr 18, 2025 | 3D Object DetectionGPU | CodeCode Available | 1 | 5 |
| LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models | Jul 5, 2025 | BenchmarkingGPU | CodeCode Available | 1 | 5 |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jun 7, 2024 | CPUGPU | CodeCode Available | 1 | 5 |
| CrAM: A Compression-Aware Minimizer | Jul 28, 2022 | GPUImage Classification | CodeCode Available | 1 | 5 |
| Embedding a Differentiable Mel-cepstral Synthesis Filter to a Neural Speech Synthesis System | Nov 21, 2022 | GPUSpeech Synthesis | CodeCode Available | 1 | 5 |
| Crabs: Consuming Resource via Auto-generation for LLM-DoS Attack under Black-box Settings | Dec 18, 2024 | GPU | CodeCode Available | 1 | 5 |
| CPU- and GPU-based Distributed Sampling in Dirichlet Process Mixtures for Large-scale Analysis | Apr 19, 2022 | CPUGPU | CodeCode Available | 1 | 5 |
| A Unified Framework for Implicit Sinkhorn Differentiation | May 13, 2022 | GPU | CodeCode Available | 1 | 5 |
| MGG: Accelerating Graph Neural Networks with Fine-grained intra-kernel Communication-Computation Pipelining on Multi-GPU Platforms | Sep 14, 2022 | GPULayout Design | CodeCode Available | 1 | 5 |
| Dynamic Pooling Improves Nanopore Base Calling Accuracy | May 16, 2021 | GPU | CodeCode Available | 1 | 5 |
| Decentralized Training of Foundation Models in Heterogeneous Environments | Jun 2, 2022 | GPUScheduling | CodeCode Available | 1 | 5 |
| CPM-2: Large-scale Cost-effective Pre-trained Language Models | Jun 20, 2021 | DecoderGPU | CodeCode Available | 1 | 5 |
| MariusGNN: Resource-Efficient Out-of-Core Training of Graph Neural Networks | Feb 4, 2022 | GPU | CodeCode Available | 1 | 5 |
| Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation | Jan 17, 2020 | Depth CompletionGPU | CodeCode Available | 1 | 5 |
| DeAR: Accelerating Distributed Deep Learning with Fine-Grained All-Reduce Pipelining | Feb 24, 2023 | AllGPU | CodeCode Available | 1 | 5 |
| Dynamic Perceiver for Efficient Visual Recognition | Jun 20, 2023 | Action RecognitionClassification | CodeCode Available | 1 | 5 |
| Dynamic Sparse Training with Structured Sparsity | May 3, 2023 | CPUGPU | CodeCode Available | 1 | 5 |