| Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization | Oct 19, 2021 | CPUGPU | —Unverified | 0 | 0 |
| Fast-COS: A Fast One-Stage Object Detector Based on Reparameterized Attention Vision Transformer for Autonomous Driving | Feb 11, 2025 | Autonomous DrivingComputational Efficiency | —Unverified | 0 | 0 |
| FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs | Dec 30, 2024 | GPUGraph Neural Network | —Unverified | 0 | 0 |
| Communication Optimization for Distributed Training: Architecture, Advances, and Opportunities | Mar 12, 2024 | GPU | —Unverified | 0 | 0 |
| Communication-Free Distributed GNN Training with Vertex Cut | Aug 6, 2023 | GPUNode Classification | —Unverified | 0 | 0 |
| Fast Back-Projection for Non-Line of Sight Reconstruction | Mar 6, 2017 | GPU | —Unverified | 0 | 0 |
| Communication-Efficient TeraByte-Scale Model Training Framework for Online Advertising | Jan 5, 2022 | Click-Through Rate PredictionCPU | —Unverified | 0 | 0 |
| ARAP-GS: Drag-driven As-Rigid-As-Possible 3D Gaussian Splatting Editing with Diffusion Prior | Apr 17, 2025 | 3DGSGPU | —Unverified | 0 | 0 |
| FastAttention: Extend FlashAttention2 to NPUs and Low-resource GPUs | Oct 22, 2024 | CPUGPU | —Unverified | 0 | 0 |
| Fast and Scalable Optimal Transport for Brain Tractograms | Jul 5, 2021 | GPU | —Unverified | 0 | 0 |