| Bag of Tricks for Optimizing Transformer Efficiency | Sep 9, 2021 | CPUDecoder | CodeCode Available | 0 | 5 |
| Longer Attention Span: Increasing Transformer Context Length with Sparse Graph Processing Techniques | Jan 31, 2025 | GPU | CodeCode Available | 0 | 5 |
| Local SGD with Periodic Averaging: Tighter Analysis and Adaptive Synchronization | Oct 30, 2019 | Distributed OptimizationGPU | CodeCode Available | 0 | 5 |
| LoCo: Low-Bit Communication Adaptor for Large-scale Model Training | Jul 5, 2024 | GPU | CodeCode Available | 0 | 5 |
| Dense and Low-Rank Gaussian CRFs Using Deep Embeddings | Oct 1, 2017 | GPUHuman Part Segmentation | CodeCode Available | 0 | 5 |
| Denoiser-based projections for 2-D super-resolution multi-reference alignment | Apr 10, 2022 | GPUSuper-Resolution | CodeCode Available | 0 | 5 |
| LLM-Powered Ensemble Learning for Paper Source Tracing: A GPU-Free Approach | Sep 14, 2024 | Ensemble LearningGPU | CodeCode Available | 0 | 5 |
| LLMPerf: GPU Performance Modeling meets Large Language Models | Mar 14, 2025 | GPU | CodeCode Available | 0 | 5 |
| NeuroSEM: A hybrid framework for simulating multiphysics problems by coupling PINNs and spectral elements | Jul 30, 2024 | CPUGPU | CodeCode Available | 0 | 5 |
| LLM-Barber: Block-Aware Rebuilder for Sparsity Mask in One-Shot for Large Language Models | Aug 20, 2024 | GPU | CodeCode Available | 0 | 5 |