| A Study of Optimizations for Fine-tuning Large Language Models | Jun 4, 2024 | GPU | —Unverified | 0 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| Speeding up Policy Simulation in Supply Chain RL | Jun 4, 2024 | GPU | —Unverified | 0 |
| Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Jun 4, 2024 | Face SwappingGPU | CodeCode Available | 4 |
| SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM | Jun 3, 2024 | DecoderGPU | CodeCode Available | 2 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 |
| OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models | Jun 3, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GPU-Accelerated Rule Evaluation and Evolution | Jun 3, 2024 | Explainable artificial intelligenceGPU | —Unverified | 0 |
| D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models | Jun 3, 2024 | GPUMath | —Unverified | 0 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |