| DeepSeek-V3 Technical Report | Dec 27, 2024 | GPULanguage Modeling | CodeCode Available | 16 |
| MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Dec 27, 2024 | GPUQuantization | CodeCode Available | 2 |
| Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference | Dec 25, 2024 | CPUGPU | —Unverified | 0 |
| GIMS: Image Matching System Based on Adaptive Graph Construction and Graph Neural Network | Dec 24, 2024 | GPUgraph construction | CodeCode Available | 1 |
| KunServe: Efficient Parameter-centric Memory Management for LLM Serving | Dec 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Dec 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition | Dec 23, 2024 | GPUMotion Synthesis | —Unverified | 0 |
| Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |
| Power- and Fragmentation-aware Online Scheduling for GPU Datacenters | Dec 23, 2024 | CPUGPU | CodeCode Available | 0 |
| CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |