| Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Jun 1, 2023 | GPUImage Compression | CodeCode Available | 2 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness | May 18, 2023 | CPUGPU | CodeCode Available | 2 |
| CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model | May 11, 2023 | DenoisingGPU | CodeCode Available | 2 |
| OctFormer: Octree-based Transformers for 3D Point Clouds | May 4, 2023 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 2 |
| VPGTrans: Transfer Visual Prompt Generator across LLMs | May 2, 2023 | GPUTransfer Learning | CodeCode Available | 2 |