| Liger Kernel: Efficient Triton Kernels for LLM Training | Oct 14, 2024 | ChunkingGPU | CodeCode Available | 9 |
| ET-Former: Efficient Triplane Deformable Attention for 3D Semantic Scene Completion From Monocular Camera | Oct 14, 2024 | 3D Semantic Scene CompletionDecision Making | —Unverified | 0 |
| DuoAttention: Efficient Long-Context LLM Inference with Retrieval and Streaming Heads | Oct 14, 2024 | GPUQuantization | CodeCode Available | 4 |
| Exploiting Local Features and Range Images for Small Data Real-Time Point Cloud Semantic Segmentation | Oct 14, 2024 | Autonomous DrivingGPU | CodeCode Available | 0 |
| Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Oct 14, 2024 | DiversityGPU | CodeCode Available | 0 |
| PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs | Oct 14, 2024 | GPURecommendation Systems | —Unverified | 0 |
| KBLaM: Knowledge Base augmented Language Model | Oct 14, 2024 | 8kGPU | CodeCode Available | 5 |
| MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal Sampling | Oct 14, 2024 | Audio-Visual SynchronizationGPU | CodeCode Available | 9 |
| Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models | Oct 14, 2024 | GPUImage Generation | —Unverified | 0 |
| SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers | Oct 14, 2024 | DecoderGPU | CodeCode Available | 9 |