| FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search | Sep 25, 2024 | Collaborative FilteringCPU | —Unverified | 0 |
| CNN Mixture-of-Depths | Sep 25, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| INT-FlashAttention: Enabling Flash Attention for INT8 Quantization | Sep 25, 2024 | GPUQuantization | CodeCode Available | 2 |
| Textless NLP -- Zero Resource Challenge with Low Resource Compute | Sep 24, 2024 | Acoustic Unit DiscoveryGPU | —Unverified | 0 |
| CAD: Memory Efficient Convolutional Adapter for Segment Anything | Sep 24, 2024 | DecoderGPU | CodeCode Available | 1 |
| Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed | Sep 24, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| A Modular-based Strategy for Mitigating Gradient Conflicts in Simultaneous Speech Translation | Sep 24, 2024 | GPUMulti-Task Learning | —Unverified | 0 |
| dnaGrinder: a lightweight and high-capacity genomic foundation model | Sep 24, 2024 | DecoderGPU | —Unverified | 0 |
| PipeFill: Using GPUs During Bubbles in Pipeline-parallel LLM Training | Sep 23, 2024 | 8kGPU | —Unverified | 0 |
| TextToon: Real-Time Text Toonify Head Avatar from Single Video | Sep 23, 2024 | Contrastive LearningGPU | —Unverified | 0 |