| A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA | Mar 24, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations | Mar 23, 2024 | GPURAG | —Unverified | 0 |
| Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention | Mar 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression | Mar 23, 2024 | Dimensionality ReductionGPU | CodeCode Available | 1 |
| Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms | Mar 23, 2024 | Autonomous NavigationEvent-based vision | —Unverified | 0 |
| Accelerating Recommender Model Training by Dynamically Skipping Stale Embeddings | Mar 22, 2024 | CPUGPU | —Unverified | 0 |
| ParFormer: A Vision Transformer with Parallel Mixer and Sparse Channel Attention Patch Embedding | Mar 22, 2024 | GPUImage Classification | —Unverified | 0 |
| YOLOv5-6D: Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries | Mar 22, 2024 | 6D Pose Estimation using RGBGPU | CodeCode Available | 2 |
| Towards a Comprehensive, Efficient and Promptable Anatomic Structure Segmentation Model using 3D Whole-body CT Scans | Mar 22, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Mar 22, 2024 | Data AugmentationGPU | —Unverified | 0 |