| LCM-LoRA: A Universal Stable-Diffusion Acceleration Module | Nov 9, 2023 | GPUImage Generation | CodeCode Available | 4 |
| GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition | Nov 8, 2023 | CPUDecoder | CodeCode Available | 1 |
| Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs | Nov 8, 2023 | GPU | —Unverified | 0 |
| LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models | Nov 8, 2023 | 8kGPU | CodeCode Available | 5 |
| A Comprehensive Summarization and Evaluation of Feature Refinement Modules for CTR Prediction | Nov 8, 2023 | BenchmarkingClick-Through Rate Prediction | CodeCode Available | 0 |
| DACBERT: Leveraging Dependency Agreement for Cost-Efficient Bert Pretraining | Nov 8, 2023 | GPUMRPC | —Unverified | 0 |
| Input Reconstruction Attack against Vertical Federated Large Language Models | Nov 7, 2023 | Federated LearningGPU | —Unverified | 0 |
| Estimator-Coupled Reinforcement Learning for Robust Purely Tactile In-Hand Manipulation | Nov 7, 2023 | GPUreinforcement-learning | —Unverified | 0 |
| Prompt Cache: Modular Attention Reuse for Low-Latency Inference | Nov 7, 2023 | CPUGPU | CodeCode Available | 1 |
| Black-Box Prompt Optimization: Aligning Large Language Models without Model Training | Nov 7, 2023 | GPU | CodeCode Available | 2 |