| Apodotiko: Enabling Efficient Serverless Federated Learning in Heterogeneous Environments | Apr 22, 2024 | CPUFederated Learning | —Unverified | 0 |
| MixLoRA: Enhancing Large Language Models Fine-Tuning with LoRA-based Mixture of Experts | Apr 22, 2024 | Common Sense ReasoningGPU | CodeCode Available | 3 |
| STROOBnet Optimization via GPU-Accelerated Proximal Recurrence Strategies | Apr 22, 2024 | GPU | —Unverified | 0 |
| GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting | Apr 22, 2024 | GPUMotion Generation | —Unverified | 0 |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Apr 22, 2024 | DenoisingGPU | —Unverified | 0 |
| Turbo-CF: Matrix Decomposition-Free Graph Filtering for Fast Recommendation | Apr 22, 2024 | Collaborative FilteringGPU | CodeCode Available | 0 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| On-board classification of underwater images using hybrid classical-quantum CNN based method | Apr 19, 2024 | Autonomous VehiclesGPU | —Unverified | 0 |
| Towards Universal Performance Modeling for Machine Learning Training on Multi-GPU Platforms | Apr 19, 2024 | GPU | —Unverified | 0 |
| Scalable Data Assimilation with Message Passing | Apr 19, 2024 | Bayesian InferenceGPU | CodeCode Available | 0 |
| RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation | Apr 18, 2024 | GPURAG | —Unverified | 0 |
| Warped Time Series Anomaly Detection | Apr 18, 2024 | Anomaly DetectionDynamic Time Warping | —Unverified | 0 |
| Partial Large Kernel CNNs for Efficient Super-Resolution | Apr 18, 2024 | Computational EfficiencyGPU | CodeCode Available | 2 |
| TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding | Apr 18, 2024 | GPU | CodeCode Available | 3 |
| FastFace: Fast-converging Scheduler for Large-scale Face Recognition Training with One GPU | Apr 17, 2024 | Face RecognitionGPU | CodeCode Available | 0 |
| LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs | Apr 16, 2024 | DecoderGPU | CodeCode Available | 1 |
| Shears: Unstructured Sparsity with Neural Low-rank Adapter Search | Apr 16, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| SparseDM: Toward Sparse Efficient Diffusion Models | Apr 16, 2024 | GPUVideo Generation | —Unverified | 0 |
| Interpolating neural network: A novel unification of machine learning and interpolation theory | Apr 16, 2024 | GPUPhysical Simulations | CodeCode Available | 1 |
| Label merge-and-split: A graph-colouring approach for memory-efficient brain parcellation | Apr 16, 2024 | GPUSegmentation | —Unverified | 0 |
| Insight Gained from Migrating a Machine Learning Model to Intelligence Processing Units | Apr 16, 2024 | GPU | —Unverified | 0 |
| Optimal Kernel Tuning Parameter Prediction using Deep Sequence Models | Apr 15, 2024 | GPUParameter Prediction | —Unverified | 0 |
| Post-Training Network Compression for 3D Medical Image Segmentation: Reducing Computational Efforts via Tucker Decomposition | Apr 15, 2024 | Computational EfficiencyGPU | CodeCode Available | 0 |
| LoongServe: Efficiently Serving Long-Context Large Language Models with Elastic Sequence Parallelism | Apr 15, 2024 | GPU | CodeCode Available | 2 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |