| Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis | Mar 3, 2024 | 3D Parameter-Efficient Fine-Tuning for ClassificationGPU | CodeCode Available | 2 |
| LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization | Mar 2, 2024 | GPUQuantization | CodeCode Available | 1 |
| Parallel Hyperparameter Optimization Of Spiking Neural Network | Mar 1, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Feb 29, 2024 | Autonomous DrivingDenoising | CodeCode Available | 0 |
| Efficient Lifelong Model Evaluation in an Era of Rapid Progress | Feb 29, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Feb 29, 2024 | GPU | CodeCode Available | 4 |
| WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Feb 29, 2024 | DiversityGPU | CodeCode Available | 2 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 |
| FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization | Feb 28, 2024 | GPUQuantization | —Unverified | 0 |
| JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability | Feb 27, 2024 | GPUInformation Retrieval | CodeCode Available | 0 |
| Scaling Supervised Local Learning with Augmented Auxiliary Networks | Feb 27, 2024 | GPUimage-classification | CodeCode Available | 0 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Differentiable Biomechanics Unlocks Opportunities for Markerless Motion Capture | Feb 27, 2024 | GPUMarkerless Motion Capture | —Unverified | 0 |
| Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Feb 27, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows | Feb 27, 2024 | GPUManagement | —Unverified | 0 |
| Single Neuromorphic Memristor closely Emulates Multiple Synaptic Mechanisms for Energy Efficient Neural Networks | Feb 26, 2024 | GPUMeta-Learning | —Unverified | 0 |
| Video-Based Autism Detection with Deep Learning | Feb 26, 2024 | Autism detectionDeep Learning | —Unverified | 0 |
| Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning | Feb 26, 2024 | GPUMinecraft | CodeCode Available | 3 |
| PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures | Feb 26, 2024 | CPUGPU | CodeCode Available | 1 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 |
| Data-freeWeight Compress and Denoise for Large Language Models | Feb 26, 2024 | GPUQuantization | —Unverified | 0 |
| Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays | Feb 25, 2024 | 16k8k | —Unverified | 0 |
| Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale | Feb 25, 2024 | GPU | —Unverified | 0 |
| Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting | Feb 24, 2024 | GPU | —Unverified | 0 |
| Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning | Feb 24, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |