| NeuroLGP-SM: A Surrogate-assisted Neuroevolution Approach using Linear Genetic Programming | Mar 28, 2024 | Evolutionary AlgorithmsGPU | —Unverified | 0 |
| Taming Lookup Tables for Efficient Image Retouching | Mar 28, 2024 | CPUGPU | CodeCode Available | 1 |
| Implementation of the Principal Component Analysis onto High-Performance Computer Facilities for Hyperspectral Dimensionality Reduction: Results and Comparisons | Mar 27, 2024 | Dimensionality ReductionGPU | —Unverified | 0 |
| Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classification | Mar 27, 2024 | FormGPU | —Unverified | 0 |
| Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction | Mar 27, 2024 | 3D Generation3DGS | CodeCode Available | 2 |
| LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning | Mar 26, 2024 | GPUGSM8K | CodeCode Available | 9 |
| Towards a Zero-Data, Controllable, Adaptive Dialog System | Mar 26, 2024 | ArticlesGPU | —Unverified | 0 |
| The Unreasonable Ineffectiveness of the Deeper Layers | Mar 26, 2024 | GPUQuantization | CodeCode Available | 3 |
| ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching | Mar 26, 2024 | CPUGPU | —Unverified | 0 |
| Fully-fused Multi-Layer Perceptrons on Intel Data Center GPUs | Mar 26, 2024 | GPUImage Compression | CodeCode Available | 2 |
| Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models | Mar 26, 2024 | GPUImage Restoration | —Unverified | 0 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |
| MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models | Mar 25, 2024 | GPUIn-Context Learning | CodeCode Available | 1 |
| SIP: Autotuning GPU Native Schedules via Stochastic Instruction Perturbation | Mar 25, 2024 | GPU | —Unverified | 0 |
| Invertible Diffusion Models for Compressed Sensing | Mar 25, 2024 | compressed sensingGPU | CodeCode Available | 2 |
| MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline | Mar 25, 2024 | GPU | CodeCode Available | 1 |
| ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration | Mar 25, 2024 | Computational EfficiencyGPU | CodeCode Available | 1 |
| Real-time Neuron Segmentation for Voltage Imaging | Mar 25, 2024 | GPU | —Unverified | 0 |
| SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions | Mar 25, 2024 | DecoderGPU | CodeCode Available | 4 |
| A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters | Mar 24, 2024 | GPUScheduling | —Unverified | 0 |
| A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA | Mar 24, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations | Mar 23, 2024 | GPURAG | —Unverified | 0 |
| Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention | Mar 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression | Mar 23, 2024 | Dimensionality ReductionGPU | CodeCode Available | 1 |
| Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms | Mar 23, 2024 | Autonomous NavigationEvent-based vision | —Unverified | 0 |