| Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models | Apr 11, 2024 | GPUIn-Context Learning | —Unverified | 0 |
| YOLO based Ocean Eddy Localization with AWS SageMaker | Apr 10, 2024 | GPUManagement | —Unverified | 0 |
| GCV-Turbo: End-to-end Acceleration of GNN-based Computer Vision Tasks on FPGA | Apr 10, 2024 | CPUGPU | —Unverified | 0 |
| PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory System | Apr 10, 2024 | CPUDistributed Optimization | CodeCode Available | 0 |
| LATUP-Net: A Lightweight 3D Attention U-Net with Parallel Convolutions for Brain Tumor Segmentation | Apr 9, 2024 | Brain Tumor SegmentationGPU | —Unverified | 0 |
| ApproxDARTS: Differentiable Neural Architecture Search with Approximate Multipliers | Apr 8, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| Dense Training, Sparse Inference: Rethinking Training of Mixture-of-Experts Language Models | Apr 8, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Data Stream Sampling with Fuzzy Task Boundaries and Noisy Labels | Apr 7, 2024 | Continual LearningFairness | CodeCode Available | 0 |
| GNNBENCH: Fair and Productive Benchmarking for Single-GPU GNN System | Apr 5, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization | Apr 4, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| GPU-Accelerated RSF Level Set Evolution for Large-Scale Microvascular Segmentation | Apr 3, 2024 | GPUSemantic Segmentation | —Unverified | 0 |
| LoSA: Long-Short-range Adapter for Scaling End-to-End Temporal Action Localization | Apr 1, 2024 | Action LocalizationGPU | —Unverified | 0 |
| Enhancing Reasoning Capacity of SLM using Cognitive Enhancement | Apr 1, 2024 | GPULanguage Modelling | —Unverified | 0 |
| Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting | Apr 1, 2024 | Domain GeneralizationGPU | CodeCode Available | 0 |
| GAMA-IR: Global Additive Multidimensional Averaging for Fast Image Restoration | Mar 31, 2024 | DeblurringDenoising | —Unverified | 0 |
| Grid Diffusion Models for Text-to-Video Generation | Mar 30, 2024 | GPUImage Generation | —Unverified | 0 |
| DeFT: Decoding with Flash Tree-attention for Efficient Tree-structured LLM Inference | Mar 30, 2024 | GPU | —Unverified | 0 |
| FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model | Mar 29, 2024 | GPUPose Estimation | —Unverified | 0 |
| Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs | Mar 29, 2024 | CPUGPU | —Unverified | 0 |
| Shallow Cross-Encoders for Low-Latency Retrieval | Mar 29, 2024 | CPUGPU | CodeCode Available | 0 |
| Bespoke Large Language Models for Digital Triage Assistance in Mental Health Care | Mar 28, 2024 | GPU | —Unverified | 0 |
| Jamba: A Hybrid Transformer-Mamba Language Model | Mar 28, 2024 | GPULanguage Modeling | CodeCode Available | 0 |
| NeuroLGP-SM: A Surrogate-assisted Neuroevolution Approach using Linear Genetic Programming | Mar 28, 2024 | Evolutionary AlgorithmsGPU | —Unverified | 0 |
| Parallel Implementations Assessment of a Spatial-Spectral Classifier for Hyperspectral Clinical Applications | Mar 28, 2024 | GPUMedical Diagnosis | —Unverified | 0 |
| Debiasing Cardiac Imaging with Controlled Latent Diffusion Models | Mar 28, 2024 | DenoisingGPU | CodeCode Available | 0 |
| Implementation of the Principal Component Analysis onto High-Performance Computer Facilities for Hyperspectral Dimensionality Reduction: Results and Comparisons | Mar 27, 2024 | Dimensionality ReductionGPU | —Unverified | 0 |
| Fourier or Wavelet bases as counterpart self-attention in spikformer for efficient visual classification | Mar 27, 2024 | FormGPU | —Unverified | 0 |
| Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models | Mar 26, 2024 | GPUImage Restoration | —Unverified | 0 |
| Towards a Zero-Data, Controllable, Adaptive Dialog System | Mar 26, 2024 | ArticlesGPU | —Unverified | 0 |
| ALISA: Accelerating Large Language Model Inference via Sparsity-Aware KV Caching | Mar 26, 2024 | CPUGPU | —Unverified | 0 |
| SIP: Autotuning GPU Native Schedules via Stochastic Instruction Perturbation | Mar 25, 2024 | GPU | —Unverified | 0 |
| Real-time Neuron Segmentation for Voltage Imaging | Mar 25, 2024 | GPU | —Unverified | 0 |
| A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters | Mar 24, 2024 | GPUScheduling | —Unverified | 0 |
| A Unified Module for Accelerating STABLE-DIFFUSION: LCM-LORA | Mar 24, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Ev-Edge: Efficient Execution of Event-based Vision Algorithms on Commodity Edge Platforms | Mar 23, 2024 | Autonomous NavigationEvent-based vision | —Unverified | 0 |
| Fine Tuning LLM for Enterprise: Practical Guidelines and Recommendations | Mar 23, 2024 | GPURAG | —Unverified | 0 |
| Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention | Mar 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Accelerating Recommender Model Training by Dynamically Skipping Stale Embeddings | Mar 22, 2024 | CPUGPU | —Unverified | 0 |
| Your Image is My Video: Reshaping the Receptive Field via Image-To-Video Differentiable AutoAugmentation and Fusion | Mar 22, 2024 | Data AugmentationGPU | —Unverified | 0 |
| ParFormer: A Vision Transformer with Parallel Mixer and Sparse Channel Attention Patch Embedding | Mar 22, 2024 | GPUImage Classification | —Unverified | 0 |
| Learning Quadruped Locomotion Using Differentiable Simulation | Mar 21, 2024 | GPU | —Unverified | 0 |
| SpikeGraphormer: A High-Performance Graph Transformer with Spiking Graph Attention | Mar 21, 2024 | GPUGraph Attention | CodeCode Available | 0 |
| Compress3D: a Compressed Latent Space for 3D Generation from a Single Image | Mar 20, 2024 | 3D Generation3D geometry | —Unverified | 0 |
| PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation? | Mar 20, 2024 | Abstractive Text SummarizationContinual Pretraining | —Unverified | 0 |
| Simple Hack for Transformers against Heavy Long-Text Classification on a Time- and Memory-Limited GPU Service | Mar 19, 2024 | ArticlesGPU | —Unverified | 0 |
| Deep Few-view High-resolution Photon-counting Extremity CT at Halved Dose for a Clinical Trial | Mar 19, 2024 | DiagnosticGPU | —Unverified | 0 |
| Graph Neural Network for Neutrino Physics Event Reconstruction | Mar 18, 2024 | CPUGPU | —Unverified | 0 |
| Towards Real-Time Fast Unmanned Aerial Vehicle Detection Using Dynamic Vision Sensors | Mar 18, 2024 | CPUEvent-based vision | —Unverified | 0 |
| VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model | Mar 18, 2024 | DenoisingGPU | —Unverified | 0 |
| 3DGS-ReLoc: 3D Gaussian Splatting for Map Representation and Visual ReLocalization | Mar 17, 2024 | 3DGSGPU | —Unverified | 0 |