| LAMM: Language-Assisted Multi-Modal Instruction-Tuning Dataset, Framework, and Benchmark | Jun 11, 2023 | GPU | CodeCode Available | 1 |
| Push: Concurrent Probabilistic Programming for Bayesian Deep Learning | Jun 10, 2023 | Bayesian InferenceDeep Learning | CodeCode Available | 0 |
| Finding Hamiltonian cycles with graph neural networks | Jun 10, 2023 | GPUGraph Neural Network | CodeCode Available | 0 |
| EfficientBioAI: Making Bioimaging AI Models Efficient in Energy, Latency and Representation | Jun 9, 2023 | CPUGPU | CodeCode Available | 1 |
| S^3: Increasing GPU Utilization during Generative Inference for Higher Throughput | Jun 9, 2023 | GPULanguage Modeling | —Unverified | 0 |
| StreetSurf: Extending Multi-view Implicit Surface Reconstruction to Street Views | Jun 8, 2023 | Autonomous DrivingGPU | CodeCode Available | 2 |
| Does Long-Term Series Forecasting Need Complex Attention and Extra Long Inputs? | Jun 8, 2023 | Bayesian OptimizationGPU | CodeCode Available | 1 |
| Optimized Crystallographic Graph Generation for Material Science | Jun 7, 2023 | GPUGraph Generation | CodeCode Available | 0 |
| Modulation Classification Through Deep Learning Using Resolution Transformed Spectrograms | Jun 6, 2023 | ClassificationCPU | —Unverified | 0 |
| Revisiting Neural Retrieval on Accelerators | Jun 6, 2023 | GPUInformation Retrieval | CodeCode Available | 1 |
| Towards Memory-Efficient Training for Extremely Large Output Spaces -- Learning with 500k Labels on a Single Commodity GPU | Jun 6, 2023 | GPU | —Unverified | 0 |
| DVIS: Decoupled Video Instance Segmentation Framework | Jun 6, 2023 | Autonomous DrivingGPU | CodeCode Available | 1 |
| SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression | Jun 5, 2023 | GPULanguage Modelling | CodeCode Available | 2 |
| DVFO: Learning-Based DVFS for Energy-Efficient Edge-Cloud Collaborative Inference | Jun 2, 2023 | Collaborative InferenceCPU | —Unverified | 0 |
| Lightweight Vision Transformer with Bidirectional Interaction | Jun 1, 2023 | GPU | CodeCode Available | 1 |
| Wuerstchen: An Efficient Architecture for Large-Scale Text-to-Image Diffusion Models | Jun 1, 2023 | GPUImage Compression | CodeCode Available | 2 |
| Accelerated Fingerprint Enhancement: A GPU-Optimized Mixed Architecture Approach | Jun 1, 2023 | GPU | —Unverified | 0 |
| Autism Disease Detection Using Transfer Learning Techniques: Performance Comparison Between Central Processing Unit vs Graphics Processing Unit Functions for Neural Networks | Jun 1, 2023 | CPUGPU | —Unverified | 0 |
| Special Session: Approximation and Fault Resiliency of DNN Accelerators | May 31, 2023 | Autonomous DrivingGPU | —Unverified | 0 |
| Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training | May 31, 2023 | GPU | —Unverified | 0 |
| Neuron to Graph: Interpreting Language Model Neurons at Scale | May 31, 2023 | GPULanguage Modeling | CodeCode Available | 0 |
| Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts | May 30, 2023 | CPUGPU | CodeCode Available | 1 |
| CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network | May 30, 2023 | GPU | —Unverified | 0 |
| SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics | May 29, 2023 | GPUQuantization | —Unverified | 0 |
| Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs | May 29, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| Search-Based Regular Expression Inference on a GPU | May 29, 2023 | CPUGPU | CodeCode Available | 1 |
| Fine-Tuning Language Models with Just Forward Passes | May 27, 2023 | GPUIn-Context Learning | CodeCode Available | 3 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| RT-kNNS Unbound: Using RT Cores to Accelerate Unrestricted Neighbor Search | May 26, 2023 | GPU | —Unverified | 0 |
| Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU | May 26, 2023 | CPUGPU | CodeCode Available | 0 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| Sliding Window Sum Algorithms for Deep Neural Networks | May 25, 2023 | CPUGPU | —Unverified | 0 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| Dynamic Data Augmentation via MCTS for Prostate MRI Segmentation | May 25, 2023 | Data AugmentationGPU | CodeCode Available | 0 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation | May 24, 2023 | Formal LogicGPU | CodeCode Available | 1 |
| Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models | May 24, 2023 | GPUImage Generation | CodeCode Available | 0 |
| AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content | May 24, 2023 | Document Summarizationdocument understanding | —Unverified | 0 |
| AutoDepthNet: High Frame Rate Depth Map Reconstruction using Commodity Depth and RGB Cameras | May 24, 2023 | Depth EstimationGPU | —Unverified | 0 |
| READ: Recurrent Adaptation of Large Transformers | May 24, 2023 | GPUTransfer Learning | —Unverified | 0 |
| Graph Analysis Using a GPU-based Parallel Algorithm: Quantum Clustering | May 24, 2023 | ClusteringGPU | —Unverified | 0 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 |
| An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level | May 23, 2023 | GPUImage Segmentation | CodeCode Available | 1 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks | May 23, 2023 | AttributeDataset Generation | CodeCode Available | 1 |
| Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding | May 22, 2023 | GPUIn-Context Learning | —Unverified | 0 |
| Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference | May 22, 2023 | Computational EfficiencyGPU | CodeCode Available | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching | May 20, 2023 | CPUGPU | —Unverified | 0 |