| A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking | Jun 15, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Coralai: Intrinsic Evolution of Embodied Neural Cellular Automata Ecosystems | Jun 14, 2024 | DiversityGPU | CodeCode Available | 1 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 |
| Optimal Kernel Orchestration for Tensor Programs with Korch | Jun 13, 2024 | DiversityGPU | CodeCode Available | 1 |
| TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps | Jun 9, 2024 | GPUImage Generation | CodeCode Available | 1 |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jun 7, 2024 | CPUGPU | CodeCode Available | 1 |
| Queue management for slo-oriented large language model serving | Jun 5, 2024 | BlockingGPU | CodeCode Available | 1 |
| Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Jun 4, 2024 | document understandingGPU | CodeCode Available | 1 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 |
| RGFN: Synthesizable Molecular Generation Using GFlowNets | Jun 1, 2024 | GPU | CodeCode Available | 1 |
| μLO: Compute-Efficient Meta-Generalization of Learned Optimizers | May 31, 2024 | GPUZero-shot Generalization | CodeCode Available | 1 |
| Spatio-Spectral Graph Neural Networks | May 29, 2024 | GPUGraph Classification | CodeCode Available | 1 |
| Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba | May 28, 2024 | Computed Tomography (CT)GPU | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| Sparse Matrix in Large Language Model Fine-tuning | May 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| DAGER: Exact Gradient Inversion for Large Language Models | May 24, 2024 | DecoderFederated Learning | CodeCode Available | 1 |
| ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution | May 23, 2024 | GPUWeather Forecasting | CodeCode Available | 1 |
| Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference | May 23, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Fast inference with Kronecker-sparse matrices | May 23, 2024 | GPUManagement | CodeCode Available | 1 |
| ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification | May 23, 2024 | GPUGSM8K | CodeCode Available | 1 |
| Attention as an RNN | May 22, 2024 | GPUTime Series | CodeCode Available | 1 |
| PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference | May 21, 2024 | GPU | CodeCode Available | 1 |
| Token-wise Influential Training Data Retrieval for Large Language Models | May 20, 2024 | CPUGPU | CodeCode Available | 1 |
| Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging | May 19, 2024 | GPU | CodeCode Available | 1 |
| HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models | May 16, 2024 | GPULanguage Modelling | CodeCode Available | 1 |
| No Time to Waste: Squeeze Time into Channel for Mobile Video Understanding | May 14, 2024 | Action DetectionGPU | CodeCode Available | 1 |
| Computation-Aware Kalman Filtering and Smoothing | May 14, 2024 | GPU | CodeCode Available | 1 |
| The Developing Human Connectome Project: A Fast Deep Learning-based Pipeline for Neonatal Cortical Surface Reconstruction | May 14, 2024 | GPUSurface Reconstruction | CodeCode Available | 1 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 |
| CoSense3D: an Agent-based Efficient Learning Framework for Collective Perception | Apr 29, 2024 | Data VisualizationDecision Making | CodeCode Available | 1 |
| LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report | Apr 29, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method | Apr 23, 2024 | DenoisingGPU | CodeCode Available | 1 |
| Mélange: Cost Efficient Large Language Model Serving by Exploiting GPU Heterogeneity | Apr 22, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| LLMem: Estimating GPU Memory Usage for Fine-Tuning Pre-Trained LLMs | Apr 16, 2024 | DecoderGPU | CodeCode Available | 1 |
| Interpolating neural network: A novel unification of machine learning and interpolation theory | Apr 16, 2024 | GPUPhysical Simulations | CodeCode Available | 1 |
| CATS: Contextually-Aware Thresholding for Sparsity in Large Language Models | Apr 12, 2024 | GPU | CodeCode Available | 1 |
| Lossless Acceleration of Large Language Model via Adaptive N-gram Parallel Decoding | Apr 10, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| FreeEval: A Modular Framework for Trustworthy and Efficient Evaluation of Large Language Models | Apr 9, 2024 | FairnessGPU | CodeCode Available | 1 |
| LIPT: Latency-aware Image Processing Transformer | Apr 9, 2024 | DenoisingGPU | CodeCode Available | 1 |
| Tensorized Ant Colony Optimization for GPU Acceleration | Apr 7, 2024 | CPUGPU | CodeCode Available | 1 |
| GeoT: Tensor Centric Library for Graph Neural Network via Efficient Segment Reduction on GPU | Apr 3, 2024 | GPUGraph Neural Network | CodeCode Available | 1 |
| IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT | Apr 2, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Taming Lookup Tables for Efficient Image Retouching | Mar 28, 2024 | CPUGPU | CodeCode Available | 1 |
| Siamese Vision Transformers are Scalable Audio-visual Learners | Mar 28, 2024 | Contrastive LearningGPU | CodeCode Available | 1 |
| ModeTv2: GPU-accelerated Motion Decomposition Transformer for Pairwise Optimization in Medical Image Registration | Mar 25, 2024 | Computational EfficiencyGPU | CodeCode Available | 1 |
| MetaAligner: Towards Generalizable Multi-Objective Alignment of Language Models | Mar 25, 2024 | GPUIn-Context Learning | CodeCode Available | 1 |
| MEDDAP: Medical Dataset Enhancement via Diversified Augmentation Pipeline | Mar 25, 2024 | GPU | CodeCode Available | 1 |
| Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression | Mar 23, 2024 | Dimensionality ReductionGPU | CodeCode Available | 1 |