| A GPU-accelerated Large-scale Simulator for Transportation System Optimization Benchmarking | Jun 15, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| Coralai: Intrinsic Evolution of Embodied Neural Cellular Automata Ecosystems | Jun 14, 2024 | DiversityGPU | CodeCode Available | 1 |
| Optimal Kernel Orchestration for Tensor Programs with Korch | Jun 13, 2024 | DiversityGPU | CodeCode Available | 1 |
| COVE: Unleashing the Diffusion Feature Correspondence for Consistent Video Editing | Jun 13, 2024 | DenoisingGPU | CodeCode Available | 1 |
| TLCM: Training-efficient Latent Consistency Model for Image Generation with 2-8 Steps | Jun 9, 2024 | GPUImage Generation | CodeCode Available | 1 |
| MEFT: Memory-Efficient Fine-Tuning through Sparse Adapter | Jun 7, 2024 | CPUGPU | CodeCode Available | 1 |
| Queue management for slo-oriented large language model serving | Jun 5, 2024 | BlockingGPU | CodeCode Available | 1 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Jun 4, 2024 | document understandingGPU | CodeCode Available | 1 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 |
| RGFN: Synthesizable Molecular Generation Using GFlowNets | Jun 1, 2024 | GPU | CodeCode Available | 1 |
| μLO: Compute-Efficient Meta-Generalization of Learned Optimizers | May 31, 2024 | GPUZero-shot Generalization | CodeCode Available | 1 |
| Spatio-Spectral Graph Neural Networks | May 29, 2024 | GPUGraph Classification | CodeCode Available | 1 |
| Cardiovascular Disease Detection from Multi-View Chest X-rays with BI-Mamba | May 28, 2024 | Computed Tomography (CT)GPU | CodeCode Available | 1 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| DAGER: Exact Gradient Inversion for Large Language Models | May 24, 2024 | DecoderFederated Learning | CodeCode Available | 1 |
| Sparse Matrix in Large Language Model Fine-tuning | May 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution | May 23, 2024 | GPUWeather Forecasting | CodeCode Available | 1 |
| ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification | May 23, 2024 | GPUGSM8K | CodeCode Available | 1 |
| Fast inference with Kronecker-sparse matrices | May 23, 2024 | GPUManagement | CodeCode Available | 1 |
| Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference | May 23, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Attention as an RNN | May 22, 2024 | GPUTime Series | CodeCode Available | 1 |
| PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference | May 21, 2024 | GPU | CodeCode Available | 1 |
| Token-wise Influential Training Data Retrieval for Large Language Models | May 20, 2024 | CPUGPU | CodeCode Available | 1 |
| Hybrid CNN-Transformer Architecture for Efficient Large-Scale Video Snapshot Compressive Imaging | May 19, 2024 | GPU | CodeCode Available | 1 |