| HETHUB: A Distributed Training System with Heterogeneous Cluster for Large-Scale Models | May 25, 2024 | GPU | —Unverified | 0 |
| MINet: Multi-scale Interactive Network for Real-time Salient Object Detection of Strip Steel Surface Defects | May 25, 2024 | CPUDefect Detection | CodeCode Available | 1 |
| A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning | May 25, 2024 | GPUregression | —Unverified | 0 |
| Accelerating Diffusion Models with Parallel Sampling: Inference at Sub-Linear Time Complexity | May 24, 2024 | GPU | —Unverified | 0 |
| Looking Backward: Streaming Video-to-Video Translation with Feature Banks | May 24, 2024 | GPUTranslation | CodeCode Available | 4 |
| DAGER: Exact Gradient Inversion for Large Language Models | May 24, 2024 | DecoderFederated Learning | CodeCode Available | 1 |
| Sparse Matrix in Large Language Model Fine-tuning | May 24, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| ARVideo: Autoregressive Pretraining for Self-Supervised Video Representation Learning | May 24, 2024 | GPURepresentation Learning | —Unverified | 0 |
| Fast inference with Kronecker-sparse matrices | May 23, 2024 | GPUManagement | CodeCode Available | 1 |
| Fast Bayesian Inference for Neutrino Non-Standard Interactions at Dark Matter Direct Detection Experiments | May 23, 2024 | Bayesian InferenceGPU | CodeCode Available | 0 |
| ArchesWeather: An efficient AI weather forecasting model at 1.5° resolution | May 23, 2024 | GPUWeather Forecasting | CodeCode Available | 1 |
| CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization | May 23, 2024 | Code GenerationGPU | CodeCode Available | 0 |
| Tele-Aloha: A Low-budget and High-authenticity Telepresence System Using Sparse RGB Cameras | May 23, 2024 | 2kGPU | —Unverified | 0 |
| MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | May 23, 2024 | Action RecognitionAction Segmentation | —Unverified | 0 |
| LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models | May 23, 2024 | Computational EfficiencyDecoder | —Unverified | 0 |
| ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification | May 23, 2024 | GPUGSM8K | CodeCode Available | 1 |
| Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference | May 23, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation | May 22, 2024 | GPU | —Unverified | 0 |
| Attention as an RNN | May 22, 2024 | GPUTime Series | CodeCode Available | 1 |
| HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images | May 22, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization | May 22, 2024 | GPU | CodeCode Available | 0 |
| What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions | May 22, 2024 | Data ValuationGPU | CodeCode Available | 2 |
| PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference | May 21, 2024 | GPU | CodeCode Available | 1 |
| Personalized Residuals for Concept-Driven Text-to-Image Generation | May 21, 2024 | GPUImage Generation | —Unverified | 0 |
| Parallelization of the K-Means Algorithm with Applications to Big Data Clustering | May 20, 2024 | ClusteringGPU | —Unverified | 0 |