| Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models | Dec 12, 2023 | GPUModel Compression | CodeCode Available | 1 |
| GateNet: A novel Neural Network Architecture for Automated Flow Cytometry Gating | Dec 12, 2023 | GPU | CodeCode Available | 1 |
| Compound Text-Guided Prompt Tuning via Image-Adaptive Cues | Dec 11, 2023 | Domain GeneralizationGPU | CodeCode Available | 1 |
| Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections | Dec 8, 2023 | Deep LearningGPU | CodeCode Available | 1 |
| SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM | Dec 6, 2023 | GPUQuantization | CodeCode Available | 1 |
| On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm | Dec 6, 2023 | Dataset DistillationDiversity | CodeCode Available | 1 |
| MMM: Generative Masked Motion Model | Dec 6, 2023 | GPUmodel | CodeCode Available | 1 |
| FlexModel: A Framework for Interpretability of Distributed Large Language Models | Dec 5, 2023 | Distributed ComputingGPU | CodeCode Available | 1 |
| Minuet: Accelerating 3D Sparse Convolutions on GPUs | Dec 1, 2023 | GPU | CodeCode Available | 1 |
| Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding | Nov 30, 2023 | GPUInductive Bias | CodeCode Available | 1 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 |
| GNNFlow: A Distributed Framework for Continuous Temporal GNN Learning on Dynamic Graphs | Nov 29, 2023 | CPUGPU | CodeCode Available | 1 |
| Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars | Nov 27, 2023 | GPUNovel View Synthesis | CodeCode Available | 1 |
| vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training | Nov 27, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| SpotServe: Serving Generative Large Language Models on Preemptible Instances | Nov 27, 2023 | GPUGraph Matching | CodeCode Available | 1 |
| ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization | Nov 22, 2023 | GPULanguage Modelling | CodeCode Available | 1 |
| Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots | Nov 21, 2023 | Boundary DetectionEdge-computing | CodeCode Available | 1 |
| LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning | Nov 20, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters | Nov 15, 2023 | 4k8k | CodeCode Available | 1 |
| InfMLLM: A Unified Framework for Visual-Language Tasks | Nov 12, 2023 | GPUImage Captioning | CodeCode Available | 1 |
| GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition | Nov 8, 2023 | CPUDecoder | CodeCode Available | 1 |
| Prompt Cache: Modular Attention Reuse for Low-Latency Inference | Nov 7, 2023 | CPUGPU | CodeCode Available | 1 |
| VR-NeRF: High-Fidelity Virtualized Walkable Spaces | Nov 5, 2023 | 2kGPU | CodeCode Available | 1 |
| In Search of Lost Online Test-time Adaptation: A Survey | Oct 31, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| Network Contention-Aware Cluster Scheduling with Reinforcement Learning | Oct 31, 2023 | GPUreinforcement-learning | CodeCode Available | 1 |