| GateNet: A novel Neural Network Architecture for Automated Flow Cytometry Gating | Dec 12, 2023 | GPU | CodeCode Available | 1 |
| Rethinking Compression: Reduced Order Modelling of Latent Features in Large Language Models | Dec 12, 2023 | GPUModel Compression | CodeCode Available | 1 |
| Compound Text-Guided Prompt Tuning via Image-Adaptive Cues | Dec 11, 2023 | Domain GeneralizationGPU | CodeCode Available | 1 |
| Tenplex: Dynamic Parallelism for Deep Learning using Parallelizable Tensor Collections | Dec 8, 2023 | Deep LearningGPU | CodeCode Available | 1 |
| SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM | Dec 6, 2023 | GPUQuantization | CodeCode Available | 1 |
| On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm | Dec 6, 2023 | Dataset DistillationDiversity | CodeCode Available | 1 |
| MMM: Generative Masked Motion Model | Dec 6, 2023 | GPUmodel | CodeCode Available | 1 |
| FlexModel: A Framework for Interpretability of Distributed Large Language Models | Dec 5, 2023 | Distributed ComputingGPU | CodeCode Available | 1 |
| Minuet: Accelerating 3D Sparse Convolutions on GPUs | Dec 1, 2023 | GPU | CodeCode Available | 1 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 |
| Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding | Nov 30, 2023 | GPUInductive Bias | CodeCode Available | 1 |
| GNNFlow: A Distributed Framework for Continuous Temporal GNN Learning on Dynamic Graphs | Nov 29, 2023 | CPUGPU | CodeCode Available | 1 |
| Animatable 3D Gaussian: Fast and High-Quality Reconstruction of Multiple Human Avatars | Nov 27, 2023 | GPUNovel View Synthesis | CodeCode Available | 1 |
| SpotServe: Serving Generative Large Language Models on Preemptible Instances | Nov 27, 2023 | GPUGraph Matching | CodeCode Available | 1 |
| vTrain: A Simulation Framework for Evaluating Cost-effective and Compute-optimal Large Language Model Training | Nov 27, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| ComPEFT: Compression for Communicating Parameter Efficient Updates via Sparsification and Quantization | Nov 22, 2023 | GPULanguage Modelling | CodeCode Available | 1 |
| Mobile-Seed: Joint Semantic Segmentation and Boundary Detection for Mobile Robots | Nov 21, 2023 | Boundary DetectionEdge-computing | CodeCode Available | 1 |
| LQ-LoRA: Low-rank Plus Quantized Matrix Decomposition for Efficient Language Model Finetuning | Nov 20, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters | Nov 15, 2023 | 4k8k | CodeCode Available | 1 |
| InfMLLM: A Unified Framework for Visual-Language Tasks | Nov 12, 2023 | GPUImage Captioning | CodeCode Available | 1 |
| GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition | Nov 8, 2023 | CPUDecoder | CodeCode Available | 1 |
| Prompt Cache: Modular Attention Reuse for Low-Latency Inference | Nov 7, 2023 | CPUGPU | CodeCode Available | 1 |
| VR-NeRF: High-Fidelity Virtualized Walkable Spaces | Nov 5, 2023 | 2kGPU | CodeCode Available | 1 |
| In Search of Lost Online Test-time Adaptation: A Survey | Oct 31, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| Network Contention-Aware Cluster Scheduling with Reinforcement Learning | Oct 31, 2023 | GPUreinforcement-learning | CodeCode Available | 1 |
| Prediction of Effective Elastic Moduli of Rocks using Graph Neural Networks | Oct 30, 2023 | GPU | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models | Oct 29, 2023 | GPUMixture-of-Experts | CodeCode Available | 1 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 |
| RedCoast: A Lightweight Tool to Automate Distributed Training of LLMs on Any GPU/TPUs | Oct 25, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Metrically Scaled Monocular Depth Estimation through Sparse Priors for Underwater Robots | Oct 25, 2023 | CPUDecoder | CodeCode Available | 1 |
| LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery | Oct 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| CAPIVARA: Cost-Efficient Approach for Improving Multilingual CLIP Performance on Low-Resource Languages | Oct 20, 2023 | DiversityGPU | CodeCode Available | 1 |
| CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation | Oct 19, 2023 | 2kGPU | CodeCode Available | 1 |
| MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient | Oct 17, 2023 | 3D Object DetectionGPU | CodeCode Available | 1 |
| DialogueLLM: Context and Emotion Knowledge-Tuned Large Language Models for Emotion Recognition in Conversations | Oct 17, 2023 | BenchmarkingEmotion Recognition | CodeCode Available | 1 |
| TRANSOM: An Efficient Fault-Tolerant System for Training LLMs | Oct 16, 2023 | Anomaly DetectionGPU | CodeCode Available | 1 |
| ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion | Oct 16, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models | Oct 15, 2023 | CPUGPU | CodeCode Available | 1 |
| G10: Enabling An Efficient Unified GPU Memory and Storage Architecture with Smart Tensor Migrations | Oct 13, 2023 | Deep LearningGPU | CodeCode Available | 1 |
| QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models | Oct 13, 2023 | Computational EfficiencyGPU | CodeCode Available | 1 |
| QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models | Oct 12, 2023 | GPUQuantization | CodeCode Available | 1 |
| No Privacy Left Outside: On the (In-)Security of TEE-Shielded DNN Partition for On-Device ML | Oct 11, 2023 | GPUInference Attack | CodeCode Available | 1 |
| Sparse Fine-tuning for Inference Acceleration of Large Language Models | Oct 10, 2023 | CPUGPU | CodeCode Available | 1 |
| Persis: A Persian Font Recognition Pipeline Using Convolutional Neural Networks | Oct 8, 2023 | BinarizationCPU | CodeCode Available | 1 |
| GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models | Oct 8, 2023 | GPUReinforcement Learning (RL) | CodeCode Available | 1 |
| Surgical Gym: A high-performance GPU-based platform for reinforcement learning with surgical robots | Oct 7, 2023 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs | Oct 3, 2023 | GPU | CodeCode Available | 1 |
| Label Supervised LLaMA Finetuning | Oct 2, 2023 | GPUnamed-entity-recognition | CodeCode Available | 1 |
| Training a Large Video Model on a Single Machine in a Day | Sep 28, 2023 | Action RecognitionCPU | CodeCode Available | 1 |