SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer Jan 30, 2025 Image Generation Model Compression
Code Code Available 9GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers Oct 31, 2022 GPU Language Modelling
Code Code Available 7A Survey on Knowledge Distillation of Large Language Models Feb 20, 2024 Data Augmentation Knowledge Distillation
Code Code Available 5LLM Inference Unveiled: Survey and Roofline Model Insights Feb 26, 2024 Knowledge Distillation Language Modelling
Code Code Available 4Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields Aug 7, 2024 3DGS Model Compression
Code Code Available 3ZipNN: Lossless Compression for AI Models Nov 7, 2024 Model Compression
Code Code Available 3SVD-LLM V2: Optimizing Singular Value Truncation for Large Language Model Compression Mar 16, 2025 Language Modeling Language Modelling
Code Code Available 3SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression Mar 12, 2024 Language Modeling Language Modelling
Code Code Available 3Efficient Reasoning Models: A Survey Apr 15, 2025 Knowledge Distillation Model Compression
Code Code Available 3ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Aug 16, 2024 GPU Model Compression
Code Code Available 3LightGNN: Simple Graph Neural Network for Recommendation Jan 6, 2025 Computational Efficiency Graph Neural Network
Code Code Available 2On-Device Domain Generalization Sep 15, 2022 Data Augmentation Domain Generalization
Code Code Available 2PromptMM: Multi-Modal Knowledge Distillation for Recommendation with Prompt-Tuning Feb 27, 2024 Knowledge Distillation Model Compression
Code Code Available 2Diffusion Models for Image Restoration and Enhancement -- A Comprehensive Survey Aug 18, 2023 Deblurring Image Restoration
Code Code Available 2Compact 3D Gaussian Representation for Radiance Field Nov 22, 2023 3DGS Model Compression
Code Code Available 2Well-Read Students Learn Better: On the Importance of Pre-training Compact Models Aug 23, 2019 Knowledge Distillation Language Modelling
Code Code Available 2OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Aug 25, 2023 Common Sense Reasoning Computational Efficiency
Code Code Available 2QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning Feb 6, 2024 Image Generation Model Compression
Code Code Available 2Compressing Volumetric Radiance Fields to 1 MB Nov 29, 2022 Model Compression NeRF
Code Code Available 2Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers Jun 25, 2024 Image Generation Model Compression
Code Code Available 2LiDAR-PTQ: Post-Training Quantization for Point Cloud 3D Object Detection Jan 29, 2024 3D Object Detection Autonomous Vehicles
Code Code Available 2Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design May 2, 2024 Model Compression Neural Network Compression
Code Code Available 2Learning Student Networks in the Wild Jun 19, 2021 Knowledge Distillation Model Compression
Code Code Available 2MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Sep 26, 2024 Large Language Model Model Compression
Code Code Available 2Fast convolutional neural networks on FPGAs with hls4ml Jan 13, 2021 Model Compression Quantization
Code Code Available 2AMC: AutoML for Model Compression and Acceleration on Mobile Devices Feb 10, 2018 AutoML GPU
Code Code Available 2Data-Free Knowledge Distillation for Deep Neural Networks Oct 19, 2017 Data-free Knowledge Distillation Knowledge Distillation
Code Code Available 2MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression Jun 21, 2024 GPU Language Modeling
Code Code Available 2Knowledge Distillation and Student-Teacher Learning for Visual Intelligence: A Review and New Outlooks Apr 13, 2020 Knowledge Distillation Model Compression
Code Code Available 2Towards Lightweight Super-Resolution with Dual Regression Learning Jul 16, 2022 Image Super-Resolution Model Compression
Code Code Available 2Contrastive Representation Distillation Oct 23, 2019 Contrastive Learning Knowledge Distillation
Code Code Available 13DG-STFM: 3D Geometric Guided Student-Teacher Feature Matching Jul 6, 2022 Homography Estimation Model Compression
Code Code Available 1CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution Jul 4, 2022 Compiler Optimization image-classification
Code Code Available 1Designing Large Foundation Models for Efficient Training and Inference: A Survey Sep 3, 2024 Knowledge Distillation Model Compression
Code Code Available 1Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference Jun 26, 2023 CPU Model Compression
Code Code Available 1Contrastive Distillation on Intermediate Representations for Language Model Compression Sep 29, 2020 Knowledge Distillation Language Modeling
Code Code Available 1CrossKD: Cross-Head Knowledge Distillation for Object Detection Jun 20, 2023 Dense Object Detection Knowledge Distillation
Code Code Available 1CompRess: Self-Supervised Learning by Compressing Representations Oct 28, 2020 Linear evaluation Model Compression
Code Code Available 1Compression-Aware Video Super-Resolution Jan 1, 2023 Model Compression Super-Resolution
Code Code Available 1Computation-Efficient Knowledge Distillation via Uncertainty-Aware Mixup Dec 17, 2020 Informativeness Knowledge Distillation
Code Code Available 1Streamlining Redundant Layers to Compress Large Language Models Mar 28, 2024 Model Compression
Code Code Available 1AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression May 17, 2023 Knowledge Distillation Language Modeling
Code Code Available 1Composable Interventions for Language Models Jul 9, 2024 knowledge editing Machine Unlearning
Code Code Available 1Consistent Quantity-Quality Control across Scenes for Deployment-Aware Gaussian Splatting May 15, 2025 3DGS Model Compression
Code Code Available 1DarwinLM: Evolutionary Structured Pruning of Large Language Models Feb 11, 2025 Model Compression
Code Code Available 1Communication-Computation Trade-Off in Resource-Constrained Edge Inference Jun 3, 2020 Edge-computing Model Compression
Code Code Available 1COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models May 26, 2023 Model Compression
Code Code Available 1Compacting, Picking and Growing for Unforgetting Continual Learning Oct 15, 2019 Age And Gender Classification Continual Learning
Code Code Available 1Comprehensive Knowledge Distillation with Causal Intervention Dec 1, 2021 Causal Inference Knowledge Distillation
Code Code Available 1Communication-Efficient Diffusion Strategy for Performance Improvement of Federated Learning with Non-IID Data Jul 15, 2022 Federated Learning Model Compression
Code Code Available 1