Application Specific Compression of Deep Learning Models Sep 9, 2024 Deep Learning Model Compression
Code Code Available 0Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation Sep 8, 2024 3D Reconstruction Model Compression
Code Code Available 0LoCa: Logit Calibration for Knowledge Distillation Sep 7, 2024 image-classification Image Classification
— Unverified 0Foundations of Large Language Model Compression -- Part 1: Weight Quantization Sep 3, 2024 Language Modeling Language Modelling
Code Code Available 0Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique Sep 3, 2024 Data Augmentation Knowledge Distillation
— Unverified 0Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks Sep 2, 2024 Edge-computing image-classification
— Unverified 0MedDet: Generative Adversarial Distillation for Efficient Cervical Disc Herniation Detection Aug 30, 2024 Knowledge Distillation Model Compression
Code Code Available 0Convolutional Neural Network Compression Based on Low-Rank Decomposition Aug 29, 2024 Model Compression Neural Network Compression
— Unverified 0Variational autoencoder-based neural network model compression Aug 25, 2024 Anomaly Detection Image Generation
— Unverified 0MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning Aug 24, 2024 Model Compression
— Unverified 0A Web-Based Solution for Federated Learning with LLM-Based Automation Aug 23, 2024 CPU Federated Learning
— Unverified 0A Survey on Drowsiness Detection -- Modern Applications and Methods Aug 23, 2024 Model Compression Survey
— Unverified 0Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches Aug 20, 2024 GPU Model Compression
— Unverified 0NeR-VCP: A Video Content Protection Method Based on Implicit Neural Representation Aug 20, 2024 Model Compression NER
— Unverified 0MoDeGPT: Modular Decomposition for Large Language Model Compression Aug 19, 2024 GPU Language Modeling
— Unverified 0RepControlNet: ControlNet Reparameterization Aug 17, 2024 Model Compression
— Unverified 0Computer Vision Model Compression Techniques for Embedded Systems: A Survey Aug 15, 2024 Model Compression Survey
Code Code Available 0Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Aug 14, 2024 Efficient Neural Network Model Compression
— Unverified 0An Effective Information Theoretic Framework for Channel Pruning Aug 14, 2024 Model Compression
— Unverified 0AdapMTL: Adaptive Pruning Framework for Multitask Learning Model Aug 7, 2024 model Model Compression
— Unverified 0DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Aug 6, 2024 Model Compression Quantization
— Unverified 0Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Aug 6, 2024 image-classification Image Classification
— Unverified 0Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Aug 6, 2024 image-classification Image Classification
Code Code Available 0An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design Aug 2, 2024 Model Compression Neural Network Compression
— Unverified 0Artificial Neural Networks for Photonic Applications: From Algorithms to Implementation Aug 2, 2024 Model Compression
— Unverified 0Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs Aug 2, 2024 Machine Translation Model Compression
— Unverified 0NeuSemSlice: Towards Effective DNN Model Maintenance via Neuron-level Semantic Slicing Jul 26, 2024 Model Compression Semantic Similarity
— Unverified 0Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Jul 22, 2024 Deep Learning image-classification
— Unverified 0Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures Jul 22, 2024 Knowledge Distillation Model Compression
Code Code Available 0LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition Jul 19, 2024 Action Recognition Computational Efficiency
— Unverified 0Compressed models are NOT miniature versions of large models Jul 18, 2024 Adversarial Attack Model Compression
— Unverified 0Mamba-PTQ: Outlier Channels in Recurrent Large Language Models Jul 17, 2024 Mamba Model Compression
— Unverified 0Minimizing PLM-Based Few-Shot Intent Detectors Jul 13, 2024 Data Augmentation Knowledge Distillation
Code Code Available 0Inference Optimization of Foundation Models on AI Accelerators Jul 12, 2024 Inference Optimization Model Compression
— Unverified 0Explicit-NeRF-QA: A Quality Assessment Database for Explicit NeRF Model Compression Jul 11, 2024 Model Compression NeRF
Code Code Available 0Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression Jul 6, 2024 Language Modeling Language Modelling
Code Code Available 0Quantizing YOLOv7: A Comprehensive Study Jul 6, 2024 Model Compression object-detection
— Unverified 0The Impact of Quantization and Pruning on Deep Reinforcement Learning Models Jul 5, 2024 Deep Reinforcement Learning Model Compression
— Unverified 0AMD: Automatic Multi-step Distillation of Large-scale Vision Models Jul 5, 2024 image-classification Image Classification
— Unverified 0Efficient DNN-Powered Software with Fair Sparse Models Jul 3, 2024 Fairness Model Compression
— Unverified 0MLKD-BERT: Multi-level Knowledge Distillation for Pre-trained Language Models Jul 3, 2024 Extractive Question-Answering Knowledge Distillation
— Unverified 0FoldGPT: Simple and Effective Large Language Model Compression Scheme Jul 1, 2024 Language Modeling Language Modelling
— Unverified 0MCNC: Manifold Constrained Network Compression Jun 27, 2024 Model Compression Quantization
— Unverified 0Speeding Up Image Classifiers with Little Companions Jun 24, 2024 image-classification Image Classification
— Unverified 0Exploring compressibility of transformer based text-to-music (TTM) models Jun 24, 2024 Decoder FAD
— Unverified 0Reinforced Knowledge Distillation for Time Series Regression Jun 21, 2024 Knowledge Distillation Model Compression
Code Code Available 0FLoCoRA: Federated learning compression with low-rank adaptation Jun 20, 2024 Federated Learning Model Compression
Code Code Available 0Failure-Resilient Distributed Inference with Model Compression over Heterogeneous Edge Devices Jun 20, 2024 Knowledge Distillation Model Compression
— Unverified 0SDQ: Sparse Decomposed Quantization for LLM Inference Jun 19, 2024 Model Compression Quantization
— Unverified 0Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model Jun 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0