Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Oct 1, 2024 Computational Efficiency Knowledge Distillation
— Unverified 0Trainable pruned ternary quantization for medical signal classification models Oct 1, 2024 Model Compression Quantization
Code Code Available 0Aggressive Post-Training Compression on Extremely Large Language Models Sep 30, 2024 Model Compression Network Pruning
— Unverified 0InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries Sep 29, 2024 Knowledge Distillation Model Compression
— Unverified 0Value-Based Deep Multi-Agent Reinforcement Learning with Dynamic Sparse Training Sep 28, 2024 Model Compression Multi-agent Reinforcement Learning
— Unverified 0General Compression Framework for Efficient Transformer Object Tracking Sep 26, 2024 Model Compression Object
— Unverified 0MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models Sep 26, 2024 Large Language Model Model Compression
Code Code Available 2Search for Efficient Large Language Models Sep 25, 2024 GPU Model Compression
Code Code Available 1Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment Sep 19, 2024 Knowledge Distillation Model Compression
Code Code Available 0Applications of Knowledge Distillation in Remote Sensing: A Survey Sep 18, 2024 Computational Efficiency Instance Segmentation
— Unverified 0ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration Sep 15, 2024 Model Compression
Code Code Available 0Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare Sep 14, 2024 Data Free Quantization Image Segmentation
— Unverified 0NVRC: Neural Video Representation Compression Sep 11, 2024 Model Compression Quantization
— Unverified 0Application Specific Compression of Deep Learning Models Sep 9, 2024 Deep Learning Model Compression
Code Code Available 0Ultron: Enabling Temporal Geometry Compression of 3D Mesh Sequences using Temporal Correspondence and Mesh Deformation Sep 8, 2024 3D Reconstruction Model Compression
Code Code Available 0LoCa: Logit Calibration for Knowledge Distillation Sep 7, 2024 image-classification Image Classification
— Unverified 0Foundations of Large Language Model Compression -- Part 1: Weight Quantization Sep 3, 2024 Language Modeling Language Modelling
Code Code Available 0Designing Large Foundation Models for Efficient Training and Inference: A Survey Sep 3, 2024 Knowledge Distillation Model Compression
Code Code Available 1Efficient Point Cloud Classification via Offline Distillation Framework and Negative-Weight Self-Distillation Technique Sep 3, 2024 Data Augmentation Knowledge Distillation
— Unverified 0Edge AI: Evaluation of Model Compression Techniques for Convolutional Neural Networks Sep 2, 2024 Edge-computing image-classification
— Unverified 0Hyper-Compression: Model Compression via Hyperfunction Sep 1, 2024 model Model Compression
Code Code Available 1MedDet: Generative Adversarial Distillation for Efficient Cervical Disc Herniation Detection Aug 30, 2024 Knowledge Distillation Model Compression
Code Code Available 0Convolutional Neural Network Compression Based on Low-Rank Decomposition Aug 29, 2024 Model Compression Neural Network Compression
— Unverified 0Variational autoencoder-based neural network model compression Aug 25, 2024 Anomaly Detection Image Generation
— Unverified 0MPruner: Optimizing Neural Network Size with CKA-Based Mutual Information Pruning Aug 24, 2024 Model Compression
— Unverified 0Localize-and-Stitch: Efficient Model Merging via Sparse Task Arithmetic Aug 24, 2024 Model Compression Task Arithmetic
Code Code Available 1A Web-Based Solution for Federated Learning with LLM-Based Automation Aug 23, 2024 CPU Federated Learning
— Unverified 0A Survey on Drowsiness Detection -- Modern Applications and Methods Aug 23, 2024 Model Compression Survey
— Unverified 0Pruning By Explaining Revisited: Optimizing Attribution Methods to Prune CNNs and Transformers Aug 22, 2024 Model Compression
Code Code Available 1NeR-VCP: A Video Content Protection Method Based on Implicit Neural Representation Aug 20, 2024 Model Compression NER
— Unverified 0Fine-Tuning and Deploying Large Language Models Over Edges: Issues and Approaches Aug 20, 2024 GPU Model Compression
— Unverified 0MoDeGPT: Modular Decomposition for Large Language Model Compression Aug 19, 2024 GPU Language Modeling
— Unverified 0RepControlNet: ControlNet Reparameterization Aug 17, 2024 Model Compression
— Unverified 0ABQ-LLM: Arbitrary-Bit Quantized Inference Acceleration for Large Language Models Aug 16, 2024 GPU Model Compression
Code Code Available 3Computer Vision Model Compression Techniques for Embedded Systems: A Survey Aug 15, 2024 Model Compression Survey
Code Code Available 0An Effective Information Theoretic Framework for Channel Pruning Aug 14, 2024 Model Compression
— Unverified 0Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection Aug 14, 2024 Efficient Neural Network Model Compression
— Unverified 0Knowledge Distillation with Refined Logits Aug 14, 2024 Knowledge Distillation Model Compression
Code Code Available 1Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields Aug 7, 2024 3DGS Model Compression
Code Code Available 3AdapMTL: Adaptive Pruning Framework for Multitask Learning Model Aug 7, 2024 model Model Compression
— Unverified 0DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Aug 6, 2024 Model Compression Quantization
— Unverified 0Compress and Compare: Interactively Evaluating Efficiency and Behavior Across ML Model Compression Experiments Aug 6, 2024 image-classification Image Classification
— Unverified 0Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression Aug 6, 2024 image-classification Image Classification
Code Code Available 0Artificial Neural Networks for Photonic Applications: From Algorithms to Implementation Aug 2, 2024 Model Compression
— Unverified 0An Efficient Real-Time Object Detection Framework on Resource-Constricted Hardware Devices via Software and Hardware Co-design Aug 2, 2024 Model Compression Neural Network Compression
— Unverified 0Tensor Train Low-rank Approximation (TT-LoRA): Democratizing AI with Accelerated LLMs Aug 2, 2024 Machine Translation Model Compression
— Unverified 0NeuSemSlice: Towards Effective DNN Model Maintenance via Neuron-level Semantic Slicing Jul 26, 2024 Model Compression Semantic Similarity
— Unverified 0Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures Jul 22, 2024 Knowledge Distillation Model Compression
Code Code Available 0Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Jul 22, 2024 Deep Learning image-classification
— Unverified 0LORTSAR: Low-Rank Transformer for Skeleton-based Action Recognition Jul 19, 2024 Action Recognition Computational Efficiency
— Unverified 0