Node Identifiers: Compact, Discrete Representations for Efficient Graph Learning May 26, 2024 Computational Efficiency Graph Classification
Code Code Available 1PTQ4DiT: Post-training Quantization for Diffusion Transformers May 25, 2024 Image Generation Quantization
Code Code Available 1M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation May 25, 2024 Language Modeling Language Modelling
Code Code Available 1Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models May 23, 2024 Data Compression Image Generation
Code Code Available 1ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification May 23, 2024 GPU GSM8K
Code Code Available 1Nearest is Not Dearest: Towards Practical Defense against Quantization-conditioned Backdoor Attacks May 21, 2024 Quantization
Code Code Available 1Deep Learning-Enabled One-Bit DoA Estimation May 15, 2024 compressed sensing Deep Learning
Code Code Available 1Feature-based Federated Transfer Learning: Communication Efficiency, Robustness and Privacy May 15, 2024 Federated Learning image-classification
Code Code Available 1Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs May 6, 2024 Quantization
Code Code Available 1Vector Quantization for Recommender Systems: A Review and Outlook May 6, 2024 Feature Compression Quantization
Code Code Available 1Gradient-based Automatic Mixed Precision Quantization for Neural Networks On-Chip May 1, 2024 Jet Tagging Quantization
Code Code Available 1Tripod: Three Complementary Inductive Biases for Disentangled Representation Learning Apr 16, 2024 Data Compression Decoder
Code Code Available 1Adapting LLaMA Decoder to Vision Transformer Apr 10, 2024 Computational Efficiency Decoder
Code Code Available 1End-to-End Rate-Distortion Optimized 3D Gaussian Representation Apr 9, 2024 3DGS Quantization
Code Code Available 1Have You Merged My Model? On The Robustness of Large Language Model IP Protection Methods Against Model Merging Apr 8, 2024 Language Modeling Language Modelling
Code Code Available 1BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models Apr 8, 2024 Binarization Quantization
Code Code Available 1Outlier-Efficient Hopfield Layers for Large Transformer-Based Models Apr 4, 2024 Benchmarking Quantization
Code Code Available 1BiPer: Binary Neural Networks using a Periodic Function Apr 1, 2024 Binarization Classification with Binary Neural Network
Code Code Available 1QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs Mar 30, 2024 Quantization
Code Code Available 1Genetic Quantization-Aware Approximation for Non-Linear Operations in Transformers Mar 28, 2024 Quantization Semantic Segmentation
Code Code Available 1AffineQuant: Affine Transformation Quantization for Large Language Models Mar 19, 2024 Quantization
Code Code Available 1MELTing point: Mobile Evaluation of Language Transformers Mar 19, 2024 Benchmarking Quantization
Code Code Available 1Self-Supervised Quantization-Aware Knowledge Distillation Mar 17, 2024 Knowledge Distillation Quantization
Code Code Available 1Representing Domain-Mixing Optical Degradation for Real-World Computational Aberration Correction via Vector Quantization Mar 15, 2024 Domain Adaptation Quantization
Code Code Available 1TaxoLLaMA: WordNet-based Model for Solving Multiple Lexical Semantic Tasks Mar 14, 2024 Domain Adaptation Few-Shot Learning
Code Code Available 1COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization Mar 11, 2024 Quantization
Code Code Available 1FrameQuant: Flexible Low-Bit Quantization for Transformers Mar 10, 2024 Quantization
Code Code Available 1Self-Adapting Large Visual-Language Models to Edge Devices across Visual Modalities Mar 7, 2024 Contrastive Learning Knowledge Distillation
Code Code Available 1LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization Mar 2, 2024 GPU Quantization
Code Code Available 1"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach Mar 1, 2024 Model Compression Quantization
Code Code Available 1NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions Feb 29, 2024 Quantization
Code Code Available 1Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation Feb 21, 2024 Arithmetic Reasoning GSM8K
Code Code Available 1Understanding and Mitigating the Threat of Vec2Text to Dense Retrieval Systems Feb 20, 2024 Quantization Retrieval
Code Code Available 1LaCo: Large Language Model Pruning via Layer Collapse Feb 17, 2024 Knowledge Distillation Language Modeling
Code Code Available 1Hierarchical Prior-based Super Resolution for Point Cloud Geometry Compression Feb 17, 2024 Decoder Quantization
Code Code Available 1EdgeQAT: Entropy and Distribution Guided Quantization-Aware Training for the Acceleration of Lightweight LLMs on the Edge Feb 16, 2024 Quantization
Code Code Available 1PRISE: LLM-Style Sequence Compression for Learning Temporal Action Abstractions in Control Feb 16, 2024 continuous-control Continuous Control
Code Code Available 1A Thorough Examination of Decoding Methods in the Era of LLMs Feb 10, 2024 Quantization
Code Code Available 1Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings Feb 9, 2024 Machine Translation Quantization
Code Code Available 1ApiQ: Finetuning of 2-Bit Quantized Large Language Model Feb 7, 2024 GPU Language Modeling
Code Code Available 1LQER: Low-Rank Quantization Error Reconstruction for LLMs Feb 4, 2024 Knowledge Distillation Quantization
Code Code Available 1Scaling Sparse Fine-Tuning to Large Language Models Jan 29, 2024 parameter-efficient fine-tuning Quantization
Code Code Available 1HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Jan 14, 2024 Contrastive Learning Image Retrieval
Code Code Available 1EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Jan 9, 2024 Denoising Image Generation
Code Code Available 1Retraining-free Model Quantization via One-Shot Weight-Coupling Learning Jan 3, 2024 Model Compression Quantization
Code Code Available 1MOC-RVQ: Multilevel Codebook-Assisted Digital Generative Semantic Communication Jan 2, 2024 2k Quantization
Code Code Available 1JointSQ: Joint Sparsification-Quantization for Distributed Learning Jan 1, 2024 Quantization
Code Code Available 1Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training Jan 1, 2024 Adversarial Attack image-classification
Code Code Available 1Spatial-Aware Regression for Keypoint Localization Jan 1, 2024 3D Pose Estimation Pose Estimation
Code Code Available 1Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike Fluctuations Jan 1, 2024 Attribute Image Reconstruction
Code Code Available 1