Norm Tweaking: High-performance Low-bit Quantization of Large Language Models Sep 6, 2023 Model Compression Quantization
— Unverified 0Bandwidth-efficient Inference for Neural Image Compression Sep 6, 2023 Data Compression Image Compression
— Unverified 0RobustEdge: Low Power Adversarial Detection for Cloud-Edge Systems Sep 5, 2023 Adversarial Robustness Quantization
— Unverified 0QuantEase: Optimization-based Quantization for Language Models Sep 5, 2023 GPU Quantization
— Unverified 0On-Chip Hardware-Aware Quantization for Mixed Precision Neural Networks Sep 5, 2023 Quantization
— Unverified 0A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking Sep 5, 2023 Benchmarking Knowledge Distillation
— Unverified 0Compressing Vision Transformers for Low-Resource Visual Learning Sep 5, 2023 Autonomous Navigation image-classification
Code Code Available 0On the fly Deep Neural Network Optimization Control for Low-Power Computer Vision Sep 4, 2023 Quantization
— Unverified 0Softmax Bias Correction for Quantized Generative Models Sep 4, 2023 Language Modeling Language Modelling
— Unverified 0eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models Sep 2, 2023 Clustering CPU
— Unverified 0Few shot font generation via transferring similarity guided global style and quantization local style Sep 2, 2023 Disentanglement Font Generation
Code Code Available 1RepCodec: A Speech Representation Codec for Speech Tokenization Aug 31, 2023 Language Modeling Language Modelling
Code Code Available 1Learning Category Trees for ID-Based Recommendation: Exploring the Power of Differentiable Vector Quantization Aug 31, 2023 Click-Through Rate Prediction Collaborative Filtering
Code Code Available 0SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models Aug 31, 2023 Decoder Language Modeling
Code Code Available 2FPTQ: Fine-grained Post-Training Quantization for Large Language Models Aug 30, 2023 Quantization
— Unverified 0Implementation and Evaluation of Physical Layer Key Generation on SDR based LoRa Platform Aug 30, 2023 Quantization
— Unverified 0Continual Learning for Generative Retrieval over Dynamic Corpora Aug 29, 2023 Continual Learning Quantization
Code Code Available 0Uncovering the Hidden Cost of Model Compression Aug 29, 2023 model Model Compression
Code Code Available 0On-Device Learning with Binary Neural Networks Aug 29, 2023 Continual Learning Quantization
— Unverified 0Low-bit Quantization for Deep Graph Neural Networks with Smoothness-aware Message Propagation Aug 29, 2023 Graph Neural Network Node Classification
Code Code Available 0Maestro: Uncovering Low-Rank Structures via Trainable Decomposition Aug 28, 2023 Low-rank compression Quantization
Code Code Available 0MEMORY-VQ: Compression for Tractable Internet-Scale Memory Aug 28, 2023 Quantization Retrieval
— Unverified 0VQ-Font: Few-Shot Font Generation with Structure-Aware Enhancement and Quantization Aug 27, 2023 Font Generation Quantization
Code Code Available 1Efficient Learned Lossless JPEG Recompression Aug 25, 2023 GPU Image Compression
— Unverified 0OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models Aug 25, 2023 Common Sense Reasoning Computational Efficiency
Code Code Available 2A2Q: Accumulator-Aware Quantization with Guaranteed Overflow Avoidance Aug 25, 2023 Quantization
Code Code Available 0Quantized distributed Nash equilibrium seeking under DoS attacks Aug 24, 2023 Quantization
— Unverified 0Hybrid noise shaping for audio coding using perfectly overlapped window Aug 24, 2023 Quantization
— Unverified 0Robust open-set classification for encrypted traffic fingerprinting Aug 23, 2023 Classification open-set classification
Code Code Available 0Consistent Signal Reconstruction from Streaming Multivariate Time Series Aug 23, 2023 Quantization Time Series
— Unverified 0Compressed Models Decompress Race Biases: What Quantized Models Forget for Fair Face Recognition Aug 23, 2023 Face Recognition Quantization
— Unverified 0Distributed Energy Resource Management: All-Time Resource-Demand Feasibility, Delay-Tolerance, Nonlinearity, and Beyond Aug 22, 2023 All energy management
— Unverified 0Towards Clip-Free Quantized Super-Resolution Networks: How to Tame Representative Images Aug 22, 2023 Quantization Super-Resolution
— Unverified 0Exploring Parameter-Efficient Fine-Tuning Techniques for Code Generation with Large Language Models Aug 21, 2023 Code Generation In-Context Learning
Code Code Available 1Jumping through Local Minima: Quantization in the Loss Landscape of Vision Transformers Aug 21, 2023 Quantization
Code Code Available 1QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection Aug 21, 2023 3D Object Detection Model Compression
— Unverified 0Dataset Quantization Aug 21, 2023 Dataset Distillation object-detection
Code Code Available 2Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts Aug 21, 2023 Image Generation Quantization
— Unverified 0Quantization-based Optimization with Perspective of Quantum Mechanics Aug 20, 2023 global-optimization Quantization
— Unverified 0Analyzing Quantization in TVM Aug 19, 2023 Quantization
— Unverified 0FunQuant: A R package to perform quantization in the context of rare events and time-consuming simulations Aug 18, 2023 Quantization
— Unverified 0NAPA-VQ: Neighborhood Aware Prototype Augmentation with Vector Quantization for Continual Learning Aug 18, 2023 class-incremental learning Class Incremental Learning
Code Code Available 1SHARK: A Lightweight Model Compression Approach for Large-scale Recommender Systems Aug 18, 2023 Model Compression Quantization
— Unverified 0ResQ: Residual Quantization for Video Perception Aug 18, 2023 Optical Flow Estimation Pose Estimation
— Unverified 0JPEG Quantized Coefficient Recovery via DCT Domain Spatial-Frequential Transformer Aug 17, 2023 JPEG Artifact Removal Quantization
— Unverified 0FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs Aug 16, 2023 GPU Mixture-of-Experts
— Unverified 0Precision and Recall Reject Curves for Classification Aug 16, 2023 Classification Quantization
— Unverified 0Characteristics of networks generated by kernel growing neural gas Aug 16, 2023 Clustering Quantization
Code Code Available 0Gradient-Based Post-Training Quantization: Challenging the Status Quo Aug 15, 2023 Quantization
— Unverified 0A Survey on Model Compression for Large Language Models Aug 15, 2023 Benchmarking Knowledge Distillation
— Unverified 0