Universal Joint Source-Channel Coding for Modulation-Agnostic Semantic Communication May 17, 2024 Decoder Quantization
— Unverified 0Flattened one-bit stochastic gradient descent: compressed distributed optimization with controlled variance May 17, 2024 Distributed Optimization Quantization
— Unverified 0Enhancing Perception Quality in Remote Sensing Image Compression via Invertible Neural Network May 17, 2024 Image Compression Quantization
— Unverified 0The Effect of Quantization in Federated Learning: A Rényi Differential Privacy Perspective May 16, 2024 Federated Learning Privacy Preserving
— Unverified 0Properties that allow or prohibit transferability of adversarial attacks among quantized networks May 15, 2024 Quantization
Code Code Available 0Neural Speech Coding for Real-time Communications using Constant Bitrate Scalar Quantization May 14, 2024 Quantization Scheduling
— Unverified 0FDD Massive MIMO: How to Optimally Combine UL Pilot and Limited DL CSI Feedback? May 14, 2024 Quantization
— Unverified 0Goal-oriented compression for L_p-norm-type goal functions: Application to power consumption scheduling May 13, 2024 Data Compression Quantization
— Unverified 0VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling May 13, 2024 Quantization
— Unverified 0Post Training Quantization of Large Language Models with Microscaling Formats May 12, 2024 Language Modeling Language Modelling
— Unverified 0Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization May 12, 2024 Language Modeling Language Modelling
— Unverified 0Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection May 10, 2024 Autonomous Driving GPU
— Unverified 0Compression-Realized Deep Structural Network for Video Quality Enhancement May 10, 2024 Denoising Motion Estimation
— Unverified 0Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models May 10, 2024 AI Agent Model Compression
— Unverified 0SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models May 10, 2024 GPU Quantization
— Unverified 0From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks May 9, 2024 Knowledge Distillation Model Compression
— Unverified 0Custom Gradient Estimators are Straight-Through Estimators in Disguise May 8, 2024 Quantization
— Unverified 0KV Cache is 1 Bit Per Channel: Efficient Large Language Model Inference with Coupled Quantization May 7, 2024 GPU Language Modeling
— Unverified 0Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer May 6, 2024 Efficient ViTs Model Compression
Code Code Available 0Compression-based Privacy Preservation for Distributed Nash Equilibrium Seeking in Aggregative Games May 6, 2024 Quantization
— Unverified 0Quantifying the Capabilities of LLMs across Scale and Precision May 6, 2024 Hallucination Misinformation
— Unverified 0Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment May 6, 2024 Arithmetic Reasoning Code Generation
— Unverified 0DeltaKWS: A 65nm 36nJ/Decision Bio-inspired Temporal-Sparsity-Aware Digital Keyword Spotting IC with 0.6V Near-Threshold SRAM May 6, 2024 channel selection Keyword Spotting
— Unverified 0Joint Discrete Precoding and RIS Optimization for RIS-Assisted MU-MIMO Communication Systems May 5, 2024 Quantization
— Unverified 0Efficient Text-driven Motion Generation via Latent Consistency Training May 5, 2024 Motion Generation Quantization
Code Code Available 0Exploring Extreme Quantization in Spiking Language Models May 4, 2024 Knowledge Distillation Language Modeling
— Unverified 0Three Quantization Regimes for ReLU Networks May 3, 2024 Quantization
— Unverified 0Lightweight Change Detection in Heterogeneous Remote Sensing Images with Online All-Integer Pruning Training May 3, 2024 All Change Detection
— Unverified 0Network reconstruction via the minimum description length principle May 2, 2024 Bayesian Inference Quantization
— Unverified 0Efficient Compression of Multitask Multilingual Speech Models May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Sequential Fronthaul Quantization and Hardware Complexity Reduction in Uplink Cell-Free Massive MIMO Networks May 2, 2024 Quantization
— Unverified 0Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment May 2, 2024 GPU NVIDIA Jetson Orin Nano
Code Code Available 0Wake Vision: A Tailored Dataset and Benchmark Suite for TinyML Computer Vision Applications May 1, 2024 Human Detection Knowledge Distillation
— Unverified 0When Quantization Affects Confidence of Large Language Models? May 1, 2024 Language Modeling Language Modelling
Code Code Available 0Self-supervised Pre-training of Text Recognizers May 1, 2024 Quantization Transfer Learning
Code Code Available 0Investigating Automatic Scoring and Feedback using Large Language Models May 1, 2024 parameter-efficient fine-tuning Quantization
— Unverified 0Transition Rate Scheduling for Quantization-Aware Training Apr 30, 2024 Quantization Scheduling
— Unverified 0Quantized Context Based LIF Neurons for Recurrent Spiking Neural Networks in 45nm Apr 28, 2024 Quantization
— Unverified 0Enhancing Channel Estimation in Quantized Systems with a Generative Prior Apr 26, 2024 Quantization
— Unverified 0sDAC -- Semantic Digital Analog Converter for Semantic Communications Apr 26, 2024 Quantization Semantic Communication
— Unverified 0MMGRec: Multimodal Generative Recommendation with Transformer Model Apr 25, 2024 model Multimodal Recommendation
— Unverified 0How to Parameterize Asymmetric Quantization Ranges for Quantization-Aware Training Apr 25, 2024 Quantization
— Unverified 0CoST: Contrastive Quantization based Semantic Tokenization for Generative Recommendation Apr 23, 2024 Decoder Language Modelling
— Unverified 0AdaQAT: Adaptive Bit-Width Quantization-Aware Training Apr 22, 2024 Quantization
— Unverified 0CNN-Based Equalization for Communications: Achieving Gigabit Throughput with a Flexible FPGA Hardware Architecture Apr 22, 2024 GPU Quantization
— Unverified 0Latency-Distortion Tradeoffs in Communicating Classification Results over Noisy Channels Apr 22, 2024 Navigate Quantization
— Unverified 0FedMPQ: Secure and Communication-Efficient Federated Learning with Multi-codebook Product Quantization Apr 21, 2024 Federated Learning Quantization
— Unverified 0HybridFlow: Infusing Continuity into Masked Codebook for Extreme Low-Bitrate Image Compression Apr 20, 2024 Decoder Image Compression
— Unverified 0A SER-based Device Selection Mechanism in Multi-bits Quantization Federated Learning Apr 20, 2024 Federated Learning Quantization
— Unverified 0EdgeFusion: On-Device Text-to-Image Generation Apr 18, 2024 Image Generation Knowledge Distillation
— Unverified 0