EdgeFusion: On-Device Text-to-Image Generation Apr 18, 2024 Image Generation Knowledge Distillation
— Unverified 0LongVQ: Long Sequence Modeling with Vector Quantization on Structured Memory Apr 17, 2024 Computational Efficiency Language Modeling
— Unverified 0QGen: On the Ability to Generalize in Quantization Aware Training Apr 17, 2024 Quantization
— Unverified 0Neural Network Approach for Non-Markovian Dissipative Dynamics of Many-Body Open Quantum Systems Apr 17, 2024 Benchmarking Quantization
— Unverified 0Variational quantization for state space models Apr 17, 2024 Quantization State Space Models
Code Code Available 0Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 0Quantization of Large Language Models with an Overdetermined Basis Apr 15, 2024 Data Compression Quantization
— Unverified 0Efficient and accurate neural field reconstruction using resistive memory Apr 15, 2024 CPU Novel View Synthesis
— Unverified 0TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models Apr 15, 2024 Denoising Model Optimization
— Unverified 0SNN4Agents: A Framework for Developing Energy-Efficient Embodied Spiking Neural Networks for Autonomous Agents Apr 14, 2024 Quantization
Code Code Available 0Bullion: A Column Store for Machine Learning Apr 13, 2024 Quantization Recommendation Systems
— Unverified 0Full-Duplex Beyond Self-Interference: The Unlimited Sensing Way Apr 12, 2024 Quantization
— Unverified 0Lossy Image Compression with Foundation Diffusion Models Apr 12, 2024 Denoising Image Compression
— Unverified 01-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture Apr 11, 2024 All Lesion Detection
— Unverified 0Frame Quantization of Neural Networks Apr 11, 2024 Quantization
— Unverified 0Edge-Efficient Deep Learning Models for Automatic Modulation Classification: A Performance Analysis Apr 11, 2024 Knowledge Distillation Model Optimization
— Unverified 0CQIL: Inference Latency Optimization with Concurrent Computation of Quasi-Independent Layers Apr 10, 2024 Quantization
Code Code Available 0Differentiable Search for Finding Optimal Quantization Strategy Apr 10, 2024 image-classification Image Classification
— Unverified 0Collaborative Edge AI Inference over Cloud-RAN Apr 9, 2024 Quantization
— Unverified 0Encoder-Quantization-Motion-based Video Quality Metrics Apr 9, 2024 Quantization Video Compression
— Unverified 0Physics of Language Models: Part 3.3, Knowledge Capacity Scaling Laws Apr 8, 2024 Quantization
— Unverified 0Investigating the Impact of Quantization on Adversarial Robustness Apr 8, 2024 Adversarial Robustness Quantization
— Unverified 0Exploring Quantization and Mapping Synergy in Hardware-Aware Deep Neural Network Accelerators Apr 8, 2024 Quantization Scheduling
Code Code Available 0David and Goliath: An Empirical Evaluation of Attacks and Defenses for QNNs at the Deep Edge Apr 8, 2024 Edge-computing Quantization
Code Code Available 0Gull: A Generative Multifunctional Audio Codec Apr 7, 2024 Audio Compression Audio Source Separation
— Unverified 0Weakly Supervised Deep Hyperspherical Quantization for Image Retrieval Apr 7, 2024 Image Retrieval Quantization
Code Code Available 0Nanometer Scanning with Micrometer Sensing: Beating Quantization Constraints in Lissajous Trajectory Tracking Apr 7, 2024 Quantization
— Unverified 0What Happens When Small Is Made Smaller? Exploring the Impact of Compression on Small Data Pretrained Language Models Apr 6, 2024 Knowledge Distillation Language Modeling
— Unverified 0Fine-Tuning, Quantization, and LLMs: Navigating Unintended Outcomes Apr 5, 2024 Quantization
— Unverified 0TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices Apr 4, 2024 Quantization Question Answering
— Unverified 0Mitigating the Impact of Outlier Channels for Language Model Quantization with Activation Regularization Apr 4, 2024 GPU Language Modeling
Code Code Available 0DI-Retinex: Digital-Imaging Retinex Theory for Low-Light Image Enhancement Apr 4, 2024 Image Enhancement Low-Light Image Enhancement
— Unverified 0CLaM-TTS: Improving Neural Codec Language Model for Zero-Shot Text-to-Speech Apr 3, 2024 Language Modeling Language Modelling
— Unverified 0Cherry on Top: Parameter Heterogeneity and Quantization in Large Language Models Apr 3, 2024 Quantization
— Unverified 0DNN Memory Footprint Reduction via Post-Training Intra-Layer Multi-Precision Quantization Apr 3, 2024 Edge-computing Quantization
— Unverified 0NeRFCodec: Neural Feature Compression Meets Neural Radiance Fields for Memory-Efficient Scene Representation Apr 2, 2024 Decoder Feature Compression
— Unverified 0Minimize Quantization Output Error with Bias Compensation Apr 2, 2024 Quantization
Code Code Available 0On the Effect of Quantization on Dynamic Mode Decomposition Apr 2, 2024 Quantization
— Unverified 0RefQSR: Reference-based Quantization for Image Super-Resolution Networks Apr 2, 2024 Image Super-Resolution Quantization
— Unverified 0A Novel Audio Representation for Music Genre Identification in MIR Apr 1, 2024 Information Retrieval Music Information Retrieval
— Unverified 0Instance-Aware Group Quantization for Vision Transformers Apr 1, 2024 image-classification Image Classification
— Unverified 0Towards Variable and Coordinated Holistic Co-Speech Motion Generation Mar 30, 2024 Motion Generation Quantization
— Unverified 0Accurate Block Quantization in LLMs with Outliers Mar 29, 2024 Quantization
— Unverified 0Transformer-Lite: High-efficiency Deployment of Large Language Models on Mobile Phone GPUs Mar 29, 2024 CPU GPU
— Unverified 0QNCD: Quantization Noise Correction for Diffusion Models Mar 28, 2024 Denoising Image Generation
Code Code Available 0Meta-Heuristic Fronthaul Bit Allocation for Cell-free Massive MIMO Systems Mar 28, 2024 CPU Fairness
— Unverified 0Uncertainty-Aware Deep Video Compression with Ensembles Mar 28, 2024 Diversity Motion Estimation
— Unverified 0Within the Dynamic Context: Inertia-aware 3D Human Modeling with Pose Sequence Mar 28, 2024 Neural Rendering Quantization
— Unverified 0Oh! We Freeze: Improving Quantized Knowledge Distillation via Signal Propagation Analysis for Large Language Models Mar 26, 2024 Knowledge Distillation Quantization
— Unverified 0Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN Mar 26, 2024 Knowledge Distillation Model Compression
— Unverified 0