LRP-QViT: Mixed-Precision Vision Transformer Quantization via Layer-wise Relevance Propagation Jan 20, 2024 Quantization
— Unverified 0Dynamic Q&A of Clinical Documents with Large Language Models Jan 19, 2024 Chatbot Decision Making
— Unverified 0A2Q+: Improving Accumulator-Aware Weight Quantization Jan 19, 2024 Quantization
Code Code Available 0Model Compression Techniques in Biometrics Applications: A Survey Jan 18, 2024 Fairness Knowledge Distillation
Code Code Available 0Enabling On-device Continual Learning with Binary Neural Networks Jan 18, 2024 Continual Learning Quantization
— Unverified 0Exploration of Activation Fault Reliability in Quantized Systolic Array-Based DNN Accelerators Jan 17, 2024 Quantization
— Unverified 0Hybrid of DiffStride and Spectral Pooling in Convolutional Neural Networks Jan 17, 2024 Quantization
— Unverified 0Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models Jan 16, 2024 GPU Quantization
Code Code Available 3Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks Jan 16, 2024 Classification image-classification
Code Code Available 0TP-Aware Dequantization Jan 15, 2024 GPU Quantization
— Unverified 0Activations and Gradients Compression for Model-Parallel Training Jan 15, 2024 image-classification Image Classification
Code Code Available 0MorpheusNet: Resource efficient sleep stage classifier for embedded on-line systems Jan 14, 2024 Quantization
Code Code Available 0HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Jan 14, 2024 Contrastive Learning Image Retrieval
Code Code Available 1ENTED: Enhanced Neural Texture Extraction and Distribution for Reference-based Blind Face Restoration Jan 13, 2024 Blind Face Restoration Quantization
— Unverified 0Extreme Compression of Large Language Models via Additive Quantization Jan 11, 2024 CPU GPU
Code Code Available 5Correlated Quantization for Faster Nonconvex Distributed Optimization Jan 10, 2024 Distributed Optimization Quantization
— Unverified 0Memory-Efficient Fine-Tuning for Quantized Diffusion Model Jan 9, 2024 model Quantization
— Unverified 0EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models Jan 9, 2024 Denoising Image Generation
Code Code Available 1RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation Jan 9, 2024 GPU Math
Code Code Available 3Detecting Face Synthesis Using a Concealed Fusion Model Jan 8, 2024 Computer Security Face Generation
— Unverified 0A Video Coding Method Based on Neural Network for CLIC2024 Jan 8, 2024 Deep Learning Quantization
— Unverified 0FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs Jan 8, 2024 Computational Efficiency GPU
— Unverified 0Data-driven Dynamic Event-triggered Control Jan 7, 2024 Quantization
— Unverified 0A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODE Jan 5, 2024 CPU Edge-computing
— Unverified 0Enhancing Generalization of Invisible Facial Privacy Cloak via Gradient Accumulation Jan 3, 2024 Face Recognition Quantization
— Unverified 0From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations Jan 3, 2024 Diversity Quantization
Code Code Available 7Retraining-free Model Quantization via One-Shot Weight-Coupling Learning Jan 3, 2024 Model Compression Quantization
Code Code Available 1Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels Jan 2, 2024 Quantization
— Unverified 0MOC-RVQ: Multilevel Codebook-Assisted Digital Generative Semantic Communication Jan 2, 2024 2k Quantization
Code Code Available 1PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding Jan 1, 2024 Quantization
— Unverified 0Are Conventional SNNs Really Efficient? A Perspective from Network Quantization Jan 1, 2024 Fairness Quantization
— Unverified 0Transferable Structural Sparse Adversarial Attack Via Exact Group Sparsity Training Jan 1, 2024 Adversarial Attack image-classification
Code Code Available 1Boosting Spike Camera Image Reconstruction from a Perspective of Dealing with Spike Fluctuations Jan 1, 2024 Attribute Image Reconstruction
Code Code Available 1JointSQ: Joint Sparsification-Quantization for Distributed Learning Jan 1, 2024 Quantization
Code Code Available 1PikeLPN: Mitigating Overlooked Inefficiencies of Low-Precision Neural Networks Jan 1, 2024 Quantization
— Unverified 0General Point Model Pretraining with Autoencoding and Autoregressive Jan 1, 2024 Decoder Language Modeling
Code Code Available 0Data-Free Quantization via Pseudo-label Filtering Jan 1, 2024 Data Free Quantization Model Compression
— Unverified 0Spatial-Aware Regression for Keypoint Localization Jan 1, 2024 3D Pose Estimation Pose Estimation
Code Code Available 1Enhancing Post-training Quantization Calibration through Contrastive Learning Jan 1, 2024 Contrastive Learning Quantization
— Unverified 0Reg-PTQ: Regression-specialized Post-training Quantization for Fully Quantized Object Detector Jan 1, 2024 Object object-detection
— Unverified 0HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Dec 31, 2023 Quantization Representation Learning
— Unverified 0Compact Neural Graphics Primitives with Learned Hash Probing Dec 28, 2023 Quantization
— Unverified 0Fast Inference of Mixture-of-Experts Language Models with Offloading Dec 28, 2023 Mixture-of-Experts Quantization
Code Code Available 4TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones Dec 28, 2023 Computational Efficiency Image Captioning
Code Code Available 3FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNs Dec 27, 2023 Benchmarking GPU
Code Code Available 0LeanVec: Searching vectors faster by making them fit Dec 26, 2023 Cross-Modal Retrieval Dimensionality Reduction
Code Code Available 2Context-aware Communication for Multi-agent Reinforcement Learning Dec 25, 2023 Multi-agent Reinforcement Learning Quantization
Code Code Available 1A-SDM: Accelerating Stable Diffusion through Redundancy Removal and Performance Optimization Dec 24, 2023 Quantization
— Unverified 0Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization Dec 23, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0Efficient Asynchronous Federated Learning with Sparsification and Quantization Dec 23, 2023 Federated Learning Quantization
— Unverified 0