FoldToken: Learning Protein Language via Vector Quantization and Beyond Feb 4, 2024 Quantization
— Unverified 0Leveraging Continuously Differentiable Activation Functions for Learning in Quantized Noisy Environments Feb 4, 2024 Quantization
Code Code Available 0Locally-Adaptive Quantization for Streaming Vector Search Feb 3, 2024 Quantization Retrieval
— Unverified 0SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding Feb 2, 2024 Adversarial Attack Quantization
Code Code Available 0FedShift: Tackling Dual Heterogeneity Problem of Federated Learning via Weight Shift Aggregation Feb 2, 2024 Diversity Federated Learning
— Unverified 0Faster Inference of Integer SWIN Transformer by Removing the GELU Activation Feb 2, 2024 GPU image-classification
— Unverified 0Neural Language of Thought Models Feb 2, 2024 Image Generation Object
— Unverified 0Truncated Non-Uniform Quantization for Distributed SGD Feb 2, 2024 Quantization
— Unverified 0Ultrafast jet classification on FPGAs for the HL-LHC Feb 2, 2024 Quantization
Code Code Available 0An Intra-BRNN and GB-RVQ Based END-TO-END Neural Audio Codec Feb 2, 2024 Quantization
— Unverified 0Improved Quantization Strategies for Managing Heavy-tailed Gradients in Distributed Learning Feb 2, 2024 Quantization
— Unverified 0HW-SW Optimization of DNNs for Privacy-preserving People Counting on Low-resolution Infrared Arrays Feb 2, 2024 Neural Architecture Search Privacy Preserving
— Unverified 0Can Large Language Models Understand Context? Feb 1, 2024 In-Context Learning Quantization
— Unverified 0Analog-digital Scheduling for Federated Learning: A Communication-Efficient Approach Feb 1, 2024 Federated Learning Quantization
— Unverified 0Trainable Fixed-Point Quantization for Deep Learning Acceleration on FPGAs Jan 31, 2024 Deep Learning Quantization
— Unverified 0One-Step Forward and Backtrack: Overcoming Zig-Zagging in Loss-Aware Quantization Training Jan 30, 2024 Quantization
Code Code Available 0Effect of Weight Quantization on Learning Models by Typical Case Analysis Jan 30, 2024 Quantization
— Unverified 0Effective Communication with Dynamic Feature Compression Jan 29, 2024 Deep Reinforcement Learning Feature Compression
Code Code Available 0HEQuant: Marrying Homomorphic Encryption and Quantization for Communication-Efficient Private Inference Jan 29, 2024 Quantization
— Unverified 0Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Jan 27, 2024 Contrastive Learning Image Retrieval
— Unverified 0A Comprehensive Survey of Compression Algorithms for Language Models Jan 27, 2024 Knowledge Distillation Quantization
— Unverified 0LitE-SNN: Designing Lightweight and Efficient Spiking Neural Network through Spatial-Temporal Compressive Network Search and Joint Optimization Jan 26, 2024 Quantization
— Unverified 0MPTQ-ViT: Mixed-Precision Post-Training Quantization for Vision Transformer Jan 26, 2024 Quantization
— Unverified 0Towards Cheaper Inference in Deep Networks with Lower Bit-Width Accumulators Jan 25, 2024 Quantization
— Unverified 0CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks Jan 25, 2024 Model Compression Quantization
— Unverified 0Within-basket Recommendation via Neural Pattern Associator Jan 25, 2024 Quantization
— Unverified 0Value-Driven Mixed-Precision Quantization for Patch-Based Inference on Microcontrollers Jan 24, 2024 Quantization
— Unverified 0Iterated Relevance Matrix Analysis (IRMA) for the identification of class-discriminative subspaces Jan 23, 2024 Dimensionality Reduction Quantization
— Unverified 0Scaling Up Quantization-Aware Neural Architecture Search for Efficient Deep Learning on the Edge Jan 22, 2024 Neural Architecture Search Quantization
— Unverified 0Robustness to distribution shifts of compressed networks for edge devices Jan 22, 2024 Knowledge Distillation Quantization
— Unverified 0Another Way to the Top: Exploit Contextual Clustering in Learned Image Coding Jan 21, 2024 Clustering Image Compression
— Unverified 0Edge-Enabled Real-time Railway Track Segmentation Jan 21, 2024 GPU Quantization
— Unverified 0LRP-QViT: Mixed-Precision Vision Transformer Quantization via Layer-wise Relevance Propagation Jan 20, 2024 Quantization
— Unverified 0Dynamic Q&A of Clinical Documents with Large Language Models Jan 19, 2024 Chatbot Decision Making
— Unverified 0A2Q+: Improving Accumulator-Aware Weight Quantization Jan 19, 2024 Quantization
Code Code Available 0Model Compression Techniques in Biometrics Applications: A Survey Jan 18, 2024 Fairness Knowledge Distillation
Code Code Available 0Enabling On-device Continual Learning with Binary Neural Networks Jan 18, 2024 Continual Learning Quantization
— Unverified 0Exploration of Activation Fault Reliability in Quantized Systolic Array-Based DNN Accelerators Jan 17, 2024 Quantization
— Unverified 0Hybrid of DiffStride and Spectral Pooling in Convolutional Neural Networks Jan 17, 2024 Quantization
— Unverified 0Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks Jan 16, 2024 Classification image-classification
Code Code Available 0Activations and Gradients Compression for Model-Parallel Training Jan 15, 2024 image-classification Image Classification
Code Code Available 0TP-Aware Dequantization Jan 15, 2024 GPU Quantization
— Unverified 0MorpheusNet: Resource efficient sleep stage classifier for embedded on-line systems Jan 14, 2024 Quantization
Code Code Available 0ENTED: Enhanced Neural Texture Extraction and Distribution for Reference-based Blind Face Restoration Jan 13, 2024 Blind Face Restoration Quantization
— Unverified 0Correlated Quantization for Faster Nonconvex Distributed Optimization Jan 10, 2024 Distributed Optimization Quantization
— Unverified 0Memory-Efficient Fine-Tuning for Quantized Diffusion Model Jan 9, 2024 model Quantization
— Unverified 0FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs Jan 8, 2024 Computational Efficiency GPU
— Unverified 0Detecting Face Synthesis Using a Concealed Fusion Model Jan 8, 2024 Computer Security Face Generation
— Unverified 0A Video Coding Method Based on Neural Network for CLIC2024 Jan 8, 2024 Deep Learning Quantization
— Unverified 0Data-driven Dynamic Event-triggered Control Jan 7, 2024 Quantization
— Unverified 0