Abstractive summarization from Audio Transcription Jul 30, 2024 Abstractive Text Summarization Quantization
— Unverified 0Enhancing Multi-Stream Beamforming Through CQIs For 5G NR FDD Massive MIMO Communications: A Tuning-Free Scheme Sep 1, 2024 Quantization
— Unverified 0Brain Inspired Cortical Coding Method for Fast Clustering and Codebook Generation Nov 17, 2022 Anomaly Detection Clustering
— Unverified 0Effective Interplay between Sparsity and Quantization: From Theory to Practice May 31, 2024 Computational Efficiency Model Compression
— Unverified 0An Inter-Layer Weight Prediction and Quantization for Deep Neural Networks based on a Smoothly Varying Weight Hypothesis Jul 16, 2019 Quantization
— Unverified 0Adaptive Compression for Communication-Efficient Distributed Training Oct 31, 2022 Quantization
— Unverified 0Effective and Efficient Mixed Precision Quantization of Speech Foundation Models Jan 7, 2025 Model Compression parameter estimation
— Unverified 0HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction Aug 2, 2024 Click-Through Rate Prediction Mixture-of-Experts
— Unverified 0Enhancing Generalization of Invisible Facial Privacy Cloak via Gradient Accumulation Jan 3, 2024 Face Recognition Quantization
— Unverified 0eDKM: An Efficient and Accurate Train-time Weight Clustering for Large Language Models Sep 2, 2023 Clustering CPU
— Unverified 0Edinburgh's Submissions to the 2020 Machine Translation Efficiency Task Jul 1, 2020 CPU GPU
— Unverified 0Boosting Distributed Full-graph GNN Training with Asynchronous One-bit Communication Mar 2, 2023 GPU Quantization
— Unverified 0LCP: A Low-Communication Parallelization Method for Fast Neural Network Inference in Image Recognition Mar 13, 2020 Quantization
— Unverified 0Non-linear Canonical Correlation Analysis: A Compressed Representation Approach Oct 31, 2018 Dimensionality Reduction Quantization
— Unverified 0POLARON: Precision-aware On-device Learning and Adaptive Runtime-cONfigurable AI acceleration Jun 10, 2025 Quantization
— Unverified 0Boosted Dense Retriever Jan 16, 2022 Quantization Retrieval
— Unverified 0Effective and Fast: A Novel Sequential Single Path Search for Mixed-Precision Quantization Mar 4, 2021 Quantization
— Unverified 0Boost Vision Transformer with GPU-Friendly Sparsity and Quantization May 18, 2023 Benchmarking GPU
— Unverified 0Edge-MultiAI: Multi-Tenancy of Latency-Sensitive Deep Learning Applications on Edge Nov 14, 2022 Management Model Compression
— Unverified 0Boosted Dense Retriever Dec 14, 2021 Quantization Retrieval
— Unverified 0Effective Quantization Approaches for Recurrent Neural Networks Feb 7, 2018 Machine Translation Quantization
— Unverified 0Effective Quantization for Diffusion Models on CPUs Nov 2, 2023 Quantization
— Unverified 0EdgeMLOps: Operationalizing ML models with Cumulocity IoT and thin-edge.io for Visual quality Inspection Jan 28, 2025 Asset Management Management
— Unverified 0Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations Aug 10, 2019 Knowledge Distillation Quantization
— Unverified 0Effect of Signal Quantization on Performance Measures of a 1st Order One Dimensional Differential Microphone Array Jun 18, 2025 Quantization
— Unverified 0Effect of Weight Quantization on Learning Models by Typical Case Analysis Jan 30, 2024 Quantization
— Unverified 0Effects of VLSI Circuit Constraints on Temporal-Coding Multilayer Spiking Neural Networks Jun 18, 2021 Quantization
— Unverified 0Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion Dec 9, 2024 Denoising Image Generation
— Unverified 0Efficient Adaptive Activation Rounding for Post-Training Quantization Aug 25, 2022 Quantization
— Unverified 0Efficient-Adam: Communication-Efficient Distributed Adam May 28, 2022 Quantization
— Unverified 0Efficient AI in Practice: Training and Deployment of Efficient LLMs for Industry Applications Feb 20, 2025 Knowledge Distillation Model Compression
— Unverified 0Efficient and accurate neural field reconstruction using resistive memory Apr 15, 2024 CPU Novel View Synthesis
— Unverified 0Efficient and Effective Methods for Mixed Precision Neural Network Quantization for Faster, Energy-efficient Inference Jan 30, 2023 Efficient Neural Network Quantization
— Unverified 0Breaking the waves: asymmetric random periodic features for low-bitrate kernel machines Apr 14, 2020 Quantization
— Unverified 0Enhancing Diversity for Data-free Quantization Jan 1, 2025 Data Free Quantization Diversity
— Unverified 0Efficient and Workload-Aware LLM Serving via Runtime Layer Swapping and KV Cache Resizing May 24, 2025 Model Compression Quantization
— Unverified 0Efficient ANN-SNN Conversion with Error Compensation Learning May 12, 2025 Quantization
— Unverified 0Efficient Arbitrary Precision Acceleration for Large Language Models on GPU Tensor Cores Sep 26, 2024 GPU Management
— Unverified 0Enhancing Field-Oriented Control of Electric Drives with Tiny Neural Network Optimized for Micro-controllers Feb 1, 2025 Quantization
— Unverified 0Enhancing Off-Grid One-Bit DOA Estimation with Learning-Based Sparse Bayesian Approach for Non-Uniform Sparse Array Dec 14, 2024 Computational Efficiency Quantization
— Unverified 0Efficient Batch Homomorphic Encryption for Vertically Federated XGBoost Dec 8, 2021 Federated Learning Quantization
— Unverified 0Efficient Bitwidth Search for Practical Mixed Precision Neural Network Mar 17, 2020 Quantization
— Unverified 0Efficient Channel Estimator with Angle-Division Multiple Access Apr 17, 2018 Quantization
— Unverified 0Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression Apr 3, 2025 Image Compression Quantization
— Unverified 0Efficient Compression of Multitask Multilingual Speech Models May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition Jun 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0EntroLLM: Entropy Encoded Weight Compression for Efficient Large Language Model Inference on Edge Devices May 5, 2025 4k Language Modeling
— Unverified 0Efficient Convolutional Neural Network with Binary Quantization Layer Nov 21, 2016 Clustering Image Segmentation
— Unverified 0BRIEDGE: EEG-Adaptive Edge AI for Multi-Brain to Multi-Robot Interaction Mar 14, 2024 EEG Model Compression
— Unverified 0ERQ: Error Reduction for Post-Training Quantization of Vision Transformers Jul 9, 2024 Quantization regression
— Unverified 0