Image Compression using only Attention based Neural Networks Oct 17, 2023 Image Compression Quantization
— Unverified 0Robustness and Approximation of Discrete-time Mean-field Games under Discounted Cost Criterion Oct 16, 2023 Quantization
— Unverified 0RoomDesigner: Encoding Anchor-latents for Style-consistent and Shape-compatible Indoor Scene Generation Oct 16, 2023 Quantization Scene Generation
Code Code Available 1One-Shot Sensitivity-Aware Mixed Sparsity Pruning for Large Language Models Oct 14, 2023 Quantization Sensitivity
Code Code Available 0LL-VQ-VAE: Learnable Lattice Vector-Quantization For Efficient Representations Oct 13, 2023 Quantization
— Unverified 0QUIK: Towards End-to-End 4-Bit Inference on Generative Large Language Models Oct 13, 2023 Computational Efficiency GPU
Code Code Available 1Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic Enhancement Oct 12, 2023 Contrastive Learning Data Augmentation
Code Code Available 1QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models Oct 12, 2023 GPU Quantization
Code Code Available 1LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models Oct 12, 2023 Natural Language Understanding Quantization
Code Code Available 2A Carbon Tracking Model for Federated Learning: Impact of Quantization and Sparsification Oct 12, 2023 Federated Learning Quantization
— Unverified 0Adaptive Quantization for Key Generation in Low-Power Wide-Area Networks Oct 11, 2023 Quantization
— Unverified 0Cost-Driven Hardware-Software Co-Optimization of Machine Learning Pipelines Oct 11, 2023 Quantization
— Unverified 0QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources Oct 11, 2023 GPU parameter-efficient fine-tuning
— Unverified 0CacheGen: KV Cache Compression and Streaming for Fast Large Language Model Serving Oct 11, 2023 Language Modeling Language Modelling
Code Code Available 5Sparse Fine-tuning for Inference Acceleration of Large Language Models Oct 10, 2023 CPU GPU
Code Code Available 1Distillation Improves Visual Place Recognition for Low Quality Images Oct 10, 2023 Knowledge Distillation Quantization
Code Code Available 0Efficient-VQGAN: Towards High-Resolution Image Generation with Efficient Vision Transformers Oct 9, 2023 Image Generation Image Reconstruction
— Unverified 0Vector Quantized Multi-modal Guidance for Alzheimer’s Disease Diagnosis Based on Feature Imputation Oct 8, 2023 Imputation Quantization
Code Code Available 0Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM Oct 7, 2023 Quantization
— Unverified 0Sub-token ViT Embedding via Stochastic Resonance Transformers Oct 6, 2023 Depth Estimation Depth Prediction
Code Code Available 0VaSAB: The variable size adaptive information bottleneck for disentanglement on speech and singing voice Oct 5, 2023 Disentanglement Quantization
— Unverified 0Learning A Disentangling Representation For PU Learning Oct 5, 2023 Clustering Density Estimation
— Unverified 0EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models Oct 5, 2023 Denoising Image Generation
Code Code Available 1Hadamard Domain Training with Integers for Class Incremental Quantized Learning Oct 5, 2023 Activity Recognition class-incremental learning
— Unverified 0Robustness-Guided Image Synthesis for Data-Free Quantization Oct 5, 2023 Data Free Quantization Diversity
— Unverified 0QuATON: Quantization Aware Training of Optical Neurons Oct 4, 2023 Quantization
— Unverified 0Reinforcement Learning with Foundation Priors: Let the Embodied Agent Efficiently Learn on Its Own Oct 4, 2023 Quantization reinforcement-learning
— Unverified 0Soft Convex Quantization: Revisiting Vector Quantization with Convex Optimization Oct 4, 2023 Image Reconstruction Quantization
— Unverified 0Mixture of Quantized Experts (MoQE): Complementary Effect of Low-bit Quantization and Robustness Oct 3, 2023 GPU Machine Translation
— Unverified 0Discrete, compositional, and symbolic representations through attractor dynamics Oct 3, 2023 Quantization
Code Code Available 0Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks Oct 2, 2023 Brain Tumor Classification Brain Tumor Segmentation
— Unverified 0Compressing LLMs: The Truth is Rarely Pure and Never Simple Oct 2, 2023 Quantization Retrieval
Code Code Available 1MobileNVC: Real-time 1080p Neural Video Compression on a Mobile Device Oct 2, 2023 Decoder GPU
— Unverified 0DiskANN++: Efficient Page-based Search over Isomorphic Mapped Graph Index using Query-sensitivity Entry Vertex Sep 30, 2023 Quantization Sensitivity
— Unverified 0Quantization of Deep Neural Networks to facilitate self-correction of weights on Phase Change Memory-based analog hardware Sep 30, 2023 Edge-computing Quantization
— Unverified 0One-Bit Channel Estimation for IRS-aided Millimeter-Wave Massive MU-MISO System Sep 29, 2023 Quantization
— Unverified 0Pruning Small Pre-Trained Weights Irreversibly and Monotonically Impairs "Difficult" Downstream Tasks in LLMs Sep 29, 2023 Quantization
Code Code Available 1Revolutionizing Mobile Interaction: Enabling a 3 Billion Parameter GPT LLM on Mobile Sep 29, 2023 Quantization
— Unverified 0QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition Sep 29, 2023 Quantization
Code Code Available 1Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head Sep 29, 2023 Pose Estimation Quantization
Code Code Available 1On Uniform Scalar Quantization for Learned Image Compression Sep 29, 2023 Image Compression Quantization
— Unverified 0Diffusion Models as Stochastic Quantization in Lattice Field Theory Sep 29, 2023 Quantization
Code Code Available 0RECOMBINER: Robust and Enhanced Compression with Bayesian Implicit Neural Representations Sep 29, 2023 Data Compression Quantization
Code Code Available 1Network Memory Footprint Compression Through Jointly Learnable Codebooks and Mappings Sep 29, 2023 Quantization
— Unverified 0PB-LLM: Partially Binarized Large Language Models Sep 29, 2023 Binarization Quantization
Code Code Available 1MixQuant: Mixed Precision Quantization with a Bit-width Optimization Search Sep 29, 2023 Quantization
— Unverified 0Pushing Large Language Models to the 6G Edge: Vision, Challenges, and Opportunities Sep 28, 2023 Edge-computing parameter-efficient fine-tuning
— Unverified 0ModuLoRA: Finetuning 2-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers Sep 28, 2023 GPU Instruction Following
Code Code Available 2Transformer-VQ: Linear-Time Transformers via Vector Quantization Sep 28, 2023 8k Decoder
Code Code Available 2Rethinking Channel Dimensions to Isolate Outliers for Low-bit Weight Quantization of Large Language Models Sep 27, 2023 HumanEval Language Modeling
Code Code Available 0