xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics Jun 20, 2024 Machine Translation Quantization
Code Code Available 0SDQ: Sparse Decomposed Quantization for LLM Inference Jun 19, 2024 Model Compression Quantization
— Unverified 0High-Fidelity Facial Albedo Estimation via Texture Quantization Jun 19, 2024 3D Face Reconstruction Face Reconstruction
— Unverified 0Q-SNNs: Quantized Spiking Neural Networks Jun 19, 2024 Quantization
— Unverified 0Attention-aware Post-training Quantization without Backpropagation Jun 19, 2024 Quantization
— Unverified 0Bayesian-LoRA: LoRA based Parameter Efficient Fine-Tuning using Optimal Quantization levels and Rank Values trough Differentiable Bayesian Gates Jun 18, 2024 parameter-efficient fine-tuning Quantization
— Unverified 0Mixture of Scales: Memory-Efficient Token-Adaptive Binarization for Large Language Models Jun 18, 2024 Binarization Quantization
Code Code Available 1MSE Minimization in RIS-Aided MU-MIMO with Discrete Phase Shifts and Fronthaul Quantization Jun 18, 2024 Quantization
— Unverified 0Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization Jun 17, 2024 Language Modeling Language Modelling
— Unverified 0ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking Jun 17, 2024 Model Optimization Quantization
Code Code Available 1Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% Jun 17, 2024 image-classification Image Classification
Code Code Available 2QTIP: Quantization with Trellises and Incoherence Processing Jun 17, 2024 Decoder Quantization
Code Code Available 1Autoregressive Image Generation without Vector Quantization Jun 17, 2024 Image Generation Quantization
Code Code Available 5Deep-Learning-Based Channel Estimation for Distributed MIMO with 1-bit Radio-Over-Fiber Fronthaul Jun 17, 2024 Quantization
— Unverified 0Tender: Accelerating Large Language Models via Tensor Decomposition and Runtime Requantization Jun 16, 2024 Quantization Tensor Decomposition
— Unverified 0Promoting Data and Model Privacy in Federated Learning through Quantized LoRA Jun 16, 2024 Federated Learning parameter-efficient fine-tuning
— Unverified 0An Analysis on Quantizing Diffusion Transformers Jun 16, 2024 Conditional Image Generation Denoising
— Unverified 0Optimization of Armv9 architecture general large language model inference performance based on Llama.cpp Jun 16, 2024 Compiler Optimization Language Modeling
Code Code Available 0Evaluating the Generalization Ability of Quantized LLMs: Benchmark, Analysis, and Toolbox Jun 15, 2024 Quantization
Code Code Available 1Memory Faults in Activation-sparse Quantized Deep Neural Networks: Analysis and Mitigation using Sharpness-aware Training Jun 15, 2024 Quantization
— Unverified 0How Should We Extract Discrete Audio Tokens from Self-Supervised Models? Jun 15, 2024 Quantization Self-Supervised Learning
— Unverified 0Optimizing Byte-level Representation for End-to-end ASR Jun 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model Jun 14, 2024 All Quantization
— Unverified 0QQQ: Quality Quattuor-Bit Quantization for Large Language Models Jun 14, 2024 Quantization
Code Code Available 2Precipitation Nowcasting Using Physics Informed Discriminator Generative Models Jun 14, 2024 Generative Adversarial Network Quantization
— Unverified 0GEB-1.3B: Open Lightweight Large Language Model Jun 14, 2024 CPU Language Modeling
— Unverified 0Human-level molecular optimization driven by mol-gene evolution Jun 13, 2024 Drug Discovery Quantization
— Unverified 0ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis Jun 13, 2024 Quantization Speech Synthesis
— Unverified 0Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models Jun 13, 2024 Math Quantization
Code Code Available 2Q-S5: Towards Quantized State Space Models Jun 13, 2024 Computational Efficiency Quantization
Code Code Available 0MGRQ: Post-Training Quantization For Vision Transformer With Mixed Granularity Reconstruction Jun 13, 2024 Quantization
— Unverified 0OpenVLA: An Open-Source Vision-Language-Action Model Jun 13, 2024 Imitation Learning Language Modelling
Code Code Available 9ME-Switch: A Memory-Efficient Expert Switching Framework for Large Language Models Jun 13, 2024 Code Generation domain classification
— Unverified 0MobileAIBench: Benchmarking LLMs and LMMs for On-Device Use Cases Jun 12, 2024 Benchmarking Model Compression
— Unverified 0Compressive Beam Alignment for Indoor Millimeter-Wave Systems Jun 12, 2024 compressed sensing Quantization
— Unverified 0Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization Jun 12, 2024 Computational Efficiency Pose Estimation
— Unverified 0VALL-E R: Robust and Efficient Zero-Shot Text-to-Speech Synthesis via Monotonic Alignment Jun 12, 2024 Quantization Speech Synthesis
— Unverified 0Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark Jun 12, 2024 Benchmarking Mixture-of-Experts
Code Code Available 1FoldToken2: Learning compact, invariant and generative protein structure language Jun 11, 2024 Decoder Quantization
— Unverified 0Image and Video Tokenization with Binary Spherical Quantization Jun 11, 2024 Decoder Image Generation
Code Code Available 3T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text Jun 11, 2024 Quantization Sign Language Production
— Unverified 0TernaryLLM: Ternarized Large Language Model Jun 11, 2024 Knowledge Distillation Language Modeling
— Unverified 0The Impact of Quantization on Retrieval-Augmented Generation: An Analysis of Small LLMs Jun 10, 2024 Quantization RAG
— Unverified 02DQuant: Low-bit Post-Training Quantization for Image Super-Resolution Jun 10, 2024 Image Super-Resolution Quantization
Code Code Available 1Topological Analysis for Detecting Anomalies (TADA) in Time Series Jun 10, 2024 Quantization Time Series
— Unverified 0Low-Rank Quantization-Aware Training for LLMs Jun 10, 2024 GPU parameter-efficient fine-tuning
Code Code Available 2Latent Representation Matters: Human-like Sketches in One-shot Drawing Tasks Jun 10, 2024 Quantization
— Unverified 0Efficient Neural Compression with Inference-time Decoding Jun 10, 2024 Decoder Quantization
— Unverified 0Towards Lightweight Speaker Verification via Adaptive Neural Network Quantization Jun 8, 2024 Quantization Speaker Verification
— Unverified 0From Analog to Digital: Multi-Order Digital Joint Coding-Modulation for Semantic Communication Jun 8, 2024 Dimensionality Reduction Quantization
Code Code Available 1