Deep Metric Learning to Rank Jun 1, 2019 Image Retrieval Learning-To-Rank
Code Code Available 0Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller Sep 25, 2019 Autonomous Navigation Deep Reinforcement Learning
Code Code Available 0Layer-Wise Quantization: A Pragmatic and Effective Method for Quantizing LLMs Beyond Integer Bit-Levels Jun 25, 2024 Language Modelling Large Language Model
Code Code Available 0AdaBin: Improving Binary Neural Networks with Adaptive Binary Sets Aug 17, 2022 Classification with Binary Neural Network Quantization
Code Code Available 0Empirical Evaluation of Deep Learning Model Compression Techniques on the WaveNet Vocoder Nov 20, 2020 Model Compression Quantization
Code Code Available 0PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off Dec 4, 2023 Binarization Computational Efficiency
Code Code Available 0SignSGD with Federated Defense: Harnessing Adversarial Attacks through Gradient Sign Decoding Feb 2, 2024 Adversarial Attack Quantization
Code Code Available 0Open-source FPGA-ML codesign for the MLPerf Tiny Benchmark Jun 23, 2022 Anomaly Detection image-classification
Code Code Available 0Learning Space Partitions for Nearest Neighbor Search Jan 24, 2019 General Classification graph partitioning
Code Code Available 0Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators Apr 16, 2021 Quantization
Code Code Available 0Operations Guided Neural Networks for High Fidelity Data-To-Text Generation Sep 8, 2018 Data-to-Text Generation Decoder
Code Code Available 0SignSGD with Federated Voting Mar 25, 2024 Quantization
Code Code Available 0Random Entity Quantization for Parameter-Efficient Compositional Knowledge Graph Representation Oct 24, 2023 Knowledge Graphs Quantization
Code Code Available 0EmbBERT-Q: Breaking Memory Barriers in Embedded NLP Feb 14, 2025 Mamba Quantization
Code Code Available 0Randomized Quantization is All You Need for Differential Privacy in Federated Learning Jun 20, 2023 All Federated Learning
Code Code Available 0Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach Oct 14, 2019 Neural Network Compression Quantization
Code Code Available 0Elastic Product Quantization for Time Series Jan 4, 2022 Quantization Time Series
Code Code Available 0Learning Semantic Textual Similarity via Topic-informed Discrete Latent Variables Nov 7, 2022 Language Modeling Language Modelling
Code Code Available 0Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training Jun 13, 2022 Quantization
Code Code Available 0Task-Based Graph Signal Compression Oct 24, 2021 Quantization
Code Code Available 0Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer Apr 15, 2022 multimodal generation Quantization
Code Code Available 0Deep Log-Likelihood Ratio Quantization Mar 11, 2019 Decoder Quantization
Code Code Available 0Learning Physical-Layer Communication with Quantized Feedback Apr 19, 2019 Quantization
Code Code Available 0Deep Learning with Low Precision by Half-wave Gaussian Quantization Feb 3, 2017 Deep Learning Quantization
Code Code Available 0Efficiera Residual Networks: Hardware-Friendly Fully Binary Weight with 2-bit Activation Model Achieves Practical ImageNet Accuracy Oct 15, 2024 Binarization Classification with Binary Weight Network
Code Code Available 0Efficient Text-driven Motion Generation via Latent Consistency Training May 5, 2024 Motion Generation Quantization
Code Code Available 0Column-wise Quantization of Weights and Partial Sums for Accurate and Efficient Compute-In-Memory Accelerators Feb 11, 2025 Quantization
Code Code Available 0Rapid-INR: Storage Efficient CPU-free DNN Training Using Implicit Neural Representation Jun 29, 2023 CPU GPU
Code Code Available 0Optimal Quantization for Matrix Multiplication Oct 17, 2024 Quantization
Code Code Available 0Learning Category Trees for ID-Based Recommendation: Exploring the Power of Differentiable Vector Quantization Aug 31, 2023 Click-Through Rate Prediction Collaborative Filtering
Code Code Available 0Learning Frequency-Specific Quantization Scaling in VVC for Standard-Compliant Task-driven Image Coding Jan 20, 2023 Quantization
Code Code Available 0Efficient statistical classification of satellite measurements Feb 10, 2012 Classification General Classification
Code Code Available 0Efficient Speech Translation through Model Compression and Knowledge Distillation May 26, 2025 Knowledge Distillation Model Compression
Code Code Available 0Optimization of Armv9 architecture general large language model inference performance based on Llama.cpp Jun 16, 2024 Compiler Optimization Language Modeling
Code Code Available 0Aggregated Learning: A Vector-Quantization Approach to Learning Neural Network Classifiers Jan 12, 2020 Classification General Classification
Code Code Available 0Understanding and Improving Knowledge Distillation for Quantization-Aware Training of Large Transformer Encoders Nov 20, 2022 Knowledge Distillation Model Compression
Code Code Available 0An efficient and straightforward online quantization method for a data stream through remove-birth updating Jun 21, 2023 Drift Detection Quantization
Code Code Available 0An Edge Computing-Based Solution for Real-Time Leaf Disease Classification using Thermal Imaging Nov 6, 2024 Deep Learning Edge-computing
Code Code Available 0Understanding Cache Boundness of ML Operators on ARM Processors Feb 1, 2021 Quantization
Code Code Available 0Towards Quantized Model Parallelism for Graph-Augmented MLPs Based on Gradient-Free ADMM Framework May 20, 2021 Quantization
Code Code Available 0When Quantization Affects Confidence of Large Language Models? May 1, 2024 Language Modeling Language Modelling
Code Code Available 0Task Vector Quantization for Memory-Efficient Model Merging Mar 10, 2025 image-classification Image Classification
Code Code Available 0Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment May 2, 2024 GPU NVIDIA Jetson Orin Nano
Code Code Available 0Learning Convolutional Transforms for Lossy Point Cloud Geometry Compression Mar 20, 2019 Binary Classification Mixed Reality
Code Code Available 0Learning Compression from Limited Unlabeled Data Sep 1, 2018 CPU GPU
Code Code Available 0Deep Learning-Based Quantization of L-Values for Gray-Coded Modulation Jun 18, 2019 Quantization
Code Code Available 0Efficient Quantization-Aware Training on Segment Anything Model in Medical Images and Its Deployment Dec 15, 2024 Image Segmentation Medical Image Segmentation
Code Code Available 0Optimizing Deep Neural Networks using Safety-Guided Self Compression May 1, 2025 Language Modeling Language Modelling
Code Code Available 0Efficient Online Inference of Vision Transformers by Training-Free Tokenization Nov 23, 2024 Quantization
Code Code Available 0Learning compact binary descriptors with unsupervised deep neural networks Jun 1, 2016 Image Retrieval Object
Code Code Available 0