SOTAVerified

Neural Network Compression

Papers

Showing 151193 of 193 papers

TitleStatusHype
Data-Free Knowledge Distillation with Soft Targeted Transfer Set Synthesis0
Deep Neural Network Compression for Aircraft Collision Avoidance Systems0
Differentiable Joint Pruning and Quantization for Hardware Efficiency0
Distilling Critical Paths in Convolutional Neural Networks0
Distilling Pixel-Wise Feature Similarities for Semantic Segmentation0
DKM: Differentiable K-Means Clustering Layer for Neural Network Compression0
DP-Net: Dynamic Programming Guided Deep Neural Network Compression0
Adaptive Error-Bounded Hierarchical Matrices for Efficient Neural Network Compression0
Efficient and Robust Knowledge Distillation from A Stronger Teacher Based on Correlation Matching0
Efficient Micro-Structured Weight Unification and Pruning for Neural Network Compression0
Efficient Neural Network Compression via Transfer Learning for Industrial Optical Inspection0
End-to-End Neural Network Compression via _1_2 Regularized Latency Surrogates0
Entropy-Constrained Training of Deep Neural Networks0
EPSD: Early Pruning with Self-Distillation for Efficient Model Compression0
Evaluation Metrics for DNNs Compression0
Generalized Ternary Connect: End-to-End Learning and Compression of Multiplication-Free Deep Neural Networks0
GranQ: Granular Zero-Shot Quantization with Channel-Wise Activation Scaling in QAT0
Grokking as Compression: A Nonlinear Complexity Perspective0
Guaranteed Quantization Error Computation for Neural Network Model Compression0
Hardware-Guided Symbiotic Training for Compact, Accurate, yet Execution-Efficient LSTM0
HEMP: High-order Entropy Minimization for neural network comPression0
How Informative is the Approximation Error from Tensor Decomposition for Neural Network Compression?0
Hybrid Tensor Decomposition in Neural Network Compression0
Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing0
Learning Filter Pruning Criteria for Deep Convolutional Neural Networks Acceleration0
Lightweight Attribute Localizing Models for Pedestrian Attribute Recognition0
Linearity-based neural network compression0
Compressing 3DCNNs Based on Tensor Train Decomposition0
Low-Rank Matrix Approximation for Neural Network Compression0
Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner0
MINT: Deep Network Compression via Mutual Information-based Neuron Trimming0
Partial Binarization of Neural Networks for Budget-Aware Efficient Learning0
MLPrune: Multi-Layer Pruning for Automated Neural Network Compression0
Model Compression Methods for YOLOv5: A Review0
Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference0
MPDCompress - Matrix Permutation Decomposition Algorithm for Deep Neural Network Compression0
MUC-G4: Minimal Unsat Core-Guided Incremental Verification for Deep Neural Network Compression0
Multi-head Knowledge Distillation for Model Compression0
Filter Distillation for Network Compression0
NETWORK COMPRESSION USING CORRELATION ANALYSIS OF LAYER RESPONSES0
Neural gradients are near-lognormal: improved quantized and sparse training0
Neural Network Compression by Joint Sparsity Promotion and Redundancy Reduction0
Neural Network Compression for Noisy Storage Devices0
Show:102550
← PrevPage 4 of 4Next →

No leaderboard results yet.