SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 151200 of 216 papers

TitleStatusHype
Label Poisoning is All You NeedCode1
Linear Mode Connectivity in Sparse Neural Networks0
DREAM+: Efficient Dataset Distillation by Bidirectional Representative MatchingCode1
AST: Effective Dataset Distillation through Alignment with Smooth and High-Quality Expert TrajectoriesCode0
Does Graph Distillation See Like Vision Dataset Counterpart?Code1
Data Distillation Can Be Like Vodka: Distilling More Times For Better Quality0
Self-Supervised Dataset Distillation for Transfer LearningCode1
Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory MatchingCode1
Can pre-trained models assist in dataset distillation?Code1
DataDAM: Efficient Dataset Distillation with Attention MatchingCode1
FedGKD: Unleashing the Power of Collaboration in Federated Graph Neural Networks0
Multi-Source Domain Adaptation meets Dataset Distillation through Dataset Dictionary Learning0
Towards Mitigating Architecture Overfitting on Distilled DatasetsCode0
Dataset QuantizationCode2
Vision-Language Dataset DistillationCode1
Exploring Multilingual Text Data DistillationCode0
Rethinking Data Distillation: Do Not Overlook Calibration0
Towards Trustworthy Dataset DistillationCode1
Dataset Distillation Meets Provable Subset Selection0
Squeeze, Recover and Relabel: Dataset Condensation at ImageNet Scale From A New PerspectiveCode1
Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching0
Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset DistillationCode1
On the Size and Approximation Error of Distilled Sets0
A Comprehensive Study on Dataset Distillation: Performance, Privacy, Robustness and Fairness0
A Survey on Dataset Distillation: Approaches, Applications and Future Directions0
Generalizing Dataset Distillation via Deep Generative PriorCode1
DiM: Distilling Dataset into Generative ModelCode1
Federated Virtual Learning on Heterogeneous Data with Local-global Distillation0
DREAM: Efficient Dataset Distillation by Representative MatchingCode1
Evaluating the effect of data augmentation and BALD heuristics on distillation of Semantic-KITTI dataset0
Navya3DSeg -- Navya 3D Semantic Segmentation Dataset & split generation for autonomous vehicles0
Dataset Distillation with Convexified Implicit GradientsCode1
Understanding Reconstruction Attacks with the Neural Tangent Kernel and Dataset Distillation0
Dataset Distillation: A Comprehensive Review0
A Comprehensive Survey of Dataset Distillation0
Backdoor Attacks Against Dataset DistillationCode1
Few-Shot Dataset Distillation via Translative Pre-Training0
Slimmable Dataset Condensation0
On Implicit Bias in Overparameterized Bilevel Optimization0
Accelerating Dataset Distillation via Model AugmentationCode0
Minimizing the Accumulated Trajectory Error to Improve Dataset DistillationCode1
Scaling Up Dataset Distillation to ImageNet-1K with Constant MemoryCode1
Dataset Factorization for CondensationCode1
Dataset Distillation via FactorizationCode1
Efficient Dataset Distillation Using Random Feature ApproximationCode1
On Divergence Measures for Bayesian PseudocoresetsCode0
Dataset Distillation for Medical Dataset Sharing0
Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing0
Dataset Distillation Using Parameter Pruning0
The Curse of Unrolling: Rate of Differentiating Through Optimization0
Show:102550
← PrevPage 4 of 5Next →

No leaderboard results yet.