SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 5175 of 216 papers

TitleStatusHype
Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset DistillationCode1
Generalizing Dataset Distillation via Deep Generative PriorCode1
DiM: Distilling Dataset into Generative ModelCode1
DREAM: Efficient Dataset Distillation by Representative MatchingCode1
Dataset Distillation with Convexified Implicit GradientsCode1
Backdoor Attacks Against Dataset DistillationCode1
Minimizing the Accumulated Trajectory Error to Improve Dataset DistillationCode1
Scaling Up Dataset Distillation to ImageNet-1K with Constant MemoryCode1
Dataset Factorization for CondensationCode1
Dataset Distillation via FactorizationCode1
Efficient Dataset Distillation Using Random Feature ApproximationCode1
Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge EnvironmentsCode1
Remember the Past: Distilling Datasets into Addressable Memories for Neural NetworksCode1
Flexible Dataset Distillation: Learn Labels Instead of ImagesCode1
Soft-Label Dataset Distillation and Text Dataset DistillationCode1
Dataset DistillationCode1
Information-Guided Diffusion Sampling for Dataset Distillation0
Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling0
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
Hyperbolic Dataset Distillation0
Data-Distill-Net: A Data Distillation Approach Tailored for Reply-based Continual Learning0
Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory0
MGD^3: Mode-Guided Dataset Distillation using Diffusion Models0
CONCORD: Concept-Informed Diffusion for Dataset DistillationCode0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.