SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 7180 of 216 papers

TitleStatusHype
Going Beyond Feature Similarity: Effective Dataset Distillation based on Class-Aware Conditional Mutual InformationCode0
Dataset Distillation via Adversarial Prediction MatchingCode0
Curriculum Coarse-to-Fine Selection for High-IPC Dataset DistillationCode0
Dataset Distillation using Neural Feature RegressionCode0
CONCORD: Concept-Informed Diffusion for Dataset DistillationCode0
BACON: Bayesian Optimal Condensation Framework for Dataset DistillationCode0
Image Distillation for Safe Data Sharing in HistopathologyCode0
ATOM: Attention Mixer for Efficient Dataset DistillationCode0
Dataset Distillation for Offline Reinforcement LearningCode0
Exploring Multilingual Text Data DistillationCode0
Show:102550
← PrevPage 8 of 22Next →

No leaderboard results yet.