SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 110 of 216 papers

TitleStatusHype
Information-Guided Diffusion Sampling for Dataset Distillation0
Task-Specific Generative Dataset Distillation with Difficulty-Guided SamplingCode0
Dataset Distillation via Vision-Language Category PrototypeCode1
FADRM: Fast and Accurate Data Residual Matching for Dataset DistillationCode1
CaO_2: Rectifying Inconsistencies in Diffusion-Based Dataset DistillationCode1
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
Flowing Datasets with Wasserstein over Wasserstein Gradient FlowsCode1
OD3: Optimization-free Dataset Distillation for Object DetectionCode1
Hyperbolic Dataset DistillationCode0
Show:102550
← PrevPage 1 of 22Next →

No leaderboard results yet.