SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 171180 of 216 papers

TitleStatusHype
Information-Guided Diffusion Sampling for Dataset Distillation0
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions0
Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training0
Label-Augmented Dataset Distillation0
Latent Dataset Distillation with Diffusion Models0
Exploring the Impact of Dataset Bias on Dataset DistillationCode0
Dataset Distillation for Offline Reinforcement LearningCode0
Exploring Multilingual Text Data DistillationCode0
Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor DatasetsCode0
TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential RecommendationCode0
Show:102550
← PrevPage 18 of 22Next →

No leaderboard results yet.