SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 101110 of 216 papers

TitleStatusHype
Risk of Text Backdoor Attacks Under Dataset DistillationCode0
Sequential Subset Matching for Dataset DistillationCode0
TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential RecommendationCode0
Teddy: Efficient Large-Scale Dataset Distillation via Taylor-Approximated MatchingCode0
Towards Adversarially Robust Dataset Distillation by Curvature RegularizationCode0
Towards Mitigating Architecture Overfitting on Distilled DatasetsCode0
UniDetox: Universal Detoxification of Large Language Models via Dataset DistillationCode0
Towards Efficient Deep Hashing Retrieval: Condensing Your Data via Feature-Embedding Matching0
Adaptive Dataset Quantization0
Video Set Distillation: Information Diversification and Temporal Densification0
Show:102550
← PrevPage 11 of 22Next →

No leaderboard results yet.