SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 150 of 216 papers

TitleStatusHype
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
FedCache 2.0: Federated Edge Learning with Knowledge Caching and Dataset DistillationCode2
Dataset QuantizationCode2
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
Dataset Distillation by Matching Training TrajectoriesCode2
Self-supervised Dataset Distillation: A Good Compression Is All You NeedCode2
Improve Cross-Architecture Generalization on Dataset DistillationCode1
GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero CostCode1
Soft-Label Dataset Distillation and Text Dataset DistillationCode1
Dataset Factorization for CondensationCode1
Frequency Domain-based Dataset DistillationCode1
Generative Dataset Distillation Based on Diffusion ModelCode1
A Large-Scale Study on Video Action Dataset CondensationCode1
Group Distributionally Robust Dataset Distillation with Risk MinimizationCode1
Emphasizing Discriminative Features for Dataset Distillation in Complex ScenariosCode1
Federated Learning via Decentralized Dataset Distillation in Resource-Constrained Edge EnvironmentsCode1
DREAM+: Efficient Dataset Distillation by Bidirectional Representative MatchingCode1
Does Graph Distillation See Like Vision Dataset Counterpart?Code1
Dataset Distillation via FactorizationCode1
Embarassingly Simple Dataset DistillationCode1
Dataset Distillation with Convexified Implicit GradientsCode1
FADRM: Fast and Accurate Data Residual Matching for Dataset DistillationCode1
Dataset Quantization with Active Learning based Adaptive SamplingCode1
Flowing Datasets with Wasserstein over Wasserstein Gradient FlowsCode1
A Label is Worth a Thousand Images in Dataset DistillationCode1
Generalizing Dataset Distillation via Deep Generative PriorCode1
D^4: Dataset Distillation via Disentangled Diffusion ModelCode1
D^4M: Dataset Distillation via Disentangled Diffusion ModelCode1
Dancing with Still Images: Video Distillation via Static-Dynamic DisentanglementCode1
DELT: A Simple Diversity-driven EarlyLate Training for Dataset DistillationCode1
Efficiency for Free: Ideal Data Are Transportable RepresentationsCode1
Distill Gold from Massive Ores: Bi-level Data Pruning towards Efficient Dataset DistillationCode1
DiM: Distilling Dataset into Generative ModelCode1
Distilling Dataset into Neural FieldCode1
Dataset DistillationCode1
Are Large-scale Soft Labels Necessary for Large-scale Dataset Distillation?Code1
Can pre-trained models assist in dataset distillation?Code1
DREAM: Efficient Dataset Distillation by Representative MatchingCode1
CaO_2: Rectifying Inconsistencies in Diffusion-Based Dataset DistillationCode1
DataDAM: Efficient Dataset Distillation with Attention MatchingCode1
Dataset Distillation via Committee VotingCode1
Efficient Dataset Distillation via Minimax DiffusionCode1
DiLM: Distilling Dataset into Language Model for Text-level Dataset DistillationCode1
Dataset Distillation via Vision-Language Category PrototypeCode1
Distilling Datasets Into Less Than One ImageCode1
Exploiting Inter-sample and Inter-feature Relations in Dataset DistillationCode1
Backdoor Attacks Against Dataset DistillationCode1
Dataset Distillation via Curriculum Data Synthesis in Large Data EraCode1
Efficient Dataset Distillation Using Random Feature ApproximationCode1
Flexible Dataset Distillation: Learn Labels Instead of ImagesCode1
Show:102550
← PrevPage 1 of 5Next →

No leaderboard results yet.