SOTAVerified

Dataset Distillation

Dataset distillation is the task of synthesizing a small dataset such that models trained on it achieve high performance on the original large dataset. A dataset distillation algorithm takes as input a large real dataset to be distilled (training set), and outputs a small synthetic distilled dataset, which is evaluated via testing models trained on this distilled dataset on a separate real dataset (validation/test set). A good small distilled dataset is not only useful in dataset understanding, but has various applications (e.g., continual learning, privacy, neural architecture search, etc.).

Papers

Showing 150 of 216 papers

TitleStatusHype
Information-Guided Diffusion Sampling for Dataset Distillation0
Task-Specific Generative Dataset Distillation with Difficulty-Guided SamplingCode0
Dataset Distillation via Vision-Language Category PrototypeCode1
FADRM: Fast and Accurate Data Residual Matching for Dataset DistillationCode1
CaO_2: Rectifying Inconsistencies in Diffusion-Based Dataset DistillationCode1
FedWSIDD: Federated Whole Slide Image Classification via Dataset Distillation0
Dataset distillation for memorized data: Soft labels can leak held-out teacher knowledgeCode0
Flowing Datasets with Wasserstein over Wasserstein Gradient FlowsCode1
OD3: Optimization-free Dataset Distillation for Object DetectionCode1
Hyperbolic Dataset DistillationCode0
Data-Distill-Net: A Data Distillation Approach Tailored for Reply-based Continual Learning0
Diversity-Driven Generative Dataset Distillation Based on Diffusion Model with Self-Adaptive Memory0
MGD^3: Mode-Guided Dataset Distillation using Diffusion Models0
CONCORD: Concept-Informed Diffusion for Dataset DistillationCode0
Taming Diffusion for Dataset Distillation with High RepresentativenessCode1
Exploring Generalized Gait Recognition: Reducing Redundancy and Noise within Indoor and Outdoor DatasetsCode0
Contrastive Learning-Enhanced Trajectory Matching for Small-Scale Dataset Distillation0
DD-Ranking: Rethinking the Evaluation of Dataset DistillationCode2
Beyond Modality Collapse: Representations Blending for Multimodal Dataset Distillation0
Leveraging Multi-Modal Information to Enhance Dataset Distillation0
Video Dataset Condensation with Diffusion Models0
Dataset Distillation with Probabilistic Latent Features0
UniDetox: Universal Detoxification of Large Language Models via Dataset DistillationCode0
Latent Video Dataset Distillation0
Distribution-aware Dataset Distillation for Efficient Image Restoration0
Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions0
Permutation-Invariant and Orientation-Aware Dataset Distillation for 3D Point Clouds0
Curriculum Coarse-to-Fine Selection for High-IPC Dataset DistillationCode0
Enhancing Dataset Distillation via Non-Critical Region RefinementCode0
Generative Dataset Distillation using Min-Max Diffusion Model0
Finding Stable Subnetworks at Initialization with Dataset Distillation0
Dataset Distillation for Quantum Neural Networks0
Robust Dataset Distillation by Matching Adversarial Trajectories0
Distilling Dataset into Neural FieldCode1
Understanding Dataset Distillation via Spectral Filtering0
Secure Federated Data Distillation0
Does Training with Synthetic Data Truly Protect Privacy?Code0
The Evolution of Dataset Distillation: Toward Scalable and Generalizable Solutions0
Trust-Aware Diversion for Data-Effective Distillation0
Dark Distillation: Backdooring Distilled Datasets without Accessing Raw Data0
TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential RecommendationCode0
Knowledge Hierarchy Guided Biological-Medical Dataset Distillation for Domain LLM Training0
On Learning Representations for Tabular Data Distillation0
Class-Imbalanced-Aware Adaptive Dataset Distillation for Scalable Pretrained Model on Credit Scoring0
Dataset Distillation as Pushforward Optimal Quantization0
Dataset Distillation via Committee VotingCode1
FocusDD: Real-World Scene Infusion for Robust Dataset Distillation0
Generative Dataset Distillation Based on Self-knowledge Distillation0
Towards Universal Dataset Distillation via Task-Driven Diffusion0
OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation0
Show:102550
← PrevPage 1 of 5Next →

No leaderboard results yet.