| A Survey on Data Augmentation in Large Model Era | Jan 27, 2024 | Audio Signal ProcessingData Augmentation | CodeCode Available | 2 | 5 |
| EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks | Jan 31, 2019 | Data AugmentationGeneral Classification | CodeCode Available | 2 | 5 |
| Words or Vision: Do Vision-Language Models Have Blind Faith in Text? | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP | Dec 13, 2024 | Action RecognitionText Augmentation | CodeCode Available | 1 | 5 |
| Text Augmentation for Language Models in High Error Recognition Scenario | Nov 11, 2020 | Data Augmentationspeech-recognition | CodeCode Available | 1 | 5 |
| Story Visualization by Online Text Augmentation with Context Memory | Aug 15, 2023 | Image GenerationSentence | CodeCode Available | 1 | 5 |
| Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning | Mar 4, 2022 | Self-LearningText Augmentation | CodeCode Available | 1 | 5 |
| DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification | Sep 12, 2022 | ClassificationData Augmentation | CodeCode Available | 1 | 5 |
| PairAug: What Can Augmented Image-Text Pairs Do for Radiology? | Apr 7, 2024 | Data Augmentationimage-classification | CodeCode Available | 1 | 5 |
| BootAug: Boosting Text Augmentation via Hybrid Instance Filtering Framework | Oct 6, 2022 | ClassificationData Augmentation | CodeCode Available | 1 | 5 |
| Image, Text, and Speech Data Augmentation using Multimodal LLMs for Deep Learning: A Survey | Jan 29, 2025 | Data AugmentationImage Augmentation | CodeCode Available | 1 | 5 |
| Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation | Apr 18, 2021 | Data AugmentationGeneral Classification | CodeCode Available | 1 | 5 |
| Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning | Dec 31, 2020 | Adversarial RobustnessData Augmentation | CodeCode Available | 1 | 5 |
| Pretraining Language Models with Text-Attributed Heterogeneous Graphs | Oct 19, 2023 | Graph Neural NetworkLink Prediction | CodeCode Available | 1 | 5 |
| CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts | Nov 28, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 1 | 5 |
| Self-training Improves Pre-training for Few-shot Learning in Task-oriented Dialog Systems | Aug 28, 2021 | dialog state trackingFew-Shot Learning | CodeCode Available | 0 | 5 |
| Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap | Feb 6, 2024 | Domain GeneralizationQuantization | CodeCode Available | 0 | 5 |
| BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset | May 28, 2022 | Image CaptioningMachine Translation | CodeCode Available | 0 | 5 |
| BrightCookies at SemEval-2025 Task 9: Exploring Data Augmentation for Food Hazard Classification | Apr 29, 2025 | Data AugmentationText Augmentation | CodeCode Available | 0 | 5 |
| Contextual Augmentation: Data Augmentation by Words with Paradigmatic Relations | May 16, 2018 | Data AugmentationGeneral Classification | CodeCode Available | 0 | 5 |
| COVID-19 Vaccine Misinformation in Middle Income Countries | Nov 30, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| Data Augmentation via Dependency Tree Morphing for Low-Resource Languages | Mar 22, 2019 | Data AugmentationPart-Of-Speech Tagging | CodeCode Available | 0 | 5 |
| Distributional Data Augmentation Methods for Low Resource Language | Sep 9, 2023 | Data AugmentationSynthetic Data Generation | CodeCode Available | 0 | 5 |
| EDDA: A Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection | Mar 23, 2024 | Data AugmentationDecoder | CodeCode Available | 0 | 5 |
| Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation | Jan 12, 2024 | Data AugmentationDiversity | CodeCode Available | 0 | 5 |
| Empirical Study of Text Augmentation on Social Media Text in Vietnamese | Sep 25, 2020 | Data AugmentationGeneral Classification | CodeCode Available | 0 | 5 |
| From Big to Small Without Losing It All: Text Augmentation with ChatGPT for Efficient Sentiment Analysis | Dec 7, 2023 | AllSentiment Analysis | CodeCode Available | 0 | 5 |
| Improving short text classification through global augmentation methods | Jul 7, 2019 | ArticlesClassification | CodeCode Available | 0 | 5 |
| Laser: Efficient Language-Guided Segmentation in Neural Radiance Fields | Jan 31, 2025 | SegmentationText Augmentation | CodeCode Available | 0 | 5 |
| Learning to Compose Domain-Specific Transformations for Data Augmentation | Sep 6, 2017 | Data AugmentationImage Augmentation | CodeCode Available | 0 | 5 |
| Linguistic Knowledge in Data Augmentation for Natural Language Processing: An Example on Chinese Question Matching | Nov 29, 2021 | Data AugmentationLanguage Modelling | CodeCode Available | 0 | 5 |
| RPN: A Word Vector Level Data Augmentation Algorithm in Deep Learning for Language Understanding | Dec 12, 2022 | CoLAData Augmentation | CodeCode Available | 0 | 5 |
| Selective Text Augmentation with Word Roles for Low-Resource Text Classification | Sep 4, 2022 | ClassificationData Augmentation | CodeCode Available | 0 | 5 |
| Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccines | Sep 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Sequence-to-Sequence Data Augmentation for Dialogue Language Understanding | Jul 4, 2018 | Data AugmentationDiversity | CodeCode Available | 0 | 5 |
| STA: Self-controlled Text Augmentation for Improving Text Classifications | Feb 24, 2023 | BenchmarkingText Augmentation | CodeCode Available | 0 | 5 |
| Teaching Specific Scientific Knowledge into Large Language Models through Additional Training | Dec 6, 2023 | Hyperparameter OptimizationLanguage Modeling | CodeCode Available | 0 | 5 |
| Text Data Augmentation Made Simple By Leveraging NLP Cloud APIs | Dec 5, 2018 | Data AugmentationText Augmentation | CodeCode Available | 0 | 5 |
| UCD-CS at TREC 2021 Incident Streams Track | Dec 7, 2021 | HumanitarianMulti-Task Learning | CodeCode Available | 0 | 5 |
| Use Random Selection for Now: Investigation of Few-Shot Selection Strategies in LLM-based Text Augmentation for Classification | Oct 14, 2024 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 | 5 |
| Evaluation Metrics for Text Data Augmentation in NLP | Feb 9, 2024 | Data AugmentationText Augmentation | —Unverified | 0 | 0 |
| ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis | May 12, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | —Unverified | 0 | 0 |
| Text Augmentation in a Multi-Task View | Jan 14, 2021 | Data AugmentationText Augmentation | —Unverified | 0 | 0 |
| Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data | Apr 7, 2025 | ClassificationText Augmentation | —Unverified | 0 | 0 |
| "Hinglish" Language -- Modeling a Messy Code-Mixed Language | Dec 30, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Augmenty: A Python Library for Structured Text Augmentation | Dec 9, 2023 | Dependency Parsingnamed-entity-recognition | —Unverified | 0 | 0 |
| Improving Fast-slow Encoder based Transducer with Streaming Deliberation | Dec 15, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Text Augmentation Techniques in Drug Adverse Effect Detection Task | Jun 1, 2021 | Text Augmentation | —Unverified | 0 | 0 |
| IndiText Boost: Text Augmentation for Low Resource India Languages | Jan 23, 2024 | Data AugmentationMulti Class Text Classification | —Unverified | 0 | 0 |