| EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification Tasks | Jan 31, 2019 | Data AugmentationGeneral Classification | CodeCode Available | 2 |
| A Survey on Data Augmentation in Large Model Era | Jan 27, 2024 | Audio Signal ProcessingData Augmentation | CodeCode Available | 2 |
| GPT3Mix: Leveraging Large-scale Language Models for Text Augmentation | Apr 18, 2021 | Data AugmentationGeneral Classification | CodeCode Available | 1 |
| Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks | Oct 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Image, Text, and Speech Data Augmentation using Multimodal LLMs for Deep Learning: A Survey | Jan 29, 2025 | Data AugmentationImage Augmentation | CodeCode Available | 1 |
| PairAug: What Can Augmented Image-Text Pairs Do for Radiology? | Apr 7, 2024 | Data Augmentationimage-classification | CodeCode Available | 1 |
| Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning | Mar 4, 2022 | Self-LearningText Augmentation | CodeCode Available | 1 |
| Pretraining Language Models with Text-Attributed Heterogeneous Graphs | Oct 19, 2023 | Graph Neural NetworkLink Prediction | CodeCode Available | 1 |
| CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts | Nov 28, 2023 | Contrastive LearningData Augmentation | CodeCode Available | 1 |
| BootAug: Boosting Text Augmentation via Hybrid Instance Filtering Framework | Oct 6, 2022 | ClassificationData Augmentation | CodeCode Available | 1 |
| Text Augmentation for Language Models in High Error Recognition Scenario | Nov 11, 2020 | Data Augmentationspeech-recognition | CodeCode Available | 1 |
| Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuning | Dec 31, 2020 | Adversarial RobustnessData Augmentation | CodeCode Available | 1 |
| Story Visualization by Online Text Augmentation with Context Memory | Aug 15, 2023 | Image GenerationSentence | CodeCode Available | 1 |
| Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIP | Dec 13, 2024 | Action RecognitionText Augmentation | CodeCode Available | 1 |
| Words or Vision: Do Vision-Language Models Have Blind Faith in Text? | Mar 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DoubleMix: Simple Interpolation-Based Data Augmentation for Text Classification | Sep 12, 2022 | ClassificationData Augmentation | CodeCode Available | 1 |
| An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets | Jul 13, 2020 | General ClassificationText Augmentation | —Unverified | 0 |
| What Have Been Learned & What Should Be Learned? An Empirical Study of How to Selectively Augment Text for Classification | Sep 1, 2021 | ClassificationText Augmentation | —Unverified | 0 |
| Data Augmentation for Low-Resource Quechua ASR Improvement | Jul 14, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Experimental Study on Data Augmentation Techniques for Named Entity Recognition on Low-Resource Domains | Nov 21, 2024 | Data Augmentationnamed-entity-recognition | —Unverified | 0 |
| Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation | Dec 5, 2020 | Data Augmentationreinforcement-learning | —Unverified | 0 |
| Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation | Sep 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR | Jul 14, 2020 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation | May 1, 2024 | RetrievalText Augmentation | —Unverified | 0 |
| Drug Re-positioning via Text Augmented Knowledge Graph Embeddings | Oct 20, 2021 | Knowledge Graph EmbeddingsKnowledge Graphs | —Unverified | 0 |
| Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning | Sep 17, 2024 | DiversityRepresentation Learning | —Unverified | 0 |
| Augmenting emotion features in irony detection with Large language modeling | Apr 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Empirical Study of Text Augmentation on Social Media Text in Vietnamese | Oct 1, 2020 | Text Augmentation | —Unverified | 0 |
| Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values | Oct 14, 2022 | ClassificationFew-Shot Learning | —Unverified | 0 |
| Entity Aware Syntax Tree Based Data Augmentation for Natural Language Understanding | Sep 6, 2022 | Data AugmentationIntent Detection | —Unverified | 0 |
| Evaluation Metrics for Text Data Augmentation in NLP | Feb 9, 2024 | Data AugmentationText Augmentation | —Unverified | 0 |
| ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis | May 12, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | —Unverified | 0 |
| Augmenty: A Python Library for Structured Text Augmentation | Dec 9, 2023 | Dependency Parsingnamed-entity-recognition | —Unverified | 0 |
| "Hinglish" Language -- Modeling a Messy Code-Mixed Language | Dec 30, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Fast-slow Encoder based Transducer with Streaming Deliberation | Dec 15, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies | Feb 14, 2024 | Sentiment AnalysisText Augmentation | —Unverified | 0 |
| IndiText Boost: Text Augmentation for Low Resource India Languages | Jan 23, 2024 | Data AugmentationMulti Class Text Classification | —Unverified | 0 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data | Apr 7, 2025 | ClassificationText Augmentation | —Unverified | 0 |
| Boosting Event Extraction with Denoised Structure-to-Text Augmentation | May 16, 2023 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs? | Aug 29, 2024 | Data AugmentationText Augmentation | —Unverified | 0 |
| LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model | Mar 12, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Mitigating Data Imbalance for Software Vulnerability Assessment: Does Data Augmentation Help? | Jul 15, 2024 | Data AugmentationText Augmentation | —Unverified | 0 |
| Back Translation Survey for Improving Text Augmentation | Feb 19, 2021 | SentenceSurvey | —Unverified | 0 |
| Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral | Jan 22, 2025 | Text AugmentationTransfer Learning | —Unverified | 0 |
| Neural Data-to-Text Generation with LM-based Text Augmentation | Feb 6, 2021 | Data-to-Text GenerationText Augmentation | —Unverified | 0 |
| On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech | Jun 9, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 9 | Jun 17, 2024 | Audio Source SeparationPrompt Engineering | —Unverified | 0 |
| Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III | Aug 24, 2020 | AttributePrediction | —Unverified | 0 |
| Probabilistic Linguistic Knowledge and Token-level Text Augmentation | Jun 29, 2023 | Text Augmentation | —Unverified | 0 |