SOTAVerified

Text Augmentation

You can read these blog posts to get an overview of the approaches.

Papers

Showing 150 of 97 papers

TitleStatusHype
EDA: Easy Data Augmentation Techniques for Boosting Performance on Text Classification TasksCode2
A Survey on Data Augmentation in Large Model EraCode2
GPT3Mix: Leveraging Large-scale Language Models for Text AugmentationCode1
Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple TasksCode1
Image, Text, and Speech Data Augmentation using Multimodal LLMs for Deep Learning: A SurveyCode1
PairAug: What Can Augmented Image-Text Pairs Do for Radiology?Code1
Show Me What and Tell Me How: Video Synthesis via Multimodal ConditioningCode1
Pretraining Language Models with Text-Attributed Heterogeneous GraphsCode1
CLAP: Isolating Content from Style through Contrastive Learning with Augmented PromptsCode1
BootAug: Boosting Text Augmentation via Hybrid Instance Filtering FrameworkCode1
Text Augmentation for Language Models in High Error Recognition ScenarioCode1
Better Robustness by More Coverage: Adversarial Training with Mixup Augmentation for Robust Fine-tuningCode1
Story Visualization by Online Text Augmentation with Context MemoryCode1
Building a Multi-modal Spatiotemporal Expert for Zero-shot Action Recognition with CLIPCode1
Words or Vision: Do Vision-Language Models Have Blind Faith in Text?Code1
DoubleMix: Simple Interpolation-Based Data Augmentation for Text ClassificationCode1
An Enhanced Text Classification to Explore Health based Indian Government Policy Tweets0
What Have Been Learned & What Should Be Learned? An Empirical Study of How to Selectively Augment Text for Classification0
Data Augmentation for Low-Resource Quechua ASR Improvement0
An Experimental Study on Data Augmentation Techniques for Named Entity Recognition on Low-Resource Domains0
Data Boost: Text Data Augmentation Through Reinforcement Learning Guided Conditional Generation0
Decoder-only Architecture for Speech Recognition with CTC Prompts and Text Data Augmentation0
Deep Transformer based Data Augmentation with Subword Units for Morphologically Rich Online ASR0
Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation0
Drug Re-positioning via Text Augmented Knowledge Graph Embeddings0
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning0
Augmenting emotion features in irony detection with Large language modeling0
Empirical Study of Text Augmentation on Social Media Text in Vietnamese0
Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values0
Entity Aware Syntax Tree Based Data Augmentation for Natural Language Understanding0
Evaluation Metrics for Text Data Augmentation in NLP0
ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis0
Augmenty: A Python Library for Structured Text Augmentation0
"Hinglish" Language -- Modeling a Messy Code-Mixed Language0
Improving Fast-slow Encoder based Transducer with Streaming Deliberation0
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies0
IndiText Boost: Text Augmentation for Low Resource India Languages0
Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling0
Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data0
Boosting Event Extraction with Denoised Structure-to-Text Augmentation0
LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?0
LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model0
Mitigating Data Imbalance for Software Vulnerability Assessment: Does Data Augmentation Help?0
Back Translation Survey for Improving Text Augmentation0
Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral0
Neural Data-to-Text Generation with LM-based Text Augmentation0
On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech0
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 90
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III0
Probabilistic Linguistic Knowledge and Token-level Text Augmentation0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.