SOTAVerified

Text Augmentation

You can read these blog posts to get an overview of the approaches.

Papers

Showing 2650 of 97 papers

TitleStatusHype
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning0
Augmenting emotion features in irony detection with Large language modeling0
Empirical Study of Text Augmentation on Social Media Text in Vietnamese0
Enabling Classifiers to Make Judgements Explicitly Aligned with Human Values0
Entity Aware Syntax Tree Based Data Augmentation for Natural Language Understanding0
Evaluation Metrics for Text Data Augmentation in NLP0
ExplainableDetector: Exploring Transformer-based Language Modeling Approach for SMS Spam Detection with Explainability Analysis0
Augmenty: A Python Library for Structured Text Augmentation0
"Hinglish" Language -- Modeling a Messy Code-Mixed Language0
Improving Fast-slow Encoder based Transducer with Streaming Deliberation0
Advancing NLP Models with Strategic Text Augmentation: A Comprehensive Study of Augmentation Methods and Curriculum Strategies0
IndiText Boost: Text Augmentation for Low Resource India Languages0
Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling0
Batch Aggregation: An Approach to Enhance Text Classification with Correlated Augmented Data0
Boosting Event Extraction with Denoised Structure-to-Text Augmentation0
LLMs vs Established Text Augmentation Techniques for Classification: When do the Benefits Outweight the Costs?0
LLMvsSmall Model? Large Language Model Based Text Augmentation Enhanced Personality Detection Model0
Mitigating Data Imbalance for Software Vulnerability Assessment: Does Data Augmentation Help?0
Back Translation Survey for Improving Text Augmentation0
Multimodal AI on Wound Images and Clinical Notes for Home Patient Referral0
Neural Data-to-Text Generation with LM-based Text Augmentation0
On the Effectiveness of Neural Text Generation based Data Augmentation for Recognition of Morphologically Rich Speech0
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 90
Prediction of ICD Codes with Clinical BERT Embeddings and Text Augmentation with Label Balancing using MIMIC-III0
Probabilistic Linguistic Knowledge and Token-level Text Augmentation0
Show:102550
← PrevPage 2 of 4Next →

No leaderboard results yet.