SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 21012150 of 5335 papers

TitleStatusHype
Utility-Probability Duality of Neural Networks0
CGCE: A Chinese Generative Chat Evaluation Benchmark for General and Financial DomainsCode3
Advancing Precise Outline-Conditioned Text Generation with Task Duality and Explicit Outline Control0
Language Model Self-improvement by Reinforcement Learning Contemplation0
QTSumm: Query-Focused Summarization over Tabular DataCode1
Evaluation of African American Language Bias in Natural Language Generation0
APPLS: Evaluating Evaluation Metrics for Plain Language SummarizationCode0
Reducing Sensitivity on Speaker Names for Text Generation from DialoguesCode0
Process-To-Text: A Framework for the Quantitative Description of Processes in Natural Language0
Improving Factuality and Reasoning in Language Models through Multiagent DebateCode2
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text GenerationCode2
DUBLIN -- Document Understanding By Language-Image Network0
INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained FeedbackCode1
Cognitive network science reveals bias in GPT-3, ChatGPT, and GPT-4 mirroring math anxiety in high-school students0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Small Language Models Improve Giants by Rewriting Their OutputsCode1
Syntactic Knowledge via Graph Attention with BERT in Machine Translation0
Look-back Decoding for Open-Ended Text GenerationCode0
ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model RobustnessCode0
Non-Autoregressive Document-Level Machine TranslationCode0
Evaluating Factual Consistency of Texts with Semantic Role LabelingCode1
GEST: the Graph of Events in Space and Time as a Common Representation between Vision and Language0
A Frustratingly Simple Decoding Method for Neural Text GenerationCode1
MacLaSa: Multi-Aspect Controllable Text Generation via Efficient Sampling from Compact Latent SpaceCode0
MAGE: Machine-generated Text Detection in the WildCode2
Pruning Pre-trained Language Models with Principled Importance and Self-regularizationCode0
Explaining How Transformers Use Context to Build PredictionsCode1
VNHSGE: VietNamese High School Graduation Examination Dataset for Large Language ModelsCode1
DiffCap: Exploring Continuous Diffusion on Image Captioning0
LogiCoT: Logical Chain-of-Thought Instruction-TuningCode1
LLM-Pruner: On the Structural Pruning of Large Language ModelsCode3
Pengi: An Audio Language Model for Audio TasksCode2
BOLT: Fast Energy-based Controlled Text Generation with Tunable BiasesCode1
Generating Visual Spatial Description via Holistic 3D Scene UnderstandingCode1
ReTAG: Reasoning Aware Table to Analytic Text Generation0
LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and GenerationCode1
DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion0
What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production VariabilityCode0
Collaborative Generative AI: Integrating GPT-k for Efficient Editing in Text-to-Image Generation0
AIwriting: Relations Between Image Generation and Digital Writing0
ReGen: Zero-Shot Text Classification via Training Data Generation with Progressive Dense RetrievalCode1
Exploiting Biased Models to De-bias Text: A Gender-Fair Rewriting ModelCode0
Diffusion Language Models Generation Can Be Halted Early0
Cross-modality Data Augmentation for End-to-End Sign Language TranslationCode1
What You See is What You Read? Improving Text-Image Alignment EvaluationCode1
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation0
AR-Diffusion: Auto-Regressive Diffusion Model for Text GenerationCode1
Iterative Adversarial Attack on Image-guided Story Ending Generation0
Pre-Training to Learn in ContextCode1
Boosting Event Extraction with Denoised Structure-to-Text Augmentation0
Show:102550
← PrevPage 43 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (sample)BLEU-134.4Unverified
2Beam search + A*esque (beam)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified