Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 451–500 of 5335 papers

Title	Date	Tasks	Status	Hype
A Hierarchical Neural Autoencoder for Paragraphs and Documents	Jun 2, 2015	SentenceText Generation	CodeCode Available	1
GLAT: Glancing at Latent Variables for Parallel Text Generation	May 1, 2022	Text Generation	CodeCode Available	1
Attribute First, then Generate: Locally-attributable Grounded Text Generation	Mar 25, 2024	AttributeDocument Summarization	CodeCode Available	1
Global Explainability of BERT-Based Evaluation Metrics by Disentangling along Linguistic Factors	Oct 8, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Dependency-based Mixture Language Models	Mar 19, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
DeCap: Decoding CLIP Latents for Zero-Shot Captioning via Text-Only Training	Mar 6, 2023	DecoderImage Captioning	CodeCode Available	1
T3: Tree-Autoencoder Constrained Adversarial Text Generation for Targeted Attack	Dec 22, 2019	Adversarial AttackAdversarial Text	CodeCode Available	1
GPT-too: A language-model-first approach for AMR-to-text generation	May 18, 2020	AMR-to-Text GenerationData-to-Text Generation	CodeCode Available	1
Data-to-text Generation with Macro Planning	Feb 4, 2021	Data-to-Text GenerationDecoder	CodeCode Available	1
Graph Pre-training for AMR Parsing and Generation	Mar 15, 2022	Abstract Meaning RepresentationAMR Parsing	CodeCode Available	1
A Benchmark for Evaluating Machine Translation Metrics on Dialects Without Standard Orthography	Nov 28, 2023	Machine TranslationText Generation	CodeCode Available	1
GRUEN for Evaluating Linguistic Quality of Generated Text	Oct 6, 2020	Text Generation	CodeCode Available	1
Data-to-text Generation with Variational Sequential Planning	Feb 28, 2022	Data-to-Text GenerationText Generation	CodeCode Available	1
Handwritten Text Generation from Visual Archetypes	Mar 27, 2023	Text Generation	CodeCode Available	1
Deep Graph Convolutional Encoders for Structured Data to Text Generation	Oct 23, 2018	Data-to-Text GenerationGraph-to-Sequence	CodeCode Available	1
DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM	Oct 23, 2023	DecoderText Generation	CodeCode Available	1
Data Generation for Post-OCR correction of Cyrillic handwriting	Nov 27, 2023	Handwriting generationHandwritten Text Recognition	CodeCode Available	1
Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity	Apr 8, 2020	AMR-to-Text GenerationData-to-Text Generation	CodeCode Available	1
Hello, It's GPT-2 -- How Can I Help You? Towards the Use of Pretrained Language Models for Task-Oriented Dialogue Systems	Jul 12, 2019	Decision MakingLanguage Modeling	CodeCode Available	1
HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language Generation	Aug 15, 2021	DescriptiveEntity Alignment	CodeCode Available	1
MatTools: Benchmarking Large Language Models for Materials Science Tools	May 16, 2025	BenchmarkingQuestion Answering	CodeCode Available	1
How to Select Datapoints for Efficient Human Evaluation of NLG Models?	Jan 30, 2025	HumanEvalMachine Translation	CodeCode Available	1
Automatic Detection of Generated Text is Easiest when Humans are Fooled	Nov 2, 2019	BenchmarkingLanguage Modelling	CodeCode Available	1
Human-Machine Collaboration Approaches to Build a Dialogue Dataset for Hate Speech Countering	Nov 7, 2022	Text Generation	CodeCode Available	1
HWD: A Novel Evaluation Score for Styled Handwritten Text Generation	Oct 31, 2023	Image GenerationPerceptual Distance	CodeCode Available	1
A Comprehensive Survey of Accelerated Generation Techniques in Large Language Models	May 15, 2024	SurveyText Generation	CodeCode Available	1
ILLUMINER: Instruction-tuned Large Language Models as Few-shot Intent Classifier and Slot Filler	Mar 26, 2024	In-Context Learningintent-classification	CodeCode Available	1
Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic Evaluation	Apr 15, 2021	Data-to-Text GenerationQuestion Generation	CodeCode Available	1
An Empirical Study On Contrastive Search And Contrastive Decoding For Open-ended Text Generation	Nov 19, 2022	DiversityText Generation	CodeCode Available	1
Improving Conversational Recommender Systems via Knowledge Graph based Semantic Fusion	Jul 8, 2020	Knowledge GraphsRecommendation Systems	CodeCode Available	1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report Generation	Oct 20, 2020	Image to textNatural Language Inference	CodeCode Available	1
Advancing Spatial Reasoning in Large Language Models: An In-Depth Evaluation and Enhancement Using the StepGame Benchmark	Jan 8, 2024	Relation MappingSpatial Reasoning	CodeCode Available	1
DALD: Improving Logits-based Detector without Logits from Black-box LLMs	Jun 7, 2024	Text DetectionText Generation	CodeCode Available	1
Improving Open-Ended Text Generation via Adaptive Decoding	Feb 28, 2024	DiversityStory Generation	CodeCode Available	1
Data Feedback Loops: Model-driven Amplification of Dataset Biases	Sep 8, 2022	image-classificationImage Classification	CodeCode Available	1
Data-to-Text Bilingual Generation	Nov 24, 2023	Text Generation	CodeCode Available	1
Adaptive Machine Translation with Large Language Models	Jan 30, 2023	DecoderDomain Adaptation	CodeCode Available	1
A Survey of Knowledge-Enhanced Text Generation	Oct 9, 2020	DecoderSurvey	CodeCode Available	1
An Empirical Study of GPT-4o Image Generation Capabilities	Apr 8, 2025	BenchmarkingImage Generation	CodeCode Available	1
InfoLM: A New Metric to Evaluate Summarization & Data2Text Generation	Dec 2, 2021	Language ModelingLanguage Modelling	CodeCode Available	1
Towards Reliable Detection of LLM-Generated Texts: A Comprehensive Evaluation Framework with CUDRT	Jun 13, 2024	BenchmarkingLLM-generated Text Detection	CodeCode Available	1
Adaptive Markup Language Generation for Contextually-Grounded Visual Document Understanding	May 8, 2025	document understandingInstruction Following	CodeCode Available	1
Customizing Language Models with Instance-wise LoRA for Sequential Recommendation	Aug 19, 2024	Mixture-of-ExpertsMulti-Task Learning	CodeCode Available	1
A Knowledge-Enhanced Pretraining Model for Commonsense Story Generation	Jan 15, 2020	Multi-Task LearningStory Generation	CodeCode Available	1
A Langevin-like Sampler for Discrete Distributions	Jun 20, 2022	Efficient ExplorationText Generation	CodeCode Available	1
Automatic Jailbreaking of the Text-to-Image Generative AI Systems	May 26, 2024	Image GenerationInformation Retrieval	CodeCode Available	1
CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text Generation	Apr 2, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Interpreting Language Models with Contrastive Explanations	Feb 21, 2022	Language ModelingLanguage Modelling	CodeCode Available	1
Is ChatGPT a Good NLG Evaluator? A Preliminary Study	Mar 7, 2023	nlg evaluationStory Generation	CodeCode Available	1
DART: Open-Domain Structured Data Record to Text Generation	Jul 6, 2020	Domain GeneralizationSemantic Parsing	CodeCode Available	1

Show:10 25 50

← PrevPage 10 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified