Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4251–4300 of 5335 papers

Title	Date	Tasks	Status
Distilling semantically aware orders for autoregressive image generation	Apr 23, 2025	Image GenerationText Generation	—Unverified
Distinctive Similarity of Clausal Coordinate Ellipsis in Russian Compared to Dutch, Estonian, German, and Hungarian	Sep 1, 2015	Text Generation	—Unverified
Distinctive Slogan Generation with Reconstruction	Dec 1, 2020	Abstractive Text SummarizationDecoder	—Unverified
Distributional Information Embedding: A Framework for Multi-bit Watermarking	Jan 27, 2025	Text Generation	—Unverified
Distribution Aware Metrics for Conditional Natural Language Generation	Sep 15, 2022	Diversityspeech-recognition	—Unverified
Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization	Nov 2, 2023	ManagementModel Compression	—Unverified
Diverse Parallel Data Synthesis for Cross-Database Adaptation of Text-to-SQL Parsers	Oct 29, 2022	Data AugmentationNatural Language Queries	—Unverified
Diversifying Neural Text Generation with Part-of-Speech Guided Softmax and Sampling	Nov 16, 2021	DiversityPOS	—Unverified
Diversity as a By-Product: Goal-oriented Language Generation Leads to Linguistic Variation	Jul 1, 2021	DiversityImage Captioning	—Unverified
Diversity Enhanced Table-to-Text Generation via Type Control	May 22, 2022	DiversityTable-to-Text Generation	—Unverified
Diversity of Thought Elicits Stronger Reasoning Capabilities in Multi-Agent Debate Frameworks	Oct 10, 2024	8kDiversity	—Unverified
DLGNet-Task: An End-to-end Neural Network Framework for Modeling Multi-turn Multi-domain Task-Oriented Dialogue	Oct 4, 2020	Dialogue GenerationNatural Language Understanding	—Unverified
Do Captioning Metrics Reflect Music Semantic Alignment?	Nov 18, 2024	Music CaptioningText Generation	—Unverified
DocChat: An Information Retrieval Approach for Chatbot Engines Using Unstructured Documents	Aug 1, 2016	ChatbotCommunity Question Answering	—Unverified
DOCCI: Descriptions of Connected and Contrasting Images	Apr 30, 2024	Image GenerationImage to text	—Unverified
Doctoral Advisor or Medical Condition: Towards Entity-specific Rankings of Knowledge Base Properties [Extended Version]	Sep 20, 2017	Semantic SimilaritySemantic Textual Similarity	—Unverified
Document Level Hierarchical Transformer	Dec 1, 2021	Document Level Machine TranslationImitation Learning	—Unverified
Do DALL-E and Flamingo Understand Each Other?	Dec 23, 2022	Image CaptioningImage Generation	—Unverified
Do dependency parsing metrics correlate with human judgments?	Jul 1, 2015	Dependency ParsingMachine Translation	—Unverified
Does a Large Language Model Really Speak in Human-Like Language?	Jan 2, 2025	Language ModelingLanguage Modelling	—Unverified
Does Meta-learning Help mBERT for Few-shot Question Generation in a Cross-lingual Transfer Setting for Indic Languages?	Oct 1, 2022	Cross-Lingual TransferLanguage Modeling	—Unverified
Does the Order of Training Samples Matter? Improving Neural Data-to-Text Generation with Curriculum Learning	Feb 6, 2021	Data-to-Text GenerationText Generation	—Unverified
Do Large Code Models Understand Programming Concepts? Counterfactual Analysis for Code Predicates	Feb 8, 2024	Code CompletionCode Generation	—Unverified
Do Large Language Models Judge Error Severity Like Humans?	Jun 5, 2025	Text Generation	—Unverified
Do Large Language Models Know about Facts?	Oct 8, 2023	Question AnsweringText Generation	—Unverified
Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023	Jan 31, 2025	ArticlesCaption Generation	—Unverified
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG	May 24, 2023	Dialogue GenerationDiversity	—Unverified
Domain Adaptable Semantic Clustering in Statistical NLG	Mar 1, 2013	ClusteringText Generation	—Unverified
Domain-Specific Image Captioning	Jun 1, 2014	Image CaptioningSentence Compression	—Unverified
Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information	Jun 21, 2022	Language ModelingLanguage Modelling	—Unverified
Don’t Forget About Pronouns: Removing Gender Bias in Language Models without Losing Factual Gender Information	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified
Don’t Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information	Jul 1, 2022	Language ModelingLanguage Modelling	—Unverified
Don't Mention the Shoe! A Learning to Rank Approach to Content Selection for Image Description Generation	Sep 1, 2016	Image DescriptionImage Retrieval	—Unverified
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation	Jan 16, 2022	Machine TranslationStyle Transfer	—Unverified
DORB: Dynamically Optimizing Multiple Rewards with Bandits	Nov 15, 2020	Data-to-Text GenerationQuestion Generation	—Unverified
DORE: A Dataset For Portuguese Definition Generation	Mar 26, 2024	Definition ModellingText Generation	—Unverified
DOSA: A Dataset of Social Artifacts from Different Indian Geographical Subcultures	Feb 23, 2024	Question AnsweringText Generation	—Unverified
Do sequence-to-sequence VAEs learn global features of sentences?	Apr 16, 2020	Language ModelingLanguage Modelling	—Unverified
DR.BENCH: Diagnostic Reasoning Benchmark for Clinical Natural Language Processing	Sep 29, 2022	DiagnosticNamed Entity Recognition	—Unverified
Beyond Sparse Rewards: Enhancing Reinforcement Learning with Language Model Critique in Text Generation	Jan 14, 2024	Language ModelingLanguage Modelling	—Unverified
DSGPT: Domain-Specific Generative Pre-Training of Transformers for Text Generation in E-commerce Title and Review Summarization	Dec 15, 2021	DecoderText Generation	—Unverified
DSim, a Danish Parallel Corpus for Text Simplification	May 1, 2012	ArticlesMachine Translation	—Unverified
DSPy Assertions: Computational Constraints for Self-Refining Language Model Pipelines	Dec 20, 2023	Language ModelingLanguage Modelling	—Unverified
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models	Mar 5, 2025	HallucinationText Generation	—Unverified
DTLLM-VLT: Diverse Text Generation for Visual Language Tracking Based on LLM	May 20, 2024	Object TrackingText Generation	—Unverified
Dual Latent Variable Model for Low-Resource Natural Language Generation in Dialogue Systems	Nov 10, 2018	DecoderText Generation	—Unverified
Dynamic Traceback Learning for Medical Report Generation	Jan 24, 2024	Image to textMedical Report Generation	—Unverified
Dual use issues in the field of Natural Language Generation	Jan 11, 2025	SurveyText Generation	—Unverified
DUBLIN -- Document Understanding By Language-Image Network	May 23, 2023	Document Classificationdocument understanding	—Unverified
Dutch Humor Detection by Generating Negative Examples	Oct 26, 2020	Binary ClassificationCommon Sense Reasoning	—Unverified

Show:10 25 50

← PrevPage 86 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified