Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 5335 papers

Title	Date	Tasks	Status	Hype
What Matters In The Structured Pruning of Generative Language Models?	Feb 7, 2023	Text Generation	CodeCode Available	2
In-Context Retrieval-Augmented Language Models	Jan 31, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Grounding Language Models to Images for Multimodal Inputs and Outputs	Jan 31, 2023	Image RetrievalIn-Context Learning	CodeCode Available	2
eVAE: Evolutionary Variational Autoencoder	Jan 1, 2023	DisentanglementImage Generation	CodeCode Available	2
DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models	Nov 28, 2022	DenoisingLanguage Modeling	CodeCode Available	2
TaTa: A Multilingual Table-to-Text Dataset for African Languages	Oct 31, 2022	Data-to-Text GenerationText Generation	CodeCode Available	2
Contrastive Decoding: Open-ended Text Generation as Optimization	Oct 27, 2022	Language ModelingLanguage Modelling	CodeCode Available	2
Contrastive Search Is What You Need For Neural Text Generation	Oct 25, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	2
DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models	Oct 17, 2022	DiversityText Generation	CodeCode Available	2
Towards a Unified Multi-Dimensional Evaluator for Text Generation	Oct 13, 2022	nlg evaluationQuestion Answering	CodeCode Available	2
Offline RL for Natural Language Generation with Implicit Language Q Learning	Jun 5, 2022	Language ModellingOffline RL	CodeCode Available	2
Multimodality for NLP-Centered Applications: Resources, Advances and Frontiers	Jun 1, 2022	SurveyText Generation	CodeCode Available	2
CoNT: Contrastive Neural Text Generation	May 29, 2022	Code Comment GenerationComment Generation	CodeCode Available	2
Symphony Generation with Permutation Invariant Language Model	May 10, 2022	Audio GenerationDecoder	CodeCode Available	2
Language Models Can See: Plugging Visual Controls in Text Generation	May 5, 2022	Image CaptioningImage-text matching	CodeCode Available	2
A Contrastive Framework for Neural Text Generation	Feb 13, 2022	DiversityText Generation	CodeCode Available	2
Ecco: An Open Source Library for the Explainability of Transformer Language Models	Aug 1, 2021	Text Generation	CodeCode Available	2
Structured Denoising Diffusion Models in Discrete State-Spaces	Jul 7, 2021	DenoisingText Generation	CodeCode Available	2
Measuring Mathematical Problem Solving With the MATH Dataset	Mar 5, 2021	MathMathematical Problem-Solving	CodeCode Available	2
Learning Transferable Visual Models From Natural Language Supervision	Feb 26, 2021	Action RecognitionBenchmarking	CodeCode Available	2
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation	Jan 6, 2021	Text Generation	CodeCode Available	2
Few-Shot Text Generation with Pattern-Exploiting Training	Dec 22, 2020	Headline Generationtext-classification	CodeCode Available	2
SeqGenSQL -- A Robust Sequence Generation Model for Structured Query Language	Nov 7, 2020	Text GenerationText to SQL	CodeCode Available	2
Deep Learning for Text Style Transfer: A Survey	Nov 1, 2020	ArticlesDeep Learning	CodeCode Available	2
Utterance-level Dialogue Understanding: An Empirical Study	Sep 29, 2020	Dialogue UnderstandingGoal-Oriented Dialogue Systems	CodeCode Available	2
The Language Interpretability Tool: Extensible, Interactive Visualizations and Analysis for NLP Models	Aug 12, 2020	counterfactualSentiment Analysis	CodeCode Available	2
Simplifying Paragraph-level Question Generation via Transformer Language Models	May 3, 2020	Language ModelingLanguage Modelling	CodeCode Available	2
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model	Mar 3, 2020	8kLanguage Modeling	CodeCode Available	2
Plug and Play Language Models: A Simple Approach to Controlled Text Generation	Dec 4, 2019	AttributeLanguage Modelling	CodeCode Available	2
Unified Vision-Language Pre-Training for Image Captioning and VQA	Sep 24, 2019	DecoderImage Captioning	CodeCode Available	2
MASS: Masked Sequence to Sequence Pre-training for Language Generation	May 7, 2019	Conversational Response GenerationDecoder	CodeCode Available	2
Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation	Sep 4, 2018	Machine TranslationText Generation	CodeCode Available	2
Mitigating Object Hallucinations via Sentence-Level Early Intervention	Jul 16, 2025	HallucinationMM-Vet	CodeCode Available	1
TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks	Jun 14, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Revisit What You See: Disclose Language Prior in Vision Tokens for Efficient Guided Decoding of LVLMs	Jun 11, 2025	HallucinationObject Hallucination	CodeCode Available	1
Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces	Jun 9, 2025	Image GenerationText Generation	CodeCode Available	1
Time to Talk: LLM Agents for Asynchronous Group Communication in Mafia Games	Jun 5, 2025	Action GenerationAsynchronous Group Communication	CodeCode Available	1
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Jun 4, 2025	FormText Generation	CodeCode Available	1
Neuro2Semantic: A Transfer Learning Framework for Semantic Reconstruction of Continuous Language from Human Intracranial EEG	May 31, 2025	EEGText Generation	CodeCode Available	1
Rethinking Text-based Protein Understanding: Retrieval or LLM?	May 26, 2025	RetrievalText Generation	CodeCode Available	1
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation	May 24, 2025	Semantic SimilaritySemantic Textual Similarity	CodeCode Available	1
ThinkRec: Thinking-based recommendation via LLM	May 21, 2025	Text Generation	CodeCode Available	1
U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding	May 20, 2025	cross-modal alignmentLanguage Modeling	CodeCode Available	1
EEG-to-Text Translation: A Model for Deciphering Human Brain Activity	May 20, 2025	DecoderEEG	CodeCode Available	1
Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning	May 20, 2025	Answer GenerationRAG	CodeCode Available	1
A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations	May 20, 2025	SentenceSentence Classification	CodeCode Available	1
WriteViT: Handwritten Text Generation with Vision Transformer	May 19, 2025	Handwriting generationText Generation	CodeCode Available	1
FastCar: Cache Attentive Replay for Fast Auto-Regressive Video Generation on the Edge	May 17, 2025	Image GenerationScheduling	CodeCode Available	1
MatTools: Benchmarking Large Language Models for Materials Science Tools	May 16, 2025	BenchmarkingQuestion Answering	CodeCode Available	1
MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception	May 11, 2025	Emotion ClassificationLarge Language Model	CodeCode Available	1

Show:10 25 50

← PrevPage 5 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified