Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 5335 papers

Title	Date	Tasks	Status	Hype
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Apr 16, 2025	GSM8KMath	—Unverified	0
Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items	Apr 15, 2025	BenchmarkingMultiple-choice	—Unverified	0
Enhancing multimodal analogical reasoning with Logic Augmented Generation	Apr 15, 2025	Knowledge GraphsText Generation	CodeCode Available	0
Transferable text data distillation by trajectory matching	Apr 14, 2025	ARCLarge Language Model	—Unverified	0
ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models	Apr 14, 2025	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Joint Action Language Modelling for Transparent Policy Execution	Apr 14, 2025	Language ModellingText Generation	—Unverified	0
Parameterized Synthetic Text Generation with SimpleStories	Apr 12, 2025	DiversityLanguage Modeling	CodeCode Available	1
ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation	Apr 11, 2025	DiversityLanguage Modeling	—Unverified	0
MedHal: An Evaluation Dataset for Medical Hallucination Detection	Apr 11, 2025	HallucinationNatural Language Inference	—Unverified	0
Large Language Models as Span Annotators	Apr 11, 2025	Data-to-Text GenerationMachine Translation	—Unverified	0
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Apr 10, 2025	Machine Translationnlg evaluation	—Unverified	0
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation	Apr 9, 2025	Text Generation	CodeCode Available	0
An Empirical Study of GPT-4o Image Generation Capabilities	Apr 8, 2025	BenchmarkingImage Generation	CodeCode Available	1
Retrieval Augmented Generation with Collaborative Filtering for Personalized Text Generation	Apr 8, 2025	Collaborative FilteringContrastive Learning	CodeCode Available	1
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Apr 7, 2025	GSM8KMath	—Unverified	0
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Apr 7, 2025	Information RetrievalLanguage Modeling	—Unverified	0
IMPersona: Evaluating Individual Level LM Impersonation	Apr 6, 2025	Text Generation	CodeCode Available	0
MSL: Not All Tokens Are What You Need for Tuning LLM as a Recommender	Apr 5, 2025	AllLanguage Modeling	CodeCode Available	1
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models	Apr 4, 2025	GSM8KMathematical Reasoning	—Unverified	0
Align to Structure: Aligning Large Language Models with Structural Information	Apr 4, 2025	Document SummarizationText Generation	CodeCode Available	0
Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task	Apr 4, 2025	Marketingmultimodal generation	—Unverified	0
Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices	Apr 4, 2025	Text Generation	—Unverified	0
Pel, A Programming Language for Orchestrating AI Agents	Apr 3, 2025	Code GenerationText Generation	—Unverified	0
CoLa -- Learning to Interactively Collaborate with Large LMs	Apr 3, 2025	CoLAText Generation	—Unverified	0
State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla	Apr 3, 2025	Data AugmentationText Generation	—Unverified	0

Show:10 25 50

← PrevPage 8 of 214Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified