Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1226–1250 of 5335 papers

Title	Date	Tasks	Status
ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation	Apr 11, 2025	DiversityLanguage Modeling	—Unverified
MedHal: An Evaluation Dataset for Medical Hallucination Detection	Apr 11, 2025	HallucinationNatural Language Inference	—Unverified
Large Language Models as Span Annotators	Apr 11, 2025	Data-to-Text GenerationMachine Translation	—Unverified
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Apr 10, 2025	Machine Translationnlg evaluation	—Unverified
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation	Apr 9, 2025	Text Generation	CodeCode Available
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Apr 7, 2025	GSM8KMath	—Unverified
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Apr 7, 2025	Information RetrievalLanguage Modeling	—Unverified
IMPersona: Evaluating Individual Level LM Impersonation	Apr 6, 2025	Text Generation	CodeCode Available
Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices	Apr 4, 2025	Text Generation	—Unverified
Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task	Apr 4, 2025	Marketingmultimodal generation	—Unverified
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models	Apr 4, 2025	GSM8KMathematical Reasoning	—Unverified
Align to Structure: Aligning Large Language Models with Structural Information	Apr 4, 2025	Document SummarizationText Generation	CodeCode Available
State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla	Apr 3, 2025	Data AugmentationText Generation	—Unverified
CoLa -- Learning to Interactively Collaborate with Large LMs	Apr 3, 2025	CoLAText Generation	—Unverified
Pel, A Programming Language for Orchestrating AI Agents	Apr 3, 2025	Code GenerationText Generation	—Unverified
LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation	Apr 2, 2025	DiagnosticMedical Report Generation	—Unverified
ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation	Apr 2, 2025	Machine TranslationText Generation	—Unverified
GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments	Apr 1, 2025	HallucinationText Generation	—Unverified
Repetitions are not all alike: distinct mechanisms sustain repetition in language models	Apr 1, 2025	AllIn-Context Learning	—Unverified
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations	Apr 1, 2025	ArticlesRAG	—Unverified
Synthesized Annotation Guidelines are Knowledge-Lite Boosters for Clinical Information Extraction	Apr 1, 2025	Few-Shot Learningnamed-entity-recognition	—Unverified
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System	Apr 1, 2025	Dialogue GenerationEnsemble Learning	—Unverified
Multi-Agent LLM Judge: automatic personalized LLM judge design for evaluating natural language generation applications	Apr 1, 2025	Text Generation	—Unverified
Adaptive Layer-skipping in Pre-trained LLMs	Mar 31, 2025	Text Generation	—Unverified
Optimizing Humor Generation in Large Language Models: Temperature Configurations and Architectural Trade-offs	Mar 31, 2025	Model SelectionText Generation	—Unverified

Show:10 25 50

← PrevPage 50 of 214Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified