Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1250 of 5335 papers

Title	Date	Tasks	Status
A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories	May 2, 2025	Code GenerationText Generation	—Unverified
Ensuring Reproducibility in Generative AI Systems for General Use Cases: A Framework for Regression Testing and Open Datasets	May 2, 2025	Code GenerationGPR	CodeCode Available
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts	May 2, 2025	Linguistic steganographyText Generation	—Unverified
Graph Synthetic Out-of-Distribution Exposure with Large Language Models	Apr 29, 2025	Out of Distribution (OOD) DetectionText Generation	—Unverified
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts	Apr 29, 2025	AllDiversity	—Unverified
YoChameleon: Personalized Vision and Language Generation	Apr 29, 2025	Image GenerationText Generation	—Unverified
Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models	Apr 29, 2025	DiversitySensitivity	—Unverified
A Platform for Generating Educational Activities to Teach English as a Second Language	Apr 28, 2025	Text Generation	—Unverified
Anyprefer: An Agentic Framework for Preference Data Synthesis	Apr 27, 2025	Medical Image AnalysisText Generation	—Unverified
TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation	Apr 25, 2025	AttributeText Generation	—Unverified
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Apr 24, 2025	Image GenerationText Generation	—Unverified
How Effective are Generative Large Language Models in Performing Requirements Classification?	Apr 23, 2025	ClassificationText Generation	—Unverified
Distilling semantically aware orders for autoregressive image generation	Apr 23, 2025	Image GenerationText Generation	—Unverified
ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs	Apr 23, 2025	Decision MakingKnowledge Graphs	CodeCode Available
(Im)possibility of Automated Hallucination Detection in Large Language Models	Apr 23, 2025	HallucinationLanguage Identification	—Unverified
FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering	Apr 20, 2025	counterfactualFairness	—Unverified
FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models	Apr 20, 2025	DescriptiveEthics	—Unverified
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation	Apr 20, 2025	AttributeImage Segmentation	—Unverified
Density Measures for Language Generation	Apr 19, 2025	HallucinationText Generation	—Unverified
Sparks of Science: Hypothesis Generation Using Structured Paper Data	Apr 17, 2025	Language ModellingText Generation	—Unverified
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Apr 16, 2025	GSM8KMath	—Unverified
Enhancing multimodal analogical reasoning with Logic Augmented Generation	Apr 15, 2025	Knowledge GraphsText Generation	CodeCode Available
Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items	Apr 15, 2025	BenchmarkingMultiple-choice	—Unverified
Joint Action Language Modelling for Transparent Policy Execution	Apr 14, 2025	Language ModellingText Generation	—Unverified
Transferable text data distillation by trajectory matching	Apr 14, 2025	ARCLarge Language Model	—Unverified
ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation	Apr 11, 2025	DiversityLanguage Modeling	—Unverified
MedHal: An Evaluation Dataset for Medical Hallucination Detection	Apr 11, 2025	HallucinationNatural Language Inference	—Unverified
Large Language Models as Span Annotators	Apr 11, 2025	Data-to-Text GenerationMachine Translation	—Unverified
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Apr 10, 2025	Machine Translationnlg evaluation	—Unverified
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation	Apr 9, 2025	Text Generation	CodeCode Available
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Apr 7, 2025	GSM8KMath	—Unverified
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Apr 7, 2025	Information RetrievalLanguage Modeling	—Unverified
IMPersona: Evaluating Individual Level LM Impersonation	Apr 6, 2025	Text Generation	CodeCode Available
Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices	Apr 4, 2025	Text Generation	—Unverified
Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task	Apr 4, 2025	Marketingmultimodal generation	—Unverified
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models	Apr 4, 2025	GSM8KMathematical Reasoning	—Unverified
Align to Structure: Aligning Large Language Models with Structural Information	Apr 4, 2025	Document SummarizationText Generation	CodeCode Available
State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla	Apr 3, 2025	Data AugmentationText Generation	—Unverified
CoLa -- Learning to Interactively Collaborate with Large LMs	Apr 3, 2025	CoLAText Generation	—Unverified
Pel, A Programming Language for Orchestrating AI Agents	Apr 3, 2025	Code GenerationText Generation	—Unverified
LVMed-R2: Perception and Reflection-driven Complex Reasoning for Medical Report Generation	Apr 2, 2025	DiagnosticMedical Report Generation	—Unverified
ContrastScore: Towards Higher Quality, Less Biased, More Efficient Evaluation Metrics with Contrastive Evaluation	Apr 2, 2025	Machine TranslationText Generation	—Unverified
GraphMaster: Automated Graph Synthesis via LLM Agents in Data-Limited Environments	Apr 1, 2025	HallucinationText Generation	—Unverified
Repetitions are not all alike: distinct mechanisms sustain repetition in language models	Apr 1, 2025	AllIn-Context Learning	—Unverified
ScholarCopilot: Training Large Language Models for Academic Writing with Accurate Citations	Apr 1, 2025	ArticlesRAG	—Unverified
Synthesized Annotation Guidelines are Knowledge-Lite Boosters for Clinical Information Extraction	Apr 1, 2025	Few-Shot Learningnamed-entity-recognition	—Unverified
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System	Apr 1, 2025	Dialogue GenerationEnsemble Learning	—Unverified
Multi-Agent LLM Judge: automatic personalized LLM judge design for evaluating natural language generation applications	Apr 1, 2025	Text Generation	—Unverified
Adaptive Layer-skipping in Pre-trained LLMs	Mar 31, 2025	Text Generation	—Unverified
Optimizing Humor Generation in Large Language Models: Temperature Configurations and Architectural Trade-offs	Mar 31, 2025	Model SelectionText Generation	—Unverified

Show:10 25 50

← PrevPage 25 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified