Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 5335 papers

Title	Date	Tasks	Status	Hype
A Rusty Link in the AI Supply Chain: Detecting Evil Configurations in Model Repositories	May 2, 2025	Code GenerationText Generation	—Unverified	0
A Character-based Diffusion Embedding Algorithm for Enhancing the Generation Quality of Generative Linguistic Steganographic Texts	May 2, 2025	Linguistic steganographyText Generation	—Unverified	0
UniBiomed: A Universal Foundation Model for Grounded Biomedical Image Interpretation	Apr 30, 2025	DiagnosticLarge Language Model	CodeCode Available	1
Graph Synthetic Out-of-Distribution Exposure with Large Language Models	Apr 29, 2025	Out of Distribution (OOD) DetectionText Generation	—Unverified	0
YoChameleon: Personalized Vision and Language Generation	Apr 29, 2025	Image GenerationText Generation	—Unverified	0
Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models	Apr 29, 2025	DiversitySensitivity	—Unverified	0
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts	Apr 29, 2025	AllDiversity	—Unverified	0
Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding	Apr 29, 2025	Code GenerationDensity Estimation	CodeCode Available	1
A Platform for Generating Educational Activities to Teach English as a Second Language	Apr 28, 2025	Text Generation	—Unverified	0
Anyprefer: An Agentic Framework for Preference Data Synthesis	Apr 27, 2025	Medical Image AnalysisText Generation	—Unverified	0
TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation	Apr 25, 2025	AttributeText Generation	—Unverified	0
Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Apr 24, 2025	Image GenerationText Generation	—Unverified	0
LLMSR@XLLM25: Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation	Apr 23, 2025	Text Generation	CodeCode Available	1
ConTextual: Improving Clinical Text Summarization in LLMs with Context-preserving Token Filtering and Knowledge Graphs	Apr 23, 2025	Decision MakingKnowledge Graphs	CodeCode Available	0
(Im)possibility of Automated Hallucination Detection in Large Language Models	Apr 23, 2025	HallucinationLanguage Identification	—Unverified	0
How Effective are Generative Large Language Models in Performing Requirements Classification?	Apr 23, 2025	ClassificationText Generation	—Unverified	0
Distilling semantically aware orders for autoregressive image generation	Apr 23, 2025	Image GenerationText Generation	—Unverified	0
Retrieval Augmented Generation Evaluation in the Era of Large Language Models: A Comprehensive Survey	Apr 21, 2025	Computational EfficiencyInformation Retrieval	CodeCode Available	2
AlignRAG: Leveraging Critique Learning for Evidence-Sensitive Retrieval-Augmented Reasoning	Apr 21, 2025	RAGRetrieval	CodeCode Available	1
LGD: Leveraging Generative Descriptions for Zero-Shot Referring Image Segmentation	Apr 20, 2025	AttributeImage Segmentation	—Unverified	0
FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering	Apr 20, 2025	counterfactualFairness	—Unverified	0
FarsEval-PKBETS: A new diverse benchmark for evaluating Persian large language models	Apr 20, 2025	DescriptiveEthics	—Unverified	0
Understanding the Repeat Curse in Large Language Models from a Feature Perspective	Apr 19, 2025	Text Generation	CodeCode Available	1
Density Measures for Language Generation	Apr 19, 2025	HallucinationText Generation	—Unverified	0
Sparks of Science: Hypothesis Generation Using Structured Paper Data	Apr 17, 2025	Language ModellingText Generation	—Unverified	0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Apr 16, 2025	GSM8KMath	—Unverified	0
Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items	Apr 15, 2025	BenchmarkingMultiple-choice	—Unverified	0
Enhancing multimodal analogical reasoning with Logic Augmented Generation	Apr 15, 2025	Knowledge GraphsText Generation	CodeCode Available	0
Transferable text data distillation by trajectory matching	Apr 14, 2025	ARCLarge Language Model	—Unverified	0
ReasonDrive: Efficient Visual Question Answering for Autonomous Vehicles with Reasoning-Enhanced Small Vision-Language Models	Apr 14, 2025	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Joint Action Language Modelling for Transparent Policy Execution	Apr 14, 2025	Language ModellingText Generation	—Unverified	0
Parameterized Synthetic Text Generation with SimpleStories	Apr 12, 2025	DiversityLanguage Modeling	CodeCode Available	1
ELSA: A Style Aligned Dataset for Emotionally Intelligent Language Generation	Apr 11, 2025	DiversityLanguage Modeling	—Unverified	0
MedHal: An Evaluation Dataset for Medical Hallucination Detection	Apr 11, 2025	HallucinationNatural Language Inference	—Unverified	0
Large Language Models as Span Annotators	Apr 11, 2025	Data-to-Text GenerationMachine Translation	—Unverified	0
DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?	Apr 10, 2025	Machine Translationnlg evaluation	—Unverified	0
HypoEval: Hypothesis-Guided Evaluation for Natural Language Generation	Apr 9, 2025	Text Generation	CodeCode Available	0
An Empirical Study of GPT-4o Image Generation Capabilities	Apr 8, 2025	BenchmarkingImage Generation	CodeCode Available	1
Retrieval Augmented Generation with Collaborative Filtering for Personalized Text Generation	Apr 8, 2025	Collaborative FilteringContrastive Learning	CodeCode Available	1
Synthetic Data Generation & Multi-Step RL for Reasoning & Tool Use	Apr 7, 2025	GSM8KMath	—Unverified	0
Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Apr 7, 2025	Information RetrievalLanguage Modeling	—Unverified	0
IMPersona: Evaluating Individual Level LM Impersonation	Apr 6, 2025	Text Generation	CodeCode Available	0
MSL: Not All Tokens Are What You Need for Tuning LLM as a Recommender	Apr 5, 2025	AllLanguage Modeling	CodeCode Available	1
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models	Apr 4, 2025	GSM8KMathematical Reasoning	—Unverified	0
Align to Structure: Aligning Large Language Models with Structural Information	Apr 4, 2025	Document SummarizationText Generation	CodeCode Available	0
Stance-Driven Multimodal Controlled Statement Generation: New Dataset and Task	Apr 4, 2025	Marketingmultimodal generation	—Unverified	0
Evaluating Compact LLMs for Zero-Shot Iberian Language Tasks on End-User Devices	Apr 4, 2025	Text Generation	—Unverified	0
Pel, A Programming Language for Orchestrating AI Agents	Apr 3, 2025	Code GenerationText Generation	—Unverified	0
CoLa -- Learning to Interactively Collaborate with Large LMs	Apr 3, 2025	CoLAText Generation	—Unverified	0
State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla	Apr 3, 2025	Data AugmentationText Generation	—Unverified	0

Show:10 25 50

← PrevPage 4 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified