Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 5335 papers

Title	Date	Tasks	Status	Hype
GEM: Empowering LLM for both Embedding Generation and Language Understanding	Jun 4, 2025	DecoderLarge Language Model	—Unverified	0
Backbone Augmented Training for Adaptations	Jun 4, 2025	Image GenerationText Generation	—Unverified	0
Watermarking Degrades Alignment in Language Models: Analysis and Mitigation	Jun 4, 2025	Text Generation	CodeCode Available	0
Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs	Jun 3, 2025	Abstractive Text SummarizationComputational Efficiency	—Unverified	0
HyperSteer: Activation Steering at Scale with Hypernetworks	Jun 3, 2025	Dictionary LearningText Generation	CodeCode Available	2
How do Pre-Trained Models Support Software Engineering? An Empirical Study in Hugging Face	Jun 3, 2025	Code GenerationText Generation	—Unverified	0
The Reader is the Metric: How Textual Features and Reader Profiles Explain Conflicting Evaluations of AI Creative Writing	Jun 3, 2025	Feature ImportanceSentence	CodeCode Available	0
Large Language Models for EEG: A Comprehensive Survey and Taxonomy	Jun 2, 2025	DiagnosticEEG	—Unverified	0
MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs	Jun 2, 2025	Instruction FollowingText Generation	—Unverified	0
Probing Neural Topology of Large Language Models	Jun 1, 2025	Functional ConnectivityGraph Matching	CodeCode Available	0
EEG2TEXT-CN: An Exploratory Study of Open-Vocabulary Chinese Text-EEG Alignment via Large Language Model and Contrastive Learning on ChineseEEG	Jun 1, 2025	Contrastive LearningDecoder	—Unverified	0
Neuro2Semantic: A Transfer Learning Framework for Semantic Reconstruction of Continuous Language from Human Intracranial EEG	May 31, 2025	EEGText Generation	CodeCode Available	1
Multilingual Gloss-free Sign Language Translation: Towards Building a Sign Language Foundation Model	May 30, 2025	Gloss-free Sign Language TranslationSign Language Translation	CodeCode Available	0
Adaptive LoRA Merge with Parameter Pruning for Low-Resource Generation	May 30, 2025	Text Generation	CodeCode Available	0
Guiding Generative Storytelling with Knowledge Graphs	May 30, 2025	Knowledge GraphsRAG	—Unverified	0
A Survey of Generative Categories and Techniques in Multimodal Large Language Models	May 29, 2025	Mixture-of-ExpertsSelf-Supervised Learning	—Unverified	0
Large Language Model Meets Constraint Propagation	May 29, 2025	Language ModelingLanguage Modelling	—Unverified	0
MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection	May 29, 2025	image-classificationImage Classification	—Unverified	0
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model	May 29, 2025	DecoderImage Generation	CodeCode Available	2
How Does Response Length Affect Long-Form Factuality	May 29, 2025	FormText Generation	CodeCode Available	0
Document-Level Text Generation with Minimum Bayes Risk Decoding using Optimal Transport	May 29, 2025	Document Level Machine TranslationImage Captioning	CodeCode Available	0
Discriminative Policy Optimization for Token-Level Reward Models	May 29, 2025	GSM8KLanguage Modeling	CodeCode Available	0
Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	May 28, 2025	Text Generation	—Unverified	0
Universal Visuo-Tactile Video Understanding for Embodied Interaction	May 28, 2025	FrictionLarge Language Model	—Unverified	0
Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked Data	May 28, 2025	Machine TranslationParaphrase Generation	CodeCode Available	0
Structured Memory Mechanisms for Stable Context Representation in Large Language Models	May 28, 2025	Question AnsweringText Generation	—Unverified	0
OmniAD: Detect and Understand Industrial Anomaly via Multimodal Reasoning	May 28, 2025	Anomaly DetectionMultimodal Reasoning	—Unverified	0
LLM-Driven E-Commerce Marketing Content Optimization: Balancing Creativity and Conversion	May 27, 2025	DiversityMarketing	—Unverified	0
MAKIEval: A Multilingual Automatic WiKidata-based Framework for Cultural Awareness Evaluation for LLMs	May 27, 2025	SpecificityText Generation	CodeCode Available	0
Rethinking Text-based Protein Understanding: Retrieval or LLM?	May 26, 2025	RetrievalText Generation	CodeCode Available	1
Enhancing Visual Reliance in Text Generation: A Bayesian Perspective on Mitigating Hallucination in Large Vision-Language Models	May 26, 2025	HallucinationMME	—Unverified	0
Monocle: Hybrid Local-Global In-Context Evaluation for Long-Text Generation with Uncertainty-Based Active Learning	May 26, 2025	Active LearningIn-Context Learning	—Unverified	0
Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking	May 26, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Next Token Prediction Is a Dead End for Creativity	May 25, 2025	PredictionText Generation	—Unverified	0
PatentScore: Multi-dimensional Evaluation of LLM-Generated Patent Claims	May 25, 2025	Text Generation	—Unverified	0
Smoothie: Smoothing Diffusion on Token Embeddings for Text Generation	May 24, 2025	Semantic SimilaritySemantic Textual Similarity	CodeCode Available	1
Writing Like the Best: Exemplar-Based Expository Text Generation	May 24, 2025	Text Generation	CodeCode Available	0
Syn3DTxt: Embedding 3D Cues for Scene Text Generation	May 24, 2025	Text Generation	CodeCode Available	0
TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis	May 23, 2025	Optical Character Recognition (OCR)Text Generation	—Unverified	0
U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding	May 23, 2025	BenchmarkingSpatial Reasoning	—Unverified	0
Towards Practical Defect-Focused Automated Code Review	May 23, 2025	Defect DetectionText Generation	—Unverified	0
LIFEBench: Evaluating Length Instruction Following in Large Language Models	May 22, 2025	Instruction FollowingText Generation	CodeCode Available	0
CASTILLO: Characterizing Response Length Distributions of Large Language Models	May 22, 2025	Instruction FollowingLanguage Modeling	CodeCode Available	0
Can AI Read Between The Lines? Benchmarking LLMs On Financial Nuance	May 22, 2025	BenchmarkingPrompt Engineering	—Unverified	0
Exploring the Relationship Between Diversity and Quality in Ad Text Generation	May 22, 2025	DiversityMachine Translation	—Unverified	0
Resource for Error Analysis in Text Simplification: New Taxonomy and Test Collection	May 22, 2025	MisinformationText Generation	—Unverified	0
Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality	May 22, 2025	Abstractive Text SummarizationInformativeness	CodeCode Available	0
DuFFin: A Dual-Level Fingerprinting Framework for LLMs IP Protection	May 22, 2025	QuantizationSafety Alignment	CodeCode Available	0
AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios	May 22, 2025	Decision MakingMulti-class Classification	CodeCode Available	0
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	May 21, 2025	Document SummarizationHallucination	—Unverified	0

Show:10 25 50

← PrevPage 2 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified