Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1951–2000 of 5335 papers

Title	Date	Tasks	Status
QCQA: Quality and Capacity-aware grouped Query Attention	Jun 8, 2024	Text Generation	—Unverified
Extroversion or Introversion? Controlling The Personality of Your Large Language Models	Jun 7, 2024	Text Generation	CodeCode Available
On Subjective Uncertainty Quantification and Calibration in Natural Language Generation	Jun 7, 2024	In-Context LearningMachine Translation	CodeCode Available
Annotating FrameNet via Structure-Conditioned Language Generation	Jun 7, 2024	Data AugmentationSemantic Role Labeling	CodeCode Available
Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMs	Jun 6, 2024	AttributeText Generation	CodeCode Available
Evaluating Durability: Benchmark Insights into Multimodal Watermarking	Jun 6, 2024	Text Generation	—Unverified
End-to-End Trainable Retrieval-Augmented Generation for Relation Extraction	Jun 6, 2024	RelationRelation Extraction	—Unverified
Effective Context Selection in LLM-based Leaderboard Generation: An Empirical Study	Jun 6, 2024	ArticlesNatural Language Inference	—Unverified
Uncovering Limitations of Large Language Models in Information Seeking from Tables	Jun 6, 2024	Single Choice QuestionText Generation	CodeCode Available
BEADs: Bias Evaluation Across Domains	Jun 6, 2024	BenchmarkingBias Detection	—Unverified
Confabulation: The Surprising Value of Large Language Model Hallucinations	Jun 6, 2024	HallucinationLanguage Modeling	—Unverified
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches	Jun 5, 2024	ChatbotInformation Retrieval	—Unverified
PatentEval: Understanding Errors in Patent Generation	Jun 5, 2024	Abstract generationText Generation	CodeCode Available
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework	Jun 5, 2024	Fact CheckingHallucination	—Unverified
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMs	Jun 5, 2024	ClusteringNatural Language Inference	CodeCode Available
AD-H: Autonomous Driving with Hierarchical Agents	Jun 5, 2024	Autonomous DrivingText Generation	CodeCode Available
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback	Jun 4, 2024	reinforcement-learningReinforcement Learning	—Unverified
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data	Jun 4, 2024	Mathematical ReasoningText Generation	—Unverified
OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step	Jun 4, 2024	Language ModelingLanguage Modelling	—Unverified
Order-Independence Without Fine Tuning	Jun 4, 2024	Language ModellingMultiple-choice	CodeCode Available
The current status of large language models in summarizing radiology report impressions	Jun 4, 2024	Text Generation	—Unverified
Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation	Jun 3, 2024	Text Generation	—Unverified
Layout Agnostic Scene Text Image Synthesis with Diffusion Models	Jun 3, 2024	DiversityImage Generation	—Unverified
Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation	Jun 3, 2024	Question AnsweringText Generation	CodeCode Available
Role-playing Prompt Framework: Generation and Evaluation	Jun 2, 2024	Natural Language UnderstandingText Generation	—Unverified
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models	Jun 2, 2024	Language ModellingText Generation	—Unverified
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning	Jun 2, 2024	Logical ReasoningReading Comprehension	—Unverified
The Power of Summary-Source Alignments	Jun 2, 2024	Document SummarizationMulti-Document Summarization	CodeCode Available
Improving Text Generation on Images with Synthetic Captions	Jun 1, 2024	Optical Character Recognition (OCR)Text Generation	—Unverified
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models	Jun 1, 2024	FairnessInstruction Following	—Unverified
Evaluating Large Language Model Biases in Persona-Steered Generation	May 30, 2024	Language ModelingLanguage Modelling	CodeCode Available
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution	May 30, 2024	Combinatorial Optimizationcounterfactual	—Unverified
Phantom: General Trigger Attacks on Retrieval Augmented Language Generation	May 30, 2024	Adversarial TextChatbot	—Unverified
Hidden in Plain Sight: Exploring Chat History Tampering in Interactive Language Models	May 30, 2024	Text Generation	—Unverified
WRDScore: New Metric for Evaluation of Natural Language Generation Models	May 29, 2024	Method name predictionText Generation	CodeCode Available
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models	May 29, 2024	Language ModellingSST-2	—Unverified
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension	May 29, 2024	Machine Reading ComprehensionRAG	—Unverified
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities	May 29, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation	May 29, 2024	Text Generation	—Unverified
MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding	May 28, 2024	Brain DecodingImage Generation	—Unverified
On the Sequence Evaluation based on Stochastic Processes	May 28, 2024	Coherence EvaluationContrastive Learning	—Unverified
Are PPO-ed Language Models Hackable?	May 28, 2024	Text Generation	—Unverified
Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities	May 28, 2024	ChatbotText Generation	—Unverified
A System for Automatic English Text Expansion	May 28, 2024	News GenerationText Generation	—Unverified
Glauber Generative Model: Discrete Diffusion Models via Binary Classification	May 27, 2024	Binary ClassificationDenoising	—Unverified
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation	May 27, 2024	Decision Makingmodel	—Unverified
On the Noise Robustness of In-Context Learning for Text Generation	May 27, 2024	In-Context Learningtext-classification	CodeCode Available
UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models	May 27, 2024	DecoderDiagnostic	—Unverified
On Understanding Attention-Based In-Context Learning for Categorical Data	May 27, 2024	Few-Shot Learningimage-classification	—Unverified
Augmenting Textual Generation via Topology Aware Retrieval	May 27, 2024	RAGRetrieval	—Unverified

Show:10 25 50

← PrevPage 40 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified