Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1651–1700 of 5335 papers

Title	Date	Tasks	Status	Hype
Generative AI for Hate Speech Detection: Evaluation and Findings	Nov 16, 2023	Hate Speech DetectionText Generation	—Unverified	0
AMRFact: Enhancing Summarization Factuality Evaluation with AMR-Driven Negative Samples Generation	Nov 16, 2023	Abstractive Text SummarizationNatural Language Inference	CodeCode Available	0
Language and Task Arithmetic with Parameter-Efficient Layers for Zero-Shot Summarization	Nov 15, 2023	Cross-Lingual Transferparameter-efficient fine-tuning	—Unverified	0
LLMRefine: Pinpointing and Refining Large Language Models via Fine-Grained Actionable Feedback	Nov 15, 2023	Long Form Question AnsweringMachine Translation	—Unverified	0
Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset	Nov 15, 2023	Bias DetectionText Generation	—Unverified	0
MAP's not dead yet: Uncovering true language model modes by conditioning away degeneracy	Nov 15, 2023	Instruction FollowingLanguage Modeling	—Unverified	0
HeLM: Highlighted Evidence augmented Language Model for Enhanced Table-to-Text Generation	Nov 15, 2023	Language ModelingLanguage Modelling	—Unverified	0
Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification	Nov 15, 2023	HallucinationRetrieval	CodeCode Available	0
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects	Nov 15, 2023	Dialogue GenerationLanguage Modelling	—Unverified	0
Towards Verifiable Text Generation with Symbolic References	Nov 15, 2023	Question AnsweringText Generation	—Unverified	0
Contrastive Transformer Learning with Proximity Data Generation for Text-Based Person Search	Nov 15, 2023	Contrastive LearningCross-Modal Retrieval	CodeCode Available	0
Token Prediction as Implicit Classification to Identify LLM-Generated Text	Nov 15, 2023	Classificationtext-classification	CodeCode Available	1
How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text Detection	Nov 14, 2023	Instruction FollowingLarge Language Model	CodeCode Available	0
AI-generated text boundary detection with RoFT	Nov 14, 2023	Boundary DetectionDiversity	CodeCode Available	1
UT5: Pretraining Non autoregressive T5 with unrolled denoising	Nov 14, 2023	DenoisingQuestion Generation	—Unverified	0
Insights into Classifying and Mitigating LLMs' Hallucinations	Nov 14, 2023	HallucinationMachine Translation	—Unverified	0
RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge	Nov 14, 2023	counterfactualKnowledge Graphs	—Unverified	0
REST: Retrieval-Based Speculative Decoding	Nov 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
Semantically Grounded QFormer for Efficient Vision Language Understanding	Nov 13, 2023	DiversityImage to text	—Unverified	0
Controlled Text Generation for Black-box Language Models via Score-based Progressive Editor	Nov 13, 2023	Language ModelingLanguage Modelling	CodeCode Available	0
LM-Polygraph: Uncertainty Estimation for Language Models	Nov 13, 2023	Text Generation	—Unverified	0
Learning Globally Optimized Language Structure via Adversarial Training	Nov 12, 2023	Adversarial AttackText Generation	—Unverified	0
Testing LLMs on Code Generation with Varying Levels of Prompt Specificity	Nov 10, 2023	Code GenerationSpecificity	—Unverified	0
Tamil-Llama: A New Tamil Language Model Based on Llama 2	Nov 10, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
All Should Be Equal in the Eyes of Language Models: Counterfactually Aware Fair Text Generation	Nov 9, 2023	AllFairness	CodeCode Available	0
The Iron(ic) Melting Pot: Reviewing Human Evaluation in Humour, Irony and Sarcasm Generation	Nov 9, 2023	Text Generation	—Unverified	0
Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations	Nov 9, 2023	Text Generation	—Unverified	0
Model-Based Minimum Bayes Risk Decoding for Text Generation	Nov 9, 2023	DecoderText Generation	CodeCode Available	0
TencentLLMEval: A Hierarchical Evaluation of Real-World Capabilities for Human-Aligned LLMs	Nov 9, 2023	BenchmarkingQuestion Answering	CodeCode Available	1
SEMQA: Semi-Extractive Multi-Source Question Answering	Nov 8, 2023	AttributeLong Form Question Answering	CodeCode Available	1
Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINO	Nov 8, 2023	QuantizationText Generation	CodeCode Available	4
Prompt Sketching for Large Language Models	Nov 8, 2023	Arithmetic ReasoningBenchmarking	—Unverified	0
Input Reconstruction Attack against Vertical Federated Large Language Models	Nov 7, 2023	Federated LearningGPU	—Unverified	0
Which is better? Exploring Prompting Strategy For LLM-based Metrics	Nov 7, 2023	Text Generation	CodeCode Available	1
Multitask Multimodal Prompted Training for Interactive Embodied Task Completion	Nov 7, 2023	DecoderText Generation	—Unverified	0
Adapting Pre-trained Generative Models for Extractive Question Answering	Nov 6, 2023	Extractive Question-AnsweringLong Form Question Answering	—Unverified	0
AnyText: Multilingual Visual Text Generation And Editing	Nov 6, 2023	Image GenerationOptical Character Recognition (OCR)	CodeCode Available	4
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding	Nov 6, 2023	CoLAQuestion Answering	—Unverified	0
Successor Features for Efficient Multisubject Controlled Text Generation	Nov 3, 2023	Computational EfficiencyLanguage Modelling	—Unverified	0
Grounded Intuition of GPT-Vision's Abilities with Scientific Images	Nov 3, 2023	Benchmarkingcounterfactual	CodeCode Available	0
The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization	Nov 2, 2023	Text GenerationText Summarization	—Unverified	0
Divergent Token Metrics: Measuring degradation to prune away LLM components -- and optimize quantization	Nov 2, 2023	ManagementModel Compression	—Unverified	0
Form follows Function: Text-to-Text Conditional Graph Generation based on Functional Requirements	Nov 1, 2023	FormGraph Generation	—Unverified	0
Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models	Nov 1, 2023	Clinical KnowledgeDiversity	CodeCode Available	1
Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs	Nov 1, 2023	BenchmarkingQuestion Answering	—Unverified	0
Probing Explicit and Implicit Gender Bias through LLM Conditional Text Generation	Nov 1, 2023	Conditional Text GenerationFairness	—Unverified	0
HWD: A Novel Evaluation Score for Styled Handwritten Text Generation	Oct 31, 2023	Image GenerationPerceptual Distance	CodeCode Available	1
Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP Tasks	Oct 30, 2023	FairnessMath	CodeCode Available	0
The Eval4NLP 2023 Shared Task on Prompting Large Language Models as Explainable Metrics	Oct 30, 2023	Machine TranslationText Generation	CodeCode Available	1
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing	Oct 30, 2023	Language ModellingMulti-Task Learning	—Unverified	0

Show:10 25 50

← PrevPage 34 of 107Next →

All datasets DART COCO Captions EMNLP2017 WMT ReDial CommonGen ROCStories Chinese Poems Czech restaurant information OpenWebText SciQ Yahoo Questions ADGEN

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5B Baseline	BLEU	48.74	—	Unverified
2	FactT5B	BLEU	48.37	—	Unverified
3	JointGT Baseline	BLEU	47.51	—	Unverified
4	FactJointGT	BLEU	47.39	—	Unverified
5	Control Prefixes (T5-large)	METEOR	0.41	—	Unverified
6	T5	METEOR	0.12	—	Unverified
7	BART	METEOR	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.95	—	Unverified
2	partGAN	BLEU-2	0.91	—	Unverified
3	RankGAN	BLEU-2	0.85	—	Unverified
4	RelGAN (100)	BLEU-2	0.85	—	Unverified
5	SeqGAN	BLEU-2	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LeakGAN	BLEU-2	0.96	—	Unverified
2	PPOGAN	BLEU-2	0.91	—	Unverified
3	RelGAN	BLEU-2	0.88	—	Unverified
4	SeqGAN	BLEU-2	0.86	—	Unverified
5	RankGAN	BLEU-2	0.78	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniCRS	Distinct-3	0.65	—	Unverified
2	CRFR	Distinct-3	0.52	—	Unverified
3	KGSF	Distinct-3	0.43	—	Unverified
4	C2CRS	Distinct-3	0.33	—	Unverified
5	KBRD	Distinct-3	0.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UniLM	CIDEr	14.92	—	Unverified
2	BART (TextBox 2.0)	CIDEr	12.98	—	Unverified
3	BART	METEOR	0.3	—	Unverified
4	T5	METEOR	0.29	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Beam search + A*esque (beam)	BLEU-1	34.4	—	Unverified
2	Beam search + A*esque (sample)	BLEU-1	34.4	—	Unverified
3	Beam search + A*esque (greedy)	BLEU-1	34.3	—	Unverified
4	Beam search	BLEU-1	33.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	RankGAN	BLEU-2	0.81	—	Unverified
2	SeqGAN	BLEU-2	0.74	—	Unverified
3	LeakGAN	BLEU-2	0.46	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TGen++	METEOR	0.17	—	Unverified
2	TGen	METEOR	0.15	—	Unverified
3	TGen+	METEOR	0.15	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT2-124M	eval_loss	3.12	—	Unverified
2	GPT2-81M-LOOP	eval_loss	3.11	—	Unverified
3	GPT2-Hermite	eval_loss	2.91	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LLaMA-65B+CFG (zero-shot)	Accuracy	96.6	—	Unverified
2	LLaMA-30B+CFG (zero-shot)	Accuracy	96.4	—	Unverified
3	LLaMA-13B+CFG (zero-shot)	Accuracy	95.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CNN-VAE	NLL	332.1	—	Unverified
2	SA-VAE	NLL	327.5	—	Unverified
3	Aggressive VAE	NLL	326.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	10.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	STWGAN-GP	BLEU-3	0.62	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PALM	ROUGE-L	41.41	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	64.34	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AEM+Attention	BLEU-1	14.17	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4	ASR	65.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	ROUGE-L	42.96	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Graph2Seq	BLEU	22	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WGANGP + DGflow	JS-4	0.19	—	Unverified