SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 20512100 of 5335 papers

TitleStatusHype
Generating with Confidence: Uncertainty Quantification for Black-box Large Language ModelsCode1
Grammar Prompting for Domain-Specific Language Generation with Large Language ModelsCode1
Controlled Text Generation with Hidden Representation TransformationsCode0
GlyphControl: Glyph Conditional Control for Visual Text GenerationCode2
Abstractive Summarization as Augmentation for Document-Level Event Detection0
Transformer Language Models Handle Word Frequency in Prediction Head0
A Critical Evaluation of Evaluations for Long-form Question AnsweringCode1
Perceived Trustworthiness of Natural Language Generators0
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking0
KoSBi: A Dataset for Mitigating Social Bias Risks Towards Safer Large Language Model ApplicationCode2
Why Does Zero-Shot Cross-Lingual Generation Fail? An Explanation and a Solution0
A Unified Framework for Slot based Response Generation in a Multimodal Dialogue SystemCode0
AaKOS: Aspect-adaptive Knowledge-based Opinion Summarization0
Learning to Imagine: Visually-Augmented Natural Language GenerationCode0
Backpack Language ModelsCode1
CREST: A Joint Framework for Rationalization and Counterfactual Text GenerationCode0
AlignScore: Evaluating Factual Consistency with a Unified Alignment FunctionCode4
HowkGPT: Investigating the Detection of ChatGPT-generated University Student Homework through Context-Aware Perplexity Analysis0
Act Like a Radiologist: Radiology Report Generation across Anatomical RegionsCode1
Do GPTs Produce Less Literal Translations?Code0
EDM3: Event Detection as Multi-task Text GenerationCode0
MERGE: Fast Private Text GenerationCode0
RewriteLM: An Instruction-Tuned Large Language Model for Text Rewriting0
Response Generation in Longitudinal Dialogues: Which Knowledge Representation Helps?0
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and MitigationCode1
TOAST: Transfer Learning via Attention SteeringCode1
Balancing Effect of Training Dataset Distribution of Multiple Styles for Multi-Style Text Transfer0
Revisiting Sentence Union Generation as a Testbed for Text ConsolidationCode0
Peek Across: Improving Multi-Document Modeling via Cross-Document Question-AnsweringCode0
Alt-Text with Context: Improving Accessibility for Images on Twitter0
Faithful Low-Resource Data-to-Text Generation through Cycle TrainingCode0
MuLER: Detailed and Scalable Reference-based Evaluation0
Is GPT-4 a Good Data Analyst?Code1
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement TheoryCode1
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuningCode1
Not All Metrics Are Guilty: Improving NLG Evaluation by Diversifying ReferencesCode0
Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and EvaluationCode0
Active Learning for Natural Language Generation0
Identifying Informational Sources in News ArticlesCode0
In-Context Demonstration Selection with Cross Entropy Difference0
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG0
The ACL OCL Corpus: Advancing Open Science in Computational Linguistics0
A Survey of Diffusion Models in Natural Language Processing0
SAIL: Search-Augmented Instruction Learning0
Universal Self-Adaptive Prompting0
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language ModelsCode1
KNN-LM Does Not Improve Open-ended Text Generation0
Trade-Offs Between Fairness and Privacy in Language ModelingCode0
Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking ScenariosCode1
Gender Biases in Automatic Evaluation Metrics for Image CaptioningCode0
Show:102550
← PrevPage 42 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified