SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 12011250 of 5335 papers

TitleStatusHype
A Gold Standard Methodology for Evaluating Accuracy in Data-To-Text SystemsCode0
CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language GenerationsCode0
CEval: A Benchmark for Evaluating Counterfactual Text GenerationCode0
Artificial Interrogation for Attributing Language ModelsCode0
Large Language Models as Sous Chefs: Revising Recipes with GPT-3Code0
CERET: Cost-Effective Extrinsic Refinement for Text GenerationCode0
Argument Undermining: Counter-Argument Generation by Attacking Weak PremisesCode0
Understanding Jargon: Combining Extraction and Generation for Definition ModelingCode0
Language Model Sentence Completion with a Parser-Driven Rhetorical Control MethodCode0
Language Generation via Combinatorial Constraint Satisfaction: A Tree Search Enhanced Monte-Carlo ApproachCode0
Language Generation with Recurrent Generative Adversarial Networks without Pre-trainingCode0
Causal ATE Mitigates Unintended Bias in Controlled Text GenerationCode0
CatVRNN: Generating Category Texts via Multi-task LearningCode0
Argumentative Text Generation in Economic DomainCode0
CatGAN: Category-aware Generative Adversarial Networks with Hierarchical Evolutionary Learning for Category Text GenerationCode0
Language Detoxification with Attribute-Discriminative Latent SpaceCode0
A Brief Study on the Effects of Training Generative Dialogue Models with a Semantic lossCode0
CASTILLO: Characterizing Response Length Distributions of Large Language ModelsCode0
Lagging Inference Networks and Posterior Collapse in Variational AutoencodersCode0
Language GANs Falling ShortCode0
Language Models can Evaluate Themselves via Probability DiscrepancyCode0
KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from ServerCode0
Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?Code0
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
TIGS: An Inference Algorithm for Text Infilling with Gradient SearchCode0
Know3-RAG: A Knowledge-aware RAG Framework with Adaptive Retrieval, Generation, and FilteringCode0
Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual StorytellingCode0
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text GenerationCode0
A General Benchmarking Framework for Text GenerationCode0
Can Large Language Models Generate High-quality Patent Claims?Code0
Automated Chess Commentator Powered by Neural Chess EngineCode0
Key Fact as Pivot: A Two-Stage Model for Low Resource Table-to-Text GenerationCode0
Keyphrase Extraction for N-best Reranking in Multi-Sentence CompressionCode0
Learning Latent Semantic Annotations for Grounding Natural Language to Structured DataCode0
Keeping Notes: Conditional Natural Language Generation with a Scratchpad MechanismCode0
Judge the Judges: A Large-Scale Evaluation Study of Neural Language Models for Online Review GenerationCode0
For Generated Text, Is NLI-Neutral Text the Best Text?Code0
Is Multilingual BERT Fluent in Language Generation?Code0
Can adversarial training learn image captioning ?Code0
A Recurrent BERT-based Model for Question GenerationCode0
Is neural language acquisition similar to natural? A chronological probing studyCode0
On the Robustness of Editing Large Language ModelsCode0
Kernelized Bayesian Softmax for Text GenerationCode0
ARAML: A Stable Adversarial Training Framework for Text GenerationCode0
Calibrating LLM-Based EvaluatorCode0
Investigating the Robustness of Natural Language Generation from Logical Forms via Counterfactual SamplesCode0
Investigating Strategies for Clause RecommendationCode0
Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's ShowerthoughtsCode0
Investigating Linguistic Pattern Ordering in Hierarchical Natural Language GenerationCode0
Investigating Metric Diversity for Evaluating Long Document SummarisationCode0
Show:102550
← PrevPage 25 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified