SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 29012950 of 5335 papers

TitleStatusHype
The current status of large language models in summarizing radiology report impressions0
The Detection of Distributional Discrepancy for Text Generation0
The DipInfo-UniTo system for SRST 20180
The E2E NLG Challenge: A Tale of Two Systems0
The Effectiveness of Bidirectional Generative Patent Language Models0
The Effect of Multiple Replies for Natural Language Generation Chatbots0
The First Multilingual Surface Realisation Shared Task (SR’18): Overview and Evaluation Results0
A Novel Task-Oriented Text Corpus in Silent Speech Recognition and its Natural Language Generation Construction Method0
The Future is Agentic: Definitions, Perspectives, and Open Challenges of Multi-Agent Recommender Systems0
The Future of AI-Assisted Writing0
The GEM Benchmark: Natural Language Generation, its Evaluation and Metrics0
The Glass Ceiling of Automatic Evaluation in Natural Language Generation0
The Hyperfitting Phenomenon: Sharpening and Stabilizing LLMs for Open-Ended Text Generation0
The Impact of Artificial Intelligence on the Evolution of Digital Education: A Comparative Study of OpenAI Text Generation Tools including ChatGPT, Bing Chat, Bard, and Ernie0
The Impact of Listener Gaze on Predicting Reference Resolution0
The Impact of Preference Agreement in Reinforcement Learning from Human Feedback: A Case Study in Summarization0
The Impact of Rule-Based Text Generation on the Quality of Abstractive Summaries0
The Iron(ic) Melting Pot: Reviewing Human Evaluation in Humour, Irony and Sarcasm Generation0
The KBGen Challenge0
The Last 10 Metres: Using Visual Analysis and Verbal Communication in Guiding Visually Impaired Smartphone Users to Entrances0
The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination0
The Less the Merrier? Investigating Language Representation in Multilingual Models0
The Magnitude of Categories of Texts Enriched by Language Models0
The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims0
Beyond the Black Box: A Statistical Model for LLM Reasoning and Inference0
ThemePro: A Toolkit for the Analysis of Thematic Progression0
The Methodius Corpus of Rhetorical Discourse Structures and Generated Texts0
The Morpho-syntactic Annotation of Animacy for a Dependency Parser0
The Multilingual Affective Soccer Corpus (MASC): Compiling a biased parallel corpus on soccer reportage in English, German and Dutch0
The Narrow Gate: Localized Image-Text Communication in Vision-Language Models0
The Natural Language Pipeline, Neural Text Generation and Explainability0
The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures0
The Open Source Advantage in Large Language Models (LLMs)0
Theoretical Benefit and Limitation of Diffusion Language Model0
The PARLANCE mobile application for interactive search in English and Mandarin0
The Perils of Using Mechanical Turk to Evaluate Open-Ended Text Generation0
The Pitfalls of Defining Hallucination0
The Power of Combining Data and Knowledge: GPT-4o is an Effective Interpreter of Machine Learning Models in Predicting Lymph Node Metastasis of Lung Cancer0
The Pure Poet: How Good is the Subjective Credibility and Stylistic Quality of Literary Short Texts Written with an Artificial Intelligence Tool as Compared to Texts Written by Human Authors?0
The Reasoning-Memorization Interplay in Language Models Is Mediated by a Single Direction0
The Rocky Road towards a Swedish FrameNet - Creating SweFN0
The role of grammar in transition-probabilities of subsequent words in English text0
The Safety Reminder: A Soft Prompt to Reactivate Delayed Safety Awareness in Vision-Language Models0
The Science of Detecting LLM-Generated Texts0
The Secret's in the Word Order: Text-to-Text Generation for Linguistic Steganography0
The SelectGen Challenge: Finding the Best Training Samples for Few-Shot Neural Text Generation0
The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge0
The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation0
The Structure of Financial Equity Research Reports -- Identification of the Most Frequently Asked Questions in Financial Analyst Reports to Automate Equity Research Using Llama 3 and GPT-40
The Surface Realisation Task: Recent Developments and Future Plans0
Show:102550
← PrevPage 59 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified