SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 37513800 of 5335 papers

TitleStatusHype
Bailicai: A Domain-Optimized Retrieval-Augmented Generation Framework for Medical Applications0
Balancing Effect of Training Dataset Distribution of Multiple Styles for Multi-Style Text Transfer0
Balancing via Generation for Multi-Class Text Classification Improvement0
BanditSpec: Adaptive Speculative Decoding via Bandit Algorithms0
Barch: an English Dataset of Bar Chart Summaries0
BART-light: One Decoder Layer Is Enough0
Basic Principles for Segmenting Thai EDUs0
Bayesian WeakS-to-Strong from Text Classification to Generation0
BEADs: Bias Evaluation Across Domains0
Behavior of Modern Pre-trained Language Models Using the Example of Probing Tasks0
Being data-driven is not enough: Revisiting interactive instruction giving as a challenge for NLG0
BENCHAGENTS: Automated Benchmark Creation with Agent Interaction0
Benchmarking Chinese Medical LLMs: A Medbench-based Analysis of Performance Gaps and Hierarchical Optimization Strategies0
Benchmarking Large Language Model Capabilities for Conditional Generation0
Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains0
Benchmarking Next-Generation Reasoning-Focused Large Language Models in Ophthalmology: A Head-to-Head Evaluation on 5,888 Items0
BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP0
BERT 4EVER@LT-EDI-ACL2022-Detecting signs of Depression from Social Media:Detecting Depression in Social Media using Prompt-Learning and Word-Emotion Cluster0
What BERT Sees: Cross-Modal Transfer for Visual Question Generation0
BERT for Question Generation0
BERT vs GPT for financial engineering0
Best-k Search Algorithm for Neural Text Generation0
Best Practices for Data-Efficient Modeling in NLG:How to Train Production-Ready Neural Models with Less Data0
Best practices for the human evaluation of automatically generated text0
Best Student Forcing: A Simple Training Mechanism in Adversarial Language Generation0
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering0
BetterV: Controlled Verilog Generation with Discriminative Guidance0
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation0
Beyond Generative Artificial Intelligence: Roadmap for Natural Language Generation0
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts0
Beyond Reality: The Pivotal Role of Generative AI in the Metaverse0
Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation0
Beyond Retrieval: Generating Narratives in Conversational Recommender Systems0
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding0
Beyond Text Generation: Supporting Writers with Continuous Automatic Text Summaries0
LLMs May Perform MCQA by Selecting the Least Incorrect Option0
Beyond Turing: A Comparative Analysis of Approaches for Detecting Machine-Generated Text0
BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs0
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation0
Bidimensional Leaderboards: Generate and Evaluate Language Hand in Hand0
BiDoRA: Bi-level Optimization-Based Weight-Decomposed Low-Rank Adaptation0
Big Bidirectional Insertion Representations for Documents0
Bilingual-GAN: A Step Towards Parallel Text Generation0
BinLin: A Simple Method of Dependency Tree Linearization0
BioInstruct: Instruction Tuning of Large Language Models for Biomedical Natural Language Processing0
Biomedical Large Languages Models Seem not to be Superior to Generalist Models on Unseen Medical Data0
Black Box to White Box: Discover Model Characteristics Based on Strategic Probing0
BLEU Neighbors: A Reference-less Approach to Automatic Evaluation0
Blogging birds: Generating narratives about reintroduced species to promote public engagement0
Book Review: Automatic Text Simplification by Horacio Saggion0
Show:102550
← PrevPage 76 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified