SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 13511400 of 5335 papers

TitleStatusHype
End-to-end argumentation knowledge graph construction0
End-to-end Concept Word Detection for Video Captioning, Retrieval, and Question Answering0
Capturing Event Argument Interaction via A Bi-Directional Entity-Level Recurrent Decoder0
Are PPO-ed Language Models Hackable?0
CaptainGAN: Navigate Through Embedding Space For Better Text Generation0
A Repository of Rules and Lexical Resources for Discourse Structure Analysis: the Case of Explanation Structures0
Structured Chain-of-Thought Prompting for Code Generation0
A Repository of Frame Instance Lexicalizations for Generation0
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation0
A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis0
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?0
Can the Transformer Be Used as a Drop-in Replacement for RNNs in Text-Generating GANs?0
A Repository of Data and Evaluation Resources for Natural Language Generation0
Adapting Graph Summaries to the Users' Reading Levels0
Enabling Efficient On-Device Fine-Tuning of LLMs Using Only Inference Engines0
Can spontaneous spoken language disfluencies help describe syntactic dependencies? An empirical study0
Can Pretrained Language Models Generate Persuasive, Faithful, and Informative Ad Text for Product Descriptions?0
Can Neural Image Captioning be Controlled via Forced Attention?0
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate0
Are LLMs Aware that Some Questions are not Open-ended?0
A Brief Introduction to Natural Language Generation within Computational Creativity0
Can LLMs Automate Fact-Checking Article Writing?0
Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs0
Adapting Descriptions of People to the Point of View of a Moving Observer0
Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inference0
Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inference0
Enabling Language Models to Implicitly Learn Self-Improvement0
Enabling text readability awareness during the micro planning phase of NLG applications0
End-to-End Differentiable GANs for Text Generation0
Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval0
Can Language Model Moderators Improve the Health of Online Discourse?0
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?0
Can Grammarly and ChatGPT accelerate language change? AI-powered technologies and their impact on the English language: wordiness vs. conciseness0
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators0
A Game-Based Setup for Data Collection and Task-Based Evaluation of Uncertain Information Presentation0
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension0
Can GPT Improve the State of Prior Authorization via Guideline Based Automated Question Answering?0
Are Fictional Voices Distinguishable? Classifying Character Voices in Modern Drama0
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery0
Are Current Decoding Strategies Capable of Facing the Challenges of Visual Dialogue?0
A Probability--Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors0
Can AI Read Between The Lines? Benchmarking LLMs On Financial Nuance0
CaM-Gen: Causally Aware Metric-Guided Text Generation0
AfriKI: Machine-in-the-Loop Afrikaans Poetry Generation0
A Block Metropolis-Hastings Sampler for Controllable Energy-based Text Generation0
CaM-Gen:Causally-aware Metric-guided Text Generation0
CaM-Gen: Causally-aware Guided Text Generation0
Call Centre Conversation Summarization: A Pilot Task at Multiling 20150
Architecture for a Trustworthy Quantum Chatbot0
Empirical Validation of Reichenbach's Tense Framework0
Show:102550
← PrevPage 28 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified