SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 12011250 of 5335 papers

TitleStatusHype
Fair Risk Control: A Generalized Framework for Calibrating Multi-group Fairness Risks0
Investigating Wit, Creativity, and Detectability of Large Language Models in Domain-Specific Writing Style Adaptation of Reddit's ShowerthoughtsCode0
On the Evaluation of Machine-Generated Reports0
Advancing human-centric AI for robust X-ray analysis through holistic self-supervised learning0
Controllable Text Generation in the Instruction-Tuning Era0
DynaMo: Accelerating Language Model Inference with Dynamic Multi-Token Sampling0
Integrating A.I. in Higher Education: Protocol for a Pilot Study with 'SAMCares: An Adaptive Learning Hub'Code0
Countering Reward Over-optimization in LLM with Demonstration-Guided Reinforcement LearningCode0
Safe Training with Sensitive In-domain Data: Leveraging Data Fragmentation To Mitigate Linkage Attacks0
DOCCI: Descriptions of Connected and Contrasting Images0
A Framework for Real-time Safeguarding the Text Generation of Large Language Model0
PECC: Problem Extraction and Coding ChallengesCode1
MRScore: Evaluating Radiology Report Generation with LLM-based Reward System0
Retrieval-Augmented Generation with Knowledge Graphs for Customer Service Question Answering0
When to Trust LLMs: Aligning Confidence with Response QualityCode0
Quantifying Memorization and Detecting Training Data of Pre-trained Language Models using Japanese Newspaper0
CEval: A Benchmark for Evaluating Counterfactual Text GenerationCode0
Large Language Models in the Clinic: A Comprehensive BenchmarkCode1
Evaluating Consistency and Reasoning Capabilities of Large Language Models0
BERT vs GPT for financial engineering0
Semantic Routing for Enhanced Performance of LLM-Assisted Intent-Based 5G Core Network Management and OrchestrationCode7
Online Personalizing White-box LLMs Generation with Neural Bandits0
Effective Unsupervised Constrained Text Generation based on Perturbed Masking0
Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language ModelsCode1
Identifying Fairness Issues in Automatically Generated Testing Content0
Towards smaller, faster decoder-only transformers: Architectural variants and their implicationsCode0
Context-Enhanced Language Models for Generating Multi-Paper Citations0
LLM-Personalize: Aligning LLM Planners with Human Preferences via Reinforced Self-Training for Housekeeping Robots0
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models0
Parameter Efficient Fine Tuning: A Comprehensive Analysis Across Applications0
Parameter Efficient Diverse Paraphrase Generation Using Sequence-Level Knowledge Distillation0
From r to Q^*: Your Language Model is Secretly a Q-Function0
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation0
iRAG: Advancing RAG for Videos with an Incremental Approach0
A Survey on Retrieval-Augmented Text Generation for Large Language Models0
Related Work and Citation Text Generation: A Survey0
Prompt-Guided Generation of Structured Chest X-Ray Report Using a Pre-trained LLM0
Generative Text Steganography with Large Language Model0
White Men Lead, Black Women Help? Benchmarking and Mitigating Language Agency Social Biases in LLMs0
LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-Text Generation?Code1
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text GenerationCode1
KG-CTG: Citation Generation through Knowledge Graph-guided Large Language Models0
Bridging the Gap between Different Vocabularies for LLM EnsembleCode1
Unveiling LLM Evaluation Focused on Metrics: Challenges and Solutions0
WikiSplit++: Easy Data Refinement for Split and RephraseCode0
PMB5: Gaining More Insight into Neural Semantic Parsing with Challenging Benchmarks0
Language Generation in the Limit0
GraSAME: Injecting Token-Level Structural Information to Pretrained Language Models via Graph-guided Self-Attention Mechanism0
Control-DAG: Constrained Decoding for Non-Autoregressive Directed Acyclic T5 using Weighted Finite State AutomataCode0
Continuous Language Model Interpolation for Dynamic and Controllable Text GenerationCode0
Show:102550
← PrevPage 25 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified