SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 10011050 of 5335 papers

TitleStatusHype
Themis: A Reference-free NLG Evaluation Language Model with Flexibility and InterpretabilityCode1
PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization EvaluationCode0
Enhancing Data Privacy in Large Language Models through Private Association Editing0
TALEC: Teach Your LLM to Evaluate in Specific Domain with In-house Criteria by Criteria Division and Zero-shot Plus Few-shotCode0
Discrete Diffusion Language Model for Long Text Summarization0
Variationist: Exploring Multifaceted Variation and Bias in Written Language DataCode1
Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization?Code0
Text-Animator: Controllable Visual Text Video Generation0
Towards a Science Exocortex0
Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors0
Prompt-Consistency Image Generation (PCIG): A Unified Framework Integrating LLMs, Knowledge Graphs, and Controllable Diffusion ModelsCode0
Cascade Reward Sampling for Efficient Decoding-Time AlignmentCode1
Evaluation of Language Models in the Medical Context Under Resource-Constrained SettingsCode0
Directed Domain Fine-Tuning: Tailoring Separate Modalities for Specific Training Tasks0
Revisiting Interpolation Augmentation for Speech-to-Text GenerationCode1
Robust Reinforcement Learning from Corrupted Human Feedback0
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship EmbeddingsCode1
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-PolygraphCode2
Evaluating Diversity in Automatic Poetry GenerationCode0
A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG SystemsCode0
Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning0
In Tree Structure Should Sentence Be GeneratedCode0
A Data-Driven Guided Decoding Mechanism for Diagnostic CaptioningCode0
CityGPT: Empowering Urban Spatial Cognition of Large Language ModelsCode1
ClinicalLab: Aligning Agents for Multi-Departmental Clinical Diagnostics in the Real WorldCode2
Finding Blind Spots in Evaluator LLMs with Interpretable ChecklistsCode1
Adaptable Logical Control for Large Language ModelsCode2
D2O: Dynamic Discriminative Operations for Efficient Generative Inference of Large Language Models0
SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text GenerationCode0
The Comparative Trap: Pairwise Comparisons Amplifies Biased Preferences of LLM Evaluators0
Self-Distillation for Model Stacking Unlocks Cross-Lingual NLU in 200+ Languages0
Generating Educational Materials with Different Levels of Readability using LLMs0
LaMDA: Large Model Fine-Tuning via Spectrally Decomposed Low-Dimensional AdaptationCode1
PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models0
LiLiuM: eBay's Large Language Models for e-commerce0
In-Context Editing: Learning Knowledge from Self-Induced DistributionsCode2
DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed GraphsCode1
Self and Cross-Model Distillation for LLMs: Effective Methods for Refusal Pattern Alignment0
Extrinsic Evaluation of Cultural Competence in Large Language ModelsCode0
Fine-grained Controllable Text Generation through In-context Learning with Feedback0
CodeGemma: Open Code Models Based on Gemma0
Incentivizing Quality Text Generation via Statistical ContractsCode0
CELL your Model: Contrastive Explanations for Large Language Models0
GPT-Powered Elicitation Interview Script Generator for Requirements Engineering Training0
Fairer Preferences Elicit Improved Human-Aligned Large Language Model JudgmentsCode1
Identifying Query-Relevant Neurons in Large Language Models for Long-Form TextsCode0
Post-hoc Utterance Refining Method by Entity Mining for Faithful Knowledge Grounded ConversationsCode0
Intertwining CP and NLP: The Generation of Unreasonably Constrained Sentences0
Facts-and-Feelings: Capturing both Objectivity and Subjectivity in Table-to-Text Generation0
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and GenerationCode2
Show:102550
← PrevPage 21 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified