SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 501550 of 5335 papers

TitleStatusHype
A Methodology for Generative Spelling Correction via Natural Spelling Errors Emulation across Multiple Domains and LanguagesCode1
Uni-NLX: Unifying Textual Explanations for Vision and Vision-Language TasksCode1
LLM Self Defense: By Self Examination, LLMs Know They Are Being TrickedCode1
Can Knowledge Graphs Simplify Text?Code1
Token-Scaled Logit Distillation for Ternary Weight Generative Language ModelsCode1
ZYN: Zero-Shot Reward Models with Yes-No Questions for RLAIFCode1
Evaluating the Generation Capabilities of Large Chinese Language ModelsCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
This is not correct! Negation-aware Evaluation of Language Generation SystemsCode1
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated TextCode1
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated ExamplesCode1
Selective Generation for Controllable Language ModelsCode1
COLLIE: Systematic Construction of Constrained Text Generation TasksCode1
Controllable Data Augmentation for Few-Shot Text Mining with Chain-of-Thought Attribute ManipulationCode1
Copy Is All You NeedCode1
Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generatorsCode1
PREADD: Prefix-Adaptive Decoding for Controlled Text GenerationCode1
Text Alignment Is An Efficient Unified Model for Massive NLP TasksCode1
Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language ModelsCode1
FairPrism: Evaluating Fairness-Related Harms in Text GenerationCode1
ZeroGen: Zero-shot Multimodal Controllable Text Generation with Multiple OraclesCode1
VisText: A Benchmark for Semantically Rich Chart CaptioningCode1
Learning to Rank in Generative RetrievalCode1
FunQA: Towards Surprising Video ComprehensionCode1
Learning to Generate Better Than Your LLMCode1
Explicit Syntactic Guidance for Neural Text GenerationCode1
Generate to Understand for RepresentationCode1
Question Decomposition Tree for Answering Complex Questions over Knowledge BasesCode1
Click: Controllable Text Generation with Sequence Likelihood Contrastive LearningCode1
Sequential Monte Carlo Steering of Large Language Models using Probabilistic ProgramsCode1
Adaptive and Personalized Exercise Generation for Online Language LearningCode1
Binary and Ternary Natural Language GenerationCode1
Differentiable Tree Operations Promote Compositional GeneralizationCode1
Preference-grounded Token-level Guidance for Language Model Fine-tuningCode1
Fine-grained Text Style Transfer with Diffusion-Based Language ModelsCode1
Grammar Prompting for Domain-Specific Language Generation with Large Language ModelsCode1
Unsupervised Melody-to-Lyric GenerationCode1
Generating with Confidence: Uncertainty Quantification for Black-box Large Language ModelsCode1
A Critical Evaluation of Evaluations for Long-form Question AnsweringCode1
Act Like a Radiologist: Radiology Report Generation across Anatomical RegionsCode1
Backpack Language ModelsCode1
Self-contradictory Hallucinations of Large Language Models: Evaluation, Detection and MitigationCode1
Investigating Table-to-Text Generation Capabilities of LLMs in Real-World Information Seeking ScenariosCode1
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement TheoryCode1
Leftover Lunch: Advantage-based Offline Reinforcement Learning for Language ModelsCode1
TOAST: Transfer Learning via Attention SteeringCode1
Inference-Time Policy Adapters (IPA): Tailoring Extreme-Scale LMs without Fine-tuningCode1
Is GPT-4 a Good Data Analyst?Code1
QTSumm: Query-Focused Summarization over Tabular DataCode1
INSTRUCTSCORE: Explainable Text Generation Evaluation with Finegrained FeedbackCode1
Show:102550
← PrevPage 11 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (sample)BLEU-134.4Unverified
2Beam search + A*esque (beam)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified