SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 16511700 of 5335 papers

TitleStatusHype
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?Code0
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMsCode0
POS-Constrained Parallel Decoding for Non-autoregressive GenerationCode0
Positional Encoding to Control Output Sequence LengthCode0
How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text DetectionCode0
HTSS: A Novel Hybrid Text Summarisation and Simplification ArchitectureCode0
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?Code0
How Helpful is Inverse Reinforcement Learning for Table-to-Text Generation?Code0
Automatic Opinion Question GenerationCode0
Enhance Incomplete Utterance Restoration by Joint Learning Token Extraction and Text GenerationCode0
How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?Code0
How Does Response Length Affect Long-Form FactualityCode0
How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical SurveyCode0
HypoEval: Hypothesis-Guided Evaluation for Natural Language GenerationCode0
High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language ModelsCode0
PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization EvaluationCode0
HistAlign: Improving Context Dependency in Language Generation by Aligning with HistoryCode0
Enhancing Content Planning for Table-to-Text Generation with Data Understanding and VerificationCode0
Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation PracticesCode0
Enhancing Court View Generation with Knowledge Injection and GuidanceCode0
Hierarchical Text Generation using an OutlineCode0
Cross-Lingual Transfer of Debiasing and Detoxification in Multilingual LLMs: An Extensive InvestigationCode0
Automatic Logical Forms improve fidelity in Table-to-Text generationCode0
Hierarchical Attention: What Really Counts in Various NLP TasksCode0
Help! Need Advice on Identifying AdviceCode0
Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay DetectionCode0
Hierarchical Text Generation and Planning for Strategic DialogueCode0
Task-Adaptive Tokenization: Enhancing Long-Form Text Generation Efficacy in Mental Health and BeyondCode0
Cross-Domain Detection of GPT-2-Generated Technical TextCode0
HeavyWater and SimplexWater: Watermarking Low-Entropy Text DistributionsCode0
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty DecodingCode0
Enhancing Paraphrase Type Generation: The Impact of DPO and RLHF Evaluated with Human-Ranked DataCode0
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text GenerationCode0
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity ApplicationsCode0
EffEval: A Comprehensive Evaluation of Efficiency for MT Evaluation MetricsCode0
Enhancing RWKV-based Language Models for Long-Sequence Text GenerationCode0
CREST: A Joint Framework for Rationalization and Counterfactual Text GenerationCode0
Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network WatermarkingCode0
Helping the Helper: Supporting Peer Counselors via AI-Empowered Practice and FeedbackCode0
How Control Information Influences Multilingual Text Image Generation and Editing?Code0
Identifying Informational Sources in News ArticlesCode0
Handling Divergent Reference Texts when Evaluating Table-to-Text GenerationCode0
An Actor-Critic Algorithm for Sequence PredictionCode0
Handling Rare Items in Data-to-Text GenerationCode0
Creative GANs for generating poems, lyrics, and metaphorsCode0
Automatic Generation of Personalized Comment Based on User ProfileCode0
Hallucination, Monofacts, and Miscalibration: An Empirical InvestigationCode0
Artificial Intelligence versus Maya Angelou: Experimental evidence that people cannot differentiate AI-generated from human-written poetryCode0
Guided Attention for Interpretable Motion CaptioningCode0
Guided Neural Language Generation for Abstractive Summarization using Abstract Meaning RepresentationCode0
Show:102550
← PrevPage 34 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified