SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 19512000 of 5335 papers

TitleStatusHype
QCQA: Quality and Capacity-aware grouped Query Attention0
Extroversion or Introversion? Controlling The Personality of Your Large Language ModelsCode0
On Subjective Uncertainty Quantification and Calibration in Natural Language GenerationCode0
Annotating FrameNet via Structure-Conditioned Language GenerationCode0
Evaluating the Smooth Control of Attribute Intensity in Text Generation with LLMsCode0
Evaluating Durability: Benchmark Insights into Multimodal Watermarking0
End-to-End Trainable Retrieval-Augmented Generation for Relation Extraction0
Effective Context Selection in LLM-based Leaderboard Generation: An Empirical Study0
Uncovering Limitations of Large Language Models in Information Seeking from TablesCode0
BEADs: Bias Evaluation Across Domains0
Confabulation: The Surprising Value of Large Language Model Hallucinations0
The Challenges of Evaluating LLM Applications: An Analysis of Automated, Human, and LLM-Based Approaches0
PatentEval: Understanding Errors in Patent GenerationCode0
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework0
CSS: Contrastive Semantic Similarity for Uncertainty Quantification of LLMsCode0
AD-H: Autonomous Driving with Hierarchical AgentsCode0
Adaptive Preference Scaling for Reinforcement Learning with Human Feedback0
Exploring Mathematical Extrapolation of Large Language Models with Synthetic Data0
OccamLLM: Fast and Exact Language Model Arithmetic in a Single Step0
Order-Independence Without Fine TuningCode0
The current status of large language models in summarizing radiology report impressions0
Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation0
Layout Agnostic Scene Text Image Synthesis with Diffusion Models0
Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language GenerationCode0
Role-playing Prompt Framework: Generation and Evaluation0
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models0
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning0
The Power of Summary-Source AlignmentsCode0
Improving Text Generation on Images with Synthetic Captions0
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models0
Evaluating Large Language Model Biases in Persona-Steered GenerationCode0
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution0
Phantom: General Trigger Attacks on Retrieval Augmented Language Generation0
Hidden in Plain Sight: Exploring Chat History Tampering in Interactive Language Models0
WRDScore: New Metric for Evaluation of Natural Language Generation ModelsCode0
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models0
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension0
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities0
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation0
MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding0
On the Sequence Evaluation based on Stochastic Processes0
Are PPO-ed Language Models Hackable?0
Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities0
A System for Automatic English Text Expansion0
Glauber Generative Model: Discrete Diffusion Models via Binary Classification0
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation0
On the Noise Robustness of In-Context Learning for Text GenerationCode0
UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models0
On Understanding Attention-Based In-Context Learning for Categorical Data0
Augmenting Textual Generation via Topology Aware Retrieval0
Show:102550
← PrevPage 40 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified