SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 11011150 of 5335 papers

TitleStatusHype
The current status of large language models in summarizing radiology report impressions0
FedMKT: Federated Mutual Knowledge Transfer for Large and Small Language ModelsCode3
MAD: Multi-Alignment MEG-to-Text DecodingCode1
Layout Agnostic Scene Text Image Synthesis with Diffusion Models0
Favi-Score: A Measure for Favoritism in Automated Preference Ratings for Generative AI Evaluation0
TCMBench: A Comprehensive Benchmark for Evaluating Large Language Models in Traditional Chinese MedicineCode2
Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language GenerationCode0
Brainstorming Brings Power to Large Language Models of Knowledge Reasoning0
Get my drift? Catching LLM Task Drift with Activation DeltasCode2
The Power of Summary-Source AlignmentsCode0
Role-playing Prompt Framework: Generation and Evaluation0
FOCUS: Forging Originality through Contrastive Use in Self-Plagiarism for Language Models0
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models0
Improving Text Generation on Images with Synthetic Captions0
XPrompt:Explaining Large Language Model's Generation via Joint Prompt Attribution0
RTGen: Generating Region-Text Pairs for Open-Vocabulary Object DetectionCode1
Phantom: General Trigger Attacks on Retrieval Augmented Language Generation0
Evaluating Large Language Model Biases in Persona-Steered GenerationCode0
Multi-Aspect Controllable Text Generation with Disentangled Counterfactual AugmentationCode1
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic SimilaritiesCode1
Hidden in Plain Sight: Exploring Chat History Tampering in Interactive Language Models0
Language Generation with Strictly Proper Scoring RulesCode1
Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities0
Can GPT Redefine Medical Understanding? Evaluating GPT on Biomedical Machine Reading Comprehension0
LMO-DP: Optimizing the Randomization Mechanism for Differentially Private Fine-Tuning (Large) Language Models0
Alt4Blind: A User Interface to Simplify Charts Alt-Text Creation0
WRDScore: New Metric for Evaluation of Natural Language Generation ModelsCode0
Are PPO-ed Language Models Hackable?0
Hardware-Aware Parallel Prompt Decoding for Memory-Efficient Acceleration of LLM InferenceCode2
Automatic detection of cognitive impairment in elderly people using an entertainment chatbot with Natural Language Processing capabilities0
MindFormer: Semantic Alignment of Multi-Subject fMRI for Brain Decoding0
On the Sequence Evaluation based on Stochastic Processes0
A System for Automatic English Text Expansion0
Augmenting Textual Generation via Topology Aware Retrieval0
UIT-DarkCow team at ImageCLEFmedical Caption 2024: Diagnostic Captioning for Radiology Images Efficiency with Transformer Models0
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation0
Glauber Generative Model: Discrete Diffusion Models via Binary Classification0
On Understanding Attention-Based In-Context Learning for Categorical Data0
A Library for Automatic Natural Language Generation of Spanish Texts0
On the Noise Robustness of In-Context Learning for Text GenerationCode0
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching RegularizationCode0
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions0
Automatic Jailbreaking of the Text-to-Image Generative AI SystemsCode1
Large Language Model Pruning0
Embedding-Aligned Language Models0
Bayesian WeakS-to-Strong from Text Classification to Generation0
Model Cascading for Code: A Cascaded Black-Box Multi-Model Framework for Cost-Efficient Code Completion with Self-Testing0
Athena: Efficient Block-Wise Post-Training Quantization for Large Language Models Using Second-Order Matrix Derivative Information0
Sparse Spectral Training and Inference on Euclidean and Hyperbolic Neural Networks0
Certifiably Robust RAG against Retrieval CorruptionCode1
Show:102550
← PrevPage 23 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified