SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 46514700 of 5335 papers

TitleStatusHype
IMPersona: Evaluating Individual Level LM ImpersonationCode0
SyntaxShap: Syntax-aware Explainability Method for Text GenerationCode0
Image Content Generation with Causal ReasoningCode0
IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for Natural Language GenerationCode0
E2E NLG Challenge: Neural Models vs. TemplatesCode0
Adaptive Compression of the Latent Space in Variational AutoencodersCode0
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection GenerationCode0
Measuring Reliability of Large Language Models through Semantic ConsistencyCode0
Measuring the Diversity of Automatic Image DescriptionsCode0
Mechanistic Behavior Editing of Language ModelsCode0
Synthesis and Evaluation of a Domain-specific Large Data Set for Dungeons & DragonsCode0
IGD: Token Decisiveness Modeling via Information Gain in LLMs for Personalized RecommendationCode0
Velocidapter: Task-oriented Dialogue Comprehension Modeling Pairing Synthetic Text Generation with Domain AdaptationCode0
Zero-Shot Dialog Generation with Cross-Domain Latent ActionsCode0
Conformal Structured PredictionCode0
Recurrent Hierarchical Topic-Guided RNN for Language GenerationCode0
Automatic Metrics in Natural Language Generation: A Survey of Current Evaluation PracticesCode0
Concept-Level Explainability for Auditing & Steering LLM ResponsesCode0
Identifying Informational Sources in News ArticlesCode0
HypoEval: Hypothesis-Guided Evaluation for Natural Language GenerationCode0
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement LearningCode0
Hyperparameter-Free Approach for Faster Minimum Bayes Risk DecodingCode0
Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss FunctionCode0
Human vs Automatic Metrics: on the Importance of Correlation DesignCode0
Human Speech Perception in Noise: Can Large Language Models Paraphrase to Improve It?Code0
Humane Speech Synthesis through Zero-Shot Emotion and Disfluency GenerationCode0
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?Code0
MERGE: Fast Private Text GenerationCode0
Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image DataCode0
Meta-DiffuB: A Contextualized Sequence-to-Sequence Text Diffusion Model with Meta-ExplorationCode0
Reducing Sensitivity on Speaker Names for Text Generation from DialoguesCode0
Meta-Learning for Efficient Fine-Tuning of Large Language ModelsCode0
Compositional Image Retrieval via Instruction-Aware Contrastive LearningCode0
Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and EvaluationCode0
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector AttentionCode0
HTSS: A Novel Hybrid Text Summarisation and Simplification ArchitectureCode0
Dynamic layer selection in decoder-only transformersCode0
Towards Better Open-Ended Text Generation: A Multicriteria Evaluation FrameworkCode0
How You Prompt Matters! Even Task-Oriented Constraints in Instructions Affect LLM-Generated Text DetectionCode0
Referenceless Quality Estimation for Natural Language GenerationCode0
Understanding the Quality-Diversity Trade-off in Diffusion Language ModelsCode0
Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and ConsistencyCode0
A Neural Conversation Generation Model via Equivalent Shared Memory InvestigationCode0
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated ImagesCode0
Automatic Logical Forms improve fidelity in Table-to-Text generationCode0
An End-to-End Model for Photo-Sharing Multi-modal Dialogue GenerationCode0
compare-mt: A Tool for Holistic Comparison of Language Generation SystemsCode0
How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical SurveyCode0
Systematic Task Exploration with LLMs: A Study in Citation Text GenerationCode0
Dynamic Human Evaluation for Relative Model ComparisonsCode0
Show:102550
← PrevPage 94 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (sample)BLEU-134.4Unverified
2Beam search + A*esque (beam)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified