SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 13011350 of 5335 papers

TitleStatusHype
Personalized Text Generation with Contrastive Activation Steering0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
TPC: Cross-Temporal Prediction Connection for Vision-Language Model Hallucination Reduction0
Architecture for a Trustworthy Quantum Chatbot0
Maximizing Signal in Human-Model Preference Alignment0
Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset0
FANS -- Formal Answer Selection for Natural Language Math Reasoning Using Lean40
Large language models in finance : what is financial sentiment?0
DSVD: Dynamic Self-Verify Decoding for Faithful Generation in Large Language Models0
Implicit Bias in LLMs: A Survey0
MCiteBench: A Multimodal Benchmark for Generating Text with CitationsCode0
BatchGEMBA: Token-Efficient Machine Translation Evaluation with Batched Prompting and Prompt CompressionCode0
FourierNAT: A Fourier-Mixing-Based Non-Autoregressive Transformer for Parallel Sequence Generation0
ChatGPT for President! Presupposed content in politicians versus GPT-generated texts0
DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models0
Waste Not, Want Not; Recycled Gumbel Noise Improves Consistency in Natural Language Generation0
Argument Summarization and its Evaluation in the Era of Large Language Models0
MedUnifier: Unifying Vision-and-Language Pre-training on Medical Data with Vision Generation Task using Discrete Visual Representations0
Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and ProactivityCode0
Multi2: Multi-Agent Test-Time Scalable Framework for Multi-Document Processing0
ChatMol: A Versatile Molecule Designer Based on the Numerically Enhanced Large Language Model0
Deterministic or probabilistic? The psychology of LLMs as random number generators0
Advancements in Natural Language Processing for Automatic Text Summarization0
AutoHete: An Automatic and Efficient Heterogeneous Training System for LLMs0
Conversational Planning for Personal Plans0
An Overview of Large Language Models for Statisticians0
Steganography Beyond Space-Time with Chain of Multimodal AI0
Synthetic Text Generation for Training Large Language Models via Gradient Matching0
Grounded Persuasive Language Generation for Automated Marketing0
Towards Conditioning Clinical Text Generation for User Control0
Evaluating the Effect of Retrieval Augmentation on Social Biases0
Sequence-level Large Language Model Training with Contrastive Preference Optimization0
The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination0
Good Representation, Better Explanation: Role of Convolutional Neural Networks in Transformer-Based Remote Sensing Image Captioning0
PPC-GPT: Federated Task-Specific Compression of Large Language Models via Pruning and Chain-of-Thought Distillation0
A General Pseudonymization Framework for Cloud-Based LLMs: Replacing Privacy Information in Controlled Text GenerationCode0
PAPI: Exploiting Dynamic Parallelism in Large Language Model Decoding with a Processing-In-Memory-Enabled Computing System0
IPAD: Inverse Prompt for AI Detection -- A Robust and Explainable LLM-Generated Text DetectorCode0
Scale Up Composed Image Retrieval Learning via Modification Text Generation0
Enhancing RWKV-based Language Models for Long-Sequence Text GenerationCode0
Machine-generated text detection prevents language model collapseCode0
MCQA-Eval: Efficient Confidence Evaluation in NLG with Gold-Standard Correctness Labels0
eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables0
NLoRA: Nyström-Initiated Low-Rank Adaptation for Large Language ModelsCode0
Entropy-UID: A Method for Optimizing Information Density0
Optimal word order for non-causal text generation with Large Language Models: the Spanish case0
Token-Level Density-Based Uncertainty Quantification Methods for Eliciting Truthfulness of Large Language ModelsCode0
Multimodal Quantitative Language for Generative Recommendation0
Prompting a Weighting Mechanism into LLM-as-a-Judge in Two-Step: A Case Study0
D.Va: Validate Your Demonstration First Before You Use It0
Show:102550
← PrevPage 27 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (sample)BLEU-134.4Unverified
2Beam search + A*esque (beam)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified