SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 14011450 of 5335 papers

TitleStatusHype
"My Answer is C": First-Token Probabilities Do Not Match Text Answers in Instruction-Tuned Language ModelsCode0
Generalizing Reward Modeling for Out-of-Distribution Preference LearningCode0
Typographic Text Generation with Off-the-Shelf Diffusion Model0
CEV-LM: Controlled Edit Vector Language Model for Shaping Natural Language GenerationsCode0
UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language ModelsCode0
MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning0
Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative DecodingCode2
GCOF: Self-iterative Text Generation for Copywriting Using Large Language Model0
From Self-Attention to Markov Models: Unveiling the Dynamics of Generative Transformers0
Hallucinations or Attention Misdirection? The Path to Strategic Value Extraction in Business Using Large Language Models0
A Multimodal In-Context Tuning Approach for E-Commerce Product Description GenerationCode1
CHATATC: Large Language Model-Driven Conversational Agents for Supporting Strategic Air Traffic Flow Management0
A Simple but Effective Approach to Improve Structured Language Model Output for Information ExtractionCode1
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification0
FinBen: A Holistic Financial Benchmark for Large Language ModelsCode4
Exploring the Impact of Table-to-Text Methods on Augmenting LLM-based Question Answering with Domain Hybrid Data0
Bias in Language Models: Beyond Trick Tests and Toward RUTEd Evaluation0
A User-Friendly Framework for Generating Model-Preferred Prompts in Text-to-Image SynthesisCode0
A Touch, Vision, and Language Dataset for Multimodal AlignmentCode2
OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data0
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual ExamplesCode1
Standardize: Aligning Language Models with Expert-Defined Standards for Content GenerationCode0
High-quality Data-to-Text Generation for Severely Under-Resourced Languages with Out-of-the-box Large Language ModelsCode0
HU at SemEval-2024 Task 8A: Can Contrastive Learning Learn Embeddings to Detect Machine-Generated Text?Code0
WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More0
Pride and Prejudice: LLM Amplifies Self-Bias in Self-RefinementCode0
Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as AgentsCode1
LLM can Achieve Self-Regulation via Hyperparameter Aware Generation0
ToBlend: Token-Level Blending With an Ensemble of LLMs to Attack AI-Generated Text Detection0
PEDANTS: Cheap but Effective and Interpretable Answer EquivalenceCode2
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated TextCode1
Controlled Text Generation for Large Language Model with Dynamic Attribute GraphsCode1
VATr++: Choose Your Words Wisely for Handwritten Text Generation0
Neural paraphrasing by automatically crawled and aligned sentence pairs0
Exploring Precision and Recall to assess the quality and diversity of LLMsCode0
Unlocking Structure Measuring: Introducing PDD, an Automatic Metric for Positional Discourse CoherenceCode0
Quantized Embedding Vectors for Controllable Diffusion Language Models0
Structured Language Generation Model for Robust Structure Prediction0
Long-form evaluation of model editingCode0
SyntaxShap: Syntax-aware Explainability Method for Text GenerationCode0
Exploring the Adversarial Capabilities of Large Language Models0
COLD-Attack: Jailbreaking LLMs with Stealthiness and ControllabilityCode2
Visually Dehallucinative Instruction GenerationCode0
A Systematic Review of Data-to-Text NLG0
Can LLMs Produce Faithful Explanations For Fact-checking? Towards Faithful Explainable Fact-Checking via Multi-Agent Debate0
Intrinsic Task-based Evaluation for Referring Expression Generation0
Label-Efficient Model Selection for Text Generation0
Synthesizing Sentiment-Controlled Feedback For Multimodal Text and Image DataCode0
Prompt Perturbation in Retrieval-Augmented Generation based Large Language Models0
CPSDBench: A Large Language Model Evaluation Benchmark and Baseline for Chinese Public Security Domain0
Show:102550
← PrevPage 29 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified