SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 451500 of 5335 papers

TitleStatusHype
Real-time Verification and Refinement of Language Model Text Generation0
GeoPix: Multi-Modal Large Language Model for Pixel-level Image Understanding in Remote Sensing0
The Magnitude of Categories of Texts Enriched by Language Models0
Dual use issues in the field of Natural Language Generation0
Beyond Flat Text: Dual Self-inherited Guidance for Visual Text Generation0
LLMs Reproduce Stereotypes of Sexual and Gender Minorities0
Beyond Sight: Finetuning Generalist Robot Policies with Heterogeneous Sensors via Language Grounding0
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation0
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States0
Beyond Factual Accuracy: Evaluating Coverage of Diverse Factual Information in Long-form Text GenerationCode0
Visual question answering: from early developments to recent advances -- a survey0
SceneVTG++: Controllable Multilingual Visual Text Generation in the Wild0
GIT-CXR: End-to-End Transformer for Chest X-Ray Report Generation0
Personalized Graph-Based Retrieval for Large Language ModelsCode1
Zero-Shot Statistical Tests for LLM-Generated Text Detection using Finite Sample Concentration InequalitiesCode0
A Survey on Large Language Models with some Insights on their Capabilities and Limitations0
Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration DecodingCode1
Safeguarding Large Language Models in Real-time with Tunable Safety-Performance Trade-offs0
ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning0
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking0
Does a Large Language Model Really Speak in Human-Like Language?0
Multi-Modal Video Feature Extraction for Popularity Prediction0
HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models0
Yo'Chameleon: Personalized Vision and Language Generation0
Incremental Dialogue Management: Survey, Discussion, and Implications for HRI0
Large Language Models Are Read/Write Policy-Makers for Simultaneous GenerationCode1
Zero-Shot Strategies for Length-Controllable Summarization0
Chunk-Distilled Language Modeling0
Have We Designed Generalizable Structural Knowledge Promptings? Systematic Evaluation and Rethinking0
AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility0
Facilitating large language model Russian adaptation with Learned Embedding PropagationCode1
Enhancing Annotated Bibliography Generation with LLM Ensembles0
Disentangling Preference Representation and Text Generation for Efficient Individual Preference AlignmentCode0
MLLM-SUL: Multimodal Large Language Model for Semantic Scene Understanding and Localization in Traffic ScenariosCode0
Multi-Attribute Constraint Satisfaction via Language Model Rewriting0
Improving Factuality with Explicit Working Memory0
RDPM: Solve Diffusion Probabilistic Models via Recurrent Token Prediction0
Characterizations of Language Generation With Breadth0
Assessing Human Editing Effort on LLM-Generated Texts via Compression-Based Edit DistanceCode0
CharGen: High Accurate Character-Level Visual Text Generation Model with MultiModal Encoder0
Emerging Security Challenges of Large Language Models0
GraphAgent: Agentic Graph Language AssistantCode0
Where am I? Cross-View Geo-localization with Natural Language DescriptionsCode2
EMPRA: Embedding Perturbation Rank Attack against Neural Ranking ModelsCode0
ADEQA: A Question Answer based approach for joint ADE-Suspect Extraction using Sequence-To-Sequence Transformers0
Semi-Supervised Adaptation of Diffusion Models for Handwritten Text Generation0
A Large-Scale Simulation on Large Language Models for Decision-Making in Political Science0
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling0
Qwen2.5 Technical ReportCode13
Rethinking Uncertainty Estimation in Natural Language Generation0
Show:102550
← PrevPage 10 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified