SOTAVerified

Text Generation

Text Generation is the task of generating text with the goal of appearing indistinguishable to human-written text. This task is more formally known as "natural language generation" in the literature.

Text generation can be addressed with Markov processes or deep generative models like LSTMs. Recently, some of the most advanced methods for text generation include BART, GPT and other GAN-based approaches. Text generation systems are evaluated either through human ratings or automatic evaluation metrics like METEOR, ROUGE, and BLEU.

Further readings:

( Image credit: Adversarial Ranking for Language Generation )

Papers

Showing 17011750 of 5335 papers

TitleStatusHype
Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments0
Can Language Models Take A Hint? Prompting for Controllable Contextualized Commonsense Inference0
Reward-RAG: Enhancing RAG with Reward Driven Supervision0
Attention in Large Language Models Yields Efficient Zero-Shot Re-Rankers0
Efficient Length-Generalizable Attention via Causal Retrieval for Long-Context Language Modeling0
Discrete Copula Diffusion0
GADFA: Generator-Assisted Decision-Focused Approach for Opinion Expressing Timing Identification0
Conformal Generative Modeling with Improved Sample Efficiency through Sequential Greedy Filtering0
Exploring Gen-AI applications in building research and industry: A review0
Are LLMs Aware that Some Questions are not Open-ended?0
What is the Role of Large Language Models in the Evolution of Astronomy Research?0
Beyond Single Concept Vector: Modeling Concept Subspace in LLMs with Gaussian DistributionCode0
ThreatGram 101 - Extreme Telegram Replies Data with Threat LevelsCode0
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty DecodingCode0
Human Bias in the Face of AI: The Role of Human Judgement in AI Generated Text Evaluation0
Natural Language Generation for Visualizations: State of the Art, Challenges and Future Directions0
See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation LearningCode0
TrojVLM: Backdoor Attack Against Vision Language Models0
On the Power of Decision Trees in Auto-Regressive Language Modeling0
Experimental Evaluation of Machine Learning Models for Goal-oriented Customer Service Chatbot with Pipeline Architecture0
Hit the Sweet Spot! Span-Level Ensemble for Large Language Models0
JoyType: A Robust Design for Multilingual Visual Text Creation0
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation0
EgoLM: Multi-Modal Language Model of Egocentric Motions0
DisGeM: Distractor Generation for Multiple Choice Questions with Span MaskingCode0
Trustworthy AI: Securing Sensitive Data in Large Language Models0
Evaluation of Large Language Models for Summarization Tasks in the Medical Domain: A Narrative Review0
Probing Omissions and Distortions in Transformer-based RDF-to-Text Models0
Application of AI-based Models for Online Fraud Detection and Analysis0
AXCEL: Automated eXplainable Consistency Evaluation using LLMs0
Accumulator-Aware Post-Training Quantization0
Overview of the First Shared Task on Clinical Text Generation: RRG24 and "Discharge Me!"0
Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation0
Qualitative Insights Tool (QualIT): LLM Enhanced Topic Modeling0
Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM0
A Comprehensive Survey of Bias in LLMs: Current Landscape and Future Directions0
Finetuning LLMs for Comparative Assessment Tasks0
Enabling Efficient On-Device Fine-Tuning of LLMs Using Only Inference Engines0
Towards Efficient and Robust VQA-NLE Data Generation with Large Vision-Language ModelsCode0
Advancing Video Quality Assessment for AIGC0
Backtracking Improves Generation Safety0
Loop Neural Networks for Parameter Sharing0
JoyHallo: Digital human model for Mandarin0
'Since Lawyers are Males..': Examining Implicit Gender Bias in Hindi Language Generation by LLMs0
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-TuningCode0
Leveraging Knowledge Graphs and LLMs to Support and Monitor Legislative Systems0
Unlocking Memorization in Large Language Models with Dynamic Soft Prompting0
Mitigating Unsafe Feedback with Learning Constraints0
LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks0
ABHINAW: A method for Automatic Evaluation of Typography within AI-Generated ImagesCode0
Show:102550
← PrevPage 35 of 107Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.74Unverified
2FactT5BBLEU48.37Unverified
3JointGT BaselineBLEU47.51Unverified
4FactJointGTBLEU47.39Unverified
5Control Prefixes (T5-large)METEOR0.41Unverified
6T5METEOR0.12Unverified
7BARTMETEOR0.11Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.95Unverified
2partGANBLEU-20.91Unverified
3RankGANBLEU-20.85Unverified
4RelGAN (100)BLEU-20.85Unverified
5SeqGANBLEU-20.83Unverified
#ModelMetricClaimedVerifiedStatus
1LeakGANBLEU-20.96Unverified
2PPOGANBLEU-20.91Unverified
3RelGANBLEU-20.88Unverified
4SeqGANBLEU-20.86Unverified
5RankGANBLEU-20.78Unverified
#ModelMetricClaimedVerifiedStatus
1UniCRSDistinct-30.65Unverified
2CRFRDistinct-30.52Unverified
3KGSFDistinct-30.43Unverified
4C2CRSDistinct-30.33Unverified
5KBRDDistinct-30.3Unverified
#ModelMetricClaimedVerifiedStatus
1UniLMCIDEr14.92Unverified
2BART (TextBox 2.0)CIDEr12.98Unverified
3BARTMETEOR0.3Unverified
4T5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1Beam search + A*esque (beam)BLEU-134.4Unverified
2Beam search + A*esque (sample)BLEU-134.4Unverified
3Beam search + A*esque (greedy)BLEU-134.3Unverified
4Beam searchBLEU-133.7Unverified
#ModelMetricClaimedVerifiedStatus
1RankGANBLEU-20.81Unverified
2SeqGANBLEU-20.74Unverified
3LeakGANBLEU-20.46Unverified
#ModelMetricClaimedVerifiedStatus
1TGen++METEOR0.17Unverified
2TGenMETEOR0.15Unverified
3TGen+METEOR0.15Unverified
#ModelMetricClaimedVerifiedStatus
1GPT2-124Meval_loss3.12Unverified
2GPT2-81M-LOOPeval_loss3.11Unverified
3GPT2-Hermiteeval_loss2.91Unverified
#ModelMetricClaimedVerifiedStatus
1LLaMA-65B+CFG (zero-shot)Accuracy96.6Unverified
2LLaMA-30B+CFG (zero-shot)Accuracy96.4Unverified
3LLaMA-13B+CFG (zero-shot)Accuracy95.1Unverified
#ModelMetricClaimedVerifiedStatus
1CNN-VAENLL332.1Unverified
2SA-VAENLL327.5Unverified
3Aggressive VAENLL326.7Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-410.2Unverified
#ModelMetricClaimedVerifiedStatus
1STWGAN-GPBLEU-30.62Unverified
#ModelMetricClaimedVerifiedStatus
1PALMROUGE-L41.41Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L64.34Unverified
#ModelMetricClaimedVerifiedStatus
1AEM+AttentionBLEU-114.17Unverified
#ModelMetricClaimedVerifiedStatus
1GPT-4ASR65.1Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)ROUGE-L42.96Unverified
#ModelMetricClaimedVerifiedStatus
1Graph2SeqBLEU22Unverified
#ModelMetricClaimedVerifiedStatus
1WGANGP + DGflowJS-40.19Unverified