SOTAVerified

Data-to-Text Generation

A classic problem in natural-language generation (NLG) involves taking structured data, such as a table, as input, and producing text that adequately and fluently describes this data as output. Unlike machine translation, which aims for complete transduction of the sentence to be translated, this form of NLG is usually taken to require addressing (at least) two separate challenges: what to say, the selection of an appropriate subset of the input data to discuss, and how to say it, the surface realization of a generation.

( Image credit: Data-to-Text Generation with Content Selection and Planning )

Papers

Showing 150 of 219 papers

TitleStatusHype
TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsCode3
Prompting for Numerical Sequences: A Case Study on Market Comment GenerationCode2
TaTa: A Multilingual Table-to-Text Dataset for African LanguagesCode2
CoNT: Contrastive Neural Text GenerationCode2
Ontology-Free General-Domain Knowledge Graph-to-Text Generation Dataset Synthesis using Large Language ModelCode1
Bridging the Gap between Different Vocabularies for LLM EnsembleCode1
Prompt Optimization via Adversarial In-Context LearningCode1
Keras GPT Copilot: Integrating the Power of Large Language Models in Deep Learning Model DevelopmentCode1
TabGenie: A Toolkit for Table-to-Text GenerationCode1
ASDOT: Any-Shot Data-to-Text Generation with Pretrained Language ModelsCode1
Leveraging Natural Supervision for Language Representation Learning and GenerationCode1
Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance LearningCode1
NMTScore: A Multilingual Analysis of Translation-based Text Similarity MeasuresCode1
GAP: A Graph-aware Language Model Framework for Knowledge Graph-to-Text GenerationCode1
Neural Pipeline for Zero-Shot Data-to-Text GenerationCode1
Chart-to-Text: A Large-Scale Benchmark for Chart SummarizationCode1
Data-to-text Generation with Variational Sequential PlanningCode1
XAlign: Cross-lingual Fact-to-Text Alignment and Generation for Low-Resource LanguagesCode1
Open Domain Question Answering with A Unified Knowledge InterfaceCode1
Control Prefixes for Parameter-Efficient Text GenerationCode1
Biomedical Data-to-Text Generation via Fine-Tuning TransformersCode1
Plan-then-Generate: Controlled Data-to-Text Generation via PlanningCode1
Improving Encoder by Auxiliary Supervision Tasks for Table-to-Text GenerationCode1
Stage-wise Fine-tuning for Graph-to-Text GenerationCode1
Learning to Reason for Text Generation from Scientific TablesCode1
Data-QuestEval: A Referenceless Metric for Data-to-Text Semantic EvaluationCode1
Structural Adapters in Pretrained Language Models for AMR-to-text GenerationCode1
Structural Information Preserving for Graph-to-Text GenerationCode1
Controlling Hallucinations at Word Level in Data-to-Text GenerationCode1
Data-to-text Generation with Macro PlanningCode1
Data-to-text Generation by Splicing Together Nearest NeighborsCode1
WikiTableT: A Large-Scale Data-to-Text Dataset for Generating Wikipedia Article SectionsCode1
Latent Template Induction with Gumbel-CRFsCode1
Data-to-Text Generation with Iterative Text EditingCode1
Knowledge Graph Based Synthetic Corpus Generation for Knowledge-Enhanced Language Model Pre-trainingCode1
PARENTing via Model-Agnostic Reinforcement Learning to Correct Pathological Behaviors in Data-to-Text GenerationCode1
Chart-to-Text: Generating Natural Language Descriptions for Charts by Adapting the Transformer ModelCode1
KGPT: Knowledge-Grounded Pre-Training for Data-to-Text GenerationCode1
Partially-Aligned Data-to-Text Generation with Distant SupervisionCode1
Investigating Pretrained Language Models for Graph-to-Text GenerationCode1
Text-to-Text Pre-Training for Data-to-Text TasksCode1
GPT-too: A language-model-first approach for AMR-to-text generationCode1
ToTTo: A Controlled Table-To-Text Generation DatasetCode1
Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic FidelityCode1
Few-shot Natural Language Generation for Task-Oriented DialogCode1
Variational Template Machine for Data-to-Text GenerationCode1
Modeling Global and Local Node Contexts for Text Generation from Knowledge GraphsCode1
Revisiting Challenges in Data-to-Text Generation with Fact GroundingCode1
Language Models are Unsupervised Multitask LearnersCode1
Deep Graph Convolutional Encoders for Structured Data to Text GenerationCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Control Prefixes (A1, T5-large)BLEU67.32Unverified
2Control Prefixes (A1, A2, T5-large)BLEU67.15Unverified
3JointGT BaselineBLEU67.08Unverified
4FactT5BBLEU67.04Unverified
5T5B BaselineBLEU67.04Unverified
6FactJointGTBLEU66.89Unverified
7T5-large + Wiki + PositionBLEU66.07Unverified
8HTML (fine-tuning)BLEU65.4Unverified
9T5-smallBLEU65.05Unverified
10TrICy (trK = trk* = 0.24)BLEU64.73Unverified
#ModelMetricClaimedVerifiedStatus
1S_1^RBLEU68.6Unverified
2EDA_CSBLEU67.05Unverified
3TrICy (trK = 0)BLEU66.43Unverified
4SlugBLEU66.19Unverified
5TGenBLEU65.93Unverified
6EDA_CS (TL)BLEU65.8Unverified
7Sys1-PrimaryBLEU65.61Unverified
8ZhangBLEU65.45Unverified
9Self-memoryBLEU65.11Unverified
10GongBLEU64.22Unverified
#ModelMetricClaimedVerifiedStatus
1Control Prefixes (A1, A2, T5-large)BLEU62.27Unverified
2Control Prefixes (A1, T5-large)BLEU61.94Unverified
3T5-large + Wiki + PositionBLEU60.56Unverified
4T5-largeBLEU59.7Unverified
5T5-LargeBLEU57.1Unverified
6HTLM (prefix 0.1%)BLEU56.3Unverified
7DATATUNER_NO_FCBLEU52.9Unverified
8Transformer (Pipeline)BLEU51.68Unverified
#ModelMetricClaimedVerifiedStatus
1Control Prefixes (T5-large)BLEU (Test set)44.15Unverified
2DataTuner_FCBLEU (Test set)43.6Unverified
3TGenBLEU (Test set)40.73Unverified
4LSTMMETEOR (Validation set)0.39Unverified
5TGenMETEOR (Validation set)0.39Unverified
6BARTMETEOR (Validation set)0.37Unverified
7T5METEOR (Validation set)0.37Unverified
#ModelMetricClaimedVerifiedStatus
1HierarchicalEncoder + NR + IRBLEU17.96Unverified
2Hierarchical transformer encoder + conditional copyBLEU17.5Unverified
3Force-CopyBLEU17.26Unverified
4Neural Content Planning + conditional copyBLEU16.5Unverified
5MacroBLEU15.46Unverified
6Encoder-decoder + conditional copyBLEU14.19Unverified
#ModelMetricClaimedVerifiedStatus
1SeqPlanPrecision97.6Unverified
2MacroPrecision97.6Unverified
3Force-CopyPrecision95.4Unverified
4Hierarchical Transformer Encoder + conditional copyPrecision89.46Unverified
5Neural Content Planning + conditional copyPrecision87.47Unverified
6Encoder-decoder + conditional copyPrecision74.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5-3BBLEU49.5Unverified
2LATTICE (T5-base)BLEU48.4Unverified
3BERT-to-BERTBLEU44Unverified
4Pointer GeneratorBLEU41.6Unverified
5NCP+CC (Puduppully et al 2019)BLEU19.2Unverified
6T5METEOR0.36Unverified
#ModelMetricClaimedVerifiedStatus
1Fact-aware embedding with mT5BLEU429.27Unverified
2Bi-lingual mT5BLEU425.88Unverified
3mT5BLEU425Unverified
4Vanilla TransformerBLEU419.9Unverified
5Translate-Output mT5BLEU418.91Unverified
6Graph Attention Network Encoder +Transformer DecoderBLEU418.3Unverified
#ModelMetricClaimedVerifiedStatus
1T5B BaselineBLEU48.47Unverified
2FactT5BBLEU48.37Unverified
3self-mem + new dataBLEU47.76Unverified
4JointGT BaselineBLEU47.51Unverified
5FactJointGTBLEU47.39Unverified
#ModelMetricClaimedVerifiedStatus
1T5-BaseBLEU35.1Unverified
2T5-smallBLEU34.96Unverified
3T2G2BLEU34.91Unverified
4SC-GPT2BLEU30.76Unverified
5HDSABLEU26.48Unverified
#ModelMetricClaimedVerifiedStatus
1Hierarchical Transformer Encoder + conditional copyDLD18.9Unverified
2Neural Content Planning + conditional copyDLD18.58Unverified
3MacroDLD17.7Unverified
4Force-CopyDLD17.26Unverified
5Encoder-decoder + conditional copyDLD8.68Unverified
#ModelMetricClaimedVerifiedStatus
1Hierarchical Transformer Encoder + conditional copyPrecision39.47Unverified
2Force-CopyPrecision34.34Unverified
3Neural Content Planning + conditional copyPrecision34.18Unverified
4MacroPrecision34.1Unverified
5Encoder-decoder + conditional copyPrecision29.49Unverified
#ModelMetricClaimedVerifiedStatus
1SeqPlanBLEU14.29Unverified
2MacroBLEU12.62Unverified
3ENTBLEU11.5Unverified
4Force-CopyBLEU10.5Unverified
#ModelMetricClaimedVerifiedStatus
1SeqPlanDLD22.7Unverified
2MacroDLD21.8Unverified
3Force-CopyDLD21.16Unverified
4ENTDLD20.7Unverified
#ModelMetricClaimedVerifiedStatus
1SeqPlanPrecision95.9Unverified
2MacroPrecision94.4Unverified
3Force-CopyPrecision84.5Unverified
4ENTPrecision81.1Unverified
#ModelMetricClaimedVerifiedStatus
1binmtBLEU score26.35Unverified
2tgenBLEU score21.96Unverified
3massBLEU score17.72Unverified
#ModelMetricClaimedVerifiedStatus
1Force-CopyPrecision49.39Unverified
2SeqPlanPrecision43.3Unverified
3MacroPrecision40.8Unverified
#ModelMetricClaimedVerifiedStatus
1self-mem + new data (random)METEOR46.11Unverified
2self-mem + new data (fixed)METEOR46.07Unverified
#ModelMetricClaimedVerifiedStatus
1Transition based Deep Input LinearizationBLEU80.49Unverified
2GCN + featBLEU0.67Unverified
#ModelMetricClaimedVerifiedStatus
1DataTuner_FCBLEU53.6Unverified
2Bo3BLEU52.1Unverified
#ModelMetricClaimedVerifiedStatus
1mBARTMETEOR0.46Unverified
2mT5METEOR0.29Unverified
#ModelMetricClaimedVerifiedStatus
1mBARTMETEOR0.61Unverified
2mT5METEOR0.18Unverified
#ModelMetricClaimedVerifiedStatus
1StructAdaptBleu48Unverified
#ModelMetricClaimedVerifiedStatus
1T5-largeBLEU45.85Unverified
#ModelMetricClaimedVerifiedStatus
1T5-largeBLEU69.27Unverified
#ModelMetricClaimedVerifiedStatus
1OursBLEU24.56Unverified