SOTAVerified

Question Generation

The goal of Question Generation is to generate a valid and fluent question according to a given passage and the target answer. Question Generation can be used in many scenarios, such as automatic tutoring systems, improving the performance of Question Answering models and enabling chatbots to lead a conversation.

Source: Generating Highly Relevant Questions

Papers

Showing 221230 of 664 papers

TitleStatusHype
Automated Educational Question Generation at Different Bloom's Skill Levels using Large Language Models: Strategies and EvaluationCode0
Improving Socratic Question Generation using Data Augmentation and Preference OptimizationCode0
Harvesting Paragraph-Level Question-Answer Pairs from WikipediaCode0
How Should Agents Ask Questions For Situated Learning? An Annotated Dialogue CorpusCode0
CARETS: A Consistency And Robustness Evaluative Test Suite for VQACode0
Expanding, Retrieving and Infilling: Diversifying Cross-Domain Question Generation with Flexible TemplatesCode0
Answer-Driven Visual State Estimator for Goal-Oriented Visual DialogueCode0
CAUS: A Dataset for Question Generation based on Human Cognition Leveraging Large Language ModelsCode0
IDK-MRC: Unanswerable Questions for Indonesian Machine Reading ComprehensionCode0
Harnessing Structured Knowledge: A Concept Map-Based Approach for High-Quality Multiple Choice Question Generation with Effective DistractorsCode0
Show:102550
← PrevPage 23 of 67Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ERNIE-GENLARGE (beam size=5)BLEU-425.41Unverified
2BART (TextBox 2.0)BLEU-425.08Unverified
3ProphetNet + ASGenBLEU-424.44Unverified
4UniLMv2BLEU-424.43Unverified
5ProphetNet + syn. mask + localnessBLEU-424.37Unverified
6ProphetNetBLEU-423.91Unverified
7UniLM + ASGenBLEU-423.7Unverified
8UniLMBLEU-422.78Unverified
9BERTSQGBLEU-422.17Unverified
10Selector & NQG++BLEU-415.87Unverified
#ModelMetricClaimedVerifiedStatus
1MDNBLEU-165.1Unverified
2coco-Caption [[Karpathy and Li2014]]BLEU-162.5Unverified
3Max(Yang,2015)BLEU-159.4Unverified
4Sample(Yang,2015)BLEU-138.8Unverified
#ModelMetricClaimedVerifiedStatus
1FactJointGTMETEOR36.21Unverified
2JointGTMETEOR36.08Unverified
3FactT5BMETEOR35.72Unverified
4T5BMETEOR35.64Unverified
#ModelMetricClaimedVerifiedStatus
1FactT5BBLEU46.1Unverified
2JointGTBLEU45.95Unverified
3T5BBLEU44.51Unverified
4FactJointGTBLEU43.61Unverified
#ModelMetricClaimedVerifiedStatus
1JointGTMETEOR37.69Unverified
2FactJointGTMETEOR37.55Unverified
3FactT5BMETEOR37.39Unverified
4T5BMETEOR37.35Unverified
#ModelMetricClaimedVerifiedStatus
1BART fine-tuned on FairytaleQAROUGE-L0.53Unverified
2BART fine-tuned on NarrativeQA and FairytaleQAROUGE-L0.52Unverified
3BART fine-tuned on NarrativeQAROUGE-L0.44Unverified
#ModelMetricClaimedVerifiedStatus
1UniPollROUGE-149.6Unverified
2T5ROUGE-144.46Unverified
3Dual DecROUGE-138.24Unverified
#ModelMetricClaimedVerifiedStatus
1Info-HCVAEQAE37.18Unverified
2HCVAEQAE31.45Unverified
#ModelMetricClaimedVerifiedStatus
1Info-HCVAEQAE71.18Unverified
2HCVAEQAE69.46Unverified
#ModelMetricClaimedVerifiedStatus
1Info-HCVAEQAE35.45Unverified
2HCVAEQAE30.2Unverified
#ModelMetricClaimedVerifiedStatus
1MDNBLEU-136Unverified