SOTAVerified

Towards Automatically Extracting UML Class Diagrams from Natural Language Specifications

2022-10-26Code Available0· sign in to hype

Song Yang, Houari Sahraoui

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In model-driven engineering (MDE), UML class diagrams serve as a way to plan and communicate between developers. However, it is complex and resource-consuming. We propose an automated approach for the extraction of UML class diagrams from natural language software specifications. To develop our approach, we create a dataset of UML class diagrams and their English specifications with the help of volunteers. Our approach is a pipeline of steps consisting of the segmentation of the input into sentences, the classification of the sentences, the generation of UML class diagram fragments from sentences, and the composition of these fragments into one UML class diagram. We develop a quantitative testing framework specific to UML class diagram extraction. Our approach yields low precision and recall but serves as a benchmark for future research.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
UML Classes With Specsthree-step-originalExact Match0.17Unverified

Reproductions