SOTAVerified

Technological taxonomies for hypernym and hyponym retrieval in patent texts

2022-11-14Code Available0· sign in to hype

You Zuo, Yixuan Li, Alma Parias García, Kim Gerdes

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This paper presents an automatic approach to creating taxonomies of technical terms based on the Cooperative Patent Classification (CPC). The resulting taxonomy contains about 170k nodes in 9 separate technological branches and is freely available. We also show that a Text-to-Text Transfer Transformer (T5) model can be fine-tuned to generate hypernyms and hyponyms with relatively high precision, confirming the manually assessed quality of the resource. The T5 model opens the taxonomy to any new technological terms for which a hypernym can be generated, thus making the resource updateable with new terms, an essential feature for the constantly evolving field of technological terminology.

Tasks

Reproductions