SOTAVerified

Automatic Technical Domain Identification

2020-12-01ICON 2020Unverified0· sign in to hype

Hema Ala, Dipti Sharma

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper we present two Machine Learning algorithms namely Stochastic Gradient Descent and Multi Layer Perceptron to Identify the technical domain of given text as such text provides information about the specific domain. We performed our experiments on Coarse-grained technical domains like Computer Science, Physics, Law, etc for English, Bengali, Gujarati, Hindi, Malayalam, Marathi, Tamil, and Telugu languages, and on fine-grained sub domains for Computer Science like Operating System, Computer Network, Database etc for only English language. Using TFIDF as a feature extraction method we show how both the machine learning models perform on the mentioned languages.

Tasks

Reproductions