题名

Closeness Based New Word Detection Method for Mechanical Design and Manufacturing Area

DOI

10.3966/199115992017102805019

作者

Qiuyuan Chen;Guang Cheng;Di Li;Jian Zhang

关键词

information entropy ; left(right) entropy ; logistic regression ; mechanical design ; named entity recognition ; new word detection

期刊名称

電腦學刊

卷期/出版年月

28卷5期(2017 / 10 / 01)

页次

210 - 219

内容语文

英文

中文摘要

Named entity recognition has been widely used in the area of information retrieval, but the common methods cannot accurately identify the proper nouns from a particular domain. In order to solve the named entity recognition for mechanical design and manufacturing area, this paper proposes a closeness based method, in order to identify the proper nouns. First, we calculate the entropy about each character, and then define the closeness between two adjacent words based on the lexical features and statistic features. Finally we use the logistic regression algorithm to determine the weights in the closeness definition. The proposed method can recognize proper nouns more accurately and efficiently for mechanical design and manufacturing area.

主题分类 基礎與應用科學 > 資訊科學