题名

Feature Selection with Test Cost Constraint through a Simulated Annealing Algorithm

DOI

10.6138/JIT.2016.17.6.20141119

作者

Junxia Niu;Hong Zhao;William Zhu

关键词

Cost-sensitive learning ; Constraint satisfaction problem ; Feature selection ; Granular computing ; Test cost

期刊名称

網際網路技術學刊

卷期/出版年月

17卷6期(2016 / 11 / 01)

页次

1133 - 1140

内容语文

英文

中文摘要

Cost-sensitive feature selection is one of the most fundamental problems in data mining applications. In real-world situations, we need to pay test cost for acquiring feature values of objects. Due to limited money or time, we also have a constraint on the total test cost. This issue has been formalized as the feature selection with test cost constraint problem, which is treated as a constraint satisfaction problem. An information gain based heuristic algorithm and a genetic algorithm have been adopted to deal with the problem. However, the two algorithms do not produce the optimal solution in most cases. In this paper, on the one hand, we built a constraint satisfaction problem model for the feature selection with test cost constraint problem. On the other hand, a simulated annealing algorithm is designed to solve the feature selection with test cost constraint problem. The proposed algorithm which takes advantage of both the test cost information and the search potential of the simulated annealing algorithm. It also adjusts atoms according to the constraint to ensure the convergence of the algorithm. These algorithms are compared based on performance and results by eight UCI datasets and two representative test cost distributions. The experimental results show that the proposed algorithm is more effective and efficient than the previous algorithms.

主题分类 基礎與應用科學 > 資訊科學