英文摘要
|
Neural network, as a popular approach in data mining, usually has better learning results with relatively high accuracy. It provides good fault-tolerant ability for handling data with noises, and its network structure can also presents the complicated relationships among attributes. However, such black-boxed type of neural network process lacks the ability of explanation to offer the users with comprehensibly manageable knowledge, and the applications of neural network are occasionally restricted. In this paper, a rule induction algorithm is employed to retrieve the explicit rules for interpret the learning results from neural networks. Furthermore, by considering the misclassification costs in the retrieval process, the retrieved rules would be more realistic to practical uses. The proposed approach is based on PRISM algorithm proposed by Cendrowska, and uses the methods of Adacost, Metacost, and information entropy to consider the misclassification costs. An empirical investigation is performed by utilizing g the UCI-ML database to verify the effectiveness of the proposed approach.
|
参考文献
|
-
Boz, O.(2002).Extracting decision trees from trained neural networks.Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.
-
Cendrowska, J.(1987).PRISM: An algorithm for inducing modular rules.International Journal of Man-Machine Studies,27(4),349-370.
-
Chan, P.,Stolfo, S.(1998).Towards scalable learning with non-uniform class and cost distributions: A case study in credit card fraud detection.Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining.
-
Cohen, W. W.(1995).Fast effective rule induction.Proceedings of the Twelfth International Conference on Machine Learning.
-
Craven, M. W.,Shavlik, J. W.(1996).Extracting tree-structured representations of trained networks.Advances in Neural Information Processing Systems,8,24-30.
-
Domingos, P.(1999).Metacost: A general method for making classifiers cost-sensitive.Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining.
-
Drummond, C.,Holte, R.(2000).Exploiting the cost (in)sensitivity of decision tree splitting criteria.Proceedings of the Seventeenth International Conference on Machine Learning.
-
Fan, W.,Stolfo, S. J.,Zhang, J.,Chan, P. K.(1999).AdaCost: Misclassification cost- sensitive boosting.Proceedings of the Sixteenth International Conference on Machine Learning.
-
Fu, L.(1998).A neural-network model for learning domain rules based on its activation function characteristics.IEEE Transactions on Neural Networks,9(5),787-795.
-
Fu, X,Wang, L.(2001).Rule extraction by genetic algorithms based on a simplified RBF neural network.Proceedings of the 2001 Congress on Evolutionary Computation.
-
Han, J.,Kamber, M.(2001).Data Mining: Concepts and Techniques.CA:Morgan Kaufmann.
-
Ikizler, N.(2002).Technical Report BU-CE-0208Technical Report BU-CE-0208,Bilkent University.
-
Liu, H,Srtiono, R.(1997).Feature selection via discretization of numeric attributes.IEEE Transaction on Knowledge and Data,9(4),642-645.
-
Norton, S.W.(1989).Generating better decision trees.Proceedings of the Eleventh International Joint Conference on Artificial Intelligence.
-
Nunez, M.(1991).The use of background knowledge in decision tree induction.Machine Learning,6,231-250.
-
Provest, F.,Fawcett, T.,Kohavi, R.(1998).The case against accuracy estimation for comparing induction algorithms.Proceedings of the Fifteenth International Conference on Machine Learning.
-
Setiono, R.,Leow, W. K.,Zurada, J. M.(2002).Extraction of rules from artificial neural networks for nonlinear regression.IEEE Transactions on Neural Networks,13(3),564-577.
-
Tan, M.(1991).Cost-sensitive reinforcement learning for adaptive classification and control.Proceedings of the Eighth International Workshop on Machine Learning.
-
Thrun, S. B.(1995).Extracting rules from artificial neural networks with distributed representations.Advances in Neural Information Processing Systems,7,505-512.
-
Tickle, A. B.,Golea, M.,Hayward, R.,Diederich, J.(1997).The truth is in there: Current issues in extracting rules from trained feed forward artificial neural networks.Proceedings of the International Conference on Neural Networks.
-
Ting, K. M.,Zheng, Z.(1998).Boosting trees for cost-sensitive classifications.Proceedings of the Tenth European Conference on Machine Learning.
-
Tsukimoto, H.(2000).Extracting rules from trained neural networks.IEEE Transactions On Neural Networks,11(2),377-389.
-
Turney, P.(1995).Cost-sensitive classification: empirical evaluation of a hybrid genetic decision tree induction algorithm.Journal of Artificial Intelligence Research,2,369-409.
-
Turney, P.(2000).Types of cost in inductive concept learning.Proceedings of the Cost-Sensitive Learning Works hop at the Seventeenth International Conference on Machine Learning.
-
Witten, I. H.,Frank, E.(2000).Data Mining: Practical Machine Learning Tools And Techniques With Java Implementations.CA:Morgan Kaufmann.
-
Zhou, Z. H.,Jiang, Y.,Chen, S. F.(2003).Extracting symbolic rules from trained neural network ensembles.AI Communications,16(1),3-15.
-
Zubek, V. B.,Dietterich, T. G.(2002).Pruning improves heuristic search for cost-sensitive learning.Proceedings of the Nineteenth International Conference on Machine Learning.
|