题名 |
Efficiently Mining Frequent Closed Itemsets by Eliminating Data Redundancies |
DOI |
10.29767/ECS.200503.0003 |
作者 |
Fan-Chen Tseng;Ching-Chi Hsu;Kuo-Sheng Fu |
关键词 |
Data Mining, Frequent Closed Itemset ; Transaction Pattern List ; Data redundancy ; Frequent Pattern List (FPL) |
期刊名称 |
Electronic Commerce Studies |
卷期/出版年月 |
3卷1期(2005 / 03 / 31) |
页次 |
39 - 55 |
内容语文 |
英文 |
英文摘要 |
Recently, data mining has been applied in business information and intelligence systems for discovering interesting patterns and knowledge to support decision making processes. One of the most basic and important tasks of data mining is the mining of frequent itemsets, which are sets of items frequently purchased by customers. Many methods have been proposed for this problem. However, mining the complete set of frequent itemsets often leads to a huge solution space. Fortunately, this problem can be reduced to the mining of Frequent Closed Itemsets (FCIs), which results in a much smaller yet representative set of purchase patterns of the customers. Still, there are redundancies in the databases that can be eliminated to enhance both space and time efficiency. In this paper, we propose a novel data structure, the Transaction Pattern List (TPL), for eliminating data redundancies, and design the algorithm TPLFCI-Mining for mining FCIs efficiently with the TPL. Our algorithm is evaluated under more rigorous conditions than previously proposed methods. Experimental results show that our method is efficient for both sparse and dense databases, and is scalable for large databases even at low support thresholds. |
主题分类 |
基礎與應用科學 >
資訊科學 社會科學 > 經濟學 |
参考文献 |
|
被引用次数 |