TY - GEN
T1 - Hierarchical association rule mining in large and dense databases using genetic network programming
AU - Gonzales, Eloy
AU - Shimada, Kaoru
AU - Mabu, Shingo
AU - Hirasawa, Kotaro
AU - Hu, Jinglu
PY - 2007/12/1
Y1 - 2007/12/1
N2 - In this paper we propose a new hierarchical method to extract association rules from large and dense datasets using Genetic Network Programming (GNP) considering a real world database with a huge number of attributes. It uses three ideas. First, the large database is divided into many small datasets. Second, these small datasets are independently processed by the conventional GNP-based mining method (CGNP) in parallel. This level of processing is called Local Level. Finally, new genetic operations are carried out for small datasets considered as individuals in order to improve the number of rules extracted and their quality as well. This level of processing is called Global Level. The amount of small datasets is also important especially for avoiding the overload and improving the general performance; we find the minimum amount of files needed to extract important association rules. The proposed method shows its effectiveness in simulations using a real world large and dense database.
AB - In this paper we propose a new hierarchical method to extract association rules from large and dense datasets using Genetic Network Programming (GNP) considering a real world database with a huge number of attributes. It uses three ideas. First, the large database is divided into many small datasets. Second, these small datasets are independently processed by the conventional GNP-based mining method (CGNP) in parallel. This level of processing is called Local Level. Finally, new genetic operations are carried out for small datasets considered as individuals in order to improve the number of rules extracted and their quality as well. This level of processing is called Global Level. The amount of small datasets is also important especially for avoiding the overload and improving the general performance; we find the minimum amount of files needed to extract important association rules. The proposed method shows its effectiveness in simulations using a real world large and dense database.
KW - Association rules
KW - Data mining
KW - Genetic network programming
KW - Parallel processing
UR - http://www.scopus.com/inward/record.url?scp=50249187805&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=50249187805&partnerID=8YFLogxK
U2 - 10.1109/SICE.2007.4421446
DO - 10.1109/SICE.2007.4421446
M3 - Conference contribution
AN - SCOPUS:50249187805
SN - 4907764286
SN - 9784907764289
T3 - Proceedings of the SICE Annual Conference
SP - 2686
EP - 2693
BT - SICE Annual Conference, SICE 2007
T2 - SICE(Society of Instrument and Control Engineers)Annual Conference, SICE 2007
Y2 - 17 September 2007 through 20 September 2007
ER -