TY - GEN
T1 - Mining association rules from databases with continuous attributes using genetic network programming
AU - Taboada, Karla
AU - Gonzales, Eloy
AU - Shimada, Kaoru
AU - Mabu, Shingo
AU - Hirasawa, Kotaro
AU - Hu, Jinglu
PY - 2007
Y1 - 2007
N2 - Most association rule mining algorithms make use of discretization algorithms for handling continuous attributes. Discretization is a process of transforming a continuous attribute value into a finite number of intervals and assigning each interval to a discrete numerical value. However, by means of methods of discretization, it is difficult to get highest attribute interdependency and at the same time to get lowest number of intervals. In this paper we present an association rule mining algorithm that is suited for continuous valued attributes commonly found in scientific and statistical databases. We propose a method using a new graph-based evolutionary algorithm named "Genetic Network Programming (GNP)" that can deal with continues values directly, that is, without using any discretization method as a preprocessing step. GNP represents its individuals using graph structures and evolve them in order to find a solution; this feature contributes to creating quite compact programs and implicitly memorizing past action sequences. In the proposed method using GNP, the significance of the extracted association rule is measured by the use of the chi-squared test and only important association rules are stored in a pool all together through generations. Results of experiments conducted on a real life database suggest that the proposed method provides an effective technique for handling continuous attributes.
AB - Most association rule mining algorithms make use of discretization algorithms for handling continuous attributes. Discretization is a process of transforming a continuous attribute value into a finite number of intervals and assigning each interval to a discrete numerical value. However, by means of methods of discretization, it is difficult to get highest attribute interdependency and at the same time to get lowest number of intervals. In this paper we present an association rule mining algorithm that is suited for continuous valued attributes commonly found in scientific and statistical databases. We propose a method using a new graph-based evolutionary algorithm named "Genetic Network Programming (GNP)" that can deal with continues values directly, that is, without using any discretization method as a preprocessing step. GNP represents its individuals using graph structures and evolve them in order to find a solution; this feature contributes to creating quite compact programs and implicitly memorizing past action sequences. In the proposed method using GNP, the significance of the extracted association rule is measured by the use of the chi-squared test and only important association rules are stored in a pool all together through generations. Results of experiments conducted on a real life database suggest that the proposed method provides an effective technique for handling continuous attributes.
UR - http://www.scopus.com/inward/record.url?scp=79955285507&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79955285507&partnerID=8YFLogxK
U2 - 10.1109/CEC.2007.4424622
DO - 10.1109/CEC.2007.4424622
M3 - Conference contribution
AN - SCOPUS:79955285507
SN - 1424413400
SN - 9781424413409
T3 - 2007 IEEE Congress on Evolutionary Computation, CEC 2007
SP - 1311
EP - 1317
BT - 2007 IEEE Congress on Evolutionary Computation, CEC 2007
T2 - 2007 IEEE Congress on Evolutionary Computation, CEC 2007
Y2 - 25 September 2007 through 28 September 2007
ER -