TY - GEN
T1 - Text Mining using PrefixSpan constrained by Item Interval and Item Attribute
AU - Sato, Issei
AU - Hirate, Yu
AU - Yamana, Hayato
N1 - Publisher Copyright:
© 2006 IEEE.
PY - 2006
Y1 - 2006
N2 - Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals-The number of items between any two adjacent items in a sequence-And the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.
AB - Applying conventional sequential pattern mining methods to text data extracts many uninteresting patterns, which increases the time to interpret the extracted patterns. To solve this problem, we propose a new sequential pattern mining algorithm by adopting the following two constraints. One is to select sequences with regard to item intervals-The number of items between any two adjacent items in a sequence-And the other is to select sequences with regard to item attributes. Using Amazon customer reviews in the book category, we have confirmed that our method is able to extract patterns faster than the conventional method, and is better able to exclude uninteresting patterns while retaining the patterns of interest.
UR - http://www.scopus.com/inward/record.url?scp=84990879604&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84990879604&partnerID=8YFLogxK
U2 - 10.1109/ICDEW.2006.142
DO - 10.1109/ICDEW.2006.142
M3 - Conference contribution
AN - SCOPUS:84990879604
T3 - ICDEW 2006 - Proceedings of the 22nd International Conference on Data Engineering Workshops
SP - 35
EP - 38
BT - ICDEW 2006 - Proceedings of the 22nd International Conference on Data Engineering Workshops
A2 - Barga, Roger S.
A2 - Zhou, Xiaofang
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 22nd International Conference on Data Engineering Workshops, ICDEW 2006
Y2 - 3 April 2006 through 7 April 2006
ER -