Data mining method from text database

Masahiro Kawano, Junzo Watada*, Takayuki Kawaura

*Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Recently, various types of data are expected to get in information processing according to multi-media technology. Especially, linguistic data are employed in fuzzy systems as well as fuzzy numerical values. In this paper we propose a text minig method based on fuzzy quantification model. In the process of text mining, we will pursue the following steps: 1) Sentences included in a text in Japanese are broken down into words. 2) It is possible to realize common understanding using fuzzy thesaurus that enables us to translate words into synonyms or into upper concepts. In this paper, we employ the method to translate words using Chinese characters or continuous letters of Katakana more then one katakana letter (Japanese alphabet letter) into keywords. The method realizes the high speed of processing without any dictionary for separating words. Fuzzy multivariate analysis is employed to analyze such processed data and to abstract a latent mutual related structure under the data. In other words, we abstract the knowledge from the given text data. At the end we apply the method to mining the text information of libraries and Web pages distributed over a web network and discussing about the application to Kansei engineering.

    Original languageEnglish
    Title of host publicationLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Pages1122-1128
    Number of pages7
    Volume3683 LNAI
    Publication statusPublished - 2005
    Event9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005 - Melbourne
    Duration: 2005 Sept 142005 Sept 16

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume3683 LNAI
    ISSN (Print)03029743
    ISSN (Electronic)16113349

    Other

    Other9th International Conference on Knowledge-Based Intelligent Information and Engineering Systems, KES 2005
    CityMelbourne
    Period05/9/1405/9/16

    Keywords

    • Fuzzy quantification analysis
    • Library data
    • Text mining

    ASJC Scopus subject areas

    • Computer Science(all)
    • Biochemistry, Genetics and Molecular Biology(all)
    • Theoretical Computer Science

    Fingerprint

    Dive into the research topics of 'Data mining method from text database'. Together they form a unique fingerprint.

    Cite this