A rough-set-based two-class classifier for large imbalanced dataset

Junzo Watada, Lee Chuan Lin, Lei Ding, Mohd Ibrahim Shapiai, Lim Chun Chew, Zuwairie Ibrahim, Lee Wen Jau, Marzuki Khalid

    Research output: Contribution to journalArticlepeer-review

    8 Citations (Scopus)


    The objective of this paper is to provide a rouch-set-based two-class classifier approach to classifying samples in large and imbalanced dataset. A database has plenty of hidden knowledge, which can be used in decision making to support commerce, research and other activities. Prediction is another form of expanding data analysis. It enables us to establish a data model using existing data and to predict the trend of data in future. In this paper, a method consists of data scaling, rough sets analysis and support vector machine with radial basis function (SVM-RBF), which is used to classify a large and imbalanced data set obtained in semiconductor industry.

    Original languageEnglish
    Pages (from-to)641-651
    Number of pages11
    JournalSmart Innovation, Systems and Technologies
    Publication statusPublished - 2010


    • Imbalanced data
    • RBF kernel function
    • Rough sets analysis
    • SVM classifier

    ASJC Scopus subject areas

    • Computer Science(all)
    • Decision Sciences(all)


    Dive into the research topics of 'A rough-set-based two-class classifier for large imbalanced dataset'. Together they form a unique fingerprint.

    Cite this