Abstract
This paper presents a method for discriminating between personal and non-personal web pages. The method can support surveys of personal opinions about products and services. In the proposed method, subjective expressions are extracted from pages and then the pages are scored by quantitatively evaluating the subjectivity in the pages. We have evaluated performances of the proposed method using 1200 web pages collected from four categories of product, tourist spot, restaurant, and movie. Comparing the performances of the proposed method with categorisations by a general search engine, we have confirmed that the performances have been significantly better in every category.
Original language | English |
---|---|
Pages (from-to) | 62-77 |
Number of pages | 16 |
Journal | International Journal of Business Intelligence and Data Mining |
Volume | 4 |
Issue number | 1 |
DOIs | |
Publication status | Published - 2009 |
Externally published | Yes |
Keywords
- Dcument classification
- Personal web pages
- Subjective expressions
ASJC Scopus subject areas
- Management Information Systems
- Statistics, Probability and Uncertainty
- Information Systems and Management