Abstract
We developed a simple method of improving the accuracy of rating prediction using feature words extracted from customer reviews. Many rating predictors work well for a small and dense dataset of customer reviews. However, a practical dataset tends to be large and sparse, because it often includes too many products for each customer to buy and evaluate. Data sparseness reduces prediction accuracy. To improve accuracy, we reduced the dimension of the feature vector using feature words extracted by analyzing the relationship between ratings and accompanying review comments instead of using ratings. We applied our method to the Pranking algorithm and evaluated it on a corpus of golf course reviews supplied by a Japanese e-commerce company. We found that by successfully reducing data sparse-ness, our method improves prediction accuracy as measured using RankLoss.
Original language | English |
---|---|
Title of host publication | SIGIR'11 - Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval |
Pages | 1205-1206 |
Number of pages | 2 |
DOIs | |
Publication status | Published - 2011 |
Externally published | Yes |
Event | 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'11 - Beijing Duration: 2011 Jul 24 → 2011 Jul 28 |
Other
Other | 34th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR'11 |
---|---|
City | Beijing |
Period | 11/7/24 → 11/7/28 |
Keywords
- Rating prediction
- Review mining
- Sentiment analysis
ASJC Scopus subject areas
- Information Systems