Narrow your search

Library

ULiège (1)


Resource type

dissertation (1)


Language

English (1)


Year
From To Submit

2017 (1)

Listing 1 - 1 of 1
Sort by

Dissertation
Predicting ratings of Amazon reviews - Techniques for imbalanced datasets
Authors: --- --- ---
Year: 2017 Publisher: Liège Université de Liège (ULiège)

Loading...
Export citation

Choose an application

Bookmark

Abstract

The goal of this dissertation is to successfully predict a user’s numerical rating from its review text content. To do so, supervised machine learning techniques and more specifically text classification are used.
Three distinct approaches are presented, namely binary classification, aiming at predicting the rating of a review as low or high, as well as multi-class classification and logistic regression whose aim is to predict the exact value of the rating for each review. Moreover, three different classifiers (Naïve Bayes, Support Vector Machine and Random Forest) are trained and tested on two different datasets from Amazon. These datasets are divided into two major categories: experience and search products and are characterized by an imbalanced distribution. We overcome this issue by applying sampling techniques to even out the class distributions. Eventually, the performance of those classifiers is tested and assessed thanks to accuracy metrics, including precision, recall and f1-score. 
Our results show that the two most successful classifiers are Naïve Bayes and SVM, with a slight advantage for the latter one for both datasets. Binary classification shows quite good results while making more precise predictions (i.e. scale from 1 to 5) is significantly a harder task. Nevertheless, these results are still acceptable.
More practically, our approach enables users’ feedbacks to be automatically expressed on a numerical scale and therefore to ease the consumer decision process prior to making a purchase. This can in turn be extended to various other situations where no numerical rating system is available, for instance comments on YouTube or Twitter.

Listing 1 - 1 of 1
Sort by