Sentiment analysis of Indonesian hotel reviews: from classical machine learning to deep learning

(1) * Retno Kusumaningrum Mail (Universitas Diponegoro, Indonesia)
(2) Iffa Zainan Nisa Mail (Universitas Diponegoro, Indonesia)
(3) Rizka Putri Nawangsari Mail (Universitas Diponegoro, Indonesia)
(4) Adi Wibowo Mail (Universitas Diponegoro, Indonesia)
*corresponding author


Currently, there are a large number of hotel reviews on the Internet that need to be evaluated to turn the data into practicable information. Deep learning has excellent capabilities for recognizing this type of data. With the advances in deep learning paradigms, many algorithms have been developed that can be used in sentiment analysis tasks. In this study, we aim to compare the performance of classical machine learning algorithms—logistic regression (LR), naïve Bayes (NB), and support vector machine (SVM) using the Word2Vec model in conjunction with deep learning algorithms such as a convolutional neural network (CNN) to classify hotel reviews on the Traveloka website into positive or negative classes. Both learning methods apply hyperparameter tuning to determine the parameters that produce the best model. Furthermore, the Word2Vec model parameters use the skip-gram model, hierarchical softmax evaluation, and the value of 100 vector dimensions. The highest average accuracy obtained was 98.08% by using the CNN with a dropout of 0.2, Tanh as convolution activation, softmax as output activation, and Adam as the optimizer. The findings from the study demonstrate that the integration of the Word2Vec model and the CNN model obtains significantly better accuracy than other classical machine learning methods.


sentiment analysis; word2vec; convoluational neural network; classical machine learning; hotel reviews



