A Hybrid Approach for Aspect-based Sentiment Analysis: A Case Study of Hotel Reviews

Main Article Content

Khanista Namee*
Jantima Polpinij
Bancha Luaphol

Abstract

This study presents a method of aspect-based sentiment analysis for customer reviews related to hotels. The considered hotel aspects are staff attentiveness, room cleanliness, value for money and convenience of location. The proposed method consists of two main components. The first component is used to assemble relevant sentences for each hotel aspect into relevant clusters of hotel aspects using BM25. We developed a corpus of keywords called the Keywords of Hotel Aspect (KoHA) Corpus, and the keywords of each aspect were used as queries to assemble relevant sentences of each hotel aspect into relevant clusters. Finally, customer review sentences in each cluster were classified into positive and negative classes using sentiment classifiers. Two algorithms, Support Vector Machines (SVM) with a linear and a RBF kernel, and Convolutional Neural Network (CNN) were applied to develop the sentiment classifier models. The model based on SVM with a linear kernel returned better results than other models with an AUC score of 0.87. Therefore, this model was chosen for the sentiment classification stage. The proposed method was evaluated using recall, precision and F1 with satisfactory results at 0.85, 0.87 and 0.86, respectively. Our proposed method provided an overview of customer feelings based on score, and also provided reasons why customers liked or disliked each aspect of the hotel. The best model from the proposed method was used to compare with a state-of-the-art model. The results show that our method increased recall, precision, and F1 scores by 2.44%, 2.50% and 1.84%, respectively.


Keywords: sentiment analysis; aspect level; Word2Vec; support vector machines; convolutional neural network; BM25


*Corresponding author: Tel.: (+66) 0886514997


                                             E-mail: [email protected]


 

Article Details

Section
Original Research Articles

References

Feldman, R., 2013. Techniques and applications for sentiment analysis. Communications of the ACM, 56(4), 82-89.

Karakaya, F. and Ganim Barnes, N., 2017. Impact of online reviews of customer care experience on brand or company selection. Journal of Consumer Marketing, 27(5), 447-457.

Medhat, W., Hassan, A. and Korashy, H., 2017. Sentiment analysis algorithms and applications: A survey. Ain Shams Engineering Journal, 5(4), 1093-1113.

Nakayama, M., 2013. Biases in consumer reviews: Implications for different categories of goods. Proceedings of the International Conference on Information Resources Management (Conf-IRM), Natal, Brazil, May 22-24, 2013, pp. 1-7.

Park, K., Cha, M., and Rhim, E., 2018. Positivity bias in customer satisfaction ratings. Proceedings of the Web Conference 2018, Lyon, France, April 23-27, 2018, pp. 631-638.

Khan, M.T., Durrani, M., Ali, A., Inayat, I., Khalid, S. and Khan, K.H., 2016. Sentiment analysis and the complex natural language. Complex Adaptive Systems Modeling, 4, DOI: 10.1186/s40294-016-0016-9.

Pang, B., Lee, L. and Vaithyanathan, S., 2002. Thumbs up?: sentiment classification using machine learning techniques. Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, Stroudsburg, USA, July 6, 2002, pp. 79-86.

Polpinij, J. and Ghose, A.K., 2008. An ontology-based sentiment classification methodology for online consumer reviews. Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, Sydney, Australia, December 9-12, 2008, pp. 518-524.

Alamoudi, E.S. and Alamoudi, N.S., 2021. Sentiment classification and aspect-based sentiment analysis on yelp reviews using deep learning and word embeddings. Journal of Decision Systems, 30(2-3), 259-281.

Batista, F. and Batista, R., 2013. Sentiment analysis and topic classification based on binary maximum entropy classifiers. Procesamiento de Lenguaje Natural, 50, 77-84.

Bouazizi, M. and Ohtsuki, T., 2016. Sentiment analysis: From binary to multi-class classification: A pattern-based approach for multi-class sentiment analysis in Twitter. Proceedings of the 2016 IEEE International Conference on Communications (ICC), Kuala Lumpur, Malaysia, May 22-27, 2016, pp. 1-6.

Gaye, B., Zhang, D. and Wulamu, A., 2021. A Tweet sentiment classification approach using a hybrid stacked ensemble technique. Information, 12(9), DOI: 10.3390/info12090374.

Supriya, S.N., Kallimani, V., Prakash, S. and Akki, C.B., 2016. Twitter sentiment analysis using binary classification technique. Proceedings of the International Conference on Nature of Computation and Communication, Rach Gia, Vietnam, March 17-18, 2016, pp. 391-396.

Taboada, M., Brooke, J., Tofiloski, M., Voli, K. and Sted, M., 2011. Lexicon-based methods for sentiment analysis. Association for Computational Linguistics, 37(2), 267-307.

Afzaal, M., Usman, M. and Fong, A., 2018. Predictive aspect-based sentiment classification of online tourist reviews. Journal of Information Science, 45(3), DOI: 10.1177/0165551518 789872.

Arulmurugan, R., Sabarmathi, K.R. and Anandakumar, H., 2019. Classification of sentence level sentiment analysis using cloud machine learning techniques. Cluster Computing, 22, 1199-1209.

Joshi, N. and Itkat, S.A., 2014. A survey on feature level sentiment analysis. International Journal of Computer Science and Information Technologies, 5(4), 5422-5425.

Liu, H., Chatterjee, I., Zhou, M., Lu, X.S. and Abusorrah, A., 2020. Aspect-based sentiment analysis: A survey of deep learning methods. IEEE Transactions on Computational Social Systems, 7(6), DOI: 10.1109/TCSS.2020.3033302.

Meena, A. and Prabhakar, T., 2007. Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis. Proceedings of the European Conference on Information Retrieval (ECIR), Rome, Italy, April 2-5, 2007, pp. 573-580.

Nazir, A., Rao, Y., Wu, L. and Sun, L., 2020. Issues and challenges of aspect-based sentiment analysis: A comprehensive survey. IEEE Transactions on Affective Computing, 13(2), 845-863, DOI: 10.1109/TAFFC.2020.2970399.

Peng, H., Xu, L., Bing, L., Huang, F., Lu, W., and Si, L., 2020. Knowing what, how and why: A near complete solution for aspect-based sentiment analysis. Proceedings of the AAAI Conference on Artificial Intelligence, 34(05), 8600-8607.

Mikolov, T., Sutskever, I., Chen, K., Corrado, G.S. and Dean, J., 2013. Distributed representations of words and phrases and their compositionality. Advances in Neural Information Processing Systems, 2(13), 3111-3119.

Polpinij, J., Srikanjanapert, N. and Sopon, P., 2018. Word2Vec approach for sentiment classification relating to hotel reviews. Advances in Intelligent Systems and Computing, 566(1), 308-316.

Zhang, W., Xu, W., Chen, G. and Guo, J., 2014. A Feature Extraction Method Based on Word Embedding for Word Similarity Computing. Natural Language Processing and Chinese Computing, 496, 160-167.

Bengio, Y. and Grandvalet, Y., 2005. Bias in estimating the nariance of K-fold cross-validation. In: P. Duchesne and B. Remillard, eds. Statistical Modeling and Analysis for Complex Data Problems. Boston: Springer, pp.75-95.

Chen, K., Zhang, Z., Long, J. and Zhang, H., 2016. Turning from tf-idf to tf-igm for term weighting in text classification. Expert Systems with Applications, 66, 245-260.

Joachims, T., 1998. Text categorization with support vector machines: Learning with many relevant features. European Conference on Machine Learning, Chemnitz, Germany, April 21-23, 1998, pp. 137-142.

Mullen, T. and Collier, N., 2004. Sentiment analysis using support vector machines with diverse information sources. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing, Barcelona, Spain, July 25-26, 2004, pp. 412-418.

Patil, G., Galande, V., Kekan, V. and Dange, K., 2014. Sentiment analysis using support vector machine. International Journal of Innovative Research in Computer and Communication Engineering, 2(1), 2607-2612.

Zainuddin, N. and Selamat, A., 2014. Sentiment analysis using support vector machine. Proceedings of the 2014 International Conference on Computer, Communications, and Control Technology (I4CT), Langkawi, Malaysia, September 2-4, 2014, pp. 333-337.

Kaestner, C.A.A., 2013. Support vector machines and kernel functions for text processing. Revista De Informática Teórica E Aplicada, 20(3), 130-154.

Han, S., Qubo, C. and Meng, H., 2012. Parameter selection in SVM with RBF kernel function. 2012 World Automation Congress, Puerto Vallarta, Mexico, June 24-28, 2012, pp. 1-4.

Kim, H. and Jeong, Y.S., 2019. Sentiment classification using convolutional neural network. Applied Science, 9(11), DOI: 10.3390/app9112347.

Sharma, A.K., Chaurasia, S. and Srivastava, D.K., 2021. Sentimental short sentences classification by using CNN deep learning model with fine tuned Word2Vec. Procedia Computer Science, 167, 1139-1147.

Trotman, A., Puurula, A. and Burgess, B., 2014. Improvements to BM25 and language models examined. Proceedings of the 2014 Australasian Document Computing Symposium, Melbourne, Australia, November 27-28, 2014, pp. 58-65.

Yang, C.Z., Du, H.H., Wu, S.S. and Chen, I.X., 2012. Duplication detection for software bug reports based on BM25 term weighting. Proceedings of the Conference on Technologies and Applications of Artificial Intelligence, Tainan, Taiwan, November 16-18, 2012, pp. 33-38.

Janjua, S.H., Siddiqui, G.F., Sindhu, M.A. and Rashid, U., 2021. Multi-level aspect based sentiment classification of Twitter data: using hybrid approach in deep learning. PeerJ Computer Science,7(21), DOI: 10.7717/peerj-cs.433.