Naznin Sultana; Sellappan Palaniappan

Deceptive Opinion Detection Using Machine Learning Techniques

Full Text (PDF, 592KB), PP.1-7

Views: 0 Downloads: 0

Author(s)

Naznin Sultana ^1,2,* Sellappan Palaniappan ¹

1. Department of Information Technology, Malaysia University of Science & Technology, Petaling Jaya, Malaysia

2. Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh

* Corresponding author.

DOI: https://doi.org/10.5815/ijieeb.2020.01.01

Received: 15 Jun. 2019 / Revised: 12 Aug. 2019 / Accepted: 28 Oct. 2019 / Published: 8 Feb. 2020

Index Terms

Natural Language Processing, Spam Re-view, Opinion Mining, Ensemble Learning, Machine Learning

Abstract

Nowadays, online reviews have become a valuable resource for customer decision making before purchasing a product. Research shows that most of the people look at online reviews before purchasing any product. So, customers reviews are now become a crucial part of doing business online. Since review can either promote or demote a product or a service, so buying and selling fake reviews turns into a profitable business for some people now a days. In the past few years, deceptive review detection has attracted significant attention from both the industrial organizations and academic communities. However, the issue remains to be a challenging problem due to the lack of labeled dataset for supervised learning and evaluation. Also, study shows that both the state of the art computational approaches and human readers acquire an error rate of about 35% to 48% in identifying fake reviews. This study thoroughly investigated and analyzed customers’ online reviews for deception detection using different supervised machine learning methods and proposes a machine learning model using stochastic gradient descent algorithm for the detection of spam review. To reduce bias and variance, bagging and boosting approach was integrated into the model. Furthermore, to select the most appropriate features in the feature selection step, some rules using regular expression were also generated. Experiments on hotel review dataset demonstrate the effectiveness of the proposed approach.

Cite This Paper

Naznin Sultana, Sellappan Palaniappan, "Deceptive Opinion Detection Using Machine Learning Techniques", International Journal of Information Engineering and Electronic Business(IJIEEB), Vol.12, No.1, pp. 1-7, 2020. DOI:10.5815/ijieeb.2020.01.01

Reference

[1]K. Adhav, P. S. Z. Gawali, and P. R. Murumkar, “Survey on Online Spam Review Detection Methods,” vol. 5, no. 6, pp. 7875–7876, 2014.
[2]A. Bhowmick and S. M. Hazarika, “Machine Learning for E-mail Spam Filtering: Review,Techniques and Trends,” Jun. 2016.
[3]E. G. Dada, J. S. Bassi, H. Chiroma, S. M. Abdulhamid, A. O. Adetunmbi, and O. E. Ajibuwa, “Machine learning for email spam filtering: review, approaches and open research problems,” Heliyon, vol. 5, no. 6, p. e01802, Jun. 2019.
[4]A. Rastogi and M. Mehrotra, “Opinion Spam Detection in Online Reviews,” J. Inf. Knowl. Manag., vol. 16, no. 04, p. 1750036, Dec. 2017.
[5]M. Crawford, T. M. Khoshgoftaar, J. D. Prusa, A. N. Rich-ter, and H. Al Najada, “Survey of review spam detection using machine learning techniques,” J. Big Data, vol. 2, no. 1, p. 23, Dec. 2015.
[6]N. Jindal and B. Liu, “Analyzing and detecting review spam,” in Proceedings - IEEE International Conference on Data Mining, ICDM, 2007, pp. 547–552.
[7]E. P. Lim, V. A. Nguyen, N. Jindal, B. Liu, and H. W. Lauw, “Detecting product review spammers using rating behaviors,” in International Conference on Information and Knowledge Management, Proceedings, 2010, pp. 939–948.
[8]G. Fei, A. Mukherjee, B. Liu, M. Hsu, M. Castellanos, and R. Ghosh, “Exploiting burstiness in reviews for review spammer detection,” in Proceedings of the 7th International Conference on Weblogs and Social Media, ICWSM 2013, 2013, pp. 175–184.
[9]S. Xie, G. Wang, S. Lin, and P. S. Yu, “Review spam de-tection via temporal pattern discovery,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2012, pp. 823–831.
[10]C. L. Lai, K. Q. Xu, R. Y. K. Lau, Y. Li, and L. Jing, “To-ward a language modeling approach for consumer review spam detection,” in Proceedings - IEEE International Con-ference on E-Business Engineering, ICEBE 2010, 2010, pp. 1–8.
[11]M. Ott, Y. Choi, C. Cardie, and J. T. Hancock, “Finding deceptive opinion spam by any stretch of the imagination,” in ACL-HLT 2011 - Proceedings of the 49th Annual Meet-ing of the Association for Computational Linguistics: Hu-man Language Technologies, 2011, vol. 1, pp. 309–319.
[12]M. Ott, C. Cardie, and J. T. Hancock, “Negative deceptive opinion spam,” in NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference, 2013, pp. 497–501.
[13]H. Sun, A. Morales, and X. Yan, “Synthetic review spam-ming and defense,” in Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2013, vol. Part F1288, pp. 1088–1096.
[14]A. Mukherjee, V. Venkataraman, B. Liu, and N. Glance, “What yelp fake review filter might be doing?,” in Pro-ceedings of the 7th International Conference on Weblogs and Social Media, ICWSM 2013, 2013, pp. 409–418.

International Journal of Information Engineering and Electronic Business (IJIEEB)