arXiv Analytics

Sign in

arXiv:2010.04260 [cs.CL]AbstractReferencesReviewsResources

Fake Reviews Detection through Analysis of Linguistic Features

Faranak Abri, Luis Felipe Gutierrez, Akbar Siami Namin, Keith S. Jones, David R. W. Sears

Published 2020-10-08Version 1

Online reviews play an integral part for success or failure of businesses. Prior to purchasing services or goods, customers first review the online comments submitted by previous customers. However, it is possible to superficially boost or hinder some businesses through posting counterfeit and fake reviews. This paper explores a natural language processing approach to identify fake reviews. We present a detailed analysis of linguistic features for distinguishing fake and trustworthy online reviews. We study 15 linguistic features and measure their significance and importance towards the classification schemes employed in this study. Our results indicate that fake reviews tend to include more redundant terms and pauses, and generally contain longer sentences. The application of several machine learning classification algorithms revealed that we were able to discriminate fake from real reviews with high accuracy using these linguistic features.

Comments: The pre-print of a paper to appear in the proceedings of the IEEE International Conference on Machine Learning Applications (ICMLA 2020), 11 pages, 3 figures, 5 tables
Categories: cs.CL, cs.IR
Related articles: Most relevant | Search more
arXiv:1809.02637 [cs.CL] (Published 2018-09-07)
Neural Generation of Diverse Questions using Answer Focus, Contextual and Linguistic Features
arXiv:1811.02750 [cs.CL] (Published 2018-11-07)
The relationship between linguistic expression and symptoms of depression, anxiety, and suicidal thoughts: A longitudinal study of blog content
arXiv:2006.00593 [cs.CL] (Published 2020-05-31)
BPGC at SemEval-2020 Task 11: Propaganda Detection in News Articles with Multi-Granularity Knowledge Sharing and Linguistic Features based Ensemble Learning