Insights into TripAdvisor's online reviews: The case of Tehran's hotels

  • Ramina Khorsand
  • , Majid Rafiee
  • , Vahid Kayvanfar*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

59 Citations (Scopus)

Abstract

User-generated data in TripAdvisor.com consists of considerable amount of useful information that can help managers to provide better services to their customers. This study aims to forecast a new user's rate to a hotel based on information of the hotel and user. To do so, all reviews on all hotels of Tehran on TripAdvisor.com as real data are selected and 8 different supervised machine learning models are applied to the data to select the best method including K-nearest neighbors (KNN), Naïve Bayes, decision tree, logistic regression, support vector machine, neural network, random forest, and gradient boosting. KNN algorithm which uses similarity and distance measures for classification is selected as the best method through conducted comprehensive comparisons, statistical analysis and data-based sensitivity analysis. Since this study investigates an intensive set of data of all hotels in a city in all time, some worthful managerial insights are presented.

Original languageEnglish
Article number100673
JournalTourism Management Perspectives
Volume34
DOIs
Publication statusPublished - 3 Apr 2020
Externally publishedYes

Keywords

  • Customer rating
  • Data mining
  • Hospitality
  • Machine learning
  • Online hotel reviews

Fingerprint

Dive into the research topics of 'Insights into TripAdvisor's online reviews: The case of Tehran's hotels'. Together they form a unique fingerprint.

Cite this