A comparative analysis of major online review platforms: Implications for social media analytics in hospitality and tourism
Introduction
There is a growing literature on social media analytics that combines Web crawling, computational linguistics, machine learning and statistical techniques to collect, analyze, and interpret the so-called big data for business purposes such as tracking trending topics and popular sentiments as well as identifying opinions and beliefs about products (Fan and Gordon, 2014, Lazer et al., 2009). Particularly, online consumer reviews, widely considered a rich data source that reflects consumer experiences and evaluation of products, have been studied to understand a range of research problems in hospitality and tourism (Schuckert, Liu, & Law, 2015b). Studies using online reviews usually employ a sample of review (and related) data, large or small, to extract features or measures that allow the researcher to detect, describe or predict patterns that are meaningful from theoretical or practical perspectives. This literature complements conventional approaches that primarily rely on surveys, personal interviews and other communication-based methods and represents a promising research direction by taking advantage of the abundant, readily available data resources (Xiang, Schwartz, Gerdes, & Uysal, 2015). While this line of research has generated novel insights into hospitality and tourism management, existing studies are limited in that 1) they tend to use a single data source for online reviews and 2) the quality of data is largely anecdotal and often based upon the popularity of the websites from which the data were collected, which substantially limits their generalizability and contribution to knowledge.
With this in mind, this study comparatively examines three major online review platforms, namely TripAdvisor, Expedia, and Yelp, in terms of information quality related to online reviews in these websites with the goal to provide a basis for understanding the methodological challenges and for identifying opportunities for the development of social media analytics in hospitality and tourism. The rest of the paper is organized as follows: the next section, Research Background, reviews related literature to provide the motivations for the present study. In Research Design and Analytical Framework, we outline our methodological approaches and describe key measures and methods used to assess the three platforms with specific research questions. In Data Collection and Analysis, we describe the data collection process and explain, in details, the text analytics procedures to develop key metrics to describe review characteristics as well as statistical analyses conducted to compare and contrast the three platforms. Then, research findings are presented followed by a discussion on implications for both research and practice. Finally, conclusions are drawn, and limitations and directions for future research are discussed.
Section snippets
Research background
Big data analytics has been touted as a new research paradigm that utilizes diverse sources of data and analytical tools to make inferences and predictions about reality (Boyd and Crawford, 2012, Mayer-Schönberger and Cukier, 2013). Particularly, with increasingly powerful natural language processing and machine learning capabilities, textual contents from the Web provide a huge shared cognitive and cultural context and, thus, have been analyzed in many application domains (Halevy, Norvig, &
Research design and analytical framework
In order to understand the methodological challenges related to data quality in social media analytics, we devised a study to assess information quality related to online reviews on three major platforms, namely TripAdvisor, Expedia, and Yelp. The rationale for selecting these platforms was three-fold: 1) they are widely used by online consumers (Gligorijevic, 2016, Yoo et al., 2016); 2) each of them represents a fairly unique “species” of review platforms (i.e., TripAdvisor as the largest
Data collection and analysis
We applied the social media analytics procedure (e.g., Abrahams et al., 2015; Fan & Gordon, 2014) to answer the above research questions. We first collected relevant data from the three platforms. Then, the unstructured data were pre-processed and key metrics including online reviews' linguistic features, sentiment, semantic features, and perceived helpfulness were developed and compared among the platforms. Finally, a set of regression analyses were conducted to examine the relationships
Findings
In this section, we first present the diagnostic analysis in terms of the extent to which the three platforms represent the hotel product from the supply side perspective. Then, we describe the characteristics of online reviews using the metrics we developed to measure information quality of these platforms. Finally, we present the results of regression analyses assessing the relationships between review characteristics and both rating and helpfulness.
Discussion and implications
Motivated by the lack of understanding of data quality in social media-related studies, we applied a series of text analytics techniques to “dissect” three major review platforms in hospitality and tourism. This study shows that TripAdvisor, Expedia, and Yelp, while all incorporating consumer reviews as primary social knowledge, are indeed distinct from each other on a variety of aspects. In terms of representing the supply of the hotel product, there appears to be huge discrepancies between
Conclusions, limitations, and future research
Information technology creates new structures and dynamics in the market; therefore, it is imperative for us to gain a solid understanding of the changing reality, either from knowledge or business perspective (Werthner and Klein, 1999, Xiang et al., 2008). In this study we showed that online review data drawn from three dominant platforms on a specific industry sector and from a specific geographic region can be considerably different in both content and structure. By demonstrating the
Acknowledgements
This study was partially supported by Natural Science Foundation of China Grant# 71531013.
Zheng Xiang, Ph.D., is Associate Professor in the Department of Hospitality and Tourism Management at Virginia Tech, USA. His research interests include travel information search, social media marketing, and business analytics for the tourism and hospitality industries. He is a recipient of Emerging Scholar of Distinction award by the International Academy for the Study of Tourism. He is a board member of International Federation for IT and Travel & Tourism (IFITT) and editorial board member of
References (63)
- et al.
In search of patterns among travellers' hotel ratings in TripAdvisor
Tourism Management
(2016) - et al.
Analysis of the perceived value of online tourism reviews: Influence of readability and reviewer characteristics
Tourism Management
(2016) - et al.
Why do travelers trust TripAdvisor? Antecedents of trust towards consumer-generated media and its influence on recommendation adoption and word of mouth
Tourism Management
(2015) - et al.
In TripAdvisor we trust: Rankings, calculative regimes and abstract systems
Accounting, Organizations and Society
(2011) - et al.
Evaluating content quality and helpfulness of online product reviews: The interplay of reviews helpfulness vs. review content
Electronic Commerce Research and Applications
(2012) - et al.
What makes a useful online review? Implication for travel product websites
Tourism Management
(2015) - et al.
Ecotourism experiences reported online: Classification of satisfaction attributes
Tourism Management
(2012) - et al.
Rating tourism and hospitality journals
Tourism Management
(2006) - et al.
Booking. com: The unexpected scoring system
Tourism Management
(2015) - et al.
Marketing decisions, customer reviews, and business performance: The use of the Toprural website by Spanish rural lodging establishments
Tourism Management
(2014)
Asymmetric effects of online consumer reviews
Annals of Tourism Research
The interactive effects of online reviews on the determinants of swiss hotel performance: A neural network analysis
Tourism Management
Perceived ‘usefulness’ of online consumer reviews: An exploratory investigation across three services categories
Electronic Commerce Research and Applications
Centroid-based summarization of multiple documents
Information Processing & Management
A segmentation of online reviews by language groups: How English and non-English speakers rate hotels differently
International Journal of Hospitality Management
Reconfiguring relations of accountability: Materialization of social media in the travel sector
Accounting, Organizations and Society
Online travel reviews as persuasive communication: The effects of content type, source, and certification logos on consumer behavior
Tourism Management
What can big data and text analytics tell us about hotel guest experience and satisfaction?
International Journal of Hospitality Management
The business value of online consumer reviews and management response to hotel performance
International Journal of Hospitality Management
The impact of online user reviews on hotel room sales
International Journal of Hospitality Management
Dimensions of lodging guest satisfaction among guests with mobility challenges: A mixed-method analysis of web-based texts
Tourism Management
The impact of e-word-of-mouth on the online popularity of restaurants: A comparison of consumer reviews and editor reviews
International Journal of Hospitality Management
The power of expert identity: How website-recognized expert reviews influence travelers' online rating behavior
Tourism Management
An integrated text analytic framework for product defect discovery
Production and Operations Management
Development of a tool for measuring and analyzing computer user satisfaction
Management Science
Evaluating loyalty constructs among hotel reward program members using eWOM
Journal of Hospitality & Tourism Research
Latent dirichlet allocation
The Journal of Machine Learning Research
Critical questions for big data: Provocations for a cultural, technological, and scholarly phenomenon
Information, Communication & Society
Persuasion: Insights from the self-validation hypothesis
Understanding review helpfulness as a function of reviewer reputation, review rating, and review depth
Journal of the Association for Information Science and Technology
Cited by (0)
Zheng Xiang, Ph.D., is Associate Professor in the Department of Hospitality and Tourism Management at Virginia Tech, USA. His research interests include travel information search, social media marketing, and business analytics for the tourism and hospitality industries. He is a recipient of Emerging Scholar of Distinction award by the International Academy for the Study of Tourism. He is a board member of International Federation for IT and Travel & Tourism (IFITT) and editorial board member of several international journals including Journal of Business Research, Journal of Travel Research, and Journal of Hospitality and Tourism Research.
Qianzhou Du is a Ph.D. student in the Department of Business Information Technology at Virginia Tech, USA. His research interests include business intelligence, text analytics, social media analytics, crowd sourcing, and business analytics for the tourism and hospitality industries. He is a recipient of ICTAS doctoral scholarship of Virginia Tech.
Yufeng Ma received the BE degree in Computer Science from Wuhan University, China in 2012. Currently he is a Ph.D. student in the Department of Computer Science at Virginia Tech, USA, where he is co-advised by Dr. Patrick Fan and Dr. Edward. A. Fox. His research interests include data mining, topic modeling in text mining, computer vision, deep learning and artificial intelligence in general.
Weiguo Fan is a full professor of the Department of Accounting and Information Systems and Director of the Center for Business Intelligence and Analytics in the Pamplin College of Business. He specializes in business intelligence, data and text analytics, social network analysis, social media analytics, business data mining using both internal and external data, complex system analysis and modeling, and consumer behavior modeling and analysis. He has applied analytics in financial statement risk analysis, search engine optimization, targeted and precision marketing, sentiment and opinion mining for product quality control, competitive intelligence, and deception detection in large scale business transactions.