From reviews to emotions: Analysing Bragança’s tourism attractions on TripAdvisor
Published Online: Dec 31, 2024
Page range: 299 - 311
Received: Jan 29, 2024
Accepted: Apr 17, 2024
DOI: https://doi.org/10.2478/ejthr-2024-0022
Keywords
© 2024 Elaine Scalabrini et al., published by Sciendo
This work is licensed under the Creative Commons Attribution 4.0 International License.
Over the past decade, sentiment analysis has emerged as a pivotal tool in tourism-related texts, driven by the sheer volume of tourist attractions and the wealth of online information. Tourists increasingly turn to travel websites to access specific information that often eludes standard evaluations of tourist attractions. Forums particularly illuminate specific information needs and their ties to potential destinations. Among these platforms, TripAdvisor has become a favoured choice for posting reviews, ratings, and facilitating online bookings. In this context, this study aims to analyse and assess sentiment in reviews sourced from the online platform TripAdvisor, focusing on tourist attractions in the northern Portuguese destination of Bragança. The research spotlights the disparity between qualitative and quantitative rankings. The study also underscores the importance of data pre-processing, including removing irrelevant information and stop words. Pre-processing was crucial in refining sentiment prediction accuracy, highlighting the differentiated roles of these words in context and meaning. Despite utilising advanced techniques such as tokenisation, TF-IDF weighting, logistic regression, and n-grams, the study’s models encountered challenges in achieving high accuracy in sentiment prediction. Even the incorporation of bigrams did not yield substantial performance improvements, with the models frequently inclined to overestimate negative and positive sentiments.