Predictive models of hotel booking cancellation: a semi-automated analysis of the literature
DOI:
https://doi.org/10.18089/tms.2019.15011Keywords:
Data Science, Forecast, Literature review, Prediction, Revenue ManagementAbstract
This study sought to combine data science tools and capabilities with human judgement and interpretation in order to demonstrate how semiautomatic analysis of the literature can contribute to identifying and synthesising research findings and topics about booking cancellation forecasting. The study also focused on recording in detail the analysis’s full experimental procedure to encourage other researchers to conduct automated literature reviews in order to understand more fully the current tendencies in their field of study. The data were obtained through a keyword search in Scopus and Web of Science databases. The methodology presented not only diminishes human bias but also enhances data visualisation and text mining techniques’ ability to facilitate abstraction, expedite analysis and improve literature reviews. The results show that, despite the importance of forecasting booking cancellations to understanding net demand and improving cancellation and overbooking policies, further research on this subject is needed.
References
Ali, N. B., & Usman, M. (2018). Reliability of search in systematic reviews: Towards a quality assessment framework for the automated-search strategy. Information and Software Technology, 99, 133–147. https://doi.org/10.1016/j.infsof.2018.02.002
Al-Safadi, E. B., & Al-Naffouri, T. Y. (2012). Peak reduction and clipping mitigation in OFDM by augmented compressive sensing. IEEE Transactions on Signal Processing, 60(7), 3834–3839. https://doi.org/10.1109/TSP.2012.2193396
Antonio, N., Almeida, A., & Nunes, L. (2017a). Predicting hotel booking cancellation to decrease uncertainty and increase revenue. Tourism & Management Studies, 13(2), 25–39. https://doi.org/10.18089/tms.2017.13203
Antonio, N., Almeida, A., & Nunes, L. (2017b). Predicting hotel bookings cancellation with a machine learning classification model. In Proceedings from the 16th IEEE International Conference on Machine Learning and Applications (pp. 1049–1054). Cancun, Mexico: IEEE. https://doi.org/10.1109/ICMLA.2017.00-11
Antonio, N., Almeida, A. de, & Nunes, L. (2017c). Using data science to predict hotel booking cancellations. In P. Vasant & K. M (Eds.), Handbook of Research on Holistic Optimization Techniques in the Hospitality, Tourism, and Travel Industry (pp. 141–167). Hershey, PA, USA: Business Science Reference.
Arun, R., Suresh, V., Madhavan, C. E. V., & Murthy, M. N. N. (2010). On finding the natural number of topics with Latent Dirichlet Allocation: Some observations. In Advances in Knowledge Discovery and Data Mining (pp. 391–402). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13657-3_43
Azadeh, S. S., Labib, R., & Savard, G. (2013). Railway demand forecasting in revenue management using neural networks. International Journal of Revenue Management, 7(1), 18. https://doi.org/10.1504/IJRM.2013.053358
Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993–1022.
Bragge, J., Relander, S., Sunikka, A., & Mannonen, P. (2007). Enriching literature reviews with computer-assisted research mining. Case: profiling group support systems research (pp. 243a-243a). IEEE. https://doi.org/10.1109/HICSS.2007.209
Calheiros, A. C., Moro, S., & Rita, P. (2017). Sentiment classification of consumer-generated online reviews using topic modeling. Journal of Hospitality Marketing & Management, 0(0), 1–19. https://doi.org/10.1080/19368623.2017.1310075
Chen, C.-C. (2016). Cancellation policies in the hotel, airline and restaurant industries. Journal of Revenue and Pricing Management, 15(3–4), 270–275. https://doi.org/10.1057/rpm.2016.9
Chiang, W.-C., Chen, J. C., & Xu, X. (2007). An overview of research on revenue management: current issues and future research. International Journal of Revenue Management, 1(1), 97–128.
Cirillo, C., Bastin, F., & Hetrakul, P. (2018). Dynamic discrete choice model for railway ticket cancellation and exchange decisions. Transportation Research Part E: Logistics and Transportation Review, 110, 137–146. https://doi.org/10.1016/j.tre.2017.12.004
Delen, D., & Crossland, M. D. (2008). Seeding the survey and analysis of research literature with text mining. Expert Systems with Applications, 34(3), 1707–1720. https://doi.org/10.1016/j.eswa.2007.01.035
Denizci Guillet, B., & Mohammed, I. (2015). Revenue management research in hospitality and tourism: A critical review of current literature and suggestions for future research. International Journal of Contemporary Hospitality Management, 27(4), 526–560. https://doi.org/10.1108/IJCHM-06-2014-0295
Fabbri, S., Hernandes, E., Di Thommazo, A., Belgamo, A., Zamboni, A., & Silva, C. (2013). Using information visualization and text mining to facilitate the conduction of systematic literature reviews. In J. Cordeiro, L. A. Maciaszek, & J. Filipe (Eds.), Enterprise Information Systems (Vol. 141, pp. 243–256). Berlin, Heidelberg: Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-40654-6_15
Feinerer, I., & Hornik, K. (2017). tm: Text mining package (Version 0.7-3). Retrieved from https://CRAN.R-project.org/package=tm
Fellows, I. (2014). wordcloud: Word clouds (Version 2.5). Retrieved from https://CRAN.R-project.org/package=wordcloud
Feng, L., Chiam, Y. K., & Lo, S. K. (2017). Text-mining techniques and tools for systematic literature reviews: A systematic literature review. In 2017 24th Asia-Pacific Software Engineering Conference (APSEC) (pp. 41–50). https://doi.org/10.1109/APSEC.2017.10
Gayar, N. F. E., Saleh, M., Atiya, A., El-Shishiny, H., Zakhary, A. A. Y. F., & Habib, H. A. A. M. (2011). An integrated framework for advanced hotel revenue management. International Journal of Contemporary Hospitality Management, 23(1), 84–98. https://doi.org/10.1108/09596111111101689
Grun, B., & Hornik, K. (2011). topicmodels: An R Package for fitting topic Models. Journal of Statistical Software, 40(11), 1–30. https://doi.org/10.18637/jss.v040.i13
Guerreiro, J., Rita, P., & Trigueiros, D. (2016). A text mining-based review of cause-related marketing literature. Journal of Business Ethics, 139(1), 111–128. https://doi.org/10.1007/s10551-015-2622-4
Guo, X., Dong, Y., & Ling, L. (2016). Customer perspective on overbooking: The failure of customers to enjoy their reserved services, accidental or intended? Journal of Air Transport Management, 53, 65–72. https://doi.org/10.1016/j.jairtraman.2016.01.001
Haneem, F., Kama, N., Ali, R., & Selamat, A. (2017). Applying data analytics approach in systematic literature review: Master data management case study. In Frontiers in Artificial Intelligence and Applications (Vol. 297, pp. 705–715). Kitakyushu, Japan.
Hornik, K. (2017). NLP: Natural language processing Infrastructure (Version 0.1.11). Retrieved from https://CRAN.R-project.org/package=NLP
Ivanov, S., & Zhechev, V. (2012). Hotel revenue management–A critical literature review. Turizam: Znanstveno-Strucnicasopis, 60(2), 175–197.
Kassambara, A. (2017). Practical guide to cluster analysis in R: Unsupervised machine learning. STHDA.
Kassambara, A., & Mundt, F. (2017). factoextra: Extract and visualize the results of multivariate data analyses (Version 1.0.5). Retrieved from https://CRAN.R-project.org/package=factoextra
Kimes, S. E., & Wirtz, J. (2003). Has revenue management become acceptable? Findings from an International study on the perceived fairness of rate fences. Journal of Service Research, 6(2), 125–135.
Kitchenham, B. A., & Charters, S. (2017). Guidelines for performing Systematic Literature Reviews in Software Engineering (version 2.3) (EBSE Technical Report No. EBSE-2007-01). Durham, UK: Keele University.
Krasteva, R. (2017). Local impact of refugee and migrants crisis on greek tourism industry. Economic Studies Journal, (4), 182–195.
Lan, Y., Ball, M. O., & Karaesmen, I. Z. (2011). Regret in overbooking and fare-class allocation for single leg. Manufacturing & Service Operations Management, 13(2), 194–208. https://doi.org/10.1287/msom.1100.0316
Lee, M. (2018). Modeling and forecasting hotel room demand based on advance booking information. Tourism Management, 66, 62–71. https://doi.org/10.1016/j.tourman.2017.11.004
Lemke, C., Riedel, S., & Gabrys, B. (2009). Dynamic combination of forecasts generated by diversification procedures applied to forecasting of airline cancellations. In IEEE Symposium on Computational Intelligence for Financial Engineering, 2009. CIFEr ’09 (pp. 85–91).
Lemke, C., Riedel, S., & Gabrys, B. (2013). Evolving forecast combination structures for airline revenue management. Journal of Revenue and Pricing Management, 12(3), 221–234. https://doi.org/10.1057/rpm.2012.30
Lewis-Beck, M. S. (2005). Election forecasting: Principles and practice. The British Journal of Politics & International Relations, 7(2), 145–164.
Liu, P. H. (2004). Hotel demand/cancellation analysis and estimation of unconstrained demand using statistical methods. In I. Yeoman & U. McMahon-Beattie (Eds.), Revenue management and pricing: Case studies and applications (pp. 91–101). Cengage Learning EMEA.
Matsuo, Y. (2003). Prediction, forecasting, and chance Discovery. In Y. Ohsawa & P. McBurney (Eds.), Chance discovery. Berlin, Heidelberg: Springer.
McGuire, K. A. (2017). The analytic hospitality executive: implementing data analytics in hotels and casinos. Hoboken, New Jersey: John Wiley & Sons, Inc.
Metzger, A., Franklin, R., & Engel, Y. (2012). Predictive monitoring of heterogeneous service-oriented business networks: the transport and logistics case (pp. 313–322). IEEE. https://doi.org/10.1109/SRII.2012.42
Morales, D. R., & Wang, J. (2010). Forecasting cancellation rates for services booking revenue management using data mining. European Journal of Operational Research, 202(2), 554–562.
Moro, S., Cortez, P., & Rita, P. (2015). Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and latent Dirichlet allocation. Expert Systems with Applications, 42(3), 1314–1324. https://doi.org/10.1016/j.eswa.2014.09.024
Nikita, M. (2016). ldatunning: Tuning of the Latent Dirichlet Allocation model parameters (Version 0.2.0). Retrieved from https://cran.r-project.org/web/packages/ldatuning/ldatuning.pdf
Noone, B. M., & Lee, C. H. (2011). Hotel overbooking: The effect of overcompensation on customers’ reactions to denied service. Journal of Hospitality & Tourism Research, 35(3), 334–357. https://doi.org/10.1177/1096348010382238
Nunez-Mir, G. C., Iannone, B. V., Pijanowski, B. C., Kong, N., & Fei, S. (2016). Automated content analysis: addressing the big literature challenge in ecology and evolution. Methods in Ecology and Evolution, 7(11), 1262–1272. https://doi.org/10.1111/2041-210X.12602
O’Neil, C., & Schutt, R. (2013). Doing data science. Sebastopol, CA, USA: O’Reilly Media.
Pan, B., & Yang, Y. (2017). Monitoring and forecasting tourist activities with big data. In M. Uysal, Z. Schwartz, & E. Sirakaya-Turk (Eds.), Management science in hospitality and tourism: Theory, practice, and applications (pp. 43–62). Apple Academic Press. Retrieved from http://www.crcnetbase.com/doi/pdfplus/10.1201/b19937-1
Park, J. Y., & Nagy, Z. (2018). Comprehensive analysis of the relationship between thermal comfort and building control research - A data-driven literature review. Renewable and Sustainable Energy Reviews, 82, 2664–2679. https://doi.org/10.1016/j.rser.2017.09.102
Pulugurtha, S. S., & Nambisan, S. S. (2003). A decision-support tool for airline yield management using genetic algorithms. Computer-Aided Civil and Infrastructure Engineering, 18(3), 214–223. https://doi.org/10.1111/1467-8667.00311
R Core Team. (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/
Talluri, K. T., & Van Ryzin, G. (2005). The theory and practice of revenue management. New York, NY: Springer.
Tsafnat, G., Glasziou, P., Choong, M. K., Dunn, A., Galgani, F., & Coiera, E. (2014). Systematic review automation technologies. Systematic Reviews, 3, 74. https://doi.org/10.1186/2046-4053-3-74
Tsai, T.-H. (2011). A temporal case-based procedure for cancellation forecasting: a case study. Current Politics and Economics of South, Southeastern, and Central Asia, 20(2), 159–182.
Weatherford, L. R., & Kimes, S. E. (2003). A comparison of forecasting methods for hotel revenue management. International Journal of Forecasting, 19(3), 401–415. https://doi.org/10.1016/S0169-2070(02)00011-0
Webster, J., & Watson, R. T. (2002). Analyzing the past to prepare for the future: Writing a literature review. MIS Quarterly, 26(3), xiii–xxiii.
Welbers, K., Van Atteveldt, W., & Benoit, K. (2017). Text analysis in R. Communication Methods and Measures, 11(4), 245–265. https://doi.org/10.1080/19312458.2017.1387238
Zakhary, A., Atiya, A. F., El-Shishiny, H., & Gayar, N. (2011). Forecasting hotel arrivals and occupancy using Monte Carlo simulation. Journal of Revenue and Pricing Management, 10(4). https://doi.org/10.1057/rpm.2009.42
Downloads
Published
Issue
Section
License
Copyright (c) 2019 Tourism & Management Studies
This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License.
The journal retains published articles’ copyrights, but they are simultaneously licensed under the Creative Commons Attribution License (CC BY-NC-ND), which allows individuals’ to share the relevant papers as long as authorship and publication in this journal are duly acknowledged.