Predictive models for hotel booking cancellation: A semiautomated analysis of the literature

Nuno Antonio

Abstract


In reservation-based industries, accurate booking cancellation forecast is of foremost importance to estimate demand. By combining data science tools and capabilities with human judgement and interpretation it is possible to demonstrate how the semiautomatic analysis of the literature can contribute to synthetize research findings and identify research topics on the subject of booking cancellation forecasting. The data used was obtained through keyword search in Scopus and Web of Science databases. The methodology presented not only diminishes human bias, but also enhances the fact that data visualization and text mining techniques facilitate abstraction, expedite analysis and contribute to the improvement of reviews. Results show that albeit the importance of bookings’ cancellation forecast, further research on the subject is still needed. By detailing the full experimental procedure of the analysis, this work aims to encourage other authors to conduct automated literature analysis as a means to understand current research in their working fields.

Keywords


Data Science; Forecast; Literature review; Prediction; Revenue Management

Full Text:

PDF

References


Ali, N. B., & Usman, M. (2018). Reliability of search in systematic reviews: Towards a quality assessment framework for the automated-search strategy. Information and Software Technology, 99, 133–147. https://doi.org/10.1016/j.infsof.2018.02.002

Al-Safadi, E. B., & Al-Naffouri, T. Y. (2012). Peak reduction and clipping mitigation in OFDM by augmented compressive sensing. IEEE Transactions on Signal Processing, 60(7), 3834–3839. https://doi.org/10.1109/TSP.2012.2193396

Antonio, N., Almeida, A., & Nunes, L. (2017a). Predicting hotel booking cancellation to decrease uncertainty and increase revenue. Tourism & Management Studies, 13(2), 25–39. https://doi.org/10.18089/tms.2017.13203

Antonio, N., Almeida, A., & Nunes, L. (2017b). Predicting hotel bookings cancellation with a machine learning classification model. In Proceedings from the 16th IEEE International Conference on Machine Learning and Applications (pp. 1049–1054). Cancun, Mexico: IEEE. https://doi.org/10.1109/ICMLA.2017.00-11

Antonio, N., Almeida, A. de, & Nunes, L. (2017c). Using data science to predict hotel booking cancellations. In P. Vasant & K. M (Eds.), Handbook of Research on Holistic Optimization Techniques in the Hospitality, Tourism, and Travel Industry (pp. 141–167). Hershey, PA, USA: Business Science Reference.

Arun, R., Suresh, V., Madhavan, C. E. V., & Murthy, M. N. N. (2010). On finding the natural number of topics with Latent Dirichlet Allocation: Some observations. In Advances in Knowledge Discovery and Data Mining (pp. 391–402). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-13657-3_43

Azadeh, S. S., Labib, R., & Savard, G. (2013). Railway demand forecasting in revenue management using neural networks. International Journal of Revenue Management, 7(1), 18. https://doi.org/10.1504/IJRM.2013.053358

Blei, D. M., Ng, A. Y., & Jordan, M. I. (2003). Latent dirichlet allocation. Journal of Machine Learning Research, 3(Jan), 993–1022.

Bragge, J., Relander, S., Sunikka, A., & Mannonen, P. (2007). Enriching literature reviews with computer-assisted research mining. Case: profiling group support systems research (pp. 243a-243a). IEEE. https://doi.org/10.1109/HICSS.2007.209

Calheiros, A. C., Moro, S., & Rita, P. (2017). Sentiment classification of consumer-generated online reviews using topic modeling. Journal of Hospitality Marketing & Management, 0(0), 1–19. https://doi.org/10.1080/19368623.2017.1310075

Chen, C.-C. (2016). Cancellation policies in the hotel, airline and restaurant industries. Journal of Revenue and Pricing Management, 15(3–4), 270–275. https://doi.org/10.1057/rpm.2016.9

Chiang, W.-C., Chen, J. C., & Xu, X. (2007). An overview of research on revenue management: current issues and future research. International Journal of Revenue Management, 1(1), 97–128.

Cirillo, C., Bastin, F., & Hetrakul, P. (2018). Dynamic discrete choice model for railway ticket cancellation and exchange decisions. Transportation Research Part E: Logistics and Transportation Review, 110, 137–146. https://doi.org/10.1016/j.tre.2017.12.004

Delen, D., & Crossland, M. D. (2008). Seeding the survey and analysis of research literature with text mining. Expert Systems with Applications, 34(3), 1707–1720. https://doi.org/10.1016/j.eswa.2007.01.035

Denizci Guillet, B., & Mohammed, I. (2015). Revenue management research in hospitality and tourism: A critical review of current literature and suggestions for future research. International Journal of Contemporary Hospitality Management, 27(4), 526–560. https://doi.org/10.1108/IJCHM-06-2014-0295

Fabbri, S., Hernandes, E., Di Thommazo, A., Belgamo, A., Zamboni, A., & Silva, C. (2013). Using information visualization and text mining to facilitate the conduction of systematic literature reviews. In J. Cordeiro, L. A. Maciaszek, & J. Filipe (Eds.), Enterprise Information Systems (Vol. 141, pp. 243–256). Berlin, Heidelberg: Springer Berlin Heidelberg. https://doi.org/10.1007/978-3-642-40654-6_15

Feinerer, I., & Hornik, K. (2017). tm: Text mining package (Version 0.7-3). Retrieved from https://CRAN.R-project.org/package=tm

Fellows, I. (2014). wordcloud: Word clouds (Version 2.5). Retrieved from https://CRAN.R-project.org/package=wordcloud

Feng, L., Chiam, Y. K., & Lo, S. K. (2017). Text-mining techniques and tools for systematic literature reviews: A systematic literature review. In 2017 24th Asia-Pacific Software Engineering Conference (APSEC) (pp. 41–50). https://doi.org/10.1109/APSEC.2017.10

Gayar, N. F. E., Saleh, M., Atiya, A., El-Shishiny, H., Zakhary, A. A. Y. F., & Habib, H. A. A. M. (2011). An integrated framework for advanced hotel revenue management. International Journal of Contemporary Hospitality Management, 23(1), 84–98. https://doi.org/10.1108/09596111111101689

Grun, B., & Hornik, K. (2011). topicmodels: An R Package for fitting topic Models. Journal of Statistical Software, 40(11), 1–30. https://doi.org/10.18637/jss.v040.i13

Guerreiro, J., Rita, P., & Trigueiros, D. (2016). A text mining-based review of cause-related marketing literature. Journal of Business Ethics, 139(1), 111–128. https://doi.org/10.1007/s10551-015-2622-4

Guo, X., Dong, Y., & Ling, L. (2016). Customer perspective on overbooking: The failure of customers to enjoy their reserved services, accidental or intended? Journal of Air Transport Management, 53, 65–72. https://doi.org/10.1016/j.jairtraman.2016.01.001

Haneem, F., Kama, N., Ali, R., & Selamat, A. (2017). Applying data analytics approach in systematic literature review: Master data management case study. In Frontiers in Artificial Intelligence and Applications (Vol. 297, pp. 705–715). Kitakyushu, Japan.

Hornik, K. (2017). NLP: Natural language processing Infrastructure (Version 0.1.11). Retrieved from https://CRAN.R-project.org/package=NLP

Ivanov, S., & Zhechev, V. (2012). Hotel revenue management–A critical literature review. Turizam: Znanstveno-Strucnicasopis, 60(2), 175–197.

Kassambara, A. (2017). Practical guide to cluster analysis in R: Unsupervised machine learning. STHDA.

Kassambara, A., & Mundt, F. (2017). factoextra: Extract and visualize the results of multivariate data analyses (Version 1.0.5). Retrieved from https://CRAN.R-project.org/package=factoextra

Kimes, S. E., & Wirtz, J. (2003). Has revenue management become acceptable? Findings from an International study on the perceived fairness of rate fences. Journal of Service Research, 6(2), 125–135.

Kitchenham, B. A., & Charters, S. (2017). Guidelines for performing Systematic Literature Reviews in Software Engineering (version 2.3) (EBSE Technical Report No. EBSE-2007-01). Durham, UK: Keele University.

Krasteva, R. (2017). Local impact of refugee and migrants crisis on greek tourism industry. Economic Studies Journal, (4), 182–195.

Lan, Y., Ball, M. O., & Karaesmen, I. Z. (2011). Regret in overbooking and fare-class allocation for single leg. Manufacturing & Service Operations Management, 13(2), 194–208. https://doi.org/10.1287/msom.1100.0316

Lee, M. (2018). Modeling and forecasting hotel room demand based on advance booking information. Tourism Management, 66, 62–71. https://doi.org/10.1016/j.tourman.2017.11.004

Lemke, C., Riedel, S., & Gabrys, B. (2009). Dynamic combination of forecasts generated by diversification procedures applied to forecasting of airline cancellations. In IEEE Symposium on Computational Intelligence for Financial Engineering, 2009. CIFEr ’09 (pp. 85–91).

Lemke, C., Riedel, S., & Gabrys, B. (2013). Evolving forecast combination structures for airline revenue management. Journal of Revenue and Pricing Management, 12(3), 221–234. https://doi.org/10.1057/rpm.2012.30

Lewis-Beck, M. S. (2005). Election forecasting: Principles and practice. The British Journal of Politics & International Relations, 7(2), 145–164.

Liu, P. H. (2004). Hotel demand/cancellation analysis and estimation of unconstrained demand using statistical methods. In I. Yeoman & U. McMahon-Beattie (Eds.), Revenue management and pricing: Case studies and applications (pp. 91–101). Cengage Learning EMEA.

Matsuo, Y. (2003). Prediction, forecasting, and chance Discovery. In Y. Ohsawa & P. McBurney (Eds.), Chance discovery. Berlin, Heidelberg: Springer.

McGuire, K. A. (2017). The analytic hospitality executive: implementing data analytics in hotels and casinos. Hoboken, New Jersey: John Wiley & Sons, Inc.

Metzger, A., Franklin, R., & Engel, Y. (2012). Predictive monitoring of heterogeneous service-oriented business networks: the transport and logistics case (pp. 313–322). IEEE. https://doi.org/10.1109/SRII.2012.42

Morales, D. R., & Wang, J. (2010). Forecasting cancellation rates for services booking revenue management using data mining. European Journal of Operational Research, 202(2), 554–562.

Moro, S., Cortez, P., & Rita, P. (2015). Business intelligence in banking: A literature analysis from 2002 to 2013 using text mining and latent Dirichlet allocation. Expert Systems with Applications, 42(3), 1314–1324. https://doi.org/10.1016/j.eswa.2014.09.024

Nikita, M. (2016). ldatunning: Tuning of the Latent Dirichlet Allocation model parameters (Version 0.2.0). Retrieved from https://cran.r-project.org/web/packages/ldatuning/ldatuning.pdf

Noone, B. M., & Lee, C. H. (2011). Hotel overbooking: The effect of overcompensation on customers’ reactions to denied service. Journal of Hospitality & Tourism Research, 35(3), 334–357. https://doi.org/10.1177/1096348010382238

Nunez-Mir, G. C., Iannone, B. V., Pijanowski, B. C., Kong, N., & Fei, S. (2016). Automated content analysis: addressing the big literature challenge in ecology and evolution. Methods in Ecology and Evolution, 7(11), 1262–1272. https://doi.org/10.1111/2041-210X.12602

O’Neil, C., & Schutt, R. (2013). Doing data science. Sebastopol, CA, USA: O’Reilly Media.

Pan, B., & Yang, Y. (2017). Monitoring and forecasting tourist activities with big data. In M. Uysal, Z. Schwartz, & E. Sirakaya-Turk (Eds.), Management science in hospitality and tourism: Theory, practice, and applications (pp. 43–62). Apple Academic Press. Retrieved from http://www.crcnetbase.com/doi/pdfplus/10.1201/b19937-1

Park, J. Y., & Nagy, Z. (2018). Comprehensive analysis of the relationship between thermal comfort and building control research - A data-driven literature review. Renewable and Sustainable Energy Reviews, 82, 2664–2679. https://doi.org/10.1016/j.rser.2017.09.102

Pulugurtha, S. S., & Nambisan, S. S. (2003). A decision-support tool for airline yield management using genetic algorithms. Computer-Aided Civil and Infrastructure Engineering, 18(3), 214–223. https://doi.org/10.1111/1467-8667.00311

R Core Team. (2016). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from https://www.R-project.org/

Talluri, K. T., & Van Ryzin, G. (2005). The theory and practice of revenue management. New York, NY: Springer.

Tsafnat, G., Glasziou, P., Choong, M. K., Dunn, A., Galgani, F., & Coiera, E. (2014). Systematic review automation technologies. Systematic Reviews, 3, 74. https://doi.org/10.1186/2046-4053-3-74

Tsai, T.-H. (2011). A temporal case-based procedure for cancellation forecasting: a case study. Current Politics and Economics of South, Southeastern, and Central Asia, 20(2), 159–182.

Weatherford, L. R., & Kimes, S. E. (2003). A comparison of forecasting methods for hotel revenue management. International Journal of Forecasting, 19(3), 401–415. https://doi.org/10.1016/S0169-2070(02)00011-0

Webster, J., & Watson, R. T. (2002). Analyzing the past to prepare for the future: Writing a literature review. MIS Quarterly, 26(3), xiii–xxiii.

Welbers, K., Van Atteveldt, W., & Benoit, K. (2017). Text analysis in R. Communication Methods and Measures, 11(4), 245–265. https://doi.org/10.1080/19312458.2017.1387238

Zakhary, A., Atiya, A. F., El-Shishiny, H., & Gayar, N. (2011). Forecasting hotel arrivals and occupancy using Monte Carlo simulation. Journal of Revenue and Pricing Management, 10(4). https://doi.org/10.1057/rpm.2009.42






Copyright (c) 2019 Tourism & Management Studies

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.