Abstract
In the last few years, a large hype around the topic of Machine Learning (ML) has emerged; this can be justified thanks to various ML approaches having recently undergone rapid developments and being applied to a wide array of tasks, both in practice as well as in science. Nevertheless, ML approaches still seem to be the exception rather than the rule in tourism literature, most likely due to a sense of hesitation towards understanding and engaging with the technical and theoretical aspects of the topic. As such, this chapter aims to provide an overview and outline the field of Machine Learning by defining basic terms, introducing the most important ML paradigms, and naming relevant algorithms. It also mentions the typical process/procedure, limitations, and challenges of ML as well as developments towards Auto-ML. Thus, a basic understanding of the topic of ML and its importance for tourism is provided here, also serving as a basis for other chapters in this book in which individual algorithms will be presented in more detail.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Keywords
References
Abubakar, A. M., Namin, B. H., Harazneh, I., Arasli, H., & Tunç, T. (2017). Does gender moderates the relationship between favoritism/nepotism, supervisor incivility, cynicism and workplace withdrawal: A neural network and SEM approach. Tourism Management Perspectives, 23, 129–139. https://doi.org/10.1016/j.tmp.2017.06.001
Adam, S. P., Alexandropoulos, S.-A. N., Pardalos, P. M., & Vrahatis, M. N. (2019). No free lunch theorem: A review. In I. C. Demetriou & P. M. Pardalos (Eds.), Springer optimization and its applications. Approximation and optimization (Vol. 145, pp. 57–82). Springer International. https://doi.org/10.1007/978-3-030-12767-1_5
Adeli, H., & Hung, S.-L. (1994). Machine learning: Neural networks, genetic algorithms, and fuzzy systems. 97804710. Retrieved from https://scholar.nctu.edu.tw/zh/publications/machine-learning-neural-networks-genetic-algorithms-and-fuzzy-sys
Aggarwal, C. C. (2015). Data classification: Algorithms and applications. Chapman & Hall/CRC data mining and knowledge discovery series. CRC Press. Retrieved from http://proquest.tech.safaribooksonline.de/9781466586741
Aggarwal, C. C. (2018a). Neural networks and deep learning. Springer International. https://doi.org/10.1007/978-3-319-94463-0
Aggarwal, C. C. (2018b). Neural networks and deep learning: A textbook. Springer International. Retrieved from https://books.google.at/books?id=achqDwAAQBAJ. https://doi.org/10.1007/978-3-319-94463-0
Akerkar, R. (Ed.). (2019a). SpringerBriefs in business. Artificial Intelligence for Business. Springer International. https://doi.org/10.1007/978-3-319-97436-1
Akerkar, R. (2019b). Machine learning. In R. Akerkar (Ed.), SpringerBriefs in business. Artificial intelligence for business (pp. 19–32). Springer International. https://doi.org/10.1007/978-3-319-97436-1_2
Althbiti, A., & Ma, X. (2020). Machine learning. In L. A. Schintler & C. L. McNeely (Eds.), Encyclopedia of big data (pp. 1–5). Springer International. https://doi.org/10.1007/978-3-319-32001-4_539-1
Arabie, P., Hubert, L., & de Soete, G. (1996). Clustering and classification. World Scientific.
Arefieva, V., Egger, R., & Yu, J. (2021). A machine learning approach to cluster destination image on Instagram. Tourism Management, 85, 104318. https://doi.org/10.1016/j.tourman.2021.104318
Awad, M., & Khanna, R. (Eds.). (2015). Books for professionals by professionals. Efficient learning machines: Theories, concepts, and applications for engineers and system designers. Apress Springer Science+Business Media. https://doi.org/10.1007/978-1-4302-5990-9
Ba, L. J., & Caruana, R. (2013, December 21). Do deep nets really need to be deep? Retrieved from http://arxiv.org/pdf/1312.6184v7
Batista e Silva, F., Barranco, R., Proietti, P., Pigaiani, C., & Lavalle, C. (2020). A new European regional tourism typology based on hotel location patterns and geographical criteria. Annals of Tourism Research, 103077. https://doi.org/10.1016/j.annals.2020.103077
Bernstein, A., & Kuleshov, A. (2014). Dimensionality reduction in statistical learning. In ICMLA ‘14: Proceedings of the 2014 13th international conference on machine learning and applications (pp. 330–335). IEEE Computer Society. https://doi.org/10.1109/ICMLA.2014.59
Bezanson, J., Edelman, A., Karpinski, S., & Shah, V. B. (2017). Julia: A fresh approach to numerical computing. SIAM Review, 59(1), 65–98. https://doi.org/10.1137/141000671
Bloom, J. Z. (2005). Market segmentation. Annals of Tourism Research, 32(1), 93–111. https://doi.org/10.1016/j.annals.2004.05.001
Buhalis, D., & Amaranggana, A. (2015). Smart tourism destinations enhancing tourism experience through personalisation of services. In Information and communication technologies in tourism 2015 (pp. 377–389). Springer. https://doi.org/10.1007/978-3-319-14343-9_28
Choo, K., Greplova, E., Fischer, M. H., & Neupert, T. (2020). Machine learning kompakt. Springer Fachmedien Wiesbaden. https://doi.org/10.1007/978-3-658-32268-7
Chua, B.-L., Meng, B., Ryu, H. B., & Han, H. (2021). Participate in volunteer tourism again? Effect of volunteering value on temporal re-participation intention. Journal of Hospitality and Tourism Management, 46, 193–204. https://doi.org/10.1016/j.jhtm.2020.12.003
Cleve, J., & Lämmel, U. (2020). Data mining (3. Aufl.). De Gruyter Studium. De Gruyter Oldenbourg. Retrieved from https://books.google.at/books?id=T0YCEAAAQBAJ
Das, P., Ivkin, N., Bansal, T., Rouesnel, L., Gautier, P., Karnin, Z., et al. (2020). Amazon SageMaker autopilot. In Proceedings of the fourth international workshop on data management for end-to-end machine learning (pp. 1–7). https://doi.org/10.1145/3399579.3399870
de Romblay, A. A. (2017). MLBox’s official documentation. Retrieved from https://mlbox.readthedocs.io/en/latest/
Del Chiappa, G., Atzeni, M., & Ghasemi, V. (2018). Community-based collaborative tourism planning in islands: A cluster analysis in the context of Costa Smeralda. Journal of Destination Marketing and Management, 8, 41–48. https://doi.org/10.1016/j.jdmm.2016.10.005
Deng, N., & Li, X. (2018). Feeling a destination through the “right” photos: A machine learning model for DMOs’ photo selection. Tourism Management, 65, 267–278. https://doi.org/10.1016/j.tourman.2017.09.010
Derek, M., Woźniak, E., & Kulczyk, S. (2019). Clustering nature-based tourists by activity. Social, economic and spatial dimensions. Tourism Management, 75, 509–521. https://doi.org/10.1016/j.tourman.2019.06.014
Dobilas, S. (2021, July 2). Random Forest models: Why are they better than single decision trees? Towards Data Science. Retrieved from https://towardsdatascience.com/random-forest-models-why-are-they-better-than-single-decision-trees-70494c29ccd1
Dy, J., & Brodley, C. (2004). Feature selection for unsupervised learning. Journal of Machine Learning Research, 5, 845–889. https://doi.org/10.1007/SpringerReference_302701
Ebrahimpor-Komleh, H., & Afsharizadeh, M. (2015). Improving prediction based digital image reversible watermarking by neural networks. In 2015 international congress on technology, communication and knowledge (ICTCK) (pp. 201–208). IEEE. https://doi.org/10.1109/ICTCK.2015.7582671.
Edwards, G. (2018, November 18). Machine learning | An introduction—Towards data science. Towards Data Science. Retrieved from https://towardsdatascience.com/machine-learning-an-introduction-23b84d51e6d0#6246
Egger, R. (2022). Introduction: Data science in tourism. In R. Egger (Ed.), Tourism on the verge. Applied data science in tourism (pp. ix–xxiv). Springer.
Egger, R., & Gokce, E. (2022). Natural language processing (NLP): An introduction. In R. Egger (Ed.), Tourism on the verge. Applied data science in tourism (pp. 307–334). Springer.
Egger, R., Kroner, M., & Stöckl, A. (2022). Web scraping. In R. Egger (Ed.), Tourism on the verge. Applied data science in tourism (pp. 67–84). Springer.
Ekman, M. (2021). Learning deep learning: Theory and practice of neural networks, computer vision, Nlp, and transformers using Tensorflow. [S.l.]: ADDISON-WESLEY. Retrieved from https://books.google.at/books?id=W8dFzgEACAAJ
Feurer, M., & Hutter, F. (2019). Hyperparameter optimization. In F. Hutter, L. Kotthoff, & J. Vanschoren (Eds.), Automated machine learning (pp. 3–34). Springer International.
Feurer, M., Klein, A., Eggensperger, K., Springenberg, J. T., Blum, M., & Hutter, F. (2019). Auto-sklearn: Efficient and robust automated machine learning. In F. Hutter, L. Kotthoff, & J. Vanschoren (Eds.), Automated machine learning (pp. 113–134). Springer International.
Fortunato, S., & Hric, D. (2016). Community detection in networks: A user guide. Physics Reports, 659, 1–44. https://doi.org/10.1016/j.physrep.2016.09.002
Ghosh, S., Halappanavar, M., Tumeo, A., Kalyanaraman, A., Lu, H., Chavarria-Miranda, D., et al. (2018). Distributed Louvain algorithm for graph community detection. In 2018 IEEE International Parallel 2018 (pp. 885–895). https://doi.org/10.1109/IPDPS.2018.00098
Gijsbers, P., LeDell, E., Thomas, J., Poirier, S., Bischl, B., & Vanschoren, J. (2019, July 1). An open source AutoML benchmark. Retrieved from http://arxiv.org/pdf/1907.00909v1
Graupe, D. (2013). Principles of artificial neural networks. Advanced series in Circuits & Systems: v.7. World Scientific. Retrieved from https://ebookcentral.proquest.com/lib/subhh/detail.action?docID=1336559
Gretzel, U., Sigala, M., Xiang, Z., & Koo, C. (2015). Smart tourism: Foundations and developments. Electronic Markets, 25(3), 179–188. https://doi.org/10.1007/s12525-015-0196-8
Guerreiro, J., & Rita, P. (2020). How to predict explicit recommendations in online reviews using text mining and sentiment analysis. Journal of Hospitality and Tourism Management, 43, 269–272. https://doi.org/10.1016/j.jhtm.2019.07.001
H20. (2017). AutoML: Automatic machine learning—H2O 3.32.1.3 documentation. Retrieved from http://docs.h2o.ai/h2o/latest-stable/h2o-docs/automl.html
Han, J. (2020). Application of SVM model to environmental resource analysis in tourism development. Journal of Physics: Conference Series, 1629, 12007. https://doi.org/10.1088/1742-6596/1629/1/012007
Hastie, T., Tibshirani, R., & Friedman, J. H. (2009). The elements of statistical learning: Data mining, inference, and prediction (Springer eBook collection mathematics and statistics) (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7
Hutter, F., Kotthoff, L., & Vanschoren, J. (2019). Automated machine learning. Springer International. https://doi.org/10.1007/978-3-030-05318-5
Jahani, A., Goshtasb, H., & Saffariha, M. (2020). Tourism impact assessment modeling of vegetation density for protected areas using data mining techniques. Land Degradation and Development, 31(12), 1502–1519. https://doi.org/10.1002/ldr.3549
Jamal, S., Goyal, S., Grover, A., & Shanker, A. (2018). Machine learning: What, why, and how? In A. Shanker (Ed.), Bioinformatics: Sequences, structures, phylogeny (pp. 359–374). Springer Singapore. https://doi.org/10.1007/978-981-13-1562-6_16
Janakiram, M. (2018, April 15). Why AutoML is set to become the future of artificial intelligence. Forbes. Retrieved from https://www.forbes.com/sites/janakirammsv/2018/04/15/why-automl-is-set-to-become-the-future-of-artificial-intelligence/?sh=bcfd730780ae
Jordan, M. I., & Mitchell, T. M. (2015). Machine learning: Trends, perspectives, and prospects. Science (New York, N.Y.), 349(6245), 255–260. https://doi.org/10.1126/science.aaa8415
Kelm, O., Gerl, K., & Meißner, F. (2020). Machine learning. In I. Borucki, K. Kleinen-von Königslöw, S. Marschall, & T. Zerback (Eds.), Handbuch Politische Kommunikation (pp. 1–9). Springer Fachmedien Wiesbaden. https://doi.org/10.1007/978-3-658-26242-6_55-1
Kim, J., Wei, S., & Ruys, H. (2003). Segmenting the market of West Australian senior tourists using an artificial neural network. Tourism Management, 24(1), 25–34. https://doi.org/10.1016/S0261-5177(02)00050-X
Koo, C., Shin, S., Gretzel, U., Hunter, W. C., & Chung, N. (2016). Conceptualization of smart tourism destination competitiveness. Asia Pacific Journal of Information Systems, 26(4), 561–576. https://doi.org/10.14329/apjis.2016.26.4.561
Langs, G., & Wazir, R. (2019). Machine learning. In S. Papp, W. Weidinger, M. Meir-Huber, B. Ortner, G. Langs, & R. Wazir (Eds.), Handbuch Data Science: Mit Datenanalyse und Machine Learning Wert aus Daten generieren (pp. 178–198). Hanser.
Li, Q., Li, S., Hu, J., Zhang, S., & Hu, J. (2018). Tourism review sentiment classification using a bidirectional recurrent neural network with an attention mechanism and topic-enriched word vectors. Sustainability, 10(9), 3313. https://doi.org/10.3390/su10093313
Li, X., Law, R., Xie, G., & Wang, S. (2021). Review of tourism forecasting research with internet data. Tourism Management, 83, 104245. https://doi.org/10.1016/j.tourman.2020.104245
Lu, J., Meng, Y., Timmermans, H., & Zhang, A. (2021). Modeling hesitancy in airport choice: A comparison of discrete choice and machine learning methods. Transportation Research Part A: Policy and Practice, 147, 230–250. https://doi.org/10.1016/j.tra.2021.03.006
Ma, S., Kirilenko, A. P., & Stepchenkova, S. (2020). Special interest tourism is not so special after all: Big data evidence from the 2017 Great American Solar Eclipse. Tourism Management, 77, 104021. https://doi.org/10.1016/j.tourman.2019.104021
Martinez-Torres, M. R., & Toral, S. L. (2019). A machine learning approach for the identification of the deceptive reviews in the hospitality sector using unique attributes and sentiment orientation. Tourism Management, 75, 393–403. https://doi.org/10.1016/j.tourman.2019.06.003
Martin-Fuentes, E., Fernandez, C., Mateu, C., & Marine-Roig, E. (2018). Modelling a grading scheme for peer-to-peer accommodation: Stars for Airbnb. International Journal of Hospitality Management, 69, 75–83. https://doi.org/10.1016/j.ijhm.2017.10.016
McAuley, J. (2017). CSE 158: Web mining and recommender systems. Retrieved from https://cseweb.ucsd.edu/classes/wi17/cse158-a/
Mich, L. (2020). Artificial intelligence and machine learning. In Z. Xiang, M. Fuchs, U. Gretzel, & W. Höpken (Eds.), Handbook of e-tourism (pp. 1–21). Springer International. https://doi.org/10.1007/978-3-030-05324-6_25-1
Mohri, M., Rostamizadeh, A., & Talwalkar, A. (2018). Foundations of machine learning. In Adaptive computation and machine learning. Cambridge Massachusetts (2nd ed.). The MIT Press.
Moolayil, J. (2019). Learn Keras for deep neural networks. Apress. https://doi.org/10.1007/978-1-4842-4240-7
Neuburger, L., & Egger, R. (2021). Travel risk perception and travel behaviour during the COVID-19 pandemic 2020: A case study of the DACH region. Current Issues in Tourism, 24(7), 1003–1016. https://doi.org/10.1080/13683500.2020.1803807
Ngyuen, C. N., & Zeigermann, O. (2021). Machine learning: Kurz & gut (2. Auflage). dpunkt.verlag, O’Reilly, Preselect.media Gmbh.
Olson, R. S., & Moore, J. H. (2019). TPOT: A tree-based pipeline optimization tool for automating machine learning. In F. Hutter, L. Kotthoff, & J. Vanschoren (Eds.), The springer series on challenges in machine learning. Automated machine learning (pp. 151–160). Springer International. https://doi.org/10.1007/978-3-030-05318-5_8
Ozdemir, S. (2016). Principles of data science: Learn the techniques and math you need to start making sense of your data. Packt. Retrieved from http://proquest.tech.safaribooksonline.de/9781785887918
Palmer, A., José Montaño, J., & Sesé, A. (2006). Designing an artificial neural network for forecasting tourism time series. Tourism Management, 27(5), 781–790. https://doi.org/10.1016/j.tourman.2005.05.006
Papp, S., Weidinger, W., Meir-Huber, M., Ortner, B., Langs, G., & Wazir, R. (Eds.). (2019). Handbuch Data Science: Mit Datenanalyse und Machine Learning Wert aus Daten generieren. Hanser. https://doi.org/10.3139/9783446459755
Park, S., Xu, Y., Jiang, L., Chen, Z., & Huang, S. (2020). Spatial structures of tourism destinations: A trajectory data mining approach leveraging mobile big data. Annals of Tourism Research, 84, 102973. https://doi.org/10.1016/j.annals.2020.102973
Payntar, N. D., Hsiao, W.-L., Covey, R. A., & Grauman, K. (2021). Learning patterns of tourist movement and photography from geotagged photos at archaeological heritage sites in Cuzco, Peru. Tourism Management, 82, 104165. https://doi.org/10.1016/j.tourman.2020.104165
Pérez, J., Barceló, P., & Marinkovic, J. (2021). Attention is Turing-complete. Journal of Machine Learning Research, 22, 1–35. Retrieved from https://www.jmlr.org/papers/volume22/20-302/20-302.pdf
Phillips, P., Zigan, K., Santos Silva, M. M., & Schegg, R. (2015). The interactive effects of online reviews on the determinants of Swiss hotel performance: A neural network analysis. Tourism Management, 50, 130–141. https://doi.org/10.1016/j.tourman.2015.01.028
Provost, F., & Fawcett, T. (2013a). Data science for business: What you need to know about data mining and data-analytic thinking (1st ed.). O’Reilly Media. Retrieved from http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=619895
Provost, F., & Fawcett, T. (2013b). Data science for business: What you need to know about data mining and data-analytic thinking. O’Reilly.
Ramos-Henríquez, J. M., Gutiérrez-Taño, D., & Díaz-Armas, R. J. (2021). Value proposition operationalization in peer-to-peer platforms using machine learning. Tourism Management, 84, 104288. https://doi.org/10.1016/j.tourman.2021.104288
Raschka, S., & Mirjalili, V. (2018). Machine Learning mit Python und Scikit-Learn und TensorFlow: Das umfassende Praxis-Handbuch für Data Science, Deep Learning und Predictive Analytics (K. Lorenzen, Trans.) (2., aktualisierte und erweiterte Auflage). mitp.
Saeed, A., Ozcelebi, T., & Lukkien, J. (2019). Multi-task self-supervised learning for human activity detection. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(2), 1–30. https://doi.org/10.1145/3328932
Sanchez, J. S. (2003). Analysis of new techniques to obtain quality training sets. Pattern Recognition Letters, 24(7), 1015–1022. https://doi.org/10.1016/S0167-8655(02)00225-8
Shalev-Shwartz, S., & Ben-David, S. (2014). Understanding machine learning: From theory to algorithms. Cambridge University Press.
Shibata, N., Shinoda, H., Nanba, H., Ishino, A., & Takezawa, T. (2020). Classification and visualization of travel blog entries based on types of tourism. In J. Neidhardt & W. Wörndl (Eds.), Information and communication Technologies in Tourism 2020 (pp. 27–37). Springer International. https://doi.org/10.1007/978-3-030-36737-4_3
Skilton, M., & Hovsepian, F. (2018a). Machine learning. In M. Skilton & F. Hovsepian (Eds.), The 4th industrial revolution (pp. 121–157). Springer International. https://doi.org/10.1007/978-3-319-62479-2_5
Skilton, M., & Hovsepian, F. (Eds.). (2018b). The 4th industrial revolution. Springer International. https://doi.org/10.1007/978-3-319-62479-2
Srihadi, T. F., Hartoyo, Sukandar, D., & Soehadi, A. W. (2016). Segmentation of the tourism market for Jakarta: Classification of foreign visitors’ lifestyle typologies. Tourism Management Perspectives, 19, 32–39. https://doi.org/10.1016/j.tmp.2016.03.005
Thornton, C., Hutter, F., Hoos, H., & Leyton-Brown, K. (2013). Auto-WEKA: Combined selection and Hyperparameter optimization of classification algorithms. In I. S. Dhillon & R. L. Grossman (Eds.), Kdd 2013: The 19th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, August 11–14, 2013, Chicago, Illinois, USA ; [including] the Industry Practice Expo (IPE) [and] co-located workshops (pp. 847–855). ACM.
Tsaur, S.-H., Chiu, Y.-C., & Huang, C.-H. (2002). Determinants of guest loyalty to international tourist hotels—A neural network approach. Tourism Management, 23(4), 397–405. https://doi.org/10.1016/S0261-5177(01)00097-8
Wennker, P. (2020). Machine learning. In P. Wennker (Ed.), Künstliche Intelligenz in der Praxis (pp. 9–37). Springer Fachmedien Wiesbaden. https://doi.org/10.1007/978-3-658-30480-5_2
Xiang, Z., & Fesenmaier, D. R. (2017). Analytics in smart tourism design. Springer International. https://doi.org/10.1007/978-3-319-44263-1
Yu, J., & Egger, R. (2021). Color and engagement in touristic Instagram pictures: A machine learning approach. Annals of Tourism Research, 103204. https://doi.org/10.1016/j.annals.2021.103204
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Further Readings & Other Sources
Further Readings & Other Sources
-
Dulhare, U. N., Ahmad, K., & Ahmad, K. A. B. (Eds.). (2020). Machine learning and big data: Concepts, algorithms, tools and applications. Wiley.
-
Google’s Machine Learning Crash Course. https://developers.google.com/machine-learning/crash-course
-
Ng, A. (2021). Machine Learning—Coursea Course with more than 4 million participants. https://tinyurl.com/andrewng-ml
-
Nwanganga, F., & Chapple, M. (2020). Practical machine learning in R. Wiley.
-
Raschka, S., & Mirjalili, V. (2017). Python machine learning: Machine learning and deep learning with python (2nd ed.). Scikit-Learn, and TensorFlow.
-
Theobald, O. (2021). Machine learning for absolute beginners: A plain English introduction (3rd ed.). Scatterplot Press.
-
Venturi, D. (2019). Provides an extensive list of machine learning courses provided on the Internet. https://www.freecodecamp.org/news/every-single-machine-learning-course-on-the-internet-ranked-by-your-reviews-3c4a7b8026c0/
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Egger, R. (2022). Machine Learning in Tourism: A Brief Overview. In: Egger, R. (eds) Applied Data Science in Tourism. Tourism on the Verge. Springer, Cham. https://doi.org/10.1007/978-3-030-88389-8_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-88389-8_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88388-1
Online ISBN: 978-3-030-88389-8
eBook Packages: Business and ManagementBusiness and Management (R0)