Abstract
There are thousands of academic journals in various fields of study. An article author must spend significant time searching and selecting a journal suitable for the article’s content before submitting it to a journal for consideration. Since many articles are submitted to a journal at a time, it would take time for an editor to review, submit it to reviewers, and inform the results back to the author. If the article is rejected due to mismatched journal content, the author will spend more time to re-submit the article to another journal. Therefore, this research introduced a recommendation system to help the author choose an appropriate journal more effectively, based on TCI Thai Journals Online Database (ThaiJO). Data from Thai and English articles were used for analysis in this research. Our work involved studying the applied data, cleaning the data, and modeling, which included calculating the importance of text by Term Frequency - Inverse Document Frequency (TF-IDF), calculating similarity scores between articles and journals using Cosine Similarity and then ranking the scores to recommend the most suitable journal. This research experiment with modeling between a model from Thai data, a model from English data, and a model using both languages. The experiment shows that when we combine Thai and English keywords and abstract data, the accuracy in the form of hit rate is improved to 0.80650 from applying only English (0.78793) or Thai data (0.62888).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Scopus. https://www.elsevier.com/solutions/scopus/how-scopus-works/content. Accessed 10 Oct 2021
ThaiJo. https://www.tci-thaijo.org/. Accessed 21 Oct 2021
Haddi, E., Liu, X., Shi, Y.: The role of text pre-processing in sentiment analysis. Procedia Comput. Sci. 17, 26–32 (2013)
Sornlertlamvanich, V.: Word segmentation for Thai in machine translation system. Mach. Transl. NECTEC, 556–561 (1993)
PyThaiNLP. https://pythainlp.github.io/docs/2.3/. Accessed 12 Aug 2021
Robertson, S.: Understanding inverse document frequency: on theoretical arguments for IDF. J. Doc. 60(5), 503–520 (2004). https://doi.org/10.1108/00220410410560582
Mizzaro, S., Pavan, M., Scagnetto, I.: Content-based similarity of Twitter users. In: Hanbury, A., Kazai, G., Rauber, A., Fuhr, N. (eds.) ECIR 2015. LNCS, vol. 9022, pp. 507–512. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-16354-3_56
Li, B., Han, L.: Distance weighted cosine similarity measure for text classification. In: Yin, H., et al. (eds.) IDEAL 2013. LNCS, vol. 8206, pp. 611–618. Springer, Heidelberg (2013). https://doi.org/10.1007/978-3-642-41278-3_74
Moreno-Torres, J.G., Saez, J.A., Herrera, F.: Study on the impact of partition-induced dataset shift on k-fold cross-validation. IEEE Trans. Neural Netw. Learn. Syst. 23(8), 1304–1312 (2012). https://doi.org/10.1109/TNNLS.2012.2199516
Magara, M.B., Ojo, S.O., Zuva, T.: A comparative analysis of text similarity measures and algorithms in research paper recommender systems. In: 2018 Conference on Information Communications Technology and Society (ICTAS), pp. 1–5 (2018)
Lee, J., Lee, K., Kim, J.: Personalized academic research paper recommendation system. arXiv:1304.5457 (2013)
Sugiyama, K., Kan, M.Y.: Scholarly paper recommendation via user’s recent research interests. In: Proceedings of the 10th Annual Joint Conference on Digital Libraries, Gold Coast, Queensland, Australia, pp. 29–38. Association for Computing Machinery (2010)
Stopwords ISO. https://github.com/stopwords-iso/stopwords-iso. Accessed 4 Aug 2021
Kuhn, M., Johnson, K.: Over-fitting and model tuning. In: Kuhn, M., Johnson, K. (eds.) Applied Predictive Modeling, pp. 61–92. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-6849-3_4
James, G., Witten, D., Hastie, T., Tibshirani, R.: Resampling methods. In: James, G., Witten, D., Hastie, T., Tibshirani, R. (eds.) An Introduction to Statistical Learning: with Applications in R, pp. 175–201. Springer, New York (2013). https://doi.org/10.1007/978-1-4614-7138-7_5
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Numnonda, N., Chanyachatchawan, S., Tuaycharoen, N. (2022). Journal Recommendation System for Author Using Thai and English Information from Manuscript. In: Meesad, P., Sodsee, S., Jitsakul, W., Tangwannawit, S. (eds) Proceedings of the 18th International Conference on Computing and Information Technology (IC2IT 2022). IC2IT 2022. Lecture Notes in Networks and Systems, vol 453. Springer, Cham. https://doi.org/10.1007/978-3-030-99948-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-030-99948-3_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-99947-6
Online ISBN: 978-3-030-99948-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)