Abstract
This paper aims to provide a solution to a problem shared by online marketing platforms. Many of these platforms are exploited by spammers to ease their job of distributing spam. This can lead to platforms domains being black-listed by ISP’s, which translates to lower deliverability rates and consequently lower profits. Normally, platforms try to counter the problem by using rule-based systems, which require high-maintenance and are not easily editable. Additionally, since analysis occurs when a contact database is imported, the regular approach of judging messages’ contents directly is not an effective solution, as those do not yet exist. The proposed solution, a machine-learning based system for the classification of contact database’s importations, tries to surpass these aforementioned systems by making use of the capabilities introduced by machine-learning technologies, namely, reliability in regards to classification and ease of maintenance. Preliminary results show the legitimacy of this approach, since various algorithms can be successfully applied to it. The most proficient of the ones applied being Ada-boost and Random-forest.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Caruana, R., Niculescu-Mizil, A.: An empirical comparison of supervised learning algorithms. In: Proceedings of the 23rd International Conference on Machine Learning, pp. 161–168. ACM (2006)
Coelho, D.: Intelligent analysis of contact databases’ importation. Master’s thesis. Instituto Superior de Engenharia do Porto (2018)
Gonçalves, M.: E-goi (2018). https://www.e-goi.pt/
Hanley, J.A., McNeil, B.J.: A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 148(3), 839–843 (1983)
Hauke, J., Kossowski, T.: Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data. Quaestiones Geographicae 30(2), 87–93 (2011)
Saltelli, A., Ratto, M., Andres, T., Campolongo, F., Cariboni, J., Gatelli, D., Saisana, M., Tarantola, S.: Global Sensitivity Analysis: The Primer. Wiley, Hoboken (2008)
Strauss, J., et al.: E-Marketing. Routledge, Abingdon (2016)
Yu, B., Xu, Z.B.: A comparative study for content-based dynamic spam classification using four machine learning algorithms. Knowl.-Based Syst. 21(4), 355–362 (2008)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Coelho, D., Madureira, A., Pereira, I., Cunha, B. (2020). A Machine Learning Approach to Contact Databases’ Importation for Spam Prevention. In: Madureira, A., Abraham, A., Gandhi, N., Varela, M. (eds) Hybrid Intelligent Systems. HIS 2018. Advances in Intelligent Systems and Computing, vol 923. Springer, Cham. https://doi.org/10.1007/978-3-030-14347-3_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-14347-3_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14346-6
Online ISBN: 978-3-030-14347-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)