Abstract
Machine learning and data mining algorithms usually assume that the training and future data have the same distribution and come from the same feature space. However, in majority of real-world problems, this is not true. In case of Debt portfolio appraisal we have sufficient training data only in another domain of interest, namely in other portfolios. Therefore, only knowledge transfer from these portfolios in inference for new one is possible. In the paper we propose transfer learning and learning based on similarity methods, basing on similarity between training and testing datasets. The proposed approach is examined in real domain debt portfolio valuation.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
Cano, J.R., Herrera, F., Lozano, M.: Using evolutionary algorithms as instance selection for data reduction in KDD: an experimental study. IEEE Transactions on Evolutionary Computation 7(6), 561–575 (2003)
Cha, S.H.: Comprehensive survey on distance/similarity measures between probability density functions. International Journal of Mathematical Models and Methods in Applied Sciences 1(4), 300–307 (2007)
Coifman, R.R., Wickerhauser, M.V.: Entropy-based algorithms for best basis selection. IEEE Transactions on Information Theory 38, 713–718 (1992)
Deza, E., Deza, M.M.: Dictionary of Distances. Elsevier (2006)
Kajdanowicz, T., Kazienko, P.: Prediction of Sequential Values for Debt Recovery. In: Bayro-Corrochano, E., Eklundh, J.-O. (eds.) CIARP 2009. LNCS, vol. 5856, pp. 337–344. Springer, Heidelberg (2009)
Kajdanowicz, T., Plamowski, S., Kaznieko, P.: Training Set Selection Using Entropy Based Distance. In: The IEEE Conference on Applied Electrical Engineering and Computing Technologies, AEECT 2011, pp. 340–344. IEEE Computer Society (2011)
Kurlej, B., Wozniak, M.: Active learning approach to concept drift problem. Logic Journal of the IGPL (2011), doi:doi:10.1093/jigpal/jzr011
Lu, Q., Getoor, L.: Link-based classification using labeled and unlabeled data. In: ICML 2003 Workshop on The Continuum from Labeled to Unlabeled Data in Machine Learning and Data Mining (2003)
Meyer, C.D.: Matrix analysis and applied linear algebra. Society for Industrial and Applied Mathematics (2000)
Pan, S.J., Yang, Q.: A Survey on Transfer Learning. IEEE Transactions on Knowledge and Data Engineering 22(10), 1345–1359 (2010)
Rencher, A.: Methods of multivariate analysis. John Wiley & Sons (2002)
Son, S.-H., Kim, J.-Y.: Data Reduction for Instance-Based Learning using Entropy-Based Partitioning. In: Gavrilova, M.L., Gervasi, O., Kumar, V., Tan, C.J.K., Taniar, D., Laganá, A., Mun, Y., Choo, H. (eds.) ICCSA 2006. LNCS, vol. 3982, pp. 590–599. Springer, Heidelberg (2006)
Theodoris, S., Koutroumbas, K.: Pattern Recognition. Elsevier (2009)
Toussaint, G.T.: Bibliography on estimation of misclassification. IEEE Transactions on Information Theory 20(4), 472–479 (1974)
Ullah, A.: Entropy, divergence and distance measures with econometric applications, Department of Economics. University of California - Riverside (1993)
Zhou, K., Doyle, K., Glover, K.: Robust and Optimal Control. Prentice-Hall (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kajdanowicz, T., Plamowski, S., Kazienko, P., Indyk, W. (2012). Transfer Learning Approach to Debt Portfolio Appraisal. In: Corchado, E., Snášel, V., Abraham, A., Woźniak, M., Graña, M., Cho, SB. (eds) Hybrid Artificial Intelligent Systems. HAIS 2012. Lecture Notes in Computer Science(), vol 7209. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28931-6_5
Download citation
DOI: https://doi.org/10.1007/978-3-642-28931-6_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28930-9
Online ISBN: 978-3-642-28931-6
eBook Packages: Computer ScienceComputer Science (R0)