Abstract
This paper introduces a new approach to fit a linear regression model on interval-valued data. Each example of the learning set is described by a feature vector where each feature value is an interval. In the proposed approach, it is fitted two linear regression models, respectively, on the mid-point and range of the interval values assumed by the variables on the learning set. The prediction of the lower and upper bound of the interval value of the dependent variable is accomplished from its mid-point and range which are estimated from the fitted linear regression models applied to the mid-point and range of each interval values of the independent variables. The evaluation of the proposed prediction method is based on the estimation of the average behaviour of root mean squared error and of the determination coefficient in the framework of a Monte Carlo experience in comparison with the method proposed by Billard and Diday [3].
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Bock, H.H., Diday, E.: Analysis of Symbolic Data: Exploratory Methods for Extracting Statistical Information from Complex Data. Springer, Berlin (2000)
Alfonso, F., BIllard, L., Diday, E.: Régression Linéaire Symbolique avec Variables Taxonomiques. In: Hébrail, et al. (eds.) Actes des IVèmes journées d’Extraction et Gestion des Connaissance, EGC Clermont-Ferrand 2004. RNTI-E-2, vol. 1, pp. 205–210 (2004)
Billard, L., Diday, E.: Regression Analysis for Interval-Valued Data. Data Analysis, Calssification and Related Methods. In: Namur (Belgium), Kiers, H.A.L., et al. (eds.) Proceedings of the Seventh Conference of the International Federation of Classification Societies, IFCS 2000, vol. 1, pp. 369–374. Springer, Heidelberg (2000)
Billard, L., Diday, E.: Symbolic Regression Analysis. Calssification, Clustering and Data Analysis. In: Jajuga, K., et al. (eds.) Proceedings of the Eighenth Conference of the International Federation of Classification Societies, IFCS 2002, Crakow (Poland), vol. 1, pp. 281–288. Springer, Heidelberg (2002)
Billard, L., Diday, E.: From the Statistics of Data to the Statistics of Knowledge: Symbolic Data Analysis. Journal of the American Statistical Association 98, 470–487 (2003)
Draper, N.R., Smith, H.: Applied Regression Analysis. JohnWiley, New York (1981)
Marsaglia, G.: Uniform Random Number Generators. Journ. Assoc. for Computing Machin-ery 12, 83–89 (1965)
Montgomery, D.C., Peck, E.A.: Introduction to Linear Regression Analysis. John Wiley, New York (1982)
Scheffé, H.: The Analysis of Variance. John Wiley, New York (1959)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Carvalho, F.d.A.T., Lima Neto, E.d.A., Tenorio, C.P. (2004). A New Method to Fit a Linear Regression Model for Interval-Valued Data. In: Biundo, S., Frühwirth, T., Palm, G. (eds) KI 2004: Advances in Artificial Intelligence. KI 2004. Lecture Notes in Computer Science(), vol 3238. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30221-6_23
Download citation
DOI: https://doi.org/10.1007/978-3-540-30221-6_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23166-0
Online ISBN: 978-3-540-30221-6
eBook Packages: Springer Book Archive