Abstract
The classification of unknown samples is among the most common problems found in chemometrics. For this purpose, a proper representation of the data is very important. Nowadays, chemical spectral data are analyzed as vectors of discretized data where the variables have not connection, and other aspects of their functional nature e.g. shape differences (structural), are also ignored. In this paper, we study some advanced representations for chemical spectral datasets, and for that we make a comparison of the classification results of 4 datasets by using their traditional representation and two other: Functional Data Analysis and Dissimilarity Representation. These approaches allow taking into account the information that is missing in the traditional representation, thus better classification results can be achieved. Some suggestions are made about the more suitable dissimilarity measures to use for chemical spectral data.
Chapter PDF
Similar content being viewed by others
Keywords
References
Ramsay, J.O., Silverman, B.W.: Functional Data Analysis, New York (1997)
Pekalska, E., Duin, R.P.W.: The Dissimilarity Representation For Pattern Recognition. Foundations and Applications 64 (2005)
Cardot, H., Ferraty, F., Sarda, P.: Functional linear model. Statist. Probab. Lett. 45, 11–22 (1999)
Preda, C., Saporta, G.: PLS regression on stochastic processes. Comput. Statist. Data Anal. 48, 149–158 (2005)
Cérou, F., Guyader, A.: Nearest neighbor classification in infinite dimension. ESAIM: Probability and Statistics 10, 340–350 (2006)
Villa, N., Rossi, F.: Support Vector Machine For Functional Data Classification. In: ESANN 2005 (2005)
Hernández, N., Biscay, R.J., Talavera, I.: Support Vector Regression Methods for Functional Data. In: Rueda, L., Mery, D., Kittler, J. (eds.) CIARP 2007. LNCS, vol. 4756, pp. 564–573. Springer, Heidelberg (2007)
Paclik, P., Duin, R.P.W.: Classifying spectral data using relational representation. In: Spectral Imaging Workshop, Graz, Austria (2003)
Varmuza, K., Karlovits, M., Demuth, W.: Spectral similarity versus structural similarity: infrared spectroscopy. Anal. Chimica Acta 490, 313–324 (2003)
Komsta, L., Skibinski, R., Grech-Baran, M., Galaszkiewicz, A.: Multivariate comparison of drugs UV spectra by hierarchical cluster analysis-comparison of different dissimilarity functions. In: Annales Universitaits Marie Curie-Sklodowska, Lublin, Polonia, vol. 20, pp. 2–13 (2007)
Wold, S.: Chemometrics: Theory and Application. In: Kowalski, B.R. (ed.) ACS Symposium, vol. 52, pp. 243–282 (1977)
Yuhas, R.H., Goetz, A.F.H., Boardman, J.W.: Discrimination among semiarid landscape end members using the spectral angle mapper (SAM) algorithm. In: Third Annual JPL Airborne Geoscience Workshop, Pasadena, CA, pp. 147–149 (1992)
Pekalska, E., Duin, R.P.W.: Prototype selection for finding efficient representations of dissimilarity data. In: Kasturi, R., Laurendeau, D., Suen, C. (eds.) International Conference on Pattern Recognition, Quebec, Canada, vol. 3, pp. 37–40 (2002)
Tablets dataset, http://www.models.kvl.dk/research/data
Tecator dataset, http://lib.stat.cmu.edu/datasets/tecator
Oil dataset, http://cac2008.teledetection.fr/shootout
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Porro, D., Duin, R.W., Talavera, I., Hdez, N. (2009). The Representation of Chemical Spectral Data for Classification. In: Bayro-Corrochano, E., Eklundh, JO. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2009. Lecture Notes in Computer Science, vol 5856. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10268-4_60
Download citation
DOI: https://doi.org/10.1007/978-3-642-10268-4_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10267-7
Online ISBN: 978-3-642-10268-4
eBook Packages: Computer ScienceComputer Science (R0)