Abstract
The class of Schoenberg transformations, embedding Euclidean distances into higher dimensional Euclidean spaces, is presented, and derived from theorems on positive definite and conditionally negative definite matrices. Original results on the arc lengths, angles and curvature of the transformations are proposed, and visualized on artificial data sets by classical multidimensional scaling. A distance-based discriminant algorithm and a robust multidimensional centroid estimate illustrate the theory, closely connected to the Gaussian kernels of Machine Learning.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
ALPAY, D., ATTIA, H., and LEVANONY, D. (2010), “On the Characteristics of a Class of Gaussian Processes Within the White Noise Space Setting”, Stochastic Processes and their Applications, 120, 1074–1104.
BAVAUD, F. (2006), “Spectral Clustering and Multidimensional Scaling: A Unified View”, in Data Science and Classification, eds. V. Batagelj, H.-H. Bock, A. Ferligoj, and A. Ziberna, Heidelberg: Springer, pp. 131–139.
BAVAUD, F. (2009), “Aggregation Invariance in General Clustering Approaches”, Advances in Data Analysis and Classification, 3, 205–225.
BAVAUD, F. (2011), “Robust Estimation of Location through Schoenberg Rransformations”, submitted.
BENZÉCRI, J.-P., and collaborators. (1973), “L’analyse des données. 1 : La taxinomie. 2 : L’analyse des correspondances”, Paris: Dunod.
BERNSTEIN, S. (1929), “Sur les fonctions absolument monotones”, Acta Mathematica, 52, 1–66.
BERG, C., MATEU, J., and PORCU, E. (2008), “The Dagum Family of Isotropic Correlation Functions”, Bernoulli, 14, 1134–1149.
BHATIA, R. (2006), “Infinitely Divisible Matrices”, The American Mathematical Monthly, 113, 221–235.
BLUMENTHAL, L.M. (1953), “Theory and Applications of Distance Geometry”, Oxford: Oxford University Press.
BORG, I., and GROENEN, P.J.F. (1997), Modern Multidimensional Scaling: Theory and Applications, Heidelberg: Springer.
CAMPBELL, N.A. (1980), “Robust Procedures in Multivariate Analysis I : Robust Covariance Estimation”, Applied Statistics, 29, 231–237.
CHEN, D., HE, Q., and WANG, X. (2007), “On Linear Separability of Data Sets in Feature Space”, Neurocomputing, 70, 2441–2448.
CHRISTAKOS, G. (1984), “On the Problem of Permissible Covariance and Variogram Models”, Water Resources Research, 20, 251–265.
CRITCHLEY, F., and FICHET, B. (1994), “The Partial Order by Inclusion of the Principal Classes of Dissimilarity on a Finite Set, and Some of Their Basic Properties”, in: Lecture Notes in Statistics. Classification and Dissimilarity Analysis, ed. B. van Cutsem, Heidelberg: Springer, pp. 5–65.
CRISTIANINI, N., and SHAWE-TAYLOR, J. (2003), “An Introduction to Support Vector Machines and Other Kernel-based Learning Methods”, Cambridge: Cambridge University Press.
CUADRAS, C.M., and FORTINA, J. (1996), “Weighted Continuous Metric Scaling”, in Multidimensional Statistical Analysis and Theory of Random Matrices, eds. A.K. Gupta and V.L. Girko, The Netherlands: VSP, pp. 27–40.
CUADRAS, C.M., FORTINA, J., and OLIVA, F. (1997), “The Proximity of an Individual to a Population with Applications in Discriminant Analysis”, Journal of Classification, 14, 117–136.
FISHER, R.A. (1936), “The Use of Multiple Measurements in Taxonomic Problems”, Annals of Eugenics, 7, 179–188.
FITZGERALD C. H., and HORN, R.A. (1977), “On Fractional Hadamard Powers of Positive Definite Matrices”, Journal of Mathematical Analysis and Applications, 61, 633–642.
GOWER, J.C. (1966), “Some Distance Properties of Latent Root and Vector Methods Used in Multivariate Analysis”, Biometrika, 53, 325–338.
GOWER, J.C. (1982), “Euclidean Distance Geometry”, Mathematical Scientist, 7, 1–14.
GREENACRE, M., and BLASIUS, J. (2006), ”Multiple Correspondence Analysis and Related Methods”, London: Chapman & Hall.
HAMPEL, F.R., RONCHETTI, E.M., ROUSSEEUW, P.J., and STAHEL, W.A. (1986), “Robust Statistics – The Approach Based on Influence Functions”, New York: Wiley.
HÄRDLE, W. (1991), “Smoothing Techniques with Implementation in S”, Heidelberg: Springer.
HAUSSLER, D. (1999), “Convolution Kernels on Discrete Structures”, Technical Report, UCSC-CRL-99-10, University of California at Santa Cruz.
HOFMANN, T., SCH¨OLKOPF, B., and SMOLA, A.J. (2008), “Kernel Methods in Machine Learning”, Annals of Statistics, 36, 1171–1220.
HORN, R.A., and JOHNSON, C.R. (1991), “Topics in Matrix Analysis”, Cambridge: Cambridge University Press.
JOLY, S., and LE CALVE, G. (1986), “Etude des puissances d’une distance”, Statistique et Analyse des Données, 11, 29–50.
KOLMOGOROV, A.N. (1940), “The Wiener Helix and Other Interesting Curves in the Hilbert Space”, Doklady Akademii Nauk SSSR, 26, 115–118.
LEBART, L., MORINEAU, A., and PIRON, M. (1998), Statistique exploratoire multidimensionnelle, Paris: Dunod.
MARDIA, K.V., KENT, J.T., and BIBBY, J.M. (1979), Multivariate Analysis, New York: Academic Press.
MEULMAN, J.J., VAN DER KOOIJ, A., and HEISER, W.J. (2004), “Principal Components Analysis with Nonlinear Optimal Scaling Transformations for Ordinal and Nominal Data”, in The Sage Handbook of Quantitative Methodology for the Social Sciences, ed. D. Kaplan, pp. 49–70.
MICCHELLI, C. (1986), “Interpolation of Scattered Data: Distance Matrices and Conditionally Positive Definite Functions”, Constructive Approximation, 2, 11–22.
VON NEUMANN, J., and SCHOENBERG, I.J. (1941), “Fourier Integrals and Metric Geometry”, Transactions of the American Mathematical Society, 50, 226–251.
RAO, C.R. (1964), “The Use and Interpretation of Principal Component Analysis in Applied Research”, Sankhya A, 26, 329–358.
SCHILLING, R., SONG, R., and VONDRAČEK, Z. (2010), Bernstein Functions: Theory and Applications, Studies in Mathematics, 37, Berlin: de Gruyter.
SCHOENBERG, I.J. (1937), “On Certain Metric Spaces Arising From Euclidean Spaces by a Change of Metric and Their Imbedding in Hilbert Space”, The Annals of Mathematics, 38, 787–793.
SCHOENBERG, I.J. (1938a), “Metric Spaces and Completely Monotone Functions”, The Annals of Mathematics, 39, 811–841.
SCHOENBERG, I.J. (1938b), “Metric Spaces and Positive Definite Functions”, Transactions of the American Mathematical Society, 44, 522–536.
SCHÖLKOPF, B. (2000), “The Kernel Trick for Distances”, Advances in Neural Information Processing Systems, 13, 301–307.
STEIN, M.L. (1999), “Interpolation of Spatial Data: Some Theory for Kriging”, Heidelberg: Springer.
TORGESON,W.S. (1958), ‘Theory and Methods of Scaling, New York: Wiley.
VAPKNIK, V.N. (1995), The Nature of Statistical Learning Theory, Heidelberg: Springer.
VERBOVEN, S., and HUBERT, M. (2005), “LIBRA: a MATLAB library for Robust Analysis”, Chemometrics and Intelligent Laboratory Systems, 75, 127–136.
YOUNG, G., and HOUSEHOLDER, A.S. (1938), “Discussion of a Set of Points in Terms of Their Mutual Distances”, Psychometrika, 3, 19–22.
WILLIAMS, C.K.I. (2002), “On a Connection Between Kernel PCA and Metric Multidimensional Scaling”, Machine Learning, 46, 11–19.
Author information
Authors and Affiliations
Corresponding author
Additional information
The helpful suggestions of two anonymous reviewers are gratefully acknowledged.
Rights and permissions
About this article
Cite this article
Bavaud, F. On the Schoenberg Transformations in Data Analysis: Theory and Illustrations. J Classif 28, 297–314 (2011). https://doi.org/10.1007/s00357-011-9092-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00357-011-9092-x