A Kernel Approach for Learning from almost Orthogonal Patterns

Schölkopf, Bernhard; Weston, Jason; Eskin, Eleazar; Leslie, Christina; Noble, William Stafford

doi:10.1007/3-540-36755-1_44

Bernhard Schölkopf²,
Jason Weston²,
Eleazar Eskin³,
Christina Leslie³ &
…
William Stafford Noble^3,4

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 2430))

Included in the following conference series:

European Conference on Machine Learning

2243 Accesses
12 Citations

Abstract

In kernel methods, all the information about the training data is contained in the Gram matrix. If this matrix has large diagonal values, which arises for many types of kernels, then kernel methods do not perform well. We propose and test several methods for dealing with this problem by reducing the dynamic range of the matrix while preserving the positive definiteness of the Hessian of the quadratic programming problem that one has to solve when training a Support Vector Machine.

Download to read the full chapter text

Chapter PDF

Extraction of Dimension Reduced Features from Empirical Kernel Vector

Opening the Black Box: Revealing Interpretable Sequence Motifs in Kernel-Based Learning Algorithms

Kernel classification using a linear programming approach

Article 14 June 2019

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Bibliography

A. A. Alizadeh et al. Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling. Nature, 403:503–511, 2000. Data available from http://llmpp.nih.gov/lymphoma.
Article Google Scholar
U. Alon, N. Barkai, D. Notterman, K. Gish, S. Ybarra, D. Mack, and A. Levine. Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon cancer tissues probed by oligonucleotide arrays. Cell Biology, 96:6745–6750, 1999.
Google Scholar
C. Berg, J. P. R. Christensen, and P. Ressel. Harmonic Analysis on Semigroups. Springer-Verlag, New York, 1984.
MATH Google Scholar
B. E. Boser, I. M. Guyon, and V. Vapnik. A training algorithm for optimal margin classifiers. In D. Haussler, editor, Proceedings of the 5th Annual ACM Workshop on Computational Learning Theory, pages 144–152, Pittsburgh, PA, July 1992. ACM Press.
Google Scholar
M. P. S. Brown, W. N. Grundy, D. Lin, N. Cristianini, C. Sugnet, T. S. Furey, M. Ares, and D. Haussler. Knowledge-based analysis of microarray gene expression data using support vector machines. Proceedings of the National Academy of Sciences, 97(1):262–267, 2000.
Google Scholar
C. Cortes and V. Vapnik. Support vector networks. Machine Learning, 20: 273–297, 1995.
MATH Google Scholar
I. Guyon, J. Weston, S. Barnhill, and V. Vapnik. Gene selection for cancer classification using support vector machines. Machine Learning, 2001.
Google Scholar
D. Haussler. Convolutional kernels on discrete structures. Technical Report UCSC-CRL-99-10, Computer Science Department, University of California at Santa Cruz, 1999.
Google Scholar
T. S. Jaakkola, M. Diekhans, and D. Haussler. A discriminative framework for detecting remote protein homologies. Journal of Computational Biology, 7: 95–114, 2000.
Article Google Scholar
T. S. Jaakkola and D. Haussler. Exploiting generative models in discriminative classifiers. In M. S. Kearns, S. A. Solla, and D. A. Cohn, editors, Advances in Neural Information Processing Systems 11, Cambridge, MA, 1999. MIT Press.
Google Scholar
C. Leslie, E. Eskin, and W. S. Noble. The spectrum kernel: A string kernel for SVM protein classification. Proceedings of the Pacific Symposium on Biocomputing, 2002. To appear.
Google Scholar
L. Liao and W. S. Noble. Combining pairwise sequence similarity and support vector machines for remote protein homology detection. Proceedings of the Sixth International Conference on Computational Molecular Biology, 2002.
Google Scholar
H. Lodhi, C. Saunders, J. Shawe-Taylor, N. Cristianini, and C. Watkins. Text classification using string kernels. Journal of Machine Learning Research, 2: 419–444, 2002.
Article MATH Google Scholar
A. G. Murzin, S. E. Brenner, T. Hubbard, and C. Chothia. SCOP: A structural classification of proteins database for the investigation of sequences and structures. Journal of Molecular Biology, pages 247:536-540, 1995.
Google Scholar
E. Osuna and F. Girosi. Reducing the run-time complexity in support vector machines. In B. Schölkopf, C. J. C. Burges, and A. J. Smola, editors, Advances in Kernel Methods — Support Vector Learning, pages 271–284, Cambridge, MA, 1999. MIT Press.
Google Scholar
B. Schölkopf and A. J. Smola. Learning with Kernels. MIT Press, Cambridge, MA, 2002.
Google Scholar
K. Tsuda. Support vector classifier with asymmetric kernel function. In M. Verleysen, editor, Proceedings ESANN, pages 183–188, Brussels, 1999. D Facto. K. Tsuda, M. Kawanabe, G. Rätsch, S. Sonnenburg, and K.R. Müller. A new discriminative kernel from probabilistic models. In T.G. Dietterich, S. Becker, and Z. Ghahramani, editors, Advances in Neural Information Processing Systems, volume 14. MIT Press, 2002. To appear.
Google Scholar
V. Vapnik. Estimation of Dependences Based on Empirical Data [in Russian]. Nauka, Moscow, 1979. (English translation: Springer Verlag, New York, 1982).
Google Scholar
V. Vapnik. Statistical Learning Theory. John Wiley and Sons, New York, 1998.
MATH Google Scholar
C. Watkins. Dynamic alignment kernels. In A. J. Smola, P. L. Bartlett, B. Schölkopf, and D. Schuurmans, editors, Advances in Large Margin Classifiers, pages 39–50, Cambridge, MA, 2000. MIT Press.
Google Scholar
J. Weston, A. Elisseeff, and B. Schölkopf. Use of the `0-norm with linear models and kernel methods. Biowulf Technical report, 2001. http://www.conclu.de/~jason/.
J. Weston, F. Pérez-Cruz, O. Bousquet, O. Chapelle, A. Elisseeff, and B. Schölkopf. Feature selection and transduction for prediction of molecular bioactivity for drug design, 2002. http://www.conclu.de/~jason/kdd/kdd.html.
J. Weston and B. Schölkopf. Dealing with large diagonals in kernel matrices. In New Trends in Optimization and Computational algorithms (NTOC 2001), Kyoto, Japan, 2001.
Google Scholar

Download references

Author information

Authors and Affiliations

Max-Planck-Institut für biologische Kybernetik, Spemannstr. 38, D-72076, Tübingen, Germany
Bernhard Schölkopf & Jason Weston
Department of Computer Science, Columbia University, New York
Eleazar Eskin, Christina Leslie & William Stafford Noble
Columbia Genome Center, Columbia University, New York
William Stafford Noble

Authors

Bernhard Schölkopf
View author publications
You can also search for this author in PubMed Google Scholar
Jason Weston
View author publications
You can also search for this author in PubMed Google Scholar
Eleazar Eskin
View author publications
You can also search for this author in PubMed Google Scholar
Christina Leslie
View author publications
You can also search for this author in PubMed Google Scholar
William Stafford Noble
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Helsinki, P.O. Box 26, 00014, Helsinki, Finland
Tapio Elomaa , Heikki Mannila & Hannu Toivonen , &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Schölkopf, B., Weston, J., Eskin, E., Leslie, C., Noble, W.S. (2002). A Kernel Approach for Learning from almost Orthogonal Patterns. In: Elomaa, T., Mannila, H., Toivonen, H. (eds) Machine Learning: ECML 2002. ECML 2002. Lecture Notes in Computer Science(), vol 2430. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36755-1_44

Download citation

DOI: https://doi.org/10.1007/3-540-36755-1_44
Published: 20 September 2002
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44036-9
Online ISBN: 978-3-540-36755-0
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

A Kernel Approach for Learning from almost Orthogonal Patterns

Abstract

Chapter PDF

Similar content being viewed by others

Extraction of Dimension Reduced Features from Empirical Kernel Vector

Opening the Black Box: Revealing Interpretable Sequence Motifs in Kernel-Based Learning Algorithms

Kernel classification using a linear programming approach

Keywords

Bibliography

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A Kernel Approach for Learning from almost Orthogonal Patterns

Abstract

Chapter PDF

Similar content being viewed by others

Extraction of Dimension Reduced Features from Empirical Kernel Vector

Opening the Black Box: Revealing Interpretable Sequence Motifs in Kernel-Based Learning Algorithms

Kernel classification using a linear programming approach

Keywords

Bibliography

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation