Regret Analysis for Performance Metrics in Multi-Label Classification: The Case of Hamming and Subset Zero-One Loss

Dembczyński, Krzysztof; Waegeman, Willem; Cheng, Weiwei; Hüllermeier, Eyke

doi:10.1007/978-3-642-15880-3_24

Krzysztof Dembczyński^23,25,
Willem Waegeman²⁴,
Weiwei Cheng²³ &
…
Eyke Hüllermeier²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6321))

Included in the following conference series:

Joint European Conference on Machine Learning and Knowledge Discovery in Databases

Abstract

In multi-label classification (MLC), each instance is associated with a subset of labels instead of a single class, as in conventional classification, and this generalization enables the definition of a multitude of loss functions. Indeed, a large number of losses has already been proposed and is commonly applied as performance metrics in experimental studies. However, even though these loss functions are of a quite different nature, a concrete connection between the type of multi-label classifier used and the loss to be minimized is rarely established, implicitly giving the misleading impression that the same method can be optimal for different loss functions. In this paper, we elaborate on risk minimization and the connection between loss functions in MLC, both theoretically and empirically. In particular, we compare two important loss functions, namely the Hamming loss and the subset 0/1 loss. We perform a regret analysis, showing how poor a classifier intended to minimize the subset 0/1 loss can become in terms of Hamming loss and vice versa. The theoretical results are corroborated by experimental studies, and their implications for MLC methods are discussed in a broader context.

Download to read the full chapter text

Chapter PDF

Surrogate regret bounds for generalized classification performance metrics

Article Open access 14 October 2016

A flexible class of dependence-aware multi-label loss functions

Article Open access 13 January 2022

A Blended Metric for Multi-label Optimisation and Evaluation

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Boutell, M., Luo, J., Shen, X., Brown, C.: Learning multi-label scene classification. Pattern Recognition 37(9), 1757–1771 (2004)
Article Google Scholar
Ghamrawi, N., McCallum, A.: Collective multi-label classification. In: CIKM 2005, pp. 195–200 (2005)
Google Scholar
Amit, Y., Dekel, O., Singer, Y.: A boosting algorithm for label covering in multilabel problems. In: JMLR W&P, vol. 2, pp. 27–34 (2007)
Google Scholar
Tsoumakas, G., Katakis, I.: Multi label classification: An overview. Int. J. Data Warehousing and Mining 3(3), 1–13 (2007)
Google Scholar
Cheng, W., Hüllermeier, E.: Combining instance-based learning and logistic regression for multilabel classification. Machine Learning 76(2-3), 211–225 (2009)
Article Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Mining multi-label data. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook. Springer, Heidelberg (2010)
Google Scholar
Dembczyński, K., Cheng, W., Hüllermeier, E.: Bayes optimal multilabel classification via probabilistic classifier chains. In: ICML 2010 (2010)
Google Scholar
Taskar, B., Guestrin, C., Koller, D.: Max-margin markov networks. In: NIPS 16. MIT Press, Cambridge (2004)
Google Scholar
McAllester, D.: Generalization bounds and consistency for structured labeling. In: Predicting Structured Data. MIT Press, Cambridge (2007)
Google Scholar
MacKay, D.J.C.: Information Theory, Inference, and Learning Algorithms. Cambridge University Press, Cambridge (2003)
MATH Google Scholar
Breiman, L., Friedman, J.: Predicting multivariate responses in multiple linear regression. J. R. Stat. Soc. Ser. B 69, 3–54 (1997)
Article MathSciNet Google Scholar
Caruana, R.: Multitask learning: A knowledge-based source of inductive bias. Machine Learning 28, 41–75 (1997)
Article Google Scholar
Nelsen, R.: An Introduction to Copulas, 2nd edn. Springer, Heidelberg (2006)
MATH Google Scholar
Witten, I.H., Frank, E.: Data Mining: Practical machine learning tools and techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
MATH Google Scholar
Platt, J.C.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. In: Advances in Large Margin Classifiers, pp. 61–74. MIT Press, Cambridge (1999)
Google Scholar
Dembczyński, K., Kotłowski, W., Słowiński, R.: Maximum likelihood rule ensembles. In: ICML 2008, pp. 224–231 (2008)
Google Scholar
Tsoumakas, G., Vlahavas, I.: Random k-labelsets: An ensemble method for multilabel classification. In: Kok, J.N., Koronacki, J., Lopez de Mantaras, R., Matwin, S., Mladenič, D., Skowron, A. (eds.) ECML 2007. LNCS (LNAI), vol. 4701, pp. 406–417. Springer, Heidelberg (2007)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, Marburg University, Hans-Meerwein-Str, 35039, Marburg, Germany
Krzysztof Dembczyński, Weiwei Cheng & Eyke Hüllermeier
Department of Applied Mathematics, Biometrics and Process Control, Ghent University, Coupure links 653, B-9000, Ghent, Belgium
Willem Waegeman
Institute of Computing Science, Poznań University of Technology, Piotrowo 2, 60-965, Poznań, Poland
Krzysztof Dembczyński

Authors

Krzysztof Dembczyński
View author publications
You can also search for this author in PubMed Google Scholar
Willem Waegeman
View author publications
You can also search for this author in PubMed Google Scholar
Weiwei Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Eyke Hüllermeier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Departamento de Matemáticas, Estadística y Computación, Universidad de Cantabria, Avenida de los Castros, s/n, 39071, Santander, Spain
José Luis Balcázar
Yahoo! Research Barcelona, Avinguda Diagonal 177, 08018, Barcelona, Spain
Francesco Bonchi
Yahoo! Research Barcelona, Avinguda Diagnonal 177, 08018, Barcelona, Spain
Aristides Gionis
TAO, CNRS-INRIA-LRI, Université Paris-Sud, 91405, Orsay, France
Michèle Sebag

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dembczyński, K., Waegeman, W., Cheng, W., Hüllermeier, E. (2010). Regret Analysis for Performance Metrics in Multi-Label Classification: The Case of Hamming and Subset Zero-One Loss. In: Balcázar, J.L., Bonchi, F., Gionis, A., Sebag, M. (eds) Machine Learning and Knowledge Discovery in Databases. ECML PKDD 2010. Lecture Notes in Computer Science(), vol 6321. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15880-3_24

Download citation

DOI: https://doi.org/10.1007/978-3-642-15880-3_24
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15879-7
Online ISBN: 978-3-642-15880-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Regret Analysis for Performance Metrics in Multi-Label Classification: The Case of Hamming and Subset Zero-One Loss

Abstract

Chapter PDF

Similar content being viewed by others

Surrogate regret bounds for generalized classification performance metrics

A flexible class of dependence-aware multi-label loss functions

A Blended Metric for Multi-label Optimisation and Evaluation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Regret Analysis for Performance Metrics in Multi-Label Classification: The Case of Hamming and Subset Zero-One Loss

Abstract

Chapter PDF

Similar content being viewed by others

Surrogate regret bounds for generalized classification performance metrics

A flexible class of dependence-aware multi-label loss functions

A Blended Metric for Multi-label Optimisation and Evaluation

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation