Bag Dissimilarities for Multiple Instance Learning

Tax, David M. J.; Loog, Marco; Duin, Robert P. W.; Cheplygina, Veronika; Lee, Wan-Jui

doi:10.1007/978-3-642-24471-1_16

David M. J. Tax¹⁸,
Marco Loog¹⁸,
Robert P. W. Duin¹⁸,
Veronika Cheplygina¹⁸ &
…
Wan-Jui Lee¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7005))

Included in the following conference series:

International Workshop on Similarity-Based Pattern Recognition

847 Accesses
14 Citations

Abstract

When objects cannot be represented well by single feature vectors, a collection of feature vectors can be used. This is what is done in Multiple Instance learning, where it is called a bag of instances. By using a bag of instances, an object gains more internal structure than when a single feature vector is used. This improves the expressiveness of the representation, but also adds complexity to the classification of the object. This paper shows that for the situation that not a single instance determines the class label of a bag, simple bag dissimilarity measures can significantly outperform standard multiple instance classifiers. In particular a measure that computes just the average minimum distance between instances, or a measure that uses the Earth Mover’s distance, perform very well.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Combining Instance Information to Classify Bags

Characterizing Multiple Instance Datasets

Multiple Instance Classification in the Image Domain

Keywords

References

Andrews, S., Hofmann, T., Tsochantaridis, I.: Multiple instance learning with generalized support vector machines. In: Proceedings of the AAAI National Conference on Artificial Intelligence (2002)
Google Scholar
Blaschko, M.B., Hofmann, T.: Conformal multi-instance kernels. In: NIPS 2006 Workshop on Learning to Compare Examples, pp. 1–6 (2006)
Google Scholar
Carson, C., Thomas, M., Belongie, S., Hellerstein, J.M., Malik, J.: Blobworld: A system for region-based image indexing and retrieval. In: Huijsmans, D.P., Smeulders, A.W.M. (eds.) VISUAL 1999. LNCS, vol. 1614, pp. 509–517. Springer, Heidelberg (1999)
Chapter Google Scholar
Chen, Y., Bi, J., Wang, J.: MILES: Multiple-instance learning via embedded instance selection. IEEE Transactions on Pattern Analysis and Machine Intelligence 28(12), 1931–1947 (2006)
Article Google Scholar
Dietterich, T., Lathrop, R., Lozano-Perez, T.: Solving the multiple instance problem with axis-parallel rectangles. Artificial Intelligence 89(1-2), 31–71 (1997)
Article MATH Google Scholar
Dubuisson, M., Jain, A.: Modified Hausdorff distance for object matching. In: 12th Internat. Conference on Pattern Recognition, vol. 1, pp. 566–568 (1994)
Google Scholar
Duda, R., Hart, P., Stork, D.: Pattern Classification, 2nd edn. John Wiley & Sons, Chichester (2001)
MATH Google Scholar
Duin, R.P., Pekalska, E.: The dissimilarity space: bridging structural and statistical pattern recognition. Pattern Recognition Letters (in press, accepted manuscript 2011)
Google Scholar
Gärtner, T., Flach, P., Kowwalczyk, A., Smola, A.: Multi-instance kernels. In: Sammut, C., Hoffmann, A. (eds.) Proceedings of the 19th International Conference on Machine Learning, pp. 179–186. Morgan Kaufmann, San Francisco (2002)
Google Scholar
Goldman, S.: SIVAL (spatially independent, variable area, and lighting) benchmark (1998), http://www.cs.wustl.edu/~sg/accio/SIVAL.html
Kuhn, H.: The hungarian method for the assignment problem. Naval Research Logistics Quarterly 2, 83–97 (1955)
Article MathSciNet MATH Google Scholar
Kwok, J.T., Cheung, P.M.: Marginalized multi-instance kernels. In: Proceedings of International Joint Conference on Artificial Intelligence, pp. 901–906 (2007)
Google Scholar
Maron, O., Lozano-Pérez, T.: A framework for multiple-instance learning. In: Advances in Neural Information Processing Systems, vol. 10, pp. 570–576. MIT Press, Cambridge (1998)
Google Scholar
Pekalska, E.: The Dissimilarity representations in pattern recognition. Concepts, theory and applications. Ph.D. thesis, Delft University of Technology (January 2005)
Google Scholar
Ray, S., Craven, M.: Supervised versus multiple instance learning: an empirical comparison. In: ICML 2005: Proceedings of the 22nd International Conference on Machine Learning, pp. 697–704. ACM, New York (2005)
Google Scholar
Rubner, Y., Tomasi, C., Guibas, L.: A metric for distributions with applications to image databases. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 59–66 (1998)
Google Scholar
Sörensen, L., Loog, M., Tax, D.M.J., Lee, W.J., de Bruijne, M., Duin, R.P.W.: Dissimilarity-based multiple instance learning. In: Hancock, E.R., Wilson, R.C., Windeatt, T., Ulusoy, I., Escolano, F. (eds.) SSPR&SPR 2010. LNCS, vol. 6218, pp. 129–138. Springer, Heidelberg (2010)
Chapter Google Scholar
Tax, D.: MIL, a Matlab toolbox for multiple instance learning, version 0.7.9 (May 2011), http://prlab.tudelft.nl/david-tax/mil.html
Viola, P., Platt, J., Zhang, C.: Multiple instance boosting for object dection. In: Advances in Neural Inf. Proc. Systems (NIPS 2005), pp. 1419–1426 (2005)
Google Scholar
Weidmann, N., Frank, E., Pfahringer, B.: A two-level learning method for generalized multi-instance problems. In: Lavrač, N., Gamberger, D., Todorovski, L., Blockeel, H. (eds.) ECML 2003. LNCS (LNAI), vol. 2837, pp. 468–479. Springer, Heidelberg (2003)
Chapter Google Scholar
Xu, X., Frank, E.: Logistic regression and boosting for labeled bags of instances. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 272–281. Springer, Heidelberg (2004)
Chapter Google Scholar
Zhang, Q., Goldman, S.: EM-DD: An improved multiple-instance learning technique. In: Advances in Neural Information Processing Systems, vol. 14, MIT Press, Cambridge (2002)
Google Scholar
Zhou, Z.H., Jiang, K., Li, M.: Multi-instance learning based web mining. Applied Intelligence 22(2), 135–147 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Pattern Recognition Laboratory, Delft University of Technology, Mekelweg 4, 2628, CD, Delft, The Netherlands
David M. J. Tax, Marco Loog, Robert P. W. Duin, Veronika Cheplygina & Wan-Jui Lee

Authors

David M. J. Tax
View author publications
You can also search for this author in PubMed Google Scholar
Marco Loog
View author publications
You can also search for this author in PubMed Google Scholar
Robert P. W. Duin
View author publications
You can also search for this author in PubMed Google Scholar
Veronika Cheplygina
View author publications
You can also search for this author in PubMed Google Scholar
Wan-Jui Lee
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

DAIS, Università Ca’ Foscari, Via Torino 155, 30172, Venice, Italy
Marcello Pelillo
The University of York, YO1 5DD, Heslington, York, UK
Edwin R. Hancock

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tax, D.M.J., Loog, M., Duin, R.P.W., Cheplygina, V., Lee, WJ. (2011). Bag Dissimilarities for Multiple Instance Learning. In: Pelillo, M., Hancock, E.R. (eds) Similarity-Based Pattern Recognition. SIMBAD 2011. Lecture Notes in Computer Science, vol 7005. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24471-1_16

Download citation

DOI: https://doi.org/10.1007/978-3-642-24471-1_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24470-4
Online ISBN: 978-3-642-24471-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Bag Dissimilarities for Multiple Instance Learning

Abstract

Chapter PDF

Similar content being viewed by others

Combining Instance Information to Classify Bags

Characterizing Multiple Instance Datasets

Multiple Instance Classification in the Image Domain

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Bag Dissimilarities for Multiple Instance Learning

Abstract

Chapter PDF

Similar content being viewed by others

Combining Instance Information to Classify Bags

Characterizing Multiple Instance Datasets

Multiple Instance Classification in the Image Domain

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation