Skip to main content

Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification

  • Conference paper
  • First Online:
Image Processing and Capsule Networks (ICIPCN 2020)

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1200))

Included in the following conference series:

Abstract

In multi-instance problems (MIL), an arbitrary number of instances is associated with a class label. Therefore, the labeling of training data becomes simpler (since it is done together, instead of individually) with the disadvantage that a weakly supervised database is produced [9]. In the PCRY, each restaurant is represented by a set of images that share the attribute label(s) of that establishment. This paper explores the use of previously learned attribute extractors, trained in 3 different databases that are similar and complementary to the PCRY database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Chen, Z., Chi, Z., Fu, H., Feng, D.: Multi-instance multi-label image classification: a neural approach. Neurocomputing 99, 298–306 (2013)

    Article  Google Scholar 

  2. Zhang, Y., Wang, Y., Liu, X.Y., Mi, S., Zhang, M.L.: Large-scale multi-label classification using unknown streaming images. Pattern Recogn. 99, 107100 (2020)

    Article  Google Scholar 

  3. Li, P., Chen, P., Xie, Y., Zhang, D.: Bi-modal learning with channel-wise attention for multi-label image classification. IEEE Access 8, 9965–9977 (2020)

    Article  Google Scholar 

  4. Yu, W.J., Chen, Z.D., Luo, X., Liu, W., Xu, X.S.: DELTA: a deep dual-stream network for multi-label image classification. Pattern Recogn. 91, 322–331 (2019)

    Article  Google Scholar 

  5. Wang, S., Zhu, Y., Yu, L., Chen, H., Lin, H., Wan, X., Fan, X., Heng, P.A.: RMDL: recalibrated multi-instance deep learning for whole slide gastric image classification. Med. Image Anal. 58, 101549 (2019)

    Article  Google Scholar 

  6. Loukas, C., Sgouros, N.P.: Multi-instance multi-label learning for surgical image annotation. Int. J. Med. Robot. Comput. Assist. Surg. 16, e2058 (2019)

    Google Scholar 

  7. Zhang, M., Li, C., Wang, X.: Multi-view metric learning for multi-label image classification. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 2134–2138. IEEE, September 2019

    Google Scholar 

  8. Varela, N., Silva, J., Gonzalez, F.M., Palencia, P., Palma, H.H., Pineda, O.B.: Method for the recovery of images in databases of rice grains from visual content. Procedia Comput. Sci. 170, 983–988 (2020)

    Article  Google Scholar 

  9. Song, L., Liu, J., Qian, B., Sun, M., Yang, K., Sun, M., Abbas, S.: A deep multi-modal CNN for multi-instance multi-label image classification. IEEE Trans. Image Process. 27(12), 6025–6038 (2018)

    Article  MathSciNet  Google Scholar 

  10. Yang, Y., Fu, Z.Y., Zhan, D.C., Liu, Z.B., Jiang, Y.: Semi-supervised multi-modal multi-instance multi-label deep network with optimal transport. IEEE Trans. Knowl. Data Eng. (2019)

    Google Scholar 

  11. Viloria, A., Acuña, G.C., Franco, D.J.A., Hernández-Palma, H., Fuentes, J.P., Rambal, E.P.: Integration of data mining techniques to PostgreSQL database manager system. Procedia Comput. Sci. 155, 575–580 (2019)

    Article  Google Scholar 

  12. Tsoumakas, G., Katakis, I., Vlahavas, I.: Data Mining and Knowledge Discovery Handbook (2009)

    Google Scholar 

  13. Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Ghahramani, Z., Welling, M., et al. (eds.) Advances in Neural Information Processing Systems, pp. 3320–3328 (2014)

    Google Scholar 

  14. Hu, H., Cui, Z., Wu, J., Wang, K.: Metric learning-based multi-instance multi-label classification with label correlation. IEEE Access 7, 109899–109909 (2019)

    Article  Google Scholar 

  15. Bossard, L., Guillaumin, M., Van Gool, L.: Food-101: mining discriminative components with random forests. In: European Conference on Computer Vision (2014)

    Google Scholar 

  16. Zeng, T., Ji, S.: Deep convolutional neural networks for multi-instance multi-task learning. In: 2015 IEEE International Conference on Data Mining, pp. 579–588. IEEE, November 2015

    Google Scholar 

  17. Li, J., Liu, J., Yongkang, W., Nishimura, S., Kankanhalli, M.: Weakly-supervised multi-person action recognition in 360° videos. In: The IEEE Winter Conference on Applications of Computer Vision, pp. 508–516 (2020)

    Google Scholar 

  18. Zhu, M., Li, Y., Pan, Z., Yang, J.: Automatic modulation recognition of compound signals using a deep multi-label classifier: a case study with radar jamming signals. Sig. Process. 169, 107393 (2020)

    Article  Google Scholar 

  19. Yang, H., Tianyi Zhou, J., Cai, J., Soon Ong, Y.: MIML-FCN + : multi-instance multi-label learning via fully convolutional networks with privileged information. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1577–1585 (2017)

    Google Scholar 

  20. Viloria, A., Lezama, O.B.P.: Improvements for determining the number of clusters in k-means for innovation databases in SMEs. ANT/EDI40, pp. 1201–1206 (2019)

    Google Scholar 

  21. Li, D., Wang, J., Zhao, X., Liu, Y., Wang, D.: Multiple kernel-based multi-instance learning algorithm for image classification. J. Vis. Commun. Image Represent. 25(5), 1112–1117 (2014)

    Article  Google Scholar 

  22. Feng, S., Xiong, W., Li, B., Lang, C., Huang, X.: Hierarchical sparse representation based multi-instance semi-supervised learning with application to image categorization. Sig. Process. 94, 595–607 (2014)

    Article  Google Scholar 

  23. Zhu, F., Li, H., Ouyang, W., Yu, N., Wang, X.: Learning spatial regularization with image-level supervisions for multi-label image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5513–5522 (2017)

    Google Scholar 

  24. Shang, J., Hong, S., Zhou, Y., Wu, M., Li, H.: Knowledge guided multi-instance multi-label learning via neural networks in medicines prediction. In: Asian Conference on Machine Learning, pp. 831–846, November 2018

    Google Scholar 

  25. Wu, J.S., Huang, S.J., Zhou, Z.H.: Genome-wide protein function prediction through multi-instance multi-label learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(5), 891–902 (2014)

    Article  Google Scholar 

  26. Ding, X., Li, B., Xiong, W., Guo, W., Hu, W., Wang, B.: Multi-instance multi-label learning combining hierarchical context and its application to image annotation. IEEE Trans. Multimed. 18(8), 1616–1627 (2016)

    Article  Google Scholar 

  27. Cheplygina, V., de Bruijne, M., Pluim, J.P.: Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis. Med. Image Anal. 54, 280–296 (2019)

    Article  Google Scholar 

  28. Shakya, S.: Machine learning based nonlinearity determination for optical fiber communication-review. J. Ubiquit. Comput. Commun. Technol. (UCCT) 1(02), 121–127 (2019)

    Google Scholar 

  29. Laib, L., Allili, M.S., Ait-Aoudia, S.: A probabilistic topic model for event-based image classification and multi-label annotation. Sig. Process. Image Commun. 76, 283–294 (2019)

    Article  Google Scholar 

  30. García-Domínguez, M., Domínguez, C., Heras, J., Mata, E., Pascual, V.: FrImCla: a framework for image classification using traditional and transfer learning techniques. IEEE Access 8, 53443–53455 (2020)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jesus Silva .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Silva, J., Varela, N., Mendoza-Palechor, F.E., Lezama, O.B.P. (2021). Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification. In: Chen, J.IZ., Tavares, J.M.R.S., Shakya, S., Iliyasu, A.M. (eds) Image Processing and Capsule Networks. ICIPCN 2020. Advances in Intelligent Systems and Computing, vol 1200. Springer, Cham. https://doi.org/10.1007/978-3-030-51859-2_16

Download citation

Publish with us

Policies and ethics