Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification

Silva, Jesus; Varela, Noel; Mendoza-Palechor, Fabio E.; Lezama, Omar Bonerge Pineda

doi:10.1007/978-3-030-51859-2_16

Jesus Silva¹⁸,
Noel Varela¹⁸,
Fabio E. Mendoza-Palechor¹⁸ &
…
Omar Bonerge Pineda Lezama¹⁹

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1200))

Included in the following conference series:

International Conference on Image Processing and Capsule Networks

918 Accesses
1 Citations

Abstract

In multi-instance problems (MIL), an arbitrary number of instances is associated with a class label. Therefore, the labeling of training data becomes simpler (since it is done together, instead of individually) with the disadvantage that a weakly supervised database is produced [9]. In the PCRY, each restaurant is represented by a set of images that share the attribute label(s) of that establishment. This paper explores the use of previously learned attribute extractors, trained in 3 different databases that are similar and complementary to the PCRY database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Deep F-Measure Maximization in Multi-label Classification: A Comparative Study

Transpose and Mask: Simple and Effective Logit-Based Knowledge Distillation for Multi-attribute and Multi-label Classification

Category-Wise Fine-Tuning for Image Multi-label Classification with Partial Labels

References

Chen, Z., Chi, Z., Fu, H., Feng, D.: Multi-instance multi-label image classification: a neural approach. Neurocomputing 99, 298–306 (2013)
Article Google Scholar
Zhang, Y., Wang, Y., Liu, X.Y., Mi, S., Zhang, M.L.: Large-scale multi-label classification using unknown streaming images. Pattern Recogn. 99, 107100 (2020)
Article Google Scholar
Li, P., Chen, P., Xie, Y., Zhang, D.: Bi-modal learning with channel-wise attention for multi-label image classification. IEEE Access 8, 9965–9977 (2020)
Article Google Scholar
Yu, W.J., Chen, Z.D., Luo, X., Liu, W., Xu, X.S.: DELTA: a deep dual-stream network for multi-label image classification. Pattern Recogn. 91, 322–331 (2019)
Article Google Scholar
Wang, S., Zhu, Y., Yu, L., Chen, H., Lin, H., Wan, X., Fan, X., Heng, P.A.: RMDL: recalibrated multi-instance deep learning for whole slide gastric image classification. Med. Image Anal. 58, 101549 (2019)
Article Google Scholar
Loukas, C., Sgouros, N.P.: Multi-instance multi-label learning for surgical image annotation. Int. J. Med. Robot. Comput. Assist. Surg. 16, e2058 (2019)
Google Scholar
Zhang, M., Li, C., Wang, X.: Multi-view metric learning for multi-label image classification. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 2134–2138. IEEE, September 2019
Google Scholar
Varela, N., Silva, J., Gonzalez, F.M., Palencia, P., Palma, H.H., Pineda, O.B.: Method for the recovery of images in databases of rice grains from visual content. Procedia Comput. Sci. 170, 983–988 (2020)
Article Google Scholar
Song, L., Liu, J., Qian, B., Sun, M., Yang, K., Sun, M., Abbas, S.: A deep multi-modal CNN for multi-instance multi-label image classification. IEEE Trans. Image Process. 27(12), 6025–6038 (2018)
Article MathSciNet Google Scholar
Yang, Y., Fu, Z.Y., Zhan, D.C., Liu, Z.B., Jiang, Y.: Semi-supervised multi-modal multi-instance multi-label deep network with optimal transport. IEEE Trans. Knowl. Data Eng. (2019)
Google Scholar
Viloria, A., Acuña, G.C., Franco, D.J.A., Hernández-Palma, H., Fuentes, J.P., Rambal, E.P.: Integration of data mining techniques to PostgreSQL database manager system. Procedia Comput. Sci. 155, 575–580 (2019)
Article Google Scholar
Tsoumakas, G., Katakis, I., Vlahavas, I.: Data Mining and Knowledge Discovery Handbook (2009)
Google Scholar
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Ghahramani, Z., Welling, M., et al. (eds.) Advances in Neural Information Processing Systems, pp. 3320–3328 (2014)
Google Scholar
Hu, H., Cui, Z., Wu, J., Wang, K.: Metric learning-based multi-instance multi-label classification with label correlation. IEEE Access 7, 109899–109909 (2019)
Article Google Scholar
Bossard, L., Guillaumin, M., Van Gool, L.: Food-101: mining discriminative components with random forests. In: European Conference on Computer Vision (2014)
Google Scholar
Zeng, T., Ji, S.: Deep convolutional neural networks for multi-instance multi-task learning. In: 2015 IEEE International Conference on Data Mining, pp. 579–588. IEEE, November 2015
Google Scholar
Li, J., Liu, J., Yongkang, W., Nishimura, S., Kankanhalli, M.: Weakly-supervised multi-person action recognition in 360° videos. In: The IEEE Winter Conference on Applications of Computer Vision, pp. 508–516 (2020)
Google Scholar
Zhu, M., Li, Y., Pan, Z., Yang, J.: Automatic modulation recognition of compound signals using a deep multi-label classifier: a case study with radar jamming signals. Sig. Process. 169, 107393 (2020)
Article Google Scholar
Yang, H., Tianyi Zhou, J., Cai, J., Soon Ong, Y.: MIML-FCN + : multi-instance multi-label learning via fully convolutional networks with privileged information. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1577–1585 (2017)
Google Scholar
Viloria, A., Lezama, O.B.P.: Improvements for determining the number of clusters in k-means for innovation databases in SMEs. ANT/EDI40, pp. 1201–1206 (2019)
Google Scholar
Li, D., Wang, J., Zhao, X., Liu, Y., Wang, D.: Multiple kernel-based multi-instance learning algorithm for image classification. J. Vis. Commun. Image Represent. 25(5), 1112–1117 (2014)
Article Google Scholar
Feng, S., Xiong, W., Li, B., Lang, C., Huang, X.: Hierarchical sparse representation based multi-instance semi-supervised learning with application to image categorization. Sig. Process. 94, 595–607 (2014)
Article Google Scholar
Zhu, F., Li, H., Ouyang, W., Yu, N., Wang, X.: Learning spatial regularization with image-level supervisions for multi-label image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5513–5522 (2017)
Google Scholar
Shang, J., Hong, S., Zhou, Y., Wu, M., Li, H.: Knowledge guided multi-instance multi-label learning via neural networks in medicines prediction. In: Asian Conference on Machine Learning, pp. 831–846, November 2018
Google Scholar
Wu, J.S., Huang, S.J., Zhou, Z.H.: Genome-wide protein function prediction through multi-instance multi-label learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(5), 891–902 (2014)
Article Google Scholar
Ding, X., Li, B., Xiong, W., Guo, W., Hu, W., Wang, B.: Multi-instance multi-label learning combining hierarchical context and its application to image annotation. IEEE Trans. Multimed. 18(8), 1616–1627 (2016)
Article Google Scholar
Cheplygina, V., de Bruijne, M., Pluim, J.P.: Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis. Med. Image Anal. 54, 280–296 (2019)
Article Google Scholar
Shakya, S.: Machine learning based nonlinearity determination for optical fiber communication-review. J. Ubiquit. Comput. Commun. Technol. (UCCT) 1(02), 121–127 (2019)
Google Scholar
Laib, L., Allili, M.S., Ait-Aoudia, S.: A probabilistic topic model for event-based image classification and multi-label annotation. Sig. Process. Image Commun. 76, 283–294 (2019)
Article Google Scholar
García-Domínguez, M., Domínguez, C., Heras, J., Mata, E., Pascual, V.: FrImCla: a framework for image classification using traditional and transfer learning techniques. IEEE Access 8, 53443–53455 (2020)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Universidad de la Costa, St. 58 #66, Barranquilla, Atlántico, Colombia
Jesus Silva, Noel Varela & Fabio E. Mendoza-Palechor
Universidad Tecnológica Centroamericana (UNITEC), San Pedro Sula, Honduras
Omar Bonerge Pineda Lezama

Authors

Jesus Silva
View author publications
You can also search for this author in PubMed Google Scholar
Noel Varela
View author publications
You can also search for this author in PubMed Google Scholar
Fabio E. Mendoza-Palechor
View author publications
You can also search for this author in PubMed Google Scholar
Omar Bonerge Pineda Lezama
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jesus Silva .

Editor information

Editors and Affiliations

Department of Electrical Engineering, Dayeh University, Changhua, Taiwan
Joy Iong-Zong Chen
Instituto de Ciência e Inovação em Engenharia Mecânica e Engenharia Industrial, Departamento de Engenharia Mecânica, Faculdade de Engenharia, Universidade do Porto, Porto, Portugal
João Manuel R. S. Tavares
Department of Electronics and Computer Engineering, Tribhuvan University, Lalitpur, Nepal
Subarna Shakya
College of Engineering, Prince Sattam Bin Abdulaziz University, Al-Kharj, Saudi Arabia
Abdullah M. Iliyasu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Silva, J., Varela, N., Mendoza-Palechor, F.E., Lezama, O.B.P. (2021). Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification. In: Chen, J.IZ., Tavares, J.M.R.S., Shakya, S., Iliyasu, A.M. (eds) Image Processing and Capsule Networks. ICIPCN 2020. Advances in Intelligent Systems and Computing, vol 1200. Springer, Cham. https://doi.org/10.1007/978-3-030-51859-2_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-51859-2_16
Published: 24 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51858-5
Online ISBN: 978-3-030-51859-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep F-Measure Maximization in Multi-label Classification: A Comparative Study

Transpose and Mask: Simple and Effective Logit-Based Knowledge Distillation for Multi-attribute and Multi-label Classification

Category-Wise Fine-Tuning for Image Multi-label Classification with Partial Labels

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Deep F-Measure Maximization in Multi-label Classification: A Comparative Study

Transpose and Mask: Simple and Effective Logit-Based Knowledge Distillation for Multi-attribute and Multi-label Classification

Category-Wise Fine-Tuning for Image Multi-label Classification with Partial Labels

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation