Abstract
In multi-instance problems (MIL), an arbitrary number of instances is associated with a class label. Therefore, the labeling of training data becomes simpler (since it is done together, instead of individually) with the disadvantage that a weakly supervised database is produced [9]. In the PCRY, each restaurant is represented by a set of images that share the attribute label(s) of that establishment. This paper explores the use of previously learned attribute extractors, trained in 3 different databases that are similar and complementary to the PCRY database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, Z., Chi, Z., Fu, H., Feng, D.: Multi-instance multi-label image classification: a neural approach. Neurocomputing 99, 298–306 (2013)
Zhang, Y., Wang, Y., Liu, X.Y., Mi, S., Zhang, M.L.: Large-scale multi-label classification using unknown streaming images. Pattern Recogn. 99, 107100 (2020)
Li, P., Chen, P., Xie, Y., Zhang, D.: Bi-modal learning with channel-wise attention for multi-label image classification. IEEE Access 8, 9965–9977 (2020)
Yu, W.J., Chen, Z.D., Luo, X., Liu, W., Xu, X.S.: DELTA: a deep dual-stream network for multi-label image classification. Pattern Recogn. 91, 322–331 (2019)
Wang, S., Zhu, Y., Yu, L., Chen, H., Lin, H., Wan, X., Fan, X., Heng, P.A.: RMDL: recalibrated multi-instance deep learning for whole slide gastric image classification. Med. Image Anal. 58, 101549 (2019)
Loukas, C., Sgouros, N.P.: Multi-instance multi-label learning for surgical image annotation. Int. J. Med. Robot. Comput. Assist. Surg. 16, e2058 (2019)
Zhang, M., Li, C., Wang, X.: Multi-view metric learning for multi-label image classification. In: 2019 IEEE International Conference on Image Processing (ICIP), pp. 2134–2138. IEEE, September 2019
Varela, N., Silva, J., Gonzalez, F.M., Palencia, P., Palma, H.H., Pineda, O.B.: Method for the recovery of images in databases of rice grains from visual content. Procedia Comput. Sci. 170, 983–988 (2020)
Song, L., Liu, J., Qian, B., Sun, M., Yang, K., Sun, M., Abbas, S.: A deep multi-modal CNN for multi-instance multi-label image classification. IEEE Trans. Image Process. 27(12), 6025–6038 (2018)
Yang, Y., Fu, Z.Y., Zhan, D.C., Liu, Z.B., Jiang, Y.: Semi-supervised multi-modal multi-instance multi-label deep network with optimal transport. IEEE Trans. Knowl. Data Eng. (2019)
Viloria, A., Acuña, G.C., Franco, D.J.A., Hernández-Palma, H., Fuentes, J.P., Rambal, E.P.: Integration of data mining techniques to PostgreSQL database manager system. Procedia Comput. Sci. 155, 575–580 (2019)
Tsoumakas, G., Katakis, I., Vlahavas, I.: Data Mining and Knowledge Discovery Handbook (2009)
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Ghahramani, Z., Welling, M., et al. (eds.) Advances in Neural Information Processing Systems, pp. 3320–3328 (2014)
Hu, H., Cui, Z., Wu, J., Wang, K.: Metric learning-based multi-instance multi-label classification with label correlation. IEEE Access 7, 109899–109909 (2019)
Bossard, L., Guillaumin, M., Van Gool, L.: Food-101: mining discriminative components with random forests. In: European Conference on Computer Vision (2014)
Zeng, T., Ji, S.: Deep convolutional neural networks for multi-instance multi-task learning. In: 2015 IEEE International Conference on Data Mining, pp. 579–588. IEEE, November 2015
Li, J., Liu, J., Yongkang, W., Nishimura, S., Kankanhalli, M.: Weakly-supervised multi-person action recognition in 360° videos. In: The IEEE Winter Conference on Applications of Computer Vision, pp. 508–516 (2020)
Zhu, M., Li, Y., Pan, Z., Yang, J.: Automatic modulation recognition of compound signals using a deep multi-label classifier: a case study with radar jamming signals. Sig. Process. 169, 107393 (2020)
Yang, H., Tianyi Zhou, J., Cai, J., Soon Ong, Y.: MIML-FCN + : multi-instance multi-label learning via fully convolutional networks with privileged information. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1577–1585 (2017)
Viloria, A., Lezama, O.B.P.: Improvements for determining the number of clusters in k-means for innovation databases in SMEs. ANT/EDI40, pp. 1201–1206 (2019)
Li, D., Wang, J., Zhao, X., Liu, Y., Wang, D.: Multiple kernel-based multi-instance learning algorithm for image classification. J. Vis. Commun. Image Represent. 25(5), 1112–1117 (2014)
Feng, S., Xiong, W., Li, B., Lang, C., Huang, X.: Hierarchical sparse representation based multi-instance semi-supervised learning with application to image categorization. Sig. Process. 94, 595–607 (2014)
Zhu, F., Li, H., Ouyang, W., Yu, N., Wang, X.: Learning spatial regularization with image-level supervisions for multi-label image classification. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5513–5522 (2017)
Shang, J., Hong, S., Zhou, Y., Wu, M., Li, H.: Knowledge guided multi-instance multi-label learning via neural networks in medicines prediction. In: Asian Conference on Machine Learning, pp. 831–846, November 2018
Wu, J.S., Huang, S.J., Zhou, Z.H.: Genome-wide protein function prediction through multi-instance multi-label learning. IEEE/ACM Trans. Comput. Biol. Bioinform. 11(5), 891–902 (2014)
Ding, X., Li, B., Xiong, W., Guo, W., Hu, W., Wang, B.: Multi-instance multi-label learning combining hierarchical context and its application to image annotation. IEEE Trans. Multimed. 18(8), 1616–1627 (2016)
Cheplygina, V., de Bruijne, M., Pluim, J.P.: Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis. Med. Image Anal. 54, 280–296 (2019)
Shakya, S.: Machine learning based nonlinearity determination for optical fiber communication-review. J. Ubiquit. Comput. Commun. Technol. (UCCT) 1(02), 121–127 (2019)
Laib, L., Allili, M.S., Ait-Aoudia, S.: A probabilistic topic model for event-based image classification and multi-label annotation. Sig. Process. Image Commun. 76, 283–294 (2019)
García-Domínguez, M., Domínguez, C., Heras, J., Mata, E., Pascual, V.: FrImCla: a framework for image classification using traditional and transfer learning techniques. IEEE Access 8, 53443–53455 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Silva, J., Varela, N., Mendoza-Palechor, F.E., Lezama, O.B.P. (2021). Deep Learning of Robust Representations for Multi-instance and Multi-label Image Classification. In: Chen, J.IZ., Tavares, J.M.R.S., Shakya, S., Iliyasu, A.M. (eds) Image Processing and Capsule Networks. ICIPCN 2020. Advances in Intelligent Systems and Computing, vol 1200. Springer, Cham. https://doi.org/10.1007/978-3-030-51859-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-030-51859-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-51858-5
Online ISBN: 978-3-030-51859-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)