Abstract
Understanding intellectual products such as comics and picture books is one of the important topics in the field of artificial intelligence. Hence, stepwise analysis of a comic story, i.e., features of a part of the image, information features, features relating to continuous scene etc., by human and by a combination of several classifiers was pursued. As the first step in this direction, several classifiers for comics are constructed in this study by utilizing a convolutional neural network, and the results of classification by a human annotator and by a computational method are compared.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Le, Q.V.: Building high-level features using large scale unsupervised learning. In: Acoustics, Speech and Signal Processing (ICASSP), pp. 8595–8598 (2013)
Ueno, M., Mori, N., Matsumoto, K.: 2-Scene comic creating system based on the distribution of picture state transition. In: Advances in Intelligent Systems and Computing, vol. 290, pp. 459–467 (2014)
Eitz, M., Hays, J., Alexa, M.: How Do Humans Sketch Objects? ACM Trans. Graph. (Proc. SIGGRAPH) 31(4), 44:1–44:10 (2012)
Fukushima, K., Miyake, S.: Neocognitron: A new algorithm for pattern recognition tolerant of deformations and shifts in position. Pattern Recognition 15(6), 455–469 (1982)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Tanaka, T., Toyama, F., Miyamichi, J., Shoji, K.: Detection and Classification of Speech Balloons in Comic Images. The journal of the Institute of Image Information and Television Engineers 64(12), 1933–1939 (2010)
Fujino, H.: Konpeito ! 1 (Confetti ! 1)). Houbunsha (2007)
Fujino, H.: Ringo no ki no shitakko de (Under the apple tree). Houbunsha (2005)
Chainer. http://chainer.org/
Hinton, G.E., et al.: Improving neural networks by preventing co-adaptation of feature detectors. arXiv preprint arXiv:1207.0580 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Ueno, M. (2016). Computational Interpretation of Comic Scenes. In: Omatu, S., et al. Distributed Computing and Artificial Intelligence, 13th International Conference. Advances in Intelligent Systems and Computing, vol 474. Springer, Cham. https://doi.org/10.1007/978-3-319-40162-1_42
Download citation
DOI: https://doi.org/10.1007/978-3-319-40162-1_42
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40161-4
Online ISBN: 978-3-319-40162-1
eBook Packages: EngineeringEngineering (R0)