Abstract
Ship detection in optical remote sensing images is a vital yet challenging task. Now, more attention has been focused on increasing detection accuracy, while the detection speed is ignored. However, detection speed is as important as detection precision for ship detection. In this paper, we propose a new model, named ImYOLOv5X, which is based on YOLOv5X combined with a Squeeze-and-Excitation Module for fast and accurate rotated ship detection. Firstly, we incorporate a Squeeze-and-Excitation (SE) module into backbone of YOLOv5X, which enables the model to focus on detection objects, thus improving detection accuracy. Then we design an easy-to-insert module, containing a Convolution Set and Squeeze-and-Excitation Module (CS-SE), which can extract features and weigh the channels of features for prediction. Finally, we introduce the Gaussian Wasserstein Distance (GWD) loss as the regression loss of the model. The GWD loss resolves the boundary discontinuity and inconsistency in training and final detection metric. Extensive experiments on the HRSC2016 dataset show that our model can achieve highest detection accuracy and still maintain fastest detection speed compared with some other models, which proves the effectiveness of our model.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Yao, Y., Jiang, Z., Zhang, H., et al.: Ship detection in optical remote sensing images based on deep convolutional neural networks. J. Appl. Remote Sens. 11(4), 042611 (2017)
Heiselberg, H.: A direct and fast methodology for ship recognition in sentinel-2 multispectral imagery. Remote Sensing 8(12), 1033 (2016)
Dong, C., Liu, J., Xu, F.: Ship detection in optical remote sensing images based on saliency and a rotation-invariant descriptor. Remote Sensing 10(3), 400 (2018)
Jiang, Y., Zhu, X., et al.: R2CNN: Rotational region CNN for orientation robust scene text detection. arXiv preprint arXiv:1706.09579 (2017)
Ma, J., Shao, W., et al.: Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans. Multimedia 20(11), 3111–3122 (2018)
Yang, X., Yan, J., et al.: R3det: Refined single-stage detector with feature refinement for rotating object. In: Proceedings of the AAAI Conference on Artificial Intelligence 35, pp. 3163–3171 (2021)
Yang, X., Yan, et al.: Rethinking rotated object detection with Gaussian wasserstein distance loss. In: Proceedings of the 38th International Conference on Machine Learning, PMLR 139, pp. 11830–11841 (2021)
Ming, Q., Miao, L., et al.: Optimization for arbitrary-oriented object detection via representation invariance loss. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2021)
Qian, W., Yang, X., et al.: Learning modulated loss for rotated object detection. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 35, no. 3, pp. 2458–2466 (2021)
Hu, J., Shen, L., et al.: Squeeze-and-excitation networks. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Liu, Z., Yuan, L., et al.: A high resolution optical satellite image dataset for ship recognition and some new baselines. In: International Conference on Pattern Recognition Applications and Methods, SciTePress, vol. 2, pp. 324–331 (2017)
Acknowledgements
This work was partially supported by the NSFC under Grant (No.61972315), Shaanxi Province International Science and Technology Cooperation Program Project-Key Projects (No.2022KWZ-14), the National Key Laboratory of Science and Technology on Space Micrwave (No. 6142411412117).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, X., Gao, S., Zhou, J., Xiao, Y. (2023). Rotated Ship Detection with Improved YOLOv5X. In: Xiong, N., Li, M., Li, K., Xiao, Z., Liao, L., Wang, L. (eds) Advances in Natural Computation, Fuzzy Systems and Knowledge Discovery. ICNC-FSKD 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 153. Springer, Cham. https://doi.org/10.1007/978-3-031-20738-9_21
Download citation
DOI: https://doi.org/10.1007/978-3-031-20738-9_21
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20737-2
Online ISBN: 978-3-031-20738-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)