Abstract
The mean-shift (MS) tracking is fast, is easy to implement, and performs well in many conditions especially for object with rotation and deformation. But the existing MS-like algorithms always have inferior performance for two reasons: the loss of pixel’s neighborhood information and lack of template update and scale estimation. We present a new adaptive scale MS algorithm with gradient histogram to settle those problems. The gradient histogram is constructed by gradient features concatenated with color features which are quantized into the 16 × 16 × 16 × 16 bins. To deal with scale change, a scale robust algorithm is adopted which is called background ratio weighting (BRW) algorithm. In order to cope with appearance variation, when the Bhattacharyya coefficient is greater than a threshold the object template is updated and the threshold is set to avoid incorrect updates. The proposed tracker is compared with lots of tracking algorithms, and the experimental results show its effectiveness in both distance precision and overlap precision.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
1 Introductions
Visual object tracking has always been a challenging work especially in the sequences with the deformation and rotation. The MS tracking has an outstanding performance and is easy to implement. It tracks by minimizing the Bhattacharyya distance between two probability density functions represented by a target and target candidate histograms. The histogram is a statistical feature that does not depend on the spatial structure within the search window. This makes it more robust than other algorithms. But it lacks the essential template update and pixel’s neighborhood information leading to a worse accuracy.
The mean-shift algorithm is a nonparametric mode-seeking method for density functions proposed by Fukunaga and Hostetler [1]. Comaniciu et al. [2, 3] use it to track object. And Comaniciu et al. change the window size over multiple runs by a constant factor but produces little effect because the smaller windows usually have higher similarity. The image pyramids and an additional mean-shift algorithm for scale selection had been used after estimating the position to confirm the window size in Collins [4]. But its speed is lower than the conventional MS algorithm. A new histogram that exploits the object neighborhood has been proposed to help discriminate the target which is called background ratio weighting (BRW) in Vojir et al. [5]. This approach is faster than others and has a superior effect in sequences with scale change but performs poorly for grayscale sequences.
Gradient information is crucial for appearance representation since it contains pixel’s neighborhood information and is insensitive to illumination variation but it always be ignored. Based on this observation, we present a novel adaptive scale MS algorithm with gradient histogram. The gradient information is calculated by Canny edge detector which was developed by Canny [6].
Moreover, the BRW algorithm has been used to improve the performance of videos with scale change. The template of target will be updated by liner interpolation only if conditions are met to avoid addition of incorrect information. The template update also can cope with appearance variation of target. The proposed tracker is compared with lots of algorithms, and the experiment results show that it is more robust and accurate.
2 Canny Edge Detector
The Canny edge detector is an edge detection operator that uses a multi-stage algorithm to detect a wide range of edges in images. To remove the noise, a Gaussian filter is applied to the image; the Gaussian filter kernel of size (2k + 1) × (2k + 1) is given by:
Let Io denote the original image and H is a classic Gaussian filter matrix; we get the image without noise I = H * Io. Then we extract intensity gradient of the image with the Sobel operator proposed by Sobel [7]. The gradient information in horizontal and vertical directions is \(G_{x}\) and \(G_{y}\). From this, the edge gradient can be determined by
where
The edge extracted from the gradient value is still quite blurred after processing the \(G\) with Gauss filter and Sobel operator. The non-maximum suppression and double-threshold joint should be applied to the processed image to improve effect of gradient. Then we can get the gradient image \(I_{g}\) to track the target.
3 The Tracking Algorithm
Different from the conventional MS tracking algorithm, we append the Ig to original image to get \(I_{e}\). After the combination of the images, we extract the histogram \({\hat{\mathbf{q}}}\) from \(I_{e}\). To cope with the problem which caused by the size of target changes, we use the BRW-MS instead of conventional MS. We can get \({\hat{\mathbf{q}}}\) from:
and an ellipsoidal region is used to represent target \(\frac{{\left( {x_{i}^{*1} } \right)^{2} }}{{a^{2} }} + \frac{{\left( {x_{i}^{*2} } \right)^{2} }}{{b^{2} }} < 1\) in current frame. The target candidate is given by
where the h is the scale factor. The location of the target is obtained by
where A is
B is
And \(g\left( x \right) = - k^{{\prime }} \left( x \right)\) is the derivative of \(k\left( x \right)\). \(w_{i}\) can be obtained by
where W is
The \(\widehat{{\varvec{bg}}}\) is the histogram of background computed over the neighborhood of the target in the first frame. Let us denote
and
When the location was determined, the \(\hat{q}\) is updated by
4 Experiment
Experiments are conducted on sequences from Object Tracking Benchmark2013 (OTB2013) dataset [8]. The sequences in OTB2013 not only suffer the deformation but also have other change such as fast motion, background clutter, motion blur, and so on. So, we selected six sequences from OTB2013 dataset to show the results. We compared the proposed algorithm with conventional and state-of-the-art algorithms which are available as source code. They are conventional mean-shit algorithm [2], ASMS [5], OAB [9], LOT [10], and CSK [11]. The parameters for those algorithms are set default.
Figure 1 shows the result of six trackers in sequences. The score of distance precision (DP) rate, overlap success (OS) rate, and center location error (CLE) can be obtained from Table 1.
In general, the gradient histogram improves performance of MS algorithm for the gradient histogram. The data in Table 1 show that proposed algorithm has a higher score than others in DP, OS, and CLE which means our tracker is better than others. The sequence dog1 suffers the scale change, and results show standard MS failed to track target because it only uses the gray levels to calculate histogram and it lacks template update. But our tracker can deal well with this condition for the adopting of gradient histogram and RBW algorithm. The proposed tracker has a better performance than other trackers in sequence singer1 which suffers illumination variation since the addition of gradient histogram which does not dependent on the current pixel value but the difference of adjacent pixel. What is more we find that our tracker has a great improvement in gray image compared to the colorful image than the ASMS algorithm since the ASMS algorithm only can acquire information in the gray channel which is scanty.
5 Conclusion
In this paper, an adaptive scale mean-shift algorithm with gradient histogram has been proposed to improve the tracking performance of MS-like algorithms. The gradient histogram is constructed by color histogram and gradient feature calculated by Canny edge detector. To deal with the scale change, the RBW algorithm is adopted. Template update is used to cope with appearance variation when the Bhattacharyya coefficient between the current frame and the template is greater than the threshold. The setting of the threshold makes tracker more robust for incorrect information. The proposed tracker is compared with a lot of algorithms in OTB2013 dataset. The experiment results show the tracker’s effectiveness in deformation, rotation, scale change, and illumination variation. Moreover, our tracker has a better preference in gray sequences than conventional MS-like algorithms.
References
Fukunaga K, Hostetler L. The estimation of the gradient of a density function, with applications in pattern recognition. IEEE Trans Inf Theor. 1975;21(1):32–40.
Comaniciu D, Ramesh V, Meer P. Real-time tracking of non-rigid objects using mean shift. In: Proceedings IEEE conference on computer vision and pattern recognition, 2000. IEEE; 2000, vol. 2. p. 142–9.
Comaniciu D, Ramesh V, Meer P. Kernel-based object tracking. IEEE Trans Pattern Anal Mach Intell. 2003;25(5):564–77.
Collins RT. Mean-shift blob tracking through scale space. In: Proceedings IEEE computer society conference on computer vision and pattern recognition, 2003. IEEE; 2003, vol. 2. p. II-234.
Vojir T, Noskova J, Matas J. Robust scale-adaptive mean-shift for tracking. Pattern Recogn Lett. 2014;49:250–8.
Canny J. A computational approach to edge detection. In: Readings in computer vision; 1987. p. 184–203.
Sobel I. An isotropic 3 × 3 image gradient operator. In: Machine vision for three-dimensional scenes; 1990. p. 376–9.
Wu Y, Lim J, Yang MH. Online object tracking: A benchmark. In: 2013 IEEE conference on computer vision and pattern recognition (CVPR). IEEE; 2013. p. 2411–8.
Grabner H, Grabner M, Bischof H. Real-time tracking via on-line boosting. Bmvc; 2006, vol. 1, no. 5. p. 6.
Oron S, Bar-Hillel A, Levi D, et al. Locally orderless tracking. Int J Comput Vision. 2015;111(2):213–28.
Henriques JF, Caseiro R, Martins P, et al. Exploiting the circulant structure of tracking-by-detection with kernels. In: European conference on computer vision. Springer, Berlin, Heidelberg; 2012. p. 702–15.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grant No. 61501139) and the Natural Scientific Research Innovation Foundation in Harbin Institute of Technology (HIT.NSRIF.2013136).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xie, C., Kang, W., Liu, G. (2020). Adaptive Scale Mean-Shift Tracking with Gradient Histogram. In: Liang, Q., Liu, X., Na, Z., Wang, W., Mu, J., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2018. Lecture Notes in Electrical Engineering, vol 516. Springer, Singapore. https://doi.org/10.1007/978-981-13-6504-1_104
Download citation
DOI: https://doi.org/10.1007/978-981-13-6504-1_104
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-6503-4
Online ISBN: 978-981-13-6504-1
eBook Packages: EngineeringEngineering (R0)