A Sparse-Dense HOG Window Sampling Technique for Fast Pedestrian Detection in Aerial Images

Kumar, Ranjeet; Deb, Alok Kanti

doi:10.1007/978-981-19-1742-4_37

Ranjeet Kumar⁴⁰ &
Alok Kanti Deb⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 893))

Included in the following conference series:

International Conference on Electrical and Electronics Engineering

602 Accesses
1 Citations

Abstract

Pedestrian detection from Unmanned Aerial Vehicle (UAV) has been an important part of surveillance systems. A Two-stage (Sparse-Dense) sliding window technique has been proposed to increase the speed of pedestrian detection using HOG-SVM classifier. Standard techniques follow a sliding window approach with a fixed sliding strides over a multi-resolution image pyramid for detection. The presented technique breaks down the detection task into sparse sampling and dense sampling stages where the first one is region proposal step and second stage scans only the proposed regions for objects. Sparse sampling stage is working as weak classifier whereas the dense sampling stage works as strong classifier for an image patch. Average pedestrian detection speed using the proposed technique gave improvement from 1.95 fps to 15.36 fps for input images of dimension [640, 360] on a system with 3.2 GHz CPU. UAV123 [1] dataset has been chosen to train the classifier. For detection, Average Center Prediction Error has been taken to quantify detection performance with increased speed.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Pedestrian Detection for UAVs Using Cascade Classifiers and Saliency Maps

Onboard monocular pedestrian detection by combining spatio-temporal hog with structure from motion algorithm

Article 03 January 2015

A Two-Stage Optimal Detection Algorithm Research for Pedestrians in Front of the Vehicles

Keywords

1 Introduction

Fast pedestrian detection on aerial images has been a challenge due to dynamic nature of the images and hardware constraints. Integral Image for fast feature calculation [2] and Histogram of Oriented Gradients (HOG) [3], classification by Support Vector Machine (SVM) based tree-type neural network [4] are some initial work. Some improved methods for feature extraction are Integral channel feature [5], Boosted HOG [6], Channel Feature Extrapolation [7] and Search Region Proposal based on Saliency Map [8]. Pedestrian detection in infrared images has been shown by Zhang et al. [9]. Some techniques exploiting input image properties are Image Orientation Adjustment by Xu et al. [10] and Locally constraint linear coding based detection by Yang et al. [11]. Some techniques trying to enhance detection speed by breaking the task into multiple stages include local binary pattern with HOG-SVM classifier [12], simplified HOG [13], Center Symmetric - Local Binary Pattern (XCS-LBP) [14], Bin-Interleaved HOG [15] and two-stage linear with non-linear SVM [16] but they need improvement for real-time application. Some hardware solutions for fast HOG-SVM based classification have been presented in [17, 18] that proposed hardware design suitable for HOG-SVM classification. This work proposes a Two-stage (Sparse-Dense) sliding window technique for pedestrian detection task and is an improvement over standard single stage sliding window techniques used with HOG+SVM [3, 5] based classifiers. Re-searchers have proposed using features other than HOG too for better detection but the proposed work shows how detection process can be modified to speed it up using existing classification method HOG-SVM and achieve real-time or near real-time performance. Section 2 discusses about standard sliding window based pedestrian detection. Section 3 presents the proposed two-stage (sparse-dense sliding) window based detection technique. Section 4 discusses about experimental setup, results and analysis. Section 5 concludes the work presented.

2 HOG-SVM Classification Based Pedestrian Detection

Histogram of Oriented Gradients

Histogram of Oriented Gradients (HOG) was given by Dalal et al. [3] to extract visual information from an image patch using pixel gradients. The technique has been used widely for classification/detection [5,6,7,8,9]. One can refer to [3] for HOG feature vector calculation for an image. Parameters to calculate HOG feature descriptor for an image patch of dimension $\left[ {M,N} \right]$ has been shown in Table 1 and its length ${\text{F}}_{{\text{l}}}$ [3] using (1) is 3780.

Table 1. HOG feature descriptor parameters

Full size table

$$ F_{l} = \left( {\frac{M}{{C_{h} }} - 1} \right){*}\left( {\frac{N}{{C_{v} }} - 1} \right)\;{*}\;b\;{*}\;B_{h} \;{*}\;B_{v} $$

(1)

HOG feature plots with corresponding RGB images has been shown in Fig. 1.

SVM Classification

Support Vector Machines (SVM) [19, 20] is a supervised learning based classification algorithm that creates an N-dimensional hyper-plane that divides $m$ number of classes. Input feature vector ($p$ dimensional) and output label for a sample $i$ has been denoted by $h_{i} \in R^{p}$ and $y_{i} \in \left\{ { - 1,1} \right\}$, where $i = 1,2,3,...,n$, Person and Background classes have been represented by label $1$ and $- 1$ respectively.

$$ f\left( {h_{i} } \right) = w^{T} \;{*}\;\phi \left( {h_{i} } \right) + b $$

(2)

Here, $f\left( {h_{i} } \right)$ is distance of a sample from decision boundary and its sign indicate predicted class, $w$ is weight vector, $\phi \left( {h_{i} } \right)$ is a function (kernel) of $h_{i}$ and $b$ is the offset from decision boundary.

Detection in an Image

While HOG-SVM classifier classify an image patch of dimension $\left[ {W_{h} ,W_{v} } \right]$, pedestrian detection on full image is done by extracting image patches of dimension $\left[ {W_{h} ,W_{v} } \right]$ in sliding window manner from multi-scale image pyramid.. Five level image pyramid formed by original and downscaled versions of an image has been shown in Fig. 2.

An important factor that determines the speed of detection is frequency of classification step for an image that has been denoted by $HOG_{count}^{I}$ and can be obtained using (3), where $W_{h}$ and $W_{v}$ are dimensions of HOG window, $DS_{h}$ and $DS_{v}$ are dense sliding strides for HOG window $\left[ {W_{h} ,W_{v} } \right]$ in horizontal and vertical direction respectively. $\left[ {M,N} \right]$ is input image width and height in pixels respectively.

$$ HOG_{count}^{I} = \frac{{M - W_{h} }}{{DS_{h} }}*\frac{{N - W_{v} }}{{DS_{v} }} $$

(3)

Down-scaled image shape can be given by $shape_{dn} = \left[ {\frac{M}{{\alpha^{l} }},\frac{N}{{\alpha^{l} }}} \right]$ where $\alpha = 1.5$ and $l \in \left\{ {0,1,...,\left( {L - 1} \right)} \right\}$ are downscaling factor and image pyramid level respectively. $L = 5$ is the number of levels in image pyramid. Classification step frequency for an image pyramid ($HOG_{count}^{P}$) can be obtained using (4).

$$ HOG_{count}^{P} = \sum\nolimits_{l = 0}^{L - 1} {\left( {\frac{{M - W_{h} }}{{\alpha^{l} *DS_{h} }}*\frac{{N - W_{v} }}{{\alpha^{l} *DS_{v} }}} \right)} $$

(4)

where, $HOG_{count}^{P}$ denoted number of HOG-SVM classification step for an image pyramid with L levels. Value of $HOG_{count}^{P}$ for the parameters given in Table 2 using (4) is 3692. This work focuses on reducing the required number of classification steps for an image pyramid by introducing a two-stage sliding (sparse-dense sampling) window technique.

Table 2. Input image shape, HOG window shape, sliding stride for standard detection technique [3] and image pyramid parameters

Full size table

3 A Two-Stage Sliding Window

Conventional classifier based detection techniques follow a dense sampling approach to classify patches from image pyramid into number of classes. [3, 5, 9]. Dense sampling window stride has been denoted by $\left[ {DS_{h} ,DS_{v} } \right]$ and is $\left[ {8,8} \right]$. Here, $DS_{h}$ and $DS_{v}$ are strides in horizontal and vertical direction respectively. The proposed two-stage sliding window technique divides the detection task into sparse and dense sampling stages. Block diagram has been shown in Fig. 3.

Stage 1: Sparse Sampling

In first stage, image patches of size $\left[ {W_{h} ,W_{v} } \right]$ are extracted from all the levels in the image pyramid at larger window sliding strides denoted by $\left[ {SS_{h} ,SS_{v} } \right]$ as compared to $\left[ {DS_{h} ,DS_{v} } \right]$ taken by most of the previous techniques [3, 5, 6, 10, 11]. Here, $\left[ {SS_{h} ,SS_{v} } \right] \in \left\{ {\left[ {32,64\left] , \right[43,90\left] , \right[51,102} \right]} \right\}$. HOG-SVM classifier output represents distance from SVM decision boundary and has been taken to determine confidence of classification. Distance threshold for this stage has been denoted by $Th_{sparse}$ and image patches exceeding $Th_{sparse}$ are recorded as regions for proposal to stage 2. The regions proposed in the stage are of dimension $\left[ {W_{h} ,W_{v} } \right]$ in their corresponding downscaled image from the pyramid. The shape of proposed regions has been transformed back to level 0 (to represent same region in original image) denoted by $PRS_{0}$ and can be obtained using (5) where $l \in \left\{ {0,1,...,\left( {L - 1} \right)} \right\}$.

$$ PRS_{0} = \left[ {W_{h} ,W_{v} } \right]\;{*}\;\alpha^{l} $$

(5)

Window sliding Stride fir sparse sampling stage has been calculated by taking percentage overlap between consecutive sampling windows. Overlap percentage of $50\%$, $30\%$ and $20\%$ have been taken for experimentation. Window sliding Stride values $\left[ {SS_{h} ,SS_{v} } \right]$ in pixels can be obtained using (6) and are $\left[ {32,64} \right]$, $\left[ {43,90} \right]$ and $\left[ {51,102} \right]$ respectively. HOG-SVM classification step frequency in sparse sampling stage has been demoted by $HOG_{count}^{S}$ and can be obtained using (7).

$$ \left[ {SS_{h} ,SS_{v} \left] { = overlap\% \;{*}\;} \right[W_{h} ,W_{v} } \right] $$

(6)

$$ HOG_{count}^{S} = \sum\nolimits_{l = 0}^{L - 1} {\left( {\frac{{M - W_{h} }}{{\alpha^{l} {*}SS_{h} }}*\frac{{N - W_{v} }}{{\alpha^{l} {*}SS_{v} }}} \right)} $$

(7)

Stage 2: Dense Sampling

In this stage, image regions proposed from first stage are searched for objects by HOG-SVM classifier with dense sampling window strides $\left[ {DS_{h} ,DS_{v} } \right]$. Image patches crossing a threshold $Th_{dense}$ are final detections. An example has been shown in Fig. 4. Only the proposed regions from stage 1 are processed in this stage and not the whole image pyramid. This saves significant processing time. Flow-chart for the two-stage process has been shown in Fig. 5.

Classification step frequency for dense sampling stage has been denoted by $HOG_{count}^{I}$ (3) and depends upon region proposal. Here, $I$ in $HOG_{count}^{I}$ represent an image region proposed by sparse sampling stage. $\left( {HOG_{count}^{S} + HOG_{count}^{I} } \right)$ is combined classification step frequency and has been determined by average value for 1000 pedestrian images from UAV123 [1] dataset.

Suppressing Duplicate Detections

Dense sampling stage yields multiple detections around the object as classifier output crosses $Th_{dense}$. These duplicate detections have been suppressed by computing Intersection-over-Union (IoU) among the detection boxes using (8) where $R_{{B_{1} }}$ and $R_{{B_{2} }}$ denote area of box $B_{1}$ and $B_{2}$ respectively. If $IoU\left( {B_{1} ,B_{2} } \right)$ crosses a threshold $IoU_{th}$, then the box with lower confidence ($f\left( {h_{i} } \right)$(2)) is discarded. An example can be seen in Fig. 6.

$$ IoU\left( {B_{1} ,B_{2} } \right) = \frac{{R_{{B_{1} }} \cap R_{{B_{2} }} }}{{R_{{B_{1} }} \cup R_{{B_{2} }} }} $$

(8)

4 Experimental Results and Analysis

Dataset

Dataset used for training and testing of HOG-SVM classifier is UAV123 [1]. It has 11575 images in its person class, out of which 2620 images with pedestrians are randomly selected and a window of $\left[ {64,128} \right]$ size has been cropped to form person and 2450 windows cropped for background class. Positive and negative class formation for training has been shown in Fig. 7.

SVM Training

Dataset size is 5070 images (2620 person class and 2450 background class) with train-test split ratio 80:20. 5-fold cross validation scheme has been adopted to split dataset into 5 mutually exclusive parts. Training has been done on 4 parts combined and testing on the remaining part. 5 trials of training/testing has been done. Scikit-learn [21] python library has been used to train Support Vector Classifier (SVC) for binary classification of Person and Background classes. Classifier parameters have been shown in Table 3.

Table 3. SVM parameters

Full size table

Classification Performance

Precision, Recall, F1-score and Accuracy have been taken as classification performance metrics [22]. Metrics for 5 trials for test data with mean values has been presented in Table 4.

Table 4. Classification performance (5-fold cross validation, SD - Standard Deviation)

Full size table

Detection Performance

UAV123 [1] provides ground-truth bounding boxes for evaluation of detection result. The prediction bounding box dimension of HOG-SVM classifier is fixed to $\left[ {64,128} \right]$ as the classifier is not designed for bounding box regression task. Centre Prediction Error (CPE) [23] has been taken as performance metric that measures difference between predicted object centre and ground-truth box centre. $CE_{avg}^{{S_{i} }}$ denote average CPE for an image sequence $S_{i}$ and can be obtained using (9) where $N_{{S_{i} }}$ denote number of frames in a sequence $S_{i}$, $\left( {x^{G} ,y^{G} } \right)$ and $\left( {x^{P} ,y^{P} } \right)$ represent ground-truth box centre and predicted box centre respectively. Chosen dataset has 23 person class image sequences named from person1 to person23 captured from an UAV platform. Average of $CPE_{avg}^{{S_{i} }}$ for 23 sequences and shown in (10) where $i = \left\{ {1,2,3,...,K} \right\}$ and $K = 23$.

$$ CE_{avg}^{{S_{i} }} = \frac{1}{{N_{{S_{i} }} }}\sum\nolimits_{t = 1}^{{N_{{S_{i} }} }} {\sqrt {\left( {x_{t}^{G} - x_{t}^{P} } \right)^{2} + \left( {y_{t}^{G} - y_{t}^{P} } \right)} } $$

(9)

$$ CE_{avg} = \frac{1}{K}\sum\nolimits_{i = 1}^{K} {CE_{avg}^{{N_{{S_{i} }} }} } $$

(10)

$$ CE_{avg} \left( \% \right) = \frac{{CE_{avg} \times 100}}{{\sqrt {M^{2} + N^{2} } }} $$

(11)

$CE_{avg}$ as percentage of input image diagonal length can be given by (11) to present a metric invariant to input image dimension. $CE_{avg} \left( \% \right)$ for person sequences in UAV123 [1] has been found out by running detector for all the 23 sequences and comes out to be 1.47%. Centre coordinate plot for X and Y direction for person1 has been shown in Fig. 8. It can clearly be seen from X and Y coordinate detection graphs that detector is following the ground-truth coordinates almost all the time. An example image with ground-truth box, detection box and a line joining their centers has been shown in Fig. 9.

Proposed Technique Evaluation

In the proposed technique, $Th_{sparse} = 0.1$ has been taken for sparse sampling, $Th_{dense} = 1.0$ for dense sampling and $IoU_{th} = 0.5$ for IoU thresholding step. Improvement in processing time has been shown in Table 5 for different percentage of overlap between consecutive sampling windows in sparse sampling stage. Testing has been done on a 3.2 GHz CPU machine. Significant reduction in processing time can be seen as originally HOG-SVM classification step frequency has decreased from 3692 to 469, 344 and 294 (Table 5).

Table 5. Percentage overlap vs processing time: 2 stage detector (input size: $\left[ {640,360} \right]$)

Full size table

As shown in Fig. 10, the technique gives different region proposals for different sliding strides and detections are concentrated around object after dense sampling stage. It can be observed that maximum detections are there in single stage dense sampling (Fig. 10-b) method but most of them are redundant and will be removed after IoU thresholding. Sparse sampling with $\left[ {32,64} \right]$ (Fig. 10-c)and $\left[ {43,90} \right]$ (Fig. 10-f) strides provide significant number of region proposals around object and multiple detections after dense sampling stage (Fig. 10-d and 10-g). IoU thresholding has been applied to remove duplicate detections.

Comparison with single stage dense sampling technique has been shown in Table 5. The improvement in speed for input image of size $\left[ {640,360} \right]$ with $\left[ {SS_{h} ,SS_{v} } \right] = \left[ {32,64} \right]$ is from 1.95 to 15.36 fps (improved by a factor of 7.88) and 1.95 to 24.48 fps for $\left[ {SS_{h} ,SS_{v} } \right] = \left[ {51,102} \right]$ (improved by a factor of 10.50).

Detection speed on full images has been compared with existing techniques and presented in Tables 6, 7, 8, 9 and 10 for different input image dimension.

Table 6. Comparison with existing work (input image size: $\left[ {320,240} \right]$)

Full size table

Table 7. Comparison with existing work (input image size: $\left[ {640,480} \right]$)

Full size table

Table 8. Comparison with existing work (input image size: $\left[ {720,400} \right]$)

Full size table

Table 9. Comparison with existing work (input image size: $\left[ {1200,400} \right]$)

Full size table

Table 10. Comparison with existing work (input image size: $\left[ {640,320} \right]$)

Full size table

It is evident from the comparison tables that the proposed technique performs better than similar techniques. Moreover, the detection quality of the technique has been quantified in terms of average center prediction error (CPE). It is a standard metric used to judge the distance of predicted bounding box to that of ground-truth. Average CPE $CE_{avg} \left( \% \right)$ for UAV123 dataset (person class) is only $1.47\%$ of image diagonal length. Also, it should be noted that using classification based techniques clubbed with window sliding always introduces a quantization error in detection (evident in Fig. 9) which is equal to $0.5$ times HOG window sliding stride $\left[ {DS_{h} ,DS_{v} } \right]$. So, a trade-off has to be maintained between detection speed and quality to choose a particular sliding stride. This work has used $\left[ {8,8} \right]$ as sliding stride in dense sampling stage and is a standard used by other researchers. The presented average CPE value includes this quantization error inherently along with the actual detection error.

5 Conclusion

The work introduced a Two-stage (Sparse-Dense) sliding window sampling technique for fast pedestrian detector. The first stage was sparse sampling stage to extract relevant regions. In the second stage, the proposed regions were taken to run classifier with smaller strides and larger classification threshold. Thus, a modified version of simple HOG-SVM detector has been presented. Visual information in image was exploited for region proposal and most of the time was spent in processing proposed regions. The proposed technique can be utilized to run real-time detection on low-cost processor on UAV platform and thus eliminate dependency on external system. This eventually opens up scope of more applications using UAV systems.

References

Mueller, M., Smith, N., Ghanem, B.: A benchmark and simulator for UAV tracking. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 445–461. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_27
Chapter Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1 (2001)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 1, pp. 886–893 (2005)
Google Scholar
Jayadeva, Deb, A., Chandra, S.: Binary classification by SVM based tree type neural networks. In: Proceedings of the 2002 International Joint Conference on Neural Networks, vol. 3, pp. 2773–2778 (2002)
Google Scholar
Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: The British Machine Vision Conference, pp. 91.1–91.11 (2009)
Google Scholar
Cao, X., Wu, C., Yan, P., Li, X.: Linear SVM classification using boosting hog features for vehicle detection in low-altitude airborne videos. In: IEEE Conference on Image Processing, pp. 2421–2424 (2011)
Google Scholar
Dollar, P., Appel, R., Belongie, S., Perona, P.: Fast feature pyramids for object detection. IEEE Trans. Pattern Anal. Mach. Intell. 36(8), 1532–1545 (2014)
Article Google Scholar
Blondel, P., Potelle, A., Pegard, C., Lozano, R.: Human detection in uncluttered environments: from ground to UAV view. In: 13th International Conference on Control Automation Robotics and Vision, pp. 76–81 (2014)
Google Scholar
Zhang, L., Wu, B., Nevatia, R.: Pedestrian detection in infrared images based on local shape features. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
Google Scholar
Xu, Y., Yu, G., Wu, X., Wang, Y., Ma, Y.: An enhanced Viola-Jones vehicle detection method from unmanned aerial vehicles imagery. IEEE Trans. Intell. Transp. Syst. 18(7), 1845–1856 (2017)
Article Google Scholar
Yang, Z., Huang, Z., Yang, Y., Yang, F., Yin, Z.: Accurate specified-pedestrian tracking from unmanned aerial vehicles. In: International Conference on Communication Technology Proceedings, ICCT, October 2019, pp. 1256–1260 (2019)
Google Scholar
Park, W.-J., Kim, D.-H., Suryanto, Lyuh, C.-G., Roh, T.M., Ko, S.-J.: Fast human detection using selective block-based HOG-LBP. In: 19th IEEE International Conference on Image Processing, pp. 601–604 (2012)
Google Scholar
Sheng, Y., Jiefa, W., Lingling, Z.: A fast pedestrian detection method based on simplified HOG descriptor. Int. J. Digit. Content Technol. Appl. 6(4), 14 (2012)
Google Scholar
Vasuki, P., Veluchamy, S.: Pedestrian detection for driver assistance systems. In: International Conference on Recent Trends in Information Technology, pp. 1–4 (2016)
Google Scholar
Son, H., Lee, S., Choi, J., Min, K.: Efficient pedestrian detection by Bin-interleaved Histogram of Oriented Gradients. In: IEEE Region 10 Conference-TENCON, pp. 2322–2325 (2010)
Google Scholar
Min, K., Son, H., Choe, Y., Kim, Y.-G.: Real-time pedestrian detection based on a hierarchical two-stage Support Vector Machine. In: 8th IEEE Conference on Industrial Electronics and Applications, pp. 114–119 (2013)
Google Scholar
Yuan, X., Cai-nian, L., Xiao-liang, X., Mei, J., Jian-guo, Z.: A two-stage hog feature extraction processor embedded with SVM for pedestrian detection. In: IEEEInternational Conference on Image Processing, pp. 3452–3455 (2015)
Google Scholar
Wang, M.-S., Zhang, Z.-R.: FPGA implementation of hog based multi-scale pedestrian detection. In: IEEE International Conference on Applied System Invention, pp. 1099–1102 (2018)
Google Scholar
Hearst, M., Dumais, S., Osuna, E., Platt, J., Scholkopf, B.: Support vector machines. IEEE Intell. Syst. Appl. 13(4), 740–755 (1998)
Article Google Scholar
Vapnik, V., Cortes, C.: Support vector networks. Mach. Learn. 20, 622–628 (1995)
MATH Google Scholar
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
MathSciNet MATH Google Scholar
Lin, T.Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) Computer Vision – ECCV 2014. ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Čehovin, L., Leonardis, A., Kristan, M.: Visual object tracking performance measures revisited. IEEE Trans. Image Process. 25(3), 1261–1274 (2016)
MathSciNet MATH Google Scholar

Download references

Acknowledgement

We kindly acknowledge IMPRINT I project, MHRD, Govt. of India for supporting with resources from the project “Decentralized target tracking using swarm of aerial robots”.

Author information

Authors and Affiliations

Indian Institute of Technology, Kharagpur, India
Ranjeet Kumar & Alok Kanti Deb

Authors

Ranjeet Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Alok Kanti Deb
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ranjeet Kumar .

Editor information

Editors and Affiliations

School of Science, Computing and Engineering Technologies, Swinburne University of Technology, Hawthorn, VIC, Australia
Saad Mekhilef
Office of the International Relations, Bharath Institute of Higher Education and Research, Chennai, India
Rabindra Nath Shaw
Department of Management and Innovation Systems, University of Salerno, Fisciano, Salerno, Italy
Pierluigi Siano

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kumar, R., Deb, A.K. (2022). A Sparse-Dense HOG Window Sampling Technique for Fast Pedestrian Detection in Aerial Images. In: Mekhilef, S., Shaw, R.N., Siano, P. (eds) Innovations in Electrical and Electronic Engineering. ICEEE 2022. Lecture Notes in Electrical Engineering, vol 893. Springer, Singapore. https://doi.org/10.1007/978-981-19-1742-4_37

Download citation

DOI: https://doi.org/10.1007/978-981-19-1742-4_37
Published: 27 April 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-1741-7
Online ISBN: 978-981-19-1742-4
eBook Packages: EnergyEnergy (R0)

Publish with us

Policies and ethics

A Sparse-Dense HOG Window Sampling Technique for Fast Pedestrian Detection in Aerial Images

Abstract

Similar content being viewed by others

Pedestrian Detection for UAVs Using Cascade Classifiers and Saliency Maps

Onboard monocular pedestrian detection by combining spatio-temporal hog with structure from motion algorithm

A Two-Stage Optimal Detection Algorithm Research for Pedestrians in Front of the Vehicles

Keywords

1 Introduction