A New Fast Encoding Algorithm Based on Motion Activity for High Efficiency Video Coding (HEVC)

Lee, Jong-Hyeok; Goswami, Kalyan; Kim, Byung-Gyu

doi:10.1007/978-94-017-8798-7_33

Jong-Hyeok Lee⁵,
Kalyan Goswami⁵ &
Byung-Gyu Kim⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 301))

2156 Accesses

Abstract

High efficiency video coding (HEVC) has three units such as coding unit (CU), prediction unit (PU), and transform unit (TU). It has too many complexities due to improve coding performance. We propose a fast algorithm which can be possible to apply for both CU and PU parts. To reduce the computational complexity, we propose based on rate distortion cost of CU about the parent and current levels to terminate the CU decision early. In terms of PU, we develop fast PU decision based on spatio-temporal and depth correlation for PU level. The Experimental results verify that the proposed algorithm provides up to 51.70 % of time reduction for encoding with a small loss in video quality, compared to HEVC Test Model (HM) version 10.0 software.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Fast encoding algorithm for high-efficiency video coding (HEVC) system based on spatio-temporal correlation

Article 13 January 2015

An Improved Hierarchical Motion Estimation Algorithm

Fast Coding Algorithm for High Efficient Video Coding (HEVC)

Keywords

33.1 Introduction

A New generation video coding standard, called a high efficiency video coding (HEVC) [1], has been developed by the Joint Collaborative Team on Video Coding (JCT-VC) group. The JCT-VC is a group of video coding experts created by ITU-T Study Group 16 (VCEG) and ISO/IEC JTC 1/SC 29/WG 11 (MPEG) in 2010.

Although coding efficiency improvement is solved it has more computational complexity than the H.264/AVC [2] because of tools having high complexity and high resolution of sequences. Therefore, reduction of encoding time with convincing loss is an interesting issue.

The video encoding process in the HEVC has three unit for block structure: (a) a coding unit (CU) is basic block unit like macroblock in the H.264/AVC, (b) a prediction unit (PU) for performing motion estimation, rate-distortion optimization and mode decision, (c) a transform unit (TU) for transform and entropy coding. Encoding process encode from Coding Tree Block (CTB) having largest block size to its 4 child CUs, recursively.

In [3], a tree pruning algorithm is proposed that makes an early termination CU. Shen et al. [4] proposed an early CU size determination algorithm. In [5], a motion vectors merging (MVM) method is proposed that reduce the inter-prediction complexity of the HEVC. The MVM algorithm decide the PU partition size by using motion vectors.

This paper is organized as follows: In Sect. 33.2, the early CU splitting termination (CST) and fast PU decision method is described for proposed algorithm. Section 33.3 presents the coding performance of suggested algorithm combined by two methods. In Sect. 33.4, Concluding comments are given.

33.2 Proposed Work

33.2.1 Fast Coding Unit Splitting Termination (CST) Method

The main objective is to select the lowest RD cost before all combinations are finished. We have designed the ratio function of CU_d(i) at depth ‘d’ can be define as:

$$ r_{i} = \frac{{RD_{cost} (CU_{d} (i))}}{{RD_{cost} (CU_{d - 1} )}}\quad where\;\; \mathop \sum \limits_{i = 0}^{3} r_{i} \le 1 $$

(33.1)

When a CU is split, it will be divided up into 4 child CUs. Therefore, we can define a value of ratio function for each newly created CUs. This parameter r _i is a ratio of the RD costs of the current CU and its parent CU. When the ratio function of a child is lower than its siblings, it has low amount of chance to split in the next depth level. Accordingly, this parameter can be used as a threshold for making the decision to split a current CU to next depth level.

In previous algorithm [3], if a CU is coded as SKIP mode then no further splitting of a CU. Motivating from this fact, we have explored other prediction modes of the PU. INTRA and INTER predictions have different kinds of PU types. INTER modes are 2N × 2N, 2N × N, N × 2N, N × N (presently, this mode is not used), 2N × nU, 2N × nD, nL × 2N and nR × 2N. INTRA modes are 2N × 2N and N × N. SKIP mode is consisted of 2N × 2N size. A CU is divided into 4 CUs of lower dimensions after the completion of all mode calculations. PU modes are processed recursively from the CTB to all children CU.

Table 33.1 have classified all the prediction modes according to motion activities. This kind of motion activity was explored in good approach in the H.264/AVC codec [6, 7].

Table 33.1 Classified weight factors for all prediction modes according to the motion activities

Full size table

Encoded CU has high chance for no splitting in SKIP mode (PU_WF is 0). To more investigate for PU_WF is 1 or 2, We have calculated a local average of the RD cost values of all encoded CUs. In this process, the final PU mode and the corresponding RD cost values are checked after encoded CU. Equations (33.2) and (33.3) perform after encoding of a CU check dimension (d) of CU and PU_WF (wf) in terms of satisfying wf < 3.

$$ sum_{d}^{wf} = \left( {{\text{avg}}_{d}^{\text{wf}} \times count_{d}^{wf} } \right) + RDcost_{d}^{wf} , $$

(33.2)

$$ avg_{d}^{wf} = \frac{{sum_{d}^{wf} }}{{count_{d}^{wf} + 1}} $$

(33.3)

An absolute motion vector (MV) is also calculated in proposed method. An average of the motion vectors can achieve from both List_0 and List_1 in the HM reference software. We only use magnitudes of the MV directions for the x and y in order to calculate simply as shown in Eq. 33.4.

$$ {\text{MV}}_{\text{abs}} = 0.5 \times \left( {\mathop \sum \limits_{i = 0}^{1} \left| {MV_{x} \left( i \right)} \right| + \left| {MV_{y} \left( i \right)} \right|} \right) $$

(33.4)

By using Eq. 33.4, we can know the motion of CU with directly relationship for MV _abs. The CU which has low MV _abs value has high change that will be not divided in current hierarchy of CU. According to above mentioned method, the proposed algorithm can determine which CU need to splitting early. The CU decision part in our algorithm can be divided into two stages. The first stage is to consider the CTU case. In terms of the CTU, we do not have any information from higher levels. Hence, ratio function cannot be calculated. In the second stage, other higher level CUs are considered.

This proposed CU splitting termination (CST) algorithm is check the PU mode weighting factor initially. If PU_WF is 0 meaning SKIP mode, the decision is taken directly that there is no need splitting. If PU_WF is 1 or 2, then we cannot make any decision directly. Hence, this method is divided into two stages in order to use different checking. For the CTU case, this algorithm only use MV _abs and local average RD cost of equivalent dimension of PU for PU_WF = 1 and 2. If RD cost of current PU is less than average RD with same dimension and MV _abs is 0, Encoder is select that there is no need splitting in the CTU case for PU_WF = 1 and 2.

Otherwise, ration function, MV _abs and local average RD cost of suitable dimension of PU for PU_WF = 1 and 2 are used in non-CTU case. A non-CTU case is same process. Since it can use ration function with higher level’s information, if the RD cost is less than average RD cost of suitable dimension of PU and ratio function (R) is less than 0.25, then it also decide that CU is no need to split.

33.2.2 Fast Prediction Unit Decision (FPU) Method

In this section, we propose an effective prediction unit selection method based on correlation and block motion complexity (BMC) that can be applied together with the CST above proposed. A natural video sequence has high spatial and temporal correlations. To analysis these correlations, we define predictor set (Ω) such as PU ₁ to PU ₇. Figure 33.1 shows position from current CU for each PU _i.

To define block motion complexity, we have designed weight factor for according to location from current CU as Table 33.2. Table 33.2 indicates to land more weight in spatial correlation and upper level (depth ‘d−1’) from current depth ‘d’ than other position. Based on the designed mode weight factor (Table 33.1) for motion activities, we defined a block motion complexity (BMC) [8] according to the predefined mode weight factors (γ) and a group of predictors in Ω in order to estimate the PU mode of the current CU.

Table 33.2 Weight factor for PUs (default setting and changing weight factor adaptively according to level of temporal layer)

Full size table

The BMC has been represented as follow [8]:

$$ BMC = \frac{{\mathop \sum \nolimits_{i = 1}^{N} W(i,l) \cdot k_{i} \cdot r_{i} }}{{\mathop \sum \nolimits_{i = 1}^{N} W(i,l) \cdot k_{i} }}, $$

(33.5)

$$ W\left( {i,l} \right) = w_{i} + f\left( {i,l} \right), $$

(33.6)

$$ f\left( {i,l} \right) = round\left( {l \cdot T_{level} } \right) \cdot T_{t} \cdot Tk_{i} $$

(33.7)

where N is the number of PUs equal to 7, W(i, l) is function of the weight factor for each mode. $ \gamma_{i} $ is weight factor for mode of PU _i in Ω. The k _i denote which PUs have available. If current CTB is in boundary region in the frame, then above or left PU information are not available. When PU _i is available, k _i is set to 1; otherwise, k _i is equal to 0.

Equation 33.6 is an adaptive weighting factor function. The value of w _i is weight factor for PU _i. It has property of $ \sum\nolimits_{i = 1}^{N} {w_{i} = 1} $. A value of $l$ denotes the temporal decomposition layer. The HEVC consist of three layers in 8 GOP according to distance between frames. Accordingly, temporal layer can represent Level_1,2,3,4 having 8, 4, 2, and 1 distance. If current frame close with reference frame, it has strong correlation than long distance case. Therefore, setting weight factors is changed when PU has 1 and 2 distance adaptively as second row in Table 33.2. Level_1,2,3,4 is set to 0.02 and is used for normalization of temporal level as l. Tk _i is a control value for changing the weight factors. Tk _i is a control value used to increase or decrease weight factors. It is set to 1, −2, or 0. PUs having the temporal correlations with PU ₄ and PU ₅ are set to be 1. Also, for a PU ₆, it is set to be −2. The others are set to 0.

$$ \left\{ {\begin{array}{ll} {BMC < Th_{1} ,} & {Slow\;motion,} \\ {Th_{1} \leq BMC < Th_{2} ,} & {Medium \;motion,} \\ {Th_{2} \leq BMC,} & {Complex\;motion\;or\;texture} \\ \end{array} } \right. $$

(33.8)

where Th ₁ and Th ₂ are set to 1 and 3 as the mode weight factor for slow motion and complex motion or texture in PU_WF of Table 33.1. If the BMC is slow motion, the proposed method performs SKIP mode and Inter 2N × 2N. In case of medium motion, mode decision process performs modes of slow motion, Inter 2N × N and N × 2N. When motion is complex, all modes are performed, then mode search process selects the best one.

33.3 Experimental Results

The proposed algorithm was implemented on HM 10.0 (HEVC reference software). Test conditions were random access using RA-Main. Standard sequences with 50 frames were used from two or three sequences per each Class with various QP values (22, 27, 32, 37). Details of the encoding environment can be seen in JCTVC-L1100 [9].

Table 33.3 shows results for comparisons between the original HM 10.0 software and the proposed algorithm without any fast options. Moreover, we have considered 3 other algorithms (namely CST, FPU and MVM) which are shown in this table. The time reduction performance of the proposed method is almost 48.72 % on average with some loss as 2.09 % and 0.12 (dB) losses both in bit-rate and Y-PSNR, respectively. It can be noted from the table, that our proposed algorithm gives superior result in terms of time reduction compared with other fast algorithm. However, it suffers from marginal bit rate and PSNR loss. But if we consider the time reduction, then the loss is negligible.

Table 33.3 The performance summarization of original HM reference software 10.0 and the proposed algorithm

Full size table

Figure 33.2 shows the RD performance. In Fig. 33.2a, it is shown that for the bigger QP values larger loss are generated for bit-rate. However, the proposed method is very similar to the original HM 10.0 software. There is negligible loss of bit-rate in Fig. 33.2b.

Although our algorithm has some loss for bit-rate and PSNR, it has significant time saving performance 43.66 %, minimum, and 51.70 %, max. In performance of Class B, the proposed algorithm can achieve very little loss as 0.2 % of bit-rate and 0.06 (dB) of PSNR, and 51.70 % of good time saving performance without any fast options.

Even if our proposed algorithm has some loss, it shows performance more than a other fast algorithm about time reduction. Our method also has good structure that can be running with other fast options in HEVC standard.

33.4 Conclusions

We have proposed a new fast coding algorithm both of early CU splitting termination (CST) and fast PU decision (FPU). The CST algorithm are used the RD costs for different CU dimensions and motion complexity for PU level. The FPU method is based on spatial, temporal and depth correlation information and adaptive weighting factor design using temporal distance. By combine with CST and FPU, our algorithm achieved, on average, a 48.72 % of time saving over the original HM 10.0 software with 2.09 % of bit-rate loss. Our algorithm is useful to where needed much time reduction like real-time video encoding systems.

References

Sullivan GJ, Ohm JR, Han WJ, Wiegand T (2012) Overview of the high efficiency video coding (HEVC) standard. IEEE Trans Circuits Syst Video Technol 22(12):1649–1668
Article Google Scholar
Wiegand T, Sullivan GJ (2007) The H.264/AVC video coding standard. IEEE Signal Process Mag II:148–153
Article Google Scholar
Choi K, Jang ES (2012) Fast coding unit decision method based on coding tree pruning for high efficiency video coding. Opt Eng Lett
Google Scholar
Shen L, Liu Z, Zhang X, Zhao W, Zhang Z (2013) An effective CU size decision method for HEVC encoders. IEEE Trans Multimedia 15(2):465–470
Article Google Scholar
Sampaio F, Bampi S, Grellert M, Agostini L, Mattos J (2012) Motion vectors merging: low complexity prediction unit decision heuristic for the inter prediction of HEVC encoders. In: International conference on multimedia and Expo, July 2012, pp 657–662
Google Scholar
Hosur PI, Ma KK (1999) Motion vector field adaptive fast motion estimation. In: International conference on information, communications and signal processing (ICICS’99)
Google Scholar
Zeng H, Cai C, Ma KK (2009) Fast mode decision for H.264/AVC based on macroblock motion activity. IEEE Trans Circuits Syst Video Technol 19(4):1–11
Article Google Scholar
Lee JH, Park CS, Kim BG, Jun DS, Jung SH, Choi JS (2013) Novel fast PU decision algorithm for the HEVC video standard. In: IEEE International conference on image processing, pp 1982–1985
Google Scholar
Bossen F (2013) Common test conditions and software reference configurations. In: Joint collaborative team on video coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11 12th meeting, Jan 2013
Google Scholar

Download references

Acknowledgments

This research was supported by the KCC (Korea Communications Commission), Korea, under the ETRI R&D support program supervised by the KCA (Korea Communications Agency) (KCA-2012-11921-02001).

Author information

Authors and Affiliations

Department of Computer Engineering, Sun Moon University, Asan, Republic of Korea
Jong-Hyeok Lee, Kalyan Goswami & Byung-Gyu Kim

Authors

Jong-Hyeok Lee
View author publications
You can also search for this author in PubMed Google Scholar
Kalyan Goswami
View author publications
You can also search for this author in PubMed Google Scholar
Byung-Gyu Kim
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Byung-Gyu Kim .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Seoul University of Science & and Technology (SeoulTech), Seoul, Korea, Republic of (South Korea)
James J. (Jong Hyuk) Park
School of Information Technologies, University of Sydney, Sydney, New South Wales, Australia
Albert Zomaya
Humanitas College, Kyung Hee University, Seoul, Korea, Republic of (South Korea)
Hwa-Young Jeong
Computer Science & Software Engineering, Monmouth University, W. Long Branch, New Jersey, USA
Mohammad Obaidat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lee, JH., Goswami, K., Kim, BG. (2014). A New Fast Encoding Algorithm Based on Motion Activity for High Efficiency Video Coding (HEVC). In: Park, J., Zomaya, A., Jeong, HY., Obaidat, M. (eds) Frontier and Innovation in Future Computing and Communications. Lecture Notes in Electrical Engineering, vol 301. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-8798-7_33

Download citation

DOI: https://doi.org/10.1007/978-94-017-8798-7_33
Published: 19 April 2014
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-017-8797-0
Online ISBN: 978-94-017-8798-7
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

A New Fast Encoding Algorithm Based on Motion Activity for High Efficiency Video Coding (HEVC)

Abstract

Similar content being viewed by others

Fast encoding algorithm for high-efficiency video coding (HEVC) system based on spatio-temporal correlation

An Improved Hierarchical Motion Estimation Algorithm

Fast Coding Algorithm for High Efficient Video Coding (HEVC)

Keywords

33.1 Introduction

33.2 Proposed Work

33.2.1 Fast Coding Unit Splitting Termination (CST) Method

33.2.2 Fast Prediction Unit Decision (FPU) Method

33.3 Experimental Results

33.4 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

A New Fast Encoding Algorithm Based on Motion Activity for High Efficiency Video Coding (HEVC)

Abstract

Similar content being viewed by others

Fast encoding algorithm for high-efficiency video coding (HEVC) system based on spatio-temporal correlation

An Improved Hierarchical Motion Estimation Algorithm

Fast Coding Algorithm for High Efficient Video Coding (HEVC)

Keywords

33.1 Introduction

33.2 Proposed Work

33.2.1 Fast Coding Unit Splitting Termination (CST) Method

33.2.2 Fast Prediction Unit Decision (FPU) Method

33.3 Experimental Results

33.4 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation