A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

Dolly, D. Raveena Judie; Bala, G. Josemin; Peter, J. Dinesh

doi:10.1007/s13369-017-2839-x

A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

Research Article - Special Issue - Computer Engineering and Computer Science
Published: 03 November 2017

Volume 43, pages 4249–4263, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Arabian Journal for Science and Engineering Aims and scope Submit manuscript

A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

Download PDF

D. Raveena Judie Dolly¹,
G. Josemin Bala¹ &
J. Dinesh Peter ORCID: orcid.org/0000-0002-4357-7163²

84 Accesses
4 Citations
Explore all metrics

Abstract

Video compression marks its necessity when a huge sized video needs to be transmitted. The process starts with the identification of GoP (group of pictures), which depends on I- (intra), B- (bidirectional) and P- (predicted) frames determination. GoP is fixed, where consecutive frames are placed in an orderly manner based on the GoP size. Conventionally, B-frames lead to buffering of memory within the past and future frames consuming more computational time. Such issues are handled by an adaptive framework for determining frames based on matching criteria rather than fixed GoP. NSEW (North–South–East–West) affine translation (NAT) is proposed for replacing B with either I- or P-frame. The proposed framework involves video compression using affine motion-based free-form transformation and video decompression using warping methodologies for the purpose of compressing and decompressing the video sequence, based on the resulted I- and P-frames. B-spline transformation was also initiated at local level along with global affine transformation to improve the subjective quality of the decompressed video sequence. The methodology was investigated for the file size, computational time, peak-signal-to-noise ratio (PSNR) and Structural Similarity index (SSIM), which proved the superiority of the proposed technique. Further, the methodology was also investigated with optimizing the affine motion parameters (AMP) using nonlinear least squares, Broyden–Fletcher–Goldfarb–Shanno (BFGS) and limited-memory BFGS which yet again proved to be far more superior to conventional techniques with an average PSNR of 38.98 dB with LBFGS. To further improve the subjective quality, affine B-spline-based motion estimation using LBFGS was implemented and observed the average PSNR gain to be 42.03 dB.

Article PDF

Automatic Adaptation of Frames Using NSEW Affine Translation

ASABSA: Adaptive Shape Assisted Block Search Algorithm and Fuzzy Holoentropy-Enabled Cost Function for Motion Vector Computation

Article 19 September 2016

Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding

Article 28 July 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Wang, Y.L.; Wang, J.X.; Lai, Y.W., Su, A.W.Y.: Dynamic GOP structure determination for real-time MPEG-4 advanced simple profile video encoder. In: Proceedings of IEEE International conference on Multimedia and Expo, Amsterdam, pp. 293–296 (2005)
Paul, M.; Lin, W.; Lau, C.T.; Lee, B.S.: Video coding with dynamic background. EURASIP J. Adv. Signal Process. 11, 1–17 (2013)
Google Scholar
Ohm, J.R.; Sullivan, G.J.; Schwarz, H.; Tan, T.K.; Wiegand, T.: Comparison of the coding efficiency of video coding standards—including high efficiency video coding (HEVC). IEEE Trans. Circuits Syst. Video Technol. 22, 1669–1684 (2012)
Article Google Scholar
Mathew, M: Overview of temporal scalability with scalable video coding (SVC). Texas Instruments Application report, pp. 1–7 (2010)
Paul, M.; Lin, W.; Lau, C.T.; Lee, B.S.: A long term reference frame for hierarchical B-picture based video coding. IEEE Trans. Circuits Syst. Video Technol. (2014). doi:10.1109/TCSVT.2014.2302555
Tabatabai, A.J.; Jasinschi, R.S.; Naveen, T.: Motion estimation methods for video compression—a review. J. Frankl. Inst. 335(8), 1411–1441 (1998)
Article MATH Google Scholar
Lu, Y.; Li, Z.N.: Automatic object extraction and reconstruction in active video. Pattern Recognit. 41, 1159–1172 (2008)
Article MATH Google Scholar
Wiegland, T.; Steinbach, E.; Girod, B.: Affine multipicture motion-compensated prediction. IEEE Trans. Circuits Syst. Video Technol. 15, 197–209 (2005)
Article Google Scholar
Theoharis, T.; Papaioannou, G.; Platis, N.; Patrikalakis, N.M.: Graphics and Visualization: Principles and Algorithms. A. K. Peters/CRC Press, Taylor & Francis Group, Wellesley (2008)
Google Scholar
Alavala, C.R.: CAD/CAM Concepts and Applications. PHI Learning Private limited, New Delhi (2009)
Google Scholar
Ding, J.R.; Yang, J.F.: Adaptive group-of-pictures and scene change detection methods based on existing H.264 advanced video coding information. IET Image Process. 2(2), 85–94 (2008)
Article Google Scholar
Test videos. http://media.xiph.org/video/derf/ (2014). Accessed 16 Aug 2014
Gu, S.; Xin, M.; Sciurba, F.C.; Wang, C.; Kaminski, N.; Pu, J.: Bi-directional elastic image registration using B-spline affine transformation. Comput. Med. Imaging Graph. 48(4), 306–314 (2014)
Article Google Scholar
Szeliski, R.; Coughlan, J.: Spline-based image registration. Int. J. Comput. Vis. 22(3), 199–218 (1997)
Article Google Scholar
Gholipour, A.; Kehtarnavaz, N.; Briggs, R.; Devous, M.; Gopinath, K.: Brain functional localization: a survey of image registration techniques. IEEE Trans. Med. Imaging 26(4), 427–451 (2007)
Article Google Scholar
Xiao, Y.; Wei, Z.; Wang, Z.: A limited memory BFGS type-method for large-scale unconstrained optimization. Comput. Math. Appl. 56, 1001–1009 (2008)
Article MathSciNet MATH Google Scholar
Igarta, M.: A study of MPEG-2 and H.264 video coding. Thesis submitted for Master of Science in Electrical and Computer Engineering, Purdue University, US (2004)
Po, L.M.; Ng, K.H.; Cheung, K.W.W.; Wong, K.M.; Uddin, Y.M.S.; Ting, C.W.: Novel directional gradient descent searches for fast block motion estimation. IEEE Trans. Circuits Syst. Video Technol. 19(8), 1189–1195 (2009)
Article Google Scholar
Oh, K.J.; Ho, Y.S.: Adaptive rate-distortion optimization for H.264. In: PCM 2005, Part II. LNCS, vol. 3768, pp. 617–628. Springer, Berlin (2005)
Liu, J.; Qiao, F.; Wei, Q.; Yang, H.: A novel video compression method based on underdetermined blind source separation. In: Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol. 240, pp. 13–20 (2013)
Kumar, V.; Sharma, K.G.; Jalal, A.S.: Macro-block mode decision in MPEG-2 video compression using machine learning. In: Lecture Notes in Electrical Engineering, pp. 149–158 (2013)
Pandian, S.I.A.; Bala, G.J.; Anitha, J.: A pattern based PSO approach for block matching in motion estimation. Eng. Appl. Artif. Intell. 26, 1811–1817 (2013)
Article Google Scholar
Al-Najdawi, N.; Al-Najdawi, M.N.; Tedmori, S.: Employing a novel cross diamond search in a modified hierarchial search motion estimation algorithm for video compression. Inf. Sci. 268, 425–435 (2014)
Article MATH Google Scholar
Kordasiewicz, R.C.; Gallant, M.D.; Shirani, S.: Affine motion prediction based on translational motion vectors. IEEE Trans. Circuits Syst. Video Technol. 17(10), 1388–1394 (2007)
Article Google Scholar
Ahmadi, A.; Pouladi, F.; Salehinejad, H.; Talebi, S.: Fast two-stage global motion estimation: a blocks and pixels sampling approach. Intell. Interact. Multimed. Syst. Serv. 11, 143–151 (2011)
Article Google Scholar
Dimou, A.; van der Vleuten, R.J.; de Haan, G.: Picture-quality optimization for the high definition TV broadcast chain. Technical Note PR-TN 2007/00338- Koninklijke Philips Electronics Nv 2007 (2007)
Xu, D.; Wang, R.: Two-dimensional reversible data hiding-based approach for intra-frame error concealment in H.264/AVC. Signal Process. Image Commun. 47, 289–302 (2016)
Article Google Scholar
Mukherjee, R.; Debattista, K.; Bashford-Rogers, T.; Vangorp, P.; Mantiuk, R.; Bessa, M.; Waterfield, B.; Chalmers, A.: Objective and subjective evaluation of High dynamic range video compression. Signal Process. Image Commun. 47, 426–437 (2016)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, Karunya University, Coimbatore, India
D. Raveena Judie Dolly & G. Josemin Bala
Department of Computer Sciences Technology, Karunya University, Coimbatore, India
J. Dinesh Peter

Authors

D. Raveena Judie Dolly
View author publications
You can also search for this author in PubMed Google Scholar
G. Josemin Bala
View author publications
You can also search for this author in PubMed Google Scholar
J. Dinesh Peter
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Dinesh Peter.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dolly, D.R.J., Bala, G.J. & Peter, J.D. A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters. Arab J Sci Eng 43, 4249–4263 (2018). https://doi.org/10.1007/s13369-017-2839-x

Download citation

Received: 02 May 2017
Accepted: 04 October 2017
Published: 03 November 2017
Issue Date: August 2018
DOI: https://doi.org/10.1007/s13369-017-2839-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

Abstract

Article PDF

Similar content being viewed by others

Automatic Adaptation of Frames Using NSEW Affine Translation

ASABSA: Adaptive Shape Assisted Block Search Algorithm and Fuzzy Holoentropy-Enabled Cost Function for Motion Vector Computation

Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters

Abstract

Article PDF

Similar content being viewed by others

Automatic Adaptation of Frames Using NSEW Affine Translation

ASABSA: Adaptive Shape Assisted Block Search Algorithm and Fuzzy Holoentropy-Enabled Cost Function for Motion Vector Computation

Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation