Abstract
Video compression marks its necessity when a huge sized video needs to be transmitted. The process starts with the identification of GoP (group of pictures), which depends on I- (intra), B- (bidirectional) and P- (predicted) frames determination. GoP is fixed, where consecutive frames are placed in an orderly manner based on the GoP size. Conventionally, B-frames lead to buffering of memory within the past and future frames consuming more computational time. Such issues are handled by an adaptive framework for determining frames based on matching criteria rather than fixed GoP. NSEW (North–South–East–West) affine translation (NAT) is proposed for replacing B with either I- or P-frame. The proposed framework involves video compression using affine motion-based free-form transformation and video decompression using warping methodologies for the purpose of compressing and decompressing the video sequence, based on the resulted I- and P-frames. B-spline transformation was also initiated at local level along with global affine transformation to improve the subjective quality of the decompressed video sequence. The methodology was investigated for the file size, computational time, peak-signal-to-noise ratio (PSNR) and Structural Similarity index (SSIM), which proved the superiority of the proposed technique. Further, the methodology was also investigated with optimizing the affine motion parameters (AMP) using nonlinear least squares, Broyden–Fletcher–Goldfarb–Shanno (BFGS) and limited-memory BFGS which yet again proved to be far more superior to conventional techniques with an average PSNR of 38.98 dB with LBFGS. To further improve the subjective quality, affine B-spline-based motion estimation using LBFGS was implemented and observed the average PSNR gain to be 42.03 dB.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Wang, Y.L.; Wang, J.X.; Lai, Y.W., Su, A.W.Y.: Dynamic GOP structure determination for real-time MPEG-4 advanced simple profile video encoder. In: Proceedings of IEEE International conference on Multimedia and Expo, Amsterdam, pp. 293–296 (2005)
Paul, M.; Lin, W.; Lau, C.T.; Lee, B.S.: Video coding with dynamic background. EURASIP J. Adv. Signal Process. 11, 1–17 (2013)
Ohm, J.R.; Sullivan, G.J.; Schwarz, H.; Tan, T.K.; Wiegand, T.: Comparison of the coding efficiency of video coding standards—including high efficiency video coding (HEVC). IEEE Trans. Circuits Syst. Video Technol. 22, 1669–1684 (2012)
Mathew, M: Overview of temporal scalability with scalable video coding (SVC). Texas Instruments Application report, pp. 1–7 (2010)
Paul, M.; Lin, W.; Lau, C.T.; Lee, B.S.: A long term reference frame for hierarchical B-picture based video coding. IEEE Trans. Circuits Syst. Video Technol. (2014). doi:10.1109/TCSVT.2014.2302555
Tabatabai, A.J.; Jasinschi, R.S.; Naveen, T.: Motion estimation methods for video compression—a review. J. Frankl. Inst. 335(8), 1411–1441 (1998)
Lu, Y.; Li, Z.N.: Automatic object extraction and reconstruction in active video. Pattern Recognit. 41, 1159–1172 (2008)
Wiegland, T.; Steinbach, E.; Girod, B.: Affine multipicture motion-compensated prediction. IEEE Trans. Circuits Syst. Video Technol. 15, 197–209 (2005)
Theoharis, T.; Papaioannou, G.; Platis, N.; Patrikalakis, N.M.: Graphics and Visualization: Principles and Algorithms. A. K. Peters/CRC Press, Taylor & Francis Group, Wellesley (2008)
Alavala, C.R.: CAD/CAM Concepts and Applications. PHI Learning Private limited, New Delhi (2009)
Ding, J.R.; Yang, J.F.: Adaptive group-of-pictures and scene change detection methods based on existing H.264 advanced video coding information. IET Image Process. 2(2), 85–94 (2008)
Test videos. http://media.xiph.org/video/derf/ (2014). Accessed 16 Aug 2014
Gu, S.; Xin, M.; Sciurba, F.C.; Wang, C.; Kaminski, N.; Pu, J.: Bi-directional elastic image registration using B-spline affine transformation. Comput. Med. Imaging Graph. 48(4), 306–314 (2014)
Szeliski, R.; Coughlan, J.: Spline-based image registration. Int. J. Comput. Vis. 22(3), 199–218 (1997)
Gholipour, A.; Kehtarnavaz, N.; Briggs, R.; Devous, M.; Gopinath, K.: Brain functional localization: a survey of image registration techniques. IEEE Trans. Med. Imaging 26(4), 427–451 (2007)
Xiao, Y.; Wei, Z.; Wang, Z.: A limited memory BFGS type-method for large-scale unconstrained optimization. Comput. Math. Appl. 56, 1001–1009 (2008)
Igarta, M.: A study of MPEG-2 and H.264 video coding. Thesis submitted for Master of Science in Electrical and Computer Engineering, Purdue University, US (2004)
Po, L.M.; Ng, K.H.; Cheung, K.W.W.; Wong, K.M.; Uddin, Y.M.S.; Ting, C.W.: Novel directional gradient descent searches for fast block motion estimation. IEEE Trans. Circuits Syst. Video Technol. 19(8), 1189–1195 (2009)
Oh, K.J.; Ho, Y.S.: Adaptive rate-distortion optimization for H.264. In: PCM 2005, Part II. LNCS, vol. 3768, pp. 617–628. Springer, Berlin (2005)
Liu, J.; Qiao, F.; Wei, Q.; Yang, H.: A novel video compression method based on underdetermined blind source separation. In: Multimedia and Ubiquitous Engineering. Lecture Notes in Electrical Engineering, vol. 240, pp. 13–20 (2013)
Kumar, V.; Sharma, K.G.; Jalal, A.S.: Macro-block mode decision in MPEG-2 video compression using machine learning. In: Lecture Notes in Electrical Engineering, pp. 149–158 (2013)
Pandian, S.I.A.; Bala, G.J.; Anitha, J.: A pattern based PSO approach for block matching in motion estimation. Eng. Appl. Artif. Intell. 26, 1811–1817 (2013)
Al-Najdawi, N.; Al-Najdawi, M.N.; Tedmori, S.: Employing a novel cross diamond search in a modified hierarchial search motion estimation algorithm for video compression. Inf. Sci. 268, 425–435 (2014)
Kordasiewicz, R.C.; Gallant, M.D.; Shirani, S.: Affine motion prediction based on translational motion vectors. IEEE Trans. Circuits Syst. Video Technol. 17(10), 1388–1394 (2007)
Ahmadi, A.; Pouladi, F.; Salehinejad, H.; Talebi, S.: Fast two-stage global motion estimation: a blocks and pixels sampling approach. Intell. Interact. Multimed. Syst. Serv. 11, 143–151 (2011)
Dimou, A.; van der Vleuten, R.J.; de Haan, G.: Picture-quality optimization for the high definition TV broadcast chain. Technical Note PR-TN 2007/00338- Koninklijke Philips Electronics Nv 2007 (2007)
Xu, D.; Wang, R.: Two-dimensional reversible data hiding-based approach for intra-frame error concealment in H.264/AVC. Signal Process. Image Commun. 47, 289–302 (2016)
Mukherjee, R.; Debattista, K.; Bashford-Rogers, T.; Vangorp, P.; Mantiuk, R.; Bessa, M.; Waterfield, B.; Chalmers, A.: Objective and subjective evaluation of High dynamic range video compression. Signal Process. Image Commun. 47, 426–437 (2016)
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Dolly, D.R.J., Bala, G.J. & Peter, J.D. A Hybrid Tactic Model Intended for Video Compression Using Global Affine Motion and Local Free-Form Transformation Parameters. Arab J Sci Eng 43, 4249–4263 (2018). https://doi.org/10.1007/s13369-017-2839-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13369-017-2839-x