Keywords

2.1 Introduction

One of the most important Quality of Service (QoS) parameters for wireless networks is the probability of occurrence of bit and packet errors measured by the Bit Error Rate (BER) and Packet Error Rates (PER) respectively. Neither single packet losses nor single bit errors can provide a comprehensive imitation modeling of fading channels. In digital systems, errors often occur in packets as a result of transmission conditions. Specifically, signals are attenuated during transmission and this consequently leads to packetization of errors. A group of erroneous packets is essentially a sequence of packets that are either lost in transit or received with error after transmission over a communication channel within a given period of time. Burst Error Length (BEL) is defined as the number of erroneous packets included in a given group of errors [1].

2.2 Method

The H.264/AVC standard is a compendium of innovations and improvements on prior video coding technologies vis-a-vis enhancement of coding efficiency and effective usage over a wide gamut of networks and applications [2]. For a complete analysis of the impact of errors on resultant signal quality, we investigate the influence of the conduit’s (i.e., wireless transmission medium’s) robustness on the perceivable quality of streaming video standard H.264/AVC using the developed Hardware and Software Complex (HSC) [3, 4]. Objective and subjective indicators of video quality were obtained using methods described in [4]. For qualitative assessment, it is imperative to have the video file data before transmission over the network (on the transmitting end), and after reception from the network (at the receiving end). Data required for the qualitative assessment at the transmitting end are: (a) the original unencoded video in YUV format, (b) the encoded video in MPEG-4, (c) transmission start time and (d) type of each packet sent to the network.

The following data are required for qualitative assessment at the receive end: (a) time of reception of each packet from the network, (b) type of each packet received from the network, (c) the encoded video (possibly distorted) in MPEG-4 format, and (d) the decoded video in YUV format for display.

We performed data evaluation by comparing the transmitted and received files.

A. Structure of the Hardware-Software Complex

Data processing is carried out in the three phases described below:

First phase: the time taken in sending and receiving each packet on both sides as well as the packet type are analyzed. This results in a record of the type of frame and the time elapsed between transmitted and received packets. The distorted video file at the receive end is restored using the originally encoded video file and information about lost packets. Subsequently, the video is decoded for playback to the viewer. It is at this stage that the general task of assessing video quality is considered. Video quality indicators always require a comparison of the received (possibly distorted) video frame and the corresponding source frame. In the case of a total loss of frame in transit, the necessary frame synchronization before and after transmission over the network becomes impossible.

Second phase: In this phase of data processing, the problem of quality assessment is resolved on the basis of the analysis of information about frame loses. Substituting the last relayed frame for the lost frame restores frame synchronization. This methodology allows for subsequent frame-wise assessment of video quality.

Third phase: At this stage, the assessment of the quality of decoded video is achieved by means of both the restored and source video files.

Figure 2.1 shows a block diagram of the HSC for assessing the quality of streaming video. The schematic diagram reflects the interaction between modules in the transmission of the digital video from a source through the network connection to the viewer.

Fig. 2.1
figure 1

Block diagram of HSC

The HSC modules interact with the network by using traces containing all the necessary data listed above. Thus, for proper functioning, the HSC requires two traces, the source video and the decoder. The data network can be considered simply as a two-port black-box that introduces delay, packet loss, and possibly packet rearrangement. The network was simulated based on the aforementioned assumptions [5] in the NS2 environment. A detailed description of the functional modules of the HSC is given in [4].

2.3 Simulation Preparation

Video encoding begins with color space conversion from RGB to YUV also known as Y, Cr, Cb i.e., one luma and two chroma components [1]. It is common knowledge that there is a significant correlation of color components in any typical image of the RGB format, which makes it an obviously redundant format in terms of compression. The standard television uses a different representation of images, which also employs three components of the signal, but these components are uncorrelated (i.e., void of inter- componential redundancy). R, G and B components are converted to luminance Y component and two color difference components U and V of the YUV format. Since most information is stored in the luminance component, little information is lost if a thinning of the U and V components is done.

Standard test videos in the YUV format may be used as initial test video sequences. However, these videos have limited playback time and hence do not allow for the assessment of change in quality under prolonged video broadcast. Similarly, a vast amount of experimental data cannot be obtained from them. It is for these reasons that we recorded our own 30-min video in YUV format (send.YUV) with a resolution of 640 × 480 pixels and frame rate of 25 fps using a special software.

The first step is to encode the source video to H.264 format (video stream file). This is done by the video codec (a device used for encoding and decoding video signals).

Video codecs are usually characterized by (a) channel throughput, (b) decoded video distortion rate, (c) startup latency, (d) end-to-end delay, computational complexity, and (e) memory capacity. A good codec is one capable of providing the necessary trade-off vis-a-vis these characteristics [1]. In the next step, the encoded video stream is packaged in an MP4-container for onward transport over the network using the User Datagram Protocol (UDP). The result of encoding the original video is an MP4-file. Since it is necessary to evaluate the quality of video transmitted over the network, the need arises to create a spare decoded YUV file from the newly created MPEG layer-4 file, which serves as the control in evaluating the quality of video transmitted over the network, excluding the impact of the codec. It is thus possible to estimate the influence of a wireless network on the received visual video quality, while excluding encoding and decoding losses. For simulation purposes, it is necessary to create a video trace file that contains the following information: frame number, frame type, frame size, and the number of segments in which the frame is divided into packets. This video trace serves as the input to the simulator network, where the sending and reception of video data occurs. As a result of video transmission over the network, it is necessary to obtain transmission trace files and reception trace files, which contain the following packet data: the transmission/reception time, a unique identifier and trace file size. These two traces are used to determine lost packets in the network. In the end, we obtain files of the sent and received packets containing detailed information about the time of sending from the transmitter and the time of reception by the receiver.

2.4 Modeling and Simulation of Transmission Over Wireless Network

The HSC allows for the simulation of the main types of errors encountered when transmitting video data over wireless networks. The two types of simulation required are as listed below:

A. Bit error simulation

Simulated transmission over a wireless channel model with Additive White Gaussian Noise (AWGN) is conducted. In the process of simulation, certain bits in the sequence are distorted (i.e., inverted) with a given probability. The probability value used is defined by the Bit Error Rate (BER).

B. Packet error simulation

The UDP packets can be manually deleted from the received trace file. This allows for the observation of codec functionality and analysis of change in visual quality in cases of packet loss. At the same time, both the received and undistorted files can be obtained during transmission over an “ideal” channel with unlimited bandwidth and no delay, with subsequent removal of some packets.

2.5 Calculation of Losses and Estimation of Obtained Video Quality

Calculation of losses given the availability of unique id of the package is quite easily achieved. With the aid of the video trace, each packet is assigned a type. Each package of the assigned type that is not included in the received trace is deemed lost. Loss of frame is calculated for any (and all) frame(s) with a lost packet. If the first packet in a frame is lost, then the whole frame is considered lost since the video decoder cannot decode a frame, which is missing the first part. The module for trace assessment estimates received traces. The recovered file must be decoded in YUV format. There are two major methods of estimating the quality of digital video, namely, the subjective and objective methods:

Subjective quality assessment is always based on viewer impression. It is extremely costly, very time consuming and requires specialized equipment. Traditionally, subjective video quality is determined by expert assessment and calculation of the average Mean Opinion Score (MOS), which is assigned a value from 1 to 5 (ITU scale) [6, 7], where 1 and 5 represent worst and best received video quality respectively.

Objective video quality assessment is usually done by measuring the average luminance peak Signal-to-Noise Ratio (PSNR). The PSNR is a traditional metric, which allows for the comparison of any two images [8]. The PSNR module evaluates the objective quality of received video stream in Polynomial Approximation Coding (PAC). The end result is the values of PSNR calculated for the original and distorted image (as shown in Fig. 2.2). MOS values are calculated from the PSNR indicator.

Fig. 2.2
figure 2

PSNR value distribution histogram of video sequence for different values of wireless channel BER

2.6 Results, Analysis and Discussion

In order to study the effect of transmission errors on the resulting video quality, the transmission of a 30-min video over a wireless network with random packet errors in the channel was simulated. Characteristics of sequences used are listed in Table 2.1.

Table 2.1 Characteristics of encoded video

For modeling purposes, the encoded video stream was split into RTP/UDP-packets using the hardware-software tool reported in [4]. Bit error simulation for transmission over a wireless channel was done using an AWGN error generator contained within the PAC structure. Simulation of packet errors during transmission over a wireless channel was done by deleting packets from the received trace file [4]. This allowed us to explore and analyze the change in visual quality during loss of packets. The received and undistorted trace files were obtained for transmission over an “ideal” channel with unlimited bandwidth and no delay in using the NS2 software environment [5], followed by a random removal of packets, according to PER and BER parameters. Quality assessment was carried out using PSNR and MOS indicators, calculated by using hardware and software tools [5]. The standard deviation of the quality of the average PSNR values was calculated using Eq. (2.1) [9].

$$S_{PSNR}^{\prime}=\sqrt{\frac{1}{N-1}\sum \limits_{n=0}^{N-1}(PSNR_n-\overline {PSNR^{\prime}})^2}.$$
(2.1)

A. Effect of bit error

Figure 2.2 shows the effect of BER on the quality of video streaming.

Analysis of the results of streaming video over the simulated wireless network with different values of BER revealed the following:

  1. (i)

    Simulating a wireless channel using AWGN model, and additive, bit errors with a value of BER ≤ 3 × 10−5 does not affect the quality of the video. However, when BER ≥ 4 × 10−3 packet loss in the network reaches its maximum value of ≥99.9 %.

  2. (ii)

    Objectively, excellent quality of video transmission over a channel can be guaranteed for all bit error probabilities less than 1 × 10−4, good quality is in the range of 1 × 10−4 to 4 × 10−4, satisfactory quality is in the range of 4 × 10−4 to 8 × 10−4, poor quality is in the range of 8 × 10−4 to 1 × 10−3, while very bad quality is for any BER > 1 × 10−3.

  3. (iii)

    The histograms of the distribution of PSNR values during simulation and broadcast over a real network in general are of a twin-peak form. One of the peaks characterizes the PSNR value of error-free video stream (the decoder is able to correct bit errors when they are relatively few in the frame). The second peak characterizes PSNR degradation due to the large number of corrupted video frames in fading moments (the decoder is unable to fix large numbers of bit errors). As the number of errors increases, this maximum increases commensurately with a decrease in the second. During transmission, depending on error level, values of either of the maxima increase. In cases when errors in the communication channel are negligible, the PSNR distribution has only one maximum.

Empirical values of BER transitions from an acceptable quality to the poor, according to the relationship between PSNR and MOS [6], are presented in Table 2.2. However, the AWGN model does not allow for adequately simulation of a fading channel. Typically, errors are often long term, since high probability of bit loss occurs in specific periods of transmission, e.g., during poor propagation. Attenuation of the transmitted signal results in packetizing (grouping) of errors. Another cause of error grouping can be physical defects of, and failures inherent in the information storage system. When using VLC, bit error occurrence results in group errors or packetization of errors.

Table 2.2 Relationship between quality indicators and BER

B. Effect of packet error

Figure 2.3 shows the effect of PER indicator on streaming video quality. The range of values of PER, within which the resulting quality is maximal (i.e., almost equal to the original) and minimal are indicated. It is shown that with PER ≤ 1 × 10−4 error does not affect the resultant video quality and can be easily eliminated with decoders and existing methods of error correction. A further change in the quality has a stepwise nature and decreases with increasing PER.

Fig. 2.3
figure 3

PSNR value distribution histogram of video sequence for different values of wireless channel PER

Analyzing the results of streaming video over a simulated wireless network with a given probability of packet loss, we safely conclude that:

  1. (i)

    A PER value of ≤ 1 × 10−4 in simulation of a wireless network does not affect the video quality. When PER ≤ 1 × 10−3 impact of errors on video quality is not noticeable and does not irritate during viewing experience. When PER ≥ 0.1, packet loss in the network has the worst effect on visual quality.

  2. (ii)

    Objectively, excellent quality of video transmission over a channel can be guaranteed for all packet error probabilities less than 1 × 10−3, good quality is in the range of 1 × 10−3–3 × 10−3, satisfactory quality is in the range of 3 × 10−3–1 × 10−2, poor quality is in the range of 1 × 10−2–5 × 10−2, while very bad quality is for any PER > 5 × 10−2.

Histograms of the distribution of values of PSNR when PER ≤ 6 × 10−4, in general, have a bimodal shape. One of the peaks characterizes the value of PSNR of video stream distorted due to packet loss. The second maximum characterizes deterioration in the PSNR of dependent frames. As the number of errors increases, one of the peaks increases due to a decrease in the other.

C. Effect of length of error groups

To study the effect of the length of error groups on resultant quality, the simulation of a 30-min video transfer over a wireless network for the values of PER of 1 × 10−5−5 × 10−2 is repeated, since a visual change in video quality is observed at this range. The simulation of groups of error packets during transmission over a wireless channel was done by means of random deletion of packet groups from the receive trace file with a given BEL. For this particular example BEL = 100 implies that the total random number of consecutively deleted packets does not exceed 100. The total sum of erroneous (deleted) packets in the video sequence for the whole experiment given PER = const. remained the same, irrespective of the value of BEL. Figure 2.4 shows the effect of BEL on the quality of streaming video for PER = 1 × 10−3.

Fig. 2.4
figure 4

PSNR value distribution histogram of video sequence for PER = 1 × 10−3 and varying values of wireless channel BEL

Analyzing the results of streaming video over the simulated wireless network with a given grouping of erroneous packets, we can draw the following conclusions:

  1. (iv)

    For PER ≤ 1 × 10−3 the effect of single packet errors on quality is insignificant and does not irritate the viewing experience.

  2. (v)

    Histograms of the distribution of values of PSNR have two maxima. One of the peaks characterizes the value of PSNR of video frames distorted due to the loss of packets. The second maximum characterizes the deterioration of PSNR of dependent frames. With increasing quantities BEL is one of the peaks decreases as the number of dependent frames are also reduced, whereas the second peak remains unchanged. This is explained by the fact that the single scattered throughout the video sequence error number of distorted frames is large due to error propagation to dependent frames. An increase in the BEL value leads to a decrease in one of the maxima, since the number of dependent frames also decreases, while the second maximum remains the same. This is due to the fact that under singular errors spread across the whole video sequence, the number of distorted frames is large because of the distribution of errors on dependent frames.

  3. (vi)

    Increasing the length of the error groups leads to an increase in the average quality of the video sequence.

  4. (vii)

    Effect of error groups on the quality is more powerful because of the local concentration of errors. However, the average quality of the video sequence increases with increase in the length of the grouping for a given value of probability of occurrence of packet errors.

For BEL ≥ 60 the average quality is almost identical to the original video.

D. Relationship between PER and BEL

The average quality of the experimental video sequence for different values of PER and BEL is shown in Fig. 2.5.

Fig. 2.5
figure 5

Estimate of video sequence quality for different values of PER and BEL

In assessing the impact of erroneous packets received on quality, it is necessary to analyze not only the likelihood of occurrence of errors, but also their structure and length of their grouping. Additionally, the following conclusions can be drawn:

  1. (i)

    Increasing the length of error groupings leads to an increase in the average quality of the video sequence. This is due to the deterioration of a small section of video, where error groups are concentrated, whereas in the case of single bit errors deterioration in the quality of video may be observed across the whole sequence;

  2. (ii)

    When the length of erroneous packets is BEL ≤ 6 the change in quality is minor and identical to the influence of single packet errors (BEL = 1);

  3. (iii)

    When BEL ≥ 60 the average quality is almost identical to the original (PSNR < 90 dB). It is logical to assume that the value of BEL in the longer video sequences, with the same average quality may have a higher value;

  4. (iv)

    The highest dynamics of change in PSNR = 60 dB is observed in two cases: (a) for a fixed PER = 1 × 10−3 and the variable values of BEL; and (b) at BEL ≥ 80 and the varying values of the PER. In other cases, the dynamics is not essential and minimal in the absence of clustering of errors (BEL = 1);

  5. (v)

    With increasing PER, the effect of BEL on quality decreases due to increase in denseness of single errors;

  6. (vi)

    Analysis of the results of PER and BEL shows that for effective assessment of the impact of transmission errors on resultant quality it is necessary to analyze not only the likelihood of errors, but also their structure and length of their grouping. The most realistic and accurate method of modeling statistical errors in communication channels is the use of probability data obtained from real networks.

At BER values ≤ 3 × 10−5 bit errors do not affect the quality of the received video and are easily eliminated by well-known methods of error correction implemented in WiMAX. When BER ≥ 4 × 10−3 packet loss in the network reaches its maximum value and leads to an unacceptable quality of the received video. Ensuring objectively excellent quality of video sequence over a channel can be done for probabilities of bit error rate less than 1 × 10−4; good quality in the range of 1 × 10−4 –4 × 10−4; satisfactory quality in the range of 4 × 10-4 –8 × 10−4; poor quality in the range of 8 × 10−4 –1 × 10−3 and very bad at BER ≥ 1 × 10−3. The use of H.264/AVC video in wireless access systems with VLC codec of variable length leads to a disruption of the synchronization of decoded video sequences and the occurrence of additional grouping of errors, whose impact on the quality for video decoding is much stronger than that of the bit error, since it leads to loss of large segments of the information. It is shown that the quality of the video affects not only the probability of error, but also the structure and length of errors. Analysis of individual errors showed that at PER ≤ 1 × 10−4 packet errors do not affect the quality of the received video and are easily eliminated by well-known methods of error correction deployed in wireless networks. When PER ≤ 1 × 10−3, the effect of errors on quality is not noticeable and does not irritate the viewing experience. For values of PER ≥ 0.1 packet loss in the network leads to an unacceptable quality of the received video. Ensuring objectively excellent quality of video sequence over a channel can be done for probabilities of bit error rate less than 1 × 10−3; good quality in the range of 1 × 10−3–3 × 10−3; satisfactory quality in the range of 3 × 10−3 –1 × 10−2; poor quality in the range of 1 × 10−2–5 × 10−2 and very bad at BER ≥ 5 × 10−2.

To assess the impact on quality of video playback under error grouping conditions of error groups BEL, the use of a regular (deterministic) model is proposed. It is shown that the effect of errors on the average quality is stronger due to local concentration of errors. The average quality of the video sequence at the same time increases with increase in the length of the grouping for a given value of probability of occurrence of packet errors. For groupings of length BEL ≥ 60, average quality is almost identical to that of the source video. With increasing PER, the effect of BEL on quality decreases due to increase in the denseness of single errors. Increase in the BEL leads to an increase in the average quality of the video sequence irrespective of the PER value. The highest dynamics of change in PSNR is observed for fixed PER = 1 × 10−3 and the variable values of BEL; at BEL ≥ 80 as well as for the changing values of the PER. In other cases, the dynamics is not significant and is minimal in the absence of clustering of errors. To assess the quality of video under packetization of errors under real conditions, it is necessary to investigate the actual distribution of packetization of errors in the communication channel.

E. Analysis of error packetization phenomenon

The need to create realistic simulation and mathematical models of behavior of losses in the communication channels based on the apparatus of Markov chains for wireless access systems is a scientific problem of important consequence. Markov processes with the necessary number of states sufficiently describe the mechanism of transmission of information [10], the knowledge of which is necessary to analyze network problems during packet video transmission. The parameters of the model make it possible to determine the quality of transmitted video as well as the statistical parameters of the network. In an experiment carried out by the authors, the matrix of values in Fig. 2.6 was obtained for the Markov model developed for investigating the error packetization phenomenon.

Fig. 2.6
figure 6

The matrix of values

A model describing the length of error intervals and error-free reception for streaming video transmission was developed based on the experimental data obtained as a result of streaming video from a moving source on WiMAX network [11]. Based on the graph of packet loss distribution, an array was formed in which the lost packet corresponds to a logic zero (0) and received packet corresponds to a logic unit (1) (Fig. 2.7).

Fig. 2.7
figure 7

DF of simulated samples of the length of OFF-(a) and ON (b). Periods: curve 1 experiment, curve 2 simulation

Analysis of the quality of received video sequence when simulating Markov model of error packetization shows that the average quality of video sequences is slightly worse than during transmission over a real network. The subjective MOS quality indicator also shows a difference in values: a real WiMAX network returned a mean value of 3.59 (corresponding to satisfactory), while the experiments returned values of 2.72 (corresponding to poor) and 3.01 (corresponding to satisfactory), respectively. The average quality of video sequences when simulating Markov model packetization of errors are similar to those obtained when simulating single packet errors with PER index in the range of 3 × 10−3 to 1 × 10−2. While the length of error group depending on the PER of the specified range attain values of BEL ≤ 10.

2.7 Conclusions

We have presented in this paper the modeling, simulation and analysis of errors inherent in video streams over wireless broadband access networks, by presenting results of investigating the effect of single bit error and bit packet errors on the quality of H.264/AVC standard bursty video streams. A software-hardware composite system that was developed specifically for this purpose was employed in the investigation. Analysis of simulation results led to conclusions and postulations discussed in detail in sections VI A through E for BER, PER, BEL, relationship between PER and BEL, and the effect of error packetization phenomenon respectively.