Stereo Video Transmission Using LDPC Code

Abstract

Stereo video is widely used because it can provide depth information. However, it is difficult to store and transmit stereo video due to the huge data amount. So, high efficient channel encoding algorithm and proper transmission strategy is needed to deal with the video transmission over limited bandwidth channel. In this paper, unequal error protection (UEP) based on low density parity check (LDPC) code was used to transmit stereo video over wireless channel with limited bandwidth. Different correction level LDPC code was used according to the importance of video stream to reconstruction at the receiver. Simulation result shows that the proposed transmission scheme increases the PSNR of reconstructed image, and improves the subjective effect.

Share and Cite:

R. GUO, L. WANG and X. JIANG, "Stereo Video Transmission Using LDPC Code," International Journal of Communications, Network and System Sciences, Vol. 1 No. 3, 2008, pp. 254-259. doi: 10.4236/ijcns.2008.13031.

Stereo video is widely used because it can provide depth information. However, it is difficult to store and transmit stereo video due to the huge data amount. So, high efficient channel encoding algorithm and proper transmission strategy is needed to deal with the video transmission over limited bandwidth channel. In this paper, unequal error protection (UEP) based on low density parity check (LDPC) code was used to transmit stereo video over wireless channel with limited bandwidth. Different correction level LDPC code was used according to the importance of video stream to reconstruction at the receiver. Simulation result shows that the proposed transmission scheme increases the PSNR of reconstructed image, and improves the subjective effect.

1. Introduction

Stereo video is widely used because it can provide depth information in stereoscopic television, video conference, remote control, telemedicine and other fields [1,2]. However, it is difficult to store and transmit stereo video due to the huge data amount. So, high efficient channel encoding algorithm and proper transmission strategy is needed to deal with the video transmission. In practical channel, especially in wireless channel with limited bandwidth, signal errors will inevitably appear at the receiver as a consequence of channel fading, multipath, noise and so on. Stereo video is highly compressed data stream, and is very sensitive to error, which will decrease the quality of reconstructed image at the receiver. Therefore, high efficient error correction technology and transmission control strategy is needed [3]. Recently, with the deep research in LDPC, more and more people focus on the video communication based on LDPC code [4,5].

In this paper, we mainly focus on the stereo video communication on AWGN channel with limited bandwidth using UEP scheme based on LDPC encoding. According to the different contribution to image reconstruction at the receiver, the video stream is divided into different parts [6], then UEP is used to protect the different bits stream in different level. The structure is shown in Figure 1.

2.  Bits Stream Structure and Data Partition of Stereo Video

Conceptually, the structure of H.264 encoder is divided into two layers: Video Coding Layer (VCL) and Network Abstraction Layer (NAL). VCL provides high performance function in video compression, including common definitions of video compression, block, macroblock, sub-graph layer and so on. NAL is responsible for network abstraction, which provides different adaptive capacity for different networks and transmitting package with proper mode. NAL works in two kinds of mode: Single Slice mode and Data Partition mode. When using Data Partition mode, H.264 puts all variable length codes with the same data type together in each frame [6].

Head information includes head information, macroblock type, frame type, predicted residual of motion vectors, frame flag etc. In H.264, this part is called A segmentation, which is the most important part. Intra-frame segmentation is called B segmentation. It loads the coding mode and the correlation coefficient in intra frame blocks. B segmentation works under the effect of A segmentation. Compared with information of inter-frame information segmentation, intra-frame information can prevent further drift, thus it is more effective than inter-frame segmentation. Inter-frame segmentation is called C segmentation. It only includes the coding mode and the correlation coefficient in

Figure 1. The diagram of stereo video communication based on UEP.

inter-frame blocks, it is the biggest segmentation in the video stream. Inter-frame segmentation is relatively subordinate, because it won’t provide synchronous information in encoding or decoding.

Here, we use stereo video compression encoding based on H.264, the output bit stream has the same structure as that of H.264 encoder. Compared with the bit stream of H.264 encoder, the stereo video adds the disparity information of different video channel, such as disparity vector, disparity prediction model and predictive residual etc. Disparity vector and motion vector have the same important level. Once disparity vector goes wrong, the data of assisted video will be affected. So in this paper, we regard the slice head information, types of encoding frame, types of disparity prediction macroblock and disparity vector among different video channel as the same importance as the motion vector in one channel. Here, we divide the bit stream of stereo video into two parts: A segmentation and B segmentation .

A segmentation includes head information, MB-Type, Reference frame, motion vector prediction difference, disparity vector prediction difference, frame ending flag, the encoding mode and the correlation coefficient in intra frame, etc.

B segmentation includes the encoding mode and  correlation coefficient in inter-frame.

We realize stereo video encoder based on H.264/AVC encoder platform. The parameters are listed in Table 1. The outputs of the encoder are two binary data

Table 1. The parameter of video encoder.

file—test.264 and coding pursuit file—trace.txt [7,8]. We divide the bit stream according to these files.

The divided stereo video bit stream will be stored separately in file A and B. We adopt high-level protection to A bit stream because of its higher importance. For the B bit stream, we use low-level protection for the sake of encoding efficiency. At the receiver, if A segmentation data is lost, B will be abandoned. If part of B segmentation is lost, the head information still can be used to improve the effect of error concealment [9,10].

3.  Realization UEP Based on Different Rates Irregular LDPC Codes

3.1 The Principles of Realizing UEP Based on Different Rates Irregular LDPC Codes

The key point of unequal error protection is: under the condition of limited bandwidth and the premise of prior protection to the important parts. We properly allocate the redundancy to the source and channel so as to reduce end-to-end distortion [11].

Assuming the total bit rate (the total bandwidth) is, the parts of source and channel isand respectively. Suppose, the source is divided into A segmentation and B segmentation, so:

(1)

In formula (1):, represent the bits of A and B segmentation used in source encoding respectively., represent the bits of A and B bit stream used in the channel encoding respectively. Assuming the bit rate of channel encoding corresponding to A segmentation and B segmentation are,. Then, we can get the following formula:

, (2)

So, the total transform is:

(3)

Thus the question converts to: with the limited bandwidth and supposed data priority, the total distortion of the video at the receiver is minimized:

(4)

Subject to the following constraint conditions:

(5)

where, represents the total channel bandwidth, andrepresents the total distortion of source and channel, which is calculated by MSE. The relationship between PSNR and MSE is given by:

By adjusting the parameters of channel encoding and bit rate allocating between source encoding and channel reasonably, we can get the optimal reconstructed image at receiver.

UEP needs to allocate bit-rate between source and channel, under the condition that the total bandwidth is decided. According to adjusting the parameters of source encoding or the bit rate of channel encoding , the situation will be different.

3.2 The Performance of UEP When the Source Rate is Decided

When the bit rate of source encoding is pre-assigned, UEP can be realized by just adjusting the bit rate of channel .In this paper, we compared the properties of UEP scheme and equal error protection (EEP) scheme (or) in the same bandwidth. When the total bandwidth is decided, and the bit rate of source is pre-assigned (QP is unchangeable), that is, in the formula (1), and are determined. From, we can know that the bits allocated to channel is also determined. UEP can be realized by adjusting the bit rate of channel encoding with different grade importance, that is by choosing, to determine and, so as to add different redundancy to information with different importance.

After analyzing the segmentations of stereo video bit stream, we found that the ratio of A segmentation and B segmentation is about 1:3 in Race 1 sequence. In order to keep the total bandwidth unchanged, the parameters of error correcting code are shown in the Table 2.

We use irregular LDPC code based on IeIRA permutation matrix with three different encoding rate in the experiment. They are 1/4, 1/2 and 3/4, respectively. The code length of LDPC is 4064 bits, using BP decoding, the maximum iteration number is 80.The length of Turbo code is 3568 bits. When using UEP scheme, the LDPC code with 1/4 rate is used to protect the important data (the bit stream from A group), the LDPC code with 3/4 rate is used to protect the less important data(the bit stream from B group). If A segmentation is lost, B segmentation will be abandoned at the receiver.

In the experiment, we test the properties of UEP, EEP based on LDPC code and Turbo code in the AWGN channel with limited bandwidth by using BPSK modulation. Test sequences–Race1 sequences from Japanese KDDI lab are used. The image size is 320×240.

At first, we give the BER performance of 3 kinds of code in AWGN channel in Figure 2. Clearly, not only the properties of irregular LDPC code based on IeIRA permutation matrix are better than Turbo code with the same length, but also error floors is lower than Turbo code. Figure 3 gives the differences of PSNR properties of image reconstructed by different protection scheme in the case of fixed source rate. (QP is 30 in stereo video compression encoding). Figure 4 shows the reconstructed 7th frame main and assistant video by different encoding strategy at the receiver when SNR is 2.1dB.

Figure 3 and Figure 4 show that the UEP scheme based on LDPC code is always better than the other two schemes based on EEP. This is because we can reduce the error probability of important information by adding more redundancy in the important part of video stream. With the improvement of channel condition(increase of signal-to-noise ratio), this advantage is decreased (Eb/No from 1.8 to 2.4, the advantage of main video stream based on LDPC UEP relative to LDPC EEP is reduced from 2.1 dB to 0.8 dB). What’s more, because of the excellent performance of LDPC code, the performance of EEP scheme based on LDPC code is better than that based on Turbo code.

Table 2. The parameters of various channel coding.

Figure 2. Performance of different channel code.

Figure 3. PSNR Compare of main, assistant video stream in different transmission scheme.

Figure 4 shows the reconstructed 7th frame at the receiver. It is clear that Figure 4(a) is better than Figure 4(b) and Figure 4(c), and Figure 4(b) is a little bit better than Figure 4(c). On condition that, it ensure the important data of A segmentation is fully protected, so we can get better subjective effect by using UEP scheme.

Conflicts of Interest

The authors declare no conflicts of interest.

Copyright © 2024 by authors and Scientific Research Publishing Inc.

Creative Commons License

This work and the related PDF file are licensed under a Creative Commons Attribution 4.0 International License.