Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Roy, Sangita; Gupta, Dola B.; Chaudhuri, Sheli Sinha; Banerjee, P. K.

doi:10.1007/978-81-322-1817-3_2

Sangita Roy⁴,
Dola B. Gupta⁴,
Sheli Sinha Chaudhuri⁵ &
…
P. K. Banerjee⁵

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 298))

1032 Accesses
2 Citations

Abstract

In the last 40 years a number of coding techniques for analog sources (speech and images) has been employed. Subband coding, a kind of transform coding, splits analog speech signal into a number of different smaller frequency bands. By subbanding data rate has been reduced to 12.13804 Kbps [Sangita et al. Studies and implementation of subband coder and decoder of speech signal, Proceedings of national conference on electronics, communication and signal processing, 8–16, 1] on 64 Kbps telephone line. In this paper a method has been proposed by which data rate has been reduced to 9.4875 Kbps using Rayleigh distribution where data rate can be reduced to 9.6 Kbps [Crochiere et al. Digital coding of speech in subbands, The BELL System Technical Journal, 2]. Proposed method can save data rate which in turn saves bandwidth as well as spectrum. Moreover this proposed method provides acceptable probability of error and quantization noise i.e. SNR.

Access provided by Autonomous University of Puebla. Download conference paper PDF

A Novel Design Approach of Subband Coder and Decoder of Speech Signal Using Log Normal Probability Distribution

Bandwidth extension of telephone speech using magnitude spectrum data hiding

Article 13 January 2017

Speech Bandwidth Extension Using Data Hiding Based on Discrete Hartley Transform Domain

Article 18 November 2021

Keywords

1 Introduction

Sub-band coding (SBC) is a kind of transform coding [3, 4]. A signal is divided into a number of different frequency bands and encodes each one independently. It enables a data compression by discarding information about frequencies which are masked. The result differs from the original signal, but if the discarded information is chosen carefully, the difference will not be noticeable, or more importantly, objectionable [5–7]. A paper—“A low-complexity audio data compression technique using subband coding (SBC) and a recursively indexed quantizer (RIQ)” compared SBC and RIQ to conventional coding techniques with SNR 2–5 dB higher than that of other coders of similar computational complexity of wideband audio signals [8]. The basic concept of “Frequency Domain Coding of Speech” methods is to divide the speech into frequency components by a filter bank (sub-band coding), or by a suitable transform (transform coding), and then encode them using adaptive PCM. Recent developments and examples of the “Vocoder-driven” adaptive transform coder for low bit-rate applications is also discussed [9]. In “Subband Coding of Speech Signals Using Decimation and Interpolation”—a structure of a two-channel quadrature mirror filter with low pass filter, high pass filter, decimators and interpolators, is proposed to perform subband coding of speech signals in the digital domain. The results show that the proposed structure significantly reduces the error and achieves considerable performance improvement compared to delta-modulation encoding systems [10]. Subband coder reduces and controls quantization noise. Here bit allocation on each subband is done on perceptual criterion. So that quality of the coded signal is improved over the full spectrum coding. Computer simulated data provides 16 and 9.6 Kbps over 64 Kbps data rate [2].

2 Basic Idea of the System

Figure 2.1a is an example of Power Spectral Density (PSD) of an Voice Signal. Here, Voice signal has been considered to be restricted to 3.5 kHz only. Power Spectral Density to be in watt/HZ or dB. In this figure frequency axis is divided into number of subbands (say 0–f1, f1–f2, f2–f3, f3–f4, etc.). The frequency band (0–f₁) is base band signal whereas (f₁–f₂), (f₂–f₃), (f₃–f₄), etc. are band pass signals. Each band will translated to baseband by multiplying with lowest frequency component of the said subband. Here seven subbands have been considered (Fig. 2.1b).

Transmitter consists of one LPF and six BPFs. All BPFs outputs are multiplied by the lowest frequency component of those bands at the multiplier block. Then outputs are PCM and then added by summer. Finally the summed output is put into channel.

At the receiver signals are decoded by seven decoders. Then each signal is passed through LPF of cut-off frequency f₁, f₂–f₁, f₃–f₂ etc. From second to seventh signal outputs are multiplied by their respective lowest frequency components and then passed through BPFs of f₂–f₁, f₃–f₂ etc. Then the outputs are summed up to get the replica of the original signal. Data rate from the above signal is reduced from 64 to 19.5 Kbps [1]. Data rate can be further reduced, if Power spectral density of voice signal is multiplied by the probabilities of occurrences which is 12.13804 Kbps lower than 19.5 Kbps by using MATLAB Simulation [1]. Figure 2.1c shows SNR, quantization noise produced out of subbanding.

3 Proposed Scheme

3.1 Case I: Subbanding Data Rate 12.0128 Kbps

Matlab simulation of subband data rate [1] has been reduced to 12.0128 Kbps where existing telephone line data rate 64 Kbps. Figures 2.2a, 2.2b, 2.2c, 2.2d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error of subbands respectively [11].

3.2 Case II: Subbanding Data Rate 11.0959 Kbps

By changing the bit allocation by perceptual criterion data rate has been reduced to 11.0959 Kbps from 12.0128 Kbps. Figure 2.3a, 2.3b, 2.3c, 2.3d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error of subbands respectively.

3.3 Case III: Subbanding Data Rate 10.0494 Kbps

By changing the bit allocation by perceptual criterion data rate has been reduced further to 10.9494 from 11.0959 Kbps. Figures 2.4a, 2.4b, 2.4c, 2.4d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error of subbands respectively.

3.4 Case IV: Subbanding Data Rate 10.7175 Kbps

By changing the bit allocation by perceptual criterion data rate has been reduced to 10.7175 Kbps from 10.9494 Kbps. Figures 2.5a, 2.5b, 2.5c, 2.5d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error respectively of subbands.

3.5 Case V: Subbanding Data Rate 10.4867 Kbps

By changing the bit allocation by perceptual criterion data rate has been reduced to 10.4867 Kbps from 10.7175 Kbps. Figures 2.6a, 2.6b, 2.6c, 2.6d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error respectively of subbands.

3.6 Case VI: Subbanding Data Rate 9.4875 Kbps Using Rayleigh Distribution

Changing the bit allocation by perceptual criterion and Rayleigh distribution Power Spectral Density, data rate has been reduced to 9.4875 Kbps from 10.4867 Kbps which is lowest in this study [12]. Figures 2.7a, 2.7b, 2.7c, 2.7d have been shown as MATLAB simulation of data rate, cumulative data rate, SNR and probability of bit error respectively of subbands.

The above studied cases are tabulated in Table 2.1.

Table 2.1 Subband coding data rate comparative table

Full size table

4 Conclusion and Future Work

The new proposed method reduces data rate by using Rayleigh distribution. Several bit combinations by perceptual criterion have been taken. Result of those combinations have been tabulated in Table 2.1. It is clear that all the cases reduces the data rate drastically without losing signal to noise ratio criterion as well as probability of bit error. Rayleigh distribution outperforms all the other cases. In future authors would like to optimize those schemes by different algorithms.

References

Sangita R, Gupta DB, Banerjee PK (2012) Studies and implementation of subband coder and decoder of speech signal. In: Proceedings of national conference on electronics, communication and signal processing (NCECS 2012), 19 Sept 2012, pp 8–16
Google Scholar
Crochiere RE, Webber SA, Flanagan JN (1976) Digital coding of speech in subbands. The BELL Syst Technical Journal, October 1976
Google Scholar
Knutson PG,Ramaswamy K, Richardson JW (2001) Subband ADPCM voice encoding and decoding. PCT/US2000/034410, July 2001
Google Scholar
Szczutkowski CF (1986) Subband encoding method and apparatus. EP 0178608 A2, April 1986
Google Scholar
Proakis JG (2001) Digital communications, 4th edn. Mcgraw-Hill, New York
Google Scholar
Leon W, Couch II (1995) Modern communication systems principles and applications. Prentice-Hall, New Jersey
MATH Google Scholar
Taub H, Schilling DL (1986) Principles of communication systems, 2nd edn. Tata Mcgraw-Hill, Noida
Google Scholar
Chen Y-J, Maher RC (1995) Sub-band coding of audio using recursively indexed quantization, Department of Electrical Engineering and Center for Communication and Information Science, University of Nebraska–Lincoln
Google Scholar
Tribolet JM, Crochiere RE (1979) Frequency domain coding of speech. IEEE Trans Acoust Speech Signal Process 27(5):512
Article Google Scholar
Aziz AM (2009) Subband coding of speech signals using decimation and interpolation. In: 13th international conference on aerospace sciences and aviation technology, ASAT-13, 26–28 May 2009, Military Technical College, Kobry Elkobbah
Google Scholar
Roy S (2011) Studies and implementation of subband coder and decoder of speech signal. M-Tech (Communication Engineering) Thesis, WBUT
Google Scholar
Rivet B, Girin L, Jutten C (2007) Log-Rayleigh distribution: a simple and efficient statistical representation of log-spectral coefficients. IEEE Trans Audio Speech Lang Process 15(3):796
Article Google Scholar

Download references

Author information

Authors and Affiliations

ECE Department, Narula Institute of Technology, WBUT, Kolkata, India
Sangita Roy & Dola B. Gupta
ETCE Department, Jadavpur University, Kolkata, India
Sheli Sinha Chaudhuri & P. K. Banerjee

Authors

Sangita Roy
View author publications
You can also search for this author in PubMed Google Scholar
Dola B. Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Sheli Sinha Chaudhuri
View author publications
You can also search for this author in PubMed Google Scholar
P. K. Banerjee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Sangita Roy , Dola B. Gupta , Sheli Sinha Chaudhuri or P. K. Banerjee .

Editor information

Editors and Affiliations

Department of Information Technology, BP Poddar Institute of Management and Technology, Kolkata, West Bengal, India
Sabnam Sengupta
Department of Information Technology, BP Poddar Institute of Management and Technology, Kolkata, West Bengal, India
Kunal Das
Department of Information Technology, BP Poddar Institute of Management and Technology, Kolkata, West Bengal, India
Gitosree Khan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Roy, S., Gupta, D.B., Chaudhuri, S.S., Banerjee, P.K. (2014). Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution. In: Sengupta, S., Das, K., Khan, G. (eds) Emerging Trends in Computing and Communication. Lecture Notes in Electrical Engineering, vol 298. Springer, New Delhi. https://doi.org/10.1007/978-81-322-1817-3_2

Download citation

DOI: https://doi.org/10.1007/978-81-322-1817-3_2
Published: 25 February 2014
Publisher Name: Springer, New Delhi
Print ISBN: 978-81-322-1816-6
Online ISBN: 978-81-322-1817-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics

Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Abstract

Similar content being viewed by others

A Novel Design Approach of Subband Coder and Decoder of Speech Signal Using Log Normal Probability Distribution

Bandwidth extension of telephone speech using magnitude spectrum data hiding

Speech Bandwidth Extension Using Data Hiding Based on Discrete Hartley Transform Domain

Keywords

1 Introduction

2 Basic Idea of the System

3 Proposed Scheme

3.1 Case I: Subbanding Data Rate 12.0128 Kbps

3.2 Case II: Subbanding Data Rate 11.0959 Kbps

3.3 Case III: Subbanding Data Rate 10.0494 Kbps

3.4 Case IV: Subbanding Data Rate 10.7175 Kbps

3.5 Case V: Subbanding Data Rate 10.4867 Kbps

3.6 Case VI: Subbanding Data Rate 9.4875 Kbps Using Rayleigh Distribution

4 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Studies and Implementation of Subband Coder and Decoder of Speech Signal Using Rayleigh Distribution

Abstract

Similar content being viewed by others

A Novel Design Approach of Subband Coder and Decoder of Speech Signal Using Log Normal Probability Distribution

Bandwidth extension of telephone speech using magnitude spectrum data hiding

Speech Bandwidth Extension Using Data Hiding Based on Discrete Hartley Transform Domain

Keywords

1 Introduction

2 Basic Idea of the System

3 Proposed Scheme

3.1 Case I: Subbanding Data Rate 12.0128 Kbps

3.2 Case II: Subbanding Data Rate 11.0959 Kbps

3.3 Case III: Subbanding Data Rate 10.0494 Kbps

3.4 Case IV: Subbanding Data Rate 10.7175 Kbps

3.5 Case V: Subbanding Data Rate 10.4867 Kbps

3.6 Case VI: Subbanding Data Rate 9.4875 Kbps Using Rayleigh Distribution

4 Conclusion and Future Work

References

Author information

Authors and Affiliations

Corresponding authors

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation