GB2327021A - Speech coding - Google Patents
Speech coding Download PDFInfo
- Publication number
- GB2327021A GB2327021A GB9713814A GB9713814A GB2327021A GB 2327021 A GB2327021 A GB 2327021A GB 9713814 A GB9713814 A GB 9713814A GB 9713814 A GB9713814 A GB 9713814A GB 2327021 A GB2327021 A GB 2327021A
- Authority
- GB
- United Kingdom
- Prior art keywords
- coefficients
- speech
- coding
- coder
- lpc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
Abstract
A CELP speech coder estimates the LPC coefficient values on a continuously adaptive basis, for example using the LMS algorithm. An LPC analysis circuit 4 which may be used in a conventional speech coder (fig.1 not shown) has samples of input speech signal applied to a processor 6 and a predictor 8. The input speech samples are subtractively combined at 10 to obtain a residual value which is also applied to the processor 6 and the processor outputs the LPC coefficients.
Description
SPEECH CODING
TECHNICAL FIELD OF THE INVENTION
This invention relates to a speech coder, for example for use in a radio telecommunications system, such as a mobile telephone system. In particular, the invention relates to the coder itself and to a method of coding a speech signal.
DESCRIPTION OF RELATED ART
Radio communications devices, in particular those used in digital communications systems, typically include a codec, comprising a coder for coding speech signals into a digital signal for transmission and a decoder for retrieving speech signals from received digital signals.
A conventional speech codec is disclosed in the
TIA/EIA standard "TDMA Cellular/PCS - Radio Interface
Enhanced Full Rate Speech Codec", published under number TIA/EIA/IS-641, May 1996. That document discloses a CELP encoder, which includes a linear predictive coding (LPC) analysis module, in which one part of the coding takes place. In this module, a frame of speech samples, for example lasting 20ms, is processed. A window is applied, covering the frame itself, some samples from the last frame, and some samples from a future frame, and the autocorrelation of the signal is estimated as a measure of the variation of the signal with time. From this estimated autocorrelation, the LPC coefficients, which are used to represent the speech signal, are estimated from an algorithm known as the Levinson-Durbin algorithm, which obtains filter coefficients from the autocorrelations by an iterative process. A new set of LPC coefficients is estimated in this way for every frame.
The LPC coefficients, which are estimated in this way, are then used to encode these speech signals, and are themselves quantized and transmitted to the receiver, for use in decoding the received signal. The estimation of the LPC coefficients is optimised by attempting to minimise the power of a residual signal, namely the difference between the signal received at the coder and the signal as coded, that is the error made by the prediction.
A potential problem which arises from this method of estimating the LPC coefficients is that, because the coefficients are calculated only once per frame, there can be significant discontinuities between consecutive values of the coefficients, associated with consecutive frames of the speech signal.
SUMMARY OF THE INVENTION
The present invention seeks to overcome this potential disadvantage of the conventional coder, by estimating the LPC coefficients for each sample of the signal. Because the coefficients are estimated more often, this can result in coefficients which are more continuous, or "smoother" with time, and which therefore allow the signal to be encoded more accurately.
In particular, there is proposed the use of the
LMS (Least Mean Square) algorithm for estimating and coding the speech spectrum. This algorithm is relatively simple to implement, and its properties have been thoroughly investigated due to its use in Adaptive
Differential Pulse Code Modulation (ADPCM) waveform coding, where it is used for a different purpose, namely to minimise the signal that is applied for quantization.
Aspects of the invention relate to the speech coder, and to the method of encoding the speech signal.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 is a block schematic diagram of an encoder in a CELP transmitter.
Figure 2 is a block schematic diagram of an LPCanalysis module forming part of a speech coder in accordance with the invention.
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS
Figure 1 is a block schematic diagram of a generally conventional code excited linear prediction (CELP) encoder modified in accordance with the invention.
The purpose of Figure 1 is to give a general overview of a CELP encoder, the nature of which is familiar to a person skilled in the art. In Figure 1, the input X is the true speech signal, x(n) is the predicted speech signal, i iopt is the transmitted code book index, (b,N) ops are the transmitted pitch gain and lag(period), and a is the set of LPC coefficients. In view of the well-known form of the CELP encoder, Figure 1 is not considered to need further explanation.
Figure 2 is a block schematic diagram of a part of the speech coder relevant to the present invention.
Coder 2 is the CELP codec, in which the speech spectrum is estimated and coded by a procedure known as Linear
Predictive Coding. The coder 2 is largely conventional, but differs from a conventional device in the form of the LPC-analysis module 4, which is illustrated in more detail. The module 4 receives the same input signals as a LPC-analysis module in a conventional CELP coder, and produces output signals in the same format, which means that the other modules in the CELP encoder, and their interfaces to the LPCanalysis module 4, can be unchanged.
The LPC-analysis module 4 receives the sampled speech signal x(n), which it supplies to a processor 6 and a predictor 8. The processor 6 produces coefficients ai(n), using an algorithm described in more detail below, and these coefficients are used by the predictor 8 to obtain a predicted value x(n) of the signal x(n), according to the equation below:
where ai are the LPC coefficients, n is the sample index and N is the predictor order, that is the number of coefficients which are used to characterise the signal.
The input signal x(n), and the predicted value x(n) thereof, are supplied to a subtractor 10, to obtain a residual value e(n), which represents the error in the prediction, according to the equation:
e(n) = x(n) - (n) The residual e(n) is also supplied to the processor 6, which derives the next set of LPC coefficients using the LMS algorithm: ai(n + 1) = a1(n) + A e(n) x(n - i) in which d is the step size parameter determining the amount by which the coefficients are changed in each iteration.
The properties of this algorithm are well-known in themselves.
The LPC coefficients ai(n) are then output from the analysis module 4. Whenever LPC coefficients are to be used in the analysis-by-synthesis loop in the speech coder, or transmitted to the receiver, they are extracted from the LMS algorithm in the LPC module.
Thus, since coefficients are transmitted to the receiver at the same rate as in a conventional device, there is no additional overhead. However, by estimating the LPC coefficients in such a way that they are continuously adapted from the speech signal, or adapted on a sample basis, for example using the LMS algorithm in this illustrated embodiment, consecutive values of the coefficients will have smaller discontinuities, which will mean that the speech spectrum can be followed more accurately over time.
This should result in less degradation in the synthesized speech spectrum.
Claims (11)
1. A speech coder, comprising a linear predictive coder using linear predictive coding coefficients, characterized in that the linear predictive coding coefficients are derived using an algorithm which is adapted on a sample basis.
2. A speech coder as claimed in claim 1, characterized in that the linear predictive coding coefficients are derived using the LMS algorithm.
3. A speech coder as claimed in claim 1 or 2, comprising means for coding a speech signal using the derived coefficients, and means for quantizing the coefficients for transmission.
4. A CELP speech coder, in which the LPC coefficients are estimated on a sample basis.
5. A CELP speech coder as claimed in claim 4, in which the LPC coefficients are estimated using the LMS algorithm.
6. A mobile communications device, including a speech coder as claimed in any preceding claim.
7. A method of coding a speech signal, the method comprising using a linear predictive coder using linear predictive coding coefficients, characterized in that the linear predictive coding coefficients are derived using an algorithm which is adapted on a sample basis.
8. A method of coding a speech signal as claimed in claim 7, characterized in that the linear predictive coding coefficients are derived using the LMS algorithm.
9. A method of coding a speech signal as claimed in claim 7 or 8, comprising coding the speech signal using the derived coefficients, and quantizing the coefficients for transmission.
10. A method of coding a speech signal using a
CELP speech coder, in which the LPC coefficients are estimated on a sample basis.
11. A method of coding a speech signal using a
CELP speech coder as claimed in claim 10, in which the
LPC coefficients are estimated using the LMS algorithm.
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9713814A GB2327021A (en) | 1997-06-30 | 1997-06-30 | Speech coding |
PCT/EP1998/003932 WO1999001866A1 (en) | 1997-06-30 | 1998-06-26 | Speech coding |
AU83395/98A AU8339598A (en) | 1997-06-30 | 1998-06-26 | Speech coding |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GB9713814A GB2327021A (en) | 1997-06-30 | 1997-06-30 | Speech coding |
Publications (2)
Publication Number | Publication Date |
---|---|
GB9713814D0 GB9713814D0 (en) | 1997-09-03 |
GB2327021A true GB2327021A (en) | 1999-01-06 |
Family
ID=10815163
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB9713814A Withdrawn GB2327021A (en) | 1997-06-30 | 1997-06-30 | Speech coding |
Country Status (3)
Country | Link |
---|---|
AU (1) | AU8339598A (en) |
GB (1) | GB2327021A (en) |
WO (1) | WO1999001866A1 (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2133255A (en) * | 1982-12-23 | 1984-07-18 | Standard Telephones Cables Ltd | Secure speech transmission system |
US4961160A (en) * | 1987-04-30 | 1990-10-02 | Oki Electric Industry Co., Ltd. | Linear predictive coding analysing apparatus and bandlimiting circuit therefor |
EP0500076A2 (en) * | 1991-02-19 | 1992-08-26 | Nec Corporation | Method and arrangement of determining coefficients for linear predictive coding |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2026289B (en) * | 1978-04-12 | 1982-04-21 | Secr Defence | Self-adaptive linear prediction filters |
-
1997
- 1997-06-30 GB GB9713814A patent/GB2327021A/en not_active Withdrawn
-
1998
- 1998-06-26 AU AU83395/98A patent/AU8339598A/en not_active Abandoned
- 1998-06-26 WO PCT/EP1998/003932 patent/WO1999001866A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2133255A (en) * | 1982-12-23 | 1984-07-18 | Standard Telephones Cables Ltd | Secure speech transmission system |
US4961160A (en) * | 1987-04-30 | 1990-10-02 | Oki Electric Industry Co., Ltd. | Linear predictive coding analysing apparatus and bandlimiting circuit therefor |
EP0500076A2 (en) * | 1991-02-19 | 1992-08-26 | Nec Corporation | Method and arrangement of determining coefficients for linear predictive coding |
Also Published As
Publication number | Publication date |
---|---|
WO1999001866A1 (en) | 1999-01-14 |
AU8339598A (en) | 1999-01-25 |
GB9713814D0 (en) | 1997-09-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0628947B1 (en) | Method and device for speech signal pitch period estimation and classification in digital speech coders | |
US4811396A (en) | Speech coding system | |
EP1062661B1 (en) | Speech coding | |
KR100545774B1 (en) | How to Improve Coding Efficiency of Audio Signals | |
EP0785541B1 (en) | Usage of voice activity detection for efficient coding of speech | |
CA2061830C (en) | Speech coding system | |
US4726037A (en) | Predictive communication system filtering arrangement | |
JPH0850500A (en) | Voice encoder and voice decoder as well as voice coding method and voice encoding method | |
KR20010087391A (en) | Synthesis of speech from pitch prototype waveforms by time-synchronous waveform interpolation | |
KR100421648B1 (en) | An adaptive criterion for speech coding | |
CA2216315C (en) | Predictive split-matrix quantization of spectral parameters for efficient coding of speech | |
CA2156558C (en) | Speech-coding parameter sequence reconstruction by classification and contour inventory | |
EP1073039A2 (en) | Speech decoder with gain processing | |
US5313554A (en) | Backward gain adaptation method in code excited linear prediction coders | |
CA2293165A1 (en) | Method for transmitting data in wireless speech channels | |
Mumolo et al. | Adaptive predictive coding of speech by means of Volterra predictors | |
US6801887B1 (en) | Speech coding exploiting the power ratio of different speech signal components | |
US5822722A (en) | Wide-band signal encoder | |
GB2327021A (en) | Speech coding | |
US5708756A (en) | Low delay, middle bit rate speech coder | |
JPS6134697B2 (en) | ||
Ojala et al. | Variable model order LPC quantization | |
JP3330178B2 (en) | Audio encoding device and audio decoding device | |
KR0138878B1 (en) | Method for reducing the pitch detection time of vocoder | |
KR100220783B1 (en) | Speech quantization and error correction method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WAP | Application withdrawn, taken to be withdrawn or refused ** after publication under section 16(1) |