EP1619665A1 - Voice coding apparatus and method using PLP in mobile communications terminal - Google Patents

Voice coding apparatus and method using PLP in mobile communications terminal Download PDF

Info

Publication number
EP1619665A1
EP1619665A1 EP05015989A EP05015989A EP1619665A1 EP 1619665 A1 EP1619665 A1 EP 1619665A1 EP 05015989 A EP05015989 A EP 05015989A EP 05015989 A EP05015989 A EP 05015989A EP 1619665 A1 EP1619665 A1 EP 1619665A1
Authority
EP
European Patent Office
Prior art keywords
signal
plp
coefficient
input signal
voiced
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP05015989A
Other languages
German (de)
French (fr)
Other versions
EP1619665B1 (en
Inventor
Chan-Woo Kim
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LG Electronics Inc
Original Assignee
LG Electronics Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LG Electronics Inc filed Critical LG Electronics Inc
Publication of EP1619665A1 publication Critical patent/EP1619665A1/en
Application granted granted Critical
Publication of EP1619665B1 publication Critical patent/EP1619665B1/en
Not-in-force legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/06Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Definitions

  • the present invention relates to a coding of a mobile communications terminal, and particularly; to a voice coding apparatus and method using a Perceptual Linear Prediction (PLP).
  • PLP Perceptual Linear Prediction
  • mobile communications terminals have provided data communications using numbers, characters, symbols, and the like, and multimedia communications including various image signals as well as voice communications.
  • a plurality of terminal users receive radio channels allocated thereto from a system and transmit and receive required data using radio resources.
  • the radio channels have limited bandwidths in order for the plurality of users to use the radio channels at the same time, and accordingly a data bit rate of each user is deservedly limited.
  • a speech coding using a generic audio coding, a Pulse Code Modulation (PCM), and an Adaptive Delta Pulse Code Modulation(ADPCM) are effectively used at a high-bit rate over 16Kbps, and a Code Excited Linear Prediction (CELP) and other various variations are effectively used at a medium-bit rate at a range of 2.4Kbps to 16Kbps.
  • a coding method using LD-CELP, CS-ACELP, VSELP and MELP and a wideband speech coding can be used at the medium-bit rate.
  • LPC Linear Predictive Coding
  • RELP Residual Excited Linear Predictive
  • Cepstral vocoder have many advantages at a low-bit rate at a range of 75bps to 2.4Kbps.
  • Fig. 1 illustrates a structure of the related art LPC encoder.
  • the related art LPC encoder includes: a correlator 10 for calculating an autocorrelation value r x [n] of an input signal x[n]; an LP coefficient calculator 11for calculating an LP coefficient a L and a gain G by processing the autocorrelation value r x [n]; a V/UV determining unit 12 for determining whether the input signal x[n] is a voiced V signal or a unvoiced UV signal; a pitch calculator 13 for calculating a pitch P of the corresponding signal when the input signal x[n] is the voice V signal; a parameter coding unit 14 for outputting a bit stream by coding the LP coefficient an, the gain G and the pitch P received from the LP coefficient calculator 11 and the pitch calculator 13 according to a V/UV indication bit outputted from the V/UV determining unit 12.
  • the correlator 10 autocorrelates an input signal x[n].
  • the LP coefficient calculator 11 processes an autocorrelation value r x [n] calculated by the correlator 10 so as to calculate an LP coefficient a n and a gain G.
  • the V/UV determining unit 12 determines whether the input signal x[n] is a voiced V signal or a unvoiced UV signal to output a V/UV indication bit, and then outputs only the voiced V signal.
  • the pitch calculator 13 calculates a pitch P of the voiced V signal which is outputted from the V/UV determining unit 12.
  • the parameter coding unit 14 outputs a bit stream by coding (encoding by a low-bit rate) the LP coefficient a n , the gain G, and the pitch P received from the LP coefficient calculator 11 and the pitch calculator 13.
  • a controller processes the bit stream to thusly output it to a radio (wireless) unit (not shown).
  • the radio unit converts the signal outputted from the control unit into a radio (wireless) signal and transmits the converted radio signal.
  • a mobile communications terminal performs the LPC coding to transmit an audio signal by a low-bit rate.
  • a linear predication coefficient is generally used, which does not consider human auditory sensing features. Therefore, for the related art LPC coding operated using the low-bit rate, a compression efficiency is not very high (i.e., 1200Kbps to 2400Kbps) and good sound quality can not be obtained.
  • an object of the present invention is to provide a voice coding apparatus and method of a mobile communications terminal capable of improving compression efficiency and sound quality by performing an LPC coding using a PLP coefficient.
  • a Linear Predictive Coding (LPC) encoder of a mobile communications terminal comprising: a Perceptual Linear Prediction (PLP) coefficient calculator for calculating a PLP coefficient and a gain by processing an input signal; a V/UV determining unit for determining whether the input signal is a voiced signal or a unvoiced signal, and thusly outputting the determination signal and the voiced signal when the input signal is the voiced signal; a pitch calculator for calculating a pitch of the input signal outputted from the V/UV determining unit; and a parameter coding unit for performing a low-bit rate coding using the PLP coefficient, the gain, and the pitch on the basis of the determination signal.
  • PLP Perceptual Linear Prediction
  • a low-bit rate voice coding method of a mobile communications terminal comprising: calculating a Perceptual Linear Prediction (PLP) coefficient and a gain by processing an input signal; determining whether the input signal is a voiced signal and a unvoiced signal, and thereby outputting a determination bit value and the voiced signal when the input signal is determined as the voiced signal; calculating a pitch of the input signal outputted from a V/UV determining unit; and performing a low-bit rate coding using the PLP coefficient, the gain and the pitch on the basis of the determination bit value.
  • PLP Perceptual Linear Prediction
  • the voiced signal is a speech signal.
  • the PLP coefficient has about a 7 th degree for a 8 kHz sampling rate.
  • the present invention provides a low-bit rate voice coding using a Perceptual Linear Prediction (PLP) capable of performing a coding of a degree (an order) lower than that of a Linear Predictive Coding (LPC) in order to perform a voice coding having high compressibility.
  • PLP Perceptual Linear Prediction
  • LPC Linear Predictive Coding
  • the LP is classically well-known, so that a detailed derived formula therefor will not be described.
  • the LP basically refers to obtaining a LP coefficient a k so that a Mean Squared Error (MSE), namely, a value of e[n] can be a minimum value according to Formula (1) as follows.
  • MSE Mean Squared Error
  • the obtained LP coefficient a k has about 8 th to 12 th degrees (orders) for a 8 kHz sampling rate. Therefore, the obtained LP coefficient a k is used for various coding methods (e.g., LPC, CELP, MELP, RELP, etc) using a Linear Prediction (LP), which is disclosed in more detail in Speech coding and synthesis, Amsterdam, the Netherlands: Elsevier, 1995.
  • LPC Linear Prediction
  • the PLP was introduced on a paper of Hermansky in 1990 for the first time.
  • the PLP uses human auditory sensing features similar to the existing Mel-Frequency Cepstral Coefficient (MFCC). Therefore, the present invention performs a low-bit rate voice coding using the PLP coefficient in stead of using the LP coefficient upon performing the LPC for a low-bit rate.
  • MFCC Mel-Frequency Cepstral Coefficient
  • the present invention obtains spectrum using the PLP coefficient.
  • the PLP coefficient reflects a human auditory effect. Accordingly, in aspect of the MSE, a greater error may occur in the spectrum using the PLP coefficient than using the LP. However, the spectrum using the PLP coefficient may have a less error when considering the auditory effect. Also, for coefficient transmissions, in case of LPC, for a typical 8kHz sampling rate, transmissions of about a 10 th degree (order) are used, but for PLP, transmissions of about a 7 th degree (order) are used, thus the bit rate can be lowered.
  • Fig. 2 illustrates a construction of an LPC encoder using the PLP coefficient according to the present invention.
  • an LPC encoder using the PLP coefficient is constructed as same as the related art LPC encoder shown in Fig. 1, except of which the correlator 10 is not included and a PLP coefficient calculator 20 replaces the LP coefficient calculator 11.
  • the PLP coefficient calculator 20 processes a speech signal S[n] to calculate a PLP coefficient a P and a gain G in which the auditory effect is considered.
  • the PLP coefficient calculator 20 receives the speech signal S[n], so as to calculate the PLP coefficient a P and the gain G by sequentially performing operations shown in Fig. 3.
  • the PLP coefficient calculator 20 performs a fast Fourier transform (FFT) of the input signal, namely, the speech signal S[n].
  • FFT fast Fourier transform
  • a critical-bank integration and resampling processing is performed for the Fourier-transformed speech signal to thusly remove noise components from the speech signal S[n] by a frequency unit.
  • the PLP coefficient calculator 20 performs equalizing and loudness processing of the Fourier-transformed speech signal into sound components having magnitudes appropriate for human auditory sensing, and then the speech signal is matched with an output power to allow listening by humans.
  • the PLP coefficient calculator 20 When the power matching is completed, the PLP coefficient calculator 20 performs an inverse discrete Fourier transform of the corresponding speech signal to thereafter obtain a set of Linear equations from the corresponding speech signal. Therefore, the PLP coefficient calculator 20 performs a Cepstral Recursion processing for the set of Linear equations, and thus outputs Cepstral Coefficients of a PLP model, namely, the PLP coefficients ap. In other words, the PLP coefficient calculator 20 outputs to the parameter coding unit 23 a low degree (order) of the PLP coefficients a P and a gain G reflecting the human auditory sensing features as parameter values.
  • the V/UV determining unit 21 outputs a V/UV Indication bit and transfers the speech signal S[n] to the pitch calculator 22.
  • the pitch calculator 22 calculates a pitch P of the speech signal S[n].
  • the parameter coding unit 23 outputs a bit stream by coding (encoding by a low-bit rate) the V/UV Indication bit value, the PLP coefficient a P , the gain G and the pitch P received from the PLP coefficient calculator 20 and the pitch calculator 22.
  • a degree of the transmitted PLP coefficient a P is about a 7 th degree for a 8 kHz sampling rate.
  • a controller processes the bit stream and then outputs the processed bit stream to a radio (wireless) unit (not shown).
  • the radio unit converts the signal outputted from the controller into a radio signal (wireless signal) and transmits it.
  • the LPC is performed by using the PLP coefficient, and thus a compressibility can be improved and voice-grade signal can be transmitted by a more efficient low-bit rate.
  • a higher compressibility can be realized and a quality of signal with high sound quality can be expected by using the PLP coefficient as a parameter rather than using the existing LP coefficient.
  • the voice coding apparatus and method according to the present invention can be used for coding and decoding voice using a low-bit rate, or be used for a device which takes up a small area and performs a voice synthesis using PLP parameters.
  • the voice coding apparatus and method according to the present invention can be used for a speech coding for an application as much as a voice itself is not very important but enough to hear. Also, an effective voice conversation can be performed on the Internet which stores data by a high compressibility or requires a low-bit rate in an embedded system with a limited memory.

Abstract

A voice coding apparatus and method of a mobile communications terminal can embody higher compressibility and ensure high sound quality, compared with the case of using a Linear Prediction (LP) coefficient, by performing a Linear Predictive Coding (LPC) using a Perceptual Linear Prediction (PLP) coefficient.

Description

    BACKGROUND OF THE INVENTION 1. Field of the Invention
  • The present invention relates to a coding of a mobile communications terminal, and particularly; to a voice coding apparatus and method using a Perceptual Linear Prediction (PLP).
  • 2. Background of the Related Art
  • As mobile communication techniques are developed, mobile communications terminals have provided data communications using numbers, characters, symbols, and the like, and multimedia communications including various image signals as well as voice communications. A plurality of terminal users receive radio channels allocated thereto from a system and transmit and receive required data using radio resources. However, the radio channels have limited bandwidths in order for the plurality of users to use the radio channels at the same time, and accordingly a data bit rate of each user is deservedly limited.
  • Therefore, a coding technique has been proposed for transmitting a greater amount of data using above limited data bit rate. Various methods exist as the related art voice coding technique, each of which has several advantages at a certain bit rate.
  • For instance, a speech coding using a generic audio coding, a Pulse Code Modulation (PCM), and an Adaptive Delta Pulse Code Modulation(ADPCM) are effectively used at a high-bit rate over 16Kbps, and a Code Excited Linear Prediction (CELP) and other various variations are effectively used at a medium-bit rate at a range of 2.4Kbps to 16Kbps. In particular, a coding method using LD-CELP, CS-ACELP, VSELP and MELP and a wideband speech coding can be used at the medium-bit rate. Also, a Linear Predictive Coding (LPC), Residual Excited Linear Predictive (RELP), formants vocoder and Cepstral vocoder have many advantages at a low-bit rate at a range of 75bps to 2.4Kbps.
  • Thus, in the related art and the present invention, a method for improving the LPC among coding methods used at the low-bit rate will now be explained.
  • Fig. 1 illustrates a structure of the related art LPC encoder.
  • As illustrated in the drawing, the related art LPC encoder includes: a correlator 10 for calculating an autocorrelation value rx[n] of an input signal x[n]; an LP coefficient calculator 11for calculating an LP coefficient aL and a gain G by processing the autocorrelation value rx[n]; a V/UV determining unit 12 for determining whether the input signal x[n] is a voiced V signal or a unvoiced UV signal; a pitch calculator 13 for calculating a pitch P of the corresponding signal when the input signal x[n] is the voice V signal; a parameter coding unit 14 for outputting a bit stream by coding the LP coefficient an, the gain G and the pitch P received from the LP coefficient calculator 11 and the pitch calculator 13 according to a V/UV indication bit outputted from the V/UV determining unit 12.
  • An operation of the related art LPC encoder having such construction will now be explained.
  • First, the correlator 10 autocorrelates an input signal x[n]. The LP coefficient calculator 11 processes an autocorrelation value rx[n] calculated by the correlator 10 so as to calculate an LP coefficient an and a gain G. At this time, the V/UV determining unit 12 determines whether the input signal x[n] is a voiced V signal or a unvoiced UV signal to output a V/UV indication bit, and then outputs only the voiced V signal. The pitch calculator 13 calculates a pitch P of the voiced V signal which is outputted from the V/UV determining unit 12.
  • Accordingly, when the V/UV indication bit indicates the voiced V signal, the parameter coding unit 14 outputs a bit stream by coding (encoding by a low-bit rate) the LP coefficient an, the gain G, and the pitch P received from the LP coefficient calculator 11 and the pitch calculator 13. Afterwards, a controller (not shown) processes the bit stream to thusly output it to a radio (wireless) unit (not shown). The radio unit converts the signal outputted from the control unit into a radio (wireless) signal and transmits the converted radio signal.
  • Thus, in the related art, a mobile communications terminal performs the LPC coding to transmit an audio signal by a low-bit rate. However, in the related art LPC coding, a linear predication coefficient is generally used, which does not consider human auditory sensing features. Therefore, for the related art LPC coding operated using the low-bit rate, a compression efficiency is not very high (i.e., 1200Kbps to 2400Kbps) and good sound quality can not be obtained.
  • SUMMARY OF THE INVENTION
  • Therefore, an object of the present invention is to provide a voice coding apparatus and method of a mobile communications terminal capable of improving compression efficiency and sound quality by performing an LPC coding using a PLP coefficient.
  • To achieve these and other advantages and in accordance with the purpose of the present invention, as embodied and broadly described herein, there is provided a Linear Predictive Coding (LPC) encoder of a mobile communications terminal comprising: a Perceptual Linear Prediction (PLP) coefficient calculator for calculating a PLP coefficient and a gain by processing an input signal; a V/UV determining unit for determining whether the input signal is a voiced signal or a unvoiced signal, and thusly outputting the determination signal and the voiced signal when the input signal is the voiced signal; a pitch calculator for calculating a pitch of the input signal outputted from the V/UV determining unit; and a parameter coding unit for performing a low-bit rate coding using the PLP coefficient, the gain, and the pitch on the basis of the determination signal.
  • To achieve these and other advantages and in accordance with the purpose of the present invention, as embodied and broadly described herein, there is provided a low-bit rate voice coding method of a mobile communications terminal comprising: calculating a Perceptual Linear Prediction (PLP) coefficient and a gain by processing an input signal; determining whether the input signal is a voiced signal and a unvoiced signal, and thereby outputting a determination bit value and the voiced signal when the input signal is determined as the voiced signal; calculating a pitch of the input signal outputted from a V/UV determining unit; and performing a low-bit rate coding using the PLP coefficient, the gain and the pitch on the basis of the determination bit value.
  • Preferably, the voiced signal is a speech signal.
  • Preferably, the PLP coefficient has about a 7th degree for a 8 kHz sampling rate.
  • The foregoing and other objects, features, aspects and advantages of the present invention will become more apparent from the following detailed description of the present invention when taken in conjunction with the accompanying drawings.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention.
  • In the drawings:
    • Fig. 1 illustrates a structure of a related art LPC encoder using an LP coefficient;
    • Fig. 2 illustrates an LPC encoder using a PLP coefficient according to the present invention; and
    • Fig. 3 illustrates sequential steps, in detail, of calculating a PLP coefficient in Fig. 2.
    DETAILED DESCRIPTION OF THE INVENTION
  • Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings.
  • The present invention provides a low-bit rate voice coding using a Perceptual Linear Prediction (PLP) capable of performing a coding of a degree (an order) lower than that of a Linear Predictive Coding (LPC) in order to perform a voice coding having high compressibility.
  • First, a difference between the PLP and the LP will now be explained.
  • The LP is classically well-known, so that a detailed derived formula therefor will not be described. The LP basically refers to obtaining a LP coefficient ak so that a Mean Squared Error (MSE), namely, a value of e[n] can be a minimum value according to Formula (1) as follows. e ̲ [ n ] = x ̲ [ n ] - x ^ ̲ [ n ] = k = 0 N pred a k x ̲ [ n - k ]
    Figure imgb0001
  • The obtained LP coefficient a k has about 8th to 12th degrees (orders) for a 8 kHz sampling rate. Therefore, the obtained LP coefficient a k is used for various coding methods (e.g., LPC, CELP, MELP, RELP, etc) using a Linear Prediction (LP), which is disclosed in more detail in Speech coding and synthesis, Amsterdam, the Netherlands: Elsevier, 1995.
  • The PLP was introduced on a paper of Hermansky in 1990 for the first time. The PLP uses human auditory sensing features similar to the existing Mel-Frequency Cepstral Coefficient (MFCC). Therefore, the present invention performs a low-bit rate voice coding using the PLP coefficient in stead of using the LP coefficient upon performing the LPC for a low-bit rate.
  • That is, the present invention obtains spectrum using the PLP coefficient. The PLP coefficient reflects a human auditory effect. Accordingly, in aspect of the MSE, a greater error may occur in the spectrum using the PLP coefficient than using the LP. However, the spectrum using the PLP coefficient may have a less error when considering the auditory effect. Also, for coefficient transmissions, in case of LPC, for a typical 8kHz sampling rate, transmissions of about a 10th degree (order) are used, but for PLP, transmissions of about a 7th degree (order) are used, thus the bit rate can be lowered.
  • Fig. 2 illustrates a construction of an LPC encoder using the PLP coefficient according to the present invention.
  • Referring to the Fig. 2, an LPC encoder using the PLP coefficient is constructed as same as the related art LPC encoder shown in Fig. 1, except of which the correlator 10 is not included and a PLP coefficient calculator 20 replaces the LP coefficient calculator 11.
  • The PLP coefficient calculator 20 processes a speech signal S[n] to calculate a PLP coefficient aP and a gain G in which the auditory effect is considered.
  • An operation of the LPC encoder using the PLP coefficient having such construction according to the present invention will now be explained with reference to the accompanying drawing.
  • First, the PLP coefficient calculator 20 receives the speech signal S[n], so as to calculate the PLP coefficient aP and the gain G by sequentially performing operations shown in Fig. 3.
  • That is, the PLP coefficient calculator 20 performs a fast Fourier transform (FFT) of the input signal, namely, the speech signal S[n]. A critical-bank integration and resampling processing is performed for the Fourier-transformed speech signal to thusly remove noise components from the speech signal S[n] by a frequency unit.
  • Once removing the noise components, the PLP coefficient calculator 20 performs equalizing and loudness processing of the Fourier-transformed speech signal into sound components having magnitudes appropriate for human auditory sensing, and then the speech signal is matched with an output power to allow listening by humans.
  • When the power matching is completed, the PLP coefficient calculator 20 performs an inverse discrete Fourier transform of the corresponding speech signal to thereafter obtain a set of Linear equations from the corresponding speech signal. Therefore, the PLP coefficient calculator 20 performs a Cepstral Recursion processing for the set of Linear equations, and thus outputs Cepstral Coefficients of a PLP model, namely, the PLP coefficients ap. In other words, the PLP coefficient calculator 20 outputs to the parameter coding unit 23 a low degree (order) of the PLP coefficients aP and a gain G reflecting the human auditory sensing features as parameter values.
  • At this time, the V/UV determining unit 21 outputs a V/UV Indication bit and transfers the speech signal S[n] to the pitch calculator 22. The pitch calculator 22 calculates a pitch P of the speech signal S[n].
  • Accordingly, the parameter coding unit 23 outputs a bit stream by coding (encoding by a low-bit rate) the V/UV Indication bit value, the PLP coefficient aP, the gain G and the pitch P received from the PLP coefficient calculator 20 and the pitch calculator 22. Preferably, a degree of the transmitted PLP coefficient aP is about a 7th degree for a 8 kHz sampling rate. Afterwards, a controller (not shown) processes the bit stream and then outputs the processed bit stream to a radio (wireless) unit (not shown). The radio unit converts the signal outputted from the controller into a radio signal (wireless signal) and transmits it.
  • As described above, in the present invention, the LPC is performed by using the PLP coefficient, and thus a compressibility can be improved and voice-grade signal can be transmitted by a more efficient low-bit rate.
  • In addition, in the present invention, a higher compressibility can be realized and a quality of signal with high sound quality can be expected by using the PLP coefficient as a parameter rather than using the existing LP coefficient.
  • Therefore, the voice coding apparatus and method according to the present invention can be used for coding and decoding voice using a low-bit rate, or be used for a device which takes up a small area and performs a voice synthesis using PLP parameters.
  • Furthermore, the voice coding apparatus and method according to the present invention can be used for a speech coding for an application as much as a voice itself is not very important but enough to hear. Also, an effective voice conversation can be performed on the Internet which stores data by a high compressibility or requires a low-bit rate in an embedded system with a limited memory.
  • As the present invention may be embodied in several forms without departing from the spirit or essential characteristics thereof, it should also be understood that the above-described embodiments are not limited by any of the details of the foregoing description, unless otherwise specified, but rather should be construed broadly within its spirit and scope as defined in the appended claims, and therefore all changes and modifications that fall within the metes and bounds of the claims, or equivalence of such metes and bounds are therefore intended to be embraced by the appended claims.

Claims (8)

  1. A voice coding apparatus in a mobile communications terminal comprising:
    a Perceptual Linear Prediction (PLP) coefficient calculator for calculating a PLP coefficient and a gain by processing an input signal;
    a V/UV determining unit for determining whether the input signal is a voiced signal or a unvoiced signal, and thus outputting a determination results and the voiced signal when the input signal is the voiced signal;
    a pitch calculator for calculating a pitch of the input signal outputted from the V/UV determining unit; and
    a parameter coding unit for performing a low-bit rate coding using the PLP coefficient, the gain, and the pitch on the basis of the determination results.
  2. The apparatus of claim 1, wherein the voiced signal is a speech signal.
  3. The apparatus of claim 1, wherein the determination results denotes a bit value for whether the input signal is the voiced signal or the unvoiced signal.
  4. The apparatus of claim 1, wherein a degree of the PLP coefficient is about a 7th degree for a 8 kHz sampling rate.
  5. A voice coding method of a mobile communications terminal comprising:
    calculating a Perceptual Linear Prediction (PLP) coefficient and a gain by processing an input signal;
    determining whether the input signal is a voiced signal and a unvoiced signal, and thereby outputting the determination signal and the voiced signal when the input signal is determined as the voiced signal;
    calculating a pitch of the input signal outputted from a V/UV determining unit; and
    performing a low-bit rate coding using the PLP coefficient, the gain and the pitch on the basis of the determination signal.
  6. The method of claim 5, wherein the voiced signal is a speech signal.
  7. The method of claim 5, wherein the step of calculating the PLP coefficient and the gain comprises:
    performing a fast Fourier transform (FFT) for the input signal;
    performing a critical-bank integration and resampling of the Fourier transformed speech signal to thus remove noise components by a frequency unit;
    performs equalizing and loudness processing of the Fourier-transformed speech signal into sound components having magnitudes appropriate for human auditory sensing, and then matching the speech signal with an appropriate output power;
    performing an inverse discrete Fourier transform of the speech signal matched with the output power, and thereby obtaining a set of linear equations; and
    performing a ceptstral recursion processing for the set of linear equations, and
    thereby obtaining a PLP coefficient and a gain.
  8. The method of claim 5, wherein a degree of the PLP coefficient is about a 7th degree for a 8 kHz sampling rate.
EP05015989A 2004-07-23 2005-07-22 Voice coding apparatus and method using PLP in mobile communications terminal Not-in-force EP1619665B1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
KR1020040057739A KR100619893B1 (en) 2004-07-23 2004-07-23 A method and a apparatus of advanced low bit rate linear prediction coding with plp coefficient for mobile phone

Publications (2)

Publication Number Publication Date
EP1619665A1 true EP1619665A1 (en) 2006-01-25
EP1619665B1 EP1619665B1 (en) 2010-09-08

Family

ID=36080675

Family Applications (1)

Application Number Title Priority Date Filing Date
EP05015989A Not-in-force EP1619665B1 (en) 2004-07-23 2005-07-22 Voice coding apparatus and method using PLP in mobile communications terminal

Country Status (6)

Country Link
EP (1) EP1619665B1 (en)
JP (1) JP2006039559A (en)
KR (1) KR100619893B1 (en)
CN (1) CN1737904A (en)
AT (1) ATE480852T1 (en)
DE (1) DE602005023385D1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101475724B1 (en) * 2008-06-09 2014-12-30 삼성전자주식회사 Audio signal quality enhancement apparatus and method
KR20110001130A (en) * 2009-06-29 2011-01-06 삼성전자주식회사 Apparatus and method for encoding and decoding audio signals using weighted linear prediction transform
WO2023112226A1 (en) * 2021-12-15 2023-06-22 株式会社Peco Remote medical examination system for animal subject

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1199812A1 (en) * 2000-10-20 2002-04-24 Telefonaktiebolaget Lm Ericsson Perceptually improved encoding of acoustic signals
US20040128130A1 (en) * 2000-10-02 2004-07-01 Kenneth Rose Perceptual harmonic cepstral coefficients as the front-end for speech recognition

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040128130A1 (en) * 2000-10-02 2004-07-01 Kenneth Rose Perceptual harmonic cepstral coefficients as the front-end for speech recognition
EP1199812A1 (en) * 2000-10-20 2002-04-24 Telefonaktiebolaget Lm Ericsson Perceptually improved encoding of acoustic signals

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"SPEECH CODING AND SYNTHESIS", 1995, ELSEVIER, article "Linear Prediction (LP), which is disclosed in more detail"
GUNAWAN W ET AL: "PLP coefficients can be quantized at 400 bps", 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING. PROCEEDINGS. (ICASSP). SALT LAKE CITY, UT, MAY 7 - 11, 2001, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), NEW YORK, NY : IEEE, US, vol. VOL. 1 OF 6, 7 May 2001 (2001-05-07), pages 77 - 80, XP010803089, ISBN: 0-7803-7041-4 *

Also Published As

Publication number Publication date
KR100619893B1 (en) 2006-09-19
ATE480852T1 (en) 2010-09-15
KR20060008078A (en) 2006-01-26
CN1737904A (en) 2006-02-22
EP1619665B1 (en) 2010-09-08
JP2006039559A (en) 2006-02-09
DE602005023385D1 (en) 2010-10-21

Similar Documents

Publication Publication Date Title
US20060025991A1 (en) Voice coding apparatus and method using PLP in mobile communications terminal
US8463599B2 (en) Bandwidth extension method and apparatus for a modified discrete cosine transform audio coder
EP1328928B1 (en) Apparatus for bandwidth expansion of a speech signal
US8942988B2 (en) Efficient temporal envelope coding approach by prediction between low band signal and high band signal
US9653088B2 (en) Systems, methods, and apparatus for signal encoding using pitch-regularizing and non-pitch-regularizing coding
JP4302978B2 (en) Pseudo high-bandwidth signal estimation system for speech codec
US10141001B2 (en) Systems, methods, apparatus, and computer-readable media for adaptive formant sharpening in linear prediction coding
US20100169082A1 (en) Enhancing Receiver Intelligibility in Voice Communication Devices
EP3457402A1 (en) Signal processing method and device adaptive to noise environment and terminal device employing same
US7603271B2 (en) Speech coding apparatus with perceptual weighting and method therefor
EP1619665B1 (en) Voice coding apparatus and method using PLP in mobile communications terminal
EP3281197B1 (en) Audio encoder and method for encoding an audio signal
EP2617034B1 (en) Determining pitch cycle energy and scaling an excitation signal
US20030055633A1 (en) Method and device for coding speech in analysis-by-synthesis speech coders
Sun et al. Speech compression

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA HR MK YU

17P Request for examination filed

Effective date: 20060705

AKX Designation fees paid

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20060901

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: LG ELECTRONICS, INC.

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: LG ELECTRONICS, INC.

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 602005023385

Country of ref document: DE

Date of ref document: 20101021

Kind code of ref document: P

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101209

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110110

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110108

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101219

Ref country code: BE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20110609

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602005023385

Country of ref document: DE

Effective date: 20110609

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: IE

Ref legal event code: MM4A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101208

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20100908

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20150615

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20150612

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20150612

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20150710

Year of fee payment: 11

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 602005023385

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20160722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170201

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20160801

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20170331

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20160722

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20160722