WO1995021443A1 - Burst excited linear prediction - Google Patents
Burst excited linear prediction Download PDFInfo
- Publication number
- WO1995021443A1 WO1995021443A1 PCT/US1995/001341 US9501341W WO9521443A1 WO 1995021443 A1 WO1995021443 A1 WO 1995021443A1 US 9501341 W US9501341 W US 9501341W WO 9521443 A1 WO9521443 A1 WO 9521443A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- burst
- waveform
- shape
- accordance
- gain
- Prior art date
Links
- 238000000034 method Methods 0.000 claims abstract description 33
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 15
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 15
- 230000007774 longterm Effects 0.000 claims abstract description 11
- 238000001914 filtration Methods 0.000 claims description 10
- 238000001514 detection method Methods 0.000 claims 1
- 230000002123 temporal effect Effects 0.000 abstract description 3
- 230000005284 excitation Effects 0.000 description 33
- 239000013598 vector Substances 0.000 description 21
- 230000005540 biological transmission Effects 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 230000001755 vocal effect Effects 0.000 description 3
- 238000010586 diagram Methods 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 210000001260 vocal cord Anatomy 0.000 description 2
- 229930091051 Arenine Natural products 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000010845 search algorithm Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
Definitions
- the present invention relates to speech processing. More particularly, the present invention relates to a novel and improved method and apparatus for performing linear predictive speech coding using burst excitation vectors.
- vocoders Devices which employ techniques to compress voiced speech by extracting parameters that relate to a model of human speech generation are typically called vocoders. Such devices are composed of an encoder, which analyzes the incoming speech to extract the relevant parameters, and a decoder, which resynthesizes the speech using the parameters which it receives over the transmission channel.
- the model is constantly changes to accurately model the time varying speech signal. Thus the speech is divided into blocks of time, or analysis frames, during which the parameters are calculated. The parameters are then updated for each new frame.
- the Code Excited Linear Predictive Coding (CELP), Stochastic Coding, or Vector Excited Speech Coding coders are of one class.
- An example of a coding algorithm of this particular class is described in the paper "A 4.8 kbps Code Excited Linear Predictive Coder” by Thomas E. Tremain et al., Proceedings of the Mobile Satellite Conference, 1988.
- examples of other vocoders of this type are detailed in copending patent application Serial No. 08/004,484, filed January 14, 1993, entitled “Variable Rate Vocoder” and assigned to the assignee of the present invention, and U.S. Patent No. 4,797,925, entitled “Method For Coding Speech At Low Bit Rates”.
- the function of the vocoder is to compress the digitized speech signal into a low bit rate signal by removing all of the natural redundancies inherent in speech.
- Speech typically has short term redundancies due primarily to the filtering operation of the vocal tract, and long term redundancies due to the excitation of the vocal tract by the vocal cords.
- these operations are modeled by two filters, a short term formant (LPC) filter and a long term pitch filter. Once these redundancies are removed, the resulting residual signal can be modeled as white Gaussian noise, which also must be encoded.
- the process of determining the coding parameters for a given frame of speech is as follows. First, the parameters of the LPC filter are determined by finding the filter coefficients which remove the short term redundancy, due to the vocal tract filtering, in the speech. Second, the parameters of the pitch filter are determined by finding the filter coefficients which remove the long term redundancy, due to the vocal cords, in the speech.' Finally, an excitation signal, which is input to the pitch and LPC filters at the decoder, is chosen by driving the pitch and LPC filters with a number of random excitation waveforms in a codebook, and selecting the particular excitation waveform which causes the output of the two filters to be the closest approximation to the original speech.
- the transmitted parameters relate to three items (1) the LPC filter, (2) the pitch filter, and (3) the codebook excitation.
- CELP coders One shortcoming of CELP coders is the use of random excitation vectors.
- the use of the random excitation vectors fails to take into account the burst like nature of the ideal excitation waveform, which remains after the short-term and long-term redundancies have been removed from the speech signal.
- Unstructured random vectors are not particularly well suited for encoding the burst like residual excitation signal, and result in an inefficient method for coding the residual excitation signal.
- the present invention is a novel and improved method and apparatus for encoding the residual excitation signal which takes into account the burst like nature of such signal.
- the present invention encodes the bursts of large energy in the excitation signal with a burst excitation vector, rather than encoding the entire excitation signal with a random excitation vector.
- the candidate burst waveforms are characterized by a burst shape, a burst gain and burst location. This set of three burst parameters determines an excitation waveform, which is used to drive the LPC and pitch filters so that the output of the filter pair is a close approximation to the target speech signal.
- a method and apparatus for providing more than one set of burst parameters which produces an improved approximation to the target speech signal.
- a set of burst parameters corresponding to one burst is found which results in a minimal difference between the filtered burst waveform and the target speech waveform.
- the waveform produced by filtering this burst by the LPC and pitch filter pair is then subtracted from the target signal, and a subsequent search for a second set of burst parameters is conducted using the new, updated target signal. This iterative procedure is repeated as often as desired to match the target waveform precisely.
- a first method and apparatus which performs the burst excitation search in a closed loop fashion. That is, when the target signal is known, an exhaustive search of all burst shapes, burst gains and burst locations is conducted, with the optimum combination determined by selecting the shape, gain, and location which result in the best match between the filtered burst excitation and the target signal. Alternatively, the number of computations may be reduced by conducting a suboptimal search over only a subset of any of the three parameters.
- a partially open loop method wherein the number of parameters to be searched is greatly reduced by analyzing the residual excitation signal, identifying the locations of greatest energy, and using those locations as the locations of the excitation bursts.
- a single location is identified as described above, a burst gain and shape are identified for the given burst location, the filtered burst signal is subtracted from the target signal, and the residual excitation signal corresponding to the remaining target signal is again analyzed to find a subsequent burst location.
- a plurality of burst locations is first identified by analyzing the residual excitation waveform, and the burst gains and shapes are then determined for the burst locations as described in the first method.
- the first method entails providing a recursive burst set wherein each succeeding burst shape may be derived for its predecessor by removing one or more elements from the beginning of the previous shape sequence and adding one or more elements to the end of the previous shape sequence.
- Another method entails providing a burst set wherein a succeeding burst shape is formed using a linear combination of previous bursts.
- Figure la-c is an illustration of a set of three waveforms
- Figure la is uncoded speech
- Figure lb is speech with short term redundancy removed
- Figure lc is speech with short term and long term speech redundancies removed, also known as the ideal residual excitation waveform
- Figure 2 is a block diagram illustrating the closed loop search mechanism
- Figure 3 is a block diagram illustrating the partially open loop search mechanism.
- Figures la-c illustrate three waveforms with time on the horizontal axis and amplitude on the vertical axis.
- Figure la illustrates a typical example of an uncoded speech signal waveform.
- Figure lb illustrates the same speech signal as Figure la with the short term redundancy removed by means of a formant (LPC) prediction filter.
- the short term redundancy in speech is typically removed by computing a set of autocorrelation coefficients for a speech frame and determining from the autocorrelation coefficients a set of linear prediction coding (LPC) coefficients by techniques that are well known in the art.
- LPC linear prediction coding
- the LPC coefficients may be obtained by the autocorrelation method using Durbin's recursion as discussed in Digital Processing of Speech Signals, Rabiner & Schafer, Prentice-Hall, Inc., 1978. Methods for determining the tap values of the LPC filters are also described in the aforementioned patent application and patent. These LPC coefficients determine a set of tap values for the formant (LPC) filter.
- LPC formant
- Figure lc illustrates the same speech samples as Figure la, but with both short term and long term temporal redundancies removed.
- the short term redundancies are removed as described above and then the residual speech is the filtered by a pitch prediction filter to remove long term temporal redundancies in the speech, the implementation of which is well known in the art.
- the long term redundancies are removed by comparing the current speech frame with a history of previously coded speech. The coder identifies a set of samples from the previous coded excitation signal which, when filtered by the LPC filter, is a best match to the current speech signal.
- This set of samples is specified by a pitch lag, which specifies the number of samples to look backward in time to find the excitation signal which produces the best match, and a pitch gain, which is a multiplicative factor to apply to the set of samples.
- FIG. lc A typical example of the resulting waveform, referred to as the residual excitation waveform, is illustrated in Figure lc.
- the large energy components in the residual excitation waveform typically occur in bursts, which are marked by arrows 1, 2 and 3 in Figure lc.
- the modeling of this target waveform has been accomplished in previous work by seeking to match the entire residual excitation waveform to a random vector in a vector codebook.
- the coder seeks to match the residual excitation waveform with a plurality of burst vectors, thus more closely approximating the large energy segments in the residual excitation waveform.
- FIG. 2 illustrates an exemplary implementation of the present invention.
- the search for the optimum burst shape (B), burst gain (G) and burst location (1) is determined in a closed loop form.
- the input speech frame, s(n) is provided to the summing input of summing element 2.
- each speech frame consists of forty speech samples.
- the optimum pitch lag L * and pitch gain b * determined previously in a pitch search operation is provided to pitch synthesis filter 4.
- the output of pitch synthesis filter 4 provided in accordance with optimum pitch lag L and pitch gain b * is provided to LPC filter 6.
- LPC formant
- LPC memoryless formant
- the tap values of filters 6, 8 and 12 are determined in accordance with these LPC coefficients.
- the output of formant (LPC) synthesis filter 6 is provided to the subtracting input of summing element 2.
- the error signal computed in summing element 2 is provided to perceptual weighting filter 8.
- Perceptual weighting filter 8 filters the signal and provides its output, the target signal, x(n), to the summing input of summing element 18.
- Element 9 exhaustively provides candidate waveforms to the subtracting input of summing element 18.
- Each candidate waveform is identified by a burst shape index value, i, a burst gain, G, and a burst location, 1.
- each candidate waveform consists of forty samples.
- Burst element 10 is provided with a burst shape index value i, in response to which burst element 10 provides a burst vector, Bi, of a predetermined number of samples.
- each of the burst vectors are nine samples long.
- Each burst vector is provided to memoryless formant (LPC) synthesis filter 12 which filters the input burst vector in accordance with the LPC coefficients.
- LPC memoryless formant
- the second input to multiplier 14 is the burst gain values G.
- the gain values can be of a predetermined set of values or can be determined adaptively from characteristics of past and present input speech frames. For each burst vector, all gain values G are exhaustively tested to determine the optimal gain value, or the optimal unquantized gain value for a particular value of 1 and i can be determined using methods known in the art, with the chosen value of G quantized to the nearest of the sixteen different gain values after the search.
- the product from multiplier 14 is provided to variable delay element 16.
- Variable delay element 16 also receives a burst location value, 1 and positions the burst vector within the candidate waveform frame in accordance with the value of 1. If a candidate waveform frame consists of L samples, then the maximum number of locations to be tested is:
- a subset of the number of possible burst locations can be chosen to reduce the resulting data rate. For example, it is possible only to allow a burst to begin at every other sample location. Testing a subset of burst locations will reduce complexity, but will result in a suboptimal coding which in some cases may reduce the resulting speech quality.
- the candidate waveform, wi / G,l(n) is provided to the subtracting input of summing element 18.
- the difference between the target waveform and the candidate waveform is provided to energy computation element 20.
- Energy computation element 20 sums the squares of the members of the weighted error vector in accordance with equation 2 below:
- Minimization element 22 compares each minimum energy value found thus far to the current energy value. If the energy value provided to minimization element 22 is less than the current minimum, the current energy value is stored in minimization element 22 and the current burst shape, burst gain, and burst position values are also stored. After all allowable burst shapes, burst positions, and burst locations have been searched, the best match candidate B * , G * and 1 * are provided by minimization element 22.
- a candidate waveform may consist of more than one burst.
- a first search is conducted and a the best match waveform is identified.
- the best match waveform is then subtracted from the target signal and additional searches are conducted. This process may be repeated for as many bursts as desired.
- it may be desirable to restrict the burst location search so that a previously selected burst location cannot be selected more than once. It has been noticed in noisy speech that burst like noise has a different audible character than random noise. By restricting the bursts to be spaced apart from one another, the resulting excitation signal is closer to random noise and may be perceived as more natural in some circumstances.
- a second partially open loop search may be conducted.
- the apparatus by which the partially open loop search is conducted is illustrated in Figure 3.
- the locations of the burst are determined using an open loop technique, and subsequently the burst shapes and gains are determined in the closed loop fashion described previously.
- the input speech frame, s(n) is provided to the summing input of summing element 30.
- the optimum pitch lag L and pitch gain b * determined previously in a pitch search operation are provided to pitch synthesis filter 32.
- the output of pitch synthesis filter 32 provided in accordance with optimum pitch lag L * and pitch gain b is provided to format (LPC) synthesis filter 34.
- LPC formant
- the output of formant (LPC) synthesis filter 34 is provided to the subtracting input of summing element 30.
- the error signal computed in summing element 30 is provided to all-zeroes perceptual weighting filter 36. All-zeroes perceptual weighting filter 36 filters the signal and provides its output, r(n), to the input of all-poles perceptual weighting filter 37. All- poles perceptual weighting filter 37 outputs the target signal x(n) to the summing input of summing element 48.
- the output of all-zeroes perceptual weighting filer 36, r(n), is also provided to peak detector 54, which analyzes the signal and identifies the location of the largest energy burst in the signal.
- peak detector 54 analyzes the signal and identifies the location of the largest energy burst in the signal. The equation by which the burst location 1 is found is:
- burst element 38 is provided with a burst index value i, in response to which burst element 38 provides burst vector, Bi.
- Bi is provided to memoryless weighted LPC filter 42 which filter the input burst vector in accordance with the LPC coefficients.
- the output of memoryless weighted LPC filter 42 is provided to one input of multiplier 44.
- the second input to multiplier 44 is the burst gain values G.
- the output of multiplier 44 is provided to burst location element 46 which, in accordance with the burst location value 1, positions the burst within the candidate frame.
- the candidate waveforms are subtracted from the target signal in summing element 48.
- the differences are then provided to energy computation element 50 which computes the energy of the error signal as described previously herein.
- the computed energy values are provided to minimization element 52, which as described above detects the minimum error energy and provides the identification parameters B * , G * and 1.
- a multiple burst partially open-loop searches can be done by identifying a first best match waveform, subtracting the unfiltered best match waveform from the output of all-zeroes perceptual weighting filter 36, r(n), and determining the location of the next burst by finding the location in the new, updated r(n) which has the greatest energy, as described above.
- the filtered first best match waveform is subtracted from the target vector, x(n), and the minimization search conducted on the resulting waveform. This process may be repeated as many times as desired. Again it may be desirable to restrict the burst locations to be different from one another for the reasons enumerated earlier herein.
- One simple means of guaranteeing that the burst locations are different is by replacing r(n) with zeroes in the region into which a burst was subtracted before conducting a subsequent burst search.
- the burst elements 10 and 38 may be optimized to reduce the computational complexity of the recursion computations that are necessary in the computation of the filter responses to filters 12 and 42.
- the burst values may be stored as recursive burst set wherein each subsequent burst shape may be derived from its predecessor by removing one or more elements from the beginning of the previous sequence and adding one or more elements to the end of the previous sequence.
- the bursts may be interrelated in other ways. For example, half of the bursts may be the sample inversions of other bursts, or bursts may be constructed using linear combinations of previous bursts. These techniques also reduce the memory required by burst elements 10 and 38 to store all of the candidate burst shapes.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Stereo-Broadcasting Methods (AREA)
- Valve Device For Special Equipments (AREA)
- Valve-Gear Or Valve Arrangements (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Radar Systems Or Details Thereof (AREA)
- Luminescent Compositions (AREA)
- Control Of Eletrric Generators (AREA)
- Amplifiers (AREA)
- Pyrane Compounds (AREA)
- Investigation Of Foundation Soil And Reinforcement Of Foundation Soil By Compacting Or Drainage (AREA)
- Steroid Compounds (AREA)
- Character Spaces And Line Spaces In Printers (AREA)
- Transmission And Conversion Of Sensor Element Output (AREA)
- Stabilization Of Oscillater, Synchronisation, Frequency Synthesizers (AREA)
- Semiconductor Lasers (AREA)
- Control Of Motors That Do Not Use Commutators (AREA)
- Devices Affording Protection Of Roads Or Walls For Sound Insulation (AREA)
Abstract
Description
Claims
Priority Applications (12)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1019960704137A KR100323487B1 (en) | 1994-02-01 | 1995-02-01 | Burst here Linear prediction |
DE69526926T DE69526926T2 (en) | 1994-02-01 | 1995-02-01 | LINEAR PREDICTION THROUGH PULSE PULSE |
BR9506574A BR9506574A (en) | 1994-02-01 | 1995-02-01 | Apparatus and method for encoding residual waveform in a linear prediction encoder in which the short and long period redundancies are removed from the structures of the digitized speech samples resulting in a residual waveform |
DK95909433T DK0744069T3 (en) | 1994-02-01 | 1995-02-01 | Burst-excited linear prediction |
JP7520734A JPH09508479A (en) | 1994-02-01 | 1995-02-01 | Burst excitation linear prediction |
MX9603122A MX9603122A (en) | 1994-02-01 | 1995-02-01 | Burst excited linear prediction. |
SI9530609T SI0744069T1 (en) | 1994-02-01 | 1995-02-01 | Burst excited linear prediction |
AT95909433T ATE218741T1 (en) | 1994-02-01 | 1995-02-01 | LINEAR PREDICTION BY IMPULSIVE EXCITATION |
AU17398/95A AU693519B2 (en) | 1994-02-01 | 1995-02-01 | Burst excited linear prediction |
EP95909433A EP0744069B1 (en) | 1994-02-01 | 1995-02-01 | Burst excited linear prediction |
FI962968A FI962968A (en) | 1994-02-01 | 1996-07-25 | Linear prediction encoder |
HK98110088A HK1011108A1 (en) | 1994-02-01 | 1998-08-21 | Burst excited linear prediction |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US18981494A | 1994-02-01 | 1994-02-01 | |
US189,814 | 1994-02-01 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO1995021443A1 true WO1995021443A1 (en) | 1995-08-10 |
Family
ID=22698876
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1995/001341 WO1995021443A1 (en) | 1994-02-01 | 1995-02-01 | Burst excited linear prediction |
Country Status (17)
Country | Link |
---|---|
US (1) | US5621853A (en) |
EP (1) | EP0744069B1 (en) |
JP (1) | JPH09508479A (en) |
KR (1) | KR100323487B1 (en) |
CN (1) | CN1139988A (en) |
AT (1) | ATE218741T1 (en) |
AU (1) | AU693519B2 (en) |
BR (1) | BR9506574A (en) |
CA (1) | CA2181456A1 (en) |
DE (1) | DE69526926T2 (en) |
DK (1) | DK0744069T3 (en) |
ES (1) | ES2177631T3 (en) |
FI (1) | FI962968A (en) |
HK (1) | HK1011108A1 (en) |
MX (1) | MX9603122A (en) |
PT (1) | PT744069E (en) |
WO (1) | WO1995021443A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999044192A1 (en) * | 1998-02-27 | 1999-09-02 | Lernout & Hauspie Speech Products N.V. | Apparatus and method for hybrid excited linear prediction speech encoding |
Families Citing this family (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6182030B1 (en) | 1998-12-18 | 2001-01-30 | Telefonaktiebolaget Lm Ericsson (Publ) | Enhanced coding to improve coded communication signals |
US7894610B2 (en) * | 2003-12-02 | 2011-02-22 | Thomson Licensing | Method for coding and decoding impulse responses of audio signals |
US8920343B2 (en) | 2006-03-23 | 2014-12-30 | Michael Edward Sabatino | Apparatus for acquiring and processing of physiological auditory signals |
US8700410B2 (en) | 2009-06-18 | 2014-04-15 | Texas Instruments Incorporated | Method and system for lossless value-location encoding |
US10121481B2 (en) * | 2011-03-04 | 2018-11-06 | Telefonaktiebolaget Lm Ericsson (Publ) | Post-quantization gain correction in audio coding |
EP3011555B1 (en) * | 2013-06-21 | 2018-03-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Reconstruction of a speech frame |
BR112015031181A2 (en) * | 2013-06-21 | 2017-07-25 | Fraunhofer Ges Forschung | apparatus and method that realize improved concepts for tcx ltp |
EP3079151A1 (en) * | 2015-04-09 | 2016-10-12 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoder and method for encoding an audio signal |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0532225A2 (en) * | 1991-09-10 | 1993-03-17 | AT&T Corp. | Method and apparatus for speech coding and decoding |
WO1993015503A1 (en) * | 1992-01-27 | 1993-08-05 | Telefonaktiebolaget Lm Ericsson | Double mode long term prediction in speech coding |
EP0573398A2 (en) * | 1992-06-01 | 1993-12-08 | Hughes Aircraft Company | C.E.L.P. Vocoder |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4191853A (en) * | 1978-10-10 | 1980-03-04 | Motorola Inc. | Sampled data filter with time shared weighters for use as an LPC and synthesizer |
US4675863A (en) * | 1985-03-20 | 1987-06-23 | International Mobile Machines Corp. | Subscriber RF telephone system for providing multiple speech and/or data signals simultaneously over either a single or a plurality of RF channels |
JP3102015B2 (en) * | 1990-05-28 | 2000-10-23 | 日本電気株式会社 | Audio decoding method |
US5138661A (en) * | 1990-11-13 | 1992-08-11 | General Electric Company | Linear predictive codeword excited speech synthesizer |
US5353374A (en) * | 1992-10-19 | 1994-10-04 | Loral Aerospace Corporation | Low bit rate voice transmission for use in a noisy environment |
US5341456A (en) * | 1992-12-02 | 1994-08-23 | Qualcomm Incorporated | Method for determining speech encoding rate in a variable rate vocoder |
-
1995
- 1995-02-01 EP EP95909433A patent/EP0744069B1/en not_active Expired - Lifetime
- 1995-02-01 CA CA002181456A patent/CA2181456A1/en not_active Abandoned
- 1995-02-01 CN CN95191398A patent/CN1139988A/en active Pending
- 1995-02-01 BR BR9506574A patent/BR9506574A/en not_active Application Discontinuation
- 1995-02-01 AT AT95909433T patent/ATE218741T1/en not_active IP Right Cessation
- 1995-02-01 WO PCT/US1995/001341 patent/WO1995021443A1/en active IP Right Grant
- 1995-02-01 MX MX9603122A patent/MX9603122A/en unknown
- 1995-02-01 KR KR1019960704137A patent/KR100323487B1/en not_active IP Right Cessation
- 1995-02-01 ES ES95909433T patent/ES2177631T3/en not_active Expired - Lifetime
- 1995-02-01 AU AU17398/95A patent/AU693519B2/en not_active Ceased
- 1995-02-01 DE DE69526926T patent/DE69526926T2/en not_active Expired - Lifetime
- 1995-02-01 PT PT95909433T patent/PT744069E/en unknown
- 1995-02-01 DK DK95909433T patent/DK0744069T3/en active
- 1995-02-01 JP JP7520734A patent/JPH09508479A/en active Pending
- 1995-09-18 US US08/529,374 patent/US5621853A/en not_active Expired - Lifetime
-
1996
- 1996-07-25 FI FI962968A patent/FI962968A/en unknown
-
1998
- 1998-08-21 HK HK98110088A patent/HK1011108A1/en not_active IP Right Cessation
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0532225A2 (en) * | 1991-09-10 | 1993-03-17 | AT&T Corp. | Method and apparatus for speech coding and decoding |
WO1993015503A1 (en) * | 1992-01-27 | 1993-08-05 | Telefonaktiebolaget Lm Ericsson | Double mode long term prediction in speech coding |
EP0573398A2 (en) * | 1992-06-01 | 1993-12-08 | Hughes Aircraft Company | C.E.L.P. Vocoder |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1999044192A1 (en) * | 1998-02-27 | 1999-09-02 | Lernout & Hauspie Speech Products N.V. | Apparatus and method for hybrid excited linear prediction speech encoding |
Also Published As
Publication number | Publication date |
---|---|
EP0744069B1 (en) | 2002-06-05 |
EP0744069A1 (en) | 1996-11-27 |
PT744069E (en) | 2002-10-31 |
DE69526926D1 (en) | 2002-07-11 |
DE69526926T2 (en) | 2003-01-02 |
HK1011108A1 (en) | 1999-07-02 |
MX9603122A (en) | 1997-03-29 |
FI962968A (en) | 1996-09-24 |
ES2177631T3 (en) | 2002-12-16 |
AU693519B2 (en) | 1998-07-02 |
CA2181456A1 (en) | 1995-08-10 |
US5621853A (en) | 1997-04-15 |
AU1739895A (en) | 1995-08-21 |
CN1139988A (en) | 1997-01-08 |
KR970700902A (en) | 1997-02-12 |
FI962968A0 (en) | 1996-07-25 |
BR9506574A (en) | 1997-09-23 |
KR100323487B1 (en) | 2002-07-08 |
JPH09508479A (en) | 1997-08-26 |
DK0744069T3 (en) | 2002-10-07 |
ATE218741T1 (en) | 2002-06-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7191125B2 (en) | Method and apparatus for high performance low bit-rate coding of unvoiced speech | |
EP1224662B1 (en) | Variable bit-rate celp coding of speech with phonetic classification | |
KR100527217B1 (en) | Sound encoder and sound decoder | |
US5751901A (en) | Method for searching an excitation codebook in a code excited linear prediction (CELP) coder | |
WO1998005030A9 (en) | Method and apparatus for searching an excitation codebook in a code excited linear prediction (clep) coder | |
US5621853A (en) | Burst excited linear prediction | |
KR100955126B1 (en) | Vector quantization apparatus | |
EP1212750A1 (en) | Multimode vselp speech coder | |
Miseki et al. | Adaptive bit-allocation between the pole-zero synthesis filter and excitation in CELP | |
MXPA99001099A (en) | Method and apparatus for searching an excitation codebook in a code excited linear prediction (clep) coder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 95191398.0 Country of ref document: CN |
|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AM AT AU BB BG BR BY CA CH CN CZ DE DK EE ES FI GB GE HU JP KE KG KP KR KZ LK LR LT LU LV MD MG MN MW MX NL NO NZ PL PT RO RU SD SE SI SK TJ TT UA UZ VN |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): KE MW SD SZ AT BE CH DE DK ES FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
WWE | Wipo information: entry into national phase |
Ref document number: 1199690034 Country of ref document: VN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2181456 Country of ref document: CA |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1995909433 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 962968 Country of ref document: FI |
|
WWE | Wipo information: entry into national phase |
Ref document number: PA/a/1996/003122 Country of ref document: MX |
|
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
WWP | Wipo information: published in national office |
Ref document number: 1995909433 Country of ref document: EP |
|
WWG | Wipo information: grant in national office |
Ref document number: 1995909433 Country of ref document: EP |