CA2301995C - High quality speech coder at low bit rates - Google Patents

High quality speech coder at low bit rates Download PDF

Info

Publication number
CA2301995C
CA2301995C CA002301995A CA2301995A CA2301995C CA 2301995 C CA2301995 C CA 2301995C CA 002301995 A CA002301995 A CA 002301995A CA 2301995 A CA2301995 A CA 2301995A CA 2301995 C CA2301995 C CA 2301995C
Authority
CA
Canada
Prior art keywords
input speech
signal
obtaining
speech signal
pitch prediction
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002301995A
Other languages
French (fr)
Other versions
CA2301995A1 (en
Inventor
Kazunori Ozawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
NEC Corp
Original Assignee
NEC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP26112196A external-priority patent/JP3360545B2/en
Priority claimed from JP30714396A external-priority patent/JP3471542B2/en
Application filed by NEC Corp filed Critical NEC Corp
Priority claimed from CA002213909A external-priority patent/CA2213909C/en
Publication of CA2301995A1 publication Critical patent/CA2301995A1/en
Application granted granted Critical
Publication of CA2301995C publication Critical patent/CA2301995C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Landscapes

  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

In a speech coder, a spectral parameter computer obtaining a plurality of spectral parameters from an input speech signal and quantizes the obtained spectral parameters. Even at low bit rates, the speech coder requires relatively less computational effort and experiences less sound quality deterioration.

Claims (12)

THE EMBODIMENTS OF THE INVENTION IN WHICH AN EXCLUSIVE
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS:
1. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position corresponding to a pulse position meeting a predetermined condition with respect to the computed pitch prediction signal, setting a pulse position retrieval range on the basis of a position obtained by shifting the obtained sample position by a predetermined number of samples, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range thus set, and outputting data of the retrieved best position.
2. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position meeting a predetermined condition with respect to the pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting a pulse position retrieval range for retrieving a pulse position on the basis of a position obtained by shifting the obtained sample position by a predetermined number of samples, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range thus set, and outputting data of the retrieved best position.
3. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position corresponding to a pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting pulse position candidates through shifting the obtained sample position by the pitch period on the basis of the position shifted by predetermined numbers of samples from the sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving the position candidates for a best position, and outputting data of the retrieved best position.
4. The speech coder according to one of claims 1, 2 and 3, wherein the excitation quantizer includes a codebook for jointly quantizing the amplitudes or polarities of a plurality of pulses.
5. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position meeting a predetermined condition with respect to the computed pitch prediction signal, setting a plurality of pulse position retrieval ranges on the basis of positions obtained by shifting the obtained sample position by corresponding shift extents, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, making retrieval of the pulse position retrieval ranges to select a best combination of a shift extent and a pulse position, and outputting data of the selected best combination.
6. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting a plurality of pulse position retrieval ranges on the basis of positions obtained by shifting the obtained sample position by corresponding shift extents, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, making retrieval of the pulse position retrieval ranges to select a best combination of a shift extent and a pulse position, and outputting data of the selected best combination.
7. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting pulse position candidates through shifting the obtained sample position by the pitch period on the basis of the position shifted by predetermined numbers of samples from the sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving the position candidates for a best position, and outputting data of the retrieved best position.
8. The speech coder according to one of claims 5, 6 and 7, wherein the excitation quantizer includes a codebook for jointly quantizing the amplitudes or polarities of a plurality of pulses.
9. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, a mode judging means for extracting a characteristic amount from the input speech signal, judging a plurality of modes from the extracted feature quantity, and outputting mode data, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and making pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude signals, obtaining a sample position meeting a predetermined condition with respect to the pitch prediction signal when the mode data represents a predetermined mode, setting a pulse position retrieval range on the basis of the obtained sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range, and outputting data of the retrieved best position.
10. The speech coder according to claim 9, wherein the feature quantity is an average pitch prediction gain.
11. The speech coder according to claim 9, wherein the mode judging means judges the modes on the basis of comparison of the average pitch prediction gain with a plurality of threshold values.
12. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for obtaining a position meeting a predetermined condition with respect to the pitch prediction signal computed in the adaptive codebook means, setting a plurality of pulse position retrieval ranges for respective pulses constituting an excitation signal, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, and retrieving the best positions of the pulses in the pulse position retrieval ranges.
CA002301995A 1996-08-26 1997-08-25 High quality speech coder at low bit rates Expired - Fee Related CA2301995C (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
JP26112196A JP3360545B2 (en) 1996-08-26 1996-08-26 Audio coding device
JP261121/1996 1996-08-26
JP307143/1996 1996-10-31
JP30714396A JP3471542B2 (en) 1996-10-31 1996-10-31 Audio coding device
CA002213909A CA2213909C (en) 1996-08-26 1997-08-25 High quality speech coder at low bit rates

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CA002213909A Division CA2213909C (en) 1996-08-26 1997-08-25 High quality speech coder at low bit rates

Publications (2)

Publication Number Publication Date
CA2301995A1 CA2301995A1 (en) 1998-02-26
CA2301995C true CA2301995C (en) 2005-10-18

Family

ID=27170442

Family Applications (2)

Application Number Title Priority Date Filing Date
CA002301994A Expired - Fee Related CA2301994C (en) 1996-08-26 1997-08-25 High quality speech coder at low bit rates
CA002301995A Expired - Fee Related CA2301995C (en) 1996-08-26 1997-08-25 High quality speech coder at low bit rates

Family Applications Before (1)

Application Number Title Priority Date Filing Date
CA002301994A Expired - Fee Related CA2301994C (en) 1996-08-26 1997-08-25 High quality speech coder at low bit rates

Country Status (1)

Country Link
CA (2) CA2301994C (en)

Also Published As

Publication number Publication date
CA2301994A1 (en) 1998-02-26
CA2301994C (en) 2004-10-26
CA2301995A1 (en) 1998-02-26

Similar Documents

Publication Publication Date Title
CN100583242C (en) Method and apparatus for speech decoding
US5724480A (en) Speech coding apparatus, speech decoding apparatus, speech coding and decoding method and a phase amplitude characteristic extracting apparatus for carrying out the method
EP0890943B1 (en) Voice coding and decoding system
EP0766232B1 (en) Speech coding apparatus
EP1008982B1 (en) Voice encoder, voice decoder, voice encoder/decoder, voice encoding method, voice decoding method and voice encoding/decoding method
RU93058657A (en) VOCODER WITH VARIABLE CODING AND DATA TRANSFER
WO1993010624A3 (en) Progressive transmission of vector quantized data
EP0834863B1 (en) Speech coder at low bit rates
KR100323011B1 (en) Pitch period extractor of audio signal
EP0849724A2 (en) High quality speech coder and coding method
EP0275099B1 (en) Voice analyzing and synthesizing apparatus
CA2301995C (en) High quality speech coder at low bit rates
EP0696793B1 (en) A speech coder
CA2336360A1 (en) Speech coder
CN1139988A (en) Burst excited linear prediction
US7272553B1 (en) Varying pulse amplitude multi-pulse analysis speech processor and method
JPH0784600A (en) Vector quantizing device
JP3490325B2 (en) Audio signal encoding method and decoding method, and encoder and decoder thereof
JPH043879B2 (en)
US4924517A (en) Encoder of a multi-pulse type capable of controlling the number of excitation pulses
KR20040041731A (en) Variable fixed codebook searching method in CELP speech codec, and apparatus thereof
JP3984021B2 (en) Speech / acoustic signal encoding method and electronic apparatus
KR100296409B1 (en) Multi-pulse excitation voice coding method
JPH043878B2 (en)
JPH0496099A (en) Multipulse type voice coding device and complexing device

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed