WO2000008633A1 - Generateur de signaux d'excitation, codeur vocal et decodeur vocal - Google Patents

Generateur de signaux d'excitation, codeur vocal et decodeur vocal Download PDF

Info

Publication number
WO2000008633A1
WO2000008633A1 PCT/JP1999/004137 JP9904137W WO0008633A1 WO 2000008633 A1 WO2000008633 A1 WO 2000008633A1 JP 9904137 W JP9904137 W JP 9904137W WO 0008633 A1 WO0008633 A1 WO 0008633A1
Authority
WO
WIPO (PCT)
Prior art keywords
code vector
signal
noise
vector
excitation signal
Prior art date
Application number
PCT/JP1999/004137
Other languages
English (en)
Japanese (ja)
Inventor
Hiroyuki Ehara
Toshiyuki Morii
Original Assignee
Matsushita Electric Industrial Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co., Ltd. filed Critical Matsushita Electric Industrial Co., Ltd.
Priority to AU49324/99A priority Critical patent/AU4932499A/en
Publication of WO2000008633A1 publication Critical patent/WO2000008633A1/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04BTRANSMISSION
    • H04B14/00Transmission systems not characterised by the medium used for transmission
    • H04B14/02Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation
    • H04B14/04Transmission systems not characterised by the medium used for transmission characterised by the use of pulse modulation using pulse code modulation

Definitions

  • the present invention relates to a CELP (Code Excited Linear Prediction) type speech coding apparatus in a mobile communication system or the like that encodes and transmits a speech signal.
  • CELP Code Excited Linear Prediction
  • voice information is compressed for efficient use of radio waves and storage media, and is used for voice coding equipment for efficient coding.
  • a method based on the CELP (Code Excited Linear Prediction) method has been widely put to practical use in middle and low bit rates.
  • CELP Linear Prediction
  • speech is divided into a certain fixed frame length (about 5 ms to 50 ms), linear prediction of speech is performed for each frame, and a prediction residual (excitation signal) is obtained by linear prediction for each frame.
  • a prediction residual excitation signal
  • the adaptive code vector is based on the adaptive codebook that stores the driving excitation vector generated in the past, and the noise code vector is the noise that stores a predetermined number of vectors with a specified shape prepared in advance. It is selected from the codebook and used.
  • the random code vector stored in the random codebook includes a random noise sequence vector. For example, a vector generated by arranging some pulses or different pulses at different positions is used. In particular, when the bit length is reduced by increasing the frame length, the quality is improved by using the noise code vector in synchronization with the pitch peak position of the adaptive code vector (phase "Adaptive CELP method" is disclosed in Japanese Unexamined Patent Application Publication No.
  • Fig. 1 shows an example of the configuration of the excitation signal generator provided in the phase-adaptive CELP encoder.
  • an excitation signal is generated by adding the adaptive code vector multiplied by the adaptive codebook gain and the noise code vector after phase adaptation processing multiplied by the noise codebook gain.
  • the phase adaptation process is performed using the phase calculated using the adaptive code vector.
  • An object of the present invention is to perform a phase calculation using an MA-type adaptive code vector output from an MA-type adaptive codebook instead of an adaptive code vector output from an adaptive codebook, to thereby reduce a transmission path error.
  • An object of the present invention is to provide an excitation signal generating device, a voice coding device, and a voice decoding device capable of suppressing propagation.
  • the present invention generates an MA type adaptive code vector using a finite number of noise code vectors used in the past, an adaptive codebook gain, and a pitch period. Then, the phase is calculated using this.
  • FIG. 1 is a block diagram showing a configuration of a conventional phase-adaptive excitation signal generator
  • FIG. 2 is a block diagram showing a configuration of an excitation signal generator according to the first embodiment of the present invention
  • FIG. 3 is a block diagram showing a configuration of an MA-type adaptive codebook provided in the excitation signal generation device according to the first embodiment
  • FIG. 4 is a flowchart showing the flow of the excitation signal generation process in the first embodiment
  • FIG. 5 is a flowchart showing the flow of the MA-type adaptive codebook generation process according to the first embodiment
  • FIG. 6 is a block diagram showing a configuration of a speech coding apparatus and a speech decoding apparatus according to Embodiment 2 of the present invention.
  • FIG. 7 is a block diagram illustrating a configuration of an audio signal transmitting device and a receiving device according to the third embodiment of the present invention.
  • FIG. 2 shows a configuration of the excitation signal generator according to the first embodiment of the present invention.
  • the excitation signal generator shown in the figure includes an adaptive codebook 101, a first random codebook 102, and a second random codebook 103.
  • the adaptive codebook 101 buffers the excitation signal generated in the past, and generates an adaptive code vector using the pitch period (pitch lag) P.
  • the adaptive code vector generated by adaptive codebook 101 is applied to adaptive codebook gain G by multiplier 104. After being multiplied by 1, it is output to the adder 105.
  • the first noise code book 102 stores a predetermined number of noise code vectors having different shapes and stores the first noise code vector specified by the index S 1 of the noise code vector. Output.
  • the first noise code vector is phase-shifted in the phase adaptor 106 by a shift amount described later.
  • the first noise code vector after the phase shift is multiplied by the noise codebook gain G 2 in the multiplier 10 ⁇ and output to the adder 105.
  • the second random codebook 103 stores a predetermined number (finite number) of random code vectors having different shapes, and stores a second random code specified by an index S 2 of the random code vector.
  • the vector is output to the multiplier 108.
  • Multiplier 108 multiplies the second noise code vector by noise codebook gain G 2 and outputs the result to adder 105.
  • the second noise code vector multiplied by the noise codebook gain G 2 is simultaneously provided to the MA adaptive codebook 109.
  • MA-type adaptive codebook 109 generates MA-type adaptive code vector using second noise code vector after noise codebook gain multiplication, adaptive codebook gain G1 and pitch period P. And outputs it to the phase calculator 110.
  • the phase calculator 110 calculates the phase shift amount using the MA type adaptive code vector output from the MA type adaptive code book 109 and the pitch period P, and uses the shift amount as the phase adaptor Output to 6.
  • the excitation signal output from the adder 105 is also input to the adaptive codebook 101 and used to update the adaptive codebook.
  • the pitch period (pitch lag) P is stored in the adaptive codebook 101, the phase calculator 110, and the MA adaptive codebook 109, and the first noise codebook index S1 is stored in the first noise codebook.
  • the second random codebook index S2 is in the second noise codebook 103
  • the adaptive codebook gain G1 is in the multiplier 104 and the MA adaptive codebook 109
  • Noise codebook gain G2 is input to multipliers 107 and 108, respectively. Is done.
  • the adaptive codebook 101 buffers the excitation signal generated in the past as time-series data.Starting at the point specified by the pitch period P, the adaptive codebook is cut out of the adaptive codebook to obtain a multiplier. Output to 104.
  • the adaptive code vector is generated by periodicizing the pitch period (output
  • the adaptive code vector length is equal to the excitation signal vector length output from the excitation signal generator).
  • the multiplier 104 multiplies the adaptive code vector output from the adaptive code book 101 by the adaptive code book gain G 1 to generate a vector of the adaptive code component of the excitation signal vector.
  • the first random codebook 102 extracts the first random code vector specified by the first random codebook index S1 and outputs it to the phase adaptor 106.
  • the phase adaptor 106 the phase shift of the first noise code vector is performed, so that the vector length stored in the first noise code book 102 is output from the excitation signal generator. It has a length longer than the excitation signal vector length by the maximum shift length that can be performed by the phase adaptor 106.
  • the phase adaptor 106 shifts the first noise code vector by the shift value calculated by the phase calculator 110, cuts out only the portion used for generating the excitation signal vector, and multiplies the multiplier by adding Output to 7.
  • the multiplier 107 multiplies the vector output from the phase adaptor 106 by the noise codebook gain G2, and outputs the result to the adder 105.
  • the second noise codebook 103 takes out the second noise code vector specified by the index S2 and outputs it to the multiplier 108.
  • the noise code vector stored in the second noise code book 103 is the same as the excitation signal vector length generated from the present excitation signal generator.
  • the multiplier 108 multiplies the second noise code vector output from the second noise codebook 103 by the noise codebook gain G2.
  • the MA-type adaptive codebook 109 is the second after the noise codebook gain multiplication input in the past. Is generated using the noise code vector and the pitch period P inputted in the past, and further, the MA type adaptive code vector is cut out from the generated MA type adaptive codebook using the current pitch period P to calculate the phase. Output to 1 110
  • the phase calculator 110 searches for a phase position using the MA-type adaptive code vector and the current pitch period P. There are several methods for searching for the phase position. The method of maximizing the correlation value between the pulse train arranged in the pitch period P and the MA adaptive code vector, and the pitch period when using for CELP coding There is a method to maximize the correlation value between the vector obtained by applying a synthesis filter to the pulse train arranged in P and the vector obtained by applying the synthesis filter to the MA adaptive code vector.
  • the adder 105 adds the vectors output from the multiplier 104, the multiplier 107, and the multiplier 108 to generate an excitation signal vector.
  • the generated excitation signal vector is also output to adaptive codebook 101 and used to update adaptive codebook 101.
  • FIG. 3 shows a detailed configuration of the MA-type adaptive codebook 109.
  • the MA-type adaptive codebook 109 forms the second noise code vector after multiplication by the noise codebook gain output from the multiplier 108 into a first delay unit 201 and a second delay unit 200. 2.
  • the third delay unit 203 delays one unit time (one unit time is the time corresponding to the length of the excitation signal vector generated in one generation process).
  • the second noise code vector multiplied by the noise codebook gain three unit times before and output from the third delay device 203 is buffered in the first MA-type adaptive codebook 204.
  • the first MA-type adaptive codebook 204 extracts a first MA-type adaptive code vector starting from a point indicated by a pitch period two unit times before, which will be described later.
  • the result obtained by multiplying the first MA-type adaptive code vector extracted from the first MA-type adaptive codebook 204 by the adaptive codebook gain two unit times later described later in a multiplier 205 Is output to the adder 206.
  • the output of the multiplier 205 and the output of the second delay unit 202 output the noise codebook gain two unit times ago.
  • the second noise code vector after the multiplication is added. This added value is output to the second MA type adaptive codebook 207.
  • the second MA-type adaptive codebook 207 is composed of the first MA-type adaptive codebook and the vector output from the adder 206, and is described below one unit time before.
  • the MA type adaptive code vector is cut out starting from the point indicated by the pitch period.
  • the vector cut out from the second MA-type adaptive codebook 207 is multiplied by an adaptive codebook gain one unit time before described later by a multiplier 208, and the result is output to an adder 209. .
  • the adder 209 the output of the multiplier 209 and the second noise code vector multiplied by the noise codebook gain one unit time ago output from the first delay unit 201 are added. I do.
  • the added value is output to the third MA type adaptive codebook 210.
  • the third MA-type adaptive codebook 210 is formed by connecting the second MA-type adaptive codebook 207 and the vector output from the adder 209, and is specified by the pitch period P. Then, the MA-type adaptive code vector is cut out from the starting point and output to the phase calculator 110.
  • the applied codebook gain G 1 is sequentially delayed by one unit time by the fourth delay unit 2 1 1 and the fifth delay unit 2 1 2.
  • the adaptive codebook gain two unit times before output from the fifth delay unit 2 1 2 is given to the multiplier 205 described above, and the adaptive codebook gain before one unit time output from the fourth delay unit 211 is output.
  • the adaptive codebook gain is provided to the multiplier 208 described above.
  • the pitch period P is sequentially delayed by one unit time by the sixth delay unit 2 13 and the seventh delay unit 2 14.
  • the pitch period two unit times before output from the seventh delay unit 2 14 is given to the first MA type adaptive codebook 204 described above, and output from the sixth delay unit 2 13 1
  • the pitch period before the unit time is given to the second MA type adaptive codebook 207 described above.
  • the second noise code vector after the noise codebook gain multiplication input from the multiplier 108 (hereinafter simply referred to as the noise code vector in this paragraph) is input to the first delay unit,
  • the delay unit 201 outputs the noise code vector S [—1] input one unit time ago (past).
  • the second delay unit 202 receives the noise code vector S [—1] input one unit time ago, and further inputs the noise code vector input one unit time ago (ie, two unit time past). Outputs S [— 2].
  • the third delay unit 203 receives the noise code vector S [ ⁇ 2] input in the past two unit times, and further inputs the noise code vector input one unit time in the past (ie, three unit times in the past). Output the vector S [— 3]. This is equivalent to buffering all the noise code vectors input in the past three unit times and extracting the noise code vectors input in each unit time.
  • the fourth delay unit 2 1 1 receives the adaptive codebook gain G 1, outputs the adaptive codebook gain G [—1] input one unit time ago, and outputs the fifth delay unit 2 1 2 is to input the adaptive codebook gain G [—1] input in the previous one unit time, and to further calculate the adaptive codebook gain G [—2] input one unit time in the past (ie, two unit time past). Output. This is equivalent to buffering all adaptive codebook gains input in the past two unit times and extracting adaptive codebook gains input in each unit time.
  • the sixth delay unit 2 13 receives the pitch period P, outputs the pitch period P [—1] input in the past for one unit time, and the seventh delay unit 2 14
  • the pitch period P [-1] input in the past unit time is input, and the pitch period P [-2] input in the past one unit time (that is, two unit times past) is output. This is equivalent to buffering all pitch periods input in the past two unit times and extracting the pitch period input in each unit time.
  • the first MA type adaptive codebook 204 is a buffer having the maximum pitch period length that the pitch period can take, and the noise code vector past 3 unit time is copied at the end of the buffer. All parts before the copied part are 0.
  • the first MA-type adaptive codebook 2 0 4 uses the pitch period P [-2] of 2 unit time past to calculate the end point of the MA-type codebook (the end point of the random code vector of 3 unit time past) from P [ — Extracts and outputs the first MA-type adaptive code vector starting from the point that has been traced back by the pitch period length indicated by [2].
  • the periodic processing is performed with the pitch period length indicated by P [—2] to obtain a vector of a predetermined length.
  • the first MA type adaptive codebook 204 itself is output to the second MA type adaptive codebook 207.
  • the vector output from the first MA-type adaptive codebook 204 is multiplied by the adaptive codebook gain G [—2] two unit times past in the multiplier 205 and output to the adder 206 Is done.
  • the adder 206 adds the vector output from the multiplier 205 to the noise code vector S [ ⁇ 2] in the past two unit times and outputs the vector to the second MA-type adaptive codebook 207 I do.
  • the second MA-type adaptive codebook 2 07 is a buffer having the same length as that of the first MA-type adaptive codebook, and the vector output from the adder 210 is located at the end of this buffer.
  • the first part is copied, and the first MA adaptive codebook 204 is copied in the previous part.
  • the second MA-type adaptive codebook 207 goes back from the end of the second MA-type adaptive codebook 207 by a pitch period length represented by a pitch period P [—1] one unit time in the past. With the point as the starting point, the second MA-type adaptive code vector is cut out and output in the same manner as when the first MA-type adaptive code vector is cut out. Further, the second MA-type adaptive codebook 2107 itself is output to the third MA-type adaptive codebook 210.
  • the vector output from the second MA-type adaptive codebook 207 is multiplied by the adaptive codebook gain G [-1] of one unit time past in the multiplier 208 and output to the adder 209. Is done.
  • the adder 209 adds the noise code vector S [—1] in the past one unit time and the vector output from the multiplier 209, and outputs the result to the third MA type adaptive codebook 210. .
  • the third MA-type adaptive codebook 210 is a buffer having the same length as the second MA-type adaptive codebook, and the vector output from the adder 209 is located at the end of this buffer.
  • the second MA-type adaptive codebook 207 is copied to the part before it is copied.
  • the third MA-type adaptive codebook 210 is a third MA-type adaptive codebook, starting from a point that has been advanced from the end of the third MA-type adaptive codebook 210 by the pitch period indicated by the current pitch period P.
  • the type adaptive code vector is cut out and output in the same way as when the first MA type adaptive code vector was cut out.
  • the third MA type adaptive code vector is input to the phase calculator 110.
  • the MA adaptive codebook generates four vectors in order to calculate the noise code vector and the adaptive codebook gain and the phase period based on the pitch period within the past three unit times. It is not affected by transmission line errors in the past more than an hour.
  • the noise code vector, adaptive codebook gain, and pitch period within the past three unit times are used.However, with the same configuration, information over the past four unit times or the past two unit times is used. The configuration used is also possible.
  • an adaptive code vector acv [0 to N-1] is generated from an adaptive codebook acb [-Pmax ⁇ -1].
  • Pmax is the maximum value of the pitch period (pitched lag) that can be taken
  • N is the number of signal samples in one unit time
  • [] indicates an array variable.
  • the adaptive codebook acb [] is a buffer that stores only Pmax samples of the excitation signal vector generated in the past, and outputs acb [-P to N-P_1] as the adaptive code vector acv [0 to Nl].
  • P is the pitch period. If N- ⁇ -1 ⁇ 0, it is out of the range of the adaptive codebook acb [], so acv [] is generated by repeatedly using the part of acb [-P ⁇ -1].
  • the MA-type adaptive codebook ma-acv [0 to N-l] is generated from the MA-type adaptive codebook ma-acb [-Pmax: -1].
  • the method of generating ma-acb [] will be described later using FIG.
  • MA-type adaptive codebook ma—acb [] is a buffer that stores only Pmax samples of vectors generated from sound source generation information in the past finite time
  • MA—acb [-P to N-P-1] Type adaptive code vector ma—Output as acv [0 to Nl].
  • N-P-1 ⁇ the MA-type adaptive code vector ma- acv [] is generated in the same way as when the adaptive code vector acv [] is generated from the adaptive codebook acb [].
  • the phase ph is calculated.
  • the phase is calculated by searching for the position of the first impulse of the impulse train vector that maximizes the cross-correlation between the MA-type adaptive code vector ma—acv [] and the impulse train vectors arranged with the pitch period P Is used.
  • search for the position of the first impulse in the impulse train vector that minimizes distortion in the area after the excitation signal is subjected to the synthesis filter.
  • Phase adaptive PS I—CEL P speech coding IEICE Technical Report, SP 94—96 (1 995-02) P. 37 -P. 44 ".
  • a first random code vector scvl [0 to N-1] is generated from the first random codebook SCBl [Slsize] [-MAXph to N-1].
  • the noise codebook SCB1 [] [] stores a vector with a length of N + MAXph as Slsize.
  • MAXph is the maximum value that the phase Ph can take (ph ⁇ 0).
  • the vector SCB1 [S1] [] specified by the first noise code index S1 is extracted, and the SCB1 [S1] [-ph to Nl-ph] is cut out using the phase ph to obtain the first noise. Let the sign vector be scvl [0 to Nl].
  • a second random code vector scv2 [0 to N-l] is generated from the second random codebook SCB2 [S2size] [0 to N-l].
  • the random codebook SCB2 [] [] stores S2size types of vectors having a length of N.
  • the vector SCB2 [S2] [0 to N-1] specified by the second noise code index S2 is set as a second noise code vector scv2 [0 to N-1].
  • step 306 adaptive code vector acv and base multiplied by the adaptive codebook gain G 1 in the [0 ⁇ Nl] vector, the first noise code base vector SCV 1 [0 to N-1] and the second noise code
  • An excitation signal vector exc [0-Nl] is generated by adding a vector obtained by multiplying the sum vector with the vector scv2 [0-Nl] by the noise codebook gain G2.
  • step 307 the adaptive codebook is updated.
  • FIG. 5 is a flowchart showing a specific process of the method of generating an MA-type adaptive codebook in the present embodiment.
  • step 401 the contents of the MA adaptive codebook ma_acb [-Pmax to -l] are cleared to zero.
  • step 402 the second noise code vector after the noise codebook gain multiplication (hereinafter simply referred to as the noise code vector in this paragraph) is stored in the buffer buf—scb [0] [0 to Nl].
  • the pitch period P is stored in the buffer buf_p [0].
  • the adaptive codebook gain G1 is stored in buf—g [0].
  • buf_p [0 to 2] stores the pitch cycle P used in the past
  • buf_p [0] indicates the current pitch cycle
  • buf_p [1] indicates the pitch used in the past one unit time.
  • the period, buf_p [2] stores the pitch period used in the past 2 units of time.
  • the adaptive codebook gain G 1 used in the past is stored in buf-g [0-2]
  • the current adaptive codebook gain is stored in buf-g [0]
  • the current adaptive codebook gain is stored in buf-g [l].
  • an MA-type adaptive codebook ma-acb [] is generated. This is done by copying buf-scb [3] [0-N-1] to ma-1 acb [-N--1].
  • the MA type adaptive codebook is updated. First, ma-act> [-buf-p [2] -Nl-buf_p [2]] is copied to temporary vector 1 Tmp [0-Nl]. This is the same as generating an adaptive code vector using the MA adaptive codebook generated in step 405 as an adaptive codebook and using buf_p [2] as a pitch period.
  • the vector obtained by multiplying the temporary vector Tmp [0 to Nl] by buf—g [2] and buf—scb [2] [0 to N-1 ⁇ is calculated by calo, and ma—acb [-N ⁇ -1 ].
  • step 407 the MA type adaptive codebook is updated again.
  • ma acb [-buf_p [l] -N-1-buf-p [l]] is copied to the temporary vector Tmp [0-N-l]. This is the same as generating the adaptive code vector using the ⁇ ⁇ -type adaptive codebook updated in step 406 as the adaptive codebook and buf_p [1] as the pitch period.
  • the vector obtained by multiplying the temporary vector Tmp [0 ⁇ N-1] by buf-g [l] and buf_scb [1 ⁇ [0 ⁇ N-1] are calculated as ma_acb [-N ⁇ - 1].
  • step 409 the three buffers are updated and all processing ends.
  • the MA-type adaptive codebook is expanded to increase the phase adaptability. It is also possible to increase.
  • phase adaptation process is performed only on one of the noise codebooks. Therefore, compared with the case where the phase adaptation process is performed on both of the two noise codebooks. The resistance to transmission line errors has become stronger.
  • an example of a configuration in which a noise codebook that always performs (MA type) phase adaptation is used is shown.
  • a noise codebook that uses only a general noise codebook that does not perform phase adaptation at all is shown.
  • the present invention is also applicable to a configuration example in which a mode for generating a vector and a mode for generating a random code vector from a random code book performing phase adaptation as described in the present embodiment are switched.
  • FIG. 6 shows an embodiment using the excitation signal generator shown in the first embodiment.
  • FIG. 6A shows a speech encoding device
  • FIG. 6B shows a speech decoding device.
  • an input signal composed of a digitized speech signal or the like is inputted to the LPC analyzer 501 and the adder 506.
  • the LPC analyzer 501 performs a linear prediction analysis to calculate a linear prediction coefficient (LPC) and outputs it to the LPC quantizer 502.
  • LPC linear prediction coefficient
  • the LPC quantizer 502 quantizes the input LPC, outputs the quantized LPC to the synthesis filter 505, and outputs a code L representing the quantized LPC to the decoder.
  • the synthesis filter 505 constructs an LPC synthesis filter using the input quantized LPC.
  • An excitation signal generated by the excitation signal generator 503 is input to the synthesized filter to perform filter processing, and a synthesized speech signal is output to the adder 506.
  • the adder 506 calculates an error between the input data and the synthesized speech signal, and outputs the error to the distortion calculator 507.
  • the distortion calculator 507 calculates the distortion of the synthesized voice signal with respect to the input voice signal in consideration of the auditory weights, etc., based on the error signal output from the adder 506, and sends the resultant to the parameter determiner 504. Output.
  • the parameter determining unit 504 determines parameters (P, S 1, S 2, G 1, G 2) for generating an excitation signal output so as to minimize the distortion output from the distortion calculating unit 507. Adjust. Finally, a combination of parameters that minimizes distortion is output to the decoder side.
  • the encoded LPC information L transmitted from the encoder side is provided to the LPC decoder 508.
  • LPC decoder 508 decodes and decodes the quantized LPC from PC information L and outputs it to synthesis filter 510.
  • the synthesis filter 510 constructs an LPC synthesis filter using the decoding LPC input from the LPC decoder 508, applies a synthesis filter to the excitation signal input from the excitation signal generator 509, and decodes the synthesized speech. Output a signal.
  • the excitation signal generator 509 generates the excitation signal using the parameters (P, SI, S2, Gl, G2) for generating the excitation signal transmitted from the encoder side. Output to the composite file.
  • the synthesis filter 510 on the decoder side and the synthesis filter 505 on the encoder side are exactly the same if there is no error in the transmitted information.
  • the excitation signal generator 503 on the decoder side and the excitation signal generator 509 on the encoder side perform exactly the same operation to generate the same excitation signal if there is no error in the transmitted information.
  • post-processing such as a post filter for improving the auditory quality is added to the decoded synthesized speech signal output from the synthesis filter 510, the quality of the decoded speech signal is further improved.
  • FIG. 7 is a block diagram showing an audio signal transmitter and a receiver provided with the audio encoding or decoding device according to the second embodiment.
  • Figure 7A shows the transmitter and
  • Figure 7B shows the receiver.
  • the audio is converted into an electrical analog signal by the audio input device 601 and output to the AZD converter 602.
  • the analog audio signal is converted into a digital audio signal by the AZD converter 602 and output to the audio encoder 603.
  • Speech encoder 603 performs speech encoding processing, and outputs the encoded information to RF modulator 604.
  • the RF modulator performs an operation for transmitting the information of the encoded voice signal as a radio wave such as modulation, amplification, and code spreading, and outputs the information to the transmission antenna 605.
  • a radio wave (RF signal) 606 is transmitted from the transmitting antenna 605.
  • a radio wave (RF signal) 606 is received by the receiving antenna 607, and the received signal is sent to the RF demodulator 608.
  • the RF demodulator 608 performs processing such as code despreading / demodulation for converting radio signals into encoded information, and outputs the encoded information to the audio decoder 609.
  • the audio decoder 609 performs a decoding process on the encoded information and converts the digital decoded audio signal into a DZA converter 61. Output to 0.
  • the DZA converter 610 converts the digitized decoded audio signal output from the audio decoder 609 into an analog decoded audio signal and outputs the analog decoded audio signal to the audio output device 611.
  • the audio output device 6 11 1 converts the electrical analog decoded audio signal into decoded audio and outputs it.
  • the transmitting device and the receiving device can be used as a mobile device or a base station device of a mobile communication device such as a mobile phone.
  • the medium for transmitting information is not limited to radio waves as described in the present embodiment, but may use optical signals or the like, and may use a wired transmission path.
  • the audio encoding device or the encoding and decoding device described in the second embodiment and the transmitting device and the transmitting and receiving device described in the third embodiment include a magnetic disk, a magneto-optical disk, and a ROM cartridge. It is also possible to realize by recording as software on a recording medium such as, for example, and by using the recording medium, a speech encoding device and a Z decoding device can be used by a personal computer or the like using such a recording medium. And a transmission device / reception device can be realized. That is, by installing the recorded program in a computer, the same function as that of the above-described excitation signal generator can be provided.
  • a phase adaptation process of a noise code vector is performed based on an MA type adaptive code vector generated using a finite number of noise code vectors used in the past, an adaptive codebook gain, and a pitch period. Therefore, the propagation of a transmission path error can be shortened as compared with a method in which a noise code vector is adaptively shifted based on information extracted from an adaptive code vector. Also, since the phase adaptation processing is performed only on one of the noise codebooks, the resistance to transmission path errors can be increased as compared with the case where the phase adaptation processing is performed on both of the two noise codebooks.
  • the noise code vector is not adaptively shifted by the information extracted from the adaptive code vector.
  • the phase adaptation processing is performed only from the data used in the past finite time, the propagation of errors due to the phase adaptation processing can be suppressed within a limited time. It is possible to provide an excitation signal generating device, a voice coding device, and a voice decoding device.
  • the present invention can be used in a communication terminal device such as a base station device and a mobile station in a digital radio communication system.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

On génère un vecteur de code adaptatif MA par l'intermédiaire d'un nombre fini de vecteurs de code de bruit utilisés antérieurement, d'un gain de table de codage adaptative et d'une période de pointe et on calcule le niveau quantitatif du déphasage à partir de ce vecteur de code adaptatif MA, ce qui permet de déphaser le vecteur de code de bruit selon le niveau quantitatif calculé du déphasage.
PCT/JP1999/004137 1998-08-06 1999-08-02 Generateur de signaux d'excitation, codeur vocal et decodeur vocal WO2000008633A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU49324/99A AU4932499A (en) 1998-08-06 1999-08-02 Exciting signal generator, voice coder, and voice decoder

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP10223392A JP2000056799A (ja) 1998-08-06 1998-08-06 励振信号生成装置並びに音声符号化装置及び音声復号化装置
JP10/223392 1998-08-06

Publications (1)

Publication Number Publication Date
WO2000008633A1 true WO2000008633A1 (fr) 2000-02-17

Family

ID=16797435

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP1999/004137 WO2000008633A1 (fr) 1998-08-06 1999-08-02 Generateur de signaux d'excitation, codeur vocal et decodeur vocal

Country Status (3)

Country Link
JP (1) JP2000056799A (fr)
AU (1) AU4932499A (fr)
WO (1) WO2000008633A1 (fr)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0519796A (ja) * 1991-07-08 1993-01-29 Nippon Telegr & Teleph Corp <Ntt> 音声の励振信号符号化・復号化方法
JPH05289698A (ja) * 1992-04-09 1993-11-05 Nippon Telegr & Teleph Corp <Ntt> 音声符号化法
JPH0792999A (ja) * 1993-09-22 1995-04-07 Nippon Telegr & Teleph Corp <Ntt> 音声の励振信号符号化方法および装置
US5787391A (en) * 1992-06-29 1998-07-28 Nippon Telegraph And Telephone Corporation Speech coding by code-edited linear prediction
US5839110A (en) * 1994-08-22 1998-11-17 Sony Corporation Transmitting and receiving apparatus

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0519796A (ja) * 1991-07-08 1993-01-29 Nippon Telegr & Teleph Corp <Ntt> 音声の励振信号符号化・復号化方法
JPH05289698A (ja) * 1992-04-09 1993-11-05 Nippon Telegr & Teleph Corp <Ntt> 音声符号化法
US5787391A (en) * 1992-06-29 1998-07-28 Nippon Telegraph And Telephone Corporation Speech coding by code-edited linear prediction
JPH0792999A (ja) * 1993-09-22 1995-04-07 Nippon Telegr & Teleph Corp <Ntt> 音声の励振信号符号化方法および装置
US5839110A (en) * 1994-08-22 1998-11-17 Sony Corporation Transmitting and receiving apparatus

Also Published As

Publication number Publication date
AU4932499A (en) 2000-02-28
JP2000056799A (ja) 2000-02-25

Similar Documents

Publication Publication Date Title
EP1619664B1 (fr) Appareil de codage et de décodage de la parole et méthodes pour cela
CN101496098B (zh) 用于以与音频信号相关联的帧修改窗口的系统及方法
US5371853A (en) Method and system for CELP speech coding and codebook for use therewith
JP4958780B2 (ja) 符号化装置、復号化装置及びこれらの方法
US7599833B2 (en) Apparatus and method for coding residual signals of audio signals into a frequency domain and apparatus and method for decoding the same
US6202045B1 (en) Speech coding with variable model order linear prediction
WO2001061687A1 (fr) Codec de parole a large bande utilisant differentes frequences d&#39;echantillonnage
CN101006495A (zh) 语音编码装置、语音解码装置、通信装置以及语音编码方法
WO2001052241A1 (fr) Dispositif de codage vocal multimode et dispositif de decodage
EP3214619B1 (fr) Système et procédé pour l&#39;excitation d&#39;un guide mixte de codification pour codage de la parole
EP1699043A1 (fr) Procede et dispositif pour decoder des signaux
JP2003323199A (ja) 符号化装置、復号化装置及び符号化方法、復号化方法
JP3343082B2 (ja) Celp型音声符号化装置
JP3888097B2 (ja) ピッチ周期探索範囲設定装置、ピッチ周期探索装置、復号化適応音源ベクトル生成装置、音声符号化装置、音声復号化装置、音声信号送信装置、音声信号受信装置、移動局装置、及び基地局装置
JP3199142B2 (ja) 音声の励振信号符号化方法および装置
JP2004302259A (ja) 音響信号の階層符号化方法および階層復号化方法
JP4578145B2 (ja) 音声符号化装置、音声復号化装置及びこれらの方法
WO2000008633A1 (fr) Generateur de signaux d&#39;excitation, codeur vocal et decodeur vocal
JP4295372B2 (ja) 音声符号化装置
JP3954716B2 (ja) 音源信号符号化装置、音源信号復号化装置及びそれらの方法、並びに記録媒体
JP2002073097A (ja) Celp型音声符号化装置とcelp型音声復号化装置及び音声符号化方法と音声復号化方法
JP2002169595A (ja) 固定音源符号帳及び音声符号化/復号化装置
JP2003015699A (ja) 固定音源符号帳並びにそれを用いた音声符号化装置及び音声復号化装置
JPH034300A (ja) 音声符号化復号化方式
GB2365297A (en) Data modem compatible with speech codecs

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642