EP0810584A2 - Codeur de signal - Google Patents
Codeur de signal Download PDFInfo
- Publication number
- EP0810584A2 EP0810584A2 EP97108526A EP97108526A EP0810584A2 EP 0810584 A2 EP0810584 A2 EP 0810584A2 EP 97108526 A EP97108526 A EP 97108526A EP 97108526 A EP97108526 A EP 97108526A EP 0810584 A2 EP0810584 A2 EP 0810584A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- signal
- pitch
- sub
- excitation
- bands
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005284 excitation Effects 0.000 claims abstract description 71
- 230000003595 spectral effect Effects 0.000 claims abstract description 53
- 230000002194 synthesizing effect Effects 0.000 claims description 5
- 230000003044 adaptive effect Effects 0.000 abstract description 21
- 230000006866 deterioration Effects 0.000 abstract description 3
- 239000011295 pitch Substances 0.000 description 77
- 230000004044 response Effects 0.000 description 25
- 238000010586 diagram Methods 0.000 description 17
- 238000013139 quantization Methods 0.000 description 14
- 238000000034 method Methods 0.000 description 9
- 238000010276 construction Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000009331 sowing Methods 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 239000002699 waste material Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0011—Long term prediction filters, i.e. pitch estimation
Definitions
- the present invention relates to a signal coder and, more particularly, to a signal coder for high quality coding of wide-band signals such as speech and music at low bit rates.
- CELP Code Excited Linear Prediction Coding
- M. Schroeder and B. Atal "Code-excited linear prediction: High quality speech at very low bit rates", Proc. ICASSP, pp. 937-940, 1985 (Literature 1), and Kleijn et al, "Improved speech quality and efficient vector quantization in CELP", Proc. ICASSP, pp. 155-158, 1998 (Literature 2).
- spectral parameters representing a spectral characteristic of a speech signal are extracted from the speech signal for each frame (of 20 ms, for instance) through LPC (linear prediction). Also, the frame is divided into sub-frames (of 5 ms, for instance), and parameters in an adaptive codebook (i.e., a delay parameter corresponding to the pitch cycle and a gain parameter) are extracted for each sub-frame on the basis of the past speech signals, for making the pitch prediction of the sub-frame noted above with the adaptive codebook.
- the optimum gain is calculated by selecting an optimum speech codevector from the excitation codebook (i.e., vector quantization codebook) consisting of noise signals of predetermined kinds for the speech signal obtained by the pitch prediction.
- excitation signal is quantized.
- An excitation codevector is selected which minimizes the error power between a synthesized signal from selected noise signals and an excitation signal obtained by the pitch prediction.
- An index representing the kind of the selected codevector, an index representing a gain codevector, the spectral parameter, a delay parameter corresponding to the pitch cycle and a gain parameter are combined in a multiplexer and then transmitted.
- Any of the techniques described above permits obtaining comparatively good sound quality with speech signals.
- speech signals of a plurality of speakers speaking in a conference or the like or music signals produced by a plurality of different musical instruments and containing a plurality of different pitches low bit rates result in extreme sound quality deterioration.
- An object of the present invention is therefore to solve the above problems and provide a signal coder, in which even at a low bit rate the necessary computational effort and sound quality deterioration are relatively less with wide-band speech signals as well as music signals.
- a signal coder comprising: a spectral parameter calculator for obtaining a spectral parameter from an input signal and quantizing the spectral parameter thus obtained; a divider for dividing the input signal into a plurality of sub-bands; a pitch calculator for obtaining pitch data in at least one of the sub-bands and obtaining a pitch prediction signal; a judging unit for obtaining pitch prediction signal in at least one of the sub-bands and executing pitch prediction judgment; and an excitation quantizer for synthesizing the pitch prediction signal, subtracting the obtained pitch prediction signal from the input signal to obtain an excitation signal, and quantizing the obtained excitation signal.
- a signal coder comprising: a spectral parameter calculator for obtaining a spectral parameter from an input signal and quantizing the spectral parameter thus obtained: a mode judging unit for judging the mode of the input signal by extracting a feature quantity therefrom; a divider for dividing the input signal into a plurality of sub-bands in a predetermined mode; a pitch calculator for obtaining pitch data in at least one of the sub-bands and obtaining a pitch prediction signal; a judging unit for making pitch prediction judgment using the pitch prediction signal in at least one of the sub-bands; and an excitation quantizer operable in a predetermined mode to synthesize the pitch prediction signal, obtaining an excitation signal by subtracting the synthesized pitch prediction signal from the input signal, and quantizing the excitation signal thus obtained.
- the excitation signal of the input signal is quantized by expressing it as a plurality of non-zero amplitude pulses.
- a signal coder comprising: a spectral parameter calculator for obtaining a spectral parameter from an input signal and quantizing the spectral parameter thus obtained; a divider for dividing the input signal into a plurality of sub-bands; a pitch calculator for obtaining a plurality of pitch data candidates in at least one of the sub-bands and obtaining a pitch prediction signal for each pitch data candidate; a selector for synthesizing the pitch prediction signal for a combination of pitch data candidates and selecting the best pitch data by using the error signal between the input signal and the pitch prediction signal; and an excitation quantizer for quantizing the error signal.
- a signal coder comprising: a spectral parameter calculator for obtaining a spectral parameter from an input signal and quantizing the spectral parameter thus obtained; a mode judging unit for judging the mode of the input signal by extracting a feature quantity therefrom; a divider for dividing the input signal into a plurality of sub-bands in a predetermined mode; a pitch calculator for obtaining a plurality of pitch data candidates in at least one of the sub-bands and obtaining a pitch prediction signal for each pitch data candidate; a selector operable in a predetermined mode to synthesize the pitch prediction signal for a combination of Pitch data candidates and selecting the best pitch data by using the error signal between the input signal and the pitch prediction signal; and an excitation quantizer for quantizing the error signal.
- the error signal is quantized by expressing it using a plurality of non-zero amplitude pulses.
- Fig. 1 is a block diagram showing a first embodiment of the signal coder according to the present invention.
- This embodiment of the signal coder comprises a frame divider 110, a sub-frame divider 120, a spectral parameter calculator 200, a spectral parameter quantizer 210, a codebook 215, an acoustical sense weighting circuit 230, subtractors 235 and 236, a response signal calculator 240, adaptive codebook circuits 300 1 to 300 U , an impulse response calculator 310, an excitation quantizer 350, an excitation codebook 355, a gain quantizer 365, a gain codebook 366, a multiplexer 400, dividers 410, 415 and 440, judging circuits 420 1 to 420 U for executing the pitch prediction judgment, and a synthesizer 430.
- the frame divider 110 divides a speech signal supplied from an input terminal 100 into frames (of 10 ms, for instance).
- the sub-frame divider 120 divides the speech signal frame into sub-frames (of 5 ms, for instance) shorter than the frame.
- the spectral parameter may be calculated by using the well-known LP analysis, Burg analysis, etc. It is herein assumed that the Burg analysis is used. The Burg analysis is detailed in Nakamizo, "Signal Analysis and System Identification", published by Corona Co., Ltd., 1988, pp. 82-87 (Literature 4), and is not described here.
- LSP Linear Spectrum Pair
- the spectral parameter calculator 200 further outputs the 2-nd sub-frame LSP parameter to the spectral parameter quantizer 210.
- the spectral parameter quantizer 210 efficiently quantizes the LSP parameter of a predetermined sub-frame. Specifically, the 2-nd sub-frame LSP parameter is vector quantized.
- This vector quantization may be executed by well-known method.
- reference may be had to, for instance, Japanese Laid-Open Patent Publication No. 4-171500 (Literature 6), Japanese Laid-Open Patent Publication No. 4-363000 (Literature 7), Japanese Laid-Open Patent Publication No. 5-6199 (Literature 8), and T. Nomura et al, "LSP Coding Using VQ-SVQ With Interpolation in 4.074 kbps M-LCELP Speech Coder", Proc. Mobile Multimedia Communications, pp. B. 2.5, 1993 (Literature 9).
- the spectral parameter quantizer 210 selects and outputs a codevector which minimizes the distortion D j given by Equation (1).
- D j i P W(i)[LSP(i)-QLSP(i) j ] 2
- LSP(i), QLSP(i) j and W(i) are the LSP of the i-th sub-frame, the j-th codevector and the weighting coefficient, respectively, before the quantization.
- the spectral parameter quantizer 210 restores the 1-st sub-frame LSP parameter from the LSP parameter which has been quantized in the 2-nd sub-frame. Specifically, the spectral parameter quantizer 210 restores the 1-st sub-frame LSP parameter through the linear interpolation of the quantized LSP parameter of the 2-nd sub-frame of the current frame and the quantized LSP parameter of the 2-nd sub-frame of the immediately preceding frame. The spectral parameter quantizer 210 can restore the 1-st sub-frame LSP parameter through the linear interpolation after selecting a codevector which minimizes the error power between the LSP parameter before the quantization and that after the quantization.
- the spectral parameter quantizer 210 further outputs an index, which represents the codevector of the quantized LSP parameter of the 2-nd sub-frame, to the multiplexer 400.
- the response signal calculator 240 receives the linear prediction coefficient ⁇ i for each sub-frame from the spectral parameter calculator 200 and also the linear prediction coefficient ⁇ i having been restored through the quantization and interpolation for each sub-frame from the spectral parameter quantizer 210, calculates the response signal x z (n) with an input signal d(n) of zero for one sub-frame by using a value preserved in the filter memory, and outputs the calculated response signal to the subtractor 235.
- the response signal x z (n) is represented by Equation (2).
- N represents the sub-frame length
- ⁇ is a weighting coefficient for controlling the amount of the acoustical sense weighting and has the same value as in Equation (6) given below
- s w (n) and p(n) are a response signal outputted from the weighting signal calculator 360 and an output signal in the right side first term of Equation (6) to be given below as a filter divider term, respectively.
- the subtractor 235 subtracts the response signal x z (n) from the acoustical sense weighting signal x z (n) for one sub-frame as in Equation (5), and outputs the subtracted result x' w (n) to the divider 410 and the subtractor 820.
- x' w (n) x w (n)-x z (n)
- the impulse response calculator 310 calculates the impulse response h w (n) of the acoustical sense weighting filter, the z transform of which is represented by Equation (6), for a predetermined number L of points, and outputs the calculation result to the divider 415 and the excitation quantizer 350.
- the divider 410 divides the subtracted result x' w (n) from the subtractor 235 into a predetermined number U of sub-bands, and outputs these sub-bands as residue signals X' w1 (n) to X' wU (n) to the adaptive codebook circuits 300 1 to 300 U and the judging circuits 420 1 to 420 U .
- the band division may be executed by using a QMF (Quadrature Mirror Filter).
- QMF Quadratture Mirror Filter
- the divider 415 divides the impulse response h w (n) into a predetermined number U of sub-bands, and outputs these sub-bands as corresponding impulse responses h w1 (n) to h wU (n) to corresponding sub-bands of the adaptive codebook circuits 300 1 to 300 U .
- the adaptive codebook circuits 300 1 and 300 U and the judging circuits 420 1 to 420 U are operative in the same way with respect to each sub-band, and as an example the operations of the adaptive codebook circuit 300 1 and the judging circuit 420 1 will be described.
- the adaptive codebook circuit 300 1 receives the past excitation signal v 1 (n) corresponding to a sub-band 1 from the divider 440, the residue signal X' w1 (n) corresponding to the sub-band 1 from the divider 410, and the impulse response signal h w1 (n) corresponding to the sub-band 1 from the divider 410.
- the adaptive codebook circuit 300 1 derives a delay parameter T 1 corresponding to the pitch gain and a pitch gain ⁇ 1 so as to minimize the distortion D T1 in Equation (7), and outputs the obtained data to the judging circuit 420 1 .
- Equation (8) y w1 (n-T 1 ) is given by Equation (8), and the symbol * represents convolution.
- y w1 (n-T 1 ) v 1 (n-T 1 )*h w1 (n)
- the adaptive codebook circuit 300 1 then derives the pitch gain ⁇ 1 as in Equation (9).
- the delay parameter T 1 may be obtained not as an integer sample but as a decimal sample in order to improve the accuracy of extraction of the delay parameter T 1 for speech of women and children.
- P. Kroon et al "Pitch predictors with high temporal resolution", Proc. ICASSP, pp. 661-664, 1990 (Literature 11).
- the adaptive codebook circuit 300 1 quantizes the pitch gain ⁇ 1 with a predetermined quantizing bit number, then executes the pitch prediction as in Equations (10) and (11), and outputs the pitch prediction signal q w1 (n) and the pitch prediction excitation signal g 1 (n) to the judging circuit 420 1 .
- ⁇ ' 1 is the quantized gain.
- the judging circuit 420 1 derives the pitch prediction gain G 1 and executes the judgment as to whether or not to execute the pitch prediction by comparing the derived pitch prediction gain G 1 with a predetermined pitch prediction gain.
- the pitch prediction gain G 1 is derived as in Equation (12).
- the judging circuit 420 1 judges that pitch prediction is activated, and outputs the pitch prediction signal q w1 (n) and the pitch prediction excitation signal g 1 (n) to the synthesizer 430.
- the judging circuit 420 1 judges that the pitch prediction is not activated, and outputs zero amplitude signal to the synthesizer 430.
- the judging circuit 420 1 When the pitch prediction is activated, the judging circuit 420 1 outputs an index representing the delay parameter T 1 and an index representing the quantized gain ⁇ ' 1 to the multiplexer 400.
- the synthesizer 430 receives the pitch prediction signal q w1 (n) and the pitch prediction excitation signal g 1 (n) from the judging circuit 420 1 , executes full band synthesis, and outputs the full band synthesized signal q w (n) to the subtractor 236.
- the synthesizer 430 outputs the full band synthesized excitation signal g(n) to the weighting signal calculator 360.
- the subtractor 236 subtracts the full band synthesized signal g w (n) from the subtracted result X' w (n) from the subtractor 235, and outputs the result of the subtraction as the excitation signal z w (n) to the excitation quantizer 350.
- z w (n) x' w (n)-q w (n)
- the excitation quantizer 350 executes the vector quantization of the excitation signal z w (n) using the excitation codebook 355. Specifically, the excitation quantizer 350 retrieves from the excitation codebook 355 the excitation codevector c j (n) such as to minimize the distortion D j in Equation (14) by using the excitation signal z w (n) as the output of the subtractor 230 and the impulse response h w (n) as the output of the impulse response calculator 310.
- Equation (14) ⁇ (n) and s wj (n) are given by Equations (15) and (16), respectively.
- Equation (16) symbol * represents convolution.
- the excitation quantizer 350 outputs the index representing the selected excitation codevector to the multiplexer 400.
- the gain quantizer 365 selects a gain codevector which minimizes the distortion D t in Equation (17) with respect to the selected excitation codevector by reading out the gain codevectors from the gain codebook 366.
- the excitation codevector gain is vector quantized.
- G' t is a t-th codevector element of a gain codevector stored in the gain codevector 366.
- the gain quantizer 365 outputs an index representing the selected the gain codevector to the multiplexer 400.
- the weighting signal calculator 360 receives an index representing the pitch cycle, an index representing the quantized gain, an index of the excitation codebook 355, and an index representing the gain codebook, reads out a codevector corresponding to these read-out indexes, and derives a drive excitation signal v(n) as in Equation (18).
- v(n) g(n)+G' t c j (n)
- the weighting signal calculator 360 outputs the drive excitation signal v(n) to the divider 440.
- the weighting signal calculator 360 calculates the response signal S w (n) for each sub-frame as in Equation (19) by using the output parameter (LSP parameter) of the spectral parameter calculator 200 and the output parameter (linear prediction coefficient ⁇ i ) of the spectral parameter quantizer 210, and outputs the calculated response signal to the response signal calculator 240.
- the divider 440 executes the band division to sub-bands with respect to the drive the excitation signal v(n) outputted from the weighting signal calculator 360, and outputs the past excitation signals v 1 (n) to v U (n) corresponding to the sub-bands to the adaptive codebooks 300 1 to 300 U .
- Fig. 2 is a block diagram showing a second embodiment of the signal coder according to the present invention.
- the second embodiment of the signal coder is different from the first embodiment of the signal coder shown in Fig. 1 in an excitation quantizer 500, an amplitude codebook 540, a gain quantizer 550, a gain codebook 560, and a weighting signal calculator 570.
- the other component circuits are designated by like reference numerals and not described.
- the excitation quantizer 500 includes a correlation calculator 510, a position calculator 520, and an amplitude quantizer 530.
- the excitation quantizer 500 calculates the positions and amplitudes of M non-zero amplitude pulses in a pulse train.
- the correlation coefficient calculator 510 receiving, from terminals 501 and 502, the subtracted result z w (n) of the subtractor 236 and the impulse response h w (n) of the impulse response calculator 310, calculates two different correlation coefficients ⁇ (n) and ⁇ (p, q) as in Equations (20) and (21), and outputs these correlation coefficients to the position calculator 520 and amplitude quantizer 530.
- the position calculator 520 calculates the positions of a predetermined number M of non-zero amplitude pulses. Specifically, the position calculator 520 obtains for each pulse a pulse position which maximizes an evaluation value D represented by Equation (22) among predetermined position candidates as in Literature 3.
- Table 1 0,5,10,15,20,25,30,35 1,6,11,16,21,26,31,36 2,7,12,17,22,27,32,37 3,8,13,18,23,28,33,38 4,9,14,19,24,29,34,39
- the position calculator 520 selects a position which maximizes Equation (22) for each pulse by checking the position candidates.
- D C k 2 E k
- Equation (22) C k and E k are given by Equations (23) and (24), respectively.
- Equations (23) and (24) m k represents the position of a k-th pulse, and sgn(k) represents the polarity of the k-th pulse.
- the position calculator 520 outputs the position data of the M pulses to the amplitude quantizer 530.
- the amplitude quantizer 530 amplifies the amplitudes of the pulses by using the amplitude codebook 530. Specifically, the amplitude quantizer 530 selects the amplitude codevectors which maximize the evaluation value given by Equation (25). C j 2 /E j
- Equations (26) and (27) g' kj is the amplitude of the k-th pulse in the j-th amplitude codevector.
- the amplitude codevector 540 for the pulse amplitude quantization is preliminarily studied using the speech signal and stored.
- the amplitude quantizer 530 outputs the amplitude codevector index and position data from terminals 503 and 504.
- the gain quantizer 550 quantizes the pulse gain using the gain codebook 560. Specifically, the gain quantizer 550 selects a gain codevector which minimizes the distortion D t in Equation (28), and outputs the index of the selected gain codevector to the multiplexer 400.
- the weighting signal calculator 570 receives the pitch delay index, the quantized gain index, the index of the amplitude codebook 540, and the gain codevector index, reads out a codevector corresponding to the read-out indexes, and derives the drive excitation signal v(n) as in Equation (29).
- v(n) g(n)+G' t g' kj h w (n-m k )
- the weighting signal calculator 570 outputs the drive excitation signal v(n) to the divider 440.
- the weighting signal calculator 570 calculates the response signal s w (n) for each sub-frame as in Equation (30) by using the output parameter (LSP parameter) of the spectral parameter calculator 200 and the output parameter (linear prediction coefficient ⁇ i ' of the spectral parameter quantizer 210, and outputs the calculated response signal to the response signal calculator 240.
- Fig. 4 is a block diagram showing a third embodiment of the signal coder according to the present invention.
- Fig. 4 is different from Fig. 1 in dividers 600, 615 and 620, synthesizer 610 and a mode judging circuit 900.
- the mode judging circuit 900 receives the acoustical sense weighted signal X w (n) for each frame from the heating sense weighting circuit 230, and outputs mode data to the dividers 600, 615 and 620, the synthesizer 610 and the multiplexer 400.
- the mode judgment is executed at this time by using a feature quantity of the current frame.
- the frame mean pitch prediction gain G is used as the feature quantity.
- the frame mean pitch prediction gin G is calculated by using Equation (31) by using Equation (31), for instance.
- Equation (31) L is the number of sub-frames in one frame, and P i and E i are the speech power in the i-th sub-frame in Equation (32) and the pitch prediction error power in Equation (33), respectively.
- T' is the optimum delay for maximizing the frame mean pitch prediction gain G.
- the mode judging circuit 900 classifies the frame mean pitch prediction gain G into a plurality of, for instance four, different modes by comparison to a plurality of different predetermined threshold values.
- the dividers 600, 615 and 620 and synthesizer 610 receive mode data, and in a predetermined mode they perform the same process as in the first embodiment of the signal coder as shown in Fig. 1 by dividing signal into a plurality of sub-bands. In the other modes, they do not perform the signal division into the sub-bands or synthesis of signal.
- Fig. 5 is a block diagram sowing a fourth embodiment of the signal coder according to the present invention.
- This embodiment of the signal coder is obtained by adding the mode judging circuit 900 shown in Fig. 4 to the second embodiment of the signal coder shown in Fig. 2.
- Like parts are thus designated by like reference numerals, and are not described.
- Fig. 6 is a block diagram showing a fifth embodiment of the signal coder according to the present invention. This embodiment of the signal coder is different from the first embodiment of the signal coder shown in Fig. 1 in a selector 700, an adaptive codebook circuits 800 1 to 800 U , a synthesizer 810 and a subtractor 820. These components will now be described.
- the adaptive codebook circuits 800 1 to 800 U are operable in the same way, and only the adaptive codebook 800 1 will be described.
- the adaptive codebook 800 1 calculates a plurality of pitch cycles in the order of minimizing the distortion D T1 in Equation (7), and quantizes these pitch cycles by calculating the pitch gain ⁇ 1 using Equation (9).
- the adaptive codebook circuit 800 1 also calculates the pitch prediction signal q w1 (n) for each of the plurality of pitch cycles as in Equation (10), and outputs the calculated result to the synthesizer 810.
- the synthesizer 810 derives a full bands prediction signal q w (n) k for each of the combinations of all of the candidates from the adaptive codebook circuits 800 1 to 800 U , and outputs these full range prediction signals to the subtractor 820.
- the subtractor 820 subtracts the subtracted result X' w (n) from each prediction signal q w (n) k , and outputs the difference to the selector 700.
- the selector 700 calculates a predicted error power E k in Equation (34) for each of a plurality of subtracted result z w (n) k outputted from the subtractor 820.
- the selector 700 selects a combination which corresponds to a minimum of the predicted error power E k in Equation (34). At this time, the selector 700 outputs the minimum predicted error signal z w (n) k to the excitation quantizer 350, and outputs the corresponding full bands excitation signal g(n) k to the weighting signal calculator 360. The selector 700 outputs an index representing the pitch cycle of the selected candidate and an index representing the quantized pitch gain to the multiplexer 400.
- Fig. 7 is a block diagram showing a sixth embodiment of the signal coder according to the present invention.
- an excitation quantizer 500, an amplitude codebook 540, a gain quantizer 550, a gain codebook 560 and a weighting signal calculator 570 are those used in the second embodiment of the signal code shown in Fig. 2, and they are not described in detail.
- Fig. 8 is a block diagram showing a seventh embodiment of the signal coder according to the present invention.
- This embodiment of the signal coder is obtained by combining the mode judging circuit 900, dividers 600, 615 and 620 and synthesizer 610 shown in Fig. 4 to the fifth embodiment of the signal coder shown in Fig. 7. In a predetermined mode, this embodiment performs the same operation as in the fifth embodiment of the signal coder shown in Fig. 6.
- Fig. 9 is a block diagram showing an eighth embodiment of the signal coder according to the present invention.
- the excitation quantizer 500, amplitude codebook 540, gain quantizer 550, gain codebook 560 and weighting signal calculator 570 shown in Fig. 2 are used in the seventh embodiment of the signal coder shown in Fig. 8, and these components are not described in detail.
- the excitation is represented by a pulse train
- a plurality of pulse position sets may be obtained, and a combination which minimizes E k in Equation (25) may be obtained by retrieving the amplitude codebook for each pulse position set.
- a plurality of such combinations may be outputted to the gain quantizer for selecting a combination of position, amplitude codevector and gain codevector which minimizes the distortion D t in Equation (28) while the gain is quantized.
- the input signal is divided into a plurality of sub-bands, the pitch prediction judgment is executed by obtaining the pitch data in at least one of the sub-bands, and a full band signal is synthesized for quantizing the excitation signal of the input signal.
- the pitch prediction judgment is executed by obtaining the pitch data in at least one of the sub-bands, and a full band signal is synthesized for quantizing the excitation signal of the input signal.
- the mode of signal is judged by extracting a feature quantity from the input signal, and the processing described above is performed only in a predetermined mode. It is thus possible to obtain very useful effects.
- the excitation signal is expressed as a pulse train consisting of M zero-amplitude pulses, and it is thus possible to obtain better sound quality with relatively less retrieving and computational efforts.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP15485096A JP3335841B2 (ja) | 1996-05-27 | 1996-05-27 | 信号符号化装置 |
JP154850/96 | 1996-05-27 |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0810584A2 true EP0810584A2 (fr) | 1997-12-03 |
EP0810584A3 EP0810584A3 (fr) | 1998-10-28 |
Family
ID=15593275
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP97108526A Withdrawn EP0810584A3 (fr) | 1996-05-27 | 1997-05-27 | Codeur de signal |
Country Status (4)
Country | Link |
---|---|
US (1) | US5873060A (fr) |
EP (1) | EP0810584A3 (fr) |
JP (1) | JP3335841B2 (fr) |
CA (1) | CA2205093C (fr) |
Families Citing this family (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6393391B1 (en) * | 1998-04-15 | 2002-05-21 | Nec Corporation | Speech coder for high quality at low bit rates |
DE69712537T2 (de) * | 1996-11-07 | 2002-08-29 | Matsushita Electric Industrial Co., Ltd. | Verfahren zur Erzeugung eines Vektorquantisierungs-Codebuchs |
JP2000305599A (ja) * | 1999-04-22 | 2000-11-02 | Sony Corp | 音声合成装置及び方法、電話装置並びにプログラム提供媒体 |
US6721713B1 (en) | 1999-05-27 | 2004-04-13 | Andersen Consulting Llp | Business alliance identification in a web architecture framework |
US7315815B1 (en) | 1999-09-22 | 2008-01-01 | Microsoft Corporation | LPC-harmonic vocoder with superframe structure |
EP1796083B1 (fr) * | 2000-04-24 | 2009-01-07 | Qualcomm Incorporated | Procédé et appareil de quantification prédictive de trames voisées de la parole |
US7668712B2 (en) * | 2004-03-31 | 2010-02-23 | Microsoft Corporation | Audio encoding and decoding with intra frames and adaptive forward error correction |
US7707034B2 (en) * | 2005-05-31 | 2010-04-27 | Microsoft Corporation | Audio codec post-filter |
US7177804B2 (en) * | 2005-05-31 | 2007-02-13 | Microsoft Corporation | Sub-band voice codec with multi-stage codebooks and redundant coding |
US7831421B2 (en) * | 2005-05-31 | 2010-11-09 | Microsoft Corporation | Robust decoder |
US8712766B2 (en) * | 2006-05-16 | 2014-04-29 | Motorola Mobility Llc | Method and system for coding an information signal using closed loop adaptive bit allocation |
JP5085700B2 (ja) * | 2010-08-30 | 2012-11-28 | 株式会社東芝 | 音声合成装置、音声合成方法およびプログラム |
TR201911121T4 (tr) * | 2012-03-29 | 2019-08-21 | Ericsson Telefon Ab L M | Vektör niceleyici. |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0607989A2 (fr) * | 1993-01-22 | 1994-07-27 | Nec Corporation | Système pour le codage de parole |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CA1255802A (fr) * | 1984-07-05 | 1989-06-13 | Kazunori Ozawa | Codage et decodage de signaux a faible debit binaire utilisant un nombre restreint d'impulsions d'excitation |
JP2940005B2 (ja) * | 1989-07-20 | 1999-08-25 | 日本電気株式会社 | 音声符号化装置 |
US5208862A (en) * | 1990-02-22 | 1993-05-04 | Nec Corporation | Speech coder |
JP2626223B2 (ja) * | 1990-09-26 | 1997-07-02 | 日本電気株式会社 | 音声符号化装置 |
JP3114197B2 (ja) * | 1990-11-02 | 2000-12-04 | 日本電気株式会社 | 音声パラメータ符号化方法 |
JP3151874B2 (ja) * | 1991-02-26 | 2001-04-03 | 日本電気株式会社 | 音声パラメータ符号化方式および装置 |
JP3143956B2 (ja) * | 1991-06-27 | 2001-03-07 | 日本電気株式会社 | 音声パラメータ符号化方式 |
JP2800618B2 (ja) * | 1993-02-09 | 1998-09-21 | 日本電気株式会社 | 音声パラメータ符号化方式 |
-
1996
- 1996-05-27 JP JP15485096A patent/JP3335841B2/ja not_active Expired - Fee Related
-
1997
- 1997-05-27 CA CA002205093A patent/CA2205093C/fr not_active Expired - Fee Related
- 1997-05-27 US US08/863,785 patent/US5873060A/en not_active Expired - Fee Related
- 1997-05-27 EP EP97108526A patent/EP0810584A3/fr not_active Withdrawn
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0607989A2 (fr) * | 1993-01-22 | 1994-07-27 | Nec Corporation | Système pour le codage de parole |
Non-Patent Citations (2)
Title |
---|
GAO YANG ET AL: "MULTIBAND CODE-EXCITED LINEAR PREDICTION (MBCELP) FOR SPEECH CODING" SIGNAL PROCESSING EUROPEAN JOURNAL DEVOTED TO THE METHODS AND APPLICATIONS OF SIGNAL PROCESSING, vol. 31, no. 2, 1 March 1993, pages 215-227, XP000345441 * |
GARCIA-MATEO C ET AL: "Application of a low-delay bank of filters to speech coding" 1994 SIXTH IEEE DIGITAL SIGNAL PROCESSING WORKSHOP (CAT. NO.94TH6585), PROCEEDINGS OF IEEE 6TH DIGITAL SIGNAL PROCESSING WORKSHOP, YOSEMITE NATIONAL PARK, CA, USA, 2-5 OCT. 1994, pages 219-222, XP002076162 ISBN 0-7803-1948-6, 1994, New York, NY, USA, IEEE, USA * |
Also Published As
Publication number | Publication date |
---|---|
JP3335841B2 (ja) | 2002-10-21 |
JPH09319398A (ja) | 1997-12-12 |
CA2205093C (fr) | 2001-01-30 |
EP0810584A3 (fr) | 1998-10-28 |
CA2205093A1 (fr) | 1997-11-27 |
US5873060A (en) | 1999-02-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0802524B1 (fr) | Codeur de parole | |
US5140638A (en) | Speech coding system and a method of encoding speech | |
EP0501421B1 (fr) | Système de codage de parole | |
EP0957472B1 (fr) | Dispositif de codage et décodage de la parole | |
US5633980A (en) | Voice cover and a method for searching codebooks | |
EP0801377B1 (fr) | Appareil pour coder un signal | |
EP1162604B1 (fr) | Codeur de la parole de haute qualité à faible débit binaire | |
US5873060A (en) | Signal coder for wide-band signals | |
EP1005022A1 (fr) | Méthode et système de codage de la parole | |
US7680669B2 (en) | Sound encoding apparatus and method, and sound decoding apparatus and method | |
EP0849724A2 (fr) | Dispositif et procédé de haute qualité pour le codage de la parole | |
EP0745972B1 (fr) | Procédé et dispositif de codage de parole | |
EP0866443B1 (fr) | Codeur de signal de parole | |
EP0871158A2 (fr) | Dispositif de codage de la parole utilisant une excitation multi-impulsionnelle | |
JP3153075B2 (ja) | 音声符号化装置 | |
EP1100076A2 (fr) | Codeur de parole multimode avec lissage du gain | |
JP3092654B2 (ja) | 信号符号化装置 | |
JPH0844397A (ja) | 音声符号化装置 | |
JPH09319399A (ja) | 音声符号化装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): DE FR GB NL SE |
|
PUAL | Search report despatched |
Free format text: ORIGINAL CODE: 0009013 |
|
AK | Designated contracting states |
Kind code of ref document: A3 Designated state(s): DE FR GB NL SE |
|
17P | Request for examination filed |
Effective date: 19980915 |
|
17Q | First examination report despatched |
Effective date: 20020409 |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: 7G 10L 19/12 A |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20040305 |