WO2005112005A1 - スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 - Google Patents
スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 Download PDFInfo
- Publication number
- WO2005112005A1 WO2005112005A1 PCT/JP2005/007438 JP2005007438W WO2005112005A1 WO 2005112005 A1 WO2005112005 A1 WO 2005112005A1 JP 2005007438 W JP2005007438 W JP 2005007438W WO 2005112005 A1 WO2005112005 A1 WO 2005112005A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- lsp
- wideband
- band
- scalable
- quantized
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 39
- 238000006243 chemical reaction Methods 0.000 claims abstract description 86
- 238000013139 quantization Methods 0.000 claims abstract description 76
- 238000004364 calculation method Methods 0.000 claims abstract description 32
- 238000004891 communication Methods 0.000 claims description 17
- 238000009499 grossing Methods 0.000 claims description 17
- 230000035945 sensitivity Effects 0.000 claims description 8
- 230000008859 change Effects 0.000 claims description 7
- 230000002123 temporal effect Effects 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims 1
- 230000001965 increasing effect Effects 0.000 abstract description 5
- 238000010586 diagram Methods 0.000 description 44
- 239000013598 vector Substances 0.000 description 39
- 230000005284 excitation Effects 0.000 description 31
- 238000004458 analytical method Methods 0.000 description 28
- 230000005540 biological transmission Effects 0.000 description 28
- 238000005070 sampling Methods 0.000 description 23
- 230000003044 adaptive effect Effects 0.000 description 21
- 230000015572 biosynthetic process Effects 0.000 description 20
- 238000003786 synthesis reaction Methods 0.000 description 20
- 238000012545 processing Methods 0.000 description 19
- 230000005236 sound signal Effects 0.000 description 15
- 230000000694 effects Effects 0.000 description 8
- 238000005516 engineering process Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 239000010410 layer Substances 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 239000000470 constituent Substances 0.000 description 4
- 238000012937 correction Methods 0.000 description 4
- 230000006872 improvement Effects 0.000 description 4
- 230000003111 delayed effect Effects 0.000 description 3
- 238000013461 design Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000010295 mobile communication Methods 0.000 description 3
- 230000010355 oscillation Effects 0.000 description 3
- 230000001902 propagating effect Effects 0.000 description 3
- 230000002441 reversible effect Effects 0.000 description 3
- 238000013459 approach Methods 0.000 description 2
- 230000015556 catabolic process Effects 0.000 description 2
- 239000012792 core layer Substances 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000003595 spectral effect Effects 0.000 description 2
- 230000001629 suppression Effects 0.000 description 2
- 241001237745 Salamis Species 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 230000002542 deteriorative effect Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000010363 phase shift Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 235000015175 salami Nutrition 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 238000013519 translation Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L25/00—Baseband systems
- H04L25/38—Synchronous or start-stop systems, e.g. for Baudot code
- H04L25/40—Transmitting circuits; Receiving circuits
- H04L25/49—Transmitting circuits; Receiving circuits using code conversion at the transmitter; using predistortion; using insertion of idle bits for obtaining a desired frequency spectrum; using three or more amplitude levels ; Baseband coding techniques specific to data transmission systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
- G10L19/07—Line spectrum pair [LSP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L25/00—Baseband systems
- H04L25/02—Details ; arrangements for supplying electrical power along data transmission lines
- H04L25/20—Repeater circuits; Relay circuits
Definitions
- the present invention relates to a scalable encoding device, a scalable decoding device, and methods thereof.
- the present invention relates to a scalable encoding apparatus, a scalable decoding apparatus, a scalable encoding method, and a scalable encoding apparatus used when performing voice communication in a mobile communication system or a packet communication system using the Internet protocol. And a scalable decoding method.
- packets such as VoIP (Voice over IP) or the like
- VoIP Voice over IP
- packets may be discarded on a transmission path due to congestion or the like.
- Patent Document 1 discloses a method in which core layer coding information and enhancement layer coding information are packed into separate packets and transmitted using scalable coding.
- Another application of packet communication is multicast communication (one-to-many communication) using a network in which a thick line (broadband line) and a thin line (line with a low transmission rate) are mixed. Even when multipoint communication is performed on such an uneven network, it is not necessary to send different encoded information for each network if the encoded information is hierarchized corresponding to each network. Therefore, scalable coding is effective.
- Patent Document 2 discloses a bandwidth scalable coding technique having scalability in a signal bandwidth, that is, a frequency axis direction.
- Patent Document 2 discloses an example of a CELP system in which spectrum envelope information of a speech signal is represented by LSP (line spectrum pair) parameters.
- LSP line spectrum pair
- the LSP parameters (narrowband coded LSP) are converted to LSP parameters for wideband speech coding using the following (Equation 1), and the converted LSP parameters are converted to a wideband speech coding section (enhancement layer).
- Equation 1 the quantum obtained by the coding layer
- Equation 1 the converted LSP parameters are converted to a wideband speech coding section (enhancement layer).
- fw (i) is the i-th LSP parameter of the wideband signal
- fn (i) is the i-th LSP parameter of the narrowband signal
- P is the LSP analysis order of the narrowband signal
- P is the LSP of the wideband signal.
- Patent Document 2 describes an example in which the sampling frequency is 8 kHz as a narrowband signal, the sampling frequency is 16 kHz as a wideband signal, and the broadband LSP analysis order is twice the narrowband LSP analysis order. Therefore, the conversion from the narrow-band LSP to the wide-band LSP can be performed by a simple equation as represented by (Equation 1). However, the position of the P-order LSP parameter on the lower order side of the broadband LSP is determined for the entire wideband signal including the (P-P) order on the higher order side. It does not correspond to the following LSP parameters.
- Non-Patent Document 1 instead of setting the conversion coefficient by which the i-th order narrowband LSP parameter of (Equation 1) is multiplied by 0.5, the conversion coefficient is expressed by the following (Equation 2). There is disclosed a method of obtaining an optimum transform coefficient ⁇ G) for each order using a coefficient optimization algorithm.
- fw_n (i) a (i) X L (i) + j8 (i) X fn-n (i)
- fw_n (i) is the i-th broadband quantized LSP parameter in the n-th frame
- a (i) XL (i) is the i-th element of the vector obtained by quantizing the prediction error signal (ex (i) is the i-th element
- L (i) is the LSP prediction residual vector
- ⁇ (i) is the weighting factor for the predicted wideband LSP
- fn_n (i) is the narrowband LSP parameter in the nth frame.
- Patent Document 1 Japanese Patent Application Laid-Open No. 2003-241799
- Patent Document 2 JP-A-11-30997
- Non-Patent Document 1 K. Koishida et al, "Enhancing MPEG-4 CELP by jointly optimized inter / intra-frame LSP predictors, IEEE Speech Coding Workshop 2000,
- FIG. 1 shows a signal obtained by subjecting a wideband signal to band limitation, that is, a signal obtained by once down-sampling and then up-sampling a wide-band signal to return to the original sampling frequency.
- FIG. 8 is a diagram showing an example of narrow-band LSP parameters obtained by performing the LSP analysis in FIG.
- FIG. 2 shows a wideband signal corresponding to the narrowband LSP parameter shown in FIG.
- FIG. 9 is a diagram showing an example of a broadband LSP parameter obtained by performing the LSP analysis in FIG.
- the horizontal axis is time (analysis frame number)
- the vertical axis is normalized frequency (1.0 is the Nyquist frequency, 8 kHz in the example in the figure).
- This change is caused by a difference in frequency components (mainly high-frequency components) included in the wideband signal, not included in the narrowband signal.
- FIG. 3 is a diagram showing ideal conversion coefficients when converting a narrowband LSP obtained for each order into a wideband LSP using the LSP data shown in FIGS. 1 and 2.
- the coefficient is a value obtained by dividing the wideband LSP by the narrowband LSP
- the horizontal axis is time (analysis frame number)
- the order is shown as an example when the order is 0th, 4th, or 8th.
- the ideal value of the conversion coefficient fluctuates with time. That is, the conversion factor when converting a narrowband LSP to a wideband LSP, or in other words, the ideal value of the conversion factor when predicting a wideband LSP from a narrowband LSP varies with time, Even if the conversion coefficient obtained by the design method shown in Non-Patent Document 1 is used, if the conversion coefficient is a fixed value, it is not possible to accurately represent an ideal conversion coefficient that fluctuates with time.
- an object of the present invention is to improve the conversion performance from a narrowband LSP to a wideband LSP, that is, to improve the prediction accuracy when predicting a wideband LSP from a narrowband LSP, and to provide a high-performance band scalable LSP codec
- An object of the present invention is to provide a scalable coding apparatus, a scalable decoding apparatus, a scalable coding method, and a scalable decoding method that can be realized.
- a scalable encoding device is a scalable encoding device that generates a narrowband and a wideband quantized LSP parameter having scalability in the frequency axis direction from an input signal. Narrowing the LSP parameter, narrowband encoding means for generating a narrowband first quantization LSP parameter, conversion means for converting the frequency band of the first quantized LSP parameter into a wideband, Wideband encoding means for performing encoding of LSP parameters of a wideband input signal using the first quantized LSP parameters after conversion to a wideband to generate second quantized LSP parameters of a wideband; Based on the relationship between the first and second quantized LSP parameters generated in A calculating means for calculating a conversion coefficient used in the converting means.
- the present invention it is possible to improve the conversion performance from a narrowband LSP to a wideband LSP and realize a high-performance band scalable LSP encoding.
- FIG. 1 A diagram showing an example of LSP parameters in a narrow band.
- FIG. 4 is a block diagram showing a main configuration of a scalable coding apparatus according to Embodiment 1.
- FIG. 5 is a block diagram showing a main configuration inside a wideband LSP coding apparatus according to Embodiment 1. 6] Block diagram showing main components inside transform coefficient calculating section according to Embodiment 1.
- FIG. 7 is a block diagram showing a main configuration of a scalable decoding device according to the first embodiment.
- FIG. 8 is a block diagram showing a main configuration inside a wideband LSP decoding device according to the first embodiment.
- FIG. 10 is a block diagram showing a main configuration inside a wideband LSP decoding unit according to Embodiment 2.
- FIG. 11 is a block diagram showing a main configuration inside a wideband LSP decoding unit according to Embodiment 2.
- FIG. 12 A block diagram showing a main configuration of a scalable coding apparatus according to Embodiment 3.
- FIG. 13 A block diagram showing a main configuration inside a transform coefficient calculation unit according to Embodiment 3.
- FIG. FIG. 15 is a block diagram showing a main configuration of a scalable decoding device according to Embodiment 3.
- FIG. 15 A block diagram showing a main configuration of a scalable decoding device according to Embodiment 4.
- FIG. 16 Embodiment 4.
- FIG. 17 is a block diagram showing a main configuration of a scalable decoding device according to [FIG. 17] A block diagram showing a main configuration of a wideband LSP coding device according to Embodiment 5 [FIG. 18] Conversion according to Embodiment 5 Block diagram showing the main configuration of the coefficient calculator
- FIG. 19 is a block diagram showing a main configuration of a scalable coding apparatus according to a fifth embodiment.
- FIG. 20 is a block diagram showing a main configuration of a wideband LSP coding section according to the sixth embodiment.
- FIG. 22 is a block diagram showing a main configuration of a wideband LSP encoding unit according to Embodiment 7. BEST MODE FOR CARRYING OUT THE INVENTION
- FIG. 4 is a block diagram showing a main configuration of the scalable coding apparatus according to Embodiment 1 of the present invention.
- Scalable coding apparatus includes down-sampling section 101, LSP analysis section (for narrowband) 102, narrowband LSP coding section 103, sound source coding section (for narrowband). 104, phase correction unit 105, LSP analysis unit (for wideband) 106, wideband LSP coding unit 107, excitation coding unit (for wideband) 108, transform coefficient calculation unit 109, upsampling unit 110, adder 111, And a multiplexing unit 112.
- Each part of the scalable coding apparatus according to the present embodiment performs the following operation.
- Down-sampling section 101 performs down-sampling processing on the input audio signal, and outputs a narrow-band signal to LSP analysis section (for narrow band) 102 and excitation codec section (for narrow band) 104.
- the input audio signal is a digitized signal, and is subjected to preprocessing such as HPF and background noise suppression processing as necessary.
- LSP analysis section (for narrow band) 102 calculates an LSP (line spectrum pair) parameter for the narrow band signal input from down sampling section 101, and outputs the result to narrow band LSP coding section 103. I do.
- Narrowband LSP encoding section 103 encodes the narrowband LSP parameters input from LSP analysis section (for narrowband) 102 and converts the quantized narrowband LSP parameters to wideband LSP encoding section. 107, a transform coefficient calculating section 109, and an excitation coding section (for narrow band) 104. Further, narrowband LSP encoding section 103 outputs the encoded data to multiplexing section 112.
- the excitation coding section (for narrowband) 104 converts the quantized narrowband LSP parameters input from the narrowband LSP coding section 103 into linear prediction coefficients, and obtains the obtained linear prediction coefficients.
- a linear prediction synthesis filter is constructed using the coefficients.
- Excitation coding section 104 calculates an auditory weighting error between the synthesized signal synthesized using the linear prediction synthesis filter and the narrowband input signal separately input from down-sampling section 101, The encoding of the excitation parameters that minimizes the target weighting error is performed.
- the obtained encoded information is sent to the multiplexing unit 112. Is output.
- excitation codec unit 104 generates a narrowband decoded audio signal and outputs it to upsample unit 110.
- narrow-band LSP coding section 103 or excitation coding section (for narrow-band) 104 is a circuit generally used in CELP-type speech coding apparatus using LSP parameters.
- the technology described in Patent Document 2 or ITU-T Recommendation G.729 can be used.
- Up-sampling section 110 receives the narrow-band decoded speech signal synthesized by excitation coding section 104, performs up-sampling processing, and outputs the result to adder 111.
- Adder 111 receives as input the phase-corrected input signal from phase corrector 105 and the narrowband decoded speech signal upsampled from upsampler 110, and obtains a difference signal between the two signals to generate a sound signal. Output to encoder (for wideband) 108.
- the phase correction section 105 is for correcting a phase shift (delay) generated in the down-sample section 101 and the up-sample section 110, and the down-sample processing and the up-sample processing perform linear phase low-pass.
- the input signal is delayed by the delay caused by the linear phase low-pass filter, and the LSP analysis unit (for wide band) 106 and the adder 111 are used.
- LSP analysis section (for wideband) 106 receives a wideband signal output from phase correction section 105, performs a known LSP analysis, and outputs wideband LSP parameters obtained to wideband LSP encoding section 107. I do.
- Transform coefficient calculating section 109 includes a narrowband quantized LSP previously output from narrowband LSP encoding section 103 and a wideband quantized LSP previously output from wideband LSP encoding section 107. , And transform coefficients are obtained and output to the wideband LSP encoding unit 107.
- the wideband LSP encoding unit 107 converts the narrowband quantized LSP input from the narrowband LSP encoding unit 103 by the transform coefficient input from the transform coefficient calculation unit 109 to convert to a wideband LSP. Then, the predicted wideband LSP is obtained by multiplying the wideband LSP by a weighting factor. Then, the error signal between the wideband LSP input from the LSP analysis unit (for wideband) 106 and the predicted wideband LSP obtained is coded using a method such as Bezel quantization, and the amount of wideband obtained is obtained. The child LSP is output to the excitation codec shading unit (for wideband) 108.
- the quantized LSP is It is expressed as (Equation 3) below.
- fw_n (i) a (i) X L (i) + j8 (i) X
- fw_n (i) is the i-th broadband quantized LSP parameter in the n-th frame
- a (i) XL (i) is the i-th element of the vector obtained by quantizing the prediction error signal (ex (i) is the i-th element
- L (i) is the LSP prediction residual vector
- ⁇ (i) is the weighting factor for the predicted wideband LSP
- fw_n—1 (i) is the wideband quantization LSP parameter in the (n ⁇ 1) th frame
- fn n-1 (i) is a narrowband quantized LSP parameter in the (n1) th frame
- fn_n (i) is a narrowband LSP parameter in the nth frame.
- wideband LSP encoding section 107 outputs the obtained code information to multiplexing section 112.
- the weighting factor a (i) by which the above LSP prediction residual vector is multiplied may be a fixed value of 1.0, a constant obtained by separate learning, or a plurality of values obtained by separate learning. You may prepare the coefficients as a codebook and select one from them.
- Excitation coding section (for wideband) 108 converts the quantized wideband LSP parameters input from wideband LSP coding section 107 into linear prediction coefficients, and uses the obtained linear prediction coefficients. To construct a linear prediction synthesis filter. Then, an auditory weighting error between the synthesized signal synthesized using the linear predictive synthesis filter and the phase-corrected input signal is determined, and a sound source parameter that minimizes the auditory weighting error is determined.
- excitation signal encoding unit 108 separately receives an error signal between the wideband input signal and the narrowband decoded signal after up-sampling from adder 111, and outputs this error signal and excitation code encoding unit 108 An error with respect to the decoded signal generated in step (1) is obtained, and the sound source parameters are determined so that the one obtained by applying the auditory weighting to this error is minimized.
- the obtained code information of the excitation parameter is output to multiplexing section 112.
- this sound source encoding for example, “K. Koishiaa et al, ⁇ lo—koit / s oandwidth scalable audio coder based on the ./9 standard,” IEEE Proc. ICASSP 2000, pp.1149-1152, 2000 Is disclosed.
- the multiplexing section 112 receives narrowband LSP encoding information from the narrowband LSP encoding section 103, and the excitation encoding section (for narrowband) 104 outputs the excitation code of the narrowband signal from the excitation encoding section (for narrowband) 104.
- the information is transmitted from the wideband LSP encoding unit 107 to the wideband LSP encoding information. From), the excitation code information of the wideband signal is input.
- the multiplexing unit 112 multiplexes these pieces of information and sends them out as a bit stream to the transmission path. Note that the bit stream is framed or packetized into a transmission channel frame according to the specifications of the transmission path. Also, in order to increase the resistance to transmission line errors, error protection and error detection codes are added, and interleave processing is applied.
- FIG. 5 is a block diagram showing a main configuration inside broadband LSP coding section 107 described above.
- the wideband LSP encoding section 107 includes an error minimizing section 121, an LSP codebook 122, a weighting factor codebook 123, amplifiers 124 to 126, and adders 127 and 128.
- the adder 127 calculates an error between the LSP parameter serving as a quantization target input from the LSP analysis unit 106 and a quantized LSP parameter candidate input from the adder 128, and calculates the obtained error. Is output to the error minimizing section 121.
- the error calculation may be a square error between the input LSP vectors. Also, if weighting is performed according to the characteristics of the input LSP vector, the quality of the audibility can be further improved. For example, in ITU-T Recommendation G.729, error minimization is performed using the weighted square error (weighted Euclidean distance) of Equation (21) in Chapter 3.2.4 (Quantization of the LSP coefficients).
- Error minimizing section 121 selects an LSP vector and a weight coefficient vector that minimize the error output from adder 127 from among LSP codebook 122 and weight coefficient codebook 123, respectively.
- the corresponding index is encoded and output to the multiplexing unit 112 (S1 Do
- LSP codebook 122 outputs the stored LSP vector to amplifier 124.
- the LSP vector stored in the LSP codebook 122 is the LSP vector of the wideband LSP predicted based on the narrowband quantized LSP output from the amplifier 125 (the wideband LSP input from the LSP analysis unit 106). Is the prediction residual vector).
- the weighting coefficient codebook 123 selects one set from the stored weighting coefficient sets, and the weight of the selected weighting coefficient set also determines the coefficient for the amplifier 124 and the coefficient for the amplifier 125 as an amplifier 124 And output to 125. It should be noted that this weighting factor set also includes the weighting factor power prepared for each of the LSP orders for each of the amplifiers 124 and 125.
- the amplifier 124 multiplies the LSP vector input from the LSP codebook 122 by the weight coefficient for the amplifier 124 output from the weight coefficient codebook 123, and outputs the result to the adder 128.
- the amplifier 125 converts the vector of the wideband LSP input from the amplifier 126, that is, the vector of the wideband LSP obtained by converting the narrowband LSP after quantization into an amplifier output from the weight coefficient codebook 123. The result is multiplied by the weighting coefficient for 125 and output to the adder 128.
- Adder 128 calculates the sum of the LSP vectors output from amplifiers 124 and 125, and outputs the sum to adder 127.
- the sum of the LSP vectors determined by error minimizing section 121 to minimize the error is output to excitation code converting section 108 and transform coefficient calculating section 109 as a wideband quantized LSP parameter.
- the LSP parameter output as a wideband quantized LSP parameter is a stable condition (if the nth LSP is larger than the 0th to (n ⁇ 1) th LSPs, that is, the LSP is the order If the value does not satisfy the order, the adder 128 operates to satisfy the stability condition of the LSP.
- the operation is generally performed so as to be longer than the predetermined interval.
- Amplifier 126 multiplies the LSP parameter input from narrowband LSP encoding section 103 by the coefficient input from transform coefficient calculation section 109, and outputs the result to amplifier 125.
- the LSP parameter input from narrowband LSP encoding unit 103 to amplifier 126 may be the quantization result of narrowband LSP encoding unit 103 as it is, but may be upsampled to obtain the sampling frequency of the wideband signal. It is even better to match with the order of the wideband LSP.
- an up-sampling method an impulse response of an LPC synthesis filter that can obtain a narrow-band LSP force is up-sampled, an auto-correlation is obtained from the up-sampled impulse response (for example, see Patent Document 2), A method of converting a number into an LSP of a desired order by a known method, and the like are not limited to this.
- FIG. 6 is a block diagram showing a main configuration inside transform coefficient calculating section 109 shown in FIG.
- the transform coefficient calculation unit 109 includes delay units 131 and 132, a divider 133, a limiter 134, and a smoothing unit 135.
- the delay unit 131 converts the narrowband LSP parameters input from the narrowband LSP encoding unit 103 into It is delayed by one processing unit time (the LSP parameter update cycle) and output to the divider 133.
- the narrow-band LSP input from the narrow-band LSP encoding unit 103 may be the parameter narrow-band LSP as it is, but it is more preferable to up-sample and equalize the order.
- Delay unit 132 delays the wideband LSP parameter input from wideband LSP encoding unit 107 by one processing unit time (the update period of the LSP parameter), and outputs the result to divider 133.
- Divider 133 combines the wideband LSP parameter quantized one processing unit time before input from delay unit 132 with the narrow band LSP parameter quantized one processing unit time before input from delay unit 131. Divide by the band LSP parameter and output the result of the division to the limiter 134. If the order of the narrowband LSP parameter output from the delay unit 131 is different from the order of the wideband LSP meter output from the delay unit 132, only the smaller order (usually the order of the narrowband LSP parameter) Performs division and outputs.
- the limiter 134 clips the division result input from the divider 133 with a preset upper limit value and lower limit value (when the limit value is exceeded, the limit value is reset to the upper limit value, and when the value falls below the lower limit value, The processing is reset to the lower limit, and the result is output to the smoothing unit 135.
- the upper limit value and the lower limit value may be the same for all orders, but it is more preferable to set an optimum value for each order.
- Smoothing unit 135 temporally smoothes the division result after clipping input from limiter 134, and outputs the result to wideband LSP encoding unit 107 as a transform coefficient. This smoothing process can be realized, for example, by using the following (Equation 4).
- X (i) is a transform coefficient applied to the i-th narrowband LSP parameter in the n-th processing unit time
- K is a smoothing coefficient and takes a value of 0 ⁇ K ⁇ 1
- ⁇ (i) is the result of dividing the i-th order LSP parameter output from the limiter 134.
- FIG. 7 is a block diagram showing a main configuration of a scalable decoding device that decodes encoded information encoded by the scalable encoding device.
- This scalable decoding device includes a demultiplexing unit 151, a sound source decoding unit (for narrow band) 152. , Narrowband LSP decoding unit 153, sound source decoding unit (for wideband) 154, transform coefficient calculation unit 155, wideband LSP decoding unit 156, speech synthesis unit (for narrowband) 157, speech synthesis unit (for wideband) 158, an up-sampling section 159, and an adder 160.
- Demultiplexing section 151 receives the coded information coded by the scalable coding apparatus described above, separates the coded information into coding information for each parameter, and converts the narrowband excitation coding information into the excitation decoding section (narrowband).
- Band 152 the narrowband LSP coding information to the narrowband LSP decoding unit 153, the wideband excitation coding information to the excitation decoding unit (for wideband) 154, and the wideband LSP coding information to the wideband LSP.
- Output to the decryption unit 156 respectively.
- the excitation decoding section (for narrowband) 152 converts the encoded information of the narrowband excitation signal input from the demultiplexing section 151 into an excitation encoding section (for narrowband) of the above scalable encoding apparatus. Decoding is performed using the reverse of the processing performed in step 104, and the quantized narrow-band sound source signal is output to the voice synthesizer (for narrow-band) 157.
- Narrowband LSP decoding section 153 performs encoding of narrowband LSP input from demultiplexing section 151 in narrowband LSP coding section 103 of the above scalable coding apparatus. Decoding is performed by the reverse of the processing, and the resulting narrowband quantized LSP is output to a speech synthesis unit (for narrowband) 157, a transform coefficient calculation unit 155, and a wideband LSP decoding unit 156.
- Speech synthesis section (for narrowband) 157 converts the quantized narrowband LSP parameters input from narrowband LSP decoding section 153 into linear prediction coefficients, and converts the obtained linear prediction coefficients. To construct a linear prediction synthesis filter.
- the speech synthesis section (for narrow band) 157 drives this linear prediction synthesis filter with the narrow band quantized excitation signal input from the sound source decoding section (for narrow band) 152 to synthesize a decoded speech signal, Output as a narrowband decoded audio signal.
- This narrow-band decoded audio signal is output to up-sampling section 159 to obtain a wide-band decoded audio signal. Note that this narrowband decoded audio signal may be used as it is as a final output. When the narrow-band decoded audio signal is used as the final output as it is, it is common to output it after performing post-processing such as a post-filter to improve subjective quality.
- Up-sampling section 159 performs up-sampling processing on the narrow-band audio signal input from speech synthesizing section (for narrow band) 157, and outputs the result to adder 160.
- Excitation decoding section (for wideband) 154 converts encoded information of the wideband excitation signal input from demultiplexing section 151 into excitation coding section (for wideband) 108 of the above-described scalable encoding apparatus. Decoding is performed by a process reverse to the performed process, and the obtained wideband quantized excitation signal is output to a voice synthesis unit (for wideband) 158.
- Transform coefficient calculating section 155 includes a narrowband quantized LSP previously input from narrowband LSP decoding section 153, and a wideband quantized LSP previously input from wideband LSP decoding section 156. , And transform coefficients are obtained and output to the wideband LSP decoding unit 156.
- Wideband LSP decoding section 156 multiplies the narrow-band quantized LSP input from narrow-band LSP decoding section 153 by a transform coefficient input from transform coefficient calculating section 155 to convert to a wideband LSP. Then, the predicted wideband LSP is obtained by multiplying the wideband LSP by a weighting factor. Note that the weighting factor uses the same value as the weighting factor used in the wideband LSP coding unit 107 of the scalable coding device. Also, the wideband LSP decoding unit 156, based on the wideband LSP coding information input from the demultiplexing unit 151, quantizes the wideband LSP prediction residual (input wideband LSP on the coding side and the prediction Decoding error from the wideband LSP).
- wideband LSP decoding section 156 adds the quantized wideband LSP prediction residual and the predicted wideband LSP already obtained above to decode the wideband quantized LSP.
- the obtained broadband quantized LSP parameters are output to speech synthesis section (for wideband) 158 and transform coefficient calculation section 155.
- Speech synthesis section (for wideband) 158 converts the quantized wideband LSP parameters input from wideband LSP decoding section 156 into linear prediction coefficients, and performs linear prediction synthesis using the obtained linear prediction coefficients. Build a filter.
- the speech synthesis section (for wideband) 158 drives this linear predictive synthesis filter with the wideband quantized sound source signal input from the sound source decoding section (for wideband) 154 and decodes the wideband decoded speech signal (mainly high-frequency component). ) And outputs the result to adder 160.
- Adder 160 includes a narrow-band decoded speech signal after up-sampling input from up-sampling section 159 and a wide-band decoded speech signal (mainly high-frequency component) input from speech synthesis section (for wide band) 158. , And outputs the final wideband decoded audio signal.
- a narrow-band decoded speech signal after up-sampling input from up-sampling section 159 and a wide-band decoded speech signal (mainly high-frequency component) input from speech synthesis section (for wide band) 158. , And outputs the final wideband decoded audio signal.
- FIG. 8 is a block diagram showing a main configuration inside broadband LSP decoding section 156 described above.
- the wideband LSP decoding section 156 includes an index decoding section 161, an LSP codebook 162, a weight coefficient codebook 163, amplifiers 164 to 166, and an adder 167.
- Index decoding section 161 obtains the wideband LSP encoding information from demultiplexing section 151, decodes the index information for LSP codebook 162 and weighting factor codebook 163, and decomposes each piece of index information. Output to codebook.
- LSP codebook 162 acquires an LSP codebook index from index decoding section 161, extracts an LSP vector specified by the index from the codebook, and outputs the LSP vector to amplifier 164. If the codebook power is an S-split type or has a multi-stage configuration, a plurality of subcodebook powers are also extracted to generate an LSP vector.
- the weighting factor codebook 163 acquires the weighting factor codebook index from the index decoding unit 161, extracts a weighting factor set specified by the index from the codebook card, and extracts the amplifier 164 from the extracted factor set.
- Coefficient subset (for the LSP codebook) (a coefficient power that multiplies each order of the LSP vector) to the amplifier 164, and a coefficient subset for the amplifier 165 (for the narrowband LSP) (coefficient to multiply each order of the predicted wideband LSP vector) Output to the amplifier 165, respectively.
- the amplifier 164 multiplies the LSP vector input from the LSP codebook 162 by the weight coefficient for the amplifier 164 input from the weight coefficient codebook 163, and outputs the result to the adder 167.
- the amplifier 165 multiplies the vector of the wideband LSP converted from the quantized narrowband LSP input from the amplifier 166 by the weight coefficient for the amplifier 165 input from the weight coefficient codebook 163. Output to adder 167.
- the calo calculator 167 calculates the sum of the LSP vectors input from the amplifiers 164 and 165, and as a quantized (decoded) wideband LSP parameter, a speech synthesis unit (for wideband) 158 and a transform coefficient calculation unit Output to 155. If the LSP parameter output as a wideband quantized LSP parameter does not satisfy the stability condition, that is, if the n-th LSP is in the 0th to (n ⁇ 1) th If it is smaller than the LSP (when the LSP does not increase in the order of the order), perform operations to satisfy the stability condition of the LSP. Note that even when the interval between adjacent quantized LSPs is smaller than the predetermined interval, the interval will be longer than the predetermined interval. Operate as follows.
- transform coefficient calculating section 155 shown in FIG. 7 is basically the same as that of transform coefficient calculating section 109 shown in FIG. Therefore, although detailed description is omitted, the input to the delay unit 131 in the transform coefficient calculation unit 155 is from the narrowband LSP decoding unit 153, and the input to the delay unit 132 is from the wideband LSP decoding unit 156. The point that the output of the smoothing unit 135 is sent to the wideband LSP decoding unit 156 is different from the transform coefficient calculating unit 109 shown in FIG.
- transform coefficient calculation section 109 uses a narrowband and wideband quantized LSP norameter encoded in a past frame (for example, the immediately preceding frame or the like). Then, an approximate value of the ideal transform coefficient in the past frame is obtained, and the transform coefficient to the narrowband quantized LSP force in the current frame is determined based on the approximate value. Specifically, the approximate value of the ideal transform coefficient is obtained by dividing the wideband quantized LSP of the past frame by the narrowband quantized LSP of the same frame.
- the conversion coefficient when estimating the narrowband LSP parameter power by multiplying the wideband LSP parameter by the conversion coefficient XG), the conversion coefficient is adaptively determined for each frame by utilizing the relationship between the past narrowband LSP parameter and the wideband LSP parameter. To determine. Therefore, the conversion coefficient changes over time. By employing this configuration, it is possible to improve the prediction accuracy when predicting a wideband LSP from a narrowband LSP.
- the above transform coefficient can be calculated only from the narrowband and wideband quantized LSP parameters in the past frame, for example, information is separately transmitted from the encoding side on the decoding side. No need to get. That is, it is possible to improve the encoding performance of wideband LSP parameters without increasing the transmission rate of communication.
- the above-mentioned transform coefficients can be directly obtained by a predetermined calculation of the narrow-band and wide-band LSP parameter forces in the past frame. You don't need to keep it.
- the limiter 134 in the conversion coefficient calculation unit 109 sets the conversion coefficient to, for example, about 10% above and below the average value so that the calculated conversion coefficient does not become an extreme value.
- the restrictions that can be kept within the limit are obtained. For example, when the voice mode switches from voiced mode to unvoiced mode or unvoiced mode to voiced mode, the LSP parameter fluctuates greatly, and the calculated conversion coefficient also fluctuates to a reasonable value. Sometimes . If the transform coefficient fluctuates greatly in a short period of time, the prediction using the LSP ratio of the wideband Z narrowband of the previous frame becomes ineffective, and works in the direction of increasing the error.
- the power of the LSP codebook trying to correct such a large error is to provide such a large error and vector in the codebook, so that the prediction error is small and the error in the case increases. Will be.
- the relationship between the transform coefficient and the LSP codebook falls into a kind of oscillation state, and it is necessary to adopt a configuration that can balance the two well so that this does not occur.
- a conversion coefficient is obtained for all frames according to the above-described arithmetic expression.
- the conversion coefficient is provided with an upper limit and a lower limit, and the calculated conversion coefficient If not, make the correction so that the conversion factor falls within this range.
- the conversion coefficient actually used for the conversion can be set to a value within a certain range, so that the continuity (or quasi-stationarity) of the conversion coefficient is guaranteed, and the oscillation state does not occur.
- the prediction ability due to the transform coefficient is limited, and the prediction error may increase.However, if the range is limited to the vicinity of the “fixed value” when the transform coefficient is a fixed value, the prediction error fixes the transform coefficient.
- the approximate value of the transform coefficient is obtained by dividing the wideband quantized LSP of the immediately preceding frame by the narrowband quantized LSP of the immediately preceding frame.
- the transform coefficient used in the current frame is obtained by dividing the approximate value by the average transform coefficient. Neighboring (for example, a range of about 10% before and after or a range of about standard deviation of the conversion coefficient).
- the above-described conversion coefficient is subjected to the smoothing processing between the analysis frames (between the preceding and succeeding frames) so as to fluctuate gradually with time. For this reason, the conversion coefficient changes gently with respect to changes in the LSP parameter, and can be prevented from being excessively sensitive to a transmission path error. Further, since the value of the transform coefficient is stable, the design of the corresponding LSP code vector codebook becomes easy. The predicted value of the quantized LSP is And the LSP code vector, if one of the parameters goes out of control, the other goes out of control, the relationship between them diverges (the above-mentioned oscillation state), and a codebook with good performance can be designed. It is because it disappears. With the above configuration, for example, the SD performance can be improved by 0.05 dB. Note that the amount of improvement depends on the number of quantization bits and the frame length.
- the present invention is also applicable when using an MA predictor.
- the MA prediction coefficient is stored in the weight coefficient codebook 163, and the number of dimensions of the weight coefficient vector is increased by the MA prediction order.
- transform coefficient calculating section 109 includes both limiter 134 and smoothing section 135, but the configuration is such that only one of them is provided. May be.
- the conversion coefficient when the calculated conversion coefficient fluctuates greatly, the conversion coefficient is corrected to be within a certain range, whereby the prediction when predicting the wideband LSP from the narrowband LSP is performed stably. It was made to be.
- the LSP parameter fluctuates by paying attention to the quantized LSP parameter and observing a change in the quantized LSP parameter! Judgment is made, and the conversion coefficient used for conversion is switched.
- a narrow-band quantized LSP parameter obtained by a narrow-band LSP coding unit on the encoding side or a narrow-band LSP decoding unit on the decoding side.
- the stationary mode is determined when the LSP parameter fluctuates, and when the quantized LSP parameter of the narrow band fluctuates, the mode is determined to be the unsteady mode.
- the code book and the weight coefficient code book are switched and used. That is, in the steady mode, adaptive control is performed by calculating the conversion coefficient for each frame in accordance with the above equation (Equation 2), whereas in the unsteady mode, the above equation (Equation 3) is used. , Set the conversion coefficient to a fixed or semi-fixed value.
- the semi-fixed value means that a plurality of conversion coefficients are set in advance, and the conversion coefficients are switched according to the coding result (sound quality) of the audio signal.
- the conversion coefficients are switched according to the coding result (sound quality) of the audio signal.
- the basic configuration of the scalable coding apparatus according to Embodiment 2 of the present invention is the same as that of the scalable coding apparatus according to Embodiment 1. Therefore, detailed description of the scalable coding apparatus according to the present embodiment will be omitted, and transform coefficient calculating section 109a and wideband LSP coding section 107a having different configurations will be described in detail below. The same components are denoted by the same reference numerals, and description thereof will be omitted.
- FIG. 9 is a block diagram showing a main configuration inside transform coefficient calculating section 109a.
- the conversion coefficient calculation unit 109a includes a mode determination unit 201, a coefficient table 202, and a switching switch 203, instead of the limiter 134.
- the conversion coefficient calculation unit 109a uses the calculated conversion coefficient and the conversion coefficient stored in the coefficient table in advance according to the mode determination result of the mode determination unit 201.
- the mode determination unit 201 includes the narrow-band quantization LSP input from the narrow-band LSP encoding unit 103, and the narrow-band quantization LSP output by the delay unit 131, which is quantized one processing unit time ago.
- the distance (change amount) from the LSP is calculated, and whether the mode is the stationary mode or the non-stationary mode is determined based on the calculated distance. For example, when the calculated distance is equal to or less than a preset threshold, the mode is determined to be the steady mode, and when the calculated distance exceeds the threshold, the mode is determined to be the non-stationary mode.
- the result of the determination is output to wideband LSP encoding section 107a and switching switch 203. Note that the calculated distance may be used as it is for threshold determination, or may be used for threshold determination after smoothing between frames.
- switch 203 When the determination result of mode determination section 201 is the steady mode, switch 203 outputs the transform coefficient output from smoothing section 135 to wideband LSP coding section 107a.
- the determination result of the determination unit 201 is the non-stationary mode, switching is performed so as to output the transform coefficients stored in the coefficient table to the wideband LSP coding unit 107a.
- Fig. 10 is a block diagram showing a main configuration inside wideband LSP coding section 107a.
- the LSP codebook and the weighting factor codebook are each composed of sub-codebooks for the number of modes (here, two) (LSP codebooks 222-1 and 222-2, weighting factor codebook). 223-1, 223-2), and the switching switches 224, 225 select one of the sub-codebooks based on the mode information input from the mode determining unit 201.
- the basic configuration of the scalable decoding device according to the second embodiment of the present invention is the same as that of the scalable decoding device according to the first embodiment. Therefore, detailed description will be omitted, and the following description will be given of the transform coefficient calculating section 155a and the wideband LSP decoding section 156a having different configurations. Note that the same components are denoted by the same reference numerals, and description thereof will be omitted.
- transform coefficient calculating section 155a The internal configuration of transform coefficient calculating section 155a is basically the same as that of transform coefficient calculating section 109a shown in FIG. Therefore, the input to the delay unit 131 is from the narrowband LSP decoding unit 153, the input to the delay unit 132 is from the wideband LSP decoding unit 156a, and the output of the smoothing unit 135 is It is different from the transform coefficient calculating unit 109a shown in FIG. 9 in that the conversion to the wideband LSP decoding unit 156a is performed. In addition, the numbering of the mode determination unit is 251 for the sake of convenience to distinguish it from the mode determination unit 201 on the encoding side.
- FIG. 11 is a block diagram showing a main configuration inside wideband LSP decoding section 156a.
- the LSP codebook and the weighting factor codebook are each composed of sub-codebooks for the number of modes (here, two) (LSP codebooks 262-1, 262-2, and weighting factor codebook). 263-1, 2 63-2), and the switching switches 264 and 265 select one of the sub-codebooks based on the mode information input from the mode determination unit 251.
- the continuity of the input unquantized wideband LSP or the narrowband LSP quantized in the current frame is determined, and the stationary state (inter-frame The calculated conversion coefficient is selectively used only when it is determined that the variation is small, and the conversion coefficient separately stored in the table is determined when it is determined to be unsteady (the variation between frames is large). Used. In other words, the calculated conversion coefficient and the conversion coefficient designed in advance and stored in the table are switched based on the stationarity of the LSP parameter. [0100] By employing the above configuration, it is possible to improve the prediction accuracy when predicting a wideband LSP from a narrowband LSP.
- the decoding side can determine the variation of the LSP parameter without transmitting the mode information from the encoding side. Since there is no need to transmit mode information from the encoding side, resources of the communication system are not consumed.
- a change in the quantized LSP parameter in a narrow band is observed, and the presence or absence of a change in the LSP parameter is determined (mode determination).
- mode determination the presence or absence of a change in the LSP parameter is determined.
- the narrow-band quantized LSP parameters may fluctuate.
- decoding of the current frame is performed based on the result of the past mode determination, so that in the method of Embodiment 2, if the past mode determination is incorrect, The error propagates to subsequent processing.
- the encoding side newly installs a mode determination unit that performs mode determination using wideband LSP parameters, and transmits the obtained mode determination result to the decoding side.
- the decoding side newly installs a mode decoding unit for decoding the mode determination result.
- FIG. 12 is a block diagram showing a main configuration of a scalable coding apparatus according to Embodiment 3 of the present invention. Note that this scalable coding device has the same basic configuration as the scalable coding device shown in Embodiment 1 (see FIG. 4), and the same components have the same codes. And a description thereof will be omitted.
- the mode determining unit 301 basically performs the same operation as the mode determining unit 201 (251) shown in the second embodiment. That is, the distance between the LSP parameter delayed by one processing unit time and the current LSP parameter is calculated, and the steady mode is set when the distance is equal to or less than a predetermined threshold. Judge as the steady mode.
- this embodiment differs from the second embodiment in that the input information used is a wideband LSP parameter output from LSP analysis section (for wideband) 106.
- the determination result of the mode determination unit 301 is a conversion coefficient
- the information is output to calculation section 109b and wideband LSP encoding section 107a, and the encoding information of the mode information is output to multiplexing section 112.
- the wideband LSP coding unit 107a has already been described in the second embodiment.
- mode determination section 301 determines stationary Z non-stationary using wideband LSP parameters that are not encoded information (quantized LSP parameters). It is possible to cope with a signal having a large fluctuation in
- mode determination section 301 multiplexes the obtained mode result together with other encoding parameters and transmits the result to the decoding side. In this way, since the mode information is transmitted to the decoding side, even if the decoding side makes a mistake in determining the mode information once, the next mode information is transmitted in the succeeding frame. The influence of the decision error is not propagated, and the transmission path error tolerance is improved.
- FIG. 13 is a block diagram showing a main configuration inside transform coefficient calculating section 109b.
- the transform coefficient calculating section 109b has the same basic configuration as the transform coefficient calculating section 109a of the second embodiment shown in FIG. 9, and only different points will be described below.
- the conversion coefficient calculation unit 109b does not include a mode determination unit inside, and inputs only the mode determination result from the outside. Then, conversion coefficient calculating section 109b switches the switch according to the input mode determination result. Specifically, in the steady mode, the switching switch 203 is switched so that the transform coefficient output from the smoothing unit 135 is output to the wideband LSP encoding unit 107a. In the non-stationary mode, the switching switch 203 is switched so that a transform coefficient designed in advance by offline learning or the like is output from the coefficient table 202 to the wideband LSP encoding unit 107a.
- FIG. 14 is a block diagram showing a main configuration of a scalable decoding device according to Embodiment 3 of the present invention.
- This scalable decoding device also has the same basic configuration as scalable decoding device shown in Embodiment 1 (see FIG. 7), and the same components are denoted by the same reference numerals. The description is omitted.
- the difference from the scalable decoding device shown in the first embodiment is that the scalable decoding device has a new mode decoding unit 351 and decodes the output information of the mode determining unit 301 of the scalable coding device according to the present embodiment.
- the decoded information is converted by the transform coefficient calculation unit 155b and And output to the wideband LSP decoding unit 156a.
- the transform coefficient calculating section 155b has the same basic configuration as the transform coefficient calculating section 109b on the encoding side (see FIG. 13).
- mode determination may be performed based on the conversion gain of the conversion coefficient.
- the conversion gain of this conversion coefficient indicates how close the ⁇ wideband quantization LSP / narrowband quantization LS PJ ratio '' of the previous frame is to the ⁇ input wideband LSP / narrowband quantization LSP '' ratio of the current frame. .
- a feature of this embodiment is that the mode information is not transmitted to the decoding side by the encoding side, and the mode determination is performed by the narrowband LSP encoding section on the encoding side or the narrow band LSP encoding section on the decoding side. This is done inside the department.
- FIG. 15 is a block diagram showing a main configuration of a scalable coding apparatus according to Embodiment 4 of the present invention. Note that this scalable coding device has the same basic configuration as the scalable coding device shown in Embodiment 3 (see FIG. 12), and the same components have the same reference characters. And description thereof is omitted.
- narrow-band LSP coding section 103c performs multi-mode coding
- transform coefficient calculating section 109b uses the mode information (S41). Mode switching and mode switching of the wideband LSP encoding unit 107a.
- FIG. 16 is a block diagram showing a main configuration of a scalable decoding device according to Embodiment 4 of the present invention. Note that this scalable decoding device also has the same basic configuration as scalable decoding device shown in Embodiment 3 (see FIG. 14), and the same components are denoted by the same reference numerals. And description thereof is omitted.
- narrowband LSP decoding device 153c is provided with a mode information decoding function. That is, narrowband LSP decoding section 153c performs multimode decoding, and outputs the mode information (S42) to transform coefficient calculating section 155b and wideband LSP decoding section 156a.
- the transform coefficient calculation unit 155b and the wideband LSP decoding unit 156a perform mode switching using the mode information (S42) input from the narrowband LSP decoding unit 153c.
- the mode of the wideband LSP encoding is switched using the mode information of the narrowband LSP encoded information, so that the wideband LSP encoding section can be switched without additional bits.
- the mode of the wideband LSP decoding unit or the transform coefficient unit can be switched. Also, since the mode information is transmitted, even if there is a transmission path error, it is possible to prevent the influence of the error from propagating to subsequent frames.
- mode determination is performed prior to LSP quantization, and the codebook to be searched is switched based on the mode determination result. That is, since the mode determination is performed in an open loop before quantization, the mode that minimizes the quantization error is not always selected.
- the mode determination according to the third embodiment is performed based on the LSP parameters before quantization. However, even though the LSP parameters before quantization fluctuate, the mode determination is not necessarily performed. The LSP parameters after quantization do not always fluctuate, and just because the LSP parameters before quantization are stationary does not mean that the LSP parameters after quantization are necessarily stationary.
- the mode is not determined by the open loop, but by the closed loop.
- Perform mode determination That is, if there are two or more modes in the stationary mode Z non-stationary mode, codebook search is actually performed in all modes, and the quantization error (quantization distortion) is determined based on the result. Select the mode to minimize.
- the wideband LSP coding unit a mode in which the transform coefficient is obtained and the wideband LSP is quantized, and a mode in which the wideband LSP is quantized using a predetermined fixed transform coefficient, Quantization is actually performed using both modes, and the result of quantization in the mode with the smaller quantization error is selected as the final quantization result.
- FIG. 17 is a block diagram showing a main configuration of wideband LSP encoding section 107d according to Embodiment 5 of the present invention.
- broadband LSP encoding section 107d has the same basic configuration as broadband LSP encoding section 107a shown in Embodiment 2 (see FIG. 10), and the same constituent elements include The same reference numerals are given and the description is omitted.
- the error minimizing section 121d performs a codebook search for all modes, and calculates an LSP vector and a weight coefficient vector that minimize the quantization error from the codebooks for all modes, using the LSP codebook 222-1, It also selects the respective intermediate powers of 222-2 and weight coefficient codebooks 223-1 and 223-2, encodes the corresponding index, and outputs the result to multiplexing section 112 (S11). At this time, the selected LSP vector and mode information (information indicating which mode codebook power is selected) S51 for generating the weight coefficient vector are also output to multiplexing section 112.
- FIG. 18 is a block diagram showing a main configuration of transform coefficient calculating section 109d according to Embodiment 5 of the present invention.
- transform coefficient calculating section 109d has the same basic configuration as transform coefficient calculating section 109a shown in Embodiment 2 (see FIG. 9), and the same components are denoted by the same reference numerals. And the description is omitted.
- Transform coefficient calculation section 109d switches the prediction coefficient to be used according to control signal C51 output from error minimization section 121d in wideband LSP encoding section 107d. That is, the conversion coefficient calculation unit 109d switches whether to express the quantized LSP by the force (Expression 3) represented by (Expression 2) according to the control signal C51.
- the transform coefficient calculation unit 109d actually performs quantization, and determines whether or not to perform quantization using (Equation 3) based on the quantization result. Therefore, (Equation 3) Since the mode using (Equation 3) is selected only for a frame that is expected to reliably improve performance by the quantization of, a high prediction performance can be obtained.
- the wideband Z narrowband quantized LSP parameter ratio of the previous frame is close to the wideband Z narrowband LSP parameter ratio of the current frame, and only for the frame.
- (Equation 3) quantization using (Equation 3) is performed for a frame in which the wideband Z narrowband LSP parameter ratio is determined to be steady, not for a frame in which the wideband Z narrowband LSP parameter is determined to be steady.
- error resilience can be improved. Because, according to the present embodiment, it is almost guaranteed that the quantization LSP parameter ratio in the wideband Z narrowband is steady in the section where the quantization mode according to (Equation 3) is continuously selected.
- the quantized LSP parameter ratio of the wideband Z narrowband of the frame two frames before or even earlier is not necessarily steady. Therefore, if the immediately preceding frame is incorrect, the quantized LSP parameter ratio of the wideband Z narrowband two frames before, which may not be stationary, may be used as an approximate value instead of this frame. At this time, a decoding result significantly different from the decoding result when there is no error will be obtained.
- the mode according to (Equation 2) is selected. Therefore, the prediction coding is reset at this stage, so that it is possible to prevent the error from propagating to the subsequent frame, and the error resilience is further improved.
- FIG. 19 is a block diagram showing a main configuration of a scalable coding apparatus according to Embodiment 5 of the present invention, which is provided with wideband LSP coding section 107d and transform coefficient calculating section 109d. is there.
- the signal (S11, S51) output from the wideband LSP coding unit 107d is different from the scalable coding device shown in the first to fourth embodiments.
- the configuration of the scalable decoding device according to the present embodiment is the same as that of scalable decoding device shown in Embodiment 3 (see FIG. 14), and a description thereof will not be repeated.
- Embodiments 1 to 5 positively utilize the quantization result of the previous frame to predict the current frame, so that the quantization performance can be improved. Therefore, it is particularly effective for applications having no or almost no transmission path errors.
- the error may propagate to a subsequent frame for a relatively long time.
- the wideband quantization LSP is predicted from the current narrowband quantization LSP using the relationship between the past narrowband quantization LSP and the wideband quantization LSP.
- different quantization results may be generated between the encoding device and the decoding device.
- the decoding device does not correctly perform prediction in the subsequent frame, and thus an error propagates to the subsequent frame.
- error propagation occurs in Embodiments 2 to 5 because the mode using the prediction using the past quantized LSP is continuous, and a transmission path error occurs in this continuous frame. Limited to the case.
- Embodiment 6 of the present invention even when a transmission path error occurs, the effect of the transmission path error is reduced by applying the method of incorporating the forgetting element in Embodiment 5. That is, in the calculation of the wideband quantization LSP of the current frame, the adaptive prediction mode component using the quantization result of the previous frame and the quantization result of the past frame are used. No fixed prediction mode component (fixed value) is used in combination. As a result, even if a transmission path error occurs in a frame in the adaptive prediction mode, the adaptive prediction component is forgotten with the lapse of time by using a fixed value, and the internal state of the encoding device and the decoding device becomes time consuming. It can be approached as time passes, and the effect of transmission path errors is reduced.
- the internal states of the encoding and decoding devices are both reset in a frame switched to the fixed prediction mode, and the effect of transmission path errors is reduced. Propagation to subsequent frames is avoided, improving error resilience
- FIG. 20 is a block diagram showing a main configuration of wideband LSP encoding section 107e according to the present embodiment.
- FIG. 21 is a block diagram showing a main configuration of transform coefficient calculating section 109e according to the present embodiment.
- the wideband LSP encoding unit 107e and the transform coefficient calculating unit 109e in Embodiment 5 are different from the wideband LSP encoding unit 107d (see FIG. 17) and the transform coefficient calculating unit 109d (see FIG. 18). It is used for Therefore, in the present embodiment, only the wideband LSP coding unit 107e and the transform coefficient calculating unit 109e will be described for the scalable encoding device and the scalable decoding device.
- the constituent elements of wideband LSP coding section 107e and transform coefficient calculating section 109e exhibit the same functions as those of wideband LSP coding section 107d and transform coefficient calculating section 109d.
- the same reference numerals are given to the constituent elements, and the description is omitted.
- amplifier 126-1 receives the LSP parameters input from narrowband LSP encoding section 103, and inputs the LSP parameters from coefficient table 2202-2 in transform coefficient calculating section 109e. The multiplication result is multiplied, and the result of the multiplication is output to the amplifier 125-1.
- the amplifier 126-2 performs a smoothing operation on the LSP parameter input from the narrowband LSP coding section 103 in the steady mode (adaptive prediction mode).
- the multiplication result is multiplied by the conversion coefficient output from the dangling unit 135, and is multiplied by the conversion coefficient stored in the coefficient table 202-1 in the non-stationary mode (fixed prediction mode). Output to amplifier 125-2. Therefore, the amplifiers 126-1 and 126-2 constitute the multiplication means in the present invention.
- amplifiers 125-1 and 125-2 respectively convert the wideband LSP vectors input from amplifiers 126-1 and 126-2, that is, the narrowband quantized LSP.
- the vector of the wideband LSP obtained by the conversion is multiplied by a predetermined weighting factor output from the weighting factor codebooks 223-1 and 223-2, and the multiplication result is output to the adder 128. Then, the adder 128 calculates the sum of the LSP vector output from the amplifier 124, the amplifiers 125-1 and 125-2, and outputs the addition result to the adder 127.
- the fixed transform coefficient is always multiplied to the narrow-band quantized LSP of the amplifier 126-1 and the amplifiers 125-1, 125-2. That is, the signal input to the adder 128 via the amplifiers 126-1 and 125-1 is a signal transmitted from the narrow-band quantized LSP input from the narrow-band LSP coding unit 103. Unless affected by errors, it is not affected by transmission path errors that occurred in the past.
- the amplifier 126-2 also multiplies the fixed transform coefficient by the narrowband quantization LSP, so that information is not exchanged between the previous and next frames, and the transmission path error that occurred in the past has occurred. Does not propagate to subsequent frames. As a result, according to the present embodiment, even when a transmission line error occurs, it is possible to suppress the influence of the error from propagating to the subsequent frame and improve error resilience.
- two coefficient tables 202-1 and 202-2 are arranged in transform coefficient calculating section 109e, and two amplifiers 126-1 are provided in wideband LSP encoding section 107e correspondingly. , 126-2 are arranged, but the present invention is not limited to this case, and more coefficient tables 202 and amplifiers 126 may be arranged.
- the present invention is not limited to this case. Only one coefficient table 202 is arranged in the transform coefficient calculating section 109e, and the same transform coefficient is input from the coefficient table 202 to the two amplifiers 126-1 and 126-2 in the wideband LSP coding section 107e. It may be done.
- the present invention is not limited to this case.
- the output of the divider 133 may be directly connected to the switching switch 203 without disposing the smoothing unit 135. With this configuration, when the switch 203 is switched to the coefficient table 202-1, the propagation of the transmission path error can be completely reset.
- transform coefficient calculating section 109e shown in FIG. 21 can be used instead of transform coefficient calculating section 155b in the scalable decoding device (see FIG. 14) shown in the third embodiment.
- the configuration ratio of the adaptive prediction mode component is reduced.
- the weighting factor is designed to be low (for example, 50% or less), while the adaptive prediction mode component is used when wideband quantization LSP is predicted based on the high frequency component of the voice signal. If the weighting factor is designed so that the composition ratio of the components becomes high (for example, 50% or more), error resilience and quantization performance in subjective quality can be harmonized.
- Embodiment 7 of the present invention differs from Embodiment 6 in that the ratio between the fixed prediction mode component and the adaptive prediction mode component in the prediction of the wideband quantized LSP is determined for each frame based on the error sensitivity of the narrowband quantized LSP.
- Decide adaptively That is, in the sixth embodiment, the weighting factors output from weighting factor codebooks 223-1 and 223-2 are the default values. However, in the present embodiment, the weighting factors selected in the stationary mode are set.
- the codebook 223-1 is sequentially updated with the weight coefficient calculated from the narrowband quantization LSP of the current frame.
- FIG. 22 is a block diagram showing a main configuration of wideband LSP encoding section 107f according to the present embodiment.
- the wideband LSP encoding unit 107f is used in Embodiment 6 instead of the wideband LSP encoding unit 107e (see FIG. 20). Therefore, in the present embodiment, the scalable coding apparatus will be described only for wideband LSP coding section 107f. Further, in the present embodiment, with respect to the constituent elements of wideband LSP encoding section 107f, the same elements as those of broadband LSP encoding section 107e are denoted by the same reference numerals, and the same reference numerals are assigned thereto. Description is omitted.
- Broadband LSP encoding unit 107f is different from wideband LSP encoding unit 107e shown in Embodiment 6, in that it further includes a weighting factor calculator 2201.
- the weighting factor calculator 2201 performs “weighting according to error sensitivity” for each frame. For example, based on the narrowband quantized LSP input from the narrowband LSP encoding unit 103, for example, “R. Salami et al, "Design and Description of C-ACELP: A Toll Quality 8 kb / s Speech Coder, IEEE Trans, on Speech and Audio Process., vol. 6, no.2, pp.116-130, March 1998.” Equation (9) of “K. Paliwal and BS Atal,” Efficient Vector Quantization of LPC
- the weighting factor for weighting factor codebook 223-1 is calculated using the calculated weighting. Then, the weighting factor calculator 2201 sequentially updates the contents of the weighting factor codebook 223-1 with the weighting factor calculated for each frame. Also, in the present embodiment, the weighting factor calculator 2201 sets the ratio of the fixed prediction mode component in the prediction of the wideband quantization LSP to be higher as the calculated weight is larger (the error sensitivity is larger) (for example, The ratio of the fixed prediction mode component is set to 50% or more), while the smaller the weight, the better the quantization performance. Practice. Then, the weighting factor calculator 2201 updates the content of the weighting factor codebook 223-1 so as to approach the optimum composition ratio (generally, the ratio of the adaptive prediction mode component increases) obtained by this learning.
- weighting factor calculator 2201 selects weighting factor codebook 223-1 selected in the steady mode based on the error sensitivity of the narrowband quantized LSP of the current frame. Optimizes the ratio of fixed prediction mode components to adaptive prediction mode components in LSP prediction to maximize the quantization performance while suppressing the degradation of error resilience in order to sequentially update the contents of can do. For example, if the weighting factor calculator 2221 sets the ratio of the fixed prediction mode component to 100% in the prediction of the wideband quantized LSP, that is, it is connected to the amplifier 126-1, which multiplies the narrowband quantized LSP by the fixed transform coefficient.
- the error resilience can be improved.
- the weighting factor calculator 2201 sets the ratio of the adaptive prediction mode component to 100%, the quantization performance can be improved instead of deteriorating the error resilience. If the weighting factor calculator 2201 sets the ratio between the fixed prediction mode component and the adaptive prediction mode component to, for example, 50% or 50%, an effect of improving the quantization performance derived from the adaptive prediction mode component occurs. In addition, since the effect of the transmission path error due to the fixed prediction mode component is diluted according to the number of calculations in the wideband LSP coding section 107f, the influence of the transmission path error can be made difficult to propagate to subsequent frames.
- the content of weighting factor codebook 223-1 is sequentially updated for each frame by weighting factor calculator 2201, so that the error sensitivity of narrowband quantized LSP is reduced for each frame.
- weighting factor calculator 2201 Even when the state transitions to, adaptively harmonize the quantization performance improvement effect derived from the adaptive prediction mode component and the error resilience degradation suppression effect derived from the fixed prediction mode component, which are in a trade-off relationship with each other. Can be.
- the weighting factor calculator 2201 uses the fixed prediction for the low frequency component. It is preferable to determine the weighting factor so that the ratio of the mode component is high, while the ratio of the adaptive prediction mode component is high for the high frequency component.
- weight coefficient multiplier 2201 calculates a weight coefficient for weight coefficient codebook 223-1 based on the error sensitivity of narrowband quantized LSP. The present invention is not limited to this case.
- the weighting factor multiplier 2201 may calculate the weighting factor for the weighting factor codebook 223-1 as well as the offline learning data power.
- the scalable encoding device and the scalable decoding device according to the present invention are not limited to the above embodiments, and can be implemented with various modifications. For example, the embodiments can be combined as appropriate.
- the scalable coding apparatus and the scalable decoding apparatus according to the present invention can also be mounted on a communication terminal apparatus and a base station apparatus in a mobile communication system.
- a communication terminal device and a base station device having the same can be provided.
- the present invention is also applicable to the force ISP (Immittance Spectrum Pairs) parameter described in the case where the LSP parameter is encoded and Z-decoded.
- LSF Line Spectral Frequency
- LSF Line Spectral Frequency
- the ratio of the LSP parameter of the wideband Z narrowband one frame before is used as the narrowband-one-wideband transform coefficient in the current frame.
- the ratio of the quantized LSP parameter of the current frame wideband Z narrowband is estimated or extrapolated, and the obtained value is used as the narrowband wideband transform coefficient of the current frame. Good to use!
- three or more force modes may be described as an example in which the mode is composed of two modes, the steady mode and the non-stationary mode.
- band scalable coding when there are two layers of band scalable coding, that is, a band scalable coding or band having two frequency band powers of a narrow band and a wide band.
- scalable decoding has been described as an example, the present invention can be applied to band scalable decoding or band scalable decoding composed of three or more frequency bands (layers).
- the power described in the case of configuring the present invention by hardware as an example can also be realized by software.
- the algorithm of the scalable encoding method or the scalable decoding method according to the present invention is described in a programming language, and this program is stored in a memory and executed by information processing means. Functions similar to those of the scalable decoding device or the scalable decoding device of the invention can be realized.
- Each functional block used in the description of each of the above embodiments is typically implemented as an LSI which is an integrated circuit. These may be individually integrated into one chip, or may be integrated into one chip so as to include a part or all of them.
- an LSI it may be referred to as an IC, a system LSI, a super LSI, or a unoratora LSI.
- the method of circuit integration is not limited to LSI, and may be realized by a dedicated circuit or a general-purpose processor. After the LSI is manufactured, an FPGA (Field Programmable Gate Array) that can be programmed or a reconfigurable processor that can reconfigure the connection or setting of circuit cells inside the LSI may be used.
- FPGA Field Programmable Gate Array
- reconfigurable processor that can reconfigure the connection or setting of circuit cells inside the LSI may be used.
- a scalable encoding apparatus, a scalable decoding apparatus, a scalable encoding method, and a scalable decoding method according to the present invention are used in a communication apparatus in a mobile communication system, a packet communication system using an Internet protocol, or the like. It can be applied to applications.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Computer Networks & Wireless Communication (AREA)
- Power Engineering (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
BRPI0510303-7A BRPI0510303A (pt) | 2004-04-27 | 2005-04-19 | dispositivo de codificação escalável, dispositivo de decodificação escalável, e seu método |
EP05734658A EP1755109B1 (en) | 2004-04-27 | 2005-04-19 | Scalable encoding and decoding apparatuses and methods |
US11/587,379 US8271272B2 (en) | 2004-04-27 | 2005-04-19 | Scalable encoding device, scalable decoding device, and method thereof |
JP2006513512A JP4546464B2 (ja) | 2004-04-27 | 2005-04-19 | スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 |
CN2005800131755A CN1947174B (zh) | 2004-04-27 | 2005-04-19 | 可扩展编码装置、可扩展解码装置、可扩展编码方法以及可扩展解码方法 |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004-132113 | 2004-04-27 | ||
JP2004132113 | 2004-04-27 | ||
JP2004-259036 | 2004-09-06 | ||
JP2004259036 | 2004-09-06 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2005112005A1 true WO2005112005A1 (ja) | 2005-11-24 |
Family
ID=35394383
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/007438 WO2005112005A1 (ja) | 2004-04-27 | 2005-04-19 | スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US8271272B2 (ja) |
EP (1) | EP1755109B1 (ja) |
JP (1) | JP4546464B2 (ja) |
KR (1) | KR20070009644A (ja) |
CN (1) | CN1947174B (ja) |
BR (1) | BRPI0510303A (ja) |
RU (1) | RU2006137841A (ja) |
WO (1) | WO2005112005A1 (ja) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1949693A1 (en) * | 2005-10-12 | 2008-07-30 | Samsung Electronics Co., Ltd. | Method and apparatus for processing/transmitting bit-stream, and method and apparatus for receiving/processing bit-stream |
WO2008120438A1 (ja) * | 2007-03-02 | 2008-10-09 | Panasonic Corporation | ポストフィルタ、復号装置およびポストフィルタ処理方法 |
JP2010520504A (ja) * | 2007-03-02 | 2010-06-10 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | レイヤード・コーデックのためのポストフィルタ |
WO2010101190A1 (ja) * | 2009-03-06 | 2010-09-10 | 株式会社エヌ・ティ・ティ・ドコモ | 音信号符号化方法、音信号復号方法、符号化装置、復号装置、音信号処理システム、音信号符号化プログラム、及び、音信号復号プログラム |
JP2012088729A (ja) * | 2011-12-06 | 2012-05-10 | Ntt Docomo Inc | 音信号符号化方法、音信号復号方法、符号化装置、復号装置、音信号処理システム、音信号符号化プログラム、及び、音信号復号プログラム |
Families Citing this family (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
BRPI0510014B1 (pt) * | 2004-05-14 | 2019-03-26 | Panasonic Intellectual Property Corporation Of America | Dispositivo de codificação, dispositivo de decodificação e método do mesmo |
EP1785985B1 (en) * | 2004-09-06 | 2008-08-27 | Matsushita Electric Industrial Co., Ltd. | Scalable encoding device and scalable encoding method |
KR20100006492A (ko) * | 2008-07-09 | 2010-01-19 | 삼성전자주식회사 | 부호화 방식 결정 방법 및 장치 |
CA2972808C (en) * | 2008-07-10 | 2018-12-18 | Voiceage Corporation | Multi-reference lpc filter quantization and inverse quantization device and method |
US20120053949A1 (en) * | 2009-05-29 | 2012-03-01 | Nippon Telegraph And Telephone Corp. | Encoding device, decoding device, encoding method, decoding method and program therefor |
US8964966B2 (en) * | 2010-09-15 | 2015-02-24 | Avaya Inc. | Multi-microphone system to support bandpass filtering for analog-to-digital conversions at different data rates |
MY185091A (en) | 2011-04-21 | 2021-04-30 | Samsung Electronics Co Ltd | Method of quantizing linear predictive coding coefficients, sound encoding method, method of de-quantizing linear predictive coding coefficients, sound decoding method, and recording medium |
CN105244034B (zh) * | 2011-04-21 | 2019-08-13 | 三星电子株式会社 | 针对语音信号或音频信号的量化方法以及解码方法和设备 |
US9117455B2 (en) * | 2011-07-29 | 2015-08-25 | Dts Llc | Adaptive voice intelligibility processor |
EP3279895B1 (en) * | 2011-11-02 | 2019-07-10 | Telefonaktiebolaget LM Ericsson (publ) | Audio encoding based on an efficient representation of auto-regressive coefficients |
EP2830062B1 (en) | 2012-03-21 | 2019-11-20 | Samsung Electronics Co., Ltd. | Method and apparatus for high-frequency encoding/decoding for bandwidth extension |
US20140307808A1 (en) * | 2012-12-14 | 2014-10-16 | Boris Kudryashov | Protection against packet loss during transmitting video information |
EP3742440B1 (en) * | 2013-04-05 | 2024-07-31 | Dolby International AB | Audio decoder for interleaved waveform coding |
CN104143336B (zh) * | 2013-05-29 | 2015-12-02 | 腾讯科技(深圳)有限公司 | 一种获取语音信号的平滑谱的方法和装置 |
EP3399522B1 (en) * | 2013-07-18 | 2019-09-11 | Nippon Telegraph and Telephone Corporation | Linear prediction analysis device, method, program, and storage medium |
EP2830061A1 (en) | 2013-07-22 | 2015-01-28 | Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
JP6270992B2 (ja) * | 2014-04-24 | 2018-01-31 | 日本電信電話株式会社 | 周波数領域パラメータ列生成方法、周波数領域パラメータ列生成装置、プログラム及び記録媒体 |
EP3139383B1 (en) * | 2014-05-01 | 2019-09-25 | Nippon Telegraph and Telephone Corporation | Coding and decoding of a sound signal |
JP6270993B2 (ja) * | 2014-05-01 | 2018-01-31 | 日本電信電話株式会社 | 符号化装置、及びその方法、プログラム、記録媒体 |
CN106486129B (zh) * | 2014-06-27 | 2019-10-25 | 华为技术有限公司 | 一种音频编码方法和装置 |
WO2016142002A1 (en) | 2015-03-09 | 2016-09-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder, method for encoding an audio signal and method for decoding an encoded audio signal |
US10824917B2 (en) | 2018-12-03 | 2020-11-03 | Bank Of America Corporation | Transformation of electronic documents by low-resolution intelligent up-sampling |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3237089B2 (ja) * | 1994-07-28 | 2001-12-10 | 株式会社日立製作所 | 音響信号符号化復号方法 |
JP2891193B2 (ja) * | 1996-08-16 | 1999-05-17 | 日本電気株式会社 | 広帯域音声スペクトル係数量子化装置 |
TW326070B (en) * | 1996-12-19 | 1998-02-01 | Holtek Microelectronics Inc | The estimation method of the impulse gain for coding vocoder |
SE512719C2 (sv) * | 1997-06-10 | 2000-05-02 | Lars Gustaf Liljeryd | En metod och anordning för reduktion av dataflöde baserad på harmonisk bandbreddsexpansion |
JP3134817B2 (ja) | 1997-07-11 | 2001-02-13 | 日本電気株式会社 | 音声符号化復号装置 |
CA2429832C (en) * | 2000-11-30 | 2011-05-17 | Matsushita Electric Industrial Co., Ltd. | Lpc vector quantization apparatus |
JP2003241799A (ja) | 2002-02-15 | 2003-08-29 | Nippon Telegr & Teleph Corp <Ntt> | 音響符号化方法、復号化方法、符号化装置、復号化装置及び符号化プログラム、復号化プログラム |
KR100503415B1 (ko) * | 2002-12-09 | 2005-07-22 | 한국전자통신연구원 | 대역폭 확장을 이용한 celp 방식 코덱간의 상호부호화 장치 및 그 방법 |
-
2005
- 2005-04-19 KR KR1020067022317A patent/KR20070009644A/ko not_active Application Discontinuation
- 2005-04-19 CN CN2005800131755A patent/CN1947174B/zh not_active Expired - Fee Related
- 2005-04-19 US US11/587,379 patent/US8271272B2/en active Active
- 2005-04-19 EP EP05734658A patent/EP1755109B1/en not_active Not-in-force
- 2005-04-19 BR BRPI0510303-7A patent/BRPI0510303A/pt not_active Application Discontinuation
- 2005-04-19 JP JP2006513512A patent/JP4546464B2/ja not_active Expired - Fee Related
- 2005-04-19 RU RU2006137841/09A patent/RU2006137841A/ru not_active Application Discontinuation
- 2005-04-19 WO PCT/JP2005/007438 patent/WO2005112005A1/ja not_active Application Discontinuation
Non-Patent Citations (3)
Title |
---|
EHARA H. ET AL.: "Kyotaiiki - Kotaiiki Yosoku Model ni Motozuku Taiiki Scalable LSP Ryoshika.", DAI 3 KAI FORUM ON INFORMATION TECHNOLOGY KOEN RONBUNSHU, 20 August 2004 (2004-08-20), pages 139 - 141, XP002992976 * |
FURUI, TASAKI, KODERA, WATANABE: "Vector Ryoshika to Joho Asshuku", KORONASHA, 10 November 1998 (1998-11-10), pages 694 - 703, XP002903458 * |
KOISHIDA X. ET AL.: "Enhancing MPEG-4CELP by jointly optimized inter/intra-frame LSP predictors.", PROC. IEEE WORKSHOP ON SPEECH CODING, 2000, pages 90 - 92, XP010520051 * |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8212693B2 (en) | 2005-10-12 | 2012-07-03 | Samsung Electronics Co., Ltd. | Bit-stream processing/transmitting and/or receiving/processing method, medium, and apparatus |
EP1949693A4 (en) * | 2005-10-12 | 2010-05-19 | Samsung Electronics Co Ltd | METHOD AND DEVICE FOR PROCESSING / SENDING A BITSTROM AND METHOD AND DEVICE FOR RECEIVING / PROCESSING A BITSTROM |
EP1949693A1 (en) * | 2005-10-12 | 2008-07-30 | Samsung Electronics Co., Ltd. | Method and apparatus for processing/transmitting bit-stream, and method and apparatus for receiving/processing bit-stream |
WO2008120438A1 (ja) * | 2007-03-02 | 2008-10-09 | Panasonic Corporation | ポストフィルタ、復号装置およびポストフィルタ処理方法 |
JP2010520504A (ja) * | 2007-03-02 | 2010-06-10 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | レイヤード・コーデックのためのポストフィルタ |
US8599981B2 (en) | 2007-03-02 | 2013-12-03 | Panasonic Corporation | Post-filter, decoding device, and post-filter processing method |
CN102737642A (zh) * | 2009-03-06 | 2012-10-17 | 株式会社Ntt都科摩 | 声音信号编解码方法、编解码装置和声音信号处理系统 |
CN102341851B (zh) * | 2009-03-06 | 2013-09-04 | 株式会社Ntt都科摩 | 声音信号编码方法、声音信号解码方法、编码装置、解码装置、声音信号处理系统 |
CN102341851A (zh) * | 2009-03-06 | 2012-02-01 | 株式会社Ntt都科摩 | 声音信号编码方法、声音信号解码方法、编码装置、解码装置、声音信号处理系统、声音信号编码程序以及声音信号解码程序 |
CN102737641A (zh) * | 2009-03-06 | 2012-10-17 | 株式会社Ntt都科摩 | 声音信号编解码方法、编解码装置和声音信号处理系统 |
JP2010210680A (ja) * | 2009-03-06 | 2010-09-24 | Ntt Docomo Inc | 音信号符号化方法、音信号復号方法、符号化装置、復号装置、音信号処理システム、音信号符号化プログラム、及び、音信号復号プログラム |
TWI385649B (zh) * | 2009-03-06 | 2013-02-11 | Ntt Docomo Inc | An audio signal coding method, an audio signal decoding method, an encoding apparatus, a decoding apparatus, an audio signal processing system, an audio signal coding program recording medium, and an audio signal decoding program recording medium |
RU2482554C1 (ru) * | 2009-03-06 | 2013-05-20 | Нтт Докомо, Инк. | Способ кодирования аудиосигнала, способ декодирования аудиосигнала, устройство кодирования, устройство декодирования, система обработки аудиосигнала, программа кодирования аудиосигнала и программа декодирования аудиосигнала |
US9214161B2 (en) | 2009-03-06 | 2015-12-15 | Ntt Docomo, Inc. | Audio signal encoding method, audio signal decoding method, encoding device, decoding device, audio signal processing system, audio signal encoding program, and audio signal decoding program |
RU2493619C1 (ru) * | 2009-03-06 | 2013-09-20 | Нтт Докомо, Инк. | Способ кодирования аудиосигнала, способ декодирования аудиосигнала, устройство кодирования, устройство декодирования, система обработки аудиосигнала, программа кодирования аудиосигнала и программа декодирования аудиосигнала |
RU2493620C1 (ru) * | 2009-03-06 | 2013-09-20 | Нтт Докомо, Инк. | Способ кодирования аудиосигнала, способ декодирования аудиосигнала, устройство кодирования, устройство декодирования, система обработки аудиосигнала, программа кодирования аудиосигнала и программа декодирования аудиосигнала |
WO2010101190A1 (ja) * | 2009-03-06 | 2010-09-10 | 株式会社エヌ・ティ・ティ・ドコモ | 音信号符号化方法、音信号復号方法、符号化装置、復号装置、音信号処理システム、音信号符号化プログラム、及び、音信号復号プログラム |
US8666754B2 (en) | 2009-03-06 | 2014-03-04 | Ntt Docomo, Inc. | Audio signal encoding method, audio signal decoding method, encoding device, decoding device, audio signal processing system, audio signal encoding program, and audio signal decoding program |
US8751245B2 (en) | 2009-03-06 | 2014-06-10 | Ntt Docomo, Inc | Audio signal encoding method, audio signal decoding method, encoding device, decoding device, audio signal processing system, audio signal encoding program, and audio signal decoding program |
CN102737641B (zh) * | 2009-03-06 | 2014-07-02 | 株式会社Ntt都科摩 | 声音信号编解码方法、编解码装置和声音信号处理系统 |
CN102737642B (zh) * | 2009-03-06 | 2014-07-30 | 株式会社Ntt都科摩 | 声音信号编解码方法、编解码装置和声音信号处理系统 |
JP2012088729A (ja) * | 2011-12-06 | 2012-05-10 | Ntt Docomo Inc | 音信号符号化方法、音信号復号方法、符号化装置、復号装置、音信号処理システム、音信号符号化プログラム、及び、音信号復号プログラム |
Also Published As
Publication number | Publication date |
---|---|
RU2006137841A (ru) | 2008-05-10 |
BRPI0510303A (pt) | 2007-10-02 |
EP1755109A4 (en) | 2008-04-02 |
US20070223577A1 (en) | 2007-09-27 |
KR20070009644A (ko) | 2007-01-18 |
JPWO2005112005A1 (ja) | 2008-03-27 |
EP1755109B1 (en) | 2012-08-15 |
JP4546464B2 (ja) | 2010-09-15 |
US8271272B2 (en) | 2012-09-18 |
CN1947174A (zh) | 2007-04-11 |
EP1755109A1 (en) | 2007-02-21 |
CN1947174B (zh) | 2012-03-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4546464B2 (ja) | スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 | |
RU2418324C2 (ru) | Поддиапазонный речевой кодекс с многокаскадными таблицами кодирования и избыточным кодированием | |
KR101344174B1 (ko) | 오디오 신호 처리 방법 및 오디오 디코더 장치 | |
JP6173288B2 (ja) | マルチモードオーディオコーデックおよびそれに適応されるcelp符号化 | |
JP5143193B2 (ja) | スペクトル包絡情報量子化装置、スペクトル包絡情報復号装置、スペクトル包絡情報量子化方法及びスペクトル包絡情報復号方法 | |
JP5270025B2 (ja) | パラメータ復号装置およびパラメータ復号方法 | |
JP4394578B2 (ja) | 可変ビットレート通話符号化における線形予測パラメータの強力な予測ベクトル量子化方法と装置 | |
JP7209032B2 (ja) | 音声符号化装置および音声符号化方法 | |
JPH09127996A (ja) | 音声復号化方法及び装置 | |
Bouzid et al. | Switched split vector quantizer applied for encoding the LPC parameters of the 2.4 Kbits/s MELP speech coder |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006513512 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007223577 Country of ref document: US Ref document number: 11587379 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005734658 Country of ref document: EP Ref document number: 1020067022317 Country of ref document: KR Ref document number: 2006137841 Country of ref document: RU Ref document number: 200580013175.5 Country of ref document: CN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: DE |
|
WWP | Wipo information: published in national office |
Ref document number: 1020067022317 Country of ref document: KR |
|
WWP | Wipo information: published in national office |
Ref document number: 2005734658 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 11587379 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0510303 Country of ref document: BR |