WO2006049205A1 - スケーラブル復号化装置およびスケーラブル符号化装置 - Google Patents
スケーラブル復号化装置およびスケーラブル符号化装置 Download PDFInfo
- Publication number
- WO2006049205A1 WO2006049205A1 PCT/JP2005/020201 JP2005020201W WO2006049205A1 WO 2006049205 A1 WO2006049205 A1 WO 2006049205A1 JP 2005020201 W JP2005020201 W JP 2005020201W WO 2006049205 A1 WO2006049205 A1 WO 2006049205A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- spectrum
- decoding
- frequency band
- unit
- information
- Prior art date
Links
- 238000001228 spectrum Methods 0.000 claims abstract description 412
- 230000003595 spectral effect Effects 0.000 claims description 79
- 238000000034 method Methods 0.000 claims description 47
- 239000013598 vector Substances 0.000 claims description 34
- 238000001914 filtration Methods 0.000 claims description 26
- 238000006243 chemical reaction Methods 0.000 claims description 18
- 230000005236 sound signal Effects 0.000 abstract description 13
- 230000015556 catabolic process Effects 0.000 abstract description 5
- 238000006731 degradation reaction Methods 0.000 abstract description 5
- 239000010410 layer Substances 0.000 description 110
- 238000010586 diagram Methods 0.000 description 36
- 238000012545 processing Methods 0.000 description 25
- 238000000926 separation method Methods 0.000 description 23
- 230000000873 masking effect Effects 0.000 description 21
- 230000005540 biological transmission Effects 0.000 description 16
- 238000004891 communication Methods 0.000 description 12
- 238000004364 calculation method Methods 0.000 description 11
- 239000000872 buffer Substances 0.000 description 7
- 230000015572 biosynthetic process Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000010295 mobile communication Methods 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- 101100518501 Mus musculus Spp1 gene Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 239000012792 core layer Substances 0.000 description 2
- 239000006185 dispersion Substances 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/038—Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L27/00—Modulated-carrier systems
- H04L27/02—Amplitude-modulated carrier systems, e.g. using on-off keying; Single sideband or vestigial sideband modulation
- H04L27/06—Demodulator circuits; Receiver circuits
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
Definitions
- the present invention relates to a scalable decoding device and a scalable encoding device used when voice signals and acoustic signals are communicated in a packet communication system using a mobile communication system or an Internet protocol.
- the band scalable speech coding scheme is a scheme that encodes speech signals hierarchically, and is a coding scheme that increases the quality of the coding scheme as the number of layers of the coding scheme increases. Since the bit rate can be made variable by increasing or decreasing the number of code layers, the transmission line capacity can be used effectively.
- the decoder side is allowed to a certain extent that the coding layer information of the additional layer, which is sufficient as long as it can receive the coding layer data of the lowest basic layer, is lost on the transmission line. Because it can, it is highly resistant to transmission path errors.
- the frequency band of the audio signal to be encoded increases as the code hierarchy increases.
- a conventional telephone band voice encoding method is used for the basic layer (core layer).
- the layer is configured so that wideband speech such as the 7 kHz band can be encoded.
- the band scalable speech coding system is It can be used for both telephone-band voice service terminals and high-quality broadband voice service terminals, and can also handle multipoint communications including both terminals.
- the code information is hierarchical, error tolerance can be increased depending on how the transmission is devised, and the bit rate can be controlled on the code side or on the transmission path. Easy. For this reason, the band scalable speech coding scheme is attracting attention as a future speech coding scheme for communication.
- the MDCT coefficient is coded using a scale factor and fine structure information for each band.
- the scale factor is Huffman encoded and the fine structure is vector quantized.
- the auditory importance of each band is calculated using the decoding result of the scale factor, and bit allocation to each band is determined.
- the bandwidth of each band is unequal, and is set in advance so that the higher the band is, the wider!
- transmission information is classified into the following four groups.
- the decoded signal of the core codec is output.
- ⁇ Case 3> When B information is received in addition to A information, the decoded signal of the core codec To generate a high frequency, and generate a decoded signal having a wider band than the decoded signal of the core codec.
- the decoded B information is used to generate the high-frequency spectrum shape. Mirroring is performed in a voiced frame and is done in such a way that the harmonic structure (harmonic structure) does not collapse. In unvoiced frames, high frequencies are generated using random noise.
- Non-Patent Document 1 B. Kovesi et al, A scalable speech and audio coding scheme with continuous bitrateflexibility, "in proc. IEEE ICASSP 2004, pp.I- 273--1- 276
- Non-Patent Document 1 a high frequency is generated by mirroring. At this time, since the mirroring is performed so as not to break the harmonic structure, the harmonic structure is maintained. However, the low-frequency harmonic structure appears as a mirror image in the high frequency range. In general, in a voiced signal, the harmonic structure collapses as it goes up, so in the high range it often does not show a pronounced harmonic structure as the low range. In other words, even if the Harmotus valley is deep in the low frequency range, the Harmony valley may be shallow in the high frequency range, and in some cases, the harmonic structure itself may have a clear force. Therefore, in the above prior art, an excessive harmonic structure appears in the high-frequency component and appears immediately, so that the quality of the decoded speech signal is deteriorated.
- An object of the present invention is that even when a speech (sound) signal is decoded by generating a high-frequency spectrum using the low-frequency spectrum, the degradation of the high-frequency spectrum is small! Quality recovery
- the scalable decoding device of the present invention includes a first decoding unit that decodes low-frequency band encoded information to obtain a low-frequency band decoded signal, the low-frequency band decoded signal, and the high-frequency band decoded signal.
- Second decoding means for obtaining a decoded signal in a high frequency band from the encoded information, wherein the second decoding means converts the decoded signal in the low frequency band.
- Conversion means for obtaining a spectrum in the low frequency band, adjustment means for adjusting the amplitude of the low frequency band vector, a spectrum of the low frequency band whose amplitude has been adjusted, and a sign signal of the high frequency band.
- a generating unit that artificially generates a spectrum in a high frequency band using the information.
- FIG. 1 is a block diagram showing a configuration of a scalable decoding device according to Embodiment 1 of the present invention.
- FIG. 2 is a block diagram showing a configuration of a scalable code device according to Embodiment 1 of the invention.
- FIG. 3 is a block diagram showing a configuration of a second layer decoding unit according to Embodiment 1 of the present invention.
- FIG. 4 is a block diagram showing a configuration of a second layer code key section according to Embodiment 1 of the present invention.
- FIG. 5 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 1 of the present invention.
- FIG. 6 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 1 of the present invention.
- FIG. 7 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 1 of the present invention.
- FIG. 8 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 1 of the present invention.
- FIG. 9 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 1 of the present invention.
- FIG. 10 is a block diagram showing a configuration of a spectrum decoding key unit according to Embodiment 1 of the present invention.
- FIG. 11 shows a high frequency component in the high frequency spectrum decoding key unit according to Embodiment 1 of the present invention.
- Schematic diagram showing the state of processing to be generated FIG. 12 is a block diagram showing a configuration of a spectrum decoding unit according to the first embodiment of the present invention.
- FIG. 13 is a block diagram showing a configuration of a spectrum decoding unit according to the first embodiment of the present invention.
- 14 Block diagram showing the configuration of the second layer decoding key unit according to Embodiment 2 of the present invention.
- FIG. 15 Block diagram showing the configuration of the second layer code keying unit according to Embodiment 2 of the present invention.
- FIG. 16 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 2 of the present invention.
- FIG. 17 is a block diagram showing a configuration of a spectrum decoding unit according to Embodiment 2 of the present invention.
- 18 Block diagram showing the configuration of the first spectrum code key section according to Embodiment 2 of the present invention.
- FIG. 19 Block diagram showing the configuration of the extended band decoding key section according to Embodiment 2 of the present invention.
- FIG. 20 is a block diagram showing the configuration of the extended band decoding unit according to Embodiment 2 of the present invention.
- FIG. 22 is a block diagram showing a configuration of an extended band decoding unit according to the second embodiment of the present invention.
- FIG. 22 A block diagram showing a configuration of an extended band decoding unit according to the second embodiment of the present invention. Schematic diagram showing a state of processing for generating a high frequency component in the extended band decoding unit according to the second embodiment of the invention.
- FIG. 24 is a block diagram showing a configuration of an extension band code key unit according to Embodiment 2 of the present invention.
- FIG. 25 is a block diagram showing a reception received by the demultiplexing unit of the scalable decoding device according to Embodiment 2 of the present invention. Schematic diagram showing the contents of a stream
- FIG. 26 is a block diagram showing a configuration of an extended band decoding unit according to the third embodiment of the present invention.
- FIG. 1 is a block diagram showing a configuration of a scalable decoding apparatus 100 that forms, for example, a band scalable audio (acoustic) signal decoding apparatus.
- the scalable decoding device 100 includes a separation unit 101, a first layer decoding unit 102, and a second layer decoding unit 103.
- Separating section 101 receives a bitstream that has also been transmitted with a scalable coding device capability, which will be described later, and separates it into a first layer code parameter and a second layer code parameter.
- the data is output to the first layer decoding unit 102 and the second layer decoding unit 103, respectively.
- the first layer decoding unit 102 receives the first layer code parameter input from the separating unit 101. Decode the data and output the first layer decoded signal. This first layer decoded signal is also output to second layer decoding section 103.
- Second layer decoding section 103 receives the second layer code parameters input from demultiplexing section 101, and the first layer decoded signal input from first layer decoding section 102. Is used to decode, and the second layer decoded signal is output.
- FIG. 2 shows an example of the configuration of a scalable coding apparatus 200 corresponding to the scalable decoding apparatus 100 of FIG.
- first layer encoding unit 201 encodes an input speech signal (original signal), and converts the obtained encoding parameters into first layer decoding unit 202 and multiplexing unit 203. Output to.
- the first layer code key unit 201 realizes band scalability of the first layer and the second layer by performing a down-sampling process, a low-pass filtering process, and the like for the code key.
- the first layer decoding unit 202 also generates the first layer decoded signal from the code layer parameter input from the first layer encoding unit 201 to generate the second layer encoding unit 204. Output to.
- Second layer encoding unit 204 encodes the input speech signal (original signal) using the first layer decoded signal input from first layer decoding unit 202, and obtains Is output to the multiplexing unit 203.
- the second layer code key unit 204 increases the first layer decoded signal in accordance with the processing (down-sampling processing or low-pass filtering processing) performed by the first layer code key unit 201 in the case of the code key. Sample processing and phase adjustment processing to match the phase of the first layer decoded signal and the phase of the input audio signal are performed.
- the multiplexing unit 203 multiplexes the coding parameter input from the first layer coding unit 201 and the coding parameter input from the second layer coding unit 204, and generates a bit stream. Is output.
- FIG. 3 is a block diagram showing the configuration of second layer decoding section 103.
- Second layer decoding section 103 includes separating section 301, scaling coefficient decoding section 302, fine spectrum decoding section 303, frequency domain conversion section 304, spectrum decoding section 305, and time domain conversion section 306. Prepare.
- Separating section 301 represents the input second-layer encoding parameters as scaling coefficients. Coding parameters (scaling coefficient parameters) and coding parameters (fine spectrum parameters) representing the spectral fine structure are output to the scaling coefficient decoding unit 302 and the fine spectral decoding unit 303, respectively.
- the scaling coefficient decoding unit 302 decodes the input scaling coefficient parameter to obtain a low-frequency scaling coefficient and a high-frequency scaling coefficient, and outputs these decoded scaling coefficients to the spectrum decoding unit 305. At the same time, it is also output to the fine spectrum decoding unit 303.
- the fine spectrum decoding unit 303 calculates the auditory importance of each band using the decoding scaling coefficient input from the scaling coefficient decoding unit 302, and generates the fine spectrum information of each band. Find the number of allocated bits.
- the fine spectrum decoding unit 303 decodes the fine spectrum parameter input from the separation unit 301 to obtain decoded fine spectrum information of each band, and outputs the decoded fine spectrum information to the spectrum decoding unit 305. Note that in the case where the information of the first layer decoded signal may be used for the calculation of the auditory importance, the output of the frequency domain transform unit 304 is also input to the fine spectrum decoding unit 303.
- Frequency domain transform section 304 transforms the input first layer decoded signal into a frequency domain spectral parameter (for example, MDCT coefficient) and outputs it to spectrum decoding section 305.
- a frequency domain spectral parameter for example, MDCT coefficient
- the spectrum decoding unit 305 includes a first layer decoded signal converted into the frequency domain input from the frequency domain converting unit 304, and a decoding scaling coefficient (low frequency input) from the scaling coefficient decoding unit 302. And high frequency), the decoded fine spectrum information input from the fine spectrum decoding unit 303, and the spectrum of the force second layer decoded signal are decoded and output to the time domain conversion unit 306.
- Time domain conversion section 306 converts the second layer decoded signal input from spectrum decoding section 305 into a time domain signal and outputs it as a second layer decoded signal.
- FIG. 3 An example of the configuration of second layer coding unit 204 corresponding to second layer decoding unit 103 in FIG. 3 is shown in FIG.
- the input audio signal is input to auditory masking calculation section 401 and frequency domain conversion section 402A.
- the auditory masking calculation unit 401 is a subband having a predetermined bandwidth. Each auditory masking is calculated, and this auditory masking is output to the scaling coefficient code unit 403 and the fine spectrum code unit 404.
- Human auditory characteristics include an auditory masking characteristic that when a certain signal is heard, it is difficult to hear even if a sound with a frequency close to that signal enters the ear. Based on this auditory masking characteristic, the above-mentioned auditory masking is used to allocate a small number of quantization bits to a frequency spectrum where the quantization distortion is difficult to hear, and to a frequency spectrum where the quantization distortion is easy to hear, the quantization bit number. Efficient spectrum coding can be realized by allocating a large amount.
- Frequency domain conversion section 402A converts the input audio signal into frequency domain spectral parameters (for example, MDCT coefficients), and outputs them to scaling coefficient code section 403 and fine vector code section 404.
- Frequency domain transform section 402B transforms the input first layer decoded signal into a frequency domain spectrum parameter (for example, MDCT coefficient), and outputs it to scaling coefficient code section 403 and fine spectrum code section 404 .
- the scaling coefficient sign key unit 403 uses the auditory masking information input from the auditory masking calculation unit 401, and the spectral parameters and frequency domain converter 402B input from the frequency domain converter 402A. Then, a scaling coefficient parameter is obtained by performing a difference vector coding with the first-layer decoded spectrum that is input, and the scaling coefficient parameter is converted into a code key parameter multiplexing unit 405 and a fine spectrum code key unit. Output to 404. In this example, the scaling coefficient parameter for the high frequency spectrum and the scaling coefficient parameter for the low frequency spectrum are output separately.
- the fine spectrum encoding unit 404 decodes the scaling coefficient parameters (low frequency and high frequency) input from the scaling coefficient encoding unit 403 to obtain decoding scaling coefficients (low frequency and high frequency),
- the difference spectrum between the spectrum parameter input from frequency domain transform section 402A and the first layer decoded spectrum input from frequency domain transform section 402B is normalized using a decoding scaling coefficient (low frequency and high frequency).
- the fine spectrum encoding unit 404 encodes the normalized differential spectrum and outputs the encoded differential spectrum (fine spectral code parameter) to the code parameter multiplexing unit 405.
- the fine spectrum code part 404 is provided with a decoding scaling factor (low frequency and high frequency). ) Is used to calculate the auditory importance for each band of the fine spectrum, and the bits are allocated according to the auditory importance.
- the first layer decoded spectrum may be used to calculate this auditory importance.
- the encoding parameter multiplexing unit 405 includes a high-frequency spectral scaling coefficient parameter and a low-frequency spectral scaling coefficient parameter input from the scaling coefficient encoding unit 403, and a fine spectral code input unit 404 input from the fine spectral code encoding unit 404.
- the spectrum code parameter is multiplexed and output as the first spectrum code parameter.
- FIG. 9 is a block diagram showing the configuration of the spectrum decoding unit 305.
- FIG. 5 shows a configuration for executing processing when the first layer decoded signal, all decoding scaling coefficients (low frequency and high frequency), and all fine spectrum decoded information are normally received. Indicates.
- FIG. 6 shows a configuration for executing processing when a part of the high frequency fine spectrum decoding information is not received.
- the difference from FIG. 5 is that the output result of adder A is input to high-frequency spectrum decoding unit 602.
- the spectrum of the band to be decoded using the received high-frequency fine spectrum decoding information is generated in a pseudo manner by the method described later.
- FIG. 7 shows a configuration for executing processing when not all of the high-frequency fine spectrum decoding information is received (in addition, some of the low-frequency fine spectrum decoding information is not received). Indicates. The difference from FIG. 6 is that the fine spectrum decoding information is not input to the high frequency spectrum decoding section 702. The spectrum of the band to be decoded using the received high frequency fine spectrum decoding information is generated in a pseudo manner by the method described later.
- FIG. 8 shows a configuration for executing the processing when all the fine spectrum decoding information is not received and a part of the low-band decoding scaling coefficients are not received.
- FIG. 7 is different from FIG. 7 in that fine spectrum decoding information is not input and there is no output from the low-frequency spectrum decoding unit 801 and there is no adder A.
- the spectrum of the band to be decoded using the received high-frequency fine spur decoding information that has not been received is artificially generated by the method described later.
- FIG. 9 shows a configuration for executing processing when only high-frequency decoding scaling coefficients are received (including cases where some high-frequency decoding scaling coefficients are not received). Low This is different from Fig. 8 in that there is no low-frequency spectrum decoding part that receives the input of the domain decoding scaling factor. A method of artificially generating a high-frequency spectrum from only the received high-frequency decoding scaling coefficient will be described later.
- the spectrum decoding unit 305 in FIG. 5 includes a low-frequency spectrum decoding unit 501, a high-frequency spectrum decoding unit 502, an adder A, and an adder B.
- the low-band spectrum decoding unit 501 includes a low-band decoding scaling coefficient input from the scaling coefficient decoding unit 302, fine spectrum decoding information input from the fine spectrum decoding unit 303, Is used to decode the low frequency band and output to adder A.
- the decoded spectrum is calculated by multiplying the fine spectrum decoding information by the decoding scaling factor.
- Adder A receives the decoded low-frequency spectrum (residue) input from low-frequency spectrum decoding unit 501 and the first layer decoded signal (spectrum) input from frequency-domain transform unit 304. Add to obtain the decoded low-frequency spectrum and output to adder B.
- High frequency spectrum decoding section 502 has a high frequency decoding scaling coefficient input from scaling coefficient decoding section 302, fine spectrum decoding information input from fine spectrum decoding section 303, and Is used to decode the high frequency spectrum and output to adder B.
- the Calo arithmetic unit B combines the decoded low-frequency spectrum input from the adder A and the decoded high-frequency spectrum input from the high-frequency spectrum decoding unit 502 together with the entire region (low frequency and high frequency). All frequency bands) are generated and output as a decoded spectrum.
- FIG. 6 differs from FIG. 5 only in the operation of the high frequency spectrum decoding unit 602.
- the high frequency spectrum decoding unit 602 includes the high frequency decoding scaling coefficient input from the scaling coefficient decoding unit 302 and the high frequency fine spectrum decoding information input from the fine spectrum decoding unit 303.
- the high-frequency spectrum is decoded using At this time, the high frequency fine spectrum decoding information of a part of the band is not received, and therefore the high frequency vector of the corresponding band cannot be accurately decoded. Therefore, high-frequency spectrum decoding section 602 uses the decoding scaling coefficient, the low-frequency decoded spectrum input from adder A, and the high-frequency spectrum that can be received and accurately decoded, Generate a high-frequency spectrum.
- FIG. 7 shows the operation when all the high-frequency fine spectrum decoding information is not received in FIGS. 5 and 6. In this case, the high frequency spectrum decoding unit 702 decodes the high frequency spectrum using only the high frequency decoding scaling coefficient input from the scaling coefficient decoding key unit 302.
- the low-frequency spectrum decoding unit 701 includes a low-frequency decoding scaling coefficient input from the scaling coefficient decoding unit 302 and a low-frequency fine scaling input from the fine spectrum decoding unit 303.
- the low-frequency spectrum is decoded using the spectrum decoding information.
- the low frequency fine spectrum decoding information of a part of the band is not received, and therefore, a part of the band is not subjected to the decoding process and is set to the zero spectrum.
- the spectrum of the corresponding band output via the adders A and B is the first layer decoded signal (spectrum) itself.
- FIG. 8 shows the operation when all the low-frequency fine spectrum decoding information is not received in FIG.
- the low-frequency spectrum decoding unit 801 does not perform decoding because no fine spectrum decoding information to which a low-frequency decoding scaling coefficient is input is input.
- FIG. 9 shows the operation when no low-frequency decoding scaling coefficient is input in FIG. However, in the high frequency spectrum decoding unit 902, when some decoding scaling coefficients (high frequency) are not input, the spectrum of that band is output as zero.
- FIG. 9 shows the configuration of the high-frequency spectrum decoding unit 902 in more detail.
- the high-frequency spectrum decoding unit 902 in FIG. 10 includes an amplitude adjustment unit 1011, a pseudo spectrum generation unit 1012, and a scaling unit 1013.
- Amplitude adjustment section 1011 adjusts the amplitude of the first layer decoded signal vector input from frequency domain transform section 304, and outputs the result to pseudo spectrum generation section 1012.
- the pseudo spectrum generation unit 1012 generates a high-frequency spectrum in a pseudo manner using the amplitude-adjusted first layer decoded signal spectrum to which the amplitude adjustment unit 1011 is also input, and supplies the spectrum to the scaling unit 1013. Output.
- the scaling unit 1013 scans the spectrum input from the pseudo spectrum generation unit 1012. Carry out and output to adder B.
- FIG. 11 is a schematic diagram showing an example of the above-described series of processes for generating a high-frequency spectrum in a pseudo manner.
- the amplitude of the decoded signal spectrum of the first layer is adjusted.
- the amplitude adjustment method can be a constant multiple in the log domain (0 XS, ⁇ is an amplitude adjustment factor (real number) in the range of 0 ⁇ ⁇ 1, S is a log spectrum), or a constant power ( s Y and s are linear spectra).
- ⁇ is an amplitude adjustment factor (real number) in the range of 0 ⁇ ⁇ 1
- S is a log spectrum
- s Y and s are linear spectra
- the adjustment factor may be a fixed constant.
- an index that represents the depth of the harmonic spectrum valley in the low-frequency spectrum (for example, the dispersion value of the spectral amplitude directly in the low-frequency range is the first indirect It is more preferable to prepare a plurality of appropriate adjustment coefficients in accordance with the pitch gain value in the layer code key section 201, etc., and selectively use the corresponding adjustment coefficient in accordance with the index. It is also possible to selectively use the adjustment coefficient according to the characteristics of each vowel using low-frequency spectrum shape (envelope) information and pitch period information. Further, the optimum adjustment coefficient may be separately encoded as transmission information and transmitted on the encoder side.
- FIG. 11 shows an example of mirroring that generates the high-frequency spectrum as a mirror image of the low-frequency spectrum.
- a method of generating a high frequency spectrum by shifting the spectrum after amplitude adjustment in the high frequency direction of the frequency axis, and a frequency axis for the spectrum after amplitude adjustment using a pitch lag obtained from the low frequency spectrum.
- FIG. 12 shows the spectrum information of the first layer (for example, decoding LSP parameters) to the amplitude adjustment unit 1211. 2 is input from the first layer decoding unit 102.
- the amplitude adjustment unit 1211 determines an adjustment coefficient used for amplitude adjustment based on the input first layer vector information.
- the first layer pitch information pitch period and pitch gain
- FIG. 13 shows a case where an amplitude adjustment coefficient is separately input to the amplitude adjustment unit 1311.
- the amplitude adjustment coefficient is quantized and encoded on the encoder side and transmitted.
- FIG. 14 is a block diagram showing the configuration of second layer decoding section 103 according to Embodiment 2 of the present invention.
- Second layer decoding section 103 in FIG. 14 includes separating section 1401, spectrum decoding section 1402A, extended band decoding section 1403, spectrum decoding section 1402B, frequency domain transform section 1404, In addition, a time domain conversion unit 1405 is provided.
- Separating section 1401 separates the second layer code parameter into a first spectral code parameter, an extended band code parameter, and a second spectral coding parameter, and The data is output to the spectrum decoding unit 1402A, the extended band decoding unit 1403, and the spectrum decoding unit 1402B, respectively.
- Frequency domain transform section 1404 transforms the first layer decoded signal input from first layer decoding section 102 into frequency domain parameters (for example, MDCT coefficients, etc.), and first layer decoded signal spectrum Is output to the spectrum decoding unit 1402A.
- frequency domain parameters for example, MDCT coefficients, etc.
- the spectrum decoding unit 1402A decodes the first spectrum code parameter input from the separation unit 1401 to the decoded signal spectrum of the first layer input from the frequency domain transform unit 1404.
- the quantized spectrum of the obtained first layer code error is added and output as the first decoded spectrum to the extended band decoding unit 1403.
- the spectrum decoding unit 1402A mainly improves the first layer code error for the low frequency component.
- the extended band decoding unit 1403 also decodes various parameters by the extended band encoding parameter input from the demultiplexing unit 1401, and the spectrum decoding unit 1402A also receives the first power. Based on the decoded spectrum, the high-frequency spectrum is decoded and generated using the decoded parameters. Then, extended band decoding section 1403 outputs the spectrum of all bands to spectrum decoding section 1402B as the second decoded spectrum.
- Spectrum decoding key section 1402B decodes the second spectrum code key parameter input from demultiplexing section 1401 to the second decoded spectrum input from extension band decoding key section 1403.
- a spectrum obtained by quantizing the sign error of the obtained second decoded spectrum is added and output to the time domain conversion unit 1405 as a third decoded spectrum.
- Time domain conversion section 1405 converts the third decoding vector, to which spectrum decoding unit 1402B force is also input, into a time domain signal and outputs it as a second layer decoded signal.
- FIG. 14 a configuration in which one or both of spectrum decoding section 1402A and spectrum decoding section 1402B are not provided may be employed.
- spectrum decoding key unit 1402A first layer decoded signal spectrum output from frequency domain transforming unit 1404 is input to extension band decoding key unit 1403.
- second decoded spectrum output from extended band decoding unit 1403 is input to time domain conversion unit 1405.
- FIG. 15 shows an example of the configuration of second layer coding unit 204 corresponding to second layer decoding unit 103 in FIG.
- the audio signal (original signal) is input to auditory masking calculation section 1501 and frequency domain conversion section 1502A.
- Auditory masking calculation section 1501 calculates auditory masking using the input audio signal, and outputs the result to first spectrum code key section 1503, extended band code key section 1504, and second spectrum code key section 1505. Output.
- Frequency domain transform section 1502A transforms the input audio signal into frequency domain spectrum parameters (for example, MDCT coefficients), and first spectrum coding section 1503, extended band coding section 1504, and second spectrum. Outputs to the code field 1505.
- frequency domain spectrum parameters for example, MDCT coefficients
- Frequency domain transform section 1502B converts the input first layer decoded signal into a spectrum parameter such as MDCT, and outputs the spectrum parameter to first spectrum coding section 1503.
- the first spectrum code key unit 1503 receives the auditory masking input from the auditory masking calculator 1501. By using masking, the first input signal spectrum input from the frequency domain transform unit 1502A and the first layer decoded spectrum from which the frequency domain transform unit 1502B force is also input are subjected to sign coding. The first spectral spectrum obtained by decoding the first spectral code key parameter is output to the extended band code key unit 1504.
- the extended band code key unit 1504 uses the auditory masking input from the auditory masking calculation unit 1501 and the input speech signal spectrum input from the frequency domain transform unit 1502A and the first spectrum code key unit 1503.
- the second decoding result obtained by encoding the error spectrum with the first decoded spectrum input from, and outputting it as an extended band code parameter, as well as decoding the extended band code parameter
- the spectrum is output to the second spectrum code key unit 1505.
- Second spectrum code encoding unit 1505 uses the auditory masking input from auditory masking calculation unit 1501, and uses the input speech signal spectrum input from frequency domain transform unit 1502A and the extended band code signal.
- the error spectrum with the second decoded spectrum input from unit 1504 is encoded and output as the second spectral encoding parameter.
- the separation unit 1601 separates the input encoding parameter into an encoding parameter (scaling coefficient parameter) representing a scaling factor and an encoding parameter (fine spectral parameter) representing a spectral fine structure.
- the scaling coefficient decoding unit 1602 and the fine spectrum decoding unit 1603 respectively output the result.
- Scaling coefficient decoding unit 1602 decodes the input scaling coefficient parameter to obtain a low-frequency scaling coefficient and a high-frequency scaling coefficient, and outputs these decoded scaling coefficients to spectrum decoding unit 1604. At the same time, it is also output to the fine spectrum decoding unit 1603.
- Fine spectrum decoding unit 1603 calculates the auditory importance of each band using the decoding scaling coefficient input from scaling coefficient decoding unit 1602, and assigns it to the fine spectrum information of each band. Find the number of bits given.
- the fine spectrum decoding unit 1603 Decodes the fine spectrum parameter input from the separation unit 1601 to obtain decoded fine spectrum information of each band, and outputs it to the spectrum decoding unit 1604. Note that the information of the decoded spectrum A may be used to calculate the auditory importance. In this case, the decoded spectrum A is also input to the fine spectrum decoding unit 1603.
- the spectrum decoding unit 1604 includes the input decoded spectrum A, the decoding scaling coefficient (low band and high band) input from the scaling coefficient decoding unit 1602, and the fine spectral decoding unit.
- the decoded fine spectrum information and force input from 1603 are also decoded and output as decoded spectrum B.
- FIG. 16 The correspondence between FIG. 16 and FIG. 14 will be explained.
- the code key parameter in FIG. 16 is the first spectrum encoding parameter in FIG.
- the decoded spectrum A in FIG. 16 corresponds to the first layer decoded signal spectrum in FIG. 14
- the decoded spectrum B in FIG. 16 corresponds to the first decoded spectrum in FIG.
- the encoding parameter of FIG. 16 is changed to the second spectrum encoding parameter of FIG. 14 and the decoding spectrum A of FIG.
- FIG. 18 shows an example of the configuration of first spectrum encoding section 1503 corresponding to spectrum decoding sections 1402A and 1402B in FIG.
- FIG. 18 shows a configuration of first spectrum code key section 1503 in FIG.
- the first spectrum coding unit 1503 shown in FIG. 18 includes a scaling coefficient coding unit 403, a fine spectrum coding unit 404, a coding parameter multiplexing unit 405, and a spectrum decoding shown in FIG. Since the operation is the same as that described with reference to FIGS. 4 and 16, the description thereof is omitted here. Also, if the first layer decoded spectrum in FIG. 18 is replaced with the second decoded spectrum and the first vector encoding parameter is replaced with the second spectral encoding parameter, the configuration shown in FIG. The second spectrum code key unit 1505 in FIG. However, in the configuration of second spectrum code key unit 1505, spectrum decoding key unit 1604 is excluded.
- FIG. 17 shows spectrum decoding units 1402A and 1402B in the case where no scaling coefficient is used. The structure of is shown.
- spectrum decoding units 1402A and 1402B include auditory importance and bit allocation calculation unit 1701, fine spectrum decoding unit 1702, and spectrum decoding unit 1703.
- auditory importance and bit allocation calculation section 1701 obtains the auditory importance of each band from input decoding spectrum A, and bit to each band determined according to the auditory importance. Ask for distribution.
- the obtained auditory importance level and bit allocation information are output to the fine spectrum decoding unit 1702.
- Fine spectrum decoding unit 1702 decodes the input coding parameters based on the auditory importance level and bit allocation calculation unit 1701, and outputs each band. Is obtained and output to the spectrum decoding unit 1703.
- the spectrum decoding unit 1703 adds the fine spectrum decoding information input from the fine spectrum decoding unit 1702 to the input decoded spectrum A and outputs the decoded spectrum B as the decoded spectrum B.
- FIG. 17 The correspondence between FIG. 17 and FIG. 14 will be explained.
- the code key parameter in FIG. 17 is the first spectrum encoding parameter in FIG.
- the decoded spectrum A in FIG. 17 corresponds to the first layer decoded signal spectrum in FIG. 14, and the decoded spectrum B in FIG. 17 corresponds to the first decoded spectrum in FIG.
- the encoding parameter of FIG. 17 is the second spectrum encoding parameter of FIG. 14, and the decoding spectrum A of FIG.
- the second decoded spectrum in FIG. 14 corresponds to the decoded spectrum B in FIG. 17 and corresponds to the third decoded spectrum in FIG.
- the first spectrum encoding unit corresponding to spectrum decoding units 1402A and 1402B in FIG. 17 can be configured.
- FIG. 19 is a block diagram showing the configuration of the extended band decoding unit 1403.
- the extended band decoding unit 1403 shown in FIG. 19 includes a separating unit 1901, an amplitude adjusting unit 1902, a filter state setting unit 19 03, a filtering unit 1904, a spectral residual shape codebook 1905, a spectral residual gain codebook 1906, a multiplier 1907, a scale factor decoding unit 1908, a scaling unit 1909, and a spectral synthesis unit 1910.
- Separation section 1901 uses the encoding parameters input from separation section 1401 in FIG. 14 as amplitude adjustment coefficient coding parameters, lag coding parameters, residual shape coding parameters, residual gain code key parameters. And the scale factor code key parameter, and output to the amplitude adjustment unit 1902, the filtering unit 1904, the spectral residual shape code book 1905, the spectral residual gain code book 1906, and the scale factor decoding key unit 1908, respectively. .
- Amplitude adjustment section 1902 decodes the amplitude adjustment coefficient encoding parameter input from separation section 1901, and uses the decoded amplitude adjustment coefficient as input from spectrum decoding section 1402A in FIG.
- the amplitude of the first decoding spectrum is adjusted, and the first decoding spectrum after the amplitude adjustment is output to the filter state setting unit 1903.
- the amplitude adjustment is performed by a method represented by ⁇ S (n) r.
- S (n) is the spectral amplitude in the linear region
- n is the frequency.
- Spectrum residual shape codebook 1905 decodes the residual shape coding parameters input from separation section 1901 and outputs a spectral residual shape vector corresponding to the decoding result to multiplier 1907.
- the spectral residual gain codebook 1906 decodes the residual gain encoding parameter input from the separation unit 1901 and outputs the residual gain corresponding to the decoding result to the multiplier 1907.
- Multiplier 1907 multiplies the residual shape beta C [n] input from spectral residual shape codebook 1905 and the residual gain g input from spectral residual gain codebook 1906. gC [n] is output to the filtering unit 1904.
- Scale factor decoding section 1908 decodes the scale factor encoding parameter input from separation section 1901, and outputs the decoded scale factor to scaling section 1909.
- the scaling unit 1909 multiplies the spectrum S [Nn to Nw] input from the filtering unit 1904 by the scale factor input from the scale factor decoding unit 1908, and outputs the result to the spectrum combining unit 1910.
- the spectrum synthesizing unit 1910 generates the first decoded spectrum, in which the spectrum decoding unit 1402A of FIG. 14 is input to the low band (S [0 to Nn]) in the high band (S [Nn to Nw]).
- Scaling unit 1909 The spectrum obtained by substituting the spectrum to which the force is also input is output to spectrum decoding unit 1402B in FIG. 14 as the second decoding spectrum.
- FIG. 20 shows the configuration of extended band decoding section 1403 when the spectral residual shape coding parameter and the spectral residual gain coding parameter cannot be completely received.
- the information that can be completely received is the encoding parameter of the amplitude adjustment coefficient, the lag code key parameter, and the scale factor code key parameter.
- the separation unit 2001 converts the code parameters input from the separation unit 1401 of FIG. 14 into amplitude adjustment coefficient coding parameters, lag coding parameters, and scale factor codes. And output to the amplitude adjustment unit 1902, the filtering unit 2002, and the scale factor decoding unit 1908, respectively.
- FIG. 21 shows the configuration of extended band decoding section 1403 when lag encoding parameters cannot be received.
- the information that can be completely received is the sign key parameter of the amplitude adjustment coefficient and the scale factor sign key parameter.
- FIG. 21 is replaced by a fineletter state setting unit 1903 and a finelettering unit 2002 force pseudo-spectrum generation unit 2102 in FIG.
- the configuration other than the separation unit 2101 and the pseudo spectrum generation unit 2102 is the same as each unit in FIG.
- separation section 2101 separates the code parameter input from separation section 1401 in FIG. 14 into an amplitude adjustment coefficient encoding parameter and a scale factor encoding parameter, and an amplitude adjustment section 1902 and the scale factor decoding unit 1908 respectively.
- the pseudo spectrum generation unit 2102 generates a high frequency spectrum in a pseudo manner using the first decoded spectrum after amplitude adjustment input from the amplitude adjustment unit 1902, and outputs it to the scaling unit 1909.
- Specific methods for generating a high-frequency spectrum include a method based on mirroring that generates a high-frequency spectrum as a mirror image of a low-frequency vector, a method of shifting the spectrum after amplitude adjustment in the high frequency direction, and a low-frequency spectrum.
- Spectral force Pitch lag is obtained, and the pitch lag is used to perform pitch filtering in the frequency axis direction on the spectrum after amplitude adjustment.
- a pseudo spectrum may be generated using a randomly generated noise spectrum.
- FIG. 22 shows the configuration of extended band decoding section 1403 when amplitude adjustment information cannot be received.
- the information that can be completely received is the scale factor encoding parameter.
- separation section 2201 separates the scale factor code parameter from the code parameter input from separation section 1401 in FIG. 14, and outputs the result to scale factor decoding section 1908. .
- Pseudospectrum generation section 2202 generates a high-frequency spectrum in a pseudo manner using the first decoded spectrum, and outputs it to scaling section 1909.
- Specific methods of generating the high-frequency spectrum include a method based on mirroring that generates the high-frequency spectrum as a mirror image of the low-frequency spectrum, a method of shifting the spectrum after amplitude adjustment in the high-frequency direction, and a low-frequency spectrum. There is a method of obtaining a pitch lag and performing a pitch filtering process in the frequency axis direction on the spectrum after amplitude adjustment using this pitch lag.
- a pseudo spectrum may be generated using a randomly generated noise spectrum.
- constant multiplication XS and S are logarithmic spectra in the logarithmic domain) or a constant power (s Y and s are linear spectral) in the linear domain.
- s Y and s are linear spectral
- the typical coefficient required to match the depth of the harmonics valley in the low range and the depth of the harmonics valley in the high range in voiced sound It is good to use.
- the adjustment factor may be a fixed constant, but it is an index indicating the depth of the harmonic valley in the low frequency spectrum (for example, the dispersion value of the spectral amplitude in the low frequency range, indirectly, It is more preferable to prepare a plurality of appropriate adjustment coefficients according to the pitch gain value in the first layer code key unit 201 and selectively use the corresponding adjustment coefficient according to the above index. It is also possible to selectively use the adjustment coefficient according to the characteristics of each vowel, using low-frequency spectrum shape (envelope) information, pitch period information, and the like. More specifically, since it is the same as the generation of the pseudo spectrum described in the first embodiment, a description thereof is omitted here.
- FIG. 23 is a schematic diagram showing a series of operations for generating a high frequency component in the configuration of FIG. As shown in FIG. 23, first, the amplitude of the first decoded spectrum is adjusted. Next, using the first decoded spectrum after amplitude adjustment as filter information of the pitch filter, a filtering process (pitch filtering) is performed in the frequency axis direction to generate a high frequency component. Next, perform scaling for each band of the scaling coefficient on the generated high frequency component and Generate the final high-frequency spectrum. Then, a second decoded spectrum is generated by combining the generated high frequency spectrum and the first decoded vector.
- a filtering process pitch filtering
- FIG. 24 shows an example of the configuration of the extended band coding unit 1504 corresponding to the extended band decoding unit 1403 in FIG.
- amplitude adjustment section 2401 adjusts the amplitude of the first decoded spectrum input from first spectrum coding section 1503 using the input speech signal spectrum input from frequency domain conversion section 1502A. , Output the encoding parameter of the amplitude adjustment coefficient, and output the first decoded spectrum after the amplitude adjustment to the filter state setting unit 2402.
- the amplitude adjustment unit 2401 performs amplitude adjustment processing such that the ratio (dynamic range) of the maximum amplitude spectrum and minimum amplitude spectrum of the first decoded spectrum approaches the high dynamic range of the input audio signal spectrum. Examples of the amplitude adjustment method include the above method. For example, amplitude adjustment can be performed using a conversion equation such as equation (1). S1 is the spectrum before conversion, and S1, is the spectrum after conversion.
- the amplitude adjustment unit 2401 prepares in advance an amplitude adjustment coefficient ⁇ when the first decoded spectrum after amplitude adjustment is closest to the dynamic range of the high frequency part of the input audio signal spectrum.
- a candidate is selected from a plurality of candidates, and the sign key parameter of the selected amplitude adjustment coefficient ⁇ is output to the multiplexing unit 203.
- Filter state setting section 2402 sets the first decoded spectrum after amplitude adjustment input from amplitude adjustment section 2401 to the internal state of the pitch filter in the same manner as filter state setting section 1903 in FIG. .
- the lag setting unit 2403 sequentially outputs the lag ⁇ ⁇ to the filtering unit 2404 while gradually changing the lag ⁇ ⁇ within a predetermined search range ⁇ to ⁇ .
- Spectral residual shape codebook 2405 stores a plurality of spectral residual shape vector candidates, and in accordance with an instruction from search unit 2406, spectral residual shape vectors are sequentially or sequentially selected from candidates. Select to output.
- spectral residual gain Codebook 2407 stores a plurality of spectral residual gain candidates, and selects and outputs all or predetermined candidate intermediate sequential spectral residual vectors according to an instruction from search unit 2406.
- Multiplier 2408 multiplies the spectral residual shape vector candidate output from spectral residual shape codebook 2405 by the spectral residual gain candidate output from spectral residual gain codebook 2407. The result is output to the filtering unit 2404.
- the filtering unit 2404 performs filtering using the internal state of the pitch filter set by the filter state setting unit 2402, the lag T output from the lag setting unit 2403, and the spectral residual shape vector after gain adjustment. To calculate the estimated value of the input speech signal spectrum. This operation is the same as the operation of the filtering unit 1904 in FIG.
- Search unit 2406 includes, among a plurality of combinations of lag, spectral residual shape vector, and spectral residual gain, the high frequency part of the input speech signal spectrum (original spectrum) and the output signal of filtering part 2404.
- the combination when the cross-correlation is maximized is determined by the analysis method by synthesis (AbS; Analysis by Synthesis). At this time, auditory masking is used to determine the most audibly similar combination.
- a search is performed in consideration of scaling by the scale factor that is performed later.
- the lag coding parameters, spectral residual shape vector coding parameters, and spectral residual gain coding parameters determined by search section 2406 are output to multiplexing section 203 and extended band decoding section 2409. .
- the pitch coefficient, the spectral residual shape vector, and the spectral residual gain may be determined simultaneously.
- the pitch coefficient T, the spectral residual shape vector, and the spectral residual gain may be determined in order in order to reduce the amount of computation.
- Extension band decoding section 2409 is the encoding parameter of the amplitude adjustment coefficient output from amplitude adjustment section 2401, the lag sign key parameter output from search section 2406, and the sign of the spectral residual shape vector.
- the first decoding spectrum is decoded using the ⁇ parameter and the spectral residual gain sign ⁇ parameter to generate the estimated spectrum of the input speech signal spectrum (ie, the spectrum before scaling), and the scale factor code Output to the conversion unit 2410.
- the decoding procedure is the same as that of the extended band decoding unit 1403 in FIG. 19 (except for the processing of the scaling unit 1909 and the spectrum synthesis unit 1910 in FIG. 19).
- Scale factor code key unit 2410 includes a high frequency part of the input speech signal spectrum (original spectrum) output from frequency domain transform unit 1502A, an estimated spectrum output from extended band decoding key unit 2409, Using the auditory masking, the scale factor (scaling coefficient) of the estimated spectrum most suitable for hearing is encoded, and the code parameter is output to the multiplexing unit 203.
- FIG. 25 is a schematic diagram showing the contents of the bitstream received by the separation unit 101 in FIG.
- a plurality of code parameters are time-multiplexed in the bitstream.
- the left side of FIG. 25 shows MSB (Most Significant Bit, the bit having the highest importance in the bitstream), and the right side shows LSB (Least Significant Bit, the bit having the lowest importance in the bitstream).
- MSB Mobile Bit
- LSB Large Significant Bit
- Figure 20 shows when LSB to (1) is discarded
- Figure 21 shows when LSB to (2) is discarded
- Figure 22 shows when LSB to (3) is discarded. It is possible to perform the decoding process using the method described above. If LSB to (4) are discarded, the decoded signal of the first layer is used as the output signal.
- FIG. 19 shows a configuration including spectrum residual shape codebook 1905, spectral residual gain codebook 1906, and multiplier 1907, but a configuration not including these It can also be taken.
- the encoder side can perform communication at a low bit rate that does not require transmission of the sign shape parameter of the residual shape vector and the sign weight parameter of the residual gain.
- the decoding processing procedure in this case is different from the description using FIG. 19 only in that there is no decoding processing of spectrum residual information (shape / gain). That is, decryption
- the processing procedure is the same as that described with reference to FIG. 20, and the position of (1) is LSB in FIG. 25 for the bitstream.
- the present embodiment shows another configuration of extended band decoding section 1403 of second layer decoding section 103 shown in FIG. 14 in the second embodiment.
- the decoding parameters of the frame are determined by using the decoding parameters for decoding the extended band code of the frame and the previous frame, and the data loss information for the received bit stream of the frame. Decide and decode the second decoded spectrum.
- FIG. 26 is a block diagram showing the configuration of extended band decoding section 1403 according to Embodiment 3 of the present invention.
- the amplitude adjustment coefficient decoding unit 2 601 decodes the amplitude adjustment coefficient from the amplitude adjustment coefficient encoding parameter.
- the lag decoding unit 2602 also decodes the lag with the lag code key parameter force.
- the decoding parameter control unit 2603 uses the decoding parameters to be decoded, the received data loss information, and the decoding parameters of the previous frame output from the buffers 2604a to 2604e. Thus, the decoding parameters used for decoding the second decoding spectrum of the frame are determined.
- Each of the noffers 2604a to 2604e is a notch for storing an amplitude adjustment coefficient, a lag, a residual shape vector, a spectral residual gain, and a scale factor, which are decoding parameters of the frame.
- the other configuration in FIG. 26 is the same as the configuration of extended band decoding section 1403 in FIG.
- each decoding parameter included in the extended band code parameter that is a part of the second layer code data of the frame that is, the scale factor, lag, amplitude adjustment coefficient, residual shape vector
- the sign parameters of each of the spectral residual gains are decoded by the respective decoding keys 1908, 2602, 2601, 1905, 1906.
- the decoding norm control unit 2603 uses each decoded parameter and the decoding parameter of the previous frame to decode the second decoding spectrum of the frame based on the received data loss information. Determine the parameters.
- the received data loss information refers to loss (packet loss or errors detected due to transmission errors). This information indicates which part of the extended band code key parameter cannot be used by the extended band decoding unit 1403.
- the second decoded spectrum is decoded using the decoding parameter of the frame and the first decoded spectrum obtained by decoding parameter control section 2603. Since the specific operation is the same as that of the extended band decoding unit 1403 of FIG. 19 in the second embodiment, the description thereof is omitted.
- decoding parameter control section 2603 uses the decoding parameter of the corresponding frequency band of the previous frame as the decoding parameter of the frequency band corresponding to the strong coding parameter obtained by loss. to substitute.
- T (n, m) lag of the mth frequency band of the ⁇ th frame
- ⁇ ( ⁇ , m) Amplitude adjustment factor of mth frequency band of ⁇ th frame
- g (n, m) spectral residual gain in the mth frequency band of the nth frame
- the decoding parameter corresponding to the lost coding parameter As a result, the decoding parameter of the m-th band of the previous frame (the n ⁇ 1th frame) is output.
- the corresponding parameter of the previous frame is used as a plurality of types of decoding parameters associated with all five types or in any combination.
- the decoding parameters decoded using the received encoding parameters of the frame are output as they are.
- the second layer frame compensation corresponds to the previous frame as an extended band decoding parameter for the entire high frequency band of the frame. Use decoding parameters.
- decoding is performed by the method described above only when the correlation is higher than the threshold, and when the correlation is lower than the threshold, the method closed in the frame according to the second embodiment.
- Decoding may be performed by the following.
- the spectral envelope information such as the LPC parameter obtained from the coding parameter power of the first layer, the pitch period, etc.
- the previous frame and the frame calculated using information on the voiced continuity of the signal, such as the pitch gain parameter, the low-frequency decoded signal of the first layer, and the low-frequency decoding boundary itself of the first layer There are correlation coefficient and spectral distance.
- the decoding parameter control unit 2603 for the frequency band in which data loss of the frame has occurred, decodes the decoding parameter of the frequency band of the previous frame, and the previous frame and the frame.
- the decoding parameters of the frequency band are obtained using the decoding parameters of the frequency band adjacent to the frequency band.
- the decoding parameter corresponding to the lost code key parameter The decoding parameters of the m-th band of the previous frame (the n-1st frame) and the decoding parameters of the band adjacent to the previous frame and the frequency band of the frame (the same band in the previous frame and the frame) are used. Then, obtain the decoding parameter as follows.
- decoding parameters decoded using the received encoding parameters of the frame are output as they are.
- decoding is performed by the method described above only when the correlation is higher than the threshold.
- decoding parameter may be decoded using the parameter of the frequency band of the previous frame, or may be decoded by the method described in the second embodiment.
- the decoded spectrum in the vector decoding unit 1402B in the second layer decoding unit 103 shown in FIG. 14 is not added to the frequency band in which the loss has occurred in the extended band encoding parameter.
- the extended band decoding unit 1403 may be configured not to include a spectral residual shape codebook, a spectral residual gain codebook, and a multiplier.
- the force may be 3 layers or more as shown in the configuration example of 2 layers.
- the scalable decoding device and the scalable encoding device according to the present invention are not limited to the above-described Embodiments 1 to 3, and can be implemented with various modifications.
- the scalable decoding device and the scalable coding device according to the present invention can be mounted on a communication terminal device and a base station device in a mobile communication system, and thereby have the same effects as described above.
- a communication terminal device and a base station device can be provided.
- the present invention can also be realized by software.
- Each functional block used in the description of each of the above embodiments is typically realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include some or all of them.
- IC integrated circuit
- system LSI system LSI
- super LSI super LSI
- non-regular LSI depending on the difference in the power density of LSI.
- the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible.
- An FPGA Field Programmable Gate Array
- a reconfigurable 'processor that can reconfigure the connection and settings of circuit cells inside the LSI may be used.
- the decoding process is performed in the manner of decoding according to the second feature. For this reason, it is designed to increase in the order of transmission path error and loss of coded information 'discarding power scale factor, amplitude adjustment coefficient, lag, and spectral residual (that is, the scale factor is the most erroneous).
- the present invention When the present invention is applied to a system in which protection is strong or transmission is preferentially performed on a transmission path, it is possible to minimize degradation in quality of decoded speech due to transmission path errors. In addition, since the decoded speech quality gradually changes in units of each parameter described above, it is possible to achieve more detailed and scalable capabilities than in the past.
- the extended band coding parameter force used for decoding the previous frame is stored as a buffer for storing each decoded parameter
- the decoding parameters of the frame are determined by using the decoding parameters of the frame and the previous frame and the data loss information for the received bit stream of the frame.
- a decoding parameter control unit configured to generate a second decoding spectrum using the first decoding spectrum of the frame and the decoding parameter output from the decoding parameter control unit. For this reason, some or all of the extended band code data obtained by encoding a high frequency vector using a filter having a low frequency spectrum as an internal state may be used for decoding. If this is not possible, loss compensation can be performed by using the decoding parameter of the previous frame with high similarity, and a high-quality signal can be decoded even when data loss occurs.
- the decoding parameter control unit power is adjacent to the frequency band of the previous frame and the frequency band of the previous frame and the frame to the frequency band in which the data loss of the frame has occurred.
- the decoding parameter of the frequency band may be obtained using the decoding parameter of the frequency band.
- the scalable decoding device and the scalable coding device of the present invention can be applied to uses such as a mobile communication system and a packet communication system using the Internet protocol.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006542422A JP4977472B2 (ja) | 2004-11-05 | 2005-11-02 | スケーラブル復号化装置 |
US11/718,437 US7983904B2 (en) | 2004-11-05 | 2005-11-02 | Scalable decoding apparatus and scalable encoding apparatus |
BRPI0517780-4A BRPI0517780A2 (pt) | 2004-11-05 | 2005-11-02 | aparelho de decodificação escalável e aparelho de codificação escalável |
EP05805495.8A EP1808684B1 (en) | 2004-11-05 | 2005-11-02 | Scalable decoding apparatus |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004322954 | 2004-11-05 | ||
JP2004-322954 | 2004-11-05 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006049205A1 true WO2006049205A1 (ja) | 2006-05-11 |
Family
ID=36319210
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/020201 WO2006049205A1 (ja) | 2004-11-05 | 2005-11-02 | スケーラブル復号化装置およびスケーラブル符号化装置 |
Country Status (8)
Country | Link |
---|---|
US (1) | US7983904B2 (ja) |
EP (1) | EP1808684B1 (ja) |
JP (1) | JP4977472B2 (ja) |
KR (1) | KR20070084002A (ja) |
CN (1) | CN101048649A (ja) |
BR (1) | BRPI0517780A2 (ja) |
RU (2) | RU2404506C2 (ja) |
WO (1) | WO2006049205A1 (ja) |
Cited By (27)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008016925A2 (en) * | 2006-07-31 | 2008-02-07 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
JP2008058953A (ja) * | 2006-07-26 | 2008-03-13 | Nec (China) Co Ltd | 音声透かしをベースとするメディア・プログラムの識別方法及び装置 |
WO2008072737A1 (ja) * | 2006-12-15 | 2008-06-19 | Panasonic Corporation | 符号化装置、復号装置およびこれらの方法 |
WO2008114078A1 (en) * | 2007-03-16 | 2008-09-25 | Nokia Corporation | En encoder |
WO2008120437A1 (ja) * | 2007-03-02 | 2008-10-09 | Panasonic Corporation | 符号化装置、復号装置およびそれらの方法 |
JP2009545775A (ja) * | 2006-07-31 | 2009-12-24 | クゥアルコム・インコーポレイテッド | ゲインファクタ制限のためのシステム、方法及び装置 |
JP2010515090A (ja) * | 2006-12-28 | 2010-05-06 | アクトイマジン | 音声コード化の方法および装置 |
JP2010522346A (ja) * | 2006-12-28 | 2010-07-01 | アクトイマジン | 音声コード化の方法および装置 |
JP2011154383A (ja) * | 2007-03-02 | 2011-08-11 | Panasonic Corp | 音声符号化装置、音声復号装置およびそれらの方法 |
CN101089951B (zh) * | 2006-06-16 | 2011-08-31 | 北京天籁传音数字技术有限公司 | 频带扩展编码方法及装置和解码方法及装置 |
US8260609B2 (en) | 2006-07-31 | 2012-09-04 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of inactive frames |
RU2471252C2 (ru) * | 2007-03-02 | 2012-12-27 | Панасоник Корпорэйшн | Устройство кодирования и способ кодирования |
WO2013027629A1 (ja) | 2011-08-24 | 2013-02-28 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
WO2013027630A1 (ja) | 2011-08-24 | 2013-02-28 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
CN103366751A (zh) * | 2012-03-28 | 2013-10-23 | 北京天籁传音数字技术有限公司 | 一种声音编解码装置及其方法 |
JP2014531056A (ja) * | 2011-10-21 | 2014-11-20 | サムスン エレクトロニクスカンパニー リミテッド | フレームエラー隠匿方法及びその装置、並びにオーディオ復号化方法及びその装置 |
CN104969291A (zh) * | 2013-02-08 | 2015-10-07 | 高通股份有限公司 | 执行用于增益确定的滤波的系统及方法 |
US9361900B2 (en) | 2011-08-24 | 2016-06-07 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US9406312B2 (en) | 2010-04-13 | 2016-08-02 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
JP2017016141A (ja) * | 2012-03-29 | 2017-01-19 | ▲ホア▼▲ウェイ▼技術有限公司Huawei Technologies Co.,Ltd. | 信号符号化および復号化の方法および装置 |
US9583112B2 (en) | 2010-04-13 | 2017-02-28 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US9659573B2 (en) | 2010-04-13 | 2017-05-23 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
JP2017102299A (ja) * | 2015-12-02 | 2017-06-08 | パナソニックIpマネジメント株式会社 | 音声信号復号装置及び音声信号復号方法 |
US9691410B2 (en) | 2009-10-07 | 2017-06-27 | Sony Corporation | Frequency band extending device and method, encoding device and method, decoding device and method, and program |
US9767824B2 (en) | 2010-10-15 | 2017-09-19 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US9875746B2 (en) | 2013-09-19 | 2018-01-23 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US10692511B2 (en) | 2013-12-27 | 2020-06-23 | Sony Corporation | Decoding apparatus and method, and program |
Families Citing this family (50)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1744139B1 (en) * | 2004-05-14 | 2015-11-11 | Panasonic Intellectual Property Corporation of America | Decoding apparatus and method thereof |
JP4977471B2 (ja) | 2004-11-05 | 2012-07-18 | パナソニック株式会社 | 符号化装置及び符号化方法 |
JP4899359B2 (ja) * | 2005-07-11 | 2012-03-21 | ソニー株式会社 | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
CN101273403B (zh) * | 2005-10-14 | 2012-01-18 | 松下电器产业株式会社 | 可扩展编码装置、可扩展解码装置以及其方法 |
BRPI0619258A2 (pt) * | 2005-11-30 | 2011-09-27 | Matsushita Electric Ind Co Ltd | aparelho de codificação de sub-banda e método de codificação de sub-banda |
DE602006015097D1 (de) * | 2005-11-30 | 2010-08-05 | Panasonic Corp | Skalierbare codierungsvorrichtung und skalierbares codierungsverfahren |
US8352254B2 (en) * | 2005-12-09 | 2013-01-08 | Panasonic Corporation | Fixed code book search device and fixed code book search method |
FR2912249A1 (fr) * | 2007-02-02 | 2008-08-08 | France Telecom | Codage/decodage perfectionnes de signaux audionumeriques. |
US9466307B1 (en) * | 2007-05-22 | 2016-10-11 | Digimarc Corporation | Robust spectral encoding and decoding methods |
CA2690433C (en) * | 2007-06-22 | 2016-01-19 | Voiceage Corporation | Method and device for sound activity detection and sound signal classification |
JP5098530B2 (ja) * | 2007-09-12 | 2012-12-12 | 富士通株式会社 | 復号化装置、復号化方法および復号化プログラム |
CN100524462C (zh) * | 2007-09-15 | 2009-08-05 | 华为技术有限公司 | 对高带信号进行帧错误隐藏的方法及装置 |
US9872066B2 (en) * | 2007-12-18 | 2018-01-16 | Ibiquity Digital Corporation | Method for streaming through a data service over a radio link subsystem |
EP2224432B1 (en) * | 2007-12-21 | 2017-03-15 | Panasonic Intellectual Property Corporation of America | Encoder, decoder, and encoding method |
JP5485909B2 (ja) * | 2007-12-31 | 2014-05-07 | エルジー エレクトロニクス インコーポレイティド | オーディオ信号処理方法及び装置 |
EP2251861B1 (en) * | 2008-03-14 | 2017-11-22 | Panasonic Intellectual Property Corporation of America | Encoding device and method thereof |
EP2255534B1 (en) * | 2008-03-20 | 2017-12-20 | Samsung Electronics Co., Ltd. | Apparatus and method for encoding using bandwidth extension in portable terminal |
JP2009300707A (ja) * | 2008-06-13 | 2009-12-24 | Sony Corp | 情報処理装置および方法、並びにプログラム |
KR101424944B1 (ko) * | 2008-12-15 | 2014-08-01 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 오디오 인코더 및 대역폭 확장 디코더 |
WO2010070770A1 (ja) * | 2008-12-19 | 2010-06-24 | 富士通株式会社 | 音声帯域拡張装置及び音声帯域拡張方法 |
EP2490217A4 (en) * | 2009-10-14 | 2016-08-24 | Panasonic Ip Corp America | ENCODING DEVICE, ENCODING METHOD AND CORRESPONDING METHODS |
JP5295380B2 (ja) | 2009-10-20 | 2013-09-18 | パナソニック株式会社 | 符号化装置、復号化装置およびこれらの方法 |
KR101309671B1 (ko) * | 2009-10-21 | 2013-09-23 | 돌비 인터네셔널 에이비 | 결합된 트랜스포저 필터 뱅크에서의 오버샘플링 |
EP2555188B1 (en) * | 2010-03-31 | 2014-05-14 | Fujitsu Limited | Bandwidth extension apparatuses and methods |
BR112012032746A2 (pt) * | 2010-06-21 | 2016-11-08 | Panasonic Corp | dispositivo de descodificação, dispositivo de codificação, e métodos para os mesmos. |
US8762158B2 (en) * | 2010-08-06 | 2014-06-24 | Samsung Electronics Co., Ltd. | Decoding method and decoding apparatus therefor |
US9230551B2 (en) * | 2010-10-18 | 2016-01-05 | Nokia Technologies Oy | Audio encoder or decoder apparatus |
WO2012144128A1 (ja) * | 2011-04-20 | 2012-10-26 | パナソニック株式会社 | 音声音響符号化装置、音声音響復号装置、およびこれらの方法 |
US8620646B2 (en) * | 2011-08-08 | 2013-12-31 | The Intellisis Corporation | System and method for tracking sound pitch across an audio signal using harmonic envelope |
CN103366749B (zh) * | 2012-03-28 | 2016-01-27 | 北京天籁传音数字技术有限公司 | 一种声音编解码装置及其方法 |
EP2842322A1 (en) * | 2012-04-24 | 2015-03-04 | Telefonaktiebolaget LM Ericsson (Publ) | Encoding and deriving parameters for coded multi-layer video sequences |
US9601125B2 (en) * | 2013-02-08 | 2017-03-21 | Qualcomm Incorporated | Systems and methods of performing noise modulation and gain adjustment |
CN108364657B (zh) | 2013-07-16 | 2020-10-30 | 超清编解码有限公司 | 处理丢失帧的方法和解码器 |
EP2830061A1 (en) * | 2013-07-22 | 2015-01-28 | Fraunhofer Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an encoded audio signal using temporal noise/patch shaping |
CN105745703B (zh) * | 2013-09-16 | 2019-12-10 | 三星电子株式会社 | 信号编码方法和装置以及信号解码方法和装置 |
US8879858B1 (en) * | 2013-10-01 | 2014-11-04 | Gopro, Inc. | Multi-channel bit packing engine |
KR101782454B1 (ko) * | 2013-12-06 | 2017-09-28 | 후아웨이 테크놀러지 컴퍼니 리미티드 | 이미지 복호화 장치, 이미지 부호화 장치, 및 부호화된 데이터 변환 장치 |
CN111370008B (zh) * | 2014-02-28 | 2024-04-09 | 弗朗霍弗应用研究促进协会 | 解码装置、编码装置、解码方法、编码方法、终端装置、以及基站装置 |
ES2878061T3 (es) * | 2014-05-01 | 2021-11-18 | Nippon Telegraph & Telephone | Dispositivo de generación de secuencia envolvente combinada periódica, método de generación de secuencia envolvente combinada periódica, programa de generación de secuencia envolvente combinada periódica y soporte de registro |
CN110875048B (zh) * | 2014-05-01 | 2023-06-09 | 日本电信电话株式会社 | 编码装置、及其方法、记录介质 |
CN106683681B (zh) | 2014-06-25 | 2020-09-25 | 华为技术有限公司 | 处理丢失帧的方法和装置 |
EP4293666A3 (en) | 2014-07-28 | 2024-03-06 | Samsung Electronics Co., Ltd. | Signal encoding method and apparatus and signal decoding method and apparatus |
JP2016038435A (ja) * | 2014-08-06 | 2016-03-22 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
US10825467B2 (en) * | 2017-04-21 | 2020-11-03 | Qualcomm Incorporated | Non-harmonic speech detection and bandwidth extension in a multi-source environment |
US10431231B2 (en) * | 2017-06-29 | 2019-10-01 | Qualcomm Incorporated | High-band residual prediction with time-domain inter-channel bandwidth extension |
CN110556122B (zh) * | 2019-09-18 | 2024-01-19 | 腾讯科技(深圳)有限公司 | 频带扩展方法、装置、电子设备及计算机可读存储介质 |
CN113113032B (zh) * | 2020-01-10 | 2024-08-09 | 华为技术有限公司 | 一种音频编解码方法和音频编解码设备 |
CN112309408A (zh) * | 2020-11-10 | 2021-02-02 | 北京百瑞互联技术有限公司 | 一种扩展lc3音频编解码带宽的方法、装置及存储介质 |
CN113724725B (zh) * | 2021-11-04 | 2022-01-18 | 北京百瑞互联技术有限公司 | 一种蓝牙音频啸叫检测抑制方法、装置、介质及蓝牙设备 |
CN114664319A (zh) * | 2022-03-28 | 2022-06-24 | 北京百度网讯科技有限公司 | 频带扩展方法、装置、设备、介质及程序产品 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2779886B2 (ja) * | 1992-10-05 | 1998-07-23 | 日本電信電話株式会社 | 広帯域音声信号復元方法 |
JP2964879B2 (ja) * | 1994-08-22 | 1999-10-18 | 日本電気株式会社 | ポストフィルタ |
JP3707153B2 (ja) * | 1996-09-24 | 2005-10-19 | ソニー株式会社 | ベクトル量子化方法、音声符号化方法及び装置 |
US6453288B1 (en) * | 1996-11-07 | 2002-09-17 | Matsushita Electric Industrial Co., Ltd. | Method and apparatus for producing component of excitation vector |
GB2351889B (en) | 1999-07-06 | 2003-12-17 | Ericsson Telefon Ab L M | Speech band expansion |
US7742927B2 (en) * | 2000-04-18 | 2010-06-22 | France Telecom | Spectral enhancing method and device |
EP1405303A1 (en) * | 2001-06-28 | 2004-04-07 | Koninklijke Philips Electronics N.V. | Wideband signal transmission system |
DE60208426T2 (de) * | 2001-11-02 | 2006-08-24 | Matsushita Electric Industrial Co., Ltd., Kadoma | Vorrichtung zur signalkodierung, signaldekodierung und system zum verteilen von audiodaten |
JP3926726B2 (ja) * | 2001-11-14 | 2007-06-06 | 松下電器産業株式会社 | 符号化装置および復号化装置 |
JP2003323199A (ja) * | 2002-04-26 | 2003-11-14 | Matsushita Electric Ind Co Ltd | 符号化装置、復号化装置及び符号化方法、復号化方法 |
JP3881946B2 (ja) * | 2002-09-12 | 2007-02-14 | 松下電器産業株式会社 | 音響符号化装置及び音響符号化方法 |
BRPI0305710B1 (pt) * | 2002-08-01 | 2017-11-07 | Panasonic Corporation | "apparatus and method of decoding of audio" |
JP3861770B2 (ja) * | 2002-08-21 | 2006-12-20 | ソニー株式会社 | 信号符号化装置及び方法、信号復号装置及び方法、並びにプログラム及び記録媒体 |
US7844451B2 (en) * | 2003-09-16 | 2010-11-30 | Panasonic Corporation | Spectrum coding/decoding apparatus and method for reducing distortion of two band spectrums |
-
2005
- 2005-11-02 BR BRPI0517780-4A patent/BRPI0517780A2/pt not_active IP Right Cessation
- 2005-11-02 RU RU2007116937/09A patent/RU2404506C2/ru not_active IP Right Cessation
- 2005-11-02 WO PCT/JP2005/020201 patent/WO2006049205A1/ja active Application Filing
- 2005-11-02 JP JP2006542422A patent/JP4977472B2/ja not_active Expired - Fee Related
- 2005-11-02 US US11/718,437 patent/US7983904B2/en not_active Expired - Fee Related
- 2005-11-02 KR KR1020077010273A patent/KR20070084002A/ko not_active Application Discontinuation
- 2005-11-02 EP EP05805495.8A patent/EP1808684B1/en not_active Not-in-force
- 2005-11-02 CN CNA2005800373627A patent/CN101048649A/zh active Pending
-
2010
- 2010-10-01 RU RU2010140339/09A patent/RU2434324C1/ru not_active IP Right Cessation
Non-Patent Citations (2)
Title |
---|
KOVESI B ET AL: "A Scalable Speech and Audio Coding Scheme with Continuous Bitrate Flexibility.", PROC OF ICASSP-04., 17 March 2004 (2004-03-17), pages I-273 - 276, XP010717618 * |
OSHIKIRI M. ET AL: "Pichi Filtering ni Motozuku Spectre Fugoka o Mochiita Choko Taiiki Schelable Onsei Fugoka no Kaizen.", THE ACUSTICAL SOCIETY OF JAPAN 2004 NEN SHUKI KENKYU HAPPYOKAI KOEN RONBUNSHU-I., 21 September 2004 (2004-09-21), pages 297 - 298, XP002998459 * |
Cited By (56)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101089951B (zh) * | 2006-06-16 | 2011-08-31 | 北京天籁传音数字技术有限公司 | 频带扩展编码方法及装置和解码方法及装置 |
JP2008058953A (ja) * | 2006-07-26 | 2008-03-13 | Nec (China) Co Ltd | 音声透かしをベースとするメディア・プログラムの識別方法及び装置 |
US7957977B2 (en) | 2006-07-26 | 2011-06-07 | Nec (China) Co., Ltd. | Media program identification method and apparatus based on audio watermarking |
EP2741288A3 (en) * | 2006-07-31 | 2014-08-06 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
US8260609B2 (en) | 2006-07-31 | 2012-09-04 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of inactive frames |
WO2008016925A3 (en) * | 2006-07-31 | 2008-08-14 | Qualcomm Inc | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
US9454974B2 (en) | 2006-07-31 | 2016-09-27 | Qualcomm Incorporated | Systems, methods, and apparatus for gain factor limiting |
US9324333B2 (en) | 2006-07-31 | 2016-04-26 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of inactive frames |
WO2008016925A2 (en) * | 2006-07-31 | 2008-02-07 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
JP2009545775A (ja) * | 2006-07-31 | 2009-12-24 | クゥアルコム・インコーポレイテッド | ゲインファクタ制限のためのシステム、方法及び装置 |
US8532984B2 (en) | 2006-07-31 | 2013-09-10 | Qualcomm Incorporated | Systems, methods, and apparatus for wideband encoding and decoding of active frames |
JP5339919B2 (ja) * | 2006-12-15 | 2013-11-13 | パナソニック株式会社 | 符号化装置、復号装置およびこれらの方法 |
WO2008072737A1 (ja) * | 2006-12-15 | 2008-06-19 | Panasonic Corporation | 符号化装置、復号装置およびこれらの方法 |
US8560328B2 (en) | 2006-12-15 | 2013-10-15 | Panasonic Corporation | Encoding device, decoding device, and method thereof |
JP2010522346A (ja) * | 2006-12-28 | 2010-07-01 | アクトイマジン | 音声コード化の方法および装置 |
JP2010515090A (ja) * | 2006-12-28 | 2010-05-06 | アクトイマジン | 音声コード化の方法および装置 |
US8935161B2 (en) | 2007-03-02 | 2015-01-13 | Panasonic Intellectual Property Corporation Of America | Encoding device, decoding device, and method thereof for secifying a band of a great error |
JP2009042733A (ja) * | 2007-03-02 | 2009-02-26 | Panasonic Corp | 符号化装置、復号装置およびそれらの方法 |
US8543392B2 (en) | 2007-03-02 | 2013-09-24 | Panasonic Corporation | Encoding device, decoding device, and method thereof for specifying a band of a great error |
RU2471252C2 (ru) * | 2007-03-02 | 2012-12-27 | Панасоник Корпорэйшн | Устройство кодирования и способ кодирования |
JP2011154384A (ja) * | 2007-03-02 | 2011-08-11 | Panasonic Corp | 音声符号化装置、音声復号装置およびそれらの方法 |
RU2502138C2 (ru) * | 2007-03-02 | 2013-12-20 | Панасоник Корпорэйшн | Кодирующее устройство, декодирующее устройство и способ |
JP2011154383A (ja) * | 2007-03-02 | 2011-08-11 | Panasonic Corp | 音声符号化装置、音声復号装置およびそれらの方法 |
EP2747080A3 (en) * | 2007-03-02 | 2014-08-06 | Panasonic Intellectual Property Corporation of America | Encoding device, decoding device, and method thereof |
EP2747079A3 (en) * | 2007-03-02 | 2014-08-13 | Panasonic Intellectual Property Corporation of America | Encoding device, decoding device, and method thereof |
WO2008120437A1 (ja) * | 2007-03-02 | 2008-10-09 | Panasonic Corporation | 符号化装置、復号装置およびそれらの方法 |
US8935162B2 (en) | 2007-03-02 | 2015-01-13 | Panasonic Intellectual Property Corporation Of America | Encoding device, decoding device, and method thereof for specifying a band of a great error |
WO2008114078A1 (en) * | 2007-03-16 | 2008-09-25 | Nokia Corporation | En encoder |
US9691410B2 (en) | 2009-10-07 | 2017-06-27 | Sony Corporation | Frequency band extending device and method, encoding device and method, decoding device and method, and program |
US9406312B2 (en) | 2010-04-13 | 2016-08-02 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US10224054B2 (en) | 2010-04-13 | 2019-03-05 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US10546594B2 (en) | 2010-04-13 | 2020-01-28 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US10381018B2 (en) | 2010-04-13 | 2019-08-13 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US10297270B2 (en) | 2010-04-13 | 2019-05-21 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US9679580B2 (en) | 2010-04-13 | 2017-06-13 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US9659573B2 (en) | 2010-04-13 | 2017-05-23 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US9583112B2 (en) | 2010-04-13 | 2017-02-28 | Sony Corporation | Signal processing apparatus and signal processing method, encoder and encoding method, decoder and decoding method, and program |
US9767824B2 (en) | 2010-10-15 | 2017-09-19 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US10236015B2 (en) | 2010-10-15 | 2019-03-19 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US9842603B2 (en) | 2011-08-24 | 2017-12-12 | Sony Corporation | Encoding device and encoding method, decoding device and decoding method, and program |
US9361900B2 (en) | 2011-08-24 | 2016-06-07 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US9390717B2 (en) | 2011-08-24 | 2016-07-12 | Sony Corporation | Encoding device and method, decoding device and method, and program |
WO2013027629A1 (ja) | 2011-08-24 | 2013-02-28 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
WO2013027630A1 (ja) | 2011-08-24 | 2013-02-28 | ソニー株式会社 | 符号化装置および方法、復号装置および方法、並びにプログラム |
JP2014531056A (ja) * | 2011-10-21 | 2014-11-20 | サムスン エレクトロニクスカンパニー リミテッド | フレームエラー隠匿方法及びその装置、並びにオーディオ復号化方法及びその装置 |
CN103366751B (zh) * | 2012-03-28 | 2015-10-14 | 北京天籁传音数字技术有限公司 | 一种声音编解码装置及其方法 |
CN103366751A (zh) * | 2012-03-28 | 2013-10-23 | 北京天籁传音数字技术有限公司 | 一种声音编解码装置及其方法 |
US9899033B2 (en) | 2012-03-29 | 2018-02-20 | Huawei Technologies Co., Ltd. | Signal coding and decoding methods and devices |
JP2017016141A (ja) * | 2012-03-29 | 2017-01-19 | ▲ホア▼▲ウェイ▼技術有限公司Huawei Technologies Co.,Ltd. | 信号符号化および復号化の方法および装置 |
US10600430B2 (en) | 2012-03-29 | 2020-03-24 | Huawei Technologies Co., Ltd. | Signal decoding method, audio signal decoder and non-transitory computer-readable medium |
CN104969291A (zh) * | 2013-02-08 | 2015-10-07 | 高通股份有限公司 | 执行用于增益确定的滤波的系统及方法 |
US9875746B2 (en) | 2013-09-19 | 2018-01-23 | Sony Corporation | Encoding device and method, decoding device and method, and program |
US10692511B2 (en) | 2013-12-27 | 2020-06-23 | Sony Corporation | Decoding apparatus and method, and program |
US11705140B2 (en) | 2013-12-27 | 2023-07-18 | Sony Corporation | Decoding apparatus and method, and program |
WO2017094203A1 (ja) * | 2015-12-02 | 2017-06-08 | パナソニックIpマネジメント株式会社 | 音声信号復号装置及び音声信号復号方法 |
JP2017102299A (ja) * | 2015-12-02 | 2017-06-08 | パナソニックIpマネジメント株式会社 | 音声信号復号装置及び音声信号復号方法 |
Also Published As
Publication number | Publication date |
---|---|
EP1808684B1 (en) | 2014-07-30 |
CN101048649A (zh) | 2007-10-03 |
JP4977472B2 (ja) | 2012-07-18 |
EP1808684A1 (en) | 2007-07-18 |
RU2434324C1 (ru) | 2011-11-20 |
KR20070084002A (ko) | 2007-08-24 |
JPWO2006049205A1 (ja) | 2008-05-29 |
US20080126082A1 (en) | 2008-05-29 |
US7983904B2 (en) | 2011-07-19 |
EP1808684A4 (en) | 2010-07-14 |
BRPI0517780A2 (pt) | 2011-04-19 |
RU2404506C2 (ru) | 2010-11-20 |
RU2007116937A (ru) | 2008-11-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4977472B2 (ja) | スケーラブル復号化装置 | |
JP5383676B2 (ja) | 符号化装置、復号装置およびこれらの方法 | |
JP4859670B2 (ja) | 音声符号化装置および音声符号化方法 | |
JP4977471B2 (ja) | 符号化装置及び符号化方法 | |
KR101363793B1 (ko) | 부호화 장치, 복호 장치 및 그 방법 | |
US8433581B2 (en) | Audio encoding device and audio encoding method | |
US20090262945A1 (en) | Stereo encoding device, stereo decoding device, and stereo encoding method | |
JP5036317B2 (ja) | スケーラブル符号化装置、スケーラブル復号化装置、およびこれらの方法 | |
JP4606418B2 (ja) | スケーラブル符号化装置、スケーラブル復号装置及びスケーラブル符号化方法 | |
KR20070029754A (ko) | 음성 부호화 장치 및 그 방법과, 음성 복호화 장치 및 그방법 | |
JPWO2009057327A1 (ja) | 符号化装置および復号装置 | |
WO2006129615A1 (ja) | スケーラブル符号化装置およびスケーラブル符号化方法 | |
JP5340378B2 (ja) | チャネル信号生成装置、音響信号符号化装置、音響信号復号装置、音響信号符号化方法及び音響信号復号方法 | |
JP4373693B2 (ja) | 音響信号の階層符号化方法および階層復号化方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KN KP KR KZ LC LK LR LS LT LU LV LY MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006542422 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200580037362.7 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11718437 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005805495 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007116937 Country of ref document: RU Ref document number: 1020077010273 Country of ref document: KR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWP | Wipo information: published in national office |
Ref document number: 2005805495 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 11718437 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0517780 Country of ref document: BR Kind code of ref document: A2 Effective date: 20070504 |