WO2012032759A1 - 符号化装置及び符号化方法 - Google Patents
符号化装置及び符号化方法 Download PDFInfo
- Publication number
- WO2012032759A1 WO2012032759A1 PCT/JP2011/004960 JP2011004960W WO2012032759A1 WO 2012032759 A1 WO2012032759 A1 WO 2012032759A1 JP 2011004960 W JP2011004960 W JP 2011004960W WO 2012032759 A1 WO2012032759 A1 WO 2012032759A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- suppression
- spectrum
- celp
- encoding
- unit
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 60
- 230000001629 suppression Effects 0.000 claims abstract description 351
- 238000001228 spectrum Methods 0.000 claims abstract description 235
- 230000000670 limiting effect Effects 0.000 claims abstract description 41
- 238000006243 chemical reaction Methods 0.000 claims abstract description 3
- 238000004364 calculation method Methods 0.000 claims description 24
- 230000001174 ascending effect Effects 0.000 claims description 8
- 230000007423 decrease Effects 0.000 claims description 6
- 230000015556 catabolic process Effects 0.000 abstract description 3
- 238000006731 degradation reaction Methods 0.000 abstract description 3
- 238000011156 evaluation Methods 0.000 description 49
- 238000000605 extraction Methods 0.000 description 16
- 230000002829 reductive effect Effects 0.000 description 16
- 230000005236 sound signal Effects 0.000 description 10
- 230000006866 deterioration Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 230000002238 attenuated effect Effects 0.000 description 4
- 230000003247 decreasing effect Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000000926 separation method Methods 0.000 description 4
- 230000003044 adaptive effect Effects 0.000 description 3
- 239000000284 extract Substances 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- ZMOQBTRTDSZZRU-UHFFFAOYSA-N 2-(1,2-dichloroethyl)pyridine;hydrochloride Chemical compound Cl.ClCC(Cl)C1=CC=CC=N1 ZMOQBTRTDSZZRU-UHFFFAOYSA-N 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 230000005284 excitation Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 230000036961 partial effect Effects 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0212—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/20—Vocoders using multiple modes using sound class specific coding, hybrid encoders or object based coding
Definitions
- the present invention relates to an encoding device and an encoding method.
- Non-Patent Document 1 Hierarchical structure of CELP (Code Excited Linear Prediction) coding suitable for audio signals and transform coding suitable for music signals as coding methods that can compress voice and music with low bit rate and high sound quality
- a combined encoding method has been proposed (see, for example, Non-Patent Document 1).
- the audio signal and the music signal may be collectively referred to as an acoustic signal.
- the encoding apparatus first encodes an input signal by the CELP encoding scheme to generate CELP encoded data.
- the encoding apparatus converts a residual spectrum obtained by converting a residual signal (hereinafter referred to as a CELP residual signal) between an input signal and a CELP decoded signal (decoding result of CELP encoded data) into a frequency domain.
- a CELP residual signal a residual spectrum obtained by converting a residual signal (hereinafter referred to as a CELP residual signal) between an input signal and a CELP decoded signal (decoding result of CELP encoded data) into a frequency domain.
- a transform coding method a method has been proposed in which a pulse is generated at a frequency having a large residual spectrum energy and the information of the pulse is coded (see Non-Patent Document 1).
- the CELP encoding method is suitable for audio signal encoding, but the audio signal quality deteriorates because the encoding model differs for music signals. Therefore, when a music signal is encoded by the above encoding method, the CELP residual signal component becomes large, so that it is difficult to improve the sound quality even if the CELP residual signal (residual spectrum) is encoded by transform encoding. There are challenges.
- Non-Patent Document 1 when the sampling frequency of the input signal is 16 kHz, the CELP component amplitude is suppressed (hereinafter referred to as CELP suppression) only in the middle band of 0.8 kHz to 5.5 kHz. Is called.
- the encoding apparatus does not directly perform transform coding on the CELP residual signal, but before that, another transform coding method (for example, Non-Patent Document 1 (Section 6.11. (Refer to 6.1)) to reduce the CELP component residual signal. For this reason, the encoding apparatus does not perform CELP suppression on the frequency component encoded by the above-described another transform encoding method even in the middle band.
- CELP suppression coefficient indicating the degree (intensity) of CELP suppression is uniform at frequencies other than the frequency where CELP suppression is not performed in the middle band.
- CELP suppression coefficients are stored in a code book (hereinafter referred to as a CELP suppression coefficient code book) for each CELP suppression strength.
- the encoding device Before performing transform coding, the encoding device performs CELP suppression by multiplying the CELP component (CELP decoded signal) and the CELP suppression coefficient stored in the CELP suppression coefficient codebook, and A residual spectrum with a CELP decoded signal (CELP decoded signal after CELP suppression) is obtained, and the residual spectrum is transcoded. This transform coding is performed on all CELP suppression coefficients.
- the encoding device calculates a residual signal between the signal obtained by adding the decoded signal of the transform encoded data and the CELP decoded signal in which the CELP component is suppressed and the input signal, and the energy of the residual signal (hereinafter, The CELP suppression coefficient that minimizes the coding distortion is determined, and the searched CELP suppression coefficient (the CELP suppression coefficient that minimizes the coding distortion) is encoded.
- the encoding apparatus can perform transform encoding with minimum encoding distortion for the entire band.
- main selection a series of processes for performing transform coding for each CELP suppression coefficient and determining a CELP suppression coefficient that minimizes coding distortion (residual signal energy)
- the decoding device suppresses the CELP component of the CELP decoded signal using the CELP suppression coefficient transmitted from the encoding device, and adds the transform-coded decoded signal to the CELP decoded signal in which the CELP component is suppressed. Accordingly, the decoding apparatus can obtain a decoded signal in which deterioration of sound quality due to CELP encoding is suppressed when encoding is performed by combining CELP encoding and transform encoding in a hierarchical structure.
- An object of the present invention is to select a part (hereinafter referred to as “preliminary selection”) of input signals (hereinafter referred to as target signals) for transform coding processing generated for each CELP suppression coefficient,
- An object of the present invention is to provide an encoding device and an encoding method that can reduce the amount of processing in the encoding device while limiting deterioration in encoding quality by limiting the targets for transform encoding.
- An encoding apparatus includes a first encoding unit that outputs a spectrum of a first decoded signal generated by decoding a first code obtained by first encoding of an input signal; A suppression unit that suppresses the amplitude of the spectrum of the first decoded signal using a suppression coefficient indicated from a plurality of suppression coefficients to generate a suppression spectrum, and uses the spectrum of the input signal and the suppression spectrum.
- a residual spectrum calculating unit that calculates a residual spectrum, and using the spectrum of the input signal and the residual spectrum, a predetermined number of suppression coefficients are preliminarily selected, and the preselected suppression coefficient is the suppression And a residual spectrum calculated by inputting a suppression spectrum generated by using the instructed suppression coefficient in the suppression unit to the residual spectrum calculation unit.
- the second encoded signal generated by decoding the second code obtained by the second encoding, the suppression spectrum, the input signal spectrum, And a second encoding unit for determining one suppression coefficient from the instructed suppression coefficients.
- An encoding method includes a first encoding step of outputting a spectrum of a first decoded signal generated by decoding a first code obtained by first encoding on an input signal; A suppression step of generating a suppression spectrum by suppressing the amplitude of the spectrum of the first decoded signal using a suppression coefficient indicated from a plurality of suppression coefficients, and using the spectrum of the input signal and the suppression spectrum Using the residual spectrum calculating step for calculating the residual spectrum and the spectrum of the input signal and the residual spectrum, a predetermined number of suppression coefficients used in the suppression step are preselected, and the preselected A preliminary selection step of setting a suppression coefficient to the instructed suppression coefficient, and a suppression spectrum generated by using the instructed suppression coefficient in the suppression step.
- the second decoding generated by decoding the second code obtained by the second encoding using the residual spectrum calculated in the residual spectrum calculating step by using the residual spectrum A second encoding step of determining one suppression coefficient from the instructed suppression coefficients using a signal spectrum, the suppression spectrum, and the input signal spectrum;
- a method of sequentially performing transform coding on all CELP suppression coefficient candidates As compared with the above, it is possible to reduce the amount of processing in the encoding device while suppressing deterioration in encoding quality.
- FIG. 1 is a block diagram showing a configuration of an encoding apparatus according to Embodiment 1 of the present invention.
- the block diagram which shows the structure of the decoding apparatus which concerns on Embodiment 1 of this invention.
- Block diagram showing a configuration of an encoding apparatus according to Embodiment 2 of the present invention.
- the audio signal and the music signal are collectively referred to as an acoustic signal. That is, the acoustic signal represents any signal of substantially only an audio signal, substantially only a music signal, or a signal in which an audio signal and a music signal are mixed.
- the encoding device and the decoding device according to the present invention have a hierarchy for performing at least two encodings.
- CELP coding is used as a coding suitable for a speech signal
- transform coding is used as a coding suitable for a music signal.
- the coding device and the decoding device are CELP codes.
- An encoding method in which encoding and transform encoding are combined in a hierarchical structure is used.
- FIG. 1 is a block diagram showing the main configuration of coding apparatus 100 according to Embodiment 1 of the present invention.
- the encoding apparatus 100 encodes input signals such as speech and music using an encoding scheme in which CELP encoding and transform encoding are combined in a hierarchical structure, and outputs encoded data.
- an encoding apparatus 100 includes an MDCT (Modified Discrete Cosine Transform) unit 101, a CELP encoding unit 102, an MDCT unit 103, a CELP component suppressing unit 104, and a CELP residual signal spectrum calculation.
- MDCT Modified Discrete Cosine Transform
- Unit 105 pulse position estimation unit 106, estimated pulse attenuation unit 107, estimated distortion evaluation unit 108, main selection candidate limiting unit 109, transform coding unit 110, addition unit 111, distortion evaluation unit 112, and multiplexing unit 113. .
- Each unit performs the following operations.
- the MDCT unit 101 performs an MDCT process on an input signal to generate an input signal spectrum. MDCT section 101 then outputs the generated input signal spectrum to CELP residual signal spectrum calculation section 105, distortion evaluation section 112, and estimated distortion evaluation section 108.
- the CELP encoder 102 encodes the input signal by the CELP encoding method to generate CELP encoded data.
- the CELP encoding unit 102 decodes the generated CELP encoded data (local decoding) to generate a CELP decoded signal.
- CELP encoding section 102 then outputs the CELP encoded data to multiplexing section 113 and outputs the CELP decoded signal to MDCT section 103.
- the MDCT unit 103 performs MDCT processing on the CELP decoded signal input from the CELP encoding unit 102 to generate a CELP decoded signal spectrum. MDCT section 103 then outputs the generated CELP decoded signal spectrum to CELP component suppression section 104.
- the CELP encoding unit 102 and the MDCT unit 103 output the spectrum of the first decoded signal generated by decoding the first code obtained by the first encoding for the input signal. Operates as an encoding unit.
- the CELP component suppression unit 104 includes a CELP suppression coefficient codebook in which a CELP suppression coefficient indicating the degree (strength) of CELP suppression is stored.
- the CELP suppression coefficient codebook stores four types of CELP suppression coefficients ranging from 1.0, which means no suppression, to 0.5, which halves the amplitude of the CELP component. That is, the value of the CELP suppression coefficient becomes smaller as the degree (strength) of CELP suppression is larger.
- the CELP suppression coefficients are stored in ascending or descending order of the degree (strength) of CELP suppression. Each CELP suppression coefficient is assigned an index (CELP suppression coefficient index) in ascending or descending order with respect to the degree (strength) of CELP suppression.
- CELP component suppression section 104 selects a CELP suppression coefficient from the CELP suppression coefficient codebook in accordance with the CELP suppression coefficient index input from estimated distortion evaluation section 108, main selection candidate limiting section 109 or distortion evaluation section 112. .
- CELP component suppression section 104 multiplies the selected CELP suppression coefficient for each frequency component of the CELP decoded signal spectrum input from MDCT section 103 to calculate a CELP component suppression spectrum.
- CELP component suppression section 104 then outputs the CELP component suppression spectrum to CELP residual signal spectrum calculation section 105 and addition section 111.
- CELP residual signal spectrum calculation section 105 calculates a CELP residual signal spectrum that is a difference between the input signal spectrum input from MDCT section 101 and the CELP component suppression spectrum input from CELP component suppression section 104. Specifically, the CELP residual signal spectrum calculation unit 105 obtains a CELP residual signal spectrum by subtracting the CELP component suppression spectrum from the input signal spectrum. CELP residual signal spectrum calculation section 105 then outputs the CELP residual signal spectrum to transform coding section 110, pulse position estimation section 106, and estimated pulse attenuation section 107.
- the pulse position estimation unit 106 performs transform coding using the CELP residual signal spectrum (a signal to be subjected to transform coding, which may be referred to as a target signal hereinafter) input from the CELP residual signal spectrum calculation unit 105.
- the pulse position encoded by the unit 110 (for example, a frequency having a large amplitude of the CELP residual signal spectrum) is estimated. Then, the pulse position estimation unit 106 outputs the estimated pulse position (estimated pulse position) to the estimated pulse attenuation unit 107.
- the estimated pulse attenuation unit 107 attenuates the amplitude at the estimated pulse position input from the pulse position estimation unit 106 out of the CELP residual signal spectrum input from the CELP residual signal spectrum calculation unit 105. Then, estimated pulse attenuation section 107 outputs the attenuated spectrum to estimated distortion evaluation section 108 as a transform encoded estimated residual spectrum.
- the estimated distortion evaluation unit 108 uses the input signal spectrum input from the MDCT unit 101 and the converted encoded estimation residual spectrum input from the estimated pulse attenuation unit 107 to encode coding distortion (distortion energy) by transform coding. The estimated strain energy that is the estimated value of) is calculated. Then, the estimated distortion evaluation unit 108 outputs the estimated distortion energy to the main selection candidate limiting unit 109.
- the estimated distortion evaluation unit 108 supplies the CELP suppression coefficient index to be evaluated to the CELP component suppression unit 104 in order to obtain a transform coding estimation residual spectrum corresponding to the CELP suppression coefficient to be evaluated in a preliminary selection search described later. Output.
- the estimated distortion evaluation unit 108 is a transform-coded estimated residual spectrum that is a result of sequential processing by the CELP component suppressing unit 104, the CELP residual signal spectrum calculating unit 105, the pulse position estimating unit 106, and the estimated pulse attenuating unit 107.
- the main selection candidate limiting unit 109 searches CELP suppression coefficients stored in the CELP suppression codebook in a CELP search that will be described later. Limit candidates of suppression coefficients (CELP suppression coefficients used for transform coding). Then, main selection candidate limiting section 109 outputs a CELP suppression coefficient index indicating a limited candidate CELP suppression coefficient to CELP component suppression section 104.
- CELP suppression coefficient groups collectively including the CELP suppression coefficient candidates limited here, and CELP suppression coefficient indexes corresponding to the CELP suppression coefficient candidates corresponding to the limited CELP suppression coefficient candidates, Sometimes called.
- the pulse position estimating unit 106, the estimated pulse attenuating unit 107, the estimated distortion evaluating unit 108, and the main selection candidate limiting unit 109 use the input signal spectrum and the CELP residual signal spectrum to calculate a predetermined number. It operates as a preselection unit that preselects the CELP suppression coefficient and instructs the CELP component suppression unit 104 about the preselected CELP suppression coefficient.
- CELP component suppression section 104 CELP residual signal spectrum calculation section 105, pulse position estimation section 106, estimated pulse attenuation section 107, estimated distortion evaluation section 108, and main selection candidate limiting section. 109 constitutes a closed loop.
- Each component constituting the closed loop includes a CELP corresponding to a CELP suppression coefficient index indicated by the estimated distortion evaluation unit 108 among the CELP suppression coefficients stored in the CELP suppression codebook included in the CELP component suppression unit 104.
- a candidate (CELP suppression coefficient index) to be searched in a main selection search described later is searched.
- this search process is referred to as “preliminary selection search”.
- the transform coding unit 110 codes the CELP residual signal spectrum (target signal) input from the CELP residual signal spectrum calculation unit 105 by transform coding to generate transform coded data.
- transform coding section 110 decodes the generated transform coded data (local decoding) to generate a transform coded decoded signal spectrum.
- transform coding section 110 performs coding so as to reduce distortion between the CELP residual signal spectrum and the transform coded decoded signal spectrum.
- the transform coding unit 110 performs coding so as to reduce the distortion by raising a pulse at a frequency where the amplitude (energy) of the CELP residual signal spectrum is large.
- transform coding section 110 outputs the transform coded data obtained by the coding to distortion evaluation section 112 and outputs the transform coded decoded signal spectrum to adding section 111.
- Adder 111 adds the CELP component suppression spectrum input from CELP component suppressor 104 and the transform encoded decoded signal spectrum input from transform encoder 110 to calculate a decoded signal spectrum, and obtains a decoded signal spectrum. Is output to the distortion evaluation unit 112.
- the distortion evaluation unit 112 scans a part of the CLEP suppression coefficients stored in the CELP suppression coefficient codebook included in the CELP component suppression unit 104 (the CELP suppression coefficient index limited by the selection candidate limiting unit 109). Then, a CELP suppression coefficient index that minimizes distortion between the input signal spectrum input from MDCT section 101 and the decoded signal spectrum input from addition section 111 (that is, encoding distortion due to transform coding) is searched. That is, the distortion evaluation unit 112 controls the CELP component suppression unit 104 (outputs the CELP suppression coefficient index) so as to perform CELP suppression using the CELP suppression coefficients corresponding to the partial indexes.
- the distortion evaluation unit 112 outputs the CELP suppression coefficient index that minimizes the calculated distortion to the multiplexing unit 113 as the CELP suppression coefficient optimum index, and includes the transform encoded data input from the transform encoding unit 110.
- the transform encoded data (transform encoded data at the time of minimum distortion) corresponding to the CELP suppression coefficient optimum index is output to multiplexing section 113.
- the transform coding unit 110, the addition unit 111, and the distortion evaluation unit 112 use the CELP suppression coefficient instructed by the above-described preliminary selection unit as the CELP suppression spectrum generated by the CELP component suppression unit 104.
- Transform coding (second coding) is performed using the CELP residual signal spectrum calculated by inputting to CELP residual signal spectrum calculation section 105, and transform coded data (second coding) obtained by transform coding is used.
- CELP component suppression section 104 CELP residual signal spectrum calculation section 105, transform coding section 110, addition section 111, and distortion evaluation section 112 constitute a closed loop.
- Each component constituting the closed loop uses a CELP suppression coefficient index indicated by the selection candidate limiting unit 109 among a plurality of CELP suppression coefficients stored in the CELP suppression codebook included in the CELP component suppression unit 104.
- a decoded signal spectrum is generated using a corresponding CELP suppression coefficient, and a candidate (CELP suppression coefficient index) that minimizes distortion (encoding distortion due to transform coding) between the input signal spectrum and the decoded signal spectrum is searched.
- this search process is referred to as “main selection search”.
- the multiplexing unit 113 multiplexes the CELP encoded data input from the CELP encoding unit 102, the converted encoded data (transformed encoded data at the time of minimum distortion) and the CELP suppression coefficient optimum index input from the distortion evaluation unit 112.
- the multiplexed result is transmitted to the decoding device as encoded data.
- the decoding device 200 decodes the encoded data transmitted from the encoding device 100 and outputs a decoded signal.
- FIG. 2 is a block diagram showing the main configuration of the decoding apparatus 200.
- the decoding apparatus 200 includes a separation unit 201, a transform coding / decoding unit 202, a CELP decoding unit 203, an MDCT unit 204, a CELP component suppressing unit 205, an adding unit 206, and an IMDCT (Inverse Modified Ccre Transform) unit. 207. Each unit performs the following operations.
- demultiplexing section 201 transmits encoded data including CELP encoded data, transform encoded data, and CELP suppression coefficient optimum index from encoding apparatus 100 (FIG. 1) to the transmission path. (Not shown). Separating section 201 separates the encoded data into CELP encoded data, transform encoded data, and CELP suppression coefficient optimum index. Separation section 201 then outputs the CELP encoded data to CELP decoding section 203, outputs the transform encoded data to transform encoding decoding section 202, and outputs the CELP suppression coefficient optimum index to CELP component suppression section 205.
- the transform coding / decoding unit 202 decodes the transform coding data input from the separation unit 201 to generate a transform coding / decoding signal spectrum, and outputs the transform coding / decoding signal spectrum to the adding unit 206.
- CELP decoding section 203 decodes the CELP encoded data input from demultiplexing section 201 and outputs the CELP decoded signal to MDCT section 204.
- MDCT section 204 performs MDCT processing on the CELP decoded signal input from CELP decoding section 203 to generate a CELP decoded signal spectrum. MDCT section 204 then outputs the generated CELP decoded signal spectrum to CELP component suppressing section 205.
- the CELP component suppression unit 205 includes a CELP suppression coefficient code book similar to the CELP suppression coefficient code book included in the CELP component suppression unit 104.
- the CELP suppression coefficient codebook included in the CELP component suppression unit 205 may be basically the same CELP suppression coefficient codebook as the CELP suppression coefficient codebook included in the CELP component suppression unit 104. Etc. are not necessarily the same.
- the CELP component suppression unit 205 multiplies the CELP suppression coefficient corresponding to the CELP suppression coefficient optimal index input from the separation unit 201 for each frequency component of the CELP decoded signal spectrum input from the MDCT unit 204, thereby obtaining a CELP decoded signal.
- a CELP component suppression spectrum in which the spectrum (CELP component) is suppressed is calculated.
- CELP component suppression section 205 then outputs the calculated CELP component suppression spectrum to addition section 206.
- the adding unit 206 receives the CELP component suppression spectrum input from the CELP component suppressing unit 205, and the transform encoded decoded signal spectrum input from the transform encoding / decoding unit 202. Are added to calculate the decoded signal spectrum. Then, addition section 206 outputs the calculated decoded signal spectrum to IMDCT section 207.
- the IMDCT unit 207 performs IMDCT processing on the decoded signal spectrum input from the adding unit 206 and outputs a decoded signal.
- coding is performed so that a pulse is generated at a frequency where the amplitude of an input signal (here, CELP residual signal spectrum) is large.
- the number of pulses to be set and the error between the pulse amplitude and the input signal differ depending on the set bit rate or the frequency characteristic of the signal. Therefore, the coding distortion in the transform coding cannot be accurately obtained unless the coding is actually performed.
- the pulse position encoded in the transform encoding can be estimated by using a statistical method.
- the CELP residual signal spectrum is normally distributed.
- transform coding a pulse is generated at a frequency having a larger amplitude, and pulse information is encoded.
- the encoding apparatus 100 determines the pulse position encoded by the transform encoding unit 110 on the assumption that the pulse is encoded at the upper 10% frequency having the largest amplitude in the CELP residual signal spectrum.
- a threshold value (amplitude threshold value) is calculated.
- the absolute value average Iavg [j] of the CELP residual signal spectrum is calculated according to the following equation (1).
- Iavg [j] represents the absolute value average of the CELP residual signal spectrum at the CELP suppression coefficient index j
- i represents the frequency sample number
- Cr represents the amplitude of the CELP residual signal spectrum.
- the total number of CELP suppression coefficient indexes is M
- the total number of frequency samples is N.
- the threshold value Ithr is calculated according to the following equation (3), for example, using the absolute value average Iavg [j] calculated by the equation (1) and the standard deviation ⁇ [j] calculated by the equation (2). .
- ⁇ is a constant that controls the value of the threshold value Ithr. For example, when the threshold is set so that the top 10% frequency having the largest amplitude is selected from the CELP residual signal spectrum, the value of ⁇ is set to about 1.6. For example, when the threshold value is set so that the upper 5% frequency having the largest amplitude is selected from the CELP residual signal spectrum, the value of ⁇ is set to about 2.0.
- the set value of ⁇ can be obtained according to a normal distribution table.
- the pulse position estimation unit 106 estimates the pulse position (estimated pulse position) encoded by the transform encoding unit 110 by using the threshold value Ithr shown in Expression (3). Specifically, the pulse position estimation unit 106 estimates the pulse position encoded by the transform encoding unit 110 in the CELP suppression coefficient index j according to the following equation (4).
- the pulse position estimation unit 106 efficiently calculates the position of the pulse obtained as a result of encoding by the transform encoding unit 110 based on the distribution characteristics of the CELP residual signal spectrum (target signal) with a low calculation amount. Is estimated. Specifically, the pulse position estimation unit 106 calculates the threshold (Ithr) calculated based on the amplitude of the CELP residual signal spectrum (target signal) or the absolute value statistic, and the amplitude of the CELP residual signal spectrum. In comparison, the pulse (estimated pulse position) encoded by the transform encoding unit 110 is estimated.
- the pulse position estimation unit 106 only needs to determine the threshold value of the amplitude, and the pulse position estimated to be encoded by the transform encoding unit 110 is smaller than the processing amount of the transform encoding unit 110. It becomes possible to specify by the processing amount.
- the statistical amount used by the pulse position estimation unit 106 may include at least the standard deviation ⁇ .
- the estimated pulse attenuating unit 107 calculates a transform coding estimated residual spectrum Cra according to the following equation (5).
- ⁇ indicates how much the amplitude of the CELP residual signal spectrum remains as an error at the estimated pulse position (that is, indicates the degree of attenuation), and is a constant not less than 0 and less than 1 (hereinafter referred to as an estimated residual coefficient).
- ⁇ is set to 0.0 when the error at the estimated pulse position is regarded as zero, and ⁇ is set to 0.1 when an error of 10% is expected at the estimated pulse position. That is, the estimated pulse attenuating unit 107 multiplies the amplitude of the CELP residual signal spectrum by an estimated residual coefficient (a value not less than 0 and less than 1), thereby obtaining a transform-coded estimated residual spectrum (that is, a decoded signal spectrum).
- Estimated value is calculated. In this way, estimating the error due to transform coding by multiplying the CELP residual signal spectrum by a constant greater than or equal to 0 and less than 1 calculates the error so that a predetermined SNR (Signal Noise Ratio) is obtained by transform coding. Will be.
- SNR Signal Noise Ratio
- the estimated distortion evaluation unit 108 uses the input signal spectrum and the transform coding estimation residual spectrum according to the following equation (7), and estimates the strain energy that is an estimated value of the coding distortion (distortion energy) by transform coding. Ee is calculated (hereinafter also referred to as estimated distortion evaluation).
- S represents the input signal spectrum.
- the estimated distortion evaluation unit 108 calculates the estimated distortion energy for the transform-coded estimated residual spectrum in which the spectrum amplitude at the estimated pulse position is attenuated to a ratio of 0 or more and less than 1. Thereby, the estimated distortion evaluation unit 108 estimates the estimated distortion energy at the pulse position estimated to be encoded by the transform encoding unit 110 with a processing amount smaller than the processing amount of the transform encoding unit 110. It becomes possible.
- the estimated distortion evaluation unit 108 operates to scan all the CELP suppression coefficient indexes. That is, estimated distortion evaluation section 108 outputs all CELP suppression coefficient indexes to CELP component suppression section 104.
- the main selection candidate limiting unit 109 limits candidates of CELP suppression coefficients (CELP suppression coefficients used for transform coding) that are search targets of the main selection search based on the distribution of estimated distortion energy. That is, the main selection candidate limiting unit 109 preselects a predetermined number of CELP suppression coefficients among a plurality of CELP suppression coefficients stored in the CELP suppression coefficient codebook based on the estimated distortion energy.
- Method 1 a preliminary selection search is performed for the largest and smallest CELP suppression coefficients, and it is determined that the larger estimated distortion energy is less likely to be selected in this selection search, and the CELP suppression coefficient is determined. By excluding from the main selection search, the processing amount of the main selection search is reduced.
- the selection candidate limiting unit 109 compares Ee [1] with Ee [4].
- the selection candidate limiting unit 109 uses the estimated distortion energy when the maximum value is used and the estimated distortion energy when the minimum value is used among the plurality of CELP suppression coefficients stored in the CELP component suppressing unit 104.
- the CELP suppression coefficient having the larger estimated distortion energy is subjected to the main selection search (CELP suppression of the main selection search). Excluded from the coefficient group). That is, by performing the preliminary selection search, one search target candidate in the main selection search is reduced.
- the amount of processing (reduction) for transform coding in the main selection search is larger than the amount of processing in two operations in the preliminary selection search, the entire coding apparatus 100 is used. The amount of processing is reduced.
- Method 1 the preliminary selection search is performed only for the necessary minimum CELP suppression coefficients (here, two CELP suppression coefficients of the maximum value and the minimum value).
- a CELP suppression coefficient having a large estimated distortion energy is excluded from the target of the main selection search.
- the preliminary selection search is performed with all the CELP suppression coefficients, and the CELP suppression coefficients that are highly likely to be selected in the main selection search are limited from the estimated distortion energy, thereby reducing the processing amount of the main selection search.
- the candidate with the lowest estimated distortion energy is always left as a candidate for the main selection search.
- the CELP suppression coefficient of the index (one or both) adjacent to the CELP suppression coefficient index assigned to the remaining candidates is also left as a candidate for the main selection search. This is because when the CELP suppression coefficient index is arranged in ascending or descending order with respect to the degree of suppression, there is a possibility that these CELP suppression coefficient candidates are selected as the candidate having the smallest distortion energy during the main selection search. This is because is higher than CELP suppression coefficient candidates other than the smallest candidate and candidates adjacent thereto.
- the selection candidate limiting unit 109 searches for the minimum estimated distortion energy among the estimated distortion energies Ee [1] to Ee [4], and stores the CELP suppression coefficient index corresponding to the minimum estimated distortion energy. .
- the main selection candidate limiting unit 109 calculates the estimated distortion energy corresponding to the CELP suppression coefficient indexes before and after (both ends) of the stored CELP suppression coefficient index (that is, the CELP suppression coefficient index corresponding to the minimum estimated distortion energy). In comparison, the CELP suppression coefficient index with the smaller estimated distortion energy is stored.
- the selection candidate limiting unit 109 stores the CELP suppression coefficient index stored in the process of (1) (that is, the CELP suppression coefficient index corresponding to the minimum estimated distortion energy) and the process of (2). Two types of CELP suppression coefficients of the CELP suppression coefficient index are limited as CELP suppression coefficient groups in the main selection search.
- the selection candidate limiting unit 109 includes a CELP suppression coefficient (first CELP suppression coefficient) having the smallest estimated distortion energy among the plurality of CELP suppression coefficients stored in the CELP component suppressing unit 104, and the estimated distortion.
- the CELP suppression coefficient (second CELP suppression coefficient) with a small estimated distortion energy is specified as the target of this selective search. That is, this selection candidate limiting unit 109 is assigned to the CELP suppression coefficient (first CELP suppression coefficient) with the smallest estimated distortion energy and the CELP suppression coefficient with the smallest estimated distortion energy among the plurality of CELP suppression coefficients.
- the CELP suppression coefficient (second CELP suppression coefficient) having the smaller estimated distortion energy is used as a predetermined number of CELP suppression coefficients. Pre-select.
- the processing amount (reduced amount) of the transform encoding in the main selection search is larger than the processing amount in the four operations in the preliminary selection search, the entire encoding apparatus 100 is used.
- the amount of processing is reduced. That is, as in Method 1, when the amount of processing for transform coding in the main selection search is larger than the amount of processing in two operations in the preliminary selection search, the entire coding apparatus 100 is used. The amount of processing is reduced.
- Method 2 although the preliminary selection search is performed for all the CELP suppression coefficients, the CELP suppression coefficient group that is the target of this selection search is more narrowly limited as compared with Method 1. Thereby, the processing amount in the main selection search can be reduced as compared with the method 1.
- the CELP suppression coefficient with the smallest estimated distortion energy and the CELP suppression coefficient with the smaller estimated distortion energy among the CELP suppression coefficients corresponding to the CELP suppression coefficient indexes at both ends of the CELP suppression coefficient are selected. It becomes the object of search. That is, in the preliminary selection search, a CELP suppression coefficient that is highly likely to be determined as an optimum CELP suppression coefficient (a CELP suppression coefficient with the minimum distortion energy) in the main selection search is searched. Therefore, in the method 2, it is possible to reduce the processing amount in the encoding device 100 while suppressing deterioration in encoding quality as compared with a case where all CELP suppression coefficients are searched in the main selection search.
- the main selection candidate limiting unit 109 includes a CELP suppression coefficient (for example, CELP suppression coefficient index j) having the smallest estimated distortion energy among a plurality of CELP suppression coefficients stored in the CELP component suppression unit 104.
- CELP suppression coefficient groups for example, CELP suppression coefficient indexes [j ⁇ 1] and [j + 1]
- this selection candidate limiting unit 109 corresponds to the CELP suppression coefficient with the smallest estimated distortion energy among the plurality of CELP suppression coefficients and the indexes before and after the index assigned to the CELP suppression coefficient with the smallest estimated distortion energy.
- Two CELP suppression coefficients may be preselected as a predetermined number of CELP suppression coefficients.
- the CELP suppression coefficient group limiting methods 1 and 2 that are the targets of the main selection search in the main selection candidate limiting unit 109 have been described above.
- the method 1 compared with the method 2, by widening the target of the main selection search, the performance degradation of the main selection search due to limiting the target of the main selection search can be further reduced.
- the processing amount in the main selection search can be further reduced.
- the estimated distortion evaluation unit 108 outputs the CELP suppression coefficient index to be searched in the preliminary selection search to the CELP component suppression unit 104. Accordingly, the transform distortion estimated residual spectrum is input to the estimated distortion evaluation unit 108 for each CELP suppression coefficient index, and the estimated distortion evaluation unit 108 calculates estimated distortion energy corresponding to each CELP suppression coefficient index. Based on the estimated distortion energy, the main selection candidate limiting unit 109 limits the CELP suppression coefficient index to be searched for in the main selection search in which distortion evaluation is actually performed using transform coding. That is, encoding apparatus 100 specifies a CELP suppression coefficient that is expected (estimated) that the distortion energy of transform encoding in the main selection search is smaller in the preliminary selection search.
- the encoding device 100 in the main selection search, only the CELP suppression coefficient index group instructed from the main selection candidate limiting unit 109 is used to perform the transform coding by the transform coding unit 110, and the distortion evaluation unit 112. The search for the CELP suppression coefficient that minimizes the distortion energy is performed. Then, the CELP suppression coefficient index corresponding to the CELP suppression coefficient that minimizes the distortion energy is output to multiplexing section 113, and the CELP suppression coefficient index is sent to decoding apparatus 200 as part of the encoded data of encoding apparatus 100. Sent.
- encoding apparatus 100 statistically estimates the pulse positions encoded by transform encoding, calculates the estimated distortion energy estimated at the estimated pulse positions, and calculates the estimated distortion energy.
- a smaller CELP suppression coefficient is limited to a CELP suppression coefficient group to be subjected to the main selection search (preliminary selection search). Then, encoding apparatus 100 performs transform coding for each CELP suppression coefficient whose candidates are limited in the preliminary selection search, and determines a CELP suppression coefficient that minimizes the energy (distortion energy) of the residual signal (this book). Selective search).
- the encoding apparatus 100 reduces the number of times that transform encoding is performed by using only the CELP suppression coefficient that is expected to have low distortion energy as the target of the main selection search in the preliminary selection search.
- the pulse position estimation unit 106 estimates the pulse position
- the estimated pulse attenuation unit 107 calculates the transform coding estimation residual spectrum
- the estimated distortion evaluation unit 108 calculates the transform coding estimation residual spectrum
- the distortion energy can be calculated with a smaller processing amount than the processing in the transform coding unit 110. Therefore, by limiting the CELP suppression coefficient group that is the target of the main selection search in the preliminary selection search in advance, compared with the case where transform coding is sequentially performed on all the CELP suppression coefficients, The amount of processing can be reduced.
- the CELP suppression coefficient that is estimated to have a low estimated distortion energy that is, the CELP suppression coefficient that is highly likely to be evaluated as the minimum distortion energy in the main selection search, is a candidate for the main selection search.
- Limit As a result, it is possible to suppress deterioration in encoding quality due to limiting the CELP suppression coefficient group to be subjected to the main selection search.
- transform coding is performed on all CELP suppression coefficient candidates in a coding scheme that combines coding suitable for audio signals and coding suitable for music signals in a hierarchical structure.
- transform coding is performed on all CELP suppression coefficient candidates in a coding scheme that combines coding suitable for audio signals and coding suitable for music signals in a hierarchical structure.
- values used for the main selection search are not calculated again during the main selection search.
- the value calculated during the preliminary selection search may be used.
- the encoding apparatus can further reduce the processing amount during the main selection search.
- FIG. 3 is a block diagram showing the main configuration of coding apparatus 300 according to Embodiment 2 of the present invention.
- the same components as those in the first embodiment (FIG. 1) are denoted by the same reference numerals, and the description thereof is omitted.
- the encoding apparatus 300 shown in FIG. 3 is different from the encoding apparatus 100 shown in FIG. 1 in that a target signal feature extraction unit 301 is added. Further, the pulse position estimation unit 302 and the estimated pulse attenuation unit 303 are different from the first embodiment in that feature information output from the target signal feature extraction unit 301 is added as an input signal.
- the target signal feature extraction unit 301 uses the CELP residual signal spectrum (target signal) input from the CELP residual signal spectrum calculation unit 105 to extract the features of the target signal. To do.
- FPC Fast Pulse Coding
- the number of pulses that can be encoded increases when the variation in the amplitude of the spectrum to be encoded (here, the CELP residual signal spectrum) is small, and the number of pulses that can be encoded when the variation in the amplitude of the spectrum to be encoded is large.
- the number is smaller.
- the target signal with energy concentrated in a certain band has a smaller number of pulses encoded with FPC, and the target signal with energy distributed over the entire band has a larger number of pulses encoded with FPC. .
- the encoding apparatus 300 can extract the features of the target signal (CELP residual signal spectrum) and predict the number of pulses encoded by FPC based on the extracted features. That is, the pulse position of the target signal can be accurately estimated in the preliminary selection search.
- CELP residual signal spectrum CELP residual signal spectrum
- the target signal feature extraction unit 301 extracts the ratio between the average value of the amplitude of the target signal and the maximum value of the amplitude as the feature of the target signal. Specifically, the target signal feature extraction unit 301 calculates the average value Iavg of the amplitude of the target signal according to the equation (1). Further, the target signal feature extraction unit 301 sets the maximum value of the absolute value amplitude of the target signal as tmax.
- the larger the value of tmax / Iavg the higher the possibility that energy is concentrated in a specific band. That is, the larger the value of tmax / Iavg, the higher the possibility that the variation in spectrum will be greater.
- the target signal feature extraction unit 301 determines that the number of target signal pulses to be estimated in the preliminary selection search should be reduced as the value of tmax / Iavg increases.
- the smaller the value of tmax / Iavg the higher the possibility that the target signal feature extraction unit 301 will disperse the energy over the entire band, so the number of target signal pulses to be estimated in the preliminary selection search should be increased.
- the target signal feature extraction unit 301 generates, as feature information K, information related to the number of pulses of the target signal predicted based on the feature of the target signal according to the following equation (8) according to the value of tmax / Iavg. .
- ⁇ h is a threshold value set in advance to determine whether or not to reduce the number of pulses estimated in the preliminary selection search (pulse position estimation unit 302), and ⁇ l is estimated in the preliminary selection search. This is a threshold value set in advance to determine whether or not to increase the number of pulses.
- the pulse position estimation unit 302 uses the CELP residual signal spectrum (target signal) input from the CELP residual signal spectrum calculation unit 105 and the feature information K input from the target signal feature extraction unit 301 to convert code
- the pulse position (estimated pulse position) encoded by the conversion unit 110 is estimated.
- the pulse position estimation unit 302 uses a threshold value Ithr [j] shown in the following equation (9) instead of the equation (3) used in the first embodiment (pulse position estimation unit 106).
- Equation (9) the value of ⁇ is adaptively corrected for each frame in accordance with the value of the feature information K (0.9, 1.0, 1.1), and is selected by the pulse position estimation unit 302.
- the number of pulses to be controlled is adaptively controlled.
- the pulse position estimation unit 302 corrects the first embodiment (Equation (3)) using the feature information K input from the target signal feature extraction unit 301 as shown in Equation (9).
- pulse position estimating section 302 sets the number of pulses to be estimated to be small, and in equation (8), tmax / Iavg ⁇ In the case of ⁇ l (when the variation in the spectrum is small), a large number of pulses to be estimated is set. That is, the pulse position estimation unit 302 sets the number of pulses to be estimated according to the characteristics of the CELP residual signal spectrum, and estimates the position of the set number of pulses. For example, the pulse position estimation unit 302 sets the number of pulses so as to decrease as the amplitude variation in each band of the CELP residual signal spectrum increases.
- the estimated pulse attenuation unit 303 uses the feature information input from the target signal feature extraction unit 301 to input from the pulse position estimation unit 302 out of the CELP residual signal spectrum input from the CELP residual signal spectrum calculation unit 105.
- the spectrum of the estimated pulse position to be attenuated is attenuated.
- estimated pulse attenuating section 303 calculates transform encoded estimated residual spectrum Cra according to the following expression (10) instead of expression (5) used in Embodiment 1 (estimated pulse attenuating section 107). calculate.
- Equation (10) the value of the estimated residual count ⁇ is adaptively corrected for each frame in accordance with the value of the feature information K (0.9, 1.0, 1.1), and the estimated pulse attenuation unit.
- the degree of attenuation (estimated error amount) at 303 is adaptively controlled.
- the estimated pulse attenuation unit 303 corrects the first embodiment (Equation (5)) using the feature information K input from the target signal feature extraction unit 301 as shown in Equation (10).
- feature information K 0.9. Therefore, “ ⁇ ” becomes “ ⁇ / 0.9”, and control is performed so that the error in the estimated pulse position becomes larger.
- estimated pulse attenuation section 303 increases the degree of spectrum attenuation, and tmax / Iavg in Equation (8).
- the attenuation degree of the spectrum is decreased. That is, the estimated pulse attenuating unit 303 sets the attenuation degree of the CELP residual signal spectrum so as to increase as the variation in amplitude in each band of the CELP residual signal spectrum increases.
- the SNR calculated based on the estimated value of the transform coding error changes adaptively according to the variation in the spectrum amplitude.
- the SNR at that time is expressed by the following equation (11).
- encoding apparatus 300 is encoded by transform encoding section 110 in accordance with the characteristics of target signal (CELP residual signal spectrum) (here, variation in spectrum amplitude (tmax / Iavg)).
- the number of pulses and the pulse error (attenuation degree in the estimated pulse attenuation unit 303) are adaptively controlled.
- encoding apparatus 300 can estimate distortion energy at a pulse position estimated to be encoded by transform encoding section 110 with higher accuracy than in the first embodiment.
- the encoding apparatus 300 estimates the estimated pulse position, calculates the transform encoded estimation residual spectrum in the estimated pulse attenuation unit 107, and calculates the distortion energy in the estimated distortion evaluation unit 108. The calculation can be performed with a smaller processing amount than the processing in the transform encoding unit 110.
- the coding is compared with the first embodiment. Compared with the method of sequentially performing transform coding on all CELP suppression coefficient candidates while further suppressing quality degradation, the processing amount in the coding apparatus can be reduced.
- the present invention is not limited to the case where the variation in the spectrum amplitude is used as the feature of the target signal.
- the tone characteristic of the target signal may be used as the feature of the target signal.
- the tone property is an index indicating the size of the peak of the spectrum or the size of the dynamic range.
- the ratio of the geometric mean to the arithmetic mean of the target signal or its absolute value is measured, and when this ratio is close to 0, it can be determined that the tone property is high.
- the target signal feature extraction unit 301 measures the tone property of the target signal.
- the pulse position estimation unit 302 sets the number of pulses so as to decrease as the tone property increases. For example, the pulse position estimator 302 sets a large threshold value when the tone characteristic of the target signal is high, and controls the number of estimated pulses to be small, and decreases the threshold value when the tone characteristic of the target signal is low. Then, the control may be performed so that the estimated number of pulses is increased. Further, the estimated pulse attenuating unit 303 sets the degree of attenuation of the CELP residual signal spectrum so as to increase as the tone property increases.
- the estimated pulse attenuating unit 303 controls to reduce the residual signal (error) by decreasing the estimated residual coefficient (increasing the degree of attenuation) when the tone characteristic of the target signal is high, When the tone characteristic of the target signal is low, the estimated residual coefficient is increased (the degree of attenuation is decreased), and control is performed so that the residual signal (error) increases. As described above, even when the tone characteristic is used as the feature of the target signal, the same effect as in the present embodiment can be obtained.
- the noise characteristics of the target signal may be used as a feature of the target signal.
- the noise characteristic is an index indicating a small energy bias of the target signal.
- the energy for each band is measured by dividing the target signal into several bands, and when the energy dispersion for each band is small, it can be determined that the noise characteristic is high.
- the target signal feature extraction unit 301 measures the noise characteristics of the target signal.
- the pulse position estimation unit 302 sets the number of pulses so as to increase as the noise property increases. For example, the pulse position estimation unit 302 performs control so that the number of estimated pulses is increased when the target signal has high noise characteristics, and increases the threshold value when the target signal has low noise characteristics.
- control may be performed so that the estimated number of pulses is reduced.
- the estimated pulse attenuating unit 303 sets the attenuation degree of the CELP residual signal spectrum so as to decrease as the noise characteristic increases. That is, the estimated pulse attenuating unit 303 performs control so that the residual signal (error) is increased by increasing the estimated residual coefficient (decreasing the attenuation degree) when the noise characteristic of the target signal is high, When the noise characteristic of the target signal is low, the estimated residual coefficient may be reduced (increase the degree of attenuation) to control the residual signal (error) to be small. As described above, even when the noise characteristic is used as the feature of the target signal, the same effect as in the present embodiment can be obtained.
- the pulse position estimation unit assumes that the input signal (CELP residual signal spectrum) to the transform coding unit is a normal distribution, and a threshold value for selecting an upper frequency with a large amplitude.
- (Ithr) is set has been described.
- the pulse position estimation unit sets a threshold (Ithr) according to the distribution model. May be.
- the pulse position estimation unit may estimate the number of pulses exceeding the upper limit value of the number of pulses encoded by the transform encoding unit.
- the pulse position estimation unit may control the estimated number of pulses using the upper limit value.
- the pulse position estimation unit may exclude pulses having a smaller amplitude, or may exclude pulses on a higher frequency side.
- the pulse position estimation unit may determine the pulse to be excluded by combining other conditions that can be calculated from the characteristics of the signal in addition to the above-described conditions of the amplitude and frequency band.
- the CELP suppression coefficients stored in the CELP suppression coefficient codebook are stored in ascending or descending order of the degree of CELP suppression.
- the suppression coefficient candidates when using a method that does not depend on the stored order, it is not always necessary to use ascending order or descending order.
- CELP coding has been described as an example of coding suitable for a speech signal.
- the present invention is based on ADPCM (Adaptive Differential Pulse Code Modulation), APC (Adaptive Prediction Coding), ATC ( It can also be realized using Adaptive Transform Coding), TCX Transform Coded Excitation, etc., and the same effect can be obtained.
- ADPCM Adaptive Differential Pulse Code Modulation
- APC Adaptive Prediction Coding
- ATC It can also be realized using Adaptive Transform Coding
- TCX Transform Coded Excitation etc.
- transform coding As an example of coding suitable for a music signal.
- a residual signal between a decoded signal and an input signal of a coding method suitable for a voice signal is used as a frequency Any method can be used as long as it allows efficient coding in a region.
- FPC Fractorial PulsedingCoding
- AVQ Algebraic Vector Quantization
- the encoded data output from the encoding devices 100 and 300 is received by the decoding device 200.
- the present invention is not limited to this. That is, the decoding apparatus 200 is output by an encoding apparatus that can generate encoded data having encoded data necessary for decoding, even if the encoded data is not generated in the configuration of the encoding apparatuses 100 and 300. If it is encoded data, it can be decoded.
- each functional block used in the description of each of the above embodiments is typically realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them. Although referred to as LSI here, it may be referred to as IC, system LSI, super LSI, or ultra LSI depending on the degree of integration.
- the method of circuit integration is not limited to LSI, and implementation with a dedicated circuit or a general-purpose processor is also possible.
- An FPGA Field Programmable Gate Array
- a reconfigurable / processor that can reconfigure the connection and setting of circuit cells inside the LSI may be used.
- the present invention can reduce the calculation amount of the entire apparatus while suppressing deterioration in encoding quality, and can be applied to, for example, a packet communication system and a mobile communication system.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
図1は、本発明の実施の形態1に係る符号化装置100の主要な構成を示すブロック図である。符号化装置100は、音声及び音楽等の入力信号を、CELP符号化と変換符号化とを階層構造にして組み合わせた符号化方式を用いて符号化して、符号化データを出力する。図1に示すように、符号化装置100は、MDCT(Modified Discrete Cosine Transform:修正離散コサイン変換)部101、CELP符号化部102、MDCT部103、CELP成分抑圧部104、CELP残差信号スペクトル算出部105、パルス位置推定部106、推定パルス減衰部107、推定歪評価部108、本選択候補限定部109、変換符号化部110、加算部111、歪評価部112及び多重化部113を具備する。各部は以下の動作を行う。
方法1では、CELP抑圧係数の最も大きい係数と最も小さい係数とについて予備選択探索を行い、推定歪エネルギが大きい方は本選択探索で選択される可能性が小さいと判断し、そのCELP抑圧係数を本選択探索から除外することで、本選択探索の処理量を減らす。
方法2では、全てのCELP抑圧係数で予備選択探索を行い、推定歪エネルギから本選択探索でも選択される可能性の高いCELP抑圧係数を限定することで、本選択探索の処理量を減らす。この時最も推定歪エネルギが小さい候補は必ず本選択探索の候補として残すようにする。そして、残された候補に付与されたCELP抑圧係数インデックスに隣接するインデックス(片方または両方)のCELP抑圧係数も本選択探索の候補として残すようにする。これは、CELP抑圧係数インデックスが抑圧の程度に関して昇順または降順に配置されている場合に、これらCELP抑圧係数候補が本選択探索時に歪エネルギが最も小さい候補として選択される可能性が、推定歪エネルギが最小の候補およびそれに隣接する候補以外のCELP抑圧係数候補よりも高いからである。
図3は、本発明の実施の形態2に係る符号化装置300の主要な構成を示すブロック図である。なお、図3において、実施の形態1(図1)と同一の構成要素には同一の符号を付しその説明を省略する。図3に示す符号化装置300では、図1に示す符号化装置100に対してターゲット信号特徴抽出部301が追加される点が異なる。また、パルス位置推定部302及び推定パルス減衰部303には、ターゲット信号特徴抽出部301から出力される特徴情報が入力信号として追加される点が実施の形態1と異なる。
200 復号装置
101,103,204 MDCT部
102 CELP符号化部
104,205 CELP成分抑圧部
105 CELP残差信号スペクトル算出部
106,302 パルス位置推定部
107,303 推定パルス減衰部
108 推定歪評価部
109 本選択候補限定部
110 変換符号化部
111,206 加算部
112 歪評価部
113 多重化部
201 分離部
202 変換符号化復号部
203 CELP復号部
207 IMDCT部
301 ターゲット信号特徴抽出部
Claims (17)
- 入力信号に対する第1の符号化により得られた第1符号を復号して生成された第1復号信号のスペクトルを出力する第1符号化部と、
前記第1復号信号のスペクトルの振幅を、複数の抑圧係数の中から指示された抑圧係数を用いて抑圧して抑圧スペクトルを生成する抑圧部と、
前記入力信号のスペクトルと前記抑圧スペクトルとを用いて残差スペクトルを算出する残差スペクトル算出部と、
前記入力信号のスペクトルと前記残差スペクトルとを用いて、所定の数の抑圧係数を予備選択し、前記予備選択された抑圧係数を前記抑圧部に対して指示する予備選択部と、
前記指示された抑圧係数を前記抑圧部で用いて生成された抑圧スペクトルを前記残差スペクトル算出部に入力して算出された残差スペクトルを用いて第2の符号化を行い、前記第2の符号化により得られた第2符号を復号して生成された第2復号信号のスペクトルと、前記抑圧スペクトルと、前記入力信号のスペクトルと、を用いて、前記指示された抑圧係数の中から一つの抑圧係数を決定する第2符号化部と、
を具備する符号化装置。 - 前記第2符号化部は、
前記残差スペクトルに対して立てられたパルスを前記第2の符号化で符号化し、前記第2の符号化による符号化歪が最小となる前記抑圧係数を探索し、
前記予備選択部は、
前記残差スペクトルを用いて前記パルスの位置を推定する推定手段と、
前記残差スペクトルのうち、推定された前記パルスの位置における振幅を減衰させて推定残差スペクトルを生成する減衰手段と、
前記推定残差スペクトルと前記入力信号のスペクトルとを用いて、前記符号化歪の推定エネルギである推定歪エネルギを算出する算出手段と、
前記推定歪エネルギに基づいて、前記複数の抑圧係数のうち、前記所定の数の抑圧係数を予備選択する候補限定手段と、
を具備する請求項1記載の符号化装置。 - 前記複数の抑圧係数は、抑圧の程度に関して昇順又は降順でインデックスが付与され、
前記候補限定手段は、
最大のインデックス及び最小のインデックスに対応する前記抑圧係数のうち、前記推定歪エネルギが大きい方の抑圧係数を、前記所定の数の抑圧係数から除外する、
請求項2記載の符号化装置。 - 前記複数の抑圧係数は、抑圧の程度に関して昇順又は降順でインデックスが付与され、
前記候補限定手段は、
前記複数の抑圧係数のうちの前記推定歪エネルギが最も小さい抑圧係数と、前記推定歪エネルギが最も小さい抑圧係数に付与されたインデックスの前後のインデックスに対応する2つの抑圧係数と、を前記所定の数の抑圧係数として予備選択する、
請求項2記載の符号化装置。 - 前記複数の抑圧係数は、抑圧の程度に関して昇順又は降順でインデックスが付与され、
前記候補限定手段は、
前記複数の抑圧係数のうちの前記推定歪エネルギが最も小さい第1の抑圧係数と、前記第1の抑圧係数に付与されたインデックスの前後のインデックスに対応する2つの抑圧係数のうち前記推定歪エネルギが小さい方の第2の抑圧係数と、を前記所定の数の抑圧係数として予備選択する、
請求項2記載の符号化装置。 - 前記推定手段は、
前記残差スペクトルの振幅の統計量に基づいて算出される閾値と、前記残差スペクトルの振幅とを比較して、前記パルスの位置を推定する、
請求項2記載の符号化装置。 - 前記統計量は、少なくとも前記振幅の標準偏差を含む、
請求項6記載の符号化装置。 - 前記減衰手段は、
推定された前記パルスの位置におけるスペクトルの振幅に、0以上1未満の値を有する係数を乗算して前記振幅を減衰させる、
請求項2記載の符号化装置。 - 前記推定手段は、
前記残差スペクトルの特徴に応じて、推定する前記パルスの本数を設定し、設定された本数の前記パルスの位置を推定する、
請求項2記載の符号化装置。 - 前記特徴は、前記残差スペクトルの各帯域における振幅のばらつきであり、
前記推定手段は、
前記パルスの本数を、前記ばらつきが大きくなるほど少なくするように設定する、
請求項9記載の符号化装置。 - 前記特徴は、前記残差スペクトルのトーン性であり、
前記推定手段は、
前記パルスの本数を、前記トーン性が高くなるほど少なくするように設定する、
請求項9記載の符号化装置。 - 前記特徴は、前記残差スペクトルの雑音性であり、
前記推定手段は、
前記パルスの本数を、前記雑音性が高くなるほど多くするように設定する、
請求項9記載の符号化装置。 - 前記減衰手段は、
前記残差スペクトルの特徴に応じて、推定された前記パルスの位置におけるスペクトルの振幅を減衰させる、
請求項2記載の符号化装置。 - 前記特徴は、前記残差スペクトルの各帯域における振幅のばらつきであり、
前記減衰手段は、
前記スペクトルの減衰度合を、前記ばらつきが大きくなるほど大きくするように設定する、
請求項13記載の符号化装置。 - 前記特徴は、前記残差スペクトルのトーン性であり、
前記減衰手段は、
前記スペクトルの減衰度合を、前記トーン性が高くなるほど大きくするように設定する、
請求項13記載の符号化装置。 - 前記特徴は、前記残差スペクトルの雑音性であり、
前記減衰手段は、
前記スペクトルの減衰度合を、前記雑音性が高くなるほど小さくするように設定する、
請求項13記載の符号化装置。 - 入力信号に対する第1の符号化により得られた第1符号を復号して生成された第1復号信号のスペクトルを出力する第1符号化ステップと、
前記第1復号信号のスペクトルの振幅を、複数の抑圧係数の中から指示された抑圧係数を用いて抑圧して抑圧スペクトルを生成する抑圧ステップと、
前記入力信号のスペクトルと前記抑圧スペクトルとを用いて残差スペクトルを算出する残差スペクトル算出ステップと、
前記入力信号のスペクトルと前記残差スペクトルとを用いて、前記抑圧ステップで用いる所定の数の抑圧係数を予備選択し、前記予備選択された抑圧係数を前記指示された抑圧係数に設定する予備選択ステップと、
前記指示された抑圧係数を前記抑圧ステップで用いて生成された抑圧スペクトルを用いて前記残差スペクトル算出ステップで算出された残差スペクトルを用いて第2の符号化を行い、前記第2の符号化により得られた第2符号を復号して生成された第2復号信号のスペクトルと、前記抑圧スペクトルと、前記入力信号のスペクトルと、を用いて、前記指示された抑圧係数の中から一つの抑圧係数を決定する第2符号化ステップと、
を有する符号化方法。
Priority Applications (8)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
RU2013110317/08A RU2013110317A (ru) | 2010-09-10 | 2011-09-05 | Кодирующее устройство и способ кодирования |
JP2012532859A JP5679470B2 (ja) | 2010-09-10 | 2011-09-05 | 符号化装置及び符号化方法 |
SG2013016431A SG188413A1 (en) | 2010-09-10 | 2011-09-05 | Encoder apparatus and encoding method |
BR112013005683A BR112013005683A2 (pt) | 2010-09-10 | 2011-09-05 | aparelho codificador e método de codificação |
AU2011300248A AU2011300248B2 (en) | 2010-09-10 | 2011-09-05 | Encoder apparatus and encoding method |
US13/820,760 US9361892B2 (en) | 2010-09-10 | 2011-09-05 | Encoder apparatus and method that perform preliminary signal selection for transform coding before main signal selection for transform coding |
CN201180040472.4A CN103069483B (zh) | 2010-09-10 | 2011-09-05 | 编码装置以及编码方法 |
KR1020137005813A KR20130108281A (ko) | 2010-09-10 | 2011-09-05 | 부호화 장치 및 부호화 방법 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2010-203657 | 2010-09-10 | ||
JP2010203657 | 2010-09-10 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2012032759A1 true WO2012032759A1 (ja) | 2012-03-15 |
Family
ID=45810369
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2011/004960 WO2012032759A1 (ja) | 2010-09-10 | 2011-09-05 | 符号化装置及び符号化方法 |
Country Status (10)
Country | Link |
---|---|
US (1) | US9361892B2 (ja) |
JP (1) | JP5679470B2 (ja) |
KR (1) | KR20130108281A (ja) |
CN (1) | CN103069483B (ja) |
AU (1) | AU2011300248B2 (ja) |
BR (1) | BR112013005683A2 (ja) |
RU (1) | RU2013110317A (ja) |
SG (1) | SG188413A1 (ja) |
TW (1) | TW201218188A (ja) |
WO (1) | WO2012032759A1 (ja) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013051210A1 (ja) * | 2011-10-07 | 2013-04-11 | パナソニック株式会社 | 符号化装置及び符号化方法 |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2581904B1 (en) * | 2010-06-11 | 2015-10-07 | Panasonic Intellectual Property Corporation of America | Audio (de)coding apparatus and method |
US8914515B2 (en) * | 2011-10-28 | 2014-12-16 | International Business Machines Corporation | Cloud optimization using workload analysis |
PL2951820T3 (pl) * | 2013-01-29 | 2017-06-30 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Urządzenie i sposób wyboru jednego spośród pierwszego algorytmu kodowania i drugiego algorytmu kodowania |
EP3503095A1 (en) | 2013-08-28 | 2019-06-26 | Dolby Laboratories Licensing Corp. | Hybrid waveform-coded and parametric-coded speech enhancement |
KR101826237B1 (ko) * | 2014-03-24 | 2018-02-13 | 니폰 덴신 덴와 가부시끼가이샤 | 부호화 방법, 부호화 장치, 프로그램 및 기록 매체 |
CN107851442B (zh) * | 2015-04-13 | 2021-07-20 | 日本电信电话株式会社 | 匹配装置、判定装置、它们的方法、程序及记录介质 |
US10325588B2 (en) | 2017-09-28 | 2019-06-18 | International Business Machines Corporation | Acoustic feature extractor selected according to status flag of frame of acoustic signal |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007043648A1 (ja) * | 2005-10-14 | 2007-04-19 | Matsushita Electric Industrial Co., Ltd. | 変換符号化装置および変換符号化方法 |
WO2008072733A1 (ja) * | 2006-12-15 | 2008-06-19 | Panasonic Corporation | 符号化装置および符号化方法 |
JP2009042739A (ja) * | 2007-03-02 | 2009-02-26 | Panasonic Corp | 符号化装置、復号装置およびそれらの方法 |
JP2009094666A (ja) * | 2007-10-05 | 2009-04-30 | Nippon Telegr & Teleph Corp <Ntt> | 多重ベクトル量子化方法、装置、プログラム及びその記録媒体 |
US20090112607A1 (en) | 2007-10-25 | 2009-04-30 | Motorola, Inc. | Method and apparatus for generating an enhancement layer within an audio coding system |
JP2010203657A (ja) | 2009-03-03 | 2010-09-16 | Kanazawa Inst Of Technology | 住宅換気システム |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6263312B1 (en) * | 1997-10-03 | 2001-07-17 | Alaris, Inc. | Audio compression and decompression employing subband decomposition of residual signal and distortion reduction |
BRPI0612579A2 (pt) * | 2005-06-17 | 2012-01-03 | Matsushita Electric Ind Co Ltd | pàs-filtro, decodificador e mÉtodo de pàs-filtraÇço |
CN101273404B (zh) * | 2005-09-30 | 2012-07-04 | 松下电器产业株式会社 | 语音编码装置以及语音编码方法 |
JP4708446B2 (ja) | 2007-03-02 | 2011-06-22 | パナソニック株式会社 | 符号化装置、復号装置およびそれらの方法 |
-
2011
- 2011-09-05 RU RU2013110317/08A patent/RU2013110317A/ru unknown
- 2011-09-05 JP JP2012532859A patent/JP5679470B2/ja not_active Expired - Fee Related
- 2011-09-05 BR BR112013005683A patent/BR112013005683A2/pt not_active IP Right Cessation
- 2011-09-05 US US13/820,760 patent/US9361892B2/en active Active
- 2011-09-05 CN CN201180040472.4A patent/CN103069483B/zh not_active Expired - Fee Related
- 2011-09-05 AU AU2011300248A patent/AU2011300248B2/en not_active Expired - Fee Related
- 2011-09-05 SG SG2013016431A patent/SG188413A1/en unknown
- 2011-09-05 KR KR1020137005813A patent/KR20130108281A/ko not_active Application Discontinuation
- 2011-09-05 WO PCT/JP2011/004960 patent/WO2012032759A1/ja active Application Filing
- 2011-09-09 TW TW100132614A patent/TW201218188A/zh unknown
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007043648A1 (ja) * | 2005-10-14 | 2007-04-19 | Matsushita Electric Industrial Co., Ltd. | 変換符号化装置および変換符号化方法 |
WO2008072733A1 (ja) * | 2006-12-15 | 2008-06-19 | Panasonic Corporation | 符号化装置および符号化方法 |
JP2009042739A (ja) * | 2007-03-02 | 2009-02-26 | Panasonic Corp | 符号化装置、復号装置およびそれらの方法 |
JP2009094666A (ja) * | 2007-10-05 | 2009-04-30 | Nippon Telegr & Teleph Corp <Ntt> | 多重ベクトル量子化方法、装置、プログラム及びその記録媒体 |
US20090112607A1 (en) | 2007-10-25 | 2009-04-30 | Motorola, Inc. | Method and apparatus for generating an enhancement layer within an audio coding system |
JP2010203657A (ja) | 2009-03-03 | 2010-09-16 | Kanazawa Inst Of Technology | 住宅換気システム |
Non-Patent Citations (1)
Title |
---|
TOMOFUMI YAMANASHI ET AL.: "ITU-T G.718- development of speech/audio codec for next- generation mobile communication systems", PANASONIC TECHNICAL JOURNAL, vol. 55, no. 1, 15 April 2009 (2009-04-15), pages 21 - 26 * |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2013051210A1 (ja) * | 2011-10-07 | 2013-04-11 | パナソニック株式会社 | 符号化装置及び符号化方法 |
JPWO2013051210A1 (ja) * | 2011-10-07 | 2015-03-30 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | 符号化装置及び符号化方法 |
US9558752B2 (en) | 2011-10-07 | 2017-01-31 | Panasonic Intellectual Property Corporation Of America | Encoding device and encoding method |
Also Published As
Publication number | Publication date |
---|---|
AU2011300248B2 (en) | 2014-05-15 |
AU2011300248A1 (en) | 2013-03-28 |
SG188413A1 (en) | 2013-04-30 |
TW201218188A (en) | 2012-05-01 |
BR112013005683A2 (pt) | 2018-01-23 |
KR20130108281A (ko) | 2013-10-02 |
CN103069483B (zh) | 2014-10-22 |
US9361892B2 (en) | 2016-06-07 |
JPWO2012032759A1 (ja) | 2014-01-20 |
JP5679470B2 (ja) | 2015-03-04 |
RU2013110317A (ru) | 2014-10-20 |
US20130166308A1 (en) | 2013-06-27 |
CN103069483A (zh) | 2013-04-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP5679470B2 (ja) | 符号化装置及び符号化方法 | |
EP1806736B1 (en) | Scalable encoding apparatus, scalable decoding apparatus, and methods thereof | |
JP5173800B2 (ja) | 音声符号化装置、音声復号化装置、およびこれらの方法 | |
KR101343267B1 (ko) | 주파수 세그먼트화를 이용한 오디오 코딩 및 디코딩을 위한 방법 및 장치 | |
US8099275B2 (en) | Sound encoder and sound encoding method for generating a second layer decoded signal based on a degree of variation in a first layer decoded signal | |
KR20080049085A (ko) | 음성 부호화 장치 및 음성 부호화 방법 | |
KR20080047443A (ko) | 변환 부호화 장치 및 변환 부호화 방법 | |
MXPA05006664A (es) | Metodo y dispositivo para la cuantizacion robusta de vector predictivo de parametros de prediccion lineal en la codificacion de conversacion a velocidad variable de bits. | |
KR20090087920A (ko) | 부호화 장치, 복호 장치 및 이들의 방법 | |
KR102072365B1 (ko) | 고급 양자화기 | |
CN102119414A (zh) | 用于在超帧中量化和逆量化线性预测系数滤波器的设备和方法 | |
US8898057B2 (en) | Encoding apparatus, decoding apparatus and methods thereof | |
WO2011086923A1 (ja) | 符号化装置、復号装置、スペクトル変動量算出方法及びスペクトル振幅調整方法 | |
JP5711733B2 (ja) | 復号装置、符号化装置及びこれらの方法 | |
WO2013051210A1 (ja) | 符号化装置及び符号化方法 | |
JP2003233397A (ja) | オーディオ符号化装置、オーディオ符号化プログラム及びオーディオ符号化データ伝送装置 | |
US9640191B2 (en) | Apparatus and method for processing an encoded signal and encoder and method for generating an encoded signal | |
KR20060064694A (ko) | 디지털 음성 코더들에서의 고조파 잡음 가중 | |
WO2012053146A1 (ja) | 符号化装置及び符号化方法 | |
Nagisetty et al. | Super-wideband fine spectrum quantization for low-rate high-quality MDCT coding mode of the 3GPP EVS codec |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 201180040472.4 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 11823238 Country of ref document: EP Kind code of ref document: A1 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2012532859 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 13820760 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: 20137005813 Country of ref document: KR Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2011823238 Country of ref document: EP |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
ENP | Entry into the national phase |
Ref document number: 2011300248 Country of ref document: AU Date of ref document: 20110905 Kind code of ref document: A |
|
ENP | Entry into the national phase |
Ref document number: 2013110317 Country of ref document: RU Kind code of ref document: A |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112013005683 Country of ref document: BR |
|
ENP | Entry into the national phase |
Ref document number: 112013005683 Country of ref document: BR Kind code of ref document: A2 Effective date: 20130308 |