WO2007116809A1 - ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 - Google Patents
ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 Download PDFInfo
- Publication number
- WO2007116809A1 WO2007116809A1 PCT/JP2007/056955 JP2007056955W WO2007116809A1 WO 2007116809 A1 WO2007116809 A1 WO 2007116809A1 JP 2007056955 W JP2007056955 W JP 2007056955W WO 2007116809 A1 WO2007116809 A1 WO 2007116809A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- channel signal
- delay time
- time difference
- decoding
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000012937 correction Methods 0.000 claims description 106
- 230000000630 rising effect Effects 0.000 claims description 56
- 238000004364 calculation method Methods 0.000 claims description 54
- 230000005236 sound signal Effects 0.000 claims description 35
- 230000003111 delayed effect Effects 0.000 claims description 7
- 238000005314 correlation function Methods 0.000 claims description 2
- 238000013459 approach Methods 0.000 claims 1
- 230000002194 synthesizing effect Effects 0.000 claims 1
- 230000015556 catabolic process Effects 0.000 abstract description 2
- 238000006731 degradation reaction Methods 0.000 abstract description 2
- 238000002955 isolation Methods 0.000 abstract 1
- 238000004891 communication Methods 0.000 description 35
- 238000010586 diagram Methods 0.000 description 22
- 238000000926 separation method Methods 0.000 description 13
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 description 11
- 230000006870 function Effects 0.000 description 9
- 238000010295 mobile communication Methods 0.000 description 6
- 238000012545 processing Methods 0.000 description 6
- 238000001514 detection method Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 238000013139 quantization Methods 0.000 description 4
- 230000005540 biological transmission Effects 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 230000006866 deterioration Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000011084 recovery Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000002238 attenuated effect Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
Definitions
- stereo speech coding apparatus stereo speech decoding apparatus, and methods thereof
- the present invention relates to a stereo speech coding apparatus that encodes a stereo speech signal, a stereo speech decoding apparatus corresponding to the stereo speech coding apparatus, and a method thereof.
- V () V x (n-d-k) *
- a is a k-th order prediction coefficient as a prediction parameter that minimizes the prediction error.
- x (n) represents one channel signal at sample number n
- yXn represents the predicted other channel signal at sample number n.
- monaural communication is expected to reduce communication costs because it has a low bit rate, and mobile phones that support only monaural communication are less expensive because of their smaller circuit scale, and high-quality voice communication is desired. This is because users who don't have enough will purchase mobile phones that only support mono communication. Accordingly, mobile phones that support stereo communication and mobile phones that support monaural communication are mixed in a single communication system, and the communication system needs to support both stereo communication and monaural communication. Arise. Furthermore, in a mobile communication system, communication data is exchanged by radio signals, so some communication data may be lost depending on the propagation path environment. Therefore, it is very useful if the mobile phone has a function that can restore the communication data of the remaining received data power even if a part of the communication data is lost.
- Non-Patent Literature 1 Hendrik Fucns, Improving Joint Stereo Audio and oding by Adaptive Inter— Channel Prediction, Applications of Signal Processing to Audio and Acoustics ⁇ Final Program and Paper Summaries ⁇ IEEE Workshop on Pages: 39-42, (17— 20 Oct. 1993)
- Non-Patent Document 2 ISO / IEC 14496-3: 1999 (B.14 Scalable AAC with core coder) Invention Disclosure
- Non-Patent Document 1 is not able to predict as expressed by the above-described equation (1). If the order of the prediction coefficient is increased in order to reduce the prediction error, that is, if the number of prediction parameters is increased, the code bit rate will increase. is there. Conversely, if the order of the prediction coefficient is reduced for the purpose of suppressing the code bit rate, there is a problem that the prediction performance deteriorates and audio quality degradation occurs in the audio signal obtained on the decoding side. In addition, when the technique of Non-Patent Document 1 is applied to scalable coding as in Non-Patent Document 2, it is necessary to obtain a prediction coefficient for not only a stereo signal but also a monaural signal, and further, a sign bit. The rate increases.
- An object of the present invention is to encode and transmit a smaller amount of information, thereby reducing the bit rate and suppressing deterioration in sound quality, and a stereo audio encoding device, stereo audio decoding device, and It is to provide these methods.
- the stereo speech decoding apparatus of the present invention is a monophonic signal obtained by combining a preceding channel signal that precedes a stereo speech signal having two channel forces and a succeeding channel signal that is delayed in time.
- Monaural signal decoding means for decoding the encoded code information
- rising position decoding means for decoding encoded information in which the rising position changing from a silent section to a voiced section of the stereo audio signal is encoded
- Delay time difference decoding means for decoding code information in which the delay time difference between the preceding channel signal and the subsequent channel signal is encoded, and encoding in which the amplitude ratio between the subsequent channel signal and the preceding channel signal is encoded
- the preceding channel signal is recovered using amplitude ratio decoding means for decoding information, the monaural signal, the delay time difference, and the rising position.
- prior channel signal decoding means for, with the preceding channel signal, using said amplitude ratio, the effect of the invention adopts a configuration having a, a subsequent channel signal decoding means for decoding
- the stereo speech code is not encoded with the prediction coefficient between both channels, and less information about the rising position of the stereo signal, the delay time difference between both channels, and the amplitude ratio.
- FIG. 1 is a block diagram showing the main configuration of a stereo speech coding apparatus according to Embodiment 1.
- FIG. 2 is a diagram for explaining a rising position of a stereo speech signal according to Embodiment 1.
- 3] A diagram for explaining a delay time difference and an amplitude ratio between an L channel signal and an R channel signal according to Embodiment 1.
- FIG. 4 is a block diagram showing a main configuration of a stereo speech decoding apparatus according to Embodiment 1.
- FIG. 5 is a block diagram showing a detailed configuration of a stereo signal decoding unit according to Embodiment 1.
- FIG. 6 is a diagram for explaining the principle of stereo audio signal decoding processing in the stereo audio decoding device according to Embodiment 1;
- FIG. 7 is a diagram showing a stereo audio signal according to Embodiment 1 in a table.
- FIG. 8 is a block diagram showing the main configuration of a stereo speech coding apparatus according to Embodiment 2.
- FIG. 9 is a block diagram showing a detailed configuration of a second layer decoder according to the second embodiment.
- FIG. 10 is a block diagram showing the main configuration of a stereo speech decoding apparatus according to Embodiment 2.
- FIG. 11 is a block diagram showing the main configuration of a stereo speech coding apparatus according to Embodiment 3.
- FIG. 12 is a block diagram showing the main configuration of a stereo speech coding apparatus according to Embodiment 4.
- BEST MODE FOR CARRYING OUT THE INVENTION is a block diagram showing the main configuration of a stereo speech coding apparatus according to Embodiment 4.
- FIG. 1 is a block diagram showing the main configuration of stereo speech coding apparatus 100 according to Embodiment 1 of the present invention.
- stereo speech coding apparatus 100 includes first layer (base layer) encoder 140 and second layer (enhancement layer) encoder 150, and performs scalable coding of a stereo speech signal.
- the first layer encoder 140 includes a monaural signal generation unit 101 and a monaural signal encoding unit 102, and performs encoding of the monaural signal.
- Second layer encoder 150 includes rising position detector 103, rising position code encoder 104, delay time difference calculator 105, delay time difference encoder 106, amplitude ratio calculator 107, and amplitude ratio code.
- the encoding unit 108 is provided to perform stereo signal encoding. Each layer encoder transmits the obtained encoding parameter to stereo audio decoding apparatus 200 described later.
- the monaural signal generation unit 101 also generates a monaural signal S (n) from the input stereo audio signal, that is, the L channel signal S (n) and the R channel signal S (n).
- the monaural signal S (n) is generated by obtaining an average value of the L channel signal S (n) and the R channel signal S (n) according to the following equation (2).
- n indicates the sample number of the stereo audio signal.
- the monaural signal encoding unit 102 encodes the monaural signal S (n) generated by the monaural signal generation unit 101 using a CELP (Code Excited Linear Prediction) encoding method, and obtains the mono signal obtained.
- CELP Code Excited Linear Prediction
- Ral signal code key parameter P is transmitted to stereo speech decoding apparatus 200.
- the vocal tract information of the audio signal is encoded by obtaining an LSP parameter, and the sound source information of the audio signal is specified by identifying one of the previously stored audio models. Encode with an index indicating the model.
- Second layer encoder 150 uses R channel signal S (n) and R channel signal S () input to stereo speech coding apparatus 100 as the rising position, L channel signal S (
- the rising position detection unit 103 receives the input L channel signal S (n) and R channel signal.
- the rising position of the stereo audio signal is detected from signal S (n).
- a stereo sound signal has a silent section in which the amplitude of the sound signal is zero and a sound section in which the amplitude of the sound signal is not zero.
- the position where the audio signal starts to transition from the silent section to the voiced section is called the rising position B.
- the L channel signal S (n) and the R channel signal S (n) acquired at different positions of the signal generated by the same sound source are separated by the distance from the sound source.
- one channel signal precedes the preceding channel, while the other channel signal becomes the following channel signal, and the amplitude is attenuated by the amplitude of the preceding channel signal. is doing.
- the L channel signal S (
- the starting position is indicated by time axis 0.
- the rising position detection unit 103 detects the start position of the section where the silent period ends and only the L-channel signal exists as the rising position B, and information on the detected rising position B Output to part 104.
- the information about the rising position B is information identifying whether the channel signal preceding in time near the sound source power is the L channel signal or the R channel signal, and the amplitude of the preceding channel from zero to non- Contains both information indicating the position to turn to zero.
- the rising position code key unit 104 codes information related to the rising position B input from the rising position detection unit 103 and obtains the obtained rising position code parameter P
- Delay time difference calculation section 105 uses L channel signal S (n) and R channel signal S (n) input to stereo speech coding apparatus 100, and uses the L channel signal S (n) according to the following equation (3).
- ⁇ ( ⁇ ) is the cross-correlation function of L channel signal S ( ⁇ ) and R channel signal S ( ⁇ ).
- ⁇ indicates the number of samples contained in one frame
- m is the value for the L channel signal S (n).
- the delay time difference calculation unit 105 is an L channel.
- the value of T becomes a positive number and is delayed with respect to the L channel signal S 01) and the third channel signal S (n).
- the value of T is negative.
- the L channel signal is Since the case of leading the R channel signal is taken as an example, the value of T is a positive number.
- the delay time difference calculation unit 105 outputs the calculated delay time difference T to the delay time difference code unit 106 and the amplitude ratio calculation unit 107.
- the delay time difference encoding unit 106 encodes the delay time difference T input from the delay time difference calculation unit 105 and transmits the encoding parameter P to the stereo speech decoding apparatus 200.
- the amplitude ratio calculation unit 107 calculates the L channel signal S (n), the R channel signal S (n) input to the stereo speech coding apparatus 100, and the delay time difference calculated by the delay time difference calculation unit 105.
- the oscillation R width ratio g between the L channel signal S (n) and the R channel signal S (n) is calculated according to the following equation (4).
- a and A are R channel signal S (n) and L channel signal S (n).
- the average amplitude in one frame is shown.
- the amplitude ratio calculation unit 107 outputs the calculated amplitude ratio g to the amplitude ratio encoding unit 108.
- Figure 3 shows the delay time difference T and the amplitude ratio g between the L channel signal S (n) and the R channel signal S (n) calculated by the delay time difference calculation unit 105 and the amplitude ratio calculation unit 107, respectively.
- FIG. 3 is a diagram showing a delay time difference and an amplitude ratio between the L channel signal S (n) and the R channel signal S (n) acquired at different positions of signals generated by the same sound source.
- L channel signal S (n) L channel signal S (n)
- R channel signal S (n) R channel signal
- FIG. 3A shows the L channel signal S (n)
- FIG. 3B shows the relationship between the R channel signal S (n) and the L channel signal S (and R and n).
- the L channel signal S (n) is calculated as a delay time difference calculation unit.
- the signal length from device B to time axis 0 matches the delay time difference T.
- the amplitude of the signal S ′ (n) is multiplied by the amplitude ratio g calculated by the amplitude ratio calculation unit 107, the signal S (n) is the same sound source.
- this is a generated signal, it ideally matches the R channel signal S (n). For example, this figure And are the amplitudes of the R channel signal S (n) corresponding to time t, respectively.
- the amplitude ratio coding unit 108 codes the amplitude ratio g input from the amplitude ratio calculating unit 107, and transmits the obtained coding parameter P to the stereo speech decoding apparatus 200.
- the code processing in the stereo speech coding apparatus 100 is performed in units of frames, and the monaural signal code key parameter P, the rising position code key parameter P,
- Delay time difference encoding parameter P and amplitude ratio encoding parameter P are generated and
- FIG. 4 is a block diagram showing the main configuration of stereo speech decoding apparatus 200 according to the present embodiment.
- stereo audio decoding apparatus 200 includes first layer (base layer) decoder 240 and second layer (enhancement layer) decoder 250 corresponding to stereo audio encoding apparatus 100.
- the first layer decoder 240 includes a monaural signal decoding unit 201, and decodes monaural signals in units of frames using the monaural signal code key parameter P transmitted from the stereo speech coding apparatus 100.
- Second layer decoder 250 includes rising position decoding section 202 and stereo signal decoding section 203, and rising position code key parameter P transmitted from stereo speech coding apparatus 100.
- monaural signal decoding section 201 decodes a monaural signal using monaural signal code key parameter P transmitted from monaural signal code section 102 of stereo speech coding apparatus 100. To output a monaural decoded signal S ⁇ (n). here
- a CELP decoding scheme is used corresponding to the encoding scheme used in monaural signal encoding section 102. If the second layer decoder 250 is unable to decode the stereo signal, the stereo audio decoding signal generated by the stereo audio decoding device 200 is composed only of the monaural decoding signal S ⁇ (n), Monaural audio signal.
- the monaural signal decoding unit 201 outputs the monaural decoded signal S ⁇ (n) to the stereo signal decoding unit 203.
- rising position decoding section 202 recovers code key parameter P transmitted from rising position code key section 104 of stereo speech coding apparatus 100.
- Stereo signal decoding section 203 receives amplitude ratio encoding parameter P transmitted from amplitude ratio encoding section 108 of stereo speech encoding apparatus 100, delay time difference encoding g of stereo speech encoding apparatus 100
- the stereo signal is decoded, and the L channel decoded signal (n) and the R channel decoded signal ⁇ Output (n).
- FIG. 5 is a block diagram showing a detailed configuration of stereo signal decoding section 203 according to the present embodiment.
- stereo signal decoding section 203 includes amplitude ratio decoding section 231, delay time difference decoding section 232, preceding channel decoded signal separating section 233, subsequent channel decoded signal generating section 234, repetition operation control section 235, A preceding channel decoded signal storage unit 236 and a subsequent channel decoded signal storage unit 237 are provided.
- Amplitude ratio decoding section 231 decodes amplitude ratio encoding parameter P transmitted from amplitude ratio coding section 108 of stereo speech coding apparatus 100, and obtains decoded amplitude ratio g ⁇ as subsequent channel g.
- Delay time difference decoding section 232 decodes delay time difference code key parameter P transmitted from delay time difference encoding section 106 of stereo speech encoding apparatus 100, and obtains the obtained delay time difference.
- the difference is output to the preceding channel decoded signal separation unit 233 and the iterative calculation control unit 235.
- the preceding channel decoded signal separation unit 233 is configured to decode the monaural decoded signal ⁇ (n) input from the monaural signal decoding unit 201 and the decoding delay time difference input from the delay time difference decoding unit 232 ⁇ " ⁇ rise position decoding Using the decoding rising position B input from the unit 202 and the subsequent channel decoded signal S ⁇ (n) input from the subsequent channel decoded signal generation unit 234, monaural
- the preceding channel decoded signal ⁇ (n) is separated from the decoded signal S ⁇ (n).
- the L channel is the preceding channel and the R channel is the subsequent channel.
- the row channel decoded signal separation unit 233 repeats the same calculation in all sections based on the control of the iterative calculation control unit 235 in the above-described separation process.
- the preceding channel decoded signal separation unit 233 converts the obtained L channel decoded signal ⁇ (n) into the succeeding channel decoded signal generation.
- Subsequent channel decoded signal generation section 234 uses the decoded amplitude input from amplitude ratio decoding section 231 and the L channel decoded signal S ⁇ (n) input from preceding channel decoded signal separation section 233 to perform subsequent channel decoding.
- Signal that is, the R channel decoded signal in this embodiment.
- the subsequent channel decoded signal generation unit 234 performs the above processing.
- the subsequent channel decoded signal generation unit 234 precedes the generated R channel decoded signal ⁇ (n).
- the iterative calculation control unit 235 uses the decoding delay time difference T input from the delay time difference decoding unit 232 and the decoding rising position ⁇ input from the rising position decoding unit 202 to use the preceding channel decoded signal separation unit 233, And subsequent channel decoded signal generator 23
- L channel signal S ⁇ (n) and R channel decoded signal S ⁇ (n) are generated.
- the preceding channel decoded signal storage unit 236 and the succeeding channel decoded signal storage unit 237 are respectively input to the preceding channel decoded signal separating unit 233 and the succeeding channel decoded signal generating unit 234.
- S ⁇ (n) and R channel decoded signal ⁇ (n) and R are stored, and L channel decoded signal S ⁇ (n) corresponding to the same delay time difference T unit is stored.
- S (n) and S (n) are the L channel signal and the R channel signal, respectively.
- N indicates the sample number.
- One frame consists of N samples.
- a solid line indicates the L channel signal S (n)
- a broken line indicates the R channel.
- Signal S (n) and the solid and broken lines in FIG.
- the R channel signal S (n) is shown at the same time.
- the case where the delay time difference T is smaller than one frame length is taken as an example, and the section from the rising position B to the first delay time difference T is shown as section 0.
- one frame of L channel signal S (n) is divided into interval 1 and interval 2 for each delay time difference T.
- the letters (1) and (2) indicate the section number. Since the frame length is not always an integral multiple of the delay time difference T, the last interval in one frame may be shorter than the delay time difference T.
- one frame of the R channel signal S (n) is also divided into sections 1 for each delay time difference T,
- Section 2 ... is separated.
- the R channel signal of each section is indicated by S (1) (n), S (2) (n), ...
- the stereo speech decoding apparatus 200 is connected to the monaural decoded signal ⁇ according to the following equation (5).
- the signal ⁇ (Q) (n) corresponding to interval 0 in (n) is replaced with the L channel decoded signal S ⁇ (Q) ( n ) in interval 0.
- the waveform of the R channel signal S ( ⁇ ) indicated by a broken line is an L channel indicated by a solid line.
- the amplitude of the R channel signal S ( ⁇ ) is multiplied by the amplitude ratio g (g ⁇ l) to the L channel signal S ( ⁇ ).
- L channel signal S (n) and R channel signal S (n) are
- Equation (6) The relationship shown in Equation (6) is satisfied.
- stereo speech decoding apparatus 200 scales L section decoded signal S ⁇ ( — T) in section 0 using the following equation (7), and R channel signal ⁇ (1 ) (n) to find R
- the stereo speech decoding apparatus 200 can decode stereo speech.
- stereo audio decoding apparatus 200 first detects L channel signal S in monaural signal S (n) not in a section where L channel signal S (n) and R channel signal S (n) are mixed.
- the stereo speech decoding apparatus 200 identifies and
- FIG. 7 is a diagram showing the stereo audio signals shown in FIG. 6 in a table.
- the first line shows the frame order
- the second line shows the section number.
- the third row shows the range of possible values for sample number n
- the fourth and fifth rows show the L channel signal and R channel signal corresponding to each section, respectively.
- stereo audio signal decoding procedure in stereo audio decoding apparatus 200 will be described in detail.
- monaural signal decoding section 201 decodes monaural signal code parameter P to obtain monaural decoded signals S ⁇ (n).
- the rising position decoding unit 202 decodes the rising position code key parameter P.
- the amplitude ratio decoding unit 231 decodes the amplitude ratio sign key parameter P, and decodes the amplitude ratio g g
- the delay time difference decoding unit 232 decodes and decodes the delay time difference encoding parameter P.
- the preceding channel decoded signal separation unit 233 performs decoding delay time difference T monaural decoded signal. Using the signal S ⁇ (n) and the decoding rising position, the L channel decoded signal S ⁇ (Q) (n) in section 0 is obtained.
- subsequent channel decoded signal generation section 234 obtains R channel decoded signal S ⁇ ( ⁇ ) in section 1 according to the above equation (7).
- monaural signal S ( ⁇ ) is obtained as an average value of L channel signal S ( ⁇ ) and R channel signal S ( ⁇ ), so that the preceding channel recovery R
- the signal signal separation unit 233 performs the L channel decoded signal ⁇ ⁇ signal in section 1 according to the following equation (8).
- Equation (8) equation (7) is substituted. That is, ⁇ (0) (nT) (0 ⁇ n ⁇ T) corresponding to the L channel decoded signal in section 0 obtained by the preceding channel decoded signal separation unit 233 is input to the subsequent channel decoded signal generation unit 234.
- the preceding channel decoded signal separating unit 233 and the succeeding channel decoded signal generating unit 234 perform the operations shown in the above equations (7) and (8) based on the control of the iterative operation control unit 235.
- the L channel decoded signal ( ⁇ ) and the R channel decoded signal ( ⁇ ) in all intervals are obtained while recursively repeating in the interval 2 and thereafter.
- Equation (7) the R channel signal ⁇ (2) ( ⁇ ) in section 2 is similarly calculated as shown in Equation (7).
- L channel decoded signal ⁇ ( ⁇ ) and R channel decoded signal ⁇ "( ⁇ ) and R in interval j + 1 are R channel decoded signal ⁇ ( 2 ) ( ⁇ ) and R channel decoded in interval 2
- the signal ⁇ ( 2 ) ( ⁇ ) can be obtained by recursively using the result of the operation in interval j in the same way as the method of finding R.
- the R channel decoded signal in interval j + 1 ⁇ ° ") Is obtained according to equation (11) below:
- J J is an integer that satisfies J * T ⁇ N ⁇ Li + ⁇ ) • ⁇
- preceding channel decoded signal separation section 233 may obtain L channel decoded signal S ⁇ ⁇ (n) using only monaural decoded signal S ⁇ (n) according to equation (17) above. .
- Such a place may obtain L channel decoded signal S ⁇ ⁇ (n) using only monaural decoded signal S ⁇ (n) according to equation (17) above. .
- R channel decoded signal S ⁇ ⁇ (n) is scaled from L channel decoded signal S ⁇ ⁇ (n).
- the stereo speech coding apparatus replaces the monaural signal and the prediction information of the L channel signal and the R channel signal in all sections with the monaural signal.
- the signal, rise position, delay time difference, and amplitude ratio are encoded and transmitted to the stereo audio decoding device.
- the stereo speech decoding apparatus decodes a stereo speech signal by performing repetitive calculations using code key information that is also transmitted with a stereo speech code. Since the amount of information of the rise position, delay time difference, and amplitude ratio is smaller than the prediction information of the L channel signal and R channel signal in all sections, according to this embodiment, the prediction coefficient is reduced and more Stereo audio signals can be transmitted at a low bit rate.
- the stereo audio signal power channel signal and the R channel signal are composed of two channels, and the L channel signal is closer to the sound source power than the R channel signal. Even if the R channel signal is closer to the sound source power than the L channel signal, the present embodiment can be applied. In such a case, in the interval 0 from the voice rising position to the first delay time difference T, the L channel No signal exists, only R channel signal exists Exists. Furthermore, even when a stereo audio signal has a signal strength of three or more channels, the present embodiment can be applied with appropriate modifications.
- the stereo decoding apparatus performs the scale adjustment of the L channel signal in section 0 and performs decoding as the R channel signal in section 1 has been described as an example. May be stored in advance and used as the R channel signal (or L channel signal) in section 1.
- the case where the CELP code method is used as the monaural signal code method has been described as an example, but another code method different from the CELP code method is used. Also good.
- a stereo audio signal is encoded and transmitted.
- a stereo audio signal consisting of a silence interval and a sound interval is encoded and transmitted. You may do it.
- FIG. 8 is a block diagram showing the main configuration of stereo speech coding apparatus 300 according to Embodiment 2 of the present invention.
- Stereo speech coding apparatus 300 has the same basic configuration as stereo speech coding apparatus 100 (see FIG. 1) shown in Embodiment 1, and the same components are the same. Reference numerals are assigned and explanations thereof are omitted.
- Stereo speech coding apparatus 300 further includes a first layer decoder 240a, a second layer decoder 450a, an error signal calculation unit 301, and an error signal coding unit 302, so that the stereo speech coding apparatus 300 described in Embodiment 1 is provided. This is different from speech encoding apparatus 100.
- first layer decoder 240a, second layer decoder 450a, error signal calculation unit 301, error signal coding unit 302, and second layer encoder 150 constitute second layer encoder 350.
- the first layer decoder 240a, second layer decoder 450a, error signal calculation unit 301, error signal coding unit 302, and second layer encoder 150 constitute
- first layer decoding as a local decoder
- the encoder 240a has the same configuration and function as the first layer decoder 240 included in the stereo speech decoding apparatus 200 according to Embodiment 1. That is, the first layer decoder 240a receives the monaural signal code key parameter P generated by the monaural signal code unit 102, decodes the monaural signal, and obtains the monaural decoded signal S ⁇ (n) obtained.
- Second layer decoder 45 Oa as another local decoder of stereo speech coding apparatus 300, monaural decoded signal S ⁇ (n) generated by first layer decoder 240a, is generated by rising position encoding section 104.
- Second layer decoder 450a outputs the generated L channel decoded signal S ⁇ (n) and R channel decoded signal S ⁇ (n) to error signal calculating section 301.
- the detailed configuration of the second layer decoder 450a will be described later.
- Error signal calculation section 301 includes L channel signal S (n) and R channel signal S (n), which are input signals of stereo speech coding apparatus 300, and the L channel generated by the second layer decoder. R
- L channel error signal A S (n) and R channel error signal A S (n) are calculated.
- the error signal calculation unit 301 calculates the calculated L channel error signal A S (n) and the R channel.
- the error signal A S (n) is output to the error signal sign key unit 302.
- Error signal sign key unit 302 is an L channel error signal calculated by error signal calculation unit 301.
- a S (n) and R channel error signal A S (n) are signed and the L channel error signal is coded R
- a stereo speech decoding apparatus that converts parameter P and R channel error signal coding parameter P
- FIG. 9 is a block diagram showing a detailed configuration of second layer decoder 450a according to the present embodiment.
- the second layer decoder 450a is the second layer decoder shown in the first embodiment.
- the same basic configuration as that of the DA 250 (see FIG. 4) is given, and the same components are denoted by the same reference numerals and the description thereof is omitted.
- Second layer decoder 450a is different from second layer decoder 250 shown in the first embodiment in that error signal decoding section 401 and decoded signal correction section 402 are further provided.
- Error signal decoding section 401 decodes L channel error signal encoding parameter P and R channel error signal encoding parameter P input from error signal encoding section 302, and generates an L channel error.
- the decoded signal (n) and the R channel error decoded signal (n) are output to the decoded signal correction unit 402.
- the decoded signal correction unit 402 generates an L channel error decoded signal (n), an R channel error decoded signal (n) generated by the error signal decoding unit 401, and an R signal generated by the stereo signal decoding unit 203.
- a decoded signal S ⁇ (n) is generated and output to the stereo signal decoding unit 203.
- L-channel decoded signal S ⁇ (n) and R-channel decoded signal S ⁇ (n) are used for decoding the stereo audio signal in the next section of the R-telo signal decoding unit 203.
- the encoding parameters generated by stereo speech encoding apparatus 300 and transmitted to stereo speech decoding apparatus 400 are monaural signal encoding parameter P and rising position encoding parameter P.
- FIG. 10 is a block diagram showing the main configuration of stereo speech decoding apparatus 400 according to the present embodiment.
- stereo audio decoding apparatus 400 includes first layer decoder 240 and second layer decoder 450.
- First layer decoder 240 of stereo speech decoding apparatus 400 4 has the same configuration and function as the first layer decoder 240 shown in FIG. 4, and a description thereof will be omitted here.
- Second layer decoder 450 of stereo speech decoding apparatus 400 has the same configuration and function as second layer decoder 450a shown in FIG. That is, the second layer decoder 450 transmits the rising position encoding parameter P, the delay time difference encoding parameter P, the amplitude ratio encoding parameter P, L transmitted from the stereo speech encoding apparatus 300.
- the stereo speech coding apparatus has an L channel error signal code parameter P and an R channel error signal code signal compared to Embodiment 1.
- the stereo speech coding apparatus can generate and output the L channel decoded signal S ⁇ (n) and the R channel decoded signal S ⁇ (n) with less error, and R
- the stereo encoding device obtains the rising position coding information and transmits it to the stereo decoding device has been described as an example.
- the stereo coding device has the rising position detection unit.
- the rising position code key unit is not provided, and the stereo decoding device does not include the rising position decoding unit, and the rising position is detected by the error signal correction unit and the stereo signal decoding unit on the stereo decoding device side to perform decoding. It's okay.
- the case where the error signals of both the L channel signal and the R channel signal are encoded is described as an example.
- the error signal of the preceding channel signal which is the L channel signal in this embodiment, is described. Only the sign may be entered.
- the quality of the stereo audio signal decoded by the stereo audio decoding device is higher when encoding the error signal of both the L channel signal and the R channel signal than when encoding only the error signal of the preceding channel signal. Can be further improved.
- the case where feedback is not provided to the L channel decoded signal and the R channel decoded signal power stereo signal decoding unit output from the stereo speech decoding apparatus has been described as an example.
- Output L channel decoded signal If the stereo audio decoding device is forced to be fed back to the stereo signal decoding unit and used in units of delay time difference, the stereo speech decoding apparatus further converts the L channel decoded signal and the R channel decoded signal with less error. Can be obtained and output.
- FIG. 11 is a block diagram showing the main configuration of stereo speech coding apparatus 500 according to Embodiment 3 of the present invention.
- Stereo speech coding apparatus 500 has the same basic configuration as stereo speech coding apparatus 100 (see FIG. 1) shown in Embodiment 1, and the same components are assigned the same reference numerals. The description is omitted.
- Stereo speech coding apparatus 500 further includes a delay time difference correction value calculation unit 501, a delay time difference correction value encoding unit 502, an amplitude ratio correction value calculation unit 503, and an amplitude ratio correction value encoding unit 504. This is different from stereo speech coding apparatus 100 shown in the first embodiment.
- Delay time difference correction value calculation section 501 calculates L channel signal S (n) and R channel signal S (n) in a length corresponding to delay time difference T input from R delay time difference calculation section 105. Dividing into K intervals, the delay R between the L channel signal S (kT + n) and the R channel signal S (kT + n) in each interval R
- the delay time difference T is the fluctuation amount ⁇ ⁇ ⁇ ⁇ ⁇ with respect to the delay time difference T, that is, the delay time k k
- the delay time difference correction value calculation unit 501 first uses the following equation (22) to cross-correlate the L channel signal S (kT + n) and the R channel signal S (kT + n) in the k interval. Function then R
- T indicates the number of samples included in each section, and indicates the number of shift samples of the R channel signal S (n) with respect to the L channel signal S (k and n).
- ⁇ () is in k section
- the delay time difference calculation unit 105 calculates the value of ⁇ that maximizes the value of ⁇ ( ⁇ ) as L
- the delay time difference T is the L channel signal and the R channel in one frame.
- the delay time difference T indicates the signal delay time difference, while the delay time difference T
- the delay time difference correction value calculation unit 501 calculates the amount of variation of the delay time difference T in the k interval with respect to the delay time difference T as the delay time difference correction value ⁇ ⁇ ⁇ ⁇ ⁇ in the k interval using the following equation (23).
- the delay time difference correction value calculation unit 501 uses the calculated delay time difference correction value ⁇ ⁇ as the delay time k.
- Difference correction value sign ⁇ part 502 is output to the delay time difference T in the k interval and the amplitude ratio correction value calculation k
- the delay time difference correction value sign key unit 502 encodes the delay time difference correction value ⁇ input from the delay time difference correction value calculation unit 501, and generates a generated delay time difference correction value code parameter k parameter P. Are transmitted to a stereo speech decoding apparatus (not shown) according to the present embodiment.
- the amplitude ratio correction value calculation unit 503 delays the L channel signal S (n) and the R channel signal S (n), R
- Delay time difference T input from time difference calculation unit 105 is divided into K sections having length T, and delay time difference T input from delay time difference correction value calculation unit 501 and amplitude ratio calculation unit 1 k
- the scale channel signal S (kT + n) is calculated as a fluctuation amount Ag with respect to the amplitude ratio g, that is, an amplitude ratio correction value Ag in the k R k k k section.
- the amplitude ratio g between the signal S (kT + n) and the L channel signal S (kT + n) is calculated.
- the amplitude ratio g indicates the amplitude ratio of the L channel signal and the R channel signal in one frame as a whole, whereas the amplitude ratio g indicates the L channel in each section in one frame.
- an amplitude ratio correction value calculation unit 503 The following equation (25) is used to calculate the amount of fluctuation of the amplitude ratio g in the k interval with respect to the amplitude ratio g as kk
- the amplitude ratio correction value calculation unit 503 performs the R channel signal S (kT + n) in the k section.
- the ratio with the ratio g is calculated as the amplitude ratio correction value Ag.
- the amplitude ratio correction value calculation unit 503 calculates k
- the amplitude ratio correction value ⁇ g thus output is output to the amplitude ratio correction value sign key section 504.
- the amplitude ratio correction value sign key unit 504 encodes the amplitude ratio correction value ⁇ g input from the amplitude ratio correction value calculation unit 503, and implements the generated amplitude ratio correction value sign key parameter P.
- Stereo speech decoding apparatus has the basic configuration and functions of stereo speech decoding apparatus 200 according to Embodiment 1 of the present invention, and includes delay time difference correction values ⁇ and k and Stereo audio recovery in that the stereo audio is decoded using the amplitude ratio correction value ⁇ g.
- the delay time difference correction value encoding parameter P the delay time difference correction value encoding parameter P
- the amplitude ratio decoding unit 231 also encodes the amplitude ratio correction value encoding parameter ⁇ g
- the stereo speech coding apparatus divides one frame of stereo speech signal into a plurality of sections with a length corresponding to the delay time difference, and in each section.
- the delay time difference correction value ⁇ ⁇ and the amplitude ratio correction value A g are smaller than the delay time difference T k k k and the amplitude ratio g in the k interval, so that the stereo audio signal can be transmitted at a lower bit rate.
- the delay time difference correction value calculation unit 501 has the following equation (22):
- the case where the cross-correlation value is calculated using the k interval whose length is the delay time difference T as the calculation range has been described as an example, but is not limited to this, and includes the k interval (T—A a) to (T A b)
- the cross-correlation value may be calculated using the range section as the calculation range.
- delay time difference correction value encoding unit 502 individually encodes the delay time difference correction value ⁇ in each section, and sets K delay time difference correction value encoding parameters.
- the case of generating k data P has been described as an example, but K delay time difference correction values ⁇ ⁇ are summarized.
- ATk k is encoded, and one delay time difference correction value sign key parameter (for example, P
- amplitude ratio correction value sign key section 504 individually codes amplitude ratio correction value ⁇ g in each section, and K amplitude ratio correction value sign key parameters.
- K amplitude ratio correction values Ag are collectively signed, and one k
- An amplitude ratio correction value encoding parameter (for example, P) may be generated.
- FIG. 12 is a block diagram showing the main configuration of stereo speech coding apparatus 700 according to the present embodiment.
- Stereo speech coding apparatus 700 has the same basic configuration as stereo speech coding apparatus 500 (see FIG. 11) shown in Embodiment 3 of the present invention. Reference numerals are assigned and explanations thereof are omitted.
- Delay time difference correction value coding unit 702 of stereo speech coding apparatus 700, amplitude ratio correction value coding unit 704, delay time difference correction value coding unit 502 of stereo speech coding apparatus 500, amplitude ratio correction value coding unit There is a difference in part of the process from 2004, and a different symbol is attached to indicate that.
- Delay time difference correction value code unit 702 further incorporates a first code bit table, and is input from delay time difference correction value calculation unit 501 using the built-in first code bit table. This is different from the delay time difference correction value sign unit 502 in that the delay time difference correction value is signed.
- the first code key table is a code k for each section for signing the delay time difference correction value ⁇ (l ⁇ k ⁇ K) in each section input from the delay time difference correction value calculation unit 501.
- the total number of bits is denoted as M, and the delay time difference correction value ⁇ in each interval k is encoded k
- the delay time difference correction value sign ⁇ part 702 is a section closer to the tail of the frame than a section near the head of the frame, that is, a section having a larger section number k More encoded bits are allocated to the encoding of the delay time difference correction value ⁇ k in FIG.
- the amplitude ratio correction value code key unit 704 further includes a second code key bit table, and is input from the amplitude ratio correction value calculation unit 503 using the built-in second code bit table.
- the difference from the amplitude ratio correction value encoding unit 504 is that the amplitude ratio correction value is encoded.
- the second coding bit table is used to code the amplitude ratio correction value A g (l ⁇ k ⁇ K) in each section input from the amplitude ratio correction value calculation unit 503. ⁇ Has the number of bits. 1 frame k
- N is the total number of bits for signing all the amplitude ratio correction values ⁇ ⁇ in the system.
- Equation (k) indicates the number of scalar quantization bits.
- the amplitude ratio correction value sign ⁇ part 704 is a section closer to the tail of the frame than a section near the beginning of the frame, that is, a section having a larger section number k.
- Stereo speech decoding apparatus 800 (not shown) according to the present embodiment performs scanning according to equation (17). Obtain the teleo speech decoded signal, and further, delay time difference correction value ⁇ ⁇ and amplitude ratio correction value
- the Leo speech decoding apparatus 800 recursively uses the delay time difference T and the amplitude ratio g in order to obtain a stereo speech decoded signal for each section in one frame, the section number k increases and the required stereo is obtained. The error of the speech decoded signal also increases. The reason is that the interval number k increases and the delay time difference correction value ⁇ ⁇ and the amplitude ratio correction value Ag increase.
- the sound quality of the speech decoded signal can be improved.
- the stereo speech coding apparatus encodes the amplitude ratio correction value and the amplitude ratio correction value in the section closer to the tail of the frame than the section near the head of the frame. Since more encoded bits are allocated, the prediction error can be reduced and the sound quality of the stereo audio decoded signal can be improved.
- the present invention is not limited to this. It is also possible to divide all K sections in one frame into a plurality of blocks, and increase the number of code bits for each block as it is closer to the tail of the frame. In other words, the same sign bit 3 ⁇ 4c is used as the sign of the delay time difference correction value or the amplitude ratio correction value in each section in the same block.
- the code bit allocation method according to the present embodiment is applied to the second embodiment of the present invention, the effect of reducing the prediction error can be obtained.
- the error signal coding unit 302 quantizes the L channel error signal and the R channel error signal input from the error signal calculation unit 301, the error signal coding unit 302 quantizes the frame from the beginning of the frame. The closer to the end of the frame, the more the number of bits may be used for quantization.
- stereo speech coding apparatus stereo speech decoding apparatus, and methods according to the present invention are not limited to the above embodiments, and can be implemented with various modifications.
- the stereo speech coding apparatus and stereo speech decoding apparatus are mobile communication devices.
- the communication terminal device and the base station device can be installed in the communication terminal device and the base station device in the communication system, thereby providing the communication terminal device and the base station device having the same effects as described above.
- the stereo speech coding apparatus, the stereo speech decoding apparatus, and these methods according to the present invention can also be used in a wired communication system.
- the stereo signal code section according to the present invention and the normal stereo signal code section are both provided, and the mode switching section is based on the degree of correlation between the L channel signal and the R channel signal. It is also possible to adopt a configuration in which the stereo signal code key section used for is switched. If the correlation between the L channel signal and the R channel signal is less than the threshold value, the L channel signal and the R channel signal are encoded separately using a normal stereo signal encoding unit. If the degree of correlation between the L channel signal and the R channel signal is higher than the threshold value, the L channel signal and the R channel signal are encoded using the stereo signal encoding unit according to the present invention. .
- the power described with reference to an example in which the present invention is configured by nodeware can also be realized by software.
- the stereo speech coding method of the present invention is described by describing the processing algorithm of the stereo speech coding method according to the present invention in a programming language, storing this program in a memory and executing it by the information processing means. It is possible to realize the same function as the key device.
- each functional block used in the description of each of the above embodiments is typically realized as an LSI that is an integrated circuit. These may be individually made into one chip, or may be made into one chip to include some or all of them!
- IC integrated circuit
- system LSI system LSI
- super LSI super LSI
- unroller LSI etc.
- the method of circuit integration is not limited to LSI, and may be realized by a dedicated circuit or a general-purpose processor.
- Reconfigurable FPGA Field Programmable Gate Array
- Reconfigurable FPGA Field Programmable Gate Array
- the stereo speech coding apparatus, the stereo speech decoding apparatus, and these methods according to the present invention can be applied to applications such as a communication terminal apparatus in a mobile communication system.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/295,073 US20090276210A1 (en) | 2006-03-31 | 2007-03-29 | Stereo audio encoding apparatus, stereo audio decoding apparatus, and method thereof |
JP2008509811A JPWO2007116809A1 (ja) | 2006-03-31 | 2007-03-29 | ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006099913 | 2006-03-31 | ||
JP2006-099913 | 2006-03-31 | ||
JP2006272132 | 2006-10-03 | ||
JP2006-272132 | 2006-10-03 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2007116809A1 true WO2007116809A1 (ja) | 2007-10-18 |
Family
ID=38581103
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2007/056955 WO2007116809A1 (ja) | 2006-03-31 | 2007-03-29 | ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 |
Country Status (3)
Country | Link |
---|---|
US (1) | US20090276210A1 (ja) |
JP (1) | JPWO2007116809A1 (ja) |
WO (1) | WO2007116809A1 (ja) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2009142017A1 (ja) * | 2008-05-22 | 2009-11-26 | パナソニック株式会社 | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 |
WO2010084756A1 (ja) * | 2009-01-22 | 2010-07-29 | パナソニック株式会社 | ステレオ音響信号符号化装置、ステレオ音響信号復号装置およびそれらの方法 |
JP5413839B2 (ja) * | 2007-10-31 | 2014-02-12 | パナソニック株式会社 | 符号化装置および復号装置 |
WO2022097236A1 (ja) * | 2020-11-05 | 2022-05-12 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
WO2022097235A1 (ja) * | 2020-11-05 | 2022-05-12 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491393B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491394B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491395B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
Families Citing this family (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008132826A1 (ja) * | 2007-04-20 | 2008-11-06 | Panasonic Corporation | ステレオ音声符号化装置およびステレオ音声符号化方法 |
US8359196B2 (en) * | 2007-12-28 | 2013-01-22 | Panasonic Corporation | Stereo sound decoding apparatus, stereo sound encoding apparatus and lost-frame compensating method |
EP2254110B1 (en) * | 2008-03-19 | 2014-04-30 | Panasonic Corporation | Stereo signal encoding device, stereo signal decoding device and methods for them |
CN101989429B (zh) * | 2009-07-31 | 2012-02-01 | 华为技术有限公司 | 转码方法、装置、设备以及系统 |
US9813262B2 (en) * | 2012-12-03 | 2017-11-07 | Google Technology Holdings LLC | Method and apparatus for selectively transmitting data using spatial diversity |
US9979531B2 (en) | 2013-01-03 | 2018-05-22 | Google Technology Holdings LLC | Method and apparatus for tuning a communication device for multi band operation |
US10229697B2 (en) | 2013-03-12 | 2019-03-12 | Google Technology Holdings LLC | Apparatus and method for beamforming to obtain voice and noise signals |
KR101808810B1 (ko) * | 2013-11-27 | 2017-12-14 | 한국전자통신연구원 | 음성/무음성 구간 검출 방법 및 장치 |
US10074373B2 (en) * | 2015-12-21 | 2018-09-11 | Qualcomm Incorporated | Channel adjustment for inter-frame temporal shift variations |
US10872611B2 (en) * | 2017-09-12 | 2020-12-22 | Qualcomm Incorporated | Selecting channel adjustment method for inter-frame temporal shift variations |
CN113948097A (zh) * | 2020-07-17 | 2022-01-18 | 华为技术有限公司 | 多声道音频信号编码方法和装置 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0629859A (ja) * | 1992-03-02 | 1994-02-04 | American Teleph & Telegr Co <Att> | デジタル入力信号符号化方法 |
JPH0651795A (ja) * | 1992-03-02 | 1994-02-25 | American Teleph & Telegr Co <Att> | 信号量子化装置及びその方法 |
JPH0675590A (ja) * | 1992-03-02 | 1994-03-18 | American Teleph & Telegr Co <Att> | 知覚モデルに基づく音声信号符号化方法とその装置 |
JP2005529520A (ja) * | 2002-06-05 | 2005-09-29 | ソニック・フォーカス・インク | 音響仮想現実エンジンおよび配信された音声改善のための新技術 |
WO2006003813A1 (ja) * | 2004-07-02 | 2006-01-12 | Matsushita Electric Industrial Co., Ltd. | オーディオ符号化及び復号化装置 |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5812971A (en) * | 1996-03-22 | 1998-09-22 | Lucent Technologies Inc. | Enhanced joint stereo coding method using temporal envelope shaping |
US6345246B1 (en) * | 1997-02-05 | 2002-02-05 | Nippon Telegraph And Telephone Corporation | Apparatus and method for efficiently coding plural channels of an acoustic signal at low bit rates |
US5890125A (en) * | 1997-07-16 | 1999-03-30 | Dolby Laboratories Licensing Corporation | Method and apparatus for encoding and decoding multiple audio channels at low bit rates using adaptive selection of encoding method |
DE19742655C2 (de) * | 1997-09-26 | 1999-08-05 | Fraunhofer Ges Forschung | Verfahren und Vorrichtung zum Codieren eines zeitdiskreten Stereosignals |
US7240001B2 (en) * | 2001-12-14 | 2007-07-03 | Microsoft Corporation | Quality improvement techniques in an audio encoder |
EP1619664B1 (en) * | 2003-04-30 | 2012-01-25 | Panasonic Corporation | Speech coding apparatus, speech decoding apparatus and methods thereof |
US7809579B2 (en) * | 2003-12-19 | 2010-10-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Fidelity-optimized variable frame length encoding |
WO2006025313A1 (ja) * | 2004-08-31 | 2006-03-09 | Matsushita Electric Industrial Co., Ltd. | 音声符号化装置、音声復号化装置、通信装置及び音声符号化方法 |
US7783480B2 (en) * | 2004-09-17 | 2010-08-24 | Panasonic Corporation | Audio encoding apparatus, audio decoding apparatus, communication apparatus and audio encoding method |
EP2138999A1 (en) * | 2004-12-28 | 2009-12-30 | Panasonic Corporation | Audio encoding device and audio encoding method |
US8296134B2 (en) * | 2005-05-13 | 2012-10-23 | Panasonic Corporation | Audio encoding apparatus and spectrum modifying method |
-
2007
- 2007-03-29 WO PCT/JP2007/056955 patent/WO2007116809A1/ja active Application Filing
- 2007-03-29 US US12/295,073 patent/US20090276210A1/en not_active Abandoned
- 2007-03-29 JP JP2008509811A patent/JPWO2007116809A1/ja active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0629859A (ja) * | 1992-03-02 | 1994-02-04 | American Teleph & Telegr Co <Att> | デジタル入力信号符号化方法 |
JPH0651795A (ja) * | 1992-03-02 | 1994-02-25 | American Teleph & Telegr Co <Att> | 信号量子化装置及びその方法 |
JPH0675590A (ja) * | 1992-03-02 | 1994-03-18 | American Teleph & Telegr Co <Att> | 知覚モデルに基づく音声信号符号化方法とその装置 |
JP2005529520A (ja) * | 2002-06-05 | 2005-09-29 | ソニック・フォーカス・インク | 音響仮想現実エンジンおよび配信された音声改善のための新技術 |
WO2006003813A1 (ja) * | 2004-07-02 | 2006-01-12 | Matsushita Electric Industrial Co., Ltd. | オーディオ符号化及び復号化装置 |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5413839B2 (ja) * | 2007-10-31 | 2014-02-12 | パナソニック株式会社 | 符号化装置および復号装置 |
WO2009142017A1 (ja) * | 2008-05-22 | 2009-11-26 | パナソニック株式会社 | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 |
WO2010084756A1 (ja) * | 2009-01-22 | 2010-07-29 | パナソニック株式会社 | ステレオ音響信号符号化装置、ステレオ音響信号復号装置およびそれらの方法 |
US8504378B2 (en) | 2009-01-22 | 2013-08-06 | Panasonic Corporation | Stereo acoustic signal encoding apparatus, stereo acoustic signal decoding apparatus, and methods for the same |
WO2022097236A1 (ja) * | 2020-11-05 | 2022-05-12 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
WO2022097235A1 (ja) * | 2020-11-05 | 2022-05-12 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491393B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491394B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
JP7491395B2 (ja) | 2020-11-05 | 2024-05-28 | 日本電信電話株式会社 | 音信号精製方法、音信号復号方法、これらの装置、プログラム及び記録媒体 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2007116809A1 (ja) | 2009-08-20 |
US20090276210A1 (en) | 2009-11-05 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2007116809A1 (ja) | ステレオ音声符号化装置、ステレオ音声復号装置、およびこれらの方法 | |
KR101221918B1 (ko) | 신호 처리 방법 및 장치 | |
US8311810B2 (en) | Reduced delay spatial coding and decoding apparatus and teleconferencing system | |
US7904292B2 (en) | Scalable encoding device, scalable decoding device, and method thereof | |
WO2009081567A1 (ja) | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 | |
WO2006070757A1 (ja) | 音声符号化装置および音声符号化方法 | |
JP4555299B2 (ja) | スケーラブル符号化装置およびスケーラブル符号化方法 | |
JP2008146081A (ja) | 冗長性低減方法 | |
JPWO2008007700A1 (ja) | 音声復号装置、音声符号化装置、および消失フレーム補償方法 | |
JPH06202696A (ja) | 音声復号化装置 | |
JPWO2009084226A1 (ja) | ステレオ音声復号装置、ステレオ音声符号化装置、および消失フレーム補償方法 | |
JP4842147B2 (ja) | スケーラブル符号化装置およびスケーラブル符号化方法 | |
WO2006104017A1 (ja) | 音声符号化装置および音声符号化方法 | |
US20090041255A1 (en) | Scalable encoding device and scalable encoding method | |
WO2005066937A1 (ja) | 信号復号化装置及び信号復号化方法 | |
JP4287637B2 (ja) | 音声符号化装置、音声符号化方法及びプログラム | |
US7860711B2 (en) | Transmitter and receiver for speech coding and decoding by using additional bit allocation method | |
JP4558734B2 (ja) | 信号復号化装置 | |
WO2009122757A1 (ja) | ステレオ信号変換装置、ステレオ信号逆変換装置およびこれらの方法 | |
JP4365653B2 (ja) | 音声信号送信装置、音声信号伝送システム及び音声信号送信方法 | |
US20100121633A1 (en) | Stereo audio encoding device and stereo audio encoding method | |
JP5425066B2 (ja) | 量子化装置、符号化装置およびこれらの方法 | |
JP3811110B2 (ja) | ディジタル信号符号化方法、復号化方法、これらの装置、プログラム及び記録媒体 | |
JP2005091749A (ja) | 音源信号符号化装置、及び音源信号符号化方法 | |
JP4351684B2 (ja) | ディジタル信号復号化方法、装置、プログラム及び記録媒体 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07740393 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2008509811 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12295073 Country of ref document: US |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07740393 Country of ref document: EP Kind code of ref document: A1 |