WO2006025337A1 - ステレオ信号生成装置およびステレオ信号生成方法 - Google Patents
ステレオ信号生成装置およびステレオ信号生成方法 Download PDFInfo
- Publication number
- WO2006025337A1 WO2006025337A1 PCT/JP2005/015674 JP2005015674W WO2006025337A1 WO 2006025337 A1 WO2006025337 A1 WO 2006025337A1 JP 2005015674 W JP2005015674 W JP 2005015674W WO 2006025337 A1 WO2006025337 A1 WO 2006025337A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- signal
- stereo
- sign
- monaural
- channel
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims description 25
- 238000001228 spectrum Methods 0.000 claims abstract description 73
- 238000004364 calculation method Methods 0.000 claims description 33
- 238000004458 analytical method Methods 0.000 claims description 29
- 230000007274 generation of a signal involved in cell-cell signaling Effects 0.000 claims description 26
- 238000006243 chemical reaction Methods 0.000 claims description 8
- 238000012546 transfer Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 10
- 238000012545 processing Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 230000010354 integration Effects 0.000 description 3
- 238000000926 separation method Methods 0.000 description 3
- 230000005236 sound signal Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 230000009977 dual effect Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010422 painting Methods 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S1/00—Two-channel systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/02—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo four-channel type, e.g. in which rear channel signals are derived from two-channel stereo signals
Definitions
- the present invention relates to a stereo signal generation device and a stereo signal generation method, and more particularly to a stereo signal generation device and a stereo signal generation method for generating a stereo signal from a monaural signal and signal parameters.
- the stereo function helps to improve the perceived audio quality.
- One application of the stereo function is high-quality teleconference equipment that can identify the location of a speaker in a situation where there are multiple speakers at the same time.
- stereo audio codecs are less common than stereo audio codecs.
- stereophonic sound codes can be realized in various ways, and stereo functions are considered standard in audio codes.
- Stereo effects can be achieved by coding the left and right channels independently as dual mono.
- Joint stereo can be performed using mid-side (MS) stereo and intensity (I) stereo. By using these two methods together, a higher compression ratio can be realized.
- MS mid-side
- I intensity
- MS stereo the correlation between stereo channels is used.
- MS stereo stereo imaging of signals that are prone to aliasing distortion is also affected when coding is done at low bit rates for narrow bandwidth transmission.
- I stereo the ability of the human auditory system to decompose high frequency components is reduced in the high frequency region, so I stereo is effective only in the high frequency region and not in the low frequency region. ,.
- most speech codecs are considered to be parametric coding that functions by modeling the human vocal tract with parameters using a variation of the linear prediction method.
- the joint stereo method is also a stereo method. Suitable for audio codecs.
- Non-Patent Document 1 As another speech codec method, there is a method using cross channel prediction (see Non-Patent Document 1, for example). In this method, redundancy such as intensity difference, delay difference, and spatial difference between stereophonic sound channels is modeled by utilizing the existence of interchannel correlation in the stereoacoustic signal.
- an audio codec method there is a method using parametric spatial audio (see, for example, Patent Document 1).
- the basic idea of this method is to represent a speech signal using a set of parameters. These parameters representing the audio signal are used on the decoding side to re-synthesize a signal that is perceptually similar to the original sound.
- the parameters are calculated for each band.
- Each subband consists of several frequency components or band coefficients, and the number of components increases with higher frequency subbands.
- One of the parameters calculated for this is the inter-channel level difference. This parameter is the power ratio between the left channel (L channel) and the right channel (R channel).
- This inter-channel level difference is used on the decoding side to correct the band coefficient. Since one interchannel level difference is calculated for each subband, the same interchannel level difference is applied to all band coefficients in that subband. This means that the same modification factor is applied to all band coefficients in the subband.
- Patent Document 1 International Publication No. 03Z090208 Pamphlet
- Non-Patent Literature 1 Ramprashad, b. A., Stereophonic CELP coding using Cross channel Prediction, Proc.IEEE Workshop on Speech Coding, Pages: 136—138, (17-20 Sept. 2000)
- bit rate is lower as a result of using one inter-channel level difference for each subband. Adjustment of level changes across components is fairly coarse
- An object of the present invention is to provide a stereo signal generation device and a stereo signal generation method capable of obtaining a stereo signal with good reproducibility at a low bit rate.
- the stereo signal generation device of the present invention includes a conversion means for converting a time domain monaural signal obtained from the signal power of each of the left and right channels of the stereo signal into a frequency domain monaural signal, and a first of the frequency domain monaural signals.
- a power calculating means for obtaining a power spectrum of 1; a first power spectrum of the first power spectrum and a power spectrum of a left channel of the stereo signal; The first scaling ratio for the left channel is also obtained, and the second scaling ratio for the right channel is calculated from the second difference between the first power spectrum and the right channel power spectrum of the stereo signal.
- a scaling ratio calculation means to be calculated; and the frequency domain monaural signal is multiplied by the first scaling ratio to generate a left channel signal of the stereo signal, and the second scaling is applied to the frequency domain monaural signal.
- a multiplication unit that multiplies the ratio to generate a right channel signal of the stereo signal.
- a stereo signal can be obtained with a low bit rate and good reproducibility.
- FIG. 1 is a power spectrum plot diagram according to one embodiment of the present invention.
- FIG. 2 Power spectrum plot diagram according to the above embodiment
- FIG. 4 Power spectrum plot according to the above embodiment.
- FIG. 5 Power spectrum plot of stereo signal frame according to the above embodiment (L channel)
- FIG. 6 Power spectrum plot of stereo signal frame according to the above embodiment (R channel)
- FIG. 7 is a block diagram showing the configuration of the code decoding system according to the above embodiment
- FIG. 8 is a block diagram showing the configuration of the LPC analysis unit according to the above embodiment
- FIG. 9 is a block diagram showing a configuration of a power spectrum calculation unit according to the above embodiment
- FIG. 10 is a block diagram showing a configuration of a stereo signal generation device according to the above embodiment.
- FIG. 11 is a block diagram showing another configuration of the stereo signal generation device according to the above embodiment.
- FIG. 12 is a block diagram showing a configuration of a power spectrum calculation unit according to the above embodiment
- FIG. 13 is a block diagram showing another configuration of the LPC analysis unit according to the above embodiment
- FIG. 14 is a block diagram showing another configuration of the power spectrum calculation unit according to the above embodiment.
- BEST MODE FOR CARRYING OUT THE INVENTION a stereo signal is generated using a mono signal and a set of LPC parameters from a stereo source.
- L channel and R channel stereo signals are generated using the L channel and R channel electric vector envelopes and the monaural signal.
- the power spectrum envelope can be thought of as an approximation to the energy spread of each channel. Therefore, the L channel and R channel signals can be generated using the approximate energy dispersion of the L channel and the R channel in addition to the monaural signal.
- the monaural signal can be coded and decoded using a standard speech coder Z decoder or audio coder Z decoder.
- the spectral envelope is calculated using the properties of the LPC analysis.
- the envelope of the signal power spectrum P can be obtained by plotting the transfer function H (z) of the all-pole filter as shown in the following equation (1).
- the dotted line represents the actual signal power
- the solid line represents the envelope of the signal power obtained using the above equation (1).
- FIG. 5 and FIG. 6 show power spectrum plots of a stereo signal frame.
- Figure 5 shows the L channel envelope
- Figure 6 shows the R channel envelope. From Fig. 5 and Fig. 6, it can be seen that the L channel envelope force and the R channel envelope force are different from each other.
- the L channel signal and the R channel signal of the stereo signal can be configured based on the power spectrum and the monaural signal of the L channel and the R channel. Therefore, this departure In MEI, the stereo output signal is generated using only LPC parameters from a stereo source in addition to the monaural signal.
- the monaural signal can be encoded by a standard encoder.
- LPC parameters are transmitted as additional information, transmission of LPC parameters requires less bandwidth than when encoded L channel signals and R channel signals are transmitted independently.
- FIG. 7 shows the configuration of a coding Z decoding system according to an embodiment of the present invention.
- the encoding device is configured to include a downmixing unit 10, a encoding unit 20, an LPC analysis unit 30, and a multiplexing unit 40.
- the decoding device includes a separation unit 60, a decoding unit 70, an electric spectrum calculation unit 80, and a stereo signal generation device 90. It is assumed that the L channel signal L and the R channel signal R input to the encoder are already in a digital format.
- the downmixing unit 10 generates a time-domain monaural signal M by downmixing the input L signal and R signal.
- the encoding unit 20 encodes the monaural signal M and outputs it to the multiplexing unit 40.
- the code key unit 20 may be a shift of the audio encoder or the audio encoder.
- the LPC analysis unit 30 analyzes the L signal and the R signal by LPC analysis, obtains LPC parameters for the L channel and the R channel, and outputs them to the multiplexing unit 40.
- the multiplexing unit 40 transmits a bit stream obtained by multiplexing the encoded monaural data and the LPC parameter to the decoding device via the communication path 50.
- the separation unit 60 separates the received bit stream into monaural data and LPC parameters.
- the monaural data is input to the decoding unit 70, and the LPC parameters are input to the power spectrum calculation unit 80.
- the decoding unit 70 decodes monaural data. Thereby, a monaural signal M ′ t in the time domain is obtained.
- the mono signal M ′ in the time domain is input to the stereo signal generator 90. At the same time, it is output from the decoding device.
- the power spectrum calculation unit 80 obtains power spectra P and ⁇ ⁇ ⁇ of the L channel and the R channel using the input LPC parameters.
- the plot of the power spectrum obtained here is
- the power spectrum ⁇ , ⁇ is the stereo signal generator 90
- the stereo signal generator 90 generates and outputs a stereo signal R ′ using these three parameters, that is, the time domain monaural signal ⁇ and the power spectrum ⁇ and ⁇ .
- the LPC analysis unit 30 includes an LPC analysis unit 301a for the L channel and an LPC analysis unit 30 lb for the R channel.
- the L channel LPC parameters and the R channel LPC parameters are multiplexed with the monaural data by the multiplexing unit 40 to generate a bit stream. This bit stream is transmitted to the decoding device via the communication path 50.
- the electric spectrum calculation unit 80 includes impulse response forming units 801a and 801b, FT (frequency conversion) units 802a and 802b, and logarithmic calculation units 803a and 803b.
- the power spectrum calculation unit 80 receives the LPC parameters (that is, the LPC coefficient a a) and the LPC gain GG of each channel obtained by separating the bit stream by the separation unit 60.
- the impulse response forming unit 801a performs LPC coefficient a and LPC.
- the impulse response h (n) is formed using the gain G and output to the FT unit 802a.
- the logarithmic operation unit 803a obtains and plots the logarithmic amplitude of the transfer function response H (z). This gives an envelope of the approximate power spectrum P of the L channel signal.
- the power spectrum P is expressed by the following equation (3).
- the impulse response forming unit 801b force LPC coefficient a and
- the impulse response h (n) is formed using the LPC gain G and output to the FT unit 802b.
- the unit 802b converts the impulse response h (n) into the frequency domain to obtain a transfer function H (z).
- the logarithmic operation unit 803b obtains and plots the logarithmic amplitude of the transfer function response H (z). to this
- the force spectrum P is expressed by the following equation (5).
- L channel power spectrum P and R channel power spectrum P are stereo signals
- the stereo signal generating apparatus 90 receives the time domain monaural signal M ′ decoded by the decoding unit 70.
- the stereo signal generator 90 includes a time domain monaural signal M ′ and an L channel power spectrum. P and R channel power spectrum P is input.
- the FT (frequency conversion) unit 901 uses a time domain monaural signal M ′ using a frequency conversion function.
- the power spectrum calculation unit 902 obtains the power spectrum P of the monaural signal M' according to the following equation (6). Note that the monaural signal M 'is zero.
- the power spectrum calculation unit 902 sets the power spectrum P to zero.
- the subtraction unit 903a sets the difference value D to zero.
- the scaling ratio calculation unit 904a uses the difference value D to perform the L check according to the following equation (8).
- Scaling ratio S is set to 1.
- subtraction unit 903b obtains difference D between R channel electric spectrum P and monaural signal power spectrum P according to the following equation (9).
- the subtractor 903a sets the difference value D to zero.
- the scaling ratio calculation unit 904b uses the difference value D to perform an R check according to the following equation (10).
- Multiplication section 905a multiplies monaural signal M 'and scaling ratio S for the L channel, as shown in the following equation (11). Further, the multiplication unit 905b is configured as shown in the following formula (12).
- the code determining unit 100 performs the following processing to determine the correct codes of the L channel signal R ′ and the R channel signal R ′′.
- the sum signal M is obtained by the adder 906a and the divider 907a according to the following equation (13).
- the adding unit 906a adds the L channel signal R ′ and the R channel signal R ′′, and the dividing unit 907a divides the result by two.
- the subtraction unit 906b and the division unit 907b obtain the difference signal M according to the following equation (14).
- the subtraction unit 906a obtains the difference between the L channel signal and the R channel signal R ", and the division unit 907b divides the result by 2.
- the absolute value calculation unit 908a calculates the absolute value of the sum signal M ; and the subtraction unit 910a calculates the absolute value of the monaural signal M ′ calculated by the absolute value calculation unit 909 and the absolute value of the sum signal M.
- the absolute value calculation unit 911a obtains the absolute value D of the difference value calculated by the absolute value calculation unit 910a.
- the absolute value D is input to the comparison unit 915.
- the absolute value calculation unit 908b obtains the absolute value of the difference signal M
- the subtraction unit 910b calculates the absolute value of the monaural signal M ′ calculated by the absolute value calculation unit 909 and the absolute value of the difference signal M.
- the absolute value calculation unit 911b obtains the absolute value D of the difference value calculated by the absolute value calculation unit 910b. Therefore, the absolute value D calculated by the absolute value calculation unit 9 l ib is expressed by the following equation (16).
- the absolute value D is input to the comparison unit 915.
- the sign of the monaural signal M ' is determined by the determination unit 912, and the determination result S is compared with the ratio.
- the sign of the sum signal M is determined by the determination unit 913a, and the determination result S is input to the comparison unit 915. Further, the sign of the difference signal M is determined by the determination unit 913b.
- the determination result S is input to the comparison unit 915. Furthermore, obtained by the multiplication unit 905a
- the L channel signal ' is input to the comparison unit 915 as it is, and the sign of the L channel signal' is inverted by the inversion unit 914a to be input to the comparison unit 915. Further, the R channel signal R ′′ obtained by the multiplication unit 905b is directly input to the comparison unit 915, and the sign of the R channel signal R ′′ is inverted by the inversion unit 914b to become ⁇ R ′′. Input to 915.
- the comparison unit 915 determines the correct codes of the L channel signal R 'and the R channel signal R ". In comparison unit 915, first, comparison is made between absolute value D and absolute value D. And
- the comparison unit 915 When the absolute value D is less than or equal to the absolute value D, the comparison unit 915 finally outputs the time region.
- the L channel output signal in the domain and the R channel output signal R ′ in the time domain are determined to have the same sign, either positive or negative. Further, the comparison unit 915 compares the code S and the code S in order to determine the actual codes of the L channel output signal and the R channel output signal R ′.
- the comparison unit 915 compares the positive L channel signal when the code S and the code S are the same.
- the comparator 915 outputs the negative L channel signal.
- the comparison unit 915 when the absolute value D is larger than the absolute value D, the comparison unit 915 finally outputs it.
- the time domain L channel output signal and the time domain R channel output signal R ′ are determined to have different signs. Further, the comparison unit 915 determines the actual sign of the L channel output signal and the R channel output signal R ′ in order to determine the sign S and
- the negative L channel signal ' is the L channel output signal and the positive R channel signal R "is the R channel output signal R'.
- the comparator 915 if the code S and the code S are different, the comparator 915 ,
- the positive L channel signal ' is the L channel output signal
- the negative R channel signal R " is the R channel output signal R'.
- the processing in this comparison unit 915 is summarized as follows: And Equation (20).
- the code determination unit 100 sets the code of the signal of one channel as the code of the average value of the immediately preceding signal and the immediately following signal in the channel, It can also be determined that the signal on the other channel is opposite in sign to the signal on that one channel.
- the processing in the code determination unit 100 is expressed by the following equation (23) or equation (24).
- the L channel signal and the R channel signal whose codes are determined as described above are output to IFT (Inverse Frequency Conversion) section 916a and IFT section 916b, respectively. Then, the IFT unit 916a converts the L-channel signal in the frequency domain into the time domain and outputs it as the final L-channel output signal. Also, the IFT unit 916b converts the frequency domain R channel signal into the time domain and outputs it as the final R channel output signal R ′.
- IFT Inverse Frequency Conversion
- the accuracy of the output stereo signal is related to the accuracy of the monaural signal M 'and the power spectra P, ⁇ of the L channel and the R channel.
- Monaural signal ⁇ is the original mono
- the accuracy of the output stereo signal is how close the L and R channel power spectra ⁇ and ⁇ are to the original power spectrum.
- the power spectra ⁇ and ⁇ are the LPC parameters for each channel and R
- an LPC analysis filter having a higher filter order ⁇ can represent a spectral envelope more accurately.
- the stereo signal generation apparatus adopts the configuration shown in FIG. 11, that is, the configuration in which the time-domain monaural signal ⁇ is directly input to the power spectrum calculation unit 902, the power
- the configuration of the spectrum calculation unit 902 is as shown in FIG.
- the LPC analysis unit 9021 displays the LPC parameters of the time domain monaural signal M.
- the impulse response forming unit 9022 forms an impulse response h (n) using this LPC parameter.
- the logarithm calculation unit 9024 calculates the logarithm of the transfer function H (z) and multiplies the calculation result by the coefficient
- M is the following formula ( Represented by 25).
- the present invention can also be applied to coding and decoding using subbands.
- the configuration of the LPC analysis unit 30 is as shown in FIG. 13
- the configuration of the electric spectrum calculation unit 80 is as shown in FIG.
- the SB (subband) analysis filters 302a and 302b separate the input L channel signal and R channel signal into 1 to N subbands.
- the LPC parameter of the L channel and the LPC parameter of the R channel of each subband are multiplexed with monaural data by the multiplexing unit 40, and a bit stream is generated. This bit stream is transmitted to the decoding device via the communication path 50.
- the impulse response forming unit 804a uses the LPC coefficient a and the LPC gain G of each subband 1 to N for each subband.
- FT section 805a has subband 1
- the logarithmic operation unit 806a obtains the logarithmic amplitude of the transfer function response H (z) of each of the subbands 1 to N to obtain the power spectrum P for each subband.
- impulse response forming unit 804b force impulse response h (n for each subband using LPC coefficient a and LPC gain G of each subband 1 to N
- the FT unit 805b converts the impulse response h (n) of subbands 1 to N into the frequency domain to obtain a transfer function H (z) of subbands 1 to N.
- the logarithmic operation unit 806b obtains the logarithmic amplitude of the transfer function response H (z) of each subband 1 to N.
- the decoding device Obtain the power spectrum P for each subband. [0070] In this manner, in the decoding device, the same processing as described above is performed for each subpand. After the same processing as described above is performed for all subbands, the subband synthesis filter synthesizes the outputs of all subbands to generate a final output stereo signal.
- Table 1 shows the L channel and Table 2 shows the R channel.
- the output signal and R channel output signal R ' are as follows.
- the L channel is shown in Table 3
- the R channel is shown in Table 4.
- the output signal and R channel output signal R ' are as follows.
- Table 5 shows the L channel and Table 6 shows the R channel.
- the output signal and the R channel output signal R ' are as follows.
- Table 7 shows the L channel and Table 8 shows the R channel.
- the channel output signal and the R channel output signal R ′ are as follows.
- each functional block used in the description of each of the above embodiments is typically realized as an LSI which is an integrated circuit. These may be individually made into one chip, or may be made into one chip so as to include a part or all of them.
- IC integrated circuit
- system LSI system LSI
- super LSI non-linear LSI
- non-linear LSI depending on the difference in power integration as LSI.
- the method of circuit integration is not limited to LSI's, and implementation using dedicated circuitry or general purpose processors is also possible. It is also possible to use a field programmable gate array (FPGA) that can be programmed after LSI manufacture and a reconfigurable processor that can reconfigure the connection and settings of circuit cells inside the LSI.
- FPGA field programmable gate array
- the present invention can be used for transmission, distribution and storage media of digital audio signals and digital audio signals.
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Stereophonic System (AREA)
- Stereo-Broadcasting Methods (AREA)
Abstract
Description
Claims
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/573,760 US8019087B2 (en) | 2004-08-31 | 2005-08-29 | Stereo signal generating apparatus and stereo signal generating method |
BRPI0515128-7A BRPI0515128A (pt) | 2004-08-31 | 2005-08-29 | aparelho de geração de sinal estéreo e método de geração de sinal estéreo |
EP05775181A EP1786239A1 (en) | 2004-08-31 | 2005-08-29 | Stereo signal generating apparatus and stereo signal generating method |
JP2006532681A JP4832305B2 (ja) | 2004-08-31 | 2005-08-29 | ステレオ信号生成装置およびステレオ信号生成方法 |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2004-252027 | 2004-08-31 | ||
JP2004252027 | 2004-08-31 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2006025337A1 true WO2006025337A1 (ja) | 2006-03-09 |
Family
ID=35999990
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2005/015674 WO2006025337A1 (ja) | 2004-08-31 | 2005-08-29 | ステレオ信号生成装置およびステレオ信号生成方法 |
Country Status (8)
Country | Link |
---|---|
US (1) | US8019087B2 (ja) |
EP (1) | EP1786239A1 (ja) |
JP (1) | JP4832305B2 (ja) |
KR (1) | KR20070056081A (ja) |
CN (1) | CN101010985A (ja) |
BR (1) | BRPI0515128A (ja) |
RU (1) | RU2007107348A (ja) |
WO (1) | WO2006025337A1 (ja) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008102527A1 (ja) * | 2007-02-20 | 2008-08-28 | Panasonic Corporation | マルチチャンネル復号装置、マルチチャンネル復号方法、プログラム及び半導体集積回路 |
JP2018511827A (ja) * | 2015-03-09 | 2018-04-26 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
JP2021196562A (ja) * | 2020-06-17 | 2021-12-27 | Toa株式会社 | 音響通信システム、音響送信装置、音響受信装置、プログラムおよび音響信号送信方法 |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8270439B2 (en) * | 2005-07-08 | 2012-09-18 | Activevideo Networks, Inc. | Video game system using pre-encoded digital audio mixing |
US8074248B2 (en) | 2005-07-26 | 2011-12-06 | Activevideo Networks, Inc. | System and method for providing video content associated with a source image to a television in a communication network |
US8103516B2 (en) * | 2005-11-30 | 2012-01-24 | Panasonic Corporation | Subband coding apparatus and method of coding subband |
US20090018824A1 (en) * | 2006-01-31 | 2009-01-15 | Matsushita Electric Industrial Co., Ltd. | Audio encoding device, audio decoding device, audio encoding system, audio encoding method, and audio decoding method |
CA2874451C (en) * | 2006-10-16 | 2016-09-06 | Dolby International Ab | Enhanced coding and parameter representation of multichannel downmixed object coding |
FR2911031B1 (fr) * | 2006-12-28 | 2009-04-10 | Actimagine Soc Par Actions Sim | Procede et dispositif de codage audio |
FR2911020B1 (fr) * | 2006-12-28 | 2009-05-01 | Actimagine Soc Par Actions Sim | Procede et dispositif de codage audio |
US9826197B2 (en) | 2007-01-12 | 2017-11-21 | Activevideo Networks, Inc. | Providing television broadcasts over a managed network and interactive content over an unmanaged network to a client device |
WO2008088741A2 (en) | 2007-01-12 | 2008-07-24 | Ictv, Inc. | Interactive encoded content system including object models for viewing on a remote device |
KR101379263B1 (ko) | 2007-01-12 | 2014-03-28 | 삼성전자주식회사 | 대역폭 확장 복호화 방법 및 장치 |
KR101452722B1 (ko) * | 2008-02-19 | 2014-10-23 | 삼성전자주식회사 | 신호 부호화 및 복호화 방법 및 장치 |
KR101756834B1 (ko) * | 2008-07-14 | 2017-07-12 | 삼성전자주식회사 | 오디오/스피치 신호의 부호화 및 복호화 방법 및 장치 |
WO2010084756A1 (ja) * | 2009-01-22 | 2010-07-29 | パナソニック株式会社 | ステレオ音響信号符号化装置、ステレオ音響信号復号装置およびそれらの方法 |
CN105225667B (zh) | 2009-03-17 | 2019-04-05 | 杜比国际公司 | 编码器系统、解码器系统、编码方法和解码方法 |
US8194862B2 (en) * | 2009-07-31 | 2012-06-05 | Activevideo Networks, Inc. | Video game system with mixing of independent pre-encoded digital audio bitstreams |
AU2011315950B2 (en) | 2010-10-14 | 2015-09-03 | Activevideo Networks, Inc. | Streaming digital video between video devices using a cable television system |
WO2012138660A2 (en) | 2011-04-07 | 2012-10-11 | Activevideo Networks, Inc. | Reduction of latency in video distribution networks using adaptive bit rates |
EP2523472A1 (en) | 2011-05-13 | 2012-11-14 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method and computer program for generating a stereo output signal for providing additional output channels |
EP2815582B1 (en) | 2012-01-09 | 2019-09-04 | ActiveVideo Networks, Inc. | Rendering of an interactive lean-backward user interface on a television |
CN103220058A (zh) * | 2012-01-20 | 2013-07-24 | 旭扬半导体股份有限公司 | 音频数据与视觉数据同步装置及其方法 |
US9800945B2 (en) | 2012-04-03 | 2017-10-24 | Activevideo Networks, Inc. | Class-based intelligent multiplexing over unmanaged networks |
US9123084B2 (en) | 2012-04-12 | 2015-09-01 | Activevideo Networks, Inc. | Graphical application integration with MPEG objects |
MX345622B (es) * | 2013-01-29 | 2017-02-08 | Fraunhofer Ges Forschung | Decodificador para generar una señal de audio mejorada en frecuencia, método de decodificación, codificador para generar una señal codificada y metodo de codificación utilizando informacion secundaria de selección compacta. |
WO2014145921A1 (en) | 2013-03-15 | 2014-09-18 | Activevideo Networks, Inc. | A multiple-mode system and method for providing user selectable video content |
EP3005712A1 (en) | 2013-06-06 | 2016-04-13 | ActiveVideo Networks, Inc. | Overlay rendering of user interface onto source video |
US9219922B2 (en) | 2013-06-06 | 2015-12-22 | Activevideo Networks, Inc. | System and method for exploiting scene graph information in construction of an encoded video sequence |
US9294785B2 (en) | 2013-06-06 | 2016-03-22 | Activevideo Networks, Inc. | System and method for exploiting scene graph information in construction of an encoded video sequence |
JP2016536856A (ja) * | 2013-10-02 | 2016-11-24 | ストーミングスイス・ゲゼルシャフト・ミト・ベシュレンクテル・ハフツング | 二つ以上の基本信号からのマルチチャンネル信号の導出 |
US9788029B2 (en) | 2014-04-25 | 2017-10-10 | Activevideo Networks, Inc. | Intelligent multiplexing using class-based, multi-dimensioned decision logic for managed networks |
KR20180075610A (ko) * | 2015-10-27 | 2018-07-04 | 앰비디오 인코포레이티드 | 사운드 스테이지 향상을 위한 장치 및 방법 |
CN108269577B (zh) | 2016-12-30 | 2019-10-22 | 华为技术有限公司 | 立体声编码方法及立体声编码器 |
US11176954B2 (en) * | 2017-04-10 | 2021-11-16 | Nokia Technologies Oy | Encoding and decoding of multichannel or stereo audio signals |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0847096A (ja) * | 1994-06-28 | 1996-02-16 | Internatl Business Mach Corp <Ibm> | ディジタル・サラウンド・サウンド方法及び装置 |
JP2002050969A (ja) * | 2000-08-03 | 2002-02-15 | Yrp Kokino Idotai Tsushin Kenkyusho:Kk | 通信路復号方法および装置 |
JP2004112825A (ja) * | 1995-10-25 | 2004-04-08 | Sarnoff Corp | ビデオ画像符号化方法 |
JP2004173250A (ja) * | 2002-11-21 | 2004-06-17 | Microsoft Corp | 複数因子分解可逆変換(multiplefactorizationreversibletransform)を用いたプログレッシブ・ツー・ロスレス埋込みオーディオ・コーダ(ProgressivetoLosslessEmbeddedAudioCoder:PLEAC) |
JP2004537739A (ja) * | 2000-10-18 | 2004-12-16 | ノキア コーポレーション | 音声コーデックにおける擬似高帯域信号の推定方法およびシステム |
JP2005519339A (ja) * | 2002-03-08 | 2005-06-30 | コニンクリーケ・ケイピーエヌ・ナムローゼ・フェンノートシャップ | システムの伝送品質を測定する方法及びシステム |
JP2005534947A (ja) * | 2001-11-20 | 2005-11-17 | シーラス ロジック,インコーポレイテッド | 心理音響ベースで圧縮する際に形成されるノイズの許容可能な歪みに基づくスケールファクタのフィードフォワード予測 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH1132399A (ja) * | 1997-05-13 | 1999-02-02 | Sony Corp | 符号化方法及び装置、並びに記録媒体 |
US6230130B1 (en) | 1998-05-18 | 2001-05-08 | U.S. Philips Corporation | Scalable mixing for speech streaming |
US7006636B2 (en) * | 2002-05-24 | 2006-02-28 | Agere Systems Inc. | Coherence-based audio coding and synthesis |
US7292901B2 (en) * | 2002-06-24 | 2007-11-06 | Agere Systems Inc. | Hybrid multi-channel/cue coding/decoding of audio signals |
US20030035553A1 (en) * | 2001-08-10 | 2003-02-20 | Frank Baumgarte | Backwards-compatible perceptual coding of spatial cues |
JP3598993B2 (ja) | 2001-05-18 | 2004-12-08 | ソニー株式会社 | 符号化装置及び方法 |
JP2003015697A (ja) | 2001-06-29 | 2003-01-17 | Matsushita Electric Ind Co Ltd | オーディオ符号化用のビット割当て方法 |
SE0202159D0 (sv) * | 2001-07-10 | 2002-07-09 | Coding Technologies Sweden Ab | Efficientand scalable parametric stereo coding for low bitrate applications |
EP1500083B1 (en) | 2002-04-22 | 2006-06-28 | Koninklijke Philips Electronics N.V. | Parametric multi-channel audio representation |
US8340302B2 (en) | 2002-04-22 | 2012-12-25 | Koninklijke Philips Electronics N.V. | Parametric representation of spatial audio |
ES2280736T3 (es) | 2002-04-22 | 2007-09-16 | Koninklijke Philips Electronics N.V. | Sintetizacion de señal. |
AU2003234763A1 (en) * | 2002-04-26 | 2003-11-10 | Matsushita Electric Industrial Co., Ltd. | Coding device, decoding device, coding method, and decoding method |
JP2006503319A (ja) | 2002-10-14 | 2006-01-26 | コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ | 信号フィルタリング |
CN1748443B (zh) * | 2003-03-04 | 2010-09-22 | 诺基亚有限公司 | 多声道音频扩展支持 |
ES2295837T3 (es) * | 2004-03-12 | 2008-04-16 | Nokia Corporation | Sistesis de una señal de audio monofonica sobre la base de una señal de audio multicanal codificada. |
US7720230B2 (en) * | 2004-10-20 | 2010-05-18 | Agere Systems, Inc. | Individual channel shaping for BCC schemes and the like |
-
2005
- 2005-08-29 US US11/573,760 patent/US8019087B2/en not_active Expired - Fee Related
- 2005-08-29 JP JP2006532681A patent/JP4832305B2/ja not_active Expired - Fee Related
- 2005-08-29 EP EP05775181A patent/EP1786239A1/en not_active Withdrawn
- 2005-08-29 KR KR1020077004878A patent/KR20070056081A/ko not_active Application Discontinuation
- 2005-08-29 WO PCT/JP2005/015674 patent/WO2006025337A1/ja active Application Filing
- 2005-08-29 RU RU2007107348/09A patent/RU2007107348A/ru not_active Application Discontinuation
- 2005-08-29 BR BRPI0515128-7A patent/BRPI0515128A/pt not_active Application Discontinuation
- 2005-08-29 CN CNA2005800287049A patent/CN101010985A/zh active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0847096A (ja) * | 1994-06-28 | 1996-02-16 | Internatl Business Mach Corp <Ibm> | ディジタル・サラウンド・サウンド方法及び装置 |
JP2004112825A (ja) * | 1995-10-25 | 2004-04-08 | Sarnoff Corp | ビデオ画像符号化方法 |
JP2002050969A (ja) * | 2000-08-03 | 2002-02-15 | Yrp Kokino Idotai Tsushin Kenkyusho:Kk | 通信路復号方法および装置 |
JP2004537739A (ja) * | 2000-10-18 | 2004-12-16 | ノキア コーポレーション | 音声コーデックにおける擬似高帯域信号の推定方法およびシステム |
JP2005534947A (ja) * | 2001-11-20 | 2005-11-17 | シーラス ロジック,インコーポレイテッド | 心理音響ベースで圧縮する際に形成されるノイズの許容可能な歪みに基づくスケールファクタのフィードフォワード予測 |
JP2005519339A (ja) * | 2002-03-08 | 2005-06-30 | コニンクリーケ・ケイピーエヌ・ナムローゼ・フェンノートシャップ | システムの伝送品質を測定する方法及びシステム |
JP2004173250A (ja) * | 2002-11-21 | 2004-06-17 | Microsoft Corp | 複数因子分解可逆変換(multiplefactorizationreversibletransform)を用いたプログレッシブ・ツー・ロスレス埋込みオーディオ・コーダ(ProgressivetoLosslessEmbeddedAudioCoder:PLEAC) |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008102527A1 (ja) * | 2007-02-20 | 2008-08-28 | Panasonic Corporation | マルチチャンネル復号装置、マルチチャンネル復号方法、プログラム及び半導体集積回路 |
JP5328637B2 (ja) * | 2007-02-20 | 2013-10-30 | パナソニック株式会社 | マルチチャンネル復号装置、マルチチャンネル復号方法、プログラム及び半導体集積回路 |
US11741973B2 (en) | 2015-03-09 | 2023-08-29 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US11238874B2 (en) | 2015-03-09 | 2022-02-01 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US10395661B2 (en) | 2015-03-09 | 2019-08-27 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
JP2020038374A (ja) * | 2015-03-09 | 2020-03-12 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
US10777208B2 (en) | 2015-03-09 | 2020-09-15 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
US11107483B2 (en) | 2015-03-09 | 2021-08-31 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
JP7469350B2 (ja) | 2015-03-09 | 2024-04-16 | フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
US10388287B2 (en) | 2015-03-09 | 2019-08-20 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
JP7077290B2 (ja) | 2015-03-09 | 2022-05-30 | フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
JP2022088470A (ja) * | 2015-03-09 | 2022-06-14 | フラウンホッファー-ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
JP2018511827A (ja) * | 2015-03-09 | 2018-04-26 | フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ | マルチチャンネル信号を符号化するためのオーディオエンコーダおよび符号化されたオーディオ信号を復号化するためのオーディオデコーダ |
US11881225B2 (en) | 2015-03-09 | 2024-01-23 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder for encoding a multichannel signal and audio decoder for decoding an encoded audio signal |
JP7385531B2 (ja) | 2020-06-17 | 2023-11-22 | Toa株式会社 | 音響通信システム、音響送信装置、音響受信装置、プログラムおよび音響信号送信方法 |
JP2021196562A (ja) * | 2020-06-17 | 2021-12-27 | Toa株式会社 | 音響通信システム、音響送信装置、音響受信装置、プログラムおよび音響信号送信方法 |
Also Published As
Publication number | Publication date |
---|---|
JPWO2006025337A1 (ja) | 2008-05-08 |
JP4832305B2 (ja) | 2011-12-07 |
US8019087B2 (en) | 2011-09-13 |
RU2007107348A (ru) | 2008-09-10 |
CN101010985A (zh) | 2007-08-01 |
KR20070056081A (ko) | 2007-05-31 |
BRPI0515128A (pt) | 2008-07-08 |
EP1786239A1 (en) | 2007-05-16 |
US20080154583A1 (en) | 2008-06-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4832305B2 (ja) | ステレオ信号生成装置およびステレオ信号生成方法 | |
KR102230727B1 (ko) | 광대역 정렬 파라미터 및 복수의 협대역 정렬 파라미터들을 사용하여 다채널 신호를 인코딩 또는 디코딩하기 위한 장치 및 방법 | |
JP4934427B2 (ja) | 音声信号復号化装置及び音声信号符号化装置 | |
JP4963962B2 (ja) | マルチチャネル信号符号化装置およびマルチチャネル信号復号装置 | |
US8290783B2 (en) | Apparatus for mixing a plurality of input data streams | |
EP2109861B1 (en) | Audio decoder | |
RU2409911C2 (ru) | Декодирование бинауральных аудиосигналов | |
JP4918490B2 (ja) | エネルギー整形装置及びエネルギー整形方法 | |
US8352249B2 (en) | Encoding device, decoding device, and method thereof | |
US20130230176A1 (en) | Method and an Apparatus for Encoding/Decoding a Multichannel Audio Signal | |
US8036390B2 (en) | Scalable encoding device and scalable encoding method | |
WO2006035810A1 (ja) | スケーラブル符号化装置、スケーラブル復号装置、及びこれらの方法 | |
US20080162148A1 (en) | Scalable Encoding Apparatus And Scalable Encoding Method | |
US8548615B2 (en) | Encoder | |
JPWO2008132826A1 (ja) | ステレオ音声符号化装置およびステレオ音声符号化方法 | |
JP2006323314A (ja) | マルチチャネル音声信号をバイノーラルキュー符号化する装置 | |
KR20070035410A (ko) | 멀티 채널 오디오 신호의 공간 정보 부호화/복호화 방법 및장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU LV MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 2006532681 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2005775181 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11573760 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200580028704.9 Country of ref document: CN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007107348 Country of ref document: RU |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020077004878 Country of ref document: KR |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWP | Wipo information: published in national office |
Ref document number: 2005775181 Country of ref document: EP |
|
ENP | Entry into the national phase |
Ref document number: PI0515128 Country of ref document: BR |