EP1697930A1 - Device and method for processing a multi-channel signal - Google Patents
Device and method for processing a multi-channel signalInfo
- Publication number
- EP1697930A1 EP1697930A1 EP05715611A EP05715611A EP1697930A1 EP 1697930 A1 EP1697930 A1 EP 1697930A1 EP 05715611 A EP05715611 A EP 05715611A EP 05715611 A EP05715611 A EP 05715611A EP 1697930 A1 EP1697930 A1 EP 1697930A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- prediction
- channel
- block
- similarity
- spectral values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 20
- 230000003595 spectral effect Effects 0.000 claims abstract description 54
- 238000001914 filtration Methods 0.000 claims abstract description 31
- 238000004364 calculation method Methods 0.000 claims description 9
- 238000004590 computer program Methods 0.000 claims description 6
- 238000001228 spectrum Methods 0.000 claims description 5
- 238000004422 calculation algorithm Methods 0.000 claims description 2
- 230000006866 deterioration Effects 0.000 abstract 1
- 238000013139 quantization Methods 0.000 description 10
- 238000007493 shaping process Methods 0.000 description 10
- 230000002123 temporal effect Effects 0.000 description 9
- 230000005236 sound signal Effects 0.000 description 6
- 230000004913 activation Effects 0.000 description 5
- 230000001360 synchronised effect Effects 0.000 description 4
- 230000009466 transformation Effects 0.000 description 4
- 230000000694 effects Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000011524 similarity measure Methods 0.000 description 3
- 230000009849 deactivation Effects 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001052 transient effect Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 238000000691 measurement method Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/03—Spectral prediction for preventing pre-echo; Temporary noise shaping [TNS], e.g. in MPEG2 or MPEG4
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
Definitions
- the present invention relates to audio encoders and, more particularly, to audio encoders that are transform-based, i.e., at which a temporal representation is converted to a spectral representation at the beginning of the encoder pipeline.
- FIG. 3 A known transformation-based audio encoder is shown in FIG. 3.
- the encoder shown in FIG. 3 is shown in the international standard ISO / IEC 14496-3: 2001 (E), Subpart 4, page 4 and is also known in the art as an AAC encoder.
- An audio signal to be coded is fed in at an input 1000. This is first fed to a scaling stage 1002, in which a so-called AAC gain control is carried out in order to determine the level of the audio signal. Side information from the scaling is supplied to a bit stream formatter 1004, as shown by the arrow between block 1002 and block 1004. The scaled audio signal is then fed to an MDCT filter bank 1006. In the AAC encoder, the filter bank implements a modified discrete cosine transformation with 50% overlapping windows, the window length being determined by a block 1008.
- block 1008 is provided for transient signals to be windowed with shorter windows and more stationary signals to be windowed with longer windows.
- the purpose of this is to achieve a higher time resolution (at the expense of frequency resolution) due to the shorter windows for transient signals.
- higher frequency resolution at the expense of time resolution
- each subband signal having a certain limited bandwidth, which is caused by the corresponding subband channel in the filter bank 1006 is set, and wherein each subband signal has a certain number of subband samples.
- the following example shows the case in which the filter bank outputs successive blocks of MDCT spectral coefficients, which generally speaking represent successive short-term spectra of the audio signal to be coded at input 1000.
- TNS temporary noise shaping
- the TNS technique is used to shape the temporal form of the quantization noise within each window of the transformation. This is achieved by applying a filtering process to parts of the spectral data of each channel.
- the coding is done on a window basis.
- the following steps are carried out in order to apply the TNS tool to a window of spectral data, that is to say to a block of spectral values.
- a frequency range is selected for the TNS tool.
- a suitable choice is to cover a frequency range from 1.5 kHz up to the highest possible scale factor band with a filter. It should be noted that this frequency range depends on the sampling rate as specified in the AAC standard (ISO / IEC 14496-3: 2001 (E)).
- spectral MDCT coefficients lie in the selected target frequency range. For increased stability, coefficients corresponding to frequencies below 2.5 kHz are excluded from this process.
- Usual LPC procedures as are known from speech processing, can be used for the LPC calculation, for example the well-known Levinson-Durbin algorithm. The calculation is carried out for the maximum permissible order of the noise shaping filter.
- the expected prediction gain PG is obtained as a result of the LPC calculation. Furthermore, the reflection coefficients or Parcor coefficients are obtained.
- the TNS tool is not used. In this case, control information is written into the bit stream so that a decoder knows that no TNS processing has been carried out.
- TNS processing is applied.
- the reflection coefficients are quantized.
- the order of the noise shaping filter used is determined by removing all reflection coefficients with an absolute value less than a threshold from the "tail" of the reflection coefficient array. The number of remaining reflection coefficients is on the order of the noise shaping filter.
- a suitable threshold is 0.1.
- the remaining reflection coefficients are typically converted to linear prediction coefficients, which technique is also known as a "step-up" procedure.
- the calculated LPC coefficients are then used as encoder noise shaping filter coefficients, that is to say as prediction filter coefficients.
- This FIR filter is carried out over the specified target frequency range.
- An auto-regressive filter is used for the decoding, while a so-called oving average filter is used for the coding.
- the page information for the TNS tool is fed to the bit stream formatter, as shown by the arrow shown between the TNS processing block 1010 and the bit stream formatter 1004 in FIG. 3.
- the center / side encoder 1012 is active when the audio signal to be encoded is a multi-channel signal, that is to say a stereo signal with a left channel and a right channel.
- the left and right stereo channels have been processed separately from one another, i.e. scaled, transformed by the filter bank, subjected to TNS processing or not, etc.
- the middle / side encoder it is first checked whether a middle / side coding makes sense, that is to say brings a coding gain at all.
- a center / side coding will bring a coding gain if the left and right channels are more similar, because then the center channel, i.e. the sum of the left and right channels, is almost equal to the left or right channel, apart from scaling by a factor of 1/2, while the side channel has very small values because it is equal to the difference between the left and right channels.
- a permitted disturbance per scale factor band is supplied to the quantizer 1014 by a psycho-acoustic model 1020.
- the quantizer works iteratively, i. H. an outer iteration loop is first called, which then calls an inner iteration loop.
- an outer iteration loop is first called, which then calls an inner iteration loop.
- a block of values is first quantized at the input of quantizer 1014.
- the inner loop quantizes the MDCT coefficients, consuming a certain number of bits.
- the outer loop calculates the distortion and modified energy of the coefficients using the scale factor to call an inner loop again. This process is iterated until a certain set of conditions is met.
- the signal is reconstructed in order to calculate the disturbance introduced by the quantization and to compare it with the permitted disturbance provided by the psycho-acoustic model 1020. Furthermore, the scale factors are increased by iteration from iteration to iteration, for each iteration of the outer iteration loop.
- the iteration that is, the analysis-by-synthesis method is ended, and the scale factors obtained are encoded, as is carried out in block 1014 and in coded form to bitstream formatter 1004. as indicated by the arrow drawn between block 1014 and block 1004.
- the quantized values are then fed to the entropy encoder 1016, which typically performs entropy coding for multiple scale factor bands using multiple Huffman code tables to translate the quantized values into a binary format.
- entropy coding in the form of Huffman coding uses code tables which are created on the basis of expected signal statistics and in which frequently occurring values are given shorter code words than less frequently occurring values.
- the entropy-coded values are then also supplied as actual main information to the bit stream formatter 1004, which then outputs the coded audio signal on the output side in accordance with a specific bit stream syntax.
- predictive filtering is used in the TNS processing block 1010 to temporally form the quantization noise within a coding frame.
- the temporal shaping of the quantization noise is carried out by filtering the spectral coefficients over the frequency in the encoder before the quantization and subsequent inverse filtering in the decoder.
- the TNS processing causes the envelope of the quantization noise to be shifted below the envelope of the signal in order to avoid pre-echo artifacts.
- the application of the TNS results from an estimate of the prediction gain of the filtering, as was explained above.
- the filter coefficients for each coding frame are determined via a correlation measure.
- the filter coefficients are calculated separately for each channel. They are also transmitted separately in the coded bit stream.
- a disadvantage of the activation / deactivation of the TNS concept is the fact that for each stereo channel, if once TNS processing has been activated due to the good expected coding gain, TNS filtering takes place separately for each channel. So this is not a problem with relatively different channels.
- the left and right channels are relatively similar, in an extreme example, the left and right channels have exactly the same useful information as a speaker, for example, and differ only in terms of the noise inevitably contained in the channels technology nevertheless calculates and uses a separate TNS filter for each channel.
- TNS filter Since the TNS filter is directly dependent on the left and right channels, and is relatively sensitive to the spectral data of the left and right channels in particular, it is also used in the case of a signal in which the left and right channels are very similar In the case of a so-called "quasi-mono signal", TNS processing is carried out for each channel with its own prediction filter. This means that due to the different filter coefficients, different temporal noise shaping takes place in the two stereo channels.
- the known procedure has a possibly possibly more serious disadvantage.
- the TNS output values that is to say the spectral residual values
- the TNS output values are subjected to a center / side coding in the center / side encoder 1002 of FIG. 3. While the two channels were relatively the same before TNS processing, this cannot be said after TNS processing.
- the described stereo effect which was introduced by the separate TNS processing, makes the spectral residual values of the two channels more dissimilar than they actually would be. This leads to one immediate decrease in coding gain due to the center / side coding, which is particularly disadvantageous especially for applications in which a low bit rate is required.
- the known TNS activation is therefore problematic for stereo signals that use similar but not exactly identical signal information in both channels, such as mono-similar speech signals. If different filter coefficients are determined for the TNS detection for both channels, this leads to a temporally different shaping of the quantization noise in the channels. This can lead to audible artifacts, e.g. B. the original mono-like sound through these temporal differences gets an undesirable stereo character. Furthermore, as has been explained, the TNS-modified spectrum is subjected to middle / side coding in a subsequent step. Different filters in both channels additionally reduce the similarity of the spectral coefficients and thus the center / side gain.
- DE 19829284C2 discloses a method and an apparatus for processing a temporal stereo signal and a method and an apparatus for decoding an audio bit stream coded using prediction over frequency.
- the left, the right and the mono channel of their own prediction over the frequency i. H. undergo TNS processing. This means that a complete prediction can be made for each channel.
- the prediction coefficients for the left channel can be calculated, which are then used to filter the right channel and the mono channel.
- the object of the present invention is to create a concept for processing a multi-channel signal. fen, which enables less artifacts and yet a good compression of the information.
- the present invention is based on the finding that if the left and right channels are similar, ie exceed a similarity measure, the same TNS filtering must be used for both channels. This ensures that no pseudo stereo artifacts are introduced into the multichannel signal by the TNS processing, since by using the same prediction filter for both channels, the temporal shaping of the quantization noise takes place identically for both channels, ie that no pseudo stereo artifacts can be heard.
- the similarity of the signals after TNS filtering corresponds to the similarity of the input signals to the filters and not, as in the prior art, to the similarity of the input signals, which is still reduced by different filters.
- FIG. 1 shows a block diagram of a device according to the invention for processing a multi-channel signal
- FIG. 2 shows a preferred embodiment of the device for determining a similarity and the device for performing the prediction filtering
- FIG. 3 shows a block diagram of a known audio encoder in accordance with the AAC standard.
- FIG. 1 shows a device for processing a multichannel signal, the multichannel signal being represented by a block of spectral values for at least two channels, as shown by L and R.
- the blocks of spectral values are represented by e.g. B. MDCT filtering by means of an MDCT filter bank 10 determined from time-domain samples l (t) or r (t) for each channel.
- the blocks of spectral values for each channel are then fed to a device 12 for determining a similarity between the two channels.
- the device for determining the similarity between the two channels can also, as shown in FIG. 1, under Use time domain samples l (t) or r (t) for each channel.
- the device 12 for determining the similarity between the first and the second channel is operative to generate a control signal on a control line 14, which has at least two states, one of which expresses, based on a measure of similarity or alternatively a measure of dissimilarity. that the blocks of spectral values of the two channels are similar, or that in its other state states that the blocks of spectral values are different for each channel.
- the decision as to whether similarity or dissimilarity prevails can be made using a preferably numerical similarity measure.
- Both the block of spectral values for the left channel and the block of spectral values for the right channel are fed to a device 16 for performing a prediction filtering.
- a prediction filtering is carried out over the frequency, the device being designed for performing, in order to carry out the prediction over the frequency, a common prediction filter 16a for the block of spectral values of the first channel and for the block of spectral values of the second channel if the similarity is greater than a threshold similarity.
- the device 16 for performing the prediction filtering is informed by the device 12 for determining a similarity that the two blocks of spectral values for each channel are dissimilar, that is to say have a similarity that is smaller than a threshold similarity, the device 16 for performing a similarity Prediction filtering, apply different filters 16b to the left and right channels.
- the output signals of the device 16 are thus spectral residual values of the left channel at an output 18a as well as spectral residual values of the right channel at an output 18b, wherein, depending on the similarity of the left and the right channel, the spectral residual values of the two channels using the same prediction filter (Case 16a) or using different prediction filters (case 16b).
- the spectral residual values of the left and the right channel can either be directly or after several processing operations, as they are e.g. B. are provided in the AAC standard, a center / side stereo encoder which outputs the center signal as an half of the sum of the left and right channel at an output 21a, while the side signal as half the difference of the left and right channel is output.
- the side signal if there was previously a high similarity between the channels, is now smaller than in the case where different TNS filters are used for similar channels due to the synchronization of the TNS processing of the two channels become, which therefore, due to the fact that the side signal is smaller, promises a higher coding gain.
- a preferred exemplary embodiment of the present invention is shown below with reference to FIG. 2, in which the first stage of the TNS calculation is already carried out in the device 12 for determining a similarity, namely the calculation of the Parcor or reflection coefficients and the Prediction gain for both the left channel and the right channel, as represented by blocks 12a, 12b.
- This TNS processing thus provides both the filter coefficients for the prediction filter ultimately to be used as well as the prediction gain, this prediction gain also being required to decide whether TNS processing should be carried out at all or not.
- the prediction gain for the first left channel which is denoted by PG1 in FIG. 2, as well as the prediction gain for the right channel, which is denoted by PG2 in FIG. 2, is fed into a similarity measure determination device, which is shown in FIG. 2 is designated 12c.
- This similarity determination device is effective to calculate the absolute amount of the difference or the relative difference of the two prediction gains and to see whether it is below a predetermined deviation threshold S. If the absolute amount of the difference in the prediction gains is below the threshold S, it is assumed that the two signals are similar and the question in block 12c is answered with yes. If, on the other hand, it is found that the difference is greater than the similarity threshold S, the question is answered with no.
- the device 16 uses a common filter for both channels L and R, while if the question is answered in block 12c, separate filters with no are used, i.e. TNS processing, as in the prior art the technology can be carried out.
- the device 16 is supplied with a set of filter coefficients FKL for the left channel and a set of filter coefficients FKR for the right channel by the devices 12a and 12b.
- a special selection is made in a block 16c for filtering by means of a common filter.
- block 16c it is decided which channel has the greater energy. If it is determined that the left channel has the greater energy, the filter coefficients FKL calculated by the device 12a for the left channel are used for the common filtering. If, on the other hand, it is determined in block 16c that the right channel has the greater energy, the set of filter coefficients FKR which has been calculated for the right channel in the device 12b is used for the common filtering.
- both the time signal and the spectral signal can be used for energy determination. Due to the fact that transformation artifacts that may have already occurred are contained in the spectral signal, it is preferred to use the spectral signals of the left and right channels for the “energy decision” in block 16c.
- TNS synchronization that is to say the use of the same filter coefficients, is used for both channels if the prediction gains for the left and right channels differ by less than three percent. If the two channels differ by more than three percent, the question in block 12c of FIG. 2 is answered with “no”.
- the prediction gains of the two channels in the Filtering compared. If a difference in the prediction gains falls below a certain threshold, the same TNS filtering is applied to both channels in order to avoid the problems described.
- the reflection coefficients of the two separately calculated TNS filters can also be compared.
- the similarity determination can also be achieved using other details of the signal, so that when a similarity has been determined, only the TNS filter coefficient set for the channel that is used for the prediction filtering of both stereo channels has to be calculated. This has the advantage that when looking at Figure 2 and when the signals are similar, only either block 12a or block 12b will be active.
- the concept according to the invention can also be used to further reduce the bit rate of the coded signal. While different TNS side information is transmitted for both channels when using two different reflection coefficients, when filtering the two channels with the same prediction filter, TNS information only has to be transmitted once for both channels. Therefore, the inventive concept can also achieve a reduction in the bit rate in such a way that a set of TNS side information is "saved" if the left and right channels are similar.
- the concept according to the invention is not fundamentally limited to stereo signals, but could be used in a multi-channel environment between different channel pairs or groups of more than 2 channels.
- a determination of the cross-correlation measure k between the left and right channels or a determination of the TNS prediction gain and the TNS filter coefficients can be carried out separately for each channel.
- the synchronization decision is made if k exceeds a threshold (e.g. 0.6) and MS stereo coding is activated.
- a threshold e.g. 0.6
- MS stereo coding is activated.
- the MS criterion can also be omitted.
- the reference channel is determined, whose TNS filter is to be adopted for the other channel. For example, the channel with the greater energy is used as the reference channel.
- the TNS filter coefficients are then copied from the reference channel to the other channel.
- the TNS prediction gain and the TNS filter coefficients are determined separately for each channel. Then a decision is made. If the prediction gain of both channels does not differ by more than a certain amount, e.g. B. 3%, the synchronization takes place.
- the reference channel can also be chosen arbitrarily if one can assume that the channels are similar.
- the TNS filter coefficients are copied from the reference channel to the other channel, whereupon the synchronized or non-synchronized TNS filters are applied to the spectrum.
- Alternative options are as follows: Whether TNS is basically activated in a channel depends on the prediction gain in this channel. If this exceeds a certain threshold, TNS is activated for this channel.
- a TNS synchronization for 2 channels is carried out if TNS was only activated in one of the two channels.
- the condition is then that, for example, the prediction gain is similar, ie a channel just above the activation limit and a channel just below the activation limit.
- the activation of TNS for both channels with the same coefficients is then derived from this comparison, or under certain circumstances also the deactivation for both channels.
- the method according to the invention for processing a multi-channel signal can be implemented in hardware or in software.
- the implementation can take place on a digital storage medium, in particular a floppy disk or CD with electronically readable control signals, which can cooperate with a programmable computer system such that the method is carried out.
- the invention thus also consists in a computer program product with a program code stored on a machine-readable carrier for carrying out the method according to the invention when the computer program product runs on a computer.
- the invention can thus be implemented as a computer program with a program code for carrying out the method if the computer program runs on a computer.
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004009954A DE102004009954B4 (en) | 2004-03-01 | 2004-03-01 | Apparatus and method for processing a multi-channel signal |
PCT/EP2005/002110 WO2005083678A1 (en) | 2004-03-01 | 2005-02-28 | Device and method for processing a multi-channel signal |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1697930A1 true EP1697930A1 (en) | 2006-09-06 |
EP1697930B1 EP1697930B1 (en) | 2007-06-13 |
Family
ID=34894904
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP05715611A Active EP1697930B1 (en) | 2004-03-01 | 2005-02-28 | Device and method for processing a multi-channel signal |
Country Status (18)
Country | Link |
---|---|
US (1) | US7340391B2 (en) |
EP (1) | EP1697930B1 (en) |
JP (1) | JP4413257B2 (en) |
KR (1) | KR100823097B1 (en) |
CN (1) | CN1926608B (en) |
AT (1) | ATE364882T1 (en) |
AU (1) | AU2005217517B2 (en) |
BR (1) | BRPI0507207B1 (en) |
CA (1) | CA2558161C (en) |
DE (2) | DE102004009954B4 (en) |
DK (1) | DK1697930T3 (en) |
ES (1) | ES2286798T3 (en) |
HK (1) | HK1095194A1 (en) |
IL (1) | IL177213A (en) |
NO (1) | NO339114B1 (en) |
PT (1) | PT1697930E (en) |
RU (1) | RU2332727C2 (en) |
WO (1) | WO2005083678A1 (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8063809B2 (en) | 2008-12-29 | 2011-11-22 | Huawei Technologies Co., Ltd. | Transient signal encoding method and device, decoding method and device, and processing system |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7809579B2 (en) * | 2003-12-19 | 2010-10-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Fidelity-optimized variable frame length encoding |
US7725324B2 (en) * | 2003-12-19 | 2010-05-25 | Telefonaktiebolaget Lm Ericsson (Publ) | Constrained filter encoding of polyphonic signals |
US9626973B2 (en) * | 2005-02-23 | 2017-04-18 | Telefonaktiebolaget L M Ericsson (Publ) | Adaptive bit allocation for multi-channel audio encoding |
KR100718416B1 (en) | 2006-06-28 | 2007-05-14 | 주식회사 대우일렉트로닉스 | Method for coding stereo audio signal between channels using prediction filter |
JP4940888B2 (en) * | 2006-10-23 | 2012-05-30 | ソニー株式会社 | Audio signal expansion and compression apparatus and method |
KR20080053739A (en) * | 2006-12-11 | 2008-06-16 | 삼성전자주식회사 | Apparatus and method for encoding and decoding by applying to adaptive window size |
US20100100372A1 (en) * | 2007-01-26 | 2010-04-22 | Panasonic Corporation | Stereo encoding device, stereo decoding device, and their method |
US8086465B2 (en) | 2007-03-20 | 2011-12-27 | Microsoft Corporation | Transform domain transcoding and decoding of audio data using integer-reversible modulated lapped transforms |
US7991622B2 (en) * | 2007-03-20 | 2011-08-02 | Microsoft Corporation | Audio compression and decompression using integer-reversible modulated lapped transforms |
ATE547786T1 (en) * | 2007-03-30 | 2012-03-15 | Panasonic Corp | CODING DEVICE AND CODING METHOD |
CN101067931B (en) * | 2007-05-10 | 2011-04-20 | 芯晟(北京)科技有限公司 | Efficient configurable frequency domain parameter stereo-sound and multi-sound channel coding and decoding method and system |
EP2264698A4 (en) * | 2008-04-04 | 2012-06-13 | Panasonic Corp | Stereo signal converter, stereo signal reverse converter, and methods for both |
EP2273493B1 (en) * | 2009-06-29 | 2012-12-19 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Bandwidth extension encoding and decoding |
EP3779977B1 (en) * | 2010-04-13 | 2023-06-21 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder for processing stereo audio using a variable prediction direction |
US8891775B2 (en) * | 2011-05-09 | 2014-11-18 | Dolby International Ab | Method and encoder for processing a digital stereo audio signal |
CN104269173B (en) * | 2014-09-30 | 2018-03-13 | 武汉大学深圳研究院 | The audio bandwidth expansion apparatus and method of switch mode |
CN108352163B (en) * | 2015-09-25 | 2023-02-21 | 沃伊斯亚吉公司 | Method and system for decoding left and right channels of a stereo sound signal |
CN107659888A (en) * | 2017-08-21 | 2018-02-02 | 广州酷狗计算机科技有限公司 | Identify the method, apparatus and storage medium of pseudostereo audio |
EP3483879A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Analysis/synthesis windowing function for modulated lapped transformation |
EP3483882A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Controlling bandwidth in encoders and/or decoders |
EP3483880A1 (en) * | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Temporal noise shaping |
EP3483883A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio coding and decoding with selective postfiltering |
EP3483886A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Selecting pitch lag |
EP3483878A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio decoder supporting a set of different loss concealment tools |
WO2019091573A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters |
EP3483884A1 (en) | 2017-11-10 | 2019-05-15 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Signal filtering |
WO2019091576A1 (en) | 2017-11-10 | 2019-05-16 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits |
CN108962268B (en) * | 2018-07-26 | 2020-11-03 | 广州酷狗计算机科技有限公司 | Method and apparatus for determining monophonic audio |
CN112151045A (en) | 2019-06-29 | 2020-12-29 | 华为技术有限公司 | Stereo coding method, stereo decoding method and device |
CN111654745B (en) * | 2020-06-08 | 2022-10-14 | 海信视像科技股份有限公司 | Multi-channel signal processing method and display device |
CN112053669B (en) * | 2020-08-27 | 2023-10-27 | 海信视像科技股份有限公司 | Method, device, equipment and medium for eliminating human voice |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5488665A (en) * | 1993-11-23 | 1996-01-30 | At&T Corp. | Multi-channel perceptual audio compression system with encoding mode switching among matrixed channels |
US5812971A (en) * | 1996-03-22 | 1998-09-22 | Lucent Technologies Inc. | Enhanced joint stereo coding method using temporal envelope shaping |
US5913187A (en) * | 1997-08-29 | 1999-06-15 | Nortel Networks Corporation | Nonlinear filter for noise suppression in linear prediction speech processing devices |
DE19747132C2 (en) * | 1997-10-24 | 2002-11-28 | Fraunhofer Ges Forschung | Methods and devices for encoding audio signals and methods and devices for decoding a bit stream |
DE19829284C2 (en) * | 1998-05-15 | 2000-03-16 | Fraunhofer Ges Forschung | Method and apparatus for processing a temporal stereo signal and method and apparatus for decoding an audio bit stream encoded using prediction over frequency |
US6771723B1 (en) * | 2000-07-14 | 2004-08-03 | Dennis W. Davis | Normalized parametric adaptive matched filter receiver |
US6622117B2 (en) * | 2001-05-14 | 2003-09-16 | International Business Machines Corporation | EM algorithm for convolutive independent component analysis (CICA) |
KR100443405B1 (en) * | 2001-07-05 | 2004-08-09 | 주식회사 이머시스 | The equipment redistribution change of multi channel headphone audio signal for multi channel speaker audio signal |
GB0124352D0 (en) * | 2001-10-11 | 2001-11-28 | 1 Ltd | Signal processing device for acoustic transducer array |
BRPI0308691B1 (en) * | 2002-04-10 | 2018-06-19 | Koninklijke Philips N.V. | "Methods for encoding a multi channel signal and for decoding multiple channel signal information, and arrangements for encoding and decoding a multiple channel signal" |
JP2007009804A (en) * | 2005-06-30 | 2007-01-18 | Tohoku Electric Power Co Inc | Schedule system for output-power control of wind power-plant |
JP2007095002A (en) * | 2005-09-30 | 2007-04-12 | Noritsu Koki Co Ltd | Photograph processor |
-
2004
- 2004-03-01 DE DE102004009954A patent/DE102004009954B4/en not_active Expired - Lifetime
-
2005
- 2005-02-28 DE DE502005000864T patent/DE502005000864D1/en active Active
- 2005-02-28 AT AT05715611T patent/ATE364882T1/en active
- 2005-02-28 KR KR1020067016991A patent/KR100823097B1/en active IP Right Grant
- 2005-02-28 PT PT05715611T patent/PT1697930E/en unknown
- 2005-02-28 CN CN2005800068249A patent/CN1926608B/en active Active
- 2005-02-28 RU RU2006134641/09A patent/RU2332727C2/en active
- 2005-02-28 DK DK05715611T patent/DK1697930T3/en active
- 2005-02-28 WO PCT/EP2005/002110 patent/WO2005083678A1/en active IP Right Grant
- 2005-02-28 BR BRPI0507207A patent/BRPI0507207B1/en active IP Right Grant
- 2005-02-28 CA CA2558161A patent/CA2558161C/en active Active
- 2005-02-28 JP JP2007501191A patent/JP4413257B2/en active Active
- 2005-02-28 AU AU2005217517A patent/AU2005217517B2/en active Active
- 2005-02-28 ES ES05715611T patent/ES2286798T3/en active Active
- 2005-02-28 EP EP05715611A patent/EP1697930B1/en active Active
-
2006
- 2006-08-01 IL IL177213A patent/IL177213A/en active IP Right Grant
- 2006-08-14 US US11/464,315 patent/US7340391B2/en active Active
- 2006-09-29 NO NO20064431A patent/NO339114B1/en unknown
-
2007
- 2007-02-12 HK HK07101657A patent/HK1095194A1/en unknown
Non-Patent Citations (1)
Title |
---|
See references of WO2005083678A1 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8063809B2 (en) | 2008-12-29 | 2011-11-22 | Huawei Technologies Co., Ltd. | Transient signal encoding method and device, decoding method and device, and processing system |
Also Published As
Publication number | Publication date |
---|---|
DE102004009954A1 (en) | 2005-09-29 |
PT1697930E (en) | 2007-09-25 |
ES2286798T3 (en) | 2007-12-01 |
CA2558161A1 (en) | 2005-09-09 |
IL177213A0 (en) | 2006-12-10 |
IL177213A (en) | 2011-10-31 |
BRPI0507207A8 (en) | 2018-06-12 |
CN1926608A (en) | 2007-03-07 |
DK1697930T3 (en) | 2007-10-08 |
JP2007525718A (en) | 2007-09-06 |
BRPI0507207B1 (en) | 2018-12-26 |
KR100823097B1 (en) | 2008-04-18 |
DE502005000864D1 (en) | 2007-07-26 |
CN1926608B (en) | 2010-05-05 |
JP4413257B2 (en) | 2010-02-10 |
EP1697930B1 (en) | 2007-06-13 |
NO339114B1 (en) | 2016-11-14 |
BRPI0507207A (en) | 2007-06-12 |
HK1095194A1 (en) | 2007-04-27 |
RU2332727C2 (en) | 2008-08-27 |
AU2005217517A1 (en) | 2005-09-09 |
DE102004009954B4 (en) | 2005-12-15 |
US20070033056A1 (en) | 2007-02-08 |
RU2006134641A (en) | 2008-04-10 |
WO2005083678A1 (en) | 2005-09-09 |
US7340391B2 (en) | 2008-03-04 |
KR20060121982A (en) | 2006-11-29 |
ATE364882T1 (en) | 2007-07-15 |
NO20064431L (en) | 2006-09-29 |
AU2005217517B2 (en) | 2008-06-26 |
CA2558161C (en) | 2010-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1697930B1 (en) | Device and method for processing a multi-channel signal | |
EP1687810B1 (en) | Device and method for determining a quantiser step size | |
EP3544003B1 (en) | Device and method of determining an estimated value | |
EP1145227B1 (en) | Method and device for error concealment in an encoded audio-signal and method and device for decoding an encoded audio signal | |
DE69233094T2 (en) | Method and arrangement for data compression in which quantization bits are allocated to a block in a current frame depending on a block in a past frame | |
DE4320990B4 (en) | Redundancy reduction procedure | |
DE19736669C1 (en) | Beat detection method for time discrete audio signal | |
DE602004005020T2 (en) | AUDIO SIGNAL SYNTHESIS | |
EP1953739B1 (en) | Method and device for reducing noise in a decoded signal | |
WO1999004506A1 (en) | Method for coding an audio signal | |
WO1999004505A1 (en) | Method for signalling a noise substitution during audio signal coding | |
EP1397799B1 (en) | Method and device for processing time-discrete audio sampled values | |
EP1825461A1 (en) | Method and apparatus for artificially expanding the bandwidth of voice signals | |
DE10236694A1 (en) | Equipment for scalable coding and decoding of spectral values of signal containing audio and/or video information by splitting signal binary spectral values into two partial scaling layers | |
DE69932861T2 (en) | METHOD FOR CODING AN AUDIO SIGNAL WITH A QUALITY VALUE FOR BIT ASSIGNMENT | |
DE60311334T2 (en) | Method and device for coding and decoding a digital information signal | |
WO2001043503A2 (en) | Method and device for processing a stereo audio signal | |
EP1277346B1 (en) | Device and method for analysing a spectral representation of a decoded time-variable signal | |
DE19742201C1 (en) | Method of encoding time discrete audio signals, esp. for studio use | |
DE4209382C1 (en) | ||
DE10065363B4 (en) | Apparatus and method for decoding a coded data signal |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20060721 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWAN |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: SCHUG, MICHAEL Inventor name: GROESCHL, ALEXANDER Inventor name: HERRE, JUERGEN |
|
GRAS | Grant fee paid |
Free format text: ORIGINAL CODE: EPIDOSNIGR3 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1095194 Country of ref document: HK |
|
GRAA | (expected) grant |
Free format text: ORIGINAL CODE: 0009210 |
|
DAX | Request for extension of the european patent (deleted) | ||
AK | Designated contracting states |
Kind code of ref document: B1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU MC NL PL PT RO SE SI SK TR |
|
REG | Reference to a national code |
Ref country code: GB Ref legal event code: FG4D Free format text: NOT ENGLISH |
|
REG | Reference to a national code |
Ref country code: CH Ref legal event code: EP |
|
REG | Reference to a national code |
Ref country code: IE Ref legal event code: FG4D Free format text: LANGUAGE OF EP DOCUMENT: GERMAN |
|
REF | Corresponds to: |
Ref document number: 502005000864 Country of ref document: DE Date of ref document: 20070726 Kind code of ref document: P |
|
GBT | Gb: translation of ep patent filed (gb section 77(6)(a)/1977) |
Effective date: 20070705 |
|
REG | Reference to a national code |
Ref country code: SE Ref legal event code: TRGR |
|
REG | Reference to a national code |
Ref country code: PT Ref legal event code: SC4A Free format text: AVAILABILITY OF NATIONAL TRANSLATION Effective date: 20070912 |
|
ET | Fr: translation filed | ||
REG | Reference to a national code |
Ref country code: DK Ref legal event code: T3 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1095194 Country of ref document: HK |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: PL Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
REG | Reference to a national code |
Ref country code: ES Ref legal event code: FG2A Ref document number: 2286798 Country of ref document: ES Kind code of ref document: T3 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CZ Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 Ref country code: SI Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 Ref country code: BG Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070913 Ref country code: IS Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071013 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: SK Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 Ref country code: LT Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
PLBE | No opposition filed within time limit |
Free format text: ORIGINAL CODE: 0009261 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: GR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070914 |
|
26N | No opposition filed |
Effective date: 20080314 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: RO Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: EE Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: CY Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: HU Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20071214 |
|
PG25 | Lapsed in a contracting state [announced via postgrant information from national office to epo] |
Ref country code: TR Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT Effective date: 20070613 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 12 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 13 |
|
REG | Reference to a national code |
Ref country code: FR Ref legal event code: PLFP Year of fee payment: 14 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: NL Payment date: 20230220 Year of fee payment: 19 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: MC Payment date: 20230216 Year of fee payment: 19 Ref country code: LU Payment date: 20230216 Year of fee payment: 19 Ref country code: IE Payment date: 20230215 Year of fee payment: 19 Ref country code: FR Payment date: 20230220 Year of fee payment: 19 Ref country code: FI Payment date: 20230222 Year of fee payment: 19 Ref country code: ES Payment date: 20230317 Year of fee payment: 19 Ref country code: DK Payment date: 20230220 Year of fee payment: 19 Ref country code: CH Payment date: 20230307 Year of fee payment: 19 Ref country code: AT Payment date: 20230215 Year of fee payment: 19 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: SE Payment date: 20230220 Year of fee payment: 19 Ref country code: PT Payment date: 20230220 Year of fee payment: 19 Ref country code: IT Payment date: 20230228 Year of fee payment: 19 Ref country code: GB Payment date: 20230221 Year of fee payment: 19 Ref country code: DE Payment date: 20230216 Year of fee payment: 19 Ref country code: BE Payment date: 20230220 Year of fee payment: 19 |
|
P01 | Opt-out of the competence of the unified patent court (upc) registered |
Effective date: 20230512 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: LU Payment date: 20240220 Year of fee payment: 20 |
|
PGFP | Annual fee paid to national office [announced via postgrant information from national office to epo] |
Ref country code: ES Payment date: 20240319 Year of fee payment: 20 Ref country code: NL Payment date: 20240220 Year of fee payment: 20 Ref country code: IE Payment date: 20240216 Year of fee payment: 20 |