EP0910927A1 - Procede de codage et de decodage de valeurs spectrales stereophoniques - Google Patents

Procede de codage et de decodage de valeurs spectrales stereophoniques

Info

Publication number
EP0910927A1
EP0910927A1 EP97925036A EP97925036A EP0910927A1 EP 0910927 A1 EP0910927 A1 EP 0910927A1 EP 97925036 A EP97925036 A EP 97925036A EP 97925036 A EP97925036 A EP 97925036A EP 0910927 A1 EP0910927 A1 EP 0910927A1
Authority
EP
European Patent Office
Prior art keywords
spectral values
stereo
coding
stereo audio
section
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP97925036A
Other languages
German (de)
English (en)
Other versions
EP0910927B1 (fr
Inventor
Uwe Gbur
Martin Dietz
Bodo Teichmann
Karlheinz Brandenburg
Heinz GERHÄUSER
Jürgen HERRE
James Johnston
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
AT&T Labs Inc
Nokia of America Corp
Original Assignee
Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Lucent Technologies Inc
AT&T Labs Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
First worldwide family litigation filed litigation Critical https://patents.darts-ip.com/?family=7799742&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=EP0910927(A1) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Application filed by Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV, Lucent Technologies Inc, AT&T Labs Inc filed Critical Fraunhofer Gesellschaft zur Forderung der Angewandten Forschung eV
Publication of EP0910927A1 publication Critical patent/EP0910927A1/fr
Application granted granted Critical
Publication of EP0910927B1 publication Critical patent/EP0910927B1/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/008Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form

Definitions

  • the present invention relates to encoding and decoding stereo audio spectral values, and more particularly to indicating the fact that stereo intensity encoding is active.
  • Modern audio coding methods or decoding methods which operate according to the MPEG layer 3 standard, for example, are able to compress the data rate of digital audio signals by a factor of twelve, for example, without noticeably deteriorating the quality thereof.
  • the redundancy and irrelevance of the two channels among one another is also used in the stereo case.
  • the MS stereo method known to those skilled in the art essentially uses the redundancy of the two channels with one another, a sum of the two channels and a difference between the two channels being calculated, which then each transmit as modified channel data for the left and right channel become.
  • the redundancy between the two channels removed in the encoder is added again in the decoder. This means that the MS stereo procedure is exactly reconstructive.
  • the intensity stereo method primarily uses stereo irrelevance.
  • stereo irrelevance it can be said that the spatial perception of the human hearing system depends on the frequency of the perceived audio signals. At lower frequencies, both the amount and phase information of both stereo signals are evaluated by the human auditory system, the perception of high-frequency components being based primarily on the analysis of the energy-time envelopes of both channels. The exact phase information of the signals in both channels is therefore not relevant for spatial perception. This property of the human ear is used to use the stereo irrelevance for further data reduction of audio signals by the intensity stereo method.
  • the stereo intensity method cannot resolve precise location information at high frequencies, it is therefore possible to transmit a common energy envelope for both channels instead of two stereo channels L, R from an intensity limit frequency determined in the encoder.
  • a common energy envelope for both channels instead of two stereo channels L, R from an intensity limit frequency determined in the encoder.
  • roughly quantified direction information is also transmitted as side information.
  • the bit savings can be up to 50%.
  • the IS method in the decoder is not exactly reconstructive.
  • mode_extension_bit indicates that the IS method is active at all in a block of stereo audio spectral values, each block having an associated one Mode_extension_bit.
  • FIG. 1 shows a basic illustration of the known IS method.
  • L ⁇ and R ⁇ here represent the stereo audio spectra values of channel L and channel R in any scale factor band.
  • the use of the IS method is only permitted above a certain IS cutoff frequency, in order to avoid coding errors in the coded Introduce stereo audio spectral values. Therefore, the left and right channels must be coded separately in a range from 0 Hz to the IS cutoff frequency.
  • the determination of the IS cutoff frequency as such is carried out in a separate algorithm which does not form part of this invention. From this limit frequency, the encoder encodes the sum signal of the left channel 10 and the right channel 12, which is formed at the summation point 14.
  • scaling information 16 for channel L and scaling information 18 for channel R are also necessary for decoding.
  • scale factors for the left and right channels are transmitted.
  • the scaling information 16 and 18 are transmitted as side information in addition to the coded spectral values of the channel L and the channel R.
  • a decoder supplies decoded audio signal values to a decoded channel L '20 or to a decoded channel R' 22, the scaling information 16 for channel R and the scaling information 18 for channel L with the decoded stereo audio spectral values of the respective channels an L multiplier 24 or an R multiplier 26 in order to decode the originally coded stereo audio spectral values again.
  • the stereo audio spectral values for each channel are grouped into so-called scale factor bands. These bands are adapted to the perceptual properties of the hearing. Each of these bands can be amplified with an additional factor, the so-called scale factor, which is transmitted as side information for the respective channel and which represents part of the scaling information 16 and the scaling information 18 from FIG. 1. These factors shape an interference noise introduced by quantization in such a way that it is "masked" taking psychoacoustic considerations into account and thus becomes inaudible.
  • FIG. 2a shows a format of the encoded right channel R, which is used, for example, in an audio coding method MPEG layer 3. All further explanations regarding the intensity stereo coding also relate to the method according to the MPEG layer 3 standard.
  • the individual scale factor bands 28, into which the stereo audio spectral values are grouped, are shown schematically in the first line in FIG. 2a.
  • the same bandwidth of the scale factor bands drawn in FIG. 2a only serves for clarity of presentation and will not occur in practice due to the psychoacoustic properties of the auditory system.
  • the third line of FIG. 2a contains part of the page information 34 for the right channel.
  • This part of the side information 34 shown consists, on the one hand, of the scale factors skf for the area below the IS cut-off frequency and of direction information rinfo 36 for the area above the IS cut-off frequency 32.
  • This directional information is also used in the intensity stereo method to ensure a rough spatial resolution of the IS-coded frequency range.
  • This direction information rinfo 36 which is also called intensity positions (is_pos), is therefore transmitted in the right channel instead of the scale factors. It should be noted once again that below the IS cutoff frequency, the scale factors 34 corresponding to the scale factor bands 28 are still present in the right channel. The intensity positions 36 indicate the perceived stereo imaging position (the ratio from left to right) of the signal source within the respective scale factor bands 28. In each scale factor band 28 above the IS cutoff frequency, the decoded values of the transmitted stereo audio spectral values are scaled according to the MPEG Layer 3 method by the following scaling factors k L for the left channel and k R for the right channel:
  • is_ratio tan (is_pos- ⁇ r / 12) (3)
  • R ⁇ and L ⁇ represent the intensity stereo decoded stereo audio spectral values.
  • the transition from the quantized sum spectral values not equal to zero to the zero values in the right channel can implicitly indicate the IS cut-off frequency to the decoder with the MPEG Layer 3 standard.
  • the transmitted channel L is thus calculated as the sum of the left and the right channel
  • the transmitted direction information can be determined using the following equation:
  • nint [x] represents the function "next integer", where E L and E R are the energies in the respective scale factor bands of the left and right channels.
  • the stereo audio spectral values are grouped into the scale factor bands, these bands being adapted to the perceptual properties of the hearing.
  • these scale factor bands are now divided into exactly three regions. In order to Areas with the same signal statistics should now be grouped. This is advantageous for the redundancy reduction now taking place by means of the known Huffman coding.
  • the non-backward-compatible NBC coding method which is currently in the standardization process, differs from the standard audio coding method MPEG Layer 3, among other things, in that not only exactly three regions from scale factor bands are allowed in the bitstream syntax for this method, but that so-called sections or "sections" can be present in any number and can have any number of scale factor bands.
  • a section is now assigned a corresponding Huffman table from a plurality of such tables in analogy to the previously described method in MPEG Layer 3 to achieve a maximum redundancy reduction, which table is then to be used for decoding. In extreme cases, for example, a section consists of only a single scale factor band. In practice, however, this is unlikely to occur, since the page information required would then be much too large.
  • the NBC method has a total of 16 Huffman coding table numbers that are transmitted as 4-bit values. This means that one of the twelve existing coding table numbers can be selected.
  • the object of the present invention is to provide methods for coding or decoding stereo audio spectral values, in which information relevant to the coding or decoding is signaled with a minimal amount of side information. This object is achieved by a method for encoding stereo audio spectral values according to claim 1 and by a method for decoding stereo audio spectral values partially encoded in the intensity stereo method according to claim 2.
  • the present invention is based on the recognition that additional coding table numbers which are not used to refer to coding tables can indicate other information relevant for a section.
  • the "additional" code table numbers are the code table numbers that do not refer to code tables. Due to a 4-bit coding of twelve different coding table numbers, the numbers 13, 14 and 15 are, as it were, freely available for assignment with other information.
  • two (no. 14 and no. 15) of the three (no. 13, no. 14 and no. 15) additional coding table numbers are used in order to, on the one hand, refer to an intensity which is present in a section. Coding and on the other hand to point out the mutual phase relationship of IS-coded stereo audio spectral values in two stereo channels.
  • the additional unused coding table number 13 can be used to indicate adaptive Huffman coding.
  • 2a shows a format of the data in the presence of stereo intensity coding for the right channel for the standard MPEG Layer 3
  • 2b shows a format of the data in the presence of stereo intensity coding for the right channel for the MPEG-NBC method
  • FIG. 3 is a schematic block diagram of a decoder that implements the present invention.
  • a method for encoding stereo audio spectral values and the method for decoding stereo audio spectral values partially encoded in the intensity stereo method according to a first exemplary embodiment of the present invention use novel signaling of the presence of the intensity stereo encoding within a section.
  • the first 12 coding table numbers correspond to actual coding tables.
  • the last and the penultimate coding table number it is now signaled that the stereo intensity method is used within the section to which this coding table number is assigned.
  • FIG. 2b shows a format of the data for the right channel R in the presence of stereo intensity coding, using the MPEG2-NBC method.
  • FIG. 2a or to the MPEG Layer 3 method, is that a user now has the flexibility to selectively insert or deactivate an intensity stereo coding of the stereo audio spectral values for each section even above the IS cut-off frequency 32 to switch off.
  • the IS cut-off frequency is therefore no longer a correct cut-off frequency, since with the NBC method, the IS coding can also be switched off or on again above the IS cut-off frequency.
  • the scale factors transmitted in a section with IS coding for the right channel now also represent the direction information 36 analogously to the prior art, these values themselves also being subjected to a difference and Huffman coding.
  • the right channel as already mentioned, there are no stereo audio spectral values in the scale factor bands that are not IS-coded, but a zero spectrum.
  • the left channel contains the sum signal of the left and right channels. However, the sum signal is normalized in such a way that its energy within the respective scale factor bands after IS decoding corresponds to the energy of the left channel. Therefore, the left channel can also be adopted unchanged in the decoding device if IS coding is used and does not have to be determined specifically by means of a re-scaling rule.
  • the stereo audio spectral values of the right channel can now be calculated back from the stereo audio spectral values of the left channel using the direction information is_pos 36, which are present in the side information of the right channel.
  • the stereo intensity method produces two coherent signals for the left or right channel, which differ only in their amplitude, ie intensity, depending on the direction information is_pos 36 (equations (4) and (5)).
  • the stereo intensity coding is signaled by means of two "unreal" coding table numbers, a phase relationship of the two channels to one another can be included. If the channels have the same phase position, the back-calculation rule according to the invention to be carried out in the decoder is as follows:
  • R ⁇ in the two previous equations denotes the back-calculated, i.e. decoded, stereo audio spectral values of the right channel
  • sfb denotes the scale factor band 28 to which the direction information is_pos 36 are assigned
  • L ⁇ denotes the stereo audio spectral values of the left channel, which are adopted unchanged in the decoder.
  • Coding table number 15 now indicates whether the first retroactive accounting step should be used, while coding table number 14 indicates that the second retroactive accounting rule should be used, i.e. that the two channels are out of phase.
  • a phase discriminator can be provided which, from a certain phase discriminator output value, which can be, for example, 90 °, determines that the signals are out of phase, the same being considered to be in phase with a phase difference of less than 90 °.
  • a section which consists of at least one scale factor band exists, by means of the code table numbers 14 or 15, the phase relationship of the two channels to one another is determined.
  • the side information caused by IS and phase signaling is 8 bits for a section, which is composed of four bits for the section length and four bits for the coding table number 14 or 15. If an audio signal is to be encoded which has frequent changes in the phase position in scale factor bands of its stereo audio spectral values, then according to the first exemplary embodiment a new section ("section") must be started each time the phase position is reversed from scale factor band to scale factor band.
  • a signal with a frequently changing phase position therefore generates a large number of sections, since each section can only display either the in-phase or the out-of-phase of its stereo audio spectral values in the two channels due to the coding table number assigned to it.
  • An unfavorable signal will therefore lead to a large number of sections and thus to a large amount of page information.
  • a second exemplary embodiment of the present invention allows a phase-factor coding on a scale factor band basis in a section in which the intensity coding is active.
  • this method according to the second exemplary embodiment of the present invention using an MS mask, which is described below, it is possible to encode phase factor by scale factor band without increasing the number of sections and without any additional expenditure.
  • center-side method and the intensity stereo method are mutually exclusive in a scale factor band. These two methods are therefore orthogonal.
  • MS coding of stereo audio spectral values is used in a bit stream
  • a signaling bit in the side information will be set accordingly globally turn on the MS coding. Setting this bit means that an MS bit mask is transmitted, with which it is possible to selectively switch MS coding on or off for each scale factor band (scfbd).
  • One bit is reserved in the MS bit mask for each scale factor band, which is why the length of the bit mask corresponds to the number of scale factor bands.
  • the MS scale factor information is not necessary, since the MS coding must not be activated here.
  • the MS bit mask can be used for other signaling in this area. It is therefore possible to display details of the IS coding using the MS bit mask.
  • the information relating to the phase position of the channels is specified in a section by means of the coding table numbers 14 and 15 in IS coding.
  • the coding table numbers also indicate that IS coding is active at all in a section.
  • the MS bit mask is used in the second exemplary embodiment of the present invention to allow scale factor bands with different phase positions in one section.
  • the MS bit mask is now used to indicate the phase relationship of the individual scale factor bands in this section in relation to the coding table number, which signals that IS coding is active in a section. If a bit in the MS bit mask for a scale factor band is not set (ie zero), the phase information indicated by the coding table number for the section in which the scale factor band is located is retained, while if a (ie one) bit is set in the MS bit mask for the scale factor band which is inverted by the phase table of the two channels indicated by the coding table number for the section in which the scale factor band is located. In principle, it is an EXCLUSIVE-OR link between the one indicated by the coding table number Phase position and the MS bit mask.
  • phase relationships of the two stereo channels L and R calculated from the coding table number and MS bit mask in a scale factor band located in a section in which the IS coding is used are as follows:
  • the described second exemplary embodiment of the present invention thus allows scale factor bands with stereo audio spectral values with different phase positions to occur in one section, as a result of which fewer sections than in the first exemplary embodiment have to be formed for coding. This means that less page information also has to be transmitted.
  • the additional coding table numbers can also be used to display other information relevant for a section.
  • Further information relevant to a section can, for example, indicate the use of an adaptive ven Huffman coding in one section.
  • an adapted Huffman table can be generated depending on the signal statistics.
  • the coding table number 13 instructs the coding device not to use any of the twelve fixed Huffman tables, but to use an adapted Huffman table which is not known a priori to the decoder. This is advantageous if the signal statistics in a section cannot be optimally coded, ie compressed, with one of the twelve fixed coding tables.
  • the coding is no longer fixed to the twelve fixed Huffman tables, but can generate and use a table that is optimally adapted to the signal statistics.
  • the information about the adaptive coding table is transmitted as additional page information.
  • a decoding device requires this additional side information in order to calculate back from it the adapted Huffman table used in the coding, in order to be able to correctly decode the Huffman-coded stereo audio spectral values again.
  • Audio spectral values partially coded using the intensity stereo method are each supplied to inverse quantizers 38 and 40, the inverse quantizers reversing the quantization introduced during coding.
  • the dequantized stereo audio spectral values then arrive in an MS decoder 42.
  • This MS decoder 42 reverses the middle-side coding introduced in the encoder.
  • An IS decoder 44 now uses the previously described recalculation regulations (7) and (8) in order to obtain the original stereo audio spectral values again for the IS-coded scale factor bands.
  • Respective reverse transformation devices for the left or right channel now convert the stereo audio spectral values into stereo audio time evaluate L (t), R (t).
  • the inverse transformers 46 and 48 can be implemented by an inverse MDCT, for example.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Stereo-Broadcasting Methods (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
EP97925036A 1996-07-12 1997-06-03 Procede de codage et de decodage de valeurs spectrales stereophoniques Expired - Lifetime EP0910927B1 (fr)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
DE19628292A DE19628292B4 (de) 1996-07-12 1996-07-12 Verfahren zum Codieren und Decodieren von Stereoaudiospektralwerten
DE19628292 1996-07-12
PCT/EP1997/002874 WO1998003036A1 (fr) 1996-07-12 1997-06-03 Procede de codage et de decodage de valeurs spectrales stereophoniques

Publications (2)

Publication Number Publication Date
EP0910927A1 true EP0910927A1 (fr) 1999-04-28
EP0910927B1 EP0910927B1 (fr) 2000-01-12

Family

ID=7799742

Family Applications (1)

Application Number Title Priority Date Filing Date
EP97925036A Expired - Lifetime EP0910927B1 (fr) 1996-07-12 1997-06-03 Procede de codage et de decodage de valeurs spectrales stereophoniques

Country Status (14)

Country Link
US (1) US6771777B1 (fr)
EP (1) EP0910927B1 (fr)
JP (1) JP3622982B2 (fr)
KR (1) KR100316582B1 (fr)
AT (1) ATE188832T1 (fr)
AU (1) AU712196B2 (fr)
CA (1) CA2260090C (fr)
DE (2) DE19628292B4 (fr)
DK (1) DK0910927T3 (fr)
ES (1) ES2143868T3 (fr)
GR (1) GR3032444T3 (fr)
NO (1) NO317570B1 (fr)
PT (1) PT910927E (fr)
WO (1) WO1998003036A1 (fr)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7016547B1 (en) 2002-06-28 2006-03-21 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US7143030B2 (en) 2001-12-14 2006-11-28 Microsoft Corporation Parametric compression/decompression modes for quantization matrices for digital audio
US7240001B2 (en) 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7299190B2 (en) 2002-09-04 2007-11-20 Microsoft Corporation Quantization and inverse quantization for audio
US7460990B2 (en) 2004-01-23 2008-12-02 Microsoft Corporation Efficient coding of digital media spectral data using wide-sense perceptual similarity
US7502743B2 (en) 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
US7539612B2 (en) 2005-07-15 2009-05-26 Microsoft Corporation Coding and decoding scale factor information
US7562021B2 (en) 2005-07-15 2009-07-14 Microsoft Corporation Modification of codewords in dictionary used for efficient coding of digital media spectral data
US7630882B2 (en) 2005-07-15 2009-12-08 Microsoft Corporation Frequency segmentation to obtain bands for efficient coding of digital media
US7688894B2 (en) 2003-09-07 2010-03-30 Microsoft Corporation Scan patterns for interlaced video content
US7782954B2 (en) 2003-09-07 2010-08-24 Microsoft Corporation Scan patterns for progressive video content
US7831434B2 (en) 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
US7953604B2 (en) 2006-01-20 2011-05-31 Microsoft Corporation Shape and scale parameters for extended-band frequency coding
US8069052B2 (en) 2002-09-04 2011-11-29 Microsoft Corporation Quantization and inverse quantization for audio

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6539357B1 (en) * 1999-04-29 2003-03-25 Agere Systems Inc. Technique for parametric coding of a signal containing information
US6735561B1 (en) * 2000-03-29 2004-05-11 At&T Corp. Effective deployment of temporal noise shaping (TNS) filters
US7099830B1 (en) * 2000-03-29 2006-08-29 At&T Corp. Effective deployment of temporal noise shaping (TNS) filters
ATE387044T1 (de) * 2000-07-07 2008-03-15 Nokia Siemens Networks Oy Verfahren und vorrichtung für die perzeptuelle tonkodierung von einem mehrkanal tonsignal mit verwendung der kaskadierten diskreten cosinustransformation oder der modifizierten diskreten cosinustransformation
US8605911B2 (en) 2001-07-10 2013-12-10 Dolby International Ab Efficient and scalable parametric stereo coding for low bitrate audio coding applications
SE0202159D0 (sv) * 2001-07-10 2002-07-09 Coding Technologies Sweden Ab Efficientand scalable parametric stereo coding for low bitrate applications
WO2003046891A1 (fr) 2001-11-29 2003-06-05 Coding Technologies Ab Procede permettant d'ameliorer la reconstruction des hautes frequences
EP2282310B1 (fr) 2002-09-04 2012-01-25 Microsoft Corporation Codage entropique par adaptation du mode de codage entre le codage à longueur de plage et le codage par niveau
US7433824B2 (en) * 2002-09-04 2008-10-07 Microsoft Corporation Entropy coding by adapting coding between level and run-length/level modes
SE0202770D0 (sv) 2002-09-18 2002-09-18 Coding Technologies Sweden Ab Method for reduction of aliasing introduces by spectral envelope adjustment in real-valued filterbanks
US7724827B2 (en) * 2003-09-07 2010-05-25 Microsoft Corporation Multi-layer run level encoding and decoding
KR20050027179A (ko) * 2003-09-13 2005-03-18 삼성전자주식회사 오디오 데이터 복원 방법 및 그 장치
JPWO2006004048A1 (ja) * 2004-07-06 2008-04-24 松下電器産業株式会社 オーディオ信号符号化装置、オーディオ信号復号化装置、方法、及びプログラム
EP1866911B1 (fr) * 2005-03-30 2010-06-09 Koninklijke Philips Electronics N.V. Codage audio multicanaux pouvant etre mis a l'echelle
US7684981B2 (en) 2005-07-15 2010-03-23 Microsoft Corporation Prediction of spectral coefficients in waveform coding and decoding
US7693709B2 (en) 2005-07-15 2010-04-06 Microsoft Corporation Reordering coefficients for waveform coding or decoding
KR100851970B1 (ko) * 2005-07-15 2008-08-12 삼성전자주식회사 오디오 신호의 중요주파수 성분 추출방법 및 장치와 이를이용한 저비트율 오디오 신호 부호화/복호화 방법 및 장치
US7599840B2 (en) * 2005-07-15 2009-10-06 Microsoft Corporation Selectively using multiple entropy models in adaptive coding and decoding
US7565018B2 (en) * 2005-08-12 2009-07-21 Microsoft Corporation Adaptive coding and decoding of wide-range coefficients
US8599925B2 (en) 2005-08-12 2013-12-03 Microsoft Corporation Efficient coding and decoding of transform blocks
US7933337B2 (en) 2005-08-12 2011-04-26 Microsoft Corporation Prediction of transform coefficients for image compression
US8190425B2 (en) 2006-01-20 2012-05-29 Microsoft Corporation Complex cross-correlation parameters for multi-channel audio
US8184710B2 (en) 2007-02-21 2012-05-22 Microsoft Corporation Adaptive truncation of transform coefficient data in a transform-based digital media codec
US7774205B2 (en) 2007-06-15 2010-08-10 Microsoft Corporation Coding of sparse digital media spectral data
US7761290B2 (en) 2007-06-15 2010-07-20 Microsoft Corporation Flexible frequency and time partitioning in perceptual transform coding of audio
US8046214B2 (en) 2007-06-22 2011-10-25 Microsoft Corporation Low complexity decoder for complex transform coding of multi-channel sound
US8249883B2 (en) 2007-10-26 2012-08-21 Microsoft Corporation Channel extension coding for multi-channel source
KR101444102B1 (ko) 2008-02-20 2014-09-26 삼성전자주식회사 스테레오 오디오의 부호화, 복호화 방법 및 장치
US8179974B2 (en) 2008-05-02 2012-05-15 Microsoft Corporation Multi-level representation of reordered transform coefficients
US8406307B2 (en) 2008-08-22 2013-03-26 Microsoft Corporation Entropy coding/decoding of hierarchically organized data
JP6061121B2 (ja) * 2011-07-01 2017-01-18 ソニー株式会社 オーディオ符号化装置、オーディオ符号化方法、およびプログラム

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE3310480C2 (de) * 1983-03-23 1986-02-13 Seitzer, Dieter, Prof. Dr.-Ing., 8520 Erlangen Digitales Codierverfahren für Audiosignale
JPS59188764A (ja) 1983-04-11 1984-10-26 Hitachi Ltd メモリ装置
DE3943881B4 (de) * 1989-04-17 2008-07-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Digitales Codierverfahren
JP3131249B2 (ja) 1991-08-23 2001-01-31 日本放送協会 混合音声信号受信装置
EP0559348A3 (fr) * 1992-03-02 1993-11-03 AT&T Corp. Processeur ayant une boucle de réglage du débit pour un codeur/décodeur perceptuel
CA2090052C (fr) 1992-03-02 1998-11-24 Anibal Joao De Sousa Ferreira Methode et appareil de codage di signaux audio
DE4236989C2 (de) * 1992-11-02 1994-11-17 Fraunhofer Ges Forschung Verfahren zur Übertragung und/oder Speicherung digitaler Signale mehrerer Kanäle
JP3292522B2 (ja) 1992-11-25 2002-06-17 京セラ株式会社 携帯電話機
JP3150475B2 (ja) 1993-02-19 2001-03-26 松下電器産業株式会社 量子化方法
US5581653A (en) 1993-08-31 1996-12-03 Dolby Laboratories Licensing Corporation Low bit-rate high-resolution spectral envelope coding for audio encoder and decoder
DE4331376C1 (de) 1993-09-15 1994-11-10 Fraunhofer Ges Forschung Verfahren zum Bestimmen der zu wählenden Codierungsart für die Codierung von wenigstens zwei Signalen
DE4331367C2 (de) * 1993-09-15 1996-04-18 Lewin Martin Innenmuffe zur Dichtung von Rohrstößen in Rohrleitungen
US5488665A (en) 1993-11-23 1996-01-30 At&T Corp. Multi-channel perceptual audio compression system with encoding mode switching among matrixed channels
JP3435674B2 (ja) 1994-05-06 2003-08-11 日本電信電話株式会社 信号の符号化方法と復号方法及びそれを使った符号器及び復号器
US5864802A (en) * 1995-09-22 1999-01-26 Samsung Electronics Co., Ltd. Digital audio encoding method utilizing look-up table and device thereof

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO9803036A1 *

Cited By (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7143030B2 (en) 2001-12-14 2006-11-28 Microsoft Corporation Parametric compression/decompression modes for quantization matrices for digital audio
US7155383B2 (en) 2001-12-14 2006-12-26 Microsoft Corporation Quantization matrices for jointly coded channels of audio
US7240001B2 (en) 2001-12-14 2007-07-03 Microsoft Corporation Quality improvement techniques in an audio encoder
US7249016B2 (en) 2001-12-14 2007-07-24 Microsoft Corporation Quantization matrices using normalized-block pattern of digital audio
US7930171B2 (en) 2001-12-14 2011-04-19 Microsoft Corporation Multi-channel audio encoding/decoding with parametric compression/decompression and weight factors
US7016547B1 (en) 2002-06-28 2006-03-21 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US7218790B2 (en) 2002-06-28 2007-05-15 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US7340103B2 (en) 2002-06-28 2008-03-04 Microsoft Corporation Adaptive entropy encoding/decoding for screen capture content
US7801735B2 (en) 2002-09-04 2010-09-21 Microsoft Corporation Compressing and decompressing weight factors using temporal prediction for audio data
US8069052B2 (en) 2002-09-04 2011-11-29 Microsoft Corporation Quantization and inverse quantization for audio
US7299190B2 (en) 2002-09-04 2007-11-20 Microsoft Corporation Quantization and inverse quantization for audio
US7502743B2 (en) 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
US7782954B2 (en) 2003-09-07 2010-08-24 Microsoft Corporation Scan patterns for progressive video content
US7688894B2 (en) 2003-09-07 2010-03-30 Microsoft Corporation Scan patterns for interlaced video content
US7460990B2 (en) 2004-01-23 2008-12-02 Microsoft Corporation Efficient coding of digital media spectral data using wide-sense perceptual similarity
US7630882B2 (en) 2005-07-15 2009-12-08 Microsoft Corporation Frequency segmentation to obtain bands for efficient coding of digital media
US7562021B2 (en) 2005-07-15 2009-07-14 Microsoft Corporation Modification of codewords in dictionary used for efficient coding of digital media spectral data
US7539612B2 (en) 2005-07-15 2009-05-26 Microsoft Corporation Coding and decoding scale factor information
US7831434B2 (en) 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US7953604B2 (en) 2006-01-20 2011-05-31 Microsoft Corporation Shape and scale parameters for extended-band frequency coding
US7885819B2 (en) 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
US9741354B2 (en) 2007-06-29 2017-08-22 Microsoft Technology Licensing, Llc Bitstream syntax for multi-process audio decoding

Also Published As

Publication number Publication date
KR100316582B1 (ko) 2002-02-28
AU712196B2 (en) 1999-10-28
US6771777B1 (en) 2004-08-03
NO317570B1 (no) 2004-11-15
NO990106D0 (no) 1999-01-11
DE59701014D1 (de) 2000-02-17
DE19628292B4 (de) 2007-08-02
EP0910927B1 (fr) 2000-01-12
DK0910927T3 (da) 2000-05-08
CA2260090A1 (fr) 1998-01-22
NO990106L (no) 1999-03-10
ES2143868T3 (es) 2000-05-16
ATE188832T1 (de) 2000-01-15
JP3622982B2 (ja) 2005-02-23
DE19628292A1 (de) 1998-01-15
KR20000022435A (ko) 2000-04-25
WO1998003036A1 (fr) 1998-01-22
CA2260090C (fr) 2000-10-17
JP2000505266A (ja) 2000-04-25
AU3031897A (en) 1998-02-09
GR3032444T3 (en) 2000-05-31
PT910927E (pt) 2000-04-28

Similar Documents

Publication Publication Date Title
DE19628292B4 (de) Verfahren zum Codieren und Decodieren von Stereoaudiospektralwerten
EP0910928B1 (fr) Codage et decodage de signaux audio au moyen d'un procede stereo en intensite et de prediction
DE69927505T2 (de) Verfahren zum einfügen von zusatzdaten in einen audiodatenstrom
DE69210064T2 (de) Teilbandkodierer und Sender unter Verwendung dieses Kodierers
DE19747132C2 (de) Verfahren und Vorrichtungen zum Codieren von Audiosignalen sowie Verfahren und Vorrichtungen zum Decodieren eines Bitstroms
EP0931386B1 (fr) Procede de signalisation d'une substitution de bruit lors du codage d'un signal audio
EP0954909B1 (fr) Procede de codage d'un signal audio
DE19921122C1 (de) Verfahren und Vorrichtung zum Verschleiern eines Fehlers in einem codierten Audiosignal und Verfahren und Vorrichtung zum Decodieren eines codierten Audiosignals
DE4135070C1 (fr)
DE19742655C2 (de) Verfahren und Vorrichtung zum Codieren eines zeitdiskreten Stereosignals
DE4222623C2 (de) Verfahren zum Übertragen oder Speichern von digitalisierten Tonsignalen
EP0611516B1 (fr) Procede de reduction de donnees dans la transmission et/ou la mise en memoire de signaux numeriques de plusieurs canaux dependants
DE4217276C1 (fr)
EP1926082A1 (fr) Procédé de codage échelonnable de signaux stéréo
DE4430864C2 (de) Verfahren zum unbemerktem Übertragen und/oder Speichern von Zusatzinformationen innerhalb eines quellencodierten, datenreduzierten Audiosignals
DE19742201C1 (de) Verfahren und Vorrichtung zum Codieren von Audiosignalen
DE19747119C2 (de) Verfahren und Vorrichtungen zum Codieren bzw. Decodieren eines Audiosignals bzw. eines Bitstroms
DE10113322C2 (de) Verfahren zur Codierung von Audiodaten
DE19840853B4 (de) Verfahren und Vorrichtungen zum Codieren eines Audiosignals
DE19617654C1 (de) Verfahren zum Codieren eines zwei- oder mehrkanaligen Tonsignals

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19990105

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAG Despatch of communication of intention to grant

Free format text: ORIGINAL CODE: EPIDOS AGRA

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

RIN1 Information on inventor provided before grant (corrected)

Inventor name: JOHNSTON, JAMES

Inventor name: HERRE, JUERGEN

Inventor name: GERHAEUSER, HEINZ

Inventor name: BRANDENBURG, KARLHEINZ

Inventor name: TEICHMANN, BODO

Inventor name: DIETZ, MARTIN

Inventor name: GBUR, UWE

17Q First examination report despatched

Effective date: 19990601

GRAH Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOS IGRA

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE

ITF It: translation for a ep patent filed

Owner name: JACOBACCI & PERANI S.P.A.

REF Corresponds to:

Ref document number: 188832

Country of ref document: AT

Date of ref document: 20000115

Kind code of ref document: T

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

ET Fr: translation filed
REF Corresponds to:

Ref document number: 59701014

Country of ref document: DE

Date of ref document: 20000217

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: GERMAN

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20000313

REG Reference to a national code

Ref country code: PT

Ref legal event code: SC4A

Free format text: AVAILABILITY OF NATIONAL TRANSLATION

Effective date: 20000131

REG Reference to a national code

Ref country code: DK

Ref legal event code: T3

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2143868

Country of ref document: ES

Kind code of ref document: T3

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed
REG Reference to a national code

Ref country code: GB

Ref legal event code: IF02

REG Reference to a national code

Ref country code: CH

Ref legal event code: PFA

Owner name: AT & T LABORATORIES/RESEARCH

Free format text: FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.#LEONRODSTRASSE 54#80636 MUENCHEN (DE) $ AT & T LABORATORIES/RESEARCH#180 PARK AVENUE#FLORHAM PARK, NJ 07932 (US) $ LUCENT TECHNOLOGIES#BELL LABORATORIES, 600 MOUNTAIN AVENUE#MURRAY HILL, NJ 07974-0636 (US) -TRANSFER TO- AT & T LABORATORIES/RESEARCH#180 PARK AVENUE#FLORHAM PARK, NJ 07932 (US) $ LUCENT TECHNOLOGIES#BELL LABORATORIES, 600 MOUNTAIN AVENUE#MURRAY HILL, NJ 07974-0636 (US) $ FRAUNHOFER-GESELLSCHAFT ZUR FOERDERUNG DER ANGEWANDTEN FORSCHUNG E.V.#HANSASTRASSE 27 C#80686 MUENCHEN (DE)

REG Reference to a national code

Ref country code: DE

Ref legal event code: R039

Ref document number: 59701014

Country of ref document: DE

Ref country code: DE

Ref legal event code: R008

Ref document number: 59701014

Country of ref document: DE

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 59701014

Country of ref document: DE

Ref country code: DE

Ref legal event code: R040

Ref document number: 59701014

Country of ref document: DE

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20160622

Year of fee payment: 20

Ref country code: CH

Payment date: 20160621

Year of fee payment: 20

Ref country code: ES

Payment date: 20160622

Year of fee payment: 20

Ref country code: GR

Payment date: 20160621

Year of fee payment: 20

Ref country code: GB

Payment date: 20160628

Year of fee payment: 20

Ref country code: IE

Payment date: 20160621

Year of fee payment: 20

Ref country code: MC

Payment date: 20160621

Year of fee payment: 20

Ref country code: LU

Payment date: 20160622

Year of fee payment: 20

Ref country code: FI

Payment date: 20160620

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: AT

Payment date: 20160620

Year of fee payment: 20

Ref country code: BE

Payment date: 20160621

Year of fee payment: 20

Ref country code: PT

Payment date: 20160525

Year of fee payment: 20

Ref country code: FR

Payment date: 20160621

Year of fee payment: 20

Ref country code: SE

Payment date: 20160621

Year of fee payment: 20

Ref country code: DK

Payment date: 20160622

Year of fee payment: 20

Ref country code: NL

Payment date: 20160621

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20160621

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 59701014

Country of ref document: DE

REG Reference to a national code

Ref country code: DK

Ref legal event code: EUP

Effective date: 20170603

REG Reference to a national code

Ref country code: NL

Ref legal event code: MK

Effective date: 20170602

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20170602

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK07

Ref document number: 188832

Country of ref document: AT

Kind code of ref document: T

Effective date: 20170603

REG Reference to a national code

Ref country code: IE

Ref legal event code: MK9A

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20170602

REG Reference to a national code

Ref country code: SE

Ref legal event code: EUG

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20170614

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20170603

REG Reference to a national code

Ref country code: ES

Ref legal event code: FD2A

Effective date: 20180508

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20170604