EP1989706A2 - Vorrichtung für wahrnehmungsgewichtung bei der tonkodierung/-dekodierung - Google Patents

Vorrichtung für wahrnehmungsgewichtung bei der tonkodierung/-dekodierung

Info

Publication number
EP1989706A2
EP1989706A2 EP07731586A EP07731586A EP1989706A2 EP 1989706 A2 EP1989706 A2 EP 1989706A2 EP 07731586 A EP07731586 A EP 07731586A EP 07731586 A EP07731586 A EP 07731586A EP 1989706 A2 EP1989706 A2 EP 1989706A2
Authority
EP
European Patent Office
Prior art keywords
perceptual weighting
filter
band
gain compensation
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP07731586A
Other languages
English (en)
French (fr)
Other versions
EP1989706B1 (de
Inventor
Stéphane RAGOT
Romain Trilling
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP1989706A2 publication Critical patent/EP1989706A2/de
Application granted granted Critical
Publication of EP1989706B1 publication Critical patent/EP1989706B1/de
Not-in-force legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • G10L19/0208Subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Definitions

  • the present invention relates to a perceptual weighting device for encoding / decoding an audio signal in a given frequency band. It also relates to a hierarchical audio encoder and decoder comprising a coding / decoding device according to the invention.
  • the invention finds a particularly advantageous application in the field of transmission and storage of digital signals, such as audio-frequency signals of speech, music, etc.
  • waveform coding methods, such as MIC or ADPCM (PCM or ADPCM) coding
  • CELP coding Code Excited Linear Prediction
  • the coder generates a fixed rate bit stream.
  • This fixed rate constraint simplifies the implementation and use of the encoder and decoder, commonly referred to together as "coded". Examples of such systems are: ITU-T G.711 coding at 64 kbit / s, ITU-T G.729 coding at 8 kbit / s or GSM-EFR at 12.2 kbit / s.
  • variable rate bit stream the bit rate values being taken in a pre-defined set. It is thus possible to distinguish several multi-rate coding techniques, more flexible than the fixed rate coding:
  • the multi-mode coding controlled by the source and / or the channel as implemented in the AMR-NB, AMR-WB, SMV or VMR-WB systems the hierarchical coding, or "scalable" coding, which generates a so-called hierarchical bit stream because it includes a core rate and one or more enhancement layer (s).
  • the 48, 56 and 64 kbit / s G.722 system is a simple example of scalable rate scaling.
  • the MPEG-4 CELP codec is scalable in terms of bit rate and bandwidth; other examples of such coders are found in the article by B. Kovesi, D. Massaloux, A. Sollaud, "A Scalable Speech and Audio Coding Scheme with Continuous Bitrate Flexibility", ICASSP 2004.
  • the invention is of interest here more particularly to hierarchical coding.
  • the bit stream comprises a base layer, or core, and one or more enhancement layers.
  • the base layer is generated by a fixed low rate codec, known as a "core coded", guaranteeing the minimum quality of the coding; this layer must be received by the decoder to maintain an acceptable level of quality.
  • Improvement layers are used to improve the quality; it may happen that they are not all received by the decoder.
  • the main advantage of hierarchical coding is that it allows an adaptation of the bit rate by simple truncation of the bit stream.
  • the number of layers namely the number of possible truncations of the bitstream, defines the granularity of the coding: we speak of coding with high granularity if the bitstream comprises few layers (of the order of 2 to 4), while a fine granular coding allows for example a step of the order of 1 kbit / s.
  • the invention relates to scalable bandwidth and bandwidth coding techniques with a CELP heart-coder in a telephone band and one or more band-enhanced enhancement layer with respect to the actual telephone band.
  • Examples of such systems are given in the article by H. Taddei et al, Scalable Three Bitrate (8, 14.2 and 24 kbit / s) Audio Coder; 107th Convention AES, 199, with a high granularity of 8, 14.2 and 24 kbit / s, and with fine granularity of 6.4 to 32 kbit / s in the article by B. Kovesi et al supra.
  • G.729EV Embedded Variable Bitrate
  • the objective of the G.729EV standardization is to obtain a G.729 core hierarchical encoder, producing a signal whose band extends from the narrow band (300-3400 Hz) to the broadband (50-7000 Hz). ) at a rate of 8 to 32 kbit / s for conversational services.
  • This encoder is inherently interoperable with Recommendation G.729, which ensures compatibility with existing VoIP devices.
  • perceptual weighting filtering allows to put shaped the coding noise by attenuating the signal at frequencies where its intensity is strong and where the noise can be more easily masked.
  • the most common perceptual weighting filters used in narrowband CELP coding are of the form ⁇ (z / y ⁇ ) / ⁇ (z / y 2 ) where 0 ⁇ 2 ⁇ ⁇ i ⁇ 1 and ⁇ (z) represents the spectrum LPC of a signal segment of length 5 to 30 ms.
  • the synthesis analysis in CELP coding thus amounts to minimizing the quadratic error in a signal domain perceptually weighted by this type of filter.
  • the technical problem to be solved by the object of the present invention is to propose a perceptual weighting device for encoding / decoding an audio signal in a given frequency band, which would make it possible to carry out a full perceptual weighting filtering.
  • band that is to say on the whole of said given frequency band, in particular the 0-8000 Hz wide band of a hierarchical audio coder, without this operation leading to long and resource-intensive calculations.
  • the solution to the technical problem posed consists, according to the present invention in that, said coding / decoding being carried out in a plurality of adjacent subbands in said given frequency band, said device comprises, in at least one subband, a perceptual weighting filter with gain compensation adapted to achieve the spectral continuity between the output signal of said gain-compensated perceptual weighting filter and the signals in the sub-bands adjacent to said sub-band.
  • the perceptual weighting device performs the desired filtering in one or more subbands and not in the overall coding / decoding band, which limits the complexity of the calculations.
  • the possible disparity of the gains of perceptual weighting filtering from one subband to another is solved thanks to a gain compensation which ensures the spectral continuity over the entire width of the frequency band.
  • the invention therefore makes it possible to obtain a homogeneous band at the output of the perceptual weighting filtering even if the subbands that constitute it have been treated separately from this point of view.
  • each subband can be filtered or not by perceptual weighting.
  • the spectral continuity can therefore be ensured between a filtered sub-band and another unfiltered, or between two filtered subbands.
  • said gain-compensated perceptual weighting filter comprises a perceptual weighting filter and a gain compensation module.
  • said perceptual weighting filter with gain compensation comprises a perceptual weighting filter incorporating said gain compensation.
  • said perceptual weighting filter in the first subband is of the form ((z / y ⁇ ) / ((z / y) where ((z) represents a linear prediction filter.
  • the invention proposes that said gain compensation multiplies by a factor / ⁇ c equal to:
  • the invention also relates to a hierarchical audio encoder in a frequency band decomposed into a first and a second adjacent subbands, said encoder comprising: a heart coder for coding an original signal in the first subband of said frequency band,
  • a stage for calculating a residual signal from said original signal and the signal coming from said core coder a device for perceptually weighting said residual signal, characterized in that said perceptual weighting device comprises a perceptual weighting filter with compensation. gain circuit adapted to achieve the spectral continuity between the output signal of said perceptual weighting filter with gain compensation and the signal in the second subband.
  • only the first subband is subject to perceptual weighting filtering, the second subband not being filtered.
  • said gain-compensated perceptual weighting filter comprises a perceptual weighting filter in the first sub-band
  • the invention provides that said perceptual weighting filter in the first subband is of the form ⁇ (z / y ⁇ ) / ⁇ (z / y 2 ) where A 1 (Z) represents a linear prediction filter.
  • said gain compensation in the first subband performs a multiplication by a factor / ⁇ c ⁇ equal to:
  • the signal from the perceptual weighting device in the first subband and the original signal in the second subband are respectively applied to transform analysis modules, and said transform analysis modules are connected to a transform encoder in said frequency band.
  • said encoder also comprises a device for perceptual weighting of the original signal in the second subband, comprising a perceptual weighting filter with gain compensation able to achieve the spectral continuity between the output signal of said perceptual weighting filter with gain compensation and the output signal of the device of perceptual weighting in the first sub-band.
  • said perceptual weighting filter with gain compensation comprises a perceptual weighting filter in the second band
  • said perceptual weighting filter in the second subband is of the form 2 2 (z / y'i ) / 2 (z / y 'where $ ⁇ 2 (z) represents a linear prediction filter in this case
  • said gain compensation in the second subband performs a multiplication by a factor / ⁇ c2 equal to.:
  • the signal from the perceptual weighting device in the first subband and the signal from the perceptual weighting device in the second subband are respectively applied to transform analysis modules, and said analysis modules to transformed are connected to a transform encoder in said frequency band.
  • the invention further relates to a hierarchical audio decoder in a frequency band decomposed into first and second adjacent sub-bands, said decoder comprising: - a core decoder for decoding in the first sub-band of said frequency band a received signal encoded by the encoder according to the invention, - a device for inverse perceptual weighting of a signal representative of the weighted residual signal in the first sub-band by the perceptual weighting device of said encoder, characterized in that said inverse perceptual weighting device comprises a perceptual weighting filter with gain compensation, inverse of the perceptual weighting filter with gain compensation of the encoder in the first subband.
  • said decoder also comprises an inverse perceptual weighting device of the decoded signal in the second subband, comprising a perceptual weighting filter with gain compensation, inverse of the perceptual weighting filter with gain compensation of the encoder in the second subband.
  • said gain-compensated perceptual weighting filter comprises a perceptual weighting filter in the second band
  • said gain-compensated inverse perceptual weighting filter comprises an inverse perceptual weighting filter in the second band. subband.
  • said inverse perceptual weighting filter in the second subband is of the form
  • the coefficients of the linear prediction filter ⁇ 2 (z) are provided by a band extension module.
  • the invention further relates to a perceptual weighting method for encoding an audio signal in a given frequency band, wherein said encoding is performed in a plurality of adjacent subbands in said given frequency band, said method comprises, in at least one sub-band, a perceptual weighting step with gain compensation adapted to achieve the spectral continuity between the signal from said perceptual weighting step with gain compensation and the signals in the adjacent subbands to said sub-band.
  • the invention relates to a perceptual weighting method for decoding an audio signal encoded in a given frequency band in accordance with the perceptual weighting method for encoding said signal, which is remarkable in that said method comprises - band, a perceptual weighting step with gain compensation, inverse of said perceptual weighting step with gain compensation.
  • FIG. 1 is a diagram of a hierarchical audio coder of the prior art, comprising a full-band perceptual weighting filter before transform coding.
  • FIG. 2 is a high-level diagram of a hierarchical audio coder according to the invention.
  • FIG. 3 is a diagram of the perceptual weighting device of the encoder of FIG. 2.
  • FIG. 4 is a spectrum giving the amplitude of a filtered and gain-compensated signal according to the invention in a first sub-band and the amplitude of an unfiltered signal in a second sub-band.
  • FIG. 5 is a high-level diagram of a hierarchical audio decoder according to the invention.
  • FIG. 6 is a diagram of a variant of the hierarchical audio coder of FIG. 2.
  • FIG. 7 is a diagram of a variant of the hierarchical audio decoder of FIG.
  • FIG. 8 is a spectrum giving the amplitude of a filtered signal then gain-compensated according to the invention in a first sub-band and the amplitude of a filtered signal then equalized according to the invention in a second sub-band .
  • FIG. 2 shows a subband audio coder at rates ranging from 8 to 32 kbit / s. This figure gives the different steps of the corresponding coding method.
  • the input signal in a so-called “extended” 50 to 7000 Hz frequency band sampled at 16 kHz is first decomposed into 2 adjacent subbands by QMF quadrature mirror filtering ("Quadrature Mirror").
  • the first sub-band, or low band, from 0 to 4000 Hz is obtained by low-pass filtering L 300 and decimation 301, and the second sub-band. band, or high band, from 4000 to 8000 Hz by high-pass filtering H 302 and decimation 303.
  • the filters L 300 and H 302 are of length 64 and conform to those described in the article of J. Johnston, ICASSP, vol. 5, pp. 291-294, 1980.
  • the first sub-band is pre-processed by a high-pass filter 304 eliminating the components below 50 Hz before coding by a narrow-band CELP 305 core coder.
  • the high-pass filtering takes into account the fact that the broadband is defined as covering the range 50-7000 Hz.
  • the narrow-band CELP coding corresponds to that described in Figure 1; it is a cascaded CELP coding comprising as a first stage a modified G.729 coding (ITU-T G.729 Recommendation, Coding of Speech at 8 kbps using Conjugate Structure Algebraic Code Excited Linear Prediction (CS-ACELP ), March 1996) without a pre-processing filter, and as a second stage an additional fixed dictionary.
  • CS-ACELP Conjugate Structure Algebraic Code Excited Linear Prediction
  • the residual signal e related to the error due to the CELP coding is calculated by the stage 306 and then perceptually weighted by a device 307 comprising a perceptual weighting filter to obtain the signal x 1 o in the time domain.
  • This signal is analyzed by Modified Discrete Cosine Transform (MDCT) 308 to obtain the discrete spectrum X 1 o in the frequency domain.
  • MDCT Modified Discrete Cosine Transform
  • the device 307 for perceptual weighting is shown in FIG. 3.
  • This device W 1 (Z) comprises a perceptual weighting filter ⁇ (z / y ⁇ ) / ⁇ (z / y 2 ) comprising the filter stages 501 and 502 respectively by A 1 (ZZy 1 ) and 1 / A 1 (ZZy 2 ).
  • the linear prediction filter A 1 (Z) is derived from narrowband CELP coding.
  • the perceptual weighting device 307 also comprises a gain compensation module 503 for multiplying the perceptually weighted signal from the filter 501, 502 by the factor / ⁇ ci defined by:
  • the second subband, or high band is first unfolded spectrally 309 to compensate for the folding due to high pass filter 302 combined with decimation 303.
  • This high band is then pre-processed by a low pass filter 310 eliminating the components between 7000 and 8000 Hz in the original signal.
  • the resulting signal xu in the time domain is transformed by MDCT 311 to obtain the discrete spectrum s in the frequency domain.
  • a band extension 312 is made from x M elX M.
  • the MDCT transformation is implemented using P. Duhamel's algorithm. , Y. Mahieux, JP Small, A Fast Algorithm for the Implementation of Aliasing Cancellation ', ICASSP, vol. 3, pp.2209-2212, 1991.
  • the low band MDCT and high band X ⁇ o and Xu spectra are encoded in the transform coding module 313.
  • the different bitstreams generated by the coding modules 305, 312 and 313 are multiplexed and structured into a hierarchical bitstream in the multiplexer 314.
  • the coding is performed by sample blocks (or frames) of 20 ms, ie 320 samples.
  • the coding rate is 8, 12, 14 to 32 kbit / s.
  • FIG. 4 shows the decomposition of the total frequency band into a first sub-band, the low band between 0 and 4 kHz, and a second sub-band, the high band between 4 and 8 kHz.
  • the MDCT encoder 313 applies to these two sub-bands with:
  • FIG. 5 This figure illustrates the decoding steps of the signal coded by said encoder.
  • the bits describing each frame of 20 ms are demultiplexed in the demultiplexer 700.
  • a decoding operation of 8 to 32 kbit / s is presented, although in practice the bit stream can be truncated to 8, 12, 14 or between 14 and 32 kbit / s.
  • the bit stream of the 8 and 12 kbit / s layers is used by the CELP decoder 701 to generate a first synthesis in the first subband, or narrow band, between 0 and 4000 Hz.
  • the portion of the bit stream associated with the layer at 14 kbit / s is decoded by the band extension module 702 and the signal obtained in the second subband, or high band, between 4000 and 7000 Hz is converted by MDCT 703 into an X h ⁇ spectrum.
  • Decoding MDCT 704 generates from the bit stream associated with the bit rates of 14 to 32 kbit / s a reconstructed spectrum X 10 Qn low band and a reconstructed spectrum X h , in high band.
  • the extended band output signal is obtained via a bank of QMF synthesis filters which perform the oversampling operations 710 and 712, low-pass filtering 711 and high-pass filtering. 713 and addition 714.
  • a perceptual decoding step with gain compensation is performed by the inverse perceptual weighting device 707 Wi (z) ⁇ ⁇ comprising an inverse perceptual weighting filter ⁇ i (z / ⁇ 2 ) / ⁇ i (z / ⁇ i) and a modulus of gain compensation for multiplying the signal from said inverse perceptual weighting filter by the factor 1 / faci with:
  • the t are the coefficients of the filter A 1 (Z) resulting from the CELP coding in narrow band.
  • the coefficients ⁇ are kept constant in each 5 ms subframe.
  • FIG. 2 An alternative embodiment of the encoder of FIG. 2 is shown in FIG.
  • the perceptual weighting device 912 with highband gain compensation W 2 (z) takes the same form as the filter W 1 (Z) in the low band. It is therefore a filter of type ⁇ 2 (z / y 'JZA 2 (ZZy' 2 ) followed by a gain compensation factor fac 2 defined as:
  • fac 2 IAA 2 (ZZf 1 ) ZA 2 (ZZf 2 )
  • I for z 1, ie the frequency 0 Hz or DC component in the high band which corresponds to 4 kHz once this frequency is returned to the input signal before QMF filtering.
  • the MDCT coder applies to these two sub-bands with:
  • the gain compensation in low and high bands by the fac et et 2 factors respectively ensure a continuity of the responses of the 4 kHz filters. It is this continuity which then makes it possible to code the two discrete spectra X ⁇ o and X h , into a single vector X. Again, it is important to note that the value 0 dB used here to define the continuity between low and high bands n is indicative.
  • the hierarchical audio decoder corresponding to this variant is described in FIG. 7. Compared to the decoder of the previous embodiment, the only difference consists in recovering the quantized LPC coefficients, ⁇ 2 (z), used by the module 1002.
  • the inverse filter W 2 (Z) '1 in the high band is of type ⁇ 2 (z / y' 2 ) / ⁇ 2 (z / y'i) followed by the gain compensation factor l / fac 2 wherefac 2 a has been defined above.
  • the invention furthermore covers a computer program comprising a sequence of instructions stored on a medium for execution by a computer or a dedicated device, which is remarkable in that, during the execution of these instructions, the latter executes the method of perceptual weighting object of the invention for coding and / or decoding.
  • the aforementioned computer program is for example a directly executable program implanted in a perceptual weighting device object of the invention. It is understood that the invention is not limited to the only embodiments that have just been described. In particular, it will be noted that
  • the numerical values of the adjustable parameters ⁇ lt ⁇ 2 , ⁇ 'i and y ' 2 may be different from those chosen above,
  • the fac compensation factor can be applied before filtering by A (ZZy 1 ) ZA (ZZy 2 ) or between the filters A (ZZy 1 ) and ⁇ (zZ ⁇ 2 ) or else integrated into one of the filters A (ZZy 1 ) or ⁇ (zZy 2 ). It is the same for the factor / ⁇ c 2 and the corresponding inverse filters, the perceptual weighting filter is not necessarily of the form ((z / ⁇ i ) / ((z / ⁇ 2 ),
  • the number of sub-bands defined in the total frequency band may be greater than 2.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
EP07731586A 2006-02-14 2007-02-07 Vorrichtung für wahrnehmungsgewichtung bei der tonkodierung/-dekodierung Not-in-force EP1989706B1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0650538 2006-02-14
PCT/FR2007/050760 WO2007093726A2 (fr) 2006-02-14 2007-02-07 Dispositif de ponderation perceptuelle en codage/decodage audio

Publications (2)

Publication Number Publication Date
EP1989706A2 true EP1989706A2 (de) 2008-11-12
EP1989706B1 EP1989706B1 (de) 2011-10-26

Family

ID=36952401

Family Applications (1)

Application Number Title Priority Date Filing Date
EP07731586A Not-in-force EP1989706B1 (de) 2006-02-14 2007-02-07 Vorrichtung für wahrnehmungsgewichtung bei der tonkodierung/-dekodierung

Country Status (7)

Country Link
US (1) US8260620B2 (de)
EP (1) EP1989706B1 (de)
JP (1) JP5117407B2 (de)
KR (1) KR101366124B1 (de)
CN (1) CN101385079B (de)
AT (1) ATE531037T1 (de)
WO (1) WO2007093726A2 (de)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7461106B2 (en) 2006-09-12 2008-12-02 Motorola, Inc. Apparatus and method for low complexity combinatorial coding of signals
GB2448201A (en) * 2007-04-04 2008-10-08 Zarlink Semiconductor Inc Cancelling non-linear echo during full duplex communication in a hands free communication system.
US8576096B2 (en) * 2007-10-11 2013-11-05 Motorola Mobility Llc Apparatus and method for low complexity combinatorial coding of signals
US8209190B2 (en) * 2007-10-25 2012-06-26 Motorola Mobility, Inc. Method and apparatus for generating an enhancement layer within an audio coding system
US20090234642A1 (en) * 2008-03-13 2009-09-17 Motorola, Inc. Method and Apparatus for Low Complexity Combinatorial Coding of Signals
US8639519B2 (en) * 2008-04-09 2014-01-28 Motorola Mobility Llc Method and apparatus for selective signal coding based on core encoder performance
JP5551694B2 (ja) * 2008-07-11 2014-07-16 フラウンホッファー−ゲゼルシャフト ツァ フェルダールング デァ アンゲヴァンテン フォアシュンク エー.ファオ 多くのスペクトルエンベロープを計算するための装置および方法
BRPI0910511B1 (pt) * 2008-07-11 2021-06-01 Fraunhofer-Gesellschaft Zur Forderung Der Angewandten Forschung E.V. Aparelho e método para decodificar e codificar um sinal de áudio
KR101170466B1 (ko) 2008-07-29 2012-08-03 한국전자통신연구원 Mdct 영역에서의 후처리 방법, 및 장치
WO2010032992A2 (ko) * 2008-09-18 2010-03-25 한국전자통신연구원 Mdct기반의 코너와 이종의 코더간 변환에서의 인코딩 장치 및 디코딩 장치
FR2938688A1 (fr) * 2008-11-18 2010-05-21 France Telecom Codage avec mise en forme du bruit dans un codeur hierarchique
US8219408B2 (en) * 2008-12-29 2012-07-10 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
US8175888B2 (en) * 2008-12-29 2012-05-08 Motorola Mobility, Inc. Enhanced layered gain factor balancing within a multiple-channel audio coding system
US8200496B2 (en) * 2008-12-29 2012-06-12 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
US8140342B2 (en) * 2008-12-29 2012-03-20 Motorola Mobility, Inc. Selective scaling mask computation based on peak detection
CA2780962C (en) * 2009-11-19 2017-09-05 Telefonaktiebolaget L M Ericsson (Publ) Methods and arrangements for loudness and sharpness compensation in audio codecs
US8423355B2 (en) * 2010-03-05 2013-04-16 Motorola Mobility Llc Encoder for audio signal including generic audio and speech frames
US8428936B2 (en) * 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
CN102223527B (zh) * 2010-04-13 2013-04-17 华为技术有限公司 频带加权量化编解码方法和装置
KR101747917B1 (ko) 2010-10-18 2017-06-15 삼성전자주식회사 선형 예측 계수를 양자화하기 위한 저복잡도를 가지는 가중치 함수 결정 장치 및 방법
FR2969360A1 (fr) * 2010-12-16 2012-06-22 France Telecom Codage perfectionne d'un etage d'amelioration dans un codeur hierarchique
US9037456B2 (en) * 2011-07-26 2015-05-19 Google Technology Holdings LLC Method and apparatus for audio coding and decoding
JP5737077B2 (ja) * 2011-08-30 2015-06-17 富士通株式会社 オーディオ符号化装置、オーディオ符号化方法及びオーディオ符号化用コンピュータプログラム
US8712076B2 (en) 2012-02-08 2014-04-29 Dolby Laboratories Licensing Corporation Post-processing including median filtering of noise suppression gains
US9173025B2 (en) 2012-02-08 2015-10-27 Dolby Laboratories Licensing Corporation Combined suppression of noise, echo, and out-of-location signals
US9129600B2 (en) 2012-09-26 2015-09-08 Google Technology Holdings LLC Method and apparatus for encoding an audio signal
FR3008533A1 (fr) * 2013-07-12 2015-01-16 Orange Facteur d'echelle optimise pour l'extension de bande de frequence dans un decodeur de signaux audiofrequences
EP3503095A1 (de) 2013-08-28 2019-06-26 Dolby Laboratories Licensing Corp. Hybride wellenformcodierte und parametercodierte spracherweiterung
FR3011408A1 (fr) * 2013-09-30 2015-04-03 Orange Re-echantillonnage d'un signal audio pour un codage/decodage a bas retard
CN107113357B (zh) 2014-12-23 2021-05-28 杜比实验室特许公司 与语音质量估计相关的改进方法和设备
WO2017050398A1 (en) 2015-09-25 2017-03-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Encoder, decoder and methods for signal-adaptive switching of the overlap ratio in audio transform coding
EP3288031A1 (de) 2016-08-23 2018-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und verfahren zur codierung eines audiosignals mit einem kompensationswert
US20190051286A1 (en) * 2017-08-14 2019-02-14 Microsoft Technology Licensing, Llc Normalization of high band signals in network telephony communications
CN113196387B (zh) * 2019-01-13 2024-10-18 华为技术有限公司 一种用于音频编解码的计算机实现的方法和电子设备

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
JP3139602B2 (ja) * 1995-03-24 2001-03-05 日本電信電話株式会社 音響信号符号化方法及び復号化方法
FR2734389B1 (fr) * 1995-05-17 1997-07-18 Proust Stephane Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme
US5778335A (en) * 1996-02-26 1998-07-07 The Regents Of The University Of California Method and apparatus for efficient multiband celp wideband speech and music coding and decoding
KR100261253B1 (ko) * 1997-04-02 2000-07-01 윤종용 비트율 조절이 가능한 오디오 부호화/복호화 방법및 장치
US6182031B1 (en) * 1998-09-15 2001-01-30 Intel Corp. Scalable audio coding system
DE60035453T2 (de) * 1999-05-11 2008-03-20 Nippon Telegraph And Telephone Corp. Auswahl des Synthesefilters für eine CELP Kodierung von breitbandigen Audiosignalen
US6691082B1 (en) 1999-08-03 2004-02-10 Lucent Technologies Inc Method and system for sub-band hybrid coding
US6446037B1 (en) * 1999-08-09 2002-09-03 Dolby Laboratories Licensing Corporation Scalable coding method for high quality audio
CA2290037A1 (en) * 1999-11-18 2001-05-18 Voiceage Corporation Gain-smoothing amplifier device and method in codecs for wideband speech and audio signals
WO2001075759A1 (en) 2000-03-27 2001-10-11 Russell Randall A School commerce system and method
EP1287521A4 (de) 2000-03-28 2005-11-16 Tellabs Operations Inc Wahrnehmungsbezogene spektrale gewichtung von frequenzbändern für die adaptive rauschlöschung
US6523003B1 (en) * 2000-03-28 2003-02-18 Tellabs Operations, Inc. Spectrally interdependent gain adjustment techniques
DE60230925D1 (de) * 2001-12-25 2009-03-05 Ntt Docomo Inc Signalcodierung
US7283966B2 (en) * 2002-03-07 2007-10-16 Microsoft Corporation Scalable audio communications utilizing rate-distortion based end-to-end bit allocation
WO2003077235A1 (en) * 2002-03-12 2003-09-18 Nokia Corporation Efficient improvements in scalable audio coding
US7502743B2 (en) * 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
US20040098255A1 (en) * 2002-11-14 2004-05-20 France Telecom Generalized analysis-by-synthesis speech coding method, and coder implementing such method
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
US7676043B1 (en) * 2005-02-28 2010-03-09 Texas Instruments Incorporated Audio bandwidth expansion
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO2007093726A2 *

Also Published As

Publication number Publication date
JP5117407B2 (ja) 2013-01-16
KR101366124B1 (ko) 2014-02-21
WO2007093726A2 (fr) 2007-08-23
ATE531037T1 (de) 2011-11-15
US20090076829A1 (en) 2009-03-19
CN101385079A (zh) 2009-03-11
JP2009527017A (ja) 2009-07-23
CN101385079B (zh) 2012-08-29
EP1989706B1 (de) 2011-10-26
KR20080093450A (ko) 2008-10-21
WO2007093726A3 (fr) 2007-10-18
US8260620B2 (en) 2012-09-04

Similar Documents

Publication Publication Date Title
EP1989706B1 (de) Vorrichtung für wahrnehmungsgewichtung bei der tonkodierung/-dekodierung
EP1905010B1 (de) Hierarchischen Audio-kodierung/-dekodierung
EP2115741B1 (de) Fortgeschrittene kodierung/dekodierung von digitalen tonsignalen
EP2452337B1 (de) Zuweisung von bits bei einer verstärkten codierung/decodierung zur verbesserung einer hierarchischen codierung/decodierung digitaler tonsignale
EP1907812B1 (de) Verfahren zum umschalten der raten- und bandbreitenskalierbaren audiodecodierungsrate
EP2452336B1 (de) Verbesserte codierung/decodierung digitaler audiosignale
EP2366177B1 (de) Codieren eines audio-digitalsignals mit rauschtransformation in einem skalierbaren codierer
WO2007096551A2 (fr) Procede de codage binaire d'indices de quantification d'une enveloppe d'un signal, procede de decodage d'une enveloppe d'un signal et modules de codage et decodage correspondants
EP2239731B1 (de) Kodiervorrichtung, dekodiervorrichtung und verfahren dafür
EP2104936B1 (de) Transformationsködierung mit geringer verzögerung unter verwendung von gewichtgsfenstern
FR2897733A1 (fr) Procede de discrimination et d'attenuation fiabilisees des echos d'un signal numerique dans un decodeur et dispositif correspondant
EP2005424A2 (de) Verfahren zur nachverarbeitung eines signals in einem audiodecoder
EP2652735B1 (de) Verbesserte kodierung einer verbesserungsstufe bei einem hierarchischen kodierer
FR2737360A1 (fr) Procedes de codage et de decodage de signaux audiofrequence, codeur et decodeur pour la mise en oeuvre de tels procedes
FR2980620A1 (fr) Traitement d'amelioration de la qualite des signaux audiofrequences decodes
WO2013135997A1 (fr) Modification des caractéristiques spectrales d'un filtre de prédiction linéaire d'un signal audionumérique représenté par ses coefficients lsf ou isf

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20080911

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20100924

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

DAX Request for extension of the european patent (deleted)
GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 602007018217

Country of ref document: DE

Effective date: 20111222

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20111026

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20111026

REG Reference to a national code

Ref country code: AT

Ref legal event code: MK05

Ref document number: 531037

Country of ref document: AT

Kind code of ref document: T

Effective date: 20111026

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120226

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120227

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120127

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120126

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20120228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

26N No opposition filed

Effective date: 20120727

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120229

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602007018217

Country of ref document: DE

Effective date: 20120727

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20120206

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20111026

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20120207

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20070207

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 10

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20160121

Year of fee payment: 10

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20160127

Year of fee payment: 10

Ref country code: FR

Payment date: 20160121

Year of fee payment: 10

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 602007018217

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170207

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20171031

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170901

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170207