EP1907812B1 - Procede de commutation de debit en decodage audio scalable en debit et largeur de bande - Google Patents

Procede de commutation de debit en decodage audio scalable en debit et largeur de bande Download PDF

Info

Publication number
EP1907812B1
EP1907812B1 EP06779036A EP06779036A EP1907812B1 EP 1907812 B1 EP1907812 B1 EP 1907812B1 EP 06779036 A EP06779036 A EP 06779036A EP 06779036 A EP06779036 A EP 06779036A EP 1907812 B1 EP1907812 B1 EP 1907812B1
Authority
EP
European Patent Office
Prior art keywords
post
signal
rates
rate
processed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Not-in-force
Application number
EP06779036A
Other languages
German (de)
English (en)
French (fr)
Other versions
EP1907812A2 (fr
Inventor
Stéphane RAGOT
David Virette
Balazs Kovesi
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP1907812A2 publication Critical patent/EP1907812A2/fr
Application granted granted Critical
Publication of EP1907812B1 publication Critical patent/EP1907812B1/fr
Not-in-force legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Definitions

  • the present invention relates to a rate switching method for decoding an audio signal coded by a multi-rate audio coding system and more particularly a scalable audio scalability and possibly bandwidth encoding system. It also relates to an application of said method to a bit rate and bandwidth scalable audio decoding system and a bandwidth scalable and scalable audio decoder.
  • the invention finds a particularly advantageous application in the field of the transmission of speech and / or audio signals over voice-over-IP packet networks, in order to provide a quality which can be modulated according to the capacity of the transmission channel.
  • the method according to the invention makes it possible to obtain non-artifact transitions between the different bit rates of a scalable audio encoder / decoder (codec) in bandwidth and bandwidth, especially in the case of transitions between the telephone band and the band.
  • codec scalable audio encoder / decoder
  • the broadband in the context of scalable bit rate and bandwidth audio coding with a telephone band core with rate dependent post-processing and one or more broadband enhancement layers.
  • the term “telephone band” or “narrow band” the frequency band located between 300 and 3400 Hz, while the term “broadband” is reserved for the band spreading from 50 to 7000 Hz.
  • Waveform coding methods, such as MIC or ADPCM (PCM or ADPCM), methods of "parametric analysis by synthesis analysis” such as CELP coding ("Code Excited Linear Prediction"), and methods of "perceptual coding in subbands or by transform".
  • PCM or ADPCM PCM or ADPCM
  • CELP coding Code Excited Linear Prediction
  • perceptual coding in subbands or by transform a post-processing is generally used to improve the quality.
  • This post-processing typically includes adaptive post-filtering and high-pass filtering.
  • the encoder In conventional speech coding, the encoder generates a fixed rate bit stream. This fixed rate constraint simplifies the implementation and use of the encoder and the decoder. Examples of such systems are given by the G.711 coding at 64 kbit / s or the G.729 coding at 8 kbit / s
  • Rate switching is easy to achieve if the coding is based on all the bit rates on the representation by the same coding model of a signal audio in the same bandwidth.
  • the signal is defined in a telephone band (300-3400 Hz) and the coding is based on the ACELP model ("Algebraic Code Excited Linear Prediction"), except for the generation of noise. comfort, which is nevertheless achieved by a model of the LPC type ("Linear Predictive Coding") compatible with the ACELP model.
  • the AMR-NB coding conventionally uses a post-processing in the form of an adaptive post-filtering and a high-pass filtering, the coefficients of the adaptive post-filtering being dependent on the decoding bit rate.
  • no precautions are taken to deal with potential problems related to the use of variable post-processing parameters depending on the rate.
  • AMR-WB wide band CELP coding does not use post-processing, mainly for reasons of complexity.
  • Flow switching is even more problematic in scalable audio scalability and bandwidth encoding. Indeed, in this case the coding is based on different models and bandwidths depending on the rate.
  • the bit stream comprises a base layer and one or more enhancement layers.
  • the base layer is generated by a fixed low rate codec, termed a "core codec", which guarantees the minimum quality of the coding.
  • This layer must be received by the decoder to maintain an acceptable level of quality. Improvement layers are used to improve quality. If they are all sent by the coder, it may happen that they are not all received by the decoder.
  • the main advantage of hierarchical coding is that it allows an adaptation of the bit rate by simple truncation of the bit stream.
  • the number of layers namely the number of possible truncations of the bit stream, defines the granularity of the coding.
  • Hierarchical coding techniques that are scalable in rate and bandwidth with a CELP heart-type coder in a telephone band and one or more broadband enhancement layer (s). Examples of such systems are given in H. Taddei et al., Scalable Three Bitrate (8, 14.2 and 24 kbit / s) Audio Coder; 107th AES Convention, 1999 with a high granularity of 8, 14.2 and 24 kbit / s, and in B. Kovesi, D. Massaloux, A. Sollaud, A scalable speech and audio coding scheme with continuous bitrate flexibility, ICASSP 2004 with fine granularity from 6.4 to 32 kbit / s, or the MPEG-4 CELP coding.
  • international demand WO 02/060075 discloses an optimized decimation system for converting the enlarged band to the telephone band.
  • the process proposed in the international application WO 01/48931 is in fact a band extension technique which consists in generating a pseudo-wide band signal from a telephone band signal, in particular by extracting a "spectral profile".
  • Similar techniques known from the prior art mainly address the problems related to the switching of the broadband to the telephone band seeking to avoid band reduction by the use of a band extension technique without transmission of information for generating an expanded band signal from the received bandband signal. It should be noted that these methods do not seek to really control the transition between bandwidths and that they also have the disadvantage of to rely on band extension techniques whose quality is very variable and which can not therefore ensure stable output quality.
  • a post-processing is performed on the decoding during transitions to simulate a continuous variation of the bandwidth.
  • the technical problem to be solved by the object of the present invention is to propose a method of switching the rate at the decoding of an audio signal coded by a multi-rate audio coding system, said decoding comprising at least one step of rate-dependent post-processing, which would make it possible to process the transitions between different rates for which post-processing is used according to the decoding rate, so as to eliminate the particularly sensitive artefacts during rapid rate variations at decoding.
  • a post-processing introduces a phase shift on the signal, and the use of two different post-treatments involves phase continuity problems during transitions.
  • the invention also relates to a computer program comprising code instructions for implementing the method according to the invention when said program is executed by a computer.
  • the invention further relates to an application of the method according to the invention to an audio scalable scalable audio decoding system.
  • the invention further relates to an application of the method according to the invention to a bit rate and bandwidth scalable audio decoding system in which the initial bit rate is obtained by at least a first decoding layer in a first frequency band, and the final rate is obtained by at least one second decoding layer, called the extension layer of said first frequency band in a second frequency band, the post-processing step being applied to the decoding performed at the initial rate.
  • the invention further relates to an application of the method according to the invention to a bit rate and bandwidth scalable audio decoding system in which the final bit rate is obtained by at least a first decoding layer in a first frequency band, and the initial rate is obtained by at least one second decoding layer, called the extension layer of said first frequency band in a second frequency band, the post-processing step being applied to the decoding performed at the final rate.
  • extended band is that of the "enlarged band” defined above, said first band being in this case the telephone band.
  • the invention also relates to a multi-rate audio decoder as claimed in claim 10.
  • the invention is now described in the context of a scalable audio codec in bit rate and bandwidth.
  • the scalable bandwidth and bandwidth coding structure considered herein has a CELP coder in the form of a telephone band, a particular case of which uses the G.729A coder as described in ITU-T G729 Recommendation, Coding of Speech at 8 kbps using Conjugate Structure Algebraic Excited Linear Prediction Code (CS-ACELP), March 1996 and in R. Salami et al., Description of ITU-T Recommendation G.729 Annex A: 8 kbit / s Reduced Complexity CS-ACELP codec, ICASSP 1997 .
  • CELP core coding In the CELP core coding, three enhancement stages are added, namely an improvement in CELP coding in a telephone band, a band extension and a transform predictive coding.
  • the flow switching considered here will involve switching between the telephone band and the enlarged band and vice versa.
  • the figure 1 gives a diagram of the encoder used.
  • a 50-7000 Hz bandwidth audio signal sampled at 16 kHz is cut into frames of 320 samples, or 20 ms.
  • a high-pass filtering 101 of 50 Hz cut-off frequency is applied to the input signal.
  • the resulting signal, called S WB is reused in several branches of the encoder.
  • a low pass filtering and a two subsampling, 102, of 16 to 8 kHz are applied to the signal S WB .
  • This operation makes it possible to obtain a sampled telephone band signal at 8 kHz.
  • This signal is processed by the heart encoder 103, according to a CELP coding.
  • This coding corresponds here to the G.729A coder, which generates the core of the bit stream with a bit rate of 8 kbit / s.
  • a first enhancement layer introduces a second CELP coding stage 103.
  • This second stage consists of an innovative dictionary that enriches the CELP excitation and offers a quality improvement, especially on unvoiced sounds.
  • the rate of this second coding stage is 4 kbit / s and the associated parameters are the positions and the signs of the pulses as well as the gain of the associated innovative dictionary for each subframe of 40 samples (5 ms at 8 kHz).
  • the decoding of the core coder and the first enhancement layer are performed to obtain the synthesis signal 104 in a 12 kbit / s telephone band.
  • Over-sampling by two from 8 to 16 kHz and low-pass filtering 105 make it possible to obtain the sampled version at 16 kHz of the first two stages of the encoder.
  • the third enhancement layer makes it possible to switch to an enlarged band 106.
  • the input signal S WB can be pre-processed by a pre-emphasis filter. This filter makes it possible to better represent the high frequencies from the broadband linear prediction filter. To compensate for the effect of the pre-emphasis filter, a de-emphasis inverse filter is then used in the synthesis. An alternative to this coding and decoding structure will not use any pre-emphasis and de-emphasis filters.
  • the next step is to calculate and quantify the wideband linear prediction filters.
  • the order of the linear prediction filter is 18, but in a variant, a lower prediction order will be chosen, for example 16.
  • the linear prediction filter can be calculated by the autocorrelation method and the algorithm of Levinson-Durbin.
  • This broadband linear prediction filter A WB (z) is quantized using a prediction of these coefficients from the NB (z) filter from the telephone band core encoder.
  • the coefficients can then be quantized using, for example, a multistage vector quantization and using the LSF (Line Spectrum Frequency) parameters dequantized from the telephone band core encoder as described in FIG. H. Ehara, T. Morii, M. Oshikiri and K. Yoshida, Predictive VQ for scalable bandwidth LSP quantization, ICASSP 2005 .
  • the excitation in broadband is obtained from the parameters of the telephone band excitation of the core encoder: the fundamental period delay or "pitch", the associated gain as well as the algebraic excitations of the core coder and the first layer of enrichment of CELP excitation and associated gains.
  • This excitation is generated by using an oversampled version of the parameters of the excitation of the telephone band stages.
  • This excitation in broadband is then shaped by the synthesis filter ⁇ WB (Z) calculated previously.
  • the de-emphasis filter is applied to the output signal of the synthesis filter.
  • the signal obtained is an expanded band signal which is not adjusted in energy.
  • high-pass filtering is applied to the broadband synthesis signal.
  • the same high-pass filter is applied to the error signal corresponding to the difference between the original delayed signal and the synthesis signal of the two previous stages.
  • This gain is calculated by a ratio of energy between the two signals.
  • the quantized gain g WB is then applied to the signal S 14 WB per subframe of 80 samples (5 ms at 16 kHz), the signal thus obtained is added to the synthesis signal of the preceding stage to create the signal in an enlarged band. corresponding to the bit rate of 14 kbit / s.
  • the further coding is performed in the frequency domain using a transform predictive coding scheme.
  • TDAC Time Domain Aliasing Cancellation
  • a Modified Discrete Cosine Transform (or MDCT) is applied, on the one hand, 110, on blocks of 640 samples of the weighted input signal with an overlap of 50% (refresh of the MDCT analysis every 20 ms), and, on the other hand, 112, on the weighted synthesis signal from the previous 14 kbit / s bandwidth stage (same block length and same overlay rate).
  • the MDCT spectrum to be encoded, 113 corresponds to the difference between the weighted input signal and the 14 kbit / s synthesis signal for the 0 to 3400 Hz band, and the 3400 Hz to 7000 weighted input signal. Hz.
  • the spectrum is limited to 7000 Hz by setting the last 40 coefficients to zero (only the first 280 coefficients are coded).
  • the spectrum is divided into 18 bands: a band of 8 coefficients and 17 bands of 16 coefficients.
  • the energy of the MDCT coefficients is calculated (scale factors).
  • the 18 scale factors constitute the spectral envelope of the weighted signal which is then quantized, coded and transmitted in the frame.
  • the figure 3 shows the format of the binary train.
  • the dynamic bit allocation is based on the energy of the spectrum bands from the dequantized version of the spectral envelope. This makes it possible to have compatibility between the bit allocation of the encoder and the decoder.
  • the normalized MDCT coefficients (fine structure) in each band are then quantized by vector quantizers using size and dimension nested dictionaries, the dictionaries being composed of a permutation code union as described in C. Lamblin et al. , Vector quantization in variable size and resolution, patent PCT FR 04 00219 , 2004 .
  • the information on the core coder, the CELP enrichment stage in the telephone band, the broadband CELP stage and finally the spectral envelope and the standardized coded coefficients are multiplexed and transmitted in a frame.
  • the figure 2 represents a block diagram of the decoder associated with the encoder of the figure 1 .
  • An inverse MDCT is then applied to the decoded MDCT coefficients 220, and filtering by the weighted synthesis filter 221 provides the output signal.
  • Block 205 represents a "cross-fade” module
  • the post-processing 203, 204 in the broad sense which is part of the G.729A decoder is applied in telephone band, before over-sampling.
  • this post-processing is not activated because, at the encoder, the encoding of the higher floors has been calculated from the version without post-processing of the telephone band.
  • Post-processing, 203 and 204 introduces a phase shift of the signal.
  • a smooth transition must be ensured.
  • the figure 4 describes the realization of block 205 which provides this slow transition between the post-processed and non-post-processed telephone band signal by applying cross-fades.
  • Step 401 examines whether the current frame is a voice band frame or not, that is, whether the current frame rate is 8 or 12 kbit / s.
  • a step 402 is called to check whether the previous frame was post-processed or not in the telephone band (which amounts to checking whether the bit rate of the previous frame was 8-12 kbit / s or not) .
  • the non-post-processed signal S 1 is copied into the signal S 3 .
  • the signal S 3 will contain the result of a cross-fade, where the weight of the non-post-processed component S 1 increases while the weight of the post-filtered component S 2 decreases.
  • Step 404 is followed by step 405 which updates the prevPF flag with the value 0.
  • step 406 it is checked whether in the previous frame the post-processing was active or not in the telephone band.
  • step 408 the post-processed signal S 2 is copied into the signal S 3 .
  • the signal S 3 is calculated, in step 407, as the result of a cross-fade, where this time the weight of the non-post-processed component S 1 decreases while the weight of the post-treated component S 2 increases.
  • step 409 is called to update the prevPF flag with the value 1.
  • the effective bandwidth of the final output of the decoder is the telephone band (signal S 1 ).
  • a post-processing is applied in telephone band, before over-sampling.
  • the post-processing used for rates of 8 or 12 kbit / s and the post-processing used for rates greater than or equal to 14 kbit / s introduce signal phase differences different from each other.
  • This slow transition between the telephone band signals with the different post-treatments is carried out by applying cross-fades (which give the signal S 3 ).
  • the post-processed signal S2 is copied into the signal S3.
  • the signal S3 is calculated as the result of a crossfade, where this time the weight of the post-processed component S1 decreases while the weight of the post-treated component S2 increases.
  • Block 209 calculates the broadband linear prediction filters required for the band extension and transform prediction decoding stages. This calculation is necessary in the case where only the telephone band portion of the bitstream of a frame is received after having received an expanded band frame and it is desired to carry out a band extension in order to maintain the band. band effect.
  • a set of LSF is extrapolated from the LSF of the telephone band core decoder. One can for example evenly distribute 8 LSF on the band between the last LSF from the telephone band and the Nyquist frequency. This allows the linear prediction filter to be stretched to a flat amplitude response filter for high frequencies.
  • Block 213 realizes the gain adaptation used for the band extension according to the present invention.
  • the organizational charts corresponding to this block are described in figures 5 and 7 .
  • the principle of adaptive attenuation of gain applied to the high band is described in figure 5 .
  • the calculation of the gain of the first broadband decoding layer is done, 501, according to two possibilities.
  • the gain is obtained by decoding 503.
  • a extrapolation of the gain associated with this decoding layer is carried out, 502. For example, it is possible to calculate the gain by aligning the energy of the low band of the broadband decoding stage with the actual decoding of the telephone band. previously realized.
  • a counter of the number of previously received wideband frames is updated, 504, according to the principle described in FIG. figure 7 .
  • this counter is used to parameterize the attenuation applied to the gain of the first wide band decoding stage, 505.
  • the figure 7 represents the flowchart of the count management of the number of received wideband frames.
  • the update of the counter is done as follows. If the current frame is an expanded band frame, then if the gain associated with the first wideband decode stage has been received (block 501 of the figure 5 ) and that the previous frame was also an expanded band frame, then the counter is incremented by 1 and saturated with the value MAX_COUNT_RCV. This value corresponds to the number of frames during which the broadband decoded signal will be attenuated when switching between a telephone bandwidth to an enlarged bandwidth.
  • the counter is set to 0. Otherwise, if the previous frame was an expanded band frame and the counter has a value less than MAX_COUNT_RCV, the counter is also set to 0. In all other cases, the counter remains at the value previous.
  • Block 219 performs the adaptive attenuation of the transform prediction coding enhancement layers according to the present invention as described in FIG. figure 6 .
  • This figure gives the flowchart of the adaptive attenuation procedure of the transform predictive decoding layer. Firstly, it is checked whether the spectral envelope of this layer has been totally received, 601. If this is the case, then an attenuation of the MDCT coefficients of correction of the low band 0-3500 Hz is carried out, 602, in using the received broadband frame counter and the attenuation table defined in the figure 9 .
  • the number of received broadband frames is monitored. If this number is less than MAX_COUNT_RCV, the MDCT coefficients corresponding to the first bandwidth broadband decoding stage with information transmission are used for the transform prediction decoding stage. On the other hand, if the counter has the maximum value, the procedure of upgrading the energy of the bands of the predictive decoding by transforming with the decoded spectral envelope is carried out.

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
EP06779036A 2005-07-22 2006-07-10 Procede de commutation de debit en decodage audio scalable en debit et largeur de bande Not-in-force EP1907812B1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0552286 2005-07-22
PCT/FR2006/050697 WO2007010158A2 (fr) 2005-07-22 2006-07-10 Procede de commutation de debit en decodage audio scalable en debit et largeur de bande

Publications (2)

Publication Number Publication Date
EP1907812A2 EP1907812A2 (fr) 2008-04-09
EP1907812B1 true EP1907812B1 (fr) 2010-12-01

Family

ID=36177265

Family Applications (1)

Application Number Title Priority Date Filing Date
EP06779036A Not-in-force EP1907812B1 (fr) 2005-07-22 2006-07-10 Procede de commutation de debit en decodage audio scalable en debit et largeur de bande

Country Status (10)

Country Link
US (1) US8630864B2 (ru)
EP (1) EP1907812B1 (ru)
JP (1) JP5009910B2 (ru)
KR (1) KR101295729B1 (ru)
CN (1) CN101263554B (ru)
AT (1) ATE490454T1 (ru)
DE (1) DE602006018618D1 (ru)
ES (1) ES2356492T3 (ru)
RU (1) RU2419171C2 (ru)
WO (1) WO2007010158A2 (ru)

Families Citing this family (39)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7461106B2 (en) 2006-09-12 2008-12-02 Motorola, Inc. Apparatus and method for low complexity combinatorial coding of signals
WO2008066071A1 (en) * 2006-11-29 2008-06-05 Panasonic Corporation Decoding apparatus and audio decoding method
EP2132732B1 (en) * 2007-03-02 2012-03-07 Telefonaktiebolaget LM Ericsson (publ) Postfilter for layered codecs
EP2128858B1 (en) * 2007-03-02 2013-04-10 Panasonic Corporation Encoding device and encoding method
US8599981B2 (en) 2007-03-02 2013-12-03 Panasonic Corporation Post-filter, decoding device, and post-filter processing method
US8576096B2 (en) * 2007-10-11 2013-11-05 Motorola Mobility Llc Apparatus and method for low complexity combinatorial coding of signals
US8209190B2 (en) * 2007-10-25 2012-06-26 Motorola Mobility, Inc. Method and apparatus for generating an enhancement layer within an audio coding system
BRPI0818927A2 (pt) * 2007-11-02 2015-06-16 Huawei Tech Co Ltd Método e aparelho para a decodificação de áudio
US9872066B2 (en) * 2007-12-18 2018-01-16 Ibiquity Digital Corporation Method for streaming through a data service over a radio link subsystem
DE102008009720A1 (de) * 2008-02-19 2009-08-20 Siemens Enterprise Communications Gmbh & Co. Kg Verfahren und Mittel zur Dekodierung von Hintergrundrauschinformationen
US20090234642A1 (en) * 2008-03-13 2009-09-17 Motorola, Inc. Method and Apparatus for Low Complexity Combinatorial Coding of Signals
US8639519B2 (en) * 2008-04-09 2014-01-28 Motorola Mobility Llc Method and apparatus for selective signal coding based on core encoder performance
PL3246918T3 (pl) * 2008-07-11 2023-11-06 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Dekoder audio, sposób dekodowania sygnału audio oraz program komputerowy
US20100057473A1 (en) * 2008-08-26 2010-03-04 Hongwei Kong Method and system for dual voice path processing in an audio codec
US20100063825A1 (en) * 2008-09-05 2010-03-11 Apple Inc. Systems and Methods for Memory Management and Crossfading in an Electronic Device
KR101670063B1 (ko) * 2008-09-18 2016-10-28 한국전자통신연구원 Mdct 기반의 코더와 이종의 코더 간 변환에서의 인코딩 장치 및 디코딩 장치
US8140342B2 (en) * 2008-12-29 2012-03-20 Motorola Mobility, Inc. Selective scaling mask computation based on peak detection
US8175888B2 (en) 2008-12-29 2012-05-08 Motorola Mobility, Inc. Enhanced layered gain factor balancing within a multiple-channel audio coding system
US8219408B2 (en) * 2008-12-29 2012-07-10 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
US8200496B2 (en) * 2008-12-29 2012-06-12 Motorola Mobility, Inc. Audio signal decoder and method for producing a scaled reconstructed audio signal
KR101622950B1 (ko) * 2009-01-28 2016-05-23 삼성전자주식회사 오디오 신호의 부호화 및 복호화 방법 및 그 장치
FR2947944A1 (fr) * 2009-07-07 2011-01-14 France Telecom Codage/decodage perfectionne de signaux audionumeriques
US8423355B2 (en) * 2010-03-05 2013-04-16 Motorola Mobility Llc Encoder for audio signal including generic audio and speech frames
US8428936B2 (en) * 2010-03-05 2013-04-23 Motorola Mobility Llc Decoder for audio signal including generic audio and speech frames
US8886523B2 (en) * 2010-04-14 2014-11-11 Huawei Technologies Co., Ltd. Audio decoding based on audio class with control code for post-processing modes
US9047875B2 (en) * 2010-07-19 2015-06-02 Futurewei Technologies, Inc. Spectrum flatness control for bandwidth extension
JP5489900B2 (ja) 2010-07-27 2014-05-14 ヤマハ株式会社 音響データ通信装置
NO2669468T3 (ru) * 2011-05-11 2018-06-02
RU2480904C1 (ru) * 2012-06-01 2013-04-27 Анна Валерьевна Хуторцева Способ совместной фильтрации и дифференциальной импульсно-кодовой модуляции-демодуляции сигналов
CN103516440B (zh) 2012-06-29 2015-07-08 华为技术有限公司 语音频信号处理方法和编码装置
US9129600B2 (en) 2012-09-26 2015-09-08 Google Technology Holdings LLC Method and apparatus for encoding an audio signal
AU2013366552B2 (en) * 2012-12-21 2017-03-02 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Comfort noise addition for modeling background noise at low bit-rates
RU2639952C2 (ru) * 2013-08-28 2017-12-25 Долби Лабораторис Лайсэнзин Корпорейшн Гибридное усиление речи с кодированием формы сигнала и параметрическим кодированием
EP3217612A4 (en) * 2014-04-21 2017-11-22 Samsung Electronics Co., Ltd. Device and method for transmitting and receiving voice data in wireless communication system
KR102244612B1 (ko) 2014-04-21 2021-04-26 삼성전자주식회사 무선 통신 시스템에서 음성 데이터를 송신 및 수신하기 위한 장치 및 방법
US10049684B2 (en) * 2015-04-05 2018-08-14 Qualcomm Incorporated Audio bandwidth selection
JP7285830B2 (ja) * 2017-09-20 2023-06-02 ヴォイスエイジ・コーポレーション Celpコーデックにおいてサブフレーム間にビット配分を割り振るための方法およびデバイス
KR102383195B1 (ko) * 2017-10-27 2022-04-08 프라운호퍼-게젤샤프트 추르 푀르데룽 데어 안제반텐 포르슝 에 파우 디코더에서의 노이즈 감쇠
JPWO2022009505A1 (ru) * 2020-07-07 2022-01-13

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0728494A (ja) * 1993-07-09 1995-01-31 Nippon Steel Corp 圧縮符号化音声信号復号化方法および装置
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US7145898B1 (en) * 1996-11-18 2006-12-05 Mci Communications Corporation System, method and article of manufacture for selecting a gateway of a hybrid communication system architecture
US6904110B2 (en) * 1997-07-31 2005-06-07 Francois Trans Channel equalization system and method
FI980132A (fi) * 1998-01-21 1999-07-22 Nokia Mobile Phones Ltd Adaptoituva jälkisuodatin
JP2000259195A (ja) * 1999-01-08 2000-09-22 Matsushita Electric Ind Co Ltd デコード回路及びそれを用いた再生装置
JP2000267686A (ja) * 1999-03-19 2000-09-29 Victor Co Of Japan Ltd 信号伝送方式及び復号化装置
US6496794B1 (en) * 1999-11-22 2002-12-17 Motorola, Inc. Method and apparatus for seamless multi-rate speech coding
GB2357682B (en) 1999-12-23 2004-09-08 Motorola Ltd Audio circuit and method for wideband to narrowband transition in a communication device
FI115329B (fi) * 2000-05-08 2005-04-15 Nokia Corp Menetelmä ja järjestely lähdesignaalin kaistanleveyden vaihtamiseksi tietoliikenneyhteydessä, jossa on valmiudet useisiin kaistanleveyksiin
JP2003050598A (ja) * 2001-08-06 2003-02-21 Mitsubishi Electric Corp 音声復号装置
CA2388439A1 (en) * 2002-05-31 2003-11-30 Voiceage Corporation A method and device for efficient frame erasure concealment in linear predictive based speech codecs
US6590833B1 (en) * 2002-08-08 2003-07-08 The United States Of America As Represented By The Secretary Of The Navy Adaptive cross correlator
US7502743B2 (en) * 2002-09-04 2009-03-10 Microsoft Corporation Multi-channel audio encoding and decoding with multi-channel transform selection
CA2992125C (en) * 2004-03-01 2018-09-25 Dolby Laboratories Licensing Corporation Reconstructing audio signals with multiple decorrelation techniques and differentially coded parameters
US7668712B2 (en) * 2004-03-31 2010-02-23 Microsoft Corporation Audio encoding and decoding with intra frames and adaptive forward error correction
US20110022924A1 (en) * 2007-06-14 2011-01-27 Vladimir Malenovsky Device and Method for Frame Erasure Concealment in a PCM Codec Interoperable with the ITU-T Recommendation G. 711
US8554551B2 (en) * 2008-01-28 2013-10-08 Qualcomm Incorporated Systems, methods, and apparatus for context replacement by audio level
JP5241921B2 (ja) * 2008-07-29 2013-07-17 ドルビー ラボラトリーズ ライセンシング コーポレイション 電子音響チャンネルの適応制御とイコライゼーションの方法
US8924222B2 (en) * 2010-07-30 2014-12-30 Qualcomm Incorporated Systems, methods, apparatus, and computer-readable media for coding of harmonic signals

Also Published As

Publication number Publication date
WO2007010158A3 (fr) 2007-05-10
JP5009910B2 (ja) 2012-08-29
US8630864B2 (en) 2014-01-14
RU2008106750A (ru) 2009-08-27
KR101295729B1 (ko) 2013-08-12
RU2419171C2 (ru) 2011-05-20
EP1907812A2 (fr) 2008-04-09
US20090306992A1 (en) 2009-12-10
DE602006018618D1 (de) 2011-01-13
WO2007010158A2 (fr) 2007-01-25
JP2009503559A (ja) 2009-01-29
CN101263554B (zh) 2011-12-28
ATE490454T1 (de) 2010-12-15
ES2356492T3 (es) 2011-04-08
KR20080033997A (ko) 2008-04-17
CN101263554A (zh) 2008-09-10

Similar Documents

Publication Publication Date Title
EP1907812B1 (fr) Procede de commutation de debit en decodage audio scalable en debit et largeur de bande
EP1905010B1 (fr) Codage/décodage audio hiérarchique
EP1989706B1 (fr) Dispositif de ponderation perceptuelle en codage/decodage audio
EP2277172B1 (fr) Dissimulation d'erreur de transmission dans un signal audionumerique dans une structure de decodage hierarchique
EP2656343B1 (fr) Codage de son à bas retard alternant codage prédictif et codage par transformée
EP2366177B1 (fr) Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique
CA2766864C (fr) Codage/decodage perfectionne de signaux audionumeriques
EP3175443B1 (fr) Détermination d'un budget de codage d'une trame de transition lpd/fd
EP3069340A2 (fr) Transition d'un codage/décodage par transformée vers un codage/décodage prédictif
EP2005424A2 (fr) Procede de post-traitement d'un signal dans un decodeur audio
US7974839B2 (en) Method, medium, and apparatus encoding scalable wideband audio signal
Sinder et al. Recent speech coding technologies and standards
Ogunfunmi et al. Scalable and Multi-Rate Speech Coding for Voice-over-Internet Protocol (VoIP) Networks

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20080205

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

17Q First examination report despatched

Effective date: 20091228

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

DAX Request for extension of the european patent (deleted)
GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

REF Corresponds to:

Ref document number: 602006018618

Country of ref document: DE

Date of ref document: 20110113

Kind code of ref document: P

REG Reference to a national code

Ref country code: NL

Ref legal event code: VDEP

Effective date: 20101201

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2356492

Country of ref document: ES

Kind code of ref document: T3

Effective date: 20110408

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

LTIE Lt: invalidation of european patent or patent extension

Effective date: 20101201

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110301

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110302

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110401

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20110401

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

26N No opposition filed

Effective date: 20110902

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 602006018618

Country of ref document: DE

Effective date: 20110902

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20110731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20110710

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: HU

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20101201

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20160623

Year of fee payment: 11

Ref country code: GB

Payment date: 20160627

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20160622

Year of fee payment: 11

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20160627

Year of fee payment: 11

Ref country code: DE

Payment date: 20160622

Year of fee payment: 11

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 602006018618

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170710

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20180330

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170710

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20180201

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170731

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170710

REG Reference to a national code

Ref country code: ES

Ref legal event code: FD2A

Effective date: 20181107

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170711