WO2007007001A2 - Dispositif de codage/decodage hierarchique - Google Patents
Dispositif de codage/decodage hierarchique Download PDFInfo
- Publication number
- WO2007007001A2 WO2007007001A2 PCT/FR2006/050690 FR2006050690W WO2007007001A2 WO 2007007001 A2 WO2007007001 A2 WO 2007007001A2 FR 2006050690 W FR2006050690 W FR 2006050690W WO 2007007001 A2 WO2007007001 A2 WO 2007007001A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- coding
- band
- signal
- transform
- frequency band
- Prior art date
Links
- 239000010410 layer Substances 0.000 claims abstract description 52
- 230000003595 spectral effect Effects 0.000 claims abstract description 42
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 36
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 36
- 238000004458 analytical method Methods 0.000 claims abstract description 16
- 230000005236 sound signal Effects 0.000 claims abstract description 6
- 239000012792 core layer Substances 0.000 claims abstract description 5
- 238000000034 method Methods 0.000 claims description 29
- 230000006872 improvement Effects 0.000 claims description 13
- 238000001228 spectrum Methods 0.000 claims description 9
- 239000004606 Fillers/Extenders Substances 0.000 claims description 4
- 238000004364 calculation method Methods 0.000 claims description 3
- 238000004590 computer program Methods 0.000 claims description 2
- 230000000750 progressive effect Effects 0.000 claims description 2
- 230000009466 transformation Effects 0.000 claims description 2
- 230000006978 adaptation Effects 0.000 claims 1
- 230000005540 biological transmission Effects 0.000 abstract description 8
- 230000002708 enhancing effect Effects 0.000 abstract 1
- 230000005284 excitation Effects 0.000 description 36
- 238000001914 filtration Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 8
- 238000013139 quantization Methods 0.000 description 8
- 230000003044 adaptive effect Effects 0.000 description 6
- 238000012805 post-processing Methods 0.000 description 4
- 238000013459 approach Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 3
- 101150012579 ADSL gene Proteins 0.000 description 2
- 102100020775 Adenylosuccinate lyase Human genes 0.000 description 2
- 108700040193 Adenylosuccinate lyases Proteins 0.000 description 2
- 230000003111 delayed effect Effects 0.000 description 2
- 230000009977 dual effect Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Chemical compound CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 230000007175 bidirectional communication Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000006854 communication Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000010363 phase shift Effects 0.000 description 1
- 238000011002 quantification Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/06—Determination or coding of the spectral characteristics, e.g. of the short-term prediction coefficients
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/12—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
Definitions
- the present invention relates to a hierarchical audio coding system. It also relates to a hierarchical audio coder and decoder.
- the invention finds a particularly advantageous application in the field of the transmission of speech and / or audio signals over voice-over-IP packet networks. More specifically, the invention makes it possible, in this context, to provide a modular quality ranging from a telephone band to an enlarged band, as a function of the capacity of the transmission bit rate and while guaranteeing interoperability with an existing core. in telephone band.
- the first category includes quantization techniques with or without memory such as MIC or ADPCM (PCM or ADPCM) coding.
- the second category includes techniques that represent the signal using a model, usually linear predictive, but whose parameters are determined using methods derived from waveform coding. For this reason, this category is often referred to as hybrid coding.
- CELP coding (“Code Excited Linear Prediction") belongs to this second category.
- the input signal is encoded using a model "source-filter” inspired speech production process.
- the transmitted parameters represent separately the source (also called “excitation") and the filter.
- the filter is usually an all-pole filter.
- Notions Basic information on the coding of audio-frequency signals, and more particularly CELP coding and quantification, is given in particular in the following works: WB. Kleijn and KK Paliwal Editors, Speech Coding and Synthesis, Elsevier, 1995, and Nicolas Moreau, Signal Compression Techniques, Technical and Scientific Collection of Telecommunications, Masson, 1995.
- the third category includes coding techniques such as MPEG 1 and 2 Layer IH, more known as MP3, or MPEG 4 AAC.
- the G.729 system recommended to I 1 UlT-T is an example of coding
- CELP designed for voiceband speech signals (300-3400 Hz) sampled at 8 kHz. It operates at a fixed rate of 8 kbit / s with frames of 10 ms. Its detailed operation is specified in ITU-T Recommendation G.729, Coding of Speech at 8 kbps using Conjugate Structure Algebraic Code Excited Linear Prediction (CS-ACELP), March 1996.
- Figure 1 (c) shows how the G.729 decoder reconstructs the speech signal from the data provided by the decelerator (112). The excitation is reconstructed by subframes of 5 ms by adding two contributions:
- LPC Linear Predictive Coding
- a (z) (120) of order 10 the coefficients of which are decoded (119) in the domain of LSF spectral line pairs ("Line Spectrum Frequency") and interpolated by sub -5 ms frame.
- the reconstructed signal is then processed by an adaptive post-filter (121) and a post-processing high-pass filter (122).
- the decoder of FIG. 1 (c) thus relies on the "source-filter” model to synthesize the signal. associated with this model are listed in the table of figure 2 distinguishing those describing the excitation and those which describe the filter.
- Figure 1 (a) shows a very high level diagram of the G.729 encoder. It thus highlights the preprocessing high pass filtering (101), the LPC analysis and quantization (102), the excitation coding (103) and the coded parameter multiplexing (104).
- the LPC preprocessing and analysis and quantization blocks of the G.729 encoder are not discussed here; reference can be made to the above ITU-T Recommendation for further details.
- the operation of the coding of the excitation is shown schematically in Figure 1 (b). This shows how the excitation parameters listed in FIG. 2 are determined and quantified.
- the excitation is coded in 3 steps:
- the excitation parameters are determined by minimizing the quadratic error (111) between the CELP target (105) and the filtered excitation by W (z) / ⁇ (z) (110). This process of synthesis analysis is detailed in the ITU-T Recommendation mentioned above.
- G.729 In practice the complexity of the coder / decoder (coded) G.729 is relatively high (around 18 WMOPS ("Weighted Million Operations Per Second")). To meet the needs of applications such as simultaneous voice and data transmission over DSVD (Digital Simultaneous Voice and Data) modems, an interoperable system of lesser complexity (about 9 WMOPS) has also been recommended to I 1 ITU. T: the G.729A codec. The latter is described and compared to G, 729 in R. Salami et al., Description of ITU-T Recommendation G.729 Annex A: 8 kbit / s Reduced Complexity CS-ACELP coded, ICASSP 1997.
- G.729 and G.729A the one which allows more to reduce the complexity of G, 729 concerns the search in the ACELP dictionary; in the G.729A coder a deep search first of the 4 signed pulses replaces the nested loop search used in the G.729 encoder. Because of its low complexity, the G.729A codec is now widely used in voice over IP and ATM (300-3400 Hz) applications. With the development of fiber optics and broadband networks such as ADSL, it is now possible to deploy new services such as bi-directional communications of much better quality than conventional systems in telephone band. A step in this direction is to provide an "extended band" quality, that is to say considering audio-frequency signals sampled at 16 kHz and restricted to a useful band of 50-7000 Hz. The quality obtained is then similar to that of the AM radio.
- IP networks and connection points (telephone modems, ADSL, LAN, WiFi, etc.) is highly heterogeneous in terms of throughput, quality of service characterized by jitter, packet loss rate, etc.
- Terminals reproducing sounds sometimes differ in terms of sample rate and number of audio channels. It is sometimes difficult to know in advance at the encoder level the actual capacity of the terminals.
- the hierarchical coding is to generate a bit stream from which we can decode all or part.
- the hierarchical coding comprises a core layer and one or more improvement layers.
- the core layer is generated by a fixed low rate codec, called a "core”, which guarantees the minimum quality of the coding.
- This layer must be received by the decoder to maintain an acceptable level of quality. Improvement layers are used to improve quality. However, it may happen that they are not all received by the decoder because of transmission faults, for example in the case of congestion of an IP network. This technique therefore offers great flexibility in the choice of flow and the quality of reconstruction.
- the encoder always works assuming the flow rate is maximum. However, at any point in the communication chain, the bit rate can be adapted by simply truncating the bitstream.
- hierarchical coding makes it possible to deploy broadband quality progressively, relying on a standard CELP coding in a telephone band (such as ITU-T G.729 or G.729A standards).
- a standard CELP coding in a telephone band (such as ITU-T G.729 or G.729A standards).
- coding uses an encoder 8 kbit / s G.729 core, an intermediate bandband enhancement layer to 14.2 kbit / s, followed by an enhanced bandwidth enhancement layer by transform coding to 24 kbit / s s.
- Valin is shown schematically in FIG. 3.
- a telephone band signal (300-3400 Hz) is extended to the extended band 0-8000 Hz by adding (31) three contributions:
- the telephone band signal for example coded by the system
- the extension envelope can be realized, for example by codebook mapping techniques, without transmission of auxiliary information or with explicit information requiring transmission by quantization at a low additional bit rate.
- the narrowband LPC residual signal (or excitation) is calculated by the block (36).
- excitation resultant sampled at 8 kHz is extended to the sampling frequency of 16 kHz by block (37).
- This operation can be performed in the field of excitation by employing non-linearity, oversampling and filtering, in order to extend the harmonic structure and whiten the full-band excitation.
- the extended excitation is then shaped by the full-band synthesis filter 1 / B WB (z) (38) and the result is limited by the high-pass filtering (39) at the band 3400-8000 Hz.
- the non-linear phase of the pre- and post-treatment is rarely taken into account.
- the improvement layers based on the coding of a signal difference between original (pre-processed or not) and synthesis of the lower layer have very poor performance if the non-linear phase (or group delay) Pre- and post-treatment filters are not compensated for or eliminated.
- the object of the invention is to remedy the various problems stated above by proposing a system for encoding a hierarchical audio signal, comprising, at least, a parametric coded core layer by synthesis analysis in a first frequency band.
- a band extender layer for expanding said first frequency band into a second frequency band, said extended band, notable in that said system also includes a layer for improving the quality of audio coding in the extended band , based on transform coding using a spectral parameter from said band extension layer.
- extended band means a frequency band resulting from the extension of a first band, the telephone band between
- said system also comprises an audio coding quality improvement layer in said first frequency band.
- said spectral parameter is a spectral envelope derived from the band extension layer.
- said spectral envelope is specified by an extended band linear prediction filter, or said spectral envelope is given by the energy per subband of the signal.
- said spectral parameter is at least a part of the signal transform synthesized by the band extension layer.
- said system comprises a module for progressively adjusting the energy in the subbands of the signal transform synthesized by the band extension layer.
- said parametric coding by synthesis analysis is a CELP coding.
- said CELP coding is a G.729 coding or a G.729A coding.
- the coding system proposed by the invention is a hierarchical coding system capable of operating for example at rates of 8 and 12 kbit / s and at all rates between 14 and 32 kbit / s.
- the coding / decoding system according to (invention) makes it possible to obtain:
- the invention also relates to a method for implementing the coding system according to the first embodiment, comprising the following steps;
- said method also comprises a step of producing an audio coding quality enhancement layer using transform coding, said transform coding of said residual signal using said spectral envelope.
- the invention further relates to a method for implementing the coding system according to the second embodiment, comprising the following steps;
- said method also comprises a step of producing an enhancement layer using a transform coding of said residual signal, said transform coding using the signal transform synthesized by the band extender layer.
- said method comprises a step of gradually adjusting the energy in the sub-bands of the signal transform synthesized by the band extension layer.
- the invention also relates to a computer program comprising program instructions for implementing the steps of the method according to the invention when said program is executed by a computer. Furthermore, the invention relates to a first hierarchical audio coder, comprising:
- a parametric encoding heart coder by synthesis analysis for coding an original signal in a first frequency band, a coding stage in an extension of the first frequency band, comprising a spectral envelope,
- said coder also comprises a stage for improving the quality of the audio coding extended band by coding by transformed including an inverse transform, using said spectral envelope.
- the invention relates to a second hierarchical audio coder, comprising: a parametric encoding heart coder by synthesis analysis, intended to encode an original signal in a first frequency band,
- said coder also comprises a stage for improving the quality of the audio coding extended band by coding by transformed, using the signal transform synthesized by the band extension layer.
- the invention also relates to a first hierarchical audio decoder, comprising:
- a decoding stage in an extension of the first frequency band comprising a spectral envelope
- said decoder also comprises a stage for improving the quality of the audio decoding extended band by transform decoding including an inverse transform, using said spectral envelope.
- the invention relates to a second hierarchical audio decoder, comprising: a parametric encoding core decoder by synthesis analysis for decoding in a first frequency band a received signal coded by the second coder,
- decoder in an extension of the first frequency band, characterized in that said decoder also comprises a stage for improving the quality of the audio decoding extended band by transform decoding including an inverse transform, using the transform of the decoder. signal synthesized by the band extension layer.
- Figure 4 (a) is a diagram of the first three stages of an encoder according to the present invention.
- Fig. 4 (b) is a diagram of the fourth coder coding stage of Fig. 4 (a).
- Fig. 5 is a table of the coefficients of the low-pass filter used in the present invention.
- Fig. 6 is a table of coefficients of the high pass filter used to generate an enlarged band enhancement signal according to the invention.
- Fig. 7 is a table specifying the sub-banding of the MDCT spectra according to the invention.
- FIG. 8 is a table giving the number of bits allocated for each frame to each of the parameters of an encoder and a decoder according to the present invention.
- Figure 9 shows the structure of the bit stream associated with the present invention.
- Figure 10 (a) is a general diagram of the four-layer decoder of the present invention.
- FIG. 10 (b) is a detail diagram of the transform predictive decoding stage of the decoder of Fig. 10 (a). All of FIGS. 4 (a) to 10 (b) describe a hierarchical coding / decoding system consisting of an encoder and a decoder which will now be described successively.
- extended band refers to the particular case of a 300-3400 Hz telephone band extended to the 50-7000 Hz range.
- Figure 4 (a) gives a block diagram of the encoder.
- An original audio signal of useful band between 50 and 7000 Hz and sampled at 16 kHz is cut into a frame of 320 samples, or 20 ms.
- High-pass filtering 601 of 50Hz cut-off frequency is applied to the input signal.
- the resulting signal, called S WB is reused in several branches of the encoder and corresponds to the actually encoded signal.
- low-pass filtering (whose coefficients are provided in the table of FIG. 5) and subsampling by two 602 are applied to S WB .
- This signal is processed by the heart coder 603, type CELP G.729A + coding, for example.
- the G.729A + coder corresponds here to the G.729 coder without high pass filter pretreatment, and for which the search in the ACELP dictionary has been replaced by that of the G.729A as described previously.
- Variants of this embodiment may use G.729A, G.729 or other CELP encoders without preprocessing.
- This coding gives the heart of the bit stream with a bit rate of 8 kbit / s in the case of the G.729A + encoder.
- a first enhancement layer introduces a second CELP coding stage 603.
- This second stage consists of an innovative code consists of four additional pulses ⁇ 1 for a subframe of 5 ms (equivalent to dictionary DCui the G.729A), these pulses are scaled by a gain g enh -
- This dictionary enriches the CELP excitation and offers a quality improvement, especially on unvoiced sounds.
- the rate of this second coding stage is 4 kbit / s and the associated parameters are the positions and the signs of the pulses and the associated gain for each subframe of 40 samples (5 ms at 8 kHz).
- this coding stage uses other modes of improvement, for example those described in the De lacovo article cited above.
- the decoding of the core encoder and the first enhancement layer are performed to obtain the 12 kbit / s telephone band synthesis signal. It is important to note that the adaptive post-filtering and post-processing (high-pass filtering) of the core encoder are disabled in order to take into account the non-linear phase shift of these operations; the difference between the original pre-processed signal and the 8 and 12 kbit / s synthesis is minimized.
- Over-sampling and low-pass filtering 604 make it possible to obtain the sampled version at 16 kHz of the first two stages of the encoder.
- the second enhancement layer also known as a band extension layer, makes it possible to switch to an enlarged band.
- the input signal S WB can be filtered by a pre-emphasis filter 605 with This filter makes it possible to better represent the high frequencies from the broadband linear prediction filter.
- a dual deemphasis filter 606 is then used in the synthesis.
- no pre-emphasis and de-emphasis filters are integrated into the coding and decoding structure.
- the next step is to calculate and quantify the wideband linear prediction filter 607.
- the order of the linear prediction filter is 18, but in a variant of this embodiment, another prediction order, for example lower (16), is chosen.
- the linear prediction filter can be calculated by the autocorrelation method and the Levinson-Durbin algorithm,
- This broadband linear prediction filter WB (z) is quantized using a prediction of these coefficients possibly from the filter NB (z) from the heart coder 603 in a telephone band.
- the coefficients can then be quantized using, for example, multi-stage vector quantization and using the dequantized LSF parameters of the telephone band heart coder, as described in the article by H. Ehara, T. Morii, M. Oshikiri and K. Yoshida, Predictive VQ for scalable bandwidth LSP quantization, ICASSP 2005.
- the wideband excitation 608 is obtained from the parameters of the telephone band excitation of the core coder: the pitch delay, the associated gain as well as the algebraic excitations of the core coder and the first enrichment layer. CELP excitation and associated gains. This excitation is generated by using an over-sampled version of the parameters of the excitation of the telephone band stages. In a variant of this embodiment, the excitation is calculated from the "pitch" delay and the associated gain, these parameters being used to generate a harmonic excitation from a white noise. In this variant, the excitation of the algebraic dictionary is replaced by a white noise.
- This excitation in broadband is then filtered by the synthesis filter 609 calculated previously.
- the de-emphasis filter 606 is applied to the output signal of the synthesis filter.
- the signal obtained is an expanded band signal which is not adjusted in energy.
- high pass filtering 611 (whose coefficients are given in the table of FIG. 6) is applied to the signal of broadband synthesis.
- the same high-pass filter 612 is applied to the error signal corresponding to the difference between the delayed original signal 610 and the synthesis signal of the two preceding stages.
- the gain to be applied to the synthesis signal of the high band is calculated by a ratio of energy between the two signals.
- the gain gw ⁇ 611 is then applied to the signal S 14 UB by subframe of 80 samples (5 ms at 16 kHz), the signal thus obtained is added to the synthesis signal of the preceding stage to create the broadband signal corresponding to the 14 kbit / s rate.
- the further coding is performed in the frequency domain using a transform predictive coding scheme using the linear prediction filter from the band extension layer.
- This coding stage is the enhancement quality improvement layer in the extended band.
- FIG. 4 (b) describes this part of the encoder. Delayed input signals
- a modified discrete cosine transform (or MDCT) is applied: on the one hand, on blocks of 640 samples of the weighted input signal 618 with an overlap of 50% (refresh of the MDCT analysis every 20 ms ), on the other hand, on the weighted synthesis signal 619 from the previous 14 kbit / s bandwidth stage (same block length and same recovery rate).
- the MDCT spectrum to be encoded 620 corresponds to the difference between the weighted input signal and the 14 kbit / s synthesis signal for the 0 to 3400 Hz band, and the 3400 Hz to 7000 Hz weighted input signal.
- the spectrum is limited to 7000 Hz by setting the last 40 coefficients to zero (only the first 280 coefficients are coded).
- the spectrum is divided into 18 bands: a band of 8 coefficients and 17 bands of 16 coefficients as described in the table of Figure 7.
- a variant of this embodiment uses 20 bands of equal widths (14 coefficients).
- the energy of the MDCT coefficients is calculated (scale factors).
- the 18 scale factors constitute the spectral envelope of the weighted signal which is then quantized, coded and transmitted in the frame.
- Dynamic bit allocation is based on spectrum band energy from the dequantized version of the spectral envelope. This makes it possible to have compatibility between the bit allocation of the encoder and the decoder.
- the bit allocation in the Time Domain Aliasing Cancellation (TDAC) module 620 is done in two phases. First, a first calculation of the number of bits to be allocated to each band is performed; each of the values obtained is rounded to the rate of the nearest available dictionary. If the total flow allocated is not exactly equal to that available, a second phase is used to perform the readjustment. This step is done by an iterative procedure based on an energetic criterion that adds or removes bits to the bands as described in the article by Y. Mahieux and JP.
- the bits are added to the bands where the perceptual improvement is the most important. (higher energy).
- the extraction of bits on the bands is dual.
- the normalized MDGT coefficients (fine structure) in each band are then quantized by vector quantizers using dictionnaries nested in size and resolution, the dictionaries being composed of a union of permutation codes as described in the international application WO / 0400219 .
- the information on the core coder, the CELP enrichment stage in the telephone band, the broadband CELP stage and finally the spectral envelope and the coded standard coefficients are multiplexed and transmitted in a frame.
- the number of bits allocated to each of the parameters of the encoder and decoder is specified in the table of FIG. 8.
- the structure of the frame of the bitstream is described in FIG.
- the module 701 demultiplexes the parameters contained in the bit stream. There are several decoding cases depending on the number of bits received for a frame, the first three cases are described from Figure 10 (a) and the last case from Figure 10 (b):
- the first concerns the reception of the minimum number of bits by the decoder. In this case "Seui Ie first floor is decoded. So, only the train Binary relating to the CELP core decoder 702 (G.729A +) is received and decoded. This synthesis can be processed by the adaptive post-filter and the postprocessing of the G.729 decoder. This signal is oversampled and filtered to produce a signal sampled at 16 kHz (703).
- 2- concerns the reception of the number of bits relative to the first and second decoding stages. In this case, the core decoder as well as the first enhancement stage of the CELP excitation are decoded. This synthesis can be processed by the adaptive post-filter and the postprocessing of the G.729 decoder. This signal is then oversampled and filtered to produce a signal sampled at 16 kHz (703).
- the third case corresponds to the reception of the number of bits relative to the first three decoding stages.
- the first two decoding stages are first performed as in case 2, then the band extension module generates a signal sampled at 16 kHz after decoding the parameters of the spectral line pairs (WB-LSF). ) in broadband (704) as well as gains associated with excitation.
- the broadband excitation is generated from the parameters of the core encoder and the first enhancement stage of the CELP 705 excitation. This excitation is then filtered by the synthesis filter 706 and optionally by the deceleration filter 707. in the case where a pre-emphasis filter has been used at the encoder.
- a high-pass filter 708 is applied to the obtained signal and the energy of the band-extension signal is adjusted with the associated gains (709) every 5 ms.
- This signal is then added to the sampled 16 kHz telephone band signal obtained from the first two decoding stages.
- this signal is filtered in the transformed domain by setting to 0 the last 40 MDCT coefficients before passing through the inverse MDCT transform 713 and the weighted synthesis filter 714.
- This last case corresponds to the decoding of the last stage of the decoder (FIG. 10 (b)).
- This stage corresponds to the quality improvement layer of the decoding in the extended band.
- This last stage consists of a transform predictive decoder using the finite prediction filter derived from the band extension layer. Step 3 described above is first realized. Then, depending on the number of additional bits received, the decoding scheme is adapted:
- the partial or complete spectral envelope is used. for adjusting the energy of the MDCT coefficient bands (722) between 3400 Hz and 7000 Hz (720) corresponding to a portion of the signal transform generated by the band extension stage 711. This system makes it possible to obtain a progressive improvement of the audio quality according to the number of bits received.
- the number of bits corresponds to the totality of the spectral envelope and a part or the whole of the fine structure.
- the bit allocation is performed in the same way as at the encoder 716.
- the decoded MDCT coefficients are calculated from the spectral envelope 715 and the dequantized fine structure 717.
- the procedure of the preceding paragraph is used, that is to say that the MDCT coefficients calculated on the signal obtained by the band extension - which constitute a spectral parameter derived from the band-extension layer, are adjusted in energy from the received spectral envelope (722).
- the MDCT spectrum used for the synthesis therefore consists of: on the one hand, the synthesis signal of the two first decoding stages added to the decoded error signal in the bands between 0 and 3400 Hz (718 and 719); on the other hand, for the bands between 3400 Hz and 7000 Hz decoded MDCT coefficients in the bands where the fine structure has been received and MDCT coefficients of the energy-adjusted band extension stage for the other spectral bands (721 and 722),
- An inverse MDCT transformation is then applied to the decoded MDCT coefficients (713) and filtering by the weighted synthesis filter (714) provides the signal. Release.
- the transform predictive coding / decoding stage will operate entirely on the difference signal between the original signal and the synthesis signal of the band extension stage between 0 and 7000 Hz.
- the band extension will be performed on coding and decoding in the transformed domain from a spectral envelope given by the energy per subband of the signal, and a coding of the fine structure.
- This spectral envelope can be quantified by vector quantization.
- the broadband enhancement stage uses TDAC-type transform coding as previously described (without weighting filtering).
- the spectral envelope that is given by the energy per subband of the signal and which constitutes a spectral parameter is transmitted in the band extension stage and will be reused by the broadband enhancement layer.
- the first coded frequency band could correspond to the enlarged 50-7000 Hz band and the second coded frequency band could be an FM (50-15000 z) or hifi band (20-24000 Hz).
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020087003000A KR101303145B1 (ko) | 2005-07-13 | 2006-07-07 | 계층적 오디오 신호를 코딩하기 위한 시스템, 오디오 신호를 코딩하는 방법, 컴퓨터-판독가능한 매체 및 계층적 오디오 디코더 |
AT06779029T ATE511179T1 (de) | 2005-07-13 | 2006-07-07 | Hierarchischen audio-kodierung/-dekodierung |
JP2008520925A JP5112309B2 (ja) | 2005-07-13 | 2006-07-07 | 階層符号化/復号化装置 |
BRPI0612987-0A BRPI0612987A2 (pt) | 2005-07-13 | 2006-07-07 | dispositivo de codificaÇço/decodificaÇço hierÁrquica |
US11/988,758 US8374853B2 (en) | 2005-07-13 | 2006-07-07 | Hierarchical encoding/decoding device |
CN2006800336707A CN101263553B (zh) | 2005-07-13 | 2006-07-07 | 分级编码/解码设备 |
EP06779029A EP1905010B1 (fr) | 2005-07-13 | 2006-07-07 | Codage/décodage audio hiérarchique |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0552199A FR2888699A1 (fr) | 2005-07-13 | 2005-07-13 | Dispositif de codage/decodage hierachique |
FR0552199 | 2005-07-13 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007007001A2 true WO2007007001A2 (fr) | 2007-01-18 |
WO2007007001A3 WO2007007001A3 (fr) | 2007-04-12 |
Family
ID=36608212
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2006/050690 WO2007007001A2 (fr) | 2005-07-13 | 2006-07-07 | Dispositif de codage/decodage hierarchique |
Country Status (9)
Country | Link |
---|---|
US (1) | US8374853B2 (fr) |
EP (1) | EP1905010B1 (fr) |
JP (1) | JP5112309B2 (fr) |
KR (1) | KR101303145B1 (fr) |
CN (1) | CN101263553B (fr) |
AT (1) | ATE511179T1 (fr) |
BR (1) | BRPI0612987A2 (fr) |
FR (1) | FR2888699A1 (fr) |
WO (1) | WO2007007001A2 (fr) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010003624A2 (fr) | 2008-07-09 | 2010-01-14 | Sanofi-Aventis | Composés hétérocycliques, leurs procédés de préparation, médicaments comprenant lesdits composés et leur utilisation |
WO2012120057A1 (fr) | 2011-03-08 | 2012-09-13 | Sanofi | Nouveaux dérivés phényl-oxathiazine substitués, procédé pour leur préparation, agent pharmaceutique contenant ces composés et leur utilisation |
CN115116457A (zh) * | 2022-06-15 | 2022-09-27 | 腾讯科技(深圳)有限公司 | 音频编码及解码方法、装置、设备、介质及程序产品 |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7461106B2 (en) * | 2006-09-12 | 2008-12-02 | Motorola, Inc. | Apparatus and method for low complexity combinatorial coding of signals |
EP2096632A4 (fr) * | 2006-11-29 | 2012-06-27 | Panasonic Corp | Appareil de décodage, et procédé de décodage audio |
US8576096B2 (en) * | 2007-10-11 | 2013-11-05 | Motorola Mobility Llc | Apparatus and method for low complexity combinatorial coding of signals |
US8209190B2 (en) * | 2007-10-25 | 2012-06-26 | Motorola Mobility, Inc. | Method and apparatus for generating an enhancement layer within an audio coding system |
US20090234642A1 (en) * | 2008-03-13 | 2009-09-17 | Motorola, Inc. | Method and Apparatus for Low Complexity Combinatorial Coding of Signals |
KR100916400B1 (ko) | 2008-04-07 | 2009-09-07 | 현대자동차주식회사 | 후드용 안전후크 구조 |
US8639519B2 (en) * | 2008-04-09 | 2014-01-28 | Motorola Mobility Llc | Method and apparatus for selective signal coding based on core encoder performance |
FR2938688A1 (fr) * | 2008-11-18 | 2010-05-21 | France Telecom | Codage avec mise en forme du bruit dans un codeur hierarchique |
US8219408B2 (en) * | 2008-12-29 | 2012-07-10 | Motorola Mobility, Inc. | Audio signal decoder and method for producing a scaled reconstructed audio signal |
US8140342B2 (en) * | 2008-12-29 | 2012-03-20 | Motorola Mobility, Inc. | Selective scaling mask computation based on peak detection |
US8175888B2 (en) * | 2008-12-29 | 2012-05-08 | Motorola Mobility, Inc. | Enhanced layered gain factor balancing within a multiple-channel audio coding system |
US8200496B2 (en) * | 2008-12-29 | 2012-06-12 | Motorola Mobility, Inc. | Audio signal decoder and method for producing a scaled reconstructed audio signal |
KR101433701B1 (ko) | 2009-03-17 | 2014-08-28 | 돌비 인터네셔널 에이비 | 적응형으로 선택가능한 좌/우 또는 미드/사이드 스테레오 코딩과 파라메트릭 스테레오 코딩의 조합에 기초한 진보된 스테레오 코딩 |
FR2947945A1 (fr) * | 2009-07-07 | 2011-01-14 | France Telecom | Allocation de bits dans un codage/decodage d'amelioration d'un codage/decodage hierarchique de signaux audionumeriques |
FR2947944A1 (fr) * | 2009-07-07 | 2011-01-14 | France Telecom | Codage/decodage perfectionne de signaux audionumeriques |
CN101989429B (zh) * | 2009-07-31 | 2012-02-01 | 华为技术有限公司 | 转码方法、装置、设备以及系统 |
ES2805349T3 (es) | 2009-10-21 | 2021-02-11 | Dolby Int Ab | Sobremuestreo en un banco de filtros de reemisor combinado |
CN102081926B (zh) * | 2009-11-27 | 2013-06-05 | 中兴通讯股份有限公司 | 格型矢量量化音频编解码方法和系统 |
CN102081927B (zh) * | 2009-11-27 | 2012-07-18 | 中兴通讯股份有限公司 | 一种可分层音频编码、解码方法及系统 |
US8423355B2 (en) * | 2010-03-05 | 2013-04-16 | Motorola Mobility Llc | Encoder for audio signal including generic audio and speech frames |
US8428936B2 (en) * | 2010-03-05 | 2013-04-23 | Motorola Mobility Llc | Decoder for audio signal including generic audio and speech frames |
WO2011142709A2 (fr) * | 2010-05-11 | 2011-11-17 | Telefonaktiebolaget Lm Ericsson (Publ) | Procédé et dispositif de traitement de signaux audio |
KR101918904B1 (ko) * | 2010-06-04 | 2019-02-08 | 소니 주식회사 | 화상 처리 장치 및 방법 |
US8904027B2 (en) | 2010-06-30 | 2014-12-02 | Cable Television Laboratories, Inc. | Adaptive bit rate for data transmission |
JP5695074B2 (ja) * | 2010-10-18 | 2015-04-01 | パナソニック インテレクチュアル プロパティ コーポレーション オブアメリカPanasonic Intellectual Property Corporation of America | 音声符号化装置および音声復号化装置 |
KR101525185B1 (ko) | 2011-02-14 | 2015-06-02 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | 트랜지언트 검출 및 품질 결과를 사용하여 일부분의 오디오 신호를 코딩하기 위한 장치 및 방법 |
ES2639646T3 (es) | 2011-02-14 | 2017-10-27 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codificación y decodificación de posiciones de impulso de pistas de una señal de audio |
BR112013020482B1 (pt) * | 2011-02-14 | 2021-02-23 | Fraunhofer Ges Forschung | aparelho e método para processar um sinal de áudio decodificado em um domínio espectral |
CN103477387B (zh) | 2011-02-14 | 2015-11-25 | 弗兰霍菲尔运输应用研究公司 | 使用频谱域噪声整形的基于线性预测的编码方案 |
US9536534B2 (en) | 2011-04-20 | 2017-01-03 | Panasonic Intellectual Property Corporation Of America | Speech/audio encoding apparatus, speech/audio decoding apparatus, and methods thereof |
US9601122B2 (en) * | 2012-06-14 | 2017-03-21 | Dolby International Ab | Smooth configuration switching for multichannel audio |
US9129600B2 (en) | 2012-09-26 | 2015-09-08 | Google Technology Holdings LLC | Method and apparatus for encoding an audio signal |
FR3008533A1 (fr) * | 2013-07-12 | 2015-01-16 | Orange | Facteur d'echelle optimise pour l'extension de bande de frequence dans un decodeur de signaux audiofrequences |
CN105493182B (zh) * | 2013-08-28 | 2020-01-21 | 杜比实验室特许公司 | 混合波形编码和参数编码语音增强 |
KR102271852B1 (ko) * | 2013-11-02 | 2021-07-01 | 삼성전자주식회사 | 광대역 신호 생성방법 및 장치와 이를 채용하는 기기 |
FR3017484A1 (fr) * | 2014-02-07 | 2015-08-14 | Orange | Extension amelioree de bande de frequence dans un decodeur de signaux audiofrequences |
PL3128513T3 (pl) * | 2014-03-31 | 2019-11-29 | Fraunhofer Ges Forschung | Koder, dekoder, sposób kodowania, sposób dekodowania i program |
FR3024582A1 (fr) | 2014-07-29 | 2016-02-05 | Orange | Gestion de la perte de trame dans un contexte de transition fd/lpd |
CN108549048B (zh) * | 2018-03-23 | 2021-10-22 | 武汉大学 | 一种多频WiFi外辐射源雷达相参处理方法 |
WO2020253941A1 (fr) * | 2019-06-17 | 2020-12-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Codeur audio avec un nombre dépendant du signal et une commande de précision, décodeur audio, et procédés et programmes informatiques associés |
US20220277754A1 (en) * | 2019-08-20 | 2022-09-01 | Dolby International Ab | Multi-lag format for audio coding |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004000219A2 (fr) | 2002-06-20 | 2003-12-31 | Novalar Pharmaceuticals, Inc. | Formulations stabilisees d'antagonistes de recepteurs adrenergiques alpha et leur utilisation |
Family Cites Families (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2779886B2 (ja) * | 1992-10-05 | 1998-07-23 | 日本電信電話株式会社 | 広帯域音声信号復元方法 |
US5455888A (en) * | 1992-12-04 | 1995-10-03 | Northern Telecom Limited | Speech bandwidth extension method and apparatus |
JP3483958B2 (ja) * | 1994-10-28 | 2004-01-06 | 三菱電機株式会社 | 広帯域音声復元装置及び広帯域音声復元方法及び音声伝送システム及び音声伝送方法 |
FR2729247A1 (fr) * | 1995-01-06 | 1996-07-12 | Matra Communication | Procede de codage de parole a analyse par synthese |
JP3139602B2 (ja) * | 1995-03-24 | 2001-03-05 | 日本電信電話株式会社 | 音響信号符号化方法及び復号化方法 |
ATE302991T1 (de) * | 1998-01-22 | 2005-09-15 | Deutsche Telekom Ag | Verfahren zur signalgesteuerten schaltung zwischen verschiedenen audiokodierungssystemen |
CA2252170A1 (fr) * | 1998-10-27 | 2000-04-27 | Bruno Bessette | Methode et dispositif pour le codage de haute qualite de la parole fonctionnant sur une bande large et de signaux audio |
KR20010080646A (ko) * | 1998-12-01 | 2001-08-22 | 린다 에스. 스티븐슨 | 강화된 파형 보간 코더 |
US6446037B1 (en) * | 1999-08-09 | 2002-09-03 | Dolby Laboratories Licensing Corporation | Scalable coding method for high quality audio |
KR20010101422A (ko) * | 1999-11-10 | 2001-11-14 | 요트.게.아. 롤페즈 | 매핑 매트릭스에 의한 광대역 음성 합성 |
FI115329B (fi) * | 2000-05-08 | 2005-04-15 | Nokia Corp | Menetelmä ja järjestely lähdesignaalin kaistanleveyden vaihtamiseksi tietoliikenneyhteydessä, jossa on valmiudet useisiin kaistanleveyksiin |
BR0109237A (pt) * | 2001-01-16 | 2002-12-03 | Koninkl Philips Electronics Nv | Codificador paramétrico, método de codificação paramétrica, decodificador paramétrico, método de decodificação, fluxo de dados incluindo dados de código senoidais, e, meio de armazenamento |
SE0101175D0 (sv) * | 2001-04-02 | 2001-04-02 | Coding Technologies Sweden Ab | Aliasing reduction using complex-exponential-modulated filterbanks |
DE60214027T2 (de) * | 2001-11-14 | 2007-02-15 | Matsushita Electric Industrial Co., Ltd., Kadoma | Kodiervorrichtung und dekodiervorrichtung |
EP1423847B1 (fr) * | 2001-11-29 | 2005-02-02 | Coding Technologies AB | Reconstruction des hautes frequences |
CN1266673C (zh) * | 2002-03-12 | 2006-07-26 | 诺基亚有限公司 | 可伸缩音频编码的有效改进 |
JP3881946B2 (ja) | 2002-09-12 | 2007-02-14 | 松下電器産業株式会社 | 音響符号化装置及び音響符号化方法 |
WO2003091989A1 (fr) * | 2002-04-26 | 2003-11-06 | Matsushita Electric Industrial Co., Ltd. | Codeur, decodeur et procede de codage et de decodage |
JP2003323199A (ja) * | 2002-04-26 | 2003-11-14 | Matsushita Electric Ind Co Ltd | 符号化装置、復号化装置及び符号化方法、復号化方法 |
SE0202770D0 (sv) * | 2002-09-18 | 2002-09-18 | Coding Technologies Sweden Ab | Method for reduction of aliasing introduces by spectral envelope adjustment in real-valued filterbanks |
EP1543307B1 (fr) * | 2002-09-19 | 2006-02-22 | Matsushita Electric Industrial Co., Ltd. | Procede et appareil de decodage audio |
KR100917464B1 (ko) * | 2003-03-07 | 2009-09-14 | 삼성전자주식회사 | 대역 확장 기법을 이용한 디지털 데이터의 부호화 방법,그 장치, 복호화 방법 및 그 장치 |
US7318035B2 (en) * | 2003-05-08 | 2008-01-08 | Dolby Laboratories Licensing Corporation | Audio coding systems and methods using spectral component coupling and spectral component regeneration |
US20050004793A1 (en) * | 2003-07-03 | 2005-01-06 | Pasi Ojala | Signal adaptation for higher band coding in a codec utilizing band split coding |
KR100513729B1 (ko) * | 2003-07-03 | 2005-09-08 | 삼성전자주식회사 | 계층적인 대역폭 구조를 갖는 음성 압축 및 복원 장치와그 방법 |
JP4679049B2 (ja) * | 2003-09-30 | 2011-04-27 | パナソニック株式会社 | スケーラブル復号化装置 |
CN100507485C (zh) * | 2003-10-23 | 2009-07-01 | 松下电器产业株式会社 | 频谱编码装置和频谱解码装置 |
RU2381569C2 (ru) * | 2004-01-28 | 2010-02-10 | Конинклейке Филипс Электроникс Н.В. | Способ и устройство масштабирования сигнала по времени |
CA2457988A1 (fr) * | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methodes et dispositifs pour la compression audio basee sur le codage acelp/tcx et sur la quantification vectorielle a taux d'echantillonnage multiples |
EP1742202B1 (fr) * | 2004-05-19 | 2008-05-07 | Matsushita Electric Industrial Co., Ltd. | Dispositif de codage, dispositif de décodage et méthode pour cela |
US20060023748A1 (en) * | 2004-07-09 | 2006-02-02 | Chandhok Ravinder P | System for layering content for scheduled delivery in a data network |
EP1785985B1 (fr) * | 2004-09-06 | 2008-08-27 | Matsushita Electric Industrial Co., Ltd. | Dispositif de codage extensible et procede de codage extensible |
DE602007002385D1 (de) * | 2006-02-06 | 2009-10-22 | France Telecom | Verfahren und vorrichtung zur hierarchischen kodiecodierverfahren und gerät, programme und signal |
BRPI0818927A2 (pt) * | 2007-11-02 | 2015-06-16 | Huawei Tech Co Ltd | Método e aparelho para a decodificação de áudio |
-
2005
- 2005-07-13 FR FR0552199A patent/FR2888699A1/fr active Pending
-
2006
- 2006-07-07 JP JP2008520925A patent/JP5112309B2/ja not_active Expired - Fee Related
- 2006-07-07 EP EP06779029A patent/EP1905010B1/fr not_active Not-in-force
- 2006-07-07 US US11/988,758 patent/US8374853B2/en not_active Expired - Fee Related
- 2006-07-07 AT AT06779029T patent/ATE511179T1/de not_active IP Right Cessation
- 2006-07-07 BR BRPI0612987-0A patent/BRPI0612987A2/pt not_active IP Right Cessation
- 2006-07-07 KR KR1020087003000A patent/KR101303145B1/ko active IP Right Grant
- 2006-07-07 CN CN2006800336707A patent/CN101263553B/zh not_active Expired - Fee Related
- 2006-07-07 WO PCT/FR2006/050690 patent/WO2007007001A2/fr active Application Filing
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2004000219A2 (fr) | 2002-06-20 | 2003-12-31 | Novalar Pharmaceuticals, Inc. | Formulations stabilisees d'antagonistes de recepteurs adrenergiques alpha et leur utilisation |
Non-Patent Citations (9)
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010003624A2 (fr) | 2008-07-09 | 2010-01-14 | Sanofi-Aventis | Composés hétérocycliques, leurs procédés de préparation, médicaments comprenant lesdits composés et leur utilisation |
WO2012120057A1 (fr) | 2011-03-08 | 2012-09-13 | Sanofi | Nouveaux dérivés phényl-oxathiazine substitués, procédé pour leur préparation, agent pharmaceutique contenant ces composés et leur utilisation |
CN115116457A (zh) * | 2022-06-15 | 2022-09-27 | 腾讯科技(深圳)有限公司 | 音频编码及解码方法、装置、设备、介质及程序产品 |
Also Published As
Publication number | Publication date |
---|---|
CN101263553B (zh) | 2013-10-02 |
FR2888699A1 (fr) | 2007-01-19 |
JP2009501351A (ja) | 2009-01-15 |
US8374853B2 (en) | 2013-02-12 |
EP1905010B1 (fr) | 2011-05-25 |
WO2007007001A3 (fr) | 2007-04-12 |
ATE511179T1 (de) | 2011-06-15 |
BRPI0612987A2 (pt) | 2010-12-14 |
KR101303145B1 (ko) | 2013-09-09 |
JP5112309B2 (ja) | 2013-01-09 |
CN101263553A (zh) | 2008-09-10 |
US20090326931A1 (en) | 2009-12-31 |
KR20080032160A (ko) | 2008-04-14 |
EP1905010A2 (fr) | 2008-04-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1905010B1 (fr) | Codage/décodage audio hiérarchique | |
EP1907812B1 (fr) | Procede de commutation de debit en decodage audio scalable en debit et largeur de bande | |
EP1989706B1 (fr) | Dispositif de ponderation perceptuelle en codage/decodage audio | |
EP2277172B1 (fr) | Dissimulation d'erreur de transmission dans un signal audionumerique dans une structure de decodage hierarchique | |
CA2766777C (fr) | Allocation de bits dans un codage/decodage d'amelioration d'un codage/decodage hierarchique de signaux audionumeriques | |
CA2512179C (fr) | Procede de codage et de decodage audio a debit variable | |
CA2766864C (fr) | Codage/decodage perfectionne de signaux audionumeriques | |
EP2115741A1 (fr) | Codage/decodage perfectionnes de signaux audionumeriques | |
WO2007096551A2 (fr) | Procede de codage binaire d'indices de quantification d'une enveloppe d'un signal, procede de decodage d'une enveloppe d'un signal et modules de codage et decodage correspondants | |
EP1692689A1 (fr) | Procede de codage multiple optimise | |
EP3069340B1 (fr) | Transition d'un codage/décodage par transformée vers un codage/décodage prédictif | |
EP3175443B1 (fr) | Détermination d'un budget de codage d'une trame de transition lpd/fd | |
EP2005424A2 (fr) | Procede de post-traitement d'un signal dans un decodeur audio | |
JP2010520504A (ja) | レイヤード・コーデックのためのポストフィルタ | |
FR2737360A1 (fr) | Procedes de codage et de decodage de signaux audiofrequence, codeur et decodeur pour la mise en oeuvre de tels procedes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 2008520925 Country of ref document: JP Ref document number: 304/DELNP/2008 Country of ref document: IN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWW | Wipo information: withdrawn in national office |
Ref document number: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087003000 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006779029 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200680033670.7 Country of ref document: CN |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 06779029 Country of ref document: EP Kind code of ref document: A2 |
|
WWP | Wipo information: published in national office |
Ref document number: 2006779029 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 11988758 Country of ref document: US |
|
ENP | Entry into the national phase |
Ref document number: PI0612987 Country of ref document: BR Kind code of ref document: A2 Effective date: 20080114 |