WO2010058117A1 - Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique - Google Patents

Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique Download PDF

Info

Publication number
WO2010058117A1
WO2010058117A1 PCT/FR2009/052194 FR2009052194W WO2010058117A1 WO 2010058117 A1 WO2010058117 A1 WO 2010058117A1 FR 2009052194 W FR2009052194 W FR 2009052194W WO 2010058117 A1 WO2010058117 A1 WO 2010058117A1
Authority
WO
WIPO (PCT)
Prior art keywords
signal
coding
quantization
improvement
noise
Prior art date
Application number
PCT/FR2009/052194
Other languages
English (en)
French (fr)
Inventor
Balazs Kovesi
Stéphane RAGOT
Alain Le Guyader
Original Assignee
France Telecom
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom filed Critical France Telecom
Priority to CN2009801546871A priority Critical patent/CN102282611B/zh
Priority to KR1020117014240A priority patent/KR101339857B1/ko
Priority to EP09768200.9A priority patent/EP2366177B1/fr
Priority to JP2011543801A priority patent/JP5474088B2/ja
Priority to US13/129,483 priority patent/US8965773B2/en
Publication of WO2010058117A1 publication Critical patent/WO2010058117A1/fr

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering
    • G10L19/265Pre-filtering, e.g. high frequency emphasis prior to encoding
    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques

Definitions

  • the present invention relates to the field of coding digital signals.
  • the coding according to the invention is particularly suitable for the transmission and / or storage of digital signals such as audio-frequency signals (speech, music or other).
  • the present invention relates more particularly to the coding of the ADPCM type of waveform coding (for "Pulse Modulation and Adaptive Differential Coding"), or "ADPCM” (for "Adaptive Differential Pulse Code Modulation”) in English, and in particular to the coding of a nested code ADPCM type for delivering scalable bit stream quantization indices.
  • ITU-T Recommendation G.722 or ITU-T G.121 The general principle of nested code ADPCM coding / decoding specified by ITU-T Recommendation G.722 or ITU-T G.121 is as described with reference to Figures 1 and 2.
  • FIG. 1 thus represents an encoder with nested codes of the ADPCM type.
  • a prediction module 110 making it possible to give the prediction of the signal from the previous samples of the quantized error signal Q v is the scaling factor, and reconstituted signal where n is the current moment.
  • a subtraction module 120 which subtracts from the input signal x (n) its prediction to obtain a prediction error signal denoted e (ri).
  • a quantization module 130 of the error signal which receives as input the error signal e (n) to give quantization indices constituted of B + K bits.
  • the quantification module is nested codes that is to say it has a core quantizer B bits and quantizers bit that are embedded on the quantizer heart.
  • the bit quantization index at the output of the module of quantification is transmitted via the transmission channel 140 to the decoder as described with reference to FIG. 2.
  • the encoder also includes:
  • an inverse quantization module 120 for outputting a quantized error signal on B bits
  • an adaptation module 170 p quantizers and inverse quantizers to give a level control parameter v (n) still called scale factor, for the next moment;
  • the dashed portion referenced 155 represents the low-rate local decoder which contains the predictors 165 and 175 and the inverse quantizer 120.
  • This local decoder thus makes it possible to adapt the inverse quantizer to 170 from the low bit rate index I B (n) and adapt the predictors 165 and 175 from the reconstructed low bit rate data.
  • the nested code ADPCM decoder of FIG. 2 receives as input the indices from transmission channel 140, version of possibly disturbed by bit errors, and performs inverse quantization by the inverse quantization module 210 (Q B ) of bit rate B bits per sample to obtain the signal
  • Q B inverse quantization module 210
  • the symbol "'" indicates a value received at the decoder, possibly different from that transmitted by the encoder due to transmission errors.
  • the output signal r ' B (n) for B bits will be equal to the sum of the signal prediction and the output of the B-bit inverse quantizer.
  • This part 255 of the decoder is identical to the low speed local decoder 155 of FIG.
  • the decoder can improve the restored signal.
  • the output will be equal to the sum of the prediction and the output of the inverse quantizer 230 to B + 1.
  • G.722 nested code ADPCM (hereinafter referred to as G.722) coding broadband signals which are defined with a minimum bandwidth of [50-7000 Hz] and sampled at 16 kHz.
  • G.722 coding is an ADPCM coding of each of the two signal subbands [50-4000 Hz] and [4000-7000 Hz] obtained by decomposition of the signal by quadrature mirror filters.
  • the low band is coded by a 6, 5 and 4 bit nested code ADPCM coding while the high band is coded by a 2 bit ADPCM coder per sample.
  • the total bit rate will be 64, 56 or 48 bit / s depending on the number of bits used for decoding the low band.
  • This coding was first used in ISDN (Digital Integrated Services Network) and then in IP network audio coding applications.
  • the 8 bits are distributed in the following manner as represented in FIG.
  • bits I LS and I 16 may be "stolen" or replaced by data and constitute the improvement bits of the low band. Bits constitute the core bits of the low band.
  • a quantized signal frame according to the G.722 standard consists of 8, 7 or 6 bit coded quantization indices.
  • the transmission frequency of the index being 8 kHz
  • the bit rate will be 64, 56 or 48 kbit / s.
  • the quantization noise spectrum will be relatively flat as shown in FIG. 4.
  • the spectrum of the signal is also represented in FIG. 4 (here a voiced signal block). This spectrum has a great dynamic ( ⁇ 40dB). It can be seen that in areas of low energy, the noise is very close to the signal and is therefore not necessarily masked. It can then become audible in these regions, essentially in the frequency range [2000-2500 Hz] in Figure 4.
  • Coding noise formatting is therefore necessary. Coding noise formatting suitable for nested code coding would also be desirable.
  • G.711.1 Wideband embedded extension for G.711
  • modulation code "or” G.711.1 A wideband extension to ITU-T G.711 ". Y. Hiwasaki, S. Sasaki, H. Ohmuro, T. Mon, J. Seong, MS Lee, B. Kovesi, S. Ragot, J.-L. Garcia, C. Marro, LM, J. Xu, V. Malenovsky, J. Lapierre, R. Lefebvre. EUSIPCO, Lausanne, 2008.
  • This recommendation thus describes coding with coding noise formatting for heart rate coding.
  • a perceptual filter for encoding noise shaping is calculated based on past decoded signals from a reverse core quantizer.
  • a local heart rate decoder thus makes it possible to calculate the noise shaping filter.
  • this noise shaping filter is possible to calculate from decoded heart rate signals.
  • a quantizer delivering improvement bits is used at the encoder.
  • the decoder receiving the core bit stream and the improvement bits, calculates the coding noise shaping filter in the same way as the coder from the decoded heart rate signal and applies this filter to the output signal of the decoder.
  • inverse quantizer of the enhancement bits the shaped high-speed signal being obtained by adding the filtered signal to the decoded heart signal.
  • the shaping of the noise thus improves the perceptual quality of the heart rate signal. It offers a limited improvement in quality for improvement bits. Indeed, the formatting of the coding noise is not carried out for the coding of the improvement bits, the input of the quantizer being the same for the quantization of the core as for the improved quantization.
  • the decoder must then remove a resulting parasitic component by a matched filtering, when the improvement bits are decoded in addition to the core bits.
  • This technique is not used in existing standard scalable decoders of the G.722 or G.727 decoder type. There is therefore a need to improve the quality of the signals regardless of the bit rate while remaining compatible with standard scalable existing decoders.
  • the present invention improves the situation.
  • the enhancement coding comprises a step of obtaining a coding noise shaping filter used to determine a target signal and that the scalar quantization indices of said enhancement signal, are determined by minimizing the error between a set of possible scalar quantization values and said target signal.
  • the signal received at the decoder can therefore be decoded by a standard decoder capable of decoding the heart rate and nested rate signal that does not require noise shaping calculation or correction term.
  • the quality of the decoded signal is therefore improved whatever the bit rate available to the decoder.
  • an embodiment of the determination of the target signal is such that for a current improvement coding stage, the method comprises the following steps for a current sample:
  • the set of possible scalar quantization values and the quantization value of the error signal for the current sample are values designating quantization reconstruction levels, scaled by a parameter. level control calculated with respect to the heart rate quantization indices.
  • the values are adapted to the output level of the core coding.
  • the values designating quantization reconstruction levels for an improvement stage k are defined by the difference between the values designating the reconstruction levels of the quantization of a nested quantizer with B + k bits, B denotes the number of bits of the core coding and the values designating the quantization reconstruction levels of a nested quantizer at B + k-1 bits, the reconstruction levels of the nested quantizer at B + k bits being defined by doubling the levels. of reconstruction of the nested quantizer at B + k-1 bits.
  • the values designating quantization reconstruction levels for the improvement stage k are stored in a memory space and indexed according to the core rate and enhancement quantization indices.
  • Output values of the enhancement quantizer are not recalculated for each sampling time by subtracting the output values of the quantizer at B + k bit from those of the quantizer at B + k-1 bits. They are more, for example, arranged 2 by 2 in a table easily indexable by the index of the previous stage.
  • the number of possible scalar quantization values varies for each sample.
  • the number of coded samples of said improvement signal giving the scalar quantization indices is smaller than the number of samples of the input signal.
  • a possible embodiment of the core coding is for example an ADPCM coding using a scalar quantization and a prediction filter.
  • Another possible embodiment of the core coding is, for example, a PCM coding.
  • the core coding may also comprise a formatting of the coding noise, for example with the following steps for a current sample:
  • a shaping of the coding noise of less complexity is thus carried out for the core coding.
  • the noise shaping filter is defined by an ARMA filter or a succession of ARMA filters.
  • this type of weighting function with a numerator value and a denominator value has the advantage that the denominator value takes signal peaks into account and the numerator value to mitigate these values. peaks thus providing an optimal shaping of the quantization noise.
  • the succession of cascade ARMA filters makes it possible to better model the masking filter by modeling components of the envelope of the signal spectrum and of periodicity or quasi-periodicity components.
  • the noise shaping filter is decomposed into two ARMA filtering cells in a cascade of decoupled spectral slope and formational pitch.
  • each filter is adapted according to the spectral characteristics of the input signal and is therefore suitable for signals having various types of spectral slopes.
  • the noise shaping filter (W (z)) used by the enhancement coding is also used by the core coding, thus reducing the implementation complexity.
  • the noise shaping filter is calculated as a function of said input signal so as best to adapt to different input signals.
  • the noise shaping filter is calculated from a signal locally decoded by the core coding.
  • the present invention also relates to a hierarchical coder of a digital audio signal for a current frame of the input signal comprising:
  • a core coding stage delivering a scalar quantization index for each sample of the current frame
  • At least one improvement coding stage delivering scalar quantization indices for each coded sample of an improvement signal.
  • the encoder is such that the enhancement coding stage comprises a module for obtaining a coding noise shaping filter used to determine a target signal and a quantization module delivering the scalar quantization indices of said signal. improvement method by minimizing the error between a set of possible scalar quantization values and said target signal.
  • ElIe also relates to a computer program comprising code instructions for implementing the steps of the encoding method according to the invention, when these instructions are executed by a processor.
  • the invention relates to a storage means readable by a processor storing a computer program as described.
  • FIG. 1 illustrates a coder of the ADPCM type with nested codes according to the state of the art and as previously described;
  • FIG. 2 illustrates a decoder of the ADPCM type with nested codes according to the state of the art and as described above;
  • FIG. 3 illustrates an example of a quantization index frame of a coder of the ADPCM type with nested codes according to the state of the art and as previously described;
  • FIG. 4 represents a spectrum of a signal block with respect to the spectrum of a quantization noise present in an encoder not implementing the present invention
  • FIG. 5 represents a block diagram of an embedded coder and a coding method according to a general embodiment of the invention
  • FIGS. 6a and 6b show a block diagram of an improvement coding stage and an improvement coding method according to the invention
  • FIG. 7 illustrates different configurations of decoders adapted to the decoding of a signal derived from the coding according to the invention
  • FIG. 8 represents a block diagram of a first detailed embodiment of an encoder according to the invention and of a coding method according to the invention
  • FIG. 9 illustrates an example of calculation of a coding noise for the core coding stage of an encoder according to the invention
  • FIG. 10 illustrates a detailed function for calculating a coding noise of FIG. 9
  • FIG. 11 illustrates an example of obtaining a set of quantization reconstruction levels according to the coding method of the invention
  • FIG. 12 illustrates a representation of the improvement signal according to the coding method of the invention
  • FIG. 13 illustrates a flowchart representing the steps of a first embodiment of the calculation of the masking filter for coding according to the invention
  • FIG. 14 illustrates a flowchart representing the steps of a second embodiment of the calculation of the masking filter for coding according to the invention
  • FIG. 15 represents a block diagram of a second detailed embodiment of an encoder according to the invention and of a coding method according to the invention
  • FIG. 16 represents a block diagram of a third detailed embodiment of an encoder according to the invention and of a coding method according to the invention.
  • FIG. 17 represents a possible embodiment of an encoder according to the invention.
  • This encoder comprises a heart rate encoding stage 500 with B-bit quantization, of the type for example ADPCM encoding such as the G.722 or G.727 or PCM standardized coder for "Coded pulse modulation” or PCM in English (for "Pulse code modulation") such as the standardized encoder G.71 1 and modified according to the outputs of block 520.
  • the block referenced 510 represents this core coding stage with shaping of the coding noise, that is to say masking of the noise of the core coding, described in more detail later with reference to FIGS. 8, 15 or 16.
  • the invention as presented also relates to the case where no masking of the coding noise in the core part is performed.
  • core coder is used in a broad sense in this document.
  • core encoder an existing multi-rate encoder such as ITU-T G.722 56 or 64 kbit / s.
  • core encoder at 0 kbit / s, that is to say, apply the enhancement coding technique that is the subject of the present invention from the first step of the coding. In the latter case the improvement coding becomes core coding.
  • the core coding stage described here with reference to FIG. 5, with noise shaping comprises a filtering module 520 performing the prediction P r (z) from the quantization noise q B (n) and the noise of the noise. filtered quantization q B (n) to provide a prediction signal p
  • the filtered quantization noise q B (n) is obtained for example by adding K M partial predictions of the filtered noise to the quantization noise as described later with reference to FIG. 9.
  • the heart coding stage receives the signal x (n) as input and outputs the quantization index I B (n), the reconstructed signal r B (n) from I B (n) and the factor d quantizer scale v (n) in the case for example of an ADPCM coding as described with reference to FIG.
  • the encoder as represented in FIG. 5 also comprises several improvement coding stages.
  • the EAl stage (530), the EAk stage (540) and the EAk2 stage (550) are represented here.
  • each enhancement coding stage k inputs the signal x (n), the optimal index. , concatenation of the coding index heart and indexes of the previous improvement stages J or of equivalently all these indices, the signal reconstructed in the previous step the parameters of the masking filter and, if appropriate, the scale factor v (n) in the case of adaptive coding.
  • This improvement stage outputs the quantization index of the improvement bits of this coding stage which will be concatenated with the index in the concatenation module 560.
  • the improvement stage k also outputs the reconstructed signal .
  • the index J k (n) represents here a bit for each sample of index n; however, in the general case J k (n) can represent several bits per sample if the number of possible quantization values is greater than 2.
  • stages correspond to bits to be transmitted which will be concatenated to the index so that the resulting index can be decoded by a standard decoder as shown and subsequently described in Figure 7. It is therefore not necessary to change the remote decoder; in addition, no additional information is needed to "inform" the remote decoder of the processing performed at the encoder.
  • bits correspond to improvement bits by increasing the bit rate and masking and require an additional decoding module described with reference to FIG. 7.
  • the encoder of FIG. 5 also comprises a module 580 for calculating the noise shaping filter or masking filter, from the input signal or the coefficients of the synthesis filters of the encoder as described later with reference to FIGS. and 14. It should be noted that the 580 module could have as input the decoded signal locally rather than the original signal.
  • the enhancement coding stages as shown herein provide enhancement bits providing increased signal quality to the decoder, regardless of the rate of the decoded signal and without modifying the decoder and therefore without any additional complexity to the decoder.
  • a module Eak of FIG. 5 representing an improvement coding stage k according to one embodiment of the invention is now described with reference to FIG. 6a.
  • the improvement coding performed by this coding stage includes a quantization step which outputs an index and a value of quantization minimizing the error between a set of possible quantization values and a target signal determined by using the encoding noise shaping filter.
  • the stage k makes it possible to obtain the improvement bit or a group of bits
  • It comprises an EAk-I module for subtracting the input signal x (n) from the signal synthesized at the stage for each preceding sample a current frame and the signa of the previous stage for sample n, to give a coding error signal
  • a weighted quadratic error criterion will be minimized in the quantization step, so that noise Spectrally shaped is less audible.
  • the stage k thus comprises a filter module EAk-2 of the error signal by the weighting function W (z).
  • This weighting function can also be used for noise shaping in the heart coding stage.
  • the noise shaping filter is here equal to the inverse of the spectral weighting, that is to say:
  • This formatting filter is ARMA type ("AutoRegressive Moving Average"). Its transfer function includes a numerator of order N N and a denominator of order Np.
  • the EAk-I block serves essentially to define the memories of the non-recursive part of the filter W (z), which correspond to the denominator of H M (z).
  • the definition of the memories of the recursive part of W (z) is not shown for the sake of brevity, but it is deduced from and of
  • This filtering module outputs a corresponding filtered signal to the target signal.
  • spectral weighting The role of spectral weighting is to shape the spectrum of the coding error, which is achieved by minimizing the energy of the weighted error.
  • An EAk-3 quantization module performs the quantization step which, based on possible quantization output values, seeks to minimize the weighted error criterion according to the following equation:
  • This module EAk-3 makes a quantification of improvement Q enh having as first output the value of the optimal bit J k to concatenate with the index of the previous stage and for the second output, the signal output of the quantizer for the optimal index J k where v (n) represents a scale factor defined by the core coding to adapt the output level of the quantizers.
  • the improvement coding stage finally comprises an EAk-4 module for adding the quantized error signal. to the synthesized signal at
  • the number of samples to be kept in memory is therefore equal to the number of coefficients of the denominator of the noise shaping filter.
  • the index n is incremented by one unit.
  • the invention shown in Figure 6a can be achieved by equivalent variants.
  • the reconstructed signal can be broken down into a part determined only by the samples already available (past samples samples present from previous stages, memories of the filters) and another part to be determined s opt (n) depending solely on the sample present to be optimized.
  • the calculation of the error to be minimized which is the error weighted between the input signal and the reconstructed signal r can also be broken down into two parts.
  • W (z) the weighted difference by W (z) between the input sample x (n) and (EAK-I and EAK-2 modules of Figure 6a).
  • the value thus obtained is the target signal at the instant n which is reduced to a single target value, it is to be calculated once for each possible value of quantification. Then in the loop For optimization, we must simply find out of all possible scalar quantization values the one that is closest in the sense of the Euclidean distance of this target value.
  • Another variant of calculating the target value is to carry out two weighting filterings W (z). The first weights the difference between the input signal and the reconstructed signal of the previous stage The second filter has zero input but these memories are updated using enh (n) v (n). The difference between the outputs of these two filterings gives the same target signal.
  • Block 601 gives the coding error of the previous stage
  • Block 602 derives one by one all the possible scalar quantization values en h, which are subtracted from block 603 to obtain the coding error of the current stage.
  • This error is weighted by the noise shaping filter W (z) (block 604) and minimized (block 605) for controlling the block 602.
  • W (z) the noise shaping filter
  • block 605 the value locally decoded by the enhancement coding stage is) (block 606).
  • the decoding device implemented depends on the signal transmission rate and for example the origin of the signal depending on whether it comes from an ISDN network 710 for example or an IP network 720.
  • the restored signal resulting from this decoding will benefit from improved quality through the enhancement coding stages implemented in the encoder.
  • an additional decoder 730 then performs inverse quantization in addition to inverse quantifications at B + 1 and B + 2 bits described with reference to Figure 2 to provide the quantized error that added to the prediction signal will give the enhanced broadband signal
  • the heart rate coding stage 800 performs an ADPCM type coding with a formatting of coding noise.
  • the heart coding stage comprises a module 810 for calculating the prediction of the signal x B (n) produced from the preceding samples of the quantized error signal via the low flow rate index I B (n) of the core layer and the reconstructed signal r as that described with reference to FIG.
  • a subtraction module 801 of the prediction to the input signal x (n) is provided to obtain a prediction error signal B
  • the core coder also comprises a noise prediction module P r (z) 802 made from previous samples of the quantization noise and filtered noise
  • An addition module 803 of the noise prediction / to the prediction error signal d is also provided to obtain an error signal noted e B (n).
  • a core quantization module Q B receives the error signal as input to give quantization indices
  • the optimal quantification index I B (ri) and the quantified value minimize the error criterion where the values y B (ri) are the reconstructed levels and v (n) the scaling factor from the quantizer adaptation module 804.
  • the reconstruction levels of the core quantizer Q B are defined by the table VI of the article of X. Master. "7 kHz audio coding within 64 kbit / s", IEEE Journal on Selected Areas in Communication, Vol.6-2, February 1988 ".
  • the quantization index I B (n) of B bits at the output of the quantization module Q B will be multiplexed in the multiplexing module 830 with the bits improvement J t , ..., J ⁇ before being transmitted via the transmission channel 840 to the decoder as described with reference to Figure 7.
  • the core coding stage also comprises a module 805 for calculating the quantization noise, the difference between the input of the quantizer and its output a module 806 for calculating the quantization noise filtered by addition of the quantization noise to the quantization noise prediction and a module 807 for calculating the reconstructed signal in adding the signal prediction to the quantized error r
  • the adaptation module 804 of the quantizer Q B gives a parameter control level v (n) still called scale factor for the next moment
  • the prediction module 810 comprises an adaptation module 811 from the samples of the reconstructed quantized error signal and possibly reconstructed quantized error signal Q filtered by
  • the module 850 Cale Mask detailed later is intended to provide the coding noise shaping filter that can be used by both the core coding stage and the improvement coding stages, either from the signal of the coding noise. input, either from the signal locally decoded by the core coding (at the heart rate), or from the prediction filter coefficients calculated in the ADPCM coding by a simplified gradient algorithm.
  • the noise shaping filter can be obtained from the coefficients of a prediction filter used for heart rate coding, by adding damping constants and adding a de-rating filter. emphasis.
  • the masking module is also possible to use the masking module only in the improvement stages; this alternative is advantageous in the case where the core coding uses few bits per sample, in which case the coding error is not white noise and the signal-to-noise ratio is very low - this situation is reflected in ADPCM coding at 2 bits per sample of the high band (4000-8000 Hz) in G.722, in which case feedback noise shaping is not effective.
  • the noise shaping of the core coding corresponding to blocks 802, 803, 805, 806 in Figure 8, is optional.
  • the invention as shown in FIG. 16 applies even for a reduced ADPCM core coding to blocks 801, 804, 807, 810, 811, 820.
  • FIG. 9 describes in more detail the module 802 performing the calculation of the quantization noise prediction by an ARMA filter (for "AutoRegressive to Adjusted Average” of general expression:
  • the filter is represented by cascading ARMA filtering cells 900, 901, 902:
  • Quantization noise filtered of Figure 9 at the end of this filter cascade, will be given as a function of the quantization noise by :
  • FIG. 10 shows in more detail a module F k (z) 901.
  • the quantization noise at the output of this cell k is given by:
  • perceived coding is shaped by the filter so less audible.
  • an ARMA filter cell can be deduced
  • This type of weighting function with a numerator value and a denominator value has the advantage that the denominator value takes into account the signal peaks and the numerator value of attenuating these peaks. thus an optimal shaping of the quantization noise.
  • the values of g] and g 2 are such that: 0
  • a slight shaping from the fine structure of the signal revealing the periodicities of the signal reduces the perceived quantization noise between the harmonics of the signal.
  • the improvement is particularly significant in the case of signals with a fundamental frequency or a relatively high pitch, for example greater than 200 Hz.
  • An ARMA cell for long-term noise shaping is given by:
  • the coder also comprises several improvement coding stages. Two stages EA1 and EAk are represented here.
  • This coding stage comprises an EAk-I module for subtracting from the input signal x ⁇ ) the signal r B + k (n) formed of the signal synthesized at the stage kr B + k (n) for the instants of sampling n - l, ..., n - N D and signal synthesized at the stage k-1 for the moment n, to give a coding error signal e B + k (n).
  • An EAk-2 filtering module of e B + k (n) by the weighting function W (z) is also included in the coding stage k.
  • This weighting function is equal to the inverse of the masking filter H M (z) given by the core coding such as previously described.
  • a filtered signal e w + (rc) is obtained.
  • the stage k also comprises an addition module EAk-4 of the quantized error signal to the signal synthesized on the previous stage to give the signal synthesized upstairs
  • the filtered error signal is then given in z-transformed notation, by:
  • a partial reconstituted signal r B + k (n) is calculated from the signal reconstructed on the previous stage r B + k ⁇ ] (n) and past samples of the signal
  • This signal is subtracted from the signal x (n) to give the error signal
  • the error signal is filtered by the filter having an ARMA filtering cell W x to give:
  • the weighted error criterion is to minimize the squared error for the two values (or N G values if several bits) of possible outputs of the quantizer:
  • cascade filtering is carried out.
  • the output of the first filter cell will be equal to:
  • the improvement bits are obtained bit by bit or group of bits per group of bits in cascade improvement stages.
  • the improvement bits according to the invention are calculated in such a way that the improvement signal at the output of the standard decoder is reconstructed with shaping of the quantization noise.
  • the values designating levels of quantization reconstructions for an improvement stage k are defined by the difference between the values designating the quantization reconstruction levels of a nested quantizer at B +. k bits, where B denotes the number of bits of the core coding and the values designating the quantization reconstruction levels of a nested quantizer at B + k-1 bits, the reconstruction levels of the nested quantizer at B + k bits being defined by duplication of reconstruction levels of the nested quantizer at B + k-1 bits.
  • v (n) representing the scale factor defined by the core coding to adapt the output level of the fixed quantizers.
  • the quantization for the quantizers at B, B + 1, ..., B + K bits was performed at one time by identifying in which decision range the quantizer at B + K bits is the value e (n) to quantify.
  • the present invention provides a different method. Knowing the quantized value from the quantizer at B + k-1 bits, the quantization of the signal e ⁇ + li (n) at the input of the quantizer is done by minimizing the quantization error and without using the decision thresholds, which advantageously makes it possible to reduce the calculation noise for a fixed-point implantation of the product enh tel than:
  • a weighted quadratic error criterion will be minimized so that the spectrally shaped noise is less audible.
  • the spectral weighting function used is W (z), which can also be used for noise shaping in the core coding stage.
  • the restored heart signal is equal to the sum of the prediction and the output of the inverse quantizer, ie:
  • the two reconstructed signals possible at the k-stage are given as a function of the signal actually reconstructed at the k-1 stage by the following equation:
  • the signal is defined as being equal to the sum of the two signals: representing the concatenation of all the values
  • the filtered error signal () will be equal to:
  • the optimal index J k is the one that minimizes the criterion E for realizing scalar quantization from both levels of improvement calculated from the scalar quantizer reconstruction levels to bits and knowing the optimal core index and clues or equivalent
  • the quantizer output value for the optimal index is: and the value of the reconstituted signal at time n will be given by:
  • FIGS. 13 and 14 illustrate two masking filter calculation embodiments implemented by the module 850 for calculating the masking filter.
  • a current signal block that corresponds to the block of the current frame completed by a segment sample of the previous frame is taken into account.
  • the signal is pre-processed (pre-emphasis processing) before the calculation in E60 of the correlation coefficients by a filter A 1 (z) whose coefficient or coefficients are either fixed are adapted by linear prediction as described in the patent FR2742568.
  • the signal to be analyzed S p (n) is calculated by inverse filtering:
  • the signal block is then weighted at E 61 by a Hanning window or a window formed by the concatenation of sub-windows, as known from the state of the art.
  • Constants allow you to adjust the spectrum of Masking filter including the first two that regulate the slope of the filter spectrum.
  • FIG. 1 A second example of implementation of the masking filter, of low complexity, is illustrated with reference to FIG.
  • the principle here is to directly use the synthesis filter of the ARMA filter reconstruction of the decoded signal with a de-emphasis applied by a compensation filter according to the slope of the input signal.
  • the ARMA ADPCA predictor has 2 denominator coefficients.
  • the compensation filter calculated in E71 will be of the form:
  • masking filter Another very simple form of masking filter is that obtained by taking only the denominator of the ARMA predictor with a slight damping: with for example
  • This AR partial reconstruction filter of the signal leads to a reduced complexity.
  • the coefficients of the filter to be buffered over a signal frame can be fixed or several times per frame to maintain a smoothing effect.
  • One way to perform the smoothing is to detect sudden changes in dynamics on the signal at the input of the quantizer or equivalent but minimum complexity directly on the output indices of the quantizer. Between two abrupt variations of indices we obtain an area where the spectral characteristics are less fluctuating, and therefore with ADPCM coefficients more suitable for the purpose of masking.
  • the pitch period is calculated, for example, by minimizing the quadratic long-term prediction error at the input e B (n) of the quantizer Q B of FIG. 8, by maximizing the correlation coefficient:
  • Pitch is such that:
  • the pitch prediction gain Cor f (i) used to generate the masking filters is given by:
  • FIG. 15 proposes a second embodiment of an encoder according to the invention.
  • This embodiment uses prediction modules in place of the filter modules described with reference to FIG. 8, for both the core coding stage and the improvement coding stages.
  • the ADPCM type encoder with heart quantization noise formatting includes a noise prediction module 1505. of reconstruction difference between the input signal x (n) and the low speed synthesized signal r B (n) and an addition module 1510 of the prediction to the input signal x (n),
  • the core encoder also comprises a noise prediction calculation module P N , (Z) 1530 made from the preceding quantization noise samples and a subtraction module 1540 of the prediction thus obtained to the prediction error signal to obtain an error signal noted e B (n).
  • a core quantization module Q B in 1550 minimizes the quadratic error criterion where the values are the levels reconstructed and v (n) the scaling factor from the adaptation module 1560 of the quantizer.
  • the quantization module receives as input the error signal to output quantization indices I B (n) and the quantized signal
  • the Q B core quantizer reconstruction levels are defined by Table VI of the X. Master article. "7 kHz audio coding within 64 kbit / s". IEEE Journal on Selected Areas in Communication, Vol.6-2, February 1988 ".
  • the quantification index I B (n) of B bits at the output of the quantization module Q B will be multiplexed at 830 with the improvement bits J 1 ,..., J k before being transmitted via the transmission channel. 840 to the decoder as described with reference to Figure 7.
  • a quantization noise calculation module 1570 makes the difference between quantizer input and quantizer output
  • a module 1580 calculates the reconstructed signal by adding the prediction of the signal to the quantized error
  • Q Adapt adaptation module 1560 quantifier gives a level of control parameter v (n) also called scale factor for the next moment.
  • An adaptation module P Ada t 811 of the prediction module performs an adaptation from the passed samples of the reconstructed signal and the signal of quantized error reconstructed
  • the improvement stage EAk comprises an EAk-10 module for subtracting the signal reconstructed at the stage preceding the input signal x (n) for give the signal
  • the signal filtering is carried out by the filter module EAk-11 by the filter to give the filtered signal
  • An EAk-12 module for calculating a prediction signal also calculated from the previous quantized samples of the quantized error signal and samples thereof. signal filtered by.
  • the EA-k enhancement stage also includes a module EA-kl3 subtraction of the prediction ) to the signal to give a target signal
  • the improvement quantification module performs a step of minimizing the quadratic error criterion:
  • This module receives as input the signal and provides for first output the quantized signal and for second output the index J k .
  • the reconstructed levels of the N + k-bit nested quantizer are computed by splitting the nested output levels of the quantizer to B + k-1 bits. Difference values between these reconstructed levels of the nested quantizer at B + k bits and those of the quantizer at B + k-1 bits are calculated. The difference values are then stored once and for all in processor memory and are indexed by the combination of the core quantization index and quantizer enhancement indices of the previous stages.
  • An addition module EAk-15 of the output signal of the quantizer e to the prediction P is also integrated into the improvement stage k as well as a module EAk-16 for adding the signal preceding the signal reconstituted on the previous stage to give the signal restored to the floor
  • the previously detailed module Cale Mask 850 provides the masking filter either from the input signal (FIG. 13) or from the coefficients of the ADPCM synthesis filters as explained with reference. in Figure 14.
  • the improvement stage k implements the following steps for a current sample: - obtaining a difference signal) by calculating the difference between the input signal x (n) of the hierarchical coding and a reconstructed signal derived from an improvement coding of a preceding improvement coding stage;
  • Fig. 15 is given for a masking filter consisting of a single ARMA cell for simple explanation purposes. It is understood that cascading multi-cell ARMA generalization will be performed according to the method described by equations 7 to 17 and FIGS. 9 and 10.
  • FIG. 16 represents a third embodiment of the invention with this time a PCM core coding stage.
  • the core coding stage 1600 comprises a coding noise shaping via a prediction module P, (z) 1610 calculating the noise prediction. from previous samples of standardized G.71 1 MIC quantization noise and filtered noise
  • noise shaping of the core coding corresponding to the blocks 1610, 1620, 1640 and 1650 in Figure 16, is optional.
  • the invention as shown in FIG. 16 applies even for a MIC core coding reduced to block 1630.
  • a module 1620 performs the addition of the prediction) to the signal input x (n) to obtain an error signal noted e (n).
  • a heart quantization module 1630 receives as input the signal e (") to give quantization indices I B (n). The optimal quantification index I B (n) and the quantized value minimize
  • PCM Pulse Code Modulation
  • the quantization index of B bits at the output of the module of quantification will be concatenated in 830 with the improvement bits J t , ..., J ⁇ before being transmitted via the transmission channel 840 to the standard G.71 1 type decoder.
  • a quantization noise calculation module 1640 makes the difference between the input of the quantizer MIC and the quantized output
  • a filtered quantization noise calculation module 1650 performs the addition of the quantization noise to the quantization noise prediction Improvement coding consists of improving the quality of the decoded signal by successive addition of quantization bits while keeping optimal formatting of the reconstruction noise for the intermediate rates.
  • stage k making it possible to obtain the improvement MIC bit J k or a group of bits is described by the EAk block.
  • This improvement coding stage is similar to that described with reference to FIG. 8.
  • It comprises an EAk-I subtraction module of the input signal x (n) of the signal r B + k (n) formed of the signal synthesized at the stage k for the samples and the signal synthesized at the k-1 stage for the moment n for give a coding error signal e .
  • An EAk-4 addition module of the quantized error signal at signal synthesized in the previous step gives the signal synthesized in step k.
  • the signal and the memories of the filter are adapted as described previously for Figures 6 and 8.
  • the module 850 calculates the masking filter used for both the core coding and the enhancement coding. It is possible to envisage other versions of the hierarchical coder, represented in FIGS. 8, 15 or 16.
  • the number of possible quantization values in the improvement coding varies for each coded sample.
  • Improvement coding uses a variable number of bits depending on the samples to be coded.
  • the number of enhancement bits allocated can be adapted according to an allocation rule, fixed or variable.
  • An example of a variable allocation is given, for example, by the low band enhancement MIC coding in ITU-T G.711.1.
  • the allocation algorithm if it is variable must use information available to the remote decoder, so that no additional information needs to be transmitted, which is the case for example in the ITU standard -T G.711.1.
  • the number of coded samples of the improvement signal giving the scalar quantization indices (J k (n)) in the enhancement coding may be less than the number of samples of the signal of enhancement. 'Entrance. This variant is deduced from the previous variant when the number of improvement bits allocated is set to zero for some samples.
  • an encoder as described according to the first, the second or the third embodiment in the sense of the invention typically comprises a ⁇ P processor cooperating with a memory block BM including a storage and / or working memory, as well as a aforementioned MEM buffer memory as a means for storing, for example, quantization values of the preceding coding stages or a dictionary of quantization reconstruction levels or any other data necessary for the implementation of the coding method as described with reference to FIGS. 6, 8, 15 and 16.
  • This encoder receives as input successive frames of the digital signal x (n) and delivers concatenated quantization indices I B + K.
  • the memory block BM may comprise a computer program comprising the code instructions for implementing the steps of the method according to the invention when these instructions are executed by a ⁇ P processor of the encoder and in particular a coding at a predetermined rate called core rate, delivering a scalar quantization index for each sample of the current frame and at least one enhancement coding delivering scalar quantization indices for each coded sample of an enhancement signal.
  • This enhancement coding comprises a step of obtaining a coding noise shaping filter used to determine a target signal. The scalar quantization indices of said enhancement signal are determined by minimizing the error between a set of possible scalar quantization values and said target signal.
  • a means of storage readable by a computer or a processor, integrated or not integrated with the encoder, possibly removable, stores a computer program implementing a coding method according to the invention.
  • Figures 8, 15 or 16 may for example illustrate the algorithm of such a computer program.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Theoretical Computer Science (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
PCT/FR2009/052194 2008-11-18 2009-11-17 Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique WO2010058117A1 (fr)

Priority Applications (5)

Application Number Priority Date Filing Date Title
CN2009801546871A CN102282611B (zh) 2008-11-18 2009-11-17 数字音频信号的分级编码方法和分级编码器
KR1020117014240A KR101339857B1 (ko) 2008-11-18 2009-11-17 계층적 코더 내에 잡음 성형을 구비한 코딩
EP09768200.9A EP2366177B1 (fr) 2008-11-18 2009-11-17 Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique
JP2011543801A JP5474088B2 (ja) 2008-11-18 2009-11-17 スケーラブルエンコーダでのノイズ変換を伴う音声デジタル信号の符号化
US13/129,483 US8965773B2 (en) 2008-11-18 2009-11-17 Coding with noise shaping in a hierarchical coder

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0857839 2008-11-18
FR0857839A FR2938688A1 (fr) 2008-11-18 2008-11-18 Codage avec mise en forme du bruit dans un codeur hierarchique

Publications (1)

Publication Number Publication Date
WO2010058117A1 true WO2010058117A1 (fr) 2010-05-27

Family

ID=40661226

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2009/052194 WO2010058117A1 (fr) 2008-11-18 2009-11-17 Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique

Country Status (7)

Country Link
US (1) US8965773B2 (ko)
EP (1) EP2366177B1 (ko)
JP (1) JP5474088B2 (ko)
KR (1) KR101339857B1 (ko)
CN (1) CN102282611B (ko)
FR (1) FR2938688A1 (ko)
WO (1) WO2010058117A1 (ko)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012080649A1 (fr) 2010-12-16 2012-06-21 France Telecom Codage perfectionne d'un etage d'amelioration dans un codeur hierarchique

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2911228A1 (fr) * 2007-01-05 2008-07-11 France Telecom Codage par transformee, utilisant des fenetres de ponderation et a faible retard.
FR2938688A1 (fr) * 2008-11-18 2010-05-21 France Telecom Codage avec mise en forme du bruit dans un codeur hierarchique
FR2961980A1 (fr) * 2010-06-24 2011-12-30 France Telecom Controle d'une boucle de retroaction de mise en forme de bruit dans un codeur de signal audionumerique
FR2972875A1 (fr) * 2011-03-17 2012-09-21 France Telecom Procede et dispositif de filtrage lors d'un changement de filtre arma
JP6079230B2 (ja) * 2012-12-28 2017-02-15 株式会社Jvcケンウッド 付加情報挿入装置、付加情報挿入方法、付加情報挿入プログラム、付加情報抽出装置、付加情報抽出方法、及び付加情報抽出プログラム
EP2951825B1 (en) 2013-01-29 2021-11-24 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for generating a frequency enhanced signal using temporal smoothing of subbands
US9667959B2 (en) 2013-03-29 2017-05-30 Qualcomm Incorporated RTP payload format designs
US9466305B2 (en) 2013-05-29 2016-10-11 Qualcomm Incorporated Performing positional analysis to code spherical harmonic coefficients
US9502044B2 (en) 2013-05-29 2016-11-22 Qualcomm Incorporated Compression of decomposed representations of a sound field
US9922656B2 (en) 2014-01-30 2018-03-20 Qualcomm Incorporated Transitioning of ambient higher-order ambisonic coefficients
US9502045B2 (en) 2014-01-30 2016-11-22 Qualcomm Incorporated Coding independent frames of ambient higher-order ambisonic coefficients
US10770087B2 (en) 2014-05-16 2020-09-08 Qualcomm Incorporated Selecting codebooks for coding vectors decomposed from higher-order ambisonic audio signals
US9620137B2 (en) 2014-05-16 2017-04-11 Qualcomm Incorporated Determining between scalar and vector quantization in higher order ambisonic coefficients
US9852737B2 (en) 2014-05-16 2017-12-26 Qualcomm Incorporated Coding vectors decomposed from higher-order ambisonics audio signals
EP4002359A1 (en) * 2014-06-10 2022-05-25 MQA Limited Digital encapsulation of audio signals
US9747910B2 (en) 2014-09-26 2017-08-29 Qualcomm Incorporated Switching between predictive and non-predictive quantization techniques in a higher order ambisonics (HOA) framework
CN109416913B (zh) * 2016-05-10 2024-03-15 易默森服务有限责任公司 自适应音频编解码系统、方法、装置及介质
US10756755B2 (en) 2016-05-10 2020-08-25 Immersion Networks, Inc. Adaptive audio codec system, method and article
US10699725B2 (en) 2016-05-10 2020-06-30 Immersion Networks, Inc. Adaptive audio encoder system, method and article
US10770088B2 (en) 2016-05-10 2020-09-08 Immersion Networks, Inc. Adaptive audio decoder system, method and article
US11281312B2 (en) 2018-01-08 2022-03-22 Immersion Networks, Inc. Methods and apparatuses for producing smooth representations of input motion in time and space
US10763885B2 (en) * 2018-11-06 2020-09-01 Stmicroelectronics S.R.L. Method of error concealment, and associated device
US11380343B2 (en) 2019-09-12 2022-07-05 Immersion Networks, Inc. Systems and methods for processing high frequency audio signal
KR102491948B1 (ko) * 2021-06-04 2023-01-27 한국 천문 연구원 단일 gnss 수신기를 이용한 고위도 전리권 플라즈마 불균질 수평방향 속력 결정 방법

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2742568A1 (fr) 1995-12-15 1997-06-20 Catherine Quinquis Procede d'analyse par prediction lineaire d'un signal audiofrequence, et procedes de codage et de decodage d'un signal audiofrequence en comportant application
WO2007093726A2 (fr) * 2006-02-14 2007-08-23 France Telecom Dispositif de ponderation perceptuelle en codage/decodage audio
US20070208557A1 (en) * 2006-03-03 2007-09-06 Microsoft Corporation Perceptual, scalable audio compression

Family Cites Families (72)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3688097A (en) * 1970-05-20 1972-08-29 Bell Telephone Labor Inc Digital attenuator for non-linear pulse code modulation signals
US4386237A (en) * 1980-12-22 1983-05-31 Intelsat NIC Processor using variable precision block quantization
DE3411962A1 (de) * 1983-03-31 1984-10-31 Sansui Electric Co., Ltd., Tokio/Tokyo Datenuebertragungseinrichtung
CA1220282A (en) * 1985-04-03 1987-04-07 Northern Telecom Limited Transmission of wideband speech signals
JP3707116B2 (ja) * 1995-10-26 2005-10-19 ソニー株式会社 音声復号化方法及び装置
TW321810B (ko) * 1995-10-26 1997-12-01 Sony Co Ltd
JP3587920B2 (ja) * 1995-12-21 2004-11-10 株式会社日立国際電気 送信方法及び受信方法
JPH10105194A (ja) * 1996-09-27 1998-04-24 Sony Corp ピッチ検出方法、音声信号符号化方法および装置
KR100335609B1 (ko) * 1997-11-20 2002-10-04 삼성전자 주식회사 비트율조절이가능한오디오부호화/복호화방법및장치
JPH11224099A (ja) * 1998-02-06 1999-08-17 Sony Corp 位相量子化装置及び方法
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
US7423983B1 (en) * 1999-09-20 2008-09-09 Broadcom Corporation Voice and data exchange over a packet based network
US6931058B1 (en) * 2000-05-19 2005-08-16 Scientific-Atlanta, Inc. Method and apparatus for the compression and/or transport and/or decompression of a digital signal
US7161931B1 (en) * 1999-09-20 2007-01-09 Broadcom Corporation Voice and data exchange over a packet based network
US6604070B1 (en) * 1999-09-22 2003-08-05 Conexant Systems, Inc. System of encoding and decoding speech signals
AU2094201A (en) * 1999-12-13 2001-06-18 Broadcom Corporation Voice gateway with downstream voice synchronization
FI115329B (fi) * 2000-05-08 2005-04-15 Nokia Corp Menetelmä ja järjestely lähdesignaalin kaistanleveyden vaihtamiseksi tietoliikenneyhteydessä, jossa on valmiudet useisiin kaistanleveyksiin
SE0001727L (sv) * 2000-05-10 2001-11-11 Global Ip Sound Ab Överföring över paketförmedlade nät
EP1339041B1 (en) * 2000-11-30 2009-07-01 Panasonic Corporation Audio decoder and audio decoding method
US6614370B2 (en) * 2001-01-26 2003-09-02 Oded Gottesman Redundant compression techniques for transmitting data over degraded communication links and/or storing data on media subject to degradation
US6650762B2 (en) * 2001-05-31 2003-11-18 Southern Methodist University Types-based, lossy data embedding
SE521600C2 (sv) * 2001-12-04 2003-11-18 Global Ip Sound Ab Lågbittaktskodek
US6829579B2 (en) * 2002-01-08 2004-12-07 Dilithium Networks, Inc. Transcoding method and system between CELP-based speech codes
US7158572B2 (en) * 2002-02-14 2007-01-02 Tellabs Operations, Inc. Audio enhancement communication techniques
WO2003071522A1 (fr) * 2002-02-20 2003-08-28 Matsushita Electric Industrial Co., Ltd. Procede de production de vecteur de source sonore fixe et table de codage de source sonore fixe
DE60214599T2 (de) * 2002-03-12 2007-09-13 Nokia Corp. Skalierbare audiokodierung
US7330812B2 (en) * 2002-10-04 2008-02-12 National Research Council Of Canada Method and apparatus for transmitting an audio stream having additional payload in a hidden sub-channel
US7408918B1 (en) * 2002-10-07 2008-08-05 Cisco Technology, Inc. Methods and apparatus for lossless compression of delay sensitive signals
US20040208169A1 (en) * 2003-04-18 2004-10-21 Reznik Yuriy A. Digital audio signal compression method and apparatus
EP1619664B1 (en) * 2003-04-30 2012-01-25 Panasonic Corporation Speech coding apparatus, speech decoding apparatus and methods thereof
DE602004004950T2 (de) * 2003-07-09 2007-10-31 Samsung Electronics Co., Ltd., Suwon Vorrichtung und Verfahren zum bitraten-skalierbaren Sprachkodieren und -dekodieren
EP1513137A1 (en) * 2003-08-22 2005-03-09 MicronasNIT LCC, Novi Sad Institute of Information Technologies Speech processing system and method with multi-pulse excitation
DE602004013031T2 (de) * 2003-10-10 2009-05-14 Agency For Science, Technology And Research Verfahren zum codieren eines digitalen signals in einen skalierbaren bitstrom, verfahren zum decodieren eines skalierbaren bitstroms
CA2457988A1 (en) * 2004-02-18 2005-08-18 Voiceage Corporation Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization
US7272567B2 (en) * 2004-03-25 2007-09-18 Zoran Fejzo Scalable lossless audio codec and authoring tool
WO2006018748A1 (en) * 2004-08-17 2006-02-23 Koninklijke Philips Electronics N.V. Scalable audio coding
US7949520B2 (en) * 2004-10-26 2011-05-24 QNX Software Sytems Co. Adaptive filter pitch extraction
US8170879B2 (en) * 2004-10-26 2012-05-01 Qnx Software Systems Limited Periodic signal enhancement system
KR100707174B1 (ko) * 2004-12-31 2007-04-13 삼성전자주식회사 광대역 음성 부호화 및 복호화 시스템에서 고대역 음성부호화 및 복호화 장치와 그 방법
CN100592389C (zh) * 2008-01-18 2010-02-24 华为技术有限公司 合成滤波器状态更新方法及装置
JP4887279B2 (ja) * 2005-02-01 2012-02-29 パナソニック株式会社 スケーラブル符号化装置およびスケーラブル符号化方法
US8102872B2 (en) * 2005-02-01 2012-01-24 Qualcomm Incorporated Method for discontinuous transmission and accurate reproduction of background noise information
US20070147518A1 (en) * 2005-02-18 2007-06-28 Bruno Bessette Methods and devices for low-frequency emphasis during audio compression based on ACELP/TCX
KR100707184B1 (ko) * 2005-03-10 2007-04-13 삼성전자주식회사 오디오 부호화 및 복호화 장치와 그 방법 및 기록 매체
FR2888699A1 (fr) * 2005-07-13 2007-01-19 France Telecom Dispositif de codage/decodage hierachique
US7991611B2 (en) * 2005-10-14 2011-08-02 Panasonic Corporation Speech encoding apparatus and speech encoding method that encode speech signals in a scalable manner, and speech decoding apparatus and speech decoding method that decode scalable encoded signals
US7490036B2 (en) * 2005-10-20 2009-02-10 Motorola, Inc. Adaptive equalizer for a coded speech signal
US7831434B2 (en) * 2006-01-20 2010-11-09 Microsoft Corporation Complex-transform channel coding with extended-band frequency coding
US8781842B2 (en) * 2006-03-07 2014-07-15 Telefonaktiebolaget Lm Ericsson (Publ) Scalable coding with non-casual predictive information in an enhancement layer
DE102006022346B4 (de) * 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
KR101542069B1 (ko) * 2006-05-25 2015-08-06 삼성전자주식회사 고정 코드북 검색 방법 및 장치와 그를 이용한 음성 신호의부호화/복호화 방법 및 장치
US7933770B2 (en) * 2006-07-14 2011-04-26 Siemens Audiologische Technik Gmbh Method and device for coding audio data based on vector quantisation
DE602007012116D1 (de) * 2006-08-15 2011-03-03 Dolby Lab Licensing Corp Arbiträre formung einer temporären rauschhüllkurve ohne nebeninformation
JP2008170488A (ja) * 2007-01-06 2008-07-24 Yamaha Corp 波形圧縮装置、波形伸長装置、プログラムおよび圧縮データの生産方法
US8199835B2 (en) * 2007-05-30 2012-06-12 International Business Machines Corporation Systems and methods for adaptive signal sampling and sample quantization for resource-constrained stream processing
JP5618826B2 (ja) * 2007-06-14 2014-11-05 ヴォイスエイジ・コーポレーション Itu.t勧告g.711と相互運用可能なpcmコーデックにおいてフレーム消失を補償する装置および方法
US8719012B2 (en) * 2007-06-15 2014-05-06 Orange Methods and apparatus for coding digital audio signals using a filtered quantizing noise
US7885819B2 (en) * 2007-06-29 2011-02-08 Microsoft Corporation Bitstream syntax for multi-process audio decoding
WO2009010674A1 (fr) * 2007-07-06 2009-01-22 France Telecom Codage hierarchique de signaux audionumeriques
KR100912826B1 (ko) * 2007-08-16 2009-08-18 한국전자통신연구원 G.711 코덱의 음질 향상을 위한 향상 계층 부호화 및복호화 장치와 그 방법
US8271273B2 (en) * 2007-10-04 2012-09-18 Huawei Technologies Co., Ltd. Adaptive approach to improve G.711 perceptual quality
US8515767B2 (en) * 2007-11-04 2013-08-20 Qualcomm Incorporated Technique for encoding/decoding of codebook indices for quantized MDCT spectrum in scalable speech and audio codecs
ATE500588T1 (de) * 2008-01-04 2011-03-15 Dolby Sweden Ab Audiokodierer und -dekodierer
EP2144230A1 (en) * 2008-07-11 2010-01-13 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Low bitrate audio encoding/decoding scheme having cascaded switches
ES2592416T3 (es) * 2008-07-17 2016-11-30 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Esquema de codificación/decodificación de audio que tiene una derivación conmutable
FR2938688A1 (fr) * 2008-11-18 2010-05-21 France Telecom Codage avec mise en forme du bruit dans un codeur hierarchique
GB2466668A (en) * 2009-01-06 2010-07-07 Skype Ltd Speech filtering
GB2465047B (en) * 2009-09-03 2010-09-22 Peter Graham Craven Prediction of signals
US8452606B2 (en) * 2009-09-29 2013-05-28 Skype Speech encoding using multiple bit rates
FR2961980A1 (fr) * 2010-06-24 2011-12-30 France Telecom Controle d'une boucle de retroaction de mise en forme de bruit dans un codeur de signal audionumerique
US9263049B2 (en) * 2010-10-25 2016-02-16 Polycom, Inc. Artifact reduction in packet loss concealment
FR2969360A1 (fr) * 2010-12-16 2012-06-22 France Telecom Codage perfectionne d'un etage d'amelioration dans un codeur hierarchique

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2742568A1 (fr) 1995-12-15 1997-06-20 Catherine Quinquis Procede d'analyse par prediction lineaire d'un signal audiofrequence, et procedes de codage et de decodage d'un signal audiofrequence en comportant application
WO2007093726A2 (fr) * 2006-02-14 2007-08-23 France Telecom Dispositif de ponderation perceptuelle en codage/decodage audio
US20070208557A1 (en) * 2006-03-03 2007-09-06 Microsoft Corporation Perceptual, scalable audio compression

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
HIWASAKI Y ET AL: "A G.711 EMBEDDED WIDEBAND SPEECH CODING FOR VOIP CONFERENCES", IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, INFORMATION & SYSTEMS SOCIETY, TOKYO, JP, vol. E89D, no. 9, 1 September 2006 (2006-09-01), pages 2542 - 2552, XP001542710, ISSN: 0916-8532 *
LAPIERRE J, LEFEBVRE R, BESSETE R, MALENOVSKY V,SALAMI R: "Noise shaping in an ITU-T G.711-interoperable embedded codec", EUSIPCO, 16TH EUROPEAN SIGNAL PROCESSING CONFERENCE, 25 August 2008 (2008-08-25) - 29 September 2008 (2008-09-29), pages 1 - 5, XP002530134, Retrieved from the Internet <URL:http://www.eurasip.org/Proceedings/Eusipco/Eusipco2008/papers/1569105406.pdf> [retrieved on 20090529] *
X. MAITRE: "7 kHz audio coding within 64 kbit/s", IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATION, vol. 6-2, February 1988 (1988-02-01)
Y. HIWASAKI; S. SASAKI; H. OHMURO; T. MORI; J. SEONG; M. S. LEE; B. KÔVESI; S. RAGOT; J.-L. GARCIA; C. MARRO: "G.711.1: A wideband extension to ITU-T G.711", EUSIPCO, 2008
Y.HIWASAKI ET AL.: "G.711.1: A wideband extension to ITU-T G.711", 16TH EUROPEAN SIGNAL PROCESSING CONFERENCE EUSIPCO 2008, 25 August 2008 (2008-08-25), Lausanne, XP002562145, Retrieved from the Internet <URL:http://www.eurasip.org/Proceedings/Eusipco/Eusipco2008/papers/1569104469.pdf> [retrieved on 20100108] *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012080649A1 (fr) 2010-12-16 2012-06-21 France Telecom Codage perfectionne d'un etage d'amelioration dans un codeur hierarchique
JP2014501395A (ja) * 2010-12-16 2014-01-20 オランジュ 階層型符号器における改良ステージの改良符号化

Also Published As

Publication number Publication date
EP2366177A1 (fr) 2011-09-21
US20110224995A1 (en) 2011-09-15
KR101339857B1 (ko) 2013-12-10
KR20110095387A (ko) 2011-08-24
EP2366177B1 (fr) 2015-10-21
JP2012509515A (ja) 2012-04-19
FR2938688A1 (fr) 2010-05-21
CN102282611B (zh) 2013-05-08
JP5474088B2 (ja) 2014-04-16
US8965773B2 (en) 2015-02-24
CN102282611A (zh) 2011-12-14

Similar Documents

Publication Publication Date Title
EP2366177B1 (fr) Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique
EP1907812B1 (fr) Procede de commutation de debit en decodage audio scalable en debit et largeur de bande
EP2586133B1 (fr) Contrôle d&#39;une boucle de rétroaction de mise en forme de bruit dans un codeur de signal audionumérique
EP2452337B1 (fr) Allocation de bits dans un codage/décodage d&#39;amélioration d&#39;un codage/décodage hiérarchique de signaux audionumériques
FR2761801A1 (fr) Methode de codage audio et dispositif
EP2452336B1 (fr) Codage/décodage perfectionne de signaux audionumériques
EP1989706A2 (fr) Dispositif de ponderation perceptuelle en codage/decodage audio
EP1692689A1 (fr) Procede de codage multiple optimise
EP2652735B1 (fr) Codage perfectionne d&#39;un etage d&#39;amelioration dans un codeur hierarchique
WO2007107670A2 (fr) Procede de post-traitement d&#39;un signal dans un decodeur audio
WO2009004227A1 (fr) Codage de signaux audionumériques
WO2011144863A1 (fr) Codage avec mise en forme du bruit dans un codeur hierarchique
FR2987931A1 (fr) Modification des caracteristiques spectrales d&#39;un filtre de prediction lineaire d&#39;un signal audionumerique represente par ses coefficients lsf ou isf.
FR2990552A1 (fr) Traitement d&#39;amelioration de la qualite des signaux audiofrequences
FR2737360A1 (fr) Procedes de codage et de decodage de signaux audiofrequence, codeur et decodeur pour la mise en oeuvre de tels procedes
WO2015145050A1 (fr) Estimation d&#39;un bruit de codage introduit par un codage en compression de type micda
FR3007184A1 (fr) Controle du traitement d&#39;attenuation d&#39;un bruit de quantification introduit par un codage en compresssion

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200980154687.1

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09768200

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 13129483

Country of ref document: US

ENP Entry into the national phase

Ref document number: 2011543801

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2009768200

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 20117014240

Country of ref document: KR

Kind code of ref document: A