EP2005424A2 - Method for post-processing a signal in an audio decoder - Google Patents
Method for post-processing a signal in an audio decoderInfo
- Publication number
- EP2005424A2 EP2005424A2 EP07731774A EP07731774A EP2005424A2 EP 2005424 A2 EP2005424 A2 EP 2005424A2 EP 07731774 A EP07731774 A EP 07731774A EP 07731774 A EP07731774 A EP 07731774A EP 2005424 A2 EP2005424 A2 EP 2005424A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- frequency
- signal
- envelope
- module
- post
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
Definitions
- the present invention relates to a method of post-processing a signal in an audio decoder.
- the invention finds a particularly advantageous application in the field of transmission and storage of digital signals such as audio-frequency signals: speech, music, etc.
- the encoder In conventional speech coding, the encoder generates a fixed rate bit stream. This fixed rate constraint simplifies the implementation and use of the encoder and decoder (called “coded" set). Examples of such systems are: ITU-T G.711 coding at 64 kbit / s, ITU-T G.729 coding at 8 kbit / s or GSM-EFR at 12.2 kbit / s.
- variable rate bit stream In some applications, such as mobile telephony or voice over IP, it is preferable to generate a variable rate bit stream, the bit rate values being taken in a predefined set.
- multi-rate coding techniques can be distinguished that are more flexible than fixed rate coding:
- the multi-mode coding controlled by the source and / or the channel as implemented in the AMR-NB, AMR-WB, SMV 1 or VMR-WB systems, hierarchical coding, or "scalable" coding, which generates a so-called hierarchical bitstream because it comprises a core rate and one or more improvement layer (s).
- the 48, 56 and 64 kbit / s G.722 system is a simple example of scalable rate scaling.
- the MPEG-4 CELP codec is scalable in terms of bit rate and bandwidth.
- Other examples of such encoders are found in the articles by B. Kovesi, D. Massaloux, A. Sollaud, A Scalable Speech and Audio Coding Scheme with Continuous Bitrate Flexibility, ICASSP 2004, and H. Taddéi et al, A Scalable. Three Bitrate (8, 14.2 and 24 kbit / s) Audio Coder; 107th AES Convention, 1999. - Multiple description coding.
- the invention is more particularly concerned with hierarchical coding.
- the basic concept of hierarchical audio coding for example, is illustrated in the article by Y. Hiwasaki, T. Mori, H. Ohmuro, J. Ikedo, D. Tokumoto, and A. Kataoka, Scalable Speech Coding Technology for High-Quality. Ubiquitous Communications, NTT Technical Review, March 2004.
- the bitstream includes a base layer and one or more enhancement layers.
- the base layer is generated by a fixed low rate codec, known as a "core coded", guaranteeing the minimum quality of the coding; this layer must be received by the decoder to maintain an acceptable level of quality. Improvement layers are used to improve the quality; it may happen that they are not all received by the decoder.
- the main advantage of hierarchical coding is that it allows an adaptation of the bit rate by simple truncation of the bit stream.
- the number of layers that is to say the number of possible truncations of the bit stream, defines the granularity of the coding: we speak of coding with "high granularity” if the bit stream comprises few layers, of the order of 2 to 4 with steps in the range of 4 to 8 kbit / s; a "fine granularity" coding allows a large number of layers with a step of the order of 1 kbit / s.
- the invention relates to scalable rate and bandwidth encoding techniques with a CELP heart-type coder in a telephone band and one or more broadband enhancement layer (s).
- a CELP heart-type coder in a telephone band and one or more broadband enhancement layer (s).
- broadband enhancement layer s
- Examples of such systems are given in the aforementioned article H. Taddéi et al with a high granularity of 8, 14.2, 24 kbit / s, and in the aforementioned article by B. Kovesi with fine granularity of 6.4 to 32 kbit / s.
- G.729EV EV for Embedded Variable Bitrate
- the objective of the G.729EV standardization is to obtain a G.729 core hierarchical encoder, producing a signal whose band extends from the narrow band (300-3400 Hz) to the broadband (50-7000 Hz). ) at a rate of 8 to 32 kbit / s for conversational services.
- This encoder is inherently interoperable with Recommendation G.729, which ensures compatibility with existing VoIP devices.
- the input audio signals are sampled at 16 kHz over a useful band of 50 to 7000 Hz.
- the high band typically corresponds to frequencies between 3400 Hz. and 7000 Hz.
- This band is coded according to a band extension technique based on the time and frequency envelope encoder extraction, these envelopes being then applied to the decoder to a reconstructed synthetic excitation signal in the high band. from the parameters estimated in the low band (between 50 and 3400 Hz) sampled at 8 kHz.
- the low band will be designated in the sequence "first frequency band"; the high band is then called "second frequency band".
- This band extension technique is shown schematically in FIG.
- the high frequency components of the original signal are isolated by a bandpass filter (100) between 3400 and 7000 Hz.
- the temporal and frequency envelopes of the signal are calculated respectively by the modules (101) and (102). These envelopes are quantized together with 2 kbit / s at the block (103).
- the synthetic excitation from the reconstruction module (104) is then shaped by a scaling module (106) from the time envelope and by a filtering module (107) from the frequency envelope.
- the band extension mechanism that has just been described with reference to the ITU-T SG16 / WP3 D214 codec is therefore based on the shaping of a synthetic excitation by temporal and frequency envelopes.
- the application of such a model is delicate and causes the appearance of artifacts in the form of very audible one-time "clicks" due to strong amplitude overruns.
- the technical problem to be solved by the object of the present invention is to propose a method of post-processing, in an audio decoder, a signal reconstructed by temporal and frequency formatting of an excitation signal obtained.
- temporal and frequency formatting being made from a temporal envelope and a frequency envelope received and decoded in a second frequency band.
- said method comprises the steps consisting in comparing the amplitude of said reconstructed signal with said received and decoded time envelope, and, in case of exceeding at least one threshold function of said temporal envelope, to apply to said reconstructed signal an amplitude compression.
- the method according to the invention compensates for the lack of adequate coupling between the excitation and the shaping functions by means of a post-processing by amplitude compression of the audio signal supplied by the decoder in the second frequency band, or high band.
- said amplitude compression consists in applying to the amplitude of said signal at least one linear attenuation if said amplitude is greater than at least one trigger threshold according to said received and decoded time envelope.
- the method of the invention has the advantage of being adaptive in the sense that the triggering threshold is variable since it follows the value of the time envelope received and decoded.
- the invention also relates to a computer program comprising program code instructions for implementing the post-processing method according to the invention when said program is executed on a computer.
- the invention further relates to a post-processing module, in an audio decoder, of a signal reconstructed by shaping an excitation signal obtained from at least one estimated parameter in a first frequency band. , said temporal and frequency formatting being made from a time envelope and a frequency envelope received and decoded in a second frequency band, the module being remarkable in that it comprises a comparator of the amplitude said reconstructed signal to said received and decoded time envelope and amplitude compression means adapted, in case of a positive comparison, to apply to said reconstructed signal an amplitude compression.
- an audio decoder comprising a module for estimating at least one parameter of an excitation signal in a first frequency band, a module for reconstructing a signal of excitation from said parameter, a decoding module of a temporal envelope in a second frequency band, a module (802) for decoding a frequency envelope in a second frequency band, a module (805) for setting in temporal form of said excitation signal, by means, at least, of said decoded time envelope ( ⁇ ) and a frequency forming module (807) of said excitation signal, by means of, at least, said frequency envelope decoded, remarkable in that said decoder comprises a post-processing module according to the invention.
- FIG. 1 is a diagram of a high-band coding / decoding stage in accordance with the prior art.
- FIG. 2 is a high level diagram of a hierarchical audio coder to
- FIG. 3 is a diagram of the high band encoder for the 13.65 kbit / s mode of the coder of FIG. 2.
- FIG. 4 is a diagram showing the frame division performed by the high band encoder of FIG.
- FIG. 5 is a high-level diagram of an 8, 12, 13.65 kbit / s hierarchical audio decoder associated with the coder of FIG. 2.
- Fig. 6 is a diagram of the high band decoder for the 13.65 kbit / s mode of the decoder of Fig. 5.
- Fig. 7 is a flowchart of a first embodiment of an amplitude compression function.
- FIG. 8 is a graph of the amplitude compression function of FIG. 7.
- Fig. 9 is a flowchart of a second embodiment of an amplitude compression function.
- Figure 10 is a graph of the amplitude compression function of Figure 9.
- Fig. 11 is a flowchart of a third embodiment of an amplitude compression function.
- FIG. 12 is a graph of the amplitude compression function of FIG. 11. It will be recalled that the present invention is more particularly part of an overall hierarchical audio coding and decoding scheme in subbands operating at three possible rates: 8, 12 or 13.65 kbit / s. In practice, the encoder always operates at the maximum rate of 13.65 kbit / s, while the decoder can receive the heart at 8 kbit / s and one or two enhancement layers at 12 or 13.65 kbit / s.
- the hierarchical audio coder is shown schematically in FIG.
- the broadband input signal sampled at 16 kHz is first decomposed into two subbands by QMF ("Quadrature Mirror”) filtering.
- QMF Quadrature Mirror
- the first frequency band, or low band, between 0 and 4000 Hz is obtained by low-pass filtering L and decimation 401, and the second frequency band, or high band, between 4000 and 8000 Hz by filtering 402 passes. H and decimation 403.
- the filters L and H are of length 64 and conform to those described in the J. Johnston article, ICASSP, flight. 5, pp. 291-294, 1980.
- the low band is pre-processed by a high pass filter 404 eliminating components below 50 Hz before CELP 405 coding in 8 and 12 kbit / s narrowband.
- This high-pass filtering takes account of the fact that the wide band is defined as covering the interval 50-7000 Hz.
- the narrow-band CELP coding corresponds to that of the ITU-T SG16 / WP3 D135 coder ( ITU-T, COM 16, D135 (WP 3/16), "France Telecom G729EV Candidate: High level description and complexity evaluation," Q.10 / 16, Study Period 2005-2008, Geneva, 26 July - 5 August 2005) ; it is a cascaded CELP encoding comprising as a first 8 kbit / s stage a modified G.729 coding (ITU-T G729 Recommendation, Coding of Speech at 8 kbps using Conjugate Structure Algebraic Code Excited Linear Prediction ( CS-ACELP), March 1996) without a pre-processing filter and as a second stage at 12 kbit / s an additional fixed CELP dictionary.
- CELP coding allows to determine the parameters of the excitation signal in the low band.
- the high band is first folded spectrally 406 to compensate for the folding due to the high pass filter 402 combined with the decimation 403.
- the high band is then pretreated by a low pass filter 407 eliminating the components between 3000 and 4000 Hz. of the high band, that is to say the components between 7000 and 8000 Hz of the original signal.
- a band extension 408, or high band coding, at 13.65 kbit / s is realized.
- the different bit streams generated by the coding modules 405 and 408 are multiplexed and structured into a hierarchical bit stream in the multiplexer 409.
- the coding is done in blocks of samples, or frames, of 20 ms, ie 320 samples.
- the hierarchical coding rate is 8, 12 and 13.65 kbit / s.
- the high band encoder 408 is detailed in FIG. 3. Its principle is similar to the parametric band extension of the ITU-T SG16 / WP3 D214 encoder.
- the high band signal x h i is coded in frames of N / 2 samples, where N is the number of samples of the original broadband frame and the division by 2 is due to the decimation by 2 of the high band.
- N / 2 160 samples, or 20 ms at 8 kHz sampling.
- time and frequency envelopes are extracted by the modules 600 and 601 as in the ITU-T SG16 / WP3 D214 encoder. These envelopes are then jointly quantized in block 602.
- This operation requires future samples, commonly called “lookahead” because the spectral analysis uses a temporal window centered on the current frame that overflows on the future frame.
- the frequent envelope extraction can be carried out for example as follows: calculation of the short-term spectrum with windowing of the current frame and lookahead, and discrete Fourier transform,
- the frequency envelope is thus defined as the rms value of each of the sub-bands of the signal Xh ,.
- Each frame of 20 ms consists of 160 samples:
- the time envelope of the current frame is calculated as follows:
- the time envelope is thus defined as the rms value of each of the 16 subframes of the signal X h ,.
- FIG. 5 represents a hierarchical audio decoder associated with the encoder which has just been described with reference to FIGS. 2 and 3.
- the bits describing each frame of 20 ms are demultiplexed by the demultiplexer 500.
- the bitstream of the 8 and 12 kbit / s layers is used by the decoding module 501 CELP to generate the parameters of synthesis of the excitation signal in the band.
- the low band synthetic speech signal is then postfiltered by block 502.
- the portion of the bit stream associated with the 13.65 kbit / s layer is decoded by the band extension module 503.
- the expanded band output signal, sampled at 16 kHz, is obtained through the synthesis QMF filter bank 504, 505, 507, 508 and 509, incorporating the reverse folding 506.
- the high band decoder 503 of FIG. 5 is described in detail in FIG.
- This decoder repeats the principle of synthesis of the high band described for the coder of FIG. 1, with however two modifications: a frequency envelope interpolation module 806 and a post-processing module 808. These two frequency envelope interpolation and post-processing modules are intended to improve the quality of coding in the high band.
- the module 806 interpolates between the frequency envelope of the preceding frame and the frequency envelope of the current frame so that this envelope evolves every 10 ms, instead of 20 ms.
- the high band decoder of FIG. 6 demultiplexes in the demultiplexer 800 the parameters received in the bitstream and decodes the time and frequency envelope information in the modules 801 and
- a synthetic excitation signal is generated in a reconstruction module 803 from the CELP excitation parameters received by the 8 and 12 kbit / s layers. This excitation is filtered in the 804 low-pass filter to keep only the frequencies between 0 and 3000 Hz which correspond to the 4000 to 7000 Hz band of the original signal. As in the encoder of FIG. 1, the synthetic excitation signal is shaped by the modules 805 and 807:
- the output of the temporal shaping module 805 ideally has an effective value (r.m.s.) per subframes which corresponds to the decoded time envelope; the module 805 therefore corresponds to the application of an adaptive gain in time,
- the output of the frequency shaping module 807 ideally has an effective value (rms) per sub-band which corresponds to the decoded frequency envelope; the module 807 can be realized by means of a filterbank or a transform with overlap.
- the signal resulting from the shaping of the excitation is finally processed by the post-processing module 808 to obtain the reconstructed high band y.
- the post-processing module 808 will now be described in detail.
- the post-processing performed by the module 808 consists in applying to the signal x coming from the frequency shaping module 807 an amplitude compression so as to limit the amplitude of the signal and thus avoid the artifacts that could occur as a result of lack of coupling between excitation and shaping.
- this post-treatment acts instantaneously, that is to say sample per sample without causing a delay in treatment
- the trigger threshold for the amplitude compression is provided by the time envelope as decoded by the time envelope decoding module 801.
- the post-processing is of the adaptive type because the value of ⁇ changes at each subframe of 10 samples, namely every 1.25 ms,
- the decoded time envelope for the current frame corresponds to a temporal support offset by 2 ms, ie 16 samples, as illustrated in FIG. 4.
- the adaptive post-processing keeps in memory the effective value (rms) of the two sub-bits. -sames associated with the "lookahead": these two subframes correspond to the two subframes of the beginning of the current frame.
- the flowchart of FIG. 7 details a first compression function, denoted C 1 (X), of post-processing.
- C 1 (X) a first compression function
- the beginning and end of the calculation are identified by blocks 1000 and 1006.
- the value of the output is first initialized at x (block 1001). Then two tests are done (blocks 1002 and
- FIG. 8 clearly shows that the function Ci (x) performs symmetrical amplitude compression with a "trigger threshold" set at +/- ⁇ . More precisely, the slope of Fi (x / ⁇ ) is 1 between [-1. + 1] and 1/16 elsewhere. Equivalently, the slope of Ci (x) is 1 between [- ⁇ , + ⁇ ] and 1/16 elsewhere.
- FIGS. 9 to 12 Two variants of the post-processing are described in FIGS. 9 to 12.
- the corresponding functions are denoted respectively C 2 (X) and C 3 (X).
- the post-processing C 2 (x) shown in FIGS. 9 and 10 is identical to C- ⁇ (x) but with a value of the "trigger threshold" which goes from +/- ⁇ to +/- 2 ⁇ .
- the slope of C 2 (x) is 1 between [-2 ⁇ , + 2 ⁇ ] and 1/16 elsewhere.
- the post-processing C 3 (x) is a more evolved variant of Ci (x), in which the amplitude compression is performed in two successive steps.
- the trip interval is always set to [- ⁇ , + ⁇ ] (blocks 1402 and 1406), whereas the value of y is only attenuated by a factor Vi, unless the value of y modified by blocks 1403 and 1407 is outside the range [-2.5 ⁇ , + 2.5 ⁇ ] in which case the value of y is further modified by blocks 1405 and 1409.
- C 3 ( x) The operation of C 3 ( x) is illustrated in Figure 12 where we can see that the slope of C 3 (x) is: - 1/16 on [- ⁇ , -4 ⁇ ] and [4 ⁇ , + ⁇ ], - 1/2 on [-Aa, - ⁇ ] and [ ⁇ , 4 ⁇ ] and - 1 on [- ⁇ , + ⁇ ].
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0650954 | 2006-03-20 | ||
PCT/FR2007/050959 WO2007107670A2 (en) | 2006-03-20 | 2007-03-20 | Method for post-processing a signal in an audio decoder |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2005424A2 true EP2005424A2 (en) | 2008-12-24 |
Family
ID=37500047
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP07731774A Withdrawn EP2005424A2 (en) | 2006-03-20 | 2007-03-20 | Method for post-processing a signal in an audio decoder |
Country Status (6)
Country | Link |
---|---|
US (1) | US20090299755A1 (en) |
EP (1) | EP2005424A2 (en) |
JP (1) | JP5457171B2 (en) |
KR (1) | KR101373207B1 (en) |
CN (1) | CN101405792B (en) |
WO (1) | WO2007107670A2 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101008508B1 (en) * | 2006-08-15 | 2011-01-17 | 브로드콤 코포레이션 | Re-phasing of decoder states after packet loss |
JP4932917B2 (en) | 2009-04-03 | 2012-05-16 | 株式会社エヌ・ティ・ティ・ドコモ | Speech decoding apparatus, speech decoding method, and speech decoding program |
EP2362376A3 (en) | 2010-02-26 | 2011-11-02 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Apparatus and method for modifying an audio signal using envelope shaping |
CN103069484B (en) * | 2010-04-14 | 2014-10-08 | 华为技术有限公司 | Time/frequency two dimension post-processing |
JP5997592B2 (en) | 2012-04-27 | 2016-09-28 | 株式会社Nttドコモ | Speech decoder |
CN110890101B (en) * | 2013-08-28 | 2024-01-12 | 杜比实验室特许公司 | Method and apparatus for decoding based on speech enhancement metadata |
JP6035270B2 (en) * | 2014-03-24 | 2016-11-30 | 株式会社Nttドコモ | Speech decoding apparatus, speech encoding apparatus, speech decoding method, speech encoding method, speech decoding program, and speech encoding program |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07193548A (en) * | 1993-12-25 | 1995-07-28 | Sony Corp | Noise reduction processing method |
US5945932A (en) * | 1997-10-30 | 1999-08-31 | Audiotrack Corporation | Technique for embedding a code in an audio signal and for detecting the embedded code |
GB2351889B (en) * | 1999-07-06 | 2003-12-17 | Ericsson Telefon Ab L M | Speech band expansion |
WO2001022401A1 (en) * | 1999-09-20 | 2001-03-29 | Koninklijke Philips Electronics N.V. | Processing circuit for correcting audio signals, receiver, communication system, mobile apparatus and related method |
JP3810257B2 (en) * | 2000-06-30 | 2006-08-16 | 松下電器産業株式会社 | Voice band extending apparatus and voice band extending method |
SE0004818D0 (en) * | 2000-12-22 | 2000-12-22 | Coding Technologies Sweden Ab | Enhancing source coding systems by adaptive transposition |
US7590525B2 (en) * | 2001-08-17 | 2009-09-15 | Broadcom Corporation | Frame erasure concealment for predictive speech coding based on extrapolation of speech waveform |
US7173966B2 (en) * | 2001-08-31 | 2007-02-06 | Broadband Physics, Inc. | Compensation for non-linear distortion in a modem receiver |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
US6988066B2 (en) * | 2001-10-04 | 2006-01-17 | At&T Corp. | Method of bandwidth extension for narrow-band speech |
US20030187663A1 (en) * | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
CA2457988A1 (en) * | 2004-02-18 | 2005-08-18 | Voiceage Corporation | Methods and devices for audio compression based on acelp/tcx coding and multi-rate lattice vector quantization |
US7720230B2 (en) * | 2004-10-20 | 2010-05-18 | Agere Systems, Inc. | Individual channel shaping for BCC schemes and the like |
US8204261B2 (en) * | 2004-10-20 | 2012-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Diffuse sound shaping for BCC schemes and the like |
CN1937496A (en) | 2005-09-21 | 2007-03-28 | 日电(中国)有限公司 | Extensible false name certificate system and method |
-
2007
- 2007-03-20 US US12/225,462 patent/US20090299755A1/en not_active Abandoned
- 2007-03-20 JP JP2009500896A patent/JP5457171B2/en not_active Expired - Fee Related
- 2007-03-20 WO PCT/FR2007/050959 patent/WO2007107670A2/en active Application Filing
- 2007-03-20 CN CN200780010053XA patent/CN101405792B/en not_active Expired - Fee Related
- 2007-03-20 KR KR1020087025600A patent/KR101373207B1/en not_active IP Right Cessation
- 2007-03-20 EP EP07731774A patent/EP2005424A2/en not_active Withdrawn
Non-Patent Citations (1)
Title |
---|
None * |
Also Published As
Publication number | Publication date |
---|---|
KR20080109038A (en) | 2008-12-16 |
US20090299755A1 (en) | 2009-12-03 |
WO2007107670A3 (en) | 2007-11-08 |
CN101405792B (en) | 2012-09-05 |
KR101373207B1 (en) | 2014-03-12 |
WO2007107670A2 (en) | 2007-09-27 |
CN101405792A (en) | 2009-04-08 |
JP5457171B2 (en) | 2014-04-02 |
JP2009530679A (en) | 2009-08-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1989706B1 (en) | Device for perceptual weighting in audio encoding/decoding | |
EP1907812B1 (en) | Method for switching rate- and bandwidth-scalable audio decoding rate | |
EP1905010B1 (en) | Hierarchical audio encoding/decoding | |
EP2115741B1 (en) | Advanced encoding / decoding of audio digital signals | |
EP2366177B1 (en) | Encoding of an audio-digital signal with noise transformation in a scalable encoder | |
EP2002428B1 (en) | Method for trained discrimination and attenuation of echoes of a digital signal in a decoder and corresponding device | |
EP2452337B1 (en) | Allocation of bits in an enhancement coding/decoding for improving a hierarchical coding/decoding of digital audio signals | |
EP2452336B1 (en) | Improved coding /decoding of digital audio signals | |
WO2007096551A2 (en) | Method for binary coding of quantization indices of a signal envelope, method for decoding a signal envelope and corresponding coding and decoding modules | |
EP2277172A1 (en) | Concealment of transmission error in a digital signal in a hierarchical decoding structure | |
EP2080195A1 (en) | Synthesis of lost blocks of a digital audio signal, with pitch period correction | |
EP2586133A1 (en) | Controlling a noise-shaping feedback loop in a digital audio signal encoder | |
WO2007107670A2 (en) | Method for post-processing a signal in an audio decoder | |
EP2936488B1 (en) | Effective attenuation of pre-echos in a digital audio signal | |
EP2347411B1 (en) | Pre-echo attenuation in a digital audio signal | |
EP2652735B1 (en) | Improved encoding of an improvement stage in a hierarchical encoder | |
FR2737360A1 (en) | Audio digital signal coding method of successive sample blocks - using spectral analysis to select vector dictionary for each sample block and allocating vector and scalar quantisation bits |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20081010 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC MT NL PL PT RO SE SI SK TR |
|
17Q | First examination report despatched |
Effective date: 20090130 |
|
RIN1 | Information on inventor provided before grant (corrected) |
Inventor name: RAGOT, STEPHANE Inventor name: GUILLAUME, CYRIL |
|
DAX | Request for extension of the european patent (deleted) | ||
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: ORANGE |
|
GRAP | Despatch of communication of intention to grant a patent |
Free format text: ORIGINAL CODE: EPIDOSNIGR1 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/04 20130101AFI20170223BHEP Ipc: G10L 21/02 20130101ALI20170223BHEP Ipc: G10L 19/24 20130101ALI20170223BHEP |
|
INTG | Intention to grant announced |
Effective date: 20170310 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20170721 |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 21/02 20130101ALI20170223BHEP Ipc: G10L 19/24 20130101ALI20170223BHEP Ipc: G10L 19/04 20130101AFI20170223BHEP |
|
RIC1 | Information provided on ipc code assigned before grant |
Ipc: G10L 19/04 20130101AFI20170223BHEP Ipc: G10L 19/24 20130101ALI20170223BHEP Ipc: G10L 21/02 20130101ALI20170223BHEP |