WO2007107670A2 - Procede de post-traitement d'un signal dans un decodeur audio - Google Patents
Procede de post-traitement d'un signal dans un decodeur audio Download PDFInfo
- Publication number
- WO2007107670A2 WO2007107670A2 PCT/FR2007/050959 FR2007050959W WO2007107670A2 WO 2007107670 A2 WO2007107670 A2 WO 2007107670A2 FR 2007050959 W FR2007050959 W FR 2007050959W WO 2007107670 A2 WO2007107670 A2 WO 2007107670A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- frequency
- signal
- envelope
- module
- post
- Prior art date
Links
- 238000012805 post-processing Methods 0.000 title claims abstract description 27
- 238000000034 method Methods 0.000 title claims abstract description 26
- 230000005284 excitation Effects 0.000 claims abstract description 28
- 230000002123 temporal effect Effects 0.000 claims abstract description 24
- 230000006835 compression Effects 0.000 claims abstract description 23
- 238000007906 compression Methods 0.000 claims abstract description 23
- 238000007493 shaping process Methods 0.000 claims abstract description 17
- 238000004590 computer program Methods 0.000 claims description 2
- 230000001960 triggered effect Effects 0.000 claims 1
- 230000006870 function Effects 0.000 description 12
- 238000010586 diagram Methods 0.000 description 6
- 230000015572 biosynthetic process Effects 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 238000003786 synthesis reaction Methods 0.000 description 5
- 230000003044 adaptive effect Effects 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 230000002238 attenuated effect Effects 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000006872 improvement Effects 0.000 description 2
- 238000013139 quantization Methods 0.000 description 2
- 230000005236 sound signal Effects 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000012360 testing method Methods 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000012797 qualification Methods 0.000 description 1
- 230000004044 response Effects 0.000 description 1
- 230000002441 reversible effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
Definitions
- the present invention relates to a method of post-processing a signal in an audio decoder.
- the invention finds a particularly advantageous application in the field of transmission and storage of digital signals such as audio-frequency signals: speech, music, etc.
- the encoder In conventional speech coding, the encoder generates a fixed rate bit stream. This fixed rate constraint simplifies the implementation and use of the encoder and decoder (called “coded" set). Examples of such systems are: ITU-T G.711 coding at 64 kbit / s, ITU-T G.729 coding at 8 kbit / s or GSM-EFR at 12.2 kbit / s.
- variable rate bit stream In some applications, such as mobile telephony or voice over IP, it is preferable to generate a variable rate bit stream, the bit rate values being taken in a predefined set.
- multi-rate coding techniques can be distinguished that are more flexible than fixed rate coding:
- the multi-mode coding controlled by the source and / or the channel as implemented in the AMR-NB, AMR-WB, SMV 1 or VMR-WB systems, hierarchical coding, or "scalable" coding, which generates a so-called hierarchical bitstream because it comprises a core rate and one or more improvement layer (s).
- the 48, 56 and 64 kbit / s G.722 system is a simple example of scalable rate scaling.
- the MPEG-4 CELP codec is scalable in terms of bit rate and bandwidth.
- Other examples of such encoders are found in the articles by B. Kovesi, D. Massaloux, A. Sollaud, A Scalable Speech and Audio Coding Scheme with Continuous Bitrate Flexibility, ICASSP 2004, and H. Taddéi et al, A Scalable. Three Bitrate (8, 14.2 and 24 kbit / s) Audio Coder; 107th AES Convention, 1999. - Multiple description coding.
- the invention is more particularly concerned with hierarchical coding.
- the basic concept of hierarchical audio coding for example, is illustrated in the article by Y. Hiwasaki, T. Mori, H. Ohmuro, J. Ikedo, D. Tokumoto, and A. Kataoka, Scalable Speech Coding Technology for High-Quality. Ubiquitous Communications, NTT Technical Review, March 2004.
- the bitstream includes a base layer and one or more enhancement layers.
- the base layer is generated by a fixed low rate codec, known as a "core coded", guaranteeing the minimum quality of the coding; this layer must be received by the decoder to maintain an acceptable level of quality. Improvement layers are used to improve the quality; it may happen that they are not all received by the decoder.
- the main advantage of hierarchical coding is that it allows an adaptation of the bit rate by simple truncation of the bit stream.
- the number of layers that is to say the number of possible truncations of the bit stream, defines the granularity of the coding: we speak of coding with "high granularity” if the bit stream comprises few layers, of the order of 2 to 4 with steps in the range of 4 to 8 kbit / s; a "fine granularity" coding allows a large number of layers with a step of the order of 1 kbit / s.
- the invention relates to scalable rate and bandwidth encoding techniques with a CELP heart-type coder in a telephone band and one or more broadband enhancement layer (s).
- a CELP heart-type coder in a telephone band and one or more broadband enhancement layer (s).
- broadband enhancement layer s
- Examples of such systems are given in the aforementioned article H. Taddéi et al with a high granularity of 8, 14.2, 24 kbit / s, and in the aforementioned article by B. Kovesi with fine granularity of 6.4 to 32 kbit / s.
- G.729EV EV for Embedded Variable Bitrate
- the objective of the G.729EV standardization is to obtain a G.729 core hierarchical encoder, producing a signal whose band extends from the narrow band (300-3400 Hz) to the broadband (50-7000 Hz). ) at a rate of 8 to 32 kbit / s for conversational services.
- This encoder is inherently interoperable with Recommendation G.729, which ensures compatibility with existing VoIP devices.
- the input audio signals are sampled at 16 kHz over a useful band of 50 to 7000 Hz.
- the high band typically corresponds to frequencies between 3400 Hz. and 7000 Hz.
- This band is coded according to a band extension technique based on the time and frequency envelope encoder extraction, these envelopes being then applied to the decoder to a reconstructed synthetic excitation signal in the high band. from the parameters estimated in the low band (between 50 and 3400 Hz) sampled at 8 kHz.
- the low band will be designated in the sequence "first frequency band"; the high band is then called "second frequency band".
- This band extension technique is shown schematically in FIG.
- the high frequency components of the original signal are isolated by a bandpass filter (100) between 3400 and 7000 Hz.
- the temporal and frequency envelopes of the signal are calculated respectively by the modules (101) and (102). These envelopes are quantized together with 2 kbit / s at the block (103).
- the synthetic excitation from the reconstruction module (104) is then shaped by a scaling module (106) from the time envelope and by a filtering module (107) from the frequency envelope.
- the band extension mechanism that has just been described with reference to the ITU-T SG16 / WP3 D214 codec is therefore based on the shaping of a synthetic excitation by temporal and frequency envelopes.
- the application of such a model is delicate and causes the appearance of artifacts in the form of very audible one-time "clicks" due to strong amplitude overruns.
- the technical problem to be solved by the object of the present invention is to propose a method of post-processing, in an audio decoder, a signal reconstructed by temporal and frequency formatting of an excitation signal obtained.
- temporal and frequency formatting being made from a temporal envelope and a frequency envelope received and decoded in a second frequency band.
- said method comprises the steps consisting in comparing the amplitude of said reconstructed signal with said received and decoded time envelope, and, in case of exceeding at least one threshold function of said temporal envelope, to apply to said reconstructed signal an amplitude compression.
- the method according to the invention compensates for the lack of adequate coupling between the excitation and the shaping functions by means of a post-processing by amplitude compression of the audio signal supplied by the decoder in the second frequency band, or high band.
- said amplitude compression consists in applying to the amplitude of said signal at least one linear attenuation if said amplitude is greater than at least one trigger threshold according to said received and decoded time envelope.
- the method of the invention has the advantage of being adaptive in the sense that the triggering threshold is variable since it follows the value of the time envelope received and decoded.
- the invention also relates to a computer program comprising program code instructions for implementing the post-processing method according to the invention when said program is executed on a computer.
- the invention further relates to a post-processing module, in an audio decoder, of a signal reconstructed by shaping an excitation signal obtained from at least one estimated parameter in a first frequency band. , said temporal and frequency formatting being made from a time envelope and a frequency envelope received and decoded in a second frequency band, the module being remarkable in that it comprises a comparator of the amplitude said reconstructed signal to said received and decoded time envelope and amplitude compression means adapted, in case of a positive comparison, to apply to said reconstructed signal an amplitude compression.
- an audio decoder comprising a module for estimating at least one parameter of an excitation signal in a first frequency band, a module for reconstructing a signal of excitation from said parameter, a decoding module of a temporal envelope in a second frequency band, a module (802) for decoding a frequency envelope in a second frequency band, a module (805) for setting in temporal form of said excitation signal, by means, at least, of said decoded time envelope ( ⁇ ) and a frequency forming module (807) of said excitation signal, by means of, at least, said frequency envelope decoded, remarkable in that said decoder comprises a post-processing module according to the invention.
- FIG. 1 is a diagram of a high-band coding / decoding stage in accordance with the prior art.
- FIG. 2 is a high level diagram of a hierarchical audio coder to
- FIG. 3 is a diagram of the high band encoder for the 13.65 kbit / s mode of the coder of FIG. 2.
- FIG. 4 is a diagram showing the frame division performed by the high band encoder of FIG.
- FIG. 5 is a high-level diagram of an 8, 12, 13.65 kbit / s hierarchical audio decoder associated with the coder of FIG. 2.
- Fig. 6 is a diagram of the high band decoder for the 13.65 kbit / s mode of the decoder of Fig. 5.
- Fig. 7 is a flowchart of a first embodiment of an amplitude compression function.
- FIG. 8 is a graph of the amplitude compression function of FIG. 7.
- Fig. 9 is a flowchart of a second embodiment of an amplitude compression function.
- Figure 10 is a graph of the amplitude compression function of Figure 9.
- Fig. 11 is a flowchart of a third embodiment of an amplitude compression function.
- FIG. 12 is a graph of the amplitude compression function of FIG. 11. It will be recalled that the present invention is more particularly part of an overall hierarchical audio coding and decoding scheme in subbands operating at three possible rates: 8, 12 or 13.65 kbit / s. In practice, the encoder always operates at the maximum rate of 13.65 kbit / s, while the decoder can receive the heart at 8 kbit / s and one or two enhancement layers at 12 or 13.65 kbit / s.
- the hierarchical audio coder is shown schematically in FIG.
- the broadband input signal sampled at 16 kHz is first decomposed into two subbands by QMF ("Quadrature Mirror”) filtering.
- QMF Quadrature Mirror
- the first frequency band, or low band, between 0 and 4000 Hz is obtained by low-pass filtering L and decimation 401, and the second frequency band, or high band, between 4000 and 8000 Hz by filtering 402 passes. H and decimation 403.
- the filters L and H are of length 64 and conform to those described in the J. Johnston article, ICASSP, flight. 5, pp. 291-294, 1980.
- the low band is pre-processed by a high pass filter 404 eliminating components below 50 Hz before CELP 405 coding in 8 and 12 kbit / s narrowband.
- This high-pass filtering takes account of the fact that the wide band is defined as covering the interval 50-7000 Hz.
- the narrow-band CELP coding corresponds to that of the ITU-T SG16 / WP3 D135 coder ( ITU-T, COM 16, D135 (WP 3/16), "France Telecom G729EV Candidate: High level description and complexity evaluation," Q.10 / 16, Study Period 2005-2008, Geneva, 26 July - 5 August 2005) ; it is a cascaded CELP encoding comprising as a first 8 kbit / s stage a modified G.729 coding (ITU-T G729 Recommendation, Coding of Speech at 8 kbps using Conjugate Structure Algebraic Code Excited Linear Prediction ( CS-ACELP), March 1996) without a pre-processing filter and as a second stage at 12 kbit / s an additional fixed CELP dictionary.
- CELP coding allows to determine the parameters of the excitation signal in the low band.
- the high band is first folded spectrally 406 to compensate for the folding due to the high pass filter 402 combined with the decimation 403.
- the high band is then pretreated by a low pass filter 407 eliminating the components between 3000 and 4000 Hz. of the high band, that is to say the components between 7000 and 8000 Hz of the original signal.
- a band extension 408, or high band coding, at 13.65 kbit / s is realized.
- the different bit streams generated by the coding modules 405 and 408 are multiplexed and structured into a hierarchical bit stream in the multiplexer 409.
- the coding is done in blocks of samples, or frames, of 20 ms, ie 320 samples.
- the hierarchical coding rate is 8, 12 and 13.65 kbit / s.
- the high band encoder 408 is detailed in FIG. 3. Its principle is similar to the parametric band extension of the ITU-T SG16 / WP3 D214 encoder.
- the high band signal x h i is coded in frames of N / 2 samples, where N is the number of samples of the original broadband frame and the division by 2 is due to the decimation by 2 of the high band.
- N / 2 160 samples, or 20 ms at 8 kHz sampling.
- time and frequency envelopes are extracted by the modules 600 and 601 as in the ITU-T SG16 / WP3 D214 encoder. These envelopes are then jointly quantized in block 602.
- This operation requires future samples, commonly called “lookahead” because the spectral analysis uses a temporal window centered on the current frame that overflows on the future frame.
- the frequent envelope extraction can be carried out for example as follows: calculation of the short-term spectrum with windowing of the current frame and lookahead, and discrete Fourier transform,
- the frequency envelope is thus defined as the rms value of each of the sub-bands of the signal Xh ,.
- Each frame of 20 ms consists of 160 samples:
- the time envelope of the current frame is calculated as follows:
- the time envelope is thus defined as the rms value of each of the 16 subframes of the signal X h ,.
- FIG. 5 represents a hierarchical audio decoder associated with the encoder which has just been described with reference to FIGS. 2 and 3.
- the bits describing each frame of 20 ms are demultiplexed by the demultiplexer 500.
- the bitstream of the 8 and 12 kbit / s layers is used by the decoding module 501 CELP to generate the parameters of synthesis of the excitation signal in the band.
- the low band synthetic speech signal is then postfiltered by block 502.
- the portion of the bit stream associated with the 13.65 kbit / s layer is decoded by the band extension module 503.
- the expanded band output signal, sampled at 16 kHz, is obtained through the synthesis QMF filter bank 504, 505, 507, 508 and 509, incorporating the reverse folding 506.
- the high band decoder 503 of FIG. 5 is described in detail in FIG.
- This decoder repeats the principle of synthesis of the high band described for the coder of FIG. 1, with however two modifications: a frequency envelope interpolation module 806 and a post-processing module 808. These two frequency envelope interpolation and post-processing modules are intended to improve the quality of coding in the high band.
- the module 806 interpolates between the frequency envelope of the preceding frame and the frequency envelope of the current frame so that this envelope evolves every 10 ms, instead of 20 ms.
- the high band decoder of FIG. 6 demultiplexes in the demultiplexer 800 the parameters received in the bitstream and decodes the time and frequency envelope information in the modules 801 and
- a synthetic excitation signal is generated in a reconstruction module 803 from the CELP excitation parameters received by the 8 and 12 kbit / s layers. This excitation is filtered in the 804 low-pass filter to keep only the frequencies between 0 and 3000 Hz which correspond to the 4000 to 7000 Hz band of the original signal. As in the encoder of FIG. 1, the synthetic excitation signal is shaped by the modules 805 and 807:
- the output of the temporal shaping module 805 ideally has an effective value (r.m.s.) per subframes which corresponds to the decoded time envelope; the module 805 therefore corresponds to the application of an adaptive gain in time,
- the output of the frequency shaping module 807 ideally has an effective value (rms) per sub-band which corresponds to the decoded frequency envelope; the module 807 can be realized by means of a filterbank or a transform with overlap.
- the signal resulting from the shaping of the excitation is finally processed by the post-processing module 808 to obtain the reconstructed high band y.
- the post-processing module 808 will now be described in detail.
- the post-processing performed by the module 808 consists in applying to the signal x coming from the frequency shaping module 807 an amplitude compression so as to limit the amplitude of the signal and thus avoid the artifacts that could occur as a result of lack of coupling between excitation and shaping.
- this post-treatment acts instantaneously, that is to say sample per sample without causing a delay in treatment
- the trigger threshold for the amplitude compression is provided by the time envelope as decoded by the time envelope decoding module 801.
- the post-processing is of the adaptive type because the value of ⁇ changes at each subframe of 10 samples, namely every 1.25 ms,
- the decoded time envelope for the current frame corresponds to a temporal support offset by 2 ms, ie 16 samples, as illustrated in FIG. 4.
- the adaptive post-processing keeps in memory the effective value (rms) of the two sub-bits. -sames associated with the "lookahead": these two subframes correspond to the two subframes of the beginning of the current frame.
- the flowchart of FIG. 7 details a first compression function, denoted C 1 (X), of post-processing.
- C 1 (X) a first compression function
- the beginning and end of the calculation are identified by blocks 1000 and 1006.
- the value of the output is first initialized at x (block 1001). Then two tests are done (blocks 1002 and
- FIG. 8 clearly shows that the function Ci (x) performs symmetrical amplitude compression with a "trigger threshold" set at +/- ⁇ . More precisely, the slope of Fi (x / ⁇ ) is 1 between [-1. + 1] and 1/16 elsewhere. Equivalently, the slope of Ci (x) is 1 between [- ⁇ , + ⁇ ] and 1/16 elsewhere.
- FIGS. 9 to 12 Two variants of the post-processing are described in FIGS. 9 to 12.
- the corresponding functions are denoted respectively C 2 (X) and C 3 (X).
- the post-processing C 2 (x) shown in FIGS. 9 and 10 is identical to C- ⁇ (x) but with a value of the "trigger threshold" which goes from +/- ⁇ to +/- 2 ⁇ .
- the slope of C 2 (x) is 1 between [-2 ⁇ , + 2 ⁇ ] and 1/16 elsewhere.
- the post-processing C 3 (x) is a more evolved variant of Ci (x), in which the amplitude compression is performed in two successive steps.
- the trip interval is always set to [- ⁇ , + ⁇ ] (blocks 1402 and 1406), whereas the value of y is only attenuated by a factor Vi, unless the value of y modified by blocks 1403 and 1407 is outside the range [-2.5 ⁇ , + 2.5 ⁇ ] in which case the value of y is further modified by blocks 1405 and 1409.
- C 3 ( x) The operation of C 3 ( x) is illustrated in Figure 12 where we can see that the slope of C 3 (x) is: - 1/16 on [- ⁇ , -4 ⁇ ] and [4 ⁇ , + ⁇ ], - 1/2 on [-Aa, - ⁇ ] and [ ⁇ , 4 ⁇ ] and - 1 on [- ⁇ , + ⁇ ].
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2009500896A JP5457171B2 (ja) | 2006-03-20 | 2007-03-20 | オーディオデコーダ内で信号を後処理する方法 |
KR1020087025600A KR101373207B1 (ko) | 2006-03-20 | 2007-03-20 | 오디오 디코더에서 신호를 사후-프로세싱하는 방법 |
CN200780010053XA CN101405792B (zh) | 2006-03-20 | 2007-03-20 | 用于在音频解码器中对信号进行后处理的方法 |
US12/225,462 US20090299755A1 (en) | 2006-03-20 | 2007-03-20 | Method for Post-Processing a Signal in an Audio Decoder |
EP07731774A EP2005424A2 (fr) | 2006-03-20 | 2007-03-20 | Procede de post-traitement d'un signal dans un decodeur audio |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0650954 | 2006-03-20 | ||
FR0650954 | 2006-03-20 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007107670A2 true WO2007107670A2 (fr) | 2007-09-27 |
WO2007107670A3 WO2007107670A3 (fr) | 2007-11-08 |
Family
ID=37500047
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2007/050959 WO2007107670A2 (fr) | 2006-03-20 | 2007-03-20 | Procede de post-traitement d'un signal dans un decodeur audio |
Country Status (6)
Country | Link |
---|---|
US (1) | US20090299755A1 (fr) |
EP (1) | EP2005424A2 (fr) |
JP (1) | JP5457171B2 (fr) |
KR (1) | KR101373207B1 (fr) |
CN (1) | CN101405792B (fr) |
WO (1) | WO2007107670A2 (fr) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
RU2631155C1 (ru) * | 2014-03-24 | 2017-09-19 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
US9779744B2 (en) * | 2009-04-03 | 2017-10-03 | Ntt Docomo, Inc. | Speech decoder with high-band generation and temporal envelope shaping |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008022181A2 (fr) * | 2006-08-15 | 2008-02-21 | Broadcom Corporation | Mise à jour des états de décodeur après un masquage de perte de paquet |
EP2362375A1 (fr) | 2010-02-26 | 2011-08-31 | Fraunhofer-Gesellschaft zur Förderung der Angewandten Forschung e.V. | Dispositif et procédé de modification d'un signal audio par mise en forme de son envelope |
CN103069484B (zh) * | 2010-04-14 | 2014-10-08 | 华为技术有限公司 | 时/频二维后处理 |
JP5997592B2 (ja) | 2012-04-27 | 2016-09-28 | 株式会社Nttドコモ | 音声復号装置 |
EP3503095A1 (fr) | 2013-08-28 | 2019-06-26 | Dolby Laboratories Licensing Corp. | Amélioration hybride de la parole codée du front d'onde et de paramètres |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2351889A (en) * | 1999-07-06 | 2001-01-10 | Ericsson Telefon Ab L M | Speech band expansion |
WO2001022401A1 (fr) * | 1999-09-20 | 2001-03-29 | Koninklijke Philips Electronics N.V. | Circuit de traitement pour corriger les signaux audio, recepteur, systeme de communications, appareil mobile et procede correspondant |
US20030187663A1 (en) * | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
WO2005078706A1 (fr) * | 2004-02-18 | 2005-08-25 | Voiceage Corporation | Procedes et dispositifs pour l'accentuation a basse frequence lors de la compression audio basee sur les technologies acelp/tcx (codage a prediction lineaire a excitation de code/codage par transformee d'excitation) |
Family Cites Families (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH07193548A (ja) * | 1993-12-25 | 1995-07-28 | Sony Corp | 雑音低減処理方法 |
US5945932A (en) * | 1997-10-30 | 1999-08-31 | Audiotrack Corporation | Technique for embedding a code in an audio signal and for detecting the embedded code |
JP3810257B2 (ja) * | 2000-06-30 | 2006-08-16 | 松下電器産業株式会社 | 音声帯域拡張装置及び音声帯域拡張方法 |
SE0004818D0 (sv) * | 2000-12-22 | 2000-12-22 | Coding Technologies Sweden Ab | Enhancing source coding systems by adaptive transposition |
US7590525B2 (en) * | 2001-08-17 | 2009-09-15 | Broadcom Corporation | Frame erasure concealment for predictive speech coding based on extrapolation of speech waveform |
US7173966B2 (en) * | 2001-08-31 | 2007-02-06 | Broadband Physics, Inc. | Compensation for non-linear distortion in a modem receiver |
US6988066B2 (en) * | 2001-10-04 | 2006-01-17 | At&T Corp. | Method of bandwidth extension for narrow-band speech |
US6895375B2 (en) * | 2001-10-04 | 2005-05-17 | At&T Corp. | System for bandwidth extension of Narrow-band speech |
US8204261B2 (en) * | 2004-10-20 | 2012-06-19 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Diffuse sound shaping for BCC schemes and the like |
US7720230B2 (en) * | 2004-10-20 | 2010-05-18 | Agere Systems, Inc. | Individual channel shaping for BCC schemes and the like |
CN1937496A (zh) | 2005-09-21 | 2007-03-28 | 日电(中国)有限公司 | 可延展伪名证书系统和方法 |
-
2007
- 2007-03-20 JP JP2009500896A patent/JP5457171B2/ja not_active Expired - Fee Related
- 2007-03-20 KR KR1020087025600A patent/KR101373207B1/ko not_active IP Right Cessation
- 2007-03-20 CN CN200780010053XA patent/CN101405792B/zh not_active Expired - Fee Related
- 2007-03-20 US US12/225,462 patent/US20090299755A1/en not_active Abandoned
- 2007-03-20 EP EP07731774A patent/EP2005424A2/fr not_active Withdrawn
- 2007-03-20 WO PCT/FR2007/050959 patent/WO2007107670A2/fr active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2351889A (en) * | 1999-07-06 | 2001-01-10 | Ericsson Telefon Ab L M | Speech band expansion |
WO2001022401A1 (fr) * | 1999-09-20 | 2001-03-29 | Koninklijke Philips Electronics N.V. | Circuit de traitement pour corriger les signaux audio, recepteur, systeme de communications, appareil mobile et procede correspondant |
US20030187663A1 (en) * | 2002-03-28 | 2003-10-02 | Truman Michael Mead | Broadband frequency translation for high frequency regeneration |
WO2005078706A1 (fr) * | 2004-02-18 | 2005-08-25 | Voiceage Corporation | Procedes et dispositifs pour l'accentuation a basse frequence lors de la compression audio basee sur les technologies acelp/tcx (codage a prediction lineaire a excitation de code/codage par transformee d'excitation) |
Non-Patent Citations (2)
Title |
---|
"High level description of the scalable 8-32 kbit/s algorithm submitted to the Qualification Test by Matsushita, Mindspeed and siemens" COMM 16 - D214 - E (WP 3/16). INTERNATIONAL TELECOMMUNICATION UNION., 26 juillet 2005 (2005-07-26), - 5 août 2005 (2005-08-05) XP008072690 Geneva, CH * |
ATKINSON I A ET AL: "1.6 kbit/s LP vocoder using time envelope" ELECTRONICS LETTERS, IEE STEVENAGE, GB, vol. 31, no. 7, 30 mars 1995 (1995-03-30), pages 517-519, XP006002638 ISSN: 0013-5194 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9779744B2 (en) * | 2009-04-03 | 2017-10-03 | Ntt Docomo, Inc. | Speech decoder with high-band generation and temporal envelope shaping |
US10366696B2 (en) | 2009-04-03 | 2019-07-30 | Ntt Docomo, Inc. | Speech decoder with high-band generation and temporal envelope shaping |
RU2631155C1 (ru) * | 2014-03-24 | 2017-09-19 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
RU2654141C1 (ru) * | 2014-03-24 | 2018-05-16 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
RU2707722C2 (ru) * | 2014-03-24 | 2019-11-28 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
RU2718421C1 (ru) * | 2014-03-24 | 2020-04-02 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
RU2732951C1 (ru) * | 2014-03-24 | 2020-09-24 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
RU2751150C1 (ru) * | 2014-03-24 | 2021-07-08 | Нтт Докомо, Инк. | Устройство аудиодекодирования, устройство аудиокодирования, способ аудиодекодирования, способ аудиокодирования, программа аудиодекодирования и программа аудиокодирования |
EP4293667A3 (fr) * | 2014-03-24 | 2024-06-12 | Ntt Docomo, Inc. | Dispositif de codage audio et procédé de codage audio |
Also Published As
Publication number | Publication date |
---|---|
KR20080109038A (ko) | 2008-12-16 |
CN101405792A (zh) | 2009-04-08 |
US20090299755A1 (en) | 2009-12-03 |
KR101373207B1 (ko) | 2014-03-12 |
WO2007107670A3 (fr) | 2007-11-08 |
JP5457171B2 (ja) | 2014-04-02 |
JP2009530679A (ja) | 2009-08-27 |
CN101405792B (zh) | 2012-09-05 |
EP2005424A2 (fr) | 2008-12-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP1989706B1 (fr) | Dispositif de ponderation perceptuelle en codage/decodage audio | |
EP1907812B1 (fr) | Procede de commutation de debit en decodage audio scalable en debit et largeur de bande | |
EP1905010B1 (fr) | Codage/décodage audio hiérarchique | |
EP2366177B1 (fr) | Codage de signal audionumerique avec mise en forme du bruit dans un codeur hierarchique | |
EP2002428B1 (fr) | Procede de discrimination et d'attenuation fiabilisees des echos d'un signal numerique dans un decodeur et dispositif correspondant | |
EP2452337B1 (fr) | Allocation de bits dans un codage/décodage d'amélioration d'un codage/décodage hiérarchique de signaux audionumériques | |
EP2115741A1 (fr) | Codage/decodage perfectionnes de signaux audionumeriques | |
EP2586133B1 (fr) | Contrôle d'une boucle de rétroaction de mise en forme de bruit dans un codeur de signal audionumérique | |
WO2007096551A2 (fr) | Procede de codage binaire d'indices de quantification d'une enveloppe d'un signal, procede de decodage d'une enveloppe d'un signal et modules de codage et decodage correspondants | |
EP2277172A1 (fr) | Dissimulation d'erreur de transmission dans un signal audionumerique dans une structure de decodage hierarchique | |
FR2907586A1 (fr) | Synthese de blocs perdus d'un signal audionumerique,avec correction de periode de pitch. | |
WO2007107670A2 (fr) | Procede de post-traitement d'un signal dans un decodeur audio | |
EP2452336A1 (fr) | Codage/décodage perfectionne de signaux audionumériques | |
EP2936488B1 (fr) | Atténuation efficace de pré-échos dans un signal audionumérique | |
EP2347411B1 (fr) | Attenuation de pre-echos dans un signal audionumerique | |
EP2132732B1 (fr) | Post-filtre pour des codecs en couche | |
EP2652735B1 (fr) | Codage perfectionne d'un etage d'amelioration dans un codeur hierarchique | |
FR2737360A1 (fr) | Procedes de codage et de decodage de signaux audiofrequence, codeur et decodeur pour la mise en oeuvre de tels procedes |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07731774 Country of ref document: EP Kind code of ref document: A2 |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2009500896 Country of ref document: JP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 200780010053.X Country of ref document: CN |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 5148/CHENP/2008 Country of ref document: IN |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007731774 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 1020087025600 Country of ref document: KR |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12225462 Country of ref document: US |