EP1316087A1 - Übertragungsfehler-verdeckung in einem audiosignal - Google Patents

Übertragungsfehler-verdeckung in einem audiosignal

Info

Publication number
EP1316087A1
EP1316087A1 EP01969857A EP01969857A EP1316087A1 EP 1316087 A1 EP1316087 A1 EP 1316087A1 EP 01969857 A EP01969857 A EP 01969857A EP 01969857 A EP01969857 A EP 01969857A EP 1316087 A1 EP1316087 A1 EP 1316087A1
Authority
EP
European Patent Office
Prior art keywords
signal
samples
valid
synthesis
decoder
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
EP01969857A
Other languages
English (en)
French (fr)
Other versions
EP1316087B1 (de
Inventor
Balazs Kovesi
Dominique Massaloux
David Deleam
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Orange SA
Original Assignee
France Telecom SA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by France Telecom SA filed Critical France Telecom SA
Publication of EP1316087A1 publication Critical patent/EP1316087A1/de
Application granted granted Critical
Publication of EP1316087B1 publication Critical patent/EP1316087B1/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm

Definitions

  • the present invention relates to techniques for concealing consecutive transmission errors in transmission systems using any type of digital coding of the speech and / or sound signal.
  • time coders which carry out the compression of samples of digitized signal sample by sample
  • the coded values are then transformed into a binary train which will be transmitted on a transmission channel.
  • disturbances can affect the transmitted signal and produce errors on the bit stream received by the decoder. These errors can occur in isolation in the bitstream but very frequently occur in bursts. It is then a packet of bits corresponding to a complete portion of signal which is erroneous or not received. This type of problem is encountered for example for transmissions on mobile networks. It is also encountered in transmissions on packet networks and in particular on internet-type networks.
  • a general object of the invention is to improve, for any system of speech and sound compression, the subjective quality of the speech signal restored to the decoder when, due to poor quality of the transmission channel or due to the loss or non-reception of a packet in a packet transmission system, a set of consecutive coded data has been lost.
  • Most predictive coding algorithms offer techniques for recovering erased frames ([GSM-FR], [REC G.723.1A], [SALAMI], [HONKA EN], [COX-2], [CHEN- 2], [CHEN-3], [CHEN-4], [CHEN-5], [CHEN-6], [CHEN-7], [KROON-2], [WATKINS]).
  • the decoder is informed of the occurrence of a frame erased in one way or another, for example in the case of radio mobile systems by the transmission of the frame erasure information coming from the channel decoder.
  • the purpose of the devices for recovering erased frames is to extrapolate the parameters of the erased frame from the last previous frame (s) considered to be valid.
  • Some parameters manipulated or coded by predictive coders have a strong inter-frame correlation (case of short-term prediction parameters, also called “Lear” of "Linear Predictive Coding” (see [RABINER]) which represent the spectral envelope, and long-term prediction settings for voiced sounds, for example). Because of this correlation, it is much it is more advantageous to reuse the parameters of the last valid frame to synthesize the erased frame than to use erroneous or random parameters.
  • the LPC filter is obtained from the LPC parameters of the last valid frame either by copying the parameters or with the introduction of a certain damping (cf. encoder G723.1 [REC G.723.1A]).
  • the voicing is detected to determine the degree of harmonicity of the signal at the level of the erased frame ([SALAMI], this detection taking place as follows:
  • the procedures for concealing erased frames are strongly linked to the decoder and use modules of this decoder, such as the signal synthesis module. They use also intermediate signals available within this decoder such as the excitation signal passed and stored during the processing of valid frames preceding the erased frames.
  • the techniques for reconstructing erased frames are also based on the coding structure used: algorithms, such as [PICTEL, MAHIEUX-2], aim to regenerate the lost transformed coefficients from the values taken by these coefficients before erasure.
  • a prior art which can be considered as closest to the present invention is that which is described in [COMBESCURE], which proposed a method for concealing erased frames equivalent to that used in CELP coders for a transform coder.
  • the disadvantages of the proposed method were the introduction of audible spectral distortions
  • the invention allows for the concealment of erased frames without marked distortion at higher error rates and / or for longer erased intervals.
  • the energy of the synthesis signal thus generated is controlled using a gain calculated and adapted sample by sample.
  • the gain for controlling the synthesis signal is advantageously calculated as a function of at least one of the following parameters: energy values previously stored for the samples corresponding to valid data, fundamental period for the voiced sounds, or any parameter characterizing the frequency spectrum.
  • the gain applied to the synthesis signal decreases progressively as a function of the duration during which the synthesis samples are generated.
  • the contents of the memories used for the decoding processing are updated as a function of the synthesis samples generated.
  • a coding analogous to that implemented at the transmitter is implemented at least partially on the synthesized samples possibly followed by a decoding operation (possibly partial), the data obtained serving to regenerate the memories of the decoder.
  • this optionally partial coding-decoding operation can be advantageously used to regenerate the first erased frame because it makes it possible to exploit the content of the memories of the decoder before the cut, when these memories contain information not provided by the last valid samples. decoded (for example in the case of addition-overlap transformers, see paragraph 5.2.2.2.1 point 10).
  • an excitation signal is generated at the input of the short-term prediction operator which, in the neighboring zone, is the sum of a harmonic component and a weakly harmonic component or non harmonic, and in the voiced zone limited to the non harmonic component.
  • the harmonic component is advantageously obtained by implementing a filtering by means of the long-term prediction operator applied to a residual signal calculated by implementing reverse short-term filtering on the stored samples.
  • the other component can be determined with the idea of a long-term prediction operator to which pseudo-random disturbances (for example gain or period disturbances) are applied.
  • pseudo-random disturbances for example gain or period disturbances
  • the harmonic component represents the low frequencies of the spectrum, while the other component represents the high frequency part.
  • the long-term prediction operator is determined from the samples of valid stored frames, with a number of samples used for this estimation varying between a minimum value and a value equal to at least twice the estimated fundamental period for voiced sound.
  • the residual signal is advantageously modified by treatments of the non-linear type to eliminate amplitude peaks.
  • voice activity is detected by estimating noise parameters when the signal is considered to be non-active, and parameters of the synthesized signal are made to tend towards those of the estimated noise.
  • the spectral envelope of the noise of the decoded samples is estimated. valid and one generates a synthesized signal evolving towards a signal having the same spectral envelope.
  • the invention also provides a method for processing sound signals, characterized in that a discrimination is made between speech and musical sounds and when musical sounds are detected, a method of the aforementioned type is implemented without the estimation of a long-term prediction operator, the excitation signal being limited to a non-harmonic component obtained for example by generating uniform white noise.
  • the invention further relates to a device for concealing a transmission error in an audio-digital signal which receives as input a decoded signal which is transmitted to it by a decoder and which generates missing or erroneous samples in this decoded signal, characterized in that 'It includes processing means capable of implementing the above method.
  • It also relates to a transmission system comprising at least one encoder, at least one transmission channel, a module capable of detecting that transmitted data has been lost or is greatly erroneous, at least one decoder and an error concealment device which receives the decoded signal, characterized in that this error concealment device is a device of the aforementioned type.
  • FIG. 1 is a block diagram illustrating a transmission system according to a possible embodiment of the invention
  • FIG. 2 and Figure 3 are block diagrams illustrating an implementation according to a possible embodiment of the invention.
  • FIGS. 4 to 6 schematically illustrate the windows used with the error concealment method according to a possible embodiment of one invention
  • Figures 7 and 8 are schematic representations illustrating a possible embodiment of the invention in the case of musical signals.
  • FIG. 1 shows a device for coding and decoding the digital audio signal, comprising an encoder 1, a transmission channel 2, a module 3 making it possible to detect that the transmitted data has been lost or is strongly erroneous, a decoder 4, and a module 5 for concealing errors or lost packets in accordance with a possible embodiment of the invention.
  • this module in addition to the indication of erased data, receives the decoded signal in valid period and transmits signals used to update it to the decoder.
  • module 5 is based on:
  • the memory of the decoded samples is updated, containing a sufficient number of samples for the regeneration of any periods erased subsequently.
  • a signal of the order of 20 to 40 ms is stored.
  • the energy of the valid frames is also calculated and the energies corresponding to the last valid frames processed are stored in memory (typically of the order of 5 s).
  • a method for detecting voiced sounds (processing 12 in FIG. 3: V / NV detection, for "voiced / unvoiced") is used on the last stored data. For example one can use for that the normalized correlation ([KLEIJN]), or the criterion presented in the example of realization which follows.
  • a residual signal is calculated by reverse filtering LPC (processing 10) of the last stored samples. This signal is then used to generate an excitation signal from the LPC 11 synthesis filter (see below).
  • the synthesis of the replacement samples is carried out by introducing an excitation signal (calculated in 13 from the signal at the output of the inverse LPC filter) in the LPC synthesis filter 11 (l / A (z)) calculated in 1.
  • This excitation signal is generated in two different ways depending on whether the signal is voiced or unvoiced:
  • the excitation signal is the sum of two signals, one strongly harmonic component and the other less harmonic or not at all.
  • the strongly harmonic component is obtained by LTP filtering (processing module 14) using the parameters calculated in 2, of the residual signal mentioned in 3.
  • the second component can also be obtained by LTP filtering but made non-periodic by random modifications of the parameters, by generation of a pseudo-random signal.
  • the residual signal used for generating the excitation is processed to eliminate the amplitude peaks significantly above the average.
  • the energy of the synthesis signal is controlled using a gain calculated and adapted sample by sample. In the case where the erasure period is relatively long, it is necessary to gradually lower the energy of the synthesis signal.
  • the gain adaptation law is calculated according to different parameters: stored energy values before erasure (see in 1), fundamental period, and local stationarity of the signal at the time of cutting.
  • the system includes a module allowing the discrimination of stationary (like music) and non-stationary (like speech) sounds, different adaptation laws can also be used.
  • the first half of the memory of the last frame correctly received contains fairly precise information on the first half of the first lost frame (its weight in the addition-recovery is more important than that of the current frame). This information can also be used to calculate the adaptive gain.
  • the synthesis parameters can also be changed. If the system is coupled to a voice activity detection device with estimation of the noise parameters (such as [REC-G.723.1A], [SALAMI-2],
  • KLEIJN predictions
  • This information is normally available both to the coder, who must have done this for these preceding samples have a form of local decoding, and at the remote decoder present at the reception. As soon as the transmission channel is disturbed and the remote decoder no longer has the same information as the local decoder present at transmission, there is desynchronization between the encoder and the decoder.
  • this desynchronization can cause audible degradations which can last a long time or even increase over time if there are instabilities in the structure. In this case, it is therefore important to endeavor to resynchronize the coder and the decoder, that is to say to make an estimation of the memories of the decoder as close as possible to those of the coder.
  • resynchronization techniques depend on the coding structure used. One will be presented, the principle of which is general in this patent, but the complexity of which is potentially significant.
  • One possible method consists in introducing into the decoder on reception a coding module of the same type as that present on the transmission, making it possible to carry out the coding-decoding of the samples of the signal produced by the techniques mentioned in the preceding paragraph during the periods deleted. In this way the memories necessary to decode the following samples are completed with a priori similar data.
  • This update can be carried out at the time of production of the replacement samples, which distributes the complexity over the entire erasure zone, but is combined with the synthesis procedure described above.
  • the above procedure can also be limited to an intermediate zone at the start of the period of valid data succeeding an erased period, the updating procedure then being combined with the decoding operation. .
  • TDAC or TCDM ([MAHIEUX]) type transform coders are particularly addressed.
  • Broadband encoder (50-7000 Hz) at 24 kb / s or 32 kb / s. 20 ms frame (320 samples).
  • a binary frame contains the coded parameters obtained by the TDAC transformation on a window. After decoding these parameters, by doing the reverse transformation TDAC, we obtain an output frame of 20 ms which is the sum of the second half of the previous window and the first half of the current window.
  • the two parts of windows used for the reconstruction of the frame n have been marked in bold.
  • a lost binary frame disturbs the reconstruction of two consecutive frames (the current one and the next one, Figure 5).
  • FIG. 6 binary frame
  • the memory of the decoded samples is updated.
  • This memory is used for LPC and LTP analyzes of the signal passed in the event of erasure of a binary frame.
  • the LPC analysis is performed over a signal period of 20 ms (320 samples).
  • LTP analysis requires more samples to be stored.
  • the number of samples stored is equal to twice the maximum value of the pitch. For example, if the maximum value of the MaxPitch pitch is fixed at 320 samples (50 Hz, 20 ms), the last 640 samples will be memorized (40 ms of the signal).
  • MaxCorr 0.6
  • Tj the position of this maximum
  • MaxCorrL Corr (T] _) If ⁇ > MinPitch and MaxCorrL> 0.75 * MaxCorr, we choose i as the new fundamental period.
  • T p is less than MaxPitch / 2
  • we can check if it is really a voiced frame by looking for the local maximum of the correlation around 2 * TP (TPP) and checking if Corr (T PP )> 0.4. If Corr (T) ⁇ 0.4 and if the signal energy decreases, we set DiminFlag l and we decrease the value of MaxCorr, otherwise we look for the next local maximum between the current T P and MaxPitch.
  • Another voicing criterion consists in checking whether at least in 2/3 of the cases the signal delayed by the fundamental period has the same sign as the non-delayed signal.
  • the voicing decision also takes into account the signal energy: if the energy is strong, the value of MaxCorr is increased, so it is more likely that the frame is decided voiced. On the other hand, if the energy is very low, the value of MaxCorr is reduced.
  • this vector of Tp samples is processed.
  • the method used in our example is as follows: "We calculate the mean MeanAmpl of the absolute values of the last Tp samples of the residual signal.
  • the excitation signal is the sum of two signals, a strongly harmonic component limited in band at the low frequencies of the excb spectrum and another less harmonic limited to the highest frequencies exch.
  • the coefficients [0.15, 0.7, 0.15] correspond to a low pass FIR filter of 3 dB attenuation at Fs /.
  • the second component is also obtained by an LTP filtering made non-periodic by the random modification of its fundamental period Tph.
  • Tph is chosen as the integer part of a random real value Tpa.
  • the initial value of Tpa is equal to Tp then it is modified sample by sample by adding a random value in [-0.5, 0.5].
  • the voiced excitation is then the sum of these 2 components:
  • the excitation signal exe is also obtained by LTP filtering of order 3 with the coefficients [0.15, 0.7, 0.15] but it is made non-periodic by increasing the fundamental period d 'a value equal to 1 every 10 samples, and inversion of the sign with a probability of 0.2.
  • the memory of the decoder is updated for decoding the next frame (synchronization of the encoder and the decoder, see paragraph 5.1.4).
  • the addition-recovery technique makes it possible to check whether the synthesized voiced signal corresponds well to the original signal or not because for the first half of the first frame lost the weight of the last window memory correctly received is greater ( figure 6). So by taking the correlation between the first half of the first synthesized frame and the first half of the frame obtained after the TDAC g reverse TDAC operations, we can estimate the similarity between the lost frame and the replacement frame. A weak correlation ( ⁇ 0.65) indicates that the original signal is enough different from that obtained by the replacement method, and it is better to decrease the energy of the latter quickly to the minimum level.
  • points 1-6 relate to the analysis of the decoded signal preceding the first erased frame and allowing the construction of a synthesis model (LPC and possibly LTP) of this signal.
  • LPC synthesis model
  • the analysis is not repeated, the replacement of the lost signal is based on the parameters (LPC coefficients, pitch, MaxCorr, ResMem) calculated during the first erased frame.
  • Such processing implements the following steps for the music synthesis module, illustrated in FIG. 8:
  • the synthesis of the replacement samples is carried out by introducing an excitation signal into the LPC synthesis filter (l / A (z)) calculated in step 19.
  • This excitation signal - calculated in a step 20 - is a white noise whose amplitude is chosen to obtain a signal having the same energy as that of the last N samples stored in valid period.
  • the filtering step is referenced by 21.
  • Example of the control of the amplitude of the residual signal If the excitation is presented as a uniform white noise multiplied by a gain, one can calculate this gain G as follows:
  • Durbin's algorithm gives the energy of the residual signal. Knowing also the energy of the signal to be modeled, the gain G ⁇ c: of the LPC filter is estimated as the ratio of these two energies. Calculation of the target energy:
  • the target energy is estimated equal to the energy of the last N samples stored in a valid period (N is typically ⁇ the length of the signal used for the LPC analysis).
  • the energy of the synthesized signal is the product of the energy of white noise by G 2 and G ⁇ ⁇ . We choose G so that this energy is equal to the target energy.
  • the energy of the synthesis signal is controlled at using a gain calculated and adapted sample by sample. In the case where the erasure period is relatively long, it is necessary to gradually lower the energy of the synthesis signal.
  • the gain adaptation law can be calculated as a function of various parameters such as the energy values memorized before erasure, and local stationarity of the signal at the time of cutting. 6. Evolution of the synthesis procedure over time:
  • the synthesis parameters can also be changed. If the system is coupled to a device for detecting voice activity or musical signals with estimation of the noise parameters (such as [REC-G.723.1A],
  • the technique which has just been described has the advantage of being usable with any type of coder; in particular it makes it possible to remedy the problems of lost bit packets for time or transform coders, on speech and music signals with good performance: indeed in the present technique, the only signals memorized during periods when the data transmitted are valid are the samples from the decoder, information that is available regardless of the coding structure used.
  • AT&T DA Kapilo, RV Cox
  • FEC frame erasure concealment
  • GSM-FR GSM Recommendation 06.11. "Substitution and muting of lost frames for full rate speech traffic channels”. ETSI / TC SMG, ver. : 3.0.1. , February 1992.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Input Circuits Of Receivers And Coupling Of Receivers And Audio Equipment (AREA)
  • Automobile Manufacture Line, Endless Track Vehicle, Trailer (AREA)
  • Arrangements For Transmission Of Measured Signals (AREA)
EP01969857A 2000-09-05 2001-09-05 Übertragungsfehler-verdeckung in einem audiosignal Expired - Lifetime EP1316087B1 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
FR0011285 2000-09-05
FR0011285A FR2813722B1 (fr) 2000-09-05 2000-09-05 Procede et dispositif de dissimulation d'erreurs et systeme de transmission comportant un tel dispositif
PCT/FR2001/002747 WO2002021515A1 (fr) 2000-09-05 2001-09-05 Dissimulation d'erreurs de transmission dans un signal audio

Publications (2)

Publication Number Publication Date
EP1316087A1 true EP1316087A1 (de) 2003-06-04
EP1316087B1 EP1316087B1 (de) 2008-01-02

Family

ID=8853973

Family Applications (1)

Application Number Title Priority Date Filing Date
EP01969857A Expired - Lifetime EP1316087B1 (de) 2000-09-05 2001-09-05 Übertragungsfehler-verdeckung in einem audiosignal

Country Status (11)

Country Link
US (2) US7596489B2 (de)
EP (1) EP1316087B1 (de)
JP (1) JP5062937B2 (de)
AT (1) ATE382932T1 (de)
AU (1) AU2001289991A1 (de)
DE (1) DE60132217T2 (de)
ES (1) ES2298261T3 (de)
FR (1) FR2813722B1 (de)
HK (1) HK1055346A1 (de)
IL (2) IL154728A0 (de)
WO (1) WO2002021515A1 (de)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015044609A1 (fr) 2013-09-30 2015-04-02 Orange Re-echantillonnage d'un signal audio pour un codage/decodage a bas retard
CN109313905A (zh) * 2016-03-07 2019-02-05 弗劳恩霍夫应用研究促进协会 对不同的频带根据不同的阻尼因子淡出隐藏的音频帧的错误隐藏单元、音频解码器及相关方法和计算机程序

Families Citing this family (69)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030163304A1 (en) * 2002-02-28 2003-08-28 Fisseha Mekuria Error concealment for voice transmission system
FR2849727B1 (fr) * 2003-01-08 2005-03-18 France Telecom Procede de codage et de decodage audio a debit variable
JP4303687B2 (ja) 2003-01-30 2009-07-29 富士通株式会社 音声パケット消失隠蔽装置,音声パケット消失隠蔽方法,受信端末および音声通信システム
US7835916B2 (en) * 2003-12-19 2010-11-16 Telefonaktiebolaget Lm Ericsson (Publ) Channel signal concealment in multi-channel audio systems
KR100587953B1 (ko) * 2003-12-26 2006-06-08 한국전자통신연구원 대역-분할 광대역 음성 코덱에서의 고대역 오류 은닉 장치 및 그를 이용한 비트스트림 복호화 시스템
JP4761506B2 (ja) * 2005-03-01 2011-08-31 国立大学法人北陸先端科学技術大学院大学 音声処理方法と装置及びプログラム並びに音声システム
KR100915726B1 (ko) * 2005-04-28 2009-09-04 지멘스 악티엔게젤샤프트 잡음 억제 방법 및 장치
US7831421B2 (en) * 2005-05-31 2010-11-09 Microsoft Corporation Robust decoder
US8620644B2 (en) * 2005-10-26 2013-12-31 Qualcomm Incorporated Encoder-assisted frame loss concealment techniques for audio coding
US7805297B2 (en) 2005-11-23 2010-09-28 Broadcom Corporation Classification-based frame loss concealment for audio signals
US8417185B2 (en) 2005-12-16 2013-04-09 Vocollect, Inc. Wireless headset and method for robust voice data communication
US8160874B2 (en) * 2005-12-27 2012-04-17 Panasonic Corporation Speech frame loss compensation using non-cyclic-pulse-suppressed version of previous frame excitation as synthesis filter source
US7885419B2 (en) * 2006-02-06 2011-02-08 Vocollect, Inc. Headset terminal with speech functionality
US7773767B2 (en) 2006-02-06 2010-08-10 Vocollect, Inc. Headset terminal with rear stability strap
CA2658962A1 (en) * 2006-07-27 2008-01-31 Nec Corporation Sound data decoding apparatus
US8015000B2 (en) * 2006-08-03 2011-09-06 Broadcom Corporation Classification-based frame loss concealment for audio signals
RU2437170C2 (ru) * 2006-10-20 2011-12-20 Франс Телеком Ослабление чрезмерной тональности, в частности, для генерирования возбуждения в декодере при отсутствии информации
EP1921608A1 (de) * 2006-11-13 2008-05-14 Electronics And Telecommunications Research Institute Verfahren für die Einfügung von Vektorinformationen zum Schätzen von Sprachdaten in der Phase der Neusynchronisierung von Schlüsseln, Verfahren zum Übertragen von Vektorinformationen und Verfahren zum Schätzen der Sprachdaten bei der Neusynchronisierung von Schlüsseln unter Verwendung der Vektorinformationen
KR100862662B1 (ko) 2006-11-28 2008-10-10 삼성전자주식회사 프레임 오류 은닉 방법 및 장치, 이를 이용한 오디오 신호복호화 방법 및 장치
JP4504389B2 (ja) * 2007-02-22 2010-07-14 富士通株式会社 隠蔽信号生成装置、隠蔽信号生成方法および隠蔽信号生成プログラム
US9129590B2 (en) * 2007-03-02 2015-09-08 Panasonic Intellectual Property Corporation Of America Audio encoding device using concealment processing and audio decoding device using concealment processing
US7853450B2 (en) * 2007-03-30 2010-12-14 Alcatel-Lucent Usa Inc. Digital voice enhancement
US8126707B2 (en) * 2007-04-05 2012-02-28 Texas Instruments Incorporated Method and system for speech compression
WO2008146466A1 (ja) * 2007-05-24 2008-12-04 Panasonic Corporation オーディオ復号装置、オーディオ復号方法、プログラム及び集積回路
KR100906766B1 (ko) * 2007-06-18 2009-07-09 한국전자통신연구원 키 재동기 구간의 음성 데이터 예측을 위한 음성 데이터송수신 장치 및 방법
US8607127B2 (en) 2007-09-21 2013-12-10 France Telecom Transmission error dissimulation in a digital signal with complexity distribution
FR2929466A1 (fr) * 2008-03-28 2009-10-02 France Telecom Dissimulation d'erreur de transmission dans un signal numerique dans une structure de decodage hierarchique
CN101588341B (zh) * 2008-05-22 2012-07-04 华为技术有限公司 一种丢帧隐藏的方法及装置
KR20090122143A (ko) * 2008-05-23 2009-11-26 엘지전자 주식회사 오디오 신호 처리 방법 및 장치
MX2011000375A (es) * 2008-07-11 2011-05-19 Fraunhofer Ges Forschung Codificador y decodificador de audio para codificar y decodificar tramas de una señal de audio muestreada.
USD605629S1 (en) 2008-09-29 2009-12-08 Vocollect, Inc. Headset
JP2010164859A (ja) * 2009-01-16 2010-07-29 Sony Corp オーディオ再生装置、情報再生システム、オーディオ再生方法、およびプログラム
CN101609677B (zh) 2009-03-13 2012-01-04 华为技术有限公司 一种预处理方法、装置及编码设备
US8160287B2 (en) 2009-05-22 2012-04-17 Vocollect, Inc. Headset with adjustable headband
US8438659B2 (en) 2009-11-05 2013-05-07 Vocollect, Inc. Portable computing device and headset interface
ES2686889T3 (es) * 2009-12-14 2018-10-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Dispositivo de cuantificación vectorial, dispositivo de codificación de voz, procedimiento de cuantificación vectorial y procedimiento de codificación de voz
PT2676267T (pt) 2011-02-14 2017-09-26 Fraunhofer Ges Forschung Codificação e descodificação de posições de pulso de faixas de um sinal de áudio
CN103493129B (zh) 2011-02-14 2016-08-10 弗劳恩霍夫应用研究促进协会 用于使用瞬态检测及质量结果将音频信号的部分编码的装置与方法
PL2676268T3 (pl) 2011-02-14 2015-05-29 Fraunhofer Ges Forschung Urządzenie i sposób przetwarzania zdekodowanego sygnału audio w domenie widmowej
BR112013020324B8 (pt) * 2011-02-14 2022-02-08 Fraunhofer Ges Forschung Aparelho e método para supressão de erro em fala unificada de baixo atraso e codificação de áudio
PL2676266T3 (pl) 2011-02-14 2015-08-31 Fraunhofer Ges Forschung Układ kodowania na bazie predykcji liniowej wykorzystujący kształtowanie szumu w dziedzinie widmowej
AU2012217158B2 (en) 2011-02-14 2014-02-27 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Information signal representation using lapped transform
US8849663B2 (en) 2011-03-21 2014-09-30 The Intellisis Corporation Systems and methods for segmenting and/or classifying an audio signal from transformed audio information
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US9026434B2 (en) * 2011-04-11 2015-05-05 Samsung Electronic Co., Ltd. Frame erasure concealment for a multi rate speech and audio codec
US8620646B2 (en) 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8548803B2 (en) 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
CN107068156B (zh) * 2011-10-21 2021-03-30 三星电子株式会社 帧错误隐藏方法和设备以及音频解码方法和设备
CN104321815B (zh) * 2012-03-21 2018-10-16 三星电子株式会社 用于带宽扩展的高频编码/高频解码方法和设备
US9123328B2 (en) * 2012-09-26 2015-09-01 Google Technology Holdings LLC Apparatus and method for audio frame loss recovery
WO2014083380A1 (en) * 2012-11-27 2014-06-05 Nokia Corporation A shared audio scene apparatus
US9437203B2 (en) * 2013-03-07 2016-09-06 QoSound, Inc. Error concealment for speech decoder
FR3004876A1 (fr) * 2013-04-18 2014-10-24 France Telecom Correction de perte de trame par injection de bruit pondere.
PT3285254T (pt) 2013-10-31 2019-07-09 Fraunhofer Ges Forschung Descodificador de áudio e método para fornecer uma informação de áudio descodificada utilizando uma ocultação de erro com base num sinal de excitação no domínio de tempo
BR122022008597B1 (pt) 2013-10-31 2023-01-31 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Decodificador áudio e método para fornecer uma informação de áudio decodificada utilizando uma dissimulação de erro que modifica um sinal de excitação de domínio de tempo
US9437211B1 (en) * 2013-11-18 2016-09-06 QoSound, Inc. Adaptive delay for enhanced speech processing
EP2922056A1 (de) 2014-03-19 2015-09-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und zugehöriges Computerprogramm zur Erzeugung eines Fehlerverschleierungssignals unter Verwendung von Leistungskompensation
EP2922054A1 (de) * 2014-03-19 2015-09-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und zugehöriges Computerprogramm zur Erzeugung eines Fehlerverschleierungssignals unter Verwendung einer adaptiven Rauschschätzung
EP2922055A1 (de) 2014-03-19 2015-09-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung, Verfahren und zugehöriges Computerprogramm zur Erzeugung eines Fehlerverschleierungssignals mit einzelnen Ersatz-LPC-Repräsentationen für individuelle Codebuchinformationen
TWI602172B (zh) 2014-08-27 2017-10-11 弗勞恩霍夫爾協會 使用參數以加強隱蔽之用於編碼及解碼音訊內容的編碼器、解碼器及方法
KR102547480B1 (ko) * 2014-12-09 2023-06-26 돌비 인터네셔널 에이비 Mdct-도메인 에러 은닉
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
CN109155134B (zh) * 2016-03-07 2023-05-23 弗劳恩霍夫应用研究促进协会 隐藏音频帧丢失的错误隐藏单元、音频解码器和相关方法
EP3553777B1 (de) * 2018-04-09 2022-07-20 Dolby Laboratories Licensing Corporation Verdecken von paketverlusten mit niedriger komplexität für transcodierte audiosignale
US10763885B2 (en) 2018-11-06 2020-09-01 Stmicroelectronics S.R.L. Method of error concealment, and associated device
CN111063362B (zh) * 2019-12-11 2022-03-22 中国电子科技集团公司第三十研究所 一种数字语音通信噪音消除和语音恢复方法及装置

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2746033B2 (ja) * 1992-12-24 1998-04-28 日本電気株式会社 音声復号化装置
US5574825A (en) * 1994-03-14 1996-11-12 Lucent Technologies Inc. Linear prediction coefficient generation during frame erasure or packet loss
CA2142391C (en) * 1994-03-14 2001-05-29 Juin-Hwey Chen Computational complexity reduction during frame erasure or packet loss
CA2177413A1 (en) * 1995-06-07 1996-12-08 Yair Shoham Codebook gain attenuation during frame erasures
US5732389A (en) * 1995-06-07 1998-03-24 Lucent Technologies Inc. Voiced/unvoiced classification of speech for excitation codebook selection in celp speech decoding during frame erasures
US5699485A (en) * 1995-06-07 1997-12-16 Lucent Technologies Inc. Pitch delay modification during frame erasures
CN100583242C (zh) * 1997-12-24 2010-01-20 三菱电机株式会社 声音译码方法和声音译码装置
FR2774827B1 (fr) * 1998-02-06 2000-04-14 France Telecom Procede de decodage d'un flux binaire representatif d'un signal audio
US6449590B1 (en) * 1998-08-24 2002-09-10 Conexant Systems, Inc. Speech encoder using warping in long term preprocessing
US6556966B1 (en) * 1998-08-24 2003-04-29 Conexant Systems, Inc. Codebook structure for changeable pulse multimode speech coding
US6188980B1 (en) * 1998-08-24 2001-02-13 Conexant Systems, Inc. Synchronized encoder-decoder frame concealment using speech coding parameters including line spectral frequencies and filter coefficients
US6240386B1 (en) * 1998-08-24 2001-05-29 Conexant Systems, Inc. Speech codec employing noise classification for noise compensation
JP3365360B2 (ja) * 1999-07-28 2003-01-08 日本電気株式会社 音声信号復号方法および音声信号符号化復号方法とその装置
US7590525B2 (en) * 2001-08-17 2009-09-15 Broadcom Corporation Frame erasure concealment for predictive speech coding based on extrapolation of speech waveform

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See references of WO0221515A1 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015044609A1 (fr) 2013-09-30 2015-04-02 Orange Re-echantillonnage d'un signal audio pour un codage/decodage a bas retard
US10403296B2 (en) 2013-09-30 2019-09-03 Koninklijke Philips N.V. Resampling an audio signal for low-delay encoding/decoding
CN109313905A (zh) * 2016-03-07 2019-02-05 弗劳恩霍夫应用研究促进协会 对不同的频带根据不同的阻尼因子淡出隐藏的音频帧的错误隐藏单元、音频解码器及相关方法和计算机程序
CN109313905B (zh) * 2016-03-07 2023-05-23 弗劳恩霍夫应用研究促进协会 隐藏音频帧丢失的错误隐藏单元、音频解码器及相关方法

Also Published As

Publication number Publication date
FR2813722A1 (fr) 2002-03-08
ES2298261T3 (es) 2008-05-16
IL154728A (en) 2008-07-08
FR2813722B1 (fr) 2003-01-24
US20040010407A1 (en) 2004-01-15
AU2001289991A1 (en) 2002-03-22
EP1316087B1 (de) 2008-01-02
US20100070271A1 (en) 2010-03-18
HK1055346A1 (en) 2004-01-02
WO2002021515A1 (fr) 2002-03-14
DE60132217T2 (de) 2009-01-29
US7596489B2 (en) 2009-09-29
ATE382932T1 (de) 2008-01-15
US8239192B2 (en) 2012-08-07
IL154728A0 (en) 2003-10-31
JP5062937B2 (ja) 2012-10-31
DE60132217D1 (de) 2008-02-14
JP2004508597A (ja) 2004-03-18

Similar Documents

Publication Publication Date Title
EP1316087B1 (de) Übertragungsfehler-verdeckung in einem audiosignal
EP2277172B1 (de) Verbergung von übertragungsfehlern in einem digitalsignal in einer hierarchischen decodierungsstruktur
AU2003233724B2 (en) Method and device for efficient frame erasure concealment in linear predictive based speech codecs
RU2419891C2 (ru) Способ и устройство эффективной маскировки стирания кадров в речевых кодеках
EP2535893B1 (de) Einrichtung und Verfahren zum Verbergen verlorener Rahmen
EP2080195B1 (de) Synthese verlorener blöcke eines digitalen audiosignals
EP1051703B1 (de) Verfahren zur dekodierung eines audiosignals mit korrektur von übertragungsfehlern
EP2080194B1 (de) Dämpfung von stimmüberlagerung, im besonderen zur erregungserzeugung bei einem decoder in abwesenheit von informationen
EP3175444B1 (de) Rahmenverlustverwaltung in einem fd-/ldp-übergangskontext
EP2347411B1 (de) Vor-echo-dämpfung in einem digitalaudiosignal
FR2830970A1 (fr) Procede et dispositif de synthese de trames de substitution, dans une succession de trames representant un signal de parole
MX2008008477A (es) Metodo y dispositivo para ocultamiento eficiente de borrado de cuadros en codec de voz

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20030324

AK Designated contracting states

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

APBN Date of receipt of notice of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA2E

APBR Date of receipt of statement of grounds of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNNOA3E

APBV Interlocutory revision of appeal recorded

Free format text: ORIGINAL CODE: EPIDOSNIRAPE

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: FRENCH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REF Corresponds to:

Ref document number: 60132217

Country of ref document: DE

Date of ref document: 20080214

Kind code of ref document: P

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20080408

REG Reference to a national code

Ref country code: ES

Ref legal event code: FG2A

Ref document number: 2298261

Country of ref document: ES

Kind code of ref document: T3

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

REG Reference to a national code

Ref country code: HK

Ref legal event code: GR

Ref document number: 1055346

Country of ref document: HK

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080602

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080402

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20081003

BERE Be: lapsed

Owner name: FRANCE TELECOM

Effective date: 20080930

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20080930

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20080930

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20080930

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20080930

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20080905

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080102

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20080403

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 16

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 17

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 18

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20200819

Year of fee payment: 20

Ref country code: GB

Payment date: 20200819

Year of fee payment: 20

Ref country code: DE

Payment date: 20200819

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: IT

Payment date: 20200824

Year of fee payment: 20

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: ES

Payment date: 20201001

Year of fee payment: 20

REG Reference to a national code

Ref country code: DE

Ref legal event code: R071

Ref document number: 60132217

Country of ref document: DE

REG Reference to a national code

Ref country code: GB

Ref legal event code: PE20

Expiry date: 20210904

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20210904

REG Reference to a national code

Ref country code: ES

Ref legal event code: FD2A

Effective date: 20211228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF EXPIRATION OF PROTECTION

Effective date: 20210906