EP2245621B1 - Procédé et moyens d encodage d informations de bruit de fond - Google Patents

Procédé et moyens d encodage d informations de bruit de fond Download PDF

Info

Publication number
EP2245621B1
EP2245621B1 EP09711908.5A EP09711908A EP2245621B1 EP 2245621 B1 EP2245621 B1 EP 2245621B1 EP 09711908 A EP09711908 A EP 09711908A EP 2245621 B1 EP2245621 B1 EP 2245621B1
Authority
EP
European Patent Office
Prior art keywords
sid
background noise
narrowband
speech
parameters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
EP09711908.5A
Other languages
German (de)
English (en)
Other versions
EP2245621A1 (fr
Inventor
Herve Taddei
Stefan Schandl
Panji Setiawan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Unify GmbH and Co KG
Original Assignee
Unify GmbH and Co KG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Unify GmbH and Co KG filed Critical Unify GmbH and Co KG
Publication of EP2245621A1 publication Critical patent/EP2245621A1/fr
Application granted granted Critical
Publication of EP2245621B1 publication Critical patent/EP2245621B1/fr
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • G10L19/24Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding

Definitions

  • the invention relates to methods and means for encoding background noise information in speech signal coding methods.
  • Such a limited frequency range is also provided in many speech signal coding methods for today's digital telecommunications.
  • a bandwidth limitation of the analog signal Prior to a coding process, a bandwidth limitation of the analog signal is performed for this purpose.
  • a codec is used which, due to the described bandwidth limitation in the frequency range between 300 Hz and 3400 Hz, is also referred to below as narrow-band speech codec (Narrow Band Speech Codec).
  • the term codec is understood to mean both the coding rule for the digital coding of audio signals and the decoding rule for the decoding of data with the aim of reconstructing the audio signal.
  • a narrowband speech codec is known from ITU-T Recommendation G.729.
  • a transmission of a narrowband speech signal with a data rate of 8 kbit / s is provided.
  • broadband speech codecs Wide Band Speech Codec
  • Such an extended frequency range is e.g. between a frequency of 50 Hz and 7000 Hz.
  • a wideband voice codec is known from ITU-T Recommendation G.729.EV.
  • coding methods for broadband speech codecs are made scalable.
  • scalability it is meant here that the transmitted coded data includes various demarcated blocks containing the narrowband portion, the wideband portion and / or the full bandwidth of the coded voice signal.
  • such a scalable design allows for backward compatibility on the receiver side and, on the other hand, offers a simple possibility of adapting the data rate and the size of transmitted data frames in the transmission channel in the case of limited data transmission capacities in the transmission channel.
  • a compression of the data to be transmitted For a reduction of the data transmission rate by a codec is usually provided a compression of the data to be transmitted. Compression is achieved, for example, by coding methods, parameters for an excitation signal and filter parameters being determined for encoding the speech data. The filter parameters and parameters specifying the excitation signal are then transmitted to the receiver. There, a synthetic speech signal is synthesized using the codec, which is as similar as possible to the original speech signal in terms of a subjective hearing impression. With the help of this method, also known as "analysis-by-synthesis", the determined and digitized samples are not transmitted, but determined parameters that allow a receiver-side synthesis of the speech signal.
  • a further measure for reducing the data transmission rate is provided by a method for discontinuous transmission (Discontinuous Transmission), which is also familiar in the art under the term DTX.
  • DTX discontinuous Transmission
  • the basic goal of DTX is to reduce the data transfer rate in the event of a speech break.
  • a pause detection (Voice Activity Detection, VAD) is used on the part of the transmitter, which recognizes when a certain signal level falls below a speech break.
  • VAD Voice Activity Detection
  • the receiver is not expected to have complete silence during a speech break.
  • a complete silence on the receiver side would lead to irritation or even to the suspicion of a breakdown of the connection.
  • methods for generating a so-called comfort noise are applied.
  • Comfort noise is noise that is synthesized to fill silence phases on the receiver's side.
  • the comfort noise serves as a subjective impression of a continuing connection, without claiming the data transmission rate intended for the transmission of speech signals. In other words, less effort is required to code the speech data for the transmitter-side coding of the noise. For a receiver-side still perceived as realistic synthesizing the comfort noise data are transmitted at a much lower data rate.
  • the data transmitted here are also referred to in the art as SID (Silence Insertion Description).
  • G.729.1 SID has an embedded structure with a core SID equal to the G.729 SID and a first and second extension layer.
  • the first enhancement layer adds some parameters for narrowband comfort noise, while the second enhancement layer adds wideband information, with the SID much smaller than any other frame.
  • a marker (M) bit should be set to 1 when using DTX in the RTP header.
  • ITU-T standard G.729.1 (05/2006): SERIES G: TRANSMISSION SYSTEMS AND MEDIA, DIGITAL SYSTEMS AND NETWORKS, Digital terminal equipments - Coding of analogue signals by methods other than PCM, "G.729-based embedded variable bit-rate coder: to 8-32 kbps scalable wideband coder bitstream interoperable with G.729", ITU-T RECOMMENDATION G.729.1, approved on 29 May 2006 (2006-05 -29) by ITU-T Study Group 16 (2005-2008), International Telecommunication Union, Geneva, CH, ITU-T Rec.
  • the encoding of the background noise information occurs over either the entire bandwidth of the input noise signal or over a portion of the bandwidth of the input noise signal.
  • the encoded noise signal is transmitted in the form of SID frames via the DTX method and reconstructed on the receiver side.
  • the reconstructed, ie synthesized, comfort noise may therefore have a different quality than the speech information synthesized on the receiver side. This adversely affects the reception of the recipient.
  • the object of the invention is to provide an improved implementation of the DTX method in scalable speech codecs.
  • a basic idea of the invention is to provide the scalability known for the transmission of speech information analogously to the formation of a SID frame.
  • the inventive method for encoding a SID frame for transmission of background noise information using a scalable speech signal encoding method provides for encoding a narrowband first and a wideband second portion of the background noise information.
  • the encoding is usually done at the same time and in different ways. However, the encoding of a share can of course also take place with a time offset before or after an encoding of another share. Likewise, the encoding of the two components can optionally also be carried out in the same way.
  • a SID frame is formed with separate regions for the first and the second component. In other words, in the SID frame, this means that a first data area receives the data for the encoded first portion, while a second data area separate from it receives the data for the encoded second portion.
  • An essential advantage of the invention is that it can be determined on the receiver side whether comfort noise should be based on the broadband portion of the transmitted SID frames or on the basis of the narrowband portion.
  • This is of particular advantage for the receiver-side acoustic reception in a situation where the transmission rate for speech information frames has been reduced so that only narrowband speech information is transmitted. Namely, as synthesized in the current state of the art, narrowband speech information in conjunction with broadband noise, this is very irritating for the receiver.
  • the said reduction of the transmission rate for speech information frames can be caused, for example, by a high congestion of the network between transmitter and receiver.
  • the much smaller SID frames are not affected by such a network bottleneck. For them, there is no compulsion to reduce their data transfer rate or their content.
  • a third portion is provided in the definition of the SID frame.
  • This contains encoded background noise parameters, which are encoded with an increased data rate, although the third component still contains narrowband data (extended narrowband data or "enhanced low band").
  • the advantage of defining the SID frame with this third component is the ability to reproduce a noise signal in a quality enhanced in comparison to conventional narrowband coding while remaining in compliance with the G.729.B standard.
  • the single FIGURE shows a structure of a SID frame according to the invention.
  • discontinuous transmission (DTX) methods for the transmission of background noise information currently do not support the scalable character intended for the transmission of the speech information.
  • narrow-band speech codecs such as e.g. 3GPP AMR, ITU-T G.729 and on the other hand broadband speech codecs, e.g. 3GPP AMR-WB, ITU-T G.722.
  • a narrow-band speech codec encodes speech signals at a sampling frequency of 8 kHz with a bandwidth which is usually in the frequency range between 300 and 3400 Hz.
  • a wideband speech codec encodes a speech signal having a sampling frequency of 16 kHz at a bandwidth in a frequency range between 50 and 7000 Hz.
  • Some of these codecs use DTX techniques, that is, discontinuous transmission techniques to reduce the overall transmission rate in the communication channel.
  • DTX discontinuous transmission techniques to reduce the overall transmission rate in the communication channel.
  • SID frames are transmitted, with the bandwidth of the SID frames corresponding to the bandwidth of the voice signal.
  • the background noise during a speech break is described.
  • This codec G.729.1 is a scalable speech codec in which the DTX method is currently not scalable across the entire bandwidth.
  • the coding method can be characterized as follows during an active speech period, in contrast to a speech pause recognized as »Silent Period «:
  • the speech signal is split into two parts, namely a narrowband (lowband) part and a broadband (highband) part. Both signals are sampled at a sampling frequency of 8 kHz.
  • the division into a narrowband and a broadband component takes place in a special bandpass filter, which is also referred to as QMF (Quadrature Mirror Filter).
  • the narrowband portion of the speech signal is encoded at a data rate of 8 and 12 kbit / s.
  • a CELP Code Excited Linear Prediction
  • the narrowband component is further modified taking into account the »Transform Codec « section of G.729.1.
  • the broadband portion of the current frame again assuming it contains voice signals, is encoded at a data rate of 14 kbit / s using the TDBWE (Time Domain Bandwidth Extension) method.
  • TDBWE Time Domain Bandwidth Extension
  • the speech signal is also split into a narrowband and a broadband component, with both components sampled at a frequency of 8 kHz.
  • the decomposition also takes place via a QMF filter.
  • the narrowband portion is encoded using narrow-band SID information.
  • This narrowband SID information is sent to the receiver at a later time in a SID frame compatible with the G.729 standard. Further measures as described above can contribute to an improvement of the narrowband SID component.
  • the broadband component is encoded using a modified TDBWE method.
  • the speech signal is further encoded at a data rate of 14 kbit / s, while at the same time the background noise detected during the speech pause is evaluated and corresponding parameters are set.
  • the background noise is evaluated with regard to the energy of the noise signal and its frequency distribution.
  • the temporal fine structure is not evaluated, but merely an average of the energy is formed over the frame.
  • FIG shows a SID frame with separate areas for a narrowband first portion LB ("Low Band”), a broadband second portion HB ("High Band”) and an itermediate third portion ELB ("Enhanced Low Band”).
  • LB narrowband first portion
  • HB broadband second portion
  • ELB Enhanced Low Band
  • the first component LB contains encoded background noise parameters, which are encoded at a data rate of 8 kbit / s or below.
  • the data length of the first component LB is, for example, 15 bits.
  • the second component HB contains encoded background noise parameters, which are encoded with a data rate between 14 kbit / s and 32 kbit / s.
  • the data length of the second component HB is for example 19 bits.
  • the third component ELB contains encoded background noise parameters, which are encoded with a data rate of greater than 8 kbit / s, for example 12 kbit / s.
  • the data length of the third component ELB is 9 bits, for example.
  • characteristics of the background noise are learned on the part of the encoder.
  • the characteristics include in particular the temporal distribution as well as the spectral form of the background noise.
  • a filtering method is used, which takes into account temporal and spectral parameters of background noise from previous frames. If there are significant changes in the character or magnitude of background noise, a decision is made based on threshold values as to whether there is a need to update the learned parameters.
  • the following procedure is carried out:
  • a "regular" ie a speech signal containing, frame is received
  • the data rate for such regular frames is usually 8 kbit / s or above.
  • comfort noise is synthesized, and in the case of a wideband SID, a broadband comfort noise is synthesized and output with a read-out gain.
  • the embodiments relate to further details for incorporating the DTX method into broadband codecs such as e.g. G.729.1, and further methods for modifying the TDBWE method, which include synthesizing comfort noise during non-active frames, i. Frames without language information, support.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Mobile Radio Communication Systems (AREA)

Claims (10)

  1. Procédé d'encodage d'une trame SID (SID) pour une transmission d'informations de bruit de fond en application d'un procédé de codage de signaux vocaux modulable avec les étapes suivantes :
    encodage d'une première partie à bande étroite (LB) et d'une deuxième partie à bande large (HB) de l'information de bruit de fond ;
    formation de la trame SID (SID) avec des domaines séparés pour la première (LB) et la deuxième (HB) partie ;
    prédiction de la modularité pour la transmission d'informations vocales analogue à celle lors de la formation de la trame SID (SID) de sorte qu'il est déterminé côté récepteur, si un bruit de confort a lieu sur la base de la deuxième partie à bande large (HB) de la trame SID (SID) transmise ou sur la base de la première partie à bande étroite (LB), dans lequel, lorsque des informations vocales à bande large sont transmises, le bruit de confort a lieu sur la base de la deuxième partie à bande large (HB) de la trame SID (SID) transmise, et, lorsque des informations vocales à bande étroite sont transmises, le bruit de confort a lieu sur la base de la première partie à bande étroite (HB) de la trame SID (SID) transmise ;
    dans lequel une troisième partie à bande étroite (ELB) est encodée et que la trame SID (SID) est formée avec un domaine séparé supplémentaire pour la troisième partie (ELB) ;
    dans lequel la troisième partie à bande étroite (ELB) est rendue dans une qualité accrue par rapport à la première partie à bande étroite par un débit de données accru de l'encodage de paramètres temporels et spectraux du bruit de fond et en conformité avec le standard G.729.B ;
    dans lequel la première partie (LB) de l'information de bruit de fond est encodée conformément aux directives de codage du standard G.729.B ;
    caractérisé en ce
    que la deuxième partie (HB) de l'information de bruit de fond est encodée conformément à un procédé TDBWE modifié,
    dans lequel une simplification dans le procédé TDBWE modifié est atteinte par le fait qu'un encodage dans le domaine temporel correspond seulement à l'énergie du signal dans le domaine temporel.
  2. Procédé selon l'une quelconque des revendications précédentes, caractérisé en ce que pendant une pause vocale du côté d'un encodeur, des caractéristiques d'un bruit de fond sont analysées et prises en compte, donc apprises, dans lequel les caractéristiques comprennent en particulier la distribution temporelle tout comme la forme spectrale du bruit de fond.
  3. Procédé selon la revendication 2, caractérisé en ce qu'un procédé de filtrage, qui tient compte de paramètres temporels et spectraux du bruit de fond de trames précédentes, est appliqué pour le procédé d'apprentissage.
  4. Procédé selon la revendication 3, caractérisé en ce que s'il résulte des modifications significatives dans le caractère ou dans l'intensité du bruit de fond, une décision est prise sur la base de paramètres de valeur limite (Threshold Values), s'il y a besoin d'actualiser les paramètres appris.
  5. Procédé selon la revendication 4, caractérisé en ce que des trames SID (SID) sont envoyées, lorsqu'une modification significative de la deuxième partie à bande large (HB) du bruit de fond est détectée ou lorsqu'une actualisation de la première partie à bande étroite (LB) doit être envoyée.
  6. Procédé selon l'une quelconque des revendications précédentes, caractérisé en ce que pendant une période d'excédent, qui commence après un changement d'une phase vocale active à une pause vocale pour l'apprentissage du bruit de fond, des procédés de filtrage pour l'association d'une plus grande importance d'une trame actuelle que la trame précédente sont appliqués.
  7. Procédé selon l'une quelconque des revendications précédentes, caractérisé en ce que des paramètres d'énergie filtrés dans la deuxième partie à bande large (HB) sont utilisées pour la description d'un bruit de fond, qui comprennent des paramètres d'enveloppes dans le domaine temporel (tenv_fidx) et/ou des paramètres d'enveloppes dans le domaine de fréquence (fenv_fidx[i]).
  8. Procédé selon la revendication 7, caractérisé en ce qu'un indice respectif (idx) identifie une trame respective, dans lequel l'enveloppe est formée dans le domaine de fréquence par un nombre adapté de valeurs de fréquence (i={1, ..., NB-SUBBANDS}) pour la description des propriétés spectrales du bruit de fond.
  9. Codec avec des moyens pour la réalisation du procédé selon l'une quelconque des revendications 1 à 8.
  10. Codec selon la revendication 9, caractérisé par une mise en oeuvre dans le standard ITU-T G.729.1 connu en soi.
EP09711908.5A 2008-02-19 2009-02-02 Procédé et moyens d encodage d informations de bruit de fond Active EP2245621B1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE102008009719A DE102008009719A1 (de) 2008-02-19 2008-02-19 Verfahren und Mittel zur Enkodierung von Hintergrundrauschinformationen
PCT/EP2009/051118 WO2009103608A1 (fr) 2008-02-19 2009-02-02 Procédé et moyens d’encodage d’informations de bruit de fond

Publications (2)

Publication Number Publication Date
EP2245621A1 EP2245621A1 (fr) 2010-11-03
EP2245621B1 true EP2245621B1 (fr) 2019-05-01

Family

ID=40652248

Family Applications (1)

Application Number Title Priority Date Filing Date
EP09711908.5A Active EP2245621B1 (fr) 2008-02-19 2009-02-02 Procédé et moyens d encodage d informations de bruit de fond

Country Status (8)

Country Link
US (2) US20100318352A1 (fr)
EP (1) EP2245621B1 (fr)
JP (1) JP5361909B2 (fr)
KR (2) KR101364983B1 (fr)
CN (1) CN101952886B (fr)
DE (1) DE102008009719A1 (fr)
RU (1) RU2461080C2 (fr)
WO (1) WO2009103608A1 (fr)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101483495B (zh) * 2008-03-20 2012-02-15 华为技术有限公司 一种背景噪声生成方法以及噪声处理装置
CN103187065B (zh) 2011-12-30 2015-12-16 华为技术有限公司 音频数据的处理方法、装置和系统
EP2936486B1 (fr) 2012-12-21 2018-07-18 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Ajout de bruit de confort pour modeler un bruit d'arrière-plan à des débits binaires faibles
AU2013366642B2 (en) * 2012-12-21 2016-09-22 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Generation of a comfort noise with high spectro-temporal resolution in discontinuous transmission of audio signals
MX345622B (es) * 2013-01-29 2017-02-08 Fraunhofer Ges Forschung Decodificador para generar una señal de audio mejorada en frecuencia, método de decodificación, codificador para generar una señal codificada y metodo de codificación utilizando informacion secundaria de selección compacta.
CN104217723B (zh) * 2013-05-30 2016-11-09 华为技术有限公司 信号编码方法及设备
MY181026A (en) * 2013-06-21 2020-12-16 Fraunhofer Ges Forschung Apparatus and method realizing improved concepts for tcx ltp
JP6035270B2 (ja) * 2014-03-24 2016-11-30 株式会社Nttドコモ 音声復号装置、音声符号化装置、音声復号方法、音声符号化方法、音声復号プログラム、および音声符号化プログラム
EP2980790A1 (fr) 2014-07-28 2016-02-03 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Appareil et procédé de sélection de mode de génération de bruit de confort
KR101701623B1 (ko) * 2015-07-09 2017-02-13 라인 가부시키가이샤 VoIP 통화음성 대역폭 감소를 은닉하는 시스템 및 방법
US10978096B2 (en) * 2017-04-25 2021-04-13 Qualcomm Incorporated Optimized uplink operation for voice over long-term evolution (VoLte) and voice over new radio (VoNR) listen or silent periods

Family Cites Families (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI105001B (fi) * 1995-06-30 2000-05-15 Nokia Mobile Phones Ltd Menetelmä odotusajan selvittämiseksi puhedekooderissa epäjatkuvassa lähetyksessä ja puhedekooderi sekä lähetin-vastaanotin
US5960389A (en) 1996-11-15 1999-09-28 Nokia Mobile Phones Limited Methods for generating comfort noise during discontinuous transmission
RU2237296C2 (ru) * 1998-11-23 2004-09-27 Телефонактиеболагет Лм Эрикссон (Пабл) Кодирование речи с функцией изменения комфортного шума для повышения точности воспроизведения
US7124079B1 (en) * 1998-11-23 2006-10-17 Telefonaktiebolaget Lm Ericsson (Publ) Speech coding with comfort noise variability feature for increased fidelity
US6424938B1 (en) * 1998-11-23 2002-07-23 Telefonaktiebolaget L M Ericsson Complex signal activity detection for improved speech/noise classification of an audio signal
US6397177B1 (en) * 1999-03-10 2002-05-28 Samsung Electronics, Co., Ltd. Speech-encoding rate decision apparatus and method in a variable rate
CA2290037A1 (fr) * 1999-11-18 2001-05-18 Voiceage Corporation Dispositif amplificateur a lissage du gain et methode pour codecs de signaux audio et de parole a large bande
JP3761795B2 (ja) * 2000-04-10 2006-03-29 三菱電機株式会社 ディジタル回線多重化装置
US6889187B2 (en) * 2000-12-28 2005-05-03 Nortel Networks Limited Method and apparatus for improved voice activity detection in a packet voice network
US20030120484A1 (en) * 2001-06-12 2003-06-26 David Wong Method and system for generating colored comfort noise in the absence of silence insertion description packets
US20030112758A1 (en) * 2001-12-03 2003-06-19 Pang Jon Laurent Methods and systems for managing variable delays in packet transmission
EP1808852A1 (fr) * 2002-10-11 2007-07-18 Nokia Corporation Procédé d'interopération entre des codecs à large bande à haute vitesse adaptative (AMR-WB) et à large bande à débit binaire variable multimode (VMR-WB)
RU2331933C2 (ru) * 2002-10-11 2008-08-20 Нокиа Корпорейшн Способы и устройства управляемого источником широкополосного кодирования речи с переменной скоростью в битах
US7391768B1 (en) * 2003-05-13 2008-06-24 Cisco Technology, Inc. IPv4-IPv6 FTP application level gateway
US20050004793A1 (en) * 2003-07-03 2005-01-06 Pasi Ojala Signal adaptation for higher band coding in a codec utilizing band split coding
ES2634511T3 (es) * 2004-07-23 2017-09-28 Iii Holdings 12, Llc Aparato de codificación de audio y procedimiento de codificación de audio
US20060149536A1 (en) * 2004-12-30 2006-07-06 Dunling Li SID frame update using SID prediction error
CA2593247A1 (fr) * 2005-01-10 2006-11-16 Quartics, Inc. Architecture integree destinee au traitement unifie de medias visuels
CN100592389C (zh) * 2008-01-18 2010-02-24 华为技术有限公司 合成滤波器状态更新方法及装置
EP1897085B1 (fr) * 2005-06-18 2017-05-31 Nokia Technologies Oy Systeme et procede destines a la transmission adaptative de parametres de bruit de confort au cours d'une transmission vocale discontinue
US7546237B2 (en) * 2005-12-23 2009-06-09 Qnx Software Systems (Wavemakers), Inc. Bandwidth extension of narrowband speech
US8260609B2 (en) * 2006-07-31 2012-09-04 Qualcomm Incorporated Systems, methods, and apparatus for wideband encoding and decoding of inactive frames
US8725499B2 (en) * 2006-07-31 2014-05-13 Qualcomm Incorporated Systems, methods, and apparatus for signal change detection
US7796626B2 (en) * 2006-09-26 2010-09-14 Nokia Corporation Supporting a decoding of frames
CN101246688B (zh) * 2007-02-14 2011-01-12 华为技术有限公司 一种对背景噪声信号进行编解码的方法、系统和装置
US8032359B2 (en) * 2007-02-14 2011-10-04 Mindspeed Technologies, Inc. Embedded silence and background noise compression
KR101290622B1 (ko) * 2007-11-02 2013-07-29 후아웨이 테크놀러지 컴퍼니 리미티드 오디오 복호화 방법 및 장치
US8554550B2 (en) * 2008-01-28 2013-10-08 Qualcomm Incorporated Systems, methods, and apparatus for context processing using multi resolution analysis
CN101335000B (zh) * 2008-03-26 2010-04-21 华为技术有限公司 编码的方法及装置

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
"CODING OF SPEECH AT 8 KBIT/S USING CONJUGATE STRUCTURE ALGEBRAIC-CODE-EXCITED LINEAR-PREDICTION (CS-ACELP). ANNEX B: A SILENCE COMPRESSION SCHEME FOR G.729 OPTIMIZED FOR TERMINALS CONFORMING TO RECOMMENDATION V.70", ITU-T RECOMMENDATION G.729, XX, XX, 1 November 1996 (1996-11-01), pages COMPLETE23, XP002259964 *
"G.729 based Embedded Variable bit-rate coder: An 8-32 kbit/s scalable wideband coder bitstream interoperable with G.729; G.729.1 (05/06)", ITU-T DRAFT STUDY PERIOD 2005-2008, INTERNATIONAL TELECOMMUNICATION UNION, GENEVA ; CH, no. G.729.1 (05/06), 29 May 2006 (2006-05-29), XP017404590 *

Also Published As

Publication number Publication date
US20100318352A1 (en) 2010-12-16
DE102008009719A1 (de) 2009-08-20
KR101364983B1 (ko) 2014-02-20
RU2010138563A (ru) 2012-04-10
US20160035360A1 (en) 2016-02-04
JP2011512563A (ja) 2011-04-21
JP5361909B2 (ja) 2013-12-04
RU2461080C2 (ru) 2012-09-10
CN101952886B (zh) 2013-03-06
KR20100120217A (ko) 2010-11-12
WO2009103608A1 (fr) 2009-08-27
EP2245621A1 (fr) 2010-11-03
CN101952886A (zh) 2011-01-19
KR20120089378A (ko) 2012-08-09

Similar Documents

Publication Publication Date Title
EP2245621B1 (fr) Procédé et moyens d encodage d informations de bruit de fond
DE60117471T2 (de) Breitband-signalübertragungssystem
DE3688980T2 (de) Verfahren zur Multigeschwindigkeitskodierung von Signalen und Einrichtung zur Durchführung dieses Verfahrens.
AT405346B (de) Verfahren zum herleiten der nachwirkperiode in einem sprachdecodierer bei diskontinuierlicher übertragung, sowie sprachcodierer und sender-empfänger
DE69917677T2 (de) SPRACHKODIERUNG MIT VERäNDERBAREM KOMFORT-RAUSCHEN FüR VERBESSERTER WIEDERGABEQUALITäT
EP3217583B1 (fr) Décodeur et procédé de décodage d'une suite de paquets de données
DE60319590T2 (de) Verfahren zur codierung und decodierung von audio mit variabler rate
EP1953739B1 (fr) Procédé et dispositif pour réduire le bruit dans un signal décodé
DE69730721T2 (de) Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form
EP1338004A1 (fr) Procede et dispositif pour produire ou decoder un flux de donnees modulable compte tenu d'une banque de bits, codeur et codeur modulable
DE60224005T2 (de) Verfahren und vorrichtung zur verarbeitung von mehreren audiobitströmen
EP2245620B1 (fr) Procédé et moyens d'encodage d'informations de bruit de fond
EP1677286A1 (fr) Procédé pour l'adaptation de paramètres de génération de bruit de confort
EP1327243A1 (fr) Procede et dispositif pour produire un flux de donnees modulable et procede et dispositif pour decoder un flux de donnees modulable
EP2245622B1 (fr) Procédés et moyens pour décoder des informations de bruit de fond
EP0658874A1 (fr) Procédé et dispositif de circuit pour l'agrandissement de la largeur de signaux de langage à bande étroite
DE2303497C2 (de) Verfahren zur Übertragung von Sprachsignalen
EP1433166B1 (fr) Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite
EP1390946B1 (fr) Procede d'evaluation d'un parametre de codeur-decodeur
DE69834993T2 (de) Sprachübertragungssystem
DE69836454T2 (de) Kommunikationsnetzwerk zur übertragung von sprachsignalen
EP1834322B1 (fr) Procede de codage d'un signal analogique
DE19906223B4 (de) Verfahren und Funk-Kommunikationssystem zur Sprachübertragung, insbesondere für digitale Mobilkummunikationssysteme
CH680030A5 (fr)
DE102005000830A1 (de) Verfahren zur Bandbreitenerweiterung

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20100812

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

AX Request for extension of the european patent

Extension state: AL BA RS

RIN1 Information on inventor provided before grant (corrected)

Inventor name: SETIAWAN, PANJI

Inventor name: SCHANDL, STEFAN

Inventor name: TADDEI, HERVE

DAX Request for extension of the european patent (deleted)
RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: UNIFY GMBH & CO. KG

17Q First examination report despatched

Effective date: 20150706

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: UNIFY GMBH & CO. KG

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: EXAMINATION IS IN PROGRESS

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: GRANT OF PATENT IS INTENDED

INTG Intention to grant announced

Effective date: 20181129

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE PATENT HAS BEEN GRANTED

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO SE SI SK TR

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

Ref country code: AT

Ref legal event code: REF

Ref document number: 1127980

Country of ref document: AT

Kind code of ref document: T

Effective date: 20190515

REG Reference to a national code

Ref country code: DE

Ref legal event code: R096

Ref document number: 502009015751

Country of ref document: DE

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: GERMAN

REG Reference to a national code

Ref country code: NL

Ref legal event code: MP

Effective date: 20190501

REG Reference to a national code

Ref country code: LT

Ref legal event code: MG4D

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: HR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: NO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190801

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190901

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: LT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190802

Ref country code: BG

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190801

Ref country code: LV

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IS

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190901

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: EE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: CZ

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: RO

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: SK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

REG Reference to a national code

Ref country code: DE

Ref legal event code: R097

Ref document number: 502009015751

Country of ref document: DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

26N No opposition filed

Effective date: 20200204

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

REG Reference to a national code

Ref country code: BE

Ref legal event code: MM

Effective date: 20200229

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MC

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200202

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200229

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200229

REG Reference to a national code

Ref country code: DE

Ref legal event code: R082

Ref document number: 502009015751

Country of ref document: DE

Representative=s name: SCHAAFHAUSEN PATENTANWAELTE PARTNERSCHAFTSGESE, DE

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200202

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200229

REG Reference to a national code

Ref country code: AT

Ref legal event code: MM01

Ref document number: 1127980

Country of ref document: AT

Kind code of ref document: T

Effective date: 20200202

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20200202

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: MK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20190501

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20240216

Year of fee payment: 16

Ref country code: GB

Payment date: 20240222

Year of fee payment: 16

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20240222

Year of fee payment: 16