EP1677286A1 - Procédé pour l'adaptation de paramètres de génération de bruit de confort - Google Patents
Procédé pour l'adaptation de paramètres de génération de bruit de confort Download PDFInfo
- Publication number
- EP1677286A1 EP1677286A1 EP05111976A EP05111976A EP1677286A1 EP 1677286 A1 EP1677286 A1 EP 1677286A1 EP 05111976 A EP05111976 A EP 05111976A EP 05111976 A EP05111976 A EP 05111976A EP 1677286 A1 EP1677286 A1 EP 1677286A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- cng
- parameters
- qrc
- cnp
- coefficients
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 238000000034 method Methods 0.000 title claims abstract description 22
- 230000006978 adaptation Effects 0.000 title description 3
- 230000005540 biological transmission Effects 0.000 claims abstract description 46
- 238000004891 communication Methods 0.000 claims abstract description 24
- 230000003595 spectral effect Effects 0.000 claims description 7
- 238000003780 insertion Methods 0.000 claims description 4
- 230000037431 insertion Effects 0.000 claims description 4
- 230000015572 biosynthetic process Effects 0.000 claims description 3
- 238000003786 synthesis reaction Methods 0.000 claims description 3
- 238000012360 testing method Methods 0.000 claims description 3
- 238000001914 filtration Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 101000687640 Streptomyces caespitosus Extracellular small neutral protease Proteins 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 238000005311 autocorrelation function Methods 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000010183 spectrum analysis Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/012—Comfort noise or silence coding
Definitions
- the present invention relates to a method for adapting comfort noise generation (CNG) parameters provided for generating a background noise signal in a telecommunication system consisting of a packet-oriented telecommunication network and at least one first and second communication device connected thereto.
- CNG comfort noise generation
- the CNG parameters are generated in the first communication device and inserted into at least one Silence Insertion Descriptor (SID) transmission frame over the packet-oriented telecommunications network to the second communication device.
- SID Silence Insertion Descriptor
- IP Voice Over Internet Protocol
- CDMA Code Division Multiple Access
- the receiving unit uses the CNG parameters to create an artificial, pleasant noise (“Comfort Noise Generation”), using many different techniques to produce CNG parameters and then recover the background noise (“comfort noise generation”) which implement both predefined and at least partially standardized protocols for the exchange of CNG parameters both in the receiving unit and in the transmitting unit.
- SID Session Insertion Descriptor
- CNG Comfort Noise Generation
- a non-binding definition of such CNG parameters with regard to the transmission frame to be used or the "comfort noise payload" transmitted in a data packet is provided in standard G.711 Appendix II of the ITU Telecommunication Standardization Section (ITU-T), which already stipulates that the "comfort noise payload" may include a parameter indicative of the noise signal level and a plurality of parameters indicative of background noise spectral characteristics in the form of filter coefficients.
- the essential advantage of the method according to the invention can be seen in that the transmitted CNG parameters are compared with a predetermined CNG parameter format and thereby adapted to the predetermined CNG parameter format in the event of deviation from the predetermined CNG parameter format which removes individual CNG parameters and / or replaces faulty, missing, or incompatible CNG parameters with predetermined target CNG parameters.
- the transmitted CNG parameters are compared with a predetermined CNG parameter format and thereby adapted to the predetermined CNG parameter format in the event of deviation from the predetermined CNG parameter format which removes individual CNG parameters and / or replaces faulty, missing, or incompatible CNG parameters with predetermined target CNG parameters.
- the predetermined CNG parameter format limits the number of transmitted CNG parameters to a maximum of 11 parameters including a QEL parameter and 10 QRC coefficients.
- the limitation of the number of parameters to a maximum of 11 parameters, of which 10 are formed as spectral parameters, allows the use of commercially available filter units and reduces both the hardware implementation costs and the computational effort within the telecommunications system.
- a telecommunication system 1 in particular a packet-oriented telecommunication system, is shown by way of a schematic structure diagram, which has a first communication device 2 and a second communication device 3, which are interconnected, for example, via a packet-oriented or IP-oriented communication network 4 ,
- the data transmission via the IP-oriented communication network 4 takes place here by means of data packets.
- the first and second communication device 2, 3 may be configured as gateway computer systems that are of different technical structure and to which, in turn, communication terminals such as an IP telephone or client computer systems, etc. are connected (not shown in the figures).
- a transmitting unit 5 is provided in the first communication device 2 by way of example and a receiving unit 6 is provided in the second communication device 3, which are designed to transmit the data packets via the IP-oriented communication network 4, for example according to the transmission standard G.711 of the ITU.
- the transmission standard G.726 of the ITU can be used.
- the transmission unit 5 has a "Voice Activity Detection (VAD)" unit 7, which is connected via a connection line with, for example, an input 12 A data signal or voice data signal received at the input 12 is transmitted to the VAD unit 7 and there is a lack of voice data to be transmitted in the data signal If there are no voice data available, the VAD unit 7 generates a so-called "Silence Insertion Descriptor" (SID) transmission frame which is further processed in the transmission unit 5 and subsequently to the reception unit 6 of the second communication device This process is continued until voice data is again available in the transmission unit 5.
- SID Stress Insertion Descriptor
- a "discontinuous transmission” (DTX) unit 8 is provided in the transmission unit 5, which is likewise connected via connecting lines to the input 12 of the first communication device 2 and the VAD unit 7. With the aid of the DTX unit 8, the generated SID transmission frames SID are counted during a coherent speech pause, and thus the frequency of generation or transmission of the SID transmission frames SID during the speech pauses is determined.
- DTX discontinuous transmission
- VAD unit 7 is connected via a connecting line with a first "Comfort Noise Generation” (CNG) unit 9, which is also connected via a further connecting line to the input 12.
- CNG Comfort Noise Generation
- the background noise present in the speech pause is detected on the basis of "Comfort Noise Generation” parameters CNP, which in particular determine the background noise volume using a Quantized Energy Level parameter QEP and, if appropriate, the background noise spectral properties several “quantized Reflection Coefficients "QRC coefficients
- CNP Comfort Noise Generation
- QEP Quantized Energy Level parameter
- QRC coefficients The determined comfort noise generation parameters CNP and quantized energy level parameters QEP and the quantized reflection coefficients QRC are inserted into the SID transmission frame SID.
- a first speech signal unit 10 is provided in the transmitting unit 5 of the first communication device 2, which is connected to the input 12 of the first telecommunication device 2.
- a voice data signal received via the input 12 is coded and inserted into a user data transmission frame VP.
- the generated user data transmission frames VP and the generated SID transmission frames SID are then inserted into data packets (not shown) and transmitted via the IP-oriented telecommunications network 4.
- a multiplex unit 11 is connected via connecting lines, which packs the Nutz Schemeübertragungsrahmen VP or the SID transmission frame SID in at least one data packet and to the output E2 of the first communication device 2 for transmission over the IP -oriented telecommunications network 4 controls.
- a demultiplexer unit 12 is connected, which reads out the transmission frames VP or SID contained in the received data packets and either to a connected second speech signal unit 13 or to a second "Comfort Noise Generation” (CNG) unit 14 forwards.
- CNG Comfort Noise Generation
- the information contained in the SID transmission frame SID is read out and evaluated to generate a background noise.
- a control unit 15 and a memory unit 16 are provided in the receiving unit 6 by way of example, which are provided for controlling the CNG unit 14 and the second voice signal unit 13 as well as for storing data, in particular the last received "Comfort Noise Generation" parameters CNP.
- FIG. 2 shows by way of example the first byte indicating the "Quantized Energy Level” parameter QEP within the SID transmission frame SID.
- the noise signal level is given here in -dBov, where values from 0 to 127 and from 0 to -127 dBov can be mapped.
- 8 bits are provided which correspond to the first byte of the SID transmission frame SID. In this case, the bit having the zeroth bit position is blanketed with the value 0, and the remaining first to seventh bits represent the actual noise signal level value, the "Most Significant Bit” (MSB) being provided at the first bit position.
- MSB Monitoring Significant Bit
- the "Quantized Reflection Coefficients" coefficients QRC are transmitted on the basis of the second to M + 1-th bytes within the SID transmission frame SID, the second QRC coefficient N 1 being based on the second byte, the second QRC coefficient N being the second byte 2 , etc. is transmitted.
- the M-th QRC coefficient N M is finally transmitted last, in which case the number of M of the QRC coefficients QRC determines the order of the digital filter over which the background noise is formed from a Gaussian random signal or stochastic random noise signal.
- the extracted CNG parameters CNP are subjected to an analysis in such a way that they are first split into the Quantized Energy Level parameter QEP and the Quantized Reflection Coefficients QRC and here the number M of the transmitted ones QRC coefficients N 1 - N M is determined.
- the parameter values are checked byte-by-byte to determine whether they are within a predetermined range that exceeds CNPF or a predetermined number of bytes by a predetermined CNG parameter format.
- All remaining, ie non-compliant CNG parameters CNP are first adjusted in a third step 19 to the default CNG parameter format CNPF.
- standard filters for the generation of the background noise signal can advantageously be used, whereby the adaptation of the filter order of the filters provided in the different transmitting and receiving units is eliminated.
- a fourth step 20 the content of the CNG parameter CNP, which now consists of a maximum of 11 bytes, is checked in terms of content, i. the QEL parameters QEP and the remaining QRC coefficients QRC are analyzed in greater detail and, for example, missing or incomplete or erroneous or incompatible are replaced by target CNG parameters SCNP.
- the target CNG parameters SCNP are taken from a "Set of Golden Parameters" SGP which is stored in the memory unit 16.
- the "Set of Golden Parameters" SGP in a preferred embodiment comprises a golden QEL parameter GQEP and ten golden QRC coefficients GQRC, which were determined by extensive analyzes of numerous test files with standardized or trial samples of speech samples. For this purpose, a spectral analysis of the speech samples was created after they were subjected to high-pass filtering, window filtering and an autocorrelation function and the Levison-Durbin algorithm, wherein the Set of Golden Parameters SGP is selected such that the generated background noise in a uniform frequency range between 900 to 3400 Hz comes to rest. In this case, the received signal energy approaches the said frequency range evenly distributed between 900 to 3400 Hz. In particular, care was taken to ensure that only a few frequency components fall within the frequency range of 300-900 Hz, which produces a louder impression in human hearing.
- the thus adapted CNG parameters CNP * are then smoothed in a fifth step 21 with regard to the signal level of the background noise that can be generated thereby. This is done, for example, analogously to the method defined in ITU Standard G.711 Appendix II.
- the adapted QRC coefficients QRC * are converted into "Linear Prediction Coefficient (LPC)" coefficients LPC using the Levison Durbin algorithm.
- LPC Linear Prediction Coefficient
- resource-conserving golden LPC coefficients LPC which are also stored in the memory unit 16, can be used directly for the golden QRC coefficients GQRC.
- a computationally intensive determination of the associated LPC coefficients LPC for the QRC coefficients QRC * taken from the set of golden parameters SGP is omitted.
- a Gaussian random signal is generated, which is subjected to a calibration.
- the generated gaussian random signal is performed by filtering through a filter unit applied with the detected LPC coefficients LPC, thereby generating the background noise signal superimposed on the speech data signal.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102004063290A DE102004063290A1 (de) | 2004-12-29 | 2004-12-29 | Verfahren zur Anpassung von Comfort Noise Generation Parametern |
Publications (1)
Publication Number | Publication Date |
---|---|
EP1677286A1 true EP1677286A1 (fr) | 2006-07-05 |
Family
ID=36084186
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP05111976A Withdrawn EP1677286A1 (fr) | 2004-12-29 | 2005-12-12 | Procédé pour l'adaptation de paramètres de génération de bruit de confort |
Country Status (4)
Country | Link |
---|---|
US (1) | US20060143001A1 (fr) |
EP (1) | EP1677286A1 (fr) |
CN (1) | CN1801327A (fr) |
DE (1) | DE102004063290A1 (fr) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7715372B2 (en) * | 2006-12-28 | 2010-05-11 | Verizon Services Organization Inc. | Method and system for inserting selected comfort signal |
US7688810B2 (en) * | 2006-12-28 | 2010-03-30 | Verizon Services Organization Inc. | Method and system for inserting comfort signal in reaction to events |
US7613175B2 (en) * | 2006-12-28 | 2009-11-03 | Verizon Services Organization Inc. | Method and system for inserting user defined comfort signal |
CN101303855B (zh) * | 2007-05-11 | 2011-06-22 | 华为技术有限公司 | 一种舒适噪声参数产生方法和装置 |
CN104156509A (zh) * | 2014-07-24 | 2014-11-19 | 西北工业大学 | 一种噪声合成方法 |
EP2980801A1 (fr) * | 2014-07-28 | 2016-02-03 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Procédé d'estimation de bruit dans un signal audio, estimateur de bruit, encodeur audio, décodeur audio et système de transmission de signaux audio |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0843301A2 (fr) * | 1996-11-15 | 1998-05-20 | Nokia Mobile Phones Ltd. | Méthodes pour générer un bruit de confort durant une transmission discontinue |
WO2000031719A2 (fr) * | 1998-11-23 | 2000-06-02 | Telefonaktiebolaget Lm Ericsson (Publ) | Codage de la parole avec variabilite du bruit de confort pour une fidelite accrue |
WO2002043048A2 (fr) * | 2000-11-27 | 2002-05-30 | Nokia Corporation | Procede et systeme de generation de bruit de confort dans les communications telephoniques |
US20020120439A1 (en) * | 2001-02-28 | 2002-08-29 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for providing comfort noise in communication system with discontinuous transmission |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6269331B1 (en) * | 1996-11-14 | 2001-07-31 | Nokia Mobile Phones Limited | Transmission of comfort noise parameters during discontinuous transmission |
US7423983B1 (en) * | 1999-09-20 | 2008-09-09 | Broadcom Corporation | Voice and data exchange over a packet based network |
US20030120484A1 (en) * | 2001-06-12 | 2003-06-26 | David Wong | Method and system for generating colored comfort noise in the absence of silence insertion description packets |
-
2004
- 2004-12-29 DE DE102004063290A patent/DE102004063290A1/de not_active Ceased
-
2005
- 2005-12-12 EP EP05111976A patent/EP1677286A1/fr not_active Withdrawn
- 2005-12-29 US US11/321,482 patent/US20060143001A1/en not_active Abandoned
- 2005-12-29 CN CNA2005101074003A patent/CN1801327A/zh active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP0843301A2 (fr) * | 1996-11-15 | 1998-05-20 | Nokia Mobile Phones Ltd. | Méthodes pour générer un bruit de confort durant une transmission discontinue |
WO2000031719A2 (fr) * | 1998-11-23 | 2000-06-02 | Telefonaktiebolaget Lm Ericsson (Publ) | Codage de la parole avec variabilite du bruit de confort pour une fidelite accrue |
WO2002043048A2 (fr) * | 2000-11-27 | 2002-05-30 | Nokia Corporation | Procede et systeme de generation de bruit de confort dans les communications telephoniques |
US20020120439A1 (en) * | 2001-02-28 | 2002-08-29 | Telefonaktiebolaget Lm Ericsson (Publ) | Method and apparatus for providing comfort noise in communication system with discontinuous transmission |
Non-Patent Citations (1)
Title |
---|
ITU: "G.711 Appendix II: A comfort noise payload definition for ITU-T G.711 use in packet based multimedia communication systems", ITU-T RECOMMENDATION G.711 APPENDIX II, February 2000 (2000-02-01), pages 1 - 11, XP002315190 * |
Also Published As
Publication number | Publication date |
---|---|
CN1801327A (zh) | 2006-07-12 |
DE102004063290A1 (de) | 2006-07-13 |
US20060143001A1 (en) | 2006-06-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69631318T2 (de) | Verfahren und Vorrichtung zur Erzeugung von Hintergrundrauschen in einem digitalen Übertragungssystem | |
EP2245621B1 (fr) | Procédé et moyens d encodage d informations de bruit de fond | |
DE60012860T2 (de) | Verfahren zur Verarbeitung mehrerer digitaler Audiodatenströme | |
DE60034484T2 (de) | Verfahren und vorrichtung in einem kommunikationssystem | |
DE69910240T2 (de) | Vorrichtung und verfahren zur wiederherstellung des hochfrequenzanteils eines überabgetasteten synthetisierten breitbandsignals | |
DE60129327T2 (de) | Anordnung und verfahren in bezug auf die vermittlung von sprache | |
DE60029147T2 (de) | Qualitätsverbesserung eines audiosignals in einem digitalen netzwerk | |
DE69738401T2 (de) | Fehlerverdeckung in einem digitalen audioempfänger | |
DE60117471T2 (de) | Breitband-signalübertragungssystem | |
EP1388147B1 (fr) | Procede d'agrandissement de la largeur de bande d'un signal vocal filtre en bande etroite, en particulier d'un signal vocal emis par un appareil de telecommunication | |
DE60319590T2 (de) | Verfahren zur codierung und decodierung von audio mit variabler rate | |
DE60122751T2 (de) | Verfahren und vorrichtung für die objektive bewertung der sprachqualität ohne referenzsignal | |
WO2007073949A1 (fr) | Procede et dispositif pour elargir artificiellement la largeur de bande de signaux vocaux | |
EP0978172B1 (fr) | Procede de masquage des erreurs dans un courant de donnees audio | |
EP1677286A1 (fr) | Procédé pour l'adaptation de paramètres de génération de bruit de confort | |
DE3009677A1 (de) | Verfahren zur erkennung von sprache und sprachpausen | |
EP1023777B1 (fr) | Procede et dispositif pour limiter un courant de donnees audio dont le debit binaire peut etre mis a l'echelle | |
EP2245620B1 (fr) | Procédé et moyens d'encodage d'informations de bruit de fond | |
EP0135229A2 (fr) | Procédé et circuit pour la réalisation de connexions en conférence dans un système de commutation | |
EP0658874A1 (fr) | Procédé et dispositif de circuit pour l'agrandissement de la largeur de signaux de langage à bande étroite | |
DE60225505T2 (de) | Verfahren und Vorrichtung zur Analyse von Prädiktionsparametern | |
EP2044588A2 (fr) | Procédé et système de réduction de la réception de messages indésirables | |
DE60210597T2 (de) | Vorrichtung zur adpcdm sprachkodierung mit spezifischer anpassung der schrittwerte | |
EP1433166B1 (fr) | Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite | |
EP1390946B1 (fr) | Procede d'evaluation d'un parametre de codeur-decodeur |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK YU |
|
17P | Request for examination filed |
Effective date: 20061214 |
|
17Q | First examination report despatched |
Effective date: 20070126 |
|
AKX | Designation fees paid |
Designated state(s): DE FR GB IT |
|
RBV | Designated contracting states (corrected) |
Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
RAP1 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NOKIA SIEMENS NETWORKS S.P.A. |
|
RAP3 | Party data changed (applicant data changed or rights of an application transferred) |
Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN WITHDRAWN |
|
18W | Application withdrawn |
Effective date: 20080311 |