EP1433166B1 - Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite - Google Patents

Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite Download PDF

Info

Publication number
EP1433166B1
EP1433166B1 EP01978183A EP01978183A EP1433166B1 EP 1433166 B1 EP1433166 B1 EP 1433166B1 EP 01978183 A EP01978183 A EP 01978183A EP 01978183 A EP01978183 A EP 01978183A EP 1433166 B1 EP1433166 B1 EP 1433166B1
Authority
EP
European Patent Office
Prior art keywords
speech
extender
wideband
speech signal
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
EP01978183A
Other languages
German (de)
English (en)
Other versions
EP1433166B8 (fr
EP1433166A1 (fr
Inventor
Stefano Ambrosius Klinke
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Solutions and Networks GmbH and Co KG
Original Assignee
Nokia Siemens Networks GmbH and Co KG
Nokia Solutions and Networks SpA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Siemens Networks GmbH and Co KG, Nokia Solutions and Networks SpA filed Critical Nokia Siemens Networks GmbH and Co KG
Publication of EP1433166A1 publication Critical patent/EP1433166A1/fr
Application granted granted Critical
Publication of EP1433166B1 publication Critical patent/EP1433166B1/fr
Publication of EP1433166B8 publication Critical patent/EP1433166B8/fr
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • speech encoders For compressing the data transmission rate in speech signals, speech encoders, also referred to as speech codecs, are used. They are mainly used in mobile radio systems.
  • GSM Global System for Mobile Communication
  • speech encoders that work according to the Linear Predictive Coding (LPC) method.
  • LPC Linear Predictive Coding
  • a sampling rate of the speech signal of 8 kHz results in a data rate of 104 kbit / s at a resolution of 13 bits.
  • this data rate is reduced to a constant 13 KBit / s (so-called code rate) by means of LPC.
  • full-rate codecs or improved full-rate codecs are used, for example, in GSM.
  • a half-rate codec Half Rate Codec
  • the bit rate can be greatly reduced with correspondingly reduced voice quality, namely to 5.6 kbit / s.
  • UMTS Universal Mobile Telecommunications System
  • voice encoders which can encode voice signals with a variable bit rate.
  • One such speech coder is, for example, the Adaptive Multirate (AMR) speech coder, which allows encoding at various bit rates. It was designed for GSM cellular systems, but is also intended for use in UMTS mobile radio systems are used as standard speech coders.
  • AMR Adaptive Multirate
  • the bit rate can be adapted to the bandwidth available for transmitting the coded speech signal. If sufficient bandwidth is available for transmission, the speech signal is coded at a high bit rate. This is also referred to as broadband coding. Otherwise, ie with a low bandwidth, is coded with a low bit rate (narrowband coding).
  • the adaptation of the bit rate can take place during the transmission of a speech signal.
  • the bandwidth of a transmission channel in the form of the available bit rate is continuously measured. If the available bit rate drops below a predetermined threshold during a transmission of the speech signal, the coding is switched so that the speech signal is narrow-band coded.
  • a broadband coding takes place, for example, at a sampling frequency of about 16 kHz, while a narrow-band coding takes place at a sampling frequency of 8 kHz.
  • a voice frequency range up to 8 kHz in the second case up to 4 kHz is covered.
  • the problem is caused by the switching of the bit rate fluctuation of the signal quality and the associated quality variation of a communication link. Due to the predetermined threshold, the switching takes place relatively abruptly, so that the quality of the connection can suddenly drop during a conversation.
  • US-A-5 581 652 discloses a speech extender which, in a training phase, generates codebooks with which narrowband speech signals are converted into broadband in an application phase.
  • the speech quality of a speech extender is to be further improved. Furthermore, a method for estimating a wideband speech signal based on a narrow-band speech signal is to be specified, which enables an improved speech quality.
  • the core of the invention is to make an adaptation to a communication terminal and / or to a speaker during a voice signal transmission. As a result, the voice quality can be improved again compared to known methods and speech extender.
  • the invention relates to a speech extender which is adapted to estimate a wideband speech signal based on a narrow-band speech signal. Further it is adaptive such that it adapts a codebook to a communication terminal and / or to a speaker. The adaptation takes place during a voice transmission. This allows the Konnier to constantly adapt to the remote party.
  • the speech extender analyzes and stores at least one speech parameter and uses it for adaptation.
  • the at least one speech parameter is a wideband speech parameter that occurs during a speech transmission.
  • the at least one speech parameter can be speaker and / or communication terminal-specific.
  • the speech extender can be used in various mobile phones and adapt to their acoustic properties. Furthermore, it can address different users, i. adapt to their acoustic properties such as different speech frequency spectra.
  • speech parameters therefore preferably characteristic acoustic properties of the communication terminal and / or the speaker are used, such as frequency characteristics, attenuation of certain frequencies or frequency ranges and the frequency spectrum of the voice of the speaker.
  • Such speech parameters can be determined in particular by measurements during a speech transmission.
  • the speech extender makes estimates by evaluating at least one stored speech parameter.
  • different speech parameters can be used for the adaptation. These are stored after their determination and are therefore available for adaptation at any time. It would also be conceivable to constantly update the stored speech parameters in order to always be optimally adapted to the current acoustic conditions.
  • the speech extender can be used in a speech coder of a mobile and / or base station designed for a third generation mobile radio system.
  • the third generation mobile radio system may be UMTS.
  • the speech extender is preferably implemented in hardware, in particular in an integrated circuit, and / or in software.
  • An implementation in hardware offers the advantage that the speech extender can be integrated on a chip together with other essential circuit elements of the mobile radio terminal. For example, a chip manufacturer may offer such language extenders to mobile terminal manufacturers.
  • an implementation in software offers the advantage of easier changeability of the speech extender, and above all of the subsequent change, in particular if the software of the speech extender is stored in an erasable and rewritable memory such as an EEPROM.
  • the invention relates to a method for estimating a wideband speech signal based on a narrow-band speech signal. According to the method, an adaptation of a codebook to a communication terminal and / or to a speaker is carried out during estimation.
  • the method can be advantageously used in a speech coder of a mobile and / or base station, the or are designed for a third generation mobile radio system, in particular UMTS.
  • the mobile station is a mobile radio terminal and the method is implemented in hardware, in particular in an integrated circuit, and / or at least partially in software.
  • a broadband excitation signal and broadband filter coefficients are required for the synthesis filter in the speech coder. Since usually only the narrow-band excitation signal and the narrow-band filter coefficients are known, it is necessary to carry out a transformation from "narrowband" to "broadband". This is done by means of a broadband speech extender.
  • the excitation signal can be extended, for example, by a non-linear signal processing. Another possibility is to superimpose the excitation signal with white noise.
  • the filter coefficients can be estimated by using two codebooks.
  • the entries of the codebooks represent possible sets of filter coefficients.
  • a narrowband and a wideband codebook are trained. Since they are trained simultaneously with the same excitation signal (once narrowband and once wideband), the relationship between the entries of both codebooks is known. For example, entry 1 of the narrow-band codebook corresponds to the entry 2 of the wideband codebook.
  • the narrow-band filter coefficients are calculated from the narrow-band speech. These coefficients are compared with the entries of the narrow-band coefficient codebook and the best matching entry is chosen. Since, as already mentioned above, the relationship between the codebooks is known, the optimal filter coefficients for the speech synthesis filter of the wideband speech extender are estimated in this way.
  • the speech extender according to the invention achieves a further improvement of the speech quality. It can be used particularly advantageously in all communication systems in which variable bit rate speech coders can be used, which can code both narrowband and broadband, for example in the case of UMTS.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Mobile Radio Communication Systems (AREA)

Claims (10)

  1. Dispositif d'extension vocale exécuté de telle sorte qu'il évalue un signal vocal à large bande au moyen d'un signal vocal à bande étroite, caractérisé en ce qu'il adapte un dictionnaire de manière adaptative à un terminal de communication et/ou à un locuteur pendant une transmission vocale à large bande et comprend des moyens grâce auxquels il analyse, enregistre et utilise pour l'adaptation au moins un paramètre vocal de large bande intervenant pendant la transmission vocale.
  2. Dispositif d'extension vocale selon la revendication 1, caractérisé en ce que le au moins un paramètre vocal est un paramètre spécifique à un locuteur et/ou à un terminal de communication.
  3. Dispositif d'extension vocale selon la revendication 2, caractérisé en ce qu'il procède à une évaluation en exploitant au moins un paramètre vocal enregistré.
  4. Dispositif d'extension vocale selon l'une des revendications précédentes, caractérisé en ce qu'il est utilisé dans un codeur vocal d'une station mobile et/ou de base exécutée resp. exécutées pour un système de radiotéléphonie mobile de troisième génération, notamment l'UMTS.
  5. Dispositif d'extension vocale selon la revendication 4, caractérisé en ce que la station mobile est un terminal de radiotéléphonie mobile et le dispositif d'extension vocale est réalisé sous forme matérielle, notamment dans un circuit intégré, et/ou au moins partiellement sous forme logicielle.
  6. Procédé d'évaluation d'un signal vocal à large bande au moyen d'un signal vocal à bande étroite, caractérisé en ce qu'une adaptation adaptative d'un dictionnaire à un terminal de communication et/ou à un locuteur est exécutée lors d'une transmission vocale à large bande et au moins un paramètre vocal de large bande intervenant pendant la transmission vocale est analysé, enregistré et utilisé pour l'adaptation.
  7. Procédé selon la revendication 6, caractérisé en ce que le au moins un paramètre vocal est un paramètre spécifique à un locuteur et/ou à un terminal de communication.
  8. Procédé selon la revendication 7, caractérisé en ce que l'évaluation se fait en exploitant au moins un paramètre vocal enregistré.
  9. Procédé selon l'une des revendications 6 à 8, caractérisé en ce qu'il est utilisé dans un codeur vocal d'une station mobile et/ou de base exécutée resp. exécutées pour un système de radiotéléphonie mobile de troisième génération, notamment l'UMTS.
  10. Procédé selon la revendication 9, caractérisé en ce que la station mobile est un terminal de radiotéléphonie mobile et le procédé est exécuté dans un matériel, notamment dans un circuit intégré, et/ou au moins partiellement dans un logiciel.
EP01978183A 2001-09-28 2001-09-28 Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite Expired - Lifetime EP1433166B8 (fr)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/DE2001/003729 WO2003036623A1 (fr) 2001-09-28 2001-09-28 Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite

Publications (3)

Publication Number Publication Date
EP1433166A1 EP1433166A1 (fr) 2004-06-30
EP1433166B1 true EP1433166B1 (fr) 2007-11-14
EP1433166B8 EP1433166B8 (fr) 2008-01-02

Family

ID=5648296

Family Applications (1)

Application Number Title Priority Date Filing Date
EP01978183A Expired - Lifetime EP1433166B8 (fr) 2001-09-28 2001-09-28 Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite

Country Status (5)

Country Link
US (1) US20040243400A1 (fr)
EP (1) EP1433166B8 (fr)
CN (1) CN100403401C (fr)
DE (1) DE50113277D1 (fr)
WO (1) WO2003036623A1 (fr)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004090870A1 (fr) * 2003-04-04 2004-10-21 Kabushiki Kaisha Toshiba Procede et dispositif pour le codage ou le decodage de signaux audio large bande
US8818797B2 (en) 2010-12-23 2014-08-26 Microsoft Corporation Dual-band speech encoding
KR102244612B1 (ko) * 2014-04-21 2021-04-26 삼성전자주식회사 무선 통신 시스템에서 음성 데이터를 송신 및 수신하기 위한 장치 및 방법
US9837089B2 (en) * 2015-06-18 2017-12-05 Qualcomm Incorporated High-band signal generation
US10847170B2 (en) 2015-06-18 2020-11-24 Qualcomm Incorporated Device and method for generating a high-band signal from non-linearly processed sub-ranges

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4311877A (en) * 1979-12-19 1982-01-19 Kahn Leonard R Method and means for improving the reliability of systems that transmit relatively wideband signals over two or more relatively narrowband transmission circuits
US4330689A (en) * 1980-01-28 1982-05-18 The United States Of America As Represented By The Secretary Of The Navy Multirate digital voice communication processor
EP0474812B1 (fr) * 1990-03-08 1995-09-20 Telefonaktiebolaget L M Ericsson Systeme et procede d'affectation dynamique de numeros d'acheminement d'appels a des abonnes mobiles
JP2779886B2 (ja) * 1992-10-05 1998-07-23 日本電信電話株式会社 広帯域音声信号復元方法
US5455888A (en) * 1992-12-04 1995-10-03 Northern Telecom Limited Speech bandwidth extension method and apparatus
WO1995002288A1 (fr) * 1993-07-07 1995-01-19 Picturetel Corporation Reduction de bruits de fond pour l'amelioration de la qualite de voix
US5668837A (en) * 1993-10-14 1997-09-16 Ericsson Inc. Dual-mode radio receiver for receiving narrowband and wideband signals
DE69619284T3 (de) * 1995-03-13 2006-04-27 Matsushita Electric Industrial Co., Ltd., Kadoma Vorrichtung zur Erweiterung der Sprachbandbreite
US5706335A (en) * 1995-04-10 1998-01-06 Corporate Computer Systems Method and appartus for transmitting coded audio signals through a transmission channel with limited bandwidth
US5806025A (en) * 1996-08-07 1998-09-08 U S West, Inc. Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank
US5901145A (en) * 1997-02-28 1999-05-04 Telefonaktiebolaget L M Ericsson (Publ) Mobile station handoff between a spread spectrum communications system and a frequency division communications system
DE19804581C2 (de) * 1998-02-05 2000-08-17 Siemens Ag Verfahren und Funk-Kommunikationssystem zur Übertragung von Sprachinformation
EP0945852A1 (fr) * 1998-03-25 1999-09-29 BRITISH TELECOMMUNICATIONS public limited company Synthèse de la parole
GB2357682B (en) * 1999-12-23 2004-09-08 Motorola Ltd Audio circuit and method for wideband to narrowband transition in a communication device
US6704711B2 (en) * 2000-01-28 2004-03-09 Telefonaktiebolaget Lm Ericsson (Publ) System and method for modifying speech signals
CN1235192C (zh) * 2001-06-28 2006-01-04 皇家菲利浦电子有限公司 传输系统以及用于接收窄带音频信号的接收机和方法

Also Published As

Publication number Publication date
DE50113277D1 (de) 2007-12-27
EP1433166B8 (fr) 2008-01-02
EP1433166A1 (fr) 2004-06-30
US20040243400A1 (en) 2004-12-02
WO2003036623A1 (fr) 2003-05-01
CN1630896A (zh) 2005-06-22
CN100403401C (zh) 2008-07-16

Similar Documents

Publication Publication Date Title
EP1388147B1 (fr) Procede d'agrandissement de la largeur de bande d'un signal vocal filtre en bande etroite, en particulier d'un signal vocal emis par un appareil de telecommunication
DE102008016502B4 (de) Verfahren zur Datenübermittlung über einen Sprachkanal eines drahtlosen Kommunikationsnetzes unter Verwendung einer kontinuierlichen Signalmodulation
DE69727895T2 (de) Verfahren und Vorrichtung zur Sprachkodierung
EP2245621B1 (fr) Procédé et moyens d encodage d informations de bruit de fond
DE69730721T2 (de) Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form
WO2007073949A1 (fr) Procede et dispositif pour elargir artificiellement la largeur de bande de signaux vocaux
DE69820362T2 (de) Nichtlinearer Filter zur Geräuschunterdrückung in linearen Prädiktions-Sprachkodierungs-Vorrichtungen
EP1433166B1 (fr) Dispositif d'extension vocale et procede pour evaluer un signal vocal a large bande au moyen d'un signal vocal a bande etroite
DE4211945C1 (fr)
DE4343366C2 (de) Verfahren und Schaltungsanordnung zur Vergrößerung der Bandbreite von schmalbandigen Sprachsignalen
EP1677286A1 (fr) Procédé pour l'adaptation de paramètres de génération de bruit de confort
EP1430674A1 (fr) Dispositif et procede de suppression de signaux parasites periodiques
DE10252070B4 (de) Kommunikationsendgerät mit parametrierter Bandbreitenerweiterung und Verfahren zur Bandbreitenerweiterung dafür
DE60210597T2 (de) Vorrichtung zur adpcdm sprachkodierung mit spezifischer anpassung der schrittwerte
EP2245622B1 (fr) Procédés et moyens pour décoder des informations de bruit de fond
EP1561205A1 (fr) Procede pour elargir la bande passante d'un signal vocal filtre sur une bande etroite
WO2002058055A1 (fr) Procede et dispositif pour convertir en signaux vocaux des signaux vocaux a codage parametrique de differentes largeurs de bandes
EP1390947B1 (fr) Procede pour la reception de signaux
DE4236315C1 (de) Verfahren zur Sprachcodierung
DE19906223B4 (de) Verfahren und Funk-Kommunikationssystem zur Sprachübertragung, insbesondere für digitale Mobilkummunikationssysteme
WO2006072526A1 (fr) Procede d'extension de bande passante
DE10136491B4 (de) Verfahren und Vorrichtung zur Verbesserung der Sprachqualität auf transparenten Telekommunikations-Übertragungswegen
WO2006072519A1 (fr) Procede de codage d'un signal analogique

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20040212

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

RIN1 Information on inventor provided before grant (corrected)

Inventor name: KLINKE, STEFANO, AMBROSIUS

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB

RAP3 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA SIEMENS NETWORKS S.P.A.

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

RAP4 Party data changed (patent owner data changed or rights of a patent transferred)

Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG

REF Corresponds to:

Ref document number: 50113277

Country of ref document: DE

Date of ref document: 20071227

Kind code of ref document: P

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20080131

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20080815

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20080912

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20080918

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20080919

Year of fee payment: 8

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20090928

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20100531

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090930

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100401

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090928