EP1433166B1 - Speech extender and method for estimating a broadband speech signal from a narrowband speech signal - Google Patents

Speech extender and method for estimating a broadband speech signal from a narrowband speech signal Download PDF

Info

Publication number
EP1433166B1
EP1433166B1 EP01978183A EP01978183A EP1433166B1 EP 1433166 B1 EP1433166 B1 EP 1433166B1 EP 01978183 A EP01978183 A EP 01978183A EP 01978183 A EP01978183 A EP 01978183A EP 1433166 B1 EP1433166 B1 EP 1433166B1
Authority
EP
European Patent Office
Prior art keywords
speech
extender
wideband
speech signal
signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
EP01978183A
Other languages
German (de)
French (fr)
Other versions
EP1433166A1 (en
EP1433166B8 (en
Inventor
Stefano Ambrosius Klinke
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nokia Solutions and Networks GmbH and Co KG
Original Assignee
Nokia Siemens Networks GmbH and Co KG
Nokia Solutions and Networks SpA
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nokia Siemens Networks GmbH and Co KG, Nokia Solutions and Networks SpA filed Critical Nokia Siemens Networks GmbH and Co KG
Publication of EP1433166A1 publication Critical patent/EP1433166A1/en
Publication of EP1433166B1 publication Critical patent/EP1433166B1/en
Application granted granted Critical
Publication of EP1433166B8 publication Critical patent/EP1433166B8/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Definitions

  • speech encoders For compressing the data transmission rate in speech signals, speech encoders, also referred to as speech codecs, are used. They are mainly used in mobile radio systems.
  • GSM Global System for Mobile Communication
  • speech encoders that work according to the Linear Predictive Coding (LPC) method.
  • LPC Linear Predictive Coding
  • a sampling rate of the speech signal of 8 kHz results in a data rate of 104 kbit / s at a resolution of 13 bits.
  • this data rate is reduced to a constant 13 KBit / s (so-called code rate) by means of LPC.
  • full-rate codecs or improved full-rate codecs are used, for example, in GSM.
  • a half-rate codec Half Rate Codec
  • the bit rate can be greatly reduced with correspondingly reduced voice quality, namely to 5.6 kbit / s.
  • UMTS Universal Mobile Telecommunications System
  • voice encoders which can encode voice signals with a variable bit rate.
  • One such speech coder is, for example, the Adaptive Multirate (AMR) speech coder, which allows encoding at various bit rates. It was designed for GSM cellular systems, but is also intended for use in UMTS mobile radio systems are used as standard speech coders.
  • AMR Adaptive Multirate
  • the bit rate can be adapted to the bandwidth available for transmitting the coded speech signal. If sufficient bandwidth is available for transmission, the speech signal is coded at a high bit rate. This is also referred to as broadband coding. Otherwise, ie with a low bandwidth, is coded with a low bit rate (narrowband coding).
  • the adaptation of the bit rate can take place during the transmission of a speech signal.
  • the bandwidth of a transmission channel in the form of the available bit rate is continuously measured. If the available bit rate drops below a predetermined threshold during a transmission of the speech signal, the coding is switched so that the speech signal is narrow-band coded.
  • a broadband coding takes place, for example, at a sampling frequency of about 16 kHz, while a narrow-band coding takes place at a sampling frequency of 8 kHz.
  • a voice frequency range up to 8 kHz in the second case up to 4 kHz is covered.
  • the problem is caused by the switching of the bit rate fluctuation of the signal quality and the associated quality variation of a communication link. Due to the predetermined threshold, the switching takes place relatively abruptly, so that the quality of the connection can suddenly drop during a conversation.
  • US-A-5 581 652 discloses a speech extender which, in a training phase, generates codebooks with which narrowband speech signals are converted into broadband in an application phase.
  • the speech quality of a speech extender is to be further improved. Furthermore, a method for estimating a wideband speech signal based on a narrow-band speech signal is to be specified, which enables an improved speech quality.
  • the core of the invention is to make an adaptation to a communication terminal and / or to a speaker during a voice signal transmission. As a result, the voice quality can be improved again compared to known methods and speech extender.
  • the invention relates to a speech extender which is adapted to estimate a wideband speech signal based on a narrow-band speech signal. Further it is adaptive such that it adapts a codebook to a communication terminal and / or to a speaker. The adaptation takes place during a voice transmission. This allows the Konnier to constantly adapt to the remote party.
  • the speech extender analyzes and stores at least one speech parameter and uses it for adaptation.
  • the at least one speech parameter is a wideband speech parameter that occurs during a speech transmission.
  • the at least one speech parameter can be speaker and / or communication terminal-specific.
  • the speech extender can be used in various mobile phones and adapt to their acoustic properties. Furthermore, it can address different users, i. adapt to their acoustic properties such as different speech frequency spectra.
  • speech parameters therefore preferably characteristic acoustic properties of the communication terminal and / or the speaker are used, such as frequency characteristics, attenuation of certain frequencies or frequency ranges and the frequency spectrum of the voice of the speaker.
  • Such speech parameters can be determined in particular by measurements during a speech transmission.
  • the speech extender makes estimates by evaluating at least one stored speech parameter.
  • different speech parameters can be used for the adaptation. These are stored after their determination and are therefore available for adaptation at any time. It would also be conceivable to constantly update the stored speech parameters in order to always be optimally adapted to the current acoustic conditions.
  • the speech extender can be used in a speech coder of a mobile and / or base station designed for a third generation mobile radio system.
  • the third generation mobile radio system may be UMTS.
  • the speech extender is preferably implemented in hardware, in particular in an integrated circuit, and / or in software.
  • An implementation in hardware offers the advantage that the speech extender can be integrated on a chip together with other essential circuit elements of the mobile radio terminal. For example, a chip manufacturer may offer such language extenders to mobile terminal manufacturers.
  • an implementation in software offers the advantage of easier changeability of the speech extender, and above all of the subsequent change, in particular if the software of the speech extender is stored in an erasable and rewritable memory such as an EEPROM.
  • the invention relates to a method for estimating a wideband speech signal based on a narrow-band speech signal. According to the method, an adaptation of a codebook to a communication terminal and / or to a speaker is carried out during estimation.
  • the method can be advantageously used in a speech coder of a mobile and / or base station, the or are designed for a third generation mobile radio system, in particular UMTS.
  • the mobile station is a mobile radio terminal and the method is implemented in hardware, in particular in an integrated circuit, and / or at least partially in software.
  • a broadband excitation signal and broadband filter coefficients are required for the synthesis filter in the speech coder. Since usually only the narrow-band excitation signal and the narrow-band filter coefficients are known, it is necessary to carry out a transformation from "narrowband" to "broadband". This is done by means of a broadband speech extender.
  • the excitation signal can be extended, for example, by a non-linear signal processing. Another possibility is to superimpose the excitation signal with white noise.
  • the filter coefficients can be estimated by using two codebooks.
  • the entries of the codebooks represent possible sets of filter coefficients.
  • a narrowband and a wideband codebook are trained. Since they are trained simultaneously with the same excitation signal (once narrowband and once wideband), the relationship between the entries of both codebooks is known. For example, entry 1 of the narrow-band codebook corresponds to the entry 2 of the wideband codebook.
  • the narrow-band filter coefficients are calculated from the narrow-band speech. These coefficients are compared with the entries of the narrow-band coefficient codebook and the best matching entry is chosen. Since, as already mentioned above, the relationship between the codebooks is known, the optimal filter coefficients for the speech synthesis filter of the wideband speech extender are estimated in this way.
  • the speech extender according to the invention achieves a further improvement of the speech quality. It can be used particularly advantageously in all communication systems in which variable bit rate speech coders can be used, which can code both narrowband and broadband, for example in the case of UMTS.

Description

Die Erfindung betrifft einen Sprachextender nach dem Oberbegriff von Anspruch 1 und ein Verfahren zum Schätzen eines breitbandigen Sprachsignals anhand eines schmalbandigen Sprachsignals nach dem Oberbegriff von Anspruch 6.The invention relates to a speech extender according to the preamble of claim 1 and to a method for estimating a wideband speech signal using a narrow-band speech signal according to the preamble of claim 6.

Zur Komprimierung der Datenübertragungsrate bei Sprachsignalen werden Sprachkodierer, auch als Sprach-Codecs bezeichnet, eingesetzt. Sie werden vor allem in Mobilfunksystemen eingesetzt. In GSM-Mobilfunksystemen, (Global System for Mobile Communication) werden Sprachkodierer verwendet, die nach der Methode des Linear Predictive Coding (LPC) arbeiten. Hierbei wird im Gegensatz zu einer Signalformkodierung nicht das Sprachsignal selber, sondern sein Entstehungsprozess im menschlichen Sprechtrakt nachgebildet. Bei einer Abtastrate des Sprachsignals von 8 kHz ergibt sich bei einer Auflösung von 13 Bit eine Datenrate von 104 KBit/s. Bei GSM wird mittels LPC diese Datenrate auf konstante 13 KBit/s (sogenannte Coderate) reduziert. Hierzu werden beispielsweise bei GSM Vollraten- (Full rate Codec) oder verbesserte Vollraten-Codecs (Enhanced Full rate Codec) eingesetzt. Mit einem Halbraten-Codec (Half rate Codec) lässt sich die Bitrate bei entsprechend verminderter Sprachqualität stark reduzieren, nämlich auf 5,6 KBit/s.For compressing the data transmission rate in speech signals, speech encoders, also referred to as speech codecs, are used. They are mainly used in mobile radio systems. GSM (Global System for Mobile Communication) mobile systems use speech encoders that work according to the Linear Predictive Coding (LPC) method. Here, in contrast to a waveform coding not the speech signal itself, but its development process is reproduced in the human speech tract. At a sampling rate of the speech signal of 8 kHz results in a data rate of 104 kbit / s at a resolution of 13 bits. With GSM, this data rate is reduced to a constant 13 KBit / s (so-called code rate) by means of LPC. For this purpose, full-rate codecs or improved full-rate codecs are used, for example, in GSM. With a half-rate codec (Half Rate Codec), the bit rate can be greatly reduced with correspondingly reduced voice quality, namely to 5.6 kbit / s.

In zukünftigen Kommunikationssystemen wie beispielsweise UMTS (Universale Mobile Telecommunications System) werden Sprachkodierer eingesetzt, die mit einer variablen Bitrate Sprachsignale kodieren können. Ein derartiger Sprachkodierer ist beispielsweise der Adaptive Multirate (AMR) Sprachkodierer, der eine Kodierung mit verschiedenen Bitraten ermöglicht. Er wurde für GSM-Mobilfunksysteme entworfen, soll jedoch auch in UMTS-Mobilfunksystemen als Standard-Sprachkodierer zur Anwendung kommen.In future communication systems such as UMTS (Universal Mobile Telecommunications System) voice encoders are used, which can encode voice signals with a variable bit rate. One such speech coder is, for example, the Adaptive Multirate (AMR) speech coder, which allows encoding at various bit rates. It was designed for GSM cellular systems, but is also intended for use in UMTS mobile radio systems are used as standard speech coders.

Bei derartigen Sprachkodierern kann die Bitrate an die zur Übertragung des kodierten Sprachsignals zur Verfügung stehende Bandbreite angepasst werden. Steht ausreichend Bandbreite für die Übertragung zur Verfügung, so wird das Sprachsignal mit einer hohen Bitrate kodiert. Dies wird auch als breitbandige Kodierung bezeichnet. Andernfalls, also bei einer geringen Bandbreite, wird mit einer niedrigen Bitrate kodiert (schmalbandige Kodierung).In such speech encoders, the bit rate can be adapted to the bandwidth available for transmitting the coded speech signal. If sufficient bandwidth is available for transmission, the speech signal is coded at a high bit rate. This is also referred to as broadband coding. Otherwise, ie with a low bandwidth, is coded with a low bit rate (narrowband coding).

Die Anpassung der Bitrate kann während der Übertragung eines Sprachsignals erfolgen. Hierzu wird laufend die Bandbreite eines Übertragungskanals in Form der verfügbaren Bitrate gemessen. Sinkt die verfügbare Bitrate während einer Übertragung des Sprachsignals unter eine vorgegebene Schwelle, wird die Kodierung umgeschaltet, so dass das Sprachsignal schmalbandig kodiert wird.The adaptation of the bit rate can take place during the transmission of a speech signal. For this purpose, the bandwidth of a transmission channel in the form of the available bit rate is continuously measured. If the available bit rate drops below a predetermined threshold during a transmission of the speech signal, the coding is switched so that the speech signal is narrow-band coded.

Es können beispielsweise zwei unterschiedliche Kodierungen mit entsprechend verschiedenen Bitraten vorgesehen sein. Eine breitbandige Kodierung erfolgt beispielsweise bei einer Abtastfrequenz von etwa 16 kHz, während eine schmalbandige Kodierung bei einer Abtastfrequenz von 8 kHz stattfindet. Im ersten Fall wird ein Sprach-Frequenzbereich bis 8 kHz, im zweiten Fall bis 4 kHz abgedeckt. Problematisch ist jedoch die durch das Umschalten der Bitrate verursachte Schwankung der Signalqualität und der damit einhergehenden Qualitätsschwankung einer Kommunikationsverbindung. Durch die vorgegebene Schwelle erfolgt das Umschalten relativ abrupt, so dass während eines Gesprächs plötzlich die Qualität der Verbindung absinken kann.For example, two different codes with correspondingly different bit rates may be provided. A broadband coding takes place, for example, at a sampling frequency of about 16 kHz, while a narrow-band coding takes place at a sampling frequency of 8 kHz. In the first case, a voice frequency range up to 8 kHz, in the second case up to 4 kHz is covered. The problem, however, is caused by the switching of the bit rate fluctuation of the signal quality and the associated quality variation of a communication link. Due to the predetermined threshold, the switching takes place relatively abruptly, so that the quality of the connection can suddenly drop during a conversation.

Durch den Einsatz eines sogenannten Breitband-Sprachextenders (Wideband Speech Extender) oder kurz Sprachextenders kann ohne Änderung eines Kommunikationsprotokolls eine deutlich besbessere Sprachqualität auch bei schmalbandiger Kodierung und der qualitätsmindernde Effekt des abrupten Umschaltens etwas verringert werden. Ein solcher Breitband-Sprachextender ist beispielsweise aus der WO 01/56021 bekannt. Der Breitband-Sprachextender schätzt hierzu anhand des schmalbandigen Signals das breitbandige Sprachsignal durch Synthese der Sprachfrequenzanteile oberhalb von 4 kHz. Dadurch wird ein zu starker Qualitätsverlust beim Umschalten der Bitrate zur Kodierung vermieden. Das durch den Breitband-Sprachextender zur Verfügung gestellte Verfahren zur Adaption der Bitrate eines Sprachsignals funktioniert für nahezu alle Sprecher recht gut, insbesondere bietet es eine durchaus akzeptable Sprachqualität.By using a so-called wideband speech extender (Wideband Speech Extender) or short language extender a much better can be achieved without changing a communication protocol Speech quality even with narrow-band coding and the quality-reducing effect of the abrupt switching slightly reduced. Such a broadband speech extender is for example from WO 01/56021 known. For this purpose, the wideband speech extender estimates the broadband speech signal by synthesizing the speech frequency components above 4 kHz on the basis of the narrowband signal. This avoids excessive loss of quality when switching the bit rate for encoding. The method provided by the wideband speech extender for adapting the bit rate of a speech signal works quite well for almost all speakers, in particular it offers quite acceptable speech quality.

US-A-5 581 652 offenbart einen Sprachextender, der in einer Trainingsphase Codebücher erstellt, mit denen in einer Anwendungsphase schmalbandige Sprachsignale in breitbandige umgesetzt werden. US-A-5 581 652 discloses a speech extender which, in a training phase, generates codebooks with which narrowband speech signals are converted into broadband in an application phase.

Mit der vorliegenden Erfindung soll die Sprachqualität eines Sprachextenders, insbesondere eines Breitband-Sprachextenders weiter verbessert werden. Ferner soll ein Verfahren zum Schätzen eines breitbandigen Sprachsignals anhand eines schmalbandigen Sprachsignals angegeben werden, daß eine verbesserte Sprachqualität ermöglicht.With the present invention, the speech quality of a speech extender, in particular a wideband speech extender, is to be further improved. Furthermore, a method for estimating a wideband speech signal based on a narrow-band speech signal is to be specified, which enables an improved speech quality.

Diese Verbesserung wird durch einen Sprachextender mit den Merkmalen nach Anspruch 1 erreicht. Ferner wird mit dem fahren mit den Merkmalen von Anspruch 6 eine deutliche Verbesserung bei der Schätzung von breitbandigen Sprachsignalen erzielt. Bevorzugte Ausgestaltungen des Sprachextenders und des Verfahrens zum Schätzen eines breitbandigen Sprachsignals anhand eines schmalbandigen Sprachsignals sind Gegenstand der jeweiligen abhängigen Ansprüche.This improvement is achieved by a Sprachxtender with the features of claim 1. Furthermore, a significant improvement in the estimation of wideband speech signals is achieved by driving with the features of claim 6. Preferred embodiments of the speech extender and of the method for estimating a wideband speech signal on the basis of a narrowband speech signal are the subject of the respective dependent claims.

Kern der Erfindung ist es, eine Adaption an ein Kommunikationsendgerät und/oder an einen Sprecher während einer Sprachsignalübertragung vorzunehmen. Hierdurch kann die Sprachqualität nochmals gegenüber bekannten Verfahren und Sprachextendern verbessert werden.The core of the invention is to make an adaptation to a communication terminal and / or to a speaker during a voice signal transmission. As a result, the voice quality can be improved again compared to known methods and speech extender.

Die Erfindung betrifft konkret einen Sprachextender, der derart ausgebildet ist, daß er anhand eines schmalbandigen Sprachsignals ein breitbandiges Sprachsignal schätzt. Ferner ist er derart adaptiv, dass er ein Codebuch an ein Kommunikationsendgerät und/oder an einen Sprecher anpasst. Die Anpassung erfolgt während einer Sprachübertragung. Dadurch kann sich der Sprachextender laufend an den Fernteilnehmer anpassen.Specifically, the invention relates to a speech extender which is adapted to estimate a wideband speech signal based on a narrow-band speech signal. Further it is adaptive such that it adapts a codebook to a communication terminal and / or to a speaker. The adaptation takes place during a voice transmission. This allows the Sprachxtender to constantly adapt to the remote party.

Der Sprachextender analysiert und speichert mindestens einen Sprachparameter und verwendet ihn zur Anpassung. Bei dem mindestens einen Sprachparameter handelt es sich um einen breitbandigen Sprachparameter, der während einer Sprachübertragung auftritt. Insbesondere kann der mindestens eine Sprachparameter sprecher- und/oder Kommunikationsendgerät-spezifisch sein. Beispielsweise kann der Sprachextender in verschiedenen Mobiltelefonen eingesetzt werden und sich an deren akustische Eigenschaften anpassen. Ferner kann er sich an verschiedene Benutzer, d.h. an deren akustische Eigenschaften wie unterschiedliche Sprach-Frequenzspektren adaptieren. Als Sprachparameter werden also vorzugsweise charakteristische akustische Eigenschaften des Kommunikationsendgerätes und/oder des Sprechers herangezogen, beispielsweise Frequenzverläufe, Dämpfungen bestimmter Frequenzen oder Frequenzbereiche sowie das Frequenzspektrum der Stimme des Sprechers. Derartige Sprachparameter können insbesondere durch Messungen während einer Sprachübertragung ermittelt werden.The speech extender analyzes and stores at least one speech parameter and uses it for adaptation. The at least one speech parameter is a wideband speech parameter that occurs during a speech transmission. In particular, the at least one speech parameter can be speaker and / or communication terminal-specific. For example, the speech extender can be used in various mobile phones and adapt to their acoustic properties. Furthermore, it can address different users, i. adapt to their acoustic properties such as different speech frequency spectra. As speech parameters therefore preferably characteristic acoustic properties of the communication terminal and / or the speaker are used, such as frequency characteristics, attenuation of certain frequencies or frequency ranges and the frequency spectrum of the voice of the speaker. Such speech parameters can be determined in particular by measurements during a speech transmission.

Vorzugsweise nimmt der Sprachextender Schätzungen vor, indem er mindestens einen gespeicherten Sprachparameter auswertet. Wie bereits oben erläutert, können verschiedene Sprachparameter zur Adaption herangezogen werden. Diese werden nach ihrer Ermittlung gespeichert und stehen dadurch für die Adaption jederzeit zur Verfügung. Denkbar wäre es auch, die gespeicherten Sprachparameter laufend zu aktualisieren, um immer optimale an die aktuellen akustischen Bedingungen angepasst zu sein.Preferably, the speech extender makes estimates by evaluating at least one stored speech parameter. As already explained above, different speech parameters can be used for the adaptation. These are stored after their determination and are therefore available for adaptation at any time. It would also be conceivable to constantly update the stored speech parameters in order to always be optimally adapted to the current acoustic conditions.

Schließlich kann der Sprachextender in einem Sprachkodierer einer Mobil- und/oder Basisstation eingesetzt werden, die für ein Mobilfunksystem der dritten Generation ausgebildet sind bzw. ist. Bei dem Mobilfunksystem der dritten Generation kann es sich insbesondere um UMTS handeln.Finally, the speech extender can be used in a speech coder of a mobile and / or base station designed for a third generation mobile radio system. In particular, the third generation mobile radio system may be UMTS.

Ist die Mobilstation ein Mobilfunkendgerät, vorzugsweise ein Mobiltelefon, ist der Sprachextender vorzugsweise in Hardware, insbesondere in einer integrierten Schaltung, und/oder in Software implementiert. Eine Implementierung in Hardware bietet den Vorteil, dass der Sprachextender zusammen mit anderen wesentlichen Schaltungselementen des Mobilfunkendgerätes auf einem Chip integriert sein kann. Beispielsweise kann ein Chiphersteller derartige Sprachextender für Produzenten von Mobilfunkendgeräten anbieten. Dagegen bietet eine Implementierung in Software den Vorteil der leichteren Änderbarkeit des Sprachextenders, und vor allem der nachträglichen Änderung, insbesondere wenn die Software des Sprachextenders in einem lösch- und wiederbeschreibbaren Speicher wie beispielsweise einem EEPROM abgelegt ist.If the mobile station is a mobile radio terminal, preferably a mobile telephone, the speech extender is preferably implemented in hardware, in particular in an integrated circuit, and / or in software. An implementation in hardware offers the advantage that the speech extender can be integrated on a chip together with other essential circuit elements of the mobile radio terminal. For example, a chip manufacturer may offer such language extenders to mobile terminal manufacturers. On the other hand, an implementation in software offers the advantage of easier changeability of the speech extender, and above all of the subsequent change, in particular if the software of the speech extender is stored in an erasable and rewritable memory such as an EEPROM.

Ferner betrifft die Erfindung ein Verfahren zum Schätzen eines breitbandigen Sprachsignals anhand eines schmalbandigen Sprachsignals. Verfahrensgemäß wird beim Schätzen eine Anpassung eines Codebuchs an ein Kommunikationsendgerät und/oder an einen Sprecher durchgeführt.Furthermore, the invention relates to a method for estimating a wideband speech signal based on a narrow-band speech signal. According to the method, an adaptation of a codebook to a communication terminal and / or to a speaker is carried out during estimation.

Es wird mindestens ein Sprachparameter analysiert, gespeichert und zur Anpassung verwendet. Der mindestens eine Sprachparameter ist ein breitbandiger Sprachparameter, der während einer Sprachübertragung auftritt. Insbesondere kann der mindestens eine Sprachparameter Sprecher- und/oder Kommunikationsendgerät-spezifisch sein.At least one language parameter is analyzed, stored and used for adaptation. The at least one speech parameter is a wideband speech parameter that occurs during a speech transmission. In particular, the at least one speech parameter may be speaker and / or communication terminal-specific.

Das Verfahren kann vorteilhaft in einem Sprachkodierer einer Mobil- und/oder Basisstation eingesetzt werden, die bzw. das für ein Mobilfunksystem der dritten Generation, insbesondere UMTS, ausgebildet sind bzw. ist. In einer bevorzugten Ausführungsform ist die Mobilstation ein Mobilfunkendgerät und das Verfahren in Hardware, insbesondere in einer integrierten Schaltung, und/oder zumindest teilweise in Software implementiert.The method can be advantageously used in a speech coder of a mobile and / or base station, the or are designed for a third generation mobile radio system, in particular UMTS. In a preferred embodiment, the mobile station is a mobile radio terminal and the method is implemented in hardware, in particular in an integrated circuit, and / or at least partially in software.

Nachfolgend wird die Erfindung anhand von Ausführungsbeispielen erläutert.The invention will be explained below with reference to exemplary embodiments.

Zuerst soll die Funktionsweise eines Breitband-Sprachextenders beispielhaft erläutert werden, um den Einsatz der Erfindung besser verstehen zu können.First, the operation of a broadband speech extender will be exemplified to better understand the use of the invention.

Um Sprachfrequenzanteile oberhalb einer bestimmten Frequenz zu generieren, benötigt man ein breitbandiges Anregungssignal und breitbandige Filterkoeffizienten für den Synthesefilter im Sprachkodierer. Da in der Regel nur das schmalbandige.Anregungssignal und die schmalbandigen Filterkoeffizienten bekannt sind, ist es notwendig, eine Transformation "Schmalband" zu "Breitband" durchzuführen. Dies erfolgt mittels eines Breitband-Sprachextenders. Das Anregungssignal kann zum Beispiel durch eine nicht lineare Signalverarbeitung erweitert werden. Eine andere Möglichkeit liegt im Überlagern des Anregungssignals mit weißem Rauschen.In order to generate speech frequency components above a certain frequency, a broadband excitation signal and broadband filter coefficients are required for the synthesis filter in the speech coder. Since usually only the narrow-band excitation signal and the narrow-band filter coefficients are known, it is necessary to carry out a transformation from "narrowband" to "broadband". This is done by means of a broadband speech extender. The excitation signal can be extended, for example, by a non-linear signal processing. Another possibility is to superimpose the excitation signal with white noise.

Die Filterkoeffizienten können durch den Einsatz von zwei Codebüchern geschätzt werden. Die Einträge der Codebücher stellen mögliche Sätze von Filterkoeffizienten dar. In einer Trainingsphase werden ein schmalbandiges und ein breitbandiges Codebuch trainiert. Da sie gleichzeitig mit dem gleichen Anregungssignal (einmal schmalbandig und einmal breitbandig) trainiert werden, ist der Zusammenhang zwischen den Einträgen beider Codebücher bekannt. Beispielsweise korrespondiert Eintrag 1 des schmalbandigen Codebuches mit dem Eintrag 2 des breitbandigen Codebuches.The filter coefficients can be estimated by using two codebooks. The entries of the codebooks represent possible sets of filter coefficients. In a training phase, a narrowband and a wideband codebook are trained. Since they are trained simultaneously with the same excitation signal (once narrowband and once wideband), the relationship between the entries of both codebooks is known. For example, entry 1 of the narrow-band codebook corresponds to the entry 2 of the wideband codebook.

In der Einsatzphase des Breitband-Sprachextenders werden beide Codebücher folgendermaßen benutzt:

  1. 1. Vom schmalbandigen Sprachsignal werden die schmalbandigen Filterkoeffizienten berechnet.
  2. 2. Diese Filterkoeffizienten werden mit den Einträgen des schmalbandigen Codebuchs verglichen.
  3. 3. Der am besten passende Eintrag wird ausgewählt. Da der Zusammenhang zwischen den Codebüchern bekannt ist, werden die optimalen breitbandigen Filterkoeffizienten gleichzeitig gefunden.
In the broadband speech extender deployment phase, both codebooks are used as follows:
  1. 1. From the narrow-band speech signal, the narrow-band filter coefficients are calculated.
  2. 2. These filter coefficients are compared with the entries of the narrowband codebook.
  3. 3. The best matching entry is selected. Since the relationship between the codebooks is known, the optimal wideband filter coefficients are found simultaneously.

Anhand der gefundenen breitbandigen Filterkoeffizienten und des erweiterten Anregungssignals werden Sprachsignalabschnitte synthetisiert.On the basis of the found broadband filter coefficients and the extended excitation signal speech signal sections are synthesized.

Während der Spracherweiterung werden also die schmalbandigen Filterkoeffizienten von der schmalbandigen Sprache berechnet. Diese Koeffizienten werden mit den Einträgen des Codebuches mit schmalbandigen Koeffizienten verglichen und der am besten passende Eintrag wird gewählt. Da - wie oben bereits erwähnt - der Zusammenhang zwischen den Codebüchern bekannt ist, werden auf diese Weise die optimalen Filterkoeffizienten für den Sprachsynthesefilter des Breitband-Sprachextenders geschätzt.During language extension, therefore, the narrow-band filter coefficients are calculated from the narrow-band speech. These coefficients are compared with the entries of the narrow-band coefficient codebook and the best matching entry is chosen. Since, as already mentioned above, the relationship between the codebooks is known, the optimal filter coefficients for the speech synthesis filter of the wideband speech extender are estimated in this way.

Das Verfahren kann auch adaptiv beispielsweise in folgenden Schritten realisiert werden:

  1. 1. Während der breitbandigen Sprachübertragung werden die "echten" Filterkoeffizienten (FK) berechnet und der am besten passende Eintrag des Codebuches mit breitbandigen Sprachkoeffizienten wird gesucht.
  2. 2. Der Codebucheintrag wird neu berechnet. CB - Vector new = C * FK + 1 - c * CB Vector old
    Figure imgb0001
  3. 3. Während der schmalbandigen Sprachübertragung werden die breitbandigen Filterkoeffizienten anhand des modifizierten Codebuches geschätzt.
The method can also be implemented adaptively, for example in the following steps:
  1. 1. During wideband speech transmission, the "true" filter coefficients (FK) are calculated and the best matching entry of the codebook with broadband speech coefficients is searched.
  2. 2. The codebook entry is recalculated. CB - Vector new = C * FK + 1 - c * CB ~ Vector old
    Figure imgb0001
  3. 3. During narrowband speech transmission, the wideband filter coefficients are estimated from the modified codebook.

Eine Alternative wird in folgenden Schritten realisiert:

  1. 1. Während der breitbandigen Sprachübertragung werden die "echten" Filterkoeffizienten (FK)berechnet.
  2. 2. Vom breitbandigen Sprachsignal wird das schmalbandige Sprachsignal berechnet und die schmalbandigen Filterkoeffizienten werden ermittelt.
  3. 3. Der am besten passenden Eintrag des Codebuches mit schmalbandigen Koeffizienten für die schmalbandigen Filterkoeffizienten wird gesucht.
  4. 4. Mittels des bekannten Zusammenhanges zwischen den Codebüchern wird der optimale Eintrag des Codebuches mit breitbandigen Koeffizienten gesucht.
  5. 5. Dieser Codebucheintrag wird neu berechnet. CB - Vector new = C * FK + 1 - c * CB - Vector old
    Figure imgb0002
  6. 6. Während der schmalbandigen Sprachübertragung werden die breitbandigen Filterkoeffizienten anhand des modifizierten Codebuches geschätzt.
An alternative is realized in the following steps:
  1. 1. During wideband speech transmission, the "true" filter coefficients (FK) are calculated.
  2. 2. The wideband speech signal is used to calculate the narrow-band speech signal and the narrow-band filter coefficients are determined.
  3. 3. The best matching entry of the codebook with narrowband coefficients for the narrowband filter coefficients is searched.
  4. 4. By means of the known relationship between the codebooks the optimal entry of the codebook is searched with broadband coefficients.
  5. 5. This codebook entry is recalculated. CB - Vector new = C * FK + 1 - c * CB - Vector old
    Figure imgb0002
  6. 6. During narrowband speech transmission, the wideband filter coefficients are estimated from the modified codebook.

Durch den erfindungsgemäßen Sprachextender wird eine weitere Verbesserung der Sprachqualität erzielt. Er kann besonders vorteilhaft in allen Kommunikationssystemen eingesetzt werden, in denen Sprachkodierer mit variabler Bitrate angewendet werden, die sowohl schmalbandig als auch breitbandig kodieren können, beispielsweise bei UMTS.The speech extender according to the invention achieves a further improvement of the speech quality. It can be used particularly advantageously in all communication systems in which variable bit rate speech coders can be used, which can code both narrowband and broadband, for example in the case of UMTS.

Claims (10)

  1. Speech extender which is embodied so that it estimates a wideband speech signal on the basis of a narrowband speech signal, characterized in that it matches a codebook adaptively to a communication terminal and/or to a speaker during a wideband speech transmission and features means with which it analyses, stores and uses for adaptation at least one wideband speech parameter which arises during speech transmission.
  2. Speech extender according to claim 1, characterized in that the at least one speech parameter is speaker-specific and/or communication terminal-specific.
  3. Speech extender according to claim 2, characterized in that it estimates by evaluating at least one stored speech parameter.
  4. Speech extender according to one of the previous claims, characterized in that it is used in a speech codec of a mobile station and/or base station which are or is embodied for a mobile radio system of the third generation, especially UMTS.
  5. Speech extender according to claim 4, characterized in that the mobile station is a mobile radio terminal and the speech extender is implemented in hardware, especially in an integrated circuit, and/or at least partly in software.
  6. Method for estimating a wideband speech signal based on a narrowband speech signal characterized in that, in a wideband speech transmission an adaptive matching of a codebook to a communication terminal and/or to a speaker is performed and at least one wideband speech parameter which occurs during speech transmission is analysed, stored and used for adaptation.
  7. Method according to claim 6, characterized in that the at least one speech parameter is speaker- and/or communication terminal-specific.
  8. Method according to claim 7, characterized in that estimation is undertaken on the basis of at least one stored speech parameter.
  9. Method according to one of the claims 6 to 8, characterized in that it is used in a speech codec of a mobile station and/or base station which are or is embodied for a mobile radio system of the third generation, especially UMTS.
  10. Method according to claim 9, characterized in that the mobile station is a mobile radio terminal and the method is implemented in hardware, especially in an integrated circuit, and/or at least partly in software.
EP01978183A 2001-09-28 2001-09-28 Speech extender and method for estimating a broadband speech signal from a narrowband speech signal Expired - Lifetime EP1433166B8 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/DE2001/003729 WO2003036623A1 (en) 2001-09-28 2001-09-28 Speech extender and method for estimating a broadband speech signal from a narrowband speech signal

Publications (3)

Publication Number Publication Date
EP1433166A1 EP1433166A1 (en) 2004-06-30
EP1433166B1 true EP1433166B1 (en) 2007-11-14
EP1433166B8 EP1433166B8 (en) 2008-01-02

Family

ID=5648296

Family Applications (1)

Application Number Title Priority Date Filing Date
EP01978183A Expired - Lifetime EP1433166B8 (en) 2001-09-28 2001-09-28 Speech extender and method for estimating a broadband speech signal from a narrowband speech signal

Country Status (5)

Country Link
US (1) US20040243400A1 (en)
EP (1) EP1433166B8 (en)
CN (1) CN100403401C (en)
DE (1) DE50113277D1 (en)
WO (1) WO2003036623A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2004090870A1 (en) * 2003-04-04 2004-10-21 Kabushiki Kaisha Toshiba Method and apparatus for encoding or decoding wide-band audio
US8818797B2 (en) 2010-12-23 2014-08-26 Microsoft Corporation Dual-band speech encoding
KR102244612B1 (en) * 2014-04-21 2021-04-26 삼성전자주식회사 Appratus and method for transmitting and receiving voice data in wireless communication system
US10847170B2 (en) 2015-06-18 2020-11-24 Qualcomm Incorporated Device and method for generating a high-band signal from non-linearly processed sub-ranges
US9837089B2 (en) * 2015-06-18 2017-12-05 Qualcomm Incorporated High-band signal generation

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4311877A (en) * 1979-12-19 1982-01-19 Kahn Leonard R Method and means for improving the reliability of systems that transmit relatively wideband signals over two or more relatively narrowband transmission circuits
US4330689A (en) * 1980-01-28 1982-05-18 The United States Of America As Represented By The Secretary Of The Navy Multirate digital voice communication processor
ATE128311T1 (en) * 1990-03-08 1995-10-15 Ericsson Telefon Ab L M SYSTEM AND METHOD FOR DYNAMIC ALLOCATION OF STRIP ROUTING NUMBERS.
JP2779886B2 (en) * 1992-10-05 1998-07-23 日本電信電話株式会社 Wideband audio signal restoration method
US5455888A (en) * 1992-12-04 1995-10-03 Northern Telecom Limited Speech bandwidth extension method and apparatus
JP3626492B2 (en) * 1993-07-07 2005-03-09 ポリコム・インコーポレイテッド Reduce background noise to improve conversation quality
US5668837A (en) * 1993-10-14 1997-09-16 Ericsson Inc. Dual-mode radio receiver for receiving narrowband and wideband signals
DE69619284T3 (en) * 1995-03-13 2006-04-27 Matsushita Electric Industrial Co., Ltd., Kadoma Device for expanding the voice bandwidth
US5706335A (en) * 1995-04-10 1998-01-06 Corporate Computer Systems Method and appartus for transmitting coded audio signals through a transmission channel with limited bandwidth
US5806025A (en) * 1996-08-07 1998-09-08 U S West, Inc. Method and system for adaptive filtering of speech signals using signal-to-noise ratio to choose subband filter bank
US5901145A (en) * 1997-02-28 1999-05-04 Telefonaktiebolaget L M Ericsson (Publ) Mobile station handoff between a spread spectrum communications system and a frequency division communications system
DE19804581C2 (en) * 1998-02-05 2000-08-17 Siemens Ag Method and radio communication system for the transmission of voice information
EP0945852A1 (en) * 1998-03-25 1999-09-29 BRITISH TELECOMMUNICATIONS public limited company Speech synthesis
GB2357682B (en) * 1999-12-23 2004-09-08 Motorola Ltd Audio circuit and method for wideband to narrowband transition in a communication device
US6704711B2 (en) * 2000-01-28 2004-03-09 Telefonaktiebolaget Lm Ericsson (Publ) System and method for modifying speech signals
CN1235192C (en) * 2001-06-28 2006-01-04 皇家菲利浦电子有限公司 Wideband signal transmission system

Also Published As

Publication number Publication date
CN1630896A (en) 2005-06-22
EP1433166A1 (en) 2004-06-30
CN100403401C (en) 2008-07-16
US20040243400A1 (en) 2004-12-02
WO2003036623A1 (en) 2003-05-01
DE50113277D1 (en) 2007-12-27
EP1433166B8 (en) 2008-01-02

Similar Documents

Publication Publication Date Title
EP1388147B1 (en) Method for enlarging the band width of a narrow-band filtered voice signal, especially a voice signal emitted by a telecommunication appliance
DE102008016502B4 (en) A method for data transmission over a voice channel of a wireless communication network using continuous signal modulation
DE69727895T2 (en) Method and apparatus for speech coding
DE69932575T2 (en) LANGUAGE CODIER AND LANGUAGE DECODER
EP2245621B1 (en) Method and means for encoding background noise information
DE69730721T2 (en) METHOD AND DEVICES FOR NOISE CONDITIONING OF SIGNALS WHICH REPRESENT AUDIO INFORMATION IN COMPRESSED AND DIGITIZED FORM
DE60118631T2 (en) METHOD FOR REPLACING TRACKED AUDIO DATA
WO2007073949A1 (en) Method and apparatus for artificially expanding the bandwidth of voice signals
DE69820362T2 (en) Non-linear filter for noise suppression in linear predictive speech coding devices
EP1433166B1 (en) Speech extender and method for estimating a broadband speech signal from a narrowband speech signal
DE4211945C1 (en)
DE4343366C2 (en) Method and circuit arrangement for increasing the bandwidth of narrowband speech signals
EP1677286A1 (en) Process for adaptation of comfort noise generation parameters
EP1430674A1 (en) Device and method for suppressing periodic interference signals
DE10252070B4 (en) Communication terminal with parameterized bandwidth extension and method for bandwidth expansion therefor
DE60210597T2 (en) DEVICE FOR ADPCDM LANGUAGE CODING WITH SPECIFIC ADJUSTMENT OF THE STEP VALUES
EP2245622B1 (en) Method and means for decoding background noise information
WO2004044894A1 (en) Method for enlarging the bandwidth of a narrow-band filtered speech signal
WO2002058055A1 (en) Method and system for converting voice signals of different band widths, which have been encoded using parameters, into voice signals
EP1390947B1 (en) Method for signal reception
DE4236315C1 (en) Method of speech coding
DE19906223B4 (en) Method and radio communication system for voice transmission, in particular for digital mobile communication systems
WO2006072526A1 (en) Method for bandwidth extension
DE10136491B4 (en) Method and device for improving voice quality on transparent telecommunication transmission paths
WO2006072519A1 (en) Analog signal encoding method

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 20040212

AK Designated contracting states

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

RIN1 Information on inventor provided before grant (corrected)

Inventor name: KLINKE, STEFANO, AMBROSIUS

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

RAP1 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): DE FR GB

RAP3 Party data changed (applicant data changed or rights of an application transferred)

Owner name: NOKIA SIEMENS NETWORKS S.P.A.

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

RAP4 Party data changed (patent owner data changed or rights of a patent transferred)

Owner name: NOKIA SIEMENS NETWORKS GMBH & CO. KG

REF Corresponds to:

Ref document number: 50113277

Country of ref document: DE

Date of ref document: 20071227

Kind code of ref document: P

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20080131

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20080815

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20080912

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20080918

Year of fee payment: 8

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: DE

Payment date: 20080919

Year of fee payment: 8

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20090928

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20100531

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090930

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20100401

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20090928