EP1288914B1 - Method for the correction of measured speech quality values - Google Patents

Method for the correction of measured speech quality values Download PDF

Info

Publication number
EP1288914B1
EP1288914B1 EP02012790A EP02012790A EP1288914B1 EP 1288914 B1 EP1288914 B1 EP 1288914B1 EP 02012790 A EP02012790 A EP 02012790A EP 02012790 A EP02012790 A EP 02012790A EP 1288914 B1 EP1288914 B1 EP 1288914B1
Authority
EP
European Patent Office
Prior art keywords
speech
speech quality
value
correction
quality
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
EP02012790A
Other languages
German (de)
French (fr)
Other versions
EP1288914A2 (en
EP1288914A3 (en
Inventor
Jens Dr. Berger
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Deutsche Telekom AG
Original Assignee
Deutsche Telekom AG
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Deutsche Telekom AG filed Critical Deutsche Telekom AG
Publication of EP1288914A2 publication Critical patent/EP1288914A2/en
Publication of EP1288914A3 publication Critical patent/EP1288914A3/en
Application granted granted Critical
Publication of EP1288914B1 publication Critical patent/EP1288914B1/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02168Noise filtering characterised by the method used for estimating noise the estimation exclusively taking place during speech pauses

Definitions

  • the invention relates to instrumental methods for measuring the speech quality of recorded or transmitted speech signals. It is based on voice quality metrics assumed that e.g. with the ITU-T standard P.862 ("Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs ", ITU-T, Geneva, 2001) become.
  • P.862 Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs
  • the perceived speech quality z. B. in telephone or broadcasting is mainly caused by speech-simultaneous disturbances, ie disturbances during the Voice activity, determined. But also noises in speech pauses go into that Quality judgment, especially with high quality voice reproduction quality.
  • Speech quality determinations of speech signals are usually auditory ("subjective") investigations carried out with test subjects.
  • the goal of instrumental ("objective") speech quality assessment procedures are out Properties of the speech signal to be evaluated by means of suitable calculation methods To determine characteristic values that determine the speech quality of the speech signal to be evaluated describe without having to resort to judgments from test subjects.
  • Known methods for instrumental determination of speech quality determine the voice quality based on a comparison between undisturbed reference speech signal (source speech signal) and the to be evaluated and possibly disturbed signal.
  • Such measuring methods are used in so-called sample connection systems, in where a known reference speech signal (source speech signal) is input to the source and z. B. transmitted over a telephone connection and recorded on the sink. After Recording of the speech signal is the calculation of a speech quality value.
  • source speech signal source speech signal
  • Instrumental methods for speech quality determination are usually limited to the Evaluation of sections with voice activity.
  • the current ITU-T standard P.862 is also restricted to sections when determining the speech quality active language. Especially with high quality speech and sounds, the occur exclusively in speech pauses, provide these methods (eg ITU-T Rec. P.862) unreliable quality values.
  • the voice quality is here too evaluated optimistically, since the perceived by a listener voice quality on the whole signal including possible noises in the speech pauses relates.
  • Some instrumental methods for speech quality determination such. B. the method according to ITU-T Rec. P.862, take into account the calculation of the speech quality values Noises in the speech pauses not. The resulting readings are thereby especially at high quality playback in voice activity but occurring noises in speech breaks, unreliable. With the present method, the background noise be taken into account in the speech pauses in determining the speech quality values.
  • the solution of the problem as defined in claim 1 assumes that the background noise in the Speech pauses regarding their disturbing influence on the perceived speech quality be rated. For this purpose, intensity characteristic values of the background noise are determined and with these values, the speech quality measurements made by an instrumental Method, for. B. ITU-T Rec. P.862, were corrected corrected.
  • the voice quality value is obtained by using the source voice signal and the evaluating disturbed speech signal, z.
  • the method of ITU-T Rec. P.962 calculated.
  • These speech signals are also available to the subsequent correction process Input parameters available.
  • the correction method described here requires nor the calculated speech quality value, the z. B. with the method according to ITU-T Rec. P.862 was calculated.
  • one or more intensity characteristics are obtained of the noise in speech pauses.
  • This can be z.
  • the average loudness according to ISO 532 of the background noise in speech pauses A very efficient process for this is described in the patent application DE 101 20 168.
  • other intensity characteristics e.g. Sharpness, impulsiveness, fluctuation strength
  • Sharpness, impulsiveness, fluctuation strength can be included in the correction value. It It is assumed that increasing intensity characteristics of one as well Increasing noise caused by the noise in speech breaks and thus too lead to a greater reduction in perceived speech quality.
  • the voice quality is rated too high by the speech quality measurement methods described with good speech reproduction quality and simultaneous noises in the speech pauses.
  • One or more background noise intensity values in speech pauses are used to correct the measured speech quality value. Assuming that the measured speech quality score is on a scale from 1 (low quality) to 5 (very good quality) according to ITU-T Recommendation P.800 ("Methods for objective and subjective assessment of quality", ITU-T, Geneva 1996 ), all values above a certain speech quality threshold (eg, in the method of ITU-T Rec. P.862 above 3.0) are reduced when background noise occurs in speech pauses.
  • This reduction is dependent on the intensity characteristics of the background noise, the proportion of speech pauses in the total signal PA and the speech quality value Y.
  • a general description of this correction is for all speech quality values Y above a speech quality threshold YS in which only one intensity score (here N) is taken into account. This can be z.
  • the function a (N) represents a unique and increasing weighting function of the intensity parameter N.
  • the corrected speech quality value is always less than or equal to the uncorrected value.
  • the correction is small if the intensity of the noise is low ( N is small), there are few speech pauses ( PA small) or the speech quality value is close to the speech quality threshold ( Y-YS small). More correct is when strong pauses sound with otherwise high voice quality Y occur.
  • the intensity characteristics are weighted with the weighting functions a (N), b (M) and c (O), respectively. Since an increasing disturbance with increasing values is assumed, here also clear and increasing weighting functions are to be set.
  • the exemplary embodiment presented here shows, by way of example, a correction of the speech quality values determined using the method according to ITU-T Rec. P.862 "PESQ" (as of 2001).
  • This method provides a speech quality value by comparing an undisturbed source speech signal with the disturbed speech signal to be evaluated. These two speech signals are used to determine the average loudness of the background noise according to patent application DE 101 20 168.
  • the thus calculated value N in sone is used to correct the quality value calculated by the method according to ITU-T Rec. P.862 "PESQ" (state 2001) used.
  • the value of the threshold excess YD is determined, ie by what proportion the measured speech quality Y exceeds the speech quality threshold YS .

Abstract

A speech quality estimate correction system uses a sum of background noise measurements in speech pauses with multiplication of the measured value by a pause ratio constant.

Description

VorbemerkungPreliminary note

Die Erfindung bezieht sich auf instrumentelle Verfahren zur Messung der Sprachqualität von aufgezeichneten oder übertragenen Sprachsignalen. Dabei wird von Sprachqualitätsmesswerten ausgegangen, die z.B. mit dem ITU-T Standard P.862 ("Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs", ITU-T, Genf, 2001) bestimmt werden.The invention relates to instrumental methods for measuring the speech quality of recorded or transmitted speech signals. It is based on voice quality metrics assumed that e.g. with the ITU-T standard P.862 ("Perceptual Evaluation of Speech Quality (PESQ), an Objective Method for end-to-end Speech Quality Assessment of Narrow-band Telephone Networks and Speech Codecs ", ITU-T, Geneva, 2001) become.

Die empfundene Sprachqualität z. B. bei Telefonverbindungen oder Rundfunkübertragungen wird hauptsächlich von sprachsimultanen Störungen, also Störungen während der Sprachaktivität, bestimmt. Aber auch Geräusche in Sprachpausen gehen in das Qualitätsurteil ein, das insbesondere bei hochqualitativer Sprachwiedergabequalität.The perceived speech quality z. B. in telephone or broadcasting is mainly caused by speech-simultaneous disturbances, ie disturbances during the Voice activity, determined. But also noises in speech pauses go into that Quality judgment, especially with high quality voice reproduction quality.

Sprachqualitätsbestimmungen von Sprachsignalen werden in der Regel mittels auditiver ("subjektiver") Untersuchungen mit Versuchspersonen vorgenommen. Das Ziel von instrumentellen ("objektiven") Verfahren zur Sprachqualitätsbestimmung ist es dagegen, aus Eigenschaften des zu bewertenden Sprachsignals mittels geeigneter Rechenverfahren Kennwerte zu ermitteln, die die Sprachqualität des zu bewertenden Sprachsignals beschreiben ohne auf Urteile von Versuchspersonen zurückgreifen zu müssen.Speech quality determinations of speech signals are usually auditory ("subjective") investigations carried out with test subjects. The goal of instrumental ("objective") speech quality assessment procedures, however, are out Properties of the speech signal to be evaluated by means of suitable calculation methods To determine characteristic values that determine the speech quality of the speech signal to be evaluated describe without having to resort to judgments from test subjects.

Stand der Technik und Nachteile bekannter VerfahrenState of the art and disadvantages of known methods

Bekannte Verfahren zur instrumentellen Bestimmung der Sprachqualität (z. B. ITU-T Rec. P.862 "PESQ") bestimmen die Sprachqualität anhand eines Vergleichs zwischen ungestörtem Referenzsprachsignal (Quellsprachsignal) und dem zu bewertenden und möglicherweise gestörten Signal.Known methods for instrumental determination of speech quality (eg ITU-T Rec. P.862 "PESQ") determine the voice quality based on a comparison between undisturbed reference speech signal (source speech signal) and the to be evaluated and possibly disturbed signal.

Solche Messverfahren werden in sogenannten Probeverbindungssystemen eingesetzt, bei denen ein bekanntes Referenzsprachsignal (Quellsprachsignal) an der Quelle eingespeist und z. B. über eine Telefonverbindung übertragen und an der Senke aufgezeichnet wird. Nach der Aufzeichnung des Sprachsignals erfolgt die Berechnung eines Sprachqualitätswertes.Such measuring methods are used in so-called sample connection systems, in where a known reference speech signal (source speech signal) is input to the source and z. B. transmitted over a telephone connection and recorded on the sink. After Recording of the speech signal is the calculation of a speech quality value.

Instrumentelle Verfahren zur Sprachqualitätsbestimmung beschränken sich meist auf die Auswertung von Abschnitten mit Sprachaktivität. Der aktuelle ITU-T Standard P.862 beschränkt sich bei der Bestimmung der Sprachqualität ebenfalls nur auf Abschnitte mit aktiver Sprache. Insbesondere bei hochqualitativer Sprachwiedergabe und Geräuschen, die ausschließlich in Sprachpausen auftreten, liefern diese Verfahren (z. B. Messverfahren nach ITU-T Rec. P.862) unzuverlässige Qualitätswerte. Die Sprachqualität wird hier zu optimistisch bewertet, da die von einem Zuhörer empfundene Sprachqualität sich auf das gesamte Signal einschließlich möglicher Geräusche in den Sprachpausen bezieht.Instrumental methods for speech quality determination are usually limited to the Evaluation of sections with voice activity. The current ITU-T standard P.862 is also restricted to sections when determining the speech quality active language. Especially with high quality speech and sounds, the occur exclusively in speech pauses, provide these methods (eg ITU-T Rec. P.862) unreliable quality values. The voice quality is here too evaluated optimistically, since the perceived by a listener voice quality on the whole signal including possible noises in the speech pauses relates.

Aufgabetask

Einige instrumentelle Verfahren zur Sprachqualitätsbestimmung, wie z. B. das Verfahren nach ITU-T Rec. P.862, berücksichtigen bei der Berechnung der Sprachqualitätswerte die Geräusche in den Sprachpausen nicht. Die sich ergebenden Messwerte sind dadurch, insbesondere bei hoher Wiedergabequalität bei Sprachaktivität aber auftretenden Geräuschen in Sprachpausen, unzuverlässig. Mit dem vorliegenden Verfahren sollen die Hintergeräusche in den Sprachpausen bei der Bestimmung der Sprachqualitätswerte berücksichtigt werden.Some instrumental methods for speech quality determination, such. B. the method according to ITU-T Rec. P.862, take into account the calculation of the speech quality values Noises in the speech pauses not. The resulting readings are thereby especially at high quality playback in voice activity but occurring noises in speech breaks, unreliable. With the present method, the background noise be taken into account in the speech pauses in determining the speech quality values.

Lösungsprinzipsolution principle

Die Lösung der Aufgabe, wie im Anspruch 1 definiert, geht davon aus, dass die Hintergrundgeräusche in den Sprachpausen hinsichtlich ihres störenden Einflusses auf die empfundene Sprachqualität bewertet werden. Dazu werden Intensitätskennwerte des Hintergrundgeräusches ermittelt und mit diesen Werten die Sprachqualitätsmesswerte, die durch ein instrumentales Verfahren, z. B. nach ITU-T Rec. P.862, gewonnenen wurden, korrigiert.The solution of the problem as defined in claim 1 assumes that the background noise in the Speech pauses regarding their disturbing influence on the perceived speech quality be rated. For this purpose, intensity characteristic values of the background noise are determined and with these values, the speech quality measurements made by an instrumental Method, for. B. ITU-T Rec. P.862, were corrected corrected.

Zunächst wird der Sprachqualitätswert unter Benutzung des Quellsprachsignals und des zu bewertenden gestörten Sprachsignals, z. B. mit dem Verfahren nach ITU-T Rec. P.962, berechnet. Diese Sprachsignale stehen dem anschließenden Korrekturverfahren ebenfalls als Eingangsparameter zur Verfügung. Zusätzlich benötigt das hier beschriebene Korrekturverfahren noch den berechneten Sprachqualitätswert, der z. B. mit dem Verfahren nach ITU-T Rec. P.862 berechnet wurde.First, the voice quality value is obtained by using the source voice signal and the evaluating disturbed speech signal, z. By the method of ITU-T Rec. P.962, calculated. These speech signals are also available to the subsequent correction process Input parameters available. In addition, the correction method described here requires nor the calculated speech quality value, the z. B. with the method according to ITU-T Rec. P.862 was calculated.

Ausgehend von den beiden Sprachsignalen werden ein oder mehrere Intensitätskennwerte des Geräuschs in Sprachpausen berechnet. Dies kann z. B. die mittlere Lautheit nach ISO 532 der Hintergrundgeräusche in Sprachpausen sein. Ein sehr effizientes Verfahren dafür ist in der Patentanmeldung DE 101 20 168 beschrieben. Auch andere Intensitätskennwerte (z.B. Schärfe, Impulshaftigkeit, Schwankungsstärke) können in den Korrekturwert einfließen. Es wird davon ausgegangen, dass zunehmende Intensitätskennwerte von einer ebenfalls zunehmenden Störung durch das Geräusch in Sprachpausen verursacht werden und damit zu einer stärkeren Verminderung der empfundenen Sprachqualität führen.Starting from the two speech signals, one or more intensity characteristics are obtained of the noise in speech pauses. This can be z. For example, the average loudness according to ISO 532 of the background noise in speech pauses. A very efficient process for this is described in the patent application DE 101 20 168. Also other intensity characteristics (e.g. Sharpness, impulsiveness, fluctuation strength) can be included in the correction value. It It is assumed that increasing intensity characteristics of one as well Increasing noise caused by the noise in speech breaks and thus too lead to a greater reduction in perceived speech quality.

Die Sprachqualität wird bei guter Sprachwiedergabequalität und gleichzeitigen Geräuschen in den Sprachpausen von den beschriebenen Sprachqualitätsmessverfahren zu hoch bewertet. Ein oder mehrere Intensitätswerte des Hintergrundgeräusches in Sprachpausen werden dazu benutzt, den gemessenen Sprachqualitätswert zu korrigieren. Ausgehend davon, dass der gemessene Sprachqualitätswert auf einer Skala von 1 (niedrige Qualität) bis 5 (sehr gute Qualität) entsprechend der ITU-T Recommendation P.800 ("Methods for objective and subjective assessment of quality", ITU-T, Genf 1996), abgebildet wird, werden alle Werte oberhalb eines bestimmten Sprachqualitätsschwellwertes (z. B. bei dem Verfahren nach ITU-T Rec. P.862 oberhalb von 3,0) bei Auftreten von Hintergrundgeräuschen in Sprachpausen reduziert. Diese Reduktion ist von den Intensitätskennwerten des Hintergrundgeräusches, dem Anteil der Sprachpausen im Gesamtsignal PA und dem Sprachqualitätswert Y abhängig. Eine allgemeine Beschreibung dieser Korrektur ist für alle Sprachqualitätswerte Y oberhalb eines Sprachqualitätsschwellenwertes YS, bei der nur ein Intensitätskennwert (hier N) berücksichtigt wird. Dies kann z. B. die mittlere Lautheit des Hintergrundgeräusches in Sprachpausen sein: Y(korrigiert) = Y- ((Y- YS)) * a(N) * PA). The voice quality is rated too high by the speech quality measurement methods described with good speech reproduction quality and simultaneous noises in the speech pauses. One or more background noise intensity values in speech pauses are used to correct the measured speech quality value. Assuming that the measured speech quality score is on a scale from 1 (low quality) to 5 (very good quality) according to ITU-T Recommendation P.800 ("Methods for objective and subjective assessment of quality", ITU-T, Geneva 1996 ), all values above a certain speech quality threshold (eg, in the method of ITU-T Rec. P.862 above 3.0) are reduced when background noise occurs in speech pauses. This reduction is dependent on the intensity characteristics of the background noise, the proportion of speech pauses in the total signal PA and the speech quality value Y. A general description of this correction is for all speech quality values Y above a speech quality threshold YS in which only one intensity score (here N) is taken into account. This can be z. For example, the average loudness of background noise in speech pauses may be: Y (corrected) = Y- ((Y-YS)) * a (N) * PA).

Die Funktion a(N) stellt eine eindeutige und steigende Wichtungsfunktion des Intensitätskennwertes N dar. Der korrigierte Sprachqualitätswert ist stets kleiner oder gleich dem unkorrigierten Wert. Die Korrektur fällt gering aus, wenn die Intensität des Geräusches gering ist (N ist klein), nur wenig Sprachpausen auftreten (PA klein) oder sich der Sprachqualitätswert in der Nähe des Sprachqualitätsschwellwertes befindet (Y- YS klein). Stärker wird korrigiert, wenn starke Pausengeräusche bei ansonsten hoher Sprachqualität Y auftreten. In Analogie dazu, kann die Korrektur der Sprachqualitätswerte auch mit Berücksichtigung mehrerer Intensitätskennwerte (hier als Beispiel N, M und O) erfolgen: Y(korrigiert) = Y - ((Y- YS)) * a(N) *b(M *c(O) * PA) The function a (N) represents a unique and increasing weighting function of the intensity parameter N. The corrected speech quality value is always less than or equal to the uncorrected value. The correction is small if the intensity of the noise is low ( N is small), there are few speech pauses ( PA small) or the speech quality value is close to the speech quality threshold ( Y-YS small). More correct is when strong pauses sound with otherwise high voice quality Y occur. By analogy with this, the correction of the speech quality values can also take place with consideration of several intensity characteristics (here as examples N, M and O): Y (corrected) = Y - ((Y-YS)) * a (N) * b (M * c (O) * PA)

Die Intensitätskennwerte werden mit den Wichtungsfunktionen a(N), b(M) bzw. c(O) gewichtet. Da von einer zunehmenden Störung bei steigenden Werten ausgegangen wird, sind hier auch eindeutige und steigende Wichtungsfunktionen anzusetzen.The intensity characteristics are weighted with the weighting functions a (N), b (M) and c (O), respectively. Since an increasing disturbance with increasing values is assumed, here also clear and increasing weighting functions are to be set.

Ausführungsbeispiel für die Korrektur von Sprachqualitätswerten, die mit dem Verfahren "PESQ".nach ITU-T P.862 bestimmt wurdenEmbodiment for the correction of speech quality values associated with the Method "PESQ" according to ITU-T P.862

Das hier vorgestellte Ausführungsbeispiel zeigt beispielhaft eine Korrektur der mit dem Verfahren nach ITU-T Rec. P.862 "PESQ" (Stand 2001) ermittelten Sprachqualitätswerte. Dieses Verfahren liefert durch den Vergleich eines ungestörten Quellsprachsignals mit dem zu bewertenden gestörten Sprachsignal einen Sprachqualitätswert. Diese beiden Sprachsignale dienen zur Bestimmung der mittleren Lautheit des Hintergrundgeräuschs entsprechend Patentanmeldung DE 101 20 168. Der so berechnete Wert N in sone wird zur Korrektur des mit dem Verfahren nach ITU-T Rec. P.862 "PESQ" (Stand 2001) berechneten Qualitätswertes benutzt. Die Korrektur des Qualitätswertes setzt erst ein, wenn N einen Wert von 1,3 sone überschritten hat (unterer Schwellwert): N1 = N -1,3, wenn N1 < 0, dann N1 = 0
   wenn N1 > 4, dann N1 = 4.
The exemplary embodiment presented here shows, by way of example, a correction of the speech quality values determined using the method according to ITU-T Rec. P.862 "PESQ" (as of 2001). This method provides a speech quality value by comparing an undisturbed source speech signal with the disturbed speech signal to be evaluated. These two speech signals are used to determine the average loudness of the background noise according to patent application DE 101 20 168. The thus calculated value N in sone is used to correct the quality value calculated by the method according to ITU-T Rec. P.862 "PESQ" (state 2001) used. The correction of the quality value does not start until N exceeds a value of 1.3 sone (lower threshold): N1 = N -1.3, if N1 <0, then N1 = 0
if N1> 4, then N1 = 4.

Der Wert der Schwellenüberschreitung N1 wird für die weiteren Schritte benutzt. Negative N1 werden nicht zugelassen, sondern auf N1 = 0 gesetzt. Ebenso wird N1 bei N1 = 4 begrenzt.The value of the threshold exceeded N1 is used for the further steps. Negative N1 are not allowed, but set to N1 = 0 . Likewise, N1 is limited at N1 = 4 .

In einem zweiten Schritt wird der Wert N1 mit einer Konstanten und dem Anteil der Sprachpausen im Gesamtsignal PA (PA = 0...1) gewichtet: N2 = 0,45 * N1 * PA In a second step, the value N1 is weighted with a constant and the proportion of speech pauses in the overall signal PA (PA = 0... 1) : N2 = 0.45 * N1 * PA

Im nächsten Schritt wird der Wert der Schwellüberschreitung YD ermittelt, d.h. um welchen Anteil die gemessene Sprachqualität Y den Sprachqualitätsschwellwert YS überschreitet. Der dafür erforderliche Sprachqualitätsschwellwert YS ist gleich 3.0: YD = Y - YS,    wenn YD < 0, dann YD = 0In the next step, the value of the threshold excess YD is determined, ie by what proportion the measured speech quality Y exceeds the speech quality threshold YS . The required language quality threshold YS is equal to 3.0: YD = Y - YS, if YD <0, then YD = 0

Der Korrekturwert YK, der die Reduktion des Sprachqualitätswertes durch das Hintergrundgeräusch in Sprachpausen beschreibt, ist der mit der Schwellwertüberschreitung gewichtete Intensitätswert N2: YK = YD * N2 The correction value YK, which describes the reduction of the speech quality value by the background noise in speech pauses, is the intensity value N2 weighted with the threshold exceeding: YK = YD * N2

Dieser Korrekturwert wird vom ursprünglichen Y subtrahiert und damit ergibt sich der korrigierte Wert für die Sprachqualität Y(korrigiert): Y(korrigiert) = Y- YK. This correction value is subtracted from the original Y, resulting in the corrected value for speech quality Y (corrected) : Y (corrected) = Y-YK.

Claims (2)

  1. Method for correction of measured speech quality values, the speech quality measurements being obtained by instrumental methods analyzing only the signal segments with speech activity, wherein:
    the noise in the speech pauses is taken into account in determining the speech quality values,
    the intensity of the background noise in the speech pauses is determined for this purpose and these characteristic values are used for the correction of the measured speech quality values, and
    the correction is performed by the means that the measured speech quality value is reduced by an amount calculated from one or more weighted characteristic intensity values of the background noise, the proportion of speech pauses in the speech signal being evaluated, and the measured speech quality value.
  2. Method according to claim 1, characterized in that the correction for speech quality values measured in conformity with the ITU-T standard P. 862 is performed by the means that the mean loudness in speech pauses is determined as a characteristic intensity value and the correction calculation takes into account the amount exceeding the threshold value of 1.3 sone but no more than a maximum of 4.0 sone, and said value is multiplied by a constant factor (0.45), the proportion of speech pauses in the signal as a whole, and the proportion of the measured speech quality value exceeding the speech quality threshold of 3.0, resulting in a correction value to be subtracted from the measured speech quality value in order to determine the corrected speech quality value.
EP02012790A 2001-08-29 2002-06-10 Method for the correction of measured speech quality values Expired - Lifetime EP1288914B1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE10142846A DE10142846A1 (en) 2001-08-29 2001-08-29 Procedure for the correction of measured speech quality values
DE10142846 2001-08-29

Publications (3)

Publication Number Publication Date
EP1288914A2 EP1288914A2 (en) 2003-03-05
EP1288914A3 EP1288914A3 (en) 2004-05-19
EP1288914B1 true EP1288914B1 (en) 2005-12-21

Family

ID=7697364

Family Applications (1)

Application Number Title Priority Date Filing Date
EP02012790A Expired - Lifetime EP1288914B1 (en) 2001-08-29 2002-06-10 Method for the correction of measured speech quality values

Country Status (3)

Country Link
EP (1) EP1288914B1 (en)
AT (1) ATE313846T1 (en)
DE (2) DE10142846A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2944640A1 (en) * 2009-04-17 2010-10-22 France Telecom METHOD AND DEVICE FOR OBJECTIVE EVALUATION OF THE VOICE QUALITY OF A SPEECH SIGNAL TAKING INTO ACCOUNT THE CLASSIFICATION OF THE BACKGROUND NOISE CONTAINED IN THE SIGNAL.

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FI94810C (en) * 1993-10-11 1995-10-25 Nokia Mobile Phones Ltd A method for identifying a poor GSM speech frame
US5684921A (en) * 1995-07-13 1997-11-04 U S West Technologies, Inc. Method and system for identifying a corrupted speech message signal
US5809414A (en) * 1995-11-22 1998-09-15 Northern Telecom Limited User out-of-range indication for digital wireless systems
SE506341C2 (en) * 1996-04-10 1997-12-08 Ericsson Telefon Ab L M Method and apparatus for reconstructing a received speech signal
EP0980064A1 (en) * 1998-06-26 2000-02-16 Ascom AG Method for carrying an automatic judgement of the transmission quality of audio signals
DE19840548C2 (en) * 1998-08-27 2001-02-15 Deutsche Telekom Ag Procedures for instrumental language quality determination
GB9911777D0 (en) * 1999-05-20 1999-07-21 Univ Southampton Transceiver

Also Published As

Publication number Publication date
ATE313846T1 (en) 2006-01-15
DE50205328D1 (en) 2006-01-26
DE10142846A1 (en) 2003-03-20
EP1288914A2 (en) 2003-03-05
EP1288914A3 (en) 2004-05-19

Similar Documents

Publication Publication Date Title
DE19952538C2 (en) Automatic gain control in a speech recognition system
DE60108401T2 (en) SYSTEM FOR INCREASING LANGUAGE QUALITY
DE60023517T2 (en) CLASSIFICATION OF SOUND SOURCES
DE69334139T2 (en) Testing of communication device
EP1386307B1 (en) Method and device for determining a quality measure for an audio signal
DE60122751T2 (en) METHOD AND DEVICE FOR OBJECTIVE EVALUATION OF LANGUAGE QUALITY WITHOUT REFERENCE SIGNAL
DE19647399C1 (en) Hearing-appropriate quality assessment of audio test signals
DE10017646A1 (en) Noise suppression in the time domain
DE60205232T2 (en) METHOD AND DEVICE FOR DETERMINING THE QUALITY OF A LANGUAGE SIGNAL
DE69918635T2 (en) Apparatus and method for speech processing
DE10214407C1 (en) Measuring, evaluating and noise recognition device for signal transmission or storage systems, has estimator which supplies estimated parameter to model system whose output indicates instantaneous noise and distortion
DE60222770T2 (en) IMPROVED METHOD FOR DETERMINING THE QUALITY OF A LANGUAGE SIGNAL
EP1077448A2 (en) Speech recognition taking into account fluctuations of loudness
DE102013011761A1 (en) Motor vehicle has estimating unit and background noise spectrum unit that are designed to open dynamic filter with low background noise and close with strong background noise
EP1048025B1 (en) Method for objective voice quality evaluation
EP1634277B1 (en) Extraction of test signal sections for measuring the quality of an audio signal
EP1382034B1 (en) Method for determining intensity parameters of background noise in speech pauses of voice signals
DE2021126A1 (en) Speech recognition device
DE60110541T2 (en) Method for speech recognition with noise-dependent normalization of the variance
EP1288914B1 (en) Method for the correction of measured speech quality values
DE60305306T2 (en) Apparatus and method for binaural quality assessment
EP1005016A2 (en) Method and circuit arrangement for measuring speech level in a speech processing system
DE102013005844B3 (en) Method for measuring quality of speech signal transmitted through e.g. voice over internet protocol, involves weighing partial deviations of each frames of time lengths of reference, and measuring speech signals by weighting factor
EP0902416B1 (en) Method and device for recognizing a speech input during an announcement
DE10026872A1 (en) Procedure for calculating a voice activity decision (Voice Activity Detector)

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

AX Request for extension of the european patent

Extension state: AL LT LV MK RO SI

RIC1 Information provided on ipc code assigned before grant

Ipc: 7H 04M 3/22 B

Ipc: 7G 10L 19/00 A

17P Request for examination filed

Effective date: 20041119

AKX Designation fees paid

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

GRAP Despatch of communication of intention to grant a patent

Free format text: ORIGINAL CODE: EPIDOSNIGR1

GRAS Grant fee paid

Free format text: ORIGINAL CODE: EPIDOSNIGR3

GRAA (expected) grant

Free format text: ORIGINAL CODE: 0009210

AK Designated contracting states

Kind code of ref document: B1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE TR

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: IT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT;WARNING: LAPSES OF ITALIAN PATENTS WITH EFFECTIVE DATE BEFORE 2007 MAY HAVE OCCURRED AT ANY TIME BEFORE 2007. THE CORRECT EFFECTIVE DATE MAY BE DIFFERENT FROM THE ONE RECORDED.

Effective date: 20051221

Ref country code: IE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: NL

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: FI

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

REG Reference to a national code

Ref country code: GB

Ref legal event code: FG4D

Free format text: NOT ENGLISH

REG Reference to a national code

Ref country code: CH

Ref legal event code: EP

REG Reference to a national code

Ref country code: IE

Ref legal event code: FG4D

Free format text: LANGUAGE OF EP DOCUMENT: GERMAN

REF Corresponds to:

Ref document number: 50205328

Country of ref document: DE

Date of ref document: 20060126

Kind code of ref document: P

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: SE

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

Ref country code: GR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

Ref country code: DK

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060321

GBT Gb: translation of ep patent filed (gb section 77(6)(a)/1977)

Effective date: 20060306

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: ES

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060401

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: PT

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20060522

NLV1 Nl: lapsed or annulled due to failure to fulfill the requirements of art. 29p and 29m of the patents act
PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: BE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: LI

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: CH

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

Ref country code: MC

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060630

REG Reference to a national code

Ref country code: IE

Ref legal event code: FD4D

ET Fr: translation filed
PLBE No opposition filed within time limit

Free format text: ORIGINAL CODE: 0009261

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: NO OPPOSITION FILED WITHIN TIME LIMIT

26N No opposition filed

Effective date: 20060922

REG Reference to a national code

Ref country code: CH

Ref legal event code: PL

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: AT

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060610

BERE Be: lapsed

Owner name: DEUTSCHE TELEKOM A.G.

Effective date: 20060630

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: TR

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

Ref country code: LU

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20060610

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: CY

Free format text: LAPSE BECAUSE OF FAILURE TO SUBMIT A TRANSLATION OF THE DESCRIPTION OR TO PAY THE FEE WITHIN THE PRESCRIBED TIME-LIMIT

Effective date: 20051221

REG Reference to a national code

Ref country code: FR

Ref legal event code: PLFP

Year of fee payment: 15

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: GB

Payment date: 20160628

Year of fee payment: 15

Ref country code: DE

Payment date: 20160622

Year of fee payment: 15

PGFP Annual fee paid to national office [announced via postgrant information from national office to epo]

Ref country code: FR

Payment date: 20160621

Year of fee payment: 15

REG Reference to a national code

Ref country code: DE

Ref legal event code: R119

Ref document number: 50205328

Country of ref document: DE

GBPC Gb: european patent ceased through non-payment of renewal fee

Effective date: 20170610

REG Reference to a national code

Ref country code: FR

Ref legal event code: ST

Effective date: 20180228

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: GB

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170610

Ref country code: DE

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20180103

PG25 Lapsed in a contracting state [announced via postgrant information from national office to epo]

Ref country code: FR

Free format text: LAPSE BECAUSE OF NON-PAYMENT OF DUE FEES

Effective date: 20170630