WO2000013173A1 - Verfahren zur instrumentellen sprachqualitätsbestimmung - Google Patents
Verfahren zur instrumentellen sprachqualitätsbestimmung Download PDFInfo
- Publication number
- WO2000013173A1 WO2000013173A1 PCT/EP1999/005972 EP9905972W WO0013173A1 WO 2000013173 A1 WO2000013173 A1 WO 2000013173A1 EP 9905972 W EP9905972 W EP 9905972W WO 0013173 A1 WO0013173 A1 WO 0013173A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- spectral
- evaluated
- calculated
- signal
- speech signal
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 29
- 238000013441 quality evaluation Methods 0.000 title abstract 3
- 230000003595 spectral effect Effects 0.000 claims abstract description 55
- 230000010354 integration Effects 0.000 claims abstract description 3
- 238000004364 calculation method Methods 0.000 claims description 7
- 238000001228 spectrum Methods 0.000 claims 3
- 238000005457 optimization Methods 0.000 abstract description 3
- 239000013589 supplement Substances 0.000 abstract 1
- 230000006870 function Effects 0.000 description 12
- 230000005540 biological transmission Effects 0.000 description 7
- 238000013459 approach Methods 0.000 description 5
- 230000006735 deficit Effects 0.000 description 3
- 238000012360 testing method Methods 0.000 description 3
- 238000012937 correction Methods 0.000 description 2
- 238000011156 evaluation Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 1
- 230000000052 comparative effect Effects 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000001303 quality assessment method Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
Definitions
- the invention relates to a method for instrumental ("objective") speech quality determination, in which characteristic values for determining the speech quality (speech quality) are derived by comparing properties of a speech signal to be evaluated with properties of a reference speech signal (undisturbed signal).
- Speech quality determinations of speech signals are generally carried out by means of auditory ("subjective") examinations with test subjects.
- the aim of instrumental ("objective") methods for determining speech quality is to determine from the properties of the speech signal to be assessed, using suitable computing methods, characteristic values which describe the speech quality of the speech signal to be assessed, without having to resort to judgments from test subjects.
- the calculated parameters and the underlying method for instrumental language quality determination are considered recognized if a high correlation to the results of auditory comparative examinations is achieved.
- the language quality values obtained by means of auditory examinations thus represent the target values that are to be achieved by instrumental methods.
- Known methods for instrumental speech quality determination are based on a comparison of a reference speech signal with the speech signal to be evaluated.
- the reference speech signal and the speech signal to be evaluated are segmented into short time segments.
- the spectral properties of the two signals are compared in these segments.
- the spectral intensity map calculated in this way for each period of time under consideration can be understood as a series of numerical values in which the number of individual values corresponds to the number of frequency bands used, the numerical values themselves represent the calculated intensity values and a continuous index of the frequency bands describes the sequence of the numerical values.
- the limits of the frequency bands used are kept constant on the frequency axis.
- the calculated intensities of the speech signal to be evaluated and the reference speech signal in each band are compared with one another.
- the difference between the two values, or the similarity of the two resulting spectral intensity images, is the basis for the calculation of a quality value
- a disadvantage of the methods known today in such cases is that when comparing the speech signal to be evaluated with a reference speech signal, differences between the two signal sections in the selected display level flow into the quality characteristic to be calculated, which are not or hardly at all - also perceptible in the auditory test - lead to qualitative impairment.
- Frequency band limitations and spectral deformations of the speech signal to be evaluated e.g. caused by filter properties of the telephone device or the transmission channel
- the object of the invention is to reduce the influence of spectral limitations and deformations of the speech signal to be evaluated and of shifts in spectral short-term maxima before comparing the spectral properties of a signal to be tested with a reference speech signal and calculating a quality value in instrumental methods.
- a spectral weighting function is generated in the invention described here, which is based on medium spectral envelopes, e.g. the average spectral power density, based on the speech signal to be evaluated and the reference speech signal. This also enables the method to be used for non-linear and time-variant transmission.
- the spectral weighting function is calculated from the quotients of the base values of the mean spectral power density of the signal Phi ⁇ (f) to be evaluated and that of the input signal of the transmission system Phi ⁇ (f) in such a way that the weighting function over
- the evaluation function a (f) can weight the weighting function W ⁇ (f) differently over the effective range, in the simplest case it is constant 1.
- the spectral weighting function W ⁇ (f) calculated in this way approximates the mean spectral envelopes of the speech signal and the reference speech signal to be evaluated, so that differences between the two spectral envelopes are only incorporated to a reduced extent in the calculated quality value.
- the spectral weighting function W ⁇ (f) can be applied to the reference speech signal.
- the average spectral power density of the reference speech signal is approximated to the signal to be evaluated (FIG. 2a).
- the spectral weighting function can be applied inverted to the signal to be evaluated. This is equalized and, with regard to its average spectral power density, approximated to the reference speech signal (FIG. 2b).
- Another part of the invention relates to the correction of shifts in short-term spectral maxima caused by the transmission systems.
- the intensity is integrated in frequency bands for each time period.
- the result is a series of intensity values for each spectral representation of a signal section, each individual value representing the intensity in a frequency band.
- the shifts in short-term spectral maxima can lead to deviating calculated intensities in the frequency bands of the reference speech signal and the speech signal to be evaluated.
- variable band limits for calculating the spectral intensity mapping is not only limited to the signal in which the described spectral weighting function W ⁇ (f) is also used, but can also be applied to the other signal and even to both signals, ( see FIGS. 2a and 2b).
- a special exemplary embodiment shows an implementation according to FIG. 3, which is referred to as TOSQA (Telecommunication Objective Speech Quality Assessment). This involves advanced preprocessing of the reference speech signal.
- TOSQA Telecommunication Objective Speech Quality Assessment
- speech pauses are recognized here by means of a speech pause recognizer and do not go into the quality measure.
- the reference speech signal and the speech signal to be evaluated are also filtered with a bandpass 300 ... 3400 Hz and the frequency response of a telephone handset is filtered.
- the spectral power density is integrated in frequency groups, which form the basis for the calculation of the specific loudness.
- the calculated loudness patterns are supplemented by an error evaluation function.
- the calculated quality value is formed from the mean value of the co-correlation coefficients of the specific loudnesses for each short time segment under consideration from the number of evaluated speech segments.
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims
Priority Applications (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE59907623T DE59907623D1 (de) | 1998-08-27 | 1999-08-14 | Verfahren zur instrumentellen sprachqualitätsbestimmung |
EP99942871A EP1048025B1 (de) | 1998-08-27 | 1999-08-14 | Verfahren zur instrumentellen sprachqualitätsbestimmung |
US09/530,389 US7013266B1 (en) | 1998-08-27 | 1999-08-14 | Method for determining speech quality by comparison of signal properties |
AT99942871T ATE253765T1 (de) | 1998-08-27 | 1999-08-14 | Verfahren zur instrumentellen sprachqualitätsbestimmung |
CA002305652A CA2305652A1 (en) | 1998-08-27 | 1999-08-14 | Method for instrumental voice quality evaluation |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19840548A DE19840548C2 (de) | 1998-08-27 | 1998-08-27 | Verfahren zur instrumentellen Sprachqualitätsbestimmung |
DE19840548.0 | 1998-08-27 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000013173A1 true WO2000013173A1 (de) | 2000-03-09 |
Family
ID=7879918
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/EP1999/005972 WO2000013173A1 (de) | 1998-08-27 | 1999-08-14 | Verfahren zur instrumentellen sprachqualitätsbestimmung |
Country Status (6)
Country | Link |
---|---|
US (1) | US7013266B1 (de) |
EP (1) | EP1048025B1 (de) |
AT (1) | ATE253765T1 (de) |
CA (1) | CA2305652A1 (de) |
DE (2) | DE19840548C2 (de) |
WO (1) | WO2000013173A1 (de) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2001065543A1 (en) * | 2000-02-29 | 2001-09-07 | Telefonaktiebolaget Lm Ericsson (Publ) | Compensation for linear filtering using frequency weighting factors |
EP1241663A1 (de) * | 2001-03-13 | 2002-09-18 | Koninklijke KPN N.V. | Verfahren und Vorrichtung zur Sprachqualitätsbestimmung |
US7200561B2 (en) * | 2001-08-23 | 2007-04-03 | Nippon Telegraph And Telephone Corporation | Digital signal coding and decoding methods and apparatuses and programs therefor |
DE10142846A1 (de) * | 2001-08-29 | 2003-03-20 | Deutsche Telekom Ag | Verfahren zur Korrektur von gemessenen Sprachqualitätswerten |
DE10150519B4 (de) | 2001-10-12 | 2014-01-09 | Hewlett-Packard Development Co., L.P. | Verfahren und Anordnung zur Sprachverarbeitung |
US7305341B2 (en) | 2003-06-25 | 2007-12-04 | Lucent Technologies Inc. | Method of reflecting time/language distortion in objective speech quality assessment |
EP1492084B1 (de) * | 2003-06-25 | 2006-05-17 | Psytechnics Ltd | Vorrichtung und Verfahren zur binauralen Qualitätsbeurteilung |
CA2580763C (en) * | 2004-09-20 | 2014-07-29 | John Gerard Beerends | Frequency compensation for perceptual speech analysis |
EP2249333B1 (de) * | 2009-05-06 | 2014-08-27 | Nuance Communications, Inc. | Verfahren und Vorrichtung zur Schätzung einer Grundfrequenz eines Sprachsignals |
EP2474975B1 (de) * | 2010-05-21 | 2013-05-01 | SwissQual License AG | Verfahren zur Schätzung der Sprachqualität |
US9373341B2 (en) * | 2012-03-23 | 2016-06-21 | Dolby Laboratories Licensing Corporation | Method and system for bias corrected speech level determination |
CN112233693B (zh) * | 2020-10-14 | 2023-12-01 | 腾讯音乐娱乐科技(深圳)有限公司 | 一种音质评估方法、装置和设备 |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5621854A (en) * | 1992-06-24 | 1997-04-15 | British Telecommunications Public Limited Company | Method and apparatus for objective speech quality measurements of telecommunication equipment |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE3708002A1 (de) * | 1987-03-12 | 1988-09-22 | Telefonbau & Normalzeit Gmbh | Messverfahren zum beurteilen der guete von sprachcodierern und/oder uebertragungsstrecken |
US4860360A (en) * | 1987-04-06 | 1989-08-22 | Gte Laboratories Incorporated | Method of evaluating speech |
SE517836C2 (sv) * | 1995-02-14 | 2002-07-23 | Telia Ab | Metod och anordning för fastställande av talkvalitet |
NL9500512A (nl) * | 1995-03-15 | 1996-10-01 | Nederland Ptt | Inrichting voor het bepalen van de kwaliteit van een door een signaalbewerkingscircuit te genereren uitgangssignaal, alsmede werkwijze voor het bepalen van de kwaliteit van een door een signaalbewerkingscircuit te genereren uitgangssignaal. |
ATE205009T1 (de) * | 1996-05-21 | 2001-09-15 | Koninkl Kpn Nv | Vorrichtung und verfahren zur bestimmung der qualität eines ausgangssignals, das von einem signalverarbeitungsschaltkreis erzeugt werden soll |
-
1998
- 1998-08-27 DE DE19840548A patent/DE19840548C2/de not_active Expired - Fee Related
-
1999
- 1999-08-14 AT AT99942871T patent/ATE253765T1/de active
- 1999-08-14 DE DE59907623T patent/DE59907623D1/de not_active Expired - Lifetime
- 1999-08-14 EP EP99942871A patent/EP1048025B1/de not_active Expired - Lifetime
- 1999-08-14 WO PCT/EP1999/005972 patent/WO2000013173A1/de active IP Right Grant
- 1999-08-14 US US09/530,389 patent/US7013266B1/en not_active Expired - Lifetime
- 1999-08-14 CA CA002305652A patent/CA2305652A1/en not_active Abandoned
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5621854A (en) * | 1992-06-24 | 1997-04-15 | British Telecommunications Public Limited Company | Method and apparatus for objective speech quality measurements of telecommunication equipment |
Also Published As
Publication number | Publication date |
---|---|
DE19840548C2 (de) | 2001-02-15 |
ATE253765T1 (de) | 2003-11-15 |
DE19840548A1 (de) | 2000-03-02 |
EP1048025B1 (de) | 2003-11-05 |
CA2305652A1 (en) | 2000-03-09 |
US7013266B1 (en) | 2006-03-14 |
EP1048025A1 (de) | 2000-11-02 |
DE59907623D1 (de) | 2003-12-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60009206T2 (de) | Rauschunterdrückung mittels spektraler Subtraktion | |
DE69520067T2 (de) | Verfahren und Einrichtung zur Kennzeichnung eines Eingangssignales | |
DE69401514T2 (de) | Vom rechenaufwand her effiziente adaptive bitzuteilung für kodierverfahren und kodiereinrichtung | |
DE69613646T2 (de) | Verfahren zur Sprachdetektion bei starken Umgebungsgeräuschen | |
DE69529356T2 (de) | Wellenforminterpolation mittels Zerlegung in Rauschen und periodische Signalanteile | |
DE19952538C2 (de) | Automatische Verstärkungsregelung in einem Spracherkennungssystem | |
DE69420400T2 (de) | Verfahren und gerät zur sprechererkennung | |
DE69329511T2 (de) | Verfahren und Einrichtung zum Unterscheiden zwischen stimmhaften und stimmlosen Lauten | |
DE69535452T2 (de) | Verfahren und Vorrichtung zur Auswahl der Kodierrate in einem Vocoder mit Variabler Rate | |
DE3306730C2 (de) | ||
DE69121312T2 (de) | Geräuschsignalvorhersagevorrichtung | |
DE69423692T2 (de) | Sprachkodiergerät und Verfahren unter Verwendung von Klassifikationsregeln | |
EP0938831B1 (de) | Gehörangepasste qualitätsbeurteilung von audiosignalen | |
DE69730721T2 (de) | Verfahren und vorrichtungen zur geräuschkonditionierung von signalen welche audioinformationen darstellen in komprimierter und digitalisierter form | |
EP1048025B1 (de) | Verfahren zur instrumentellen sprachqualitätsbestimmung | |
DE60122751T2 (de) | Verfahren und vorrichtung für die objektive bewertung der sprachqualität ohne referenzsignal | |
DE69614937T2 (de) | Verfahren und System zur Spracherkennung mit verringerter Erkennungszeit unter Berücksichtigung von Veränderungen der Hintergrundgeräusche | |
DE69720134T2 (de) | Spracherkenner unter Verwendung von Grundfrequenzintensitätsdaten | |
DE602004010634T2 (de) | Verfahren und system zur sprachqualitätsvorhersage eines audioübertragungssystems | |
DE69616724T2 (de) | Verfahren und System für die Spracherkennung | |
DE3043516C2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE2636032B2 (de) | Elektrische Schaltungsanordnung zum Extrahieren der Grundschwingungsperiode aus einem Sprachsignal | |
DE19505435C1 (de) | Verfahren und Vorrichtung zum Bestimmen der Tonalität eines Audiosignals | |
DE69112855T2 (de) | Sprachsignalverarbeitungsvorrichtung. | |
DE10157535B4 (de) | Verfahren und Vorrichtung zur Reduzierung zufälliger, kontinuierlicher, instationärer Störungen in Audiosignalen |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
WWE | Wipo information: entry into national phase |
Ref document number: 1999942871 Country of ref document: EP |
|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): CA US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
ENP | Entry into the national phase |
Ref document number: 2305652 Country of ref document: CA Ref country code: CA Ref document number: 2305652 Kind code of ref document: A Format of ref document f/p: F |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWP | Wipo information: published in national office |
Ref document number: 1999942871 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 09530389 Country of ref document: US |
|
WWG | Wipo information: grant in national office |
Ref document number: 1999942871 Country of ref document: EP |