CA2891453C - Method of and apparatus for evaluating intelligibility of a degraded speech signal - Google Patents

Method of and apparatus for evaluating intelligibility of a degraded speech signal Download PDF

Info

Publication number
CA2891453C
CA2891453C CA2891453A CA2891453A CA2891453C CA 2891453 C CA2891453 C CA 2891453C CA 2891453 A CA2891453 A CA 2891453A CA 2891453 A CA2891453 A CA 2891453A CA 2891453 C CA2891453 C CA 2891453C
Authority
CA
Canada
Prior art keywords
signal
degraded
speech
frames
disturbance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CA2891453A
Other languages
English (en)
French (fr)
Other versions
CA2891453A1 (en
Inventor
John Gerard Beerends
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nederlandse Organisatie voor Toegepast Natuurwetenschappelijk Onderzoek TNO
Original Assignee
Nederlandse Organisatie voor Toegepast Natuurwetenschappelijk Onderzoek TNO
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nederlandse Organisatie voor Toegepast Natuurwetenschappelijk Onderzoek TNO filed Critical Nederlandse Organisatie voor Toegepast Natuurwetenschappelijk Onderzoek TNO
Publication of CA2891453A1 publication Critical patent/CA2891453A1/en
Application granted granted Critical
Publication of CA2891453C publication Critical patent/CA2891453C/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/69Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/005Correction of errors induced by the transmission channel, if related to the coding algorithm
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C2207/00Indexing scheme relating to arrangements for writing information into, or reading information out from, a digital store
    • G11C2207/16Solid state audio
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C7/00Arrangements for writing information into, or reading information out from, a digital store
    • G11C7/16Storage of analogue signals in digital stores using an arrangement comprising analogue/digital [A/D] converters, digital memories and digital/analogue [D/A] converters 

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Detection And Prevention Of Errors In Transmission (AREA)
CA2891453A 2012-11-16 2013-11-15 Method of and apparatus for evaluating intelligibility of a degraded speech signal Active CA2891453C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP12193082.0 2012-11-16
EP12193082.0A EP2733700A1 (en) 2012-11-16 2012-11-16 Method of and apparatus for evaluating intelligibility of a degraded speech signal
PCT/NL2013/050824 WO2014077690A1 (en) 2012-11-16 2013-11-15 Method of and apparatus for evaluating intelligibility of a degraded speech signal

Publications (2)

Publication Number Publication Date
CA2891453A1 CA2891453A1 (en) 2014-05-22
CA2891453C true CA2891453C (en) 2023-10-10

Family

ID=47216118

Family Applications (1)

Application Number Title Priority Date Filing Date
CA2891453A Active CA2891453C (en) 2012-11-16 2013-11-15 Method of and apparatus for evaluating intelligibility of a degraded speech signal

Country Status (7)

Country Link
US (1) US9472202B2 (ja)
EP (2) EP2733700A1 (ja)
JP (1) JP6522508B2 (ja)
CN (1) CN104919525B (ja)
AU (1) AU2013345546B2 (ja)
CA (1) CA2891453C (ja)
WO (1) WO2014077690A1 (ja)

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2595145A1 (en) * 2011-11-17 2013-05-22 Nederlandse Organisatie voor toegepast -natuurwetenschappelijk onderzoek TNO Method of and apparatus for evaluating intelligibility of a degraded speech signal
US10255487B2 (en) * 2015-12-24 2019-04-09 Casio Computer Co., Ltd. Emotion estimation apparatus using facial images of target individual, emotion estimation method, and non-transitory computer readable medium
WO2017127367A1 (en) 2016-01-19 2017-07-27 Dolby Laboratories Licensing Corporation Testing device capture performance for multiple speakers
CN106409287B (zh) * 2016-12-12 2019-12-13 天津大学 提高肌肉萎缩或神经退行性病人语音可懂度装置和方法
US10726855B2 (en) * 2017-03-15 2020-07-28 Guardian Glass, Llc. Speech privacy system and/or associated method
CN107895582A (zh) * 2017-10-16 2018-04-10 中国电子科技集团公司第二十八研究所 面向多源信息领域的说话人自适应语音情感识别方法
CN107958673B (zh) * 2017-11-28 2021-05-11 北京先声教育科技有限公司 一种口语评分方法及装置
CN111785292B (zh) * 2020-05-19 2023-03-31 厦门快商通科技股份有限公司 一种基于图像识别的语音混响强度估计方法、装置及存储介质
CN117711435A (zh) * 2023-12-20 2024-03-15 书行科技(北京)有限公司 音频处理方法及装置、电子设备及计算机可读存储介质

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0809236B1 (en) * 1996-05-21 2001-08-29 Koninklijke KPN N.V. Device for determining the quality of an output signal to be generated by a signal processing circuit, and also method
EP1241663A1 (en) * 2001-03-13 2002-09-18 Koninklijke KPN N.V. Method and device for determining the quality of speech signal
EP1465156A1 (en) * 2003-03-31 2004-10-06 Koninklijke KPN N.V. Method and system for determining the quality of a speech signal
ES2313413T3 (es) * 2004-09-20 2009-03-01 Nederlandse Organisatie Voor Toegepast-Natuurwetenschappelijk Onderzoek Tno Compensacion en frecuencia para el analisis de precepcion de habla.
JP4745916B2 (ja) * 2006-06-07 2011-08-10 日本電信電話株式会社 雑音抑圧音声品質推定装置、方法およびプログラム
ATE470931T1 (de) * 2007-10-11 2010-06-15 Koninkl Kpn Nv Verfahren und system zur messung der sprachverständlichkeit eines tonübertragungssystems
CN101609686B (zh) * 2009-07-28 2011-09-14 南京大学 基于语音增强算法主观评估的客观评估方法
ES2526126T3 (es) * 2009-08-14 2015-01-07 Koninklijke Kpn N.V. Método, producto de programa informático y sistema para determinar una calidad percibida de un sistema de audio
DK2465113T3 (en) * 2009-08-14 2015-04-07 Koninkl Kpn Nv PROCEDURE, COMPUTER PROGRAM PRODUCT AND SYSTEM FOR DETERMINING AN CONCEPT QUALITY OF A SOUND SYSTEM

Also Published As

Publication number Publication date
EP2920785B1 (en) 2018-08-08
EP2920785A1 (en) 2015-09-23
AU2013345546B2 (en) 2018-08-30
CA2891453A1 (en) 2014-05-22
JP2015535100A (ja) 2015-12-07
US20150340047A1 (en) 2015-11-26
EP2733700A1 (en) 2014-05-21
AU2013345546A1 (en) 2015-06-11
CN104919525B (zh) 2018-02-06
US9472202B2 (en) 2016-10-18
JP6522508B2 (ja) 2019-05-29
WO2014077690A1 (en) 2014-05-22
CN104919525A (zh) 2015-09-16

Similar Documents

Publication Publication Date Title
CA2891453C (en) Method of and apparatus for evaluating intelligibility of a degraded speech signal
EP3120356B1 (en) Method of and apparatus for evaluating quality of a degraded speech signal
EP2780909B1 (en) Method of and apparatus for evaluating intelligibility of a degraded speech signal
JP4879180B2 (ja) 知覚音声分析のための周波数補償
WO2011018428A1 (en) Method and system for determining a perceived quality of an audio system
US8818798B2 (en) Method and system for determining a perceived quality of an audio system
WO2009046949A1 (en) Method and system for speech intelligibility measurement of an audio transmission system
US9659565B2 (en) Method of and apparatus for evaluating intelligibility of a degraded speech signal, through providing a difference function representing a difference between signal frames and an output signal indicative of a derived quality parameter
US20230260528A1 (en) Method of determining a perceptual impact of reverberation on a perceived quality of a signal, as well as computer program product

Legal Events

Date Code Title Description
EEER Examination request

Effective date: 20181012

EEER Examination request

Effective date: 20181012

EEER Examination request

Effective date: 20181012

EEER Examination request

Effective date: 20181012

EEER Examination request

Effective date: 20181012

EEER Examination request

Effective date: 20181012