CN105378839B - 用于测量话语信号质量的系统和方法 - Google Patents
用于测量话语信号质量的系统和方法 Download PDFInfo
- Publication number
- CN105378839B CN105378839B CN201480036085.7A CN201480036085A CN105378839B CN 105378839 B CN105378839 B CN 105378839B CN 201480036085 A CN201480036085 A CN 201480036085A CN 105378839 B CN105378839 B CN 105378839B
- Authority
- CN
- China
- Prior art keywords
- quality
- distortion
- signal
- electronic device
- prospect
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/69—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for evaluating synthetic or decoded voice signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Telephone Function (AREA)
Applications Claiming Priority (13)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201361839796P | 2013-06-26 | 2013-06-26 | |
| US201361839800P | 2013-06-26 | 2013-06-26 | |
| US201361839807P | 2013-06-26 | 2013-06-26 | |
| US61/839,800 | 2013-06-26 | ||
| US61/839,807 | 2013-06-26 | ||
| US61/839,796 | 2013-06-26 | ||
| US201361876177P | 2013-09-10 | 2013-09-10 | |
| US61/876,177 | 2013-09-10 | ||
| US201361888945P | 2013-10-09 | 2013-10-09 | |
| US61/888,945 | 2013-10-09 | ||
| US14/314,019 US9679555B2 (en) | 2013-06-26 | 2014-06-24 | Systems and methods for measuring speech signal quality |
| US14/314,019 | 2014-06-24 | ||
| PCT/US2014/044163 WO2014210204A1 (en) | 2013-06-26 | 2014-06-25 | Systems and methods for measuring speech signal quality |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN105378839A CN105378839A (zh) | 2016-03-02 |
| CN105378839B true CN105378839B (zh) | 2019-03-19 |
Family
ID=52116446
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480036085.7A Active CN105378839B (zh) | 2013-06-26 | 2014-06-25 | 用于测量话语信号质量的系统和方法 |
Country Status (6)
| Country | Link |
|---|---|
| US (2) | US9679555B2 (enExample) |
| EP (1) | EP3014613A1 (enExample) |
| JP (1) | JP6339187B2 (enExample) |
| KR (1) | KR20160023767A (enExample) |
| CN (1) | CN105378839B (enExample) |
| WO (2) | WO2014210208A1 (enExample) |
Families Citing this family (23)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9679555B2 (en) | 2013-06-26 | 2017-06-13 | Qualcomm Incorporated | Systems and methods for measuring speech signal quality |
| US10148526B2 (en) * | 2013-11-20 | 2018-12-04 | International Business Machines Corporation | Determining quality of experience for communication sessions |
| US11888919B2 (en) | 2013-11-20 | 2024-01-30 | International Business Machines Corporation | Determining quality of experience for communication sessions |
| CN105679335B (zh) * | 2015-12-21 | 2019-08-13 | 南京华苏科技有限公司 | 基于无线分析的语音质量评估方法及系统 |
| CN106920546B (zh) * | 2015-12-23 | 2020-03-20 | 小米科技有限责任公司 | 智能识别语音的方法及装置 |
| WO2017127367A1 (en) | 2016-01-19 | 2017-07-27 | Dolby Laboratories Licensing Corporation | Testing device capture performance for multiple speakers |
| US10090001B2 (en) | 2016-08-01 | 2018-10-02 | Apple Inc. | System and method for performing speech enhancement using a neural network-based combined symbol |
| CN108346434B (zh) * | 2017-01-24 | 2020-12-22 | 中国移动通信集团安徽有限公司 | 一种语音质量评估的方法和装置 |
| KR102017244B1 (ko) * | 2017-02-27 | 2019-10-21 | 한국전자통신연구원 | 자연어 인식 성능 개선 방법 및 장치 |
| KR102623514B1 (ko) * | 2017-10-23 | 2024-01-11 | 삼성전자주식회사 | 음성신호 처리장치 및 그 동작방법 |
| CN108874761A (zh) * | 2018-05-31 | 2018-11-23 | 阿里巴巴集团控股有限公司 | 一种智能写作方法和装置 |
| EP3598639A1 (en) | 2018-07-20 | 2020-01-22 | Sonion Nederland B.V. | An amplifier with a symmetric current profile |
| US10951169B2 (en) | 2018-07-20 | 2021-03-16 | Sonion Nederland B.V. | Amplifier comprising two parallel coupled amplifier units |
| WO2020225850A1 (ja) * | 2019-05-07 | 2020-11-12 | 日本電信電話株式会社 | 音響品質評価装置、音響品質評価方法、およびプログラム |
| US11178311B2 (en) * | 2019-08-21 | 2021-11-16 | Adobe Inc. | Context aware color reduction |
| US10965806B1 (en) | 2020-01-31 | 2021-03-30 | Noble Systems Corporation | Auto-correcting voice quality in real-time |
| JP2022036862A (ja) * | 2020-08-24 | 2022-03-08 | 日本放送協会 | 音声客観評価装置及びそのプログラム |
| US12153648B2 (en) * | 2021-10-15 | 2024-11-26 | Microsoft Technology Licensing, Llc | Quality estimation models for various signal characteristics |
| WO2024044246A1 (en) * | 2022-08-26 | 2024-02-29 | Dolby Laboratories Licensing Corporation | System and method for evaluation of an audio signal processing algorithm |
| US12363319B2 (en) | 2023-06-14 | 2025-07-15 | Microsoft Technology Licensing, Llc | Object-based context-based decoder correction |
| US12469507B2 (en) | 2023-06-14 | 2025-11-11 | Microsoft Technology Licensing, Llc | Predictive context-based decoder correction |
| US20240420707A1 (en) * | 2023-06-14 | 2024-12-19 | Microsoft Technology Licensing, Llc | Voice context-based decoder correction |
| CN117610416B (zh) * | 2023-11-24 | 2024-08-02 | 四川大学 | 一种基于有限差分无监督学习的噪声传播快速预测方法 |
Citations (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1617222A (zh) * | 2003-06-25 | 2005-05-18 | 朗迅科技公司 | 客观语音质量评估中反映时间/语言失真的方法 |
| CN102496369A (zh) * | 2011-12-23 | 2012-06-13 | 中国传媒大学 | 一种基于失真校正的压缩域音频质量客观评价方法 |
| CN102549657A (zh) * | 2009-08-14 | 2012-07-04 | 皇家Kpn公司 | 用于确定音频系统的感知质量的方法和系统 |
| EP2595153A1 (en) * | 2011-11-18 | 2013-05-22 | Samsung Electronics Co., Ltd | Sound quality evaluation apparatus and method thereof |
| EP2595145A1 (en) * | 2011-11-17 | 2013-05-22 | Nederlandse Organisatie voor toegepast -natuurwetenschappelijk onderzoek TNO | Method of and apparatus for evaluating intelligibility of a degraded speech signal |
Family Cites Families (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CA2230188A1 (en) | 1998-03-27 | 1999-09-27 | William C. Treurniet | Objective audio quality measurement |
| JP4240878B2 (ja) | 2001-12-13 | 2009-03-18 | 四一 安藤 | 音声認識方法及び音声認識装置 |
| FR2835125B1 (fr) | 2002-01-24 | 2004-06-18 | Telediffusion De France Tdf | Procede d'evaluation d'un signal audio numerique |
| US7308403B2 (en) | 2002-07-01 | 2007-12-11 | Lucent Technologies Inc. | Compensation for utterance dependent articulation for speech quality assessment |
| US7327985B2 (en) * | 2003-01-21 | 2008-02-05 | Telefonaktiebolaget Lm Ericsson (Publ) | Mapping objective voice quality metrics to a MOS domain for field measurements |
| US7707031B2 (en) * | 2005-03-01 | 2010-04-27 | Telefonaktiebolaget Lm Ericsson (Publ) | Large scale measurement of subjective quality in mobile communications systems |
| US20060200346A1 (en) * | 2005-03-03 | 2006-09-07 | Nortel Networks Ltd. | Speech quality measurement based on classification estimation |
| US7856355B2 (en) | 2005-07-05 | 2010-12-21 | Alcatel-Lucent Usa Inc. | Speech quality assessment method and system |
| WO2007089189A1 (en) * | 2006-01-31 | 2007-08-09 | Telefonaktiebolaget Lm Ericsson (Publ). | Non-intrusive signal quality assessment |
| US20070203694A1 (en) | 2006-02-28 | 2007-08-30 | Nortel Networks Limited | Single-sided speech quality measurement |
| EP2028651A1 (en) | 2007-08-24 | 2009-02-25 | Sound Intelligence B.V. | Method and apparatus for detection of specific input signal contributions |
| US8238563B2 (en) | 2008-03-20 | 2012-08-07 | University of Surrey-H4 | System, devices and methods for predicting the perceived spatial quality of sound processing and reproducing equipment |
| WO2010031109A1 (en) | 2008-09-19 | 2010-03-25 | Newsouth Innovations Pty Limited | Method of analysing an audio signal |
| US20120020484A1 (en) | 2009-01-30 | 2012-01-26 | Telefonaktiebolaget Lm Ericsson (Publ) | Audio Signal Quality Prediction |
| FR2944640A1 (fr) | 2009-04-17 | 2010-10-22 | France Telecom | Procede et dispositif d'evaluation objective de la qualite vocale d'un signal de parole prenant en compte la classification du bruit de fond contenu dans le signal. |
| JP5606764B2 (ja) | 2010-03-31 | 2014-10-15 | クラリオン株式会社 | 音質評価装置およびそのためのプログラム |
| US8447596B2 (en) | 2010-07-12 | 2013-05-21 | Audience, Inc. | Monaural noise suppression based on computational auditory scene analysis |
| US9679555B2 (en) | 2013-06-26 | 2017-06-13 | Qualcomm Incorporated | Systems and methods for measuring speech signal quality |
-
2014
- 2014-06-24 US US14/314,019 patent/US9679555B2/en active Active
- 2014-06-24 US US14/314,022 patent/US9830905B2/en active Active
- 2014-06-25 WO PCT/US2014/044168 patent/WO2014210208A1/en not_active Ceased
- 2014-06-25 EP EP14739335.9A patent/EP3014613A1/en not_active Withdrawn
- 2014-06-25 CN CN201480036085.7A patent/CN105378839B/zh active Active
- 2014-06-25 KR KR1020167000189A patent/KR20160023767A/ko not_active Withdrawn
- 2014-06-25 JP JP2016523900A patent/JP6339187B2/ja active Active
- 2014-06-25 WO PCT/US2014/044163 patent/WO2014210204A1/en not_active Ceased
Patent Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1617222A (zh) * | 2003-06-25 | 2005-05-18 | 朗迅科技公司 | 客观语音质量评估中反映时间/语言失真的方法 |
| CN100573662C (zh) * | 2003-06-25 | 2009-12-23 | 朗迅科技公司 | 客观语音质量评估中反映时间和语言失真的方法和系统 |
| CN102549657A (zh) * | 2009-08-14 | 2012-07-04 | 皇家Kpn公司 | 用于确定音频系统的感知质量的方法和系统 |
| EP2595145A1 (en) * | 2011-11-17 | 2013-05-22 | Nederlandse Organisatie voor toegepast -natuurwetenschappelijk onderzoek TNO | Method of and apparatus for evaluating intelligibility of a degraded speech signal |
| EP2595153A1 (en) * | 2011-11-18 | 2013-05-22 | Samsung Electronics Co., Ltd | Sound quality evaluation apparatus and method thereof |
| CN102496369A (zh) * | 2011-12-23 | 2012-06-13 | 中国传媒大学 | 一种基于失真校正的压缩域音频质量客观评价方法 |
Non-Patent Citations (2)
| Title |
|---|
| "Objective evaluation of speech signal quality by the prediction of multiple foreground diagnostic acceptability measure attributes";SEN DEEP ET AL;《THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA》;20120501;第131卷(第5期);全文 * |
| "Predicting foreground SH,SL and BNH dam scores for multidimensional objective measure of speech quality";SEN D;《ACOUSTICS,SPEECH,AND SIGNAL PROCESSING》;20040517;第1卷;全文 * |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2014210204A1 (en) | 2014-12-31 |
| US9679555B2 (en) | 2017-06-13 |
| US20150006164A1 (en) | 2015-01-01 |
| JP2016525702A (ja) | 2016-08-25 |
| JP6339187B2 (ja) | 2018-06-06 |
| EP3014613A1 (en) | 2016-05-04 |
| CN105378839A (zh) | 2016-03-02 |
| US20150006162A1 (en) | 2015-01-01 |
| WO2014210208A1 (en) | 2014-12-31 |
| KR20160023767A (ko) | 2016-03-03 |
| US9830905B2 (en) | 2017-11-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105378839B (zh) | 用于测量话语信号质量的系统和方法 | |
| Vary et al. | Digital Speech Transmission and Enhancement | |
| Torcoli et al. | Objective measures of perceptual audio quality reviewed: An evaluation of their application domain dependence | |
| Falk et al. | A non-intrusive quality and intelligibility measure of reverberant and dereverberated speech | |
| CN107293286B (zh) | 一种基于网络配音游戏的语音样本收集方法 | |
| Kleijn et al. | Optimizing speech intelligibility in a noisy environment: A unified view | |
| Dubey et al. | Non-intrusive speech quality assessment using several combinations of auditory features | |
| CN113838471A (zh) | 基于神经网络的降噪方法、系统、电子设备及存储介质 | |
| US20220122623A1 (en) | Real-Time Voice Timbre Style Transform | |
| Sen et al. | Objective evaluation of speech signal quality by the prediction of multiple foreground diagnostic acceptability measure attributes | |
| Lei et al. | Enhancing real-world far-field speech with supervised adversarial training | |
| Deng et al. | Modeling and estimating acoustic transfer functions of external ears with or without headphones | |
| Beerends et al. | Degradation decomposition of the perceived quality of speech signals on the basis of a perceptual modeling approach | |
| Mawalim et al. | OBISHI: objective binaural intelligibility score for the hearing impaired | |
| Zheng et al. | Evaluation of deep marginal feedback cancellation for hearing aids using speech and music | |
| Ashkanichenarlogh et al. | Towards Clinically Feasible Nonintrusive Quality and Intelligibility Indices for Hearing Aids | |
| Kitawaki et al. | Objective quality assessment of wideband speech coding | |
| Andersen | Speech intelligibility prediction for hearing aid systems | |
| Möller et al. | Analytic assessment of telephone transmission impact on ASR performance using a simulation model | |
| Salehi et al. | Nonintrusive speech quality estimation based on Perceptual Linear Prediction | |
| Kobayashi et al. | Performance Evaluation of an Ambient Noise Clustering Method for Objective Speech Intelligibility Estimation | |
| Voran | Estimation of speech intelligibility and quality | |
| Zheng et al. | On objective assessment of audio quality—A review | |
| Reimes | Instrumental assessment of near-end perceived listening effort | |
| PATRICK | DEVELOPMENT OF AN IMPROVED LOGISTIC MAPPINGFUNCTION FOR OBJECTIVE ASSESSMENT OF QUALITY OF RECEIVED SPEECH OVER MOBILE TELEPHONE NETWORKS |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |