CN103797535B - 减少语音辨识系统中的漏报 - Google Patents

减少语音辨识系统中的漏报 Download PDF

Info

Publication number
CN103797535B
CN103797535B CN201280040735.6A CN201280040735A CN103797535B CN 103797535 B CN103797535 B CN 103797535B CN 201280040735 A CN201280040735 A CN 201280040735A CN 103797535 B CN103797535 B CN 103797535B
Authority
CN
China
Prior art keywords
score
consistence
component sound
identification result
time length
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201280040735.6A
Other languages
English (en)
Chinese (zh)
Other versions
CN103797535A (zh
Inventor
乔纳森·肖
彼得·韦尔默郎
斯蒂芬·萨顿
罗伯特·萨瓦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sensory Inc
Original Assignee
Sensory Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sensory Inc filed Critical Sensory Inc
Publication of CN103797535A publication Critical patent/CN103797535A/zh
Application granted granted Critical
Publication of CN103797535B publication Critical patent/CN103797535B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Auxiliary Devices For Music (AREA)
CN201280040735.6A 2011-08-24 2012-08-17 减少语音辨识系统中的漏报 Active CN103797535B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US13/217,134 2011-08-24
US13/217,134 US8781825B2 (en) 2011-08-24 2011-08-24 Reducing false positives in speech recognition systems
PCT/US2012/051345 WO2013028518A1 (en) 2011-08-24 2012-08-17 Reducing false positives in speech recognition systems

Publications (2)

Publication Number Publication Date
CN103797535A CN103797535A (zh) 2014-05-14
CN103797535B true CN103797535B (zh) 2016-06-08

Family

ID=47744890

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280040735.6A Active CN103797535B (zh) 2011-08-24 2012-08-17 减少语音辨识系统中的漏报

Country Status (5)

Country Link
US (1) US8781825B2 (enExample)
JP (1) JP6030135B2 (enExample)
CN (1) CN103797535B (enExample)
DE (1) DE112012003479T5 (enExample)
WO (1) WO2013028518A1 (enExample)

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8768707B2 (en) 2011-09-27 2014-07-01 Sensory Incorporated Background speech recognition assistant using speaker verification
CN104157284A (zh) * 2013-05-13 2014-11-19 佳能株式会社 语音命令检测方法和系统,以及信息处理系统
US9147397B2 (en) 2013-10-29 2015-09-29 Knowles Electronics, Llc VAD detection apparatus and method of operating the same
PL3065131T3 (pl) * 2015-03-06 2021-01-25 Zetes Industries S.A. Sposób i układ przetwarzania końcowego rezultatu rozpoznawania mowy
US10019992B2 (en) 2015-06-29 2018-07-10 Disney Enterprises, Inc. Speech-controlled actions based on keywords and context thereof
KR102437689B1 (ko) * 2015-09-16 2022-08-30 삼성전자주식회사 음성 인식 서버 및 그 제어 방법
WO2019047220A1 (zh) * 2017-09-11 2019-03-14 深圳传音通讯有限公司 一种应用程序启动方法及终端、计算机可读存储介质
US20230317070A1 (en) * 2022-03-31 2023-10-05 Vocollect, Inc. Apparatuses, systems, and methods for speech recognition by speech rate and hint-based techniques

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1162365A (zh) * 1994-11-01 1997-10-15 英国电讯公司 语音识别
US7657433B1 (en) * 2006-09-08 2010-02-02 Tellme Networks, Inc. Speech recognition accuracy with multi-confidence thresholds
GB2468203A (en) * 2009-02-27 2010-09-01 Autonomy Corp Ltd A speech recognition system using multiple resolution analysis

Family Cites Families (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4837831A (en) * 1986-10-15 1989-06-06 Dragon Systems, Inc. Method for creating and using multiple-word sound models in speech recognition
US5193142A (en) * 1990-11-15 1993-03-09 Matsushita Electric Industrial Co., Ltd. Training module for estimating mixture gaussian densities for speech-unit models in speech recognition systems
US5390278A (en) * 1991-10-08 1995-02-14 Bell Canada Phoneme based speech recognition
CA2088080C (en) * 1992-04-02 1997-10-07 Enrico Luigi Bocchieri Automatic speech recognizer
US5794198A (en) * 1994-10-28 1998-08-11 Nippon Telegraph And Telephone Corporation Pattern recognition method
US5893059A (en) * 1997-04-17 1999-04-06 Nynex Science And Technology, Inc. Speech recoginition methods and apparatus
JPH11311994A (ja) * 1998-04-30 1999-11-09 Sony Corp 情報処理装置および方法、並びに提供媒体
US6223155B1 (en) * 1998-08-14 2001-04-24 Conexant Systems, Inc. Method of independently creating and using a garbage model for improved rejection in a limited-training speaker-dependent speech recognition system
US6138095A (en) * 1998-09-03 2000-10-24 Lucent Technologies Inc. Speech recognition
US6266633B1 (en) 1998-12-22 2001-07-24 Itt Manufacturing Enterprises Noise suppression and channel equalization preprocessor for speech and speaker recognizers: method and apparatus
CN1366659A (zh) 2000-02-10 2002-08-28 皇家菲利浦电子有限公司 具有音调变化检测的纠错方法
EP1189202A1 (en) * 2000-09-18 2002-03-20 Sony International (Europe) GmbH Duration models for speech recognition
GB2370401A (en) * 2000-12-19 2002-06-26 Nokia Mobile Phones Ltd Speech recognition
US6959278B1 (en) * 2001-04-05 2005-10-25 Verizon Corporate Services Group Inc. Systems and methods for implementing segmentation in speech recognition systems
US7103542B2 (en) * 2001-12-14 2006-09-05 Ben Franklin Patent Holding Llc Automatically improving a voice recognition system
US6724866B2 (en) 2002-02-08 2004-04-20 Matsushita Electric Industrial Co., Ltd. Dialogue device for call screening and classification
JP4437047B2 (ja) * 2004-02-20 2010-03-24 本田技研工業株式会社 音声対話装置
JP4322785B2 (ja) * 2004-11-24 2009-09-02 株式会社東芝 音声認識装置、音声認識方法および音声認識プログラム
KR100655491B1 (ko) * 2004-12-21 2006-12-11 한국전자통신연구원 음성인식 시스템에서의 2단계 발화 검증 방법 및 장치
JP2007017733A (ja) * 2005-07-08 2007-01-25 Sharp Corp 入力装置、入力システム、入力方法、入力処理プログラム、および、プログラム記録媒体
CN1963917A (zh) * 2005-11-11 2007-05-16 株式会社东芝 评价语音的分辨力、说话人认证的注册和验证方法及装置
JP4758919B2 (ja) * 2007-01-22 2011-08-31 日本放送協会 音声認識装置及び音声認識プログラム
US9646603B2 (en) * 2009-02-27 2017-05-09 Longsand Limited Various apparatus and methods for a speech recognition system
US20110004473A1 (en) 2009-07-06 2011-01-06 Nice Systems Ltd. Apparatus and method for enhanced speech recognition

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1162365A (zh) * 1994-11-01 1997-10-15 英国电讯公司 语音识别
US7657433B1 (en) * 2006-09-08 2010-02-02 Tellme Networks, Inc. Speech recognition accuracy with multi-confidence thresholds
GB2468203A (en) * 2009-02-27 2010-09-01 Autonomy Corp Ltd A speech recognition system using multiple resolution analysis

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
A New Framework For Large Vocabulary Keyword Spotting Using Two-Pass Confidence Measure;Yingna Chen;Tao Hou;Sha Meng;Shan Zhong;Jia Liu,;《IEEE》;20061231;68-71 *
Discriminative keyword spotting;Joseph Keshet;David Grangier;Samy Bengio,;《Science Direct》;20081231;317-329 *

Also Published As

Publication number Publication date
US8781825B2 (en) 2014-07-15
CN103797535A (zh) 2014-05-14
DE112012003479T5 (de) 2014-05-22
JP2014524599A (ja) 2014-09-22
WO2013028518A1 (en) 2013-02-28
JP6030135B2 (ja) 2016-11-24
US20130054242A1 (en) 2013-02-28

Similar Documents

Publication Publication Date Title
CN103797535B (zh) 减少语音辨识系统中的漏报
US20230409102A1 (en) Low-power keyword spotting system
US10304440B1 (en) Keyword spotting using multi-task configuration
Mitra et al. Articulatory features from deep neural networks and their role in speech recognition
CN108417201B (zh) 单信道多说话人身份识别方法及系统
JPWO2009078256A1 (ja) 発音変動規則抽出装置、発音変動規則抽出方法、および発音変動規則抽出用プログラム
CN109036381A (zh) 语音处理方法及装置、计算机装置及可读存储介质
CN106710585B (zh) 语音交互过程中的多音字播报方法及系统
CN111433847A (zh) 语音转换的方法及训练方法、智能装置和存储介质
Seong et al. Dysarthric speech recognition error correction using weighted finite state transducers based on context–dependent pronunciation variation
CN115768342B (zh) 合成患者特定的语音模型
CN111145748B (zh) 音频识别置信度确定方法、装置、设备及存储介质
JP6373621B2 (ja) 話し方評価装置、話し方評価方法、プログラム
CN112420021A (zh) 学习方法、说话者识别方法以及记录介质
Chen et al. The USTC System for Voice Conversion Challenge 2016: Neural Network Based Approaches for Spectrum, Aperiodicity and F0 Conversion.
Mendiratta et al. Automatic speech recognition using optimal selection of features based on hybrid ABC-PSO
JPH0643895A (ja) 音声認識装置
Zhao et al. Time-frequency kernel-based CNN for speech recognition.
CN112634859B (zh) 用于文本相关说话人识别的数据增强方法及系统
JP7173339B2 (ja) 発話評価装置、発話評価方法、およびプログラム
Alam et al. Speaker Verification Under Adverse Conditions Using i-Vector Adaptation and Neural Networks.
Prabhu et al. Fuzzy logic based Nam speech recognition for Tamil syllables
JP2001109491A (ja) 連続音声認識装置および方法
JP2019045532A (ja) 音声認識装置、車載システム及びコンピュータプログラム
Sudoh et al. Post-dialogue confidence scoring for unsupervised statistical language model training

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant