CN100524458C - 用于操作语音识别系统的方法 - Google Patents

用于操作语音识别系统的方法 Download PDF

Info

Publication number
CN100524458C
CN100524458C CNB2003801025294A CN200380102529A CN100524458C CN 100524458 C CN100524458 C CN 100524458C CN B2003801025294 A CNB2003801025294 A CN B2003801025294A CN 200380102529 A CN200380102529 A CN 200380102529A CN 100524458 C CN100524458 C CN 100524458C
Authority
CN
China
Prior art keywords
quality
noise
reception
speech recognition
recognition system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB2003801025294A
Other languages
English (en)
Chinese (zh)
Other versions
CN1708782A (zh
Inventor
A·库伊曼
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nuance Communications Inc
Original Assignee
Koninklijke Philips Electronics NV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics NV filed Critical Koninklijke Philips Electronics NV
Publication of CN1708782A publication Critical patent/CN1708782A/zh
Application granted granted Critical
Publication of CN100524458C publication Critical patent/CN100524458C/zh
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Telephonic Communication Services (AREA)
  • Monitoring And Testing Of Transmission In General (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Telephone Function (AREA)
  • Machine Translation (AREA)
  • Selective Calling Equipment (AREA)
CNB2003801025294A 2002-11-02 2003-10-24 用于操作语音识别系统的方法 Expired - Fee Related CN100524458C (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE10251113.6 2002-11-02
DE10251113A DE10251113A1 (de) 2002-11-02 2002-11-02 Verfahren zum Betrieb eines Spracherkennungssystems

Publications (2)

Publication Number Publication Date
CN1708782A CN1708782A (zh) 2005-12-14
CN100524458C true CN100524458C (zh) 2009-08-05

Family

ID=32115143

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2003801025294A Expired - Fee Related CN100524458C (zh) 2002-11-02 2003-10-24 用于操作语音识别系统的方法

Country Status (8)

Country Link
US (1) US8781826B2 (enExample)
EP (1) EP1561203B1 (enExample)
JP (2) JP2006505003A (enExample)
CN (1) CN100524458C (enExample)
AT (1) ATE421139T1 (enExample)
AU (1) AU2003269418A1 (enExample)
DE (2) DE10251113A1 (enExample)
WO (1) WO2004042698A1 (enExample)

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE10224816A1 (de) * 2002-06-05 2003-12-24 Philips Intellectual Property Eine mobile Einheit und ein Verfahren zur Steuerung einer mobilen Einheit
JP2007501444A (ja) * 2003-05-08 2007-01-25 ボイス シグナル テクノロジーズ インコーポレイテッド 信号対雑音比による音声認識方法
US7406422B2 (en) * 2004-07-20 2008-07-29 Hewlett-Packard Development Company, L.P. Techniques for improving collaboration effectiveness
US9135913B2 (en) * 2006-05-26 2015-09-15 Nec Corporation Voice input system, interactive-type robot, voice input method, and voice input program
EP2107553B1 (en) * 2008-03-31 2011-05-18 Harman Becker Automotive Systems GmbH Method for determining barge-in
EP2148325B1 (en) * 2008-07-22 2014-10-01 Nuance Communications, Inc. Method for determining the presence of a wanted signal component
JP5156043B2 (ja) * 2010-03-26 2013-03-06 株式会社東芝 音声判別装置
DE102010055297A1 (de) * 2010-12-21 2012-06-21 Brose Fahrzeugteile Gmbh & Co. Kommanditgesellschaft, Hallstadt Verfahren zur Erzeugung einer Bedienmeldung beim Auftreten eines Bedienereignisses
DE112012006165T5 (de) * 2012-03-30 2015-01-08 Intel Corporation Touchscreen-Anwenderschnittstelle mit Spracheingabe
KR101987255B1 (ko) * 2012-08-20 2019-06-11 엘지이노텍 주식회사 음성 인식 장치 및 이의 음성 인식 방법
CN103065631B (zh) * 2013-01-24 2015-07-29 华为终端有限公司 一种语音识别的方法、装置
CN103971680B (zh) * 2013-01-24 2018-06-05 华为终端(东莞)有限公司 一种语音识别的方法、装置
US20140358535A1 (en) * 2013-05-28 2014-12-04 Samsung Electronics Co., Ltd. Method of executing voice recognition of electronic device and electronic device using the same
US9293135B2 (en) * 2013-07-02 2016-03-22 Volkswagen Ag Countermeasures for voice recognition deterioration due to exterior noise from passing vehicles
US9613619B2 (en) 2013-10-30 2017-04-04 Genesys Telecommunications Laboratories, Inc. Predicting recognition quality of a phrase in automatic speech recognition systems
GB2523984B (en) * 2013-12-18 2017-07-26 Cirrus Logic Int Semiconductor Ltd Processing received speech data
CN104767652B (zh) * 2014-01-08 2020-01-17 杜比实验室特许公司 监视数字传输环境性能的方法
US9516165B1 (en) * 2014-03-26 2016-12-06 West Corporation IVR engagements and upfront background noise
US9953646B2 (en) 2014-09-02 2018-04-24 Belleau Technologies Method and system for dynamic speech recognition and tracking of prewritten script
CN107147972A (zh) * 2016-03-01 2017-09-08 卡讯电子股份有限公司 音频讯号输出控制方法及系统
US10283138B2 (en) * 2016-10-03 2019-05-07 Google Llc Noise mitigation for a voice interface device
US10923101B2 (en) * 2017-12-26 2021-02-16 International Business Machines Corporation Pausing synthesized speech output from a voice-controlled device
CN108986796A (zh) * 2018-06-21 2018-12-11 广东小天才科技有限公司 一种语音搜索方法及装置
JP7388006B2 (ja) * 2019-06-03 2023-11-29 コニカミノルタ株式会社 画像処理装置及びプログラム
KR20190084912A (ko) * 2019-06-28 2019-07-17 엘지전자 주식회사 사용자의 액션에 따라 제어 가능한 인공 지능 장치 및 그의 동작 방법
KR20210017392A (ko) * 2019-08-08 2021-02-17 삼성전자주식회사 전자 장치 및 이의 음성 인식 방법
US11037571B2 (en) * 2019-10-04 2021-06-15 Motorola Solutions, Inc. Speech-based two-way radio assistant

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1242553A (zh) * 1998-03-24 2000-01-26 松下电器产业株式会社 用于噪声环境的语音检测系统
WO2000072307A1 (en) * 1999-05-25 2000-11-30 Koninklijke Kpn N.V. Speech-processing system
CN1300417A (zh) * 1999-04-19 2001-06-20 摩托罗拉公司 使用外部语音活动检测的噪声抑制
US6336091B1 (en) * 1999-01-22 2002-01-01 Motorola, Inc. Communication device for screening speech recognizer input
EP1085501A3 (en) * 1999-09-14 2002-01-09 Canon Kabushiki Kaisha Client-server based speech recognition
US20020019734A1 (en) * 2000-06-29 2002-02-14 Bartosik Heinrich Franz Recording apparatus for recording speech information for a subsequent off-line speech recognition

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4720802A (en) * 1983-07-26 1988-01-19 Lear Siegler Noise compensation arrangement
JP2589468B2 (ja) * 1986-02-18 1997-03-12 松下電器産業株式会社 音声認識装置
GB8608289D0 (en) * 1986-04-04 1986-05-08 Pa Consulting Services Noise compensation in speech recognition
US5033088A (en) * 1988-06-06 1991-07-16 Voice Processing Corp. Method and apparatus for effectively receiving voice input to a voice recognition system
JPH0675588A (ja) * 1992-08-27 1994-03-18 Fujitsu Ltd 音声認識装置
US5870705A (en) * 1994-10-21 1999-02-09 Microsoft Corporation Method of setting input levels in a voice recognition system
WO1996025733A1 (en) * 1995-02-15 1996-08-22 British Telecommunications Public Limited Company Voice activity detection
FI100840B (fi) * 1995-12-12 1998-02-27 Nokia Mobile Phones Ltd Kohinanvaimennin ja menetelmä taustakohinan vaimentamiseksi kohinaises ta puheesta sekä matkaviestin
US5765130A (en) * 1996-05-21 1998-06-09 Applied Language Technologies, Inc. Method and apparatus for facilitating speech barge-in in connection with voice recognition systems
CA2292959A1 (en) * 1997-05-06 1998-11-12 Speechworks International, Inc. System and method for developing interactive speech applications
US5956675A (en) 1997-07-31 1999-09-21 Lucent Technologies Inc. Method and apparatus for word counting in continuous speech recognition useful for reliable barge-in and early end of speech detection
US5970446A (en) * 1997-11-25 1999-10-19 At&T Corp Selective noise/channel/coding models and recognizers for automatic speech recognition
JPH11352995A (ja) * 1998-06-08 1999-12-24 Toshiba Tec Corp 音声認識装置
JP3893763B2 (ja) 1998-08-17 2007-03-14 富士ゼロックス株式会社 音声検出装置
US6246986B1 (en) * 1998-12-31 2001-06-12 At&T Corp. User barge-in enablement in large vocabulary speech recognition systems
US6574601B1 (en) * 1999-01-13 2003-06-03 Lucent Technologies Inc. Acoustic speech recognizer system and method
US6381570B2 (en) * 1999-02-12 2002-04-30 Telogy Networks, Inc. Adaptive two-threshold method for discriminating noise from speech in a communication signal
US6505155B1 (en) * 1999-05-06 2003-01-07 International Business Machines Corporation Method and system for automatically adjusting prompt feedback based on predicted recognition accuracy
US6724864B1 (en) * 2000-01-20 2004-04-20 Comverse, Inc. Active prompts
WO2001056015A1 (en) * 2000-01-27 2001-08-02 Koninklijke Philips Electronics N.V. Speech detection device having two switch-off criterions
US6466654B1 (en) * 2000-03-06 2002-10-15 Avaya Technology Corp. Personal virtual assistant with semantic tagging
JP3903410B2 (ja) * 2000-06-01 2007-04-11 三菱電機株式会社 音声入力制御システム
GB2367467B (en) * 2000-09-30 2004-12-15 Mitel Corp Noise level calculator for echo canceller
US7117442B1 (en) 2001-02-01 2006-10-03 International Business Machines Corporation Efficient presentation of database query results through audio user interfaces
JP2002244696A (ja) * 2001-02-20 2002-08-30 Kenwood Corp 音声認識による制御装置
US6754310B1 (en) * 2001-03-08 2004-06-22 3Com Corporation Telephony interface device for providing diagnostic information to a telephone
JP2002297186A (ja) 2001-03-30 2002-10-11 Kddi Corp 音声認識装置
CN1266625C (zh) 2001-05-04 2006-07-26 微软公司 用于web启用的识别的服务器
US20030046069A1 (en) * 2001-08-28 2003-03-06 Vergin Julien Rivarol Noise reduction system and method
US7069221B2 (en) * 2001-10-26 2006-06-27 Speechworks International, Inc. Non-target barge-in detection
US7295982B1 (en) * 2001-11-19 2007-11-13 At&T Corp. System and method for automatic verification of the understandability of speech
US7103542B2 (en) * 2001-12-14 2006-09-05 Ben Franklin Patent Holding Llc Automatically improving a voice recognition system
JP3984526B2 (ja) * 2002-10-21 2007-10-03 富士通株式会社 音声対話システム及び方法

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1242553A (zh) * 1998-03-24 2000-01-26 松下电器产业株式会社 用于噪声环境的语音检测系统
US6336091B1 (en) * 1999-01-22 2002-01-01 Motorola, Inc. Communication device for screening speech recognizer input
CN1300417A (zh) * 1999-04-19 2001-06-20 摩托罗拉公司 使用外部语音活动检测的噪声抑制
WO2000072307A1 (en) * 1999-05-25 2000-11-30 Koninklijke Kpn N.V. Speech-processing system
EP1085501A3 (en) * 1999-09-14 2002-01-09 Canon Kabushiki Kaisha Client-server based speech recognition
US20020019734A1 (en) * 2000-06-29 2002-02-14 Bartosik Heinrich Franz Recording apparatus for recording speech information for a subsequent off-line speech recognition

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
Intelligent Barge-in In Conversational Systems. N.Strom and S.Seneff.Proceeding of The International Conference On Spoken Language Processing. 2000 *

Also Published As

Publication number Publication date
DE60325881D1 (de) 2009-03-05
WO2004042698A8 (en) 2005-05-19
US8781826B2 (en) 2014-07-15
ATE421139T1 (de) 2009-01-15
CN1708782A (zh) 2005-12-14
DE10251113A1 (de) 2004-05-19
WO2004042698A1 (en) 2004-05-21
US20060200345A1 (en) 2006-09-07
JP2011022600A (ja) 2011-02-03
JP2006505003A (ja) 2006-02-09
EP1561203A1 (en) 2005-08-10
EP1561203B1 (en) 2009-01-14
AU2003269418A1 (en) 2004-06-07

Similar Documents

Publication Publication Date Title
CN100524458C (zh) 用于操作语音识别系统的方法
US9571638B1 (en) Segment-based queueing for audio captioning
US6744860B1 (en) Methods and apparatus for initiating a voice-dialing operation
US9071947B1 (en) On-hold processing for telephonic systems
US8305939B2 (en) Selective teleconference interruption
US20140330562A1 (en) Method and Apparatus for Obtaining Information from the Web
EP1414227A1 (en) Event detection for multiple voice channel communications
CN109982228B (zh) 一种麦克风故障检测方法及移动终端
US20110307246A1 (en) Methods And Systems For Changing A Communication Quality Of A Communication Session Based On A Meaning Of Speech Data
KR20060008061A (ko) 푸시 투 토크형 이동 통신 단말기의 음성 검출 및 인식을이용한 발언권 관리 장치와 방법
EP3084633A1 (en) Attribute-based audio channel arbitration
US20100245111A1 (en) End user control of music on hold
US9263063B2 (en) Switching off DTX for music
JP5251588B2 (ja) 携帯電話端末装置及び通話伝達の判断方法
US20130151248A1 (en) Apparatus, System, and Method For Distinguishing Voice in a Communication Stream
CN101951560B (zh) 一种在集群通话中控制话权状态的方法及系统
CN107957860A (zh) 可自动调整声音输出的方法及电子装置
CN113271491B (zh) 电子装置以及播放控制方法
EP1287675A2 (en) Method and apparatus for audio signal based answer call message generation
JPS6345950A (ja) 対話形音声応答装置
US20070129037A1 (en) Mute processing apparatus and method
US20100303214A1 (en) One-way voice detection voicemail
KR20070066263A (ko) 무선 헤드셋을 통한 단문 메시지의 음성 메시지 변환서비스 시스템 및 방법
JP5902921B2 (ja) 無線通信システム
CN106533485B (zh) 一种联系人辨别方法及可穿戴设备

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: NUANCE COMMUNICATION INC.

Free format text: FORMER OWNER: KONINKLIKE PHILIPS ELECTRONICS N.V.

Effective date: 20121227

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20121227

Address after: Massachusetts

Patentee after: Nuance Communications, Inc.

Address before: Holland Ian Deho Finn

Patentee before: Koninklijke Philips Electronics N.V.

CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090805