CN105765656B - 控制计算装置的语音辨识过程 - Google Patents

控制计算装置的语音辨识过程 Download PDF

Info

Publication number
CN105765656B
CN105765656B CN201480064081.XA CN201480064081A CN105765656B CN 105765656 B CN105765656 B CN 105765656B CN 201480064081 A CN201480064081 A CN 201480064081A CN 105765656 B CN105765656 B CN 105765656B
Authority
CN
China
Prior art keywords
computing device
user
speech
signal
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201480064081.XA
Other languages
English (en)
Chinese (zh)
Other versions
CN105765656A (zh
Inventor
朴基炫
郑玄旭
阿拉温德·桑卡兰
帕拉舒拉姆·卡达迪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Qualcomm Inc
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of CN105765656A publication Critical patent/CN105765656A/zh
Application granted granted Critical
Publication of CN105765656B publication Critical patent/CN105765656B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/226Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • User Interface Of Digital Computer (AREA)
  • Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
  • Measuring And Recording Apparatus For Diagnosis (AREA)
CN201480064081.XA 2013-12-09 2014-12-08 控制计算装置的语音辨识过程 Active CN105765656B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US14/100,934 US9564128B2 (en) 2013-12-09 2013-12-09 Controlling a speech recognition process of a computing device
US14/100,934 2013-12-09
PCT/US2014/069110 WO2015088980A1 (en) 2013-12-09 2014-12-08 Controlling a speech recognition process of a computing device

Publications (2)

Publication Number Publication Date
CN105765656A CN105765656A (zh) 2016-07-13
CN105765656B true CN105765656B (zh) 2019-08-20

Family

ID=52118040

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201480064081.XA Active CN105765656B (zh) 2013-12-09 2014-12-08 控制计算装置的语音辨识过程

Country Status (6)

Country Link
US (1) US9564128B2 (enExample)
EP (1) EP3080809B1 (enExample)
JP (1) JP6259094B2 (enExample)
KR (1) KR101810806B1 (enExample)
CN (1) CN105765656B (enExample)
WO (1) WO2015088980A1 (enExample)

Families Citing this family (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9697828B1 (en) * 2014-06-20 2017-07-04 Amazon Technologies, Inc. Keyword detection modeling using contextual and environmental information
US10413246B2 (en) * 2014-06-23 2019-09-17 Eldad Izhak HOCHMAN Detection of human-machine interaction errors
US20160253996A1 (en) * 2015-02-27 2016-09-01 Lenovo (Singapore) Pte. Ltd. Activating voice processing for associated speaker
US10055563B2 (en) * 2015-04-15 2018-08-21 Mediatek Inc. Air writing and gesture system with interactive wearable device
US10147423B2 (en) * 2016-09-29 2018-12-04 Intel IP Corporation Context-aware query recognition for electronic devices
KR102580408B1 (ko) 2016-10-17 2023-09-19 하만인터내셔날인더스트리스인코포레이티드 음성 기능을 갖는 휴대용 오디오 디바이스
US10665243B1 (en) * 2016-11-11 2020-05-26 Facebook Technologies, Llc Subvocalized speech recognition
US10332523B2 (en) 2016-11-18 2019-06-25 Google Llc Virtual assistant identification of nearby computing devices
EP4044176A1 (en) 2016-12-19 2022-08-17 Rovi Guides, Inc. Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application
US10313782B2 (en) * 2017-05-04 2019-06-04 Apple Inc. Automatic speech recognition triggering system
EP3634296B1 (en) * 2017-06-06 2025-07-30 Intuitive Surgical Operations, Inc. Systems and methods for state-based speech recognition in a teleoperational system
CN109147770B (zh) * 2017-06-16 2023-07-28 阿里巴巴集团控股有限公司 声音识别特征的优化、动态注册方法、客户端和服务器
DE102017214164B3 (de) * 2017-08-14 2019-01-17 Sivantos Pte. Ltd. Verfahren zum Betrieb eines Hörgeräts und Hörgerät
US10522160B2 (en) 2017-08-18 2019-12-31 Intel Corporation Methods and apparatus to identify a source of speech captured at a wearable electronic device
US10764668B2 (en) 2017-09-07 2020-09-01 Lightspeed Aviation, Inc. Sensor mount and circumaural headset or headphones with adjustable sensor
US10701470B2 (en) 2017-09-07 2020-06-30 Light Speed Aviation, Inc. Circumaural headset or headphones with adjustable biometric sensor
KR20190052394A (ko) * 2017-11-08 2019-05-16 삼성전자주식회사 복수의 마이크를 이용하여 기능을 실행하기 위한 방법 및 그 전자 장치
US10847173B2 (en) * 2018-02-13 2020-11-24 Intel Corporation Selection between signal sources based upon calculated signal to noise ratio
CN108735219B (zh) * 2018-05-09 2021-08-31 深圳市宇恒互动科技开发有限公司 一种声音识别控制方法及装置
US11315553B2 (en) 2018-09-20 2022-04-26 Samsung Electronics Co., Ltd. Electronic device and method for providing or obtaining data for training thereof
US11138334B1 (en) * 2018-10-17 2021-10-05 Medallia, Inc. Use of ASR confidence to improve reliability of automatic audio redaction
US10739864B2 (en) 2018-12-31 2020-08-11 International Business Machines Corporation Air writing to speech system using gesture and wrist angle orientation for synthesized speech modulation
WO2020181461A1 (en) * 2019-03-11 2020-09-17 Nokia Shanghai Bell Co., Ltd. Conditional display of object characteristics
WO2020219113A1 (en) * 2019-04-23 2020-10-29 Google Llc Personalized talking detector for electronic device
CN112071311B (zh) * 2019-06-10 2024-06-18 Oppo广东移动通信有限公司 控制方法、控制装置、穿戴设备和存储介质
CN112216277A (zh) * 2019-07-12 2021-01-12 Oppo广东移动通信有限公司 通过耳机进行语音识别的方法、耳机、语音识别装置
WO2021087121A1 (en) * 2019-11-01 2021-05-06 Starkey Laboratories, Inc. Ear-based biometric identification
US11521643B2 (en) * 2020-05-08 2022-12-06 Bose Corporation Wearable audio device with user own-voice recording
CN113823288B (zh) * 2020-06-16 2025-01-03 华为技术有限公司 一种语音唤醒的方法、电子设备、可穿戴设备和系统
JP7354992B2 (ja) * 2020-11-19 2023-10-03 トヨタ自動車株式会社 発言評価システム、発言評価方法、及び、プログラム
WO2023080296A1 (ko) * 2021-11-08 2023-05-11 엘지전자 주식회사 Ar 디바이스 및 ar 디바이스 제어 방법
US11573635B1 (en) 2022-01-04 2023-02-07 United Arab Emirates University Face mask for accurate location of sensors relative to a users face, a communication enabling face mask and a communication system including the face mask
US20240221751A1 (en) * 2023-01-04 2024-07-04 Wispr Al, Inc. Wearable silent speech device, systems, and methods

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1442845A (zh) * 2002-03-04 2003-09-17 株式会社Ntt都科摩 语音识别系统及方法、语音合成系统及方法及程序产品
EP1503368A1 (en) * 2003-07-29 2005-02-02 Microsoft Corporation Head mounted multi-sensory audio input system
CN1601604A (zh) * 2003-09-19 2005-03-30 株式会社Ntt都科摩 说话时段检测设备及方法、语音识别处理设备
CN101222703A (zh) * 2007-01-12 2008-07-16 杭州波导软件有限公司 一种基于语音辨识的移动终端的身份验证方法

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS5852695A (ja) * 1981-09-25 1983-03-28 日産自動車株式会社 車両用音声検出装置
US4696031A (en) * 1985-12-31 1987-09-22 Wang Laboratories, Inc. Signal detection and discrimination using waveform peak factor
US5293452A (en) * 1991-07-01 1994-03-08 Texas Instruments Incorporated Voice log-in using spoken name input
US5638436A (en) * 1994-01-12 1997-06-10 Dialogic Corporation Voice detection
SE519244C2 (sv) * 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes
US6493436B1 (en) * 2001-02-13 2002-12-10 3Com Corporation System for correcting failures of music on transfer
JP3908965B2 (ja) 2002-02-28 2007-04-25 株式会社エヌ・ティ・ティ・ドコモ 音声認識装置及び音声認識方法
US20040064314A1 (en) * 2002-09-27 2004-04-01 Aubert Nicolas De Saint Methods and apparatus for speech end-point detection
JP4447857B2 (ja) 2003-06-20 2010-04-07 株式会社エヌ・ティ・ティ・ドコモ 音声検出装置
JP2006171226A (ja) * 2004-12-14 2006-06-29 Sony Corp 音声処理装置
JP4847022B2 (ja) * 2005-01-28 2011-12-28 京セラ株式会社 発声内容認識装置
US20070100611A1 (en) * 2005-10-27 2007-05-03 Intel Corporation Speech codec apparatus with spike reduction
JP4678773B2 (ja) * 2005-12-05 2011-04-27 Kddi株式会社 音声入力評価装置
US8682652B2 (en) * 2006-06-30 2014-03-25 Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic
JP4381404B2 (ja) 2006-09-25 2009-12-09 株式会社エヌ・ティ・ティ・ドコモ 音声合成システム、音声合成方法、音声合成プログラム
JP4836290B2 (ja) * 2007-03-20 2011-12-14 富士通株式会社 音声認識システム、音声認識プログラムおよび音声認識方法
WO2008128208A1 (en) * 2007-04-12 2008-10-23 Magneto Inertial Sensing Technology, Inc. Infant sid monitor based on accelerometer
CN101645265B (zh) * 2008-08-05 2011-07-13 中兴通讯股份有限公司 一种音频类别的实时识别方法及装置
US8600067B2 (en) * 2008-09-19 2013-12-03 Personics Holdings Inc. Acoustic sealing analysis system
US8249870B2 (en) 2008-11-12 2012-08-21 Massachusetts Institute Of Technology Semi-automatic speech transcription
US20110246187A1 (en) 2008-12-16 2011-10-06 Koninklijke Philips Electronics N.V. Speech signal processing
US8412525B2 (en) * 2009-04-30 2013-04-02 Microsoft Corporation Noise robust speech classifier ensemble
US20120284022A1 (en) * 2009-07-10 2012-11-08 Alon Konchitsky Noise reduction system using a sensor based speech detector
US20110010172A1 (en) * 2009-07-10 2011-01-13 Alon Konchitsky Noise reduction system using a sensor based speech detector
WO2011015237A1 (en) * 2009-08-04 2011-02-10 Nokia Corporation Method and apparatus for audio signal classification
ES2371619B1 (es) * 2009-10-08 2012-08-08 Telefónica, S.A. Procedimiento de detección de segmentos de voz.
US9330667B2 (en) * 2010-10-29 2016-05-03 Iflytek Co., Ltd. Method and system for endpoint automatic detection of audio record
US20120130154A1 (en) * 2010-11-23 2012-05-24 Richie Sajan Voice Volume Modulator
CN102103858B (zh) * 2010-12-15 2013-07-24 方正国际软件有限公司 一种基于语音的控制方法及系统
US9318129B2 (en) * 2011-07-18 2016-04-19 At&T Intellectual Property I, Lp System and method for enhancing speech activity detection using facial feature detection
JP5790238B2 (ja) * 2011-07-22 2015-10-07 ソニー株式会社 情報処理装置、情報処理方法及びプログラム
US20130151248A1 (en) * 2011-12-08 2013-06-13 Forrest Baker, IV Apparatus, System, and Method For Distinguishing Voice in a Communication Stream

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1442845A (zh) * 2002-03-04 2003-09-17 株式会社Ntt都科摩 语音识别系统及方法、语音合成系统及方法及程序产品
EP1503368A1 (en) * 2003-07-29 2005-02-02 Microsoft Corporation Head mounted multi-sensory audio input system
CN1601604A (zh) * 2003-09-19 2005-03-30 株式会社Ntt都科摩 说话时段检测设备及方法、语音识别处理设备
CN101222703A (zh) * 2007-01-12 2008-07-16 杭州波导软件有限公司 一种基于语音辨识的移动终端的身份验证方法

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
3GPP Organizational Partners.3GPP TS 26.094 version 11 .0.0 Release 11 Adaptive Multi-Rate (AMR) speech codec *
Voice Activity Detector (VAD).《3GPP》.2012,第1-27页. *

Also Published As

Publication number Publication date
KR20160095141A (ko) 2016-08-10
US9564128B2 (en) 2017-02-07
JP6259094B2 (ja) 2018-01-10
US20150161998A1 (en) 2015-06-11
WO2015088980A1 (en) 2015-06-18
EP3080809A1 (en) 2016-10-19
KR101810806B1 (ko) 2017-12-19
JP2016540250A (ja) 2016-12-22
CN105765656A (zh) 2016-07-13
EP3080809B1 (en) 2017-10-18

Similar Documents

Publication Publication Date Title
CN105765656B (zh) 控制计算装置的语音辨识过程
US10856070B2 (en) Throat microphone system and method
CN110072434B (zh) 用于辅助听力设备使用的声音声学生物标记的使用
CN111475206B (zh) 用于唤醒可穿戴设备的方法及装置
CN113544768A (zh) 使用多传感器的语音识别
WO2020228095A1 (zh) 实时语音唤醒的音频设备、运行方法、装置及存储介质
CN109346075A (zh) 通过人体振动识别用户语音以控制电子设备的方法和系统
US8155966B2 (en) Apparatus and method for producing an audible speech signal from a non-audible speech signal
WO2020155490A1 (zh) 基于语音分析的管理音乐的方法、装置和计算机设备
KR20150104345A (ko) 음성 합성 장치 및 음성 합성 방법
EP3641344B1 (en) A method for operating a hearing instrument and a hearing system comprising a hearing instrument
EP3884850B1 (en) Systems and methods for biomarker analysis on a hearing device
JP2012230535A (ja) 電子機器および電子機器の制御プログラム
TWI749663B (zh) 發聲監控之方法及系統
CN116368818A (zh) 一种优化骨传导耳机工作状态的方法
JP2009178783A (ja) コミュニケーションロボット及びその制御方法
WO2017202002A1 (zh) 基于骨传导的听力健康检测系统及方法
CN113767431B (zh) 语音检测的方法和系统
CN113948109A (zh) 一种基于声音识别生理现象的系统
CN113409809B (zh) 语音降噪方法、装置及设备
WO2021051403A1 (zh) 一种语音控制方法、装置、芯片、耳机及系统
WO2019238061A1 (zh) 通过人体振动识别用户语音的方法和设备
CN113810819B (zh) 一种基于耳腔振动的静默语音采集处理方法及设备
CN111401912B (zh) 移动支付方法,电子设备及存储介质
CN118136021A (zh) 一种基于振动信号的录音角色切换装置及方法

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant