CN105765656B - 控制计算装置的语音辨识过程 - Google Patents
控制计算装置的语音辨识过程 Download PDFInfo
- Publication number
- CN105765656B CN105765656B CN201480064081.XA CN201480064081A CN105765656B CN 105765656 B CN105765656 B CN 105765656B CN 201480064081 A CN201480064081 A CN 201480064081A CN 105765656 B CN105765656 B CN 105765656B
- Authority
- CN
- China
- Prior art keywords
- computing device
- user
- speech
- signal
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- User Interface Of Digital Computer (AREA)
- Measurement Of The Respiration, Hearing Ability, Form, And Blood Characteristics Of Living Organisms (AREA)
- Measuring And Recording Apparatus For Diagnosis (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US14/100,934 US9564128B2 (en) | 2013-12-09 | 2013-12-09 | Controlling a speech recognition process of a computing device |
| US14/100,934 | 2013-12-09 | ||
| PCT/US2014/069110 WO2015088980A1 (en) | 2013-12-09 | 2014-12-08 | Controlling a speech recognition process of a computing device |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN105765656A CN105765656A (zh) | 2016-07-13 |
| CN105765656B true CN105765656B (zh) | 2019-08-20 |
Family
ID=52118040
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201480064081.XA Active CN105765656B (zh) | 2013-12-09 | 2014-12-08 | 控制计算装置的语音辨识过程 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US9564128B2 (enExample) |
| EP (1) | EP3080809B1 (enExample) |
| JP (1) | JP6259094B2 (enExample) |
| KR (1) | KR101810806B1 (enExample) |
| CN (1) | CN105765656B (enExample) |
| WO (1) | WO2015088980A1 (enExample) |
Families Citing this family (33)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US9697828B1 (en) * | 2014-06-20 | 2017-07-04 | Amazon Technologies, Inc. | Keyword detection modeling using contextual and environmental information |
| US10413246B2 (en) * | 2014-06-23 | 2019-09-17 | Eldad Izhak HOCHMAN | Detection of human-machine interaction errors |
| US20160253996A1 (en) * | 2015-02-27 | 2016-09-01 | Lenovo (Singapore) Pte. Ltd. | Activating voice processing for associated speaker |
| US10055563B2 (en) * | 2015-04-15 | 2018-08-21 | Mediatek Inc. | Air writing and gesture system with interactive wearable device |
| US10147423B2 (en) * | 2016-09-29 | 2018-12-04 | Intel IP Corporation | Context-aware query recognition for electronic devices |
| KR102580408B1 (ko) | 2016-10-17 | 2023-09-19 | 하만인터내셔날인더스트리스인코포레이티드 | 음성 기능을 갖는 휴대용 오디오 디바이스 |
| US10665243B1 (en) * | 2016-11-11 | 2020-05-26 | Facebook Technologies, Llc | Subvocalized speech recognition |
| US10332523B2 (en) | 2016-11-18 | 2019-06-25 | Google Llc | Virtual assistant identification of nearby computing devices |
| EP4044176A1 (en) | 2016-12-19 | 2022-08-17 | Rovi Guides, Inc. | Systems and methods for distinguishing valid voice commands from false voice commands in an interactive media guidance application |
| US10313782B2 (en) * | 2017-05-04 | 2019-06-04 | Apple Inc. | Automatic speech recognition triggering system |
| EP3634296B1 (en) * | 2017-06-06 | 2025-07-30 | Intuitive Surgical Operations, Inc. | Systems and methods for state-based speech recognition in a teleoperational system |
| CN109147770B (zh) * | 2017-06-16 | 2023-07-28 | 阿里巴巴集团控股有限公司 | 声音识别特征的优化、动态注册方法、客户端和服务器 |
| DE102017214164B3 (de) * | 2017-08-14 | 2019-01-17 | Sivantos Pte. Ltd. | Verfahren zum Betrieb eines Hörgeräts und Hörgerät |
| US10522160B2 (en) | 2017-08-18 | 2019-12-31 | Intel Corporation | Methods and apparatus to identify a source of speech captured at a wearable electronic device |
| US10764668B2 (en) | 2017-09-07 | 2020-09-01 | Lightspeed Aviation, Inc. | Sensor mount and circumaural headset or headphones with adjustable sensor |
| US10701470B2 (en) | 2017-09-07 | 2020-06-30 | Light Speed Aviation, Inc. | Circumaural headset or headphones with adjustable biometric sensor |
| KR20190052394A (ko) * | 2017-11-08 | 2019-05-16 | 삼성전자주식회사 | 복수의 마이크를 이용하여 기능을 실행하기 위한 방법 및 그 전자 장치 |
| US10847173B2 (en) * | 2018-02-13 | 2020-11-24 | Intel Corporation | Selection between signal sources based upon calculated signal to noise ratio |
| CN108735219B (zh) * | 2018-05-09 | 2021-08-31 | 深圳市宇恒互动科技开发有限公司 | 一种声音识别控制方法及装置 |
| US11315553B2 (en) | 2018-09-20 | 2022-04-26 | Samsung Electronics Co., Ltd. | Electronic device and method for providing or obtaining data for training thereof |
| US11138334B1 (en) * | 2018-10-17 | 2021-10-05 | Medallia, Inc. | Use of ASR confidence to improve reliability of automatic audio redaction |
| US10739864B2 (en) | 2018-12-31 | 2020-08-11 | International Business Machines Corporation | Air writing to speech system using gesture and wrist angle orientation for synthesized speech modulation |
| WO2020181461A1 (en) * | 2019-03-11 | 2020-09-17 | Nokia Shanghai Bell Co., Ltd. | Conditional display of object characteristics |
| WO2020219113A1 (en) * | 2019-04-23 | 2020-10-29 | Google Llc | Personalized talking detector for electronic device |
| CN112071311B (zh) * | 2019-06-10 | 2024-06-18 | Oppo广东移动通信有限公司 | 控制方法、控制装置、穿戴设备和存储介质 |
| CN112216277A (zh) * | 2019-07-12 | 2021-01-12 | Oppo广东移动通信有限公司 | 通过耳机进行语音识别的方法、耳机、语音识别装置 |
| WO2021087121A1 (en) * | 2019-11-01 | 2021-05-06 | Starkey Laboratories, Inc. | Ear-based biometric identification |
| US11521643B2 (en) * | 2020-05-08 | 2022-12-06 | Bose Corporation | Wearable audio device with user own-voice recording |
| CN113823288B (zh) * | 2020-06-16 | 2025-01-03 | 华为技术有限公司 | 一种语音唤醒的方法、电子设备、可穿戴设备和系统 |
| JP7354992B2 (ja) * | 2020-11-19 | 2023-10-03 | トヨタ自動車株式会社 | 発言評価システム、発言評価方法、及び、プログラム |
| WO2023080296A1 (ko) * | 2021-11-08 | 2023-05-11 | 엘지전자 주식회사 | Ar 디바이스 및 ar 디바이스 제어 방법 |
| US11573635B1 (en) | 2022-01-04 | 2023-02-07 | United Arab Emirates University | Face mask for accurate location of sensors relative to a users face, a communication enabling face mask and a communication system including the face mask |
| US20240221751A1 (en) * | 2023-01-04 | 2024-07-04 | Wispr Al, Inc. | Wearable silent speech device, systems, and methods |
Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1442845A (zh) * | 2002-03-04 | 2003-09-17 | 株式会社Ntt都科摩 | 语音识别系统及方法、语音合成系统及方法及程序产品 |
| EP1503368A1 (en) * | 2003-07-29 | 2005-02-02 | Microsoft Corporation | Head mounted multi-sensory audio input system |
| CN1601604A (zh) * | 2003-09-19 | 2005-03-30 | 株式会社Ntt都科摩 | 说话时段检测设备及方法、语音识别处理设备 |
| CN101222703A (zh) * | 2007-01-12 | 2008-07-16 | 杭州波导软件有限公司 | 一种基于语音辨识的移动终端的身份验证方法 |
Family Cites Families (32)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS5852695A (ja) * | 1981-09-25 | 1983-03-28 | 日産自動車株式会社 | 車両用音声検出装置 |
| US4696031A (en) * | 1985-12-31 | 1987-09-22 | Wang Laboratories, Inc. | Signal detection and discrimination using waveform peak factor |
| US5293452A (en) * | 1991-07-01 | 1994-03-08 | Texas Instruments Incorporated | Voice log-in using spoken name input |
| US5638436A (en) * | 1994-01-12 | 1997-06-10 | Dialogic Corporation | Voice detection |
| SE519244C2 (sv) * | 1995-12-06 | 2003-02-04 | Telia Ab | Anordning och metod vid talsyntes |
| US6493436B1 (en) * | 2001-02-13 | 2002-12-10 | 3Com Corporation | System for correcting failures of music on transfer |
| JP3908965B2 (ja) | 2002-02-28 | 2007-04-25 | 株式会社エヌ・ティ・ティ・ドコモ | 音声認識装置及び音声認識方法 |
| US20040064314A1 (en) * | 2002-09-27 | 2004-04-01 | Aubert Nicolas De Saint | Methods and apparatus for speech end-point detection |
| JP4447857B2 (ja) | 2003-06-20 | 2010-04-07 | 株式会社エヌ・ティ・ティ・ドコモ | 音声検出装置 |
| JP2006171226A (ja) * | 2004-12-14 | 2006-06-29 | Sony Corp | 音声処理装置 |
| JP4847022B2 (ja) * | 2005-01-28 | 2011-12-28 | 京セラ株式会社 | 発声内容認識装置 |
| US20070100611A1 (en) * | 2005-10-27 | 2007-05-03 | Intel Corporation | Speech codec apparatus with spike reduction |
| JP4678773B2 (ja) * | 2005-12-05 | 2011-04-27 | Kddi株式会社 | 音声入力評価装置 |
| US8682652B2 (en) * | 2006-06-30 | 2014-03-25 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung E.V. | Audio encoder, audio decoder and audio processor having a dynamically variable warping characteristic |
| JP4381404B2 (ja) | 2006-09-25 | 2009-12-09 | 株式会社エヌ・ティ・ティ・ドコモ | 音声合成システム、音声合成方法、音声合成プログラム |
| JP4836290B2 (ja) * | 2007-03-20 | 2011-12-14 | 富士通株式会社 | 音声認識システム、音声認識プログラムおよび音声認識方法 |
| WO2008128208A1 (en) * | 2007-04-12 | 2008-10-23 | Magneto Inertial Sensing Technology, Inc. | Infant sid monitor based on accelerometer |
| CN101645265B (zh) * | 2008-08-05 | 2011-07-13 | 中兴通讯股份有限公司 | 一种音频类别的实时识别方法及装置 |
| US8600067B2 (en) * | 2008-09-19 | 2013-12-03 | Personics Holdings Inc. | Acoustic sealing analysis system |
| US8249870B2 (en) | 2008-11-12 | 2012-08-21 | Massachusetts Institute Of Technology | Semi-automatic speech transcription |
| US20110246187A1 (en) | 2008-12-16 | 2011-10-06 | Koninklijke Philips Electronics N.V. | Speech signal processing |
| US8412525B2 (en) * | 2009-04-30 | 2013-04-02 | Microsoft Corporation | Noise robust speech classifier ensemble |
| US20120284022A1 (en) * | 2009-07-10 | 2012-11-08 | Alon Konchitsky | Noise reduction system using a sensor based speech detector |
| US20110010172A1 (en) * | 2009-07-10 | 2011-01-13 | Alon Konchitsky | Noise reduction system using a sensor based speech detector |
| WO2011015237A1 (en) * | 2009-08-04 | 2011-02-10 | Nokia Corporation | Method and apparatus for audio signal classification |
| ES2371619B1 (es) * | 2009-10-08 | 2012-08-08 | Telefónica, S.A. | Procedimiento de detección de segmentos de voz. |
| US9330667B2 (en) * | 2010-10-29 | 2016-05-03 | Iflytek Co., Ltd. | Method and system for endpoint automatic detection of audio record |
| US20120130154A1 (en) * | 2010-11-23 | 2012-05-24 | Richie Sajan | Voice Volume Modulator |
| CN102103858B (zh) * | 2010-12-15 | 2013-07-24 | 方正国际软件有限公司 | 一种基于语音的控制方法及系统 |
| US9318129B2 (en) * | 2011-07-18 | 2016-04-19 | At&T Intellectual Property I, Lp | System and method for enhancing speech activity detection using facial feature detection |
| JP5790238B2 (ja) * | 2011-07-22 | 2015-10-07 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
| US20130151248A1 (en) * | 2011-12-08 | 2013-06-13 | Forrest Baker, IV | Apparatus, System, and Method For Distinguishing Voice in a Communication Stream |
-
2013
- 2013-12-09 US US14/100,934 patent/US9564128B2/en active Active
-
2014
- 2014-12-08 KR KR1020167018329A patent/KR101810806B1/ko active Active
- 2014-12-08 WO PCT/US2014/069110 patent/WO2015088980A1/en not_active Ceased
- 2014-12-08 CN CN201480064081.XA patent/CN105765656B/zh active Active
- 2014-12-08 EP EP14815195.4A patent/EP3080809B1/en active Active
- 2014-12-08 JP JP2016536943A patent/JP6259094B2/ja active Active
Patent Citations (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN1442845A (zh) * | 2002-03-04 | 2003-09-17 | 株式会社Ntt都科摩 | 语音识别系统及方法、语音合成系统及方法及程序产品 |
| EP1503368A1 (en) * | 2003-07-29 | 2005-02-02 | Microsoft Corporation | Head mounted multi-sensory audio input system |
| CN1601604A (zh) * | 2003-09-19 | 2005-03-30 | 株式会社Ntt都科摩 | 说话时段检测设备及方法、语音识别处理设备 |
| CN101222703A (zh) * | 2007-01-12 | 2008-07-16 | 杭州波导软件有限公司 | 一种基于语音辨识的移动终端的身份验证方法 |
Non-Patent Citations (2)
| Title |
|---|
| 3GPP Organizational Partners.3GPP TS 26.094 version 11 .0.0 Release 11 Adaptive Multi-Rate (AMR) speech codec * |
| Voice Activity Detector (VAD).《3GPP》.2012,第1-27页. * |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20160095141A (ko) | 2016-08-10 |
| US9564128B2 (en) | 2017-02-07 |
| JP6259094B2 (ja) | 2018-01-10 |
| US20150161998A1 (en) | 2015-06-11 |
| WO2015088980A1 (en) | 2015-06-18 |
| EP3080809A1 (en) | 2016-10-19 |
| KR101810806B1 (ko) | 2017-12-19 |
| JP2016540250A (ja) | 2016-12-22 |
| CN105765656A (zh) | 2016-07-13 |
| EP3080809B1 (en) | 2017-10-18 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN105765656B (zh) | 控制计算装置的语音辨识过程 | |
| US10856070B2 (en) | Throat microphone system and method | |
| CN110072434B (zh) | 用于辅助听力设备使用的声音声学生物标记的使用 | |
| CN111475206B (zh) | 用于唤醒可穿戴设备的方法及装置 | |
| CN113544768A (zh) | 使用多传感器的语音识别 | |
| WO2020228095A1 (zh) | 实时语音唤醒的音频设备、运行方法、装置及存储介质 | |
| CN109346075A (zh) | 通过人体振动识别用户语音以控制电子设备的方法和系统 | |
| US8155966B2 (en) | Apparatus and method for producing an audible speech signal from a non-audible speech signal | |
| WO2020155490A1 (zh) | 基于语音分析的管理音乐的方法、装置和计算机设备 | |
| KR20150104345A (ko) | 음성 합성 장치 및 음성 합성 방법 | |
| EP3641344B1 (en) | A method for operating a hearing instrument and a hearing system comprising a hearing instrument | |
| EP3884850B1 (en) | Systems and methods for biomarker analysis on a hearing device | |
| JP2012230535A (ja) | 電子機器および電子機器の制御プログラム | |
| TWI749663B (zh) | 發聲監控之方法及系統 | |
| CN116368818A (zh) | 一种优化骨传导耳机工作状态的方法 | |
| JP2009178783A (ja) | コミュニケーションロボット及びその制御方法 | |
| WO2017202002A1 (zh) | 基于骨传导的听力健康检测系统及方法 | |
| CN113767431B (zh) | 语音检测的方法和系统 | |
| CN113948109A (zh) | 一种基于声音识别生理现象的系统 | |
| CN113409809B (zh) | 语音降噪方法、装置及设备 | |
| WO2021051403A1 (zh) | 一种语音控制方法、装置、芯片、耳机及系统 | |
| WO2019238061A1 (zh) | 通过人体振动识别用户语音的方法和设备 | |
| CN113810819B (zh) | 一种基于耳腔振动的静默语音采集处理方法及设备 | |
| CN111401912B (zh) | 移动支付方法,电子设备及存储介质 | |
| CN118136021A (zh) | 一种基于振动信号的录音角色切换装置及方法 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |