CN114144831B - 激活语音识别 - Google Patents
激活语音识别Info
- Publication number
- CN114144831B CN114144831B CN202080052825.1A CN202080052825A CN114144831B CN 114144831 B CN114144831 B CN 114144831B CN 202080052825 A CN202080052825 A CN 202080052825A CN 114144831 B CN114144831 B CN 114144831B
- Authority
- CN
- China
- Prior art keywords
- hand
- response
- detector
- speech recognition
- detecting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3206—Monitoring of events, devices or parameters that trigger a change in power modality
- G06F1/3231—Monitoring the presence, absence or movement of users
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/10—Image acquisition
- G06V10/12—Details of acquisition arrangements; Constructional details thereof
- G06V10/14—Optical characteristics of the device performing the acquisition or on the illumination arrangements
- G06V10/143—Sensing or illuminating at different wavelengths
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/20—Scenes; Scene-specific elements in augmented reality scenes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/107—Static hand or arm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/227—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/526,608 US11437031B2 (en) | 2019-07-30 | 2019-07-30 | Activating speech recognition based on hand patterns detected using plurality of filters |
| US16/526,608 | 2019-07-30 | ||
| PCT/US2020/044127 WO2021021970A1 (en) | 2019-07-30 | 2020-07-30 | Activating speech recognition |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN114144831A CN114144831A (zh) | 2022-03-04 |
| CN114144831B true CN114144831B (zh) | 2025-07-25 |
Family
ID=72087256
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202080052825.1A Active CN114144831B (zh) | 2019-07-30 | 2020-07-30 | 激活语音识别 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US11437031B2 (https=) |
| EP (1) | EP4004908B1 (https=) |
| JP (1) | JP7645230B2 (https=) |
| KR (1) | KR102926603B1 (https=) |
| CN (1) | CN114144831B (https=) |
| BR (1) | BR112022000922A2 (https=) |
| PH (1) | PH12021553299A1 (https=) |
| TW (1) | TWI871343B (https=) |
| WO (1) | WO2021021970A1 (https=) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20210015234A (ko) * | 2019-08-01 | 2021-02-10 | 삼성전자주식회사 | 전자 장치, 및 그의 음성 명령에 따른 기능이 실행되도록 제어하는 방법 |
| US11682391B2 (en) * | 2020-03-30 | 2023-06-20 | Motorola Solutions, Inc. | Electronic communications device having a user interface including a single input interface for electronic digital assistant and voice control access |
| US11862189B2 (en) * | 2020-04-01 | 2024-01-02 | Qualcomm Incorporated | Method and apparatus for target sound detection |
| US11590929B2 (en) * | 2020-05-05 | 2023-02-28 | Nvidia Corporation | Systems and methods for performing commands in a vehicle using speech and image recognition |
| US20220201083A1 (en) * | 2020-12-22 | 2022-06-23 | Cerence Operating Company | Platform for integrating disparate ecosystems within a vehicle |
| KR20230092180A (ko) * | 2021-12-17 | 2023-06-26 | 현대자동차주식회사 | 차량 및 그의 제어방법 |
| US12412565B2 (en) * | 2022-01-28 | 2025-09-09 | Syntiant Corp. | Prediction based wake-word detection and methods for use therewith |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103869967A (zh) * | 2012-12-14 | 2014-06-18 | 歌乐株式会社 | 控制装置、车辆以及便携终端 |
| CN207758675U (zh) * | 2017-12-29 | 2018-08-24 | 广州视声光电有限公司 | 一种触发式车载后视镜 |
Family Cites Families (20)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE9902229L (sv) * | 1999-06-07 | 2001-02-05 | Ericsson Telefon Ab L M | Apparatus and method of controlling a voice controlled operation |
| US20020077830A1 (en) * | 2000-12-19 | 2002-06-20 | Nokia Corporation | Method for activating context sensitive speech recognition in a terminal |
| US8958848B2 (en) * | 2008-04-08 | 2015-02-17 | Lg Electronics Inc. | Mobile terminal and menu control method thereof |
| KR20090107364A (ko) * | 2008-04-08 | 2009-10-13 | 엘지전자 주식회사 | 이동 단말기 및 그 메뉴 제어방법 |
| KR101502003B1 (ko) * | 2008-07-08 | 2015-03-12 | 엘지전자 주식회사 | 이동 단말기 및 그 텍스트 입력 방법 |
| KR20100007625A (ko) * | 2008-07-14 | 2010-01-22 | 엘지전자 주식회사 | 이동 단말기 및 그 메뉴 표시 방법 |
| KR101537693B1 (ko) * | 2008-11-24 | 2015-07-20 | 엘지전자 주식회사 | 단말기 및 그 제어 방법 |
| JP5229083B2 (ja) * | 2009-04-14 | 2013-07-03 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
| US9551590B2 (en) * | 2009-08-28 | 2017-01-24 | Robert Bosch Gmbh | Gesture-based information and command entry for motor vehicle |
| KR101795574B1 (ko) * | 2011-01-06 | 2017-11-13 | 삼성전자주식회사 | 모션에 의해 제어되는 전자기기 및 그 제어 방법 |
| JP2013080015A (ja) * | 2011-09-30 | 2013-05-02 | Toshiba Corp | 音声認識装置および音声認識方法 |
| JP6211256B2 (ja) * | 2012-09-26 | 2017-10-11 | 株式会社ナビタイムジャパン | 情報処理装置、情報処理方法、および情報処理プログラム |
| US10272920B2 (en) * | 2013-10-11 | 2019-04-30 | Panasonic Intellectual Property Corporation Of America | Processing method, program, processing apparatus, and detection system |
| KR101546709B1 (ko) * | 2013-11-25 | 2015-08-24 | 현대자동차주식회사 | 음성 인식 장치, 그를 가지는 차량 및 그 방법 |
| KR101643560B1 (ko) * | 2014-12-17 | 2016-08-10 | 현대자동차주식회사 | 음성 인식 장치, 그를 가지는 차량 및 그 방법 |
| CN105373227B (zh) * | 2015-10-29 | 2019-03-26 | 小米科技有限责任公司 | 一种智能关闭电子设备的方法及装置 |
| CN105869637B (zh) * | 2016-05-26 | 2019-10-15 | 百度在线网络技术(北京)有限公司 | 语音唤醒方法和装置 |
| CN107197090B (zh) * | 2017-05-18 | 2020-07-14 | 维沃移动通信有限公司 | 一种语音信号的接收方法及移动终端 |
| JP7091983B2 (ja) * | 2018-10-01 | 2022-06-28 | トヨタ自動車株式会社 | 機器制御装置 |
| CN209571226U (zh) * | 2018-12-20 | 2019-11-01 | 深圳市朗强科技有限公司 | 一种语音识别装置及系统 |
-
2019
- 2019-07-30 US US16/526,608 patent/US11437031B2/en active Active
-
2020
- 2020-07-30 CN CN202080052825.1A patent/CN114144831B/zh active Active
- 2020-07-30 PH PH1/2021/553299A patent/PH12021553299A1/en unknown
- 2020-07-30 TW TW109125736A patent/TWI871343B/zh active
- 2020-07-30 BR BR112022000922A patent/BR112022000922A2/pt unknown
- 2020-07-30 EP EP20757126.6A patent/EP4004908B1/en active Active
- 2020-07-30 JP JP2022504699A patent/JP7645230B2/ja active Active
- 2020-07-30 KR KR1020227002030A patent/KR102926603B1/ko active Active
- 2020-07-30 WO PCT/US2020/044127 patent/WO2021021970A1/en not_active Ceased
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN103869967A (zh) * | 2012-12-14 | 2014-06-18 | 歌乐株式会社 | 控制装置、车辆以及便携终端 |
| CN207758675U (zh) * | 2017-12-29 | 2018-08-24 | 广州视声光电有限公司 | 一种触发式车载后视镜 |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20220041831A (ko) | 2022-04-01 |
| JP2022543201A (ja) | 2022-10-11 |
| US11437031B2 (en) | 2022-09-06 |
| BR112022000922A2 (pt) | 2022-03-08 |
| TW202121115A (zh) | 2021-06-01 |
| WO2021021970A1 (en) | 2021-02-04 |
| PH12021553299A1 (en) | 2022-08-01 |
| CN114144831A (zh) | 2022-03-04 |
| EP4004908B1 (en) | 2024-10-09 |
| EP4004908C0 (en) | 2024-10-09 |
| KR102926603B1 (ko) | 2026-02-11 |
| EP4004908A1 (en) | 2022-06-01 |
| JP7645230B2 (ja) | 2025-03-13 |
| US20210035571A1 (en) | 2021-02-04 |
| TWI871343B (zh) | 2025-02-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN114144831B (zh) | 激活语音识别 | |
| KR102216048B1 (ko) | 음성 명령 인식 장치 및 방법 | |
| KR101981878B1 (ko) | 스피치의 방향에 기초한 전자 디바이스의 제어 | |
| CN108447472B (zh) | 语音唤醒方法及装置 | |
| CN111696570B (zh) | 语音信号处理方法、装置、设备及存储介质 | |
| EP3792911A1 (en) | Method for detecting key term in speech signal, device, terminal, and storage medium | |
| KR20220031610A (ko) | 멀티-모달 사용자 인터페이스 | |
| CN111833872B (zh) | 对电梯的语音控制方法、装置、设备、系统及介质 | |
| CN113380275B (zh) | 语音处理方法、装置、智能设备及存储介质 | |
| CN105556593A (zh) | 预处理音频信号的方法和设备 | |
| CN113744736B (zh) | 命令词识别方法、装置、电子设备及存储介质 | |
| US9928851B2 (en) | Voice verifying system and voice verifying method which can determine if voice signal is valid or not | |
| CN114220420A (zh) | 多模态语音唤醒方法、装置及计算机可读存储介质 | |
| US9791925B2 (en) | Information acquisition method, information acquisition system, and non-transitory recording medium for user of motor vehicle | |
| US11682392B2 (en) | Information processing apparatus | |
| US9967393B2 (en) | Mobile electronic apparatus, method for controlling mobile electronic apparatus, and non-transitory computer readable recording medium | |
| CN111681654A (zh) | 语音控制方法、装置、电子设备及存储介质 | |
| CN114299945B (zh) | 语音信号的识别方法、装置、电子设备、存储介质及产品 | |
| CN115769567B (zh) | 音频增益选择 | |
| CN112329909B (zh) | 生成神经网络模型的方法、装置及存储介质 | |
| CN116189718B (zh) | 语音活性检测方法、装置、设备及存储介质 | |
| JP7722374B2 (ja) | 情報処理装置、及び情報処理方法 | |
| JP2023162857A (ja) | 音声対話装置及び音声対話方法 | |
| JP2026016007A (ja) | 情報処理装置、システム、情報処理方法、およびプログラム |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |