KR102926603B1 - 음성 인식의 활성화 - Google Patents
음성 인식의 활성화Info
- Publication number
- KR102926603B1 KR102926603B1 KR1020227002030A KR20227002030A KR102926603B1 KR 102926603 B1 KR102926603 B1 KR 102926603B1 KR 1020227002030 A KR1020227002030 A KR 1020227002030A KR 20227002030 A KR20227002030 A KR 20227002030A KR 102926603 B1 KR102926603 B1 KR 102926603B1
- Authority
- KR
- South Korea
- Prior art keywords
- hand
- audio signal
- detector
- delete delete
- processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F1/00—Details not covered by groups G06F3/00 - G06F13/00 and G06F21/00
- G06F1/26—Power supply means, e.g. regulation thereof
- G06F1/32—Means for saving power
- G06F1/3203—Power management, i.e. event-based initiation of a power-saving mode
- G06F1/3206—Monitoring of events, devices or parameters that trigger a change in power modality
- G06F1/3231—Monitoring the presence, absence or movement of users
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/017—Gesture based interaction, e.g. based on a set of recognized hand gestures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/10—Image acquisition
- G06V10/12—Details of acquisition arrangements; Constructional details thereof
- G06V10/14—Optical characteristics of the device performing the acquisition or on the illumination arrangements
- G06V10/143—Sensing or illuminating at different wavelengths
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/20—Scenes; Scene-specific elements in augmented reality scenes
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/107—Static hand or arm
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/227—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of the speaker; Human-factor methodology
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- General Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Measuring Pulse, Heart Rate, Blood Pressure Or Blood Flow (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US16/526,608 US11437031B2 (en) | 2019-07-30 | 2019-07-30 | Activating speech recognition based on hand patterns detected using plurality of filters |
| US16/526,608 | 2019-07-30 | ||
| PCT/US2020/044127 WO2021021970A1 (en) | 2019-07-30 | 2020-07-30 | Activating speech recognition |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20220041831A KR20220041831A (ko) | 2022-04-01 |
| KR102926603B1 true KR102926603B1 (ko) | 2026-02-11 |
Family
ID=72087256
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020227002030A Active KR102926603B1 (ko) | 2019-07-30 | 2020-07-30 | 음성 인식의 활성화 |
Country Status (9)
| Country | Link |
|---|---|
| US (1) | US11437031B2 (https=) |
| EP (1) | EP4004908B1 (https=) |
| JP (1) | JP7645230B2 (https=) |
| KR (1) | KR102926603B1 (https=) |
| CN (1) | CN114144831B (https=) |
| BR (1) | BR112022000922A2 (https=) |
| PH (1) | PH12021553299A1 (https=) |
| TW (1) | TWI871343B (https=) |
| WO (1) | WO2021021970A1 (https=) |
Families Citing this family (7)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR20210015234A (ko) * | 2019-08-01 | 2021-02-10 | 삼성전자주식회사 | 전자 장치, 및 그의 음성 명령에 따른 기능이 실행되도록 제어하는 방법 |
| US11682391B2 (en) * | 2020-03-30 | 2023-06-20 | Motorola Solutions, Inc. | Electronic communications device having a user interface including a single input interface for electronic digital assistant and voice control access |
| US11862189B2 (en) * | 2020-04-01 | 2024-01-02 | Qualcomm Incorporated | Method and apparatus for target sound detection |
| US11590929B2 (en) * | 2020-05-05 | 2023-02-28 | Nvidia Corporation | Systems and methods for performing commands in a vehicle using speech and image recognition |
| US20220201083A1 (en) * | 2020-12-22 | 2022-06-23 | Cerence Operating Company | Platform for integrating disparate ecosystems within a vehicle |
| KR20230092180A (ko) * | 2021-12-17 | 2023-06-26 | 현대자동차주식회사 | 차량 및 그의 제어방법 |
| US12412565B2 (en) * | 2022-01-28 | 2025-09-09 | Syntiant Corp. | Prediction based wake-word detection and methods for use therewith |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130085757A1 (en) | 2011-09-30 | 2013-04-04 | Kabushiki Kaisha Toshiba | Apparatus and method for speech recognition |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE9902229L (sv) * | 1999-06-07 | 2001-02-05 | Ericsson Telefon Ab L M | Apparatus and method of controlling a voice controlled operation |
| US20020077830A1 (en) * | 2000-12-19 | 2002-06-20 | Nokia Corporation | Method for activating context sensitive speech recognition in a terminal |
| US8958848B2 (en) * | 2008-04-08 | 2015-02-17 | Lg Electronics Inc. | Mobile terminal and menu control method thereof |
| KR20090107364A (ko) * | 2008-04-08 | 2009-10-13 | 엘지전자 주식회사 | 이동 단말기 및 그 메뉴 제어방법 |
| KR101502003B1 (ko) * | 2008-07-08 | 2015-03-12 | 엘지전자 주식회사 | 이동 단말기 및 그 텍스트 입력 방법 |
| KR20100007625A (ko) * | 2008-07-14 | 2010-01-22 | 엘지전자 주식회사 | 이동 단말기 및 그 메뉴 표시 방법 |
| KR101537693B1 (ko) * | 2008-11-24 | 2015-07-20 | 엘지전자 주식회사 | 단말기 및 그 제어 방법 |
| JP5229083B2 (ja) * | 2009-04-14 | 2013-07-03 | ソニー株式会社 | 情報処理装置、情報処理方法及びプログラム |
| US9551590B2 (en) * | 2009-08-28 | 2017-01-24 | Robert Bosch Gmbh | Gesture-based information and command entry for motor vehicle |
| KR101795574B1 (ko) * | 2011-01-06 | 2017-11-13 | 삼성전자주식회사 | 모션에 의해 제어되는 전자기기 및 그 제어 방법 |
| JP6211256B2 (ja) * | 2012-09-26 | 2017-10-11 | 株式会社ナビタイムジャパン | 情報処理装置、情報処理方法、および情報処理プログラム |
| JP6030430B2 (ja) * | 2012-12-14 | 2016-11-24 | クラリオン株式会社 | 制御装置、車両及び携帯端末 |
| US10272920B2 (en) * | 2013-10-11 | 2019-04-30 | Panasonic Intellectual Property Corporation Of America | Processing method, program, processing apparatus, and detection system |
| KR101546709B1 (ko) * | 2013-11-25 | 2015-08-24 | 현대자동차주식회사 | 음성 인식 장치, 그를 가지는 차량 및 그 방법 |
| KR101643560B1 (ko) * | 2014-12-17 | 2016-08-10 | 현대자동차주식회사 | 음성 인식 장치, 그를 가지는 차량 및 그 방법 |
| CN105373227B (zh) * | 2015-10-29 | 2019-03-26 | 小米科技有限责任公司 | 一种智能关闭电子设备的方法及装置 |
| CN105869637B (zh) * | 2016-05-26 | 2019-10-15 | 百度在线网络技术(北京)有限公司 | 语音唤醒方法和装置 |
| CN107197090B (zh) * | 2017-05-18 | 2020-07-14 | 维沃移动通信有限公司 | 一种语音信号的接收方法及移动终端 |
| CN207758675U (zh) | 2017-12-29 | 2018-08-24 | 广州视声光电有限公司 | 一种触发式车载后视镜 |
| JP7091983B2 (ja) * | 2018-10-01 | 2022-06-28 | トヨタ自動車株式会社 | 機器制御装置 |
| CN209571226U (zh) * | 2018-12-20 | 2019-11-01 | 深圳市朗强科技有限公司 | 一种语音识别装置及系统 |
-
2019
- 2019-07-30 US US16/526,608 patent/US11437031B2/en active Active
-
2020
- 2020-07-30 CN CN202080052825.1A patent/CN114144831B/zh active Active
- 2020-07-30 PH PH1/2021/553299A patent/PH12021553299A1/en unknown
- 2020-07-30 TW TW109125736A patent/TWI871343B/zh active
- 2020-07-30 BR BR112022000922A patent/BR112022000922A2/pt unknown
- 2020-07-30 EP EP20757126.6A patent/EP4004908B1/en active Active
- 2020-07-30 JP JP2022504699A patent/JP7645230B2/ja active Active
- 2020-07-30 KR KR1020227002030A patent/KR102926603B1/ko active Active
- 2020-07-30 WO PCT/US2020/044127 patent/WO2021021970A1/en not_active Ceased
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20130085757A1 (en) | 2011-09-30 | 2013-04-04 | Kabushiki Kaisha Toshiba | Apparatus and method for speech recognition |
Also Published As
| Publication number | Publication date |
|---|---|
| KR20220041831A (ko) | 2022-04-01 |
| JP2022543201A (ja) | 2022-10-11 |
| CN114144831B (zh) | 2025-07-25 |
| US11437031B2 (en) | 2022-09-06 |
| BR112022000922A2 (pt) | 2022-03-08 |
| TW202121115A (zh) | 2021-06-01 |
| WO2021021970A1 (en) | 2021-02-04 |
| PH12021553299A1 (en) | 2022-08-01 |
| CN114144831A (zh) | 2022-03-04 |
| EP4004908B1 (en) | 2024-10-09 |
| EP4004908C0 (en) | 2024-10-09 |
| EP4004908A1 (en) | 2022-06-01 |
| JP7645230B2 (ja) | 2025-03-13 |
| US20210035571A1 (en) | 2021-02-04 |
| TWI871343B (zh) | 2025-02-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR102926603B1 (ko) | 음성 인식의 활성화 | |
| JP7646063B2 (ja) | デジタルアシスタントのためのボイストリガ | |
| US11069343B2 (en) | Voice activation method, apparatus, electronic device, and storage medium | |
| KR102216048B1 (ko) | 음성 명령 인식 장치 및 방법 | |
| EP3179474B1 (en) | User focus activated voice recognition | |
| KR101981878B1 (ko) | 스피치의 방향에 기초한 전자 디바이스의 제어 | |
| EP2959474B1 (en) | Hybrid performance scaling for speech recognition | |
| US9418651B2 (en) | Method and apparatus for mitigating false accepts of trigger phrases | |
| EP3792911A1 (en) | Method for detecting key term in speech signal, device, terminal, and storage medium | |
| CN111833872B (zh) | 对电梯的语音控制方法、装置、设备、系统及介质 | |
| KR102492727B1 (ko) | 전자장치 및 그 제어방법 | |
| CN105556593A (zh) | 预处理音频信号的方法和设备 | |
| CN113744736B (zh) | 命令词识别方法、装置、电子设备及存储介质 | |
| CN114220420A (zh) | 多模态语音唤醒方法、装置及计算机可读存储介质 | |
| US11682392B2 (en) | Information processing apparatus | |
| CN111681654A (zh) | 语音控制方法、装置、电子设备及存储介质 | |
| CN114299945B (zh) | 语音信号的识别方法、装置、电子设备、存储介质及产品 | |
| CN115769567B (zh) | 音频增益选择 | |
| CN116189718B (zh) | 语音活性检测方法、装置、设备及存储介质 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| D18-X000 | Deferred examination requested |
St.27 status event code: A-1-2-D10-D18-exm-X000 |
|
| D19-X000 | Deferred examination accepted |
St.27 status event code: A-1-2-D10-D19-exm-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-2-2-P10-P22-nap-X000 |
|
| E13-X000 | Pre-grant limitation requested |
St.27 status event code: A-2-3-E10-E13-lim-X000 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0302 | Request for accelerated examination |
St.27 status event code: A-1-2-D10-D16-exm-PA0302 |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
| P11 | Amendment of application requested |
Free format text: ST27 STATUS EVENT CODE: A-2-2-P10-P11-NAP-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| D20 | Deferred examination resumed |
Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D20-EXM-X000 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| D20-X000 | Deferred examination resumed |
St.27 status event code: A-1-2-D10-D20-exm-X000 |
|
| D22 | Grant of ip right intended |
Free format text: ST27 STATUS EVENT CODE: A-1-2-D10-D22-EXM-PE0701 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| F11 | Ip right granted following substantive examination |
Free format text: ST27 STATUS EVENT CODE: A-2-4-F10-F11-EXM-PR0701 (AS PROVIDED BY THE NATIONAL OFFICE) |
|
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U12-oth-PR1002 Fee payment year number: 1 |
|
| U12 | Designation fee paid |
Free format text: ST27 STATUS EVENT CODE: A-2-2-U10-U12-OTH-PR1002 (AS PROVIDED BY THE NATIONAL OFFICE) Year of fee payment: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| Q13 | Ip right document published |
Free format text: ST27 STATUS EVENT CODE: A-4-4-Q10-Q13-NAP-PG1601 (AS PROVIDED BY THE NATIONAL OFFICE) |