SG11201906576WA - Speech wakeup method, apparatus, and electronic device - Google Patents
Speech wakeup method, apparatus, and electronic deviceInfo
- Publication number
- SG11201906576WA SG11201906576WA SG11201906576WA SG11201906576WA SG11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA
- Authority
- SG
- Singapore
- Prior art keywords
- speech wakeup
- electronic device
- speech
- wakeup method
- wakeup
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
- G10L2015/0631—Creating reference templates; Clustering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Abstract
A speech wakeup method, apparatus, and electronic device are disclosed in embodiments of this specification. The method includes: implementing speech wakeup by using a speech wakeup model that includes a Deep Neural Network (DNN) and a Connectionist Temporal 5 Classifier (CTC). The speech wakeup model can be obtained by training with general speech data.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710514348.6A CN107358951A (en) | 2017-06-29 | 2017-06-29 | A kind of voice awakening method, device and electronic equipment |
PCT/CN2018/092899 WO2019001428A1 (en) | 2017-06-29 | 2018-06-26 | Voice wake-up method and device and electronic device |
Publications (1)
Publication Number | Publication Date |
---|---|
SG11201906576WA true SG11201906576WA (en) | 2019-08-27 |
Family
ID=60274110
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SG11201906576WA SG11201906576WA (en) | 2017-06-29 | 2018-06-26 | Speech wakeup method, apparatus, and electronic device |
Country Status (11)
Country | Link |
---|---|
US (2) | US20200013390A1 (en) |
EP (1) | EP3579227B1 (en) |
JP (1) | JP6877558B2 (en) |
KR (1) | KR102181836B1 (en) |
CN (1) | CN107358951A (en) |
ES (1) | ES2878137T3 (en) |
PH (1) | PH12019501674A1 (en) |
PL (1) | PL3579227T3 (en) |
SG (1) | SG11201906576WA (en) |
TW (1) | TWI692751B (en) |
WO (1) | WO2019001428A1 (en) |
Families Citing this family (34)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107358951A (en) * | 2017-06-29 | 2017-11-17 | 阿里巴巴集团控股有限公司 | A kind of voice awakening method, device and electronic equipment |
CN108320733B (en) * | 2017-12-18 | 2022-01-04 | 上海科大讯飞信息科技有限公司 | Voice data processing method and device, storage medium and electronic equipment |
CN108182937B (en) * | 2018-01-17 | 2021-04-13 | 出门问问创新科技有限公司 | Keyword recognition method, device, equipment and storage medium |
US11488002B2 (en) * | 2018-02-15 | 2022-11-01 | Atlazo, Inc. | Binary neural network accelerator engine methods and systems |
CN108597523B (en) * | 2018-03-23 | 2019-05-17 | 平安科技(深圳)有限公司 | Identified by speaking person method, server and computer readable storage medium |
CN111066082B (en) * | 2018-05-25 | 2020-08-28 | 北京嘀嘀无限科技发展有限公司 | Voice recognition system and method |
CN110619871B (en) * | 2018-06-20 | 2023-06-30 | 阿里巴巴集团控股有限公司 | Voice wakeup detection method, device, equipment and storage medium |
US11257481B2 (en) * | 2018-10-24 | 2022-02-22 | Tencent America LLC | Multi-task training architecture and strategy for attention-based speech recognition system |
CN111276138B (en) * | 2018-12-05 | 2023-07-18 | 北京嘀嘀无限科技发展有限公司 | Method and device for processing voice signal in voice wake-up system |
CN109886386B (en) * | 2019-01-30 | 2020-10-27 | 北京声智科技有限公司 | Method and device for determining wake-up model |
CN109872713A (en) * | 2019-03-05 | 2019-06-11 | 深圳市友杰智新科技有限公司 | A kind of voice awakening method and device |
CN110310628B (en) * | 2019-06-27 | 2022-05-20 | 百度在线网络技术(北京)有限公司 | Method, device and equipment for optimizing wake-up model and storage medium |
US11081102B2 (en) * | 2019-08-16 | 2021-08-03 | Ponddy Education Inc. | Systems and methods for comprehensive Chinese speech scoring and diagnosis |
JP7098587B2 (en) * | 2019-08-29 | 2022-07-11 | 株式会社東芝 | Information processing device, keyword detection device, information processing method and program |
CN110634468B (en) * | 2019-09-11 | 2022-04-15 | 中国联合网络通信集团有限公司 | Voice wake-up method, device, equipment and computer readable storage medium |
CN110648659B (en) * | 2019-09-24 | 2022-07-01 | 上海依图信息技术有限公司 | Voice recognition and keyword detection device and method based on multitask model |
CN110648668A (en) * | 2019-09-24 | 2020-01-03 | 上海依图信息技术有限公司 | Keyword detection device and method |
CN110970016B (en) * | 2019-10-28 | 2022-08-19 | 苏宁云计算有限公司 | Awakening model generation method, intelligent terminal awakening method and device |
CN110853629A (en) * | 2019-11-21 | 2020-02-28 | 中科智云科技有限公司 | Speech recognition digital method based on deep learning |
CN110992929A (en) * | 2019-11-26 | 2020-04-10 | 苏宁云计算有限公司 | Voice keyword detection method, device and system based on neural network |
US11341954B2 (en) * | 2019-12-17 | 2022-05-24 | Google Llc | Training keyword spotters |
JP7438744B2 (en) | 2019-12-18 | 2024-02-27 | 株式会社東芝 | Information processing device, information processing method, and program |
CN111640426A (en) * | 2020-06-10 | 2020-09-08 | 北京百度网讯科技有限公司 | Method and apparatus for outputting information |
CN111883121A (en) * | 2020-07-20 | 2020-11-03 | 北京声智科技有限公司 | Awakening method and device and electronic equipment |
CN112233655A (en) * | 2020-09-28 | 2021-01-15 | 上海声瀚信息科技有限公司 | Neural network training method for improving voice command word recognition performance |
CN112669818B (en) * | 2020-12-08 | 2022-12-02 | 北京地平线机器人技术研发有限公司 | Voice wake-up method and device, readable storage medium and electronic equipment |
CN112733272A (en) * | 2021-01-13 | 2021-04-30 | 南昌航空大学 | Method for solving vehicle path problem with soft time window |
US20220293088A1 (en) * | 2021-03-12 | 2022-09-15 | Samsung Electronics Co., Ltd. | Method of generating a trigger word detection model, and an apparatus for the same |
CN113113007A (en) * | 2021-03-30 | 2021-07-13 | 北京金山云网络技术有限公司 | Voice data processing method and device, electronic equipment and storage medium |
KR102599480B1 (en) * | 2021-05-18 | 2023-11-08 | 부산대학교 산학협력단 | System and Method for automated training keyword spotter |
CN113160823A (en) * | 2021-05-26 | 2021-07-23 | 中国工商银行股份有限公司 | Voice awakening method and device based on pulse neural network and electronic equipment |
KR20230068087A (en) * | 2021-11-10 | 2023-05-17 | 삼성전자주식회사 | Electronic apparatus and method for controlling thereof |
CN113990296B (en) * | 2021-12-24 | 2022-05-27 | 深圳市友杰智新科技有限公司 | Training method and post-processing method of voice acoustic model and related equipment |
CN115862604B (en) * | 2022-11-24 | 2024-02-20 | 镁佳(北京)科技有限公司 | Voice awakening model training and voice awakening method and device and computer equipment |
Family Cites Families (23)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05128286A (en) * | 1991-11-05 | 1993-05-25 | Ricoh Co Ltd | Keyword spotting system by neural network |
JP2007179239A (en) * | 2005-12-27 | 2007-07-12 | Kenwood Corp | Schedule management device and program |
US9117449B2 (en) * | 2012-04-26 | 2015-08-25 | Nuance Communications, Inc. | Embedded system for construction of small footprint speech recognition with user-definable constraints |
US9177547B2 (en) * | 2013-06-25 | 2015-11-03 | The Johns Hopkins University | System and method for processing speech to identify keywords or other information |
CN104378723A (en) * | 2013-08-16 | 2015-02-25 | 上海耐普微电子有限公司 | Microphone with voice wake-up function |
US9715660B2 (en) * | 2013-11-04 | 2017-07-25 | Google Inc. | Transfer learning for deep neural network based hotword detection |
US9443522B2 (en) * | 2013-11-18 | 2016-09-13 | Beijing Lenovo Software Ltd. | Voice recognition method, voice controlling method, information processing method, and electronic apparatus |
CN105096935B (en) * | 2014-05-06 | 2019-08-09 | 阿里巴巴集团控股有限公司 | A kind of pronunciation inputting method, device and system |
US10783900B2 (en) * | 2014-10-03 | 2020-09-22 | Google Llc | Convolutional, long short-term memory, fully connected deep neural networks |
CN106463112B (en) * | 2015-04-10 | 2020-12-08 | 华为技术有限公司 | Voice recognition method, voice awakening device, voice recognition device and terminal |
CN106297774B (en) * | 2015-05-29 | 2019-07-09 | 中国科学院声学研究所 | A kind of the distributed parallel training method and system of neural network acoustic model |
TWI639153B (en) * | 2015-11-03 | 2018-10-21 | 絡達科技股份有限公司 | Electronic apparatus and voice trigger method therefor |
JP6679898B2 (en) * | 2015-11-24 | 2020-04-15 | 富士通株式会社 | KEYWORD DETECTION DEVICE, KEYWORD DETECTION METHOD, AND KEYWORD DETECTION COMPUTER PROGRAM |
US10755698B2 (en) * | 2015-12-07 | 2020-08-25 | University Of Florida Research Foundation, Inc. | Pulse-based automatic speech recognition |
CN106887227A (en) * | 2015-12-16 | 2017-06-23 | 芋头科技(杭州)有限公司 | A kind of voice awakening method and system |
CN105632486B (en) * | 2015-12-23 | 2019-12-17 | 北京奇虎科技有限公司 | Voice awakening method and device of intelligent hardware |
US10229672B1 (en) * | 2015-12-31 | 2019-03-12 | Google Llc | Training acoustic models using connectionist temporal classification |
CN105931633A (en) * | 2016-05-30 | 2016-09-07 | 深圳市鼎盛智能科技有限公司 | Speech recognition method and system |
CN106098059B (en) * | 2016-06-23 | 2019-06-18 | 上海交通大学 | Customizable voice awakening method and system |
CN106611597B (en) * | 2016-12-02 | 2019-11-08 | 百度在线网络技术(北京)有限公司 | Voice awakening method and device based on artificial intelligence |
CN106782536B (en) * | 2016-12-26 | 2020-02-28 | 北京云知声信息技术有限公司 | Voice awakening method and device |
CN107221326B (en) * | 2017-05-16 | 2021-05-28 | 百度在线网络技术(北京)有限公司 | Voice awakening method and device based on artificial intelligence and computer equipment |
CN107358951A (en) * | 2017-06-29 | 2017-11-17 | 阿里巴巴集团控股有限公司 | A kind of voice awakening method, device and electronic equipment |
-
2017
- 2017-06-29 CN CN201710514348.6A patent/CN107358951A/en active Pending
-
2018
- 2018-03-14 TW TW107108572A patent/TWI692751B/en active
- 2018-06-26 ES ES18823086T patent/ES2878137T3/en active Active
- 2018-06-26 WO PCT/CN2018/092899 patent/WO2019001428A1/en unknown
- 2018-06-26 KR KR1020197022130A patent/KR102181836B1/en active IP Right Grant
- 2018-06-26 PL PL18823086T patent/PL3579227T3/en unknown
- 2018-06-26 SG SG11201906576WA patent/SG11201906576WA/en unknown
- 2018-06-26 JP JP2019539235A patent/JP6877558B2/en active Active
- 2018-06-26 EP EP18823086.6A patent/EP3579227B1/en active Active
-
2019
- 2019-07-19 PH PH12019501674A patent/PH12019501674A1/en unknown
- 2019-09-16 US US16/571,468 patent/US20200013390A1/en not_active Abandoned
-
2020
- 2020-01-28 US US16/774,422 patent/US10748524B2/en active Active
Also Published As
Publication number | Publication date |
---|---|
EP3579227A4 (en) | 2020-02-26 |
WO2019001428A1 (en) | 2019-01-03 |
PL3579227T3 (en) | 2021-10-18 |
ES2878137T3 (en) | 2021-11-18 |
JP6877558B2 (en) | 2021-05-26 |
PH12019501674A1 (en) | 2020-06-01 |
TWI692751B (en) | 2020-05-01 |
US20200168207A1 (en) | 2020-05-28 |
CN107358951A (en) | 2017-11-17 |
TW201905897A (en) | 2019-02-01 |
KR102181836B1 (en) | 2020-11-25 |
EP3579227A1 (en) | 2019-12-11 |
EP3579227B1 (en) | 2021-06-09 |
KR20190134594A (en) | 2019-12-04 |
JP2020517977A (en) | 2020-06-18 |
US20200013390A1 (en) | 2020-01-09 |
US10748524B2 (en) | 2020-08-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SG11201906576WA (en) | Speech wakeup method, apparatus, and electronic device | |
EP4024232A4 (en) | Text processing model training method, and text processing method and apparatus | |
EP3540637A4 (en) | Neural network model training method, device and storage medium for image processing | |
EP3611725A4 (en) | Voice signal processing model training method, electronic device, and storage medium | |
EP3866163A4 (en) | Voiceprint identification method, model training method and server | |
PH12019501009A1 (en) | Face liveness detection method and apparatus, and electronic device | |
EP3882808A4 (en) | Face detection model training method and apparatus, and face key point detection method and apparatus | |
EP3819835A4 (en) | Risk identification model training method and apparatus, and server | |
EP3611657A4 (en) | Model training method and method, apparatus, and device for determining data similarity | |
EP3716156A4 (en) | Neural network model training method and apparatus | |
EP3579160A4 (en) | Learned model generating method, learned model generating device, and learned model use device | |
EP3633549A4 (en) | Facial detection training method, apparatus and electronic device | |
WO2018149898A3 (en) | Methods and systems for network self-optimization using deep learning | |
EP3154054A3 (en) | Method and apparatus for training language model and recognizing speech | |
SG11202100918SA (en) | Model Training Method And Apparatus Based On Gradient Boosting Decision Tree | |
SG11202010669RA (en) | Classification model generation method and apparatus, and data identification method and apparatus | |
EP3540652A4 (en) | Method, device, chip and system for training neural network model | |
SG11202008385YA (en) | Disease prediction method and apparatus based on long short-term memory model, and computer device | |
EP4036803A4 (en) | Neural network model processing method and apparatus, computer device, and storage medium | |
EP3951646A4 (en) | Image recognition network model training method, image recognition method and device | |
EP3059699A3 (en) | Neural network training method and apparatus, and recognition method and apparatus | |
EP2977936A3 (en) | Neural network training method and apparatus, and data processing apparatus | |
EP3657428A4 (en) | Data learning server, and method for generating and using learning model thereof | |
EP3579169A4 (en) | Learned model provision method, and learned model provision device | |
GB2540062A (en) | Systems, apparatuses and methods for communication flow modification |