SG11201906576WA - Speech wakeup method, apparatus, and electronic device - Google Patents

Speech wakeup method, apparatus, and electronic device

Info

Publication number
SG11201906576WA
SG11201906576WA SG11201906576WA SG11201906576WA SG11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA SG 11201906576W A SG11201906576W A SG 11201906576WA
Authority
SG
Singapore
Prior art keywords
speech wakeup
electronic device
speech
wakeup method
wakeup
Prior art date
Application number
SG11201906576WA
Inventor
Zhiming Wang
Jun Zhou
Xiaolong Li
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Publication of SG11201906576WA publication Critical patent/SG11201906576WA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0631Creating reference templates; Clustering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Abstract

A speech wakeup method, apparatus, and electronic device are disclosed in embodiments of this specification. The method includes: implementing speech wakeup by using a speech wakeup model that includes a Deep Neural Network (DNN) and a Connectionist Temporal 5 Classifier (CTC). The speech wakeup model can be obtained by training with general speech data.
SG11201906576WA 2017-06-29 2018-06-26 Speech wakeup method, apparatus, and electronic device SG11201906576WA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710514348.6A CN107358951A (en) 2017-06-29 2017-06-29 A kind of voice awakening method, device and electronic equipment
PCT/CN2018/092899 WO2019001428A1 (en) 2017-06-29 2018-06-26 Voice wake-up method and device and electronic device

Publications (1)

Publication Number Publication Date
SG11201906576WA true SG11201906576WA (en) 2019-08-27

Family

ID=60274110

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11201906576WA SG11201906576WA (en) 2017-06-29 2018-06-26 Speech wakeup method, apparatus, and electronic device

Country Status (11)

Country Link
US (2) US20200013390A1 (en)
EP (1) EP3579227B1 (en)
JP (1) JP6877558B2 (en)
KR (1) KR102181836B1 (en)
CN (1) CN107358951A (en)
ES (1) ES2878137T3 (en)
PH (1) PH12019501674A1 (en)
PL (1) PL3579227T3 (en)
SG (1) SG11201906576WA (en)
TW (1) TWI692751B (en)
WO (1) WO2019001428A1 (en)

Families Citing this family (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107358951A (en) * 2017-06-29 2017-11-17 阿里巴巴集团控股有限公司 A kind of voice awakening method, device and electronic equipment
CN108320733B (en) * 2017-12-18 2022-01-04 上海科大讯飞信息科技有限公司 Voice data processing method and device, storage medium and electronic equipment
CN108182937B (en) * 2018-01-17 2021-04-13 出门问问创新科技有限公司 Keyword recognition method, device, equipment and storage medium
US11488002B2 (en) * 2018-02-15 2022-11-01 Atlazo, Inc. Binary neural network accelerator engine methods and systems
CN108597523B (en) * 2018-03-23 2019-05-17 平安科技(深圳)有限公司 Identified by speaking person method, server and computer readable storage medium
CN111066082B (en) * 2018-05-25 2020-08-28 北京嘀嘀无限科技发展有限公司 Voice recognition system and method
CN110619871B (en) * 2018-06-20 2023-06-30 阿里巴巴集团控股有限公司 Voice wakeup detection method, device, equipment and storage medium
US11257481B2 (en) * 2018-10-24 2022-02-22 Tencent America LLC Multi-task training architecture and strategy for attention-based speech recognition system
CN111276138B (en) * 2018-12-05 2023-07-18 北京嘀嘀无限科技发展有限公司 Method and device for processing voice signal in voice wake-up system
CN109886386B (en) * 2019-01-30 2020-10-27 北京声智科技有限公司 Method and device for determining wake-up model
CN109872713A (en) * 2019-03-05 2019-06-11 深圳市友杰智新科技有限公司 A kind of voice awakening method and device
CN110310628B (en) * 2019-06-27 2022-05-20 百度在线网络技术(北京)有限公司 Method, device and equipment for optimizing wake-up model and storage medium
US11081102B2 (en) * 2019-08-16 2021-08-03 Ponddy Education Inc. Systems and methods for comprehensive Chinese speech scoring and diagnosis
JP7098587B2 (en) * 2019-08-29 2022-07-11 株式会社東芝 Information processing device, keyword detection device, information processing method and program
CN110634468B (en) * 2019-09-11 2022-04-15 中国联合网络通信集团有限公司 Voice wake-up method, device, equipment and computer readable storage medium
CN110648659B (en) * 2019-09-24 2022-07-01 上海依图信息技术有限公司 Voice recognition and keyword detection device and method based on multitask model
CN110648668A (en) * 2019-09-24 2020-01-03 上海依图信息技术有限公司 Keyword detection device and method
CN110970016B (en) * 2019-10-28 2022-08-19 苏宁云计算有限公司 Awakening model generation method, intelligent terminal awakening method and device
CN110853629A (en) * 2019-11-21 2020-02-28 中科智云科技有限公司 Speech recognition digital method based on deep learning
CN110992929A (en) * 2019-11-26 2020-04-10 苏宁云计算有限公司 Voice keyword detection method, device and system based on neural network
US11341954B2 (en) * 2019-12-17 2022-05-24 Google Llc Training keyword spotters
JP7438744B2 (en) 2019-12-18 2024-02-27 株式会社東芝 Information processing device, information processing method, and program
CN111640426A (en) * 2020-06-10 2020-09-08 北京百度网讯科技有限公司 Method and apparatus for outputting information
CN111883121A (en) * 2020-07-20 2020-11-03 北京声智科技有限公司 Awakening method and device and electronic equipment
CN112233655A (en) * 2020-09-28 2021-01-15 上海声瀚信息科技有限公司 Neural network training method for improving voice command word recognition performance
CN112669818B (en) * 2020-12-08 2022-12-02 北京地平线机器人技术研发有限公司 Voice wake-up method and device, readable storage medium and electronic equipment
CN112733272A (en) * 2021-01-13 2021-04-30 南昌航空大学 Method for solving vehicle path problem with soft time window
US20220293088A1 (en) * 2021-03-12 2022-09-15 Samsung Electronics Co., Ltd. Method of generating a trigger word detection model, and an apparatus for the same
CN113113007A (en) * 2021-03-30 2021-07-13 北京金山云网络技术有限公司 Voice data processing method and device, electronic equipment and storage medium
KR102599480B1 (en) * 2021-05-18 2023-11-08 부산대학교 산학협력단 System and Method for automated training keyword spotter
CN113160823A (en) * 2021-05-26 2021-07-23 中国工商银行股份有限公司 Voice awakening method and device based on pulse neural network and electronic equipment
KR20230068087A (en) * 2021-11-10 2023-05-17 삼성전자주식회사 Electronic apparatus and method for controlling thereof
CN113990296B (en) * 2021-12-24 2022-05-27 深圳市友杰智新科技有限公司 Training method and post-processing method of voice acoustic model and related equipment
CN115862604B (en) * 2022-11-24 2024-02-20 镁佳(北京)科技有限公司 Voice awakening model training and voice awakening method and device and computer equipment

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH05128286A (en) * 1991-11-05 1993-05-25 Ricoh Co Ltd Keyword spotting system by neural network
JP2007179239A (en) * 2005-12-27 2007-07-12 Kenwood Corp Schedule management device and program
US9117449B2 (en) * 2012-04-26 2015-08-25 Nuance Communications, Inc. Embedded system for construction of small footprint speech recognition with user-definable constraints
US9177547B2 (en) * 2013-06-25 2015-11-03 The Johns Hopkins University System and method for processing speech to identify keywords or other information
CN104378723A (en) * 2013-08-16 2015-02-25 上海耐普微电子有限公司 Microphone with voice wake-up function
US9715660B2 (en) * 2013-11-04 2017-07-25 Google Inc. Transfer learning for deep neural network based hotword detection
US9443522B2 (en) * 2013-11-18 2016-09-13 Beijing Lenovo Software Ltd. Voice recognition method, voice controlling method, information processing method, and electronic apparatus
CN105096935B (en) * 2014-05-06 2019-08-09 阿里巴巴集团控股有限公司 A kind of pronunciation inputting method, device and system
US10783900B2 (en) * 2014-10-03 2020-09-22 Google Llc Convolutional, long short-term memory, fully connected deep neural networks
CN106463112B (en) * 2015-04-10 2020-12-08 华为技术有限公司 Voice recognition method, voice awakening device, voice recognition device and terminal
CN106297774B (en) * 2015-05-29 2019-07-09 中国科学院声学研究所 A kind of the distributed parallel training method and system of neural network acoustic model
TWI639153B (en) * 2015-11-03 2018-10-21 絡達科技股份有限公司 Electronic apparatus and voice trigger method therefor
JP6679898B2 (en) * 2015-11-24 2020-04-15 富士通株式会社 KEYWORD DETECTION DEVICE, KEYWORD DETECTION METHOD, AND KEYWORD DETECTION COMPUTER PROGRAM
US10755698B2 (en) * 2015-12-07 2020-08-25 University Of Florida Research Foundation, Inc. Pulse-based automatic speech recognition
CN106887227A (en) * 2015-12-16 2017-06-23 芋头科技(杭州)有限公司 A kind of voice awakening method and system
CN105632486B (en) * 2015-12-23 2019-12-17 北京奇虎科技有限公司 Voice awakening method and device of intelligent hardware
US10229672B1 (en) * 2015-12-31 2019-03-12 Google Llc Training acoustic models using connectionist temporal classification
CN105931633A (en) * 2016-05-30 2016-09-07 深圳市鼎盛智能科技有限公司 Speech recognition method and system
CN106098059B (en) * 2016-06-23 2019-06-18 上海交通大学 Customizable voice awakening method and system
CN106611597B (en) * 2016-12-02 2019-11-08 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence
CN106782536B (en) * 2016-12-26 2020-02-28 北京云知声信息技术有限公司 Voice awakening method and device
CN107221326B (en) * 2017-05-16 2021-05-28 百度在线网络技术(北京)有限公司 Voice awakening method and device based on artificial intelligence and computer equipment
CN107358951A (en) * 2017-06-29 2017-11-17 阿里巴巴集团控股有限公司 A kind of voice awakening method, device and electronic equipment

Also Published As

Publication number Publication date
EP3579227A4 (en) 2020-02-26
WO2019001428A1 (en) 2019-01-03
PL3579227T3 (en) 2021-10-18
ES2878137T3 (en) 2021-11-18
JP6877558B2 (en) 2021-05-26
PH12019501674A1 (en) 2020-06-01
TWI692751B (en) 2020-05-01
US20200168207A1 (en) 2020-05-28
CN107358951A (en) 2017-11-17
TW201905897A (en) 2019-02-01
KR102181836B1 (en) 2020-11-25
EP3579227A1 (en) 2019-12-11
EP3579227B1 (en) 2021-06-09
KR20190134594A (en) 2019-12-04
JP2020517977A (en) 2020-06-18
US20200013390A1 (en) 2020-01-09
US10748524B2 (en) 2020-08-18

Similar Documents

Publication Publication Date Title
SG11201906576WA (en) Speech wakeup method, apparatus, and electronic device
EP4024232A4 (en) Text processing model training method, and text processing method and apparatus
EP3540637A4 (en) Neural network model training method, device and storage medium for image processing
EP3611725A4 (en) Voice signal processing model training method, electronic device, and storage medium
EP3866163A4 (en) Voiceprint identification method, model training method and server
PH12019501009A1 (en) Face liveness detection method and apparatus, and electronic device
EP3882808A4 (en) Face detection model training method and apparatus, and face key point detection method and apparatus
EP3819835A4 (en) Risk identification model training method and apparatus, and server
EP3611657A4 (en) Model training method and method, apparatus, and device for determining data similarity
EP3716156A4 (en) Neural network model training method and apparatus
EP3579160A4 (en) Learned model generating method, learned model generating device, and learned model use device
EP3633549A4 (en) Facial detection training method, apparatus and electronic device
WO2018149898A3 (en) Methods and systems for network self-optimization using deep learning
EP3154054A3 (en) Method and apparatus for training language model and recognizing speech
SG11202100918SA (en) Model Training Method And Apparatus Based On Gradient Boosting Decision Tree
SG11202010669RA (en) Classification model generation method and apparatus, and data identification method and apparatus
EP3540652A4 (en) Method, device, chip and system for training neural network model
SG11202008385YA (en) Disease prediction method and apparatus based on long short-term memory model, and computer device
EP4036803A4 (en) Neural network model processing method and apparatus, computer device, and storage medium
EP3951646A4 (en) Image recognition network model training method, image recognition method and device
EP3059699A3 (en) Neural network training method and apparatus, and recognition method and apparatus
EP2977936A3 (en) Neural network training method and apparatus, and data processing apparatus
EP3657428A4 (en) Data learning server, and method for generating and using learning model thereof
EP3579169A4 (en) Learned model provision method, and learned model provision device
GB2540062A (en) Systems, apparatuses and methods for communication flow modification