SG11202001627XA - Speech recognition method, apparatus, and computer readable storage medium - Google Patents

Speech recognition method, apparatus, and computer readable storage medium

Info

Publication number
SG11202001627XA
SG11202001627XA SG11202001627XA SG11202001627XA SG11202001627XA SG 11202001627X A SG11202001627X A SG 11202001627XA SG 11202001627X A SG11202001627X A SG 11202001627XA SG 11202001627X A SG11202001627X A SG 11202001627XA SG 11202001627X A SG11202001627X A SG 11202001627XA
Authority
SG
Singapore
Prior art keywords
storage medium
computer readable
readable storage
speech recognition
recognition method
Prior art date
Application number
SG11202001627XA
Inventor
Hao Liang
Ning Cheng
Jianzong Wang
Jing Xiao
Original Assignee
Ping An Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ping An Technology Shenzhen Co Ltd filed Critical Ping An Technology Shenzhen Co Ltd
Publication of SG11202001627XA publication Critical patent/SG11202001627XA/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N20/00Machine learning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N7/00Computing arrangements based on specific mathematical models
    • G06N7/01Probabilistic graphical models, e.g. probabilistic networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/16Speech classification or search using artificial neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • G06N3/084Backpropagation, e.g. using gradient descent
SG11202001627XA 2017-10-23 2017-11-28 Speech recognition method, apparatus, and computer readable storage medium SG11202001627XA (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201710994268.5A CN107680597B (en) 2017-10-23 2017-10-23 Audio recognition method, device, equipment and computer readable storage medium
PCT/CN2017/113230 WO2019080248A1 (en) 2017-10-23 2017-11-28 Speech recognition method, device, and apparatus, and computer readable storage medium

Publications (1)

Publication Number Publication Date
SG11202001627XA true SG11202001627XA (en) 2020-03-30

Family

ID=61141446

Family Applications (1)

Application Number Title Priority Date Filing Date
SG11202001627XA SG11202001627XA (en) 2017-10-23 2017-11-28 Speech recognition method, apparatus, and computer readable storage medium

Country Status (4)

Country Link
US (1) US11081103B2 (en)
CN (1) CN107680597B (en)
SG (1) SG11202001627XA (en)
WO (1) WO2019080248A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114038465A (en) * 2021-04-28 2022-02-11 北京有竹居网络技术有限公司 Voice processing method and device and electronic equipment

Families Citing this family (56)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8676904B2 (en) 2008-10-02 2014-03-18 Apple Inc. Electronic devices with voice command and contextual data processing capabilities
US10199051B2 (en) 2013-02-07 2019-02-05 Apple Inc. Voice trigger for a digital assistant
US10170123B2 (en) 2014-05-30 2019-01-01 Apple Inc. Intelligent assistant for home automation
US9338493B2 (en) 2014-06-30 2016-05-10 Apple Inc. Intelligent automated assistant for TV user interactions
US10747498B2 (en) 2015-09-08 2020-08-18 Apple Inc. Zero latency digital assistant
US10691473B2 (en) 2015-11-06 2020-06-23 Apple Inc. Intelligent automated assistant in a messaging environment
DK180048B1 (en) 2017-05-11 2020-02-04 Apple Inc. MAINTAINING THE DATA PROTECTION OF PERSONAL INFORMATION
DK179496B1 (en) 2017-05-12 2019-01-15 Apple Inc. USER-SPECIFIC Acoustic Models
DK201770428A1 (en) 2017-05-12 2019-02-18 Apple Inc. Low-latency intelligent automated assistant
CN108520741B (en) 2018-04-12 2021-05-04 科大讯飞股份有限公司 Method, device and equipment for restoring ear voice and readable storage medium
CN108664460A (en) * 2018-04-16 2018-10-16 北京天使软件技术有限公司 Voice is filled in a form device, method, system and storage medium
US10928918B2 (en) 2018-05-07 2021-02-23 Apple Inc. Raise to speak
DK180639B1 (en) 2018-06-01 2021-11-04 Apple Inc DISABILITY OF ATTENTION-ATTENTIVE VIRTUAL ASSISTANT
CN108922513B (en) * 2018-06-04 2023-03-17 平安科技(深圳)有限公司 Voice distinguishing method and device, computer equipment and storage medium
CN108877775B (en) * 2018-06-04 2023-03-31 平安科技(深圳)有限公司 Voice data processing method and device, computer equipment and storage medium
CN110619871B (en) * 2018-06-20 2023-06-30 阿里巴巴集团控股有限公司 Voice wakeup detection method, device, equipment and storage medium
CN108776795A (en) * 2018-06-20 2018-11-09 邯郸学院 Method for identifying ID, device and terminal device
CN108962223A (en) * 2018-06-25 2018-12-07 厦门快商通信息技术有限公司 A kind of voice gender identification method, equipment and medium based on deep learning
CN108935188A (en) * 2018-07-05 2018-12-07 平安科技(深圳)有限公司 Pig disease identification method, apparatus and electronic equipment
CN108922521B (en) * 2018-08-15 2021-07-06 合肥讯飞数码科技有限公司 Voice keyword retrieval method, device, equipment and storage medium
US11462215B2 (en) 2018-09-28 2022-10-04 Apple Inc. Multi-modal inputs for voice commands
CN109559735B (en) * 2018-10-11 2023-10-27 平安科技(深圳)有限公司 Voice recognition method, terminal equipment and medium based on neural network
CN109346103B (en) * 2018-10-30 2023-03-28 交通运输部公路科学研究所 Audio detection method for road tunnel traffic incident
CN110517679B (en) * 2018-11-15 2022-03-08 腾讯科技(深圳)有限公司 Artificial intelligence audio data processing method and device and storage medium
CN110166826B (en) * 2018-11-21 2021-10-08 腾讯科技(深圳)有限公司 Video scene recognition method and device, storage medium and computer equipment
US11114103B2 (en) 2018-12-28 2021-09-07 Alibaba Group Holding Limited Systems, methods, and computer-readable storage media for audio signal processing
CN109658921A (en) * 2019-01-04 2019-04-19 平安科技(深圳)有限公司 A kind of audio signal processing method, equipment and computer readable storage medium
CN109872713A (en) * 2019-03-05 2019-06-11 深圳市友杰智新科技有限公司 A kind of voice awakening method and device
US11348573B2 (en) 2019-03-18 2022-05-31 Apple Inc. Multimodality in digital assistant systems
DK201970509A1 (en) 2019-05-06 2021-01-15 Apple Inc Spoken notifications
US11307752B2 (en) 2019-05-06 2022-04-19 Apple Inc. User configurable task triggers
JP7242903B2 (en) * 2019-05-14 2023-03-20 ドルビー ラボラトリーズ ライセンシング コーポレイション Method and Apparatus for Utterance Source Separation Based on Convolutional Neural Networks
CN110277088B (en) * 2019-05-29 2024-04-09 平安科技(深圳)有限公司 Intelligent voice recognition method, intelligent voice recognition device and computer readable storage medium
US11289073B2 (en) * 2019-05-31 2022-03-29 Apple Inc. Device text to speech
US11468890B2 (en) 2019-06-01 2022-10-11 Apple Inc. Methods and user interfaces for voice-based control of electronic devices
CN110288999B (en) * 2019-07-02 2020-12-11 腾讯科技(深圳)有限公司 Speech recognition method, speech recognition device, computer equipment and storage medium
KR20210010133A (en) * 2019-07-19 2021-01-27 삼성전자주식회사 Speech recognition method, learning method for speech recognition and apparatus thereof
CN110534098A (en) * 2019-10-09 2019-12-03 国家电网有限公司客户服务中心 A kind of the speech recognition Enhancement Method and device of age enhancing
CN111128235A (en) * 2019-12-05 2020-05-08 厦门快商通科技股份有限公司 Age prediction method, device and equipment based on voice
CN111145765B (en) * 2019-12-31 2022-04-15 思必驰科技股份有限公司 Audio processing method and device, electronic equipment and storage medium
CN112750425B (en) * 2020-01-22 2023-11-03 腾讯科技(深圳)有限公司 Speech recognition method, device, computer equipment and computer readable storage medium
CN113470662A (en) * 2020-03-31 2021-10-01 微软技术许可有限责任公司 Generating and using text-to-speech data for keyword spotting systems and speaker adaptation in speech recognition systems
CN113593539A (en) * 2020-04-30 2021-11-02 阿里巴巴集团控股有限公司 Streaming end-to-end voice recognition method and device and electronic equipment
US11061543B1 (en) 2020-05-11 2021-07-13 Apple Inc. Providing relevant data items based on context
CN111667817A (en) * 2020-06-22 2020-09-15 平安资产管理有限责任公司 Voice recognition method, device, computer system and readable storage medium
CN111696526B (en) * 2020-06-22 2021-09-10 北京达佳互联信息技术有限公司 Method for generating voice recognition model, voice recognition method and device
US11490204B2 (en) 2020-07-20 2022-11-01 Apple Inc. Multi-device audio adjustment coordination
US11438683B2 (en) 2020-07-21 2022-09-06 Apple Inc. User identification using headphones
CN112216270B (en) * 2020-10-09 2024-02-06 携程计算机技术(上海)有限公司 Speech phoneme recognition method and system, electronic equipment and storage medium
US11942078B2 (en) * 2021-02-26 2024-03-26 International Business Machines Corporation Chunking and overlap decoding strategy for streaming RNN transducers for speech recognition
CN112820279B (en) * 2021-03-12 2024-02-09 深圳市臻络科技有限公司 Parkinson detection model construction method based on voice context dynamic characteristics
US11948550B2 (en) * 2021-05-06 2024-04-02 Sanas.ai Inc. Real-time accent conversion model
CN113724718B (en) * 2021-09-01 2022-07-29 宿迁硅基智能科技有限公司 Target audio output method, device and system
CN113724690B (en) * 2021-09-01 2023-01-03 宿迁硅基智能科技有限公司 PPG feature output method, target audio output method and device
CN113611285B (en) * 2021-09-03 2023-11-24 哈尔滨理工大学 Language identification method based on stacked bidirectional time sequence pooling
CN116415166A (en) * 2021-12-28 2023-07-11 深圳大学 Multi-keyboard mixed key sound identification method, device, equipment and storage medium

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20120072145A (en) * 2010-12-23 2012-07-03 한국전자통신연구원 Method and apparatus for recognizing speech
CN104952448A (en) * 2015-05-04 2015-09-30 张爱英 Method and system for enhancing features by aid of bidirectional long-term and short-term memory recurrent neural networks
US10332509B2 (en) * 2015-11-25 2019-06-25 Baidu USA, LLC End-to-end speech recognition
CN106803422B (en) * 2015-11-26 2020-05-12 中国科学院声学研究所 Language model reestimation method based on long-time and short-time memory network
CN105679316A (en) * 2015-12-29 2016-06-15 深圳微服机器人科技有限公司 Voice keyword identification method and apparatus based on deep neural network
CN105869624B (en) * 2016-03-29 2019-05-10 腾讯科技(深圳)有限公司 The construction method and device of tone decoding network in spoken digit recognition
US10949736B2 (en) * 2016-11-03 2021-03-16 Intel Corporation Flexible neural network accelerator and methods therefor
US20180330718A1 (en) * 2017-05-11 2018-11-15 Mitsubishi Electric Research Laboratories, Inc. System and Method for End-to-End speech recognition

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114038465A (en) * 2021-04-28 2022-02-11 北京有竹居网络技术有限公司 Voice processing method and device and electronic equipment

Also Published As

Publication number Publication date
CN107680597B (en) 2019-07-09
CN107680597A (en) 2018-02-09
US20210074264A1 (en) 2021-03-11
US11081103B2 (en) 2021-08-03
WO2019080248A1 (en) 2019-05-02

Similar Documents

Publication Publication Date Title
SG11202001627XA (en) Speech recognition method, apparatus, and computer readable storage medium
EP3806089A4 (en) Mixed speech recognition method and apparatus, and computer readable storage medium
EP3648099A4 (en) Voice recognition method, device, apparatus, and storage medium
ZA201901031B (en) Data storage, data check, and data linkage method and apparatus
SG11202100900QA (en) Text-based speech synthesis method and device, computer device, and non-transitory computer-readable storage medium
EP3748629C0 (en) Identification method for voice keywords, computer-readable storage medium, and computer device
EP3584790A4 (en) Voiceprint recognition method, device, storage medium, and background server
EP3435035C0 (en) Route-deviation recognition method, terminal and storage medium
EP3828885C0 (en) Voice denoising method and apparatus, computing device and computer readable storage medium
EP3447769A4 (en) Speech detection method and apparatus, and storage medium
EP3370188A4 (en) Facial verification method, device, and computer storage medium
EP3584786A4 (en) Voice recognition method, electronic device, and computer storage medium
EP3605537A4 (en) Speech emotion detection method and apparatus, computer device, and storage medium
EP3321842A4 (en) Lane line recognition modeling method, apparatus, storage medium, and device, recognition method and apparatus, storage medium, and device
EP3319081A4 (en) On-board voice command identification method and apparatus, and storage medium
EP3373293A4 (en) Speech recognition method and apparatus
EP3588490A4 (en) Speech conversion method, computer device, and storage medium
EP3605407A4 (en) Information processing device, information processing method, and computer-readable storage medium
SG11201911625YA (en) Target recognition method and apparatus, storage medium, and electronic device
EP3349165A4 (en) Recognition method and apparatus for user relationship, storage medium, and server
EP3196801A4 (en) Face recognition method, device and computer readable storage medium
EP3188074A4 (en) Fingerprint information dynamic updating method and fingerprint recognition apparatus
SG11202001873SA (en) Semantic recognition method, electronic device , and computer-readable storage medium
EP3605400A4 (en) Information processing device, information processing method, and computer-readable storage medium
EP3309691A4 (en) Search recommendation method and apparatus, device, and computer storage medium