CN110070867B - 语音指令识别方法、计算机装置及计算机可读存储介质 - Google Patents
语音指令识别方法、计算机装置及计算机可读存储介质 Download PDFInfo
- Publication number
- CN110070867B CN110070867B CN201910342260.XA CN201910342260A CN110070867B CN 110070867 B CN110070867 B CN 110070867B CN 201910342260 A CN201910342260 A CN 201910342260A CN 110070867 B CN110070867 B CN 110070867B
- Authority
- CN
- China
- Prior art keywords
- neural network
- convolution
- layer
- convolutional neural
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 47
- 238000003860 storage Methods 0.000 title claims abstract description 20
- 238000004364 calculation method Methods 0.000 claims abstract description 54
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 51
- 238000013528 artificial neural network Methods 0.000 claims abstract description 29
- 238000004590 computer program Methods 0.000 claims description 24
- 230000006870 function Effects 0.000 claims description 11
- 238000012545 processing Methods 0.000 claims description 11
- 230000003213 activating effect Effects 0.000 claims description 6
- 230000004913 activation Effects 0.000 claims description 6
- 238000000354 decomposition reaction Methods 0.000 claims 1
- 230000008569 process Effects 0.000 abstract description 7
- 230000000306 recurrent effect Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000003062 neural network model Methods 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 230000001413 cellular effect Effects 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Complex Calculations (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910342260.XA CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910342260.XA CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110070867A CN110070867A (zh) | 2019-07-30 |
CN110070867B true CN110070867B (zh) | 2022-03-11 |
Family
ID=67369049
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910342260.XA Active CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110070867B (zh) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110718211B (zh) * | 2019-09-26 | 2021-12-21 | 东南大学 | 一种基于混合压缩卷积神经网络的关键词识别系统 |
CN111583940A (zh) * | 2020-04-20 | 2020-08-25 | 东南大学 | 极低功耗关键词唤醒神经网络电路 |
CN112185360B (zh) * | 2020-09-28 | 2024-07-02 | 苏州科达科技股份有限公司 | 语音数据识别方法、多人会议的语音激励方法及相关设备 |
CN113611289B (zh) * | 2021-08-06 | 2024-06-18 | 上海汽车集团股份有限公司 | 一种语音识别方法和装置 |
CN113409773B (zh) * | 2021-08-18 | 2022-01-18 | 中科南京智能技术研究院 | 一种二值化神经网络语音唤醒方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106575379A (zh) * | 2014-09-09 | 2017-04-19 | 英特尔公司 | 用于神经网络的改进的定点整型实现方式 |
CN107808150A (zh) * | 2017-11-20 | 2018-03-16 | 珠海习悦信息技术有限公司 | 人体视频动作识别方法、装置、存储介质及处理器 |
CN108573708A (zh) * | 2017-03-08 | 2018-09-25 | 恩智浦有限公司 | 用于促进可靠样式检测的方法和系统 |
CN109448707A (zh) * | 2018-12-18 | 2019-03-08 | 北京嘉楠捷思信息技术有限公司 | 一种语音识别方法及装置、设备、介质 |
Family Cites Families (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105760933A (zh) * | 2016-02-18 | 2016-07-13 | 清华大学 | 卷积神经网络的逐层变精度定点化方法及装置 |
US10460747B2 (en) * | 2016-05-10 | 2019-10-29 | Google Llc | Frequency based audio analysis using neural networks |
CN108009625B (zh) * | 2016-11-01 | 2020-11-06 | 赛灵思公司 | 人工神经网络定点化后的微调方法和装置 |
KR102224510B1 (ko) * | 2016-12-09 | 2021-03-05 | 베이징 호라이즌 인포메이션 테크놀로지 컴퍼니 리미티드 | 데이터 관리를 위한 시스템들 및 방법들 |
CN107688849B (zh) * | 2017-07-28 | 2021-04-13 | 赛灵思电子科技(北京)有限公司 | 一种动态策略定点化训练方法及装置 |
CN107679618B (zh) * | 2017-07-28 | 2021-06-11 | 赛灵思电子科技(北京)有限公司 | 一种静态策略定点化训练方法及装置 |
CN107679622B (zh) * | 2017-09-06 | 2020-08-14 | 清华大学 | 一种面向神经网络算法的模拟感知计算架构 |
WO2019075604A1 (zh) * | 2017-10-16 | 2019-04-25 | 深圳市大疆创新科技有限公司 | 数据定点化方法和装置 |
CN107783960B (zh) * | 2017-10-23 | 2021-07-23 | 百度在线网络技术(北京)有限公司 | 用于抽取信息的方法、装置和设备 |
CN107993651B (zh) * | 2017-12-29 | 2021-01-19 | 深圳和而泰数据资源与云技术有限公司 | 一种语音识别方法、装置、电子设备及存储介质 |
CN108596328B (zh) * | 2018-04-26 | 2021-02-02 | 北京市商汤科技开发有限公司 | 一种定点化方法及装置、计算机设备 |
CN109036385A (zh) * | 2018-10-19 | 2018-12-18 | 北京旋极信息技术股份有限公司 | 一种语音指令识别方法、装置及计算机存储介质 |
CN109448719B (zh) * | 2018-12-11 | 2022-09-09 | 杭州易现先进科技有限公司 | 神经网络模型建立方法及语音唤醒方法、装置、介质和设备 |
-
2019
- 2019-04-26 CN CN201910342260.XA patent/CN110070867B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106575379A (zh) * | 2014-09-09 | 2017-04-19 | 英特尔公司 | 用于神经网络的改进的定点整型实现方式 |
CN108573708A (zh) * | 2017-03-08 | 2018-09-25 | 恩智浦有限公司 | 用于促进可靠样式检测的方法和系统 |
CN107808150A (zh) * | 2017-11-20 | 2018-03-16 | 珠海习悦信息技术有限公司 | 人体视频动作识别方法、装置、存储介质及处理器 |
CN109448707A (zh) * | 2018-12-18 | 2019-03-08 | 北京嘉楠捷思信息技术有限公司 | 一种语音识别方法及装置、设备、介质 |
Also Published As
Publication number | Publication date |
---|---|
CN110070867A (zh) | 2019-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110070867B (zh) | 语音指令识别方法、计算机装置及计算机可读存储介质 | |
CN109840589B (zh) | 一种在fpga上运行卷积神经网络的方法和装置 | |
CN110136744B (zh) | 一种音频指纹生成方法、设备及存储介质 | |
CN110335587B (zh) | 语音合成方法、系统、终端设备和可读存储介质 | |
CN110929865B (zh) | 网络量化方法、业务处理方法及相关产品 | |
CN110751944B (zh) | 构建语音识别模型的方法、装置、设备和存储介质 | |
CN112508125A (zh) | 一种图像检测模型的高效全整数量化方法 | |
CN112767927A (zh) | 一种提取语音特征的方法、装置、终端及存储介质 | |
CN110059804B (zh) | 数据处理方法及装置 | |
CN111275166B (zh) | 基于卷积神经网络的图像处理装置、设备及可读存储介质 | |
CN115457975A (zh) | 婴儿哭声和咳嗽声检测方法、装置、存储介质及终端设备 | |
CN114141237A (zh) | 语音识别方法、装置、计算机设备和存储介质 | |
CN112652299B (zh) | 时间序列语音识别深度学习模型的量化方法及装置 | |
CN111048065B (zh) | 文本纠错数据生成方法及相关装置 | |
CN116306672A (zh) | 一种数据处理方法及其装置 | |
CN111667045A (zh) | 多通道神经网络模型训练方法、装置及计算机存储介质 | |
CN111401069A (zh) | 会话文本的意图识别方法、意图识别装置及终端 | |
CN110852348B (zh) | 特征图处理方法、图像处理方法及装置 | |
CN116153326A (zh) | 语音分离方法、装置、电子设备及可读存储介质 | |
CN112489687A (zh) | 一种基于序列卷积的语音情感识别方法及装置 | |
CN117292024B (zh) | 基于语音的图像生成方法、装置、介质及电子设备 | |
CN111797984A (zh) | 一种用于多任务神经网络的量化和硬件加速方法及装置 | |
CN110717578A (zh) | 神经网络压缩方法、图像处理方法及装置 | |
CN112926724A (zh) | 注塑成型产品良率的评分方法、装置及电子设备 | |
CN113808613B (zh) | 一种轻量化的语音去噪方法、系统、设备及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190730 Assignee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Assignor: ZHUHAI SPACETOUCH Ltd. Contract record no.: X2022980021423 Denomination of invention: Speech instruction recognition method, computer device and computer readable storage medium Granted publication date: 20220311 License type: Exclusive License Record date: 20221115 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Speech instruction recognition method, computer device and computer readable storage medium Effective date of registration: 20221118 Granted publication date: 20220311 Pledgee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Pledgor: ZHUHAI SPACETOUCH Ltd. Registration number: Y2022980022393 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20231228 Granted publication date: 20220311 Pledgee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Pledgor: ZHUHAI SPACETOUCH Ltd. Registration number: Y2022980022393 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Assignor: ZHUHAI SPACETOUCH Ltd. Contract record no.: X2022980021423 Date of cancellation: 20240103 |
|
EC01 | Cancellation of recordation of patent licensing contract |