CN110070867A - 语音指令识别方法、计算机装置及计算机可读存储介质 - Google Patents
语音指令识别方法、计算机装置及计算机可读存储介质 Download PDFInfo
- Publication number
- CN110070867A CN110070867A CN201910342260.XA CN201910342260A CN110070867A CN 110070867 A CN110070867 A CN 110070867A CN 201910342260 A CN201910342260 A CN 201910342260A CN 110070867 A CN110070867 A CN 110070867A
- Authority
- CN
- China
- Prior art keywords
- convolution
- layer
- convolutional neural
- neural networks
- output valve
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 238000003860 storage Methods 0.000 title claims abstract description 19
- 238000009434 installation Methods 0.000 title claims abstract description 14
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 53
- 238000013528 artificial neural network Methods 0.000 claims abstract description 19
- 238000004590 computer program Methods 0.000 claims description 20
- 238000012545 processing Methods 0.000 claims description 14
- 238000004364 calculation method Methods 0.000 abstract description 25
- 230000008569 process Effects 0.000 abstract description 5
- 239000000284 extract Substances 0.000 abstract description 4
- 230000006870 function Effects 0.000 description 8
- 230000001537 neural effect Effects 0.000 description 5
- 230000004913 activation Effects 0.000 description 4
- 230000008859 change Effects 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000000306 recurrent effect Effects 0.000 description 4
- 238000011160 research Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 239000011159 matrix material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 210000004218 nerve net Anatomy 0.000 description 2
- 230000002035 prolonged effect Effects 0.000 description 2
- 241001269238 Data Species 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- FFBHFFJDDLITSX-UHFFFAOYSA-N benzyl N-[2-hydroxy-4-(3-oxomorpholin-4-yl)phenyl]carbamate Chemical compound OC1=C(NC(=O)OCC2=CC=CC=C2)C=CC(=C1)N1CCOCC1=O FFBHFFJDDLITSX-UHFFFAOYSA-N 0.000 description 1
- 229910002056 binary alloy Inorganic materials 0.000 description 1
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 210000003739 neck Anatomy 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 238000003672 processing method Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Complex Calculations (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910342260.XA CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910342260.XA CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110070867A true CN110070867A (zh) | 2019-07-30 |
CN110070867B CN110070867B (zh) | 2022-03-11 |
Family
ID=67369049
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910342260.XA Active CN110070867B (zh) | 2019-04-26 | 2019-04-26 | 语音指令识别方法、计算机装置及计算机可读存储介质 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110070867B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110718211A (zh) * | 2019-09-26 | 2020-01-21 | 东南大学 | 一种基于混合压缩卷积神经网络的关键词识别系统 |
CN111583940A (zh) * | 2020-04-20 | 2020-08-25 | 东南大学 | 极低功耗关键词唤醒神经网络电路 |
CN112185360A (zh) * | 2020-09-28 | 2021-01-05 | 苏州科达科技股份有限公司 | 语音数据识别方法、多人会议的语音激励方法及相关设备 |
CN113409773A (zh) * | 2021-08-18 | 2021-09-17 | 中科南京智能技术研究院 | 一种二值化神经网络语音唤醒方法及系统 |
CN113611289A (zh) * | 2021-08-06 | 2021-11-05 | 上海汽车集团股份有限公司 | 一种语音识别方法和装置 |
Citations (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105760933A (zh) * | 2016-02-18 | 2016-07-13 | 清华大学 | 卷积神经网络的逐层变精度定点化方法及装置 |
CN106575379A (zh) * | 2014-09-09 | 2017-04-19 | 英特尔公司 | 用于神经网络的改进的定点整型实现方式 |
CN107679618A (zh) * | 2017-07-28 | 2018-02-09 | 北京深鉴科技有限公司 | 一种静态策略定点化训练方法及装置 |
CN107679622A (zh) * | 2017-09-06 | 2018-02-09 | 清华大学 | 一种面向神经网络算法的模拟感知计算架构 |
CN107688849A (zh) * | 2017-07-28 | 2018-02-13 | 北京深鉴科技有限公司 | 一种动态策略定点化训练方法及装置 |
CN107783960A (zh) * | 2017-10-23 | 2018-03-09 | 百度在线网络技术(北京)有限公司 | 用于抽取信息的方法、装置和设备 |
CN107808150A (zh) * | 2017-11-20 | 2018-03-16 | 珠海习悦信息技术有限公司 | 人体视频动作识别方法、装置、存储介质及处理器 |
CN107993651A (zh) * | 2017-12-29 | 2018-05-04 | 深圳和而泰数据资源与云技术有限公司 | 一种语音识别方法、装置、电子设备及存储介质 |
CN108009625A (zh) * | 2016-11-01 | 2018-05-08 | 北京深鉴科技有限公司 | 人工神经网络定点化后的微调方法和装置 |
WO2018103736A1 (en) * | 2016-12-09 | 2018-06-14 | Beijing Horizon Information Technology Co., Ltd. | Systems and methods for data management |
CN108573708A (zh) * | 2017-03-08 | 2018-09-25 | 恩智浦有限公司 | 用于促进可靠样式检测的方法和系统 |
CN108596328A (zh) * | 2018-04-26 | 2018-09-28 | 北京市商汤科技开发有限公司 | 一种定点化方法及装置、计算机设备 |
CN108701250A (zh) * | 2017-10-16 | 2018-10-23 | 深圳市大疆创新科技有限公司 | 数据定点化方法和装置 |
CN109036385A (zh) * | 2018-10-19 | 2018-12-18 | 北京旋极信息技术股份有限公司 | 一种语音指令识别方法、装置及计算机存储介质 |
CN109155006A (zh) * | 2016-05-10 | 2019-01-04 | 谷歌有限责任公司 | 使用神经网络进行基于频率的音频分析 |
CN109448707A (zh) * | 2018-12-18 | 2019-03-08 | 北京嘉楠捷思信息技术有限公司 | 一种语音识别方法及装置、设备、介质 |
CN109448719A (zh) * | 2018-12-11 | 2019-03-08 | 网易(杭州)网络有限公司 | 神经网络模型建立方法及语音唤醒方法、装置、介质和设备 |
-
2019
- 2019-04-26 CN CN201910342260.XA patent/CN110070867B/zh active Active
Patent Citations (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106575379A (zh) * | 2014-09-09 | 2017-04-19 | 英特尔公司 | 用于神经网络的改进的定点整型实现方式 |
CN105760933A (zh) * | 2016-02-18 | 2016-07-13 | 清华大学 | 卷积神经网络的逐层变精度定点化方法及装置 |
CN109155006A (zh) * | 2016-05-10 | 2019-01-04 | 谷歌有限责任公司 | 使用神经网络进行基于频率的音频分析 |
CN108009625A (zh) * | 2016-11-01 | 2018-05-08 | 北京深鉴科技有限公司 | 人工神经网络定点化后的微调方法和装置 |
WO2018103736A1 (en) * | 2016-12-09 | 2018-06-14 | Beijing Horizon Information Technology Co., Ltd. | Systems and methods for data management |
CN108573708A (zh) * | 2017-03-08 | 2018-09-25 | 恩智浦有限公司 | 用于促进可靠样式检测的方法和系统 |
CN107688849A (zh) * | 2017-07-28 | 2018-02-13 | 北京深鉴科技有限公司 | 一种动态策略定点化训练方法及装置 |
CN107679618A (zh) * | 2017-07-28 | 2018-02-09 | 北京深鉴科技有限公司 | 一种静态策略定点化训练方法及装置 |
CN107679622A (zh) * | 2017-09-06 | 2018-02-09 | 清华大学 | 一种面向神经网络算法的模拟感知计算架构 |
CN108701250A (zh) * | 2017-10-16 | 2018-10-23 | 深圳市大疆创新科技有限公司 | 数据定点化方法和装置 |
CN107783960A (zh) * | 2017-10-23 | 2018-03-09 | 百度在线网络技术(北京)有限公司 | 用于抽取信息的方法、装置和设备 |
CN107808150A (zh) * | 2017-11-20 | 2018-03-16 | 珠海习悦信息技术有限公司 | 人体视频动作识别方法、装置、存储介质及处理器 |
CN107993651A (zh) * | 2017-12-29 | 2018-05-04 | 深圳和而泰数据资源与云技术有限公司 | 一种语音识别方法、装置、电子设备及存储介质 |
CN108596328A (zh) * | 2018-04-26 | 2018-09-28 | 北京市商汤科技开发有限公司 | 一种定点化方法及装置、计算机设备 |
CN109036385A (zh) * | 2018-10-19 | 2018-12-18 | 北京旋极信息技术股份有限公司 | 一种语音指令识别方法、装置及计算机存储介质 |
CN109448719A (zh) * | 2018-12-11 | 2019-03-08 | 网易(杭州)网络有限公司 | 神经网络模型建立方法及语音唤醒方法、装置、介质和设备 |
CN109448707A (zh) * | 2018-12-18 | 2019-03-08 | 北京嘉楠捷思信息技术有限公司 | 一种语音识别方法及装置、设备、介质 |
Non-Patent Citations (3)
Title |
---|
BIKANG PENG: ""A face Detection framework based on deep cascaded full convolutional neural networks"", 《2019 IEEE 4TH ICCCS》 * |
MUHAMMAD SHAHNAWAZ: ""Studying the effects of feature extraction settings on the accuracy and memory requirements of neural networks for keyword spotting"", 《2018 IEEE ICCE-BERLIN》 * |
李雪莲: ""基于三维可分离卷积神经网络的动态手势识别技术研究"", 《万方数据库》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110718211A (zh) * | 2019-09-26 | 2020-01-21 | 东南大学 | 一种基于混合压缩卷积神经网络的关键词识别系统 |
CN111583940A (zh) * | 2020-04-20 | 2020-08-25 | 东南大学 | 极低功耗关键词唤醒神经网络电路 |
CN112185360A (zh) * | 2020-09-28 | 2021-01-05 | 苏州科达科技股份有限公司 | 语音数据识别方法、多人会议的语音激励方法及相关设备 |
CN113611289A (zh) * | 2021-08-06 | 2021-11-05 | 上海汽车集团股份有限公司 | 一种语音识别方法和装置 |
CN113409773A (zh) * | 2021-08-18 | 2021-09-17 | 中科南京智能技术研究院 | 一种二值化神经网络语音唤醒方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN110070867B (zh) | 2022-03-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110070867A (zh) | 语音指令识别方法、计算机装置及计算机可读存储介质 | |
CN109871532B (zh) | 文本主题提取方法、装置及存储介质 | |
Yuan et al. | High performance CNN accelerators based on hardware and algorithm co-optimization | |
CN111553406B (zh) | 基于改进yolo-v3的目标检测系统、方法及终端 | |
CN110050267A (zh) | 用于数据管理的系统和方法 | |
CN107679082A (zh) | 问答搜索方法、装置以及电子设备 | |
CN110136744A (zh) | 一种音频指纹生成方法、设备及存储介质 | |
CN103942571B (zh) | 一种基于遗传规划算法的图形图像分类方法 | |
CN111105017B (zh) | 神经网络量化方法、装置及电子设备 | |
CN111080654B (zh) | 图像的病变区域分割方法、装置及服务器 | |
CN112163601A (zh) | 图像分类方法、系统、计算机设备及存储介质 | |
CN111062854A (zh) | 检测水印的方法、装置、终端及存储介质 | |
CN114783021A (zh) | 一种口罩佩戴智能检测方法、装置、设备及介质 | |
CN110765843B (zh) | 人脸验证方法、装置、计算机设备及存储介质 | |
CN115223042A (zh) | 基于YOLOv5网络模型的目标识别方法及装置 | |
CN113361567B (zh) | 图像处理方法、装置、电子设备和存储介质 | |
CN116227573B (zh) | 分割模型训练方法、图像分割方法、装置及相关介质 | |
CN113299298A (zh) | 残差单元及网络及目标识别方法及系统及装置及介质 | |
CN116386803A (zh) | 一种基于图的细胞病理报告生成方法 | |
CN116524352A (zh) | 一种遥感图像水体提取方法及装置 | |
CN116166993A (zh) | 电力线路故障类型识别方法及装置、电力系统、存储介质 | |
CN112183725B (zh) | 提供神经网络的方法、计算装置和计算机可读存储介质 | |
CN115266141A (zh) | 基于gru-c网络的点焊质量检测方法、装置及存储介质 | |
CN112132269B (zh) | 模型处理方法、装置、设备及存储介质 | |
CN111767710B (zh) | 印尼语的情感分类方法、装置、设备及介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190730 Assignee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Assignor: ZHUHAI SPACETOUCH Ltd. Contract record no.: X2022980021423 Denomination of invention: Speech instruction recognition method, computer device and computer readable storage medium Granted publication date: 20220311 License type: Exclusive License Record date: 20221115 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: Speech instruction recognition method, computer device and computer readable storage medium Effective date of registration: 20221118 Granted publication date: 20220311 Pledgee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Pledgor: ZHUHAI SPACETOUCH Ltd. Registration number: Y2022980022393 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20231228 Granted publication date: 20220311 Pledgee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Pledgor: ZHUHAI SPACETOUCH Ltd. Registration number: Y2022980022393 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
EC01 | Cancellation of recordation of patent licensing contract |
Assignee: Hengqin Financial Investment International Finance Leasing Co.,Ltd. Assignor: ZHUHAI SPACETOUCH Ltd. Contract record no.: X2022980021423 Date of cancellation: 20240103 |
|
EC01 | Cancellation of recordation of patent licensing contract |