CN111833856B - 基于深度学习的语音关键信息标定方法 - Google Patents
基于深度学习的语音关键信息标定方法 Download PDFInfo
- Publication number
- CN111833856B CN111833856B CN202010682482.9A CN202010682482A CN111833856B CN 111833856 B CN111833856 B CN 111833856B CN 202010682482 A CN202010682482 A CN 202010682482A CN 111833856 B CN111833856 B CN 111833856B
- Authority
- CN
- China
- Prior art keywords
- voice
- information
- layer
- voice signal
- vector
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 18
- 238000013135 deep learning Methods 0.000 title claims abstract description 11
- 238000013527 convolutional neural network Methods 0.000 claims abstract description 26
- 230000004913 activation Effects 0.000 claims abstract description 24
- 238000013145 classification model Methods 0.000 claims abstract description 21
- 238000012549 training Methods 0.000 claims abstract description 13
- 238000013507 mapping Methods 0.000 claims abstract description 7
- 239000013598 vector Substances 0.000 claims description 40
- 238000011176 pooling Methods 0.000 claims description 23
- 238000004364 calculation method Methods 0.000 claims description 14
- 238000000605 extraction Methods 0.000 claims description 3
- 238000004458 analytical method Methods 0.000 abstract description 4
- 230000000694 effects Effects 0.000 description 4
- 230000002411 adverse Effects 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 230000008034 disappearance Effects 0.000 description 2
- 238000009825 accumulation Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/16—Speech classification or search using artificial neural networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02T—CLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
- Y02T10/00—Road transport of goods or passengers
- Y02T10/10—Internal combustion engine [ICE] based vehicles
- Y02T10/40—Engine management systems
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Computation (AREA)
- Life Sciences & Earth Sciences (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Evolutionary Biology (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Machine Translation (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010682482.9A CN111833856B (zh) | 2020-07-15 | 2020-07-15 | 基于深度学习的语音关键信息标定方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010682482.9A CN111833856B (zh) | 2020-07-15 | 2020-07-15 | 基于深度学习的语音关键信息标定方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111833856A CN111833856A (zh) | 2020-10-27 |
CN111833856B true CN111833856B (zh) | 2023-10-24 |
Family
ID=72922856
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010682482.9A Active CN111833856B (zh) | 2020-07-15 | 2020-07-15 | 基于深度学习的语音关键信息标定方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111833856B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN114324580A (zh) * | 2021-12-03 | 2022-04-12 | 西安交通大学 | 一种结构缺陷的智能敲击检测方法及系统 |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1512402A (zh) * | 2002-12-31 | 2004-07-14 | 程松林 | 一种语音检索方法及采用该方法的音像信息检索系统 |
CN107578775A (zh) * | 2017-09-07 | 2018-01-12 | 四川大学 | 一种基于深度神经网络的多任务语音分类方法 |
CN108305617A (zh) * | 2018-01-31 | 2018-07-20 | 腾讯科技(深圳)有限公司 | 语音关键词的识别方法和装置 |
CN109599126A (zh) * | 2018-12-29 | 2019-04-09 | 广州丰石科技有限公司 | 一种基于mel能量谱和卷积神经网络的声音故障识别方法 |
CN109979440A (zh) * | 2019-03-13 | 2019-07-05 | 广州市网星信息技术有限公司 | 关键词样本确定方法、语音识别方法、装置、设备和介质 |
CN110378480A (zh) * | 2019-06-14 | 2019-10-25 | 平安科技(深圳)有限公司 | 模型训练方法、装置及计算机可读存储介质 |
CN110490154A (zh) * | 2019-08-23 | 2019-11-22 | 集美大学 | 一种多维泄漏信息检测方法、终端设备及存储介质 |
CN110717415A (zh) * | 2019-09-24 | 2020-01-21 | 上海数创医疗科技有限公司 | 基于特征选取的st段分类卷积神经网络及其使用方法 |
CN110909819A (zh) * | 2019-12-02 | 2020-03-24 | 集美大学 | 基于时域的电磁信息泄漏检测方法、终端设备及存储介质 |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9715660B2 (en) * | 2013-11-04 | 2017-07-25 | Google Inc. | Transfer learning for deep neural network based hotword detection |
US10360901B2 (en) * | 2013-12-06 | 2019-07-23 | Nuance Communications, Inc. | Learning front-end speech recognition parameters within neural network training |
US20190147854A1 (en) * | 2017-11-16 | 2019-05-16 | Microsoft Technology Licensing, Llc | Speech Recognition Source to Target Domain Adaptation |
-
2020
- 2020-07-15 CN CN202010682482.9A patent/CN111833856B/zh active Active
Patent Citations (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1512402A (zh) * | 2002-12-31 | 2004-07-14 | 程松林 | 一种语音检索方法及采用该方法的音像信息检索系统 |
CN107578775A (zh) * | 2017-09-07 | 2018-01-12 | 四川大学 | 一种基于深度神经网络的多任务语音分类方法 |
CN108305617A (zh) * | 2018-01-31 | 2018-07-20 | 腾讯科技(深圳)有限公司 | 语音关键词的识别方法和装置 |
CN110444195A (zh) * | 2018-01-31 | 2019-11-12 | 腾讯科技(深圳)有限公司 | 语音关键词的识别方法和装置 |
CN109599126A (zh) * | 2018-12-29 | 2019-04-09 | 广州丰石科技有限公司 | 一种基于mel能量谱和卷积神经网络的声音故障识别方法 |
CN109979440A (zh) * | 2019-03-13 | 2019-07-05 | 广州市网星信息技术有限公司 | 关键词样本确定方法、语音识别方法、装置、设备和介质 |
CN110378480A (zh) * | 2019-06-14 | 2019-10-25 | 平安科技(深圳)有限公司 | 模型训练方法、装置及计算机可读存储介质 |
CN110490154A (zh) * | 2019-08-23 | 2019-11-22 | 集美大学 | 一种多维泄漏信息检测方法、终端设备及存储介质 |
CN110717415A (zh) * | 2019-09-24 | 2020-01-21 | 上海数创医疗科技有限公司 | 基于特征选取的st段分类卷积神经网络及其使用方法 |
CN110909819A (zh) * | 2019-12-02 | 2020-03-24 | 集美大学 | 基于时域的电磁信息泄漏检测方法、终端设备及存储介质 |
Non-Patent Citations (3)
Title |
---|
《Internal Calibration System Using Learning Algorithm With Gradient Descent》;Chan-Yong Jung et, al.;《 IEEE Geoscience and Remote Sensing Letters 》;第17卷(第9期);1503 - 1507 * |
Dong Yu et,al..《Word confidence calibration using a maximum entropy model with constraints on confidence and word distributions》.《2010 IEEE International Conference on Acoustics, Speech and Signal Processing》.2010,4446-4449. * |
面向汽车电子控制的嵌入式语音识别系统设计;操太伟;《中国优秀硕士学位论文全文数据库 工程科技Ⅱ辑》;C035-75 * |
Also Published As
Publication number | Publication date |
---|---|
CN111833856A (zh) | 2020-10-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111680706B (zh) | 一种基于编码和解码结构的双通道输出轮廓检测方法 | |
CN111562108A (zh) | 一种基于cnn和fcmc的滚动轴承智能故障诊断方法 | |
CN111724770B (zh) | 一种基于深度卷积生成对抗网络的音频关键词识别方法 | |
CN110161388B (zh) | 一种高压设备的故障类型识别方法及其系统 | |
CN102915729B (zh) | 语音关键词检出系统、创建用于其的词典的系统和方法 | |
CN111199202B (zh) | 基于循环注意力网络的人体动作识别方法及识别装置 | |
CN111986699B (zh) | 基于全卷积网络的声音事件检测方法 | |
US20220238100A1 (en) | Voice data processing based on deep learning | |
CN112860183B (zh) | 基于高阶矩匹配的多源蒸馏-迁移机械故障智能诊断方法 | |
CN110991422A (zh) | 基于多元时移多尺度排列熵的滚动轴承故障诊断方法 | |
CN116167010B (zh) | 具有智能迁移学习能力的电力系统异常事件快速识别方法 | |
CN115580445A (zh) | 一种未知攻击入侵检测方法、装置和计算机可读存储介质 | |
CN111833856B (zh) | 基于深度学习的语音关键信息标定方法 | |
CN110289004B (zh) | 一种基于深度学习的人工合成声纹检测系统及方法 | |
CN115588112A (zh) | 一种基于rfef-yolo目标检测方法 | |
CN115345255A (zh) | 一种故障诊断方法、控制装置、终端及存储介质 | |
CN115457982A (zh) | 情感预测模型的预训练优化方法、装置、设备及介质 | |
CN111883177B (zh) | 基于深度学习的语音关键信息分离方法 | |
CN106057196B (zh) | 车载语音数据解析识别方法 | |
CN116738332A (zh) | 一种结合注意力机制的飞行器多尺度信号分类识别与故障检测方法 | |
CN116090449B (zh) | 一种质量问题分析报告的实体关系抽取方法及系统 | |
CN107871113B (zh) | 一种情感混合识别检测的方法和装置 | |
CN115457966A (zh) | 基于改进ds证据理论多分类器融合的猪咳嗽声识别方法 | |
CN115249329A (zh) | 一种基于深度学习的苹果叶片病害检测方法 | |
CN115375959A (zh) | 一种车辆图像识别模型建立及识别方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20230921 Address after: 361000 4th floor, No. 319, Huoju Road, Huoju Park, Huoju high tech Zone, Xiamen, Fujian Province Applicant after: XIAMEN HEROCHEER ELECTRONIC TECHNOLOGY CO.,LTD. Applicant after: Xiamen Xiquan Digital Technology Co.,Ltd. Applicant after: Shanghai Xizhong Technology Co.,Ltd. Address before: Room 621, South Building, torch Plaza, No. 56-58, torch garden, torch hi tech Zone, Xiamen City, Fujian Province, 361000 Applicant before: XIAMEN HEROCHEER ELECTRONIC TECHNOLOGY CO.,LTD. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant |