CN111916083B - 一种通过大数据采集的智能设备语音指令识别算法 - Google Patents
一种通过大数据采集的智能设备语音指令识别算法 Download PDFInfo
- Publication number
- CN111916083B CN111916083B CN202010842396.XA CN202010842396A CN111916083B CN 111916083 B CN111916083 B CN 111916083B CN 202010842396 A CN202010842396 A CN 202010842396A CN 111916083 B CN111916083 B CN 111916083B
- Authority
- CN
- China
- Prior art keywords
- waveform diagram
- big data
- voice
- waveform
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000010586 diagram Methods 0.000 claims abstract description 108
- 238000000034 method Methods 0.000 claims abstract description 26
- 238000007781 pre-processing Methods 0.000 claims abstract description 17
- 238000006243 chemical reaction Methods 0.000 claims abstract description 6
- 230000037406 food intake Effects 0.000 claims abstract description 3
- 230000002159 abnormal effect Effects 0.000 claims description 10
- 238000009960 carding Methods 0.000 claims description 4
- 210000001260 vocal cord Anatomy 0.000 claims description 3
- 238000013461 design Methods 0.000 abstract description 5
- 230000009286 beneficial effect Effects 0.000 abstract description 2
- 238000012360 testing method Methods 0.000 description 2
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 1
- 239000005977 Ethylene Substances 0.000 description 1
- NIXOWILDQLNWCW-UHFFFAOYSA-N acrylic acid group Chemical group C(C=C)(=O)O NIXOWILDQLNWCW-UHFFFAOYSA-N 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 239000012535 impurity Substances 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000001915 proofreading effect Effects 0.000 description 1
- QQONPFPTGQHPMA-UHFFFAOYSA-N propylene Natural products CC=C QQONPFPTGQHPMA-UHFFFAOYSA-N 0.000 description 1
- 125000004805 propylene group Chemical group [H]C([H])([H])C([H])([*:1])C([H])([H])[*:2] 0.000 description 1
- 230000036632 reaction speed Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F2218/00—Aspects of pattern recognition specially adapted for signal processing
- G06F2218/02—Preprocessing
- G06F2218/04—Denoising
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02P—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
- Y02P90/00—Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
- Y02P90/02—Total factory control, e.g. smart factories, flexible manufacturing systems [FMS] or integrated manufacturing systems [IMS]
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Library & Information Science (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (5)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010842396.XA CN111916083B (zh) | 2020-08-20 | 2020-08-20 | 一种通过大数据采集的智能设备语音指令识别算法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010842396.XA CN111916083B (zh) | 2020-08-20 | 2020-08-20 | 一种通过大数据采集的智能设备语音指令识别算法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111916083A CN111916083A (zh) | 2020-11-10 |
CN111916083B true CN111916083B (zh) | 2023-08-22 |
Family
ID=73279214
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010842396.XA Active CN111916083B (zh) | 2020-08-20 | 2020-08-20 | 一种通过大数据采集的智能设备语音指令识别算法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111916083B (zh) |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101067928A (zh) * | 2007-07-10 | 2007-11-07 | 章森 | 测量语音波形相似度的一种新方法 |
KR20090063566A (ko) * | 2007-12-14 | 2009-06-18 | 송옥기 | 음성인식 게임장치 |
CN106251868A (zh) * | 2016-08-09 | 2016-12-21 | 江门雷斯诺照明有限公司 | 一种具有智能降噪功能的灯具语音识别控制方法 |
CN107220292A (zh) * | 2017-04-25 | 2017-09-29 | 上海庆科信息技术有限公司 | 智能对话装置、反馈式智能语音控制系统及方法 |
CN107825433A (zh) * | 2017-10-27 | 2018-03-23 | 安徽硕威智能科技有限公司 | 一种儿童语音指令识别的卡片机器人 |
CN109285556A (zh) * | 2018-09-29 | 2019-01-29 | 百度在线网络技术(北京)有限公司 | 音频处理方法、装置、设备以及存储介质 |
GB201909950D0 (en) * | 2018-07-11 | 2019-08-28 | Premium Loudspeakers Hui Zhou Co Ltd | Method for providing vui particular response and application thereof to intelligent sound box |
-
2020
- 2020-08-20 CN CN202010842396.XA patent/CN111916083B/zh active Active
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101067928A (zh) * | 2007-07-10 | 2007-11-07 | 章森 | 测量语音波形相似度的一种新方法 |
KR20090063566A (ko) * | 2007-12-14 | 2009-06-18 | 송옥기 | 음성인식 게임장치 |
CN106251868A (zh) * | 2016-08-09 | 2016-12-21 | 江门雷斯诺照明有限公司 | 一种具有智能降噪功能的灯具语音识别控制方法 |
CN107220292A (zh) * | 2017-04-25 | 2017-09-29 | 上海庆科信息技术有限公司 | 智能对话装置、反馈式智能语音控制系统及方法 |
CN107825433A (zh) * | 2017-10-27 | 2018-03-23 | 安徽硕威智能科技有限公司 | 一种儿童语音指令识别的卡片机器人 |
GB201909950D0 (en) * | 2018-07-11 | 2019-08-28 | Premium Loudspeakers Hui Zhou Co Ltd | Method for providing vui particular response and application thereof to intelligent sound box |
CN109285556A (zh) * | 2018-09-29 | 2019-01-29 | 百度在线网络技术(北京)有限公司 | 音频处理方法、装置、设备以及存储介质 |
Also Published As
Publication number | Publication date |
---|---|
CN111916083A (zh) | 2020-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10515292B2 (en) | Joint acoustic and visual processing | |
Versteegh et al. | The zero resource speech challenge 2015: Proposed approaches and results | |
WO2021000408A1 (zh) | 面试评分方法、装置、设备及存储介质 | |
US8195459B1 (en) | Augmentation and calibration of output from non-deterministic text generators by modeling its characteristics in specific environments | |
US6836760B1 (en) | Use of semantic inference and context-free grammar with speech recognition system | |
CN106297776A (zh) | 一种基于音频模板的语音关键词检索方法 | |
CN109192194A (zh) | 语音数据标注方法、装置、计算机设备及存储介质 | |
CN112397054B (zh) | 一种电力调度语音识别方法 | |
Basak et al. | Challenges and Limitations in Speech Recognition Technology: A Critical Review of Speech Signal Processing Algorithms, Tools and Systems. | |
JP2016099507A (ja) | 音響特徴量変換装置、音響モデル適応装置、音響特徴量変換方法、音響モデル適応方法、およびプログラム | |
KR20090060631A (ko) | 타 언어권 화자음성에 대한 음성인식 시스템의 성능 향상을위한 비직접적 데이터 기반 발음변이 모델링 시스템 및방법 | |
CN112015874A (zh) | 学生心理健康陪伴对话系统 | |
Elakkiya et al. | Implementation of speech to text conversion using hidden markov model | |
Ballard et al. | A multimodal learning interface for word acquisition | |
CN111916083B (zh) | 一种通过大数据采集的智能设备语音指令识别算法 | |
Mohanty et al. | Isolated Odia digit recognition using HTK: an implementation view | |
JP2010277036A (ja) | 音声データ検索装置 | |
CN110807370B (zh) | 一种基于多模态的会议发言人身份无感确认方法 | |
Mukherjee et al. | Identification of top-3 spoken Indian languages: an ensemble learning-based approach | |
Liu et al. | Supra-Segmental Feature Based Speaker Trait Detection. | |
Hussein et al. | Arabic speaker recognition using HMM | |
Alashban et al. | Language effect on speaker gender classification using deep learning | |
Liao et al. | Towards the Development of Automatic Speech Recognition for Bikol and Kapampangan | |
Therese et al. | Optimisation of training samples in recognition of overlapping speech and identification of speaker in a two speakers situation | |
Hacine-Gharbi et al. | Automatic Classification of French Spontaneous Oral Speech into Injunction and No-injunction Classes. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20230721 Address after: 100000 No. 10, 1st floor, building 6, No. 108 Beiyuan Road B, Chaoyang District, Beijing Applicant after: Beijing Jizhi Technology Co.,Ltd. Address before: No. 287, Baiyang village, Anchang street, Keqiao District, Shaoxing City, Zhejiang Province Applicant before: Shaoxing maimang Intelligent Technology Co.,Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20230911 Address after: 610, 6th Floor, Building A, No. 2 Lize Zhong'er Road, Chaoyang District, Beijing, 100000 Patentee after: Zhongguancun Technology Leasing Co.,Ltd. Address before: 100000 No. 10, 1st floor, building 6, No. 108 Beiyuan Road B, Chaoyang District, Beijing Patentee before: Beijing Jizhi Technology Co.,Ltd. |