CN112017676B - 音频处理方法、装置和计算机可读存储介质 - Google Patents
音频处理方法、装置和计算机可读存储介质 Download PDFInfo
- Publication number
- CN112017676B CN112017676B CN201910467088.0A CN201910467088A CN112017676B CN 112017676 B CN112017676 B CN 112017676B CN 201910467088 A CN201910467088 A CN 201910467088A CN 112017676 B CN112017676 B CN 112017676B
- Authority
- CN
- China
- Prior art keywords
- probability
- audio
- effective
- processed
- frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 19
- 238000010801 machine learning Methods 0.000 claims abstract description 17
- 230000000875 corresponding effect Effects 0.000 claims description 28
- 238000012545 processing Methods 0.000 claims description 21
- 230000002596 correlated effect Effects 0.000 claims description 8
- 238000013527 convolutional neural network Methods 0.000 claims description 7
- 238000013528 artificial neural network Methods 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 4
- 125000004122 cyclic group Chemical group 0.000 claims description 2
- 238000000034 method Methods 0.000 abstract description 15
- 238000005516 engineering process Methods 0.000 abstract description 5
- 239000010410 layer Substances 0.000 description 12
- 238000010586 diagram Methods 0.000 description 8
- 238000012549 training Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 4
- 230000003993 interaction Effects 0.000 description 3
- 230000000306 recurrent effect Effects 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 238000003058 natural language processing Methods 0.000 description 2
- 238000010606 normalization Methods 0.000 description 2
- 239000002356 single layer Substances 0.000 description 2
- 238000003786 synthesis reaction Methods 0.000 description 2
- 206010011224 Cough Diseases 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000012805 post-processing Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000010200 validation analysis Methods 0.000 description 1
Abstract
Description
Claims (8)
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910467088.0A CN112017676B (zh) | 2019-05-31 | 音频处理方法、装置和计算机可读存储介质 | |
US17/611,741 US20220238104A1 (en) | 2019-05-31 | 2020-05-18 | Audio processing method and apparatus, and human-computer interactive system |
PCT/CN2020/090853 WO2020238681A1 (zh) | 2019-05-31 | 2020-05-18 | 音频处理方法、装置和人机交互系统 |
JP2021569116A JP2022534003A (ja) | 2019-05-31 | 2020-05-18 | 音声処理方法、音声処理装置およびヒューマンコンピュータインタラクションシステム |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910467088.0A CN112017676B (zh) | 2019-05-31 | 音频处理方法、装置和计算机可读存储介质 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112017676A CN112017676A (zh) | 2020-12-01 |
CN112017676B true CN112017676B (zh) | 2024-07-16 |
Family
ID=
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108389575A (zh) * | 2018-01-11 | 2018-08-10 | 苏州思必驰信息科技有限公司 | 音频数据识别方法及系统 |
CN108877775A (zh) * | 2018-06-04 | 2018-11-23 | 平安科技(深圳)有限公司 | 语音数据处理方法、装置、计算机设备及存储介质 |
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108389575A (zh) * | 2018-01-11 | 2018-08-10 | 苏州思必驰信息科技有限公司 | 音频数据识别方法及系统 |
CN108877775A (zh) * | 2018-06-04 | 2018-11-23 | 平安科技(深圳)有限公司 | 语音数据处理方法、装置、计算机设备及存储介质 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110838289B (zh) | 基于人工智能的唤醒词检测方法、装置、设备及介质 | |
WO2021208287A1 (zh) | 用于情绪识别的语音端点检测方法、装置、电子设备及存储介质 | |
CN106683680B (zh) | 说话人识别方法及装置、计算机设备及计算机可读介质 | |
US20180158449A1 (en) | Method and device for waking up via speech based on artificial intelligence | |
CN112185352B (zh) | 语音识别方法、装置及电子设备 | |
CN111402891B (zh) | 语音识别方法、装置、设备和存储介质 | |
JP5932869B2 (ja) | N−gram言語モデルの教師無し学習方法、学習装置、および学習プログラム | |
US11398219B2 (en) | Speech synthesizer using artificial intelligence and method of operating the same | |
US11417313B2 (en) | Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium | |
CN111833845A (zh) | 多语种语音识别模型训练方法、装置、设备及存储介质 | |
CN110491375B (zh) | 一种目标语种检测的方法和装置 | |
CN114550703A (zh) | 语音识别系统的训练方法和装置、语音识别方法和装置 | |
CN113628612A (zh) | 语音识别方法、装置、电子设备及计算机可读存储介质 | |
CN115312033A (zh) | 基于人工智能的语音情感识别方法、装置、设备及介质 | |
WO2024093578A1 (zh) | 语音识别方法、装置、电子设备、存储介质及计算机程序产品 | |
US20220238104A1 (en) | Audio processing method and apparatus, and human-computer interactive system | |
CN112201275A (zh) | 声纹分割方法、装置、设备及可读存储介质 | |
CN112017676B (zh) | 音频处理方法、装置和计算机可读存储介质 | |
CN112199498A (zh) | 一种养老服务的人机对话方法、装置、介质及电子设备 | |
US11393447B2 (en) | Speech synthesizer using artificial intelligence, method of operating speech synthesizer and computer-readable recording medium | |
KR102642617B1 (ko) | 인공 지능을 이용한 음성 합성 장치, 음성 합성 장치의 동작 방법 및 컴퓨터로 판독 가능한 기록 매체 | |
CN114566156A (zh) | 一种关键词的语音识别方法及装置 | |
CN113571051A (zh) | 一种唇部语音活动检测和结果纠错的语音识别系统和方法 | |
CN113674745A (zh) | 语音识别方法及装置 | |
CN112951214B (zh) | 一种抗对抗样本攻击的语音识别模型训练方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
CB02 | Change of applicant information |
Address after: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176 Applicant after: Jingdong Technology Holding Co.,Ltd. Address before: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176 Applicant before: Jingdong Digital Technology Holding Co.,Ltd. Address after: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176 Applicant after: Jingdong Digital Technology Holding Co.,Ltd. Address before: Room 221, 2 / F, block C, 18 Kechuang 11th Street, Daxing District, Beijing, 100176 Applicant before: JINGDONG DIGITAL TECHNOLOGY HOLDINGS Co.,Ltd. |
|
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant |