CN110246489A - 用于儿童的语音识别方法及系统 - Google Patents
用于儿童的语音识别方法及系统 Download PDFInfo
- Publication number
- CN110246489A CN110246489A CN201910516503.7A CN201910516503A CN110246489A CN 110246489 A CN110246489 A CN 110246489A CN 201910516503 A CN201910516503 A CN 201910516503A CN 110246489 A CN110246489 A CN 110246489A
- Authority
- CN
- China
- Prior art keywords
- children
- adult
- speech
- acoustic feature
- audio
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
Abstract
Description
Claims (10)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910516503.7A CN110246489B (zh) | 2019-06-14 | 2019-06-14 | 用于儿童的语音识别方法及系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910516503.7A CN110246489B (zh) | 2019-06-14 | 2019-06-14 | 用于儿童的语音识别方法及系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110246489A true CN110246489A (zh) | 2019-09-17 |
CN110246489B CN110246489B (zh) | 2021-07-13 |
Family
ID=67887219
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910516503.7A Active CN110246489B (zh) | 2019-06-14 | 2019-06-14 | 用于儿童的语音识别方法及系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110246489B (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110706692A (zh) * | 2019-10-21 | 2020-01-17 | 上海交通大学 | 儿童语音识别模型的训练方法及系统 |
CN111161728A (zh) * | 2019-12-26 | 2020-05-15 | 珠海格力电器股份有限公司 | 一种智能设备的唤醒方法、装置、设备及介质 |
CN111370024A (zh) * | 2020-02-21 | 2020-07-03 | 腾讯科技(深圳)有限公司 | 一种音频调整方法、设备及计算机可读存储介质 |
CN112634860A (zh) * | 2020-12-29 | 2021-04-09 | 苏州思必驰信息科技有限公司 | 儿童语音识别模型训练语料筛选方法 |
CN115312031A (zh) * | 2022-07-22 | 2022-11-08 | 东北大学 | 基于深度学习的自然语言处理方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102203852A (zh) * | 2008-09-12 | 2011-09-28 | 如师通有限公司 | 建立语音模型的方法 |
KR20180065761A (ko) * | 2016-12-08 | 2018-06-18 | 한국전자통신연구원 | 디지털 목소리 유전 요소에 기반한 사용자 적응형 음성 인식 시스템 및 방법 |
CN109036387A (zh) * | 2018-07-16 | 2018-12-18 | 中央民族大学 | 视频语音识别方法及系统 |
CN109637551A (zh) * | 2018-12-26 | 2019-04-16 | 出门问问信息科技有限公司 | 语音转换方法、装置、设备及存储介质 |
-
2019
- 2019-06-14 CN CN201910516503.7A patent/CN110246489B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102203852A (zh) * | 2008-09-12 | 2011-09-28 | 如师通有限公司 | 建立语音模型的方法 |
KR20180065761A (ko) * | 2016-12-08 | 2018-06-18 | 한국전자통신연구원 | 디지털 목소리 유전 요소에 기반한 사용자 적응형 음성 인식 시스템 및 방법 |
CN109036387A (zh) * | 2018-07-16 | 2018-12-18 | 中央民族大学 | 视频语音识别方法及系统 |
CN109637551A (zh) * | 2018-12-26 | 2019-04-16 | 出门问问信息科技有限公司 | 语音转换方法、装置、设备及存储介质 |
Non-Patent Citations (4)
Title |
---|
JOACHIM FAINBERG ET AL.: "《Improving Children’s Speech Recognition through Out-of-Domain Data Augmentation》", 《INTERSPEECH 2016》 * |
S SHAHNAWAZUDDIN ET AL.: "《Pitch-Adaptive Front-end Features for Robust Children’s ASR》", 《INTERSPEECH 2016》 * |
SOUVIK KUNDU ET AL.: "《Joint acoustic factor learning for robust deep neural network based automatic speech recognition》", 《ICASSP 2016》 * |
陈伟等: "《儿童语音数据库与儿童语音识别技术初探》", 《第八届全国人机语音通讯学术会议》 * |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110706692A (zh) * | 2019-10-21 | 2020-01-17 | 上海交通大学 | 儿童语音识别模型的训练方法及系统 |
CN110706692B (zh) * | 2019-10-21 | 2021-12-14 | 思必驰科技股份有限公司 | 儿童语音识别模型的训练方法及系统 |
CN111161728A (zh) * | 2019-12-26 | 2020-05-15 | 珠海格力电器股份有限公司 | 一种智能设备的唤醒方法、装置、设备及介质 |
CN111370024A (zh) * | 2020-02-21 | 2020-07-03 | 腾讯科技(深圳)有限公司 | 一种音频调整方法、设备及计算机可读存储介质 |
CN111370024B (zh) * | 2020-02-21 | 2023-07-04 | 腾讯科技(深圳)有限公司 | 一种音频调整方法、设备及计算机可读存储介质 |
CN112634860A (zh) * | 2020-12-29 | 2021-04-09 | 苏州思必驰信息科技有限公司 | 儿童语音识别模型训练语料筛选方法 |
CN112634860B (zh) * | 2020-12-29 | 2022-05-03 | 思必驰科技股份有限公司 | 儿童语音识别模型训练语料筛选方法 |
CN115312031A (zh) * | 2022-07-22 | 2022-11-08 | 东北大学 | 基于深度学习的自然语言处理方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
CN110246489B (zh) | 2021-07-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110246489A (zh) | 用于儿童的语音识别方法及系统 | |
CN109949783B (zh) | 歌曲合成方法及系统 | |
Boril et al. | Unsupervised equalization of Lombard effect for speech recognition in noisy adverse environments | |
Wali et al. | Generative adversarial networks for speech processing: A review | |
CN109767778B (zh) | 一种融合Bi-LSTM和WaveNet的语音转换方法 | |
Shahnawazuddin et al. | Creating speaker independent ASR system through prosody modification based data augmentation | |
CN104700843A (zh) | 一种年龄识别的方法及装置 | |
CN108766445A (zh) | 声纹识别方法及系统 | |
CN110246488A (zh) | 半优化CycleGAN模型的语音转换方法及装置 | |
CN1967657B (zh) | 节目制作中的说话人声音自动跟踪变调系统和方法 | |
Yılmaz et al. | Articulatory features for asr of pathological speech | |
WO2023221345A1 (zh) | 一种情感语音的合成方法及合成装置 | |
US20150348535A1 (en) | Method for forming the excitation signal for a glottal pulse model based parametric speech synthesis system | |
CN112382301A (zh) | 基于轻量级神经网络的含噪语音性别识别方法及系统 | |
Chakroun et al. | Robust features for text-independent speaker recognition with short utterances | |
CN110232928A (zh) | 文本无关说话人验证方法和装置 | |
Maity et al. | A pitch and noise robust keyword spotting system using SMAC features with prosody modification | |
Li et al. | Prosody usage optimization for children speech recognition with zero resource children speech. | |
CN105895079A (zh) | 语音数据的处理方法和装置 | |
CN114708857A (zh) | 语音识别模型训练方法、语音识别方法及相应装置 | |
Kinnunen | Optimizing spectral feature based text-independent speaker recognition | |
CN109086387A (zh) | 一种音频流评分方法、装置、设备及存储介质 | |
Liu et al. | A novel unified framework for speech enhancement and bandwidth extension based on jointly trained neural networks | |
Wang et al. | Improve gan-based neural vocoder using pointwise relativistic leastsquare gan | |
Rafi et al. | Relative Significance of Speech Sounds in Speaker Verification Systems |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200616 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Ltd. Applicant after: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Ltd. Applicant before: SHANGHAI JIAO TONG University |
|
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20201023 Address after: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant after: AI SPEECH Ltd. Address before: 215123 14 Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou, Jiangsu. Applicant before: AI SPEECH Ltd. Applicant before: Shanghai Jiaotong University Intellectual Property Management Co.,Ltd. |
|
CB02 | Change of applicant information | ||
CB02 | Change of applicant information |
Address after: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant after: Sipic Technology Co.,Ltd. Address before: 215123 building 14, Tengfei Innovation Park, 388 Xinping street, Suzhou Industrial Park, Suzhou City, Jiangsu Province Applicant before: AI SPEECH Ltd. |
|
GR01 | Patent grant | ||
GR01 | Patent grant |