CN106653048B - 基于人声模型的单通道声音分离方法 - Google Patents
基于人声模型的单通道声音分离方法 Download PDFInfo
- Publication number
- CN106653048B CN106653048B CN201611237076.1A CN201611237076A CN106653048B CN 106653048 B CN106653048 B CN 106653048B CN 201611237076 A CN201611237076 A CN 201611237076A CN 106653048 B CN106653048 B CN 106653048B
- Authority
- CN
- China
- Prior art keywords
- power
- voice
- model
- frequency
- formula
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
Abstract
Description
Claims (8)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611237076.1A CN106653048B (zh) | 2016-12-28 | 2016-12-28 | 基于人声模型的单通道声音分离方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611237076.1A CN106653048B (zh) | 2016-12-28 | 2016-12-28 | 基于人声模型的单通道声音分离方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106653048A CN106653048A (zh) | 2017-05-10 |
CN106653048B true CN106653048B (zh) | 2019-10-15 |
Family
ID=58832394
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611237076.1A Active CN106653048B (zh) | 2016-12-28 | 2016-12-28 | 基于人声模型的单通道声音分离方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106653048B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107680611B (zh) * | 2017-09-13 | 2020-06-16 | 电子科技大学 | 基于卷积神经网络的单通道声音分离方法 |
CN109801644B (zh) * | 2018-12-20 | 2021-03-09 | 北京达佳互联信息技术有限公司 | 混合声音信号的分离方法、装置、电子设备和可读介质 |
CN112259120B (zh) * | 2020-10-19 | 2021-06-29 | 南京硅基智能科技有限公司 | 基于卷积循环神经网络的单通道人声与背景声分离方法 |
CN113314140A (zh) * | 2021-05-31 | 2021-08-27 | 哈尔滨理工大学 | 一种端到端时域多尺度卷积神经网络的音源分离算法 |
CN113393857A (zh) * | 2021-06-10 | 2021-09-14 | 腾讯音乐娱乐科技(深圳)有限公司 | 一种音乐信号的人声消除方法、设备及介质 |
CN113593604A (zh) * | 2021-07-22 | 2021-11-02 | 腾讯音乐娱乐科技(深圳)有限公司 | 检测音频质量方法、装置及存储介质 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1349148A1 (en) * | 2000-12-28 | 2003-10-01 | NEC Corporation | Noise removing method and device |
CN1523573A (zh) * | 2003-09-12 | 2004-08-25 | 中国科学院声学研究所 | 一种采用后置滤波器的多通道语音增强方法 |
DE60304859D1 (de) * | 2003-08-21 | 2006-06-01 | Bernafon Ag Bern | Verfahren zur Verarbeitung von Audiosignalen |
CN101589430A (zh) * | 2007-08-10 | 2009-11-25 | 松下电器产业株式会社 | 声音分离装置、声音合成装置及音质变换装置 |
CN102402977A (zh) * | 2010-09-14 | 2012-04-04 | 无锡中星微电子有限公司 | 从立体声音乐中提取伴奏、人声的方法及其装置 |
CN102982801A (zh) * | 2012-11-12 | 2013-03-20 | 中国科学院自动化研究所 | 一种用于鲁棒语音识别的语音特征提取方法 |
CN103000174A (zh) * | 2012-11-26 | 2013-03-27 | 河海大学 | 语音识别系统中基于快速噪声估计的特征补偿方法 |
CN105719657A (zh) * | 2016-02-23 | 2016-06-29 | 惠州市德赛西威汽车电子股份有限公司 | 基于单麦克风的人声提取方法及装置 |
-
2016
- 2016-12-28 CN CN201611237076.1A patent/CN106653048B/zh active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1349148A1 (en) * | 2000-12-28 | 2003-10-01 | NEC Corporation | Noise removing method and device |
DE60304859D1 (de) * | 2003-08-21 | 2006-06-01 | Bernafon Ag Bern | Verfahren zur Verarbeitung von Audiosignalen |
CN1523573A (zh) * | 2003-09-12 | 2004-08-25 | 中国科学院声学研究所 | 一种采用后置滤波器的多通道语音增强方法 |
CN101589430A (zh) * | 2007-08-10 | 2009-11-25 | 松下电器产业株式会社 | 声音分离装置、声音合成装置及音质变换装置 |
CN102402977A (zh) * | 2010-09-14 | 2012-04-04 | 无锡中星微电子有限公司 | 从立体声音乐中提取伴奏、人声的方法及其装置 |
CN102982801A (zh) * | 2012-11-12 | 2013-03-20 | 中国科学院自动化研究所 | 一种用于鲁棒语音识别的语音特征提取方法 |
CN103000174A (zh) * | 2012-11-26 | 2013-03-27 | 河海大学 | 语音识别系统中基于快速噪声估计的特征补偿方法 |
CN105719657A (zh) * | 2016-02-23 | 2016-06-29 | 惠州市德赛西威汽车电子股份有限公司 | 基于单麦克风的人声提取方法及装置 |
Also Published As
Publication number | Publication date |
---|---|
CN106653048A (zh) | 2017-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106653048B (zh) | 基于人声模型的单通道声音分离方法 | |
Gabbay et al. | Visual speech enhancement | |
WO2019214047A1 (zh) | 建立声纹模型的方法、装置、计算机设备和存储介质 | |
Rivet et al. | Audiovisual speech source separation: An overview of key methodologies | |
Iseli et al. | Age, sex, and vowel dependencies of acoustic measures related to the voice source | |
Patel et al. | Speech recognition and verification using MFCC & VQ | |
Le Cornu et al. | Reconstructing intelligible audio speech from visual speech features. | |
Dua et al. | Performance evaluation of Hindi speech recognition system using optimized filterbanks | |
Chang et al. | Spectro-temporal features for noise-robust speech recognition using power-law nonlinearity and power-bias subtraction | |
de-La-Calle-Silos et al. | Synchrony-based feature extraction for robust automatic speech recognition | |
Wang et al. | Attention-based fusion for bone-conducted and air-conducted speech enhancement in the complex domain | |
Cheyne et al. | Talker-to-listener distance effects on speech production and perception | |
CN109272996A (zh) | 一种降噪方法及系统 | |
Milner et al. | Reconstructing intelligible audio speech from visual speech features | |
Bunton et al. | Identification of synthetic vowels based on a time-varying model of the vocal tract area function | |
JP4381404B2 (ja) | 音声合成システム、音声合成方法、音声合成プログラム | |
Wu et al. | Robust target feature extraction based on modified cochlear filter analysis model | |
Ferreira | On the possibility of speaker discrimination using a glottal pulse phase-related feature | |
Zheng et al. | A spectra-based equalization-generation combined framework for throat microphone speech enhancement | |
Koolagudi et al. | Spectral features for emotion classification | |
CN111968627A (zh) | 一种基于联合字典学习和稀疏表示的骨导语音增强方法 | |
Gupta et al. | Morse wavelet transform-based features for voice liveness detection | |
Marković et al. | Recognition of the Multimodal Speech Based on the GFCC features | |
Kuo et al. | Auditory-based robust speech recognition system for ambient assisted living in smart home | |
Armani et al. | Weighted autocorrelation-based f0 estimation for distant-talking interaction with a distributed microphone network |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170929 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Applicant after: Cloud known sound (Shanghai) Technology Co. Ltd. Address before: 200233 Shanghai, Qinzhou, North Road, No. 82, building 2, layer 1198, Applicant before: SHANGHAI YUZHIYI INFORMATION TECHNOLOGY CO., LTD. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right |
Effective date of registration: 20200416 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Co-patentee after: Xiamen yunzhixin Intelligent Technology Co., Ltd Patentee after: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. Address before: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Patentee before: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. |
|
TR01 | Transfer of patent right |