CN108766450B - 一种基于谐波冲激分解的语音转换方法 - Google Patents
一种基于谐波冲激分解的语音转换方法 Download PDFInfo
- Publication number
- CN108766450B CN108766450B CN201810335633.6A CN201810335633A CN108766450B CN 108766450 B CN108766450 B CN 108766450B CN 201810335633 A CN201810335633 A CN 201810335633A CN 108766450 B CN108766450 B CN 108766450B
- Authority
- CN
- China
- Prior art keywords
- signal
- voice
- harmonic
- speech
- impulse
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 42
- 238000000354 decomposition reaction Methods 0.000 title claims abstract description 24
- 238000000034 method Methods 0.000 title claims abstract description 24
- 238000001228 spectrum Methods 0.000 claims abstract description 60
- 239000013598 vector Substances 0.000 claims description 28
- 239000011159 matrix material Substances 0.000 claims description 26
- 230000005284 excitation Effects 0.000 claims description 14
- 238000012549 training Methods 0.000 claims description 13
- 238000010586 diagram Methods 0.000 claims description 4
- 230000017105 transposition Effects 0.000 claims description 3
- 230000001105 regulatory effect Effects 0.000 claims 1
- 238000012545 processing Methods 0.000 abstract description 8
- 230000009286 beneficial effect Effects 0.000 abstract description 3
- 230000008569 process Effects 0.000 abstract description 3
- 238000001914 filtration Methods 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000001755 vocal effect Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 230000007704 transition Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810335633.6A CN108766450B (zh) | 2018-04-16 | 2018-04-16 | 一种基于谐波冲激分解的语音转换方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810335633.6A CN108766450B (zh) | 2018-04-16 | 2018-04-16 | 一种基于谐波冲激分解的语音转换方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108766450A CN108766450A (zh) | 2018-11-06 |
CN108766450B true CN108766450B (zh) | 2023-02-17 |
Family
ID=64010844
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810335633.6A Active CN108766450B (zh) | 2018-04-16 | 2018-04-16 | 一种基于谐波冲激分解的语音转换方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108766450B (zh) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995030983A1 (en) * | 1994-05-04 | 1995-11-16 | Georgia Tech Research Corporation | Audio analysis/synthesis system |
WO2002062120A2 (en) * | 2001-02-02 | 2002-08-15 | Motorola, Inc. | Method and apparatus for speech reconstruction in a distributed speech recognition system |
TW201001396A (en) * | 2008-06-26 | 2010-01-01 | Univ Nat Taiwan Science Tech | Method for synthesizing speech |
CN101751921A (zh) * | 2009-12-16 | 2010-06-23 | 南京邮电大学 | 一种在训练数据量极少条件下的实时语音转换方法 |
CN102063899A (zh) * | 2010-10-27 | 2011-05-18 | 南京邮电大学 | 一种非平行文本条件下的语音转换方法 |
CN102664003A (zh) * | 2012-04-24 | 2012-09-12 | 南京邮电大学 | 基于谐波加噪声模型的残差激励信号合成及语音转换方法 |
CN102750955A (zh) * | 2012-07-20 | 2012-10-24 | 中国科学院自动化研究所 | 基于残差信号频谱重构的声码器 |
CN103345920A (zh) * | 2013-05-29 | 2013-10-09 | 河海大学常州校区 | 基于Mel-KSVD稀疏表示的自适应内插加权谱模型的语音转换及重构方法 |
CN107221321A (zh) * | 2017-03-27 | 2017-09-29 | 杭州电子科技大学 | 一种用于任意源和目标语音之间的语音转换方法 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2853125A1 (fr) * | 2003-03-27 | 2004-10-01 | France Telecom | Procede d'analyse d'informations de frequence fondamentale et procede et systeme de conversion de voix mettant en oeuvre un tel procede d'analyse. |
-
2018
- 2018-04-16 CN CN201810335633.6A patent/CN108766450B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1995030983A1 (en) * | 1994-05-04 | 1995-11-16 | Georgia Tech Research Corporation | Audio analysis/synthesis system |
WO2002062120A2 (en) * | 2001-02-02 | 2002-08-15 | Motorola, Inc. | Method and apparatus for speech reconstruction in a distributed speech recognition system |
TW201001396A (en) * | 2008-06-26 | 2010-01-01 | Univ Nat Taiwan Science Tech | Method for synthesizing speech |
CN101751921A (zh) * | 2009-12-16 | 2010-06-23 | 南京邮电大学 | 一种在训练数据量极少条件下的实时语音转换方法 |
CN102063899A (zh) * | 2010-10-27 | 2011-05-18 | 南京邮电大学 | 一种非平行文本条件下的语音转换方法 |
CN102664003A (zh) * | 2012-04-24 | 2012-09-12 | 南京邮电大学 | 基于谐波加噪声模型的残差激励信号合成及语音转换方法 |
CN102750955A (zh) * | 2012-07-20 | 2012-10-24 | 中国科学院自动化研究所 | 基于残差信号频谱重构的声码器 |
CN103345920A (zh) * | 2013-05-29 | 2013-10-09 | 河海大学常州校区 | 基于Mel-KSVD稀疏表示的自适应内插加权谱模型的语音转换及重构方法 |
CN107221321A (zh) * | 2017-03-27 | 2017-09-29 | 杭州电子科技大学 | 一种用于任意源和目标语音之间的语音转换方法 |
Non-Patent Citations (3)
Title |
---|
一种基于声调规范模型的声调变换方法;薛健等;《计算机工程与应用》;20051001(第10期);全文 * |
一种改进的语音二项式正弦脉冲激励方案;邓立新等;《南京邮电学院学报》;20050330(第01期);全文 * |
基于STRAIGHT算法的汉语语音morphing方法;甘振业等;《西北师范大学学报(自然科学版)》;20080915(第05期);全文 * |
Also Published As
Publication number | Publication date |
---|---|
CN108766450A (zh) | 2018-11-06 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111785261B (zh) | 基于解纠缠和解释性表征的跨语种语音转换方法及系统 | |
Défossez et al. | Music source separation in the waveform domain | |
Kaneko et al. | Generative adversarial network-based postfilter for STFT spectrograms | |
JP2956548B2 (ja) | 音声帯域拡大装置 | |
JP6783001B2 (ja) | 逆離散コサイン変換のケプストラム係数の動的分割に基づく音声特徴抽出アルゴリズム | |
WO2020015270A1 (zh) | 语音信号分离方法、装置、计算机设备以及存储介质 | |
Jeong et al. | Singing voice separation using RPCA with weighted-norm | |
CN108369803B (zh) | 用于形成基于声门脉冲模型的参数语音合成系统的激励信号的方法 | |
CN113744715A (zh) | 声码器语音合成方法、装置、计算机设备及存储介质 | |
CN106782599A (zh) | 基于高斯过程输出后滤波的语音转换方法 | |
Okamoto et al. | Noise level limited sub-modeling for diffusion probabilistic vocoders | |
Saleem et al. | Spectral phase estimation based on deep neural networks for single channel speech enhancement | |
CN114283822A (zh) | 一种基于伽马通频率倒谱系数的多对一语音转换方法 | |
CN108766450B (zh) | 一种基于谐波冲激分解的语音转换方法 | |
CN113782044A (zh) | 一种语音增强方法及装置 | |
Hossain et al. | Dual-transform source separation using sparse nonnegative matrix factorization | |
Toda et al. | Statistical approach to vocal tract transfer function estimation based on factor analyzed trajectory HMM | |
CN112863477B (zh) | 一种语音合成方法、装置及存储介质 | |
Ernawan et al. | Efficient discrete tchebichef on spectrum analysis of speech recognition | |
Xie et al. | Pitch transformation in neural network based voice conversion | |
CN115862590A (zh) | 一种基于特征金字塔的文本驱动语音合成方法 | |
Li et al. | Weighted robust principal component analysis with gammatone auditory filterbank for singing voice separation | |
CN104282300A (zh) | 一种非周期成分音节模型建立、及语音合成的方法和设备 | |
TWI409802B (zh) | 音頻特徵處理方法及其裝置 | |
Wang et al. | Improve gan-based neural vocoder using pointwise relativistic leastsquare gan |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20240605 Address after: Room 329, Building 2, No. 26 Longquan Road, Cangqian Street, Yuhang District, Hangzhou City, Zhejiang Province, 310000 Patentee after: Jinma Intelligent Technology (Hangzhou) Co.,Ltd. Country or region after: China Address before: 310018 no.1158, No.2 street, Baiyang street, Hangzhou Economic and Technological Development Zone, Zhejiang Province Patentee before: HANGZHOU DIANZI University Country or region before: China |