CN109671422A - 一种获取纯净语音的录音方法 - Google Patents
一种获取纯净语音的录音方法 Download PDFInfo
- Publication number
- CN109671422A CN109671422A CN201910017762.5A CN201910017762A CN109671422A CN 109671422 A CN109671422 A CN 109671422A CN 201910017762 A CN201910017762 A CN 201910017762A CN 109671422 A CN109671422 A CN 109671422A
- Authority
- CN
- China
- Prior art keywords
- coefficient
- frame
- frequency spectrum
- recording
- spectrum energy
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000001228 spectrum Methods 0.000 claims abstract description 26
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 12
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 12
- 230000009466 transformation Effects 0.000 claims abstract description 10
- 238000000034 method Methods 0.000 claims abstract description 8
- 238000001514 detection method Methods 0.000 claims abstract description 5
- 239000002131 composite material Substances 0.000 claims abstract description 4
- 238000005070 sampling Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/04—Details of speech synthesis systems, e.g. synthesiser structure or memory management
- G10L13/047—Architecture of speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
- G10L13/033—Voice editing, e.g. manipulating the voice of the synthesiser
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Circuit For Audible Band Transducer (AREA)
- Electrophonic Musical Instruments (AREA)
Abstract
Description
Claims (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910017762.5A CN109671422B (zh) | 2019-01-09 | 2019-01-09 | 一种获取纯净语音的录音方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910017762.5A CN109671422B (zh) | 2019-01-09 | 2019-01-09 | 一种获取纯净语音的录音方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109671422A true CN109671422A (zh) | 2019-04-23 |
CN109671422B CN109671422B (zh) | 2022-06-17 |
Family
ID=66149428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910017762.5A Active CN109671422B (zh) | 2019-01-09 | 2019-01-09 | 一种获取纯净语音的录音方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109671422B (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110246502A (zh) * | 2019-06-26 | 2019-09-17 | 广东小天才科技有限公司 | 语音降噪方法、装置及终端设备 |
CN112652315A (zh) * | 2020-08-03 | 2021-04-13 | 李�昊 | 基于深度学习的汽车引擎声实时合成系统及方法 |
CN113838453A (zh) * | 2021-08-17 | 2021-12-24 | 北京百度网讯科技有限公司 | 语音处理方法、装置、设备和计算机存储介质 |
US11996084B2 (en) | 2021-08-17 | 2024-05-28 | Beijing Baidu Netcom Science Technology Co., Ltd. | Speech synthesis method and apparatus, device and computer storage medium |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0193796A (ja) * | 1987-10-06 | 1989-04-12 | Nippon Hoso Kyokai <Nhk> | 声質変換方法 |
JPH056198A (ja) * | 1991-06-26 | 1993-01-14 | Yamaha Corp | フオルマント合成装置 |
US5459813A (en) * | 1991-03-27 | 1995-10-17 | R.G.A. & Associates, Ltd | Public address intelligibility system |
US5649058A (en) * | 1990-03-31 | 1997-07-15 | Gold Star Co., Ltd. | Speech synthesizing method achieved by the segmentation of the linear Formant transition region |
US20040158470A1 (en) * | 2003-01-30 | 2004-08-12 | Yamaha Corporation | Tone generator of wave table type with voice synthesis capability |
CN101067929A (zh) * | 2007-06-05 | 2007-11-07 | 南京大学 | 使用共振峰增强提取话音共振峰轨迹的方法 |
CN101359473A (zh) * | 2007-07-30 | 2009-02-04 | 国际商业机器公司 | 自动进行语音转换的方法和装置 |
CN106057192A (zh) * | 2016-07-07 | 2016-10-26 | Tcl集团股份有限公司 | 一种实时语音转换方法和装置 |
CN108682413A (zh) * | 2018-04-24 | 2018-10-19 | 上海师范大学 | 一种基于语音转换的情感疏导系统 |
-
2019
- 2019-01-09 CN CN201910017762.5A patent/CN109671422B/zh active Active
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0193796A (ja) * | 1987-10-06 | 1989-04-12 | Nippon Hoso Kyokai <Nhk> | 声質変換方法 |
US5649058A (en) * | 1990-03-31 | 1997-07-15 | Gold Star Co., Ltd. | Speech synthesizing method achieved by the segmentation of the linear Formant transition region |
US5459813A (en) * | 1991-03-27 | 1995-10-17 | R.G.A. & Associates, Ltd | Public address intelligibility system |
JPH056198A (ja) * | 1991-06-26 | 1993-01-14 | Yamaha Corp | フオルマント合成装置 |
US20040158470A1 (en) * | 2003-01-30 | 2004-08-12 | Yamaha Corporation | Tone generator of wave table type with voice synthesis capability |
CN101067929A (zh) * | 2007-06-05 | 2007-11-07 | 南京大学 | 使用共振峰增强提取话音共振峰轨迹的方法 |
CN101359473A (zh) * | 2007-07-30 | 2009-02-04 | 国际商业机器公司 | 自动进行语音转换的方法和装置 |
CN106057192A (zh) * | 2016-07-07 | 2016-10-26 | Tcl集团股份有限公司 | 一种实时语音转换方法和装置 |
CN108682413A (zh) * | 2018-04-24 | 2018-10-19 | 上海师范大学 | 一种基于语音转换的情感疏导系统 |
Non-Patent Citations (2)
Title |
---|
王坤赤等: "一种基于语音频谱的基频和共振峰提取算法", 《信息技术》 * |
罗兰娥等: "歌唱艺术嗓音中声学参数的应用", 《山西电子技术》 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110246502A (zh) * | 2019-06-26 | 2019-09-17 | 广东小天才科技有限公司 | 语音降噪方法、装置及终端设备 |
CN112652315A (zh) * | 2020-08-03 | 2021-04-13 | 李�昊 | 基于深度学习的汽车引擎声实时合成系统及方法 |
CN113838453A (zh) * | 2021-08-17 | 2021-12-24 | 北京百度网讯科技有限公司 | 语音处理方法、装置、设备和计算机存储介质 |
US11996084B2 (en) | 2021-08-17 | 2024-05-28 | Beijing Baidu Netcom Science Technology Co., Ltd. | Speech synthesis method and apparatus, device and computer storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN109671422B (zh) | 2022-06-17 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109671422A (zh) | 一种获取纯净语音的录音方法 | |
Wise et al. | Maximum likelihood pitch estimation | |
Alku et al. | Formant frequency estimation of high-pitched vowels using weighted linear prediction | |
Bresch et al. | Synchronized and noise-robust audio recordings during realtime magnetic resonance imaging scans | |
US9111526B2 (en) | Systems, method, apparatus, and computer-readable media for decomposition of a multichannel music signal | |
Iseli et al. | Age, sex, and vowel dependencies of acoustic measures related to the voice source | |
EP1064648B1 (en) | Wideband speech synthesis from a narrowband speech signal | |
US8706496B2 (en) | Audio signal transforming by utilizing a computational cost function | |
JP2009042716A (ja) | 周期信号処理方法、周期信号変換方法および周期信号処理装置ならびに周期信号の分析方法 | |
CN108701465A (zh) | 音频信号解码 | |
Ganapathy et al. | Temporal envelope compensation for robust phoneme recognition using modulation spectrum | |
CN106653048B (zh) | 基于人声模型的单通道声音分离方法 | |
CN108172210B (zh) | 一种基于歌声节奏的演唱和声生成方法 | |
JP2010210758A (ja) | 音声を含む信号の処理方法及び装置 | |
Kotnik et al. | Evaluation of pitch detection algorithms in adverse conditions | |
Benetos et al. | Auditory spectrum-based pitched instrument onset detection | |
Cosi et al. | Lyon's auditory model inversion: a tool for sound separation and speech enhancement | |
Yim et al. | Computationally efficient algorithm for time scale modification (GLS-TSM) | |
CN105336320A (zh) | 一种弹簧混响模型 | |
KR20030031936A (ko) | 피치변경법을 이용한 단일 음성 다중 목소리 합성기 | |
Alku et al. | Linear predictive method for improved spectral modeling of lower frequencies of speech with small prediction orders | |
Ternström | Hi-Fi voice: observations on the distribution of energy in the singing voice spectrum above 5 kHz | |
CN107919115A (zh) | 一种基于非线性谱变换的特征补偿方法 | |
CN109697985B (zh) | 语音信号处理方法、装置及终端 | |
Flanagan | Parametric representation of speech signals [dsp history] |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190423 Assignee: Lingqi Internet of Things Technology (Hangzhou) Co.,Ltd. Assignor: JIANG University OF TECHNOLOGY Contract record no.: X2022330000931 Denomination of invention: A recording method for obtaining pure speech Granted publication date: 20220617 License type: Common License Record date: 20221229 Application publication date: 20190423 Assignee: Zhejiang Yu'an Information Technology Co.,Ltd. Assignor: JIANG University OF TECHNOLOGY Contract record no.: X2022330000897 Denomination of invention: A recording method for obtaining pure speech Granted publication date: 20220617 License type: Common License Record date: 20221228 Application publication date: 20190423 Assignee: Hangzhou Ruiboqifan Enterprise Management Co.,Ltd. Assignor: JIANG University OF TECHNOLOGY Contract record no.: X2022330000903 Denomination of invention: A recording method for obtaining pure speech Granted publication date: 20220617 License type: Common License Record date: 20221228 Application publication date: 20190423 Assignee: Hangzhou Anfeng Jiyue Cultural Creativity Co.,Ltd. Assignor: JIANG University OF TECHNOLOGY Contract record no.: X2022330000901 Denomination of invention: A recording method for obtaining pure speech Granted publication date: 20220617 License type: Common License Record date: 20221228 |
|
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20190423 Assignee: Taizhou Linhai Xinxing Safety Technology Training Co.,Ltd. Assignor: JIANG University OF TECHNOLOGY Contract record no.: X2023980047386 Denomination of invention: A Recording Method for Obtaining Pure Speech Granted publication date: 20220617 License type: Common License Record date: 20231117 |
|
EE01 | Entry into force of recordation of patent licensing contract |