CN1835074B - 一种结合高层描述信息和模型自适应的说话人转换方法 - Google Patents
一种结合高层描述信息和模型自适应的说话人转换方法 Download PDFInfo
- Publication number
- CN1835074B CN1835074B CN200610039680A CN200610039680A CN1835074B CN 1835074 B CN1835074 B CN 1835074B CN 200610039680 A CN200610039680 A CN 200610039680A CN 200610039680 A CN200610039680 A CN 200610039680A CN 1835074 B CN1835074 B CN 1835074B
- Authority
- CN
- China
- Prior art keywords
- model
- speaker
- parameter
- voice
- state
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Electrically Operated Instructional Devices (AREA)
Abstract
Description
Claims (1)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200610039680A CN1835074B (zh) | 2006-04-07 | 2006-04-07 | 一种结合高层描述信息和模型自适应的说话人转换方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN200610039680A CN1835074B (zh) | 2006-04-07 | 2006-04-07 | 一种结合高层描述信息和模型自适应的说话人转换方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1835074A CN1835074A (zh) | 2006-09-20 |
CN1835074B true CN1835074B (zh) | 2010-05-12 |
Family
ID=37002789
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN200610039680A Active CN1835074B (zh) | 2006-04-07 | 2006-04-07 | 一种结合高层描述信息和模型自适应的说话人转换方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN1835074B (zh) |
Families Citing this family (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102982809B (zh) * | 2012-12-11 | 2014-12-10 | 中国科学技术大学 | 一种说话人声音转换方法 |
GB2517503B (en) * | 2013-08-23 | 2016-12-28 | Toshiba Res Europe Ltd | A speech processing system and method |
CN104766602B (zh) * | 2014-01-06 | 2019-01-18 | 科大讯飞股份有限公司 | 歌唱合成系统中基频合成参数生成方法及系统 |
CN105023574B (zh) * | 2014-04-30 | 2018-06-15 | 科大讯飞股份有限公司 | 一种实现合成语音增强的方法及系统 |
WO2017046887A1 (ja) * | 2015-09-16 | 2017-03-23 | 株式会社東芝 | 音声合成装置、音声合成方法、音声合成プログラム、音声合成モデル学習装置、音声合成モデル学習方法及び音声合成モデル学習プログラム |
CN105304080B (zh) * | 2015-09-22 | 2019-09-03 | 科大讯飞股份有限公司 | 语音合成装置及方法 |
CN105654942A (zh) * | 2016-01-04 | 2016-06-08 | 北京时代瑞朗科技有限公司 | 一种基于统计参数的疑问句、感叹句的语音合成方法 |
CN105845125B (zh) * | 2016-05-18 | 2019-05-03 | 百度在线网络技术(北京)有限公司 | 语音合成方法和语音合成装置 |
CN107705802B (zh) * | 2017-09-11 | 2021-01-29 | 厦门美图之家科技有限公司 | 语音转换方法、装置、电子设备及可读存储介质 |
US20220013106A1 (en) * | 2018-12-11 | 2022-01-13 | Microsoft Technology Licensing, Llc | Multi-speaker neural text-to-speech synthesis |
CN112242134B (zh) * | 2019-07-01 | 2024-07-16 | 北京邮电大学 | 语音合成方法及装置 |
CN111292718A (zh) * | 2020-02-10 | 2020-06-16 | 清华大学 | 语音转换处理方法、装置、电子设备及存储介质 |
CN111192566B (zh) * | 2020-03-03 | 2022-06-24 | 云知声智能科技股份有限公司 | 英文语音合成方法及装置 |
CN112365877A (zh) * | 2020-11-27 | 2021-02-12 | 北京百度网讯科技有限公司 | 语音合成方法、装置、电子设备和存储介质 |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1127898A (zh) * | 1995-01-26 | 1996-07-31 | 李琳山 | 智慧型国语语音输入方法及国语听写机 |
CN1342967A (zh) * | 2000-09-13 | 2002-04-03 | 中国科学院自动化研究所 | 多种语音工作模式的统一识别方法 |
CN1607576A (zh) * | 2002-11-15 | 2005-04-20 | 中国科学院声学研究所 | 一种语音识别系统 |
CN1615508A (zh) * | 2001-12-17 | 2005-05-11 | 旭化成株式会社 | 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器 |
JP2005157354A (ja) * | 2003-11-26 | 2005-06-16 | Microsoft Corp | 複数感知の音声強調のための方法および機器 |
-
2006
- 2006-04-07 CN CN200610039680A patent/CN1835074B/zh active Active
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1127898A (zh) * | 1995-01-26 | 1996-07-31 | 李琳山 | 智慧型国语语音输入方法及国语听写机 |
CN1342967A (zh) * | 2000-09-13 | 2002-04-03 | 中国科学院自动化研究所 | 多种语音工作模式的统一识别方法 |
CN1615508A (zh) * | 2001-12-17 | 2005-05-11 | 旭化成株式会社 | 语音识别方法、遥控器、信息终端、电话通信终端以及语音识别器 |
CN1607576A (zh) * | 2002-11-15 | 2005-04-20 | 中国科学院声学研究所 | 一种语音识别系统 |
JP2005157354A (ja) * | 2003-11-26 | 2005-06-16 | Microsoft Corp | 複数感知の音声強調のための方法および機器 |
Also Published As
Publication number | Publication date |
---|---|
CN1835074A (zh) | 2006-09-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1835074B (zh) | 一种结合高层描述信息和模型自适应的说话人转换方法 | |
EP3895159B1 (en) | Multi-speaker neural text-to-speech synthesis | |
US11514888B2 (en) | Two-level speech prosody transfer | |
Huang et al. | Generspeech: Towards style transfer for generalizable out-of-domain text-to-speech | |
Wang et al. | Uncovering latent style factors for expressive speech synthesis | |
CN1222924C (zh) | 声音个性化的语音合成器 | |
Morgan | Deep and wide: Multiple layers in automatic speech recognition | |
KR100815115B1 (ko) | 타 언어권 화자 음성에 대한 음성 인식시스템의 성능향상을 위한 발음 특성에 기반한 음향모델 변환 방법 및이를 이용한 장치 | |
Kim et al. | Real-time emotion detection system using speech: Multi-modal fusion of different timescale features | |
CN108831435B (zh) | 一种基于多情感说话人自适应的情感语音合成方法 | |
CN109887484A (zh) | 一种基于对偶学习的语音识别与语音合成方法及装置 | |
CN1835075B (zh) | 一种结合自然样本挑选与声学参数建模的语音合成方法 | |
JP2002328695A (ja) | テキストからパーソナライズ化音声を生成する方法 | |
Qian et al. | Improved prosody generation by maximizing joint probability of state and longer units | |
Choi et al. | Sequence-to-sequence emotional voice conversion with strength control | |
Yamagishi et al. | The HTS-2008 system: Yet another evaluation of the speaker-adaptive HMM-based speech synthesis system in the 2008 Blizzard Challenge | |
CN101178895A (zh) | 基于生成参数听感误差最小化的模型自适应方法 | |
Secujski et al. | Speaker/Style-Dependent Neural Network Speech Synthesis Based on Speaker/Style Embedding. | |
Chen et al. | Polyglot speech synthesis based on cross-lingual frame selection using auditory and articulatory features | |
Toman et al. | Unsupervised and phonologically controlled interpolation of Austrian German language varieties for speech synthesis | |
Toda et al. | Trajectory training considering global variance for HMM-based speech synthesis | |
Reddy et al. | Improved HMM-based mixed-language (Telugu–Hindi) polyglot speech synthesis | |
Qin et al. | HMM-based emotional speech synthesis using average emotion model | |
Ding | A Systematic Review on the Development of Speech Synthesis | |
Ueda et al. | Individuality-preserving voice reconstruction for articulation disorders using text-to-speech synthesis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee |
Owner name: ANHUI USTC IFLYTEK CO., LTD. Free format text: FORMER NAME: ZHONGKEDA XUNFEI INFORMATION SCIENCE +. TECHNOLOGY CO., LTD., ANHUI PROV. |
|
CP01 | Change in the name or title of a patent holder |
Address after: 230088 No. 616, Mount Huangshan Road, Hefei, Anhui Patentee after: Anhui USTC iFLYTEK Co., Ltd. Address before: 230088 No. 616, Mount Huangshan Road, Hefei, Anhui Patentee before: Zhongkeda Xunfei Information Science &. Technology Co., Ltd., Anhui Prov. |
|
C56 | Change in the name or address of the patentee |
Owner name: IFLYTEK CO., LTD. Free format text: FORMER NAME: ANHUI USTC IFLYTEK CO., LTD. |
|
CP03 | Change of name, title or address |
Address after: Wangjiang Road high tech Development Zone Hefei city Anhui province 230088 No. 666 Patentee after: Iflytek Co., Ltd. Address before: 230088 No. 616, Mount Huangshan Road, Hefei, Anhui Patentee before: Anhui USTC iFLYTEK Co., Ltd. |