CN106531150A - Emotion synthesis method based on deep neural network model - Google Patents
Emotion synthesis method based on deep neural network model Download PDFInfo
- Publication number
- CN106531150A CN106531150A CN201611201686.6A CN201611201686A CN106531150A CN 106531150 A CN106531150 A CN 106531150A CN 201611201686 A CN201611201686 A CN 201611201686A CN 106531150 A CN106531150 A CN 106531150A
- Authority
- CN
- China
- Prior art keywords
- speaker
- emotion
- neutral
- model
- neural network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000008451 emotion Effects 0.000 title claims abstract description 163
- 238000003062 neural network model Methods 0.000 title claims abstract description 85
- 238000001308 synthesis method Methods 0.000 title abstract 5
- 230000007935 neutral effect Effects 0.000 claims abstract description 162
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 104
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 102
- 230000002996 emotional effect Effects 0.000 claims abstract description 42
- 238000010189 synthetic method Methods 0.000 claims description 38
- 238000012549 training Methods 0.000 claims description 25
- 230000009466 transformation Effects 0.000 claims description 20
- 238000001228 spectrum Methods 0.000 claims description 19
- 238000000034 method Methods 0.000 claims description 17
- 230000006870 function Effects 0.000 claims description 14
- 230000005284 excitation Effects 0.000 claims description 6
- 239000011159 matrix material Substances 0.000 claims description 3
- 238000013528 artificial neural network Methods 0.000 claims description 2
- 230000008901 benefit Effects 0.000 abstract description 6
- 238000006243 chemical reaction Methods 0.000 abstract 3
- 238000005516 engineering process Methods 0.000 description 9
- 230000008859 change Effects 0.000 description 5
- 230000033764 rhythmic process Effects 0.000 description 5
- 230000004048 modification Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 238000011161 development Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 3
- 241001269238 Data Species 0.000 description 1
- 238000004587 chromatography analysis Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- SYHGEUNFJIGTRX-UHFFFAOYSA-N methylenedioxypyrovalerone Chemical compound C=1C=C2OCOC2=CC=1C(=O)C(CCC)N1CCCC1 SYHGEUNFJIGTRX-UHFFFAOYSA-N 0.000 description 1
- 230000001537 neural effect Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000008439 repair process Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/02—Methods for producing synthetic speech; Speech synthesisers
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
- G10L13/08—Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611201686.6A CN106531150B (en) | 2016-12-23 | 2016-12-23 | Emotion synthesis method based on deep neural network model |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611201686.6A CN106531150B (en) | 2016-12-23 | 2016-12-23 | Emotion synthesis method based on deep neural network model |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106531150A true CN106531150A (en) | 2017-03-22 |
CN106531150B CN106531150B (en) | 2020-02-07 |
Family
ID=58337400
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611201686.6A Active CN106531150B (en) | 2016-12-23 | 2016-12-23 | Emotion synthesis method based on deep neural network model |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106531150B (en) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103900A (en) * | 2017-06-06 | 2017-08-29 | 西北师范大学 | A kind of across language emotional speech synthesizing method and system |
CN108305641A (en) * | 2017-06-30 | 2018-07-20 | 腾讯科技(深圳)有限公司 | The determination method and apparatus of emotion information |
CN108364631A (en) * | 2017-01-26 | 2018-08-03 | 北京搜狗科技发展有限公司 | A kind of phoneme synthesizing method and device |
CN108447470A (en) * | 2017-12-28 | 2018-08-24 | 中南大学 | A kind of emotional speech conversion method based on sound channel and prosodic features |
CN108597492A (en) * | 2018-05-02 | 2018-09-28 | 百度在线网络技术(北京)有限公司 | Phoneme synthesizing method and device |
CN108763190A (en) * | 2018-04-12 | 2018-11-06 | 平安科技(深圳)有限公司 | Voice-based mouth shape cartoon synthesizer, method and readable storage medium storing program for executing |
CN109036370A (en) * | 2018-06-06 | 2018-12-18 | 安徽继远软件有限公司 | A kind of speaker's voice adaptive training method |
CN109102796A (en) * | 2018-08-31 | 2018-12-28 | 北京未来媒体科技股份有限公司 | A kind of phoneme synthesizing method and device |
WO2019218773A1 (en) * | 2018-05-15 | 2019-11-21 | 中兴通讯股份有限公司 | Voice synthesis method and device, storage medium, and electronic device |
CN110853616A (en) * | 2019-10-22 | 2020-02-28 | 武汉水象电子科技有限公司 | Speech synthesis method, system and storage medium based on neural network |
WO2020073944A1 (en) * | 2018-10-10 | 2020-04-16 | 华为技术有限公司 | Speech synthesis method and device |
WO2020098269A1 (en) * | 2018-11-15 | 2020-05-22 | 华为技术有限公司 | Speech synthesis method and speech synthesis device |
CN111599338A (en) * | 2020-04-09 | 2020-08-28 | 云知声智能科技股份有限公司 | Stable and controllable end-to-end speech synthesis method and device |
CN111613224A (en) * | 2020-04-10 | 2020-09-01 | 云知声智能科技股份有限公司 | Personalized voice synthesis method and device |
US11538455B2 (en) | 2018-02-16 | 2022-12-27 | Dolby Laboratories Licensing Corporation | Speech style transfer |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101064104A (en) * | 2006-04-24 | 2007-10-31 | 中国科学院自动化研究所 | Emotion voice creating method based on voice conversion |
CN101308652A (en) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | Synthesizing method of personalized singing voice |
CN102005205A (en) * | 2009-09-03 | 2011-04-06 | 株式会社东芝 | Emotional speech synthesizing method and device |
US20140067397A1 (en) * | 2012-08-29 | 2014-03-06 | Nuance Communications, Inc. | Using emoticons for contextual text-to-speech expressivity |
CN105206258A (en) * | 2015-10-19 | 2015-12-30 | 百度在线网络技术(北京)有限公司 | Generation method and device of acoustic model as well as voice synthetic method and device |
EP3046053A2 (en) * | 2015-01-19 | 2016-07-20 | Samsung Electronics Co., Ltd | Method and apparatus for training language model, and method and apparatus for recongnizing language |
-
2016
- 2016-12-23 CN CN201611201686.6A patent/CN106531150B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101064104A (en) * | 2006-04-24 | 2007-10-31 | 中国科学院自动化研究所 | Emotion voice creating method based on voice conversion |
CN101308652A (en) * | 2008-07-17 | 2008-11-19 | 安徽科大讯飞信息科技股份有限公司 | Synthesizing method of personalized singing voice |
CN102005205A (en) * | 2009-09-03 | 2011-04-06 | 株式会社东芝 | Emotional speech synthesizing method and device |
US20140067397A1 (en) * | 2012-08-29 | 2014-03-06 | Nuance Communications, Inc. | Using emoticons for contextual text-to-speech expressivity |
EP3046053A2 (en) * | 2015-01-19 | 2016-07-20 | Samsung Electronics Co., Ltd | Method and apparatus for training language model, and method and apparatus for recongnizing language |
CN105206258A (en) * | 2015-10-19 | 2015-12-30 | 百度在线网络技术(北京)有限公司 | Generation method and device of acoustic model as well as voice synthetic method and device |
Cited By (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108364631A (en) * | 2017-01-26 | 2018-08-03 | 北京搜狗科技发展有限公司 | A kind of phoneme synthesizing method and device |
CN108364631B (en) * | 2017-01-26 | 2021-01-22 | 北京搜狗科技发展有限公司 | Speech synthesis method and device |
CN107103900A (en) * | 2017-06-06 | 2017-08-29 | 西北师范大学 | A kind of across language emotional speech synthesizing method and system |
CN108305641A (en) * | 2017-06-30 | 2018-07-20 | 腾讯科技(深圳)有限公司 | The determination method and apparatus of emotion information |
CN108305641B (en) * | 2017-06-30 | 2020-04-07 | 腾讯科技(深圳)有限公司 | Method and device for determining emotion information |
CN108447470A (en) * | 2017-12-28 | 2018-08-24 | 中南大学 | A kind of emotional speech conversion method based on sound channel and prosodic features |
US11538455B2 (en) | 2018-02-16 | 2022-12-27 | Dolby Laboratories Licensing Corporation | Speech style transfer |
CN108763190A (en) * | 2018-04-12 | 2018-11-06 | 平安科技(深圳)有限公司 | Voice-based mouth shape cartoon synthesizer, method and readable storage medium storing program for executing |
CN108763190B (en) * | 2018-04-12 | 2019-04-02 | 平安科技(深圳)有限公司 | Voice-based mouth shape cartoon synthesizer, method and readable storage medium storing program for executing |
WO2019196306A1 (en) * | 2018-04-12 | 2019-10-17 | 平安科技(深圳)有限公司 | Device and method for speech-based mouth shape animation blending, and readable storage medium |
CN108597492A (en) * | 2018-05-02 | 2018-09-28 | 百度在线网络技术(北京)有限公司 | Phoneme synthesizing method and device |
WO2019218773A1 (en) * | 2018-05-15 | 2019-11-21 | 中兴通讯股份有限公司 | Voice synthesis method and device, storage medium, and electronic device |
CN110556092A (en) * | 2018-05-15 | 2019-12-10 | 中兴通讯股份有限公司 | Speech synthesis method and device, storage medium and electronic device |
CN109036370A (en) * | 2018-06-06 | 2018-12-18 | 安徽继远软件有限公司 | A kind of speaker's voice adaptive training method |
CN109036370B (en) * | 2018-06-06 | 2021-07-20 | 安徽继远软件有限公司 | Adaptive training method for speaker voice |
CN109102796A (en) * | 2018-08-31 | 2018-12-28 | 北京未来媒体科技股份有限公司 | A kind of phoneme synthesizing method and device |
WO2020073944A1 (en) * | 2018-10-10 | 2020-04-16 | 华为技术有限公司 | Speech synthesis method and device |
US11361751B2 (en) | 2018-10-10 | 2022-06-14 | Huawei Technologies Co., Ltd. | Speech synthesis method and device |
US11282498B2 (en) | 2018-11-15 | 2022-03-22 | Huawei Technologies Co., Ltd. | Speech synthesis method and speech synthesis apparatus |
WO2020098269A1 (en) * | 2018-11-15 | 2020-05-22 | 华为技术有限公司 | Speech synthesis method and speech synthesis device |
CN111192568A (en) * | 2018-11-15 | 2020-05-22 | 华为技术有限公司 | Speech synthesis method and speech synthesis device |
CN110853616A (en) * | 2019-10-22 | 2020-02-28 | 武汉水象电子科技有限公司 | Speech synthesis method, system and storage medium based on neural network |
CN111599338A (en) * | 2020-04-09 | 2020-08-28 | 云知声智能科技股份有限公司 | Stable and controllable end-to-end speech synthesis method and device |
CN111613224A (en) * | 2020-04-10 | 2020-09-01 | 云知声智能科技股份有限公司 | Personalized voice synthesis method and device |
Also Published As
Publication number | Publication date |
---|---|
CN106531150B (en) | 2020-02-07 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106531150A (en) | Emotion synthesis method based on deep neural network model | |
CN101578659B (en) | Voice tone converting device and voice tone converting method | |
CN111667812B (en) | Speech synthesis method, device, equipment and storage medium | |
CN105118498B (en) | The training method and device of phonetic synthesis model | |
CN105185372B (en) | Training method for multiple personalized acoustic models, and voice synthesis method and voice synthesis device | |
CN102231278B (en) | Method and system for realizing automatic addition of punctuation marks in speech recognition | |
CN101064104B (en) | Emotion voice creating method based on voice conversion | |
CN107464559A (en) | Joint forecast model construction method and system based on Chinese rhythm structure and stress | |
CN108447486A (en) | A kind of voice translation method and device | |
CN104916284B (en) | Prosody and acoustics joint modeling method and device for voice synthesis system | |
CN107958433A (en) | A kind of online education man-machine interaction method and system based on artificial intelligence | |
CN106653052A (en) | Virtual human face animation generation method and device | |
CN106601228A (en) | Sample marking method and device based on artificial intelligence prosody prediction | |
CN111048062A (en) | Speech synthesis method and apparatus | |
CN106057192A (en) | Real-time voice conversion method and apparatus | |
CN106971709A (en) | Statistic parameter model method for building up and device, phoneme synthesizing method and device | |
CN106128450A (en) | The bilingual method across language voice conversion and system thereof hidden in a kind of Chinese | |
CN105654939A (en) | Voice synthesis method based on voice vector textual characteristics | |
CN107452379A (en) | The identification technology and virtual reality teaching method and system of a kind of dialect language | |
Malcangi | Text-driven avatars based on artificial neural networks and fuzzy logic | |
CN107871496A (en) | Audio recognition method and device | |
CN110010136A (en) | The training and text analyzing method, apparatus, medium and equipment of prosody prediction model | |
Schröder et al. | Synthesis of emotional speech | |
CN101887719A (en) | Speech synthesis method, system and mobile terminal equipment with speech synthesis function | |
CN110459201B (en) | Speech synthesis method for generating new tone |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20170929 Address after: 200233 Shanghai City, Xuhui District Guangxi 65 No. 1 Jinglu room 702 unit 03 Applicant after: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Address before: 200233 Shanghai, Qinzhou, North Road, No. 82, building 2, layer 1198, Applicant before: SHANGHAI YUZHIYI INFORMATION TECHNOLOGY Co.,Ltd. |
|
TA01 | Transfer of patent application right | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: An emotion synthesis method based on deep neural network model Effective date of registration: 20201201 Granted publication date: 20200207 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY Co.,Ltd. Registration number: Y2020310000047 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220307 Granted publication date: 20200207 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2020310000047 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: An emotion synthesis method based on deep neural network model Effective date of registration: 20230210 Granted publication date: 20200207 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Granted publication date: 20200207 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2023310000028 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A sentiment synthesis method based on deep neural network models Granted publication date: 20200207 Pledgee: Bank of Hangzhou Limited by Share Ltd. Shanghai branch Pledgor: YUNZHISHENG (SHANGHAI) INTELLIGENT TECHNOLOGY CO.,LTD. Registration number: Y2024310000165 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right |