CN102820030B - 发音器官可视语音合成系统 - Google Patents
发音器官可视语音合成系统 Download PDFInfo
- Publication number
- CN102820030B CN102820030B CN201210265448.7A CN201210265448A CN102820030B CN 102820030 B CN102820030 B CN 102820030B CN 201210265448 A CN201210265448 A CN 201210265448A CN 102820030 B CN102820030 B CN 102820030B
- Authority
- CN
- China
- Prior art keywords
- model
- motion
- module
- synthesis system
- parameter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 210000000056 organ Anatomy 0.000 title claims abstract description 78
- 230000015572 biosynthetic process Effects 0.000 title claims abstract description 45
- 238000003786 synthesis reaction Methods 0.000 title claims abstract description 45
- 230000001755 vocal effect Effects 0.000 title claims abstract description 27
- 238000004458 analytical method Methods 0.000 claims abstract description 39
- 238000013507 mapping Methods 0.000 claims abstract description 38
- 238000001228 spectrum Methods 0.000 claims abstract description 38
- 239000000203 mixture Substances 0.000 claims abstract description 8
- 230000000007 visual effect Effects 0.000 claims description 34
- 238000012549 training Methods 0.000 claims description 26
- 238000006073 displacement reaction Methods 0.000 claims description 23
- 238000006243 chemical reaction Methods 0.000 claims description 16
- 238000009499 grossing Methods 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 13
- 238000005516 engineering process Methods 0.000 claims description 12
- 210000000988 bone and bone Anatomy 0.000 claims description 9
- 210000001061 forehead Anatomy 0.000 claims description 8
- 210000000214 mouth Anatomy 0.000 claims description 6
- 238000007781 pre-processing Methods 0.000 claims description 6
- 238000004422 calculation algorithm Methods 0.000 claims description 5
- 230000009467 reduction Effects 0.000 claims description 5
- 210000003128 head Anatomy 0.000 claims description 4
- 238000012545 processing Methods 0.000 claims description 3
- 230000003595 spectral effect Effects 0.000 claims 2
- 230000002146 bilateral effect Effects 0.000 claims 1
- 238000010586 diagram Methods 0.000 description 10
- 230000037433 frameshift Effects 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 230000008569 process Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000035945 sensitivity Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005094 computer simulation Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000004088 simulation Methods 0.000 description 1
- 210000005182 tip of the tongue Anatomy 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
- 210000000216 zygoma Anatomy 0.000 description 1
Images
Landscapes
- Processing Or Creating Images (AREA)
Abstract
Description
Claims (11)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210265448.7A CN102820030B (zh) | 2012-07-27 | 2012-07-27 | 发音器官可视语音合成系统 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210265448.7A CN102820030B (zh) | 2012-07-27 | 2012-07-27 | 发音器官可视语音合成系统 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102820030A CN102820030A (zh) | 2012-12-12 |
CN102820030B true CN102820030B (zh) | 2014-03-26 |
Family
ID=47304115
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210265448.7A Active CN102820030B (zh) | 2012-07-27 | 2012-07-27 | 发音器官可视语音合成系统 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102820030B (zh) |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103258340B (zh) * | 2013-04-17 | 2015-12-09 | 中国科学技术大学 | 富有情感表达能力的三维可视化中文普通话发音词典的发音方法 |
CN103218841B (zh) * | 2013-04-26 | 2016-01-27 | 中国科学技术大学 | 结合生理模型和数据驱动模型的三维发音器官动画方法 |
US9607609B2 (en) * | 2014-09-25 | 2017-03-28 | Intel Corporation | Method and apparatus to synthesize voice based on facial structures |
CN105390133A (zh) * | 2015-10-09 | 2016-03-09 | 西北师范大学 | 藏语ttvs系统的实现方法 |
CN106875955A (zh) * | 2015-12-10 | 2017-06-20 | 掌赢信息科技(上海)有限公司 | 一种声音动画的制作方法及电子设备 |
CN111161368A (zh) * | 2019-12-13 | 2020-05-15 | 天津大学 | 通过输入语音实时合成人体发声器官运动图像的方法 |
CN111554318B (zh) * | 2020-04-27 | 2023-12-05 | 天津大学 | 一种手机端发音可视化系统的实现方法 |
CN115393945A (zh) * | 2022-10-27 | 2022-11-25 | 科大讯飞股份有限公司 | 基于语音的图像驱动方法、装置、电子设备及存储介质 |
CN116012505A (zh) * | 2022-12-29 | 2023-04-25 | 上海师范大学天华学院 | 基于关键点自检测与风格迁徙的发音动画生成方法及系统 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1466104A (zh) * | 2002-07-03 | 2004-01-07 | 中国科学院计算技术研究所 | 基于统计与规则结合的语音驱动人脸动画方法 |
WO2005031654A1 (en) * | 2003-09-30 | 2005-04-07 | Koninklijke Philips Electronics, N.V. | System and method for audio-visual content synthesis |
CN101488346A (zh) * | 2009-02-24 | 2009-07-22 | 深圳先进技术研究院 | 语音可视化系统及语音可视化方法 |
-
2012
- 2012-07-27 CN CN201210265448.7A patent/CN102820030B/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1466104A (zh) * | 2002-07-03 | 2004-01-07 | 中国科学院计算技术研究所 | 基于统计与规则结合的语音驱动人脸动画方法 |
WO2005031654A1 (en) * | 2003-09-30 | 2005-04-07 | Koninklijke Philips Electronics, N.V. | System and method for audio-visual content synthesis |
CN101488346A (zh) * | 2009-02-24 | 2009-07-22 | 深圳先进技术研究院 | 语音可视化系统及语音可视化方法 |
Non-Patent Citations (2)
Title |
---|
《基于混合映射模型的语音转换算法研究》;康永国等;《声学学报》;20061130;第31卷(第6期);555-562 * |
康永国等.《基于混合映射模型的语音转换算法研究》.《声学学报》.2006,第31卷(第6期),555-562. |
Also Published As
Publication number | Publication date |
---|---|
CN102820030A (zh) | 2012-12-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102820030B (zh) | 发音器官可视语音合成系统 | |
Morishima et al. | A media conversion from speech to facial image for intelligent man-machine interface | |
US7136818B1 (en) | System and method of providing conversational visual prosody for talking heads | |
US7353177B2 (en) | System and method of providing conversational visual prosody for talking heads | |
CA2375350C (en) | Method of animating a synthesised model of a human face driven by an acoustic signal | |
Kuratate et al. | Kinematics-based synthesis of realistic talking faces | |
Kuratate et al. | Audio-visual synthesis of talking faces from speech production correlates. | |
JP2518683B2 (ja) | 画像合成方法及びその装置 | |
US20120130717A1 (en) | Real-time Animation for an Expressive Avatar | |
US20120191460A1 (en) | Synchronized gesture and speech production for humanoid robots | |
JPH10312467A (ja) | 像合成のための自動スピーチ整列方法 | |
JP2003529861A5 (zh) | ||
Barker et al. | Evidence of correlation between acoustic and visual features of speech | |
JPH08235384A (ja) | 音響支援画像処理 | |
Yehia et al. | Facial animation and head motion driven by speech acoustics | |
JP4381404B2 (ja) | 音声合成システム、音声合成方法、音声合成プログラム | |
Waters et al. | DECface: A system for synthetic face applications | |
Morishima et al. | Real-time facial action image synthesis system driven by speech and text | |
JP2974655B1 (ja) | アニメーションシステム | |
Csapó | Extending text-to-speech synthesis with articulatory movement prediction using ultrasound tongue imaging | |
Morishima et al. | Speech-to-image media conversion based on VQ and neural network | |
Akdemir et al. | Bimodal automatic speech segmentation based on audio and visual information fusion | |
GB2328849A (en) | System for animating virtual actors using linguistic representations of speech for visual realism. | |
GB2346526A (en) | System for providing virtual actors using neural network and text-to-linguistics | |
Savran et al. | Speaker-independent 3D face synthesis driven by speech and text |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20170421 Address after: 100085 Beijing East Road, No. 35, building No. 1, floor 3, 1-312-316, Patentee after: Extreme element (Beijing) intelligent Polytron Technologies Inc Address before: 100190 Zhongguancun East Road, Beijing, No. 95, No. Patentee before: Institute of Automation, Chinese Academy of Sciences |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee after: Limit element (Hangzhou) intelligent Polytron Technologies Inc Address before: 100085 1-312-316, 3 floor, 1 building, 35 hospital, Shanghai East Road, Haidian District, Beijing. Patentee before: Extreme element (Beijing) intelligent Polytron Technologies Inc |
|
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee after: Zhongke extreme element (Hangzhou) Intelligent Technology Co., Ltd Address before: 310019 1105, 11 / F, 4 building, 9 Ring Road, Jianggan District nine, Hangzhou, Zhejiang. Patentee before: Limit element (Hangzhou) intelligent Polytron Technologies Inc. |