CN100369469C - 语音驱动头部图像合成影音文件的方法 - Google Patents
语音驱动头部图像合成影音文件的方法 Download PDFInfo
- Publication number
- CN100369469C CN100369469C CNB200510093269XA CN200510093269A CN100369469C CN 100369469 C CN100369469 C CN 100369469C CN B200510093269X A CNB200510093269X A CN B200510093269XA CN 200510093269 A CN200510093269 A CN 200510093269A CN 100369469 C CN100369469 C CN 100369469C
- Authority
- CN
- China
- Prior art keywords
- voice
- frames
- image
- frame
- speech
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 230000014509 gene expression Effects 0.000 claims abstract description 29
- 230000003068 static effect Effects 0.000 claims abstract description 7
- 210000003128 head Anatomy 0.000 claims description 32
- 238000012545 processing Methods 0.000 claims description 22
- 230000001815 facial effect Effects 0.000 claims description 15
- 230000002194 synthesizing effect Effects 0.000 claims description 9
- 238000012937 correction Methods 0.000 claims description 8
- 210000000697 sensory organ Anatomy 0.000 claims description 7
- 230000004397 blinking Effects 0.000 claims description 4
- 230000001755 vocal effect Effects 0.000 claims description 4
- 241001465754 Metazoa Species 0.000 abstract description 4
- 230000001007 puffing effect Effects 0.000 abstract 2
- 238000004458 analytical method Methods 0.000 description 13
- 230000000694 effects Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 4
- 238000000605 extraction Methods 0.000 description 4
- 230000015572 biosynthetic process Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 238000001228 spectrum Methods 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000000007 visual effect Effects 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 238000006243 chemical reaction Methods 0.000 description 2
- 230000006835 compression Effects 0.000 description 2
- 238000007906 compression Methods 0.000 description 2
- 238000013075 data extraction Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000008921 facial expression Effects 0.000 description 2
- 238000001914 filtration Methods 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000003044 adaptive effect Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 239000003086 colorant Substances 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000010295 mobile communication Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000007781 pre-processing Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Landscapes
- Processing Or Creating Images (AREA)
Abstract
Description
Claims (9)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB200510093269XA CN100369469C (zh) | 2005-08-23 | 2005-08-23 | 语音驱动头部图像合成影音文件的方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB200510093269XA CN100369469C (zh) | 2005-08-23 | 2005-08-23 | 语音驱动头部图像合成影音文件的方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1731833A CN1731833A (zh) | 2006-02-08 |
CN100369469C true CN100369469C (zh) | 2008-02-13 |
Family
ID=35964119
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB200510093269XA Active CN100369469C (zh) | 2005-08-23 | 2005-08-23 | 语音驱动头部图像合成影音文件的方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100369469C (zh) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101482976B (zh) * | 2009-01-19 | 2010-10-27 | 腾讯科技(深圳)有限公司 | 语音驱动嘴唇形状变化的方法、获取嘴唇动画的方法及装置 |
CN104869326B (zh) * | 2015-05-27 | 2018-09-11 | 网易(杭州)网络有限公司 | 一种配合音频的图像显示方法和设备 |
CN105187736B (zh) * | 2015-07-28 | 2018-07-06 | 广东欧珀移动通信有限公司 | 一种将静止人脸图片转化为视频的方法、系统及移动终端 |
CN105761559A (zh) * | 2016-04-29 | 2016-07-13 | 东北电力大学 | 一种基于先入为主的反向共鸣的外语学习方法 |
CN107623622A (zh) * | 2016-07-15 | 2018-01-23 | 掌赢信息科技(上海)有限公司 | 一种发送语音动画的方法及电子设备 |
CN106447750A (zh) * | 2016-09-30 | 2017-02-22 | 长春市机器侠科技有限公司 | 一种深度写真影像重构表情同步视频生成方法 |
CN106777204B (zh) * | 2016-12-23 | 2020-08-07 | 北京安云世纪科技有限公司 | 图片数据的处理方法、装置及移动终端 |
CN106653052B (zh) * | 2016-12-29 | 2020-10-16 | Tcl科技集团股份有限公司 | 虚拟人脸动画的生成方法及装置 |
CN109087651B (zh) * | 2018-09-05 | 2021-01-19 | 广州势必可赢网络科技有限公司 | 一种基于视频与语谱图的声纹鉴定方法、系统及设备 |
CN110072047B (zh) * | 2019-01-25 | 2020-10-09 | 北京字节跳动网络技术有限公司 | 图像形变的控制方法、装置和硬件装置 |
CN110636323B (zh) * | 2019-10-15 | 2021-11-23 | 博科达(北京)科技有限公司 | 一种基于云平台的全球直播及视频点播系统及方法 |
CN112992120A (zh) * | 2019-12-02 | 2021-06-18 | 泛太丝亚企业管理顾问(上海)有限公司 | 语音转换虚拟脸部图像的方法 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1466104A (zh) * | 2002-07-03 | 2004-01-07 | 中国科学院计算技术研究所 | 基于统计与规则结合的语音驱动人脸动画方法 |
CN1492711A (zh) * | 2002-10-26 | 2004-04-28 | 乐金电子(中国)研究开发中心有限公 | 移动可视电话中基于语音的图像帧频控制装置及方法 |
US20040120554A1 (en) * | 2002-12-21 | 2004-06-24 | Lin Stephen Ssu-Te | System and method for real time lip synchronization |
-
2005
- 2005-08-23 CN CNB200510093269XA patent/CN100369469C/zh active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1466104A (zh) * | 2002-07-03 | 2004-01-07 | 中国科学院计算技术研究所 | 基于统计与规则结合的语音驱动人脸动画方法 |
CN1492711A (zh) * | 2002-10-26 | 2004-04-28 | 乐金电子(中国)研究开发中心有限公 | 移动可视电话中基于语音的图像帧频控制装置及方法 |
US20040120554A1 (en) * | 2002-12-21 | 2004-06-24 | Lin Stephen Ssu-Te | System and method for real time lip synchronization |
Non-Patent Citations (1)
Title |
---|
基于数据挖掘的语音驱动三维人脸动画合成. 陈益文,高文,王兆其,姜大龙,左力.系统仿真学报,第14卷第4期. 2002 * |
Also Published As
Publication number | Publication date |
---|---|
CN1731833A (zh) | 2006-02-08 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100369469C (zh) | 语音驱动头部图像合成影音文件的方法 | |
CN113192161B (zh) | 一种虚拟人形象视频生成方法、系统、装置及存储介质 | |
WO2020007185A1 (zh) | 图像处理方法、装置、存储介质和计算机设备 | |
US7123262B2 (en) | Method of animating a synthesized model of a human face driven by an acoustic signal | |
CN112001992A (zh) | 基于深度学习的语音驱动3d虚拟人表情音画同步方法及系统 | |
WO2022100691A1 (zh) | 音频识别方法和装置 | |
WO2023035969A1 (zh) | 语音与图像同步性的衡量方法、模型的训练方法及装置 | |
Jachimski et al. | A comparative study of English viseme recognition methods and algorithms | |
CN115455136A (zh) | 智能数字人营销交互方法、装置、计算机设备及存储介质 | |
CN114581812A (zh) | 视觉语言识别方法、装置、电子设备及存储介质 | |
JP4774820B2 (ja) | 電子透かし埋め込み方法 | |
Matthews | Features for audio-visual speech recognition | |
CN112330579B (zh) | 视频背景更换方法、装置、计算机设备及计算机可读介质 | |
CN117409121A (zh) | 基于音频和单幅图像驱动的细粒度情感控制说话人脸视频生成方法、系统、设备及介质 | |
US20240265606A1 (en) | Method and apparatus for generating mouth shape by using deep learning network | |
JP4011844B2 (ja) | 翻訳装置、翻訳方法および媒体 | |
WO2007076279A2 (en) | Method for classifying speech data | |
KR100849027B1 (ko) | 음성 신호에 대한 립싱크 동기화 방법 및 장치 | |
Chen et al. | Lip synchronization in talking head video utilizing speech information | |
JP4177751B2 (ja) | 声質モデル生成方法、声質変換方法、並びにそれらのためのコンピュータプログラム、当該プログラムを記録した記録媒体、及び当該プログラムによりプログラムされたコンピュータ | |
TWI398853B (zh) | 人臉說話模擬系統及方法 | |
CN115410061B (zh) | 一种基于自然语言处理的图文情感分析系统 | |
Belete | College of Natural Sciences | |
CN115880737B (zh) | 一种基于降噪自学习的字幕生成方法、系统、设备及介质 | |
CN114581832B (zh) | 一种语音增强方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: WANG WEIGUO Free format text: FORMER OWNER: SUN DAN; APPLICANT Effective date: 20070420 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20070420 Address after: Beijing North 100044 North Xizhimen Avenue, No. 41 days trillion homes 4C501 Applicant after: Wang Weiguo Address before: 100044 Beijing city Xizhimen North Street No. 41 days trillion homes 4C501 Applicant before: Sun Dan Co-applicant before: Wang Weiguo |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: GUANGZHOU CITY YIFENG COMMUNICATION SCIENCE CO., L Free format text: FORMER OWNER: WANG WEIGUO Effective date: 20090703 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090703 Address after: F8, 11 floor, No. 689 Tianhe North Road, Guangzhou, Tianhe District Patentee after: GUANGZHOU EAPHONE TECHNOLOGY Co.,Ltd. Address before: Beijing City, Xizhimen North Street, No. 41 days trillion homes 4C501 Patentee before: Wang Weiguo |
|
C56 | Change in the name or address of the patentee | ||
CP03 | Change of name, title or address |
Address after: 510620 Tianhe District, Guangdong, No. five road, No. 246, Patentee after: Guangzhou Yifeng Health Technology Co.,Ltd. Address before: F8, 11 floor, No. 689 Tianhe North Road, Guangzhou, Tianhe District Patentee before: GUANGZHOU EAPHONE TECHNOLOGY Co.,Ltd. |
|
CP03 | Change of name, title or address | ||
CP03 | Change of name, title or address |
Address after: Room 601-2, No. 246, 248, and 250 Wushan Road, Tianhe District, Guangzhou City, Guangdong Province, 510000 Patentee after: Guangzhou Yifeng Communication Technology Co.,Ltd. Country or region after: China Address before: No. 246, Wushan Road, Tianhe District, Guangzhou, Guangdong 510620 Patentee before: Guangzhou Yifeng Health Technology Co.,Ltd. Country or region before: China |