CN1214784A - 图象合成 - Google Patents

图象合成 Download PDF

Info

Publication number
CN1214784A
CN1214784A CN97193348A CN97193348A CN1214784A CN 1214784 A CN1214784 A CN 1214784A CN 97193348 A CN97193348 A CN 97193348A CN 97193348 A CN97193348 A CN 97193348A CN 1214784 A CN1214784 A CN 1214784A
Authority
CN
China
Prior art keywords
consonant
vowel
phoneme
lip
rounding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN97193348A
Other languages
English (en)
Chinese (zh)
Inventor
安德鲁·保罗·布林
埃马·简·鲍尔斯
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
British Telecommunications PLC
Original Assignee
British Telecommunications PLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by British Telecommunications PLC filed Critical British Telecommunications PLC
Publication of CN1214784A publication Critical patent/CN1214784A/zh
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/40Three-dimensional [3D] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/205Three-dimensional [3D] animation driven by audio data
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Processing Or Creating Images (AREA)
CN97193348A 1996-03-26 1997-03-24 图象合成 Pending CN1214784A (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP96302060.7 1996-03-26
EP96302060 1996-03-26

Publications (1)

Publication Number Publication Date
CN1214784A true CN1214784A (zh) 1999-04-21

Family

ID=8224860

Family Applications (1)

Application Number Title Priority Date Filing Date
CN97193348A Pending CN1214784A (zh) 1996-03-26 1997-03-24 图象合成

Country Status (8)

Country Link
EP (1) EP0890168B1 (https=)
JP (1) JP4037455B2 (https=)
KR (1) KR20000005183A (https=)
CN (1) CN1214784A (https=)
AU (1) AU2167097A (https=)
CA (1) CA2249016C (https=)
DE (1) DE69715175T2 (https=)
WO (1) WO1997036288A1 (https=)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108133709A (zh) * 2016-12-01 2018-06-08 奥林巴斯株式会社 语音识别装置和语音识别方法
CN108847234A (zh) * 2018-06-28 2018-11-20 广州华多网络科技有限公司 唇语合成方法、装置、电子设备及存储介质
CN111260761A (zh) * 2020-01-15 2020-06-09 北京猿力未来科技有限公司 一种生成动画人物口型的方法及装置

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2346527B (en) * 1997-07-25 2001-02-14 Motorola Inc Virtual actor with set of speaker profiles
AU2998099A (en) * 1998-03-11 1999-09-27 Entropic, Inc. Face synthesis system and methodology
WO1999046732A1 (fr) * 1998-03-11 1999-09-16 Mitsubishi Denki Kabushiki Kaisha Dispositif de generation d'images en mouvement et dispositif d'apprentissage via reseau de controle d'images
IT1314671B1 (it) * 1998-10-07 2002-12-31 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per l'animazione di un modellosintetizzato di volto umano pilotata da un segnale audio.
SG87837A1 (en) 1998-10-08 2002-04-16 Sony Computer Entertainment Inc Portable toy, portable information terminal, intertainment system, and recording medium
KR20010072936A (ko) * 1999-06-24 2001-07-31 요트.게.아. 롤페즈 정보 스트림의 포스트-동기화
KR100395491B1 (ko) * 1999-08-16 2003-08-25 한국전자통신연구원 아바타 기반 음성 언어 번역 시스템에서의 화상 통신 방법
US6766299B1 (en) * 1999-12-20 2004-07-20 Thrillionaire Productions, Inc. Speech-controlled animation system
GB0008537D0 (en) 2000-04-06 2000-05-24 Ananova Ltd Character animation
GB0030148D0 (en) * 2000-12-11 2001-01-24 20 20 Speech Ltd Audio and video synthesis method and system
JP4067762B2 (ja) 2000-12-28 2008-03-26 ヤマハ株式会社 歌唱合成装置
US6661418B1 (en) 2001-01-22 2003-12-09 Digital Animations Limited Character animation system
DE10214431B4 (de) * 2002-03-30 2005-11-10 Ralf Dringenberg Verfahren und Vorrichtung zur Visualisierung von Audiodaten
KR100754430B1 (ko) * 2004-10-08 2007-08-31 비쥬텍쓰리디(주) 음성 기반 자동 립싱크 애니메이션 장치와 방법 및 기록매체
CN1991982A (zh) * 2005-12-29 2007-07-04 摩托罗拉公司 一种使用语音数据激励图像的方法
GB2468140A (en) 2009-02-26 2010-09-01 Dublin Inst Of Technology A character animation tool which associates stress values with the locations of vowels
AU2021204758A1 (en) * 2020-11-20 2022-06-16 Soul Machines Autonomous animation in embodied agents
CN113205797B (zh) * 2021-04-30 2024-03-05 平安科技(深圳)有限公司 虚拟主播生成方法、装置、计算机设备及可读存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4913539A (en) * 1988-04-04 1990-04-03 New York Institute Of Technology Apparatus and method for lip-synching animation
JP2518683B2 (ja) * 1989-03-08 1996-07-24 国際電信電話株式会社 画像合成方法及びその装置
US5313522A (en) * 1991-08-23 1994-05-17 Slager Robert P Apparatus for generating from an audio signal a moving visual lip image from which a speech content of the signal can be comprehended by a lipreader
US5608839A (en) * 1994-03-18 1997-03-04 Lucent Technologies Inc. Sound-synchronized video system

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108133709A (zh) * 2016-12-01 2018-06-08 奥林巴斯株式会社 语音识别装置和语音识别方法
CN108133709B (zh) * 2016-12-01 2021-09-14 奥林巴斯株式会社 语音识别装置和语音识别方法
CN108847234A (zh) * 2018-06-28 2018-11-20 广州华多网络科技有限公司 唇语合成方法、装置、电子设备及存储介质
CN111260761A (zh) * 2020-01-15 2020-06-09 北京猿力未来科技有限公司 一种生成动画人物口型的方法及装置
CN111260761B (zh) * 2020-01-15 2023-05-09 北京猿力未来科技有限公司 一种生成动画人物口型的方法及装置

Also Published As

Publication number Publication date
DE69715175T2 (de) 2003-05-15
KR20000005183A (ko) 2000-01-25
DE69715175D1 (de) 2002-10-10
JP2000507377A (ja) 2000-06-13
EP0890168B1 (en) 2002-09-04
CA2249016A1 (en) 1997-10-02
AU2167097A (en) 1997-10-17
EP0890168A1 (en) 1999-01-13
JP4037455B2 (ja) 2008-01-23
CA2249016C (en) 2002-12-03
WO1997036288A1 (en) 1997-10-02

Similar Documents

Publication Publication Date Title
CN1214784A (zh) 图象合成
CN112184858B (zh) 基于文本的虚拟对象动画生成方法及装置、存储介质、终端
CN112333179B (zh) 虚拟视频的直播方法、装置、设备及可读存储介质
KR102035596B1 (ko) 인공지능 기반의 가상 캐릭터의 페이셜 애니메이션 자동 생성 시스템 및 방법
US9082400B2 (en) Video generation based on text
US20100082345A1 (en) Speech and text driven hmm-based body animation synthesis
CN113987269B (zh) 数字人视频生成方法、装置、电子设备和存储介质
US6208356B1 (en) Image synthesis
CN116958342A (zh) 虚拟形象的动作生成方法、动作库的构建方法及装置
JP2014519082A5 (https=)
KR100300962B1 (ko) 음성합성을위한립싱크방법및그장치
EP2772906A1 (en) Multilingual speech system and method of character
CN114219880B (zh) 一种生成表情动画的方法和装置
CN107092664A (zh) 一种内容解释方法及装置
CN120153418A (zh) 用于文本转语音的大规模多语言语音-文本联合半监督学习
CN121753095A (zh) 利用对已发现的数据进行零监督来扩展多语言语音合成
CN120640052B (zh) 一种音素时间轴驱动的高清视频口型自动合成方法
CN114255737B (zh) 语音生成方法、装置、电子设备
KR20240083590A (ko) 동시조음 규칙을 조합하여 스피치 애니메이션을 자동으로 생성하는 방법
CN106708789A (zh) 一种文本处理方法及装置
KR102813533B1 (ko) 입모양 애니메이션 생성방법 및 장치
Bear et al. Some observations on computer lip-reading: moving from the dream to the reality
CN115797515A (zh) 一种语音生成和表情驱动方法、客户端及服务端
KR101196116B1 (ko) 리얼 타임 토킹 리얼리티 방법 및 장치
Theobald et al. 2.5 D Visual Speech Synthesis Using Appearance Models.

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication