CA2249016C - Image synthesis - Google Patents

Image synthesis Download PDF

Info

Publication number
CA2249016C
CA2249016C CA002249016A CA2249016A CA2249016C CA 2249016 C CA2249016 C CA 2249016C CA 002249016 A CA002249016 A CA 002249016A CA 2249016 A CA2249016 A CA 2249016A CA 2249016 C CA2249016 C CA 2249016C
Authority
CA
Canada
Prior art keywords
vowel
consonant
group
phonetic
representations
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CA002249016A
Other languages
English (en)
French (fr)
Other versions
CA2249016A1 (en
Inventor
Andrew Paul Breen
Emma Jane Bowers
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
British Telecommunications PLC
Original Assignee
British Telecommunications PLC
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by British Telecommunications PLC filed Critical British Telecommunications PLC
Publication of CA2249016A1 publication Critical patent/CA2249016A1/en
Application granted granted Critical
Publication of CA2249016C publication Critical patent/CA2249016C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/40Three-dimensional [3D] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/20Three-dimensional [3D] animation
    • G06T13/205Three-dimensional [3D] animation driven by audio data
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Processing Or Creating Images (AREA)
CA002249016A 1996-03-26 1997-03-24 Image synthesis Expired - Fee Related CA2249016C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
EP96302060.7 1996-03-26
EP96302060 1996-03-26
PCT/GB1997/000818 WO1997036288A1 (en) 1996-03-26 1997-03-24 Image synthesis

Publications (2)

Publication Number Publication Date
CA2249016A1 CA2249016A1 (en) 1997-10-02
CA2249016C true CA2249016C (en) 2002-12-03

Family

ID=8224860

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002249016A Expired - Fee Related CA2249016C (en) 1996-03-26 1997-03-24 Image synthesis

Country Status (8)

Country Link
EP (1) EP0890168B1 (https=)
JP (1) JP4037455B2 (https=)
KR (1) KR20000005183A (https=)
CN (1) CN1214784A (https=)
AU (1) AU2167097A (https=)
CA (1) CA2249016C (https=)
DE (1) DE69715175T2 (https=)
WO (1) WO1997036288A1 (https=)

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2346527B (en) * 1997-07-25 2001-02-14 Motorola Inc Virtual actor with set of speaker profiles
AU2998099A (en) * 1998-03-11 1999-09-27 Entropic, Inc. Face synthesis system and methodology
WO1999046732A1 (fr) * 1998-03-11 1999-09-16 Mitsubishi Denki Kabushiki Kaisha Dispositif de generation d'images en mouvement et dispositif d'apprentissage via reseau de controle d'images
IT1314671B1 (it) * 1998-10-07 2002-12-31 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per l'animazione di un modellosintetizzato di volto umano pilotata da un segnale audio.
SG87837A1 (en) 1998-10-08 2002-04-16 Sony Computer Entertainment Inc Portable toy, portable information terminal, intertainment system, and recording medium
KR20010072936A (ko) * 1999-06-24 2001-07-31 요트.게.아. 롤페즈 정보 스트림의 포스트-동기화
KR100395491B1 (ko) * 1999-08-16 2003-08-25 한국전자통신연구원 아바타 기반 음성 언어 번역 시스템에서의 화상 통신 방법
US6766299B1 (en) * 1999-12-20 2004-07-20 Thrillionaire Productions, Inc. Speech-controlled animation system
GB0008537D0 (en) 2000-04-06 2000-05-24 Ananova Ltd Character animation
GB0030148D0 (en) * 2000-12-11 2001-01-24 20 20 Speech Ltd Audio and video synthesis method and system
JP4067762B2 (ja) 2000-12-28 2008-03-26 ヤマハ株式会社 歌唱合成装置
US6661418B1 (en) 2001-01-22 2003-12-09 Digital Animations Limited Character animation system
DE10214431B4 (de) * 2002-03-30 2005-11-10 Ralf Dringenberg Verfahren und Vorrichtung zur Visualisierung von Audiodaten
KR100754430B1 (ko) * 2004-10-08 2007-08-31 비쥬텍쓰리디(주) 음성 기반 자동 립싱크 애니메이션 장치와 방법 및 기록매체
CN1991982A (zh) * 2005-12-29 2007-07-04 摩托罗拉公司 一种使用语音数据激励图像的方法
GB2468140A (en) 2009-02-26 2010-09-01 Dublin Inst Of Technology A character animation tool which associates stress values with the locations of vowels
JP2018091954A (ja) * 2016-12-01 2018-06-14 オリンパス株式会社 音声認識装置、及び音声認識方法
CN108847234B (zh) * 2018-06-28 2020-10-30 广州华多网络科技有限公司 唇语合成方法、装置、电子设备及存储介质
CN111260761B (zh) * 2020-01-15 2023-05-09 北京猿力未来科技有限公司 一种生成动画人物口型的方法及装置
AU2021204758A1 (en) * 2020-11-20 2022-06-16 Soul Machines Autonomous animation in embodied agents
CN113205797B (zh) * 2021-04-30 2024-03-05 平安科技(深圳)有限公司 虚拟主播生成方法、装置、计算机设备及可读存储介质

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4913539A (en) * 1988-04-04 1990-04-03 New York Institute Of Technology Apparatus and method for lip-synching animation
JP2518683B2 (ja) * 1989-03-08 1996-07-24 国際電信電話株式会社 画像合成方法及びその装置
US5313522A (en) * 1991-08-23 1994-05-17 Slager Robert P Apparatus for generating from an audio signal a moving visual lip image from which a speech content of the signal can be comprehended by a lipreader
US5608839A (en) * 1994-03-18 1997-03-04 Lucent Technologies Inc. Sound-synchronized video system

Also Published As

Publication number Publication date
DE69715175T2 (de) 2003-05-15
CN1214784A (zh) 1999-04-21
KR20000005183A (ko) 2000-01-25
DE69715175D1 (de) 2002-10-10
JP2000507377A (ja) 2000-06-13
EP0890168B1 (en) 2002-09-04
CA2249016A1 (en) 1997-10-02
AU2167097A (en) 1997-10-17
EP0890168A1 (en) 1999-01-13
JP4037455B2 (ja) 2008-01-23
WO1997036288A1 (en) 1997-10-02

Similar Documents

Publication Publication Date Title
CA2249016C (en) Image synthesis
US6208356B1 (en) Image synthesis
Ezzat et al. Miketalk: A talking facial display based on morphing visemes
US5657426A (en) Method and apparatus for producing audio-visual synthetic speech
JP2518683B2 (ja) 画像合成方法及びその装置
US20020024519A1 (en) System and method for producing three-dimensional moving picture authoring tool supporting synthesis of motion, facial expression, lip synchronizing and lip synchronized voice of three-dimensional character
CN110880315A (zh) 一种基于音素后验概率的个性化语音和视频生成系统
CN112001992A (zh) 基于深度学习的语音驱动3d虚拟人表情音画同步方法及系统
US6014625A (en) Method and apparatus for producing lip-movement parameters in a three-dimensional-lip-model
GB2516965A (en) Synthetic audiovisual storyteller
KR101153736B1 (ko) 발음기관 애니메이션 생성 장치 및 방법
KR100300962B1 (ko) 음성합성을위한립싱크방법및그장치
US6839672B1 (en) Integration of talking heads and text-to-speech synthesizers for visual TTS
US6332123B1 (en) Mouth shape synthesizing
JPH08235384A (ja) 音響支援画像処理
JP4617500B2 (ja) リップシンクアニメーション作成装置、コンピュータプログラム及び顔モデル生成装置
KR100754430B1 (ko) 음성 기반 자동 립싱크 애니메이션 장치와 방법 및 기록매체
Breen et al. An investigation into the generation of mouth shapes for a talking head
KR20080018408A (ko) 음성 사운드 소스를 이용한 얼굴 표정 변화 프로그램을기록한 컴퓨터에서 읽을 수 있는 기록매체
JP3755503B2 (ja) アニメーション制作システム
JP2003058908A (ja) 顔画像制御方法および装置、コンピュータプログラム、および記録媒体
EP0982684A1 (en) Moving picture generating device and image control network learning device
JP4631077B2 (ja) アニメーション作成装置
EP0144731B1 (en) Speech synthesizer
CN115797515A (zh) 一种语音生成和表情驱动方法、客户端及服务端

Legal Events

Date Code Title Description
EEER Examination request
MKLA Lapsed
MKLA Lapsed

Effective date: 20120326