NO318698B1 - Anordning og fremgangsmate for prosodigenering av visuell syntese - Google Patents

Anordning og fremgangsmate for prosodigenering av visuell syntese Download PDF

Info

Publication number
NO318698B1
NO318698B1 NO19994599A NO994599A NO318698B1 NO 318698 B1 NO318698 B1 NO 318698B1 NO 19994599 A NO19994599 A NO 19994599A NO 994599 A NO994599 A NO 994599A NO 318698 B1 NO318698 B1 NO 318698B1
Authority
NO
Norway
Prior art keywords
face
accordance
speech
movement pattern
words
Prior art date
Application number
NO19994599A
Other languages
English (en)
Norwegian (no)
Other versions
NO994599L (no
NO994599D0 (no
Inventor
Bertil Lyberg
Original Assignee
Teliasonera Ab Publ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Teliasonera Ab Publ filed Critical Teliasonera Ab Publ
Publication of NO994599D0 publication Critical patent/NO994599D0/no
Publication of NO994599L publication Critical patent/NO994599L/no
Publication of NO318698B1 publication Critical patent/NO318698B1/no

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/2053D [Three Dimensional] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/06Elementary speech units used in speech synthesisers; Concatenation rules
    • G10L13/07Concatenation rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Machine Translation (AREA)
  • Processing Or Creating Images (AREA)
  • Steroid Compounds (AREA)
NO19994599A 1997-03-25 1999-09-22 Anordning og fremgangsmate for prosodigenering av visuell syntese NO318698B1 (no)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE9701101A SE520065C2 (sv) 1997-03-25 1997-03-25 Anordning och metod för prosodigenerering vid visuell talsyntes
PCT/SE1998/000506 WO1998043235A2 (en) 1997-03-25 1998-03-20 Device and method for prosody generation at visual synthesis

Publications (3)

Publication Number Publication Date
NO994599D0 NO994599D0 (no) 1999-09-22
NO994599L NO994599L (no) 1999-12-14
NO318698B1 true NO318698B1 (no) 2005-04-25

Family

ID=20406308

Family Applications (1)

Application Number Title Priority Date Filing Date
NO19994599A NO318698B1 (no) 1997-03-25 1999-09-22 Anordning og fremgangsmate for prosodigenering av visuell syntese

Country Status (9)

Country Link
US (1) US6389396B1 (sv)
EP (1) EP0970465B1 (sv)
JP (1) JP2001517326A (sv)
DE (1) DE69816049T2 (sv)
DK (1) DK0970465T3 (sv)
EE (1) EE03883B1 (sv)
NO (1) NO318698B1 (sv)
SE (1) SE520065C2 (sv)
WO (1) WO1998043235A2 (sv)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6947044B1 (en) * 1999-05-21 2005-09-20 Kulas Charles J Creation and playback of computer-generated productions using script-controlled rendering engines
US20020194006A1 (en) * 2001-03-29 2002-12-19 Koninklijke Philips Electronics N.V. Text to visual speech system and method incorporating facial emotions
CN1159702C (zh) 2001-04-11 2004-07-28 国际商业机器公司 具有情感的语音-语音翻译系统和方法
US7076430B1 (en) 2002-05-16 2006-07-11 At&T Corp. System and method of providing conversational visual prosody for talking heads
US20060009978A1 (en) * 2004-07-02 2006-01-12 The Regents Of The University Of Colorado Methods and systems for synthesis of accurate visible speech via transformation of motion capture data
JP4985714B2 (ja) * 2009-06-12 2012-07-25 カシオ計算機株式会社 音声表示出力制御装置、および音声表示出力制御処理プログラム
US8571870B2 (en) * 2010-02-12 2013-10-29 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
US8949128B2 (en) * 2010-02-12 2015-02-03 Nuance Communications, Inc. Method and apparatus for providing speech output for speech-enabled applications
US8447610B2 (en) * 2010-02-12 2013-05-21 Nuance Communications, Inc. Method and apparatus for generating synthetic speech with contrastive stress
AU2012100262B4 (en) * 2011-12-15 2012-05-24 Nguyen, Phan Thi My Ngoc Ms Speech visualisation tool
JP2012098753A (ja) * 2012-01-27 2012-05-24 Casio Comput Co Ltd 音声表示出力制御装置、画像表示制御装置、および音声表示出力制御処理プログラム、画像表示制御処理プログラム
CN112100352A (zh) * 2020-09-14 2020-12-18 北京百度网讯科技有限公司 与虚拟对象的对话方法、装置、客户端及存储介质

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2518683B2 (ja) * 1989-03-08 1996-07-24 国際電信電話株式会社 画像合成方法及びその装置
GB9019829D0 (en) 1990-09-11 1990-10-24 British Telecomm Speech analysis and image synthesis
US6122616A (en) * 1993-01-21 2000-09-19 Apple Computer, Inc. Method and apparatus for diphone aliasing
US5878396A (en) * 1993-01-21 1999-03-02 Apple Computer, Inc. Method and apparatus for synthetic speech in facial animation
SE9301596L (sv) * 1993-05-10 1994-05-24 Televerket Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
SE516526C2 (sv) 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
CA2162199A1 (en) 1994-11-07 1996-05-08 Homer H. Chen Acoustic-assisted image processing
SE519244C2 (sv) * 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes
SE9600959L (sv) 1996-03-13 1997-09-14 Telia Ab Metod och anordning vid tal-till-talöversättning

Also Published As

Publication number Publication date
NO994599L (no) 1999-12-14
SE9701101L (sv) 1998-09-26
WO1998043235A2 (en) 1998-10-01
EP0970465B1 (en) 2003-07-02
DK0970465T3 (da) 2003-10-27
DE69816049T2 (de) 2004-04-22
NO994599D0 (no) 1999-09-22
DE69816049D1 (de) 2003-08-07
EE03883B1 (et) 2002-10-15
JP2001517326A (ja) 2001-10-02
US6389396B1 (en) 2002-05-14
SE9701101D0 (sv) 1997-03-25
EE9900419A (et) 2000-04-17
WO1998043235A3 (en) 1998-12-23
EP0970465A2 (en) 2000-01-12
SE520065C2 (sv) 2003-05-20

Similar Documents

Publication Publication Date Title
KR102306844B1 (ko) 비디오 번역 및 립싱크 방법 및 시스템
Choi Phonetic underspecification and target-interpolation: an acoustic study of Marshallese vowel allophony
NO311546B1 (no) Anordning og fremgangsmåte ved talesyntese
Granström et al. Prosodic cues in multimodal speech perception
JPH0830287A (ja) テキスト−音声変換システム
JP2006106741A (ja) 対話型音声応答システムによる音声理解を防ぐための方法および装置
NO318698B1 (no) Anordning og fremgangsmate for prosodigenering av visuell syntese
Batliner et al. Prosodic models, automatic speech understanding, and speech synthesis: Towards the common ground?
El-Imam et al. Text-to-speech conversion of standard Malay
US6385580B1 (en) Method of speech synthesis
Black Speech synthesis for educational technology
Roehling et al. Towards expressive speech synthesis in english on a robotic platform
Qian et al. HMM-based mixed-language (Mandarin-English) speech synthesis
Ouni et al. Internationalization of a talking head
Hinterleitner et al. Speech synthesis
Leonardo et al. A general approach to TTS reading of mixed-language texts
Pagel et al. How to achieve a prominence GOAL! in different speaking styles
Granström et al. Eyebrow movements as a cue to prominence
Hirschberg et al. Voice response systems: Technologies and applications
Campbell Multi-lingual concatenative speech synthesis
Karjalainen Review of speech synthesis technology
Safabakhsh et al. AUT-Talk: a farsi talking head
Keller Quality improvement of (wireless) phone-based teleservices using advanced speech synthesis techniques
Van Santen Phonetic knowledge in text-to-speech synthesis
O'Cinneide et al. A brief introduction to speech synthesis and voice modification