ZA200305593B - Generating visual representation of speech by any individuals of a population. - Google Patents

Generating visual representation of speech by any individuals of a population. Download PDF

Info

Publication number
ZA200305593B
ZA200305593B ZA200305593A ZA200305593A ZA200305593B ZA 200305593 B ZA200305593 B ZA 200305593B ZA 200305593 A ZA200305593 A ZA 200305593A ZA 200305593 A ZA200305593 A ZA 200305593A ZA 200305593 B ZA200305593 B ZA 200305593B
Authority
ZA
South Africa
Prior art keywords
visual
viseme
speech
profile
operative
Prior art date
Application number
ZA200305593A
Other languages
English (en)
Inventor
Nachshon Margaliot
Gad Blilious
Original Assignee
Speechview Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Speechview Ltd filed Critical Speechview Ltd
Publication of ZA200305593B publication Critical patent/ZA200305593B/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • User Interface Of Digital Computer (AREA)
  • Toys (AREA)
  • Telephonic Communication Services (AREA)
ZA200305593A 2000-12-19 2003-07-18 Generating visual representation of speech by any individuals of a population. ZA200305593B (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US25660600P 2000-12-19 2000-12-19

Publications (1)

Publication Number Publication Date
ZA200305593B true ZA200305593B (en) 2004-10-04

Family

ID=22972875

Family Applications (1)

Application Number Title Priority Date Filing Date
ZA200305593A ZA200305593B (en) 2000-12-19 2003-07-18 Generating visual representation of speech by any individuals of a population.

Country Status (6)

Country Link
US (1) US20040107106A1 (fr)
EP (1) EP1356460A4 (fr)
AU (1) AU2002216345A1 (fr)
CA (1) CA2432021A1 (fr)
WO (1) WO2002050813A2 (fr)
ZA (1) ZA200305593B (fr)

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB0229678D0 (en) * 2002-12-20 2003-01-29 Koninkl Philips Electronics Nv Telephone adapted to display animation corresponding to the audio of a telephone call
US20050204286A1 (en) * 2004-03-11 2005-09-15 Buhrke Eric R. Speech receiving device and viseme extraction method and apparatus
US20060009978A1 (en) * 2004-07-02 2006-01-12 The Regents Of The University Of Colorado Methods and systems for synthesis of accurate visible speech via transformation of motion capture data
US7643822B2 (en) * 2004-09-30 2010-01-05 Google Inc. Method and system for processing queries initiated by users of mobile devices
TWI454955B (zh) * 2006-12-29 2014-10-01 Nuance Communications Inc 使用模型檔產生動畫的方法及電腦可讀取的訊號承載媒體
CA2717992C (fr) * 2008-03-12 2018-01-16 E-Lane Systems Inc. Procede et systeme de comprehension de la parole
US8884982B2 (en) * 2009-12-15 2014-11-11 Deutsche Telekom Ag Method and apparatus for identifying speakers and emphasizing selected objects in picture and video messages
US8878773B1 (en) 2010-05-24 2014-11-04 Amazon Technologies, Inc. Determining relative motion as input
US20110311144A1 (en) * 2010-06-17 2011-12-22 Microsoft Corporation Rgb/depth camera for improving speech recognition
JP2012085009A (ja) * 2010-10-07 2012-04-26 Sony Corp 情報処理装置および情報処理方法
US8667548B2 (en) * 2012-03-11 2014-03-04 Broadcom Corporation Audio/video channel bonding architecture
US9094576B1 (en) * 2013-03-12 2015-07-28 Amazon Technologies, Inc. Rendered audiovisual communication
CN104424955B (zh) * 2013-08-29 2018-11-27 国际商业机器公司 生成音频的图形表示的方法和设备、音频搜索方法和设备
US9070409B1 (en) 2014-08-04 2015-06-30 Nathan Robert Yntema System and method for visually representing a recorded audio meeting
US20170099981A1 (en) * 2015-10-08 2017-04-13 Michel Abou Haidar Callisto integrated tablet computer in hot and cold dispensing machine
US20170099980A1 (en) * 2015-10-08 2017-04-13 Michel Abou Haidar Integrated tablet computer in hot and cold dispensing machine
US10460732B2 (en) * 2016-03-31 2019-10-29 Tata Consultancy Services Limited System and method to insert visual subtitles in videos
US10770092B1 (en) * 2017-09-22 2020-09-08 Amazon Technologies, Inc. Viseme data generation
US11030291B2 (en) * 2018-09-14 2021-06-08 Comcast Cable Communications, Llc Methods and systems for user authentication
US20220108510A1 (en) * 2019-01-25 2022-04-07 Soul Machines Limited Real-time generation of speech animation
US11860925B2 (en) * 2020-04-17 2024-01-02 Accenture Global Solutions Limited Human centered computing based digital persona generation
CN115174826A (zh) * 2022-07-07 2022-10-11 云知声智能科技股份有限公司 一种音视频合成方法及装置

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4012848A (en) * 1976-02-19 1977-03-22 Elza Samuilovna Diament Audio-visual teaching machine for speedy training and an instruction center on the basis thereof
US4884972A (en) * 1986-11-26 1989-12-05 Bright Star Technology, Inc. Speech synchronized animation
US4921427A (en) * 1989-08-21 1990-05-01 Dunn Jeffery W Educational device
US5278943A (en) * 1990-03-23 1994-01-11 Bright Star Technology, Inc. Speech animation and inflection system
JPH04237394A (ja) * 1991-01-21 1992-08-25 Ricoh Co Ltd マルチメディア名刺情報装置
US5613056A (en) * 1991-02-19 1997-03-18 Bright Star Technology, Inc. Advanced tools for speech synchronized animation
US5313522A (en) * 1991-08-23 1994-05-17 Slager Robert P Apparatus for generating from an audio signal a moving visual lip image from which a speech content of the signal can be comprehended by a lipreader
US5878396A (en) * 1993-01-21 1999-03-02 Apple Computer, Inc. Method and apparatus for synthetic speech in facial animation
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
US6232965B1 (en) * 1994-11-30 2001-05-15 California Institute Of Technology Method and apparatus for synthesizing realistic animations of a human speaking using a computer
US5734794A (en) * 1995-06-22 1998-03-31 White; Tom H. Method and system for voice-activated cell animation
JPH09200712A (ja) * 1996-01-12 1997-07-31 Sharp Corp 音声・画像伝送装置
US5923337A (en) * 1996-04-23 1999-07-13 Image Link Co., Ltd. Systems and methods for communicating through computer animated images
US5884267A (en) * 1997-02-24 1999-03-16 Digital Equipment Corporation Automated speech alignment for image synthesis
US6363380B1 (en) * 1998-01-13 2002-03-26 U.S. Philips Corporation Multimedia computer system with story segmentation capability and operating program therefor including finite automation video parser
US6250928B1 (en) * 1998-06-22 2001-06-26 Massachusetts Institute Of Technology Talking facial display method and apparatus
US6017260A (en) * 1998-08-20 2000-01-25 Mattel, Inc. Speaking toy having plural messages and animated character face
US6085242A (en) * 1999-01-05 2000-07-04 Chandra; Rohit Method for managing a repository of user information using a personalized uniform locator
US6219640B1 (en) * 1999-08-06 2001-04-17 International Business Machines Corporation Methods and apparatus for audio-visual speaker recognition and utterance verification
US6366885B1 (en) * 1999-08-27 2002-04-02 International Business Machines Corporation Speech driven lip synthesis using viseme based hidden markov models

Also Published As

Publication number Publication date
AU2002216345A1 (en) 2002-07-01
EP1356460A2 (fr) 2003-10-29
US20040107106A1 (en) 2004-06-03
EP1356460A4 (fr) 2006-01-04
CA2432021A1 (fr) 2002-06-27
WO2002050813A2 (fr) 2002-06-27
WO2002050813A3 (fr) 2002-11-07

Similar Documents

Publication Publication Date Title
US20040107106A1 (en) Apparatus and methods for generating visual representations of speech verbalized by any of a population of personas
US10163111B2 (en) Virtual photorealistic digital actor system for remote service of customers
US11222632B2 (en) System and method for intelligent initiation of a man-machine dialogue based on multi-modal sensory inputs
US11468894B2 (en) System and method for personalizing dialogue based on user's appearances
CN103650002B (zh) 基于文本的视频生成
Cox et al. Tessa, a system to aid communication with deaf people
US20150287403A1 (en) Device, system, and method of automatically generating an animated content-item
Cosatto et al. Lifelike talking faces for interactive services
US20110131041A1 (en) Systems And Methods For Synthesis Of Motion For Animation Of Virtual Heads/Characters Via Voice Processing In Portable Devices
US20100085363A1 (en) Photo Realistic Talking Head Creation, Content Creation, and Distribution System and Method
CN104144108B (zh) 一种消息响应方法、装置及系统
WO2022089224A1 (fr) Procédé et appareil de communication de vidéo, dispositif électronique, support de stockage lisible par ordinateur, et produit programme informatique
JP2003521750A (ja) スピーチシステム
KR100733772B1 (ko) 이동통신 가입자를 위한 립싱크 서비스 제공 방법 및 이를위한 시스템
JP4077656B2 (ja) 発言者特定映像装置
CN112669846A (zh) 交互系统、方法、装置、电子设备及存储介质
Hassid et al. More than words: In-the-wild visually-driven prosody for text-to-speech
CN111160051B (zh) 数据处理方法、装置、电子设备及存储介质
CN115393484A (zh) 虚拟形象动画的生成方法、装置、电子设备和存储介质
Verma et al. Animating expressive faces across languages
KR20100134022A (ko) 실사 토킹 헤드 생성, 콘텐트 생성, 분배 시스템 및 방법
KR20040076524A (ko) 애니메이션 캐릭터 제작 방법 및 애니메이션 캐릭터를이용한 인터넷 서비스 시스템
JP7496128B2 (ja) 仮想人物対話システム、映像生成方法、映像生成プログラム
US9633505B2 (en) System and method for on-demand delivery of audio content for use with entertainment creatives
US20240290024A1 (en) Dynamic synthetic video chat agent replacement