WO2008031955A3 - Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur - Google Patents

Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur Download PDF

Info

Publication number
WO2008031955A3
WO2008031955A3 PCT/FR2007/001495 FR2007001495W WO2008031955A3 WO 2008031955 A3 WO2008031955 A3 WO 2008031955A3 FR 2007001495 W FR2007001495 W FR 2007001495W WO 2008031955 A3 WO2008031955 A3 WO 2008031955A3
Authority
WO
WIPO (PCT)
Prior art keywords
avatar
real time
speaker
animating
voice
Prior art date
Application number
PCT/FR2007/001495
Other languages
English (en)
Other versions
WO2008031955A2 (fr
Inventor
Laurent Ach
Serge Vieillescaze
Benoit Morel
Original Assignee
Cantoche Production S A
Laurent Ach
Serge Vieillescaze
Benoit Morel
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Cantoche Production S A, Laurent Ach, Serge Vieillescaze, Benoit Morel filed Critical Cantoche Production S A
Priority to US12/441,293 priority Critical patent/US20090278851A1/en
Priority to EP07848234A priority patent/EP2059926A2/fr
Publication of WO2008031955A2 publication Critical patent/WO2008031955A2/fr
Publication of WO2008031955A3 publication Critical patent/WO2008031955A3/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/2053D [Three Dimensional] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • Telephone Function (AREA)

Abstract

II s'agit d'un procédé et d'un système d'animation sur un écran (3, 3', 3'') d'appareil mobile (4, 4', 4'') d'un avatar (2, 2', 2'') muni d'une bouche (5, 5') à partir d'un signal d'entrée sonore (6) correspondant à la voix (7) d'un interlocuteur (8) de communication téléphonique. On transforme en temps réel le signal d'entrée sonore en un flux audio et vidéo dans lequel on synchronise les mouvements de la bouche de l'avatar avec les phonèmes détectés dans ledit signal d'entrée sonore, et on anime l'avatar de façon cohérente avec ledit signal par des changements d'attitudes et des mouvements par analyse dudit signal, de sorte que l'avatar semble parler en temps réel ou sensiblement en temps réel à la place de l'interlocuteur.
PCT/FR2007/001495 2006-09-15 2007-09-14 Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur WO2008031955A2 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
US12/441,293 US20090278851A1 (en) 2006-09-15 2007-09-14 Method and system for animating an avatar in real time using the voice of a speaker
EP07848234A EP2059926A2 (fr) 2006-09-15 2007-09-14 Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
FR0608078A FR2906056B1 (fr) 2006-09-15 2006-09-15 Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur.
FR0608078 2006-09-15

Publications (2)

Publication Number Publication Date
WO2008031955A2 WO2008031955A2 (fr) 2008-03-20
WO2008031955A3 true WO2008031955A3 (fr) 2008-06-05

Family

ID=37882253

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/FR2007/001495 WO2008031955A2 (fr) 2006-09-15 2007-09-14 Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur

Country Status (4)

Country Link
US (1) US20090278851A1 (fr)
EP (1) EP2059926A2 (fr)
FR (1) FR2906056B1 (fr)
WO (1) WO2008031955A2 (fr)

Families Citing this family (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2468140A (en) * 2009-02-26 2010-09-01 Dublin Inst Of Technology A character animation tool which associates stress values with the locations of vowels
US9665563B2 (en) * 2009-05-28 2017-05-30 Samsung Electronics Co., Ltd. Animation system and methods for generating animation based on text-based data and user information
US20120058747A1 (en) * 2010-09-08 2012-03-08 James Yiannios Method For Communicating and Displaying Interactive Avatar
US20120069028A1 (en) * 2010-09-20 2012-03-22 Yahoo! Inc. Real-time animations of emoticons using facial recognition during a video chat
US8948893B2 (en) 2011-06-06 2015-02-03 International Business Machines Corporation Audio media mood visualization method and system
WO2013076359A1 (fr) * 2011-11-24 2013-05-30 Nokia Corporation Procédé, appareil et produit programme d'ordinateur pour produire une image animée associée à un contenu multimédia
RU2481640C1 (ru) * 2011-12-01 2013-05-10 Корпорация "Самсунг Электроникс Ко., Лтд" Способ и система генерации анимированных художественных эффектов на статичных изображениях
US9035955B2 (en) 2012-05-16 2015-05-19 Microsoft Technology Licensing, Llc Synchronizing virtual actor's performances to a speaker's voice
US9325809B1 (en) * 2012-09-07 2016-04-26 Mindmeld, Inc. Audio recall during voice conversations
GB201301981D0 (en) * 2013-02-04 2013-03-20 Headcast Ltd Presenting audio/visual animations
GB201315142D0 (en) * 2013-08-23 2013-10-09 Ucl Business Plc Audio-Visual Dialogue System and Method
US20150287403A1 (en) * 2014-04-07 2015-10-08 Neta Holzer Zaslansky Device, system, and method of automatically generating an animated content-item
US11289077B2 (en) * 2014-07-15 2022-03-29 Avaya Inc. Systems and methods for speech analytics and phrase spotting using phoneme sequences
US10291597B2 (en) 2014-08-14 2019-05-14 Cisco Technology, Inc. Sharing resources across multiple devices in online meetings
US10542126B2 (en) 2014-12-22 2020-01-21 Cisco Technology, Inc. Offline virtual participation in an online conference meeting
US9948786B2 (en) 2015-04-17 2018-04-17 Cisco Technology, Inc. Handling conferences using highly-distributed agents
US10592867B2 (en) 2016-11-11 2020-03-17 Cisco Technology, Inc. In-meeting graphical user interface display using calendar information and system
US10516707B2 (en) 2016-12-15 2019-12-24 Cisco Technology, Inc. Initiating a conferencing meeting using a conference room device
US10440073B2 (en) 2017-04-11 2019-10-08 Cisco Technology, Inc. User interface for proximity based teleconference transfer
US10375125B2 (en) 2017-04-27 2019-08-06 Cisco Technology, Inc. Automatically joining devices to a video conference
US10375474B2 (en) 2017-06-12 2019-08-06 Cisco Technology, Inc. Hybrid horn microphone
US10477148B2 (en) 2017-06-23 2019-11-12 Cisco Technology, Inc. Speaker anticipation
US10516709B2 (en) 2017-06-29 2019-12-24 Cisco Technology, Inc. Files automatically shared at conference initiation
US10706391B2 (en) 2017-07-13 2020-07-07 Cisco Technology, Inc. Protecting scheduled meeting in physical room
US10091348B1 (en) 2017-07-25 2018-10-02 Cisco Technology, Inc. Predictive model for voice/video over IP calls
US10812430B2 (en) * 2018-02-22 2020-10-20 Mercury Universe, LLC Method and system for creating a mercemoji
US10580187B2 (en) * 2018-05-01 2020-03-03 Enas TARAWNEH System and method for rendering of an animated avatar
KR20210117066A (ko) * 2020-03-18 2021-09-28 라인플러스 주식회사 음향 기반 아바타 모션 제어 방법 및 장치
CN111988658B (zh) * 2020-08-28 2022-12-06 网易(杭州)网络有限公司 视频生成方法及装置
EP4216167A4 (fr) * 2021-01-13 2024-05-01 Samsung Electronics Co Ltd Dispositif électronique et procédé de fonctionnement d'un service vidéo d'avatar

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6539354B1 (en) * 2000-03-24 2003-03-25 Fluent Speech Technologies, Inc. Methods and devices for producing and using synthetic visual speech based on natural coarticulation
US6839672B1 (en) * 1998-01-30 2005-01-04 At&T Corp. Integration of talking heads and text-to-speech synthesizers for visual TTS
GB2423905A (en) * 2005-03-03 2006-09-06 Sean Smith Animated messaging

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1345179A3 (fr) * 2002-03-13 2004-01-21 Matsushita Electric Industrial Co., Ltd. Procédé et dispositif pour l'animation des images de synthèse
AU2003218320A1 (en) * 2002-03-21 2003-10-08 U.S. Army Medical Research And Materiel Command Methods and systems for detecting, measuring, and monitoring stress in speech
US7136818B1 (en) * 2002-05-16 2006-11-14 At&T Corp. System and method of providing conversational visual prosody for talking heads
US7983910B2 (en) * 2006-03-03 2011-07-19 International Business Machines Corporation Communicating across voice and text channels with emotion preservation

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6839672B1 (en) * 1998-01-30 2005-01-04 At&T Corp. Integration of talking heads and text-to-speech synthesizers for visual TTS
US6539354B1 (en) * 2000-03-24 2003-03-25 Fluent Speech Technologies, Inc. Methods and devices for producing and using synthetic visual speech based on natural coarticulation
GB2423905A (en) * 2005-03-03 2006-09-06 Sean Smith Animated messaging

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
POLZIN, WAIBEL: "Detecting Emotions in Speech", PROCEEDINGS OF THE CSC, 31 December 1998 (1998-12-31), pages 1 - 7, XP002427820 *

Also Published As

Publication number Publication date
FR2906056B1 (fr) 2009-02-06
WO2008031955A2 (fr) 2008-03-20
FR2906056A1 (fr) 2008-03-21
US20090278851A1 (en) 2009-11-12
EP2059926A2 (fr) 2009-05-20

Similar Documents

Publication Publication Date Title
WO2008031955A3 (fr) Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur
NL2021308B1 (en) Methods for a voice processing system
RS49875B (sr) Sistem i postupak za slobodnu govornu komunikaciju pomoću mikrofonskog niza
WO2006028587A3 (fr) Casque destine a separer des signaux vocaux dans un environnement bruyant
US8265292B2 (en) Removing noise from audio
WO2008036950A3 (fr) Traitement audio permettant d'améliorer l'utilisation
US8553906B2 (en) Apparatus for enabling karaoke
WO2006071420A3 (fr) Appareil et procede permettant de recevoir des entrees en provenance d'un utilisateur
CN109348338A (zh) 一种耳机及其播放方法
WO2007123946A3 (fr) Système et procédé pour produire un son spécifique d'un emplacement dans un système de téléprésence
CN105960794A (zh) 用于语音命令的智能蓝牙耳机
WO2007013075A3 (fr) Systeme voix/donnes synchronise
TW200601865A (en) Sound pickup apparatus and method of the same
WO2006073501A3 (fr) Visiophone ip
DE602005021546D1 (en) Hung
CN107845386B (zh) 声音信号处理方法、移动终端和服务器
WO2008078624A1 (fr) Dispositif de production en sortie de vidéo
WO2020139724A1 (fr) Synthèse vocale basée sur le contexte
CN101931816A (zh) 将3g手机视频转移至电视上的方法及系统
CN104333649B (zh) 在通信终端呈现语音消息的方法及设备
JP2024505944A (ja) 音声オーディオストリーム中断を処理するシステムおよび方法
EP2741526A3 (fr) Appareil audio et procédé de traitement de signal audio
WO2007095413A3 (fr) Procede et appareil pour detecter des affects dans un discours
Vaughan et al. Designing and implementing a platform for collecting multi-modal data of human-robot interaction
JP2006005440A (ja) 通話送受信方法および通話端末

Legal Events

Date Code Title Description
NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2007848234

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 12441293

Country of ref document: US