WO2008031955A3 - Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur - Google Patents
Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur Download PDFInfo
- Publication number
- WO2008031955A3 WO2008031955A3 PCT/FR2007/001495 FR2007001495W WO2008031955A3 WO 2008031955 A3 WO2008031955 A3 WO 2008031955A3 FR 2007001495 W FR2007001495 W FR 2007001495W WO 2008031955 A3 WO2008031955 A3 WO 2008031955A3
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- avatar
- real time
- speaker
- animating
- voice
- Prior art date
Links
- 230000005236 sound signal Effects 0.000 abstract 3
- 230000001360 synchronised effect Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/205—3D [Three Dimensional] animation driven by audio data
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T13/00—Animation
- G06T13/20—3D [Three Dimensional] animation
- G06T13/40—3D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- Processing Or Creating Images (AREA)
- Telephone Function (AREA)
Abstract
II s'agit d'un procédé et d'un système d'animation sur un écran (3, 3', 3'') d'appareil mobile (4, 4', 4'') d'un avatar (2, 2', 2'') muni d'une bouche (5, 5') à partir d'un signal d'entrée sonore (6) correspondant à la voix (7) d'un interlocuteur (8) de communication téléphonique. On transforme en temps réel le signal d'entrée sonore en un flux audio et vidéo dans lequel on synchronise les mouvements de la bouche de l'avatar avec les phonèmes détectés dans ledit signal d'entrée sonore, et on anime l'avatar de façon cohérente avec ledit signal par des changements d'attitudes et des mouvements par analyse dudit signal, de sorte que l'avatar semble parler en temps réel ou sensiblement en temps réel à la place de l'interlocuteur.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US12/441,293 US20090278851A1 (en) | 2006-09-15 | 2007-09-14 | Method and system for animating an avatar in real time using the voice of a speaker |
EP07848234A EP2059926A2 (fr) | 2006-09-15 | 2007-09-14 | Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
FR0608078A FR2906056B1 (fr) | 2006-09-15 | 2006-09-15 | Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur. |
FR0608078 | 2006-09-15 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2008031955A2 WO2008031955A2 (fr) | 2008-03-20 |
WO2008031955A3 true WO2008031955A3 (fr) | 2008-06-05 |
Family
ID=37882253
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/FR2007/001495 WO2008031955A2 (fr) | 2006-09-15 | 2007-09-14 | Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur |
Country Status (4)
Country | Link |
---|---|
US (1) | US20090278851A1 (fr) |
EP (1) | EP2059926A2 (fr) |
FR (1) | FR2906056B1 (fr) |
WO (1) | WO2008031955A2 (fr) |
Families Citing this family (30)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2468140A (en) * | 2009-02-26 | 2010-09-01 | Dublin Inst Of Technology | A character animation tool which associates stress values with the locations of vowels |
US9665563B2 (en) * | 2009-05-28 | 2017-05-30 | Samsung Electronics Co., Ltd. | Animation system and methods for generating animation based on text-based data and user information |
US20120058747A1 (en) * | 2010-09-08 | 2012-03-08 | James Yiannios | Method For Communicating and Displaying Interactive Avatar |
US20120069028A1 (en) * | 2010-09-20 | 2012-03-22 | Yahoo! Inc. | Real-time animations of emoticons using facial recognition during a video chat |
US8948893B2 (en) | 2011-06-06 | 2015-02-03 | International Business Machines Corporation | Audio media mood visualization method and system |
WO2013076359A1 (fr) * | 2011-11-24 | 2013-05-30 | Nokia Corporation | Procédé, appareil et produit programme d'ordinateur pour produire une image animée associée à un contenu multimédia |
RU2481640C1 (ru) * | 2011-12-01 | 2013-05-10 | Корпорация "Самсунг Электроникс Ко., Лтд" | Способ и система генерации анимированных художественных эффектов на статичных изображениях |
US9035955B2 (en) | 2012-05-16 | 2015-05-19 | Microsoft Technology Licensing, Llc | Synchronizing virtual actor's performances to a speaker's voice |
US9325809B1 (en) * | 2012-09-07 | 2016-04-26 | Mindmeld, Inc. | Audio recall during voice conversations |
GB201301981D0 (en) * | 2013-02-04 | 2013-03-20 | Headcast Ltd | Presenting audio/visual animations |
GB201315142D0 (en) * | 2013-08-23 | 2013-10-09 | Ucl Business Plc | Audio-Visual Dialogue System and Method |
US20150287403A1 (en) * | 2014-04-07 | 2015-10-08 | Neta Holzer Zaslansky | Device, system, and method of automatically generating an animated content-item |
US11289077B2 (en) * | 2014-07-15 | 2022-03-29 | Avaya Inc. | Systems and methods for speech analytics and phrase spotting using phoneme sequences |
US10291597B2 (en) | 2014-08-14 | 2019-05-14 | Cisco Technology, Inc. | Sharing resources across multiple devices in online meetings |
US10542126B2 (en) | 2014-12-22 | 2020-01-21 | Cisco Technology, Inc. | Offline virtual participation in an online conference meeting |
US9948786B2 (en) | 2015-04-17 | 2018-04-17 | Cisco Technology, Inc. | Handling conferences using highly-distributed agents |
US10592867B2 (en) | 2016-11-11 | 2020-03-17 | Cisco Technology, Inc. | In-meeting graphical user interface display using calendar information and system |
US10516707B2 (en) | 2016-12-15 | 2019-12-24 | Cisco Technology, Inc. | Initiating a conferencing meeting using a conference room device |
US10440073B2 (en) | 2017-04-11 | 2019-10-08 | Cisco Technology, Inc. | User interface for proximity based teleconference transfer |
US10375125B2 (en) | 2017-04-27 | 2019-08-06 | Cisco Technology, Inc. | Automatically joining devices to a video conference |
US10375474B2 (en) | 2017-06-12 | 2019-08-06 | Cisco Technology, Inc. | Hybrid horn microphone |
US10477148B2 (en) | 2017-06-23 | 2019-11-12 | Cisco Technology, Inc. | Speaker anticipation |
US10516709B2 (en) | 2017-06-29 | 2019-12-24 | Cisco Technology, Inc. | Files automatically shared at conference initiation |
US10706391B2 (en) | 2017-07-13 | 2020-07-07 | Cisco Technology, Inc. | Protecting scheduled meeting in physical room |
US10091348B1 (en) | 2017-07-25 | 2018-10-02 | Cisco Technology, Inc. | Predictive model for voice/video over IP calls |
US10812430B2 (en) * | 2018-02-22 | 2020-10-20 | Mercury Universe, LLC | Method and system for creating a mercemoji |
US10580187B2 (en) * | 2018-05-01 | 2020-03-03 | Enas TARAWNEH | System and method for rendering of an animated avatar |
KR20210117066A (ko) * | 2020-03-18 | 2021-09-28 | 라인플러스 주식회사 | 음향 기반 아바타 모션 제어 방법 및 장치 |
CN111988658B (zh) * | 2020-08-28 | 2022-12-06 | 网易(杭州)网络有限公司 | 视频生成方法及装置 |
EP4216167A4 (fr) * | 2021-01-13 | 2024-05-01 | Samsung Electronics Co Ltd | Dispositif électronique et procédé de fonctionnement d'un service vidéo d'avatar |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
US6839672B1 (en) * | 1998-01-30 | 2005-01-04 | At&T Corp. | Integration of talking heads and text-to-speech synthesizers for visual TTS |
GB2423905A (en) * | 2005-03-03 | 2006-09-06 | Sean Smith | Animated messaging |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1345179A3 (fr) * | 2002-03-13 | 2004-01-21 | Matsushita Electric Industrial Co., Ltd. | Procédé et dispositif pour l'animation des images de synthèse |
AU2003218320A1 (en) * | 2002-03-21 | 2003-10-08 | U.S. Army Medical Research And Materiel Command | Methods and systems for detecting, measuring, and monitoring stress in speech |
US7136818B1 (en) * | 2002-05-16 | 2006-11-14 | At&T Corp. | System and method of providing conversational visual prosody for talking heads |
US7983910B2 (en) * | 2006-03-03 | 2011-07-19 | International Business Machines Corporation | Communicating across voice and text channels with emotion preservation |
-
2006
- 2006-09-15 FR FR0608078A patent/FR2906056B1/fr not_active Expired - Fee Related
-
2007
- 2007-09-14 EP EP07848234A patent/EP2059926A2/fr not_active Withdrawn
- 2007-09-14 US US12/441,293 patent/US20090278851A1/en not_active Abandoned
- 2007-09-14 WO PCT/FR2007/001495 patent/WO2008031955A2/fr active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6839672B1 (en) * | 1998-01-30 | 2005-01-04 | At&T Corp. | Integration of talking heads and text-to-speech synthesizers for visual TTS |
US6539354B1 (en) * | 2000-03-24 | 2003-03-25 | Fluent Speech Technologies, Inc. | Methods and devices for producing and using synthetic visual speech based on natural coarticulation |
GB2423905A (en) * | 2005-03-03 | 2006-09-06 | Sean Smith | Animated messaging |
Non-Patent Citations (1)
Title |
---|
POLZIN, WAIBEL: "Detecting Emotions in Speech", PROCEEDINGS OF THE CSC, 31 December 1998 (1998-12-31), pages 1 - 7, XP002427820 * |
Also Published As
Publication number | Publication date |
---|---|
FR2906056B1 (fr) | 2009-02-06 |
WO2008031955A2 (fr) | 2008-03-20 |
FR2906056A1 (fr) | 2008-03-21 |
US20090278851A1 (en) | 2009-11-12 |
EP2059926A2 (fr) | 2009-05-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2008031955A3 (fr) | Procede et systeme d'animation d'un avatar en temps reel a partir de la voix d'un interlocuteur | |
NL2021308B1 (en) | Methods for a voice processing system | |
RS49875B (sr) | Sistem i postupak za slobodnu govornu komunikaciju pomoću mikrofonskog niza | |
WO2006028587A3 (fr) | Casque destine a separer des signaux vocaux dans un environnement bruyant | |
US8265292B2 (en) | Removing noise from audio | |
WO2008036950A3 (fr) | Traitement audio permettant d'améliorer l'utilisation | |
US8553906B2 (en) | Apparatus for enabling karaoke | |
WO2006071420A3 (fr) | Appareil et procede permettant de recevoir des entrees en provenance d'un utilisateur | |
CN109348338A (zh) | 一种耳机及其播放方法 | |
WO2007123946A3 (fr) | Système et procédé pour produire un son spécifique d'un emplacement dans un système de téléprésence | |
CN105960794A (zh) | 用于语音命令的智能蓝牙耳机 | |
WO2007013075A3 (fr) | Systeme voix/donnes synchronise | |
TW200601865A (en) | Sound pickup apparatus and method of the same | |
WO2006073501A3 (fr) | Visiophone ip | |
DE602005021546D1 (en) | Hung | |
CN107845386B (zh) | 声音信号处理方法、移动终端和服务器 | |
WO2008078624A1 (fr) | Dispositif de production en sortie de vidéo | |
WO2020139724A1 (fr) | Synthèse vocale basée sur le contexte | |
CN101931816A (zh) | 将3g手机视频转移至电视上的方法及系统 | |
CN104333649B (zh) | 在通信终端呈现语音消息的方法及设备 | |
JP2024505944A (ja) | 音声オーディオストリーム中断を処理するシステムおよび方法 | |
EP2741526A3 (fr) | Appareil audio et procédé de traitement de signal audio | |
WO2007095413A3 (fr) | Procede et appareil pour detecter des affects dans un discours | |
Vaughan et al. | Designing and implementing a platform for collecting multi-modal data of human-robot interaction | |
JP2006005440A (ja) | 通話送受信方法および通話端末 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2007848234 Country of ref document: EP |
|
WWE | Wipo information: entry into national phase |
Ref document number: 12441293 Country of ref document: US |