MX9504648A - Metodo y aparato para el procesamiento de imagenes, asistido por acustica. - Google Patents

Metodo y aparato para el procesamiento de imagenes, asistido por acustica.

Info

Publication number
MX9504648A
MX9504648A MX9504648A MX9504648A MX9504648A MX 9504648 A MX9504648 A MX 9504648A MX 9504648 A MX9504648 A MX 9504648A MX 9504648 A MX9504648 A MX 9504648A MX 9504648 A MX9504648 A MX 9504648A
Authority
MX
Mexico
Prior art keywords
rate
viseme sequence
response
speaker
audio
Prior art date
Application number
MX9504648A
Other languages
English (en)
Inventor
Homer H Chen
Wu Chou
Original Assignee
At & T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by At & T Corp filed Critical At & T Corp
Publication of MX9504648A publication Critical patent/MX9504648A/es

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/2053D [Three Dimensional] animation driven by audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T13/00Animation
    • G06T13/203D [Three Dimensional] animation
    • G06T13/403D [Three Dimensional] animation of characters, e.g. humans, animals or virtual beings
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/20Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video object coding

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • Image Processing (AREA)

Abstract

Se describe el procesamiento de imagen auxiliado mediante acustica. El procesamiento de imagen auxiliado mediante acustica se logra, de acuerdo con la invencion, mediante un nuevo método y aparato, en los cuales una señal de audio es obtenida por muestreo a una velocidad de muestreo en el dominio de audio; una primera secuencia de viseme es generada a una primera velocidad, en respuesta a la señal de audio muestreada, la primera velocidad corresponde a una velocidad de muestreo en el dominio de audio; la primera secuencia de viseme es transformada a una segunda secuencia de viseme a una segunda velocidad con el uso de un conjunto predeterminado de criterios, de transformacion, la segunda velocidad corresponde a una velocidad de cuadro en el dominio de video; y una imagen es procesada en respuesta a la segunda secuencia de viseme. En un ejemplo ilustrativo de la invencion, una imagen de video de una cara de un personaje humano que habla es animada con el uso de un modelo facial de cuadro alámbrico tridimensional, después de lo cual una textura superficial es representada o trazada. El modelo facial de cuadro alámbrico tridimensional es deformado estructuralmente en respuesta a una secuencia de viseme de velocidad transformada, extraída a partir de una señal hablada, de tal manera que la region de la boca de la imagen de video se mueve en correspondencia con el habla. Ventajosamente, la animacion es llevada a cabo en tiempo real, trabaja con cualquier altavoz, y no tiene limitaciones sobre el vocabulario, ni requiere ninguna accion especial por parte de la persona que habla.
MX9504648A 1994-11-07 1995-11-06 Metodo y aparato para el procesamiento de imagenes, asistido por acustica. MX9504648A (es)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US33528594A 1994-11-07 1994-11-07

Publications (1)

Publication Number Publication Date
MX9504648A true MX9504648A (es) 1997-02-28

Family

ID=23311104

Family Applications (1)

Application Number Title Priority Date Filing Date
MX9504648A MX9504648A (es) 1994-11-07 1995-11-06 Metodo y aparato para el procesamiento de imagenes, asistido por acustica.

Country Status (7)

Country Link
EP (1) EP0710929A3 (es)
JP (1) JPH08235384A (es)
KR (1) KR960018988A (es)
AU (1) AU3668095A (es)
CA (1) CA2162199A1 (es)
MX (1) MX9504648A (es)
TW (1) TW307090B (es)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0990973A (ja) * 1995-09-22 1997-04-04 Nikon Corp 音声処理装置
US6014625A (en) * 1996-12-30 2000-01-11 Daewoo Electronics Co., Ltd Method and apparatus for producing lip-movement parameters in a three-dimensional-lip-model
SE519679C2 (sv) 1997-03-25 2003-03-25 Telia Ab Metod vid talsyntes
SE520065C2 (sv) * 1997-03-25 2003-05-20 Telia Ab Anordning och metod för prosodigenerering vid visuell talsyntes
SE511927C2 (sv) * 1997-05-27 1999-12-20 Telia Ab Förbättringar i, eller med avseende på, visuell talsyntes
EP0960389B1 (en) * 1997-09-01 2005-04-27 Koninklijke Philips Electronics N.V. A method and apparatus for synchronizing a computer-animated model with an audio wave output
WO1999046734A1 (en) * 1998-03-11 1999-09-16 Entropic, Inc. Face synthesis system and methodology
IT1314671B1 (it) * 1998-10-07 2002-12-31 Cselt Centro Studi Lab Telecom Procedimento e apparecchiatura per l'animazione di un modellosintetizzato di volto umano pilotata da un segnale audio.
JP2003503925A (ja) * 1999-06-24 2003-01-28 コーニンクレッカ フィリップス エレクトロニクス エヌ ヴィ 情報ストリームのポスト同期
IT1320002B1 (it) * 2000-03-31 2003-11-12 Cselt Centro Studi Lab Telecom Procedimento per l'animazione di un modello sintetizzato di voltoumano pilotata da un segnale audio.
KR20020022504A (ko) * 2000-09-20 2002-03-27 박종만 3차원 캐릭터의 동작, 얼굴 표정, 립싱크 및 립싱크된음성 합성을 지원하는 3차원 동영상 저작 도구의 제작시스템 및 방법
US6662154B2 (en) * 2001-12-12 2003-12-09 Motorola, Inc. Method and system for information signal coding using combinatorial and huffman codes
EP1912175A1 (en) * 2006-10-09 2008-04-16 Muzlach AG System and method for generating a video signal
FR3033660A1 (fr) * 2015-03-12 2016-09-16 Univ De Lorraine Dispositif de traitement d'image
CA3151412A1 (en) * 2019-09-17 2021-03-25 Carl Adrian Woffenden System and method for talking avatar
EP3866117A4 (en) * 2019-12-26 2022-05-04 Zhejiang University VOICE CONTROLLED FACE ANIMATION GENERATION PROCESS

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4913539A (en) * 1988-04-04 1990-04-03 New York Institute Of Technology Apparatus and method for lip-synching animation
GB9019829D0 (en) * 1990-09-11 1990-10-24 British Telecomm Speech analysis and image synthesis
MY109854A (en) * 1992-12-21 1997-09-30 Casio Computer Co Ltd Object image display devices

Also Published As

Publication number Publication date
KR960018988A (ko) 1996-06-17
CA2162199A1 (en) 1996-05-08
EP0710929A2 (en) 1996-05-08
EP0710929A3 (en) 1996-07-03
AU3668095A (en) 1996-05-16
TW307090B (es) 1997-06-01
JPH08235384A (ja) 1996-09-13

Similar Documents

Publication Publication Date Title
MX9504648A (es) Metodo y aparato para el procesamiento de imagenes, asistido por acustica.
US5608839A (en) Sound-synchronized video system
EP1203352B1 (en) Method of animating a synthesised model of a human face driven by an acoustic signal
US5884267A (en) Automated speech alignment for image synthesis
US7433490B2 (en) System and method for real time lip synchronization
US5657426A (en) Method and apparatus for producing audio-visual synthetic speech
CA2285158A1 (en) A method and an apparatus for the animation, driven by an audio signal, of a synthesised model of human face
WO2007076278A2 (en) Method for animating a facial image using speech data
King et al. Creating speech-synchronized animation
KR20020022504A (ko) 3차원 캐릭터의 동작, 얼굴 표정, 립싱크 및 립싱크된음성 합성을 지원하는 3차원 동영상 저작 도구의 제작시스템 및 방법
AU4669201A (en) Character animation
US7257538B2 (en) Generating animation from visual and audio input
KR950035447A (ko) 음성 분석 자동화를 이용하는 비디오 신호 처리 시스템 및 그 방법
CN106570473A (zh) 基于机器人的聋哑人手语识别交互系统
Yargıç et al. A lip reading application on MS Kinect camera
Kalberer et al. Face animation based on observed 3d speech dynamics
Hong et al. iFACE: a 3D synthetic talking face
Sui et al. A 3D audio-visual corpus for speech recognition
CN117315102A (zh) 虚拟主播处理方法、装置、计算设备及存储介质
Morishima et al. Real-time facial action image synthesis system driven by speech and text
Lin et al. A speech driven talking head system based on a single face image
CN108109614A (zh) 一种新型的机器人带噪音语音识别装置及方法
Morishima et al. Speech-to-image media conversion based on VQ and neural network
EP0056507A1 (en) Apparatus and method for creating visual images of lip movements
JPH01190187A (ja) 画像伝送方式