SE9701977L - Förbättringar i, eller med avseende på, optisk talsyntes - Google Patents

Förbättringar i, eller med avseende på, optisk talsyntes

Info

Publication number
SE9701977L
SE9701977L SE9701977A SE9701977A SE9701977L SE 9701977 L SE9701977 L SE 9701977L SE 9701977 A SE9701977 A SE 9701977A SE 9701977 A SE9701977 A SE 9701977A SE 9701977 L SE9701977 L SE 9701977L
Authority
SE
Sweden
Prior art keywords
speaker
units
facial
mouth
around
Prior art date
Application number
SE9701977A
Other languages
English (en)
Other versions
SE511927C2 (sv
SE9701977D0 (sv
Inventor
Mats Ljungqvist
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9701977A priority Critical patent/SE511927C2/sv
Publication of SE9701977D0 publication Critical patent/SE9701977D0/sv
Priority to DE69816078T priority patent/DE69816078T2/de
Priority to EEP199900542A priority patent/EE03634B1/xx
Priority to EP98917918A priority patent/EP0983575B1/en
Priority to PCT/SE1998/000710 priority patent/WO1998054696A1/en
Priority to DK98917918T priority patent/DK0983575T3/da
Publication of SE9701977L publication Critical patent/SE9701977L/sv
Priority to NO19995673A priority patent/NO317598B1/no
Publication of SE511927C2 publication Critical patent/SE511927C2/sv

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/567Multimedia conference systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • Photoreceptors In Electrophotography (AREA)
SE9701977A 1997-05-27 1997-05-27 Förbättringar i, eller med avseende på, visuell talsyntes SE511927C2 (sv)

Priority Applications (7)

Application Number Priority Date Filing Date Title
SE9701977A SE511927C2 (sv) 1997-05-27 1997-05-27 Förbättringar i, eller med avseende på, visuell talsyntes
DE69816078T DE69816078T2 (de) 1997-05-27 1998-04-20 Verbesserungen im bezug auf visuelle sprachsynthese
EEP199900542A EE03634B1 (et) 1997-05-27 1998-04-20 Visuaalse kõnesünteesi alased või sellega seotud täiustused
EP98917918A EP0983575B1 (en) 1997-05-27 1998-04-20 Improvements in, or relating to, visual speech synthesis
PCT/SE1998/000710 WO1998054696A1 (en) 1997-05-27 1998-04-20 Improvements in, or relating to, visual speech synthesis
DK98917918T DK0983575T3 (da) 1997-05-27 1998-04-20 Forbedringer af eller vedrørende visuel talesyntese
NO19995673A NO317598B1 (no) 1997-05-27 1999-11-19 Fremgangsmate og apparat for frembringelse av visuell talesyntese

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9701977A SE511927C2 (sv) 1997-05-27 1997-05-27 Förbättringar i, eller med avseende på, visuell talsyntes

Publications (3)

Publication Number Publication Date
SE9701977D0 SE9701977D0 (sv) 1997-05-27
SE9701977L true SE9701977L (sv) 1998-11-28
SE511927C2 SE511927C2 (sv) 1999-12-20

Family

ID=20407101

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9701977A SE511927C2 (sv) 1997-05-27 1997-05-27 Förbättringar i, eller med avseende på, visuell talsyntes

Country Status (7)

Country Link
EP (1) EP0983575B1 (sv)
DE (1) DE69816078T2 (sv)
DK (1) DK0983575T3 (sv)
EE (1) EE03634B1 (sv)
NO (1) NO317598B1 (sv)
SE (1) SE511927C2 (sv)
WO (1) WO1998054696A1 (sv)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101268507A (zh) 2005-07-11 2008-09-17 皇家飞利浦电子股份有限公司 用于通信的方法以及通信设备
CN101304655B (zh) 2005-11-10 2014-12-10 巴斯夫欧洲公司 杀真菌混合物
US9956407B2 (en) 2014-08-04 2018-05-01 Cochlear Limited Tonal deafness compensation in an auditory prosthesis system
US10534955B2 (en) * 2016-01-22 2020-01-14 Dreamworks Animation L.L.C. Facial capture analysis and training system
CN106067989B (zh) * 2016-04-28 2022-05-17 江苏大学 一种人像语音视频同步校准装置及方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5621858A (en) * 1992-05-26 1997-04-15 Ricoh Corporation Neural network acoustic and visual speech recognition system training method and apparatus
US5482048A (en) * 1993-06-30 1996-01-09 University Of Pittsburgh System and method for measuring and quantitating facial movements
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
KR960018988A (ko) * 1994-11-07 1996-06-17 엠, 케이. 영 음향 보조 영상 처리 방법 및 장치
SE519244C2 (sv) * 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes

Also Published As

Publication number Publication date
NO317598B1 (no) 2004-11-22
DK0983575T3 (da) 2003-10-27
EP0983575B1 (en) 2003-07-02
EP0983575A1 (en) 2000-03-08
DE69816078D1 (de) 2003-08-07
SE511927C2 (sv) 1999-12-20
EE9900542A (et) 2000-06-15
NO995673L (no) 2000-01-25
DE69816078T2 (de) 2004-05-13
EE03634B1 (et) 2002-02-15
SE9701977D0 (sv) 1997-05-27
NO995673D0 (no) 1999-11-19
WO1998054696A1 (en) 1998-12-03

Similar Documents

Publication Publication Date Title
Munhall et al. The moving face during speech communication
Brooke et al. Analysis, synthesis, and perception of visible articulatory movements
Attina et al. A pilot study of temporal organization in Cued Speech production of French syllables: rules for a Cued Speech synthesizer
Rosenblum et al. An audiovisual test of kinematic primitives for visual speech perception.
CN106205633B (zh) 一种模仿、表演练习打分系统
Recasens Long range coarticulation effects for tongue dorsum contact in VCVCV sequences
Kuratate et al. Kinematics-based synthesis of realistic talking faces
EP0860811A3 (en) Automated speech alignment for image synthesis
WO1997010573A3 (en) Method and system for displaying a graphic image of a person modeling a garment
SE9504367D0 (sv) Anordning och metod vid talsyntes
Jones et al. BEHAVIORAL ENGINEERING: STUTTERING AS A FUNCTION OF STIMULUS DURATION DURING SPEECH SYNCHRONIZATION 1
SE9701977L (sv) Förbättringar i, eller med avseende på, optisk talsyntes
Lee et al. MAScreen: Augmenting Speech with Visual Cues of Lip Motions, Facial Expressions, and Text Using a Wearable Display
Erber et al. Voice/mouth synthesis and tactual/visual perception of/pa, ba, ma
Guiard-Marigny et al. 3D models of the lips and jaw for visual speech synthesis
KR102202654B1 (ko) 안면 움직임 정보 추출 방법 및 장치
JP3059022B2 (ja) 動画像表示装置
Kolesnik Conducting gesture recognition, analysis and performance system
Hietanen et al. Does audiovisual speech perception use information about facial configuration?
Caldognetto et al. Automatic analysis of lips and jaw kinematics in VCV sequences.
Hall et al. Using optical flow analysis on ultrasound of the tongue to examine phonological relationships
Huet et al. Shape retrieval by inexact graph matching
Le Goff et al. Analysis-synthesis and intelligibility of a talking face
Barbosa et al. Temporal characterization of auditory-visual coupling in speech
JP2002143112A (ja) 三次元画像解析システム