NO317598B1 - Fremgangsmate og apparat for frembringelse av visuell talesyntese - Google Patents

Fremgangsmate og apparat for frembringelse av visuell talesyntese Download PDF

Info

Publication number
NO317598B1
NO317598B1 NO19995673A NO995673A NO317598B1 NO 317598 B1 NO317598 B1 NO 317598B1 NO 19995673 A NO19995673 A NO 19995673A NO 995673 A NO995673 A NO 995673A NO 317598 B1 NO317598 B1 NO 317598B1
Authority
NO
Norway
Prior art keywords
acoustic
mouth
speaker
speech
constituent element
Prior art date
Application number
NO19995673A
Other languages
English (en)
Norwegian (no)
Other versions
NO995673L (no
NO995673D0 (no
Inventor
Mats Ljungqvist
Original Assignee
Teliasonera Ab Publ
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Teliasonera Ab Publ filed Critical Teliasonera Ab Publ
Publication of NO995673D0 publication Critical patent/NO995673D0/no
Publication of NO995673L publication Critical patent/NO995673L/no
Publication of NO317598B1 publication Critical patent/NO317598B1/no

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/24Speech recognition using non-acoustical features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • G06V40/16Human faces, e.g. facial parts, sketches or expressions
    • G06V40/168Feature extraction; Face representation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/06Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
    • G10L21/10Transforming into visible information
    • G10L2021/105Synthesis of the lips movements from speech, e.g. for talking heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M2201/00Electronic components, circuits, software, systems or apparatus used in telephone systems
    • H04M2201/40Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/567Multimedia conference systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Physics & Mathematics (AREA)
  • Oral & Maxillofacial Surgery (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Theoretical Computer Science (AREA)
  • Processing Or Creating Images (AREA)
  • Photoreceptors In Electrophotography (AREA)
NO19995673A 1997-05-27 1999-11-19 Fremgangsmate og apparat for frembringelse av visuell talesyntese NO317598B1 (no)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
SE9701977A SE511927C2 (sv) 1997-05-27 1997-05-27 Förbättringar i, eller med avseende på, visuell talsyntes
PCT/SE1998/000710 WO1998054696A1 (en) 1997-05-27 1998-04-20 Improvements in, or relating to, visual speech synthesis

Publications (3)

Publication Number Publication Date
NO995673D0 NO995673D0 (no) 1999-11-19
NO995673L NO995673L (no) 2000-01-25
NO317598B1 true NO317598B1 (no) 2004-11-22

Family

ID=20407101

Family Applications (1)

Application Number Title Priority Date Filing Date
NO19995673A NO317598B1 (no) 1997-05-27 1999-11-19 Fremgangsmate og apparat for frembringelse av visuell talesyntese

Country Status (7)

Country Link
EP (1) EP0983575B1 (de)
DE (1) DE69816078T2 (de)
DK (1) DK0983575T3 (de)
EE (1) EE03634B1 (de)
NO (1) NO317598B1 (de)
SE (1) SE511927C2 (de)
WO (1) WO1998054696A1 (de)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101268507A (zh) 2005-07-11 2008-09-17 皇家飞利浦电子股份有限公司 用于通信的方法以及通信设备
CN101304655B (zh) 2005-11-10 2014-12-10 巴斯夫欧洲公司 杀真菌混合物
US9956407B2 (en) 2014-08-04 2018-05-01 Cochlear Limited Tonal deafness compensation in an auditory prosthesis system
US10534955B2 (en) * 2016-01-22 2020-01-14 Dreamworks Animation L.L.C. Facial capture analysis and training system
CN106067989B (zh) * 2016-04-28 2022-05-17 江苏大学 一种人像语音视频同步校准装置及方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5621858A (en) * 1992-05-26 1997-04-15 Ricoh Corporation Neural network acoustic and visual speech recognition system training method and apparatus
US5482048A (en) * 1993-06-30 1996-01-09 University Of Pittsburgh System and method for measuring and quantitating facial movements
US5657426A (en) * 1994-06-10 1997-08-12 Digital Equipment Corporation Method and apparatus for producing audio-visual synthetic speech
KR960018988A (ko) * 1994-11-07 1996-06-17 엠, 케이. 영 음향 보조 영상 처리 방법 및 장치
SE519244C2 (sv) * 1995-12-06 2003-02-04 Telia Ab Anordning och metod vid talsyntes

Also Published As

Publication number Publication date
DK0983575T3 (da) 2003-10-27
EP0983575B1 (de) 2003-07-02
EP0983575A1 (de) 2000-03-08
DE69816078D1 (de) 2003-08-07
SE9701977L (sv) 1998-11-28
SE511927C2 (sv) 1999-12-20
EE9900542A (et) 2000-06-15
NO995673L (no) 2000-01-25
DE69816078T2 (de) 2004-05-13
EE03634B1 (et) 2002-02-15
SE9701977D0 (sv) 1997-05-27
NO995673D0 (no) 1999-11-19
WO1998054696A1 (en) 1998-12-03

Similar Documents

Publication Publication Date Title
Rosenblum et al. An audiovisual test of kinematic primitives for visual speech perception.
Munhall et al. The moving face during speech communication
Hanson et al. Towards models of phonation
JP3893763B2 (ja) 音声検出装置
Cvejic et al. Prosody off the top of the head: Prosodic contrasts can be discriminated by head motion
Livingstone et al. Head movements encode emotions during speech and song.
CN107301863A (zh) 一种聋哑儿童言语障碍康复方法及康复训练系统
Campbell The lateralization of lip-read sounds: A first look
Vatakis et al. Assessing the effect of physical differences in the articulation of consonants and vowels on audiovisual temporal perception
Freitas et al. An introduction to silent speech interfaces
Smith et al. Infant-directed visual prosody: Mothers’ head movements and speech acoustics
NO317598B1 (no) Fremgangsmate og apparat for frembringelse av visuell talesyntese
JP2007018006A (ja) 音声合成システム、音声合成方法、音声合成プログラム
Bicevskis et al. Effects of mouthing and interlocutor presence on movements of visible vs. non-visible articulators
Yip Phonetic effects on the timing of gestural coordination in Modern Greek consonant clusters
Zellou Similarity and enhancement: Nasality from Moroccan Arabic pharyngeals and nasals
Beskow et al. Visualization of speech and audio for hearing impaired persons
Öster Computer-based speech therapy using visual feedback with focus on children with profound hearing impairments
McGarr et al. Ephphatha1: Opening Inroads to Understanding Articulatory OO Organization in Persons with Jm^ j Hearing Impairment
WO2022024355A1 (ja) 感情解析システム
Lavagetto Multimedia Telephone for Hearing-Impaired People
Wu et al. Development and evaluation of on/off control for electrolaryngeal speech via artificial neural network based on visual information of lips
Dahmani et al. Some consideration on expressive audiovisual speech corpus acquisition using a multimodal platform
Chen et al. Investigating the relationship between glottal area waveform shape and harmonic magnitudes through computational modeling and laryngeal high-speed videoendoscopy.
Rothenberg Rethinking nasalance and nasal emission