SE511927C2 - Förbättringar i, eller med avseende på, visuell talsyntes - Google Patents
Förbättringar i, eller med avseende på, visuell talsyntesInfo
- Publication number
- SE511927C2 SE511927C2 SE9701977A SE9701977A SE511927C2 SE 511927 C2 SE511927 C2 SE 511927C2 SE 9701977 A SE9701977 A SE 9701977A SE 9701977 A SE9701977 A SE 9701977A SE 511927 C2 SE511927 C2 SE 511927C2
- Authority
- SE
- Sweden
- Prior art keywords
- acoustic
- mouth
- speaker
- points
- units
- Prior art date
Links
- 230000000007 visual effect Effects 0.000 title claims abstract description 53
- 230000015572 biosynthetic process Effects 0.000 title description 4
- 238000003786 synthesis reaction Methods 0.000 title description 4
- 230000001815 facial effect Effects 0.000 claims abstract description 92
- 238000000034 method Methods 0.000 claims abstract description 39
- 239000000470 constituent Substances 0.000 claims abstract description 35
- 230000001360 synchronised effect Effects 0.000 claims abstract description 19
- 230000004044 response Effects 0.000 claims abstract description 5
- 238000006243 chemical reaction Methods 0.000 claims description 23
- 238000005259 measurement Methods 0.000 claims description 14
- 238000012886 linear function Methods 0.000 claims description 5
- 238000012417 linear regression Methods 0.000 claims description 5
- 230000000052 comparative effect Effects 0.000 claims 1
- 230000008878 coupling Effects 0.000 claims 1
- 238000010168 coupling process Methods 0.000 claims 1
- 238000005859 coupling reaction Methods 0.000 claims 1
- 208000032041 Hearing impaired Diseases 0.000 description 8
- 230000008602 contraction Effects 0.000 description 6
- 230000001771 impaired effect Effects 0.000 description 5
- 230000006870 function Effects 0.000 description 4
- 230000007704 transition Effects 0.000 description 4
- 238000001228 spectrum Methods 0.000 description 3
- 210000001260 vocal cord Anatomy 0.000 description 3
- 208000016621 Hearing disease Diseases 0.000 description 2
- XEEYBQQBJWHFJM-UHFFFAOYSA-N Iron Chemical compound [Fe] XEEYBQQBJWHFJM-UHFFFAOYSA-N 0.000 description 2
- 238000004364 calculation method Methods 0.000 description 2
- 238000004891 communication Methods 0.000 description 2
- 230000003247 decreasing effect Effects 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 206010000060 Abdominal distension Diseases 0.000 description 1
- 235000019892 Stellar Nutrition 0.000 description 1
- 230000004888 barrier function Effects 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000002902 bimodal effect Effects 0.000 description 1
- 208000024330 bloating Diseases 0.000 description 1
- 230000001427 coherent effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000001934 delay Effects 0.000 description 1
- 230000006866 deterioration Effects 0.000 description 1
- 238000006073 displacement reaction Methods 0.000 description 1
- 230000001747 exhibiting effect Effects 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000008921 facial expression Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 210000001061 forehead Anatomy 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 210000004704 glottis Anatomy 0.000 description 1
- 208000016354 hearing loss disease Diseases 0.000 description 1
- 230000010354 integration Effects 0.000 description 1
- 229910052742 iron Inorganic materials 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 230000007170 pathology Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000003595 spectral effect Effects 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000012549 training Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/24—Speech recognition using non-acoustical features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transforming into visible information
- G10L2021/105—Synthesis of the lips movements from speech, e.g. for talking heads
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M2201/00—Electronic components, circuits, software, systems or apparatus used in telephone systems
- H04M2201/40—Electronic components, circuits, software, systems or apparatus used in telephone systems using speech recognition
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
- H04M3/567—Multimedia conference systems
Landscapes
- Engineering & Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Physics & Mathematics (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Theoretical Computer Science (AREA)
- Processing Or Creating Images (AREA)
- Photoreceptors In Electrophotography (AREA)
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9701977A SE511927C2 (sv) | 1997-05-27 | 1997-05-27 | Förbättringar i, eller med avseende på, visuell talsyntes |
PCT/SE1998/000710 WO1998054696A1 (en) | 1997-05-27 | 1998-04-20 | Improvements in, or relating to, visual speech synthesis |
DE69816078T DE69816078T2 (de) | 1997-05-27 | 1998-04-20 | Verbesserungen im bezug auf visuelle sprachsynthese |
DK98917918T DK0983575T3 (da) | 1997-05-27 | 1998-04-20 | Forbedringer af eller vedrørende visuel talesyntese |
EEP199900542A EE03634B1 (et) | 1997-05-27 | 1998-04-20 | Visuaalse kõnesünteesi alased või sellega seotud täiustused |
EP98917918A EP0983575B1 (de) | 1997-05-27 | 1998-04-20 | Verbesserungen im bezug auf visuelle sprachsynthese |
NO19995673A NO317598B1 (no) | 1997-05-27 | 1999-11-19 | Fremgangsmate og apparat for frembringelse av visuell talesyntese |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9701977A SE511927C2 (sv) | 1997-05-27 | 1997-05-27 | Förbättringar i, eller med avseende på, visuell talsyntes |
Publications (3)
Publication Number | Publication Date |
---|---|
SE9701977D0 SE9701977D0 (sv) | 1997-05-27 |
SE9701977L SE9701977L (sv) | 1998-11-28 |
SE511927C2 true SE511927C2 (sv) | 1999-12-20 |
Family
ID=20407101
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SE9701977A SE511927C2 (sv) | 1997-05-27 | 1997-05-27 | Förbättringar i, eller med avseende på, visuell talsyntes |
Country Status (7)
Country | Link |
---|---|
EP (1) | EP0983575B1 (de) |
DE (1) | DE69816078T2 (de) |
DK (1) | DK0983575T3 (de) |
EE (1) | EE03634B1 (de) |
NO (1) | NO317598B1 (de) |
SE (1) | SE511927C2 (de) |
WO (1) | WO1998054696A1 (de) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007007228A2 (en) | 2005-07-11 | 2007-01-18 | Philips Intellectual Property & Standards Gmbh | Method for communication and communication device |
CA2632742C (en) | 2005-11-10 | 2013-10-15 | Basf Se | Fungicidal mixtures comprising a ternary combination of triticonazole, pyraclostrobin and metalaxyl-m |
US9956407B2 (en) | 2014-08-04 | 2018-05-01 | Cochlear Limited | Tonal deafness compensation in an auditory prosthesis system |
US10534955B2 (en) * | 2016-01-22 | 2020-01-14 | Dreamworks Animation L.L.C. | Facial capture analysis and training system |
CN106067989B (zh) * | 2016-04-28 | 2022-05-17 | 江苏大学 | 一种人像语音视频同步校准装置及方法 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5621858A (en) * | 1992-05-26 | 1997-04-15 | Ricoh Corporation | Neural network acoustic and visual speech recognition system training method and apparatus |
US5482048A (en) * | 1993-06-30 | 1996-01-09 | University Of Pittsburgh | System and method for measuring and quantitating facial movements |
US5657426A (en) * | 1994-06-10 | 1997-08-12 | Digital Equipment Corporation | Method and apparatus for producing audio-visual synthetic speech |
AU3668095A (en) * | 1994-11-07 | 1996-05-16 | At & T Corporation | Acoustic-assisted image processing |
SE519244C2 (sv) * | 1995-12-06 | 2003-02-04 | Telia Ab | Anordning och metod vid talsyntes |
-
1997
- 1997-05-27 SE SE9701977A patent/SE511927C2/sv unknown
-
1998
- 1998-04-20 DK DK98917918T patent/DK0983575T3/da active
- 1998-04-20 EP EP98917918A patent/EP0983575B1/de not_active Expired - Lifetime
- 1998-04-20 WO PCT/SE1998/000710 patent/WO1998054696A1/en active IP Right Grant
- 1998-04-20 DE DE69816078T patent/DE69816078T2/de not_active Expired - Fee Related
- 1998-04-20 EE EEP199900542A patent/EE03634B1/xx not_active IP Right Cessation
-
1999
- 1999-11-19 NO NO19995673A patent/NO317598B1/no unknown
Also Published As
Publication number | Publication date |
---|---|
SE9701977L (sv) | 1998-11-28 |
DE69816078T2 (de) | 2004-05-13 |
EP0983575B1 (de) | 2003-07-02 |
EE9900542A (et) | 2000-06-15 |
EP0983575A1 (de) | 2000-03-08 |
WO1998054696A1 (en) | 1998-12-03 |
DE69816078D1 (de) | 2003-08-07 |
NO995673L (no) | 2000-01-25 |
EE03634B1 (et) | 2002-02-15 |
NO317598B1 (no) | 2004-11-22 |
NO995673D0 (no) | 1999-11-19 |
SE9701977D0 (sv) | 1997-05-27 |
DK0983575T3 (da) | 2003-10-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7676372B1 (en) | Prosthetic hearing device that transforms a detected speech into a speech of a speech form assistive in understanding the semantic meaning in the detected speech | |
Rosenblum et al. | An audiovisual test of kinematic primitives for visual speech perception. | |
Lavagetto | Converting speech into lip movements: A multimedia telephone for hard of hearing people | |
Jiang et al. | On the relationship between face movements, tongue movements, and speech acoustics | |
Tran et al. | Improvement to a NAM-captured whisper-to-speech system | |
JP3670180B2 (ja) | 補聴器 | |
Kim et al. | Hearing speech in noise: Seeing a loud talker is better | |
Barker et al. | Evidence of correlation between acoustic and visual features of speech | |
Salvi et al. | SynFace—speech-driven facial animation for virtual speech-reading support | |
SE511927C2 (sv) | Förbättringar i, eller med avseende på, visuell talsyntes | |
JP4381404B2 (ja) | 音声合成システム、音声合成方法、音声合成プログラム | |
Patel et al. | Teachable interfaces for individuals with dysarthric speech and severe physical disabilities | |
Olives et al. | Audio-visual speech synthesis for finnish | |
Adjoudani et al. | A multimedia platform for audio-visual speech processing | |
Lavagetto | Multimedia Telephone for Hearing-Impaired People | |
Bastanfard et al. | A comprehensive audio-visual corpus for teaching sound persian phoneme articulation | |
Beskow et al. | Visualization of speech and audio for hearing impaired persons | |
Agelfors et al. | Synthetic visual speech driven from auditory speech | |
Beautemps et al. | Telma: Telephony for the hearing-impaired people. from models to user tests | |
Kumar et al. | Real time detection and conversion of gestures to text and speech to sign system | |
KR20150075502A (ko) | 발음 학습 지원 시스템 및 그 시스템의 발음 학습 지원 방법 | |
Goecke | A stereo vision lip tracking algorithm and subsequent statistical analyses of the audio-video correlation in Australian English | |
Hatzis et al. | Optical logo-therapy (OLT): a computer-based real time visual feedback application for speech training. | |
Beskow et al. | Analysis and synthesis of multimodal verbal and non-verbal interaction for animated interface agents | |
Engwall et al. | Are real tongue movements easier to speech read than synthesized? |