SE514684C2 - Metod vid tal-till-textomvandling - Google Patents
Metod vid tal-till-textomvandlingInfo
- Publication number
- SE514684C2 SE514684C2 SE9502202A SE9502202A SE514684C2 SE 514684 C2 SE514684 C2 SE 514684C2 SE 9502202 A SE9502202 A SE 9502202A SE 9502202 A SE9502202 A SE 9502202A SE 514684 C2 SE514684 C2 SE 514684C2
- Authority
- SE
- Sweden
- Prior art keywords
- accent
- information
- speech
- words
- sentences
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 12
- 238000006243 chemical reaction Methods 0.000 title claims abstract description 6
- 239000000284 extract Substances 0.000 claims 1
- 238000013507 mapping Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000007619 statistical method Methods 0.000 description 2
- 238000010586 diagram Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000001228 spectrum Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1807—Speech classification or search using natural language modelling using prosody or stress
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9502202A SE514684C2 (sv) | 1995-06-16 | 1995-06-16 | Metod vid tal-till-textomvandling |
DE69618503T DE69618503T2 (de) | 1995-06-16 | 1996-06-04 | Spracherkennung für Tonsprachen |
DK96850108T DK0749109T3 (da) | 1995-06-16 | 1996-06-04 | Talegenkendelse for tonesprog |
EP96850108A EP0749109B1 (en) | 1995-06-16 | 1996-06-04 | Speech recognition for tonal languages |
NO19962463A NO316847B1 (no) | 1995-06-16 | 1996-06-12 | Fremgangsmate og anordning ved omvandling av tale til tekst |
JP8175484A JPH0922297A (ja) | 1995-06-16 | 1996-06-14 | 音声‐テキスト変換のための方法および装置 |
US08/665,728 US5806033A (en) | 1995-06-16 | 1996-06-17 | Syllable duration and pitch variation to determine accents and stresses for speech recognition |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
SE9502202A SE514684C2 (sv) | 1995-06-16 | 1995-06-16 | Metod vid tal-till-textomvandling |
Publications (3)
Publication Number | Publication Date |
---|---|
SE9502202D0 SE9502202D0 (sv) | 1995-06-16 |
SE9502202L SE9502202L (sv) | 1996-12-17 |
SE514684C2 true SE514684C2 (sv) | 2001-04-02 |
Family
ID=20398649
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
SE9502202A SE514684C2 (sv) | 1995-06-16 | 1995-06-16 | Metod vid tal-till-textomvandling |
Country Status (7)
Country | Link |
---|---|
US (1) | US5806033A (ja) |
EP (1) | EP0749109B1 (ja) |
JP (1) | JPH0922297A (ja) |
DE (1) | DE69618503T2 (ja) |
DK (1) | DK0749109T3 (ja) |
NO (1) | NO316847B1 (ja) |
SE (1) | SE514684C2 (ja) |
Families Citing this family (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH1039895A (ja) * | 1996-07-25 | 1998-02-13 | Matsushita Electric Ind Co Ltd | 音声合成方法および装置 |
KR100238189B1 (ko) * | 1997-10-16 | 2000-01-15 | 윤종용 | 다중 언어 tts장치 및 다중 언어 tts 처리 방법 |
JP4267101B2 (ja) | 1997-11-17 | 2009-05-27 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声識別装置、発音矯正装置およびこれらの方法 |
US7283973B1 (en) | 1998-10-07 | 2007-10-16 | Logic Tree Corporation | Multi-modal voice-enabled content access and delivery system |
US6941273B1 (en) | 1998-10-07 | 2005-09-06 | Masoud Loghmani | Telephony-data application interface apparatus and method for multi-modal access to data applications |
US6377927B1 (en) | 1998-10-07 | 2002-04-23 | Masoud Loghmani | Voice-optimized database system and method of using same |
WO2001003112A1 (en) * | 1999-07-06 | 2001-01-11 | James Quest | Speech recognition system and method |
AU763362B2 (en) * | 1999-07-06 | 2003-07-17 | James Quest | Speech recognition system and method |
US6526382B1 (en) | 1999-12-07 | 2003-02-25 | Comverse, Inc. | Language-oriented user interfaces for voice activated services |
US20080147404A1 (en) * | 2000-05-15 | 2008-06-19 | Nusuara Technologies Sdn Bhd | System and methods for accent classification and adaptation |
US7200142B1 (en) | 2001-02-08 | 2007-04-03 | Logic Tree Corporation | System for providing multi-phased, multi-modal access to content through voice and data devices |
US6948129B1 (en) | 2001-02-08 | 2005-09-20 | Masoud S Loghmani | Multi-modal, multi-path user interface for simultaneous access to internet data over multiple media |
US8000320B2 (en) * | 2001-02-08 | 2011-08-16 | Logic Tree Corporation | System for providing multi-phased, multi-modal access to content through voice and data devices |
ATE310302T1 (de) * | 2001-09-28 | 2005-12-15 | Cit Alcatel | Kommunikationsvorrichtung und verfahren zum senden und empfangen von sprachsignalen unter kombination eines spracherkennungsmodules mit einer kodiereinheit |
GB2388738B (en) | 2001-11-03 | 2004-06-02 | Dremedia Ltd | Time ordered indexing of audio data |
GB2381688B (en) | 2001-11-03 | 2004-09-22 | Dremedia Ltd | Time ordered indexing of audio-visual data |
US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
US7280968B2 (en) * | 2003-03-25 | 2007-10-09 | International Business Machines Corporation | Synthetically generated speech responses including prosodic characteristics of speech inputs |
US20050055197A1 (en) * | 2003-08-14 | 2005-03-10 | Sviatoslav Karavansky | Linguographic method of compiling word dictionaries and lexicons for the memories of electronic speech-recognition devices |
JP4264841B2 (ja) * | 2006-12-01 | 2009-05-20 | ソニー株式会社 | 音声認識装置および音声認識方法、並びに、プログラム |
US8315870B2 (en) * | 2007-08-22 | 2012-11-20 | Nec Corporation | Rescoring speech recognition hypothesis using prosodic likelihood |
US8401856B2 (en) * | 2010-05-17 | 2013-03-19 | Avaya Inc. | Automatic normalization of spoken syllable duration |
US9009049B2 (en) * | 2012-06-06 | 2015-04-14 | Spansion Llc | Recognition of speech with different accents |
US9966064B2 (en) | 2012-07-18 | 2018-05-08 | International Business Machines Corporation | Dialect-specific acoustic language modeling and speech recognition |
KR102084646B1 (ko) * | 2013-07-04 | 2020-04-14 | 삼성전자주식회사 | 음성 인식 장치 및 음성 인식 방법 |
US10468050B2 (en) | 2017-03-29 | 2019-11-05 | Microsoft Technology Licensing, Llc | Voice synthesized participatory rhyming chat bot |
US11809958B2 (en) * | 2020-06-10 | 2023-11-07 | Capital One Services, Llc | Systems and methods for automatic decision-making with user-configured criteria using multi-channel data inputs |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0356736B2 (ja) * | 1979-05-28 | 1991-08-29 | ||
JPH05197389A (ja) * | 1991-08-13 | 1993-08-06 | Toshiba Corp | 音声認識装置 |
SE500277C2 (sv) * | 1993-05-10 | 1994-05-24 | Televerket | Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk |
SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
SE504177C2 (sv) * | 1994-06-29 | 1996-12-02 | Telia Ab | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk |
-
1995
- 1995-06-16 SE SE9502202A patent/SE514684C2/sv unknown
-
1996
- 1996-06-04 DE DE69618503T patent/DE69618503T2/de not_active Expired - Fee Related
- 1996-06-04 EP EP96850108A patent/EP0749109B1/en not_active Expired - Lifetime
- 1996-06-04 DK DK96850108T patent/DK0749109T3/da active
- 1996-06-12 NO NO19962463A patent/NO316847B1/no unknown
- 1996-06-14 JP JP8175484A patent/JPH0922297A/ja active Pending
- 1996-06-17 US US08/665,728 patent/US5806033A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
DE69618503D1 (de) | 2002-02-21 |
EP0749109A2 (en) | 1996-12-18 |
US5806033A (en) | 1998-09-08 |
EP0749109A3 (en) | 1998-04-29 |
SE9502202D0 (sv) | 1995-06-16 |
EP0749109B1 (en) | 2002-01-16 |
NO316847B1 (no) | 2004-06-01 |
SE9502202L (sv) | 1996-12-17 |
JPH0922297A (ja) | 1997-01-21 |
NO962463L (no) | 1996-12-17 |
DK0749109T3 (da) | 2002-03-25 |
NO962463D0 (no) | 1996-06-12 |
DE69618503T2 (de) | 2002-08-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
SE514684C2 (sv) | Metod vid tal-till-textomvandling | |
EP0683483B1 (en) | A method and arrangement for speech to text conversion | |
Norris et al. | The possible-word constraint in the segmentation of continuous speech | |
US7937262B2 (en) | Method, apparatus, and computer program product for machine translation | |
US7962341B2 (en) | Method and apparatus for labelling speech | |
CN106297800B (zh) | 一种自适应的语音识别的方法和设备 | |
JP2559998B2 (ja) | 音声認識装置及びラベル生成方法 | |
Warnke et al. | Integrated dialog act segmentation and classification using prosodic features and language models. | |
JPH0423799B2 (ja) | ||
CN104464751B (zh) | 发音韵律问题的检测方法及装置 | |
JP2001100781A (ja) | 音声処理装置および音声処理方法、並びに記録媒体 | |
ATE389225T1 (de) | Spracherkennung | |
KR20060052663A (ko) | 음운 기반의 음성 인식 시스템 및 방법 | |
EP1095371A1 (en) | Language independent speech recognition | |
US5694520A (en) | Method and device for speech recognition | |
US8870575B2 (en) | Language learning system, language learning method, and computer program product thereof | |
Conkie et al. | Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events | |
KR100930714B1 (ko) | 음성인식 장치 및 방법 | |
JPH06110494A (ja) | 発音学習装置 | |
CN115424604B (zh) | 一种基于对抗生成网络的语音合成模型的训练方法 | |
Taylor et al. | Using prosodic information to constrain language models for spoken dialogue | |
NO318557B1 (no) | Fremgangsmate og system for tale-til-taleomforming | |
SE519273C2 (sv) | Förbättringar av , eller med avseende på, tal-till-tal- omvandling | |
Holmes et al. | Why have HMMs been so successful for automatic speech recognition and how might they be improved | |
O'Brien | Knowledge-based systems in speech recognition: a survey |