SE504177C2 - Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk - Google Patents
Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språkInfo
- Publication number
- SE504177C2 SE504177C2 SE9402284A SE9402284A SE504177C2 SE 504177 C2 SE504177 C2 SE 504177C2 SE 9402284 A SE9402284 A SE 9402284A SE 9402284 A SE9402284 A SE 9402284A SE 504177 C2 SE504177 C2 SE 504177C2
- Authority
- SE
- Sweden
- Prior art keywords
- model
- speech
- language
- information
- fundamental tone
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1807—Speech classification or search using natural language modelling using prosody or stress
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/90—Pitch determination of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Priority Applications (7)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| SE9402284A SE504177C2 (sv) | 1994-06-29 | 1994-06-29 | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk |
| ES95925191T ES2152411T3 (es) | 1994-06-29 | 1995-06-13 | Metodo y dispositivo para adaptar un equipo de reconocimiento del habla a las variantes dialectales de una lengua. |
| EP95925191A EP0767950B1 (de) | 1994-06-29 | 1995-06-13 | Verfahren und vorrichtung zur anpassung eines spracherkenners an dialektische sprachvarianten |
| JP8503055A JPH10504404A (ja) | 1994-06-29 | 1995-06-13 | 音声認識のための方法および装置 |
| PCT/SE1995/000710 WO1996000962A2 (en) | 1994-06-29 | 1995-06-13 | Method and device for adapting a speech recognition equipment for dialectal variations in a language |
| US08/532,823 US5694520A (en) | 1994-06-29 | 1995-06-13 | Method and device for speech recognition |
| DE69519229T DE69519229T2 (de) | 1994-06-29 | 1995-06-13 | Verfahren und vorrichtung zur anpassung eines spracherkenners an dialektische sprachvarianten |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| SE9402284A SE504177C2 (sv) | 1994-06-29 | 1994-06-29 | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| SE9402284D0 SE9402284D0 (sv) | 1994-06-29 |
| SE9402284L SE9402284L (sv) | 1995-12-30 |
| SE504177C2 true SE504177C2 (sv) | 1996-12-02 |
Family
ID=20394556
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| SE9402284A SE504177C2 (sv) | 1994-06-29 | 1994-06-29 | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk |
Country Status (7)
| Country | Link |
|---|---|
| US (1) | US5694520A (de) |
| EP (1) | EP0767950B1 (de) |
| JP (1) | JPH10504404A (de) |
| DE (1) | DE69519229T2 (de) |
| ES (1) | ES2152411T3 (de) |
| SE (1) | SE504177C2 (de) |
| WO (1) | WO1996000962A2 (de) |
Families Citing this family (18)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
| SE514684C2 (sv) * | 1995-06-16 | 2001-04-02 | Telia Ab | Metod vid tal-till-textomvandling |
| SE9601811L (sv) * | 1996-05-13 | 1997-11-03 | Telia Ab | Metod och system för tal-till-tal-omvandling med extrahering av prosodiinformation |
| SE519273C2 (sv) * | 1996-05-13 | 2003-02-11 | Telia Ab | Förbättringar av , eller med avseende på, tal-till-tal- omvandling |
| EP1051701B1 (de) * | 1998-02-03 | 2002-11-06 | Siemens Aktiengesellschaft | Verfahren zum übermitteln von sprachdaten |
| US6343270B1 (en) * | 1998-12-09 | 2002-01-29 | International Business Machines Corporation | Method for increasing dialect precision and usability in speech recognition and text-to-speech systems |
| EP1096470B1 (de) * | 1999-10-29 | 2005-04-06 | Matsushita Electric Industrial Co., Ltd. | Normalisierung der Grundfrequenz zur Spracherkennung |
| CN1159702C (zh) | 2001-04-11 | 2004-07-28 | 国际商业机器公司 | 具有情感的语音-语音翻译系统和方法 |
| US20040266337A1 (en) * | 2003-06-25 | 2004-12-30 | Microsoft Corporation | Method and apparatus for synchronizing lyrics |
| US7940897B2 (en) | 2005-06-24 | 2011-05-10 | American Express Travel Related Services Company, Inc. | Word recognition system and method for customer and employee assessment |
| JP4264841B2 (ja) * | 2006-12-01 | 2009-05-20 | ソニー株式会社 | 音声認識装置および音声認識方法、並びに、プログラム |
| JP4882899B2 (ja) * | 2007-07-25 | 2012-02-22 | ソニー株式会社 | 音声解析装置、および音声解析方法、並びにコンピュータ・プログラム |
| US8077836B2 (en) | 2008-07-30 | 2011-12-13 | At&T Intellectual Property, I, L.P. | Transparent voice registration and verification method and system |
| JP2015087649A (ja) * | 2013-10-31 | 2015-05-07 | シャープ株式会社 | 発話制御装置、方法、発話システム、プログラム、及び発話装置 |
| CN104464423A (zh) * | 2014-12-19 | 2015-03-25 | 科大讯飞股份有限公司 | 一种口语考试评测的校标优化方法及系统 |
| CN107170454B (zh) * | 2017-05-31 | 2022-04-05 | Oppo广东移动通信有限公司 | 语音识别方法及相关产品 |
| US11545132B2 (en) | 2019-08-28 | 2023-01-03 | International Business Machines Corporation | Speech characterization using a synthesized reference audio signal |
| CN110716523A (zh) * | 2019-11-06 | 2020-01-21 | 中水三立数据技术股份有限公司 | 一种基于语音识别的泵站智能决策系统及方法 |
Family Cites Families (5)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE13680C1 (de) | 1902-02-01 | |||
| SE12386C1 (de) | 1901-05-04 | |||
| US5268990A (en) * | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
| SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
| JP3450411B2 (ja) * | 1994-03-22 | 2003-09-22 | キヤノン株式会社 | 音声情報処理方法及び装置 |
-
1994
- 1994-06-29 SE SE9402284A patent/SE504177C2/sv unknown
-
1995
- 1995-06-13 DE DE69519229T patent/DE69519229T2/de not_active Expired - Fee Related
- 1995-06-13 JP JP8503055A patent/JPH10504404A/ja not_active Ceased
- 1995-06-13 EP EP95925191A patent/EP0767950B1/de not_active Expired - Lifetime
- 1995-06-13 WO PCT/SE1995/000710 patent/WO1996000962A2/en not_active Ceased
- 1995-06-13 ES ES95925191T patent/ES2152411T3/es not_active Expired - Lifetime
- 1995-06-13 US US08/532,823 patent/US5694520A/en not_active Expired - Lifetime
Also Published As
| Publication number | Publication date |
|---|---|
| WO1996000962A3 (en) | 1996-02-22 |
| JPH10504404A (ja) | 1998-04-28 |
| EP0767950A2 (de) | 1997-04-16 |
| US5694520A (en) | 1997-12-02 |
| DE69519229D1 (de) | 2000-11-30 |
| EP0767950B1 (de) | 2000-10-25 |
| DE69519229T2 (de) | 2001-05-23 |
| WO1996000962A2 (en) | 1996-01-11 |
| SE9402284D0 (sv) | 1994-06-29 |
| SE9402284L (sv) | 1995-12-30 |
| ES2152411T3 (es) | 2001-02-01 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| EP0683483B1 (de) | Verfahren und Anordnung für die Umwandlung von Sprache in Text | |
| SE504177C2 (sv) | Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk | |
| CN109255113B (zh) | 智能校对系统 | |
| US5806033A (en) | Syllable duration and pitch variation to determine accents and stresses for speech recognition | |
| US5170432A (en) | Method of speaker adaptive speech recognition | |
| US7962341B2 (en) | Method and apparatus for labelling speech | |
| JPH06110494A (ja) | 発音学習装置 | |
| JPH07181997A (ja) | 韻律学的情報を自動的に抽出する方法および装置 | |
| SE506003C2 (sv) | Metod och system för tal-till-tal-omvandling med extrahering av prosodiinformation | |
| Tjalve et al. | Pronunciation variation modelling using accent features | |
| SE519273C2 (sv) | Förbättringar av , eller med avseende på, tal-till-tal- omvandling | |
| Fujisawa et al. | Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique. | |
| Chen et al. | Application of allophonic and lexical constraints in continuous digit recognition | |
| Samoulian | Knowledge based approach to speech recognition | |
| Mary et al. | Consonant-vowel based features for language identification | |
| JPH04350699A (ja) | テキスト音声合成装置 | |
| Hoge et al. | Syllable-based acoustic-phonetic decoding and wordhypotheses generation in fluently spoken speech | |
| JPH03217900A (ja) | テキスト音声合成装置 | |
| JP2737122B2 (ja) | 音声辞書作成装置 | |
| JPS61121167A (ja) | 区切り発声に基づく音声ワ−ドプロセツサ | |
| JPS6180298A (ja) | 音声認識装置 | |
| Wiese | The role of phonology in speech processing | |
| GB2328056A (en) | Generating context dependent sub-syllable models to recognize a tonal language | |
| Samsudin et al. | Constructing a Reusable Linguistic Resource for a Polyglot Speech Synthesis | |
| JPS60225271A (ja) | 音声入力仮名漢字変換装置 |