SE504177C2 - Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk - Google Patents

Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk

Info

Publication number
SE504177C2
SE504177C2 SE9402284A SE9402284A SE504177C2 SE 504177 C2 SE504177 C2 SE 504177C2 SE 9402284 A SE9402284 A SE 9402284A SE 9402284 A SE9402284 A SE 9402284A SE 504177 C2 SE504177 C2 SE 504177C2
Authority
SE
Sweden
Prior art keywords
model
speech
language
information
fundamental tone
Prior art date
Application number
SE9402284A
Other languages
English (en)
Swedish (sv)
Other versions
SE9402284D0 (sv
SE9402284L (sv
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9402284A priority Critical patent/SE504177C2/sv
Publication of SE9402284D0 publication Critical patent/SE9402284D0/xx
Priority to ES95925191T priority patent/ES2152411T3/es
Priority to EP95925191A priority patent/EP0767950B1/de
Priority to JP8503055A priority patent/JPH10504404A/ja
Priority to PCT/SE1995/000710 priority patent/WO1996000962A2/en
Priority to US08/532,823 priority patent/US5694520A/en
Priority to DE69519229T priority patent/DE69519229T2/de
Publication of SE9402284L publication Critical patent/SE9402284L/xx
Publication of SE504177C2 publication Critical patent/SE504177C2/sv

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1807Speech classification or search using natural language modelling using prosody or stress
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • G10L2015/025Phonemes, fenemes or fenones being the recognition units
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
SE9402284A 1994-06-29 1994-06-29 Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk SE504177C2 (sv)

Priority Applications (7)

Application Number Priority Date Filing Date Title
SE9402284A SE504177C2 (sv) 1994-06-29 1994-06-29 Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk
ES95925191T ES2152411T3 (es) 1994-06-29 1995-06-13 Metodo y dispositivo para adaptar un equipo de reconocimiento del habla a las variantes dialectales de una lengua.
EP95925191A EP0767950B1 (de) 1994-06-29 1995-06-13 Verfahren und vorrichtung zur anpassung eines spracherkenners an dialektische sprachvarianten
JP8503055A JPH10504404A (ja) 1994-06-29 1995-06-13 音声認識のための方法および装置
PCT/SE1995/000710 WO1996000962A2 (en) 1994-06-29 1995-06-13 Method and device for adapting a speech recognition equipment for dialectal variations in a language
US08/532,823 US5694520A (en) 1994-06-29 1995-06-13 Method and device for speech recognition
DE69519229T DE69519229T2 (de) 1994-06-29 1995-06-13 Verfahren und vorrichtung zur anpassung eines spracherkenners an dialektische sprachvarianten

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9402284A SE504177C2 (sv) 1994-06-29 1994-06-29 Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk

Publications (3)

Publication Number Publication Date
SE9402284D0 SE9402284D0 (sv) 1994-06-29
SE9402284L SE9402284L (sv) 1995-12-30
SE504177C2 true SE504177C2 (sv) 1996-12-02

Family

ID=20394556

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9402284A SE504177C2 (sv) 1994-06-29 1994-06-29 Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk

Country Status (7)

Country Link
US (1) US5694520A (de)
EP (1) EP0767950B1 (de)
JP (1) JPH10504404A (de)
DE (1) DE69519229T2 (de)
ES (1) ES2152411T3 (de)
SE (1) SE504177C2 (de)
WO (1) WO1996000962A2 (de)

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE516526C2 (sv) * 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
SE514684C2 (sv) * 1995-06-16 2001-04-02 Telia Ab Metod vid tal-till-textomvandling
SE9601811L (sv) * 1996-05-13 1997-11-03 Telia Ab Metod och system för tal-till-tal-omvandling med extrahering av prosodiinformation
SE519273C2 (sv) * 1996-05-13 2003-02-11 Telia Ab Förbättringar av , eller med avseende på, tal-till-tal- omvandling
EP1051701B1 (de) * 1998-02-03 2002-11-06 Siemens Aktiengesellschaft Verfahren zum übermitteln von sprachdaten
US6343270B1 (en) * 1998-12-09 2002-01-29 International Business Machines Corporation Method for increasing dialect precision and usability in speech recognition and text-to-speech systems
EP1096470B1 (de) * 1999-10-29 2005-04-06 Matsushita Electric Industrial Co., Ltd. Normalisierung der Grundfrequenz zur Spracherkennung
CN1159702C (zh) 2001-04-11 2004-07-28 国际商业机器公司 具有情感的语音-语音翻译系统和方法
US20040266337A1 (en) * 2003-06-25 2004-12-30 Microsoft Corporation Method and apparatus for synchronizing lyrics
US7940897B2 (en) 2005-06-24 2011-05-10 American Express Travel Related Services Company, Inc. Word recognition system and method for customer and employee assessment
JP4264841B2 (ja) * 2006-12-01 2009-05-20 ソニー株式会社 音声認識装置および音声認識方法、並びに、プログラム
JP4882899B2 (ja) * 2007-07-25 2012-02-22 ソニー株式会社 音声解析装置、および音声解析方法、並びにコンピュータ・プログラム
US8077836B2 (en) 2008-07-30 2011-12-13 At&T Intellectual Property, I, L.P. Transparent voice registration and verification method and system
JP2015087649A (ja) * 2013-10-31 2015-05-07 シャープ株式会社 発話制御装置、方法、発話システム、プログラム、及び発話装置
CN104464423A (zh) * 2014-12-19 2015-03-25 科大讯飞股份有限公司 一种口语考试评测的校标优化方法及系统
CN107170454B (zh) * 2017-05-31 2022-04-05 Oppo广东移动通信有限公司 语音识别方法及相关产品
US11545132B2 (en) 2019-08-28 2023-01-03 International Business Machines Corporation Speech characterization using a synthesized reference audio signal
CN110716523A (zh) * 2019-11-06 2020-01-21 中水三立数据技术股份有限公司 一种基于语音识别的泵站智能决策系统及方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE13680C1 (de) 1902-02-01
SE12386C1 (de) 1901-05-04
US5268990A (en) * 1991-01-31 1993-12-07 Sri International Method for recognizing speech using linguistically-motivated hidden Markov models
SE516526C2 (sv) * 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
JP3450411B2 (ja) * 1994-03-22 2003-09-22 キヤノン株式会社 音声情報処理方法及び装置

Also Published As

Publication number Publication date
WO1996000962A3 (en) 1996-02-22
JPH10504404A (ja) 1998-04-28
EP0767950A2 (de) 1997-04-16
US5694520A (en) 1997-12-02
DE69519229D1 (de) 2000-11-30
EP0767950B1 (de) 2000-10-25
DE69519229T2 (de) 2001-05-23
WO1996000962A2 (en) 1996-01-11
SE9402284D0 (sv) 1994-06-29
SE9402284L (sv) 1995-12-30
ES2152411T3 (es) 2001-02-01

Similar Documents

Publication Publication Date Title
EP0683483B1 (de) Verfahren und Anordnung für die Umwandlung von Sprache in Text
SE504177C2 (sv) Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk
CN109255113B (zh) 智能校对系统
US5806033A (en) Syllable duration and pitch variation to determine accents and stresses for speech recognition
US5170432A (en) Method of speaker adaptive speech recognition
US7962341B2 (en) Method and apparatus for labelling speech
JPH06110494A (ja) 発音学習装置
JPH07181997A (ja) 韻律学的情報を自動的に抽出する方法および装置
SE506003C2 (sv) Metod och system för tal-till-tal-omvandling med extrahering av prosodiinformation
Tjalve et al. Pronunciation variation modelling using accent features
SE519273C2 (sv) Förbättringar av , eller med avseende på, tal-till-tal- omvandling
Fujisawa et al. Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique.
Chen et al. Application of allophonic and lexical constraints in continuous digit recognition
Samoulian Knowledge based approach to speech recognition
Mary et al. Consonant-vowel based features for language identification
JPH04350699A (ja) テキスト音声合成装置
Hoge et al. Syllable-based acoustic-phonetic decoding and wordhypotheses generation in fluently spoken speech
JPH03217900A (ja) テキスト音声合成装置
JP2737122B2 (ja) 音声辞書作成装置
JPS61121167A (ja) 区切り発声に基づく音声ワ−ドプロセツサ
JPS6180298A (ja) 音声認識装置
Wiese The role of phonology in speech processing
GB2328056A (en) Generating context dependent sub-syllable models to recognize a tonal language
Samsudin et al. Constructing a Reusable Linguistic Resource for a Polyglot Speech Synthesis
JPS60225271A (ja) 音声入力仮名漢字変換装置