SE514684C2 - Metod vid tal-till-textomvandling - Google Patents

Metod vid tal-till-textomvandling

Info

Publication number
SE514684C2
SE514684C2 SE9502202A SE9502202A SE514684C2 SE 514684 C2 SE514684 C2 SE 514684C2 SE 9502202 A SE9502202 A SE 9502202A SE 9502202 A SE9502202 A SE 9502202A SE 514684 C2 SE514684 C2 SE 514684C2
Authority
SE
Sweden
Prior art keywords
accent
information
speech
words
sentences
Prior art date
Application number
SE9502202A
Other languages
English (en)
Swedish (sv)
Other versions
SE9502202D0 (sv
SE9502202L (sv
Inventor
Bertil Lyberg
Original Assignee
Telia Ab
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telia Ab filed Critical Telia Ab
Priority to SE9502202A priority Critical patent/SE514684C2/sv
Publication of SE9502202D0 publication Critical patent/SE9502202D0/xx
Priority to DE69618503T priority patent/DE69618503T2/de
Priority to DK96850108T priority patent/DK0749109T3/da
Priority to EP96850108A priority patent/EP0749109B1/en
Priority to NO19962463A priority patent/NO316847B1/no
Priority to JP8175484A priority patent/JPH0922297A/ja
Priority to US08/665,728 priority patent/US5806033A/en
Publication of SE9502202L publication Critical patent/SE9502202L/xx
Publication of SE514684C2 publication Critical patent/SE514684C2/sv

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1807Speech classification or search using natural language modelling using prosody or stress

Landscapes

  • Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
SE9502202A 1995-06-16 1995-06-16 Metod vid tal-till-textomvandling SE514684C2 (sv)

Priority Applications (7)

Application Number Priority Date Filing Date Title
SE9502202A SE514684C2 (sv) 1995-06-16 1995-06-16 Metod vid tal-till-textomvandling
DE69618503T DE69618503T2 (de) 1995-06-16 1996-06-04 Spracherkennung für Tonsprachen
DK96850108T DK0749109T3 (da) 1995-06-16 1996-06-04 Talegenkendelse for tonesprog
EP96850108A EP0749109B1 (en) 1995-06-16 1996-06-04 Speech recognition for tonal languages
NO19962463A NO316847B1 (no) 1995-06-16 1996-06-12 Fremgangsmate og anordning ved omvandling av tale til tekst
JP8175484A JPH0922297A (ja) 1995-06-16 1996-06-14 音声‐テキスト変換のための方法および装置
US08/665,728 US5806033A (en) 1995-06-16 1996-06-17 Syllable duration and pitch variation to determine accents and stresses for speech recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
SE9502202A SE514684C2 (sv) 1995-06-16 1995-06-16 Metod vid tal-till-textomvandling

Publications (3)

Publication Number Publication Date
SE9502202D0 SE9502202D0 (sv) 1995-06-16
SE9502202L SE9502202L (sv) 1996-12-17
SE514684C2 true SE514684C2 (sv) 2001-04-02

Family

ID=20398649

Family Applications (1)

Application Number Title Priority Date Filing Date
SE9502202A SE514684C2 (sv) 1995-06-16 1995-06-16 Metod vid tal-till-textomvandling

Country Status (7)

Country Link
US (1) US5806033A (ja)
EP (1) EP0749109B1 (ja)
JP (1) JPH0922297A (ja)
DE (1) DE69618503T2 (ja)
DK (1) DK0749109T3 (ja)
NO (1) NO316847B1 (ja)
SE (1) SE514684C2 (ja)

Families Citing this family (28)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1039895A (ja) * 1996-07-25 1998-02-13 Matsushita Electric Ind Co Ltd 音声合成方法および装置
KR100238189B1 (ko) * 1997-10-16 2000-01-15 윤종용 다중 언어 tts장치 및 다중 언어 tts 처리 방법
JP4267101B2 (ja) 1997-11-17 2009-05-27 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声識別装置、発音矯正装置およびこれらの方法
US7283973B1 (en) 1998-10-07 2007-10-16 Logic Tree Corporation Multi-modal voice-enabled content access and delivery system
US6941273B1 (en) 1998-10-07 2005-09-06 Masoud Loghmani Telephony-data application interface apparatus and method for multi-modal access to data applications
US6377927B1 (en) 1998-10-07 2002-04-23 Masoud Loghmani Voice-optimized database system and method of using same
WO2001003112A1 (en) * 1999-07-06 2001-01-11 James Quest Speech recognition system and method
AU763362B2 (en) * 1999-07-06 2003-07-17 James Quest Speech recognition system and method
US6526382B1 (en) 1999-12-07 2003-02-25 Comverse, Inc. Language-oriented user interfaces for voice activated services
US20080147404A1 (en) * 2000-05-15 2008-06-19 Nusuara Technologies Sdn Bhd System and methods for accent classification and adaptation
US7200142B1 (en) 2001-02-08 2007-04-03 Logic Tree Corporation System for providing multi-phased, multi-modal access to content through voice and data devices
US6948129B1 (en) 2001-02-08 2005-09-20 Masoud S Loghmani Multi-modal, multi-path user interface for simultaneous access to internet data over multiple media
US8000320B2 (en) * 2001-02-08 2011-08-16 Logic Tree Corporation System for providing multi-phased, multi-modal access to content through voice and data devices
ATE310302T1 (de) * 2001-09-28 2005-12-15 Cit Alcatel Kommunikationsvorrichtung und verfahren zum senden und empfangen von sprachsignalen unter kombination eines spracherkennungsmodules mit einer kodiereinheit
GB2388738B (en) 2001-11-03 2004-06-02 Dremedia Ltd Time ordered indexing of audio data
GB2381688B (en) 2001-11-03 2004-09-22 Dremedia Ltd Time ordered indexing of audio-visual data
US20030115169A1 (en) * 2001-12-17 2003-06-19 Hongzhuan Ye System and method for management of transcribed documents
US6990445B2 (en) * 2001-12-17 2006-01-24 Xl8 Systems, Inc. System and method for speech recognition and transcription
US7280968B2 (en) * 2003-03-25 2007-10-09 International Business Machines Corporation Synthetically generated speech responses including prosodic characteristics of speech inputs
US20050055197A1 (en) * 2003-08-14 2005-03-10 Sviatoslav Karavansky Linguographic method of compiling word dictionaries and lexicons for the memories of electronic speech-recognition devices
JP4264841B2 (ja) * 2006-12-01 2009-05-20 ソニー株式会社 音声認識装置および音声認識方法、並びに、プログラム
US8315870B2 (en) * 2007-08-22 2012-11-20 Nec Corporation Rescoring speech recognition hypothesis using prosodic likelihood
US8401856B2 (en) * 2010-05-17 2013-03-19 Avaya Inc. Automatic normalization of spoken syllable duration
US9009049B2 (en) * 2012-06-06 2015-04-14 Spansion Llc Recognition of speech with different accents
US9966064B2 (en) 2012-07-18 2018-05-08 International Business Machines Corporation Dialect-specific acoustic language modeling and speech recognition
KR102084646B1 (ko) * 2013-07-04 2020-04-14 삼성전자주식회사 음성 인식 장치 및 음성 인식 방법
US10468050B2 (en) 2017-03-29 2019-11-05 Microsoft Technology Licensing, Llc Voice synthesized participatory rhyming chat bot
US11809958B2 (en) * 2020-06-10 2023-11-07 Capital One Services, Llc Systems and methods for automatic decision-making with user-configured criteria using multi-channel data inputs

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0356736B2 (ja) * 1979-05-28 1991-08-29
JPH05197389A (ja) * 1991-08-13 1993-08-06 Toshiba Corp 音声認識装置
SE500277C2 (sv) * 1993-05-10 1994-05-24 Televerket Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk
SE516526C2 (sv) * 1993-11-03 2002-01-22 Telia Ab Metod och anordning vid automatisk extrahering av prosodisk information
SE504177C2 (sv) * 1994-06-29 1996-12-02 Telia Ab Metod och anordning att adaptera en taligenkänningsutrustning för dialektala variationer i ett språk

Also Published As

Publication number Publication date
DE69618503D1 (de) 2002-02-21
EP0749109A2 (en) 1996-12-18
US5806033A (en) 1998-09-08
EP0749109A3 (en) 1998-04-29
SE9502202D0 (sv) 1995-06-16
EP0749109B1 (en) 2002-01-16
NO316847B1 (no) 2004-06-01
SE9502202L (sv) 1996-12-17
JPH0922297A (ja) 1997-01-21
NO962463L (no) 1996-12-17
DK0749109T3 (da) 2002-03-25
NO962463D0 (no) 1996-06-12
DE69618503T2 (de) 2002-08-29

Similar Documents

Publication Publication Date Title
SE514684C2 (sv) Metod vid tal-till-textomvandling
EP0683483B1 (en) A method and arrangement for speech to text conversion
Norris et al. The possible-word constraint in the segmentation of continuous speech
US7937262B2 (en) Method, apparatus, and computer program product for machine translation
US7962341B2 (en) Method and apparatus for labelling speech
CN106297800B (zh) 一种自适应的语音识别的方法和设备
JP2559998B2 (ja) 音声認識装置及びラベル生成方法
Warnke et al. Integrated dialog act segmentation and classification using prosodic features and language models.
JPH0423799B2 (ja)
CN104464751B (zh) 发音韵律问题的检测方法及装置
JP2001100781A (ja) 音声処理装置および音声処理方法、並びに記録媒体
ATE389225T1 (de) Spracherkennung
KR20060052663A (ko) 음운 기반의 음성 인식 시스템 및 방법
EP1095371A1 (en) Language independent speech recognition
US5694520A (en) Method and device for speech recognition
US8870575B2 (en) Language learning system, language learning method, and computer program product thereof
Conkie et al. Prosody recognition from speech utterances using acoustic and linguistic based models of prosodic events
KR100930714B1 (ko) 음성인식 장치 및 방법
JPH06110494A (ja) 発音学習装置
CN115424604B (zh) 一种基于对抗生成网络的语音合成模型的训练方法
Taylor et al. Using prosodic information to constrain language models for spoken dialogue
NO318557B1 (no) Fremgangsmate og system for tale-til-taleomforming
SE519273C2 (sv) Förbättringar av , eller med avseende på, tal-till-tal- omvandling
Holmes et al. Why have HMMs been so successful for automatic speech recognition and how might they be improved
O'Brien Knowledge-based systems in speech recognition: a survey