SE513456C2 - Metod och anordning vid tal- till textomvandling - Google Patents
Metod och anordning vid tal- till textomvandlingInfo
- Publication number
- SE513456C2 SE513456C2 SE9401613A SE9401613A SE513456C2 SE 513456 C2 SE513456 C2 SE 513456C2 SE 9401613 A SE9401613 A SE 9401613A SE 9401613 A SE9401613 A SE 9401613A SE 513456 C2 SE513456 C2 SE 513456C2
- Authority
- SE
- Sweden
- Prior art keywords
- words
- speech
- model
- phrases
- language
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1807—Speech classification or search using natural language modelling using prosody or stress
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
Landscapes
- Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Priority Applications (6)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| SE9401613A SE513456C2 (sv) | 1994-05-10 | 1994-05-10 | Metod och anordning vid tal- till textomvandling |
| ES95850082T ES2153021T3 (es) | 1994-05-10 | 1995-04-27 | Procedimiento y disposicion para la conversion del habla a texto. |
| EP95850082A EP0683483B1 (de) | 1994-05-10 | 1995-04-27 | Verfahren und Anordnung für die Umwandlung von Sprache in Text |
| DE69519328T DE69519328T2 (de) | 1994-05-10 | 1995-04-27 | Verfahren und Anordnung für die Umwandlung von Sprache in Text |
| US08/432,062 US5752227A (en) | 1994-05-10 | 1995-05-01 | Method and arrangement for speech to text conversion |
| JP7137215A JPH0850498A (ja) | 1994-05-10 | 1995-05-10 | 音声をテキストに変換するための方法および装置 |
Applications Claiming Priority (1)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| SE9401613A SE513456C2 (sv) | 1994-05-10 | 1994-05-10 | Metod och anordning vid tal- till textomvandling |
Publications (3)
| Publication Number | Publication Date |
|---|---|
| SE9401613D0 SE9401613D0 (sv) | 1994-05-10 |
| SE9401613L SE9401613L (sv) | 1995-11-11 |
| SE513456C2 true SE513456C2 (sv) | 2000-09-18 |
Family
ID=20393956
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| SE9401613A SE513456C2 (sv) | 1994-05-10 | 1994-05-10 | Metod och anordning vid tal- till textomvandling |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US5752227A (de) |
| EP (1) | EP0683483B1 (de) |
| JP (1) | JPH0850498A (de) |
| DE (1) | DE69519328T2 (de) |
| ES (1) | ES2153021T3 (de) |
| SE (1) | SE513456C2 (de) |
Families Citing this family (64)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
| US6067520A (en) * | 1995-12-29 | 2000-05-23 | Lee And Li | System and method of recognizing continuous mandarin speech utilizing chinese hidden markou models |
| WO1998011494A1 (en) * | 1996-09-16 | 1998-03-19 | Advanced Research Solutions, Llc | Data correlation and analysis tool |
| JPH10162065A (ja) * | 1996-11-28 | 1998-06-19 | Hitachi Ltd | 配送管理システム |
| DE19721008A1 (de) * | 1997-05-20 | 1998-11-26 | Hanjo Dr Kreitz | Sprechschreibmaschine |
| US6490561B1 (en) * | 1997-06-25 | 2002-12-03 | Dennis L. Wilson | Continuous speech voice transcription |
| US6064957A (en) * | 1997-08-15 | 2000-05-16 | General Electric Company | Improving speech recognition through text-based linguistic post-processing |
| US6603835B2 (en) | 1997-09-08 | 2003-08-05 | Ultratec, Inc. | System for text assisted telephony |
| US6219641B1 (en) * | 1997-12-09 | 2001-04-17 | Michael V. Socaciu | System and method of transmitting speech at low line rates |
| US6157905A (en) * | 1997-12-11 | 2000-12-05 | Microsoft Corporation | Identifying language and character set of data representing text |
| US6754631B1 (en) | 1998-11-04 | 2004-06-22 | Gateway, Inc. | Recording meeting minutes based upon speech recognition |
| DE19857070A1 (de) * | 1998-12-10 | 2000-06-15 | Michael Mende | Verfahren und Vorrichtung zur Ermittlung einer orthographischen Wiedergabe eines Textes |
| JP2000196730A (ja) * | 1998-12-25 | 2000-07-14 | Nec Saitama Ltd | 無線通信機 |
| CA2366057C (en) * | 1999-03-05 | 2009-03-24 | Canon Kabushiki Kaisha | Database annotation and retrieval |
| US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
| US6882970B1 (en) | 1999-10-28 | 2005-04-19 | Canon Kabushiki Kaisha | Language recognition using sequence frequency |
| DE60036486T2 (de) | 1999-10-28 | 2008-06-12 | Canon K.K. | Methode und apparat zum prüfen von musterübereinstimmungen |
| US6789060B1 (en) * | 1999-11-01 | 2004-09-07 | Gene J. Wolfe | Network based speech transcription that maintains dynamic templates |
| JP2001166789A (ja) * | 1999-12-10 | 2001-06-22 | Matsushita Electric Ind Co Ltd | 初頭/末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置 |
| US20060074664A1 (en) * | 2000-01-10 | 2006-04-06 | Lam Kwok L | System and method for utterance verification of chinese long and short keywords |
| GB0011798D0 (en) * | 2000-05-16 | 2000-07-05 | Canon Kk | Database annotation and retrieval |
| GB0015233D0 (en) | 2000-06-21 | 2000-08-16 | Canon Kk | Indexing method and apparatus |
| US7075671B1 (en) * | 2000-09-14 | 2006-07-11 | International Business Machines Corp. | System and method for providing a printing capability for a transcription service or multimedia presentation |
| GB0023930D0 (en) | 2000-09-29 | 2000-11-15 | Canon Kk | Database annotation and retrieval |
| GB0027178D0 (en) * | 2000-11-07 | 2000-12-27 | Canon Kk | Speech processing system |
| GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
| US8416925B2 (en) | 2005-06-29 | 2013-04-09 | Ultratec, Inc. | Device independent text captioned telephone service |
| US20030050777A1 (en) * | 2001-09-07 | 2003-03-13 | Walker William Donald | System and method for automatic transcription of conversations |
| EP1430474B1 (de) * | 2001-09-17 | 2005-11-30 | Koninklijke Philips Electronics N.V. | Korrektur eines von einer spracherkennung erkannten textes mittels vergleich der phonemfolgen des erkannten textes mit einer phonetischen transkription eines manuell eingegebenen korrekturwortes |
| US20030115169A1 (en) * | 2001-12-17 | 2003-06-19 | Hongzhuan Ye | System and method for management of transcribed documents |
| US6990445B2 (en) * | 2001-12-17 | 2006-01-24 | Xl8 Systems, Inc. | System and method for speech recognition and transcription |
| CA2530899C (en) * | 2002-06-28 | 2013-06-25 | Conceptual Speech, Llc | Multi-phoneme streamer and knowledge representation speech recognition system and method |
| US7614880B2 (en) * | 2002-10-03 | 2009-11-10 | James Bennett | Method and apparatus for a phoneme playback system for enhancing language learning skills |
| US7412392B1 (en) | 2003-04-14 | 2008-08-12 | Sprint Communications Company L.P. | Conference multi-tasking system and method |
| US7275032B2 (en) | 2003-04-25 | 2007-09-25 | Bvoice Corporation | Telephone call handling center where operators utilize synthesized voices generated or modified to exhibit or omit prescribed speech characteristics |
| JP4713111B2 (ja) * | 2003-09-19 | 2011-06-29 | 株式会社エヌ・ティ・ティ・ドコモ | 発話区間検出装置、音声認識処理装置、送信システム、信号レベル制御装置、発話区間検出方法 |
| US8515024B2 (en) | 2010-01-13 | 2013-08-20 | Ultratec, Inc. | Captioned telephone service |
| US20050221142A1 (en) * | 2004-03-23 | 2005-10-06 | Narayanan Sekharipuram R | Composite polymer electrolytes based on organosilica hybrid proton conductors for fuel cells |
| JP2005326677A (ja) * | 2004-05-14 | 2005-11-24 | Toshiba Tec Corp | 音声メモプリンタ |
| JP4544933B2 (ja) * | 2004-07-29 | 2010-09-15 | 東芝テック株式会社 | 音声メモプリンタ |
| US20060092291A1 (en) * | 2004-10-28 | 2006-05-04 | Bodie Jeffrey C | Digital imaging system |
| KR101100191B1 (ko) * | 2005-01-28 | 2011-12-28 | 엘지전자 주식회사 | 멀티미디어 재생장치와 이를 이용한 멀티미디어 자료검색방법 |
| US11258900B2 (en) | 2005-06-29 | 2022-02-22 | Ultratec, Inc. | Device independent text captioned telephone service |
| WO2007129316A2 (en) | 2006-05-07 | 2007-11-15 | Varcode Ltd. | A system and method for improved quality management in a product logistic chain |
| US7562811B2 (en) | 2007-01-18 | 2009-07-21 | Varcode Ltd. | System and method for improved quality management in a product logistic chain |
| EP2156369B1 (de) | 2007-05-06 | 2015-09-02 | Varcode Ltd. | System und verfahren zur qualitätsverwaltung unter verwendung von strichcodeindikatoren |
| WO2009016631A2 (en) | 2007-08-01 | 2009-02-05 | Ginger Software, Inc. | Automatic context sensitive language correction and enhancement using an internet corpus |
| US8595642B1 (en) | 2007-10-04 | 2013-11-26 | Great Northern Research, LLC | Multiple shell multi faceted graphical user interface |
| WO2009063465A2 (en) | 2007-11-14 | 2009-05-22 | Varcode Ltd. | A system and method for quality management utilizing barcode indicators |
| US8856003B2 (en) * | 2008-04-30 | 2014-10-07 | Motorola Solutions, Inc. | Method for dual channel monitoring on a radio device |
| US11704526B2 (en) | 2008-06-10 | 2023-07-18 | Varcode Ltd. | Barcoded indicators for quality management |
| US9015036B2 (en) * | 2010-02-01 | 2015-04-21 | Ginger Software, Inc. | Automatic context sensitive language correction using an internet corpus particularly for small keyboard devices |
| US8807422B2 (en) | 2012-10-22 | 2014-08-19 | Varcode Ltd. | Tamper-proof quality management barcode indicators |
| US10389876B2 (en) | 2014-02-28 | 2019-08-20 | Ultratec, Inc. | Semiautomated relay method and apparatus |
| US10878721B2 (en) | 2014-02-28 | 2020-12-29 | Ultratec, Inc. | Semiautomated relay method and apparatus |
| US12482458B2 (en) | 2014-02-28 | 2025-11-25 | Ultratec, Inc. | Semiautomated relay method and apparatus |
| US20180034961A1 (en) | 2014-02-28 | 2018-02-01 | Ultratec, Inc. | Semiautomated Relay Method and Apparatus |
| US20180270350A1 (en) | 2014-02-28 | 2018-09-20 | Ultratec, Inc. | Semiautomated relay method and apparatus |
| JP6649472B2 (ja) | 2015-05-18 | 2020-02-19 | バーコード リミティド | 活性化可能な品質表示ラベルのための熱変色性インク証印 |
| US10697837B2 (en) | 2015-07-07 | 2020-06-30 | Varcode Ltd. | Electronic quality indicator |
| US11539900B2 (en) | 2020-02-21 | 2022-12-27 | Ultratec, Inc. | Caption modification and augmentation systems and methods for use by hearing assisted user |
| CN111862954B (zh) * | 2020-05-29 | 2024-03-01 | 北京捷通华声科技股份有限公司 | 一种语音识别模型的获取方法及装置 |
| US12190871B1 (en) * | 2021-09-07 | 2025-01-07 | Amazon Technologies, Inc. | Deep learning-based automatic detection and labeling of dynamic advertisements in long-form audio content |
| US12056457B2 (en) * | 2022-03-22 | 2024-08-06 | Charles University, Faculty Of Mathematics And Physics | Computer-implemented method of real time speech translation and a computer system for carrying out the method |
Family Cites Families (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US3704345A (en) * | 1971-03-19 | 1972-11-28 | Bell Telephone Labor Inc | Conversion of printed text into synthetic speech |
| JPS5919358B2 (ja) * | 1978-12-11 | 1984-05-04 | 株式会社日立製作所 | 音声内容伝送方式 |
| FR2547146B1 (fr) * | 1983-06-02 | 1987-03-20 | Texas Instruments France | Procede et dispositif pour l'audition de messages parles synthetises et pour la visualisation de messages graphiques correspondants |
| US4802223A (en) * | 1983-11-03 | 1989-01-31 | Texas Instruments Incorporated | Low data rate speech encoding employing syllable pitch patterns |
| US4797930A (en) * | 1983-11-03 | 1989-01-10 | Texas Instruments Incorporated | constructed syllable pitch patterns from phonological linguistic unit string data |
| US4695962A (en) * | 1983-11-03 | 1987-09-22 | Texas Instruments Incorporated | Speaking apparatus having differing speech modes for word and phrase synthesis |
| US4977599A (en) * | 1985-05-29 | 1990-12-11 | International Business Machines Corporation | Speech recognition employing a set of Markov models that includes Markov models representing transitions to and from silence |
| US4829580A (en) * | 1986-03-26 | 1989-05-09 | Telephone And Telegraph Company, At&T Bell Laboratories | Text analysis system with letter sequence recognition and speech stress assignment arrangement |
| US5384701A (en) * | 1986-10-03 | 1995-01-24 | British Telecommunications Public Limited Company | Language translation system |
| US4852170A (en) * | 1986-12-18 | 1989-07-25 | R & D Associates | Real time computer speech recognition system |
| US5231670A (en) * | 1987-06-01 | 1993-07-27 | Kurzweil Applied Intelligence, Inc. | Voice controlled system and method for generating text from a voice controlled input |
| US5146405A (en) * | 1988-02-05 | 1992-09-08 | At&T Bell Laboratories | Methods for part-of-speech determination and usage |
| US5220639A (en) * | 1989-12-01 | 1993-06-15 | National Science Council | Mandarin speech input method for Chinese computers and a mandarin speech recognition machine |
| US5268990A (en) * | 1991-01-31 | 1993-12-07 | Sri International | Method for recognizing speech using linguistically-motivated hidden Markov models |
| SE9301596L (sv) * | 1993-05-10 | 1994-05-24 | Televerket | Anordning för att öka talförståelsen vid översätttning av tal från ett första språk till ett andra språk |
| SE516526C2 (sv) * | 1993-11-03 | 2002-01-22 | Telia Ab | Metod och anordning vid automatisk extrahering av prosodisk information |
-
1994
- 1994-05-10 SE SE9401613A patent/SE513456C2/sv unknown
-
1995
- 1995-04-27 DE DE69519328T patent/DE69519328T2/de not_active Expired - Fee Related
- 1995-04-27 EP EP95850082A patent/EP0683483B1/de not_active Expired - Lifetime
- 1995-04-27 ES ES95850082T patent/ES2153021T3/es not_active Expired - Lifetime
- 1995-05-01 US US08/432,062 patent/US5752227A/en not_active Expired - Lifetime
- 1995-05-10 JP JP7137215A patent/JPH0850498A/ja active Pending
Also Published As
| Publication number | Publication date |
|---|---|
| SE9401613D0 (sv) | 1994-05-10 |
| ES2153021T3 (es) | 2001-02-16 |
| SE9401613L (sv) | 1995-11-11 |
| DE69519328D1 (de) | 2000-12-14 |
| EP0683483A3 (de) | 1997-08-27 |
| EP0683483B1 (de) | 2000-11-08 |
| JPH0850498A (ja) | 1996-02-20 |
| EP0683483A2 (de) | 1995-11-22 |
| US5752227A (en) | 1998-05-12 |
| DE69519328T2 (de) | 2001-05-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| SE513456C2 (sv) | Metod och anordning vid tal- till textomvandling | |
| Ananthakrishnan et al. | Automatic prosodic event detection using acoustic, lexical, and syntactic evidence | |
| US5806033A (en) | Syllable duration and pitch variation to determine accents and stresses for speech recognition | |
| US7937262B2 (en) | Method, apparatus, and computer program product for machine translation | |
| CN113571037B (zh) | 一种汉语盲文语音合成方法及系统 | |
| US11817079B1 (en) | GAN-based speech synthesis model and training method | |
| US5694520A (en) | Method and device for speech recognition | |
| Kadambe et al. | Language identification with phonological and lexical models | |
| Chen et al. | How prosody improves word recognition | |
| EP0919052B1 (de) | Verfahren und system zur sprache-in-sprache-umsetzung | |
| Akinwonm | Development of a prosodic read speech syllabic corpus of the yoruba language | |
| Fosler-Lussier | A tutorial on pronunciation modeling for large vocabulary speech recognition | |
| Chen et al. | A maximum likelihood prosody recognizer | |
| SE519273C2 (sv) | Förbättringar av , eller med avseende på, tal-till-tal- omvandling | |
| Berkling | Automatic language identification with sequences of language-independent phoneme clusters | |
| Hamid et al. | Automatic generation of hypotheses for automatic diagnosis of pronunciation errors | |
| Waibel | Towards very large vocabulary word recognition | |
| Teich et al. | Matching a tone-based and tune-based approach to English intonation for concept-to-speech generation | |
| Hoge et al. | Syllable-based acoustic-phonetic decoding and wordhypotheses generation in fluently spoken speech | |
| Külekci | Statistical morphological disambiguation with application to disambiguation of pronunciations in Turkish | |
| JPS61121167A (ja) | 区切り発声に基づく音声ワ−ドプロセツサ | |
| JP2005534968A (ja) | 漢字語の読みの決定 | |
| Weibin et al. | Duration Modeling For Chinese Systhesis from C-ToBI Labeled Corpus | |
| Togawa et al. | Voice-activated word processor with automatic learning for dynamic optimization of syllable-templates | |
| Wang | An interactive open-vocabulary chinese name input system using syllable spelling and character description recognition modules for error correction |