DE69220825D1 - Verfahren und System zur Spracherkennung - Google Patents
Verfahren und System zur SpracherkennungInfo
- Publication number
- DE69220825D1 DE69220825D1 DE69220825T DE69220825T DE69220825D1 DE 69220825 D1 DE69220825 D1 DE 69220825D1 DE 69220825 T DE69220825 T DE 69220825T DE 69220825 T DE69220825 T DE 69220825T DE 69220825 D1 DE69220825 D1 DE 69220825D1
- Authority
- DE
- Germany
- Prior art keywords
- speech recognition
- speech
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/04—Speaking
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G10L15/144—Training of HMMs
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
- G10L2015/025—Phonemes, fenemes or fenones being the recognition units
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/12—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being prediction coefficients
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP3058797A JP3050934B2 (ja) | 1991-03-22 | 1991-03-22 | 音声認識方式 |
Publications (2)
Publication Number | Publication Date |
---|---|
DE69220825D1 true DE69220825D1 (de) | 1997-08-21 |
DE69220825T2 DE69220825T2 (de) | 1998-02-19 |
Family
ID=13094576
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE69220825T Expired - Fee Related DE69220825T2 (de) | 1991-03-22 | 1992-03-20 | Verfahren und System zur Spracherkennung |
Country Status (4)
Country | Link |
---|---|
US (1) | US5649056A (de) |
EP (1) | EP0504927B1 (de) |
JP (1) | JP3050934B2 (de) |
DE (1) | DE69220825T2 (de) |
Families Citing this family (35)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0772840B2 (ja) * | 1992-09-29 | 1995-08-02 | 日本アイ・ビー・エム株式会社 | 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法 |
GB9223066D0 (en) * | 1992-11-04 | 1992-12-16 | Secr Defence | Children's speech training aid |
US5440662A (en) * | 1992-12-11 | 1995-08-08 | At&T Corp. | Keyword/non-keyword classification in isolated word speech recognition |
EP0681729B1 (de) * | 1993-01-30 | 1999-09-08 | Korea Telecommunications Authority | System zur sprachsynthese und spracherkennung |
US5794198A (en) * | 1994-10-28 | 1998-08-11 | Nippon Telegraph And Telephone Corporation | Pattern recognition method |
GB9509831D0 (en) | 1995-05-15 | 1995-07-05 | Gerzon Michael A | Lossless coding method for waveform data |
JPH0981183A (ja) * | 1995-09-14 | 1997-03-28 | Pioneer Electron Corp | 音声モデルの作成方法およびこれを用いた音声認識装置 |
JPH10260692A (ja) * | 1997-03-18 | 1998-09-29 | Toshiba Corp | 音声の認識合成符号化/復号化方法及び音声符号化/復号化システム |
US6076055A (en) * | 1997-05-27 | 2000-06-13 | Ameritech | Speaker verification method |
US7630895B2 (en) * | 2000-01-21 | 2009-12-08 | At&T Intellectual Property I, L.P. | Speaker verification method |
FR2769117B1 (fr) * | 1997-09-29 | 2000-11-10 | Matra Comm | Procede d'apprentissage dans un systeme de reconnaissance de parole |
US6092039A (en) * | 1997-10-31 | 2000-07-18 | International Business Machines Corporation | Symbiotic automatic speech recognition and vocoder |
US6347297B1 (en) * | 1998-10-05 | 2002-02-12 | Legerity, Inc. | Matrix quantization with vector quantization error compensation and neural network postprocessing for robust speech recognition |
US6219642B1 (en) | 1998-10-05 | 2001-04-17 | Legerity, Inc. | Quantization using frequency and mean compensated frequency input data for robust speech recognition |
JP2001166789A (ja) * | 1999-12-10 | 2001-06-22 | Matsushita Electric Ind Co Ltd | 初頭/末尾の音素類似度ベクトルによる中国語の音声認識方法及びその装置 |
TW521266B (en) * | 2000-07-13 | 2003-02-21 | Verbaltek Inc | Perceptual phonetic feature speech recognition system and method |
JP2002189487A (ja) * | 2000-12-20 | 2002-07-05 | Mitsubishi Electric Corp | 音声認識装置および音声認識方法 |
US6711544B2 (en) | 2001-01-25 | 2004-03-23 | Harcourt Assessment, Inc. | Speech therapy system and method |
US6714911B2 (en) | 2001-01-25 | 2004-03-30 | Harcourt Assessment, Inc. | Speech transcription and analysis system and method |
WO2002059856A2 (en) * | 2001-01-25 | 2002-08-01 | The Psychological Corporation | Speech transcription, therapy, and analysis system and method |
US6732076B2 (en) | 2001-01-25 | 2004-05-04 | Harcourt Assessment, Inc. | Speech analysis and therapy system and method |
US20020143550A1 (en) * | 2001-03-27 | 2002-10-03 | Takashi Nakatsuyama | Voice recognition shopping system |
TW556152B (en) * | 2002-05-29 | 2003-10-01 | Labs Inc L | Interface of automatically labeling phonic symbols for correcting user's pronunciation, and systems and methods |
US7089185B2 (en) * | 2002-06-27 | 2006-08-08 | Intel Corporation | Embedded multi-layer coupled hidden Markov model |
US7231019B2 (en) * | 2004-02-12 | 2007-06-12 | Microsoft Corporation | Automatic identification of telephone callers based on voice characteristics |
US7970613B2 (en) | 2005-11-12 | 2011-06-28 | Sony Computer Entertainment Inc. | Method and system for Gaussian probability data bit reduction and computation |
US7778831B2 (en) | 2006-02-21 | 2010-08-17 | Sony Computer Entertainment Inc. | Voice recognition with dynamic filter bank adjustment based on speaker categorization determined from runtime pitch |
US8010358B2 (en) * | 2006-02-21 | 2011-08-30 | Sony Computer Entertainment Inc. | Voice recognition with parallel gender and age normalization |
US8442833B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Speech processing with source location estimation using signals from two or more microphones |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8788256B2 (en) * | 2009-02-17 | 2014-07-22 | Sony Computer Entertainment Inc. | Multiple language voice recognition |
US9153235B2 (en) | 2012-04-09 | 2015-10-06 | Sony Computer Entertainment Inc. | Text dependent speaker recognition with long-term feature based on functional data analysis |
JP6495850B2 (ja) * | 2016-03-14 | 2019-04-03 | 株式会社東芝 | 情報処理装置、情報処理方法、プログラムおよび認識システム |
CN112786050B (zh) * | 2019-11-07 | 2024-02-02 | 王皓 | 一种语音识别的方法、装置及设备 |
CN111508498B (zh) * | 2020-04-09 | 2024-01-30 | 携程计算机技术(上海)有限公司 | 对话式语音识别方法、系统、电子设备和存储介质 |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6024994B2 (ja) * | 1980-04-21 | 1985-06-15 | シャープ株式会社 | パタ−ン類似度計算方式 |
JPS59226400A (ja) * | 1983-06-07 | 1984-12-19 | 松下電器産業株式会社 | 音声認識装置 |
US5131043A (en) * | 1983-09-05 | 1992-07-14 | Matsushita Electric Industrial Co., Ltd. | Method of and apparatus for speech recognition wherein decisions are made based on phonemes |
JPH0760318B2 (ja) * | 1986-09-29 | 1995-06-28 | 株式会社東芝 | 連続音声認識方式 |
US5027406A (en) * | 1988-12-06 | 1991-06-25 | Dragon Systems, Inc. | Method for interactive speech recognition and training |
JPH0636156B2 (ja) * | 1989-03-13 | 1994-05-11 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声認識装置 |
JPH0833739B2 (ja) * | 1990-09-13 | 1996-03-29 | 三菱電機株式会社 | パターン表現モデル学習装置 |
-
1991
- 1991-03-22 JP JP3058797A patent/JP3050934B2/ja not_active Expired - Fee Related
-
1992
- 1992-03-20 EP EP92104898A patent/EP0504927B1/de not_active Expired - Lifetime
- 1992-03-20 DE DE69220825T patent/DE69220825T2/de not_active Expired - Fee Related
-
1994
- 1994-02-14 US US08/195,845 patent/US5649056A/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
US5649056A (en) | 1997-07-15 |
EP0504927A2 (de) | 1992-09-23 |
EP0504927B1 (de) | 1997-07-16 |
EP0504927A3 (en) | 1993-06-02 |
JP3050934B2 (ja) | 2000-06-12 |
JPH04293096A (ja) | 1992-10-16 |
DE69220825T2 (de) | 1998-02-19 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE69220825T2 (de) | Verfahren und System zur Spracherkennung | |
DE69324629D1 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69324428D1 (de) | Verfahren zur Sprachformung und Gerät zur Spracherkennung | |
DE69031284D1 (de) | Verfahren und Einrichtung zur Spracherkennung | |
DE69420400D1 (de) | Verfahren und gerät zur sprechererkennung | |
DE69328450D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69228973T2 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69232493D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69518705T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69524829T2 (de) | Verfahren und Vorrichtung zur Spracherkennung | |
DE69127961D1 (de) | Verfahren zur Spracherkennung | |
DE69428475T2 (de) | Verfahren und Gerät zur automatischen Spracherkennung | |
DE69321656D1 (de) | Verfahren zur Spracherkennung | |
DE69625950T2 (de) | Verfahren und Vorrichtung zur Spracherkennung und Übersetzungssystem | |
DE69230092D1 (de) | Verfahren und Gerät zur Zeichenerkennung | |
DE69431445D1 (de) | Verfahren und Vorrichtung zur Sprachkodierung | |
DE69517829D1 (de) | Vorrichtung und Verfahren zur Spracherkennung | |
DE69620304D1 (de) | Vorrichtung und Verfahren zur Spracherkennung | |
DE69030548D1 (de) | Verfahren und Einrichtung zur Spracherkennung | |
DE69518674T2 (de) | Verfahren und Gerät zur Spracherkennung | |
DE69518291D1 (de) | System und Verfahren zur Spracherkennung mit verringerter Antwortzeit | |
DE69132130D1 (de) | Gerät und Verfahren zur Informationserkennung | |
DE69315638D1 (de) | Vorrichtung zur Sprachdekodierung und Verfahren zur Dekodierung | |
DE69230166D1 (de) | Verfahren und Gerät zur Zeichenerkennung |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |