CA2077728A1 - Appareil de codage vocal a prototypes dependant du locuteurscrees a partir de donnees de reference - Google Patents
Appareil de codage vocal a prototypes dependant du locuteurscrees a partir de donnees de referenceInfo
- Publication number
- CA2077728A1 CA2077728A1 CA2077728A CA2077728A CA2077728A1 CA 2077728 A1 CA2077728 A1 CA 2077728A1 CA 2077728 A CA2077728 A CA 2077728A CA 2077728 A CA2077728 A CA 2077728A CA 2077728 A1 CA2077728 A1 CA 2077728A1
- Authority
- CA
- Canada
- Prior art keywords
- vector signals
- prototype
- feature
- signal
- values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 230000001419 dependent effect Effects 0.000 title abstract 2
- 238000000034 method Methods 0.000 abstract 2
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/063—Training
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/02—Feature extraction for speech recognition; Selection of recognition unit
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Artificial Intelligence (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Complex Calculations (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US07/802,678 | 1991-12-05 | ||
US07/802,678 US5278942A (en) | 1991-12-05 | 1991-12-05 | Speech coding apparatus having speaker dependent prototypes generated from nonuser reference data |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2077728A1 true CA2077728A1 (fr) | 1993-06-06 |
CA2077728C CA2077728C (fr) | 1996-08-06 |
Family
ID=25184402
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002077728A Expired - Fee Related CA2077728C (fr) | 1991-12-05 | 1992-09-08 | Appareil de codage vocal a prototypes dependant du locuteurscrees a partir de donnees de reference |
Country Status (4)
Country | Link |
---|---|
US (1) | US5278942A (fr) |
EP (1) | EP0545083A2 (fr) |
JP (1) | JP2691109B2 (fr) |
CA (1) | CA2077728C (fr) |
Families Citing this family (41)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2684212A1 (fr) * | 1991-11-26 | 1993-05-28 | Trt Telecom Radio Electr | Dispositif de mise en óoeuvre d'un traitement d'information impliquant une methode des moindres carres. |
JPH0772840B2 (ja) * | 1992-09-29 | 1995-08-02 | 日本アイ・ビー・エム株式会社 | 音声モデルの構成方法、音声認識方法、音声認識装置及び音声モデルの訓練方法 |
US5497447A (en) * | 1993-03-08 | 1996-03-05 | International Business Machines Corporation | Speech coding apparatus having acoustic prototype vectors generated by tying to elementary models and clustering around reference vectors |
US5544277A (en) * | 1993-07-28 | 1996-08-06 | International Business Machines Corporation | Speech coding apparatus and method for generating acoustic feature vector component values by combining values of the same features for multiple time intervals |
US5522011A (en) * | 1993-09-27 | 1996-05-28 | International Business Machines Corporation | Speech coding apparatus and method using classification rules |
US5745649A (en) * | 1994-07-07 | 1998-04-28 | Nynex Science & Technology Corporation | Automated speech recognition using a plurality of different multilayer perception structures to model a plurality of distinct phoneme categories |
JP2768274B2 (ja) * | 1994-09-08 | 1998-06-25 | 日本電気株式会社 | 音声認識装置 |
DE19516106C2 (de) * | 1995-05-05 | 2003-04-03 | Philips Corp Intellectual Pty | Verfahren zum Bestimmen von Referenzwerten |
EP0788648B1 (fr) * | 1995-08-28 | 2000-08-16 | Koninklijke Philips Electronics N.V. | Procede et systeme de reconnaissance de motifs bases sur la construction dynamique d'un sous-ensemble de vecteurs de reference |
US5737433A (en) * | 1996-01-16 | 1998-04-07 | Gardner; William A. | Sound environment control apparatus |
US5963903A (en) * | 1996-06-28 | 1999-10-05 | Microsoft Corporation | Method and system for dynamically adjusted training for speech recognition |
US5835890A (en) * | 1996-08-02 | 1998-11-10 | Nippon Telegraph And Telephone Corporation | Method for speaker adaptation of speech models recognition scheme using the method and recording medium having the speech recognition method recorded thereon |
US6151575A (en) * | 1996-10-28 | 2000-11-21 | Dragon Systems, Inc. | Rapid adaptation of speech models |
US5915001A (en) | 1996-11-14 | 1999-06-22 | Vois Corporation | System and method for providing and using universally accessible voice and speech data files |
US6212498B1 (en) | 1997-03-28 | 2001-04-03 | Dragon Systems, Inc. | Enrollment in speech recognition |
US6023673A (en) * | 1997-06-04 | 2000-02-08 | International Business Machines Corporation | Hierarchical labeler in a speech recognition system |
AU8995798A (en) * | 1997-09-05 | 1999-03-29 | Idioma Ltd. | Interactive system for teaching speech pronunciation and reading |
US6343267B1 (en) * | 1998-04-30 | 2002-01-29 | Matsushita Electric Industrial Co., Ltd. | Dimensionality reduction for speaker normalization and speaker and environment adaptation using eigenvoice techniques |
US6163768A (en) * | 1998-06-15 | 2000-12-19 | Dragon Systems, Inc. | Non-interactive enrollment in speech recognition |
US6253181B1 (en) * | 1999-01-22 | 2001-06-26 | Matsushita Electric Industrial Co., Ltd. | Speech recognition and teaching apparatus able to rapidly adapt to difficult speech of children and foreign speakers |
US8290768B1 (en) | 2000-06-21 | 2012-10-16 | International Business Machines Corporation | System and method for determining a set of attributes based on content of communications |
US6408277B1 (en) | 2000-06-21 | 2002-06-18 | Banter Limited | System and method for automatic task prioritization |
US9699129B1 (en) | 2000-06-21 | 2017-07-04 | International Business Machines Corporation | System and method for increasing email productivity |
US6795804B1 (en) * | 2000-11-01 | 2004-09-21 | International Business Machines Corporation | System and method for enhancing speech and pattern recognition using multiple transforms |
US7644057B2 (en) | 2001-01-03 | 2010-01-05 | International Business Machines Corporation | System and method for electronic communication management |
US20020010715A1 (en) * | 2001-07-26 | 2002-01-24 | Garry Chinn | System and method for browsing using a limited display device |
US7571097B2 (en) * | 2003-03-13 | 2009-08-04 | Microsoft Corporation | Method for training of subspace coded gaussian models |
US7389230B1 (en) | 2003-04-22 | 2008-06-17 | International Business Machines Corporation | System and method for classification of voice signals |
US8495002B2 (en) * | 2003-05-06 | 2013-07-23 | International Business Machines Corporation | Software tool for training and testing a knowledge base |
US20050187913A1 (en) | 2003-05-06 | 2005-08-25 | Yoram Nelken | Web-based customer service interface |
JP4328698B2 (ja) * | 2004-09-15 | 2009-09-09 | キヤノン株式会社 | 素片セット作成方法および装置 |
CN101657666B (zh) * | 2006-08-21 | 2011-05-18 | 西斜坡公用事业公司 | 管道修复安装的系统和方法 |
US8219404B2 (en) * | 2007-08-09 | 2012-07-10 | Nice Systems, Ltd. | Method and apparatus for recognizing a speaker in lawful interception systems |
US8543398B1 (en) | 2012-02-29 | 2013-09-24 | Google Inc. | Training an automatic speech recognition system using compressed word frequencies |
JP5612014B2 (ja) * | 2012-03-29 | 2014-10-22 | 株式会社東芝 | モデル学習装置、モデル学習方法、及びプログラム |
US8374865B1 (en) | 2012-04-26 | 2013-02-12 | Google Inc. | Sampling training data for an automatic speech recognition system based on a benchmark classification distribution |
US8805684B1 (en) * | 2012-05-31 | 2014-08-12 | Google Inc. | Distributed speaker adaptation |
US8571859B1 (en) | 2012-05-31 | 2013-10-29 | Google Inc. | Multi-stage speaker adaptation |
US8554559B1 (en) | 2012-07-13 | 2013-10-08 | Google Inc. | Localized speech recognition with offload |
US9123333B2 (en) | 2012-09-12 | 2015-09-01 | Google Inc. | Minimum bayesian risk methods for automatic speech recognition |
US9135911B2 (en) * | 2014-02-07 | 2015-09-15 | NexGen Flight LLC | Automated generation of phonemic lexicon for voice activated cockpit management systems |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS58129684A (ja) * | 1982-01-29 | 1983-08-02 | Toshiba Corp | パタ−ン認識装置 |
US4980918A (en) * | 1985-05-09 | 1990-12-25 | International Business Machines Corporation | Speech recognition system with efficient storage and rapid assembly of phonological graphs |
US4751737A (en) * | 1985-11-06 | 1988-06-14 | Motorola Inc. | Template generation method in a speech recognition system |
JPS62231993A (ja) * | 1986-03-25 | 1987-10-12 | インタ−ナシヨナル ビジネス マシ−ンズ コ−ポレ−シヨン | 音声認識方法 |
US4817156A (en) * | 1987-08-10 | 1989-03-28 | International Business Machines Corporation | Rapidly training a speech recognizer to a subsequent speaker given training data of a reference speaker |
JPH02265000A (ja) * | 1989-04-06 | 1990-10-29 | Canon Inc | 音声対話装置 |
-
1991
- 1991-12-05 US US07/802,678 patent/US5278942A/en not_active Expired - Fee Related
-
1992
- 1992-09-08 CA CA002077728A patent/CA2077728C/fr not_active Expired - Fee Related
- 1992-10-05 JP JP4265717A patent/JP2691109B2/ja not_active Expired - Lifetime
- 1992-11-03 EP EP92118815A patent/EP0545083A2/fr not_active Withdrawn
Also Published As
Publication number | Publication date |
---|---|
EP0545083A3 (fr) | 1994-02-23 |
CA2077728C (fr) | 1996-08-06 |
JPH05241589A (ja) | 1993-09-21 |
JP2691109B2 (ja) | 1997-12-17 |
US5278942A (en) | 1994-01-11 |
EP0545083A2 (fr) | 1993-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA2077728A1 (fr) | Appareil de codage vocal a prototypes dependant du locuteurscrees a partir de donnees de reference | |
CA2073991A1 (fr) | Appareil de reconnaissance vocale a codeur de paroles produisant des signaux prototypes | |
CA2089786C (fr) | Appareil de reconnaissance de la parole contextuel utilisant une estimation du mot suivant | |
US6535852B2 (en) | Training of text-to-speech systems | |
US4661915A (en) | Allophone vocoder | |
CA2072721A1 (fr) | Dispositif de codage de paroles a prototyques acoustiques unidimensionnels pour appareil de reconnaissance vocale | |
CA2020242C (fr) | Methode et appareil pour extraire d'un signal des parties contenant des informations pour reconnaitre des variantes de configurations similaires | |
EP0302663B1 (fr) | Procédé et dispositif économiques pour la reconnaissance de la parole | |
CA2126380A1 (fr) | Minimisation du taux d'erreur dans les modeles de chaine combines | |
CA2085895A1 (fr) | Systeme de traitement de paroles continues | |
CA2158847A1 (fr) | Methode et appareil de reconnaissance vocale | |
US4424415A (en) | Formant tracker | |
EP0241768A3 (en) | Synthesizing word baseforms used in speech recognition | |
Cole et al. | Feature-based speaker-independent recognition of isolated English letters | |
EP1005019A3 (fr) | Procédé de mesure de la similarité pour la reconnaissance de la parole basé sur un découpage en segments | |
CA2068041A1 (fr) | Algorithme rapide de derivation de prototypes acoustiques pour la reconnaissance vocale automatique | |
EP0071716B1 (fr) | Vocodeur allophonique | |
ATE122171T1 (de) | Spracherkennung. | |
EP0042590B1 (fr) | Dispositif d'extraction de phonèmes | |
JP2002236494A (ja) | 音声区間判別装置、音声認識装置、プログラム及び記録媒体 | |
EP0465639A1 (fr) | Apprentissage par association de series chronologiques | |
US5544277A (en) | Speech coding apparatus and method for generating acoustic feature vector component values by combining values of the same features for multiple time intervals | |
Blomberg | Adaptation to a speaker's voice in a speech recognition system based on synthetic phoneme references | |
GB2231698A (en) | Speech recognition | |
JP2502493B2 (ja) | 規則合成音のリズム制御方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |