ES2164870T3 - Reconocimiento del habla. - Google Patents
Reconocimiento del habla.Info
- Publication number
- ES2164870T3 ES2164870T3 ES96904973T ES96904973T ES2164870T3 ES 2164870 T3 ES2164870 T3 ES 2164870T3 ES 96904973 T ES96904973 T ES 96904973T ES 96904973 T ES96904973 T ES 96904973T ES 2164870 T3 ES2164870 T3 ES 2164870T3
- Authority
- ES
- Spain
- Prior art keywords
- recognition
- speech recognition
- pruning
- probability
- probability values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Lifetime
Links
- 238000000034 method Methods 0.000 abstract 2
- 238000013138 pruning Methods 0.000 abstract 2
- GEYOCULIXLDCMW-UHFFFAOYSA-N 1,2-phenylenediamine Chemical compound NC1=CC=CC=C1N GEYOCULIXLDCMW-UHFFFAOYSA-N 0.000 abstract 1
- 230000002028 premature Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/01—Assessment or evaluation of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Machine Translation (AREA)
- Fittings On The Vehicle Exterior For Carrying Loads, And Devices For Holding Or Mounting Articles (AREA)
- Navigation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Telephonic Communication Services (AREA)
- Image Analysis (AREA)
- Selective Calling Equipment (AREA)
- Character Discrimination (AREA)
- Document Processing Apparatus (AREA)
- Computer And Data Communications (AREA)
- Feedback Control In General (AREA)
Abstract
SE PROPORCIONA UN RECONOCEDOR CON UNOS VALORES DE PROBABILIDAD A PRIORI (POR EJEMPLO, DE ALGUN RECONOCIMIENTO ANTERIOR) INDICANDO QUE PROBABILIDADES TIENEN LAS DIFERENTES PALABRAS DEL VOCABULARIO DEL RECONOCEDOR DE OCURRIR EN EL CONTEXTO PARTICULAR, Y A ESTOS VALORES SE LES DAN "PUNTOS" DE RECONOCIMIENTO ANTES DE QUE SE ELIJA UN RESULTADO (O RESULTADOS). EL RECONOCEDOR TAMBIEN EMPLEA "PODA" DE FORMA QUE RESULTADOS PARCIALES DE BAJA PUNTUACION SE DESCARTAN PARA ACELERAR EL PROCESO DE RECONOCIMIENTO. PARA EVITAR LA PODA PREMATURA DE LAS PALABRAS MAS PROBABLES, LOS VALORES DE PROBABILIDAD SE APLICAN ANTES DE QUE SE TOMEN LAS DECISIONES DE PODA. SE DESCRIBE UN METODO PARA APLICAR ESTOS VALORES DE PROBABILIDAD.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP95301477 | 1995-03-07 |
Publications (1)
Publication Number | Publication Date |
---|---|
ES2164870T3 true ES2164870T3 (es) | 2002-03-01 |
Family
ID=8221113
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
ES96904973T Expired - Lifetime ES2164870T3 (es) | 1995-03-07 | 1996-03-07 | Reconocimiento del habla. |
Country Status (13)
Country | Link |
---|---|
US (1) | US5999902A (es) |
EP (1) | EP0813735B1 (es) |
JP (1) | JP4180110B2 (es) |
KR (1) | KR100406604B1 (es) |
CN (1) | CN1150515C (es) |
AU (1) | AU702903B2 (es) |
CA (1) | CA2211636C (es) |
DE (1) | DE69615667T2 (es) |
ES (1) | ES2164870T3 (es) |
MX (1) | MX9706407A (es) |
NO (1) | NO974097L (es) |
NZ (1) | NZ302748A (es) |
WO (1) | WO1996027872A1 (es) |
Families Citing this family (66)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3061114B2 (ja) * | 1996-11-25 | 2000-07-10 | 日本電気株式会社 | 音声認識装置 |
GB9723214D0 (en) * | 1997-11-03 | 1998-01-07 | British Telecomm | Pattern recognition |
US6411929B1 (en) * | 1997-11-27 | 2002-06-25 | Hitachi, Ltd. | Speech recognition method and system |
US7937260B1 (en) * | 1998-06-15 | 2011-05-03 | At&T Intellectual Property Ii, L.P. | Concise dynamic grammars using N-best selection |
US6574596B2 (en) * | 1999-02-08 | 2003-06-03 | Qualcomm Incorporated | Voice recognition rejection scheme |
JP2002539528A (ja) * | 1999-03-05 | 2002-11-19 | キヤノン株式会社 | データベース注釈付け及び検索 |
US20050149462A1 (en) * | 1999-10-14 | 2005-07-07 | The Salk Institute For Biological Studies | System and method of separating signals |
US6424960B1 (en) * | 1999-10-14 | 2002-07-23 | The Salk Institute For Biological Studies | Unsupervised adaptation and classification of multiple classes and sources in blind signal separation |
CN1329861C (zh) * | 1999-10-28 | 2007-08-01 | 佳能株式会社 | 模式匹配方法和装置 |
US7310600B1 (en) | 1999-10-28 | 2007-12-18 | Canon Kabushiki Kaisha | Language recognition using a similarity measure |
US6882970B1 (en) | 1999-10-28 | 2005-04-19 | Canon Kabushiki Kaisha | Language recognition using sequence frequency |
AU1767600A (en) * | 1999-12-23 | 2001-07-09 | Intel Corporation | Speech recognizer with a lexical tree based n-gram language model |
US6920421B2 (en) * | 1999-12-28 | 2005-07-19 | Sony Corporation | Model adaptive apparatus for performing adaptation of a model used in pattern recognition considering recentness of a received pattern data |
GB0011798D0 (en) * | 2000-05-16 | 2000-07-05 | Canon Kk | Database annotation and retrieval |
GB0015233D0 (en) | 2000-06-21 | 2000-08-16 | Canon Kk | Indexing method and apparatus |
GB0023930D0 (en) | 2000-09-29 | 2000-11-15 | Canon Kk | Database annotation and retrieval |
GB0027178D0 (en) | 2000-11-07 | 2000-12-27 | Canon Kk | Speech processing system |
GB0028277D0 (en) * | 2000-11-20 | 2001-01-03 | Canon Kk | Speech processing system |
DE60233561D1 (de) * | 2001-04-19 | 2009-10-15 | British Telecomm | Sprachantwortsystem |
EP1397797B1 (en) * | 2001-04-19 | 2007-09-12 | BRITISH TELECOMMUNICATIONS public limited company | Speech recognition |
US20030018451A1 (en) * | 2001-07-16 | 2003-01-23 | Level 3 Communications, Inc. | System, method and computer program product for rating enterprise metrics |
JP2003108187A (ja) * | 2001-09-28 | 2003-04-11 | Fujitsu Ltd | 類似性評価方法及び類似性評価プログラム |
KR100450396B1 (ko) * | 2001-10-22 | 2004-09-30 | 한국전자통신연구원 | 트리탐색기반 음성 인식 방법 및 이를 이용한 대용량 연속음성 인식 시스템 |
US7356466B2 (en) * | 2002-06-28 | 2008-04-08 | Samsung Electronics Co., Ltd. | Method and apparatus for performing observation probability calculations |
EP1387232A1 (fr) * | 2002-07-29 | 2004-02-04 | Centre National De La Recherche Scientifique | Procédé de détermination de la valeur à donner à différents paramètres d'un système |
US7228275B1 (en) * | 2002-10-21 | 2007-06-05 | Toyota Infotechnology Center Co., Ltd. | Speech recognition system having multiple speech recognizers |
US7805299B2 (en) * | 2004-03-01 | 2010-09-28 | Coifman Robert E | Method and apparatus for improving the transcription accuracy of speech recognition software |
US7852993B2 (en) * | 2003-08-11 | 2010-12-14 | Microsoft Corporation | Speech recognition enhanced caller identification |
US7899671B2 (en) * | 2004-02-05 | 2011-03-01 | Avaya, Inc. | Recognition results postprocessor for use in voice recognition systems |
US7869588B2 (en) | 2004-05-03 | 2011-01-11 | Somatek | System and method for providing particularized audible alerts |
US9117460B2 (en) * | 2004-05-12 | 2015-08-25 | Core Wireless Licensing S.A.R.L. | Detection of end of utterance in speech recognition system |
US20080004881A1 (en) * | 2004-12-22 | 2008-01-03 | David Attwater | Turn-taking model |
US8200495B2 (en) * | 2005-02-04 | 2012-06-12 | Vocollect, Inc. | Methods and systems for considering information about an expected response when performing speech recognition |
US7865362B2 (en) | 2005-02-04 | 2011-01-04 | Vocollect, Inc. | Method and system for considering information about an expected response when performing speech recognition |
US20090024183A1 (en) | 2005-08-03 | 2009-01-22 | Fitchmun Mark I | Somatic, auditory and cochlear communication system and method |
KR100748720B1 (ko) * | 2006-02-09 | 2007-08-13 | 삼성전자주식회사 | 다중 계층 중심 어휘 목록에 기초하여 대규모 단어 음성인식 방법 및 그 장치 |
WO2007142102A1 (ja) * | 2006-05-31 | 2007-12-13 | Nec Corporation | 言語モデル学習システム、言語モデル学習方法、および言語モデル学習用プログラム |
US7899251B2 (en) * | 2006-06-05 | 2011-03-01 | Microsoft Corporation | Balancing out-of-dictionary and in-dictionary recognition scores |
CN101105894B (zh) * | 2006-07-12 | 2011-08-10 | 陈修志 | 多功能语言学习机 |
KR100925479B1 (ko) * | 2007-09-19 | 2009-11-06 | 한국전자통신연구원 | 음성 인식 방법 및 장치 |
GB2453366B (en) * | 2007-10-04 | 2011-04-06 | Toshiba Res Europ Ltd | Automatic speech recognition method and apparatus |
US7437291B1 (en) | 2007-12-13 | 2008-10-14 | International Business Machines Corporation | Using partial information to improve dialog in automatic speech recognition systems |
US20090198490A1 (en) * | 2008-02-06 | 2009-08-06 | International Business Machines Corporation | Response time when using a dual factor end of utterance determination technique |
US20090307003A1 (en) * | 2008-05-16 | 2009-12-10 | Daniel Benyamin | Social advertisement network |
US8086631B2 (en) * | 2008-12-12 | 2011-12-27 | Microsoft Corporation | Search result diversification |
KR101217525B1 (ko) | 2008-12-22 | 2013-01-18 | 한국전자통신연구원 | 비터비 디코더와 이를 이용한 음성 인식 방법 |
FI20086260A (fi) * | 2008-12-31 | 2010-09-02 | Teknillinen Korkeakoulu | Menetelmä hahmon löytämiseksi ja tunnistamiseksi |
US8442829B2 (en) * | 2009-02-17 | 2013-05-14 | Sony Computer Entertainment Inc. | Automatic computation streaming partition for voice recognition on multiple processors with limited memory |
US8504550B2 (en) * | 2009-05-15 | 2013-08-06 | Citizennet Inc. | Social network message categorization systems and methods |
US8306191B2 (en) * | 2009-06-12 | 2012-11-06 | Avaya Inc. | Caller recognition by voice messaging system |
US8380697B2 (en) * | 2009-10-21 | 2013-02-19 | Citizennet Inc. | Search and retrieval methods and systems of short messages utilizing messaging context and keyword frequency |
US8554854B2 (en) * | 2009-12-11 | 2013-10-08 | Citizennet Inc. | Systems and methods for identifying terms relevant to web pages using social network messages |
US8615434B2 (en) | 2010-10-19 | 2013-12-24 | Citizennet Inc. | Systems and methods for automatically generating campaigns using advertising targeting information based upon affinity information obtained from an online social network |
US8612293B2 (en) | 2010-10-19 | 2013-12-17 | Citizennet Inc. | Generation of advertising targeting information based upon affinity information obtained from an online social network |
US9063927B2 (en) | 2011-04-06 | 2015-06-23 | Citizennet Inc. | Short message age classification |
US9002892B2 (en) | 2011-08-07 | 2015-04-07 | CitizenNet, Inc. | Systems and methods for trend detection using frequency analysis |
US9053497B2 (en) | 2012-04-27 | 2015-06-09 | CitizenNet, Inc. | Systems and methods for targeting advertising to groups with strong ties within an online social network |
CN103544952A (zh) * | 2012-07-12 | 2014-01-29 | 百度在线网络技术(北京)有限公司 | 语音自适应方法、装置及系统 |
US10055767B2 (en) * | 2015-05-13 | 2018-08-21 | Google Llc | Speech recognition for keywords |
CN105356935B (zh) * | 2015-11-27 | 2017-10-31 | 天津光电通信技术有限公司 | 一种实现同步数字体系高阶交叉的交叉板及实现方法 |
JP6618884B2 (ja) * | 2016-11-17 | 2019-12-11 | 株式会社東芝 | 認識装置、認識方法およびプログラム |
US10565320B1 (en) | 2018-09-28 | 2020-02-18 | International Business Machines Corporation | Dynamic multilingual speech recognition |
RU2744063C1 (ru) | 2018-12-18 | 2021-03-02 | Общество С Ограниченной Ответственностью "Яндекс" | Способ и система определения говорящего пользователя управляемого голосом устройства |
KR20220010259A (ko) * | 2020-07-17 | 2022-01-25 | 삼성전자주식회사 | 음성 신호 처리 방법 및 장치 |
CN112786007B (zh) * | 2021-01-20 | 2024-01-26 | 北京有竹居网络技术有限公司 | 语音合成方法、装置、可读介质及电子设备 |
CN117166996B (zh) * | 2023-07-27 | 2024-03-22 | 中国地质大学(北京) | 地质参数门槛值的确定方法、装置、设备及存储介质 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4860358A (en) * | 1983-09-12 | 1989-08-22 | American Telephone And Telegraph Company, At&T Bell Laboratories | Speech recognition arrangement with preselection |
US4783803A (en) * | 1985-11-12 | 1988-11-08 | Dragon Systems, Inc. | Speech recognition apparatus and method |
US5202952A (en) * | 1990-06-22 | 1993-04-13 | Dragon Systems, Inc. | Large-vocabulary continuous speech prefiltering and processing system |
JP2974387B2 (ja) * | 1990-09-05 | 1999-11-10 | 日本電信電話株式会社 | ワードスポッティング音声認識方法 |
KR920013250A (ko) * | 1990-12-28 | 1992-07-28 | 이헌조 | 음성인식 시스템의 변별적 특성을 이용한 숫자음 인식방법 |
US5267345A (en) * | 1992-02-10 | 1993-11-30 | International Business Machines Corporation | Speech recognition apparatus which predicts word classes from context and words from word classes |
JPH06175685A (ja) * | 1992-12-09 | 1994-06-24 | Matsushita Electric Ind Co Ltd | パタン認識装置及びヒドゥンマルコフモデル作成装置 |
US5699456A (en) * | 1994-01-21 | 1997-12-16 | Lucent Technologies Inc. | Large vocabulary connected speech recognition system and method of language representation using evolutional grammar to represent context free grammars |
JP2775140B2 (ja) * | 1994-03-18 | 1998-07-16 | 株式会社エイ・ティ・アール人間情報通信研究所 | パターン認識方法、音声認識方法および音声認識装置 |
-
1996
- 1996-03-07 JP JP52671596A patent/JP4180110B2/ja not_active Expired - Lifetime
- 1996-03-07 ES ES96904973T patent/ES2164870T3/es not_active Expired - Lifetime
- 1996-03-07 CN CNB961923768A patent/CN1150515C/zh not_active Expired - Fee Related
- 1996-03-07 AU AU48876/96A patent/AU702903B2/en not_active Ceased
- 1996-03-07 NZ NZ302748A patent/NZ302748A/en unknown
- 1996-03-07 US US08/875,070 patent/US5999902A/en not_active Expired - Lifetime
- 1996-03-07 MX MX9706407A patent/MX9706407A/es not_active Application Discontinuation
- 1996-03-07 WO PCT/GB1996/000531 patent/WO1996027872A1/en active IP Right Grant
- 1996-03-07 EP EP96904973A patent/EP0813735B1/en not_active Expired - Lifetime
- 1996-03-07 CA CA002211636A patent/CA2211636C/en not_active Expired - Fee Related
- 1996-03-07 KR KR1019970706130A patent/KR100406604B1/ko not_active IP Right Cessation
- 1996-03-07 DE DE69615667T patent/DE69615667T2/de not_active Expired - Lifetime
-
1997
- 1997-09-05 NO NO974097A patent/NO974097L/no unknown
Also Published As
Publication number | Publication date |
---|---|
US5999902A (en) | 1999-12-07 |
JPH11501410A (ja) | 1999-02-02 |
DE69615667T2 (de) | 2002-06-20 |
KR19980702723A (ko) | 1998-08-05 |
EP0813735B1 (en) | 2001-10-04 |
NO974097D0 (no) | 1997-09-05 |
CN1178023A (zh) | 1998-04-01 |
DE69615667D1 (de) | 2001-11-08 |
JP4180110B2 (ja) | 2008-11-12 |
CA2211636C (en) | 2002-01-22 |
CA2211636A1 (en) | 1996-09-12 |
AU702903B2 (en) | 1999-03-11 |
AU4887696A (en) | 1996-09-23 |
CN1150515C (zh) | 2004-05-19 |
NO974097L (no) | 1997-09-08 |
WO1996027872A1 (en) | 1996-09-12 |
NZ302748A (en) | 1999-04-29 |
KR100406604B1 (ko) | 2004-02-18 |
EP0813735A1 (en) | 1997-12-29 |
MX9706407A (es) | 1997-11-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
ES2164870T3 (es) | Reconocimiento del habla. | |
Rickford | Social contact and linguistic diffusion: Hiberno-English and new world Black English | |
CA2089786A1 (en) | Context-dependent speech recognizer using estimated next word context | |
ES2153021T3 (es) | Procedimiento y disposicion para la conversion del habla a texto. | |
EP1128361A3 (en) | Language models for speech recognition | |
CA2275774A1 (en) | Selection of superwords based on criteria relevant to both speech recognition and understanding | |
EP0984428A3 (en) | Method and system for automatically determining phonetic transciptions associated with spelled words | |
BR0013407A (pt) | Método para retirar um acabamento de piso de cura por luz ultravioleta, acabamento de piso curável por luz ultravioleta retirável e piso acabado removìvel | |
EP0664535A3 (en) | Word vocabulary recognition system with extended vocabulary and language representation method using an evolutionary grammar as a non-contextual grammar. | |
EP1220197A3 (en) | Speech recognition method and system | |
EP0978823A3 (en) | Speech recognition | |
DE3779170D1 (de) | Erzeugung von wortgrundstrukturen zur spracherkennung. | |
FR2433767A1 (fr) | Dispositif a eclairage incident pour microscope | |
ATE232638T1 (de) | Verfahren zur spracherkennung | |
ES2141743T3 (es) | Procedimiento de tratamiento ulterior de residuos tecnicos de la industria manipuladora de melazas. | |
BR0103860A (pt) | Sistema de diálogo de fala, e, método para extrair uma sub-sequência de palavra significante de um resultado de reconhecimento produzido por uma unidade de reconhecimento de fala de um sistema de diálogo de fala | |
Scharenborg et al. | A two-pass strategy for handling OOVs in a large vocabulary recognition task | |
ES2044542T3 (es) | Util de corte y procedimiento para su fabricacion. | |
ES2080507T3 (es) | Deformacion superplastica de estructuras de aluminio unidas por difusion. | |
ATE345321T1 (de) | Verfahren zur herstellung von hexafluoraceton und dessen hydrat | |
Sheppard | Electra again | |
ATE212009T1 (de) | Verfahren zur herstellung von gesättigten alkoholen | |
ES2175051T3 (es) | Uso de un pliholosido en una composicion limpiadora o desmaquillante y composicion que lo incluye. | |
Savino | Non-finality and pre-finality in bari Italian intonation: a preliminary account. | |
Choi et al. | Lexical tree decoding with a class-based language model for Chinese speech recognition |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
FG2A | Definitive protection |
Ref document number: 813735 Country of ref document: ES |