CN1230764C - 用于语音识别的设备、方法和计算机系统 - Google Patents
用于语音识别的设备、方法和计算机系统 Download PDFInfo
- Publication number
- CN1230764C CN1230764C CNB00135969XA CN00135969A CN1230764C CN 1230764 C CN1230764 C CN 1230764C CN B00135969X A CNB00135969X A CN B00135969XA CN 00135969 A CN00135969 A CN 00135969A CN 1230764 C CN1230764 C CN 1230764C
- Authority
- CN
- China
- Prior art keywords
- word
- model
- probability
- dictionary
- good
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims description 50
- 230000005055 memory storage Effects 0.000 claims description 12
- 238000012821 model calculation Methods 0.000 claims 6
- 230000008676 import Effects 0.000 claims 1
- 230000002093 peripheral effect Effects 0.000 abstract 1
- 230000008569 process Effects 0.000 description 12
- 238000004364 calculation method Methods 0.000 description 8
- 238000006243 chemical reaction Methods 0.000 description 4
- 238000004590 computer program Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 238000002474 experimental method Methods 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 239000004615 ingredient Substances 0.000 description 3
- 238000005457 optimization Methods 0.000 description 3
- 230000004069 differentiation Effects 0.000 description 2
- 230000010365 information processing Effects 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical group OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (19)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP370413/1999 | 1999-12-27 | ||
JP37041399A JP3426176B2 (ja) | 1999-12-27 | 1999-12-27 | 音声認識装置、方法、コンピュータ・システム及び記憶媒体 |
JP370413/99 | 1999-12-27 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1302025A CN1302025A (zh) | 2001-07-04 |
CN1230764C true CN1230764C (zh) | 2005-12-07 |
Family
ID=18496852
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB00135969XA Expired - Fee Related CN1230764C (zh) | 1999-12-27 | 2000-12-19 | 用于语音识别的设备、方法和计算机系统 |
Country Status (3)
Country | Link |
---|---|
US (1) | US6917910B2 (zh) |
JP (1) | JP3426176B2 (zh) |
CN (1) | CN1230764C (zh) |
Families Citing this family (24)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7533020B2 (en) * | 2001-09-28 | 2009-05-12 | Nuance Communications, Inc. | Method and apparatus for performing relational speech recognition |
US7308404B2 (en) * | 2001-09-28 | 2007-12-11 | Sri International | Method and apparatus for speech recognition using a dynamic vocabulary |
US8161069B1 (en) | 2007-02-01 | 2012-04-17 | Eighty-Three Degrees, Inc. | Content sharing using metadata |
US9286935B1 (en) * | 2007-01-29 | 2016-03-15 | Start Project, LLC | Simplified data entry |
US7912700B2 (en) * | 2007-02-08 | 2011-03-22 | Microsoft Corporation | Context based word prediction |
US7809719B2 (en) * | 2007-02-08 | 2010-10-05 | Microsoft Corporation | Predicting textual candidates |
WO2008151465A1 (en) * | 2007-06-14 | 2008-12-18 | Google Inc. | Dictionary word and phrase determination |
CN101779200B (zh) | 2007-06-14 | 2013-03-20 | 谷歌股份有限公司 | 词典词和短语确定方法和设备 |
JP2009230068A (ja) * | 2008-03-25 | 2009-10-08 | Denso Corp | 音声認識装置及びナビゲーションシステム |
US9043209B2 (en) * | 2008-11-28 | 2015-05-26 | Nec Corporation | Language model creation device |
US9424246B2 (en) | 2009-03-30 | 2016-08-23 | Touchtype Ltd. | System and method for inputting text into electronic devices |
US10191654B2 (en) | 2009-03-30 | 2019-01-29 | Touchtype Limited | System and method for inputting text into electronic devices |
GB0905457D0 (en) * | 2009-03-30 | 2009-05-13 | Touchtype Ltd | System and method for inputting text into electronic devices |
WO2012093451A1 (ja) * | 2011-01-07 | 2012-07-12 | 日本電気株式会社 | 音声認識システム、音声認識方法および音声認識プログラム |
CN103578465B (zh) * | 2013-10-18 | 2016-08-17 | 威盛电子股份有限公司 | 语音辨识方法及电子装置 |
CN105531758B (zh) | 2014-07-17 | 2019-10-01 | 微软技术许可有限责任公司 | 使用外国单词语法的语音识别 |
EP3264413B1 (en) * | 2015-02-23 | 2020-10-21 | Sony Corporation | Information processing system and method |
CN105869624B (zh) * | 2016-03-29 | 2019-05-10 | 腾讯科技(深圳)有限公司 | 数字语音识别中语音解码网络的构建方法及装置 |
GB201610984D0 (en) | 2016-06-23 | 2016-08-10 | Microsoft Technology Licensing Llc | Suppression of input images |
CN106254696A (zh) * | 2016-08-02 | 2016-12-21 | 北京京东尚科信息技术有限公司 | 外呼结果确定方法、装置及系统 |
KR102455067B1 (ko) * | 2017-11-24 | 2022-10-17 | 삼성전자주식회사 | 전자 장치 및 그 제어 방법 |
US11295732B2 (en) * | 2019-08-01 | 2022-04-05 | Soundhound, Inc. | Dynamic interpolation for hybrid language models |
CN111326160A (zh) * | 2020-03-11 | 2020-06-23 | 南京奥拓电子科技有限公司 | 一种纠正噪音文本的语音识别方法、系统及存储介质 |
CN112397059B (zh) * | 2020-11-10 | 2024-02-06 | 武汉天有科技有限公司 | 一种语音流畅度检测方法及装置 |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS6152698A (ja) | 1984-08-23 | 1986-03-15 | 日立電子エンジニアリング株式会社 | 音声認識装置 |
US5261009A (en) * | 1985-10-15 | 1993-11-09 | Palantir Corporation | Means for resolving ambiguities in text passed upon character context |
JP3240691B2 (ja) | 1992-07-07 | 2001-12-17 | 日本電信電話株式会社 | 音声認識方法 |
JPH07104782A (ja) | 1993-10-04 | 1995-04-21 | Atr Onsei Honyaku Tsushin Kenkyusho:Kk | 音声認識装置 |
US5797123A (en) * | 1996-10-01 | 1998-08-18 | Lucent Technologies Inc. | Method of key-phase detection and verification for flexible speech understanding |
US6212498B1 (en) * | 1997-03-28 | 2001-04-03 | Dragon Systems, Inc. | Enrollment in speech recognition |
US6018708A (en) * | 1997-08-26 | 2000-01-25 | Nortel Networks Corporation | Method and apparatus for performing speech recognition utilizing a supplementary lexicon of frequently used orthographies |
US6125345A (en) * | 1997-09-19 | 2000-09-26 | At&T Corporation | Method and apparatus for discriminative utterance verification using multiple confidence measures |
CN1159662C (zh) * | 1998-05-13 | 2004-07-28 | 国际商业机器公司 | 连续语音识别中的标点符号自动生成装置及方法 |
JP3004254B2 (ja) * | 1998-06-12 | 2000-01-31 | 株式会社エイ・ティ・アール音声翻訳通信研究所 | 統計的シーケンスモデル生成装置、統計的言語モデル生成装置及び音声認識装置 |
US6292778B1 (en) * | 1998-10-30 | 2001-09-18 | Lucent Technologies Inc. | Task-independent utterance verification with subword-based minimum verification error training |
US6374217B1 (en) * | 1999-03-12 | 2002-04-16 | Apple Computer, Inc. | Fast update implementation for efficient latent semantic language modeling |
US6385579B1 (en) * | 1999-04-29 | 2002-05-07 | International Business Machines Corporation | Methods and apparatus for forming compound words for use in a continuous speech recognition system |
MXPA02005387A (es) * | 1999-12-02 | 2004-04-21 | Thomson Licensing Sa | Proceso y dispositivo para reconocimiento de voz que utiliza modelos de lenguaje desarticulados. |
-
1999
- 1999-12-27 JP JP37041399A patent/JP3426176B2/ja not_active Expired - Fee Related
-
2000
- 2000-12-19 CN CNB00135969XA patent/CN1230764C/zh not_active Expired - Fee Related
- 2000-12-26 US US09/748,542 patent/US6917910B2/en not_active Expired - Lifetime
Also Published As
Publication number | Publication date |
---|---|
US20010014859A1 (en) | 2001-08-16 |
JP3426176B2 (ja) | 2003-07-14 |
JP2001188558A (ja) | 2001-07-10 |
US6917910B2 (en) | 2005-07-12 |
CN1302025A (zh) | 2001-07-04 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1230764C (zh) | 用于语音识别的设备、方法和计算机系统 | |
US11037553B2 (en) | Learning-type interactive device | |
US10672391B2 (en) | Improving automatic speech recognition of multilingual named entities | |
US20180075343A1 (en) | Processing sequences using convolutional neural networks | |
CN110164447B (zh) | 一种口语评分方法及装置 | |
CN1227657A (zh) | 采用基于字典的词类概率的自然语言语法分析程序 | |
JP2002524806A (ja) | 音声認識および自然言語処理を使用したネットワーク用対話型ユーザ・インタフェース | |
WO2003010754A1 (fr) | Systeme de recherche a entree vocale | |
WO2008023470A1 (fr) | Procédé de recherche de phrase, moteur de recherche de phrase, programme informatique, support d'enregistrement et stockage de document | |
US10909972B2 (en) | Spoken language understanding using dynamic vocabulary | |
WO2018057166A1 (en) | Technologies for improved keyword spotting | |
CN105551485A (zh) | 语音文件检索方法及系统 | |
CN111192572A (zh) | 语义识别的方法、装置及系统 | |
CN114120985A (zh) | 智能语音终端的安抚交互方法、系统、设备及存储介质 | |
EP4295358A1 (en) | Lookup-table recurrent language model | |
CN116483979A (zh) | 基于人工智能的对话模型训练方法、装置、设备及介质 | |
JP2013072887A (ja) | 対話装置 | |
CN116052655A (zh) | 音频处理方法、装置、电子设备和可读存储介质 | |
Matsubara et al. | Stochastic dependency parsing of spontaneous Japanese spoken language | |
JP6147836B2 (ja) | 対話装置 | |
JP2000075892A (ja) | 音声認識のための統計的言語モデル作成方法および装置 | |
CN113077792B (zh) | 佛学主题词识别方法、装置、设备及存储介质 | |
US20240144917A1 (en) | Exporting modular encoder features for streaming and deliberation asr | |
CN1043490C (zh) | 叠词变换方法和汉字变换装置 | |
CN1043542C (zh) | 汉字变换装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C06 | Publication | ||
PB01 | Publication | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: WEICHA COMMUNICATION CO.,LTD. Free format text: FORMER OWNER: INTERNATIONAL BUSINESS MACHINE CORP. Effective date: 20090904 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20090904 Address after: Massachusetts, USA Patentee after: Nuance Communications Inc. Address before: American New York Patentee before: International Business Machines Corp. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20051207 Termination date: 20161219 |
|
CF01 | Termination of patent right due to non-payment of annual fee |