KR101679445B1 - 컴퓨터구현 음성 방법 및 시스템 - Google Patents
컴퓨터구현 음성 방법 및 시스템 Download PDFInfo
- Publication number
- KR101679445B1 KR101679445B1 KR1020117022845A KR20117022845A KR101679445B1 KR 101679445 B1 KR101679445 B1 KR 101679445B1 KR 1020117022845 A KR1020117022845 A KR 1020117022845A KR 20117022845 A KR20117022845 A KR 20117022845A KR 101679445 B1 KR101679445 B1 KR 101679445B1
- Authority
- KR
- South Korea
- Prior art keywords
- context
- term memory
- computer
- constraint
- word
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/187—Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
- G06F3/0237—Character input methods using prediction or retrieval techniques
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/126—Character encoding
- G06F40/129—Handling non-Latin characters, e.g. kana-to-kanji conversion
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/205—Parsing
- G06F40/216—Parsing using statistical methods
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
- G10L15/19—Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
- G10L15/197—Probabilistic grammars, e.g. word n-grams
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- General Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US12/413,606 | 2009-03-30 | ||
| US12/413,606 US8798983B2 (en) | 2009-03-30 | 2009-03-30 | Adaptation for statistical language model |
| PCT/US2010/028932 WO2010117688A2 (en) | 2009-03-30 | 2010-03-26 | Adaptation for statistical language model |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| KR20120018114A KR20120018114A (ko) | 2012-02-29 |
| KR101679445B1 true KR101679445B1 (ko) | 2016-11-24 |
Family
ID=42785345
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| KR1020117022845A Expired - Fee Related KR101679445B1 (ko) | 2009-03-30 | 2010-03-26 | 컴퓨터구현 음성 방법 및 시스템 |
Country Status (6)
| Country | Link |
|---|---|
| US (1) | US8798983B2 (https=) |
| JP (1) | JP2012522278A (https=) |
| KR (1) | KR101679445B1 (https=) |
| CN (1) | CN102369567B (https=) |
| TW (1) | TWI484476B (https=) |
| WO (1) | WO2010117688A2 (https=) |
Families Citing this family (12)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US8688454B2 (en) * | 2011-07-06 | 2014-04-01 | Sri International | Method and apparatus for adapting a language model in response to error correction |
| KR101478146B1 (ko) * | 2011-12-15 | 2015-01-02 | 한국전자통신연구원 | 화자 그룹 기반 음성인식 장치 및 방법 |
| US8918408B2 (en) * | 2012-08-24 | 2014-12-23 | Microsoft Corporation | Candidate generation for predictive input using input history |
| CN102968986B (zh) * | 2012-11-07 | 2015-01-28 | 华南理工大学 | 基于长时特征和短时特征的重叠语音与单人语音区分方法 |
| US10726831B2 (en) * | 2014-05-20 | 2020-07-28 | Amazon Technologies, Inc. | Context interpretation in natural language processing using previous dialog acts |
| US9703394B2 (en) * | 2015-03-24 | 2017-07-11 | Google Inc. | Unlearning techniques for adaptive language models in text entry |
| CN108241440B (zh) * | 2016-12-27 | 2023-02-17 | 北京搜狗科技发展有限公司 | 一种候选词展示方法和装置 |
| US10535342B2 (en) * | 2017-04-10 | 2020-01-14 | Microsoft Technology Licensing, Llc | Automatic learning of language models |
| CN109981328B (zh) * | 2017-12-28 | 2022-02-25 | 中国移动通信集团陕西有限公司 | 一种故障预警方法及装置 |
| CN112508197B (zh) * | 2020-11-27 | 2024-02-20 | 高明昕 | 人工智能设备的控制方法、控制装置和人工智能设备 |
| CN117313790A (zh) * | 2023-09-26 | 2023-12-29 | 山东新一代信息产业技术研究院有限公司 | 一种增强大模型上下文方法及系统 |
| CN119293191A (zh) * | 2024-12-09 | 2025-01-10 | 北京罗克维尔斯科技有限公司 | 基于记忆系统的交互方法、装置、设备、存储介质及车辆 |
Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2002366190A (ja) | 2001-06-07 | 2002-12-20 | Nippon Hoso Kyokai <Nhk> | 統計的言語モデル生成装置および統計的言語モデル生成プログラム |
| US20050060138A1 (en) | 1999-11-05 | 2005-03-17 | Microsoft Corporation | Language conversion and display |
Family Cites Families (33)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| TW283774B (en) * | 1994-12-31 | 1996-08-21 | Lin-Shan Lii | Intelligently vocal chinese input method and chinese dictation machine |
| DE19708183A1 (de) * | 1997-02-28 | 1998-09-03 | Philips Patentverwaltung | Verfahren zur Spracherkennung mit Sprachmodellanpassung |
| CN1311881A (zh) | 1998-06-04 | 2001-09-05 | 松下电器产业株式会社 | 语言变换规则产生装置、语言变换装置及程序记录媒体 |
| US6848080B1 (en) * | 1999-11-05 | 2005-01-25 | Microsoft Corporation | Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors |
| US7107204B1 (en) * | 2000-04-24 | 2006-09-12 | Microsoft Corporation | Computer-aided writing system and method with cross-language writing wizard |
| US7013258B1 (en) * | 2001-03-07 | 2006-03-14 | Lenovo (Singapore) Pte. Ltd. | System and method for accelerating Chinese text input |
| US7103534B2 (en) | 2001-03-31 | 2006-09-05 | Microsoft Corporation | Machine learning contextual approach to word determination for text input via reduced keypad keys |
| JP4215418B2 (ja) * | 2001-08-24 | 2009-01-28 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 単語予測方法、音声認識方法、その方法を用いた音声認識装置及びプログラム |
| US20050043948A1 (en) * | 2001-12-17 | 2005-02-24 | Seiichi Kashihara | Speech recognition method remote controller, information terminal, telephone communication terminal and speech recognizer |
| US20040003392A1 (en) | 2002-06-26 | 2004-01-01 | Koninklijke Philips Electronics N.V. | Method and apparatus for finding and updating user group preferences in an entertainment system |
| TWI225640B (en) * | 2002-06-28 | 2004-12-21 | Samsung Electronics Co Ltd | Voice recognition device, observation probability calculating device, complex fast fourier transform calculation device and method, cache device, and method of controlling the cache device |
| US20050027534A1 (en) | 2003-07-30 | 2005-02-03 | Meurs Pim Van | Phonetic and stroke input methods of Chinese characters and phrases |
| US7542907B2 (en) * | 2003-12-19 | 2009-06-02 | International Business Machines Corporation | Biasing a speech recognizer based on prompt context |
| US8019602B2 (en) * | 2004-01-20 | 2011-09-13 | Microsoft Corporation | Automatic speech recognition learning using user corrections |
| US7478033B2 (en) | 2004-03-16 | 2009-01-13 | Google Inc. | Systems and methods for translating Chinese pinyin to Chinese characters |
| US7406416B2 (en) * | 2004-03-26 | 2008-07-29 | Microsoft Corporation | Representation of a deleted interpolation N-gram language model in ARPA standard format |
| KR100718147B1 (ko) | 2005-02-01 | 2007-05-14 | 삼성전자주식회사 | 음성인식용 문법망 생성장치 및 방법과 이를 이용한 대화체음성인식장치 및 방법 |
| US7379870B1 (en) | 2005-02-03 | 2008-05-27 | Hrl Laboratories, Llc | Contextual filtering |
| US8117540B2 (en) | 2005-05-18 | 2012-02-14 | Neuer Wall Treuhand Gmbh | Method and device incorporating improved text input mechanism |
| JP4769031B2 (ja) * | 2005-06-24 | 2011-09-07 | マイクロソフト コーポレーション | 言語モデルを作成する方法、かな漢字変換方法、その装置、コンピュータプログラムおよびコンピュータ読み取り可能な記憶媒体 |
| JP4197344B2 (ja) | 2006-02-20 | 2008-12-17 | インターナショナル・ビジネス・マシーンズ・コーポレーション | 音声対話システム |
| CN101034390A (zh) | 2006-03-10 | 2007-09-12 | 日电(中国)有限公司 | 用于语言模型切换和自适应的装置和方法 |
| US7912700B2 (en) | 2007-02-08 | 2011-03-22 | Microsoft Corporation | Context based word prediction |
| US7809719B2 (en) | 2007-02-08 | 2010-10-05 | Microsoft Corporation | Predicting textual candidates |
| US8028230B2 (en) | 2007-02-12 | 2011-09-27 | Google Inc. | Contextual input method |
| JP4852448B2 (ja) * | 2007-02-28 | 2012-01-11 | 日本放送協会 | 誤り傾向学習音声認識装置及びコンピュータプログラム |
| US20090030687A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Adapting an unstructured language model speech recognition system based on usage |
| CN101286094A (zh) * | 2007-04-10 | 2008-10-15 | 谷歌股份有限公司 | 多模式输入法编辑器 |
| KR101465770B1 (ko) | 2007-06-25 | 2014-11-27 | 구글 인코포레이티드 | 단어 확률 결정 |
| US8010465B2 (en) * | 2008-02-26 | 2011-08-30 | Microsoft Corporation | Predicting candidates using input scopes |
| EP2329492A1 (en) * | 2008-09-19 | 2011-06-08 | Dolby Laboratories Licensing Corporation | Upstream quality enhancement signal processing for resource constrained client devices |
| JP5054711B2 (ja) * | 2009-01-29 | 2012-10-24 | 日本放送協会 | 音声認識装置および音声認識プログラム |
| US8386249B2 (en) * | 2009-12-11 | 2013-02-26 | International Business Machines Corporation | Compressing feature space transforms |
-
2009
- 2009-03-30 US US12/413,606 patent/US8798983B2/en not_active Expired - Fee Related
-
2010
- 2010-02-23 TW TW099105182A patent/TWI484476B/zh not_active IP Right Cessation
- 2010-03-26 WO PCT/US2010/028932 patent/WO2010117688A2/en not_active Ceased
- 2010-03-26 CN CN2010800158015A patent/CN102369567B/zh not_active Expired - Fee Related
- 2010-03-26 JP JP2012503537A patent/JP2012522278A/ja active Pending
- 2010-03-26 KR KR1020117022845A patent/KR101679445B1/ko not_active Expired - Fee Related
Patent Citations (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20050060138A1 (en) | 1999-11-05 | 2005-03-17 | Microsoft Corporation | Language conversion and display |
| JP2002366190A (ja) | 2001-06-07 | 2002-12-20 | Nippon Hoso Kyokai <Nhk> | 統計的言語モデル生成装置および統計的言語モデル生成プログラム |
Also Published As
| Publication number | Publication date |
|---|---|
| US8798983B2 (en) | 2014-08-05 |
| WO2010117688A2 (en) | 2010-10-14 |
| KR20120018114A (ko) | 2012-02-29 |
| CN102369567B (zh) | 2013-07-17 |
| WO2010117688A3 (en) | 2011-01-13 |
| US20100250251A1 (en) | 2010-09-30 |
| TWI484476B (zh) | 2015-05-11 |
| TW201035968A (en) | 2010-10-01 |
| CN102369567A (zh) | 2012-03-07 |
| JP2012522278A (ja) | 2012-09-20 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| KR101679445B1 (ko) | 컴퓨터구현 음성 방법 및 시스템 | |
| JP5901001B1 (ja) | 音響言語モデルトレーニングのための方法およびデバイス | |
| US7953692B2 (en) | Predicting candidates using information sources | |
| CN102150156B (zh) | 优化用于机器翻译的参数 | |
| US10402493B2 (en) | System and method for inputting text into electronic devices | |
| US9189472B2 (en) | System and method for inputting text into small screen devices | |
| AU2010346493B2 (en) | Speech correction for typed input | |
| CN101833547B (zh) | 基于个人语料库进行短语级预测输入的方法 | |
| US9659002B2 (en) | System and method for inputting text into electronic devices | |
| JP5462001B2 (ja) | 文脈上の入力方法 | |
| US20100235780A1 (en) | System and Method for Identifying Words Based on a Sequence of Keyboard Events | |
| EP2542951A2 (en) | System and method for inputting text into electronic devices | |
| JPWO2014073206A1 (ja) | 情報処理装置、及び、情報処理方法 | |
| Liu et al. | Building neural network language model with POS-based negative sampling and stochastic conjugate gradient descent | |
| Heidel et al. | Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm. | |
| CN106030568A (zh) | 自然语言处理系统、自然语言处理方法、以及自然语言处理程序 | |
| US20120284016A1 (en) | Text mining method, text mining device and text mining program | |
| CN118761389B (zh) | 一种藏语机翻系统及藏语文本自动分段方法 | |
| US20130110491A1 (en) | Discriminative learning of feature functions of generative type in speech translation | |
| KR20100069555A (ko) | 음성 인식 시스템 및 방법 | |
| Singh | On-Device User-Adaptive Next Word Prediction System | |
| JP6588933B2 (ja) | 言語モデル構築装置、その方法、及びプログラム | |
| CN111813891A (zh) | 语言模型的训练、预测词的出现概率的方法和装置 | |
| Kong et al. | Research for Uyghur-Chinese Neural Machine Translation | |
| Toselli et al. | Interactive Text Generation |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| PA0105 | International application |
St.27 status event code: A-0-1-A10-A15-nap-PA0105 |
|
| PG1501 | Laying open of application |
St.27 status event code: A-1-1-Q10-Q12-nap-PG1501 |
|
| P22-X000 | Classification modified |
St.27 status event code: A-2-2-P10-P22-nap-X000 |
|
| PN2301 | Change of applicant |
St.27 status event code: A-3-3-R10-R13-asn-PN2301 St.27 status event code: A-3-3-R10-R11-asn-PN2301 |
|
| A201 | Request for examination | ||
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| PA0201 | Request for examination |
St.27 status event code: A-1-2-D10-D11-exm-PA0201 |
|
| N231 | Notification of change of applicant | ||
| PN2301 | Change of applicant |
St.27 status event code: A-3-3-R10-R13-asn-PN2301 St.27 status event code: A-3-3-R10-R11-asn-PN2301 |
|
| E902 | Notification of reason for refusal | ||
| PE0902 | Notice of grounds for rejection |
St.27 status event code: A-1-2-D10-D21-exm-PE0902 |
|
| P11-X000 | Amendment of application requested |
St.27 status event code: A-2-2-P10-P11-nap-X000 |
|
| P13-X000 | Application amended |
St.27 status event code: A-2-2-P10-P13-nap-X000 |
|
| E701 | Decision to grant or registration of patent right | ||
| PE0701 | Decision of registration |
St.27 status event code: A-1-2-D10-D22-exm-PE0701 |
|
| GRNT | Written decision to grant | ||
| PR0701 | Registration of establishment |
St.27 status event code: A-2-4-F10-F11-exm-PR0701 |
|
| PR1002 | Payment of registration fee |
St.27 status event code: A-2-2-U10-U12-oth-PR1002 Fee payment year number: 1 |
|
| PG1601 | Publication of registration |
St.27 status event code: A-4-4-Q10-Q13-nap-PG1601 |
|
| FPAY | Annual fee payment |
Payment date: 20191016 Year of fee payment: 4 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 4 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 5 |
|
| PR1001 | Payment of annual fee |
St.27 status event code: A-4-4-U10-U11-oth-PR1001 Fee payment year number: 6 |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: A-4-4-U10-U13-oth-PC1903 Not in force date: 20221119 Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE |
|
| PC1903 | Unpaid annual fee |
St.27 status event code: N-4-6-H10-H13-oth-PC1903 Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE Not in force date: 20221119 |