JP2012522278A - 統計的言語モデルへの適応 - Google Patents

統計的言語モデルへの適応 Download PDF

Info

Publication number
JP2012522278A
JP2012522278A JP2012503537A JP2012503537A JP2012522278A JP 2012522278 A JP2012522278 A JP 2012522278A JP 2012503537 A JP2012503537 A JP 2012503537A JP 2012503537 A JP2012503537 A JP 2012503537A JP 2012522278 A JP2012522278 A JP 2012522278A
Authority
JP
Japan
Prior art keywords
term memory
word
context
candidate
short
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2012503537A
Other languages
English (en)
Japanese (ja)
Other versions
JP2012522278A5 (https=
Inventor
克年 大附
孝史 梅岡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2012522278A publication Critical patent/JP2012522278A/ja
Publication of JP2012522278A5 publication Critical patent/JP2012522278A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
JP2012503537A 2009-03-30 2010-03-26 統計的言語モデルへの適応 Pending JP2012522278A (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US12/413,606 2009-03-30
US12/413,606 US8798983B2 (en) 2009-03-30 2009-03-30 Adaptation for statistical language model
PCT/US2010/028932 WO2010117688A2 (en) 2009-03-30 2010-03-26 Adaptation for statistical language model

Publications (2)

Publication Number Publication Date
JP2012522278A true JP2012522278A (ja) 2012-09-20
JP2012522278A5 JP2012522278A5 (https=) 2013-05-09

Family

ID=42785345

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2012503537A Pending JP2012522278A (ja) 2009-03-30 2010-03-26 統計的言語モデルへの適応

Country Status (6)

Country Link
US (1) US8798983B2 (https=)
JP (1) JP2012522278A (https=)
KR (1) KR101679445B1 (https=)
CN (1) CN102369567B (https=)
TW (1) TWI484476B (https=)
WO (1) WO2010117688A2 (https=)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8688454B2 (en) * 2011-07-06 2014-04-01 Sri International Method and apparatus for adapting a language model in response to error correction
KR101478146B1 (ko) * 2011-12-15 2015-01-02 한국전자통신연구원 화자 그룹 기반 음성인식 장치 및 방법
US8918408B2 (en) * 2012-08-24 2014-12-23 Microsoft Corporation Candidate generation for predictive input using input history
CN102968986B (zh) * 2012-11-07 2015-01-28 华南理工大学 基于长时特征和短时特征的重叠语音与单人语音区分方法
US10726831B2 (en) * 2014-05-20 2020-07-28 Amazon Technologies, Inc. Context interpretation in natural language processing using previous dialog acts
US9703394B2 (en) * 2015-03-24 2017-07-11 Google Inc. Unlearning techniques for adaptive language models in text entry
CN108241440B (zh) * 2016-12-27 2023-02-17 北京搜狗科技发展有限公司 一种候选词展示方法和装置
US10535342B2 (en) * 2017-04-10 2020-01-14 Microsoft Technology Licensing, Llc Automatic learning of language models
CN109981328B (zh) * 2017-12-28 2022-02-25 中国移动通信集团陕西有限公司 一种故障预警方法及装置
CN112508197B (zh) * 2020-11-27 2024-02-20 高明昕 人工智能设备的控制方法、控制装置和人工智能设备
CN117313790A (zh) * 2023-09-26 2023-12-29 山东新一代信息产业技术研究院有限公司 一种增强大模型上下文方法及系统
CN119293191A (zh) * 2024-12-09 2025-01-10 北京罗克维尔斯科技有限公司 基于记忆系统的交互方法、装置、设备、存储介质及车辆

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10240288A (ja) * 1997-02-28 1998-09-11 Philips Electron Nv 言語モデル適合による音声認識方法
JP2002366190A (ja) * 2001-06-07 2002-12-20 Nippon Hoso Kyokai <Nhk> 統計的言語モデル生成装置および統計的言語モデル生成プログラム
JP2005208643A (ja) * 2004-01-20 2005-08-04 Microsoft Corp ユーザ訂正を用いた自動音声認識学習のためのシステムおよび方法
JP2008216341A (ja) * 2007-02-28 2008-09-18 Nippon Hoso Kyokai <Nhk> 誤り傾向学習音声認識装置及びコンピュータプログラム
JP2010175765A (ja) * 2009-01-29 2010-08-12 Nippon Hoso Kyokai <Nhk> 音声認識装置および音声認識プログラム

Family Cites Families (30)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW283774B (en) * 1994-12-31 1996-08-21 Lin-Shan Lii Intelligently vocal chinese input method and chinese dictation machine
CN1311881A (zh) 1998-06-04 2001-09-05 松下电器产业株式会社 语言变换规则产生装置、语言变换装置及程序记录媒体
US7403888B1 (en) * 1999-11-05 2008-07-22 Microsoft Corporation Language input user interface
US6848080B1 (en) * 1999-11-05 2005-01-25 Microsoft Corporation Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors
US7107204B1 (en) * 2000-04-24 2006-09-12 Microsoft Corporation Computer-aided writing system and method with cross-language writing wizard
US7013258B1 (en) * 2001-03-07 2006-03-14 Lenovo (Singapore) Pte. Ltd. System and method for accelerating Chinese text input
US7103534B2 (en) 2001-03-31 2006-09-05 Microsoft Corporation Machine learning contextual approach to word determination for text input via reduced keypad keys
JP4215418B2 (ja) * 2001-08-24 2009-01-28 インターナショナル・ビジネス・マシーンズ・コーポレーション 単語予測方法、音声認識方法、その方法を用いた音声認識装置及びプログラム
US20050043948A1 (en) * 2001-12-17 2005-02-24 Seiichi Kashihara Speech recognition method remote controller, information terminal, telephone communication terminal and speech recognizer
US20040003392A1 (en) 2002-06-26 2004-01-01 Koninklijke Philips Electronics N.V. Method and apparatus for finding and updating user group preferences in an entertainment system
TWI225640B (en) * 2002-06-28 2004-12-21 Samsung Electronics Co Ltd Voice recognition device, observation probability calculating device, complex fast fourier transform calculation device and method, cache device, and method of controlling the cache device
US20050027534A1 (en) 2003-07-30 2005-02-03 Meurs Pim Van Phonetic and stroke input methods of Chinese characters and phrases
US7542907B2 (en) * 2003-12-19 2009-06-02 International Business Machines Corporation Biasing a speech recognizer based on prompt context
US7478033B2 (en) 2004-03-16 2009-01-13 Google Inc. Systems and methods for translating Chinese pinyin to Chinese characters
US7406416B2 (en) * 2004-03-26 2008-07-29 Microsoft Corporation Representation of a deleted interpolation N-gram language model in ARPA standard format
KR100718147B1 (ko) 2005-02-01 2007-05-14 삼성전자주식회사 음성인식용 문법망 생성장치 및 방법과 이를 이용한 대화체음성인식장치 및 방법
US7379870B1 (en) 2005-02-03 2008-05-27 Hrl Laboratories, Llc Contextual filtering
US8117540B2 (en) 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
JP4769031B2 (ja) * 2005-06-24 2011-09-07 マイクロソフト コーポレーション 言語モデルを作成する方法、かな漢字変換方法、その装置、コンピュータプログラムおよびコンピュータ読み取り可能な記憶媒体
JP4197344B2 (ja) 2006-02-20 2008-12-17 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声対話システム
CN101034390A (zh) 2006-03-10 2007-09-12 日电(中国)有限公司 用于语言模型切换和自适应的装置和方法
US7912700B2 (en) 2007-02-08 2011-03-22 Microsoft Corporation Context based word prediction
US7809719B2 (en) 2007-02-08 2010-10-05 Microsoft Corporation Predicting textual candidates
US8028230B2 (en) 2007-02-12 2011-09-27 Google Inc. Contextual input method
US20090030687A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Adapting an unstructured language model speech recognition system based on usage
CN101286094A (zh) * 2007-04-10 2008-10-15 谷歌股份有限公司 多模式输入法编辑器
KR101465770B1 (ko) 2007-06-25 2014-11-27 구글 인코포레이티드 단어 확률 결정
US8010465B2 (en) * 2008-02-26 2011-08-30 Microsoft Corporation Predicting candidates using input scopes
EP2329492A1 (en) * 2008-09-19 2011-06-08 Dolby Laboratories Licensing Corporation Upstream quality enhancement signal processing for resource constrained client devices
US8386249B2 (en) * 2009-12-11 2013-02-26 International Business Machines Corporation Compressing feature space transforms

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10240288A (ja) * 1997-02-28 1998-09-11 Philips Electron Nv 言語モデル適合による音声認識方法
JP2002366190A (ja) * 2001-06-07 2002-12-20 Nippon Hoso Kyokai <Nhk> 統計的言語モデル生成装置および統計的言語モデル生成プログラム
JP2005208643A (ja) * 2004-01-20 2005-08-04 Microsoft Corp ユーザ訂正を用いた自動音声認識学習のためのシステムおよび方法
JP2008216341A (ja) * 2007-02-28 2008-09-18 Nippon Hoso Kyokai <Nhk> 誤り傾向学習音声認識装置及びコンピュータプログラム
JP2010175765A (ja) * 2009-01-29 2010-08-12 Nippon Hoso Kyokai <Nhk> 音声認識装置および音声認識プログラム

Also Published As

Publication number Publication date
US8798983B2 (en) 2014-08-05
WO2010117688A2 (en) 2010-10-14
KR20120018114A (ko) 2012-02-29
CN102369567B (zh) 2013-07-17
WO2010117688A3 (en) 2011-01-13
US20100250251A1 (en) 2010-09-30
TWI484476B (zh) 2015-05-11
TW201035968A (en) 2010-10-01
CN102369567A (zh) 2012-03-07
KR101679445B1 (ko) 2016-11-24

Similar Documents

Publication Publication Date Title
JP2012522278A (ja) 統計的言語モデルへの適応
JP5901001B1 (ja) 音響言語モデルトレーニングのための方法およびデバイス
KR102596446B1 (ko) 모바일 디바이스들에서의 모달리티 학습
CN102150156B (zh) 优化用于机器翻译的参数
US9202461B2 (en) Sampling training data for an automatic speech recognition system based on a benchmark classification distribution
US9697819B2 (en) Method for building a speech feature library, and method, apparatus, device, and computer readable storage media for speech synthesis
WO2020163422A1 (en) Enhancing hybrid self-attention structure with relative-position-aware bias for speech synthesis
CN112926306B (zh) 文本纠错方法、装置、设备以及存储介质
US20100235780A1 (en) System and Method for Identifying Words Based on a Sequence of Keyboard Events
WO2015169134A1 (en) Method and apparatus for phonetically annotating text
KR20100135819A (ko) 스케일된 확률들을 사용한 단어들의 분절
KR20090109585A (ko) 문맥적 입력 방법
CN103854643B (zh) 用于合成语音的方法和装置
JP2015094848A (ja) 情報処理装置、情報処理方法、およびプログラム
JP2013148697A (ja) 情報処理装置、大語彙連続音声認識方法及びプログラム
CN110352423A (zh) 序列转换神经网络
JP4974470B2 (ja) Arpa標準フォーマットによる、削除補間nグラム言語モデルの表現
JP2024546500A (ja) ラティス音声補正
WO2018232591A1 (en) Sequence recognition processing
US9135326B2 (en) Text mining method, text mining device and text mining program
CN109597881B (zh) 匹配度确定方法、装置、设备和介质
WO2020163157A1 (en) Unsupervised automatic speech recognition
CN112966513A (zh) 用于实体链接的方法和装置
US20150058011A1 (en) Information processing apparatus, information updating method and computer-readable storage medium
JP7212596B2 (ja) 学習装置、学習方法および学習プログラム

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130318

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20130318

RD03 Notification of appointment of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7423

Effective date: 20130701

RD04 Notification of resignation of power of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7424

Effective date: 20130718

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20140326

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20140625

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20150216

A711 Notification of change in applicant

Free format text: JAPANESE INTERMEDIATE CODE: A711

Effective date: 20150514

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20150518

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20160104