KR101679445B1 - 컴퓨터구현 음성 방법 및 시스템 - Google Patents

컴퓨터구현 음성 방법 및 시스템 Download PDF

Info

Publication number
KR101679445B1
KR101679445B1 KR1020117022845A KR20117022845A KR101679445B1 KR 101679445 B1 KR101679445 B1 KR 101679445B1 KR 1020117022845 A KR1020117022845 A KR 1020117022845A KR 20117022845 A KR20117022845 A KR 20117022845A KR 101679445 B1 KR101679445 B1 KR 101679445B1
Authority
KR
South Korea
Prior art keywords
context
term memory
computer
constraint
word
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
KR1020117022845A
Other languages
English (en)
Korean (ko)
Other versions
KR20120018114A (ko
Inventor
가츠토시 오츠키
다카시 우메오카
Original Assignee
마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 테크놀로지 라이센싱, 엘엘씨 filed Critical 마이크로소프트 테크놀로지 라이센싱, 엘엘씨
Publication of KR20120018114A publication Critical patent/KR20120018114A/ko
Application granted granted Critical
Publication of KR101679445B1 publication Critical patent/KR101679445B1/ko
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/187Phonemic context, e.g. pronunciation rules, phonotactical constraints or phoneme n-grams
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/183Speech classification or search using natural language modelling using context dependencies, e.g. language models
    • G10L15/19Grammatical context, e.g. disambiguation of the recognition hypotheses based on word sequence rules
    • G10L15/197Probabilistic grammars, e.g. word n-grams

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Artificial Intelligence (AREA)
  • General Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • General Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)
KR1020117022845A 2009-03-30 2010-03-26 컴퓨터구현 음성 방법 및 시스템 Expired - Fee Related KR101679445B1 (ko)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US12/413,606 2009-03-30
US12/413,606 US8798983B2 (en) 2009-03-30 2009-03-30 Adaptation for statistical language model
PCT/US2010/028932 WO2010117688A2 (en) 2009-03-30 2010-03-26 Adaptation for statistical language model

Publications (2)

Publication Number Publication Date
KR20120018114A KR20120018114A (ko) 2012-02-29
KR101679445B1 true KR101679445B1 (ko) 2016-11-24

Family

ID=42785345

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020117022845A Expired - Fee Related KR101679445B1 (ko) 2009-03-30 2010-03-26 컴퓨터구현 음성 방법 및 시스템

Country Status (6)

Country Link
US (1) US8798983B2 (https=)
JP (1) JP2012522278A (https=)
KR (1) KR101679445B1 (https=)
CN (1) CN102369567B (https=)
TW (1) TWI484476B (https=)
WO (1) WO2010117688A2 (https=)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8688454B2 (en) * 2011-07-06 2014-04-01 Sri International Method and apparatus for adapting a language model in response to error correction
KR101478146B1 (ko) * 2011-12-15 2015-01-02 한국전자통신연구원 화자 그룹 기반 음성인식 장치 및 방법
US8918408B2 (en) * 2012-08-24 2014-12-23 Microsoft Corporation Candidate generation for predictive input using input history
CN102968986B (zh) * 2012-11-07 2015-01-28 华南理工大学 基于长时特征和短时特征的重叠语音与单人语音区分方法
US10726831B2 (en) * 2014-05-20 2020-07-28 Amazon Technologies, Inc. Context interpretation in natural language processing using previous dialog acts
US9703394B2 (en) * 2015-03-24 2017-07-11 Google Inc. Unlearning techniques for adaptive language models in text entry
CN108241440B (zh) * 2016-12-27 2023-02-17 北京搜狗科技发展有限公司 一种候选词展示方法和装置
US10535342B2 (en) * 2017-04-10 2020-01-14 Microsoft Technology Licensing, Llc Automatic learning of language models
CN109981328B (zh) * 2017-12-28 2022-02-25 中国移动通信集团陕西有限公司 一种故障预警方法及装置
CN112508197B (zh) * 2020-11-27 2024-02-20 高明昕 人工智能设备的控制方法、控制装置和人工智能设备
CN117313790A (zh) * 2023-09-26 2023-12-29 山东新一代信息产业技术研究院有限公司 一种增强大模型上下文方法及系统
CN119293191A (zh) * 2024-12-09 2025-01-10 北京罗克维尔斯科技有限公司 基于记忆系统的交互方法、装置、设备、存储介质及车辆

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002366190A (ja) 2001-06-07 2002-12-20 Nippon Hoso Kyokai <Nhk> 統計的言語モデル生成装置および統計的言語モデル生成プログラム
US20050060138A1 (en) 1999-11-05 2005-03-17 Microsoft Corporation Language conversion and display

Family Cites Families (33)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW283774B (en) * 1994-12-31 1996-08-21 Lin-Shan Lii Intelligently vocal chinese input method and chinese dictation machine
DE19708183A1 (de) * 1997-02-28 1998-09-03 Philips Patentverwaltung Verfahren zur Spracherkennung mit Sprachmodellanpassung
CN1311881A (zh) 1998-06-04 2001-09-05 松下电器产业株式会社 语言变换规则产生装置、语言变换装置及程序记录媒体
US6848080B1 (en) * 1999-11-05 2005-01-25 Microsoft Corporation Language input architecture for converting one text form to another text form with tolerance to spelling, typographical, and conversion errors
US7107204B1 (en) * 2000-04-24 2006-09-12 Microsoft Corporation Computer-aided writing system and method with cross-language writing wizard
US7013258B1 (en) * 2001-03-07 2006-03-14 Lenovo (Singapore) Pte. Ltd. System and method for accelerating Chinese text input
US7103534B2 (en) 2001-03-31 2006-09-05 Microsoft Corporation Machine learning contextual approach to word determination for text input via reduced keypad keys
JP4215418B2 (ja) * 2001-08-24 2009-01-28 インターナショナル・ビジネス・マシーンズ・コーポレーション 単語予測方法、音声認識方法、その方法を用いた音声認識装置及びプログラム
US20050043948A1 (en) * 2001-12-17 2005-02-24 Seiichi Kashihara Speech recognition method remote controller, information terminal, telephone communication terminal and speech recognizer
US20040003392A1 (en) 2002-06-26 2004-01-01 Koninklijke Philips Electronics N.V. Method and apparatus for finding and updating user group preferences in an entertainment system
TWI225640B (en) * 2002-06-28 2004-12-21 Samsung Electronics Co Ltd Voice recognition device, observation probability calculating device, complex fast fourier transform calculation device and method, cache device, and method of controlling the cache device
US20050027534A1 (en) 2003-07-30 2005-02-03 Meurs Pim Van Phonetic and stroke input methods of Chinese characters and phrases
US7542907B2 (en) * 2003-12-19 2009-06-02 International Business Machines Corporation Biasing a speech recognizer based on prompt context
US8019602B2 (en) * 2004-01-20 2011-09-13 Microsoft Corporation Automatic speech recognition learning using user corrections
US7478033B2 (en) 2004-03-16 2009-01-13 Google Inc. Systems and methods for translating Chinese pinyin to Chinese characters
US7406416B2 (en) * 2004-03-26 2008-07-29 Microsoft Corporation Representation of a deleted interpolation N-gram language model in ARPA standard format
KR100718147B1 (ko) 2005-02-01 2007-05-14 삼성전자주식회사 음성인식용 문법망 생성장치 및 방법과 이를 이용한 대화체음성인식장치 및 방법
US7379870B1 (en) 2005-02-03 2008-05-27 Hrl Laboratories, Llc Contextual filtering
US8117540B2 (en) 2005-05-18 2012-02-14 Neuer Wall Treuhand Gmbh Method and device incorporating improved text input mechanism
JP4769031B2 (ja) * 2005-06-24 2011-09-07 マイクロソフト コーポレーション 言語モデルを作成する方法、かな漢字変換方法、その装置、コンピュータプログラムおよびコンピュータ読み取り可能な記憶媒体
JP4197344B2 (ja) 2006-02-20 2008-12-17 インターナショナル・ビジネス・マシーンズ・コーポレーション 音声対話システム
CN101034390A (zh) 2006-03-10 2007-09-12 日电(中国)有限公司 用于语言模型切换和自适应的装置和方法
US7912700B2 (en) 2007-02-08 2011-03-22 Microsoft Corporation Context based word prediction
US7809719B2 (en) 2007-02-08 2010-10-05 Microsoft Corporation Predicting textual candidates
US8028230B2 (en) 2007-02-12 2011-09-27 Google Inc. Contextual input method
JP4852448B2 (ja) * 2007-02-28 2012-01-11 日本放送協会 誤り傾向学習音声認識装置及びコンピュータプログラム
US20090030687A1 (en) * 2007-03-07 2009-01-29 Cerra Joseph P Adapting an unstructured language model speech recognition system based on usage
CN101286094A (zh) * 2007-04-10 2008-10-15 谷歌股份有限公司 多模式输入法编辑器
KR101465770B1 (ko) 2007-06-25 2014-11-27 구글 인코포레이티드 단어 확률 결정
US8010465B2 (en) * 2008-02-26 2011-08-30 Microsoft Corporation Predicting candidates using input scopes
EP2329492A1 (en) * 2008-09-19 2011-06-08 Dolby Laboratories Licensing Corporation Upstream quality enhancement signal processing for resource constrained client devices
JP5054711B2 (ja) * 2009-01-29 2012-10-24 日本放送協会 音声認識装置および音声認識プログラム
US8386249B2 (en) * 2009-12-11 2013-02-26 International Business Machines Corporation Compressing feature space transforms

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050060138A1 (en) 1999-11-05 2005-03-17 Microsoft Corporation Language conversion and display
JP2002366190A (ja) 2001-06-07 2002-12-20 Nippon Hoso Kyokai <Nhk> 統計的言語モデル生成装置および統計的言語モデル生成プログラム

Also Published As

Publication number Publication date
US8798983B2 (en) 2014-08-05
WO2010117688A2 (en) 2010-10-14
KR20120018114A (ko) 2012-02-29
CN102369567B (zh) 2013-07-17
WO2010117688A3 (en) 2011-01-13
US20100250251A1 (en) 2010-09-30
TWI484476B (zh) 2015-05-11
TW201035968A (en) 2010-10-01
CN102369567A (zh) 2012-03-07
JP2012522278A (ja) 2012-09-20

Similar Documents

Publication Publication Date Title
KR101679445B1 (ko) 컴퓨터구현 음성 방법 및 시스템
JP5901001B1 (ja) 音響言語モデルトレーニングのための方法およびデバイス
US7953692B2 (en) Predicting candidates using information sources
CN102150156B (zh) 优化用于机器翻译的参数
US10402493B2 (en) System and method for inputting text into electronic devices
US9189472B2 (en) System and method for inputting text into small screen devices
AU2010346493B2 (en) Speech correction for typed input
CN101833547B (zh) 基于个人语料库进行短语级预测输入的方法
US9659002B2 (en) System and method for inputting text into electronic devices
JP5462001B2 (ja) 文脈上の入力方法
US20100235780A1 (en) System and Method for Identifying Words Based on a Sequence of Keyboard Events
EP2542951A2 (en) System and method for inputting text into electronic devices
JPWO2014073206A1 (ja) 情報処理装置、及び、情報処理方法
Liu et al. Building neural network language model with POS-based negative sampling and stochastic conjugate gradient descent
Heidel et al. Language model adaptation using latent dirichlet allocation and an efficient topic inference algorithm.
CN106030568A (zh) 自然语言处理系统、自然语言处理方法、以及自然语言处理程序
US20120284016A1 (en) Text mining method, text mining device and text mining program
CN118761389B (zh) 一种藏语机翻系统及藏语文本自动分段方法
US20130110491A1 (en) Discriminative learning of feature functions of generative type in speech translation
KR20100069555A (ko) 음성 인식 시스템 및 방법
Singh On-Device User-Adaptive Next Word Prediction System
JP6588933B2 (ja) 言語モデル構築装置、その方法、及びプログラム
CN111813891A (zh) 语言模型的训练、预测词的出现概率的方法和装置
Kong et al. Research for Uyghur-Chinese Neural Machine Translation
Toselli et al. Interactive Text Generation

Legal Events

Date Code Title Description
PA0105 International application

St.27 status event code: A-0-1-A10-A15-nap-PA0105

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

P22-X000 Classification modified

St.27 status event code: A-2-2-P10-P22-nap-X000

PN2301 Change of applicant

St.27 status event code: A-3-3-R10-R13-asn-PN2301

St.27 status event code: A-3-3-R10-R11-asn-PN2301

A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

N231 Notification of change of applicant
PN2301 Change of applicant

St.27 status event code: A-3-3-R10-R13-asn-PN2301

St.27 status event code: A-3-3-R10-R11-asn-PN2301

E902 Notification of reason for refusal
PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U12-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

FPAY Annual fee payment

Payment date: 20191016

Year of fee payment: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 5

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 6

PC1903 Unpaid annual fee

St.27 status event code: A-4-4-U10-U13-oth-PC1903

Not in force date: 20221119

Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

PC1903 Unpaid annual fee

St.27 status event code: N-4-6-H10-H13-oth-PC1903

Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

Not in force date: 20221119