JP2017515141A - 言語モデルカスタマイズのためのフレキシブルスキーマ - Google Patents
言語モデルカスタマイズのためのフレキシブルスキーマ Download PDFInfo
- Publication number
- JP2017515141A JP2017515141A JP2016559328A JP2016559328A JP2017515141A JP 2017515141 A JP2017515141 A JP 2017515141A JP 2016559328 A JP2016559328 A JP 2016559328A JP 2016559328 A JP2016559328 A JP 2016559328A JP 2017515141 A JP2017515141 A JP 2017515141A
- Authority
- JP
- Japan
- Prior art keywords
- hint
- language modeling
- modeling components
- list
- domains
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 claims description 21
- 238000004891 communication Methods 0.000 description 13
- 238000010586 diagram Methods 0.000 description 13
- 230000006870 function Effects 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 4
- 238000012545 processing Methods 0.000 description 4
- 238000013500 data storage Methods 0.000 description 2
- 230000005055 memory storage Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009471 action Effects 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000004883 computer application Methods 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000003032 molecular docking Methods 0.000 description 1
- 230000006855 networking Effects 0.000 description 1
- 230000002085 persistent effect Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
- 239000000758 substrate Substances 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/02—Digital computers in general; Data processing equipment in general manually operated with input through keyboard and computation using a built-in program, e.g. pocket calculators
- G06F15/0225—User interface arrangements, e.g. keyboard, display; Interfaces to other computer systems
- G06F15/0233—User interface arrangements, e.g. keyboard, display; Interfaces to other computer systems with printing provisions
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/183—Speech classification or search using natural language modelling using context dependencies, e.g. language models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/268—Morphological analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/30—Semantic analysis
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Acoustics & Sound (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Computer Hardware Design (AREA)
- Computing Systems (AREA)
- Information Transfer Between Computers (AREA)
- User Interface Of Digital Computer (AREA)
- Stored Programmes (AREA)
- Machine Translation (AREA)
Abstract
Description
Claims (10)
- 言語モデリングコンポーネントをカスタマイズする方法であって:
コンピューティングデバイスが、言語モデリングコンポーネントのリストを提示するステップ;
前記コンピューティングデバイスが、前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信するステップであって、前記ヒントは複数のドメインのうちの1つ以上に基づく、ステップ;及び
前記コンピューティングデバイスが、前記ヒントに基づく前記複数の言語モデリングコンポーネントのカスタマイズされた組み合わせを受信するステップ;
を有する方法。 - 前記複数の言語モデリングコンポーネントのうちの1つ以上と前記ヒントとの間のコネクションを維持するステップを更に有する請求項1に記載の方法。
- 前記コンピューティングデバイスが、前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信することが、前記複数のドメインのうちの1つ以上に基づいて予めコンパイルされた言語モデルの選択肢を送信することを含む、請求項1に記載の方法。
- 前記コンピューティングデバイスが、前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信することが、前記複数のドメインのうちの1つ以上に基づいて、前記複数の言語モデリングコンポーネントについての固定ウェイトの組み合わせの選択肢を送信することを含む、請求項1に記載の方法。
- 言語モデリングコンポーネントをカスタマイズするシステムであって:
実行可能なプログラムコードを保存するメモリ;及び
前記メモリに機能的に結合されるプロセッサ;
を有し、前記プロセッサは、前記プログラムコードに含まれるコンピュータ実行可能な命令に応じて動作を行い、前記動作は:
言語モデリングコンポーネントのリストを提示するステップ;
前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信するステップであって、前記ヒントは複数のドメインのうちの1つ以上に基づく、ステップ;
前記ヒントに基づく前記複数の言語モデリングコンポーネントのカスタマイズされた組み合わせを受信するステップ;及び
前記複数の言語モデリングコンポーネントのうちの1つ以上と前記ヒントとの間のコネクションを維持するステップ;
を有することを特徴とするシステム。 - 前記プロセッサは、前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信する際に、前記複数のドメインのうちの1つ以上に基づいて予めコンパイルされた言語モデルの選択肢を送信するように動作する、請求項5に記載のシステム。
- 前記プロセッサは、前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信する際に、前記複数のドメインのうちの1つ以上に基づいて、前記複数の言語モデリングコンポーネントについての固定ウェイトの組み合わせの選択肢を送信するように動作する、請求項5に記載のシステム。
- コンピュータ実行可能な命令を保存するコンピュータ読み取り可能な記録媒体であって、前記命令は、コンピュータにより実行されると、言語モデリングコンポーネントをカスタマイズする方法をコンピュータに実行させ、前記方法は:
言語モデリングコンポーネントのリストを提示するステップ;
前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信するステップであって、前記ヒントは複数のドメインのうちの1つ以上に基づいて、前記複数のドメインのうちの1つ以上は、音声サーチドメイン及びショートメッセージ口述ドメインのうちの1つ以上を含む、ステップ;
前記ヒントに基づく前記複数の言語モデリングコンポーネントのカスタマイズされた組み合わせを受信するステップ;及び
前記複数の言語モデリングコンポーネントのうちの1つ以上と前記ヒントとの間のコネクションを維持するステップ;
を有することを特徴とする記録媒体。 - 前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信することが、前記複数のドメインのうちの1つ以上に基づいて予めコンパイルされた言語モデルの選択肢を送信することを含む、請求項8に記載の記録媒体。
- 前記リストのうちの複数の言語モデリングコンポーネントを組み合わせるためのヒントを送信することが、前記複数のドメインのうちの1つ以上に基づいて、前記複数の言語モデリングコンポーネントについての固定ウェイトの組み合わせの選択肢を送信することを含む、請求項8に記載の記録媒体。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US14/227,492 | 2014-03-27 | ||
US14/227,492 US9529794B2 (en) | 2014-03-27 | 2014-03-27 | Flexible schema for language model customization |
PCT/US2015/021921 WO2015148333A1 (en) | 2014-03-27 | 2015-03-23 | Flexible schema for language model customization |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2017515141A true JP2017515141A (ja) | 2017-06-08 |
JP2017515141A5 JP2017515141A5 (ja) | 2018-04-05 |
JP6571106B2 JP6571106B2 (ja) | 2019-09-04 |
Family
ID=53039568
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2016559328A Active JP6571106B2 (ja) | 2014-03-27 | 2015-03-23 | 言語モデルカスタマイズのための方法、システム、コンピュータプログラム及び記憶媒体 |
Country Status (10)
Country | Link |
---|---|
US (2) | US9529794B2 (ja) |
EP (1) | EP3123467B1 (ja) |
JP (1) | JP6571106B2 (ja) |
KR (1) | KR102315104B1 (ja) |
CN (1) | CN106133826B (ja) |
AU (1) | AU2015236417B2 (ja) |
CA (1) | CA2940430C (ja) |
MX (2) | MX2016012195A (ja) |
RU (1) | RU2689203C2 (ja) |
WO (1) | WO2015148333A1 (ja) |
Families Citing this family (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8181205B2 (en) | 2002-09-24 | 2012-05-15 | Russ Samuel H | PVR channel and PVR IPG information |
US9728184B2 (en) | 2013-06-18 | 2017-08-08 | Microsoft Technology Licensing, Llc | Restructuring deep neural network acoustic models |
US9311298B2 (en) | 2013-06-21 | 2016-04-12 | Microsoft Technology Licensing, Llc | Building conversational understanding systems using a toolset |
US9589565B2 (en) | 2013-06-21 | 2017-03-07 | Microsoft Technology Licensing, Llc | Environmentally aware dialog policies and response generation |
CN104281626B (zh) * | 2013-07-12 | 2018-01-19 | 阿里巴巴集团控股有限公司 | 基于图片化处理的网页展示方法及网页展示装置 |
US9324321B2 (en) | 2014-03-07 | 2016-04-26 | Microsoft Technology Licensing, Llc | Low-footprint adaptation and personalization for a deep neural network |
US9529794B2 (en) | 2014-03-27 | 2016-12-27 | Microsoft Technology Licensing, Llc | Flexible schema for language model customization |
US9614724B2 (en) | 2014-04-21 | 2017-04-04 | Microsoft Technology Licensing, Llc | Session-based device configuration |
US9520127B2 (en) | 2014-04-29 | 2016-12-13 | Microsoft Technology Licensing, Llc | Shared hidden layer combination for speech recognition systems |
US9384335B2 (en) | 2014-05-12 | 2016-07-05 | Microsoft Technology Licensing, Llc | Content delivery prioritization in managed wireless distribution networks |
US10111099B2 (en) | 2014-05-12 | 2018-10-23 | Microsoft Technology Licensing, Llc | Distributing content in managed wireless distribution networks |
US9874914B2 (en) | 2014-05-19 | 2018-01-23 | Microsoft Technology Licensing, Llc | Power management contracts for accessory devices |
US10037202B2 (en) | 2014-06-03 | 2018-07-31 | Microsoft Technology Licensing, Llc | Techniques to isolating a portion of an online computing service |
US9367490B2 (en) | 2014-06-13 | 2016-06-14 | Microsoft Technology Licensing, Llc | Reversible connector for accessory devices |
US9717006B2 (en) | 2014-06-23 | 2017-07-25 | Microsoft Technology Licensing, Llc | Device quarantine in a wireless network |
CN110111780B (zh) * | 2018-01-31 | 2023-04-25 | 阿里巴巴集团控股有限公司 | 数据处理方法和服务器 |
US11182565B2 (en) | 2018-02-23 | 2021-11-23 | Samsung Electronics Co., Ltd. | Method to learn personalized intents |
US11314940B2 (en) | 2018-05-22 | 2022-04-26 | Samsung Electronics Co., Ltd. | Cross domain personalized vocabulary learning in intelligent assistants |
CN110908667B (zh) * | 2019-11-18 | 2021-11-16 | 北京迈格威科技有限公司 | 神经网络联合编译的方法、装置和电子设备 |
CN111161739B (zh) * | 2019-12-28 | 2023-01-17 | 科大讯飞股份有限公司 | 语音识别方法及相关产品 |
KR20240076977A (ko) * | 2022-11-24 | 2024-05-31 | 고려대학교 산학협력단 | 개체 유형 및 관계 정보에 대한 프롬프트 및 빈칸 추론을 이용한 대화 관계 추출 방법 및 장치 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002091477A (ja) * | 2000-09-14 | 2002-03-27 | Mitsubishi Electric Corp | 音声認識システム、音声認識装置、音響モデル管理サーバ、言語モデル管理サーバ、音声認識方法及び音声認識プログラムを記録したコンピュータ読み取り可能な記録媒体 |
JP2003280683A (ja) * | 2002-03-20 | 2003-10-02 | Toshiba Corp | 音声認識装置、音声認識装置における音声認識制御方法、音声処理に関する辞書管理装置 |
JP2005266192A (ja) * | 2004-03-18 | 2005-09-29 | Matsushita Electric Ind Co Ltd | 音声認識装置および音声認識方法 |
JP2007264128A (ja) * | 2006-03-27 | 2007-10-11 | Toshiba Corp | 音声認識装置及びその方法 |
JP2009075582A (ja) * | 2007-08-29 | 2009-04-09 | Advanced Media Inc | 端末装置、言語モデル作成装置、および分散型音声認識システム |
JP2009230068A (ja) * | 2008-03-25 | 2009-10-08 | Denso Corp | 音声認識装置及びナビゲーションシステム |
Family Cites Families (130)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2631864B2 (ja) | 1988-06-13 | 1997-07-16 | 大成建設株式会社 | 偏平トンネルの施工方法 |
US5170499A (en) | 1989-03-06 | 1992-12-08 | Motorola, Inc. | Method and apparatus for adjusting the volume level of a radio |
DE69126983T2 (de) | 1991-08-19 | 1998-03-05 | Lernout & Hauspie Speechprod | Einrichtung zur mustererkennung mit einem kuenstlichen neuronalen netzwerk fuer kontextabhaengige modellierung |
US5233681A (en) | 1992-04-24 | 1993-08-03 | International Business Machines Corporation | Context-dependent speech recognizer using estimated next word context |
US6405132B1 (en) | 1997-10-22 | 2002-06-11 | Intelligent Technologies International, Inc. | Accident avoidance system |
US6167377A (en) | 1997-03-28 | 2000-12-26 | Dragon Systems, Inc. | Speech recognition language models |
KR100241901B1 (ko) * | 1997-08-28 | 2000-02-01 | 윤종용 | 핸드셋과 핸즈프리킷 공용 음성인식기의 등록 엔트리 관리방법 |
ITTO980383A1 (it) | 1998-05-07 | 1999-11-07 | Cselt Centro Studi Lab Telecom | Procedimento e dispositivo di riconoscimento vocale con doppio passo di riconoscimento neurale e markoviano. |
US20050091057A1 (en) | 1999-04-12 | 2005-04-28 | General Magic, Inc. | Voice application development methodology |
US6647270B1 (en) | 1999-09-10 | 2003-11-11 | Richard B. Himmelstein | Vehicletalk |
US7392185B2 (en) | 1999-11-12 | 2008-06-24 | Phoenix Solutions, Inc. | Speech based learning/training system using semantic decoding |
US6263308B1 (en) | 2000-03-20 | 2001-07-17 | Microsoft Corporation | Methods and apparatus for performing speech recognition using acoustic models which are improved through an interactive process |
US7788602B2 (en) | 2000-06-06 | 2010-08-31 | Microsoft Corporation | Method and system for providing restricted actions for recognized semantic categories |
ATE261137T1 (de) | 2000-06-29 | 2004-03-15 | Aspen Technology Inc | Rechnerverfahren und gerät zur beschränkung einer nicht-linearen gleichungsnäherung eines empirischen prozesses |
US6807536B2 (en) | 2000-11-16 | 2004-10-19 | Microsoft Corporation | Methods and systems for computing singular value decompositions of matrices and low rank approximations of matrices |
US6622136B2 (en) | 2001-02-16 | 2003-09-16 | Motorola, Inc. | Interactive tool for semi-automatic creation of a domain model |
US20050234727A1 (en) | 2001-07-03 | 2005-10-20 | Leo Chiu | Method and apparatus for adapting a voice extensible markup language-enabled voice system for natural speech recognition and system response |
US6970947B2 (en) | 2001-07-18 | 2005-11-29 | International Business Machines Corporation | Method and apparatus for providing a flexible and scalable context service |
US20030149566A1 (en) | 2002-01-02 | 2003-08-07 | Esther Levin | System and method for a spoken language interface to a large database of changing records |
US7006972B2 (en) | 2002-03-20 | 2006-02-28 | Microsoft Corporation | Generating a task-adapted acoustic model from one or more different corpora |
US7191119B2 (en) | 2002-05-07 | 2007-03-13 | International Business Machines Corporation | Integrated development tool for building a natural language understanding application |
US7548847B2 (en) | 2002-05-10 | 2009-06-16 | Microsoft Corporation | System for automatically annotating training data for a natural language understanding system |
US7398209B2 (en) | 2002-06-03 | 2008-07-08 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7756531B2 (en) | 2002-09-04 | 2010-07-13 | Farhad John Aminzadeh | Method and apparatus for avoiding call disturbances and facilitating urgent calls based on a caller's decision |
US7274741B2 (en) * | 2002-11-01 | 2007-09-25 | Microsoft Corporation | Systems and methods for generating a comprehensive user attention model |
JP2004227468A (ja) | 2003-01-27 | 2004-08-12 | Canon Inc | 情報提供装置、情報提供方法 |
US20040176083A1 (en) | 2003-02-25 | 2004-09-09 | Motorola, Inc. | Method and system for reducing distractions of mobile device users |
US7366655B1 (en) | 2003-04-02 | 2008-04-29 | At&T Corp. | Method of generating a labeling guide for spoken dialog services |
US7835910B1 (en) | 2003-05-29 | 2010-11-16 | At&T Intellectual Property Ii, L.P. | Exploiting unlabeled utterances for spoken language understanding |
CA2473195C (en) * | 2003-07-29 | 2014-02-04 | Microsoft Corporation | Head mounted multi-sensory audio input system |
EP1654728A1 (en) | 2003-08-01 | 2006-05-10 | Philips Intellectual Property & Standards GmbH | Method for driving a dialog system |
US20050065789A1 (en) | 2003-09-23 | 2005-03-24 | Sherif Yacoub | System and method with automated speech recognition engines |
US7774196B2 (en) | 2003-10-01 | 2010-08-10 | Dictaphone Corporation | System and method for modifying a language model and post-processor information |
JP2005157494A (ja) | 2003-11-20 | 2005-06-16 | Aruze Corp | 会話制御装置及び会話制御方法 |
WO2005050621A2 (en) | 2003-11-21 | 2005-06-02 | Philips Intellectual Property & Standards Gmbh | Topic specific models for text formatting and speech recognition |
CN100539763C (zh) | 2003-11-27 | 2009-09-09 | 国际商业机器公司 | 控制来自移动车辆的无线通信的方法 |
US8412521B2 (en) | 2004-08-20 | 2013-04-02 | Multimodal Technologies, Llc | Discriminative training of document transcription system |
US7693713B2 (en) | 2005-06-17 | 2010-04-06 | Microsoft Corporation | Speech models generated using competitive training, asymmetric training, and data boosting |
US7640160B2 (en) | 2005-08-05 | 2009-12-29 | Voicebox Technologies, Inc. | Systems and methods for responding to natural language speech utterance |
US7620549B2 (en) | 2005-08-10 | 2009-11-17 | Voicebox Technologies, Inc. | System and method of supporting adaptive misrecognition in conversational speech |
US8321220B1 (en) | 2005-11-30 | 2012-11-27 | At&T Intellectual Property Ii, L.P. | System and method of semi-supervised learning for spoken language understanding using semantic role labeling |
US20070128979A1 (en) | 2005-12-07 | 2007-06-07 | J. Shackelford Associates Llc. | Interactive Hi-Tech doll |
US7835911B2 (en) | 2005-12-30 | 2010-11-16 | Nuance Communications, Inc. | Method and system for automatically building natural language understanding models |
US7603330B2 (en) | 2006-02-01 | 2009-10-13 | Honda Motor Co., Ltd. | Meta learning for question classification |
DE102006006551B4 (de) | 2006-02-13 | 2008-09-11 | Siemens Ag | Verfahren und System zum Bereitstellen von Sprachdialoganwendungen sowie mobiles Endgerät |
IL174522A0 (en) | 2006-03-23 | 2006-08-01 | Jonathan Agmon | Method for predictive typing |
US7627536B2 (en) | 2006-06-13 | 2009-12-01 | Microsoft Corporation | Dynamic interaction menus from natural language representations |
US7716049B2 (en) | 2006-06-30 | 2010-05-11 | Nokia Corporation | Method, apparatus and computer program product for providing adaptive language model scaling |
US9318108B2 (en) | 2010-01-18 | 2016-04-19 | Apple Inc. | Intelligent automated assistant |
WO2008081543A1 (ja) | 2006-12-28 | 2008-07-10 | Fujitsu Limited | 携帯端末装置、その通話制御プログラム、その通話制御プログラムを格納した記録媒体、及びその通話制御方法 |
US7912700B2 (en) | 2007-02-08 | 2011-03-22 | Microsoft Corporation | Context based word prediction |
TW200836893A (en) | 2007-03-01 | 2008-09-16 | Benq Corp | Interactive home entertainment robot and method of controlling the same |
US8838457B2 (en) * | 2007-03-07 | 2014-09-16 | Vlingo Corporation | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
US20090030697A1 (en) * | 2007-03-07 | 2009-01-29 | Cerra Joseph P | Using contextual information for delivering results generated from a speech recognition facility using an unstructured language model |
US20070150428A1 (en) | 2007-03-20 | 2007-06-28 | Brandyn Webb | Inference engine for discovering features and making predictions using generalized incremental singular value decomposition |
JP2008233678A (ja) | 2007-03-22 | 2008-10-02 | Honda Motor Co Ltd | 音声対話装置、音声対話方法、及び音声対話用プログラム |
US8301757B2 (en) | 2007-06-11 | 2012-10-30 | Enghouse Interactive Inc. | System and method for obtaining in-use statistics for voice applications in interactive voice response systems |
US8275615B2 (en) | 2007-07-13 | 2012-09-25 | International Business Machines Corporation | Model weighting, selection and hypotheses combination for automatic speech recognition and machine translation |
CN101415039A (zh) | 2007-10-17 | 2009-04-22 | 宏达国际电子股份有限公司 | 通话管理方法 |
US8229729B2 (en) | 2008-03-25 | 2012-07-24 | International Business Machines Corporation | Machine translation in continuous space |
US8332394B2 (en) | 2008-05-23 | 2012-12-11 | International Business Machines Corporation | System and method for providing question and answers with deferred type evaluation |
US8364481B2 (en) | 2008-07-02 | 2013-01-29 | Google Inc. | Speech recognition with parallel recognition tasks |
US8412529B2 (en) | 2008-10-29 | 2013-04-02 | Verizon Patent And Licensing Inc. | Method and system for enhancing verbal communication sessions |
US20100114890A1 (en) | 2008-10-31 | 2010-05-06 | Purediscovery Corporation | System and Method for Discovering Latent Relationships in Data |
JP5475795B2 (ja) * | 2008-11-05 | 2014-04-16 | グーグル・インコーポレーテッド | カスタム言語モデル |
RU2509350C2 (ru) | 2008-11-07 | 2014-03-10 | Матрокс Профешнл Инк | Способ семантической обработки естественного языка с использованием графического языка-посредника |
US20100128863A1 (en) | 2008-11-21 | 2010-05-27 | Robert Bosch Gmbh | Context aware voice communication proxy |
US8447608B1 (en) * | 2008-12-10 | 2013-05-21 | Adobe Systems Incorporated | Custom language models for audio content |
CA2751557A1 (en) | 2009-02-16 | 2010-08-19 | Comverse, Ltd. | Context-aware communications |
US8930179B2 (en) | 2009-06-04 | 2015-01-06 | Microsoft Corporation | Recognition using re-recognition and statistical classification |
US9177557B2 (en) | 2009-07-07 | 2015-11-03 | General Motors Llc. | Singular value decomposition for improved voice recognition in presence of multi-talker background noise |
US8886641B2 (en) * | 2009-10-15 | 2014-11-11 | Yahoo! Inc. | Incorporating recency in network search using machine learning |
US8571866B2 (en) | 2009-10-23 | 2013-10-29 | At&T Intellectual Property I, L.P. | System and method for improving speech recognition accuracy using textual context |
KR101622111B1 (ko) | 2009-12-11 | 2016-05-18 | 삼성전자 주식회사 | 대화 시스템 및 그의 대화 방법 |
US8315597B2 (en) | 2009-12-21 | 2012-11-20 | Julia Olincy | “I am driving/busy” automatic response system for mobile phones |
US8249627B2 (en) | 2009-12-21 | 2012-08-21 | Julia Olincy | “I am driving/busy” automatic response system for mobile phones |
EP2339576B1 (en) | 2009-12-23 | 2019-08-07 | Google LLC | Multi-modal input on an electronic device |
US8400332B2 (en) | 2010-02-09 | 2013-03-19 | Ford Global Technologies, Llc | Emotive advisory system including time agent |
JP2012038239A (ja) | 2010-08-11 | 2012-02-23 | Sony Corp | 情報処理装置、情報処理方法、及び、プログラム |
US8972253B2 (en) | 2010-09-15 | 2015-03-03 | Microsoft Technology Licensing, Llc | Deep belief network for large vocabulary continuous speech recognition |
FR2965377A1 (fr) * | 2010-09-24 | 2012-03-30 | Univ D Avignon Et Des Pays De Vaucluse | Procede de classification de donnees biometriques |
JP2012075047A (ja) | 2010-09-29 | 2012-04-12 | Toshiba Corp | Ip交換システム及びip交換装置 |
US8812321B2 (en) | 2010-09-30 | 2014-08-19 | At&T Intellectual Property I, L.P. | System and method for combining speech recognition outputs from a plurality of domain-specific speech recognizers via machine learning |
JP5704692B2 (ja) * | 2010-11-30 | 2015-04-22 | 独立行政法人情報通信研究機構 | パターン分類装置の学習装置及びそのためのコンピュータプログラム |
US8352245B1 (en) | 2010-12-30 | 2013-01-08 | Google Inc. | Adjusting language models |
JP5861649B2 (ja) | 2011-02-03 | 2016-02-16 | 日本電気株式会社 | モデル適応化装置、モデル適応化方法およびモデル適応化用プログラム |
US9081760B2 (en) * | 2011-03-08 | 2015-07-14 | At&T Intellectual Property I, L.P. | System and method for building diverse language models |
US9679561B2 (en) * | 2011-03-28 | 2017-06-13 | Nuance Communications, Inc. | System and method for rapid customization of speech recognition models |
US10642934B2 (en) | 2011-03-31 | 2020-05-05 | Microsoft Technology Licensing, Llc | Augmented conversational understanding architecture |
US8489529B2 (en) | 2011-03-31 | 2013-07-16 | Microsoft Corporation | Deep convex network with joint use of nonlinear random projection, Restricted Boltzmann Machine and batch-based parallelizable optimization |
EP2691885A4 (en) | 2011-03-31 | 2015-09-30 | Microsoft Technology Licensing Llc | INCREASED CONVERSATIONAL UNDERSTANDING ARCHITECTURE |
US9244984B2 (en) | 2011-03-31 | 2016-01-26 | Microsoft Technology Licensing, Llc | Location based conversational understanding |
US8260615B1 (en) | 2011-04-25 | 2012-09-04 | Google Inc. | Cross-lingual initialization of language models |
US20120290293A1 (en) | 2011-05-13 | 2012-11-15 | Microsoft Corporation | Exploiting Query Click Logs for Domain Detection in Spoken Language Understanding |
US8918352B2 (en) | 2011-05-23 | 2014-12-23 | Microsoft Corporation | Learning processes for single hidden layer neural networks with linear output units |
US20130031476A1 (en) | 2011-07-25 | 2013-01-31 | Coin Emmett | Voice activated virtual assistant |
KR20130022513A (ko) | 2011-08-24 | 2013-03-07 | 한국전자통신연구원 | 결합 쌍일차 변환 공간 기반의 화자 적응 방법 및 장치 |
JP5698203B2 (ja) | 2011-09-30 | 2015-04-08 | アップル インコーポレイテッド | バーチャルアシスタントのコマンド処理を容易にするためのコンテクスト情報の使用 |
US8698621B2 (en) | 2011-11-22 | 2014-04-15 | Verizon Patent And Licensing Inc. | Method and system for providing notifications of a mobile device in motion to determine call treatment |
US9235799B2 (en) * | 2011-11-26 | 2016-01-12 | Microsoft Technology Licensing, Llc | Discriminative pretraining of deep neural networks |
US9082402B2 (en) | 2011-12-08 | 2015-07-14 | Sri International | Generic virtual personal assistant platform |
US9324323B1 (en) * | 2012-01-13 | 2016-04-26 | Google Inc. | Speech recognition using topic-specific language models |
US9263040B2 (en) | 2012-01-17 | 2016-02-16 | GM Global Technology Operations LLC | Method and system for using sound related vehicle information to enhance speech recognition |
JP2012128440A (ja) | 2012-02-06 | 2012-07-05 | Denso Corp | 音声対話装置 |
CN102609264A (zh) | 2012-02-14 | 2012-07-25 | 深圳市同洲视讯传媒有限公司 | 一种调用应用程序编程接口生成调用代码的方法及装置 |
US9524730B2 (en) * | 2012-03-30 | 2016-12-20 | Ohio State Innovation Foundation | Monaural speech filter |
US8346563B1 (en) | 2012-04-10 | 2013-01-01 | Artificial Solutions Ltd. | System and methods for delivering advanced natural language interaction applications |
GB201208373D0 (en) | 2012-05-14 | 2012-06-27 | Touchtype Ltd | Mechanism for synchronising devices,system and method |
US8600525B1 (en) | 2012-05-31 | 2013-12-03 | Honeywell Asca Inc. | Efficient quadratic programming (QP) solver for process control and optimization |
US9053708B2 (en) | 2012-07-18 | 2015-06-09 | International Business Machines Corporation | System, method and program product for providing automatic speech recognition (ASR) in a shared resource environment |
US9424840B1 (en) | 2012-08-31 | 2016-08-23 | Amazon Technologies, Inc. | Speech recognition platforms |
US8527276B1 (en) * | 2012-10-25 | 2013-09-03 | Google Inc. | Speech synthesis using deep neural networks |
US10282419B2 (en) | 2012-12-12 | 2019-05-07 | Nuance Communications, Inc. | Multi-domain natural language processing architecture |
KR101559124B1 (ko) | 2013-02-28 | 2015-10-12 | 한양대학교 산학협력단 | 리튬황전지용 양극, 이를 포함하는 리튬황전지 및 이의 제조 방법 |
US9177550B2 (en) | 2013-03-06 | 2015-11-03 | Microsoft Technology Licensing, Llc | Conservatively adapting a deep neural network in a recognition system |
US9728184B2 (en) | 2013-06-18 | 2017-08-08 | Microsoft Technology Licensing, Llc | Restructuring deep neural network acoustic models |
US9311298B2 (en) | 2013-06-21 | 2016-04-12 | Microsoft Technology Licensing, Llc | Building conversational understanding systems using a toolset |
US9589565B2 (en) | 2013-06-21 | 2017-03-07 | Microsoft Technology Licensing, Llc | Environmentally aware dialog policies and response generation |
CN103456299B (zh) * | 2013-08-01 | 2016-06-15 | 百度在线网络技术(北京)有限公司 | 一种控制语音识别的方法和装置 |
CN103400577B (zh) * | 2013-08-01 | 2015-09-16 | 百度在线网络技术(北京)有限公司 | 多语种语音识别的声学模型建立方法和装置 |
US9280968B2 (en) | 2013-10-04 | 2016-03-08 | At&T Intellectual Property I, L.P. | System and method of using neural transforms of robust audio features for speech processing |
US9721561B2 (en) | 2013-12-05 | 2017-08-01 | Nuance Communications, Inc. | Method and apparatus for speech recognition using neural networks with speaker adaptation |
US9373324B2 (en) | 2013-12-06 | 2016-06-21 | International Business Machines Corporation | Applying speaker adaption techniques to correlated features |
US9400955B2 (en) | 2013-12-13 | 2016-07-26 | Amazon Technologies, Inc. | Reducing dynamic range of low-rank decomposition matrices |
KR101937655B1 (ko) | 2013-12-31 | 2019-01-11 | 코오롱인더스트리 주식회사 | 복합 중공사막 및 그 제조방법 |
US10339920B2 (en) | 2014-03-04 | 2019-07-02 | Amazon Technologies, Inc. | Predicting pronunciation in speech recognition |
US9324321B2 (en) | 2014-03-07 | 2016-04-26 | Microsoft Technology Licensing, Llc | Low-footprint adaptation and personalization for a deep neural network |
US9529794B2 (en) | 2014-03-27 | 2016-12-27 | Microsoft Technology Licensing, Llc | Flexible schema for language model customization |
US9520127B2 (en) | 2014-04-29 | 2016-12-13 | Microsoft Technology Licensing, Llc | Shared hidden layer combination for speech recognition systems |
US20150325236A1 (en) | 2014-05-08 | 2015-11-12 | Microsoft Corporation | Context specific language model scale factors |
-
2014
- 2014-03-27 US US14/227,492 patent/US9529794B2/en active Active
-
2015
- 2015-03-23 AU AU2015236417A patent/AU2015236417B2/en active Active
- 2015-03-23 MX MX2016012195A patent/MX2016012195A/es unknown
- 2015-03-23 EP EP15719880.5A patent/EP3123467B1/en active Active
- 2015-03-23 WO PCT/US2015/021921 patent/WO2015148333A1/en active Application Filing
- 2015-03-23 KR KR1020167026586A patent/KR102315104B1/ko active IP Right Grant
- 2015-03-23 CA CA2940430A patent/CA2940430C/en active Active
- 2015-03-23 RU RU2016138130A patent/RU2689203C2/ru active
- 2015-03-23 JP JP2016559328A patent/JP6571106B2/ja active Active
- 2015-03-23 CN CN201580016605.2A patent/CN106133826B/zh active Active
-
2016
- 2016-09-20 MX MX2021008012A patent/MX2021008012A/es unknown
- 2016-12-22 US US15/389,088 patent/US10497367B2/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2002091477A (ja) * | 2000-09-14 | 2002-03-27 | Mitsubishi Electric Corp | 音声認識システム、音声認識装置、音響モデル管理サーバ、言語モデル管理サーバ、音声認識方法及び音声認識プログラムを記録したコンピュータ読み取り可能な記録媒体 |
JP2003280683A (ja) * | 2002-03-20 | 2003-10-02 | Toshiba Corp | 音声認識装置、音声認識装置における音声認識制御方法、音声処理に関する辞書管理装置 |
JP2005266192A (ja) * | 2004-03-18 | 2005-09-29 | Matsushita Electric Ind Co Ltd | 音声認識装置および音声認識方法 |
JP2007264128A (ja) * | 2006-03-27 | 2007-10-11 | Toshiba Corp | 音声認識装置及びその方法 |
JP2009075582A (ja) * | 2007-08-29 | 2009-04-09 | Advanced Media Inc | 端末装置、言語モデル作成装置、および分散型音声認識システム |
JP2009230068A (ja) * | 2008-03-25 | 2009-10-08 | Denso Corp | 音声認識装置及びナビゲーションシステム |
Also Published As
Publication number | Publication date |
---|---|
CA2940430A1 (en) | 2015-10-01 |
RU2689203C2 (ru) | 2019-05-24 |
EP3123467B1 (en) | 2019-09-11 |
CN106133826B (zh) | 2019-12-17 |
WO2015148333A1 (en) | 2015-10-01 |
RU2016138130A (ru) | 2018-04-27 |
US9529794B2 (en) | 2016-12-27 |
MX2016012195A (es) | 2017-01-05 |
US10497367B2 (en) | 2019-12-03 |
JP6571106B2 (ja) | 2019-09-04 |
MX2021008012A (es) | 2021-08-05 |
AU2015236417A1 (en) | 2016-09-08 |
RU2016138130A3 (ja) | 2018-10-19 |
US20170103753A1 (en) | 2017-04-13 |
US20150278191A1 (en) | 2015-10-01 |
KR20160138424A (ko) | 2016-12-05 |
KR102315104B1 (ko) | 2021-10-19 |
CN106133826A (zh) | 2016-11-16 |
AU2015236417B2 (en) | 2019-12-19 |
EP3123467A1 (en) | 2017-02-01 |
CA2940430C (en) | 2022-05-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP6571106B2 (ja) | 言語モデルカスタマイズのための方法、システム、コンピュータプログラム及び記憶媒体 | |
US9324321B2 (en) | Low-footprint adaptation and personalization for a deep neural network | |
US9942358B2 (en) | Recommending applications | |
CN107636613B (zh) | 到第三方应用的数字助理可扩展性 | |
RU2667717C2 (ru) | Диалоговые политики на основе параметров окружающей среды и генерация ответа | |
US20150325236A1 (en) | Context specific language model scale factors | |
US20130110992A1 (en) | Electronic device management using interdomain profile-based inferences | |
US9171099B2 (en) | System and method for providing calculation web services for online documents | |
US20170300090A1 (en) | Accommodating sensors and touch in a unified experience | |
WO2018085123A1 (en) | Contextual canvases for a collaborative workspace environment | |
US11301345B2 (en) | Desktop sound source discovery | |
US10805358B2 (en) | Universal casting service | |
US10404765B2 (en) | Re-homing embedded web content via cross-iframe signaling | |
KR102368945B1 (ko) | 외부 콘텐츠 아이템과의 인코딩된 연관을 제공하는 기법 | |
CN113646057A (zh) | 用于增强游戏体验的跨设备附件输入和输出 | |
KR101532909B1 (ko) | 메신저 서비스의 첨부파일 관리 방법, 이를 위한 시스템 및 이를 위한 단말 장치 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20180220 |
|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20180220 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20181206 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20181218 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20190312 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20190709 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20190807 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 6571106 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |
|
R250 | Receipt of annual fees |
Free format text: JAPANESE INTERMEDIATE CODE: R250 |