JP2006053906A - コンピューティングデバイスへの入力を提供するための効率的なマルチモーダル方法 - Google Patents

コンピューティングデバイスへの入力を提供するための効率的なマルチモーダル方法 Download PDF

Info

Publication number
JP2006053906A
JP2006053906A JP2005204325A JP2005204325A JP2006053906A JP 2006053906 A JP2006053906 A JP 2006053906A JP 2005204325 A JP2005204325 A JP 2005204325A JP 2005204325 A JP2005204325 A JP 2005204325A JP 2006053906 A JP2006053906 A JP 2006053906A
Authority
JP
Japan
Prior art keywords
data
computer
speech
collection
input
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2005204325A
Other languages
English (en)
Japanese (ja)
Other versions
JP2006053906A5 (enExample
Inventor
Eric I-Chao Chang
イ−チャオ チャン エリック
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Corp
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Publication of JP2006053906A publication Critical patent/JP2006053906A/ja
Publication of JP2006053906A5 publication Critical patent/JP2006053906A5/ja
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)
  • Machine Translation (AREA)
  • Communication Control (AREA)
JP2005204325A 2004-07-13 2005-07-13 コンピューティングデバイスへの入力を提供するための効率的なマルチモーダル方法 Pending JP2006053906A (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US10/889,822 US20060036438A1 (en) 2004-07-13 2004-07-13 Efficient multimodal method to provide input to a computing device

Publications (2)

Publication Number Publication Date
JP2006053906A true JP2006053906A (ja) 2006-02-23
JP2006053906A5 JP2006053906A5 (enExample) 2008-08-28

Family

ID=35094176

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2005204325A Pending JP2006053906A (ja) 2004-07-13 2005-07-13 コンピューティングデバイスへの入力を提供するための効率的なマルチモーダル方法

Country Status (7)

Country Link
US (1) US20060036438A1 (enExample)
EP (1) EP1617409B1 (enExample)
JP (1) JP2006053906A (enExample)
KR (1) KR101183340B1 (enExample)
CN (1) CN1758211A (enExample)
AT (1) ATE506674T1 (enExample)
DE (1) DE602005027522D1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7912699B1 (en) 2004-08-23 2011-03-22 At&T Intellectual Property Ii, L.P. System and method of lattice-based search for spoken utterance retrieval
US8065316B1 (en) * 2004-09-30 2011-11-22 Google Inc. Systems and methods for providing search query refinements
US8942985B2 (en) 2004-11-16 2015-01-27 Microsoft Corporation Centralized method and system for clarifying voice commands
US7902447B1 (en) * 2006-10-03 2011-03-08 Sony Computer Entertainment Inc. Automatic composition of sound sequences using finite state automata
US8615388B2 (en) * 2008-03-28 2013-12-24 Microsoft Corporation Intra-language statistical machine translation
WO2010014093A1 (en) * 2008-07-31 2010-02-04 Hewlett-Packard Development Company, L.P. Capturing internet content
US8589157B2 (en) * 2008-12-05 2013-11-19 Microsoft Corporation Replying to text messages via automated voice search techniques
US20100153112A1 (en) * 2008-12-16 2010-06-17 Motorola, Inc. Progressively refining a speech-based search
US8494852B2 (en) 2010-01-05 2013-07-23 Google Inc. Word-level correction of speech input
US8660847B2 (en) 2011-09-02 2014-02-25 Microsoft Corporation Integrated local and cloud based speech recognition
US8972263B2 (en) * 2011-11-18 2015-03-03 Soundhound, Inc. System and method for performing dual mode speech recognition
US9330659B2 (en) 2013-02-25 2016-05-03 Microsoft Technology Licensing, Llc Facilitating development of a spoken natural language interface
DE102013007964B4 (de) 2013-05-10 2022-08-18 Audi Ag Kraftfahrzeug-Eingabevorrichtung mit Zeichenerkennung
EP3089159B1 (en) 2015-04-28 2019-08-28 Google LLC Correcting voice recognition using selective re-speak
US10410635B2 (en) 2017-06-09 2019-09-10 Soundhound, Inc. Dual mode speech recognition

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04364523A (ja) * 1991-06-11 1992-12-16 Brother Ind Ltd 音声認識結果表示装置
JPH11272662A (ja) * 1998-03-20 1999-10-08 Sharp Corp 音声情報処理装置及び方法並びにその制御プログラムを記憶した媒体
WO2003042975A1 (en) * 2001-11-16 2003-05-22 Koninklijke Philips Electronics N.V. Device to edit a text in predefined windows
JP2003202886A (ja) * 2001-12-28 2003-07-18 Toshiba Corp テキスト入力処理装置及び方法並びにプログラム

Family Cites Families (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
US5632002A (en) * 1992-12-28 1997-05-20 Kabushiki Kaisha Toshiba Speech recognition interface system suitable for window systems and speech mail systems
US6125347A (en) * 1993-09-29 2000-09-26 L&H Applications Usa, Inc. System for controlling multiple user application programs by spoken input
WO1995025326A1 (en) * 1994-03-17 1995-09-21 Voice Powered Technology International, Inc. Voice/pointer operated system
US5642502A (en) * 1994-12-06 1997-06-24 University Of Central Florida Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text
DE69622565T2 (de) * 1995-05-26 2003-04-03 Speechworks International, Inc. Verfahren und vorrichtung zur dynamischen anpassung eines spracherkennungssystems mit grossem wortschatz und zur verwendung von einschränkungen aus einer datenbank in einem spracherkennungssystem mit grossem wortschatz
US5852801A (en) * 1995-10-04 1998-12-22 Apple Computer, Inc. Method and apparatus for automatically invoking a new word module for unrecognized user input
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5995921A (en) * 1996-04-23 1999-11-30 International Business Machines Corporation Natural language help interface
US6311182B1 (en) * 1997-11-17 2001-10-30 Genuity Inc. Voice activated web browser
US6078914A (en) * 1996-12-09 2000-06-20 Open Text Corporation Natural language meta-search system and method
US6085159A (en) * 1998-03-26 2000-07-04 International Business Machines Corporation Displaying voice commands with multiple variables
US7720682B2 (en) * 1998-12-04 2010-05-18 Tegic Communications, Inc. Method and apparatus utilizing voice input to resolve ambiguous manually entered text input
US6192343B1 (en) * 1998-12-17 2001-02-20 International Business Machines Corporation Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms
US7206747B1 (en) * 1998-12-16 2007-04-17 International Business Machines Corporation Speech command input recognition system for interactive computer display with means for concurrent and modeless distinguishing between speech commands and speech queries for locating commands
KR100310339B1 (ko) * 1998-12-30 2002-01-17 윤종용 이동전화 단말기의 음성인식 다이얼링 방법
US6591236B2 (en) * 1999-04-13 2003-07-08 International Business Machines Corporation Method and system for determining available and alternative speech commands
DE69942663D1 (de) * 1999-04-13 2010-09-23 Sony Deutschland Gmbh Zusammenfügen von Sprachschnittstellen zur gleichzeitigen Benützung von Vorrichtungen und Anwendungen
US7069220B2 (en) * 1999-08-13 2006-06-27 International Business Machines Corporation Method for determining and maintaining dialog focus in a conversational speech system
EP1158799A1 (en) * 2000-05-18 2001-11-28 Deutsche Thomson-Brandt Gmbh Method and receiver for providing subtitle data in several languages on demand
GB0015233D0 (en) * 2000-06-21 2000-08-16 Canon Kk Indexing method and apparatus
US7130790B1 (en) * 2000-10-24 2006-10-31 Global Translations, Inc. System and method for closed caption data translation
US20020094512A1 (en) * 2000-11-29 2002-07-18 International Business Machines Corporation Computer controlled speech word recognition display dictionary providing user selection to clarify indefinite detection of speech words
EP1215661A1 (en) * 2000-12-14 2002-06-19 TELEFONAKTIEBOLAGET L M ERICSSON (publ) Mobile terminal controllable by spoken utterances
US7085723B2 (en) * 2001-01-12 2006-08-01 International Business Machines Corporation System and method for determining utterance context in a multi-context speech application
US7174294B2 (en) * 2002-06-21 2007-02-06 Microsoft Corporation Speech platform architecture
US7197494B2 (en) * 2002-10-15 2007-03-27 Microsoft Corporation Method and architecture for consolidated database search for input recognition systems
JP4107093B2 (ja) * 2003-01-30 2008-06-25 株式会社日立製作所 対話型端末装置及び対話アプリケーション提供方法
US20040243415A1 (en) * 2003-06-02 2004-12-02 International Business Machines Corporation Architecture for a speech input method editor for handheld portable devices
US20050027539A1 (en) * 2003-07-30 2005-02-03 Weber Dean C. Media center controller system and method
US20050075857A1 (en) * 2003-10-02 2005-04-07 Elcock Albert F. Method and system for dynamically translating closed captions
US20050108026A1 (en) * 2003-11-14 2005-05-19 Arnaud Brierre Personalized subtitle system
CN1697515A (zh) * 2004-05-14 2005-11-16 创新科技有限公司 字幕翻译引擎
US20060136195A1 (en) * 2004-12-22 2006-06-22 International Business Machines Corporation Text grouping for disambiguation in a speech application

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH04364523A (ja) * 1991-06-11 1992-12-16 Brother Ind Ltd 音声認識結果表示装置
JPH11272662A (ja) * 1998-03-20 1999-10-08 Sharp Corp 音声情報処理装置及び方法並びにその制御プログラムを記憶した媒体
WO2003042975A1 (en) * 2001-11-16 2003-05-22 Koninklijke Philips Electronics N.V. Device to edit a text in predefined windows
JP2003202886A (ja) * 2001-12-28 2003-07-18 Toshiba Corp テキスト入力処理装置及び方法並びにプログラム

Also Published As

Publication number Publication date
CN1758211A (zh) 2006-04-12
DE602005027522D1 (de) 2011-06-01
KR20060050139A (ko) 2006-05-19
US20060036438A1 (en) 2006-02-16
KR101183340B1 (ko) 2012-09-14
EP1617409B1 (en) 2011-04-20
ATE506674T1 (de) 2011-05-15
EP1617409A1 (en) 2006-01-18

Similar Documents

Publication Publication Date Title
US11016968B1 (en) Mutation architecture for contextual data aggregator
US10909969B2 (en) Generation of language understanding systems and methods
US10037758B2 (en) Device and method for understanding user intent
CN108847241B (zh) 将会议语音识别为文本的方法、电子设备及存储介质
TWI437449B (zh) 多重模式輸入方法及輸入方法編輯器系統
JP3962763B2 (ja) 対話支援装置
CN111710333B (zh) 用于生成语音转录的方法和系统
JP4829901B2 (ja) マニュアルでエントリされた不確定なテキスト入力を音声入力を使用して確定する方法および装置
TWI266280B (en) Multimodal disambiguation of speech recognition
US7912700B2 (en) Context based word prediction
US6910012B2 (en) Method and system for speech recognition using phonetically similar word alternatives
CN100568223C (zh) 用于表意语言的多模式输入的方法和设备
JP5703491B2 (ja) 言語モデル・音声認識辞書作成装置及びそれらにより作成された言語モデル・音声認識辞書を用いた情報処理装置
CN101415259A (zh) 嵌入式设备上基于双语语音查询的信息检索系统及方法
JP2002116796A (ja) 音声処理装置、音声処理方法及び記憶媒体
JP2006053906A (ja) コンピューティングデバイスへの入力を提供するための効率的なマルチモーダル方法
WO2016008128A1 (en) Speech recognition using foreign word grammar
TW200538969A (en) Handwriting and voice input with automatic correction
JP5231484B2 (ja) 音声認識装置、音声認識方法、プログラム、及びプログラムを配信する情報処理装置
US7860707B2 (en) Compound word splitting for directory assistance services
CN112560493B (zh) 命名实体纠错方法、装置、计算机设备和存储介质
JP4839291B2 (ja) 音声認識装置およびコンピュータプログラム
CN112017647A (zh) 一种结合语义的语音识别方法、装置和系统
JP5293607B2 (ja) 略語生成装置およびプログラム、並びに、略語生成方法
JP5008248B2 (ja) 表示処理装置、表示処理方法、表示処理プログラム、および記録媒体

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20080714

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20080714

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20110426

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20110726

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20110819

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20111118

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20120106