KR101183340B1 - 컴퓨팅 장치에 입력을 제공하기 위한 효율적인 멀티모달방법 - Google Patents

컴퓨팅 장치에 입력을 제공하기 위한 효율적인 멀티모달방법 Download PDF

Info

Publication number
KR101183340B1
KR101183340B1 KR1020050063343A KR20050063343A KR101183340B1 KR 101183340 B1 KR101183340 B1 KR 101183340B1 KR 1020050063343 A KR1020050063343 A KR 1020050063343A KR 20050063343 A KR20050063343 A KR 20050063343A KR 101183340 B1 KR101183340 B1 KR 101183340B1
Authority
KR
South Korea
Prior art keywords
data
computer
speech
input
phrases
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
KR1020050063343A
Other languages
English (en)
Korean (ko)
Other versions
KR20060050139A (ko
Inventor
에릭 아이차오 창
Original Assignee
마이크로소프트 코포레이션
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 마이크로소프트 코포레이션 filed Critical 마이크로소프트 코포레이션
Publication of KR20060050139A publication Critical patent/KR20060050139A/ko
Application granted granted Critical
Publication of KR101183340B1 publication Critical patent/KR101183340B1/ko
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)
  • Communication Control (AREA)
  • Machine Translation (AREA)
KR1020050063343A 2004-07-13 2005-07-13 컴퓨팅 장치에 입력을 제공하기 위한 효율적인 멀티모달방법 Expired - Fee Related KR101183340B1 (ko)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/889,822 US20060036438A1 (en) 2004-07-13 2004-07-13 Efficient multimodal method to provide input to a computing device
US10/889,822 2004-07-13

Publications (2)

Publication Number Publication Date
KR20060050139A KR20060050139A (ko) 2006-05-19
KR101183340B1 true KR101183340B1 (ko) 2012-09-14

Family

ID=35094176

Family Applications (1)

Application Number Title Priority Date Filing Date
KR1020050063343A Expired - Fee Related KR101183340B1 (ko) 2004-07-13 2005-07-13 컴퓨팅 장치에 입력을 제공하기 위한 효율적인 멀티모달방법

Country Status (7)

Country Link
US (1) US20060036438A1 (enExample)
EP (1) EP1617409B1 (enExample)
JP (1) JP2006053906A (enExample)
KR (1) KR101183340B1 (enExample)
CN (1) CN1758211A (enExample)
AT (1) ATE506674T1 (enExample)
DE (1) DE602005027522D1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7912699B1 (en) * 2004-08-23 2011-03-22 At&T Intellectual Property Ii, L.P. System and method of lattice-based search for spoken utterance retrieval
US8065316B1 (en) * 2004-09-30 2011-11-22 Google Inc. Systems and methods for providing search query refinements
US8942985B2 (en) 2004-11-16 2015-01-27 Microsoft Corporation Centralized method and system for clarifying voice commands
US7902447B1 (en) * 2006-10-03 2011-03-08 Sony Computer Entertainment Inc. Automatic composition of sound sequences using finite state automata
US8615388B2 (en) * 2008-03-28 2013-12-24 Microsoft Corporation Intra-language statistical machine translation
CN102113004B (zh) * 2008-07-31 2014-03-12 惠普开发有限公司 采集因特网内容
US8589157B2 (en) * 2008-12-05 2013-11-19 Microsoft Corporation Replying to text messages via automated voice search techniques
US20100153112A1 (en) * 2008-12-16 2010-06-17 Motorola, Inc. Progressively refining a speech-based search
US8494852B2 (en) 2010-01-05 2013-07-23 Google Inc. Word-level correction of speech input
US8660847B2 (en) 2011-09-02 2014-02-25 Microsoft Corporation Integrated local and cloud based speech recognition
US8972263B2 (en) 2011-11-18 2015-03-03 Soundhound, Inc. System and method for performing dual mode speech recognition
US9330659B2 (en) 2013-02-25 2016-05-03 Microsoft Technology Licensing, Llc Facilitating development of a spoken natural language interface
DE102013007964B4 (de) 2013-05-10 2022-08-18 Audi Ag Kraftfahrzeug-Eingabevorrichtung mit Zeichenerkennung
EP3089159B1 (en) 2015-04-28 2019-08-28 Google LLC Correcting voice recognition using selective re-speak
US10410635B2 (en) 2017-06-09 2019-09-10 Soundhound, Inc. Dual mode speech recognition

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020052870A1 (en) * 2000-06-21 2002-05-02 Charlesworth Jason Peter Andrew Indexing method and apparatus
US20020091511A1 (en) * 2000-12-14 2002-07-11 Karl Hellwig Mobile terminal controllable by spoken utterances

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3106550B2 (ja) * 1991-06-11 2000-11-06 ブラザー工業株式会社 音声認識結果表示装置
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
EP0607615B1 (en) * 1992-12-28 1999-09-15 Kabushiki Kaisha Toshiba Speech recognition interface system suitable for window systems and speech mail systems
US6125347A (en) * 1993-09-29 2000-09-26 L&H Applications Usa, Inc. System for controlling multiple user application programs by spoken input
WO1995025326A1 (en) * 1994-03-17 1995-09-21 Voice Powered Technology International, Inc. Voice/pointer operated system
US5642502A (en) * 1994-12-06 1997-06-24 University Of Central Florida Method and system for searching for relevant documents from a text database collection, using statistical ranking, relevancy feedback and small pieces of text
AU5738296A (en) * 1995-05-26 1996-12-11 Applied Language Technologies Method and apparatus for dynamic adaptation of a large vocab ulary speech recognition system and for use of constraints f rom a database in a large vocabulary speech recognition syst em
US5852801A (en) * 1995-10-04 1998-12-22 Apple Computer, Inc. Method and apparatus for automatically invoking a new word module for unrecognized user input
US5799276A (en) * 1995-11-07 1998-08-25 Accent Incorporated Knowledge-based speech recognition system and methods having frame length computed based upon estimated pitch period of vocalic intervals
US5995921A (en) * 1996-04-23 1999-11-30 International Business Machines Corporation Natural language help interface
US6311182B1 (en) * 1997-11-17 2001-10-30 Genuity Inc. Voice activated web browser
US6078914A (en) * 1996-12-09 2000-06-20 Open Text Corporation Natural language meta-search system and method
JPH11272662A (ja) * 1998-03-20 1999-10-08 Sharp Corp 音声情報処理装置及び方法並びにその制御プログラムを記憶した媒体
US6085159A (en) * 1998-03-26 2000-07-04 International Business Machines Corporation Displaying voice commands with multiple variables
US7720682B2 (en) * 1998-12-04 2010-05-18 Tegic Communications, Inc. Method and apparatus utilizing voice input to resolve ambiguous manually entered text input
US7206747B1 (en) * 1998-12-16 2007-04-17 International Business Machines Corporation Speech command input recognition system for interactive computer display with means for concurrent and modeless distinguishing between speech commands and speech queries for locating commands
US6192343B1 (en) * 1998-12-17 2001-02-20 International Business Machines Corporation Speech command input recognition system for interactive computer display with term weighting means used in interpreting potential commands from relevant speech terms
KR100310339B1 (ko) * 1998-12-30 2002-01-17 윤종용 이동전화 단말기의 음성인식 다이얼링 방법
DE69942663D1 (de) * 1999-04-13 2010-09-23 Sony Deutschland Gmbh Zusammenfügen von Sprachschnittstellen zur gleichzeitigen Benützung von Vorrichtungen und Anwendungen
US6591236B2 (en) * 1999-04-13 2003-07-08 International Business Machines Corporation Method and system for determining available and alternative speech commands
US7069220B2 (en) * 1999-08-13 2006-06-27 International Business Machines Corporation Method for determining and maintaining dialog focus in a conversational speech system
EP1158799A1 (en) * 2000-05-18 2001-11-28 Deutsche Thomson-Brandt Gmbh Method and receiver for providing subtitle data in several languages on demand
US7130790B1 (en) * 2000-10-24 2006-10-31 Global Translations, Inc. System and method for closed caption data translation
US20020094512A1 (en) * 2000-11-29 2002-07-18 International Business Machines Corporation Computer controlled speech word recognition display dictionary providing user selection to clarify indefinite detection of speech words
US7085723B2 (en) * 2001-01-12 2006-08-01 International Business Machines Corporation System and method for determining utterance context in a multi-context speech application
WO2003042975A1 (en) * 2001-11-16 2003-05-22 Koninklijke Philips Electronics N.V. Device to edit a text in predefined windows
JP3762300B2 (ja) * 2001-12-28 2006-04-05 株式会社東芝 テキスト入力処理装置及び方法並びにプログラム
US7174294B2 (en) * 2002-06-21 2007-02-06 Microsoft Corporation Speech platform architecture
US7197494B2 (en) * 2002-10-15 2007-03-27 Microsoft Corporation Method and architecture for consolidated database search for input recognition systems
JP4107093B2 (ja) * 2003-01-30 2008-06-25 株式会社日立製作所 対話型端末装置及び対話アプリケーション提供方法
US20040243415A1 (en) * 2003-06-02 2004-12-02 International Business Machines Corporation Architecture for a speech input method editor for handheld portable devices
US20050027539A1 (en) * 2003-07-30 2005-02-03 Weber Dean C. Media center controller system and method
US20050075857A1 (en) * 2003-10-02 2005-04-07 Elcock Albert F. Method and system for dynamically translating closed captions
US20050108026A1 (en) * 2003-11-14 2005-05-19 Arnaud Brierre Personalized subtitle system
CN1697515A (zh) * 2004-05-14 2005-11-16 创新科技有限公司 字幕翻译引擎
US20060136195A1 (en) * 2004-12-22 2006-06-22 International Business Machines Corporation Text grouping for disambiguation in a speech application

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020052870A1 (en) * 2000-06-21 2002-05-02 Charlesworth Jason Peter Andrew Indexing method and apparatus
US20020091511A1 (en) * 2000-12-14 2002-07-11 Karl Hellwig Mobile terminal controllable by spoken utterances

Also Published As

Publication number Publication date
EP1617409A1 (en) 2006-01-18
CN1758211A (zh) 2006-04-12
KR20060050139A (ko) 2006-05-19
DE602005027522D1 (de) 2011-06-01
EP1617409B1 (en) 2011-04-20
JP2006053906A (ja) 2006-02-23
US20060036438A1 (en) 2006-02-16
ATE506674T1 (de) 2011-05-15

Similar Documents

Publication Publication Date Title
US7286978B2 (en) Creating a language model for a language processing system
KR101183340B1 (ko) 컴퓨팅 장치에 입력을 제공하기 위한 효율적인 멀티모달방법
KR101532447B1 (ko) 아시아어 문자를 생성하는 인식 아키텍처
EP2477186B1 (en) Information retrieving apparatus, information retrieving method and navigation system
JP4987682B2 (ja) 音声チャットシステム、情報処理装置、音声認識方法およびプログラム
US20090112593A1 (en) System for recognizing speech for searching a database
JP2004005600A (ja) データベースに格納された文書をインデックス付け及び検索する方法及びシステム
WO2008124368A1 (en) Method and apparatus for distributed voice searching
CN101636732A (zh) 用于语言独立语音索引和搜索的方法和装置
CN101415259A (zh) 嵌入式设备上基于双语语音查询的信息检索系统及方法
CN100592385C (zh) 用于对多语言的姓名进行语音识别的方法和系统
JP2004133880A (ja) インデックス付き文書のデータベースとで使用される音声認識器のための動的語彙を構成する方法
US7401019B2 (en) Phonetic fragment search in speech data
US20070016420A1 (en) Dictionary lookup for mobile devices using spelling recognition
CN111489742A (zh) 声学模型训练方法、语音识别方法、装置及电子设备
JP4839291B2 (ja) 音声認識装置およびコンピュータプログラム
JP2009163358A (ja) 情報処理装置、情報処理方法、プログラムおよび音声チャットシステム
JP5293607B2 (ja) 略語生成装置およびプログラム、並びに、略語生成方法
JP2005084829A (ja) 情報検索方法、情報検索装置、情報検索プログラム、及びプログラム記録媒体
Hahn et al. Optimizing CRFs for SLU tasks in various languages using modified training criteria.
JP5583230B2 (ja) 情報検索装置及び情報検索方法
KR101132469B1 (ko) 초성 자동완성 및 그에 따른 검색 결과 제공 방법 및시스템
KR100910302B1 (ko) 멀티모달 기반의 정보 검색 장치 및 방법
US20250045509A1 (en) Entity name audio-to-text translation
JP4622861B2 (ja) 音声入力システム、音声入力方法、および、音声入力用プログラム

Legal Events

Date Code Title Description
PA0109 Patent application

St.27 status event code: A-0-1-A10-A12-nap-PA0109

PG1501 Laying open of application

St.27 status event code: A-1-1-Q10-Q12-nap-PG1501

A201 Request for examination
P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

PA0201 Request for examination

St.27 status event code: A-1-2-D10-D11-exm-PA0201

R17-X000 Change to representative recorded

St.27 status event code: A-3-3-R10-R17-oth-X000

PE0902 Notice of grounds for rejection

St.27 status event code: A-1-2-D10-D21-exm-PE0902

E13-X000 Pre-grant limitation requested

St.27 status event code: A-2-3-E10-E13-lim-X000

P11-X000 Amendment of application requested

St.27 status event code: A-2-2-P10-P11-nap-X000

P13-X000 Application amended

St.27 status event code: A-2-2-P10-P13-nap-X000

E701 Decision to grant or registration of patent right
PE0701 Decision of registration

St.27 status event code: A-1-2-D10-D22-exm-PE0701

GRNT Written decision to grant
PR0701 Registration of establishment

St.27 status event code: A-2-4-F10-F11-exm-PR0701

PR1002 Payment of registration fee

St.27 status event code: A-2-2-U10-U11-oth-PR1002

Fee payment year number: 1

PG1601 Publication of registration

St.27 status event code: A-4-4-Q10-Q13-nap-PG1601

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R13-asn-PN2301

St.27 status event code: A-5-5-R10-R11-asn-PN2301

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R11-asn-PN2301

PN2301 Change of applicant

St.27 status event code: A-5-5-R10-R14-asn-PN2301

FPAY Annual fee payment

Payment date: 20150819

Year of fee payment: 4

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 4

FPAY Annual fee payment

Payment date: 20160818

Year of fee payment: 5

PR1001 Payment of annual fee

St.27 status event code: A-4-4-U10-U11-oth-PR1001

Fee payment year number: 5

LAPS Lapse due to unpaid annual fee
PC1903 Unpaid annual fee

St.27 status event code: A-4-4-U10-U13-oth-PC1903

Not in force date: 20170911

Payment event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

R18-X000 Changes to party contact information recorded

St.27 status event code: A-5-5-R10-R18-oth-X000

PC1903 Unpaid annual fee

St.27 status event code: N-4-6-H10-H13-oth-PC1903

Ip right cessation event data comment text: Termination Category : DEFAULT_OF_REGISTRATION_FEE

Not in force date: 20170911