JP4667138B2 - 音声認識方法及び音声認識装置 - Google Patents
音声認識方法及び音声認識装置 Download PDFInfo
- Publication number
- JP4667138B2 JP4667138B2 JP2005191538A JP2005191538A JP4667138B2 JP 4667138 B2 JP4667138 B2 JP 4667138B2 JP 2005191538 A JP2005191538 A JP 2005191538A JP 2005191538 A JP2005191538 A JP 2005191538A JP 4667138 B2 JP4667138 B2 JP 4667138B2
- Authority
- JP
- Japan
- Prior art keywords
- item
- speech recognition
- displayed
- grammar
- recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 13
- 230000008569 process Effects 0.000 claims abstract description 7
- 230000010365 information processing Effects 0.000 claims description 18
- 238000003672 processing method Methods 0.000 claims description 7
- 238000001514 detection method Methods 0.000 claims description 5
- 230000002452 interceptive effect Effects 0.000 abstract description 2
- 230000006870 function Effects 0.000 description 9
- 238000012545 processing Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 3
- 230000007423 decrease Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 241000282414 Homo sapiens Species 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000007429 general method Methods 0.000 description 1
- 230000001771 impaired effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 150000003839 salts Chemical class 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/226—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics
- G10L2015/228—Procedures used during a speech recognition process, e.g. man-machine dialogue using non-speech characteristics of application context
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- User Interface Of Digital Computer (AREA)
- Digital Computer Display Output (AREA)
- Mobile Radio Communication Systems (AREA)
Description
Claims (6)
- 複数の項目の夫々にデータを設定する情報処理方法であって、
音声認識開始を指示する指示手段の指示が、表示画面に表示されていない項目を有効にする指示であった場合に、表示されていない項目に対応する音声認識文法を用いて、受信した音声情報を認識する認識工程と、
前記認識工程で認識した結果を用いて、前記項目に対して設定を行う設定工程とを備えたことを特徴とする情報処理方法。 - 前記表示されていない項目に対応する音声認識文法は、該項目が表示されている場合に用いられる音声認識文法よりも制限がかけられた音声認識文法であることを特徴とする請求項1記載の情報処理方法。
- 前記認識工程は、音声認識開始を指示する指示手段の指示が、表示画面に表示されている項目を有効にする指示であった場合に、表示されている項目に対応する音声認識文法を用いて、受信した音声情報を認識することを特徴とする請求項2記載の情報処理方法。
- 前記音声認識開始を指示する指示手段とはボタンであり、表示画面に表示されている項目を有効にするボタン及び表示画面に表示されていない項目を有効にするボタンの少なくとも2つであることを特徴とする請求項3記載の情報処理方法。
- 請求項1乃至4のいずれかに記載の情報処理方法をコンピュータに実行させるための制御プログラム。
- 複数の項目の夫々にデータを設定する情報処理装置であって、
表示画面に表示されていない項目を検知する検知手段と、
音声認識開始を指示する指示手段の指示が、表示画面に表示されていない項目を有効にする指示であった場合に、前記検知手段で検知した表示されていない項目に対応する音声認識文法を用いて、受信した音声情報を認識する認識手段と、
前記認識手段で認識した結果を用いて、前記項目に対して設定を行う設定手段とを備えたことを特徴とする情報処理装置。
Priority Applications (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005191538A JP4667138B2 (ja) | 2005-06-30 | 2005-06-30 | 音声認識方法及び音声認識装置 |
US11/472,908 US7668719B2 (en) | 2005-06-30 | 2006-06-22 | Speech recognition method and speech recognition apparatus |
EP06253332A EP1739656B1 (en) | 2005-06-30 | 2006-06-27 | Speech recognition method and speech recognition apparatus |
DE602006007062T DE602006007062D1 (de) | 2005-06-30 | 2006-06-27 | Vorrichtung und Verfahren zur Spracherkennung |
AT06253332T ATE433180T1 (de) | 2005-06-30 | 2006-06-27 | Vorrichtung und verfahren zur spracherkennung |
KR1020060059540A KR100815731B1 (ko) | 2005-06-30 | 2006-06-29 | 음성 인식 방법 및 음성 인식 장치 |
CN2006100907781A CN1892819B (zh) | 2005-06-30 | 2006-06-30 | 语音识别方法和语音识别设备 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2005191538A JP4667138B2 (ja) | 2005-06-30 | 2005-06-30 | 音声認識方法及び音声認識装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2007010971A JP2007010971A (ja) | 2007-01-18 |
JP4667138B2 true JP4667138B2 (ja) | 2011-04-06 |
Family
ID=37067634
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2005191538A Expired - Fee Related JP4667138B2 (ja) | 2005-06-30 | 2005-06-30 | 音声認識方法及び音声認識装置 |
Country Status (7)
Country | Link |
---|---|
US (1) | US7668719B2 (ja) |
EP (1) | EP1739656B1 (ja) |
JP (1) | JP4667138B2 (ja) |
KR (1) | KR100815731B1 (ja) |
CN (1) | CN1892819B (ja) |
AT (1) | ATE433180T1 (ja) |
DE (1) | DE602006007062D1 (ja) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP4878471B2 (ja) * | 2005-11-02 | 2012-02-15 | キヤノン株式会社 | 情報処理装置およびその制御方法 |
US7822608B2 (en) * | 2007-02-27 | 2010-10-26 | Nuance Communications, Inc. | Disambiguating a speech recognition grammar in a multimodal application |
WO2008136081A1 (ja) * | 2007-04-20 | 2008-11-13 | Mitsubishi Electric Corporation | ユーザインタフェース装置及びユーザインタフェース設計装置 |
US8306810B2 (en) * | 2008-02-12 | 2012-11-06 | Ezsav Inc. | Systems and methods to enable interactivity among a plurality of devices |
US9519353B2 (en) * | 2009-03-30 | 2016-12-13 | Symbol Technologies, Llc | Combined speech and touch input for observation symbol mappings |
KR101597289B1 (ko) * | 2009-07-31 | 2016-03-08 | 삼성전자주식회사 | 동적 화면에 따라 음성을 인식하는 장치 및 방법 |
DE102009059792A1 (de) * | 2009-12-21 | 2011-06-22 | Continental Automotive GmbH, 30165 | Verfahren und Vorrichtung zur Bedienung technischer Einrichtungen, insbesondere eines Kraftfahrzeugs |
KR101207435B1 (ko) | 2012-07-09 | 2012-12-04 | 다이알로이드(주) | 대화형 음성인식 서버, 대화형 음성인식 클라이언트 및 대화형 음성인식 방법 |
CN103204100B (zh) * | 2013-04-08 | 2015-08-05 | 浙江海联电子股份有限公司 | 一种出租车顶灯语音控制系统 |
US9430186B2 (en) * | 2014-03-17 | 2016-08-30 | Google Inc | Visual indication of a recognized voice-initiated action |
CN106098066B (zh) * | 2016-06-02 | 2020-01-17 | 深圳市智物联网络有限公司 | 语音识别方法及装置 |
US10515625B1 (en) | 2017-08-31 | 2019-12-24 | Amazon Technologies, Inc. | Multi-modal natural language processing |
KR102640327B1 (ko) | 2018-10-19 | 2024-02-22 | 삼성에스디아이 주식회사 | 배터리의 대형 모듈 |
CN110569017A (zh) * | 2019-09-12 | 2019-12-13 | 四川长虹电器股份有限公司 | 基于语音的文本输入方法 |
US11967306B2 (en) | 2021-04-14 | 2024-04-23 | Honeywell International Inc. | Contextual speech recognition methods and systems |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10222337A (ja) * | 1997-02-13 | 1998-08-21 | Meidensha Corp | コンピュータシステム |
JP2001042890A (ja) * | 1999-07-30 | 2001-02-16 | Toshiba Tec Corp | 音声認識装置 |
JP2003157095A (ja) * | 2001-11-22 | 2003-05-30 | Canon Inc | 音声認識装置及びその方法、プログラム |
JP2004219728A (ja) * | 2003-01-15 | 2004-08-05 | Matsushita Electric Ind Co Ltd | 音声認識装置 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5386494A (en) * | 1991-12-06 | 1995-01-31 | Apple Computer, Inc. | Method and apparatus for controlling a speech recognition function using a cursor control device |
JP3286339B2 (ja) * | 1992-03-25 | 2002-05-27 | 株式会社リコー | ウインドウ画面制御装置 |
US5890122A (en) | 1993-02-08 | 1999-03-30 | Microsoft Corporation | Voice-controlled computer simulateously displaying application menu and list of available commands |
CA2115210C (en) * | 1993-04-21 | 1997-09-23 | Joseph C. Andreshak | Interactive computer system recognizing spoken commands |
US5897618A (en) * | 1997-03-10 | 1999-04-27 | International Business Machines Corporation | Data processing system and method for switching between programs having a same title using a voice command |
US6182046B1 (en) | 1998-03-26 | 2001-01-30 | International Business Machines Corp. | Managing voice commands in speech applications |
US6499013B1 (en) * | 1998-09-09 | 2002-12-24 | One Voice Technologies, Inc. | Interactive user interface using speech recognition and natural language processing |
WO2000021232A2 (en) | 1998-10-02 | 2000-04-13 | International Business Machines Corporation | Conversational browser and conversational systems |
US8275617B1 (en) * | 1998-12-17 | 2012-09-25 | Nuance Communications, Inc. | Speech command input recognition system for interactive computer display with interpretation of ancillary relevant speech query terms into commands |
JP2000268046A (ja) | 1999-03-17 | 2000-09-29 | Sharp Corp | 情報処理装置 |
JP2002062213A (ja) * | 2000-08-22 | 2002-02-28 | Airec Engineering Corp | 光ファイバ湿潤度センサ及びこのセンサを用いた湿潤度計測装置 |
JP3774698B2 (ja) | 2000-10-11 | 2006-05-17 | キヤノン株式会社 | 情報処理装置、情報処理方法及び記憶媒体 |
CN1156751C (zh) * | 2001-02-02 | 2004-07-07 | 国际商业机器公司 | 用于自动生成语音xml文件的方法和系统 |
JP4056711B2 (ja) * | 2001-03-19 | 2008-03-05 | 日産自動車株式会社 | 音声認識装置 |
JP2003202890A (ja) | 2001-12-28 | 2003-07-18 | Canon Inc | 音声認識装置及びその方法、プログラム |
KR100567828B1 (ko) | 2003-08-06 | 2006-04-05 | 삼성전자주식회사 | 향상된 음성인식 장치 및 방법 |
-
2005
- 2005-06-30 JP JP2005191538A patent/JP4667138B2/ja not_active Expired - Fee Related
-
2006
- 2006-06-22 US US11/472,908 patent/US7668719B2/en not_active Expired - Fee Related
- 2006-06-27 EP EP06253332A patent/EP1739656B1/en not_active Not-in-force
- 2006-06-27 DE DE602006007062T patent/DE602006007062D1/de not_active Expired - Fee Related
- 2006-06-27 AT AT06253332T patent/ATE433180T1/de not_active IP Right Cessation
- 2006-06-29 KR KR1020060059540A patent/KR100815731B1/ko not_active IP Right Cessation
- 2006-06-30 CN CN2006100907781A patent/CN1892819B/zh not_active Expired - Fee Related
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH10222337A (ja) * | 1997-02-13 | 1998-08-21 | Meidensha Corp | コンピュータシステム |
JP2001042890A (ja) * | 1999-07-30 | 2001-02-16 | Toshiba Tec Corp | 音声認識装置 |
JP2003157095A (ja) * | 2001-11-22 | 2003-05-30 | Canon Inc | 音声認識装置及びその方法、プログラム |
JP2004219728A (ja) * | 2003-01-15 | 2004-08-05 | Matsushita Electric Ind Co Ltd | 音声認識装置 |
Also Published As
Publication number | Publication date |
---|---|
EP1739656A3 (en) | 2007-02-28 |
CN1892819A (zh) | 2007-01-10 |
DE602006007062D1 (de) | 2009-07-16 |
JP2007010971A (ja) | 2007-01-18 |
KR20070003640A (ko) | 2007-01-05 |
CN1892819B (zh) | 2010-04-21 |
US7668719B2 (en) | 2010-02-23 |
ATE433180T1 (de) | 2009-06-15 |
EP1739656B1 (en) | 2009-06-03 |
US20070005371A1 (en) | 2007-01-04 |
EP1739656A2 (en) | 2007-01-03 |
KR100815731B1 (ko) | 2008-03-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP4667138B2 (ja) | 音声認識方法及び音声認識装置 | |
JP4416643B2 (ja) | マルチモーダル入力方法 | |
JP3728304B2 (ja) | 情報処理方法、情報処理装置、プログラム、及び記憶媒体 | |
JP4878471B2 (ja) | 情報処理装置およびその制御方法 | |
US7330868B2 (en) | Data input apparatus and method | |
JP4574390B2 (ja) | 音声認識方法 | |
US20120123781A1 (en) | Touch screen device for allowing blind people to operate objects displayed thereon and object operating method in the touch screen device | |
EP2017828A1 (en) | Techniques for disambiguating speech input using multimodal interfaces | |
JP2006515073A (ja) | 音声認識を実行するための方法、システム、及びプログラミング | |
JP2007171809A (ja) | 情報処理装置及び情報処理方法 | |
JP2020003925A (ja) | 対話システムの制御方法、対話システム及びプログラム | |
EP3540565A1 (en) | Control method for translation device, translation device, and program | |
JP2008145693A (ja) | 情報処理装置及び情報処理方法 | |
JP2005525603A (ja) | ハンドヘルド装置用音声コマンド及び音声認識 | |
JP3813132B2 (ja) | プレゼンテーション用プログラム及びプレゼンテーション用装置 | |
JP2008051883A (ja) | 音声合成制御方法および装置 | |
US7970617B2 (en) | Image processing apparatus and image processing method with speech registration | |
JP4702081B2 (ja) | 文字入力装置 | |
US7761731B2 (en) | Information processing apparatus and information processing method | |
JP2006235040A (ja) | 画像形成装置、プログラムおよび記録媒体 | |
JP2005182168A (ja) | コンテンツ処理装置、コンテンツ処理方法、コンテンツ処理プログラム、および記録媒体 | |
WO2018185716A1 (en) | Method and device for proofreading text | |
JP2023118279A (ja) | 電子文体の読取位置報知装置 | |
JP2014127040A (ja) | 情報処理装置、情報処理方法及びプログラム | |
JP2020118872A (ja) | 情報入力システム及び方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20080624 |
|
RD04 | Notification of resignation of power of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7424 Effective date: 20100201 |
|
RD01 | Notification of change of attorney |
Free format text: JAPANESE INTERMEDIATE CODE: A7421 Effective date: 20100630 |
|
A977 | Report on retrieval |
Free format text: JAPANESE INTERMEDIATE CODE: A971007 Effective date: 20100929 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20101019 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20101213 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20110105 |
|
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20110111 |
|
FPAY | Renewal fee payment (event date is renewal date of database) |
Free format text: PAYMENT UNTIL: 20140121 Year of fee payment: 3 |
|
R150 | Certificate of patent or registration of utility model |
Free format text: JAPANESE INTERMEDIATE CODE: R150 |
|
LAPS | Cancellation because of no payment of annual fees |