JP2004294945A

JP2004294945A - Speech recognition apparatus

Info

Publication number: JP2004294945A
Application number: JP2003089699A
Authority: JP
Inventors: Yutaka Numajiri; 裕沼尻
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 2003-03-28
Filing date: 2003-03-28
Publication date: 2004-10-21

Abstract

<P>PROBLEM TO BE SOLVED: To provide a speech recognition apparatus that can be made to start speech recognition without any special operation and is suitably applied to mobile equipment. <P>SOLUTION: The speech recognition apparatus is equipped with a main body 1, a microphone 3 which is built in the main body 1 and picks up sound, a speech recognition part 9 for recognizing the sound picked up by the microphone 3, an angle sensor 6 which detects a tilt of the main body 1, and a control part 7 which makes the speech recognition part 9 start speech recognizing operation according to the detection result of the angle sensor 6. <P>COPYRIGHT: (C)2005,JPO&NCIPI

Description

【０００１】
【発明の属する技術分野】
本発明は、音声認識装置に関する。
【０００２】
【従来の技術】
従来から様々な音声認識装置が開発されているが、音声認識の開始は所定のキー操作をきっかけに行うものが一般的であった。例えば特許文献１では、カメラを用い、操作者が特定の動作を行ったことを検知して音声認識を開始するものが開示されている。しかしながら携帯型機器にて使用するには不適当なものであった。また、特許文献２では、発話者の顔に光を投光し、発話者の顔がマイクの方向を向いている期間だけ音声認識処理を行うものであるが、やはり携帯型機器にて使用するには不適当なものであった。
【０００３】
【特許文献１】特開２０００−３３８９９５号公報
【特許文献２】特開２０００−１８７４９９号公報
【０００４】
【発明が解決しようとする課題】
以上述べたように、従来、音声認識の開始は所定のキー操作をきっかけに行ったり、特殊な動作を行うことでなされていたが、操作が面倒であったり特殊な動作を必要としており、特に携帯型機器に搭載するには不適当であるという問題があった。
【０００５】
本発明はこのような問題点を解決するためになされたもので、特別な操作を行うことなしに音声認識動作を開始させることが出来、携帯型機器に適用して好適な音声認識装置を提供することを目的とする。
【０００６】
【課題を解決するための手段】
請求項１にかかる音声認識装置は、本体と、この本体に内蔵され音声を集音する手段と、この集音手段にて集音された音声を認識するための音声認識手段と、前記本体の傾きを検出する手段と、この傾き検出手段の検出結果に基づき前記音声認識手段における音声認識動作を開始させる制御手段とを具備したことを特徴とするものである。
【０００７】
請求項２にかかる音声認識装置は、請求項１記載の音声認識装置において、前記制御手段は前記傾き検出手段により前記本体が特定の角度範囲に入ったことを検出して前記音声認識手段における音声認識動作を開始させるようにしたことを特徴とするものである。
【０００８】
請求項３にかかる音声認識装置は、請求項１乃至２記載の音声認識装置において、前記本体は薄箱状であり、前記集音手段は前記本体の端部に内蔵されていることを特徴とするものである。
【０００９】
請求項４にかかる音声認識装置は、請求項１乃至３記載の音声認識装置において、本体に設けられたスイッチを具備し、前記制御手段は、このスイッチのＯＮ及び前記傾き検出手段の検出結果に基づき前記音声認識手段における音声認識動作を開始させるよう構成されたことを特徴とするものである。
【００１０】
請求項５にかかる音声認識装置は、本体と、この本体に内蔵され音声を集音する手段と、この集音手段にて集音された音声を認識するための音声認識手段と、前記本体の傾きを検出する第１の検出手段と、前記本体がユーザにより保持されている場合に左右どちらの手で保持されているかを検出する第２の検出手段と、前記第１の検出手段の検出結果及び第２の検出手段の検出結果に基づき前記音声認識手段における音声認識動作を開始させる制御手段とを具備したことを特徴とするものである。
【００１１】
請求項６にかかる音声認識装置は、請求項５記載の音声認識装置において、前記制御手段は前記第２の検出手段での検出結果が右手保持か左手保持かに応じて、前記第１の検出手段による傾き検出範囲を異ならせるようにしたことを特徴とするものである。
【００１２】
請求項７にかかる音声認識装置は、請求項５乃至６記載の音声認識装置において、前記本体は薄箱状であり、前記集音手段は前記本体の端部に内蔵されていることを特徴とするものである。
【００１３】
請求項８にかかる音声認識装置は、請求項７記載の音声認識装置において、前記第２の検出手段は薄箱状の本体の一面及びこの面と対向する面のそれぞれに設けられた圧力センサを具備したことを特徴とするものである。
【００１４】
請求項９にかかる音声認識装置は、請求項５乃至８記載の音声認識装置において、本体に設けられたスイッチを具備し、前記制御手段は、このスイッチのＯＮ及び前記傾き検出手段の検出結果に基づき前記音声認識手段における音声認識動作を開始させるように構成されたことを特徴とするものである。
【００１５】
【発明の実施の形態】
以下、本発明になる音声認識装置の実施の形態について図面を用いて説明する。
【００１６】
図１（Ａ）は、本発明になる音声認識装置の一実施の形態を示す外観図であり、１は薄箱状の携帯型音声認識装置本体、２はディスプレイ、３はディスプレイ２の左上端部近くに集音口を有する本体１に内蔵されたマイクロフォンである。４は電源スイッチを含む各種の操作キー群、５はスピーカ、６は本体１に内蔵された角度センサである。この角度センサ６は、ディスプレイ２の配設面の水平面（または水平面と垂直な面）に対する傾き角を検出するものである。
【００１７】
図１（Ｂ）は、同図（Ａ）に示された装置本体１をユーザ１１が右手１２で保持している様子を示すもので、この場合、ユーザ１１は装置本体１のマイクロフォン３に対して声を発している状態を示している。
【００１８】
一方、図２は、図１（Ａ）に示された装置１の内部構成を示すブロック図である。なお、図１と同一のものには同一番号を付している。角度センサ６には制御部７が接続され、角度センサ６から本体１の傾き角度に関する情報が入力される。８は制御部７に接続されたメモリであり、本体１の傾き角度に関する設定情報を記憶している。９は制御部７及びマイクロフォン３に接続された音声認識部である。
【００１９】
次に図２に示した装置の動作について、図３のフローチャートも用いて説明する。まず音声認識装置本体１を使用しようとするときには、本体の電源スイッチをオンにする（ステップＳ１）。これにより音声認識部９内で音声認識ソフトウェアが起動する（ステップＳ２）。次に、角度センサ６から本体１の傾き角度に関する情報を制御部７が得る（ステップＳ３）。ここで、角度センサ６から得た傾き角度が予め定めておいた角度範囲内にあるかを判断する（ステップＳ４）。すなわち、制御部７はメモリ８に記憶されている角度と角度センサ６から得た傾き角度とを比較する。ユーザ１１（図１（Ｂ）参照）が音声認識を行なわせようとすると、マイクロフォン３に対して発声を行なうために、マイクロフォン３を口に近づけるので、本体１（のディスプレイ２取り付け面）は通常保持されている角度（例えば水平面から６０度の角度）からより垂直に近い角度範囲（例えば、水平面に対し８０度から９０度の角度）に移動することになる（図１（Ｂ）参照）。このような通常とは異なる角度範囲内にあれば（ステップＳ４でＹＥＳ）、ユーザ１１が音声認識を行ないたい状態にあるとして、音声認識部９での音声認識を開始する（ステップＳ５）。範囲外で有れば（ステップＳ４でＮＯ）、ステップＳ３に戻り、角度センサ６から本体１の傾き角度に関する情報を再び得ることになる。
【００２０】
本実施の形態によれば、特別な操作を行うことなしに素早く音声認識動作を開始させることが出来、携帯型機器に適用して好適な音声認識装置を提供することが出来る。
【００２１】
図４は、本発明になる音声認識装置の他の実施の形態を示す外観図であり、先の実施の形態とは、後述する圧力センサが設けられている点で大きく相違する。すなわち、２１は薄箱状の携帯型音声認識装置本体、２２はディスプレイ、２３はディスプレイ２２の左上端部近くに集音口が設けられた本体２１内蔵のマイクロフォンである。２４は電源スイッチを含む各種の操作キー群、２５はスピーカである。さらに２６は本体２１に内蔵された角度センサである。この角度センサ２６は２次元的な角度センサであり、ディスプレイ２２の配設面の水平面（あるいは水平面に対し垂直な面に対する）傾き角及びディスプレイ２２の配設面と直交する面に対する傾き角の両方を検出する（図１の角度センサ６を２組用いても良い）。２７、２８はユーザが携帯型音声認識装置本体２１を保持した場合に、ユーザの手２９が当接する本体側面３０、３１の位置に設けられた圧力センサである。これら圧力センサ２７、２８は、それぞれ所定の間隔を空けた数点での圧力を検出可能なものとなっている。
【００２２】
図５は、図４にて外観が示された装置の内部構成を示すブロック図である。なお、図４と同一のものには同一番号を付している。角度センサ２６には制御部３２が接続され、角度センサ２６から本体２１の２方向にかかる傾き角度の情報が入力される。同様にして、圧力センサ２７、２８の出力端にも制御部３２が接続され、圧力センサ２７、２８から本体２１を保持する圧力点の数に関する情報が入力される。２３は制御部２２に接続されたメモリであり、本体１１が右手で保持された場合及び左手で保持された場合のそれぞれの傾き角度に関する設定情報を記憶している。３４は制御部３２及びマイクロフォン２２に接続された音声認識部である。
【００２３】
次に図５に示した装置の動作について、図６のフローチャートも用いて説明する。まず音声認識装置本体２１を使用しようとするときには、本体２１の電源スイッチをオンにする（ステップＳ１１）。これにより音声認識部３４内で音声認識ソフトウェアが起動する（ステップＳ１２）。次に、圧力センサ２７、２８からの本体２１の両側面３０、３１の圧力情報を制御部３２にて入手する（ステップＳ１３）。２つのセンサ２７、２８からの圧力情報の比較により、制御部３２は本体２１がユーザの右手で保持されているのか、左手で保持されているのかを判断する（ステップＳ１４）。すなわち、例えば右手で本体２１が保持されていれば、ディスプレイ２２の右側に位置するセンサ２８によって認識される圧力点は例えば１点で、ディスプレイ２２の左側に位置するセンサ２７によって認識される圧力点は複数（例えば、４点）である。従って、センサ２７にて認識される圧力点数の方がセンサ２８のそれよりも多い。従って、制御部３２は本体２１が右手で保持されていることを認識できる。逆に、左手によって装置本体２１が保持されている場合は、センサ２７にて認識される圧力点の数の方がセンサ２８にて認識される圧力点の数よりも少ない。これにより、制御部３２は本体２１が左手で保持されていることを認識できる。
【００２４】
ステップＳ１４にて右手によって保持されていることが分かれば（ステップＳ１４でＹＥＳ）、後述するように角度センサ２６からの傾き角度情報に基づき本体２１が所定の角度範囲にあるかを判断する際の上記角度範囲を右手での保持に対応させた角度範囲に設定する（ステップＳ１５）。この角度範囲に関する情報はメモリ３３に記憶されている。左手によって保持されているのであれば、同様にして左手での保持に対応させた角度範囲（同じくこの角度範囲に関する情報はメモリ３３に記憶されている）に設定する（ステップＳ１６）。
【００２５】
次に角度センサ２６から本体２１の傾き角度に関する情報を制御部３２が得る（ステップＳ１７）。ここで、角度センサ２６から得た傾き角度が予め定めておいた角度範囲内にあるかを判断する（ステップＳ１８）。すなわち、制御部３２はメモリ３３に記憶されている角度と角度センサ２６から得た傾き角度とを比較する。ユーザが音声認識を行おうとすると、マイクロフォン２２を口に近づけることになるので、本体２１（のディスプレイ２２取り付け面）は通常保持されている角度（例えば水平面から６０度の角度）からより垂直に近い角度範囲（例えば水平面に対し８０度から９０度の角度）に移動することになる。さらに、ユーザが右手で本体２１を保持している場合は、やや右傾きとなる（ディスプレイ２２の面と垂直な面に対してディスプレイ２２長手方向中心軸が所定角度右にずれる）（図４参照）。同様にして、ユーザが左手で本体２１を保持している場合は、本体２１が通常保持されている角度（例えば水平面から６０度の角度）からより垂直に近い角度範囲（例えば水平面に対し８０度から９０度の角度）に移動すると共に、やや左傾きとなる。
【００２６】
従って、本体２１が持ち手の種類毎に設定された通常とは異なる所定の角度範囲内（上述した２方向におけるそれぞれ定められた角度内）にあれば（ステップＳ１８でＹＥＳ）、ユーザが音声認識を行ないたい状態にあるとして、音声認識回路３３での音声認識を開始する（ステップＳ１９）。範囲外であれば（ステップＳ１８でＮＯ）、ステップＳ１７に戻り、角度センサ２６から本体２１の傾き角度に関する情報を再び得ることになる。
【００２７】
本実施の形態によれば、ユーザが音声認識を開始することを欲した場合に、先の実施の形態よりもさらに正確にその要求を検出して音声認識動作を開始させることが出来、携帯型機器に適用して好適な音声認識装置を提供することが出来る。
【００２８】
なお、この発明は上述した実施の形態に限定されるものではなく、種々の変形が可能であることは言うまでもない。例えば、メインスイッチとの併用による誤動作防止を行ったり、音声認識装置を立って保持している時や座って保持している時の使用形態を想定して、所定の傾き角度範囲を設定すること等で、より細かな音声認識動作の開始制御を行うことが可能となる。
【００２９】
【発明の効果】
以上説明したように、この発明によれば、特別な操作を行うことなしに音声認識動作を開始させることが出来、携帯型機器に適用して好適な音声認識装置を提供することができる。
【図面の簡単な説明】
【図１】本発明になる音声認識装置の実施の形態の一例を示す外観斜視図。
【図２】図１に示される装置のブロック図。
【図３】図２の装置の動作を説明するためのフローチャート。
【図４】本発明になる音声認識装置の他の実施の形態を示す外観斜視図。
【図５】図４に示される装置のブロック図。
【図６】図５の装置の動作を説明するためのフローチャート。
【符号の説明】
１：本体
３：マイクロフォン
６：角度センサ
７：制御部
８：メモリ
９：音声認識部[0001]
TECHNICAL FIELD OF THE INVENTION
The present invention relates to a speech recognition device.
[0002]
[Prior art]
Conventionally, various voice recognition devices have been developed, but voice recognition is generally started by a predetermined key operation. For example, Patent Literature 1 discloses a technology that uses a camera, detects that an operator has performed a specific operation, and starts voice recognition. However, it was unsuitable for use in portable devices. In Patent Document 2, light is projected on the face of a speaker, and voice recognition processing is performed only during a period in which the face of the speaker is facing the microphone. However, the technique is also used in a portable device. Was inappropriate.
[0003]
[Patent Document 1] JP-A-2000-338995 [Patent Document 2] JP-A-2000-187499
[Problems to be solved by the invention]
As described above, conventionally, the start of voice recognition has been performed by performing a predetermined key operation or performing a special operation.However, the operation is troublesome or requires a special operation. There is a problem that it is not suitable for mounting on a portable device.
[0005]
The present invention has been made to solve such a problem, and can provide a voice recognition device that can start a voice recognition operation without performing a special operation and is suitable for a portable device. The purpose is to do.
[0006]
[Means for Solving the Problems]
The voice recognition device according to claim 1 includes: a main body; a unit built in the main body for collecting voice; a voice recognition unit for recognizing the voice collected by the sound collection unit; The apparatus is characterized by comprising: means for detecting a tilt; and control means for starting a voice recognition operation in the voice recognition means based on a detection result of the tilt detection means.
[0007]
According to a second aspect of the present invention, in the voice recognition apparatus according to the first aspect, the control unit detects that the main body enters a specific angle range by the tilt detection unit and outputs the voice in the voice recognition unit. The recognition operation is started.
[0008]
According to a third aspect of the present invention, in the voice recognition apparatus according to the first or second aspect, the main body has a thin box shape, and the sound collection unit is built in an end of the main body. Is what you do.
[0009]
According to a fourth aspect of the present invention, there is provided the voice recognition apparatus according to any one of the first to third aspects, further comprising a switch provided on the main body, wherein the control unit determines whether the switch is turned on and the detection result of the tilt detection unit. The voice recognition unit is configured to start a voice recognition operation based on the voice recognition.
[0010]
The voice recognition device according to claim 5, wherein the main body, a means built in the main body for collecting voice, a voice recognition means for recognizing the voice collected by the voice collecting means, First detecting means for detecting an inclination, second detecting means for detecting which of the right and left hands is held when the main body is held by a user, and a detection result of the first detecting means And control means for starting a voice recognition operation in the voice recognition means based on a detection result of the second detection means.
[0011]
According to a sixth aspect of the present invention, in the voice recognition apparatus according to the fifth aspect, the control unit determines the first detection based on whether a detection result of the second detection unit is a right-hand holding or a left-hand holding. The tilt detection range of the means is made different.
[0012]
According to a seventh aspect of the present invention, in the voice recognition apparatus according to the fifth or sixth aspect, the main body has a thin box shape, and the sound collecting means is built in an end of the main body. Is what you do.
[0013]
According to an eighth aspect of the present invention, in the voice recognition apparatus according to the seventh aspect, the second detecting means includes a pressure sensor provided on one surface of the thin box-shaped main body and a surface opposed to this surface. It is characterized by having.
[0014]
According to a ninth aspect of the present invention, there is provided the voice recognition apparatus according to the fifth to eighth aspects, further comprising a switch provided on the main body, wherein the control unit determines whether the switch is turned on and the detection result of the inclination detection unit. The speech recognition unit is configured to start a speech recognition operation based on the speech recognition unit.
[0015]
BEST MODE FOR CARRYING OUT THE INVENTION
Hereinafter, embodiments of a speech recognition device according to the present invention will be described with reference to the drawings.
[0016]
FIG. 1A is an external view showing an embodiment of a speech recognition apparatus according to the present invention, wherein 1 is a thin box-shaped main body of the portable speech recognition apparatus, 2 is a display, and 3 is an upper left end of the display 2. This is a microphone built in the main body 1 having a sound collection port near the unit. 4 is a group of various operation keys including a power switch, 5 is a speaker, and 6 is an angle sensor built in the main body 1. The angle sensor 6 detects an inclination angle of a surface on which the display 2 is disposed with respect to a horizontal plane (or a plane perpendicular to the horizontal plane).
[0017]
FIG. 1B shows a state in which the user 11 holds the apparatus main body 1 shown in FIG. 1A with the right hand 12. In this case, the user 11 moves the microphone 3 of the apparatus main body 1. Uttering a voice.
[0018]
FIG. 2 is a block diagram showing the internal configuration of the device 1 shown in FIG. The same components as those in FIG. 1 are denoted by the same reference numerals. The controller 7 is connected to the angle sensor 6, and information about the tilt angle of the main body 1 is input from the angle sensor 6. Reference numeral 8 denotes a memory connected to the control unit 7, which stores setting information relating to the tilt angle of the main body 1. Reference numeral 9 denotes a voice recognition unit connected to the control unit 7 and the microphone 3.
[0019]
Next, the operation of the apparatus shown in FIG. 2 will be described with reference to the flowchart of FIG. First, when trying to use the voice recognition device main body 1, the power switch of the main body is turned on (step S1). Thereby, the voice recognition software is activated in the voice recognition unit 9 (step S2). Next, the controller 7 obtains information on the tilt angle of the main body 1 from the angle sensor 6 (step S3). Here, it is determined whether the tilt angle obtained from the angle sensor 6 is within a predetermined angle range (step S4). That is, the control unit 7 compares the angle stored in the memory 8 with the tilt angle obtained from the angle sensor 6. When the user 11 (see FIG. 1 (B)) attempts to perform voice recognition, the microphone 3 is brought close to the mouth in order to utter the microphone 3, so that the main body 1 (the mounting surface of the display 2) is usually It moves from the held angle (for example, an angle of 60 degrees from the horizontal plane) to a more perpendicular angle range (for example, an angle of 80 to 90 degrees with respect to the horizontal plane) (see FIG. 1B). If it is within such an unusual angle range (YES in step S4), it is determined that the user 11 wants to perform voice recognition, and the voice recognition unit 9 starts voice recognition (step S5). If it is out of the range (NO in step S4), the process returns to step S3, and information on the tilt angle of the main body 1 is obtained again from the angle sensor 6.
[0020]
According to the present embodiment, a speech recognition operation can be started quickly without performing a special operation, and a speech recognition device suitable for application to a portable device can be provided.
[0021]
FIG. 4 is an external view showing another embodiment of the speech recognition apparatus according to the present invention, which is largely different from the previous embodiment in that a pressure sensor described later is provided. That is, 21 is a thin box-shaped portable speech recognition device main body, 22 is a display, and 23 is a built-in microphone provided with a sound collection port near the upper left end of the display 22. 24 is a group of various operation keys including a power switch, and 25 is a speaker. An angle sensor 26 is built in the main body 21. The angle sensor 26 is a two-dimensional angle sensor, and has both a tilt angle of a horizontal plane (or a plane perpendicular to the horizontal plane) of the arrangement surface of the display 22 and a tilt angle with respect to a plane orthogonal to the arrangement surface of the display 22. (Two sets of angle sensors 6 in FIG. 1 may be used). Reference numerals 27 and 28 denote pressure sensors provided at the positions of the main body side surfaces 30 and 31 with which the user's hand 29 abuts when the user holds the portable voice recognition device main body 21. These pressure sensors 27 and 28 are each capable of detecting pressure at several points at predetermined intervals.
[0022]
FIG. 5 is a block diagram showing the internal configuration of the device whose appearance is shown in FIG. The same components as those in FIG. 4 are denoted by the same reference numerals. The controller 32 is connected to the angle sensor 26, and information on the tilt angles of the main body 21 in two directions is input from the angle sensor 26. Similarly, the control unit 32 is connected to the output terminals of the pressure sensors 27 and 28, and information on the number of pressure points holding the main body 21 is input from the pressure sensors 27 and 28. Reference numeral 23 denotes a memory connected to the control unit 22, which stores setting information relating to the respective tilt angles when the main body 11 is held by the right hand and when it is held by the left hand. Reference numeral 34 denotes a voice recognition unit connected to the control unit 32 and the microphone 22.
[0023]
Next, the operation of the apparatus shown in FIG. 5 will be described with reference to the flowchart of FIG. First, when trying to use the voice recognition device main body 21, the power switch of the main body 21 is turned on (step S11). Thereby, the voice recognition software is activated in the voice recognition unit 34 (Step S12). Next, pressure information of both side surfaces 30, 31 of the main body 21 from the pressure sensors 27, 28 is obtained by the control unit 32 (step S13). By comparing the pressure information from the two sensors 27 and 28, the control unit 32 determines whether the main body 21 is held by the right hand or the left hand of the user (step S14). That is, for example, if the main body 21 is held by the right hand, the pressure point recognized by the sensor 28 located on the right side of the display 22 is, for example, one point, and the pressure point recognized by the sensor 27 located on the left side of the display 22. Is a plurality (for example, 4 points). Therefore, the number of pressure points recognized by the sensor 27 is larger than that of the sensor 28. Therefore, the control unit 32 can recognize that the main body 21 is held by the right hand. Conversely, when the apparatus main body 21 is held by the left hand, the number of pressure points recognized by the sensor 27 is smaller than the number of pressure points recognized by the sensor 28. Thereby, the control unit 32 can recognize that the main body 21 is held by the left hand.
[0024]
If it is determined in step S14 that the main body 21 is held by the right hand (YES in step S14), it is determined whether or not the main body 21 is within a predetermined angle range based on the tilt angle information from the angle sensor 26 as described later. The angle range is set to an angle range corresponding to holding with the right hand (step S15). Information on the angle range is stored in the memory 33. If it is held by the left hand, it is similarly set to an angle range corresponding to holding by the left hand (similarly, information on this angle range is stored in the memory 33) (step S16).
[0025]
Next, the controller 32 obtains information on the tilt angle of the main body 21 from the angle sensor 26 (step S17). Here, it is determined whether the inclination angle obtained from the angle sensor 26 is within a predetermined angle range (step S18). That is, the control unit 32 compares the angle stored in the memory 33 with the tilt angle obtained from the angle sensor 26. When the user attempts to perform speech recognition, the microphone 22 is brought closer to the mouth, so that the main body 21 (the surface on which the display 22 is mounted) is closer to a more perpendicular angle from the normally held angle (for example, an angle of 60 degrees from the horizontal plane). It will move in an angle range (for example, an angle of 80 to 90 degrees with respect to the horizontal plane). Further, when the user holds the main body 21 with the right hand, the main body 21 is inclined slightly to the right (the longitudinal axis of the display 22 is shifted rightward by a predetermined angle with respect to a plane perpendicular to the surface of the display 22) (see FIG. 4). ). Similarly, when the user is holding the main body 21 with the left hand, the angle range in which the main body 21 is normally held (for example, an angle of 60 degrees from the horizontal plane) to a more vertical angle (for example, 80 degrees with respect to the horizontal plane) (An angle of 90 degrees from).
[0026]
Therefore, if the main body 21 is within a predetermined angle range different from the normal set for each type of the handle (within the respective angles determined in the two directions described above) (YES in step S18), the user performs voice recognition. , The speech recognition circuit 33 starts speech recognition (step S19). If it is out of the range (NO in step S18), the process returns to step S17, and information on the tilt angle of the main body 21 is obtained again from the angle sensor 26.
[0027]
According to the present embodiment, when the user wants to start voice recognition, the request can be detected more accurately than in the previous embodiment, and the voice recognition operation can be started. It is possible to provide a speech recognition device suitable for application to a device.
[0028]
Note that the present invention is not limited to the above-described embodiment, and it goes without saying that various modifications are possible. For example, it is necessary to set a predetermined tilt angle range in consideration of a malfunction prevention by using the main switch together with the main switch, and assuming a use form when the voice recognition device is standing and held or sitting and held. Thus, it is possible to perform more detailed control of the start of the voice recognition operation.
[0029]
【The invention's effect】
As described above, according to the present invention, a voice recognition operation can be started without performing a special operation, and a voice recognition device suitable for application to a portable device can be provided.
[Brief description of the drawings]
FIG. 1 is an external perspective view showing an example of an embodiment of a speech recognition device according to the present invention.
FIG. 2 is a block diagram of the device shown in FIG.
FIG. 3 is a flowchart for explaining the operation of the apparatus in FIG. 2;
FIG. 4 is an external perspective view showing another embodiment of the speech recognition apparatus according to the present invention.
FIG. 5 is a block diagram of the device shown in FIG.
FIG. 6 is a flowchart for explaining the operation of the apparatus in FIG. 5;
[Explanation of symbols]
1: body 3: microphone 6: angle sensor 7: control unit 8: memory 9: voice recognition unit

Claims

A main body, means for collecting sound incorporated in the main body, voice recognition means for recognizing the sound collected by the sound collecting means, means for detecting a tilt of the main body, and detecting the tilt Control means for starting a voice recognition operation in the voice recognition means based on a detection result of the means.

2. The voice recognition device according to claim 1, wherein the control means detects that the main body enters a specific angle range by the inclination detection means and starts a voice recognition operation in the voice recognition means. Characteristic speech recognition device.

3. A speech recognition apparatus according to claim 1, wherein said main body is formed in a thin box shape, and said sound collecting means is built in an end of said main body.

4. The voice recognition device according to claim 1, further comprising a switch provided on a main body, wherein the control unit performs a voice recognition operation in the voice recognition unit based on a result of ON of the switch and a detection result of the inclination detection unit. A speech recognition device characterized by being configured to start.

A main body, a unit built in the main body for collecting sound, a voice recognizing unit for recognizing the sound collected by the sound collecting unit, and a first detecting unit for detecting a tilt of the main body. A second detecting means for detecting whether the main body is held by the right or left hand when the main body is held by a user, and a detection result of the first detecting means and a detection result of the second detecting means. Control means for starting a voice recognition operation in said voice recognition means based on said voice recognition means.

6. The voice recognition device according to claim 5, wherein the control unit changes the inclination detection range of the first detection unit according to whether the detection result by the second detection unit is right-hand holding or left-hand holding. A speech recognition device characterized by the following.

7. The speech recognition apparatus according to claim 5, wherein said main body is in a thin box shape, and said sound collecting means is built in an end of said main body.

8. A speech recognition apparatus according to claim 7, wherein said second detecting means includes pressure sensors provided on one surface of the thin box-shaped main body and a surface facing the one surface. .

9. The voice recognition device according to claim 5, further comprising a switch provided on the main body, wherein the control unit performs a voice recognition operation in the voice recognition unit based on the ON of the switch and a detection result of the inclination detection unit. A speech recognition device characterized by being configured to start.