JP2009025715A

JP2009025715A - In-vehicle device and speech recognition method

Info

Publication number: JP2009025715A
Application number: JP2007190866A
Authority: JP
Inventors: Tsuguo Sumizawa; 紹男住沢
Original assignee: Xanavi Informatics Corp
Current assignee: Faurecia Clarion Electronics Co Ltd
Priority date: 2007-07-23
Filing date: 2007-07-23
Publication date: 2009-02-05

Abstract

<P>PROBLEM TO BE SOLVED: To provide an in-vehicle device 20 in which, according to a position in a vehicle where a passenger is seated, an operation command which is permitted for the crew who is seated at the position can be set, in the in-vehicle device 20 for performing processing according to the operation command which is input by speech. <P>SOLUTION: The in-vehicle device 20 of the invention 20 in which an operation command permitted for each seat position is determined beforehand, specifies the speech generation position of speech corresponding to the speech signal from the speech signal collected via a plurality of microphones 11, when the operation command is input by speech, and performs processing corresponding to the operation command, when the operation command by the speech is the one permitted at the generation position. <P>COPYRIGHT: (C)2009,JPO&INPIT

Description

本発明は、車両に搭載され、音声により入力された操作コマンドに応じて処理を実行する装置に関する。 The present invention relates to an apparatus that is mounted on a vehicle and executes processing in accordance with an operation command input by voice.

特許文献１には、車両が停止中の場合には全ての操作を許可し、車両が走行中の場合には予め定められた操作のみを許可する車載用ナビゲーション装置が開示されている。これにより、画面を注視しながら操作する必要がある操作を、車両の走行中に、運転者に操作させることを防止することができ、運転者を運転に集中させることができる。また、特許文献２には、車両に搭載され、音声により入力された操作コマンドに応じて処理を行う音声認識装置が開示されている。 Patent Document 1 discloses an in-vehicle navigation device that permits all operations when the vehicle is stopped, and permits only predetermined operations when the vehicle is running. Thereby, it is possible to prevent the driver from operating an operation that needs to be performed while gazing at the screen while the vehicle is traveling, and the driver can be concentrated on driving. Patent Document 2 discloses a voice recognition device that is mounted on a vehicle and performs processing according to an operation command input by voice.

特開平７−２７０１７３号公報JP-A-7-270173 特開平１１−１５４９４号公報Japanese Patent Laid-Open No. 11-15494

ところで、車両の走行中に、運転者以外の同乗者が、車載装置の機能を利用したい場合がある。同乗者は、運転していないため、画面を注視しながら操作する必要がある操作を行っても、運転の妨げとなることはない。しかし、従来は、車両が走行中の場合には、予め定められた操作のみが許可され、同乗者にとっては利便性の低いものとなっていた。これは、音声により操作コマンドを入力する場合においても同様であった。 By the way, during traveling of the vehicle, a passenger other than the driver may want to use the function of the in-vehicle device. Since the passenger is not driving, even if he / she performs an operation that needs to be performed while gazing at the screen, it does not hinder driving. However, conventionally, when the vehicle is running, only predetermined operations are permitted, which is inconvenient for passengers. This was the same when inputting an operation command by voice.

本発明は上記事情を鑑みてなされたものであり、本発明の目的は、音声により入力された操作コマンドに応じて処理を行う車載装置において、乗員が座っている車両内の位置に応じて、当該位置に座っている乗員に対して許可する操作コマンドを設定することができるようにすることにある。 The present invention has been made in view of the above circumstances, and an object of the present invention is an in-vehicle device that performs processing according to an operation command input by voice, depending on the position in the vehicle where the occupant is sitting, It is to be able to set an operation command to be permitted for a passenger sitting at the position.

上記課題を解決するために、本発明の車載装置は、座席位置毎に許可される操作コマンドが予め定められており、音声により操作コマンドが入力された場合に、複数のマイクロフォンを介して収集された音声信号から当該音声信号に対応する音声の発生位置を特定し、入力された操作コマンドが当該発生位置において許可されている操作コマンドである場合に、当該操作コマンドに対応する処理を実行する。 In order to solve the above-described problem, the in-vehicle device of the present invention has a predetermined operation command for each seat position, and is collected via a plurality of microphones when the operation command is input by voice. The voice generation position corresponding to the voice signal is identified from the voice signal, and when the input operation command is an operation command permitted at the generation position, processing corresponding to the operation command is executed.

例えば、本発明の第一の態様は、車両に搭載され、音声により入力された操作コマンドに応じて処理を実行する車載装置であって、車両内の座席の領域を示す情報に対応付けて、当該座席に座った人に対して許可する操作コマンドを格納する許可コマンド格納手段と、操作コマンドの入力開始の指示をユーザから受け付ける音声認識開始受付手段と、音声認識開始受付手段が操作コマンドの入力開始の指示をユーザから受け付けた後に、複数のマイクロフォンのそれぞれを介して収集された音声信号から当該音声信号に対応する音声の発生位置を特定し、特定した発生位置を、当該発生位置から発生した音声に対応する音声信号と共に出力する音源位置特定手段と、音源位置特定手段から出力された音声信号から操作コマンドを認識する音声認識手段と、音源位置特定手段から出力された音声信号に対応する音声の発生位置において許可されている操作コマンドを許可コマンド格納手段から抽出し、音声認識手段によって認識された操作コマンドが、当該抽出した操作コマンドのいずれかと同一である場合に、音声認識手段によって認識された操作コマンドに対応する処理を実行するコマンド処理手段とを備えることを特徴とする車載装置を提供する。 For example, the first aspect of the present invention is an in-vehicle device that is mounted on a vehicle and executes processing in response to an operation command input by voice, in association with information indicating a seat area in the vehicle, The permission command storage means for storing the operation command permitted for the person sitting in the seat, the voice recognition start receiving means for receiving an instruction to start the input of the operation command from the user, and the voice recognition start receiving means for inputting the operation command After receiving the start instruction from the user, the sound generation position corresponding to the sound signal is identified from the sound signals collected via each of the plurality of microphones, and the identified position is generated from the position. Sound source position specifying means output together with a sound signal corresponding to the sound, and sound for recognizing an operation command from the sound signal output from the sound source position specifying means And the operation command permitted at the sound generation position corresponding to the sound signal output from the sound source position specifying means is extracted from the permission command storage means, and the operation command recognized by the voice recognition means is extracted. And a command processing unit that executes a process corresponding to the operation command recognized by the voice recognition unit when the operation command is the same as any one of the operation commands.

また、本発明の第二の態様は、車両に搭載され、音声により入力された操作コマンドに応じて処理を実行する車載装置における音声認識方法であって、車載装置は、車両内の座席の領域を示す情報に対応付けて、当該座席に座った人に対して許可する操作コマンドを許可コマンド格納手段に格納する許可コマンド格納ステップと、操作コマンドの入力開始の指示をユーザから受け付ける音声認識開始受付ステップと、音声認識開始受付ステップにおいて操作コマンドの入力開始の指示をユーザから受け付けた後に、複数のマイクロフォンのそれぞれを介して収集された音声信号から当該音声信号に対応する音声の発生位置を特定し、特定した発生位置を、当該発生位置から発生した音声に対応する音声信号と共に出力する音源位置特定ステップと、音源位置特定ステップにおいて出力した音声信号から操作コマンドを認識する音声認識ステップと、音源位置特定ステップにおいて出力した音声信号に対応する音声の発生位置において許可されている操作コマンドを許可コマンド格納手段から抽出し、音声認識ステップにおいて認識した操作コマンドが、当該抽出した操作コマンドのいずれかと同一である場合に、音声認識ステップにおいて認識した操作コマンドに対応する処理を実行するコマンド処理ステップとを実行することを特徴とする音声認識方法を提供する。 The second aspect of the present invention is a speech recognition method in an in-vehicle device that is mounted on a vehicle and executes processing in accordance with an operation command input by voice, wherein the in-vehicle device is a seat area in the vehicle. In correspondence with the information indicating the permission command storage step for storing the operation command permitted for the person sitting on the seat in the permission command storage means, and the instruction for starting the input of the operation command is received from the user. After receiving an operation command input start instruction from the user in the step and the voice recognition start reception step, the voice generation position corresponding to the voice signal is specified from the voice signals collected through each of the plurality of microphones. The sound source position specifying step for outputting the specified generation position together with the audio signal corresponding to the sound generated from the generation position. A voice recognition step for recognizing an operation command from the voice signal output in the sound source position specifying step, and a permission command storage means for the operation command permitted in the sound generation position corresponding to the voice signal output in the sound source position specifying step. A command processing step for executing a process corresponding to the operation command recognized in the voice recognition step when the operation command extracted from the voice recognition step is the same as one of the extracted operation commands. A speech recognition method is provided.

本発明の車載装置によれば、音声により入力された操作コマンドに応じて処理を行う車載装置において、乗員が座っている車両内の位置に応じて、当該位置に座っている乗員に対して許可する操作コマンドを設定することができる。 According to the vehicle-mounted device of the present invention, in the vehicle-mounted device that performs processing according to an operation command input by voice, permission is given to a passenger sitting at the position according to the position in the vehicle where the passenger is sitting. Operation commands to be set can be set.

まず、本発明の第一の実施形態について、図面を参照しながら説明する。 First, a first embodiment of the present invention will be described with reference to the drawings.

図１は、本発明の第一の実施形態に係る車載システム１０の構成を示すシステム構成図である。車載システム１０は、複数のマイクロフォン１１、音声認識開始ボタン１２、表示装置１３、センサ１４、および車載装置２０を備える。車載装置２０は、音源位置特定部２１、開始指示受付部２２、音声認識部２３、コマンド処理部２４、および許可コマンド格納部２５を有する。 FIG. 1 is a system configuration diagram showing the configuration of an in-vehicle system 10 according to the first embodiment of the present invention. The in-vehicle system 10 includes a plurality of microphones 11, a voice recognition start button 12, a display device 13, a sensor 14, and an in-vehicle device 20. The in-vehicle device 20 includes a sound source position specifying unit 21, a start instruction receiving unit 22, a voice recognition unit 23, a command processing unit 24, and a permission command storage unit 25.

開始指示受付部２２は、ユーザによって音声認識開始ボタン１２が押下された場合に、音声信号に対応する音声の発生位置の特定を音源位置特定部２１に指示する。音源位置特定部２１は、開始指示受付部２２から音声信号の発生位置の特定を指示された場合に、車両内に設けられた複数のマイクロフォン１１のそれぞれを介して収集された音声信号の遅延量や振幅の差等に基づいて、当該音声信号に対応する音声の発生位置を特定する。そして、音源位置特定部２１は、特定した発生位置を、当該発生位置から発生した音声に対応する音声信号と共に音声認識部２３へ出力する。 The start instruction receiving unit 22 instructs the sound source position specifying unit 21 to specify the sound generation position corresponding to the sound signal when the voice recognition start button 12 is pressed by the user. When the sound source position specifying unit 21 is instructed to specify the generation position of the audio signal from the start instruction receiving unit 22, the delay amount of the audio signal collected through each of the plurality of microphones 11 provided in the vehicle. Based on the difference in amplitude and the like, the generation position of the sound corresponding to the sound signal is specified. Then, the sound source position specifying unit 21 outputs the specified generation position to the voice recognition unit 23 together with a voice signal corresponding to the voice generated from the generation position.

また、複数のマイクロフォン１１のそれぞれを介して収集された信号に、異なる位置から発生した複数の音声に対応する音声信号が含まれている場合、音源位置特定部２１は、発生位置毎に、当該発生位置から発生した音声に対応する音声信号を、対応する発生位置を示す情報と共に音声認識部２３へ出力する。音源位置特定部２１から音声認識部２３へ出力されるデータ４０には、例えば図２に示すように、車両内での音声の発生位置を示す音源位置４１に対応付けて、当該音源位置４１から発生した音声に対応する音声信号４２が格納される。 In addition, when the signals collected via each of the plurality of microphones 11 include sound signals corresponding to a plurality of sounds generated from different positions, the sound source position specifying unit 21 performs the corresponding process for each occurrence position. A voice signal corresponding to the voice generated from the generation position is output to the voice recognition unit 23 together with information indicating the corresponding generation position. For example, as shown in FIG. 2, the data 40 output from the sound source position specifying unit 21 to the voice recognition unit 23 is associated with the sound source position 41 indicating the sound generation position in the vehicle, from the sound source position 41. An audio signal 42 corresponding to the generated audio is stored.

本実施形態において、音源位置４１には、車両内の所定の高さにおける水平面をｘｙ平面とした場合のｘｙ平面上の座標が格納される。また、他の例として、音源位置４１には、車両内の位置を示す三次元座標が格納されていてもよい。 In the present embodiment, the sound source position 41 stores coordinates on the xy plane when the horizontal plane at a predetermined height in the vehicle is the xy plane. As another example, the sound source position 41 may store three-dimensional coordinates indicating a position in the vehicle.

音声認識部２３は、音源位置特定部２１から出力されたそれぞれの音声信号から公知の音声認識技術を用いて、操作コマンドを認識し、認識した操作コマンドを、当該操作コマンドの元となった音声信号に対応する音声の発生位置を示す情報と共にコマンド処理部２４へ出力する。 The voice recognition unit 23 recognizes an operation command from each voice signal output from the sound source position specifying unit 21 using a known voice recognition technology, and the recognized operation command is used as the voice that is the basis of the operation command. It outputs to the command processing part 24 with the information which shows the generation | occurrence | production position of the audio | voice corresponding to a signal.

許可コマンド格納部２５には、例えば図３に示すように、車両内の座席の領域を示す座席領域２５０に対応付けて、当該座席領域２５０で示される座席の属性２５１、当該座席に座った人に対して車両の走行時に許可する複数の操作コマンドを示す走行中許可コマンド２５２、および、当該座席に座った人に対して停車中に許可する複数の操作コマンドを示す停止中許可コマンド２５３が予め格納されている。 In the permission command storage unit 25, for example, as shown in FIG. 3, a seat attribute 251 indicated by the seat area 250 in association with a seat area 250 indicating a seat area in the vehicle, a person sitting in the seat A running permission command 252 indicating a plurality of operation commands permitted when the vehicle is traveling, and a stopping permission command 253 indicating a plurality of operation commands permitted to the person sitting in the seat while the vehicle is stopped Stored.

座席領域２５０には、例えば図４に示すように、車両内の所定の高さにおける水平面をｘｙ平面とした場合のｘｙ平面上において、それぞれの座席位置を囲む矩形領域の対向する頂点の座標が格納される。図４は、車両の内部を上空から見た図を模式的に表したものであり、ハンドル１５近傍の領域３０は運転席を示し、領域３１は助手席を示し、領域３２は後部座席を示している。なお、図４に示す例において、複数のマイクロフォン１１および表示装置１３は、ダッシュボード付近に設けられる。 In the seat area 250, for example, as shown in FIG. 4, the coordinates of the vertices facing each other in the rectangular area surrounding each seat position on the xy plane when the horizontal plane at a predetermined height in the vehicle is the xy plane. Stored. FIG. 4 schematically shows the interior of the vehicle as viewed from above. An area 30 near the handle 15 indicates a driver seat, an area 31 indicates a passenger seat, and an area 32 indicates a rear seat. ing. In the example shown in FIG. 4, the plurality of microphones 11 and the display device 13 are provided near the dashboard.

走行中許可コマンド２５２または停止中許可コマンド２５３において、全ての操作コマンドが許可される場合、全ての操作コマンドが許可される旨を示す「ＡＬＬ」が格納される。 When all the operation commands are permitted in the travel permission command 252 or the stop permission command 253, “ALL” indicating that all the operation commands are permitted is stored.

コマンド処理部２４は、音声認識部２３から、音声信号および当該音声信号に対応する音声の発生位置を示す情報を受信した場合に、許可コマンド格納部２５を参照して、当該発生位置が含まれる座席領域を特定する。そして、コマンド処理部２４は、ＧＰＳ（Global Positioning System）受信機や方位センサ、距離センサ等のセンサ１４から受信した測定信号に基づいて車両が走行中か否かを判定する。 When the command processing unit 24 receives a voice signal and information indicating the voice generation position corresponding to the voice signal from the voice recognition unit 23, the command processing unit 24 refers to the permission command storage unit 25 and includes the generation position. Identify the seating area. The command processing unit 24 determines whether or not the vehicle is traveling based on a measurement signal received from a sensor 14 such as a GPS (Global Positioning System) receiver, a direction sensor, or a distance sensor.

車両が走行中である場合、コマンド処理部２４は、特定した座席領域に対応付けられており、走行中許可コマンド２５２に格納されている複数の操作コマンドを抽出する。一方、車両が停止中である場合、コマンド処理部２４は、特定した座席領域に対応付けられており、停止中許可コマンド２５３に格納されている複数の操作コマンドを抽出する。 When the vehicle is traveling, the command processing unit 24 extracts a plurality of operation commands that are associated with the specified seat area and stored in the traveling permission command 252. On the other hand, when the vehicle is stopped, the command processing unit 24 extracts a plurality of operation commands that are associated with the specified seat area and stored in the stop permission command 253.

そして、コマンド処理部２４は、音声認識部２３から受信した操作コマンドが、許可コマンド格納部２５から抽出した複数の操作コマンドのいずれかに該当する場合に、当該操作コマンドの元となった音声信号に対応する音声の発生位置が含まれる座席領域を、例えば図５に示すように表示装置１３に表示して、音声認識部２３から受信した操作コマンドに対応する処理を実行する。 Then, when the operation command received from the voice recognition unit 23 corresponds to one of a plurality of operation commands extracted from the permission command storage unit 25, the command processing unit 24 generates a voice signal that is the source of the operation command. For example, as shown in FIG. 5, the seat area including the voice generation position corresponding to is displayed on the display device 13, and processing corresponding to the operation command received from the voice recognition unit 23 is executed.

図５に示した例において、コマンド処理部２４は、表示装置１３の画面内に音声信号の発生位置を示すアイコン５０を表示している。アイコン５０には、助手席を示す領域５１、運転席を示す領域５２、および後部座席を示す領域５３が含まれる。図５に示した例では、運転席から発生した音声による操作コマンドに対応する処理が実行された旨が表示されている。 In the example shown in FIG. 5, the command processing unit 24 displays an icon 50 indicating the generation position of the audio signal in the screen of the display device 13. The icon 50 includes an area 51 indicating a passenger seat, an area 52 indicating a driver seat, and an area 53 indicating a rear seat. In the example shown in FIG. 5, it is displayed that processing corresponding to an operation command by voice generated from the driver's seat has been executed.

また、他の例として、図６に示すように、画面のふちに沿って、助手席から発生した音声による操作コマンドに対応する処理が実行された旨を表示する領域５５、運転席から発生した音声による操作コマンドに対応する処理が実行された旨を表示する領域５６、および後部座席から発生した音声による操作コマンドに対応する処理が実行された旨を表示する領域５７を表示するようにしてもよい。 As another example, as shown in FIG. 6, an area 55 indicating that a process corresponding to an operation command by voice generated from the passenger seat has been executed along the edge of the screen is generated from the driver's seat. An area 56 for displaying that the process corresponding to the voice operation command has been executed and an area 57 for displaying that the process corresponding to the voice operation command generated from the rear seat has been executed may be displayed. Good.

なお、音声認識部２３から複数の音声信号およびそれぞれの音声信号に対応する音声の発生位置を示す情報を受信した場合、コマンド処理部２４は、それぞれの音声信号に対応する音声の発生位置に基づいて許可コマンド格納部２５を参照し、運転席の領域から発生した音声に対応する音声信号による操作コマンドを優先して処理する。運転席から発生した音声に対応する音声信号による操作コマンドがなかった場合、コマンド処理部２４は、例えば助手席、後部座席の順に優先して操作コマンドを実行する。 When receiving information indicating a plurality of voice signals and voice generation positions corresponding to the respective voice signals from the voice recognition unit 23, the command processing unit 24 is based on the voice generation positions corresponding to the respective voice signals. The permission command storage unit 25 is referred to, and the operation command based on the audio signal corresponding to the audio generated from the driver's seat area is preferentially processed. When there is no operation command by an audio signal corresponding to the voice generated from the driver's seat, the command processing unit 24 executes the operation command with priority in the order of the passenger seat and the rear seat, for example.

図７は、車載装置２０の動作の一例を示すフローチャートである。例えば車両のエンジンが始動する等の所定のタイミングで、車載装置２０は、本フローチャートに示す動作を開始する。 FIG. 7 is a flowchart illustrating an example of the operation of the in-vehicle device 20. For example, the vehicle-mounted device 20 starts the operation shown in this flowchart at a predetermined timing such as when the vehicle engine is started.

まず、開始指示受付部２２は、音声認識開始ボタン１２が押下されたいか否かを判定する（Ｓ１００）。音声認識開始ボタン１２が押下されていない場合（Ｓ１００：Ｎｏ）、開始指示受付部２２は、音声認識開始ボタン１２が押下されるまでステップＳ１００に示した処理を繰り返す。 First, the start instruction receiving unit 22 determines whether or not the voice recognition start button 12 is desired to be pressed (S100). When the voice recognition start button 12 is not pressed (S100: No), the start instruction receiving unit 22 repeats the process shown in step S100 until the voice recognition start button 12 is pressed.

音声認識開始ボタン１２が押下された場合（Ｓ１００：Ｙｅｓ）、開始指示受付部２２は、音声信号に対応する音声の発生位置の特定を音源位置特定部２１に指示する。音源位置特定部２１は、車両内に設けられた複数のマイクロフォン１１のそれぞれを介して収集された音声信号の遅延量や振幅の差等に基づいて、当該音声信号に対応する音声の発生位置を特定し（Ｓ１０１）、特定した発生位置を、当該発生位置から発生した音声に対応する音声信号と共に音声認識部２３へ出力する。 When the voice recognition start button 12 is pressed (S100: Yes), the start instruction receiving unit 22 instructs the sound source position specifying unit 21 to specify the sound generation position corresponding to the sound signal. The sound source position specifying unit 21 determines the sound generation position corresponding to the sound signal based on the delay amount or difference in amplitude of the sound signal collected via each of the plurality of microphones 11 provided in the vehicle. The specified generation position is output to the voice recognition unit 23 together with a voice signal corresponding to the voice generated from the generation position.

次に、音声認識部２３は、音源位置特定部２１から出力されたそれぞれの音声信号から公知の音声認識技術を用いて、操作コマンドを認識し（Ｓ１０２）、認識した操作コマンドを、当該操作コマンドの元となった音声信号に対応する音声の発生位置を示す情報と共にコマンド処理部２４へ出力する。 Next, the voice recognition unit 23 recognizes an operation command from each voice signal output from the sound source position specifying unit 21 using a known voice recognition technique (S102), and the recognized operation command is converted into the operation command. Are output to the command processing unit 24 together with information indicating the sound generation position corresponding to the sound signal that is the source of the above.

次に、コマンド処理部２４は、音声認識部２３から出力された操作コマンドを参照して、複数の操作コマンドが認識されたか否かを判定する（Ｓ１０３）。複数の操作コマンドが認識された場合（Ｓ１０３：Ｙｅｓ）、コマンド処理部２４は、操作コマンドと共に音声認識部２３から出力された音声の発生位置に基づいて許可コマンド格納部２５を参照し、運転席の領域から発生した音声に対応する音声信号による操作コマンドを優先して処理する（Ｓ１０４）。 Next, the command processing unit 24 refers to the operation command output from the voice recognition unit 23 and determines whether or not a plurality of operation commands have been recognized (S103). When a plurality of operation commands are recognized (S103: Yes), the command processing unit 24 refers to the permission command storage unit 25 based on the voice generation position output from the voice recognition unit 23 together with the operation command, and the driver's seat An operation command based on an audio signal corresponding to the audio generated from the area is preferentially processed (S104).

次に、コマンド処理部２４は、処理した操作コマンドに対応する音声の発生位置が含まれる座席位置を表示装置１３に表示し（Ｓ１０５）、開始指示受付部２２は、ステップＳ１００に示した処理を実行する。 Next, the command processing unit 24 displays the seat position including the sound generation position corresponding to the processed operation command on the display device 13 (S105), and the start instruction receiving unit 22 performs the process shown in step S100. Execute.

ステップＳ１０３において、単一の操作コマンドが認識された場合（Ｓ１０３：Ｎｏ）、コマンド処理部２４は、当該認識された操作コマンドに対応する処理を実行し（Ｓ１０６）、ステップＳ１０５に示した処理を実行する。 When a single operation command is recognized in step S103 (S103: No), the command processing unit 24 executes a process corresponding to the recognized operation command (S106), and performs the process shown in step S105. Execute.

以上、本発明の第一の実施形態について説明した。 The first embodiment of the present invention has been described above.

上記説明から明らかなように、本実施形態の車載システム１０によれば、音声により入力された操作コマンドに応じて処理を行う車載装置２０において、乗員が座っている車両内の位置に応じて、当該位置に座っている乗員に対して許可する操作コマンドを設定することができる。 As is clear from the above description, according to the in-vehicle system 10 of the present embodiment, in the in-vehicle device 20 that performs processing according to the operation command input by voice, according to the position in the vehicle where the occupant is sitting, It is possible to set an operation command to be permitted for a passenger sitting at the position.

次に、本発明の第二の実施形態について説明する。 Next, a second embodiment of the present invention will be described.

図８は、本発明の第二実施形態に係る車載システム１０の構成を示すシステム構成図である。車載システム１０は、複数のマイクロフォン１１、音声認識開始ボタン１２、複数の表示装置１３、センサ１４、および車載装置２０を備える。車載装置２０は、音源位置特定部２１、開始指示受付部２２、音声認識部２３、コマンド処理部２４、および許可コマンド格納部２５を有する。なお、以下に説明する点を除き、図８において、図１と同じ符号を付した構成は、図１における構成と同一または同様の機能を有するため説明を省略する。 FIG. 8 is a system configuration diagram showing the configuration of the in-vehicle system 10 according to the second embodiment of the present invention. The in-vehicle system 10 includes a plurality of microphones 11, a voice recognition start button 12, a plurality of display devices 13, a sensor 14, and an in-vehicle device 20. The in-vehicle device 20 includes a sound source position specifying unit 21, a start instruction receiving unit 22, a voice recognition unit 23, a command processing unit 24, and a permission command storage unit 25. Except for the points described below, in FIG. 8, the components denoted by the same reference numerals as those in FIG. 1 have the same or similar functions as those in FIG.

それぞれの表示装置１３は、例えば図９に示すように、車両内の異なる位置に設けられる。表示装置１３−１は例えば運転席の前に設けられ、運転席に座った者に見せる画面を表示する。表示装置１３−２は例えば助手席の前に設けられ、助手席に座った者に見せる画面を表示する。表示装置１３−３は、例えば後部座席の前に設けられたルーフモニタであり、後部座席に座った者に見せる画面を表示する。 Each display device 13 is provided at a different position in the vehicle, for example, as shown in FIG. The display device 13-1 is provided in front of the driver's seat, for example, and displays a screen to be shown to a person sitting in the driver's seat. The display device 13-2 is provided in front of the passenger seat, for example, and displays a screen to be shown to a person sitting in the passenger seat. The display device 13-3 is, for example, a roof monitor provided in front of the rear seat, and displays a screen to be shown to a person sitting on the rear seat.

許可コマンド格納部２５には、例えば図１０に示すように、座席領域２５０に対応付けて、属性２５１、走行中許可コマンド２５２、停止中許可コマンド２５３、および、座席領域２５０で示される座席から発生した音声の音声信号に対応する操作コマンドによる処理結果を反映させる表示装置１３を識別する表示装置ＩＤ２５４が予め格納されている。 In the permission command storage unit 25, for example, as shown in FIG. 10, it is generated from the seat indicated by the attribute 251, the permission command 252 during travel, the permission command 253 during stop, and the seat region 250 in association with the seat region 250. The display device ID 254 for identifying the display device 13 that reflects the processing result of the operation command corresponding to the sound signal of the sound is stored in advance.

コマンド処理部２４は、音声認識部２３から、音声信号および当該音声信号に対応する音声の発生位置を示す情報を受信した場合に、許可コマンド格納部２５を参照して、当該発生位置が含まれる座席領域を特定する。そして、コマンド処理部２４は、マイクロフォン１１から受信した測定信号に基づいて車両が走行中か否かを判定する。 When the command processing unit 24 receives a voice signal and information indicating the voice generation position corresponding to the voice signal from the voice recognition unit 23, the command processing unit 24 refers to the permission command storage unit 25 and includes the generation position. Identify the seating area. Then, the command processing unit 24 determines whether or not the vehicle is traveling based on the measurement signal received from the microphone 11.

そして、コマンド処理部２４は、音声認識部２３から受信した操作コマンドが、許可コマンド格納部２５から抽出した複数の操作コマンドのいずれかに該当する場合に、当該操作コマンドに対応する処理を実行する。そして、コマンド処理部２４は、許可コマンド格納部２５を参照して、当該操作コマンドの元となった音声信号に対応する音声の発生位置が含まれる座席領域２５０に対応付けられている表示装置ＩＤを特定する。そして、コマンド処理部２４は、実行結果を、特定した表示装置ＩＤに対応する表示装置１３に反映させる。 Then, when the operation command received from the voice recognition unit 23 corresponds to one of a plurality of operation commands extracted from the permission command storage unit 25, the command processing unit 24 executes processing corresponding to the operation command. . Then, the command processing unit 24 refers to the permission command storage unit 25 and displays the display device ID associated with the seat region 250 including the sound generation position corresponding to the sound signal that is the source of the operation command. Is identified. Then, the command processing unit 24 reflects the execution result on the display device 13 corresponding to the specified display device ID.

以上、本発明の第二の実施形態について説明した。 The second embodiment of the present invention has been described above.

本実施形態の車載システム１０においても、音声により入力された操作コマンドに応じて処理を行う車載装置２０において、乗員が座っている車両内の位置に応じて、当該位置に座っている乗員に対して許可する操作コマンドを設定することができる。さらに、操作コマンドによる処理を、当該操作コマンドの元となる音声が発せされた位置に座っている者が見る表示装置１３に反映させることができる。 Also in the in-vehicle system 10 of the present embodiment, in the in-vehicle device 20 that performs processing in accordance with an operation command input by voice, depending on the position in the vehicle where the occupant is sitting, the occupant sitting at the position Operation commands allowed. Furthermore, the processing based on the operation command can be reflected on the display device 13 viewed by the person sitting at the position where the sound that is the source of the operation command is emitted.

なお、上記第一または第二の実施形態における車載装置２０は、例えば図１１に示すような構成のコンピュータ６０によって実現される。図１１は、車載装置２０の機能を実現するコンピュータ６０のハードウェア構成の一例を示すハードウェア構成図である。コンピュータ６０は、ＣＰＵ（Central Processing Unit）６１、ＲＡＭ（Random Access Memory）６２、ＲＯＭ（Read Only Memory）６３、ＨＤＤ（Hard Disk Drive）６４、入力インターフェイス（Ｉ／Ｆ）６５、出力インターフェイス（Ｉ／Ｆ）６６、およびメディアインターフェイス（Ｉ／Ｆ）６７を備える。 The in-vehicle device 20 in the first or second embodiment is realized by a computer 60 configured as shown in FIG. 11, for example. FIG. 11 is a hardware configuration diagram illustrating an example of a hardware configuration of the computer 60 that realizes the functions of the in-vehicle device 20. The computer 60 includes a CPU (Central Processing Unit) 61, a RAM (Random Access Memory) 62, a ROM (Read Only Memory) 63, an HDD (Hard Disk Drive) 64, an input interface (I / F) 65, an output interface (I / F). F) 66 and a media interface (I / F) 67.

ＣＰＵ６１は、ＲＯＭ６３またはＨＤＤ６４に格納されたプログラムに基づいて動作し、各部の制御を行う。ＲＯＭ６３は、コンピュータ６０の起動時にＣＰＵ６１が実行するブートプログラムや、コンピュータ６０のハードウェアに依存するプログラム等を格納する。ＨＤＤ６４は、ＣＰＵ６１によって実行されるプログラムを格納する。 The CPU 61 operates based on a program stored in the ROM 63 or the HDD 64 and controls each unit. The ROM 63 stores a boot program executed by the CPU 61 when the computer 60 is started up, a program depending on the hardware of the computer 60, and the like. The HDD 64 stores a program executed by the CPU 61.

入力インターフェイス６５は、マイクロフォン１１、音声認識開始ボタン１２、またはセンサ１４からの信号を受信してＣＰＵ６１へ送る。ＣＰＵ６１は、入力インターフェイス６５を介して、マイクロフォン１１、音声認識開始ボタン１２、およびセンサ１４を制御し、入力インターフェイス６５を介して、マイクロフォン１１、音声認識開始ボタン１２、またはセンサ１４から信号を取得する。出力インターフェイス６６は、ＣＰＵ６１から取得したデータを、表示装置１３へ送る。ＣＰＵ６１は、出力インターフェイス６６を介して、表示装置１３を制御し、生成したデータを、出力インターフェイス６６を介して表示装置１３へ出力する。 The input interface 65 receives a signal from the microphone 11, the voice recognition start button 12, or the sensor 14 and sends it to the CPU 61. The CPU 61 controls the microphone 11, the voice recognition start button 12, and the sensor 14 through the input interface 65, and acquires a signal from the microphone 11, the voice recognition start button 12, or the sensor 14 through the input interface 65. . The output interface 66 sends the data acquired from the CPU 61 to the display device 13. The CPU 61 controls the display device 13 via the output interface 66 and outputs the generated data to the display device 13 via the output interface 66.

メディアインターフェイス６７は、記録媒体６８に格納されたプログラムまたはデータを読み取り、ＲＡＭ６２に提供する。ＲＡＭ６２を介してＣＰＵ６１に提供されるプログラムは、記録媒体６８に格納されている。当該プログラムは、記録媒体６８から読み出されて、ＲＡＭ６２を介してコンピュータ６０にインストールされ、ＣＰＵ６１によって実行される。記録媒体６８は、例えばＤＶＤ（Digital Versatile Disk）、ＰＤ（Phase change rewritable Disk）等の光学記録媒体、ＭＯ（Magneto-Optical disk）等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。 The media interface 67 reads a program or data stored in the recording medium 68 and provides it to the RAM 62. A program provided to the CPU 61 via the RAM 62 is stored in the recording medium 68. The program is read from the recording medium 68, installed in the computer 60 via the RAM 62, and executed by the CPU 61. The recording medium 68 is, for example, an optical recording medium such as a DVD (Digital Versatile Disk) or PD (Phase change rewritable disk), a magneto-optical recording medium such as an MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. Etc.

コンピュータ６０にインストールされて実行されるプログラムは、コンピュータ６０を、音源位置特定部２１、開始指示受付部２２、音声認識部２３、コマンド処理部２４、および許可コマンド格納部２５として機能させる。コンピュータ６０は、これらのプログラムを、記録媒体６８から読み取って実行するが、他の例として、コンピュータ６０に通信機能を設け、通信回線を介してこれらのプログラムを取得するようにしてもよい。 A program installed and executed on the computer 60 causes the computer 60 to function as the sound source position specifying unit 21, the start instruction receiving unit 22, the voice recognition unit 23, the command processing unit 24, and the permission command storage unit 25. The computer 60 reads these programs from the recording medium 68 and executes them. However, as another example, the computer 60 may be provided with a communication function to acquire these programs via a communication line.

また、本発明は、上記した各実施形態に限定されるものではなく、その要旨の範囲内で数々の変形が可能である。 The present invention is not limited to the above-described embodiments, and various modifications can be made within the scope of the gist.

例えば、上記した第二の実施形態において、車載システム１０は、複数の座席のそれぞれに対応する複数の表示装置１３を備えるが、他の形態として、車載システム１０は、複数の座席のそれぞれの対応する画面を表示する１台の表示装置１３を備えていてもよい。この場合、コマンド処理部２４は、操作コマンドに対応する処理の実行結果を、当該操作コマンドに対応する音声信号の発生位置が含まれる座席領域２５０用の画面に反映させる。 For example, in the second embodiment described above, the in-vehicle system 10 includes a plurality of display devices 13 corresponding to the plurality of seats, but as another form, the in-vehicle system 10 corresponds to each of the plurality of seats. One display device 13 that displays a screen to be displayed may be provided. In this case, the command processing unit 24 reflects the execution result of the process corresponding to the operation command on the screen for the seat area 250 including the generation position of the audio signal corresponding to the operation command.

複数の座席のそれぞれの対応する画面を表示する表示装置１３としては、例えば、パネルの前面にスリットを設け、運転席側用の画像と助手席側用の画像とを水平方向に交互に並べ、上記スリットでバックライトの光を左右に分離することにより運転席側と助手席側とで異なる画像を表示することができる液晶ディスプレイであるデュアルディスプレイが好ましい。 As the display device 13 for displaying the corresponding screen of each of the plurality of seats, for example, a slit is provided on the front surface of the panel, and images for the driver's seat and passenger's side are alternately arranged in the horizontal direction, A dual display which is a liquid crystal display capable of displaying different images on the driver's seat side and the passenger's seat side by separating the light of the backlight left and right with the slit is preferable.

本発明の第一実施形態に係る車載システム１０の構成を示すシステム構成図である。It is a system configuration figure showing the composition of in-vehicle system 10 concerning a first embodiment of the present invention. 音源位置特定部２１から出力されるデータ４０の構造の一例を示す図である。It is a figure which shows an example of the structure of the data 40 output from the sound source position specific | specification part. 第一の実施形態において許可コマンド格納部２５に格納されるデータの構造の一例を示す図である。It is a figure which shows an example of the structure of the data stored in the permission command storage part 25 in 1st embodiment. 座席領域を説明するための概念図である。It is a conceptual diagram for demonstrating a seat area | region. 表示装置１３に表示される画面の一例を示す概念図である。4 is a conceptual diagram illustrating an example of a screen displayed on the display device 13. FIG. 表示装置１３に表示される画面の他の例を示す概念図である。12 is a conceptual diagram illustrating another example of a screen displayed on the display device 13. FIG. 車載装置２０の動作の一例を示すフローチャートである。4 is a flowchart showing an example of the operation of the in-vehicle device 20. 本発明の第二実施形態に係る車載システム１０の構成を示すシステム構成図である。It is a system configuration figure showing the composition of in-vehicle system 10 concerning a second embodiment of the present invention. 第二実施形態における表示装置１３の配置を説明するための概念図である。It is a conceptual diagram for demonstrating arrangement | positioning of the display apparatus 13 in 2nd embodiment. 第二の実施形態において許可コマンド格納部２５に格納されるデータの構造の一例を示す図である。It is a figure which shows an example of the structure of the data stored in the permission command storage part 25 in 2nd embodiment. 車載装置２０の機能を実現するコンピュータ５０の構成の一例を示すハードウェア構成図である。It is a hardware block diagram which shows an example of a structure of the computer 50 which implement | achieves the function of the vehicle equipment.

Explanation of symbols

１０・・・車載システム、１１・・・マイクロフォン、１２・・・音声認識開始ボタン、１３・・・表示装置、１４・・・センサ、１５・・・ハンドル、２０・・・車載装置、２１・・・音源位置特定部、２２・・・開始指示受付部、２３・・・音声認識部、２４・・・コマンド処理部、２５・・・許可コマンド格納部、３０・・・領域、３１・・・領域、３２・・・領域、４０・・・データ、４１・・・音源位置、４２・・・音声信号、５０・・・アイコン、５１・・・領域、５２・・・領域、５３・・・領域、５５・・・領域、５６・・・領域、５７・・・領域、６０・・・コンピュータ、６１・・・ＣＰＵ、６２・・・ＲＡＭ、６３・・・ＲＯＭ、６４・・・ＨＤＤ、６５・・・入力インターフェイス、６６・・・出力インターフェイス、６７・・・メディアインターフェイス、６８・・・記録媒体 DESCRIPTION OF SYMBOLS 10 ... In-vehicle system, 11 ... Microphone, 12 ... Voice recognition start button, 13 ... Display device, 14 ... Sensor, 15 ... Handle, 20 ... In-vehicle device, 21. .. Sound source position specifying unit, 22... Start instruction receiving unit, 23... Voice recognition unit, 24... Command processing unit, 25.・ Area 32 ... Area 40 ... Data 41 ... Sound source position 42 ... Audio signal 50 ... Icon 51 ... Area 52 ... Area 53 ... -Area, 55 ... Area, 56 ... Area, 57 ... Area, 60 ... Computer, 61 ... CPU, 62 ... RAM, 63 ... ROM, 64 ... HDD 65 ... Input interface 66 ... Output interface , 67 ... media interface, 68 ... recording medium

Claims

An in-vehicle device that is mounted on a vehicle and executes processing according to an operation command input by voice,
A permission command storage means for storing an operation command to be permitted for a person sitting on the seat in association with information indicating a seat area in the vehicle;
Voice recognition start accepting means for accepting an operation command input start instruction from the user;
After the voice recognition start accepting means accepts an instruction to start inputting an operation command from the user, the voice generation position corresponding to the voice signal is identified from the voice signals collected through each of the plurality of microphones, and identified. Sound source position specifying means for outputting the generated position together with an audio signal corresponding to the sound generated from the generated position;
Voice recognition means for recognizing an operation command from the voice signal output from the sound source position specifying means;
The operation command permitted at the sound generation position corresponding to the sound signal output from the sound source position specifying means is extracted from the permission command storage means, and the operation command recognized by the sound recognition means is extracted. An in-vehicle device comprising: a command processing unit that executes a process corresponding to the operation command recognized by the voice recognition unit when the operation command is the same as one of the operation commands.

The in-vehicle device according to claim 1,
In the permission command storage means,
Information indicating the area of the driver's seat is included as information indicating the area of the seat in the vehicle. The information indicating the area of the driver's seat includes an operation command that is permitted while the vehicle is traveling, and the vehicle is stopped. Are associated with permitted operation commands,
The command processing means includes
If the sound generation position corresponding to the sound signal output from the sound source position specifying means is included in the area of the driver's seat, if the vehicle is running, the operation commands permitted while the vehicle is running are An on-vehicle apparatus characterized in that if the vehicle is stopped and extracted from the permission command storage means, an operation command permitted while the vehicle is stopped is extracted from the permission command storage means.

The in-vehicle device according to claim 1 or 2,
The command processing means includes
An in-vehicle device characterized in that information indicating a sound generation position corresponding to an operation command to be processed is displayed on a display device.

The in-vehicle device according to any one of claims 1 to 3,
The sound source position specifying means includes
After the voice recognition start receiving means receives an instruction to start inputting an operation command from the user, the voice corresponding to each voice signal is obtained from the mixed signal including a plurality of voice signals collected through each of the plurality of microphones. For each occurrence position, the sound signal of the sound generated from the position is separated, and the separated sound signal is output together with information indicating the sound generation position corresponding to the sound signal,
The voice recognition means
When a plurality of audio signals are output from the sound source position specifying means, an operation command is recognized for each audio signal, and the recognized operation command is combined with information indicating the generation position of the sound that is the basis of the operation command. Output,
The command processing means includes
An in-vehicle apparatus that preferentially processes an operation command by voice generated from a driver's seat area when a plurality of operation commands are recognized by the voice recognition means.

The in-vehicle device according to any one of claims 1 to 3,
The sound source position specifying means includes
After the voice recognition start receiving means receives an instruction to start inputting an operation command from the user, the voice corresponding to each voice signal is obtained from the mixed signal including a plurality of voice signals collected through each of the plurality of microphones. For each occurrence position, the sound signal of the sound generated from the position is separated, and the separated sound signal is output together with information indicating the sound generation position corresponding to the sound signal,
The voice recognition means
When a plurality of audio signals are output from the sound source position specifying means, an operation command is recognized for each audio signal, and the recognized operation command is combined with information indicating the generation position of the sound that is the basis of the operation command. Output,
The command processing means includes
When a plurality of operation commands are recognized by the voice recognition unit, for each operation command, an operation command that is permitted at a voice generation position that is a source of the operation command is extracted from the permission command storage unit, If the operation command by voice generated from the generation position and recognized by the voice recognition means is the same as any one of the extracted operation commands, the processing result corresponding to the operation command is An on-vehicle device characterized by displaying on a display device corresponding to an area of a seat including a sound generation position corresponding to the operation command among display devices provided for each.

The in-vehicle device according to any one of claims 1 to 3,
The sound source position specifying means includes
After the voice recognition start receiving means receives an instruction to start inputting an operation command from the user, the voice corresponding to each voice signal is obtained from the mixed signal including a plurality of voice signals collected through each of the plurality of microphones. For each occurrence position, the sound signal of the sound generated from the position is separated, and the separated sound signal is output together with information indicating the position where the sound signal is generated,
The voice recognition means
When a plurality of audio signals are output from the sound source position specifying means, an operation command is recognized for each audio signal, and the recognized operation command is combined with information indicating the generation position of the sound that is the basis of the operation command. Output,
The command processing means includes
When a plurality of operation commands are recognized by the voice recognition unit, for each operation command, an operation command that is permitted at a voice generation position that is a source of the operation command is extracted from the permission command storage unit, If the operation command by voice generated from the generation position and recognized by the voice recognition means is the same as any one of the extracted operation commands, the processing result corresponding to the operation command is An in-vehicle device, wherein a display device capable of different display is reflected in a display corresponding to a seat area including a sound generation position corresponding to the operation command.

A voice recognition method in an in-vehicle device that is mounted on a vehicle and executes processing according to an operation command input by voice,
The in-vehicle device is
A permission command storage step of storing, in the permission command storage means, an operation command to be permitted to a person sitting on the seat in association with information indicating a seat area in the vehicle;
A voice recognition start acceptance step for accepting an operation command input start instruction from a user;
After receiving an instruction to start inputting an operation command from the user in the voice recognition start receiving step, the voice generation position corresponding to the voice signal is specified from the voice signals collected through each of the plurality of microphones, and specified. A sound source position specifying step for outputting the generated position together with an audio signal corresponding to the sound generated from the generated position;
A voice recognition step of recognizing an operation command from the voice signal output in the sound source position specifying step;
The operation command permitted at the sound generation position corresponding to the sound signal output in the sound source position specifying step is extracted from the permission command storage means, and the operation command recognized in the sound recognition step is the extracted operation command. And a command processing step of executing a process corresponding to the operation command recognized in the voice recognition step.