JP2016099479A

JP2016099479A - Voice control system, voice control method, and voice control program

Info

Publication number: JP2016099479A
Application number: JP2014235916A
Authority: JP
Inventors: 宏也八代; Hiroya Yashiro
Original assignee: Aisin AW Co Ltd
Current assignee: Aisin AW Co Ltd
Priority date: 2014-11-20
Filing date: 2014-11-20
Publication date: 2016-05-30
Anticipated expiration: 2034-11-20
Also published as: JP6536018B2

Abstract

PROBLEM TO BE SOLVED: To provide a voice control system, a voice control method, and a voice control program capable of easily performing the desired control of a user.SOLUTION: A voice control system includes: a microphone 3 for receiving the input of a voice; and a control part 8 for continuing the same control or switching the type of control in accordance with the length of the ending of the voice received by the microphone 3. Especially, the control part 8 includes a steadiness determination control part 8a for determining whether or not the voice is steadily input by the microphone 3, and the control part 8 continues the same control since it is determined that the voice is steadily input by the steadiness determination control part 8a until it is determined that the voice is not steadily input by the steadiness determination control part 8a.SELECTED DRAWING: Figure 1

Description

本発明は、音声制御システム、音声制御方法、及び音声制御プログラムに関する。 The present invention relates to a voice control system, a voice control method, and a voice control program.

従来、車載用のナビゲーション装置において、地図等の表示情報におけるスクロールの速度調整等の指示を受け付けるシステムが提案されている。このような従来のシステムでは、スクロールボタンをユーザが指で継続的に押圧している時間や、タッチパネルをユーザがフリックした長さ等に応じて、スクロールの速度調整を行っていた。しかし、この従来のシステムにおいては、指での操作が煩わしく、また、両手が塞がっている場合には操作が出来なかった。そこで、スクロールの速度調整等をより容易に行うことを可能とするシステムとして、ユーザの音声を認識してスクロール操作を行うシステムが提案されていた（例えば、特許文献１参照）。具体的には、このシステムでは、速度が複数の段階（例えば、「スピード極高」、「スピード高」、「スピード中」、「スピード低」、「スピード極低」）に区分されており、ユーザが、十字キーやタッチパネル等を操作して地図をスクロールしている間に、「速く」という音声を発した場合には、システムがこの音声を認識して解析し、当該解析した音声に応じて、スクロールの速度を一段階（例えば、「スピード中」から「スピード高」へ）速くすることを可能としていた。 Conventionally, in a vehicle-mounted navigation device, a system that accepts an instruction such as scroll speed adjustment in display information such as a map has been proposed. In such a conventional system, the scroll speed is adjusted according to the time during which the user continuously presses the scroll button with a finger, the length of the user flicking the touch panel, and the like. However, in this conventional system, the operation with a finger is troublesome, and the operation cannot be performed when both hands are closed. In view of this, a system that recognizes a user's voice and performs a scroll operation has been proposed as a system that enables easier scrolling speed adjustment and the like (see, for example, Patent Document 1). Specifically, in this system, the speed is divided into a plurality of stages (for example, “high speed high”, “high speed”, “medium speed”, “low speed”, “low speed”), When the user utters “fast” while scrolling the map by operating the cross key or touch panel, the system recognizes and analyzes this sound and responds to the analyzed sound. Thus, the scrolling speed can be increased by one step (for example, from “medium speed” to “high speed”).

特開２００５−２２１２４４号公報JP 2005-221244 A

しかし、特許文献１のような音声による入力システムでは、指での操作による煩わしさが解消されるという利点や、両手が塞がっている場合であっても操作が可能になるという利点があるが、操作対象のパラメータの種類によっては、ユーザが所望の制御を容易に行えない可能性があった。すなわち、特許文献１に係る技術では、速度等のパラメータを複数の段階に区分する必要があるが、スクロールのスピードやスクロールの移動量のような連続性のあるパラメータを複数の段階に区分してしまうと、パラメータの連続性が損なわれてしまう結果、ユーザが所望の制御を容易に行えない可能性があった。例えば、特許文献１に係る技術において、速度を比較的多くの段階に区分すると、ユーザの所望の速さとするまでに何度も音声入力しなければならないために容易な入力が行えず、一方、速度を比較的少ない段階に区分すると、ユーザの所望の速さに設定する事が出来ない可能性があった。そこで、音声入力の利点を活かしつつ、ユーザの所望の制御を容易に行う事が可能なシステムが要望されていた。 However, the voice input system as in Patent Document 1 has the advantage that the troublesomeness caused by the operation with the finger is eliminated, and the operation is possible even when both hands are closed, Depending on the type of parameter to be operated, the user may not be able to easily perform desired control. That is, in the technique according to Patent Document 1, it is necessary to divide parameters such as speed into a plurality of stages. However, continuous parameters such as scroll speed and scroll movement amount are classified into a plurality of stages. As a result, the continuity of the parameters is impaired, and there is a possibility that the user cannot easily perform the desired control. For example, in the technology according to Patent Document 1, if the speed is divided into a relatively large number of stages, it is not possible to perform an easy input because it is necessary to input a voice many times until the user achieves a desired speed. If the speed is divided into relatively small stages, there is a possibility that the speed desired by the user cannot be set. Therefore, there has been a demand for a system that can easily perform user-desired control while taking advantage of voice input.

本発明は、上記に鑑みてなされたものであって、音声入力によって、ユーザの所望の制御を容易に行う事が可能な音声制御システム、音声制御方法、及び音声制御プログラムを提供することを目的とする。 The present invention has been made in view of the above, and an object of the present invention is to provide a voice control system, a voice control method, and a voice control program capable of easily performing a user's desired control by voice input. And

上述した課題を解決し、目的を達成するために、本発明に係る音声制御システムは、音声の入力を受け付ける音声入力受付手段と、前記音声入力受付手段にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御手段と、を備える。 In order to solve the above-described problems and achieve the object, a voice control system according to the present invention includes a voice input receiving unit that receives a voice input, and a length of a voice ending that is received by the voice input receiving unit. And control means for continuing the same control or switching the type of control.

また、本発明に係る音声制御方法は、音声の入力を受け付ける音声入力受付工程と、前記音声入力受付工程にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御工程と、を含む。 In addition, the voice control method according to the present invention continues the same control according to the voice input reception step of receiving voice input and the length of the ending of the voice received in the voice input reception step, or A control step of switching the type of control.

また、本発明に係る音声制御プログラムは、音声の入力を受け付ける音声入力受付工程と、前記音声入力受付工程にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御工程と、をコンピュータに実行させる。 In addition, the voice control program according to the present invention continues the same control according to the voice input acceptance process for accepting voice input and the length of the ending of the voice accepted in the voice input acceptance process, or And causing the computer to execute a control step of switching the type of control.

本発明に係る音声制御システム、音声制御方法、及び音声制御プログラムによれば、音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換えるので、音声入力によって、ユーザの所望の制御を容易に行う事が可能となる。特に、スクロールのスピードやスクロールの移動量のような連続性のあるパラメータを制御する場合においても、パラメータを複数の段階に区分する必要がないので、パラメータの連続性を損ねることがなく、ユーザの所望の制御を容易に行う事が可能となる。 According to the voice control system, the voice control method, and the voice control program according to the present invention, the same control is continued or the control type is switched according to the length of the voice ending. It is possible to easily perform the desired control. In particular, even in the case of controlling continuity parameters such as scroll speed and scroll movement amount, it is not necessary to divide the parameters into a plurality of stages. Desired control can be easily performed.

本発明の実施の形態に係る音声制御システムを例示するブロック図である。It is a block diagram which illustrates the voice control system concerning an embodiment of the invention. 制御音声テーブルに格納されている制御音声情報を示す表である。It is a table | surface which shows the control audio | voice information stored in the control audio | voice table. 音声制御処理のフローチャートである。It is a flowchart of an audio | voice control process.

以下、本発明に係る音声制御システム、音声制御方法、及び音声制御プログラムの実施の形態について図面を参照しつつ詳細に説明する。ただし、実施の形態によって本発明が限定されるものではない。 Hereinafter, embodiments of a voice control system, a voice control method, and a voice control program according to the present invention will be described in detail with reference to the drawings. However, the present invention is not limited to the embodiments.

〔実施の形態の基本的概念〕
まず、実施の形態の基本的概念を説明する。この実施の形態は、概略的に、ユーザの音声入力を受け付けて制御を行う音声制御システムに関する。なお、当該音声制御システムは、ユーザの音声入力に基づいて制御可能な様々な機器に適用できるが、本実施の形態では、車両に搭載された車載用ナビゲーション装置（以下、車載装置）に適用されるものとして説明する。ただし、例えば、スマートフォン、携帯用ナビゲーション装置、空調機器、映像機器、又は音響機器のような車載装置とは一切異なる分野の機器に対しても同様の音声制御システムを好適に適用する事ができる。 [Basic concept of the embodiment]
First, the basic concept of the embodiment will be described. This embodiment generally relates to a voice control system that receives and controls a user's voice input. The voice control system can be applied to various devices that can be controlled based on a user's voice input. However, in the present embodiment, the voice control system is applied to a vehicle-mounted navigation device (hereinafter referred to as a vehicle-mounted device). It will be described as a thing. However, for example, the same voice control system can be suitably applied to devices in a field completely different from on-vehicle devices such as smartphones, portable navigation devices, air conditioning devices, video devices, and audio devices.

ここで、本実施の形態では、ユーザが制御内容の音声入力を行う際において、語尾を伸ばして発声を行う場合があるが、この場合には伸ばされた語尾の部分を「〜」と表記して説明する。ここで、「語尾」とは、ユーザが発声した最後の語（音節）であるが、最後の語が複数の音（例えば、子音や母音）によって構成される場合には、これら複数の音のうちの最後の音を意味する。例えば、日本語の単語の場合には、１つの語の最後の音は母音であるため、この母音が「語尾」になる。具体的には、ユーザが日本語の「右」の語尾を伸ばして発声を行った場合、「右」に対応する音は「みぎ」であり、ユーザが発声した最後の語（音節）は「ぎ」である。そして、この「ぎ」の最後の音は母音「い」であるため、語尾は「い」になる。この場合には「右〜」と表記する。 Here, in the present embodiment, when the user performs voice input of the control content, there is a case where the utterance is extended and the utterance is uttered. In this case, the extended ending portion is expressed as “˜”. I will explain. Here, the “end of word” is the last word (syllable) uttered by the user, but when the last word is composed of a plurality of sounds (for example, consonants and vowels), It means the last sound. For example, in the case of a Japanese word, since the last sound of one word is a vowel, this vowel becomes the “end of word”. Specifically, when the user utters a Japanese “right” ending, the sound corresponding to “right” is “Migi”, and the last word (syllable) uttered by the user is “ It is "gi". And since the last sound of this “gi” is the vowel “I”, the ending is “I”. In this case, it is written as “right”.

〔実施の形態の具体的内容〕
次に、実施の形態の具体的内容について説明する。 [Specific contents of the embodiment]
Next, specific contents of the embodiment will be described.

（構成）
本実施の形態では、車載装置に音声制御プログラムをインストールすることにより、車載装置が音声制御システムとして機能する場合について説明する。なお、上述したように、この他にも、例えば、スマートフォン、携帯用ナビゲーション装置、空調機器、映像機器、又は音響機器を含む任意の装置に音声制御プログラムをインストールすることによって音声制御システムを構成してもよい。また、音声制御システムにおける車載装置としての機能については、公知の車載装置と同様の構成により得ることができるので、その説明は省略することとし、以下では、特に音声制御を達成するための構成について説明する。なお、以下では、この音声制御システムを搭載した特定の車両（車載装置を操作するユーザが搭乗する車両）を単に「車両」と称して説明する。なお、「車両」には、自動四輪車、自動二輪車、及び自転車が含まれるが、以下では、車両が自動四輪車である場合について説明する。 (Constitution)
In the present embodiment, a case where the in-vehicle device functions as a voice control system by installing a voice control program in the in-vehicle device will be described. In addition, as described above, for example, a voice control system is configured by installing a voice control program in any device including, for example, a smartphone, a portable navigation device, an air conditioner, a video device, or an audio device. May be. Moreover, since the function as the vehicle-mounted device in the voice control system can be obtained by the same configuration as that of a known vehicle-mounted device, the description thereof will be omitted, and in the following, particularly the configuration for achieving the voice control explain. In the following description, a specific vehicle (a vehicle on which a user operating the in-vehicle device is mounted) equipped with this voice control system will be simply referred to as “vehicle”. “Vehicle” includes an automobile, a motorcycle, and a bicycle. Hereinafter, a case where the vehicle is an automobile will be described.

（構成）
最初に、車載装置１の構成を説明する。図１は、本実施の形態に係る音声制御システムを例示するブロック図である。図１に示すように、車載装置１は、概略的に、スピーカ２、マイク３、タッチパネル４、ディスプレイ５、現在位置取得部６、通信部７、制御部８、及びデータ記録部９を備えている。 (Constitution)
First, the configuration of the in-vehicle device 1 will be described. FIG. 1 is a block diagram illustrating a voice control system according to this embodiment. As shown in FIG. 1, the in-vehicle device 1 generally includes a speaker 2, a microphone 3, a touch panel 4, a display 5, a current position acquisition unit 6, a communication unit 7, a control unit 8, and a data recording unit 9. Yes.

（構成−スピーカ）
スピーカ２は、制御部８の制御に基づいて情報を音声にて出力する音声出力手段である。このスピーカ２から出力される音声の具体的な態様は任意であり、必要に応じて生成された合成音声や、予め録音された音声を出力することができる。 (Configuration-Speaker)
The speaker 2 is sound output means for outputting information by sound based on the control of the control unit 8. The specific form of the sound output from the speaker 2 is arbitrary, and it is possible to output a synthesized sound generated as necessary and a sound recorded in advance.

（構成−マイク）
マイク３は、各種の入力を受け付ける複数の入力手段のうちの１つであって、音声の入力を受け付ける音声入力受付手段である。このマイク３としては、公知のマイクロフォンを用いることができる。 (Configuration-microphone)
The microphone 3 is one of a plurality of input means for receiving various inputs, and is a voice input receiving means for receiving voice input. As the microphone 3, a known microphone can be used.

（構成−タッチパネル）
タッチパネル４は、ユーザの指等で押圧されることにより、当該ユーザから各種手動入力を受け付けるものである。このタッチパネル４は、透明又は半透明状に形成され、ディスプレイ５の前面において当該ディスプレイ５の表示面と重畳するように設けられている。このタッチパネル４としては、例えば、抵抗膜方式や静電容量方式等による操作位置検出手段を備えた公知のタッチパネルを使用することができる。 (Configuration-touch panel)
The touch panel 4 receives various manual inputs from the user when pressed by a user's finger or the like. The touch panel 4 is formed to be transparent or translucent, and is provided on the front surface of the display 5 so as to overlap the display surface of the display 5. As this touch panel 4, for example, a publicly known touch panel provided with operation position detecting means by a resistance film method or a capacitance method can be used.

（構成−ディスプレイ）
ディスプレイ５は、音声制御システムによって案内された画像を表示する表示手段であり、特に、後述する地図データベース（以下、データベースを「ＤＢ」と称する）９ａに格納された地図情報に基づいて地図を表示する表示手段である。このディスプレイ５の具体的な構成は任意であり、公知の液晶ディスプレイや有機ＥＬディスプレイの如きフラットパネルディスプレイを使用することができる。 (Configuration-Display)
The display 5 is a display means for displaying an image guided by the voice control system, and in particular, displays a map based on map information stored in a map database (hereinafter referred to as “DB”) 9a described later. Display means. The specific configuration of the display 5 is arbitrary, and a flat panel display such as a known liquid crystal display or organic EL display can be used.

（構成−現在位置取得部）
現在位置取得部６は、車両の現在位置を取得する現在位置取得手段である。例えば、現在位置取得部６は、ＧＰＳ、地磁気センサ、距離センサ、又はジャイロセンサ（いずれも図示省略）の少なくとも一つにより検出した現在の車載装置１の位置（座標）及び方位等を、公知の方法にて取得する。 (Configuration-Current position acquisition unit)
The current position acquisition unit 6 is current position acquisition means for acquiring the current position of the vehicle. For example, the current position acquisition unit 6 knows the current position (coordinates) and direction of the in-vehicle device 1 detected by at least one of a GPS, a geomagnetic sensor, a distance sensor, or a gyro sensor (all not shown). Get by the method.

（構成−通信部）
通信部７は、センター装置（図示省略）との間でネットワークを介した通信を行う通信手段である。この通信手段の具体的な種類や構成は任意であるが、例えば、公知の移動体無線通信手段や、ＦＭ多重放送やビーコンを介した公知のＶＩＣＳ（登録商標）システム用の無線通信手段を用いることができる。 (Configuration-Communication Department)
The communication unit 7 is a communication unit that performs communication with a center device (not shown) via a network. The specific type and configuration of the communication means are arbitrary. For example, a known mobile wireless communication means or a known VICS (registered trademark) system wireless communication means via FM multiplex broadcasting or beacon is used. be able to.

（構成−制御部）
制御部８は、車載装置１を制御する制御手段であり、特に、マイク３にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御手段である。具体的には、ＣＰＵ、当該ＣＰＵ上で解釈実行される各種のプログラム（ＯＳなどの基本制御プログラムや、ＯＳ上で起動され特定機能を実現するアプリケーションプログラムを含む）、及びプログラムや各種のデータを格納するためのＲＡＭの如き内部メモリを備えて構成されるコンピュータである。特に、本実施の形態に係る音声制御プログラムは、任意の記録媒体又はネットワークを介して車載装置１にインストールされることで、制御部８の各部を実質的に構成する。 (Configuration-control unit)
The control unit 8 is a control unit that controls the in-vehicle device 1, and in particular, a control unit that continues the same control or switches the type of control according to the length of the ending of the voice received by the microphone 3. It is. Specifically, the CPU, various programs that are interpreted and executed on the CPU (including basic control programs such as an OS and application programs that are activated on the OS to realize specific functions), programs, and various data It is a computer configured with an internal memory such as a RAM for storing. In particular, the voice control program according to the present embodiment is substantially installed in the in-vehicle device 1 via an arbitrary recording medium or network, thereby substantially configuring each unit of the control unit 8.

この制御部８は、機能概念的に、定常判断制御部８ａ、及び音声解析部８ｂを備えて構成されている。定常判断制御部８ａは、マイク３にて音声が定常的に入力されているか否かを判断する定常判断制御手段である。音声解析部８ｂは、マイク３にて受け付けられた音声を解析する音声解析手段である。なお、これら制御部８の各部により行われる具体的な処理については後述する。 The control unit 8 includes a steady state determination control unit 8a and a voice analysis unit 8b in terms of functional concept. The steady state determination control unit 8a is a steady state determination control unit that determines whether or not sound is constantly input from the microphone 3. The voice analysis unit 8 b is a voice analysis unit that analyzes the voice received by the microphone 3. Specific processing performed by each unit of the control unit 8 will be described later.

（構成−データ記録部）
データ記録部９は、車載装置１の動作に必要なプログラム及び各種のデータを記録する記録手段であり、例えば、外部記録装置としてのハードディスク（図示省略）を用いて構成されている。ただし、ハードディスクに代えてあるいはハードディスクと共に、磁気ディスクの如き磁気的記録媒体、又はＤＶＤやブルーレイディスクの如き光学的記録媒体を含む、その他の任意の記録媒体を用いることができる。このデータ記録部９は、地図ＤＢ９ａ、発話例ＤＢ９ｂ、及び、制御音声テーブル９ｃを備えている。 (Configuration-Data recording part)
The data recording unit 9 is a recording unit that records a program and various data necessary for the operation of the in-vehicle device 1, and is configured using, for example, a hard disk (not shown) as an external recording device. However, any other recording medium including a magnetic recording medium such as a magnetic disk or an optical recording medium such as a DVD or a Blu-ray disk can be used instead of or together with the hard disk. The data recording unit 9 includes a map DB 9a, an utterance example DB 9b, and a control voice table 9c.

地図ＤＢ９ａは、地図情報を格納する地図情報格納手段である。ここで、「地図情報」とは、道路、道路構造物、施設等を含む各種の位置の特定に必要な情報であり、例えば、道路上に設定された各ノードに関するノードデータ（ノード番号、座標）や、道路上に設定された各リンクに関するリンクデータ（リンクＩＤ、リンク名、始点側接続ノード番号、終点側接続ノード番号、道路座標、道路種別（例えば、有料道路、一般道路等）、道路情報、地物データ（信号機、道路標識、ガードレール、施設等）、及び地形データ等を含んで構成されている。 The map DB 9a is map information storage means for storing map information. Here, “map information” is information necessary for specifying various positions including roads, road structures, facilities, and the like. For example, node data (node numbers, coordinates, etc.) relating to each node set on the road. ), Link data related to each link set on the road (link ID, link name, start side connection node number, end side connection node number, road coordinates, road type (for example, toll road, general road, etc.), road It includes information, feature data (signals, road signs, guardrails, facilities, etc.), and terrain data.

発話例ＤＢ９ｂは、ユーザの入力音声の内容を特定するための発話例情報を格納する発話例格納手段である。具体的に、この発話例ＤＢ９ｂは、制御音声のスペクトル情報と、各スペクトル情報を一意に特定するスペクトルＩＤとを相互に関連付けて格納している。ここで、「スペクトル情報」とは、音声解析部８ｂの音声解析に使用される情報であって、例えば、音声情報（ＷＡＶ情報）をフーリエ解析して導出された情報である。また、「制御音声」とは、車載装置１に対する詳細な制御の内容（例えば、「上」、「下」等。以下、「制御内容」）を含む、語尾を伸ばした音声であり、例えば、「上〜」、「下〜」、「右〜」、「左〜」、「拡大〜」、「縮小〜」、「左回り〜」、「右回り〜」、「左回転〜」、「右回転〜」等の言葉が該当する。すなわち、従来技術においては、通常ユーザは語尾を伸ばして発声することはないので、ユーザの入力音声を特定するために語尾を伸ばさない音声のスペクトル情報を格納しているが、本実施の形態においては、ユーザが語尾を伸ばして発声した際の入力音声の内容を特定するので、このように語尾を伸ばした音声のスペクトル情報を格納している。なお、入力音声の内容を特定する具体的な方法については後述する。また、「スペクトルＩＤ」は、各制御音声（「上〜」、「下〜」、「右〜」等）に対してそれぞれ割り当てられた識別情報であり、例えば、「０００１」、「０００２」、「０００３」等といった通し番号である。なお、これらの発話例情報を発話例ＤＢ９ｂに格納するタイミングは任意で、例えば、工場出荷時に予め格納しても良いし、プログラムのアップデート時等に送信センター（図示省略）から通信部７を介して情報を受信して格納しても良い。また、ユーザによる音声入力を参照して学習し、発話例ＤＢ９ｂに格納されたスペクトル情報をユーザに適した情報に修正したり、新たなスペクトル情報を追加したりしても良い。 The utterance example DB 9b is utterance example storage means for storing utterance example information for specifying the content of the user's input voice. Specifically, this utterance example DB 9b stores the spectrum information of the control voice and the spectrum ID that uniquely identifies each spectrum information in association with each other. Here, the “spectrum information” is information used for voice analysis of the voice analysis unit 8b, for example, information derived by Fourier analysis of voice information (WAV information). In addition, the “control voice” is a voice with an extended ending including the detailed control contents (for example, “up”, “lower”, etc., hereinafter “control contents”) for the in-vehicle device 1. “Up”, “Down”, “Right”, “Left”, “Enlarged”, “Reduced”, “Left”, “Right”, “Left”, “Right” The term “rotation” is applicable. That is, in the prior art, since the user usually does not utter the utterance, the spectrum information of the voice that does not extend the utterance is stored in order to specify the user's input voice. Specifies the content of the input voice when the user utters with the word ending extended, and thus stores the spectrum information of the voice with the word ending extended in this way. A specific method for specifying the content of the input voice will be described later. “Spectrum ID” is identification information assigned to each control voice (“Up”, “Down”, “Right”, etc.), for example, “0001”, “0002”, A serial number such as “0003”. The timing of storing these utterance example information in the utterance example DB 9b is arbitrary. For example, the utterance example information may be stored in advance at the time of factory shipment, or from the transmission center (not shown) via the communication unit 7 at the time of program update or the like. Information may be received and stored. Further, learning may be performed with reference to voice input by the user, and the spectrum information stored in the utterance example DB 9b may be corrected to information suitable for the user, or new spectrum information may be added.

制御音声テーブル９ｃは、制御音声情報を格納する制御音声格納手段である。図２は、制御音声テーブル９ｃに格納されている制御音声情報を示す表である。この図２に示すように、制御音声テーブル９ｃには、項目「コマンド名称」、項目「スペクトルＩＤ」、及び項目「制御音声」に対応する情報が相互に関連付けられて格納されている。 The control sound table 9c is control sound storage means for storing control sound information. FIG. 2 is a table showing the control sound information stored in the control sound table 9c. As illustrated in FIG. 2, information corresponding to the item “command name”, the item “spectrum ID”, and the item “control voice” is stored in the control voice table 9 c in association with each other.

ここで、項目「コマンド名称」に対応して格納される情報は、音声制御システムが実行する制御の種類（コマンド）を示す情報であって、図２に示すように、ディスプレイ５に表示された地図を特定の方向へスクロールさせるコマンドである「地図スクロール」、ディスプレイ５に表示された地図の縮尺を変化させるコマンドである「地図縮尺」、ディスプレイ５に表示された地図の方位を回転させるコマンドである「地図方位」、目的地候補等のリストをスクロールさせるコマンドである「リストスクロール」、及びスピーカ２のボリュームを変化させるコマンドである「ボリューム」が格納されている。 Here, the information stored corresponding to the item “command name” is information indicating the type (command) of control executed by the voice control system, and is displayed on the display 5 as shown in FIG. “Map Scroll” which is a command for scrolling the map in a specific direction, “Map Scale” which is a command for changing the scale of the map displayed on the display 5, and a command for rotating the orientation of the map displayed on the display 5 A “map direction”, a “list scroll” that is a command for scrolling a list of destination candidates, and a “volume” that is a command for changing the volume of the speaker 2 are stored.

また、項目「スペクトルＩＤ」に対応して格納される情報は、発話例ＤＢ９ｂに格納されたスペクトルＩＤのうち項目「コマンド名称」に対応する複数のスペクトルＩＤである。 The information stored corresponding to the item “spectrum ID” is a plurality of spectrum IDs corresponding to the item “command name” among the spectrum IDs stored in the utterance example DB 9b.

また、項目「制御音声」に対応して格納される情報は、各スペクトルＩＤにより一意に特定される制御音声である。なお、これらの制御音声情報を制御音声テーブル９ｃに格納するタイミングは任意で、例えば、工場出荷時に予め格納しても良いし、プログラムのアップデート時等に送信センター（図示省略）から通信部７を介して情報を受信して格納しても良い。 Further, the information stored corresponding to the item “control voice” is a control voice uniquely specified by each spectrum ID. The timing for storing the control voice information in the control voice table 9c is arbitrary. For example, the control voice information may be stored in advance at the time of factory shipment, or the communication unit 7 may be connected from a transmission center (not shown) at the time of program update or the like. The information may be received and stored via

（音声制御処理）
次に、このように構成される音声制御システムによって実行される音声制御処理について説明する。 (Voice control processing)
Next, a voice control process executed by the voice control system configured as described above will be described.

この音声制御処理は、概略的に、ユーザが入力した音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える処理であって、本実施の形態では特に、語尾の長さに応じて車載装置１のディスプレイ５の表示内容を制御する処理について説明する。なお、この音声制御処理を実行するタイミングは任意であり、例えば、本実施の形態では車載装置１の電源がオンとなり、ユーザによって特定のコマンドモードに設定された際に、自動的に実行されるものとして説明する。なお、このコマンドモードとは、特定のコマンドを実行するためのモードであって、例えば、制御音声テーブル９ｃの項目「コマンド名称」に対応する５つのモードである「スクロールモード」、「地図縮尺モード」、「地図方位モード」、「リストスクロールモード」、及び「ボリュームモード」の中からユーザに選択されて設定される。このコマンドモードを設定する方法は任意で、例えば、ユーザがタッチパネル４を指で操作して設定しても良い。また、ユーザがマイク３を介して「スクロールモード」、又は「地図縮尺モード」等を音声で入力し、入力された音声を制御部８が解析することにより、特定されたコマンドモードに設定しても良い。なお、このようにコマンドモードの設定を行うための音声解析の具体的な構成や方法については公知であるため、詳細な説明を省略する。以下では、コマンドモードが「スクロールモード」に設定されているものとして説明を行う。 This voice control process is generally a process of continuing the same control or switching the type of control according to the length of the voice ending input by the user. A process for controlling the display content of the display 5 of the in-vehicle device 1 according to the length of the vehicle will be described. The timing for executing this voice control process is arbitrary. For example, in the present embodiment, the on-vehicle apparatus 1 is automatically turned on when the vehicle-mounted device 1 is turned on and set to a specific command mode by the user. It will be explained as a thing. The command mode is a mode for executing a specific command. For example, five modes corresponding to the item “command name” of the control voice table 9c are “scroll mode”, “map scale mode”. ”,“ Map orientation mode ”,“ List scroll mode ”, and“ Volume mode ”are selected and set by the user. The method for setting the command mode is arbitrary. For example, the user may set the command mode by operating the touch panel 4 with a finger. Further, the user inputs “scroll mode” or “map scale mode” or the like via voice through the microphone 3, and the input voice is set by the control unit 8 to analyze the input voice. Also good. In addition, since the specific structure and method of the voice analysis for setting the command mode in this manner are known, detailed description thereof will be omitted. In the following description, it is assumed that the command mode is set to “scroll mode”.

図３は、音声制御処理のフローチャートである。まず、ＳＡ１において制御部８は、マイク３に入力された音声の振幅が閾値以上となったか否かを判定する。この判定の具体的な方法は任意であるが、例えば、車載装置１に公知のデジタル振動センサを設け、このデジタル振動センサにてマイク３を介して入力された音の振幅を測定しても良い。なお、閾値は任意の値に設定できるが、例えば、人の通常の会話時における音声の振幅と同程度の振幅に設定しても良い。 FIG. 3 is a flowchart of the voice control process. First, in SA1, the control unit 8 determines whether or not the amplitude of the sound input to the microphone 3 is equal to or greater than a threshold value. Although the specific method of this determination is arbitrary, for example, a known digital vibration sensor may be provided in the in-vehicle device 1 and the amplitude of sound input via the microphone 3 may be measured by this digital vibration sensor. . Note that the threshold value can be set to an arbitrary value, but may be set to an amplitude comparable to that of a voice during a normal conversation of a person, for example.

そして、振幅が閾値以上でない場合（ＳＡ１、Ｎｏ）、ユーザによる音声入力が無いものとし、ＳＡ１を繰り返すことにより、振幅が閾値以上となるまで待機する。また、振幅が閾値以上である場合（ＳＡ１、Ｙｅｓ）、ユーザによる音声入力が有ったものとし、ＳＡ２に移行する。 If the amplitude is not greater than or equal to the threshold value (SA1, No), it is assumed that there is no voice input by the user, and the process waits until the amplitude exceeds the threshold value by repeating SA1. If the amplitude is equal to or greater than the threshold (SA1, Yes), it is assumed that there is a voice input by the user, and the process proceeds to SA2.

そして、ＳＡ２において制御部８は、音声入力記録を開始する。具体的には、ユーザによってマイク３を介して入力された音声を、データ記録部９に随時記録する。この記録の具体的な方法は任意であるが、例えば、音声情報（例えば、ＷＡＶ情報）をデータ記録部９に記録する。 In SA2, the control unit 8 starts voice input recording. Specifically, the voice input through the microphone 3 by the user is recorded in the data recording unit 9 as needed. Although the specific method of this recording is arbitrary, for example, audio information (for example, WAV information) is recorded in the data recording unit 9.

次に、ＳＡ３において定常判断制御部８ａは、ユーザの音声が定常となったか否かを判定する。「定常」とは、略同一の音が基準時間（例えば、０．５秒）を超えて連続で繰り返されている状態の事を指し、例えば、「上〜」というように音声の語尾が基準時間伸ばされている状態が該当する。この判定には公知の方法を採用でき、例えば、データ記録部９に記録された音声情報をフーリエ解析してスペクトル情報を導出し、当該導出したスペクトル情報に同一音声のスペクトルが現在時刻から直近の基準時間連続で繰り返されている場合に、音声の語尾が基準時間伸ばされている（すなわち、ユーザの音声が定常となった）と判定してもよい。なお、基準時間の具体的な決定方法や数値は任意であるが、例えば、音声の語尾を伸ばすことを意図することなくユーザが音声を発した場合において、同一音声が連続で繰り返される最長時間を実験等で求め、この最長時間を超える時間を基準時間として設定する。また、上記のスペクトル情報以外の情報に基づいて、定常となったか否かの判定を行っても良く、例えばフーリエ解析される以前の音声情報（例えば、ＷＡＶ情報）に基づいて判定を行っても良い。例えば、音声情報そのものの振幅が例えば基準時間（例えば、０．５秒）収束状態にある場合、定常となったと判定しても良い。 Next, in SA3, the stationary determination control unit 8a determines whether or not the user's voice has become stationary. “Stationary” refers to a state in which substantially the same sound is repeated continuously over a reference time (for example, 0.5 seconds). This is the case where the time has been extended. For this determination, a known method can be adopted. For example, the speech information recorded in the data recording unit 9 is Fourier-analyzed to derive spectrum information, and the spectrum of the same speech in the derived spectrum information is the latest from the current time. When the reference time is repeated continuously, it may be determined that the end of the voice is extended by the reference time (that is, the user's voice has become steady). In addition, although the specific determination method and numerical value of reference | standard time are arbitrary, for example, when a user utters a voice without intending to extend the ending of the voice, the longest time that the same voice is repeated continuously is determined. It is obtained by experiment etc., and the time exceeding this maximum time is set as the reference time. Further, it may be determined based on information other than the above-described spectrum information, for example, whether it is steady or not, for example, based on audio information before Fourier analysis (for example, WAV information). good. For example, when the amplitude of the audio information itself is in a convergence state, for example, for a reference time (for example, 0.5 seconds), it may be determined that the sound information has become steady.

そして、定常となっていない場合（ＳＡ３、Ｎｏ）、例えば、「上〜」という音声においては「う」や「え」の音声が入力されている場合等は、ＳＡ３を繰り返すことにより、定常となるまで待機する。一方、定常となった場合（ＳＡ３、Ｙｅｓ）、例えば、「上〜」という音声において「〜」という語尾の部分の音声が入力されている場合には、ＳＡ４に移行する。 And when it is not steady (SA3, No), for example, when “U” or “E” is input in the voice “up to”, etc., by repeating SA3, Wait until On the other hand, when it becomes steady (SA3, Yes), for example, when the voice of the ending part “˜” is inputted in the voice “up”, the process proceeds to SA4.

ＳＡ４において音声解析部８ｂは、データ記録部９に記録された音声情報を解析し、ユーザによる入力音声を特定する。具体的には、音声解析部８ｂは、マイク３を介して入力された音声情報を解析してスペクトル情報を求め、このスペクトル情報における語頭の部分（音声入力が開始されてから、ＳＡ３にて定常となったと判断されるまでの部分）と、発話例ＤＢ９ｂに格納されている制御音声のスペクトル情報とを比較し、略一致する音声を探索することにより、入力音声を特定する。このように、従来は、音声が入力されてから音声が途切れるまでの部分を解析するのが通常であったのに対し、本実施の形態においては音声が入力されてから音声が定常となるまでの部分を解析するが、発話例ＤＢ９ｂには語尾を伸ばした制御音声のスペクトル情報が格納されているので、好適に入力音声を特定する事ができる。なお、音声情報の解析は、上述のようにスペクトル情報の比較ではなく、ＷＡＶ情報の比較によって行ってもよい。 In SA4, the voice analysis unit 8b analyzes the voice information recorded in the data recording unit 9, and specifies the voice input by the user. More specifically, the voice analysis unit 8b analyzes the voice information input via the microphone 3 to obtain spectrum information, and the initial part of the spectrum information (a steady input at SA3 after the voice input is started). The portion up to the point where it is determined that the input speech is determined) is compared with the spectrum information of the control speech stored in the utterance example DB 9b, and the input speech is specified by searching for a speech that substantially matches. As described above, conventionally, it is normal to analyze a portion from when the sound is input until the sound is interrupted, but in the present embodiment, until the sound becomes steady after the sound is input. However, since the utterance example DB 9b stores the spectrum information of the control voice with the ending, the input voice can be suitably specified. Note that the analysis of voice information may be performed not by comparing spectrum information as described above but by comparing WAV information.

次に、ＳＡ５において音声解析部８ｂは、ユーザによる入力音声が、設定されたコマンドに対応する制御音声であるか否かを判定する。具体的には、まず音声解析部８ｂは、発話例ＤＢ９ｂを参照して、ＳＡ４にて特定した音声のスペクトルＩＤを特定する。次に音声解析部８ｂは、制御音声テーブル９ｃを参照し、設定されたモード（本実施の形態ではスクロールモード）のコマンドに対応するスクロールＩＤとして、上述のように特定したスペクトルＩＤが含まれるか否かを判定する。そして、音声解析部８ｂは、設定されたコマンドに対応するスペクトルＩＤが含まれないと判定した場合、ユーザによる入力音声が、設定されたコマンドに対応する制御音声でないものとし（ＳＡ５、Ｎｏ）、音声制御処理を終了する。また、音声解析部８ｂは、設定されたコマンドに対応するスペクトルＩＤが含まれると判定した場合、ユーザによる入力音声が、設定されたコマンドに対応する制御音声であるものとし（ＳＡ５、Ｙｅｓ）、ＳＡ６に移行する。 Next, in SA5, the voice analysis unit 8b determines whether or not the voice input by the user is a control voice corresponding to the set command. Specifically, first, the speech analysis unit 8b refers to the utterance example DB 9b and identifies the spectrum ID of the speech identified in SA4. Next, the voice analysis unit 8b refers to the control voice table 9c, and whether the spectrum ID specified as described above is included as the scroll ID corresponding to the command of the set mode (scroll mode in the present embodiment). Determine whether or not. When the voice analysis unit 8b determines that the spectrum ID corresponding to the set command is not included, the voice input by the user is not a control voice corresponding to the set command (SA5, No). The voice control process ends. When the voice analysis unit 8b determines that the spectrum ID corresponding to the set command is included, the voice input by the user is assumed to be a control voice corresponding to the set command (SA5, Yes). Move to SA6.

ＳＡ６において制御部８は、制御を実行する。具体的には制御部８は、設定されたコマンドモード（本実施の形態では「スクロールモード」）を特定し、ＳＡ４にて特定した制御音声が示す制御内容（本実施の形態では「上」）を特定し、これらの２つに基づいて具体的な制御を実行する。例えば、本実施の形態では、制御部８は、ディスプレイ５に表示された地図を、基準の速度で、基準の量だけ、上方向にスクロールさせる。 In SA6, the control unit 8 executes control. Specifically, the control unit 8 identifies the set command mode (“scroll mode” in the present embodiment), and the control content (“up” in the present embodiment) indicated by the control voice identified in SA4. And specific control is executed based on these two. For example, in the present embodiment, the control unit 8 scrolls the map displayed on the display 5 upward by a reference amount at a reference speed.

次に、ＳＡ７において定常判断制御部８ａは、ユーザの音声が未だに定常であるか否かを判定する。例えば、「上〜」という音声の語尾の部分が、未だに定常的に繰り返されているか否かを判定する。ただし、この判定の具体的な方法については、ＳＡ３と同様に説明できるので、説明を省略する。そして、定常であると判定した場合（ＳＡ７、Ｙｅｓ）、ＳＡ６において再度同様の制御を実行した後、ＳＡ７において再度定常であるか否かの判定を行う。すなわち、ユーザが語尾を伸ばし続けている限り、ＳＡ６及びＳＡ７の処理を繰り返し実行することによって、ＳＡ６において同様の制御を連続的に実行し続ける。このことにより、例えば、ディスプレイ５に表示された地図を上方向へと移動し続ける事ができる。なお、本実施の形態においては２回目以降のスクロールも、１回目のスクロールと同様の速度で移動させるが、これに限らず、２回目以降は制御を重ねる度に速度を上昇又は下降させていっても構わない。以上にて、音声制御処理の説明を終了する。 Next, in SA7, the stationary determination control unit 8a determines whether or not the user's voice is still stationary. For example, it is determined whether or not the ending part of the voice “upper” is still repeated regularly. However, a specific method of this determination can be described in the same manner as SA3, and thus the description is omitted. And when it determines with it being steady (SA7, Yes), after performing the same control again in SA6, it is determined whether it is steady again in SA7. That is, as long as the user continues to extend the ending, the same control is continuously executed in SA6 by repeatedly executing the processes in SA6 and SA7. As a result, for example, the map displayed on the display 5 can continue to move upward. In this embodiment, the second and subsequent scrolls are also moved at the same speed as the first scroll. However, the present invention is not limited to this, and the second and subsequent scrolls are increased or decreased each time control is repeated. It doesn't matter. This is the end of the description of the voice control process.

〔実施の形態に対する変形例〕
以上、本発明に係る実施の形態について説明したが、本発明の具体的な構成及び手段は、特許請求の範囲に記載した本発明の技術的思想の範囲内において、任意に改変及び改良することができる。以下、このような変形例について説明する。 [Modifications to Embodiment]
Although the embodiments of the present invention have been described above, the specific configuration and means of the present invention may be arbitrarily modified and improved within the scope of the technical idea of the present invention described in the claims. Can do. Hereinafter, such a modification will be described.

（解決しようとする課題や発明の効果について）
まず、発明が解決しようとする課題や発明の効果は、上述の内容に限定されるものではなく、発明の実施環境や構成の細部に応じて異なる可能性があり、上述した課題の一部のみを解決したり、上述した効果の一部のみを奏することがある。例えば、音声入力によって、ユーザの所望の制御を容易に行う事が出来ない場合であっても、従来と異なる技術によりユーザの所望の制御を行う事が出来ている場合には、本願発明の課題が解決されている。 (About problems to be solved and effects of the invention)
First, the problems to be solved by the invention and the effects of the invention are not limited to the above contents, and may vary depending on the implementation environment and details of the configuration of the invention. May be solved, or only some of the effects described above may be achieved. For example, even if the user's desired control cannot be easily performed by voice input, the user's desired control can be performed by a technique different from the conventional technique. Has been resolved.

（分散や統合について）
また、上述した各電気的構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各部の分散や統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散又は統合して構成できる。例えば、車載装置１を、相互に通信可能に構成された複数の装置に分散して構成し、これら複数の装置の一部に定常判断制御部８ａを設けると共に、これら複数の装置の他の一部に音声解析部８ｂを設けてもよい。 (About distribution and integration)
Further, each of the electrical components described above is functionally conceptual and does not necessarily need to be physically configured as illustrated. In other words, the specific forms of distribution and integration of each unit are not limited to those shown in the drawings, and all or a part thereof may be functionally or physically distributed or integrated in arbitrary units according to various loads or usage conditions. Can be configured. For example, the in-vehicle device 1 is configured to be distributed among a plurality of devices configured to be able to communicate with each other, and a steady state determination control unit 8a is provided in a part of the plurality of devices, and another one of the plurality of devices. A voice analysis unit 8b may be provided in the unit.

（形状、数値、構造、時系列について）
実施の形態や図面において例示した構成要素に関して、形状、数値、又は複数の構成要素の構造若しくは時系列の相互関係については、本発明の技術的思想の範囲内において、任意に改変及び改良することができる。 (About shape, numerical value, structure, time series)
Regarding the constituent elements exemplified in the embodiment and the drawings, the shape, numerical value, or the structure of a plurality of constituent elements or the mutual relationship in time series may be arbitrarily modified and improved within the scope of the technical idea of the present invention. Can do.

（制御内容について）
本実施の形態では、制御内容として上方向のスクロールを行うものとして説明したが、他の制御内容についても同様に、ユーザの入力音声の語尾の長さに応じて処理を継続する事ができる。例えば、下方向、左方向、右方向、右上方向、左上方向等についても実施の形態と同様に説明する事ができる。また、他のコマンドについても同様に説明する事ができ、例えば、地図縮尺コマンドについては、ユーザの「拡大〜」や「縮小〜」という音声入力があった場合に、語尾の長さに応じて一定の速度で地図縮尺を拡大や縮小していく制御を行う事ができる。なお、この際に、語尾の長さに応じて拡大していく速度や縮小していく速度を上昇させていっても構わない。また、音声制御システムとして構成される装置（本実施の形態では、車載装置１）以外の装置を制御するシステムとして構成しても構わない。例えば、車載装置１と車両に搭載された空調機器とを相互にリンクさせて、ユーザの「上げて〜」という入力音声の語尾の長さに応じて、一定の速度で空調機器の送風温度や風量を上昇させていく制御を行う事としても良い。 (About control details)
Although the present embodiment has been described as performing the upward scrolling as the control content, the processing can be continued for the other control content according to the length of the ending of the input voice of the user. For example, the downward direction, the left direction, the right direction, the upper right direction, the upper left direction, and the like can be described in the same manner as in the embodiment. In addition, other commands can be explained in the same manner. For example, in the case of a map scale command, when there is a voice input such as “enlargement to” or “reduction to” by the user, it depends on the length of the ending. It is possible to control to enlarge or reduce the map scale at a constant speed. At this time, the speed of enlarging or reducing may be increased according to the length of the ending. Moreover, you may comprise as a system which controls apparatuses other than the apparatus (this embodiment in-vehicle apparatus 1) comprised as an audio | voice control system. For example, the in-vehicle device 1 and the air conditioner mounted on the vehicle are linked to each other, and the air temperature of the air conditioner is fixed at a constant speed according to the length of the ending of the input voice of the user “raise up”. It is also possible to perform control to increase the air volume.

また、本実施の形態では、５種類のコマンドのみを明記したが、その他のコマンドについても、適宜制御音声テーブル９ｃに追加して、同様に説明する事が可能である。例えば、「上〜」、「下〜」等の音声入力に応じてディスプレイ５の輝度を調節する「輝度調節コマンド」や、「暑い〜」、「寒い〜」等の音声入力に応じて空調機器の風量を調節する「風量調節コマンド」や、「開けて〜」、「閉めて〜」等の音声入力に応じて車両の窓の開度を調節する「窓調節コマンド」等を適用しても構わない。 In the present embodiment, only five types of commands are specified, but other commands can be added to the control voice table 9c as appropriate and described in the same manner. For example, a “brightness adjustment command” that adjusts the brightness of the display 5 in response to a voice input such as “upper” or “lower” or an air conditioner in response to a voice input such as “hot” or “cold” Even if you apply “air volume adjustment command” that adjusts the air volume of the vehicle, or “window adjustment command” that adjusts the opening of the vehicle window according to the voice input such as “open ~” or “close it ~” I do not care.

また、本実施の形態では、ユーザによる音声入力の語尾の長さに応じて、同一の制御を継続するものとして説明したが、これに限らず、音声入力の語尾の長さに応じて制御の種類を切換えるものとしても良い。例えば、「モード〜」という音声入力に応じて、設定されたコマンドモードを所定時刻間隔で切換えていっても構わない。すなわち、連続性のあるパラメータを制御する場合に限定されず、非連続的なパラメータやコマンドを制御しても良い。 Further, in the present embodiment, it has been described that the same control is continued according to the ending length of the voice input by the user. However, the present invention is not limited to this, and the control is performed according to the ending length of the voice input. It is good also as what switches a kind. For example, the set command mode may be switched at a predetermined time interval in response to a voice input “mode˜”. That is, the present invention is not limited to controlling continuous parameters, and non-continuous parameters and commands may be controlled.

（制御音声について）
また、本実施の形態ではコマンドモードの設定を行った後に、音声入力を行うものとしたが、これらを同時に行うものとしても良い。具体的には、発話例ＤＢ９ｂに制御音声として「スクロール上〜」や「スクロール下〜」といった音声を格納しておき、ユーザによって同様の音声入力が行われた場合には、スクロールモードに設定しつつ、上方向へのスクロールを語尾の長さに応じて継続しても良い。このような制御によれば、コマンドモードの設定を省略する事が可能となる。 (About control voice)
In this embodiment, voice input is performed after the command mode is set, but these may be performed simultaneously. Specifically, voices such as “scrolling up” and “scrolling down” are stored as control voices in the utterance example DB 9b, and when a similar voice input is made by the user, the scroll mode is set. However, the upward scrolling may be continued according to the length of the ending. According to such control, setting of the command mode can be omitted.

また、本実施の形態では、制御音声として「上〜」、「下〜」等といった制御内容の語尾を伸ばした音声を適用したが、これに限られない。例えば、制御内容を含まない単なる音声「あ〜」や「い〜」等の語尾に応じて制御を継続したり制御の種類を切換えたりしても良い。具体的には、「あ〜」という音声が入力された場合、語尾の長さに応じて上方向に地図をスクロールするように構成し、「い〜」という音声が入力された場合、語尾の長さに応じて下方向に地図をスクロールするように構成しても良い。 Further, in the present embodiment, the voice with the ending of the control content such as “upper”, “lower”, etc. is applied as the control voice, but is not limited thereto. For example, the control may be continued or the type of control may be switched according to the ending of a simple voice “A” or “I” that does not include control content. Specifically, when the voice "A ~" is input, the map is scrolled upward according to the length of the ending, and when the voice "I ~" is input, You may comprise so that a map may be scrolled below according to length.

〔実施の形態の特徴と効果の一部〕
最後に、これまでに説明した実施の形態の特徴と効果の一部を、以下に例示する。ただし、実施の形態の特徴と効果は、以下の内容に限定されず、以下の特徴の一部のみを具備することによって以下の効果の一部のみを奏する場合や、以下の特徴以外の他の特徴を具備することによって以下の効果以外の他の効果を奏する場合がある。 [Characteristics and effects of the embodiment]
Finally, some of the features and effects of the embodiments described so far are exemplified below. However, the features and effects of the embodiment are not limited to the following contents, and only some of the following effects are achieved by including only a part of the following features, or other than the following features. By providing the characteristics, there may be other effects than the following effects.

実施の形態の１つの側面１に係る音声制御システムは、音声の入力を受け付ける音声入力受付手段と、前記音声入力受付手段にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御手段と、を備える。 The voice control system according to one aspect 1 of the embodiment performs the same control according to the voice input receiving unit that receives voice input and the length of the ending of the voice received by the voice input receiving unit. Control means for continuing or switching the type of control.

上記側面１に係る音声制御システムによれば、音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換えるので、音声入力によって、ユーザが所望の制御を容易に行う事が可能となる。特に、スクロールのスピードやスクロールの移動量のような連続性のあるパラメータを制御する場合においても、パラメータを複数の段階に区分する必要がないので、パラメータの連続性を損ねることがなく、ユーザの所望の制御を容易に行う事が可能となる。 According to the voice control system according to aspect 1, the same control is continued or the type of control is switched according to the length of the voice ending, so that the user can easily perform desired control by voice input. Things will be possible. In particular, even in the case of controlling continuity parameters such as scroll speed and scroll movement amount, it is not necessary to divide the parameters into a plurality of stages. Desired control can be easily performed.

実施の形態の他の側面２に係る音声制御システムは、上記側面１に係る音声制御システムにおいて、前記制御手段は、前記音声入力受付手段にて前記音声が定常的に入力されているか否かを判断する定常判断制御手段を備え、前記制御手段は、前記定常判断制御手段にて前記音声が定常的に入力されていると判断されてから、前記定常判断制御手段にて前記音声が定常的に入力されていないと判断されるまでの間、前記同一の制御を継続する。 In the voice control system according to another aspect 2 of the embodiment, in the voice control system according to the aspect 1, the control unit determines whether the voice is constantly input by the voice input reception unit. A stationary judgment control means for judging, wherein the control means judges that the voice is steadily input by the steady judgment control means after the voice is steadily inputted by the stationary judgment control means. The same control is continued until it is determined that no input has been made.

上記側面２に係る音声制御システムによれば、制御内容を含む制御音声が入力された場合に、当該制御音声が示す制御内容を、当該制御音声の語尾の長さに応じて継続するので、ユーザが制御内容を他の手段で入力する手間等を省略する事ができ、より簡素に制御内容を指示する事が可能となる。 According to the voice control system according to the aspect 2, when a control voice including the control contents is input, the control contents indicated by the control voice are continued according to the length of the end of the control voice. However, the trouble of inputting the control contents by other means can be omitted, and the control contents can be instructed more simply.

実施の形態の他の側面３に係る音声制御システムは、上記側面１又は側面２に係る音声制御システムにおいて、前記音声入力受付手段にて受け付けられた音声を解析する音声解析手段を備え、前記制御手段は、前記音声解析手段にて前記音声が制御内容を含む制御音声であると解析された場合、当該制御音声が示す制御内容を、当該制御音声の語尾の長さに応じて継続する。 The voice control system according to another aspect 3 of the embodiment is the voice control system according to the side face 1 or the side face 2, further comprising voice analysis means for analyzing the voice received by the voice input acceptance means, When the voice analysis unit analyzes that the voice is a control voice including the control content, the means continues the control content indicated by the control voice according to the length of the ending of the control voice.

上記側面３に係る音声制御システムによれば、音声が定常的に入力されていると判断されてから、音声が定常的に入力されていないと判断されるまでの間、同一の制御を継続するので、音声が定常的に繰り返されている時間に応じて同一の制御を継続する事ができ、ユーザにとって容易な制御が可能となる。 According to the voice control system according to aspect 3 described above, the same control is continued from when it is determined that the voice is steadily input until it is determined that the voice is not steadily input. Therefore, it is possible to continue the same control according to the time during which the sound is regularly repeated, and easy control for the user is possible.

実施の形態の他の側面４に係る音声制御システムは、上記側面１から側面４のいずれかに係る音声制御システムにおいて、前記制御手段は、車両に搭載された車載装置を制御する。 In the voice control system according to another aspect 4 of the embodiment, in the voice control system according to any one of the side face 1 to the side face 4, the control means controls an in-vehicle device mounted on the vehicle.

上記側面４に係る音声制御システムによれば、制御手段は、車両に搭載された車載装置を制御するので、車両の運転時等の手が離せない場合等においても、ユーザが音声を発する事で、語尾の長さに応じた容易な操作が可能となる。 According to the voice control system according to the above aspect 4, since the control means controls the in-vehicle device mounted on the vehicle, even when the hand cannot be released such as when driving the vehicle, the user can make a voice. Easy operation according to the length of the ending is possible.

実施の形態の他の側面５に係る音声制御方法は、音声の入力を受け付ける音声入力受付工程と、前記音声入力受付工程にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御工程と、を含む。 In the voice control method according to the other aspect 5 of the embodiment, the same control is performed according to the voice input acceptance process for accepting voice input and the length of the ending of the voice accepted in the voice input acceptance process. And a control step for switching the type of control.

上記側面５に係る音声制御方法によれば、音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換えるので、音声入力によって、ユーザの所望の制御を容易に行う事が可能となる。特に、スクロールのスピードやスクロールの移動量のような連続性のあるパラメータを制御する場合においても、パラメータを複数の段階に区分する必要がないので、パラメータの連続性を損ねることがなく、ユーザの所望の制御を容易に行う事が可能となる。 According to the voice control method according to the aspect 5, the same control is continued or the type of control is switched according to the length of the voice ending, so that the user's desired control can be easily performed by voice input. Things will be possible. In particular, even in the case of controlling continuity parameters such as scroll speed and scroll movement amount, it is not necessary to divide the parameters into a plurality of stages. Desired control can be easily performed.

実施の形態の他の側面６に係る音声制御プログラムは、音声の入力を受け付ける音声入力受付工程と、前記音声入力受付工程にて受け付けられた音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換える制御工程と、をコンピュータに実行させる。 The voice control program according to the other aspect 6 of the embodiment performs the same control according to the voice input acceptance process for accepting voice input and the length of the ending of the voice accepted in the voice input acceptance process. A control step of continuing or switching the type of control.

上記側面６に係る音声制御プログラムによれば、音声の語尾の長さに応じて、同一の制御を継続し、又は制御の種類を切換えるので、音声入力によって、ユーザの所望の制御を容易に行う事が可能となる。特に、スクロールのスピードやスクロールの移動量のような連続性のあるパラメータを制御する場合においても、パラメータを複数の段階に区分する必要がないので、パラメータの連続性を損ねることがなく、ユーザの所望の制御を容易に行う事が可能となる。 According to the voice control program according to the above aspect 6, the same control is continued or the type of control is switched according to the length of the voice ending, so that the user's desired control can be easily performed by voice input. Things will be possible. In particular, even in the case of controlling continuity parameters such as scroll speed and scroll movement amount, it is not necessary to divide the parameters into a plurality of stages. Desired control can be easily performed.

１車載装置
２スピーカ
３マイク
４タッチパネル
５ディスプレイ
６現在位置取得部
７通信部
８制御部
８ａ定常判断制御部
８ｂ音声解析部
９データ記録部
９ａ地図ＤＢ
９ｂ発話例ＤＢ
９ｃ制御音声テーブル
DESCRIPTION OF SYMBOLS 1 In-vehicle apparatus 2 Speaker 3 Microphone 4 Touch panel 5 Display 6 Current position acquisition part 7 Communication part 8 Control part 8a Steady state determination control part 8b Voice analysis part 9 Data recording part 9a Map DB
9b Utterance example DB
9c Control voice table

Claims

Voice input receiving means for receiving voice input;
Control means for continuing the same control or switching the type of control according to the length of the ending of the voice received by the voice input receiving means,
Voice control system.

The control means includes
A stationary judgment control means for judging whether or not the voice is constantly inputted by the voice input receiving means;
The control means until the steady judgment control means judges that the voice is not constantly inputted after the steady judgment control means judges that the voice is constantly inputted. The same control is continued during
The voice control system according to claim 1.

Voice analysis means for analyzing the voice received by the voice input reception means,
When the voice analysis unit analyzes that the voice is a control voice including control content, the control means continues the control content indicated by the control voice according to the length of the ending of the control voice. ,
The voice control system according to claim 1 or 2.

The control means controls an in-vehicle device mounted on the vehicle.
The voice control system according to any one of claims 1 to 3.

A voice input receiving process for receiving voice input;
A control step of continuing the same control or switching the type of control according to the length of the ending of the voice received in the voice input reception step,
Voice control method.

A voice input receiving process for receiving voice input;
In accordance with the length of the speech ending received in the voice input reception step, the same control is continued, or the control step of switching the type of control,
Voice control program for causing a computer to execute.