JP2011013564A

JP2011013564A - Information presenting device and information presenting method

Info

Publication number: JP2011013564A
Application number: JP2009159024A
Authority: JP
Inventors: Hirosuke Hamaguchi; 弘介濱口
Original assignee: Nissan Motor Co Ltd
Current assignee: Nissan Motor Co Ltd
Priority date: 2009-07-03
Filing date: 2009-07-03
Publication date: 2011-01-20

Abstract

PROBLEM TO BE SOLVED: To provide an information presenting system capable of effectively preventing the occurrence of user's waiting time due to voice synthesis when information is presented with voice.SOLUTION: The information presenting system includes: a means for obtaining a plurality of pieces of information for presenting to a user; a means for calculating a creating time period which is a time period required for creating a voice data from the information; a means for calculating a voice output time period which is a time period required for outputting the voice data created based on the information; and a means for setting a processing order for sequentially creating the voice data from the information, and for sequentially outputting the created voice data, based on the voice output time period and the calculated creating time period. When the processing order is set by the setting means of the information processing system, the information for creating the voice data is set as information for creating and outputting the voice data following the information in which the processing order is already set, before finishing creation and outputting of all voice data regarding the information in which the processing order is already set.

Description

本発明は、情報提示装置および情報提示方法に関するものである。 The present invention relates to an information presentation apparatus and an information presentation method.

ユーザにより予め設定された提示順序に基づいて、複数の情報を、ユーザが所望する順序で、ユーザに対して音声で提示する技術が知られている（特許文献１）。 There is known a technique for presenting a plurality of information to a user by voice in an order desired by the user based on a presentation order preset by the user (Patent Document 1).

特開２００１−３４３９７９号公報JP 2001-343979 A

しかしながら、従来技術では、情報に基づいて音声合成を行うことにより、情報を音声で出力しているが、次に音声出力するための情報について音声合成している間に、既に音声合成された情報の音声出力が終了してしまい、次に音声出力するための情報についての音声合成が終了するまで、次に音声出力するための情報の音声出力が開始されず、ユーザの待ち時間が発生する場合があった。 However, in the prior art, information is output as speech by performing speech synthesis based on information, but information that has already been speech-synthesized while speech synthesis is performed on information for next speech output. If the voice output for the next time is finished, the voice output of the information for the next voice output is not started until the voice synthesis for the information for the next voice output is finished, and there is a waiting time for the user was there.

本発明が解決しようとする課題は、ユーザに情報を音声で提示する際に、音声合成によるユーザの待ち時間の発生を有効に防止できる情報提示装置を提供することである。 The problem to be solved by the present invention is to provide an information presentation device that can effectively prevent the occurrence of a user's waiting time due to speech synthesis when presenting information to the user by voice.

本発明は、複数の情報を取得し、取得した情報に基づいて音声データを順次生成し、かつ、生成した音声データを順次出力するための処理順序を設定するに際し、処理順序が既に設定された情報についての音声データの生成および出力が全て終了するよりも前に、音声データを生成できる情報を、処理順序が既に設定された情報に続いて音声データを生成し、出力するための情報として設定することで、上記課題を解決する。 The present invention acquires a plurality of information, sequentially generates audio data based on the acquired information, and sets the processing order for sequentially outputting the generated audio data, the processing order is already set Information that can generate audio data is set as information for generating and outputting audio data following the information whose processing order has already been set before the generation and output of the audio data for the information is completed. This solves the above problem.

本発明によれば、次に音声データを生成するための情報として、既に生成されている音声データの音声出力が終了するよりも前に、音声データを生成することが可能な情報を設定することができるため、ユーザの待ち時間の発生を有効に防止することができる。 According to the present invention, as information for generating sound data next, information capable of generating sound data is set before the sound output of the already generated sound data is finished. Therefore, it is possible to effectively prevent the waiting time of the user from occurring.

本実施形態に係る情報提示システムの構成図である。It is a lineblock diagram of the information presentation system concerning this embodiment. 本実施形態に係る情報提示処理を示すフローチャートである。It is a flowchart which shows the information presentation process which concerns on this embodiment. ユーザの現在位置周辺に存在するスポットに関するＰＯＩ情報の一例を示す図である。It is a figure which shows an example of POI information regarding the spot which exists around a user's present position. ステップＳ１０４の処理順序設定処理の内容を示すフローチャートである。It is a flowchart which shows the content of the process order setting process of step S104. 図３に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。FIG. 4 is a diagram for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 3. 図３に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。FIG. 4 is a diagram for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 3. 図３に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。It is a figure for demonstrating the relationship between the audio | voice production | generation total time and audio | voice output total time in the example of a scene shown in FIG. 図３に示す場面例において、ＰＯＩ情報の処理順序を設定した結果の一例を示す図である。FIG. 4 is a diagram illustrating an example of a result of setting the processing order of POI information in the scene example illustrated in FIG. 3. ステップＳ１０４の処理順序設定処理の他の場面例を説明するための図である。It is a figure for demonstrating the other scene example of the process order setting process of step S104. 図９に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。FIG. 10 is a diagram for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 9. 図９に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。FIG. 10 is a diagram for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 9. 図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。It is a figure for demonstrating the relationship of the audio | voice total generation time and audio | voice output total time in the example of a scene shown in FIG. 図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。It is a figure for demonstrating the relationship of the audio | voice total generation time and audio | voice output total time in the example of a scene shown in FIG. 図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。It is a figure for demonstrating the relationship of the audio | voice total generation time and audio | voice output total time in the example of a scene shown in FIG. 図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。It is a figure for demonstrating the relationship of the audio | voice total generation time and audio | voice output total time in the example of a scene shown in FIG. 図９に示す場面例において、ＰＯＩ情報の処理順序を設定した結果の一例を示す図である。FIG. 10 is a diagram illustrating an example of a result of setting the processing order of POI information in the scene example illustrated in FIG. 9.

以下、本発明の実施形態を図面に基づいて、本実施形態の情報提示システムについて説明する。図１は、本実施形態に係る情報提示システムの構成を示す図である。図１に示すように、本実施形態の情報提示システムは、車両に搭載される車載装置１００と、車載装置１００の外部に設置され、車載装置１００と情報の授受が可能な外部サーバ２００とから構成される。 Hereinafter, an information presentation system according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a diagram illustrating a configuration of an information presentation system according to the present embodiment. As shown in FIG. 1, the information presentation system of this embodiment includes an in-vehicle device 100 mounted on a vehicle and an external server 200 that is installed outside the in-vehicle device 100 and can exchange information with the in-vehicle device 100. Composed.

本実施形態に係る情報提示システムでは、ユーザの現在位置の周辺に存在する施設あるいは観光地などのスポットに関するＰＯＩ情報の取得を要求する要求情報を、車載装置１００から外部サーバ２００に送信し、外部サーバ２００において、車載装置１００から送信された要求情報に応じて、ユーザの現在位置の周辺に存在するスポットに関するＰＯＩ情報を取得する。そして、外部サーバ２００では、音声データの生成による音声出力の停止時間が発生しないように、取得されたＰＯＩ情報に基づく音声データの処理順序を設定し、処理順序が設定されたＰＯＩ情報を車載装置１００に送信することで、車載装置１００において、ＰＯＩ情報に基づいて、処理順序に従い、音声データを生成し、生成された音声データをユーザに提示する。 In the information presentation system according to the present embodiment, request information for requesting acquisition of POI information related to a spot such as a facility or sightseeing spot existing around the current position of the user is transmitted from the in-vehicle device 100 to the external server 200, and the external In the server 200, POI information regarding spots existing around the current position of the user is acquired in accordance with the request information transmitted from the in-vehicle device 100. The external server 200 sets the processing order of the voice data based on the acquired POI information so that the voice output stop time due to the generation of the voice data does not occur, and the POI information in which the processing order is set is set as the in-vehicle device. By transmitting to 100, the in-vehicle device 100 generates audio data according to the processing order based on the POI information, and presents the generated audio data to the user.

まず、車載装置１００の構成について説明する。車載装置１００は、図１示すように、通信装置１１０、入力装置１２０、車載装置用コントローラ１３０、ＧＰＳユニット１４０、スピーカ１５０、およびディスプレイ１６０から構成される。 First, the configuration of the in-vehicle device 100 will be described. As shown in FIG. 1, the in-vehicle device 100 includes a communication device 110, an input device 120, an in-vehicle device controller 130, a GPS unit 140, a speaker 150, and a display 160.

通信装置１１０は、特定の通信プロトコルを用いてネットワークに接続し、外部サーバ２００に対して、車載装置用コントローラ１３０から送信されたユーザの現在位置とＰＯＩ情報の取得を要求するための要求情報とを送信する。また、通信装置１１０は、外部サーバ２００から要求情報に応じた複数のＰＯＩ情報を受信し、受信されたＰＯＩ情報を車載装置用コントローラ１３０に送信する。 The communication device 110 is connected to a network using a specific communication protocol, and request information for requesting the external server 200 to acquire the current position of the user and POI information transmitted from the in-vehicle device controller 130. Send. In addition, the communication device 110 receives a plurality of POI information corresponding to the request information from the external server 200 and transmits the received POI information to the in-vehicle device controller 130.

入力装置１２０は、ユーザにより操作され、ユーザからの入力指示を受け付ける。入力装置１２０としては、例えば、ディスプレイ画面上に配置されるタッチパネルまたはジョイスティックなどのユーザの手操作による入力が可能な装置であってもよいし、あるいは、マイクなどのユーザの発話音声による入力が可能な装置であってもよい。入力装置１２０は、ユーザからの入力指示を、車載装置用コントローラ１３０に送信する。 The input device 120 is operated by a user and receives an input instruction from the user. The input device 120 may be, for example, a device that allows input by a user's manual operation such as a touch panel or a joystick arranged on a display screen, or input by a user's uttered voice such as a microphone. It may be a simple device. The input device 120 transmits an input instruction from the user to the in-vehicle device controller 130.

ＧＰＳユニット１４０は、図示しない複数の衛星通信から送信される電波を検出して、ユーザの位置情報を取得する。取得されたユーザの位置情報は、車載装置用コントローラ１３０を介して、通信装置１１０から外部サーバ２００に送信される。なお、ＧＰＳユニット１４０からユーザの位置情報を取得した車載装置用コントローラ１３０は、図示しないジャイロセンサから取得した角度変化情報および車速センサから取得した車速に基づいて、ユーザの現在位置を補正し、補正したユーザの位置情報を外部サーバ２００に送信してもよい。 The GPS unit 140 detects radio waves transmitted from a plurality of satellite communications (not shown), and acquires user position information. The acquired user location information is transmitted from the communication device 110 to the external server 200 via the in-vehicle device controller 130. The in-vehicle device controller 130 that acquired the user's position information from the GPS unit 140 corrects the user's current position based on the angle change information acquired from the gyro sensor (not shown) and the vehicle speed acquired from the vehicle speed sensor, and The user's location information may be transmitted to the external server 200.

スピーカ１５０は、車載装置用コントローラ１３０から送信されたＰＯＩ情報に基づく音声データを、ユーザに対して音声で出力する。また、ディスプレイ１６０は、車載装置用コントローラ１３０から送信されたＰＯＩ情報を、テキスト情報として、ディスプレイ１６０が備える画面上に表示する。スピーカ１５０およびディスプレイ１６０を介してＰＯＩ情報を提示することで、ユーザに、ユーザの現在位置周辺に存在するスポットに関するＰＯＩ情報を把握させることができる。 The speaker 150 outputs audio data based on the POI information transmitted from the in-vehicle device controller 130 to the user. The display 160 displays the POI information transmitted from the in-vehicle device controller 130 as text information on a screen included in the display 160. By presenting the POI information via the speaker 150 and the display 160, the user can be made aware of the POI information related to the spots existing around the current position of the user.

車載装置用コントローラ１３０は、外部サーバ２００から送信された複数のＰＯＩ情報に基づいて音声データを生成し、出力するためのプログラムを格納したＲＯＭ（Read Only Memory）と、このＲＯＭに格納されたプログラムを実行するＣＰＵ（Central Processing Unit）と、外部サーバ２００から送信されたＰＯＩ情報、またはユーザの操作履歴などを記憶し、アクセス可能な記憶装置として機能するＲＡＭ（Random Access Memory）とから構成される。なお、動作回路としては、ＣＰＵ（Central Processing Unit）に代えて又はこれとともに、ＭＰＵ（Micro Processing Unit）、ＤＳＰ（Digital Signal Processor）、ＡＳＩＣ（Application Specific Integrated Circuit）、ＦＰＧＡ（Field Programmable Gate Array）などを用いることができる。 The in-vehicle device controller 130 includes a ROM (Read Only Memory) that stores a program for generating and outputting audio data based on a plurality of POI information transmitted from the external server 200, and a program stored in the ROM. And a RAM (Random Access Memory) that stores POI information transmitted from the external server 200 or a user operation history and functions as an accessible storage device. . As an operation circuit, instead of or in addition to a CPU (Central Processing Unit), an MPU (Micro Processing Unit), a DSP (Digital Signal Processor), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array), etc. Can be used.

車載装置用コントローラ１３０は、ＲＯＭに格納されたプログラムをＣＰＵにより実行することにより取得機能および生成機能を実現する。以下、車載装置用コントローラ１３０が有する各機能について説明する。 The in-vehicle device controller 130 implements an acquisition function and a generation function by executing a program stored in the ROM by the CPU. Hereinafter, each function of the in-vehicle device controller 130 will be described.

車載装置用コントローラ１３０の取得機能は、車載装置１００の通信装置１１０を介して、外部サーバ２００から送信された複数のＰＯＩ情報を取得する。 The acquisition function of the in-vehicle device controller 130 acquires a plurality of POI information transmitted from the external server 200 via the communication device 110 of the in-vehicle device 100.

車載装置用コントローラ１３０の生成機能は、外部サーバ２００から送信された各ＰＯＩ情報に基づいて、音声データを生成する。具体的には、ＰＯＩ情報にはスポットの名称および詳細を音声で出力するためのテキストデータ（以下、音声出力用のテキストデータとも言う）が含まれおり、ＰＯＩ情報に含まれる音声出力用のテキストデータを用いて、波形接続方式やＨＭＭ（ＨｉｄｄｅｎＭａｒｋｏｖＭｏｄｅｌ）など既知の音声合成方法などにより、ＰＯＩ情報に含まれる音声出力用のテキストデータから音声出力用のデジタル信号である音声データを生成する。生成された音声データは、図示しないＤ／Ａコンバータでアナログ信号に変換された後、スピーカ１５０に送信される。 The generation function of the in-vehicle device controller 130 generates audio data based on each POI information transmitted from the external server 200. Specifically, the POI information includes text data for outputting the spot name and details by voice (hereinafter also referred to as voice output text data), and the voice output text included in the POI information. Using the data, voice data that is a digital signal for voice output is generated from text data for voice output included in the POI information by a known voice synthesis method such as a waveform connection method or HMM (Hidden Markov Model). The generated audio data is converted to an analog signal by a D / A converter (not shown) and then transmitted to the speaker 150.

次に外部サーバ２００について説明する。外部サーバ２００は、例えば、ＨＴＴＰなどの通信プロトコルで、車載装置１００から送信されたＰＯＩ情報取得の要求情報を受信し、要求情報に応じたＰＯＩ情報を車載装置１００に送信するＷｅｂサーバなどである。さらに、外部サーバ２００は、後述するように、データベース２３０から取得した複数のＰＯＩ情報について、音声データを生成し、出力するために適した処理順序を設定し、処理順序が設定されたＰＯＩ情報を車載装置１００に送信する。なお、外部サーバ２００が用いる通信プロトコルはＨＴＴＰに限られず、一般に用いられている通信プロトコルを用いることができる。 Next, the external server 200 will be described. The external server 200 is, for example, a web server that receives POI information acquisition request information transmitted from the in-vehicle device 100 using a communication protocol such as HTTP, and transmits POI information corresponding to the request information to the in-vehicle device 100. . Further, as will be described later, the external server 200 sets a processing order suitable for generating and outputting audio data for a plurality of POI information acquired from the database 230, and sets the POI information in which the processing order is set. It transmits to the vehicle equipment 100. Note that the communication protocol used by the external server 200 is not limited to HTTP, and a commonly used communication protocol can be used.

図１に示すように、外部サーバ２００は、通信装置２１０、サーバコントローラ２２０、データベース２３０を備える。以下、外部サーバ２００の各構成について説明する。 As shown in FIG. 1, the external server 200 includes a communication device 210, a server controller 220, and a database 230. Hereinafter, each configuration of the external server 200 will be described.

通信装置２１０は、車載装置１００から送信されたユーザの位置情報、およびＰＯＩ情報取得の要求情報を受信し、サーバコントローラ２２０に送信する。また、通信装置２１０は、サーバコントローラ２２０から送信される要求情報に応じたＰＯＩ情報を車載装置１００に送信する。 The communication device 210 receives the user location information and the POI information acquisition request information transmitted from the in-vehicle device 100 and transmits them to the server controller 220. In addition, the communication device 210 transmits POI information corresponding to the request information transmitted from the server controller 220 to the in-vehicle device 100.

データベース２３０は、複数のＰＯＩ情報を格納している。ＰＯＩ情報は、上述したように、施設および観光地などのスポットに関する情報であり、施設や観光地などのスポットの位置情報に加えて、これらスポットの名称およびスポットの詳細情報を含んでいる。さらに、上述したように、ＰＯＩ情報には、スポットの名称および詳細を音声で出力するためのテキストデータが含まれており、この音声出力用のテキストデータもデータベース２３０に格納されている。 The database 230 stores a plurality of POI information. As described above, the POI information is information related to spots such as facilities and sightseeing spots, and includes the name of the spots and detailed information of the spots in addition to the position information of spots such as facilities and sightseeing spots. Further, as described above, the POI information includes text data for outputting the name and details of the spot by voice. This text data for voice output is also stored in the database 230.

サーバコントローラ２２０は、車載装置１００の車載装置用コントローラ１３０と同様に、ＲＯＭ、ＣＰＵ、およびＲＡＭとから構成される。サーバコントローラ２２０が有するＲＯＭには、車載装置１００から送信されたＰＯＩ情報取得の要求情報に応じて、データベース２３０から複数のＰＯＩ情報を取得し、取得された複数のＰＯＩ情報の処理順序を設定するためのプログラムが格納されており、サーバコントローラ２２０のＣＰＵは、ＲＯＭに格納されたプログラムを実行することで、取得機能、生成時間算出機能、出力時間算出機能、および設定機能の各機能を実現する。以下、サーバコントローラ２２０が備える各機能について説明する。 The server controller 220 includes a ROM, a CPU, and a RAM, like the in-vehicle device controller 130 of the in-vehicle device 100. In the ROM of the server controller 220, a plurality of POI information is acquired from the database 230 in accordance with the POI information acquisition request information transmitted from the in-vehicle device 100, and the processing order of the acquired plurality of POI information is set. The CPU of the server controller 220 realizes each function of an acquisition function, a generation time calculation function, an output time calculation function, and a setting function by executing the program stored in the ROM. . Hereinafter, each function with which the server controller 220 is provided is demonstrated.

サーバコントローラ２２０の取得機能は、車載装置１００から送信されたユーザの現在位置およびＰＯＩ情報の要求情報に基づいて、ユーザの現在位置の周辺に存在する施設や観光地などのスポットに関するＰＯＩ情報を、データベース２３０から取得する。 Based on the user's current location and POI information request information transmitted from the in-vehicle device 100, the server controller 220 obtains POI information related to spots such as facilities and tourist spots existing around the user's current location. Obtain from the database 230.

サーバコントローラ２２０の設定機能は、取得機能により取得された複数のＰＯＩ情報に基づいて、音声データを生成し、出力するための処理順序を設定する。設定機能により処理順序を設定する手法については、後述する。 The setting function of the server controller 220 sets a processing order for generating and outputting audio data based on a plurality of POI information acquired by the acquisition function. A method for setting the processing order by the setting function will be described later.

サーバコントローラ２２０の生成時間算出機能は、取得機能により取得されたＰＯＩ情報ごとに、該ＰＯＩ情報に基づいて音声データを生成するのに要する時間である音声生成時間を算出する。ここで、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長（音声出力用のテキストデータのバイト数）と、音声生成時間とは一定の対応関係を有し、音声出力用のテキストデータのデータ長が大きいほど、音声生成時間は長くなる。生成時間算出機能は、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長に基づいて、該ＰＯＩ情報の音声生成時間を算出する。 The generation time calculation function of the server controller 220 calculates, for each POI information acquired by the acquisition function, a sound generation time that is a time required to generate sound data based on the POI information. Here, the data length of the text data for voice output included in the POI information (the number of bytes of text data for voice output) and the voice generation time have a certain correspondence relationship, and the text data for voice output The longer the data length, the longer the voice generation time. The generation time calculation function calculates the voice generation time of the POI information based on the data length of the text data for voice output included in the POI information.

サーバコントローラ２２０の出力時間算出機能は、取得機能により取得されたＰＯＩ情報ごとに、該ＰＯＩ情報に基づいて生成された音声データを音声出力するのに要する時間である音声出力時間を算出する。ここで、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長と、ＰＯＩ情報に基づいて生成された音声データを音声出力するのに要する音声出力時間とは一定の対応関係を有し、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長が大きいほど、音声出力時間は長くなる。出力時間算出機能は、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長に基づいて、ＰＯＩ情報の音声出力時間を算出する。 The output time calculation function of the server controller 220 calculates, for each POI information acquired by the acquisition function, an audio output time that is a time required for outputting audio data generated based on the POI information. Here, the data length of the text data for voice output included in the POI information and the voice output time required to output the voice data generated based on the POI information have a certain correspondence relationship. The longer the data length of the text data for voice output included in the information, the longer the voice output time. The output time calculation function calculates the voice output time of the POI information based on the data length of the text data for voice output included in the POI information.

次に、図２を参照して、本実施形態の情報提示処理について説明する。図２は、本実施形態に係る情報提示処理を示すフローチャートである。 Next, the information presentation process of this embodiment will be described with reference to FIG. FIG. 2 is a flowchart showing information presentation processing according to the present embodiment.

ステップＳ１０１では、車載装置１００の車載装置用コントローラ１３０により、ＰＯＩ情報取得の要求情報があるか判断される。ユーザにより、車載装置１００の入力装置１２０を介して、ＰＯＩ情報取得の要求指示が入力されると、ＰＯＩ情報取得の要求情報が入力装置１２０から車載装置用コントローラ１３０に送信される。車載装置用コントローラ１３０によりＰＯＩ情報取得の要求情報が取得されると、ＰＯＩ情報取得の要求情報があると判断され、ステップＳ１０２に進む。一方、ＰＯＩ情報取得の要求情報がないと判断された場合は、一定時間経過後にステップＳ１０１を繰り返し、再度、ＰＯＩ情報取得の要求情報があるか判断される。 In step S <b> 101, the in-vehicle device controller 130 of the in-vehicle device 100 determines whether there is POI information acquisition request information. When a user inputs a POI information acquisition request instruction via the input device 120 of the in-vehicle device 100, the POI information acquisition request information is transmitted from the input device 120 to the in-vehicle device controller 130. If the POI information acquisition request information is acquired by the in-vehicle device controller 130, it is determined that there is POI information acquisition request information, and the process proceeds to step S102. On the other hand, if it is determined that there is no POI information acquisition request information, step S101 is repeated after a certain period of time, and it is determined again whether there is POI information acquisition request information.

ステップＳ１０２では、ＰＯＩ情報取得の要求情報が、通信装置１１０を介して、車載装置１００から外部サーバ２００に送信される。また、車載装置１００から外部サーバ２００にＰＯＩ情報取得の要求情報を送信する際に、車載装置用コントローラ１３０は、ＧＰＳユニット１４０からユーザの現在位置を取得し、ＰＯＩ情報取得の要求情報とともに、ユーザの現在位置を、外部サーバ２００に送信する。 In step S <b> 102, POI information acquisition request information is transmitted from the in-vehicle device 100 to the external server 200 via the communication device 110. Further, when transmitting the POI information acquisition request information from the in-vehicle device 100 to the external server 200, the in-vehicle device controller 130 acquires the current position of the user from the GPS unit 140, and together with the POI information acquisition request information, the user Is transmitted to the external server 200.

ステップＳ１０３では、外部サーバ２００のサーバコントローラ２２０により、通信装置２１０を介して、車載装置１００から送信されたユーザの現在位置とＰＯＩ情報の要求情報とが取得される。そして、サーバコントローラ２２０の取得機能は、車載装置１００から送信されたユーザの現在位置およびＰＯＩ情報取得の要求情報に基づいて、データベース２３０に格納された複数のＰＯＩ情報の中から、ユーザの現在位置の周辺に存在するスポットに関するＰＯＩ情報を、予め設定された数だけ取得する。 In step S103, the server controller 220 of the external server 200 acquires the user's current location and POI information request information transmitted from the in-vehicle device 100 via the communication device 210. Then, the acquisition function of the server controller 220 is based on the current position of the user transmitted from the in-vehicle device 100 and the POI information acquisition request information, and the current position of the user is selected from the plurality of POI information stored in the database 230. A predetermined number of POI information relating to spots existing in the vicinity of is acquired.

図３は、ユーザの現在位置周辺に存在するスポットと、これらスポットに関するＰＯＩ情報の一例を示す図である。図３においては、ユーザの現在位置の周辺に存在するスポットに関する各ＰＯＩ情報を、ユーザの現在位置に近い順から、ＰＯＩ情報１〜ＰＯＩ情報７として表している。なお、図３に示すバイト数は、ＰＯＩ情報１〜ＰＯＩ情報７に含まれる音声出力用のテキストデータのデータ長（バイト数）を表している。例えば、サーバコントローラ２２０の取得機能により取得されるＰＯＩ情報の数が『５』に設定されている場合、取得機能は、車載装置１００から送信されたユーザの現在位置と、データベース２３０に格納された各ＰＯＩ情報が有する位置情報とに基づいて、ユーザの現在位置から近い５つのＰＯＩ情報、すなわちＰＯＩ情報１〜ＰＯＩ情報５を取得する。 FIG. 3 is a diagram illustrating an example of spots existing around the current position of the user and POI information relating to these spots. In FIG. 3, each POI information regarding spots existing around the current position of the user is represented as POI information 1 to POI information 7 in order from the closest to the current position of the user. The number of bytes shown in FIG. 3 represents the data length (number of bytes) of the text data for voice output included in the POI information 1 to POI information 7. For example, when the number of POI information acquired by the acquisition function of the server controller 220 is set to “5”, the acquisition function is stored in the database 230 and the current position of the user transmitted from the in-vehicle device 100. Based on the position information of each POI information, five pieces of POI information close to the current position of the user, that is, POI information 1 to POI information 5 are acquired.

ステップＳ１０４では、サーバコントローラ２２０により、データベース２３０から取得された所定数のＰＯＩ情報の処理順序が設定される。図４は、ステップＳ１０４の処理順序設定処理の内容を示すフローチャートである。以下、図４を参照して、ステップＳ１０４の処理順序設定処理について説明する。なお、以下においては、図３に示すユーザの現在位置の周辺に存在するスポットに関するＰＯＩ情報１〜ＰＯＩ情報５を抽出した場面を例示して説明する。 In step S104, the server controller 220 sets the processing order of a predetermined number of POI information acquired from the database 230. FIG. 4 is a flowchart showing the contents of the processing order setting process in step S104. Hereinafter, the processing order setting processing in step S104 will be described with reference to FIG. In the following, a scene in which POI information 1 to POI information 5 relating to spots existing around the current position of the user shown in FIG. 3 is extracted will be described as an example.

まず、ステップＳ２０１では、データベース２３０から取得されたＰＯＩ情報のうち、音声生成時間が最も短いＰＯＩ情報が選択され、選択されたＰＯＩ情報が音声データを最初に生成し、出力するためのＰＯＩ情報として設定される。なお、上述したように、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長と、ＰＯＩ情報に基づいて音声データを生成するための音声生成時間とが一定の対応関係を有しており、サーバコントローラ２２０の設定機能は、取得された所定数のＰＯＩ情報のうち、音声出力用のテキストデータのデータ長が最も小さいＰＯＩ情報を、音声生成時間が最も短いＰＯＩ情報であるとして、音声データを最初に生成し、出力するためのＰＯＩ情報として設定する。 First, in step S201, POI information having the shortest voice generation time is selected from the POI information acquired from the database 230, and the selected POI information is used as POI information for generating and outputting voice data first. Is set. As described above, the data length of the text data for voice output included in the POI information and the voice generation time for generating the voice data based on the POI information have a certain correspondence relationship. The setting function of the server controller 220 determines that the POI information having the shortest data length of the text data for voice output is the POI information having the shortest voice generation time out of the predetermined number of acquired POI information. First, it is set as POI information to be generated and output.

図５および図６は、図３に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。なお、図５および図６に示す各図において、横軸は各ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長（バイト数）を表しており、縦軸はＰＯＩ情報に基づいて音声データが生成され、出力される処理順序を表している。なお縦軸の処理順序は、上に位置するほど先に音声データが生成され、出力されるＰＯＩ情報であることを表している。また、図５に示す各図おいて、処理順序が設定されているＰＯＩ情報を黒色で、また後述する次候補に選択されているＰＯＩ情報を斜線のハッチングで表している。例えば、図５（Ｃ）において、ＰＯＩ情報４は処理順序が設定されており、またＰＯＩ情報２は次候補に選択されていることを示す。 5 and 6 are diagrams for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 5 and FIG. 6, the horizontal axis represents the data length (number of bytes) of the text data for voice output included in each POI information, and the vertical axis represents the voice data based on the POI information. Represents the order of processing to be generated and output. Note that the processing order on the vertical axis indicates POI information that is generated and output earlier as the position is higher. Further, in each diagram shown in FIG. 5, the POI information for which the processing order is set is represented in black, and the POI information selected as the next candidate to be described later is represented by hatching. For example, in FIG. 5C, the POI information 4 indicates that the processing order is set, and the POI information 2 is selected as the next candidate.

ステップＳ１０３において、データベース２３０からＰＯＩ情報を取得した時点においては、図５（Ａ）に示すように、データベース２３０から取得された各ＰＯＩ情報は、ユーザの現在位置から近い順序で並んでいる。ステップＳ２０１では、設定機能により、これらのＰＯＩ情報のうち、音声出力用のテキストデータのデータ長が最も小さいＰＯＩ情報であるＰＯＩ情報４が選択され、最初に音声データを生成し、出力するためのＰＯＩ情報として設定される。その結果、図５（Ｂ）に示すように、最初に生成されるＰＯＩ情報としてＰＯＩ情報４が設定され、ＰＯＩ情報４の次以降に、残りのＰＯＩ情報が、ユーザの現在位置から近い順序で並ぶ。 In step S103, when the POI information is acquired from the database 230, as shown in FIG. 5A, the POI information acquired from the database 230 is arranged in the order close to the current position of the user. In step S201, the POI information 4 that is the POI information having the smallest data length of the text data for voice output is selected from the POI information by the setting function, and voice data is first generated and output. Set as POI information. As a result, as shown in FIG. 5B, the POI information 4 is set as the POI information generated first, and after the POI information 4, the remaining POI information is arranged in the order close to the current position of the user. line up.

続いて、ステップＳ２０２では、取得機能によりデータベース２３０から取得された複数のＰＯＩ情報のうち、処理順序が設定されていないＰＯＩ情報が、音声出力用のテキストデータのデータ長の降順に並び替えられる。例えば、図３に示す場面例では、データベース２３０から取得されるＰＯＩ情報１からＰＯＩ情報５までのＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長は、ＰＯＩ情報１が７００バイト、ＰＯＩ情報２が９００バイト、ＰＯＩ情報３が８００バイト、ＰＯＩ情報４が５００バイト、およびＰＯＩ情報５が６００バイトである。そこで、設定機能は、図５（Ｃ）に示すように、処理順序が設定されているＰＯＩ情報４に続けて、音声出力用のテキストデータのデータ長の大きい順に、ＰＯＩ情報２（９００バイト）、ＰＯＩ情報３（８００バイト）、ＰＯＩ情報１（７００バイト）、ＰＯＩ情報５（５００バイト）と並べ替える。 Subsequently, in step S202, among the plurality of POI information acquired from the database 230 by the acquisition function, the POI information for which the processing order is not set is rearranged in descending order of the data length of the text data for voice output. For example, in the scene example shown in FIG. 3, the data length of the text data for voice output included in the POI information from POI information 1 to POI information 5 acquired from the database 230 is 700 bytes for POI information 1 and POI information. 2 is 900 bytes, POI information 3 is 800 bytes, POI information 4 is 500 bytes, and POI information 5 is 600 bytes. Therefore, as shown in FIG. 5 (C), the setting function is the POI information 2 (900 bytes) in the descending order of the data length of the text data for voice output following the POI information 4 in which the processing order is set. , POI information 3 (800 bytes), POI information 1 (700 bytes), and POI information 5 (500 bytes).

次に、ステップＳ２０３では、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も長いＰＯＩ情報が、音声順序が既に設定されているＰＯＩ情報の次に音声データが生成されるＰＯＩ情報の候補、すなわち次候補として選択される。例えば、図５（Ｃ）に示す例では、処理順序が設定されていないＰＯＩ情報１〜ＰＯＩ情報３およびＰＯＩ情報５のうち、音声生成時間が最も長いＰＯＩ情報であるＰＯＩ情報２が次候補として選択される。なお、音声生成時間が最も長いＰＯＩ情報を選択する際には、ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長が最も大きいＰＯＩ情報を、音声生成時間が最も長いＰＯＩ情報であるとして選択すればよい。 Next, in step S203, among the POI information for which the processing order is not set, the POI information having the longest voice generation time is the POI information for which voice data is generated next to the POI information for which the voice order has already been set. Selected as the next candidate. For example, in the example shown in FIG. 5C, among the POI information 1 to POI information 3 and POI information 5 in which the processing order is not set, POI information 2 that is POI information with the longest voice generation time is set as the next candidate. Selected. When selecting the POI information having the longest voice generation time, the POI information having the longest data length of the text data for voice output included in the POI information is selected as the POI information having the longest voice generation time. do it.

ステップＳ２０４では、サーバコントローラ２２０の生成時間算出機能により、音声データの処理順序が設定されている全てのＰＯＩ情報の音声生成時間と、次候補に選択されているＰＯＩ情報の音声生成時間とが算出され、算出された処理順序が設定されている全てのＰＯＩ情報の音声生成時間と、次候補に選択されているＰＯＩ情報の音声生成時間との合計時間である音声生成合計時間が算出される。 In step S204, the generation time calculation function of the server controller 220 calculates the voice generation time of all POI information for which the processing order of the voice data is set and the voice generation time of the POI information selected as the next candidate. Then, the total voice generation time that is the total time of the voice generation times of all the POI information for which the calculated processing order is set and the voice generation time of the POI information selected as the next candidate is calculated.

図７は、図３に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。図７に示すように、生成時間算出機能は、処理順序が設定されているＰＯＩ情報４および次候補に選択されているＰＯＩ情報２に含まれる音声出力用のテキストデータのデータ長に基づいて、ＰＯＩ情報４の音声生成時間とＰＯＩ情報２の音声生成時間とを算出し、算出されたＰＯＩ情報４の音声生成時間とＰＯＩ情報２の音声生成時間との合計時間を音声生成合計時間として算出する。 FIG. 7 is a diagram for explaining the relationship between the total voice generation time and the total voice output time in the scene example shown in FIG. As shown in FIG. 7, the generation time calculation function is based on the data length of the text data for voice output included in the POI information 4 in which the processing order is set and the POI information 2 selected as the next candidate. The voice generation time of the POI information 4 and the voice generation time of the POI information 2 are calculated, and the total time of the calculated voice generation time of the POI information 4 and the voice generation time of the POI information 2 is calculated as the voice generation total time. .

続くステップＳ２０５では、サーバコントローラ２２０の出力時間算出機能により、既に音声データの処理順序が設定されている全てのＰＯＩ情報の音声出力時間がそれぞれ算出され、処理順序が既に設定されている全てのＰＯＩ情報の音声出力時間と、生成時間算出機能により算出された最初に音声データが生成され、出力されるＰＯＩ情報の音声生成時間との合計時間が音声出力合計時間として算出される。 In subsequent step S205, the output time calculation function of the server controller 220 calculates the sound output times of all the POI information for which the processing order of the sound data has already been set, and all the POIs for which the processing order has already been set. The total time of the voice output time of the information and the voice generation time of the POI information to be output is calculated as the voice output total time firstly calculated by the generation time calculation function.

例えば、図３に示す場面例では、図７に示すように、ＰＯＩ情報２およびＰＯＩ情報４に含まれる音声出力用のテキストデータのデータ長に基づいて、既に処理順序が設定されているＰＯＩ情報４の音声出力時間と、最初に音声データが生成され、出力されるＰＯＩ情報４の音声生成時間とが算出され、算出されたＰＯＩ情報４の音声出力時間とＰＯＩ情報４の音声生成時間との合計時間である音声出力合計時間が算出される。 For example, in the scene example shown in FIG. 3, as shown in FIG. 7, the POI information whose processing order has already been set based on the data length of the text data for audio output included in the POI information 2 and the POI information 4 4 and the voice generation time of the POI information 4 that is first generated and output, and the calculated voice output time of the POI information 4 and the voice generation time of the POI information 4 are calculated. The total audio output time, which is the total time, is calculated.

ステップＳ２０６では、サーバコントローラ２２０の設定機能により、ステップＳ２０４で算出された音声生成合計時間と、ステップＳ２０５で算出された音声出力合計時間とが比較され、音声出力合計時間が音声生成合計時間よりも長いか判断される。すなわち、既に処理順序が設定されているＰＯＩ情報に基づく音声データの音声出力が終了する前に、次候補に選択されているＰＯI情報に基づく音声データを生成することができるか判断される。音声出力合計時間が音声生成合計時間よりも長いと判断された場合は、ステップＳ２０７に進み、一方、音声出力合計時間が音声生成合計時間以下と判断された場合は、ステップＳ２０９に進む。 In step S206, the setting function of the server controller 220 compares the total voice generation time calculated in step S204 with the total voice output time calculated in step S205, and the total voice output time is shorter than the total voice generation time. It is judged whether it is long. That is, it is determined whether or not the sound data based on the POI information selected as the next candidate can be generated before the sound output of the sound data based on the POI information whose processing order has already been set ends. When it is determined that the total voice output time is longer than the total voice generation time, the process proceeds to step S207. On the other hand, when it is determined that the total voice output time is equal to or less than the total voice generation time, the process proceeds to step S209.

図３に示す場面例では、図７に示すように、最初に音声データが生成され、出力されるＰＯＩ情報４の音声生成時間と既に処理順序が設定されているＰＯＩ情報４の音声出力時間との合計時間である音声出力合計時間は、既に処理順序が設定されたＰＯＩ情報４の音声生成時間と次候補に選択されたＰＯＩ情報２の音声生成時間との合計時間である音声生成合計時間よりも長いため、ステップＳ２０７に進む。 In the scene example shown in FIG. 3, as shown in FIG. 7, the voice generation time of the POI information 4 that is first generated and output, the voice output time of the POI information 4 for which the processing order has already been set, The total voice output time is the total voice generation time which is the total time of the voice generation time of the POI information 4 whose processing order has already been set and the voice generation time of the POI information 2 selected as the next candidate. Since it is long, the process proceeds to step S207.

ステップＳ２０７では、サーバコントローラ２２０の設定機能により、次候補に選択されたＰＯＩ情報の処理順序が設定される。具体的には、次候補に選択されたＰＯＩ情報の処理順序が、既に処理順序が設定されているＰＯＩ情報の次に設定される。これにより、次候補に選択されたＰＯＩ情報に基づく音声データは、処理順序が既に設定されているＰＯＩ情報に続いて生成され、出力されることになる。図３に示す場面例では、図７に示すように、最初に音声データが生成され、出力されるＰＯＩ情報４の音声生成時間と既に処理順序が設定されているＰＯＩ情報４の音声出力時間との合計時間である音声出力合計時間が、既に処理順序が設定されているＰＯＩ情報４の音声生成時間と次候補に選択されたＰＯＩ情報２の音声生成時間との合計時間である音声生成合計時間よりも長いため、図５（Ｃ）に示すように、次候補に選択されているＰＯＩ情報２の処理順序が、図６（Ａ）に示すように、既に処理順序が設定されているＰＯＩ情報４の次に設定される。 In step S207, the processing function of the POI information selected as the next candidate is set by the setting function of the server controller 220. Specifically, the processing order of the POI information selected as the next candidate is set next to the POI information for which the processing order has already been set. As a result, the audio data based on the POI information selected as the next candidate is generated and output following the POI information whose processing order has already been set. In the scene example shown in FIG. 3, as shown in FIG. 7, the voice generation time of the POI information 4 that is first generated and output, the voice output time of the POI information 4 for which the processing order has already been set, The total voice output time is the total time of the voice generation time of the POI information 4 whose processing order has already been set and the voice generation time of the POI information 2 selected as the next candidate. As shown in FIG. 5 (C), the processing order of the POI information 2 selected as the next candidate is POI information whose processing order has already been set as shown in FIG. 6 (A). 4 is set next.

ステップＳ２０８では、サーバコントローラ２２０の設定機能により、全てのＰＯＩ情報について、処理順序が設定されたか判断される。全てのＰＯＩ情報について処理順序が設定されていない場合は、ステップＳ２０９に進み、一方、全てのＰＯＩ情報について処理順序が設定された場合は、ステップＳ１０４の処理順序設定処理を終了する。 In step S208, it is determined by the setting function of the server controller 220 whether the processing order has been set for all POI information. If the processing order has not been set for all POI information, the process proceeds to step S209. On the other hand, if the processing order has been set for all POI information, the processing order setting process in step S104 ends.

図３に示す場面例では、図６（Ａ）に示す時点において、データベース２３０から取得されたＰＯＩ情報１〜ＰＯＩ情報５のうち、ＰＯＩ情報４およびＰＯＩ情報２について処理順序が設定されているが、ＰＯＩ情報３、ＰＯＩ情報１、およびＰＯＩ情報５について処理順序が設定されていない。そのため、ステップＳ２０８では、全てのＰＯＩ情報の処理順序が設定されているものではないと判断され、ステップＳ２０９に進む。 In the example of the scene shown in FIG. 3, the processing order is set for POI information 4 and POI information 2 out of POI information 1 to POI information 5 acquired from the database 230 at the time shown in FIG. No processing order is set for POI information 3, POI information 1, and POI information 5. Therefore, in step S208, it is determined that the processing order of all POI information is not set, and the process proceeds to step S209.

ステップＳ２０９では、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も長いＰＯＩ情報が次候補として選択される。図３に示す場面例では、図６（Ａ）に示すように、ＰＯＩ情報２の処理順序が設定された後は、処理順序が設定されてないＰＯＩ情報１〜ＰＯＩ情報３のうち、音声生成時間が最も長いＰＯＩ情報３が、新たな次候補として設定される。 In step S209, POI information with the longest voice generation time is selected as the next candidate among the POI information for which the processing order is not set. In the scene example shown in FIG. 3, as shown in FIG. 6A, after the processing order of POI information 2 is set, voice generation is performed among POI information 1 to POI information 3 in which the processing order is not set. The POI information 3 with the longest time is set as a new next candidate.

ステップＳ２０９において、処理順序が設定されてないＰＯＩ情報のうち音声生成時間が最も長いＰＯＩ情報が次候補に選択された後は、図４に示すように、ステップＳ２０４に戻り、新たに次候補として選択されたＰＯＩ情報に基づいて、ステップＳ２０４からステップＳ２０６の処理が行われる。ここで、図８は、図３に示す場面例において、ＰＯＩ情報の処理順序を設定した結果の一例を示す図である。図３に示す場面例では、図８に示すように、ＰＯＩ情報３を新たに次候補とした場合も、最初に音声データに生成されるＰＯＩ情報４の音声生成時間と既に処理順序が設定されているＰＯＩ情報４およびＰＯＩ情報２の音声出力時間との合計時間である音声出力合計時間が、既に処理順序が設定されているＰＯＩ情報４およびＰＯＩ情報２の音声生成時間と次候補に選択されたＰＯＩ情報３の音声生成時間との合計時間である音声生成合計時間よりも長くなり、図６（Ｂ）に示すように、ＰＯＩ情報３の処理順序が、既に処理順序が設定されているＰＯＩ情報４およびＰＯＩ情報２の次に設定される。 After the POI information having the longest voice generation time is selected as the next candidate among the POI information for which the processing order is not set in step S209, the process returns to step S204 as shown in FIG. Based on the selected POI information, the processing from step S204 to step S206 is performed. Here, FIG. 8 is a diagram illustrating an example of a result of setting the processing order of POI information in the example of the scene illustrated in FIG. In the scene example shown in FIG. 3, as shown in FIG. 8, even when the POI information 3 is newly set as the next candidate, the voice generation time and the processing order of the POI information 4 generated first in the voice data are already set. The total voice output time, which is the total time of the POI information 4 and the POI information 2 that have been output, is selected as the voice generation time and the next candidate of the POI information 4 and POI information 2 for which the processing order has already been set. As shown in FIG. 6B, the processing order of the POI information 3 is already set as the processing order of the POI information 3. It is set next to information 4 and POI information 2.

そして、さらにＰＯＩ情報１を次候補として、再度、ステップＳ２０４からステップＳ２０６の処理が行われ、ＰＯＩ情報１についても、ＰＯＩ情報３と同様に、処理順序が設定され、最終的に、図６（Ｃ）のように、ＰＯＩ情報の処理順序が設定される。このように、図３に示す場面例では、図８に示すように、データベース２３０から取得されたＰＯＩ情報を、ＰＯＩ情報４、ＰＯＩ情報２、ＰＯＩ情報３、ＰＯＩ情報１、ＰＯＩ情報５の処理順序に設定する。このように処理順序を設定することで、これらＰＯＩ情報に基づいて音声データを生成し、出力する場合に、音声データの生成による音声出力の待ち時間が発生しないことが分かる。 Further, the processing from step S204 to step S206 is performed again with POI information 1 as the next candidate, and the processing order is set for POI information 1 as well as POI information 3, and finally FIG. As in (C), the processing order of POI information is set. In this way, in the example of the scene shown in FIG. 3, as shown in FIG. 8, the POI information acquired from the database 230 is processed as POI information 4, POI information 2, POI information 3, POI information 1, and POI information 5. Set to order. By setting the processing order in this way, it can be seen that there is no waiting time for voice output due to the generation of voice data when voice data is generated and output based on these POI information.

続いて、図９に示す場面例における処理順序設定処理について説明する。図９は、処理順序設定処理の他の場面例を説明するための図である。図９に示す場面例において、ステップＳ１０３では、サーバコントローラ２２０の取得機能により、ユーザの現在位置の周辺に存在するスポットに関するＰＯＩ情報１１〜ＰＯＩ情報１７のうち、ユーザの現在位置から近い５つのＰＯＩ情報１１〜ＰＯＩ情報１５が、データベース２３０から取得される。 Next, the processing order setting process in the scene example shown in FIG. 9 will be described. FIG. 9 is a diagram for explaining another example of the process order setting process. In the scene example shown in FIG. 9, in step S103, the POI information 11 to POI information 17 related to spots existing around the current position of the user, among the POI information 11 to POI information 17 related to the spot existing around the current position of the user, is acquired in step S103. Information 11 to POI information 15 is acquired from the database 230.

図１０および図１１は、図９に示す場面例において、ＰＯＩ情報に基づく音声データの処理順序を設定する手法例を説明するための図である。なお、図１０および図１１に示す各図においては、図５と同様に、横軸は各ＰＯＩ情報に含まれる音声出力用のテキストデータのデータ長（バイト数）を表しており、縦軸はＰＯＩ情報に基づく音声データが生成され、出力される順序を表している。また、図１０および図１１において、処理順序が設定されたＰＯＩ情報を黒色で、また次候補に選択されたＰＯＩ情報を斜線のハッチングで表している。例えば、図１０（Ｂ）において、ＰＯＩ情報１４は処理順序が設定されており、またＰＯＩ情報１２は次候補に選択されていることを表している。ステップＳ１０３においてデータベース２３０からＰＯＩ情報を取得した時点においては、図１０（Ａ）に示すように、ＰＯＩ情報１１〜ＰＯＩ情報１５は、ユーザの現在位置から近い順序で並ぶことになる。 10 and 11 are diagrams for explaining an example technique for setting the processing order of audio data based on POI information in the scene example shown in FIG. 10 and FIG. 11, as in FIG. 5, the horizontal axis represents the data length (number of bytes) of the text data for audio output included in each POI information, and the vertical axis represents This represents the order in which audio data based on POI information is generated and output. In FIGS. 10 and 11, the POI information for which the processing order is set is represented in black, and the POI information selected as the next candidate is represented by hatching. For example, in FIG. 10B, the POI information 14 indicates that the processing order is set, and the POI information 12 is selected as the next candidate. When POI information is acquired from the database 230 in step S103, as shown in FIG. 10A, the POI information 11 to POI information 15 are arranged in the order close to the current position of the user.

そして、図１０（Ｂ）に示すように、これらＰＯＩ情報１１〜ＰＯＩ情報１５のうち、音声生成時間が最も短いＰＯＩ情報、すなわち音声出力用のテキストデータのデータ長が最も小さいＰＯＩ情報であるＰＯＩ情報１４が、最初に音声データが生成され、出力されるＰＯＩ情報として設定され（ステップＳ２０１）、続いて、処理順序が設定されていないＰＯＩ情報が音声出力用のテキストデータのデータ長の降順に並び替えられ（ステップＳ２０２）、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も長いＰＯＩ情報であるＰＯＩ情報１２が次候補として設定される（ステップＳ２０３）。 10B, among these POI information 11 to POI information 15, POI information having the shortest voice generation time, that is, POI information having the shortest data length of voice output text data. The information 14 is first set as POI information to be generated and output as voice data (step S201). Subsequently, the POI information for which the processing order is not set is in descending order of the data length of the text data for voice output. The POI information 12 that is the POI information with the longest voice generation time is set as the next candidate among the POI information that has been rearranged (step S202) and the processing order is not set (step S203).

図１２は、図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。図１０（Ｂ）に示すように、ＰＯＩ情報１４について処理順序が設定され、ＰＯＩ情報１２が次候補として設定されている場合、図１２に示すように、ＰＯＩ情報１４の音声生成時間とＰＯＩ情報１４の音声出力時間との合計時間である音声出力合計時間と、ＰＯＩ情報１４の音声生成時間とＰＯＩ情報１２の音声生成時間との合計時間である音声生成合計時間とが算出される（ステップＳ２０４、ステップＳ２０５）。そして、算出された音声出力合計時間と音声生成合計時間とが比較され、音声出力合計時間が音声生成合計時間よりも長いか判断される（ステップＳ２０６）。ここで、図１２に示すように、音声出力合計時間は音声生成合計時間よりも短いため（ステップＳ２０６＝ＮＯ）、図３に示す場面例とは異なり、ステップＳ２１０に進む。 FIG. 12 is a diagram for explaining the relationship between the total voice generation time and the total voice output time in the scene example shown in FIG. As shown in FIG. 10B, when the processing order is set for the POI information 14 and the POI information 12 is set as the next candidate, the voice generation time of the POI information 14 and the POI information are shown in FIG. The total voice output time that is the total time of the voice output time of 14 and the total voice generation time that is the total time of the voice generation time of the POI information 14 and the voice generation time of the POI information 12 are calculated (step S204). Step S205). Then, the calculated total voice output time is compared with the total voice generation time, and it is determined whether the total voice output time is longer than the total voice generation time (step S206). Here, as shown in FIG. 12, since the total audio output time is shorter than the total audio generation time (step S206 = NO), the process proceeds to step S210 unlike the scene example shown in FIG.

ステップＳ２１０では、サーバコントローラ２２０の設定機能により、処理順序が設定されていない全てのＰＯＩ情報が次候補として選択されたか判断される。処理順序が設定されていない全てのＰＯＩ情報が次候補として選択された場合は、処理順序が設定されていないＰＯＩ情報の中に、ステップＳ２０６の条件を満たすＰＯＩ情報がないものと判断される。処理順序が設定されていない全てのＰＯＩ情報が次候補として選択されたと判断された場合には、ステップＳ２１１に進み、一方、処理順序が設定されていないＰＯＩ情報の中に、次候補として選択されていないＰＯＩ情報があると判断された場合には、ステップＳ２１２に進む。 In step S210, it is determined by the setting function of the server controller 220 whether all POI information for which the processing order is not set has been selected as the next candidate. If all the POI information for which the processing order is not set is selected as the next candidate, it is determined that there is no POI information satisfying the condition of step S206 in the POI information for which the processing order is not set. If it is determined that all the POI information for which the processing order is not set is selected as the next candidate, the process proceeds to step S211, while the POI information for which the processing order is not set is selected as the next candidate. If it is determined that there is not POI information, the process proceeds to step S212.

図９に示す場面例において、図１０（Ｂ）に示す時点においては、処理順序が設定されていないＰＯＩ情報のうち、次候補として選択されたＰＯＩ情報はＰＯＩ情報１２だけであるため、処理順序が設定されていないＰＯＩ情報の中に、次候補として選択されていないＰＯＩ情報があると判断され、ステップＳ２１２に進む。 In the scene example shown in FIG. 9, at the time shown in FIG. 10B, the POI information selected as the next candidate among the POI information for which the processing order is not set is only the POI information 12. It is determined that there is POI information that is not selected as the next candidate in the POI information for which is not set, and the process proceeds to step S212.

ステップＳ２１２では、処理順序が設定されていないＰＯＩ情報のうち、現在、次候補として選択されているＰＯＩ情報の次に音声生成時間が長いＰＯＩ情報が、新たな次候補として選択される。図９に示す場面例では、図１０（Ｂ）に示すように、ＰＯＩ情報１２が次候補として選択されている。そこで、ステップＳ２１２では、図１０（Ｃ）に示すように、現在次候補として選択されているＰＯＩ情報１２の次に音声生成時間が長いＰＯＩ情報１３を、新たな次候補として設定する。 In step S212, among the POI information for which the processing order is not set, POI information having the longest voice generation time after the POI information currently selected as the next candidate is selected as a new next candidate. In the scene example shown in FIG. 9, as shown in FIG. 10B, POI information 12 is selected as the next candidate. Therefore, in step S212, as shown in FIG. 10C, POI information 13 having the longest voice generation time after POI information 12 currently selected as the next candidate is set as a new next candidate.

ここで、図４に示すように、ステップＳ２１２の処理後は、再度、ステップＳ２０４に戻る。そのため、図９に示す場面例では、図１０（Ｃ）に示すように、ＰＯＩ情報１３を次候補として選択した状態で、ステップＳ２０４からステップＳ２０６の処理が再度行われる。図１３は、図９に示す場面例において、ＰＯＩ情報１１、ＰＯＩ情報１３、およびＰＯＩ情報１５を次候補とした場合の音声生成時間と音声出力時間との関係を説明するための図である。図１３に示すように、ＰＯＩ情報１３が次候補に選択されている場合であっても、ＰＯＩ情報１４の音声生成時間とＰＯＩ情報１４の音声出力時間との合計時間である音声出力合計時間は、ＰＯＩ情報１４の音声生成時間と次候補に選択されたＰＯＩ情報１３の音声生成時間との合計時間である音声生成合計時間１よりも短い（ステップＳ２０６＝ＮＯ）。 Here, as shown in FIG. 4, after the process of step S212, the process returns to step S204 again. Therefore, in the scene example shown in FIG. 9, the processing from step S204 to step S206 is performed again with the POI information 13 selected as the next candidate, as shown in FIG. 10C. FIG. 13 is a diagram for explaining the relationship between the voice generation time and the voice output time when the POI information 11, the POI information 13, and the POI information 15 are the next candidates in the scene example shown in FIG. As shown in FIG. 13, even when the POI information 13 is selected as the next candidate, the total audio output time that is the total time of the audio generation time of the POI information 14 and the audio output time of the POI information 14 is The total voice generation time 1 which is the total time of the voice generation time of the POI information 14 and the voice generation time of the POI information 13 selected as the next candidate is shorter (step S206 = NO).

そのため、処理順序が設定されていないＰＯＩ情報のうち、次候補に選択されているＰＯＩ情報１３の次に音声生成時間が長いＰＯＩ情報１１が、新たな次候補として選択される（ステップＳ２１２）。しかしながら、図１３に示すように、ＰＯＩ情報１１が次候補に選択されている場合であっても、ＰＯＩ情報１４の音声生成時間とＰＯＩ情報１４の音声出力時間との合計時間である音声出力合計時間は、ＰＯＩ情報１４の音声生成時間と次候補に選択されたＰＯＩ情報１１の音声生成時間との合計時間である音声生成合計時間２よりも短い（ステップＳ２０６＝ＮＯ）。そこで、さらに、処理順序が設定されていないＰＯＩ情報のうち、次候補に選択されているＰＯＩ情報１１の次に音声生成時間が長いＰＯＩ情報１５が、新たな次候補として選択される（ステップＳ２１２）。 Therefore, among the POI information for which the processing order is not set, the POI information 11 having the longest voice generation time after the POI information 13 selected as the next candidate is selected as a new next candidate (step S212). However, as shown in FIG. 13, even when the POI information 11 is selected as the next candidate, the audio output total that is the total time of the audio generation time of the POI information 14 and the audio output time of the POI information 14 The time is shorter than the total voice generation time 2 that is the total time of the voice generation time of the POI information 14 and the voice generation time of the POI information 11 selected as the next candidate (step S206 = NO). Therefore, among the POI information for which the processing order is not set, the POI information 15 having the longest voice generation time after the POI information 11 selected as the next candidate is selected as a new next candidate (step S212). ).

図１３に示すように、ＰＯＩ情報１５が次候補に選択されている場合には、ＰＯＩ情報１４の音声生成時間とＰＯＩ情報１４の音声出力時間との合計時間である音声出力合計時間は、ＰＯＩ情報１４の音声生成時間と次候補に選択されたＰＯＩ情報１５の音声生成時間との合計時間である音声生成合計時間３よりも長くなる。そのため、ステップＳ２０７に進み、次候補に選択されているＰＯＩ情報１５の処理順序が設定される。すなわち、図１１（Ａ）に示すように、ＰＯＩ情報１５の処理順序が、ＰＯＩ情報１４の次に設定される。 As shown in FIG. 13, when the POI information 15 is selected as the next candidate, the total voice output time, which is the total time of the voice generation time of the POI information 14 and the voice output time of the POI information 14, is POI. It is longer than the total voice generation time 3, which is the total time of the voice generation time of the information 14 and the voice generation time of the POI information 15 selected as the next candidate. Therefore, the process proceeds to step S207, and the processing order of the POI information 15 selected as the next candidate is set. That is, as shown in FIG. 11A, the processing order of the POI information 15 is set next to the POI information 14.

ステップＳ２０７で、ＰＯＩ情報１５の処理順序が設定された後は、ステップＳ２０８に進み、全てのＰＯＩ情報について処理順序が設定されたか判断される。図９に示す場面例において、図１１（Ａ）に示す時点においては、まだ、ＰＯＩ情報１２、ＰＯＩ情報１３、およびＰＯＩ情報１１について、処理順序が設定されていないため、ステップＳ２０９に進み、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も長いＰＯＩ情報１２が、次候補として設定される。 After the processing order of the POI information 15 is set in step S207, the process proceeds to step S208, and it is determined whether the processing order has been set for all POI information. In the scene example shown in FIG. 9, at the time shown in FIG. 11A, the processing order is not yet set for the POI information 12, the POI information 13, and the POI information 11, and thus the process proceeds to step S209. Of the POI information for which the order is not set, the POI information 12 having the longest voice generation time is set as the next candidate.

図１４に示すように、ＰＯＩ情報１２が次候補に選択されている場合、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間と、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声出力時間の合計時間である音声出力合計時間は、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声生成時間と次候補に選択されたＰＯＩ情報１２の音声生成時間との合計時間である音声生成合計時間４よりも短い（ステップＳ２０６＝ＮＯ）。そのため、処理順序が設定されていないＰＯＩ情報のうち、次候補に選択されているＰＯＩ情報１２の次に音声生成時間が長いＰＯＩ情報１３が新たな次候補として選択される（ステップＳ２１２）。 As shown in FIG. 14, when the POI information 12 is selected as the next candidate, the voice generation time of the POI information 14 that is first generated and outputted, and the POI information for which the processing order has already been set 14 and the POI information 15 is the sum of the voice output times of the POI information 14 and the POI information 15 for which the processing order has already been set and the POI information 12 selected as the next candidate. It is shorter than the total voice generation time 4, which is the total time with the voice generation time (step S206 = NO). Therefore, among the POI information for which the processing order is not set, the POI information 13 having the longest voice generation time after the POI information 12 selected as the next candidate is selected as a new next candidate (step S212).

しかし、図１４に示すように、次候補がＰＯＩ情報１３の場合であっても、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間と、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声出力時間の合計時間である音声出力合計時間は、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声生成時間と次候補に選択されたＰＯＩ情報１３の音声生成時間との合計時間である音声生成合計時間５よりも短い（ステップＳ２０６＝ＮＯ）。そのため、処理順序が設定されていないＰＯＩ情報のうち、次候補に選択されているＰＯＩ情報１３の次に音声生成時間が長いＰＯＩ情報１１が新たな次候補として選択される（ステップＳ２１２）。 However, as shown in FIG. 14, even when the next candidate is the POI information 13, the voice data is first generated and the voice generation time of the output POI information 14 and the processing order are already set. The total voice output time that is the total voice output time of the POI information 14 and the POI information 15 is the voice generation time of the POI information 14 and the POI information 15 for which the processing order has already been set and the POI information selected as the next candidate. It is shorter than the total voice generation time 5, which is the total time of 13 voice generation times (step S206 = NO). Therefore, among the POI information for which the processing order is not set, the POI information 11 having the longest voice generation time after the POI information 13 selected as the next candidate is selected as a new next candidate (step S212).

しかしながら、次候補としてＰＯＩ情報１１が選択されている場合であっても、ＰＯＩ情報１２またはＰＯＩ情報１３が次候補に選択されている場合と同様に、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間と、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声出力時間の合計時間である音声出力合計時間は、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の音声生成時間と次候補に選択されたＰＯＩ情報１１の音声生成時間との合計時間である音声生成合計時間６よりも短い（ステップＳ２０６＝ＮＯ）。そのため、図４に示すようにステップＳ２１０に進む。ここで、ＰＯＩ情報１１が次候補として選択されることで、処理順序が設定されていない全てのＰＯＩ情報について次候補が選択されたこととなるため、ステップＳ２１０において、処理順序が設定されていない全てのＰＯＩ情報が次候補として選択されたと判断され、ステップＳ２１１に進む。 However, even when the POI information 11 is selected as the next candidate, the voice data is first generated and output as in the case where the POI information 12 or the POI information 13 is selected as the next candidate. The voice output total time which is the total time of the voice generation time of the POI information 14 and the voice output time of the POI information 14 and the POI information 15 for which the processing order has already been set is the POI information 14 for which the processing order has already been set. And the voice generation total time 6 which is the total time of the voice generation time of the POI information 15 and the voice generation time of the POI information 11 selected as the next candidate (step S206 = NO). Therefore, the process proceeds to step S210 as shown in FIG. Here, since the POI information 11 is selected as the next candidate, the next candidate is selected for all the POI information for which the processing order is not set. Therefore, in step S210, the processing order is not set. It is determined that all POI information has been selected as the next candidate, and the process proceeds to step S211.

ステップＳ２１１では、設定機能により、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も短いＰＯＩ情報の処理順序が、既に処理順序が設定されているＰＯＩ情報の次に設定される。例えば、図９に示す場面例において、図１１（Ａ）に示すように、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も短いＰＯＩ情報であるＰＯＩ情報１１の処理順序が、図１１（Ｂ）に示すように、既に処理順序が設定されているＰＯＩ情報１４およびＰＯＩ情報１５の次に設定される。 In step S211, the setting function sets the processing order of the POI information having the shortest voice generation time among the POI information for which the processing order is not set, next to the POI information for which the processing order has already been set. For example, in the scene example shown in FIG. 9, as shown in FIG. 11A, the processing order of the POI information 11 that is the POI information with the shortest voice generation time among the POI information for which the processing order is not set is As shown in FIG. 11B, it is set next to the POI information 14 and the POI information 15 for which the processing order has already been set.

図１５は、図９に示す場面例における音声生成合計時間および音声出力合計時間の関係を説明するための図である。図１１（Ｂ）に示すように、ＰＯＩ情報１１の処理順序が設定された後は、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も長いＰＯＩ情報１２が、次候補として設定される（ステップＳ２０９）。そして、ステップ２０６において、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間、および、既に処理順序が設定されているＰＯＩ情報１４、ＰＯＩ情報１５、ＰＯＩ情報１１の音声出力時間との合計時間である音声出力合計時間と、既に処理順序が設定されているＰＯＩ情報１４、ＰＯＩ情報１５、ＰＯＩ情報１２の音声生成時間、および、次候補に選択されたＰＯＩ情報１２の音声生成時間との合計時間である音声生成合計時間とが比較され、音声出力合計時間が音声生成合計時間よりも長くなるか判断される。 FIG. 15 is a diagram for explaining the relationship between the total voice generation time and the total voice output time in the scene example shown in FIG. As shown in FIG. 11B, after the processing order of the POI information 11 is set, the POI information 12 with the longest voice generation time is set as the next candidate among the POI information for which the processing order is not set. (Step S209). In step 206, the voice generation time of the POI information 14 that is first generated and output, and the voice output time of the POI information 14, the POI information 15, and the POI information 11 for which the processing order has already been set. And the voice generation time of the POI information 14, the POI information 15, and the POI information 12 for which the processing order has already been set, and the voice generation of the POI information 12 selected as the next candidate The total voice generation time, which is the total time with the time, is compared to determine whether the total voice output time is longer than the total voice generation time.

ここで、図９に示す場面例では、ＰＯＩ情報１５の処理順序が設定された後に、次候補として選択されたＰＯＩ情報１１からＰＯＩ情報１３までの中に、ステップＳ２０６の関係を満たすＰＯＩ情報が存在しないため、ＰＯＩ情報１５に基づく音声データの出力後に、ＰＯＩ情報１５に基づく音声データの終了後からＰＯＩ情報１５の次に生成されるＰＯＩ情報１１の音声データが生成されるまでの時間が出力停止時間として発生する。このように、出力停止時間が生じる場合には、出力停止時間を考慮して音声出力合計時間を算出する。例えば、図１５に示す例では、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間、既に処理順序が設定されているＰＯＩ情報１４、ＰＯＩ情報１５、ＰＯＩ情報１１の音声出力時間、および出力停止時間の合計時間が音声出力合計時間として算出される。 Here, in the scene example shown in FIG. 9, after the processing order of the POI information 15 is set, the POI information satisfying the relationship of step S206 is included in the POI information 11 to POI information 13 selected as the next candidate. Since it does not exist, after the audio data based on the POI information 15 is output, the time from the end of the audio data based on the POI information 15 to the generation of the audio data of the POI information 11 generated next to the POI information 15 is output. Occurs as stop time. Thus, when the output stop time occurs, the audio output total time is calculated in consideration of the output stop time. For example, in the example shown in FIG. 15, voice data is first generated and the voice generation time of the POI information 14 to be output and the voice output of the POI information 14, the POI information 15, and the POI information 11 whose processing order has already been set. The total time of the time and the output stop time is calculated as the total audio output time.

図１５に示すように、最初に音声データが生成され、出力されるＰＯＩ情報１４の音声生成時間、既に処理順序が設定されているＰＯＩ情報１４、ＰＯＩ情報１５、ＰＯＩ情報１２の音声出力時間、および出力停止時間の合計時間である音声出力合計時間は、既に処理順序が設定されているＰＯＩ情報１４、ＰＯＩ情報１５、ＰＯＩ情報１１の音声生成時間、および次候補に選択されたＰＯＩ情報１２の音声生成時間との合計時間である音声生成合計時間よりも長いため（ステップＳ２０６＝ＹＥＳ）、ステップＳ２０７に進み、ＰＯＩ情報１２の処理順序が設定される。そして、残ったＰＯＩ情報１３についても、ＰＯＩ情報１２と同様に、処理順序が設定される。これにより、ステップＳ２０８で、全てのＰＯＩ情報について処理順序が設定されたと判断され、処理順序設定処理が終了する。 As shown in FIG. 15, the voice generation time of the POI information 14 that is first generated and output, the voice output time of the POI information 14, the POI information 15, and the POI information 12 for which the processing order has already been set, The voice output total time, which is the total time of the output stop time, is the voice generation time of the POI information 14, POI information 15, POI information 11 for which the processing order has already been set, and the POI information 12 selected as the next candidate. Since it is longer than the total voice generation time, which is the total time with the voice generation time (step S206 = YES), the process proceeds to step S207, and the processing order of the POI information 12 is set. As for the remaining POI information 13, the processing order is set similarly to the POI information 12. As a result, in step S208, it is determined that the processing order has been set for all the POI information, and the processing order setting process ends.

図１６は、図９に示す場面例において、ＰＯＩ情報の処理順序を設定した結果の一例を示す図である。サーバコントローラ２２０は、図９に示すＰＯＩ情報をデータベース２３０から取得し、図１６に示すように、処理順序を設定する。図１６に示すように、図９に示す場面例では、ステップＳ２０６の関係を満たすＰＯＩ情報が存在せずに、音声データの生成により音声出力が一時的に停止する出力停止時間が発生するが、処理順序が設定されていないＰＯＩ情報のうち音声生成時間が最も短いＰＯＩ情報の処理順序を、処理順序が既に設定されたＰＯＩ情報の次に設定することで、出力停止時間の増大を防止できる。 FIG. 16 is a diagram illustrating an example of a result of setting the processing order of POI information in the example of the scene illustrated in FIG. The server controller 220 acquires the POI information shown in FIG. 9 from the database 230, and sets the processing order as shown in FIG. As shown in FIG. 16, in the example of the scene shown in FIG. 9, there is no POI information that satisfies the relationship of step S <b> 206, and an output stop time occurs when the audio output is temporarily stopped due to the generation of the audio data. By setting the processing order of the POI information with the shortest voice generation time among the POI information for which the processing order is not set next to the POI information for which the processing order has already been set, it is possible to prevent the output stop time from increasing.

以上のように、ステップＳ１０４の処理順序設定処理は行われる。 As described above, the processing order setting process in step S104 is performed.

続いて、図２に戻り、ステップＳ１０５では、ステップＳ１０４で処理順序が設定されたＰＯＩ情報が、外部サーバ２００から車載装置１００に送信される。車載装置１００の車載装置用コントローラ１３０の取得機能は、車載装置１００の通信装置１１０を介して、外部サーバ２００から送信される複数のＰＯＩ情報を取得する。 Subsequently, returning to FIG. 2, in step S105, the POI information whose processing order is set in step S104 is transmitted from the external server 200 to the in-vehicle device 100. The acquisition function of the in-vehicle device controller 130 of the in-vehicle device 100 acquires a plurality of POI information transmitted from the external server 200 via the communication device 110 of the in-vehicle device 100.

ステップＳ１０６では、車載装置用コントローラ１３０の生成機能により、ステップＳ１０４で設定されたＰＯＩ情報の処理順序に従い、ＰＯＩ情報に含まれる音声出力用のテキストデータに基づいて音声データの生成が行われ、ＰＯＩ情報に基づく音声データが連続的に生成される。 In step S106, the generation function of the in-vehicle device controller 130 generates voice data based on the text data for voice output included in the POI information in accordance with the processing order of the POI information set in step S104. Audio data based on information is continuously generated.

ステップＳ１０７では、ステップＳ１０４で設定されたＰＯＩ情報の処理順序に従い、ステップＳ１０６で生成されたＰＯＩ情報に基づく音声データが、車載装置用コントローラ１３０からスピーカ１５０に送信され、スピーカ１５０において、ＰＯＩ情報に基づく音声データが音声としてユーザに出力される。また、車載装置用コントローラ１３０は、外部サーバ２００から取得したＰＯＩ情報を、ディスプレイ１６０に送信し、ディスプレイ１６０の画面上にＰＯＩ情報を表示する。なお、ステップＳ１０７の処理は、ステップＳ１０６と並行して行われる。すなわち、ステップＳ１０６では、ＰＯＩ情報の処理順序に従って、ＰＯＩ情報に基づく音声データを連続的に生成するとともに、ステップＳ１０７において、ＰＯＩ情報の処理順序に従って、生成された音声データを音声として連続的に出力する。 In step S107, the voice data based on the POI information generated in step S106 is transmitted from the in-vehicle device controller 130 to the speaker 150 in accordance with the processing order of the POI information set in step S104. The voice data based is output to the user as voice. The in-vehicle device controller 130 transmits the POI information acquired from the external server 200 to the display 160, and displays the POI information on the screen of the display 160. Note that the process of step S107 is performed in parallel with step S106. That is, in step S106, voice data based on POI information is continuously generated according to the POI information processing order, and in step S107, the generated voice data is continuously output as voice according to the POI information processing order. To do.

そして、ステップＳ１０８では、全てのＰＯＩ情報に基づく音声データの音声出力が行われたか判断される。全てのＰＯＩ情報に基づく音声出力が行われていないと判断された場合は、ステップＳ１０６に戻り、ＰＯＩ情報に基づく音声データの生成と、ＰＯＩ情報に基づいて生成された音声データの音声出力が継続される。一方、全てのＰＯＩ情報に基づく音声データの音声出力が行われたと判断された場合は、本実施形態の情報提示処理を終了する。 In step S108, it is determined whether audio output of audio data based on all POI information has been performed. If it is determined that the voice output based on all the POI information is not performed, the process returns to step S106, and the generation of the voice data based on the POI information and the voice output of the voice data generated based on the POI information are continued. Is done. On the other hand, when it is determined that the voice output of the voice data based on all the POI information has been performed, the information presentation process of the present embodiment is terminated.

以上のように本実施形態によれば、既に処理順序が設定されている全てのＰＯＩ情報の音声生成時間および次候補に選択されているＰＯＩ情報の音声生成時間との合計時間である音声生成合計時間と、既に音声データの処理順序が設定されている全てのＰＯＩ情報の音声出力時間および最初に音声データが生成され、出力されるＰＯＩ情報の音声生成時間との合計時間である音声出力合計時間とを比較し、音声出力合計時間が音声生成合計時間よりも長くなるかを判断する。すなわち、既に処理順序が設定されているＰＯＩ情報に基づく音声データの音声出力が終了する前に、次候補に選択されているＰＯＩ情報に基づいて音声データを生成することができるかを判断する。そして、音声出力合計時間が音声生成合計時間よりも長くなる場合には、次候補に選択されているＰＯＩ情報の処理順序を、既に処理順序が設定されているＰＯＩ情報の次に設定する。これにより、ＰＯＩ情報に基づいて既に生成された音声データの音声出力が終了する前に、次に音声出力すべき音声データの生成を終了することができ、音声データの生成による音声出力の停止を防ぎ、ユーザの待ち時間の発生を有効に防止することができる。 As described above, according to the present embodiment, the total voice generation is the total time of the voice generation time of all the POI information for which the processing order has already been set and the voice generation time of the POI information selected as the next candidate. Total audio output time, which is the total time of the time and the audio output time of all POI information for which the audio data processing order has already been set and the audio generation time of the POI information that is first generated and output To determine whether the total audio output time is longer than the total audio generation time. That is, before the voice output of the voice data based on the POI information for which the processing order has already been set ends, it is determined whether the voice data can be generated based on the POI information selected as the next candidate. If the total voice output time is longer than the total voice generation time, the processing order of the POI information selected as the next candidate is set next to the POI information whose processing order has already been set. Thereby, before the voice output of the voice data already generated based on the POI information is finished, the generation of the voice data to be outputted next can be finished, and the voice output is stopped by the generation of the voice data. It is possible to prevent the occurrence of waiting time of the user effectively.

また、本実施形態によれば、データベース２３０から取得された複数のＰＯＩ情報のうち、音声生成時間が最も短いＰＯＩ情報を、最初に音声データを生成し、出力するためのＰＯＩ情報として設定する。最初の音声データが生成されるまでは、音声出力は全く開始されないため、最初の音声データが生成されまでの時間がユーザの待ち時間となる。そこで、本実施形態では、最初に音声データを生成し、出力するためのＰＯＩ情報を、音声生成時間が最も短いＰＯＩ情報とすることで、音声出力が開始されるまでの時間を短縮することができ、ユーザの待ち時間の増大を有効に防止することができる。 Further, according to the present embodiment, among the plurality of POI information acquired from the database 230, POI information having the shortest voice generation time is set as POI information for generating and outputting voice data first. Until the first audio data is generated, the audio output is not started at all. Therefore, the time until the first audio data is generated becomes the waiting time of the user. Therefore, in this embodiment, the POI information for generating and outputting voice data first is the POI information having the shortest voice generation time, thereby shortening the time until voice output is started. It is possible to effectively prevent an increase in the waiting time of the user.

さらに、本実施形態によれば、処理順序が設定されていないＰＯＩ情報の中に、ステップＳ２０６の関係を満たすＰＯＩ情報が存在しない場合には、処理順序が設定されていないＰＯＩ情報のうち、音声生成時間が最も短いＰＯＩ情報の処理順序を、既に処理順序が設定されているＰＯＩ情報の次に設定する。例えば、図１５または図１６に示すように、処理順序が既に設定されたＰＯＩ情報に基づく音声データの音声出力が終了する前に、音声データを生成できるＰＯＩ情報がない場合には、処理順序が既に設定されたＰＯＩ情報に基づく音声データの音声出力後、次に音声出力される音声データが生成されるまでに、音声データの生成により音声出力が一時的に停止する出力停止時間が発生してしまう。しかしながら、処理順序が設定されていないＰＯＩ情報のうち音声生成時間が最も短いＰＯＩ情報の処理順序を、処理順序が既に設定されたＰＯＩ情報の次に設定することで、出力停止時間の増大を有効に防止することができる。 Furthermore, according to the present embodiment, if there is no POI information satisfying the relationship of step S206 in the POI information for which the processing order is not set, among the POI information for which the processing order is not set, the voice The processing order of the POI information with the shortest generation time is set next to the POI information for which the processing order has already been set. For example, as shown in FIG. 15 or FIG. 16, if there is no POI information that can generate voice data before the voice output of voice data based on the POI information for which the processing order has already been set, the processing order is changed. After the voice data is output based on the POI information that has already been set, the output stop time occurs when the voice output is temporarily stopped due to the generation of the voice data until the next voice data to be output is generated. End up. However, increasing the output stop time is effective by setting the processing order of the POI information with the shortest voice generation time among the POI information for which the processing order is not set next to the POI information for which the processing order has already been set. Can be prevented.

以上説明した実施形態は、本発明の理解を容易にするために記載されたものであって、本発明を限定するために記載されたものではない。したがって、上記の実施形態に開示された各要素は、本発明の技術的範囲に属する全ての設計変更や均等物をも含む趣旨である。 The embodiment described above is described for facilitating understanding of the present invention, and is not described for limiting the present invention. Therefore, each element disclosed in the above embodiment is intended to include all design changes and equivalents belonging to the technical scope of the present invention.

例えば、本実施形態では、車載装置１００は、外部サーバ２００と通信して、処理順序が設定されたＰＯＩ情報を取得しているが、これに限定されず、例えば、車載装置１００は、外部サーバ２００と通信することなく、車載装置１００が備えるデータベースを参照して、ＰＯＩ情報を取得し、車載装置１００の車載装置用コントローラ１３０で、ＰＯＩ情報の処理順序を設定する処理を行う構成としてもよい。または、車載装置１００は、外部サーバ２００から処理順序が設定されていないＰＯＩ情報を受信し、車載装置１００の車載装置用コントローラ１３０で、受信したＰＯＩ情報の処理順序を設定する構成としてもよい。 For example, in the present embodiment, the in-vehicle device 100 communicates with the external server 200 to acquire the POI information in which the processing order is set. However, the present invention is not limited to this. For example, the in-vehicle device 100 includes the external server It is good also as a structure which refers to the database with which the vehicle equipment 100 is provided, without communicating with 200, acquires POI information, and performs the process which sets the processing order of POI information with the controller 130 for vehicle equipment of the vehicle equipment 100. . Alternatively, the in-vehicle device 100 may receive POI information for which the processing order is not set from the external server 200, and the in-vehicle device controller 130 of the in-vehicle device 100 may set the processing order of the received POI information.

さらに、外部サーバ２００のサーバコントローラ２２０により、ＰＯＩ情報の処理順序が設定され、車載装置１００は、サーバコントローラ２２０で処理順序が設定されたＰＯＩ情報を、処理順序が設定されたタイミングで、外部サーバ２００から順次ダウンロードする構成としてもよい。この場合、音声生成時間は、外部コントローラ２００から処理順序が設定されたＰＯＩ情報をダウンロードするための時間と、取得されたＰＯＩ情報に基づいて音声データを生成するのに要する時間との合計時間として算出することが好適である。 Furthermore, the POI information processing order is set by the server controller 220 of the external server 200, and the in-vehicle device 100 converts the POI information whose processing order is set by the server controller 220 to the external server at the timing when the processing order is set. The configuration may be such that the files are sequentially downloaded from 200. In this case, the voice generation time is a total time of the time for downloading the POI information for which the processing order is set from the external controller 200 and the time required for generating the voice data based on the acquired POI information. It is preferable to calculate.

さらに、本実施形態では、ＰＯＩ情報に基づく音声データの処理順序を設定する例について説明したが、処理順序が設定される情報はＰＯＩ情報に限られず、例えば、ニュースやメールなどの情報であってもよい。 Furthermore, in the present embodiment, an example of setting the processing order of audio data based on POI information has been described. However, the information for which the processing order is set is not limited to POI information, for example, information such as news and mail. Also good.

また、本実施形態では、音声出力合計時間が音声生成合計時間よりも長い場合（ステップＳ２０６＝ＹＥＳ）に、次候補に選択されているＰＯＩ情報を、既に音声データが生成されたＰＯＩ情報の次に音声出力するＰＯＩ情報として設定しているが、これに限定されず、例えば、音声出力合計時間が音声生成合計時間よりも所定時間だけ短い場合にも、次候補に選択されているＰＯＩ情報を、既に音声データが生成されたＰＯＩ情報の次に音声出力するＰＯＩ情報として設定してよい。音声出力合計時間が音声生成合計時間よりも短い場合でも、音声出力合計時間と音声生成合計時間との差がわずかな時間であれば、ユーザは発生した待ち時間を認識せず、または発生した待ち時間により不快感を受けることがないものと想定されるためである。 In this embodiment, when the total voice output time is longer than the total voice generation time (step S206 = YES), the POI information selected as the next candidate is changed to the next POI information for which voice data has already been generated. However, the present invention is not limited to this. For example, when the total voice output time is shorter than the total voice generation time by a predetermined time, the POI information selected as the next candidate is set as the POI information. Alternatively, it may be set as POI information to be output as a voice next to the POI information for which voice data has already been generated. Even if the total audio output time is shorter than the total audio generation time, if the difference between the total audio output time and the total audio generation time is small, the user does not recognize the waiting time that has occurred or has waited for it to occur. This is because it is assumed that there will be no discomfort due to time.

さらに、本実施形態では、ステップＳ２０２において、ＰＯＩ情報を、音声出力用のテキストデータのデータ長の降順に並べ、ステップＳ２０６の関係を満たさない場合に、ステップＳ２１２において、次候補に選択されているＰＯＩ情報の次に音声生成時間の長いＰＯＩ情報が、新たな次候補として選択されるが、これに限定されるものではなく、例えば、ステップＳ２０２において、ＰＯＩ情報を、ユーザの現在位置から近い順に並べ、またステップＳ２０６の関係を満たさない場合には、ステップＳ２１２において、次候補に選択されているＰＯＩ情報の次にユーザの現在位置から近い位置に存在するスポットに関するＰＯＩ情報を新たな次候補として選択してもよい。 Furthermore, in this embodiment, when the POI information is arranged in descending order of the data length of the text data for voice output in step S202 and the relationship in step S206 is not satisfied, it is selected as the next candidate in step S212. The POI information having the longest voice generation time after the POI information is selected as a new next candidate. However, the present invention is not limited to this. For example, in step S202, the POI information is sorted in order from the current position of the user. If not, the POI information related to the spot that is located next to the current position of the user next to the POI information selected as the next candidate is set as a new next candidate in step S212. You may choose.

なお、上述した実施形態の外部サーバ２００のサーバコントローラ２２０の取得機能は本発明の取得手段に、同じくサーバコントローラ２２０の生成時間算出機能は本発明の生成時間算出手段に、同じくサーバコントローラ２２０の出力時間算出機能は本発明の音声出力時間算出手段に、同じくサーバコントローラ２２０の設定機能は本発明の設定手段に、車載装置１００の車載装置用コントローラ１３０の生成機能は本発明の生成手段に、車載装置１００のスピーカ１５０は本発明の出力手段にそれぞれ相当する。 Note that the acquisition function of the server controller 220 of the external server 200 of the above-described embodiment is the same as the acquisition means of the present invention, the generation time calculation function of the server controller 220 is the same as the generation time calculation means of the present invention, and the output of the server controller 220 is also the same. The time calculation function is in the voice output time calculation means of the present invention, the setting function of the server controller 220 is in the setting means of the present invention, and the generation function of the in-vehicle device controller 130 of the in-vehicle device 100 is in the in-vehicle apparatus. The speaker 150 of the apparatus 100 corresponds to the output means of the present invention.

１００…車載装置
１１０…通信装置
１２０…入力装置
１３０…車載装置用コントローラ
１４０…ＧＰＳユニット
１５０…スピーカ
１６０…ディスプレイ
２００…外部サーバ
２１０…通信装置
２２０…サーバコントローラ
２３０…データベース DESCRIPTION OF SYMBOLS 100 ... In-vehicle apparatus 110 ... Communication apparatus 120 ... Input apparatus 130 ... In-vehicle apparatus controller 140 ... GPS unit 150 ... Speaker 160 ... Display 200 ... External server 210 ... Communication apparatus 220 ... Server controller 230 ... Database

Claims

Obtaining means for obtaining a plurality of information to be presented to the user;
Generation time calculating means for calculating a generation time which is a time required for generating the voice data from the information;
Voice output time calculating means for calculating a voice output time which is a time required for voice output of the voice data generated based on the information;
Based on the voice output time calculated by the voice output time calculation means and the generation time calculated by the generation time calculation means, the voice data is sequentially generated from the information, and the generated voice data An information presentation system having setting means for setting a processing order for sequentially outputting
When setting the processing order, the setting means sets the information that can generate the audio data before the generation and output of the audio data for the information for which the processing order has already been set is completed. An information presentation system characterized in that audio data is generated following information whose order has already been set and set as information for output.

The information presentation system according to claim 1,
When the setting means sets the processing order, there is a plurality of pieces of information that can generate the audio data before the generation and output of the audio data for the information for which the processing order has already been set is completed. The information having the longest generation time among a plurality of pieces of information capable of generating the audio data is set as information for generating and outputting the audio data following the information for which the processing order has already been set. Characteristic information presentation system.

The information presentation system according to claim 1 or 2,
The setting means, when setting the processing order, if there is no information for which the processing order has already been set, from the plurality of information, the information having the shortest generation time, first generating voice data, An information presentation system characterized by being set as information for output.

The information presentation system according to any one of claims 1 to 3,
When the setting means sets the processing order, there is no information that can generate the voice data before the generation and output of the voice data for the information for which the processing order has already been set is completed. The information presenting system is characterized in that the information with the shortest generation time is set as information for generating and outputting audio data following the information for which the processing order has already been set.

An information presentation system according to any one of claims 1 to 4,
The voice output time calculating means extracts text information for voice output from the information, and calculates the voice output time based on an information amount of the extracted text information for voice output. Presentation system.

An information presentation system according to any one of claims 1 to 5,
The information presentation system, wherein the generation time calculation means calculates the generation time based on an information amount of the extracted text information for voice output.

The information presentation system according to any one of claims 1 to 6,
An information presentation system, further comprising: generating means for generating the audio data based on the information in accordance with the processing order set by the setting means.

The information presentation system according to claim 7,
The information presentation system further comprising: an output unit that outputs the voice data generated by the generation unit according to the processing order set by the setting unit.

The information presentation system according to any one of claims 1 to 8,
Information obtained by the obtaining means includes at least POI (Point Of Interest) information.

The information presentation system according to claim 9,
The POI information includes at least a spot name and details of the spot as text information.

An information presentation method for acquiring a plurality of information, sequentially generating audio data based on the acquired information, and setting a processing order for sequentially outputting the generated audio data,
When setting the processing order, the processing order is already set for information that can generate the audio data before the generation and output of the audio data for the information for which the processing order has already been set is completed. A method of presenting information, characterized in that audio data is generated following the information and set as information for output.