JP6688252B2

JP6688252B2 - Determination device, determination method, and determination program

Info

Publication number: JP6688252B2
Application number: JP2017096004A
Authority: JP
Inventors: 石川　健二; 健二石川
Original assignee: Yahoo Japan Corp
Current assignee: Yahoo Japan Corp
Priority date: 2017-05-12
Filing date: 2017-05-12
Publication date: 2020-04-28
Anticipated expiration: 2037-05-12
Also published as: JP2018195894A

Description

本発明は、決定装置、決定方法及び決定プログラムに関する。 The present invention relates to a determination device, a determination method, and a determination program.

従来、スケジュールが予め登録された自動放送と、任意のタイミングで人為的に行われるアナウンスとの間で音声が重なることにより、音声の聞き取りが困難になるという、音声の衝突の問題が生じている。例えば、駅構内において、列車の到着を告げる自動放送と、駅員による注意喚起のアナウンスとの間で音声の衝突が発生する。 Conventionally, there is a problem of audio collision, that is, it becomes difficult to hear the audio because the audio is overlapped between the automatic broadcast whose schedule is registered in advance and the announcement that is artificially performed at an arbitrary timing. . For example, in a station premises, a voice collision occurs between an automatic broadcast that announces the arrival of a train and an alert announcement issued by a station staff.

このような音声の衝突を防止する技術として、アナウンスされる音声を一時的に記憶しておき、優先順位に従い、記憶された音声を順番に再生する技術が知られている。また、マイクロホン装置からアナウンスを行う話者に対して、自動放送に関するスケジュールを伝達することにより、音声の衝突を回避する技術が知られている。 As a technique for preventing such a voice collision, a technique is known in which announced voices are temporarily stored and the stored voices are reproduced in order according to priority. There is also known a technique of avoiding a collision of voices by transmitting a schedule related to automatic broadcasting from a microphone device to a speaker who makes an announcement.

特開平６−２７６６００号公報JP-A-6-276600 特開２００７−６７４６５号公報JP, 2007-67465, A

しかしながら、上記の従来技術では、状況に応じて適切に音声の衝突を回避しつつ、伝達する情報の内容を保持することが難しい。例えば、列車の発車時刻や、列車の到着を告げるような自動放送については、放送される時刻が重要であることから、一時的に記憶して音声を再生するといった処理には適さない。また、アナウンスを行う話者に対してスケジュールを伝達したとしても、例えば列車が頻繁に行き交う駅構内等では、音声が衝突しない空き時間が極めて短くなる可能性がある。このため、話者は、本来伝達しなければならない内容を全てアナウンスすることができない可能性がある。また、話者は、空き時間が発生するまで長時間にわたり待機しなければならない可能性もある。 However, in the above-described conventional technique, it is difficult to appropriately hold the content of the information to be transmitted while avoiding the collision of voices depending on the situation. For example, with respect to the departure time of a train and the automatic broadcasting for notifying the arrival of the train, since the broadcast time is important, it is not suitable for the process of temporarily storing and reproducing voice. Even if the schedule is transmitted to the speaker who makes the announcement, there is a possibility that the free time during which the voice does not collide is extremely short, for example, in a station yard where trains frequently travel. For this reason, the speaker may not be able to announce all the contents that should be transmitted. In addition, the speaker may have to wait for a long time before the idle time occurs.

本願は、上記に鑑みてなされたものであって、状況に応じて適切に音声の衝突を回避しつつ、伝達する情報の内容を保持することができる決定装置、決定方法及び決定プログラムを提供することを目的とする。 The present application has been made in view of the above, and provides a determination device, a determination method, and a determination program capable of retaining the content of information to be transmitted while appropriately avoiding a collision of voices depending on the situation. The purpose is to

本願に係る決定装置は、音声情報を取得する取得部と、前記取得部によって取得された音声情報に優先度を付与する付与部と、前記音声情報の出力にあたり、各音声情報が衝突しない時間帯である空白時間を検出する検出部と、前記検出部によって検出された空白時間と、前記付与部によって付与された優先度とに基づいて、前記音声情報を出力する態様を決定する決定部と、を備えたことを特徴とする。 The determination device according to the present application includes an acquisition unit that acquires audio information, an allocation unit that gives priority to the audio information acquired by the acquisition unit, and a time period during which the audio information does not collide when the audio information is output. Based on the blank time detected by the detection unit, and the priority given by the giving unit, a determination unit that determines the mode for outputting the voice information, It is characterized by having.

実施形態の一態様によれば、状況に応じて適切に音声の衝突を回避しつつ、伝達する情報の内容を保持することができるという効果を奏する。 According to the aspect of the embodiment, it is possible to appropriately avoid the collision of voices according to the situation and retain the content of the information to be transmitted.

図１は、実施形態に係る決定装置が発揮する作用効果の一例を説明するための図である。FIG. 1 is a diagram for explaining an example of the action and effect exhibited by the determination device according to the embodiment. 図２は、実施形態に係る決定装置が有する機能構成の一例を説明する図である。FIG. 2 is a diagram illustrating an example of a functional configuration of the determination device according to the embodiment. 図３は、実施形態に係る自動音声テーブルの一例を示す図である。FIG. 3 is a diagram showing an example of the automatic voice table according to the embodiment. 図４は、実施形態に係るアナウンステーブルの一例を示す図である。FIG. 4 is a diagram illustrating an example of the announcement table according to the embodiment. 図５は、実施形態に係るスケジュールテーブルの一例を示す図である。FIG. 5 is a diagram showing an example of the schedule table according to the embodiment. 図６は、実施形態に係る出力待ちテーブルの一例を示す図である。FIG. 6 is a diagram illustrating an example of the output waiting table according to the embodiment. 図７は、実施形態に係る決定処理の一例を示すフローチャート（１）である。FIG. 7 is a flowchart (1) showing an example of the determination process according to the embodiment. 図８は、実施形態に係る決定処理の一例を示すフローチャート（２）である。FIG. 8 is a flowchart (2) showing an example of the determination process according to the embodiment. 図９は、変形例に係る決定装置が有する機能構成の一例を説明する図である。FIG. 9 is a diagram illustrating an example of a functional configuration of the determination device according to the modification.

以下に、本願に係る決定装置、決定方法、及び決定プログラムを実施するための形態（以下、「実施形態」と呼ぶ）について図面を参照しつつ詳細に説明する。なお、この実施形態により本願に係る決定装置、決定方法、及び決定プログラムが限定されるものではない。また、以下の各実施形態において同一の部位には同一の符号を付し、重複する説明は省略される。 Hereinafter, modes (hereinafter, referred to as “embodiments”) for carrying out a determination device, a determination method, and a determination program according to the present application will be described in detail with reference to the drawings. Note that the determination device, the determination method, and the determination program according to the present application are not limited to this embodiment. Also, in each of the following embodiments, the same parts are designated by the same reference numerals, and duplicated description will be omitted.

〔１．決定処理の概念〕
まず、図１を用いて、決定装置１００が実行する決定処理の概念について説明する。図１は、実施形態に係る決定装置１００が発揮する作用効果の一例を説明するための図である。図１に示す決定処理システム１は、決定装置１００と、マイクロホン２０Ａと、マイクロホン２０Ｂと、出力装置３０とを含む。なお、決定処理システム１に含まれる各装置は、図示した数に限られない。 [1. Concept of decision processing]
First, the concept of the determination process executed by the determination device 100 will be described with reference to FIG. FIG. 1 is a diagram for explaining an example of a function and effect exhibited by the determination device 100 according to the embodiment. The decision processing system 1 shown in FIG. 1 includes a decision device 100, a microphone 20A, a microphone 20B, and an output device 30. The number of devices included in the decision processing system 1 is not limited to the illustrated number.

実施形態に係る決定装置１００は、例えばサーバ装置やクラウドシステム等、単数または複数の情報処理装置により実現される。決定装置１００は、例えば列車運行を行う事業者等によって管理され、駅の構内における音声の入出力を制御する。また、決定装置１００は、移動通信網や各種有線ケーブルや無線ＬＡＮ（Local Area Network）等のネットワークＮ（図２参照）を介して、所定の外部装置から音声情報（例えば、列車運行を伝達するための自動音声）を取得することや、音声の入力に用いられる入力装置（図１の例では、マイクロホン２０Ａやマイクロホン２０Ｂ）や音声を出力する出力装置３０等と通信することが可能である。 The determination device 100 according to the embodiment is realized by a single or a plurality of information processing devices such as a server device and a cloud system. The determination device 100 is managed by, for example, a business operator who operates a train, and controls the input / output of audio in the station premises. In addition, the determination device 100 transmits voice information (for example, train operation) from a predetermined external device via a network N (see FIG. 2) such as a mobile communication network, various wired cables, and a wireless LAN (Local Area Network). It is possible to obtain an automatic voice for inputting), and to communicate with an input device (microphone 20A or microphone 20B in the example of FIG. 1) used to input a voice, an output device 30 that outputs a voice, or the like.

マイクロホン２０Ａやマイクロホン２０Ｂは、駅員によって利用される音声の入力装置である。マイクロホン２０Ａやマイクロホン２０Ｂは、駅員が発声した音声の入力を受け付け、受け付けた音声に基づいて生成された音声情報を決定装置１００に送信する。なお、以下では、マイクロホン２０Ａやマイクロホン２０Ｂを区別する必要のない場合には、「マイクロホン２０」と総称する。 The microphone 20A and the microphone 20B are audio input devices used by station staff. The microphone 20A and the microphone 20B receive the input of the voice uttered by the station staff, and transmit the voice information generated based on the received voice to the determination device 100. In the following, the microphones 20A and 20B are collectively referred to as "microphone 20" unless it is necessary to distinguish between them.

出力装置３０は、駅のホームに設置されるスピーカー等の音声出力装置である。なお、出力装置３０は、特定のホームにのみ指向性を有する指向性スピーカーであってもよい。また、マイクロホン２０や出力装置３０には、所定の増幅器や音声調整装置等の情報機器が接続されていてもよい。 The output device 30 is an audio output device such as a speaker installed on the platform of the station. The output device 30 may be a directional speaker that has directivity only in a specific home. Further, the microphone 20 and the output device 30 may be connected to an information device such as a predetermined amplifier and a sound adjusting device.

図１の例において、決定装置１００は、列車の運行が記載されたスケジュール情報に基づいて、列車の発着を知らせる自動音声アナウンス（以下、「自動音声」と表記する）の入出力を制御する。なお、自動音声の内容は、列車の発着を知らせるためのものに限られず、ホームでの歩行を注意喚起するものや、喫煙の禁止を喚起するものや、宣伝情報等も含まれる。また、決定装置１００は、マイクロホン２０を介して駅員が入力するアナウンスの入出力を制御する。実施形態において、入出力の制御とは、音声の入力の受け付けや、出力装置３０への音声信号の送信や、出力装置３０において音声情報が出力（再生）される際の出力（再生）の態様の決定等の処理を含む。 In the example of FIG. 1, the determination device 100 controls input / output of an automatic voice announcement (hereinafter, referred to as “automatic voice”) that notifies departure and arrival of a train based on schedule information in which train operation is described. The content of the automatic voice is not limited to the notification of arrival / departure of a train, and may include a warning of walking at the platform, a warning of smoking prohibition, and advertising information. In addition, the determination device 100 controls input / output of an announcement input by a station employee via the microphone 20. In the embodiment, the input / output control means a mode of receiving an input of voice, transmitting a voice signal to the output device 30, and outputting (reproducing) when the output device 30 outputs (reproduces) voice information. Including processing such as determination of.

このように、決定装置１００が入出力を制御する駅のホームでは、様々な音声が出力装置３０から出力されるため、音声の衝突が発生する場合がある。具体的には、列車の到着を知らせる自動音声が出力されるタイミングと同時に、駅員がマイクロホン２０を介して乗客に注意喚起を行う場合がある。この場合、いずれかの音声の出力が中断されるか、もしくは、音声が混同して出力される可能性がある。このため、乗客に伝達されるべき情報が損なわれるか、あるいは、聞き辛い音声が再生されることになり、いずれにしても、乗客にとって望ましい状況とはいえない。そこで、決定装置１００は、以下に説明する決定処理により、状況に応じて適切に音声の衝突を回避しつつ、伝達する音声情報の内容を保持することを可能とする。以下、図１を用いて、実施形態に係る決定処理の概要を流れに沿って説明する。 As described above, various sounds are output from the output device 30 at the platform of the station where the determination device 100 controls the input / output, so that a sound collision may occur. Specifically, the station staff may call the passengers via the microphone 20 at the same time when the automatic voice that notifies the arrival of the train is output. In this case, there is a possibility that the output of one of the sounds will be interrupted, or the sounds will be confused and output. For this reason, the information to be transmitted to the passengers is damaged, or the uncomfortable voice is reproduced, which is not a desirable situation for the passengers in any case. Therefore, the determination device 100 can hold the content of the voice information to be transmitted while appropriately avoiding the collision of voices according to the situation by the determination process described below. Hereinafter, the outline of the determination process according to the embodiment will be described along the flow with reference to FIG. 1.

図１では、決定装置１００が音声の入出力を制御する駅のホームにおける、７時００分００秒の時点の音声の衝突状況を例示している。例えば、マイクロホン２０Ａは、駅員Ａから、「いつもご利用ありがとうございます。構内は禁煙ですので、おタバコはご遠慮ください。」といった内容の、ホームの乗客に対するマナーへの注意喚起のアナウンス（以下、「音声情報６１」と称する）を受け付ける。また、列車の運行のスケジュール情報に基づき、「まもなく１番線に渋谷行きの電車が参ります。」といった内容の、運行に関する自動音声アナウンス（以下、「音声情報６２」と称する）が出力されようとしている。また、マイクロホン２０Ｂは、駅員Ｂから、「お客様、お下がりください！」といった内容の、ホームの乗客に対する危険回避のためのアナウンス（以下、「音声情報６３」と称する）を受け付ける。 FIG. 1 exemplifies a state of a voice collision at a time of 7:00:00 on a platform of a station where the determining apparatus 100 controls voice input / output. For example, for the microphone 20A, the announcement from the station staff A, "Thank you for using this service. Please refrain from smoking because it is a non-smoking area." (Referred to as “voice information 61”) is accepted. Also, based on the train operation schedule information, an automatic voice announcement (hereinafter referred to as "voice information 62") about the operation, such as "A train bound for Shibuya will soon come to Line 1", is about to be output. There is. In addition, the microphone 20B receives an announcement from the station employee B for avoiding danger to passengers at the home (hereinafter referred to as "voice information 63"), such as "Customer, please drop me down!".

決定装置１００は、これらの音声情報を取得する（ステップＳ１１）。決定装置１００は、取得した音声情報を音声情報記憶部１２１に格納する。なお、決定装置１００は、必ずしも音声情報をリアルタイムに取得することを要しない。例えば、決定装置１００は、音声情報６２等の自動音声に関しては予め取得しておいてもよい。 The determination device 100 acquires these audio information (step S11). The determination device 100 stores the acquired voice information in the voice information storage unit 121. Note that the determination device 100 does not necessarily need to acquire voice information in real time. For example, the determining apparatus 100 may acquire the automatic voice such as the voice information 62 in advance.

そして、決定装置１００は、取得した各音声情報に対して優先度を付与する（ステップＳ１２）。具体的には、決定装置１００は、１から１０までの数値によって示される優先度であって、数値が小さいほど優先の度合いが高くなるような優先度を各音声情報に付与する。 Then, the determination device 100 gives a priority to each of the acquired audio information (step S12). Specifically, the determining apparatus 100 gives priority to each audio information, which is a priority represented by a numerical value from 1 to 10, and the smaller the numerical value, the higher the priority.

実施形態では、決定装置１００は、例えば、入力時に音声情報にタグ付けされた優先度に基づいて、音声情報に優先度を付与する。実施形態において、マイクロホン２０は、音声を受け付ける際に、「通常時」に流すアナウンスであるか、「緊急時」に流すアナウンスであるかの選択を受け付けることが可能であるものとする。例えば、マイクロホン２０は、マイクロホン２０に備えられたボタンのうち、一のボタンを押下されて音声を受け付けた場合には「通常時」アナウンスのタグ付けを行う。また、マイクロホン２０は、マイクロホン２０に備えられたボタンのうち、他のボタンを押下されて音声を受け付けた場合には「緊急時」アナウンスのタグ付けを行う。これにより、マイクロホン２０は、音声情報を決定装置１００に送信する際に、優先度のタグ付けを行うことができる。 In the embodiment, the determination device 100 gives priority to the voice information based on, for example, the priority tagged to the voice information at the time of input. In the embodiment, the microphone 20 is capable of accepting a selection of whether it is an announcement to be sent in “normal time” or an “emergency time” when receiving a voice. For example, the microphone 20 tags an “ordinary time” announcement when one of the buttons included in the microphone 20 is pressed to receive a voice. In addition, the microphone 20 tags the “emergency” announcement when the other button of the microphone 20 is pressed to accept a voice. Thereby, the microphone 20 can tag the priority when transmitting the voice information to the determination device 100.

図１の例では、駅員Ａは、マイクロホン２０Ａに備えられたボタンのうち、「通常時」アナウンスのタグ付けが行われるボタンを押下して音声の入力を行ったものとする。この場合、決定装置１００は、例えば「３」という優先度を音声情報６１にタグ付けする。また、駅員Ｂは、マイクロホン２０Ｂに備えられたボタンのうち、「緊急時」アナウンスのタグ付けが行われるボタンを押下して音声の入力を行ったものとする。この場合、決定装置１００は、例えば「１」という優先度を音声情報６３にタグ付けする。なお、優先度は、マイクロホン２０に備えられたボタン等によって選択されるのではなく、マイクロホン２０の個体そのものに予め設定されていてもよい。 In the example of FIG. 1, it is assumed that the station employee A inputs a voice by pressing a button that is tagged with the “normal time” announcement, out of the buttons included in the microphone 20A. In this case, the determination device 100 tags the voice information 61 with a priority of “3”, for example. In addition, it is assumed that the station employee B inputs a voice by pressing a button for tagging an "emergency" announcement among the buttons provided on the microphone 20B. In this case, the determination device 100 tags the voice information 63 with a priority of “1”, for example. The priority may not be selected by a button or the like provided in the microphone 20, but may be set in advance for the individual microphone 20 itself.

また、決定装置１００は、列車の運行を知らせるための自動音声については、例えば、予め運行管理者によって設定された優先度を付与する。決定装置１００は、例えば、列車の運行を知らせるための自動音声である音声情報６２に対して「２」という優先度を付与する。 Further, the determination device 100 gives, for example, a priority set in advance by the operation manager to the automatic voice for notifying the train operation. The determination device 100 gives a priority of “2” to the voice information 62 which is an automatic voice for notifying the train operation, for example.

そして、決定装置１００は、７時００分００秒の時点で、出力すべき音声情報が３つ取得されたことから、音声の衝突が発生したことを検知する。この場合、決定装置１００は、３つの音声情報を順番に出力させることができるような処理を行う。まず、決定装置１００は、取得した音声情報のうち、最も優先度の高い音声情報を優先的に出力させることを決定する。このとき、決定装置１００は、最も優先度の高い音声情報以外の音声情報であって、録音されていない音声情報について、一時的に録音を行う。図１の例では、音声情報６２は、自動音声であり予め録音されたものであるため、決定装置１００が録音を新たに行うことを要しない。一方、音声情報６１は、録音された音声情報ではない。このため、決定装置１００は、音声情報６３と比較して優先度の低いアナウンスである音声情報６１を、一時的に出力情報記憶部１２５に格納する（ステップＳ１３）。 Then, the determination device 100 detects that the collision of voices has occurred because three pieces of voice information to be output are acquired at the time of 7:00:00. In this case, the determination device 100 performs processing so that the three pieces of audio information can be output in order. First, the determining apparatus 100 determines to preferentially output the audio information having the highest priority among the acquired audio information. At this time, the determining apparatus 100 temporarily records the voice information other than the voice information having the highest priority and the voice information which is not recorded. In the example of FIG. 1, the voice information 62 is an automatic voice and is pre-recorded, so that the determination device 100 does not need to newly record. On the other hand, the voice information 61 is not the recorded voice information. Therefore, the determination device 100 temporarily stores the voice information 61, which is an announcement having a lower priority than the voice information 63, in the output information storage unit 125 (step S13).

決定装置１００は、出力が衝突している３つの音声情報のうち、最も優先度の高い音声情報６３を出力することを決定する。この場合、決定装置１００は、自動音声である音声情報６２が出力される予定（スケジュール）に関わらず、音声情報６３を出力させる。これは、緊急時に駅員から発せられるアナウンス等は、スケジュールに割り込んででも優先して出力されるよう設定されていることを意味する。このため、決定装置１００は、衝突した音声情報のうち、最も優先度の高い割り込みアナウンス（音声情報６３）を優先して出力させる（ステップＳ１４）。 The determining apparatus 100 determines to output the voice information 63 having the highest priority among the three voice information whose outputs collide. In this case, the determination device 100 causes the voice information 63 to be output regardless of the schedule (schedule) in which the voice information 62 that is an automatic voice is output. This means that an announcement or the like issued by a station staff member in an emergency is set to be preferentially output even when the schedule is interrupted. Therefore, the determination device 100 preferentially outputs the interrupt announcement (voice information 63) having the highest priority among the collided voice information (step S14).

音声情報６３の出力の後、決定装置１００は、各音声情報が衝突しない時間帯である空白時間を検出する処理を行う（ステップＳ１５）。具体的には、決定装置１００は、出力情報記憶部１２５に予め保持されている、列車のスケジュール情報を参照する。そして、決定装置１００は、音声情報６３の出力が終了した後、次の自動音声が出力されるタイミング迄の空白時間を検出する。 After outputting the voice information 63, the determination device 100 performs a process of detecting a blank time, which is a time period during which the voice information does not collide (step S15). Specifically, the determination device 100 refers to train schedule information that is held in advance in the output information storage unit 125. Then, the determination device 100 detects the blank time until the timing at which the next automatic voice is output after the output of the voice information 63 is completed.

このとき、決定装置１００は、検出された空白時間と、各音声情報に付与された優先度とに基づいて、音声情報を出力する態様を決定する。音声情報を出力する態様とは、音声情報を出力する順番、及び、音声情報が出力される際の再生に関する態様を含む。再生に関する態様とは、再生の速度や、音声情報のうち再生される箇所の選択を含む。 At this time, the determination device 100 determines a mode in which the voice information is output, based on the detected blank time and the priority given to each voice information. The mode of outputting the audio information includes the order of outputting the audio information and the mode relating to the reproduction when the audio information is output. The mode relating to reproduction includes the speed of reproduction and selection of a portion to be reproduced of audio information.

まず、決定装置１００は、出力を待つこととなった音声情報（以下、「待ち音声」と称する場合がある）を、優先度に基づいて順序付ける。図１の例では、決定装置１００は、待ち音声のうち、優先度の高い音声情報６２を音声情報６１よりも上位に順位付ける。 First, the determination device 100 orders the audio information that has been waiting for output (hereinafter, may be referred to as “waiting audio”) based on the priority. In the example of FIG. 1, the determination device 100 ranks the voice information 62 having a higher priority than the voice information 61 among the waiting voices.

そして、決定装置１００は、音声情報６２の再生に掛かる時間である再生時間を参照する。そして、決定装置１００は、検出される空白時間が、参照した再生時間よりも長い場合には、音声情報６２を出力することを決定する。 Then, the determining apparatus 100 refers to the reproduction time, which is the time required to reproduce the audio information 62. Then, the determining device 100 determines to output the audio information 62 when the detected blank time is longer than the referred reproduction time.

なお、決定装置１００は、検出される空白時間が再生時間よりも短い場合であっても、音声情報６２を出力することができる場合がある。例えば、決定装置１００は、各音声情報について、音声情報を短縮した場合の再生時間（以下、「短縮再生時間」と称する）に関する情報を保持するものとする。短縮再生時間とは、例えば、音声情報のうち省略できる箇所を省略した場合の再生時間であったり、音声情報を若干速く（例えば、通常の１．３倍速などで）再生した場合の再生時間であったりする。 Note that the determining apparatus 100 may be able to output the audio information 62 even if the detected blank time is shorter than the reproduction time. For example, the determining apparatus 100 holds, for each piece of audio information, information about a reproduction time when the audio information is shortened (hereinafter, referred to as “shortened reproduction time”). The shortened playback time is, for example, a playback time when omitting a portion of the audio information that can be omitted, or a playback time when the audio information is played slightly faster (for example, 1.3 times normal speed). There will be.

すなわち、決定装置１００は、検出された空白時間が、音声情報６１に割り込まれた音声情報６２の再生時間よりも短い場合であって、かつ、空白時間が短縮再生時間よりも長い場合には、音声情報６２を出力可能な態様に変更することを決定する（ステップＳ１６）。具体的には、決定装置１００は、音声情報６２の一部箇所を省略して出力させたり、音声情報６２の再生速度を早くして出力させたりするよう、出力の態様を決定する。 That is, when the detected blank time is shorter than the reproduction time of the audio information 62 interrupted by the audio information 61 and the blank time is longer than the shortened reproduction time, the determining device 100 determines that It is determined to change the voice information 62 to an outputable form (step S16). Specifically, the determining apparatus 100 determines the output mode such that a part of the audio information 62 is omitted and the audio information 62 is output, or the reproduction speed of the audio information 62 is increased and the audio information 62 is output.

このようにして、決定装置１００は、空白時間の長さに合わせて音声情報６２（自動音声）を出力させる（ステップＳ１７）。音声情報６２を出力させた後、決定装置１００は、さらに空白時間を検出する（ステップＳ１８）。そして、決定装置１００は、音声情報６１を出力可能な空白時間が検出されるか否かに応じて、待ち音声（音声情報６１）の出力タイミングを決定する（ステップＳ１９）。そして、決定装置１００は、音声情報６２の場合と同様に、空白時間の長さに合わせて、音声情報６１を出力させる（ステップＳ２０）。 In this way, the determination device 100 outputs the voice information 62 (automatic voice) according to the length of the blank time (step S17). After outputting the voice information 62, the determination device 100 further detects a blank time (step S18). Then, the determining apparatus 100 determines the output timing of the waiting voice (voice information 61) depending on whether or not the blank time during which the voice information 61 can be output is detected (step S19). Then, as in the case of the voice information 62, the determination device 100 outputs the voice information 61 according to the length of the blank time (step S20).

上述のように、実施形態に係る決定装置１００は、音声情報を取得し、取得した音声情報に優先度を付与する。そして、決定装置１００は、音声情報の出力にあたり、各音声情報が衝突しない時間帯である空白時間を検出する。さらに、決定装置１００は、検出された空白時間と、付与した優先度とに基づいて、音声情報を出力する態様を決定する。これにより、決定装置１００は、状況に応じて適切に音声の衝突を回避しつつ、伝達する情報の内容を保持することを可能とする。以下、図を用いて、決定処理を実現する決定装置１００の機能構成及び作用効果の一例を説明する。 As described above, the determining apparatus 100 according to the embodiment acquires voice information and gives priority to the acquired voice information. Then, when outputting the voice information, the determination device 100 detects a blank time that is a time period during which the voice information does not collide. Further, the determination device 100 determines the mode of outputting the audio information based on the detected blank time and the given priority. As a result, the determination apparatus 100 can appropriately hold the content of the information to be transmitted while avoiding the collision of voices depending on the situation. Hereinafter, an example of a functional configuration and an operation effect of the determination device 100 that implements the determination process will be described with reference to the drawings.

〔２．機能構成の一例〕
まず、図２を用いて、実施形態に係る決定装置１００の構成について説明する。図２は、実施形態に係る情報処理装置が有する機能構成の一例を説明する図である。図２に示すように、決定装置１００は、通信部１１０と、記憶部１２０と、制御部１３０とを有する。なお、決定装置１００は、決定装置１００の管理者等から各種操作を受け付ける入力部（例えば、キーボードやマウス等）や、各種情報を表示するための表示部（例えば、液晶ディスプレイ等）を有してもよい。 [2. Example of functional configuration]
First, the configuration of the determination device 100 according to the embodiment will be described with reference to FIG. FIG. 2 is a diagram illustrating an example of a functional configuration of the information processing apparatus according to the embodiment. As shown in FIG. 2, the determination device 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The determination device 100 has an input unit (for example, a keyboard or a mouse) that receives various operations from an administrator of the determination device 100 or the like, and a display unit (for example, a liquid crystal display or the like) for displaying various information. May be.

通信部１１０は、例えば、ＮＩＣ（Network Interface Card）等によって実現される。そして、通信部１１０は、ネットワークＮと有線または無線で接続され、マイクロホン２０や出力装置３０との間で、種々の情報の送受信を行う。 The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. Then, the communication unit 110 is connected to the network N by wire or wirelessly, and transmits and receives various information to and from the microphone 20 and the output device 30.

記憶部１２０は、例えば、ＲＡＭ（Random Access Memory）、フラッシュメモリ（Flash Memory）等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。実施形態に係る記憶部１２０は、図２に示すように、音声情報記憶部１２１と、出力情報記憶部１２５とを有する。以下、図３乃至図６を用いて、音声情報記憶部１２１、及び出力情報記憶部１２５に登録される情報の一例を説明する。 The storage unit 120 is realized by, for example, a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. The storage unit 120 according to the embodiment has a voice information storage unit 121 and an output information storage unit 125, as shown in FIG. Hereinafter, an example of information registered in the voice information storage unit 121 and the output information storage unit 125 will be described with reference to FIGS. 3 to 6.

音声情報記憶部１２１には、音声情報に関する情報が記憶される。図２に示すように、音声情報記憶部１２１は、データテーブルとして、自動音声テーブル１２２と、アナウンステーブル１２３とを有する。以下、各データテーブルについて順に説明する。 Information regarding voice information is stored in the voice information storage unit 121. As shown in FIG. 2, the voice information storage unit 121 has an automatic voice table 122 and an announcement table 123 as data tables. Hereinafter, each data table will be described in order.

自動音声テーブル１２２は、音声情報のうち、自動音声に関する情報を記憶する。図３は、実施形態に係る自動音声テーブル１２２の一例を示す図である。図３に示すように、自動音声テーブル１２２は、「自動音声種別」、「日程」、「自動音声ＩＤ」、「伝達内容」、「優先度」、「出力タイミング」、「再生時間」、「短縮再生時間」といった項目を有する。 The automatic voice table 122 stores information regarding automatic voice among the voice information. FIG. 3 is a diagram showing an example of the automatic voice table 122 according to the embodiment. As shown in FIG. 3, the automatic voice table 122 has “automatic voice type”, “schedule”, “automatic voice ID”, “transmission content”, “priority”, “output timing”, “playback time”, “ There is an item such as “shortened playback time”.

「自動音声種別」は、自動音声の種別を示す。自動音声の種別とは、例えば、自動音声が列車の発着等を知らせる運行案内であるか、ホームに所在する乗客等への注意喚起であるか、あるいは宣伝等の広告情報であるかといった、自動音声のカテゴリを示す。なお、自動音声は、自動音声種別に応じた優先度が付与されてもよい。「日程」は、自動音声の再生が予定されている日程を示す。 “Automatic voice type” indicates the type of automatic voice. The type of automatic voice is, for example, whether the automatic voice is an operation guide for informing train arrivals and departures, a warning to passengers at the platform, or an advertisement information such as advertisement. Indicates the audio category. The automatic voice may be given a priority according to the automatic voice type. "Schedule" indicates a schedule for which automatic audio reproduction is scheduled.

「自動音声ＩＤ」は、各々の自動音声を識別する識別情報を示す。「伝達内容」は、具体的な自動音声の伝達内容を示す。「優先度」は、自動音声に付与された優先度を示す。「出力タイミング」は、自動音声の出力が予定されている具体的な時間を示す。なお、自動音声には、運行案内等のように時間によって出力が予定されているものと、注意喚起や広告情報のように、具体的な時間によっては出力が予定されていないものとがある。 The “automatic voice ID” indicates identification information for identifying each automatic voice. “Transmission content” indicates specific transmission content of automatic voice. The “priority” indicates the priority given to the automatic voice. The “output timing” indicates a specific time when the automatic voice output is scheduled. There are two types of automatic voices that are scheduled to be output depending on the time, such as operation guides, and some that are not scheduled to be output depending on the specific time, such as alerts and advertisement information.

「再生時間」は、自動音声の基本的な再生時間を示す。「短縮再生時間」は、自動音声を短縮して再生する場合の再生時間を示す。 “Playback time” indicates a basic playback time of automatic voice. The “shortened reproduction time” indicates a reproduction time when the automatic voice is reproduced by being shortened.

次に、アナウンステーブル１２３について説明する。アナウンステーブル１２３は、音声情報のうち、駅員等がマイクロホン２０を介して入力したアナウンスに関する情報を記憶する。図４は、実施形態に係るアナウンステーブル１２３の一例を示す図である。図４に示すように、アナウンステーブル１２３は、「入力装置ＩＤ」、「アナウンス種別」、「アナウンスＩＤ」、「取得日時」、「伝達内容」、「優先度」、「録音時間」、「短縮再生時間」といった項目を有する。 Next, the announcement table 123 will be described. The announcement table 123 stores, among the voice information, information about announcements input by the station staff or the like via the microphone 20. FIG. 4 is a diagram showing an example of the announcement table 123 according to the embodiment. As shown in FIG. 4, the announcement table 123 includes “input device ID”, “announce type”, “announce ID”, “acquisition date”, “communication content”, “priority”, “recording time”, and “shortening”. There is an item such as "reproduction time".

「入力装置ＩＤ」は、アナウンスに係る音声情報が入力された装置を識別する識別情報を示す。「アナウンス種別」は、アナウンスの種別を示す。なお、上述のように、アナウンス種別は、マイクロホン２０に入力される際の選択操作によって設定されてもよいし、マイクロホン２０ごとに予め設定されていてもよい。なお、通常アナウンスとは、通常時と設定されて入力されたアナウンスを示す。緊急アナウンスとは、緊急時と設定されて入力されたアナウンスを示す。録音アナウンスとは、即時的に出力させるアナウンスではなく、駅員等が宣伝などを予め録音するために入力を行ったアナウンスである。「アナウンスＩＤ」は、アナウンスに係る音声情報を識別する識別情報を示す。 The “input device ID” indicates identification information for identifying the device into which the voice information related to the announcement has been input. “Announcement type” indicates the type of announcement. As described above, the announcement type may be set by a selection operation when input to the microphone 20, or may be preset for each microphone 20. The normal announcement indicates an announcement that is set and input as normal time. The urgent announcement indicates an announcement that has been set and entered as an emergency. The recording announcement is not an announcement that is output immediately, but an announcement that a station employee or the like inputs to prerecord an advertisement or the like. “Announce ID” indicates identification information for identifying voice information related to the announcement.

「取得日時」は、アナウンスが取得された日時を示す。「伝達内容」は、アナウンスの内容を示す。なお、アナウンスの内容は、例えば入力のタイミングにおいて、マイクロホン２０に備えられた機能によって駅員が自らタグ付けした内容であってもよいし、アナウンスの音声を解析し、解析された内容（例えば、テキストデータの内容等）から自動的に付与されてもよい。 The “acquisition date and time” indicates the date and time when the announcement was acquired. “Communication content” indicates the content of the announcement. The content of the announcement may be, for example, content tagged by the station staff by a function provided in the microphone 20 at the timing of input, or the content analyzed by analyzing the voice of the announcement (for example, text. It may be automatically given from the content of data, etc.).

「優先度」は、アナウンスに係る音声情報の優先度を示す。「録音時間」は、アナウンスが録音された時間、すなわち、アナウンスに係る音声情報の再生時間を示す。「短縮再生時間」は、アナウンスに係る音声情報を短縮して再生する場合の再生時間を示す。 The “priority” indicates the priority of the voice information related to the announcement. The “recording time” indicates the time when the announcement was recorded, that is, the reproduction time of the voice information related to the announcement. The “shortened playback time” indicates the playback time when the voice information related to the announcement is shortened and played back.

次に、出力情報記憶部１２５について説明する。出力情報記憶部１２５には、自動音声が出力されるスケジュールや、出力待ちとなった音声情報が記憶される。図２に示すように、出力情報記憶部１２５は、データテーブルとして、スケジュールテーブル１２６と、出力待ちテーブル１２７とを有する。以下、各データテーブルについて順に説明する。 Next, the output information storage unit 125 will be described. The output information storage unit 125 stores a schedule for outputting automatic voice and voice information that has been output. As shown in FIG. 2, the output information storage unit 125 has a schedule table 126 and an output waiting table 127 as data tables. Hereinafter, each data table will be described in order.

スケジュールテーブル１２６は、自動音声のスケジュールを記憶する。図５は、実施形態に係るスケジュールテーブル１２６の一例を示す図である。図５に示すように、スケジュールテーブル１２６は、「音声出力スケジュール」、「自動音声ＩＤ」、「再生時間」、「空白時間」といった項目を有する。 The schedule table 126 stores an automatic audio schedule. FIG. 5 is a diagram showing an example of the schedule table 126 according to the embodiment. As shown in FIG. 5, the schedule table 126 has items such as “voice output schedule”, “automatic voice ID”, “playback time”, and “blank time”.

「音声出力スケジュール」は、自動音声が出力される予定時刻を示す。「自動音声ＩＤ」は、自動音声を識別する識別情報を示す。「再生時間」は、自動音声の再生時間を示す。「空白時間」は、スケジュールにおいて、一の自動音声の再生が終了した後に次の自動音声の再生が開始されるまでの時間であって、出力装置３０において出力が空白となる時間を示す。なお、空白時間は、割り込みアナウンス等によって自動音声の出力が遅延した場合等には、適宜、遅れた時間に応じて更新されてもよい。 “Voice output schedule” indicates a scheduled time at which the automatic voice is output. "Automatic voice ID" indicates identification information for identifying automatic voice. The "reproduction time" indicates the reproduction time of the automatic voice. “Blank time” is the time from the end of the reproduction of one automatic voice to the start of the reproduction of the next automatic voice in the schedule, and indicates the time when the output is blank in the output device 30. The blank time may be appropriately updated according to the delayed time when the output of the automatic voice is delayed due to an interrupt announcement or the like.

出力待ちテーブル１２７は、出力を待機している待ち音声に関する情報を記憶する。図６は、実施形態に係る出力待ちテーブル１２７の一例を示す図である。図６に示すように、出力待ちテーブル１２７は、「音声衝突日時」、「待ち音声ＩＤ」、「優先度」、「再生時間」、「出力態様」といった項目を有する。 The output waiting table 127 stores information about waiting voices waiting for output. FIG. 6 is a diagram illustrating an example of the output waiting table 127 according to the embodiment. As shown in FIG. 6, the output wait table 127 has items such as “voice collision date / time”, “waiting voice ID”, “priority”, “playback time”, and “output mode”.

「音声衝突日時」は、音声情報の衝突が発生した日時を示す。「待ち音声ＩＤ」は、音声情報の衝突が発声した時点で、出力されずに出力を待っている状態となっている音声情報を識別する識別情報を示す。なお、待ち音声ＩＤとして示す識別情報は、図３の自動音声ＩＤや、図４のアナウンスＩＤと共通するものとする。「優先度」は、待ち音声に付与されている優先度を示す。「再生時間」は、待ち音声の再生時間を示す。「出力態様」は、待ち音声が出力される場合に予定されている出力態様を示す。待ち音声は、空白時間が検出されるのを待って再生されたり、本来のスケジュールから遅れて再生されたりすることになるため、通常よりも早く再生が終わるような態様で出力されてもよい。具体的には、出力態様は、音声情報の一部を再生することで再生時間を短縮したり、再生速度を速くして再生時間を短縮したりしてもよい。例えば、決定装置１００は、待ち音声の合計の再生時間と検出された空白時間との長さを比較して、空白時間の方が短ければ、待ち音声の出力態様を短く再生できるよう決定することで、待ち音声を極力早く出力させるよう調整する。 “Voice collision date and time” indicates the date and time when the voice information collision occurred. The “waiting voice ID” indicates identification information for identifying voice information that is in a state of not being output and waiting for output when a voice information collision is uttered. The identification information shown as the waiting voice ID is common to the automatic voice ID in FIG. 3 and the announcement ID in FIG. The “priority” indicates the priority given to the waiting voice. The “reproduction time” indicates the reproduction time of the waiting voice. The “output mode” indicates an output mode scheduled when the waiting voice is output. The waiting voice may be played after waiting for the blank time to be detected, or may be played back after the original schedule, so that the waiting voice may be output in such a manner that the playback ends earlier than usual. Specifically, as the output mode, the reproduction time may be shortened by reproducing a part of the audio information, or the reproduction speed may be increased to shorten the reproduction time. For example, the determining apparatus 100 compares the total playback time of the waiting voice with the length of the detected blank time, and determines that the output mode of the waiting voice can be played back shorter if the blank time is shorter. Then, adjust so that the waiting voice is output as quickly as possible.

図２に戻って説明を続ける。制御部１３０は、コントローラ（controller）であり、例えば、ＣＰＵ（Central Processing Unit）、ＭＰＵ（Micro Processing Unit）、ＡＳＩＣ（Application Specific Integrated Circuit）やＦＰＧＡ（Field Programmable Gate Array）等によって、決定装置１００内部の記憶装置に記憶されている各種プログラム（例えば、実施形態に係る決定プログラム）が、ＲＡＭ等の記憶領域を作業領域として実行されることにより実現される。図２に示す例では、制御部１３０は、取得部１３１と、付与部１３２と、検出部１３３と、決定部１３４と、送信部１３５（以下、総称して各処理部１３１〜１３５と記載する場合がある。）を有する。 Returning to FIG. 2, the description will be continued. The control unit 130 is a controller, and includes, for example, a CPU (Central Processing Unit), an MPU (Micro Processing Unit), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array), and the like inside the determination device 100. Various programs stored in the storage device (for example, the determination program according to the embodiment) are implemented by executing a storage area such as a RAM as a work area. In the example illustrated in FIG. 2, the control unit 130 includes the acquisition unit 131, the addition unit 132, the detection unit 133, the determination unit 134, and the transmission unit 135 (hereinafter, collectively referred to as the processing units 131 to 135). There is a case).

なお、制御部１３０が有する各処理部１３１〜１３５の接続関係は、図２に示した接続関係に限られず、他の接続関係であってもよい。また、各処理部１３１〜１３５は、以下に説明するような決定処理の機能・作用（例えば図１）を実現・実行するものであるが、これらは説明のために整理した機能単位であり、実際のハードウェア要素やソフトウェアモジュールとの一致は問わない。すなわち、以下の決定処理の機能・作用を実現・実行することができるのであれば、決定装置１００は、任意の機能単位で案内処理を実現・実行して良い。 The connection relationship between the processing units 131 to 135 included in the control unit 130 is not limited to the connection relationship illustrated in FIG. 2 and may be another connection relationship. Further, each processing unit 131 to 135 realizes and executes a function / action (for example, FIG. 1) of the determination processing as described below, but these are functional units arranged for explanation, It does not matter whether it matches the actual hardware element or software module. That is, if the function / action of the following determination process can be realized / executed, the determination device 100 may realize / execute the guidance process in arbitrary function units.

〔３．決定処理における作用効果の一例〕
以下、図７及び図８に示すフローチャートを用いて、各処理部１３１〜１３５が実行・実現する決定処理の内容について説明する。図７は、実施形態に係る決定処理の一例を示すフローチャート（１）である。なお、図７では、アナウンス等の自動音声以外の音声情報が衝突した場合の決定処理の処理手順の一例を示している。 [3. Example of action and effect in decision processing]
Hereinafter, the contents of the determination process executed / implemented by the respective processing units 131 to 135 will be described using the flowcharts shown in FIGS. 7 and 8. FIG. 7 is a flowchart (1) showing an example of the determination process according to the embodiment. It should be noted that FIG. 7 shows an example of a processing procedure of a determination process when voice information other than automatic voice such as an announcement collides.

まず、取得部１３１は、音声情報を取得したか否かを判定する（ステップＳ１０１）。なお、取得部１３１は、マイクロホン２０等の入力装置を介して、駅員等の話者から音声の入力を受け付けることで音声情報を取得する。取得部１３１は、音声情報を取得していない場合（ステップＳ１０１；Ｎｏ）、取得するまで待機する。取得部１３１が音声情報を取得した場合（ステップＳ１０１；Ｙｅｓ）、付与部１３２は、取得した音声情報に優先度を付与する。例えば、付与部１３２は、マイクロホン２０等における設定（「通常時」や「緊急時」の選択など）に基づいて、音声情報に優先度を付与する（ステップＳ１０２）。 First, the acquisition unit 131 determines whether audio information has been acquired (step S101). The acquisition unit 131 acquires voice information by receiving a voice input from a speaker such as a station staff via an input device such as the microphone 20. When the acquisition unit 131 has not acquired the voice information (step S101; No), it waits until the acquisition. When the acquisition unit 131 has acquired the voice information (step S101; Yes), the adding unit 132 adds a priority to the acquired voice information. For example, the giving unit 132 gives a priority to the voice information based on the setting of the microphone 20 or the like (selection of "normal time" or "emergency time") (step S102).

このとき、検出部１３３は、音声情報を取得した場合に、音声の衝突が発生したか否かを判定する（ステップＳ１０３）。なお、検出部１３３は、何らかの音声情報が現時点で出力されている場合のみを衝突として判定するのではなく、例えば、数秒後に自動音声が出力される予定になっている状況を衝突として判定してもよい。 At this time, the detection unit 133 determines whether a voice collision has occurred when the voice information is acquired (step S103). Note that the detection unit 133 does not determine as a collision only when some audio information is currently output, but determines, for example, a situation in which an automatic audio is scheduled to be output after a few seconds as a collision. Good.

音声の衝突が発生していない場合には（ステップＳ１０３；Ｎｏ）、取得部１３１は、音声情報を取得する処理を継続する。なお、図７での図示は省略しているが、決定部１３４は、音声の衝突が発生しない場合、ステップＳ１０１で取得した音声情報をそのまま出力装置３０に出力するよう決定してもよい。 When the voice collision has not occurred (step S103; No), the acquisition unit 131 continues the process of acquiring the voice information. Although not shown in FIG. 7, the deciding unit 134 may decide to output the audio information acquired in step S101 as it is to the output device 30 when the audio collision does not occur.

音声の衝突が発生した場合には（ステップＳ１０３；Ｙｅｓ）、取得部１３１は、衝突した音声情報のうち、録音されていない音声が存在するか否かを判定する（ステップＳ１０４）。録音されていない音声が存在する場合（ステップＳ１０４；Ｙｅｓ）、取得部１３１は、録音されていない音声を録音する（ステップＳ１０５）。例えば、取得部１３１は、複数の話者から別々に入力された二以上の音声情報であって、付与部１３２によって優先度が付与された音声情報を取得した場合には、最も優先度の高い音声情報以外を一時的に録音する。言い換えれば、取得部１３１は、取得した音声情報を音声情報記憶部１２１に格納する。なお、衝突した音声情報が録音されているものであれば（ステップＳ１０４；Ｎｏ）、取得部１３１は、音声情報を録音することを要しない。 When a sound collision has occurred (step S103; Yes), the acquisition unit 131 determines whether or not there is unrecorded sound in the collision sound information (step S104). When there is unrecorded voice (step S104; Yes), the acquisition unit 131 records the unrecorded voice (step S105). For example, when the acquisition unit 131 acquires two or more pieces of voice information that are separately input from a plurality of speakers and the voice information to which the priority has been given by the addition unit 132, the acquisition unit 131 has the highest priority. Temporarily record anything other than audio information. In other words, the acquisition unit 131 stores the acquired voice information in the voice information storage unit 121. If the voice information that has collided is recorded (step S104; No), the acquisition unit 131 does not need to record the voice information.

そして、決定部１３４は、衝突した音声情報のうち、最も優先度の高い音声を出力させることを決定する（ステップＳ１０６）。続けて、決定部１３４は、待ち音声の順番を変更する（ステップＳ１０７）。すなわち、決定部１３４は、最も順位の高い（優先度の高い）音声情報を出力させた後に、次に順位の高い待ち音声を、待ち音声の最上位に変更する。 Then, the determination unit 134 determines to output the voice with the highest priority among the voice information that has collided (step S106). Subsequently, the determination unit 134 changes the order of the waiting voices (step S107). That is, the determining unit 134 outputs the voice information with the highest rank (high priority), and then changes the waiting voice with the next highest rank to the highest rank of the waiting voice.

そして、検出部１３３は、待ち音声を出力可能な空白時間を検出したか否かを判定する（ステップＳ１０８）。なお、待ち音声を出力可能な空白時間とは、例えば、待ち音声を短縮した場合に待ち音声を再生できるような、待ち音声の短縮再生時間よりも長い空白時間をいう。出力可能な空白時間を検出しない場合（ステップＳ１０８；Ｎｏ）、検出部１３３は、空白時間を検出するまで待機する。一方、出力可能な空白時間を検出した場合（ステップＳ１０８；Ｙｅｓ）、さらに検出部１３３は、検出された空白時間が、次に出力予定の音声情報と比較して長いか否かを判定する（ステップＳ１０９）。 Then, the detection unit 133 determines whether or not a blank time period during which the waiting voice can be output is detected (step S108). The blank time during which the waiting voice can be output means, for example, a blank time that is longer than the shortened reproduction time of the waiting voice so that the waiting voice can be reproduced when the waiting voice is shortened. When the blank time that can be output is not detected (step S108; No), the detection unit 133 waits until the blank time is detected. On the other hand, when the blank time that can be output is detected (step S108; Yes), the detection unit 133 further determines whether the detected blank time is longer than the next audio information to be output (step S108). Step S109).

検出された空白時間が次に出力予定の音声情報と比較して長い場合（ステップＳ１０９；Ｙｅｓ）、決定部１３４は、出力予定の音声情報を出力させることを決定する（ステップＳ１１０）。一方、検出された空白時間が次に出力予定の音声情報と比較して長くない場合（ステップＳ１０９；Ｎｏ）、決定部１３４は、空白時間に適合するよう出力態様を決定する（ステップＳ１１１）。 When the detected blank time is longer than the next audio information to be output (step S109; Yes), the determination unit 134 determines to output the audio information to be output (step S110). On the other hand, when the detected blank time is not longer than the next audio information to be output (step S109; No), the determination unit 134 determines the output mode so as to match the blank time (step S111).

例えば、決定部１３４は、空白時間の長さに合わせて、出力する音声情報の再生速度を決定する。あるいは、決定部１３４は、空白時間の長さに合わせて、音声情報における再生箇所を選択し、選択された再生箇所のみを出力するよう決定する。 For example, the determination unit 134 determines the reproduction speed of the audio information to be output, according to the length of the blank time. Alternatively, the determination unit 134 selects a reproduction portion in the audio information according to the length of the blank time and determines to output only the selected reproduction portion.

そして、決定部１３４は、決定した態様で、出力予定の音声情報を出力させる（ステップＳ１１２）。すなわち、送信部１３５は、決定部１３４によって決定された態様で、音声情報を出力装置３０に送信する。 Then, the determination unit 134 outputs the audio information to be output in the determined mode (step S112). That is, the transmission unit 135 transmits the audio information to the output device 30 in the mode determined by the determination unit 134.

ここで、送信部１３５が音声情報を出力装置３０に送信した後に、決定部１３４は、さらに待ち音声が存在するか否かを判定する（ステップＳ１１３）。さらに待ち音声が存在する場合（ステップＳ１１３；Ｙｅｓ）、決定部１３４は、待ち音声の順番を変更する（ステップＳ１０７）。一方、待ち音声が存在しない場合（ステップＳ１１３；Ｎｏ）、決定部１３４は、ステップＳ１０３以降で発生した処理を終了する。 Here, after the transmitting unit 135 transmits the voice information to the output device 30, the determining unit 134 determines whether or not more waiting voice exists (step S113). Further, when there is a waiting voice (step S113; Yes), the determining unit 134 changes the order of the waiting voice (step S107). On the other hand, when there is no waiting voice (step S113; No), the determination unit 134 ends the processing that has occurred after step S103.

次に、図８を用いて、各処理部１３１〜１３５が実行・実現する決定処理の内容について説明する。図８は、実施形態に係る決定処理の一例を示すフローチャート（２）である。なお、図８では、自動音声とアナウンス等の音声情報とが衝突した場合の決定処理の処理手順の一例を示している。 Next, with reference to FIG. 8, the contents of the determination process executed and realized by the respective processing units 131 to 135 will be described. FIG. 8 is a flowchart (2) showing an example of the determination process according to the embodiment. Note that FIG. 8 illustrates an example of a processing procedure of a determination process when the automatic voice and the voice information such as the announcement collide with each other.

図８の例の場合、取得部１３１は、自動音声に係る音声情報、及び自動音声を出力させる（再生する）ためのスケジュール情報を予め保持する。決定部１３４は、出力情報記憶部１２５に保持されているスケジュール情報を参照し、自動音声を再生するタイミングが到来したか否かを判定する（ステップＳ２０１）。自動音声を再生するタイミングが到来していない場合（ステップＳ２０１；Ｎｏ）、決定部１３４は、タイミングが到来するまで待機する。 In the case of the example in FIG. 8, the acquisition unit 131 holds in advance audio information regarding automatic audio and schedule information for outputting (reproducing) automatic audio. The determination unit 134 refers to the schedule information held in the output information storage unit 125 and determines whether or not the timing for reproducing the automatic voice has come (step S201). When the timing for reproducing the automatic voice has not come (step S201; No), the determination unit 134 waits until the timing comes.

一方、自動音声を再生するタイミングが到来した場合（ステップＳ２０１；Ｙｅｓ）、決定部１３４は、自動音声を再生させる（ステップＳ２０２）。取得部１３１は、自動音声の再生中に新たな音声情報を取得したか否かを判定する（ステップＳ２０３）。具体的には、取得部１３１は、新たな音声情報として、マイクロホン２０を介したアナウンス等の音声情報の入力が受け付けられたか否かを判定する。すなわち、取得部１３１は、音声の衝突が発生したか否かを判定する。自動音声の再生中に新たな音声情報を取得しない場合（ステップＳ２０３；Ｎｏ）、決定部１３４は、次の自動音声を再生するタイミングが到来するまで待機する。 On the other hand, when the timing to reproduce the automatic voice arrives (step S201; Yes), the determining unit 134 reproduces the automatic voice (step S202). The acquisition unit 131 determines whether new audio information is acquired during the reproduction of the automatic audio (step S203). Specifically, the acquisition unit 131 determines whether or not the input of voice information such as an announcement via the microphone 20 has been accepted as new voice information. That is, the acquisition unit 131 determines whether or not a voice collision has occurred. When new voice information is not acquired during the reproduction of the automatic voice (step S203; No), the determination unit 134 waits until the timing of reproducing the next automatic voice comes.

一方、自動音声の再生中に新たな音声情報を取得した場合（ステップＳ２０３；Ｙｅｓ）、付与部１３２は、新たな音声情報に優先度を付与する。そして、決定部１３４は、再生中の自動音声よりも取得した音声情報の方の優先度が高いか否かを判定する（ステップＳ２０４）。 On the other hand, when new voice information is acquired during the reproduction of the automatic voice (step S203; Yes), the adding unit 132 gives a priority to the new voice information. Then, the determination unit 134 determines whether or not the acquired voice information has a higher priority than the automatic voice being reproduced (step S204).

再生中の自動音声よりも取得した音声情報の方の優先度が高い場合（ステップＳ２０４；Ｙｅｓ）、決定部１３４は、自動音声の現時点における再生箇所を記憶するとともに、自動音声の再生を停止する（ステップＳ２０５）。そして、決定部１３４は、取得した音声を出力装置３０に出力させることを決定する（ステップＳ２０６）。 When the priority of the acquired audio information is higher than that of the automatic audio being reproduced (step S204; Yes), the determination unit 134 stores the current reproduction position of the automatic audio and stops the automatic audio reproduction. (Step S205). Then, the determining unit 134 determines to output the acquired voice to the output device 30 (step S206).

このことは、付与部１３２によって所定の閾値よりも高い優先度が音声情報に付与された場合には、決定部１３４は、空白時間に関わらず、当該音声情報を優先的に出力させるよう決定することを意味する。また、このことは、自動音声よりも優先度の高い音声情報が取得された場合には、決定部１３４は、スケジュールに関わらず、優先度の高い音声情報を出力させるよう決定することを意味する。具体的には、決定部１３４は、自動音声よりも優先度の高いような、危険回避のための緊急時のアナウンスが入力された場合には、自動音声の再生中や、空白時間が短い（次の自動音声が出力されるまでの時間の数秒前など）場合であっても、当該アナウンスを割り込ませて出力する。 This means that when the giving unit 132 gives the voice information a priority higher than a predetermined threshold, the deciding unit 134 decides to preferentially output the voice information regardless of the blank time. Means that. Further, this means that when the voice information having a higher priority than the automatic voice is acquired, the determining unit 134 determines to output the voice information having a higher priority regardless of the schedule. . Specifically, when an emergency announcement for danger avoidance, which has a higher priority than the automatic voice, is input, the determination unit 134 is playing the automatic voice or the blank time is short ( Even if it is a few seconds before the next automatic voice is output), the announcement is interrupted and output.

ステップＳ２０５で示したように、決定部１３４は、自動音声よりも優先度の高い音声情報を出力することを決定した場合に、自動音声を含む他の音声情報が出力されている途中である場合には、当該優先度の高い音声情報を優先的に出力させるとともに、当該他の音声情報の出力を停止させることを決定する。そして、決定部１３４は、音声の停止した箇所を記憶する。これにより、決定部１３４は、衝突の発生した音声情報を再び再生させる場合に、停止した箇所、もしくは停止した箇所の近傍から再生を開始することができる。なお、停止した箇所の近傍とは、例えば、停止箇所から数秒間だけ前の箇所である。すなわち、決定部１３４は、停止した音声情報を再び再生する場合、停止した箇所からではなく、数秒間前から再生させるように出力態様を決定してもよい。 As shown in step S205, when the determination unit 134 determines to output the voice information having a higher priority than the automatic voice, when the other voice information including the automatic voice is being output. Determines that the voice information with the higher priority is preferentially output and the output of the other voice information is stopped. Then, the determination unit 134 stores the location where the sound is stopped. Accordingly, the deciding unit 134 can start the reproduction from the stopped position or the vicinity of the stopped position when the audio information in which the collision has occurred is reproduced again. The vicinity of the stopped location is, for example, a location that is a few seconds before the stopped location. That is, when the stopped audio information is reproduced again, the determination unit 134 may determine the output mode such that the audio information is reproduced for a few seconds before the stopped audio information.

その後、決定部１３４は、取得した音声情報の出力が終了したか否かを判定する（ステップＳ２０７）。取得した音声情報の出力が終了していない場合（ステップＳ２０７；Ｎｏ）、決定部１３４は、優先度の高い音声情報を出力し続ける。一方、取得した音声情報の出力が終了した場合（ステップＳ２０７；Ｙｅｓ）、検出部１３３、停止させた自動音声を再び再生させるための空白時間を検出したか否かを判定する（ステップＳ２０８）。 After that, the determination unit 134 determines whether or not the output of the acquired voice information is completed (step S207). When the output of the acquired voice information is not completed (step S207; No), the determination unit 134 continues to output the voice information of high priority. On the other hand, when the output of the acquired voice information is completed (step S207; Yes), the detection unit 133 determines whether or not a blank time period for replaying the stopped automatic voice is detected (step S208).

検出部１３３は、例えば、スケジュールに基づいて出力されるタイミングが予め設定された音声情報である自動音声が取得部１３１によって取得されている場合、当該自動音声が出力されるタイミングを示したスケジュール情報を参照する。そして、検出部１３３は、自動音声に設定されたスケジュールに基づいて、空白時間を検出する。具体的には、検出部１３３は、現在時刻と、図５で示したようなスケジュールテーブル１２６に記載された空白時間とを参照して、再生が停止された自動音声を再び再生させるための空白時間を検出する。なお、この過程において、検出部１３３は、スケジュールテーブル１２６に記憶された情報を適宜更新してもよい。例えば、検出部１３３は、緊急のアナウンスが割り込んだことによって、自動音声を出力させるタイミングに遅れが生じた場合には、遅れた時間に応じて音声出力スケジュールを適宜更新する。 For example, when the acquisition unit 131 acquires the automatic voice that is the voice information in which the output timing based on the schedule is preset, the detection unit 133 indicates the schedule information indicating the timing when the automatic voice is output. Refer to. Then, the detection unit 133 detects the blank time based on the schedule set for the automatic voice. Specifically, the detection unit 133 refers to the current time and the blank time described in the schedule table 126 as shown in FIG. 5, and blanks for replaying the stopped automatic voice. Detect time. In addition, in this process, the detection unit 133 may appropriately update the information stored in the schedule table 126. For example, the detection unit 133 appropriately updates the audio output schedule according to the delayed time when the timing of outputting the automatic audio is delayed due to the interruption of the urgent announcement.

出力可能な空白時間を検出しない場合（ステップＳ２０８；Ｎｏ）、検出部１３３は、空白時間を検出するまで待機する。一方、出力可能な空白時間を検出した場合（ステップＳ２０８；Ｙｅｓ）、さらに検出部１３３は、検出された空白時間が、所定時間（例えば、停止した自動音声の元々の再生時間や、停止した自動音声の残りの再生時間）と比較して長いか否かを判定する（ステップＳ２０９）。 When the blank time that can be output is not detected (step S208; No), the detection unit 133 waits until the blank time is detected. On the other hand, when the outputable blank time is detected (step S208; Yes), the detection unit 133 further detects the detected blank time for a predetermined time (for example, the original reproduction time of the stopped automatic voice or the stopped automatic sound). It is determined whether it is longer than the remaining reproduction time of the sound) (step S209).

検出された空白時間が所定時間と比較して長い場合（ステップＳ２０９；Ｙｅｓ）、具体的には、検出された空白時間が停止した自動音声の元々の再生時間よりも長い場合、決定部１３４は、停止した自動音声を最初から再生するよう態様を決定する（ステップＳ２１０）。これは、停止した箇所から自動音声を再生するよりも、可能であれば、自動音声を最初から再生する方が、伝達すべき情報を確実に乗客に伝えることができるからである。 When the detected blank time is longer than the predetermined time (step S209; Yes), specifically, when the detected blank time is longer than the original reproduction time of the stopped automatic voice, the determination unit 134 determines. The mode for deciding to reproduce the stopped automatic voice from the beginning is determined (step S210). This is because it is possible to reliably convey the information to be transmitted to the passengers by reproducing the automatic voice from the beginning, if possible, rather than reproducing the automatic voice from the stopped position.

なお、決定部１３４は、検出された空白時間が停止した自動音声の元々の再生時間よりも長い場合であっても、必ずしも最初から自動音声を再生することを要しない。例えば、本来、自動音声を再生するタイミングから所定時間（例えば、数十秒）以上経過していた場合において、運行情報等のタイミングが詳細に設定されている自動音声については、素早く再生を終了した方が望ましい場合がある。すなわち、決定部１３４は、例えば図３に示した自動音声種別に応じて、自動音声の出力態様を決定してもよい。具体的には、決定部１３４は、運行案内に係る自動音声については、自動音声を再生するタイミングから所定時間以上経過していた場合、最初から再び自動音声を再生することをせず、停止箇所から再生させるよう決定してもよい。あるいは、決定部１３４は、注意喚起や広告情報等の自動音声については、自動音声を再生するタイミングから所定時間以上経過していた場合であっても、最初から再び自動音声を再生するよう、出力態様を決定してもよい。 Note that the determining unit 134 does not necessarily have to reproduce the automatic voice from the beginning even when the detected blank time is longer than the original reproduction time of the stopped automatic voice. For example, originally, when a predetermined time (for example, several tens of seconds) or more has passed from the timing of reproducing the automatic voice, the automatic voice for which the timing of the operation information and the like is set in detail is quickly terminated. May be preferable. That is, the determining unit 134 may determine the output mode of the automatic voice according to the automatic voice type shown in FIG. 3, for example. Specifically, regarding the automatic voice related to the operation guide, the determination unit 134 does not reproduce the automatic voice again from the beginning when the predetermined time or more has elapsed from the timing of reproducing the automatic voice, and the stop location You may decide to reproduce from. Alternatively, the determination unit 134 outputs an automatic voice such as an alert or advertisement information so that the automatic voice is reproduced again from the beginning even if a predetermined time or more has elapsed from the timing of reproducing the automatic voice. The aspect may be determined.

なお、検出された空白時間が、所定時間（例えば、停止した自動音声の元々の再生時間や、停止した自動音声の残りの再生時間）と比較して長くなかった場合（ステップＳ２０９；Ｎｏ）、決定部１３４は、自動音声を出力する態様を決定する（ステップＳ２１１）。例えば、検出された空白時間が、停止した自動音声の最初からの再生時間よりも短いものの、停止した自動音声の残りの再生時間よりも長い場合、決定部１３４は、優先度の高い音声情報の出力が終了した後に、自動音声の出力を停止させた再生箇所の近傍から、自動音声を再び出力させることを決定する。 When the detected blank time is not longer than the predetermined time (for example, the original reproduction time of the stopped automatic voice or the remaining reproduction time of the stopped automatic voice) (step S209; No), The determination unit 134 determines the mode of outputting the automatic voice (step S211). For example, when the detected blank time is shorter than the reproduction time from the beginning of the stopped automatic voice but is longer than the remaining reproduction time of the stopped automatic voice, the determination unit 134 determines that the audio information with high priority is selected. After the output is finished, it is decided to output the automatic voice again from the vicinity of the reproduction position where the output of the automatic voice is stopped.

あるいは、決定部１３４は、検出された空白時間が、停止した自動音声の残りの再生時間と比較しても長くないものの、自動音声の出力態様（例えば、自動音声の再生速度や、再生箇所）の調整によって、自動音声を再生可能である場合には、調整した態様で出力することを決定する。このようにして、決定部１３４は、決定した態様で、停止させた自動音声を再び出力させる（ステップＳ２１２）。 Alternatively, although the deciding unit 134 does not compare the detected blank time with the remaining reproduction time of the stopped automatic voice, the determination unit 134 outputs the automatic voice (for example, the reproduction speed of the automatic voice or the reproduction position). If the automatic audio can be reproduced by the adjustment of, the output is determined in the adjusted manner. In this way, the determination unit 134 outputs the stopped automatic voice again in the determined mode (step S212).

一方、新たに取得した音声情報が、自動音声よりも優先度が低い場合（ステップＳ２０４；Ｎｏ）、取得部１３１は、取得した音声情報を一時的に録音する（ステップＳ２２１）。 On the other hand, when the newly acquired voice information has a lower priority than the automatic voice (step S204; No), the acquisition unit 131 temporarily records the acquired voice information (step S221).

この場合、決定部１３４は、自動音声の再生を継続させる（ステップＳ２２２）。そして、決定部１３４は、自動音声の再生が終了したか否かを判定する（ステップＳ２２３）。自動音声の再生が終了していない場合（ステップＳ２２３；Ｎｏ）、決定部１３４は、自動音声を再生し続ける。一方、自動音声の再生が終了した場合（ステップＳ２２３；Ｙｅｓ）、検出部１３３は、録音しておいた音声情報を再生させるための空白時間を検出したか否かを判定する（ステップＳ２２４）。 In this case, the determination unit 134 continues the reproduction of the automatic voice (step S222). Then, the determination unit 134 determines whether or not the reproduction of the automatic voice has ended (step S223). When the reproduction of the automatic voice is not completed (step S223; No), the determination unit 134 continues to reproduce the automatic voice. On the other hand, when the reproduction of the automatic voice is completed (step S223; Yes), the detection unit 133 determines whether or not the blank time for reproducing the recorded voice information is detected (step S224).

具体的には、検出部１３３は、スケジュール情報を参照して空白時間を検出する。空白時間を検出しない場合（ステップＳ２２４；Ｎｏ）、検出部１３３は、検出するまで待機する。空白時間を検出した場合（ステップＳ２２４；Ｙｅｓ）、さらに検出部１３３は、検出された空白時間が所定時間（例えば、取得した音声情報の録音時間）よりも長いか否かを判定する（ステップＳ２２５）。検出された空白時間が所定時間よりも長い場合（ステップＳ２２５；Ｙｅｓ）、決定部１３４は、取得した音声情報を通常の速度で再生するよう態様を決定する（ステップＳ２２６）。 Specifically, the detection unit 133 refers to the schedule information and detects a blank time. When the blank time is not detected (step S224; No), the detection unit 133 waits until it is detected. When the blank time is detected (step S224; Yes), the detection unit 133 further determines whether the detected blank time is longer than a predetermined time (for example, the recording time of the acquired voice information) (step S225). ). When the detected blank time is longer than the predetermined time (step S225; Yes), the determination unit 134 determines the mode in which the acquired voice information is reproduced at the normal speed (step S226).

一方、検出された空白時間が所定時間よりも長くない場合（ステップＳ２２５；Ｎｏ）、決定部１３４は、空白時間に適合するよう、取得した音声情報の出力態様を決定する（ステップＳ２２７）。そして、決定部１３４は、決定した態様で取得した音声情報を出力させる（ステップＳ２２８）。このあと、送信部１３５は、決定部１３４の決定した態様に関する情報と、音声情報とを出力装置３０に送信する。 On the other hand, when the detected blank time is not longer than the predetermined time (step S225; No), the determination unit 134 determines the output mode of the acquired voice information so as to match the blank time (step S227). Then, the determination unit 134 outputs the audio information acquired in the determined mode (step S228). After that, the transmission unit 135 transmits the information regarding the mode determined by the determination unit 134 and the voice information to the output device 30.

〔４．変形例〕
上述した実施形態に係る決定装置１００は、上記実施形態以外にも種々の異なる形態にて実施されてよい。そこで、以下では、上記の決定装置１００の他の実施形態について説明する。なお、以下では、実施形態と同様の点については適宜説明を省略する。 [4. Modification example)
The determination device 100 according to the above-described embodiment may be implemented in various different forms other than the above-described embodiment. Therefore, other embodiments of the determination device 100 described above will be described below. Note that, in the following, description of the same points as those in the embodiment will be appropriately omitted.

〔４−１．通知情報の生成〕
決定装置１００は、自動音声等のスケジュールや空白時間、録音されたアナウンス等の情報を話者（例えば、駅員）に通知するための通知情報を生成する構成を有していてもよい。例えば、変形例に係る決定装置２００の構成例を図９に示す。図９は、変形例に係る決定装置２００が有する機能構成の一例を説明する図である。決定装置２００は、決定装置１００に対して、生成部１３６を更に有する。 [4-1. Generation of notification information]
The determination device 100 may have a configuration for generating notification information for notifying a speaker (for example, a station staff) of information such as a schedule such as an automatic voice, a blank time, and a recorded announcement. For example, FIG. 9 shows a configuration example of the determination device 200 according to the modification. FIG. 9 is a diagram illustrating an example of a functional configuration of the determination device 200 according to the modification. The determination device 200 further includes a generation unit 136 with respect to the determination device 100.

生成部１３６は、検出部１３３によって検出された空白時間に関する情報と、空白時間の経過後に出力されることが予定されている所定の音声情報の再生時間、及び所定の音声情報の再生内容を通知する通知情報を生成する。 The generation unit 136 notifies the information about the blank time detected by the detection unit 133, the reproduction time of the predetermined audio information scheduled to be output after the lapse of the blank time, and the reproduction content of the predetermined audio information. Generate notification information.

例えば、生成部１３６が生成する通知情報は、駅員がマイクロホン２０を利用してアナウンスを行おうとする際に、例えばマイクロホン２０や、マイクロホン２０の近傍に設置された情報機器に表示される。具体的には、通知情報は、次に自動音声が出力されるまでの空白時間の長さ（例えば、自動音声が出力される迄の残り秒数など）や、次に出力される自動音声にタグ付けされた情報であって、自動音声の伝達内容を示す情報や、自動音声の再生時間の長さ等を含む。なお、通知情報が含む情報は、自動音声に係る情報に限らず、例えば、一時的に録音されて待ち音声となっている音声情報に係る情報を含んでもよい。 For example, the notification information generated by the generation unit 136 is displayed on, for example, the microphone 20 or an information device installed in the vicinity of the microphone 20 when the station staff tries to make an announcement using the microphone 20. Specifically, the notification information includes the length of the blank time until the next automatic voice is output (for example, the number of seconds remaining until the automatic voice is output) and the next automatic voice to be output. The information is tagged, and includes information indicating the transmission contents of the automatic voice, the length of the reproduction time of the automatic voice, and the like. The information included in the notification information is not limited to the information related to the automatic voice, but may include, for example, the information related to the voice information that is temporarily recorded and becomes the standby voice.

すなわち、駅員は、通知情報を確認することで、自動音声とアナウンスが衝突してしまうことを回避できる。また、駅員は、自身が行おうとするアナウンスが、次に出力される自動音声の内容と重複することを認識できる。すなわち、生成部１３６が生成する通知情報によって、自動音声や、録音された音声情報との衝突が発生することを防止できる。 That is, the station staff can avoid the collision between the automatic voice and the announcement by checking the notification information. In addition, the station staff can recognize that the announcement he / she intends to make overlaps with the content of the automatic voice that is output next. That is, the notification information generated by the generation unit 136 can prevent a collision with an automatic voice or recorded voice information.

〔４−２．出力態様のバリエーション〕
上記実施形態では、決定部１３４は、音声情報の速度を調整したり、再生箇所を調整したりして、出力態様を決定する例を示した。ここで、決定部１３４は、さらに異なる手法の出力態様を採用してもよい。 [4-2. Variation of output mode]
In the above-described embodiment, the determination unit 134 has shown an example in which the output mode is determined by adjusting the speed of the audio information and the reproduction portion. Here, the determination unit 134 may adopt an output mode of a different method.

例えば、取得部１３１は、文節ごとに優先順位の付与された自動音声を取得する。そして、決定部１３４は、音声情報に含まれる文節であって予め再生順位が設定された文節の中から、再生順位の高い順に文節を抽出し、抽出した文節を組み合わせて出力するよう、出力態様を決定する。なお、文節は、例えば自動音声を作成する者によって任意に設定されてもよい。具体的には、取得部１３１は、「まもなく１番線に渋谷行きの電車が参ります。」という内容の自動音声であって、「１番線に」、「渋谷行き」、「参ります」といった文節の優先度が比較的高く、「まもなく」、「の電車が」という文節の優先度が比較的低い自動音声を取得する。この場合、決定部１３４は、例えば空白時間が所定時間よりも短い場合に、優先度の低い文節を省略した自動音声を出力させるよう決定する。この例では、決定部１３４は、「まもなく」及び「の電車が」という文節を省略し、「１番線に渋谷行き参ります。」という内容で、自動音声を出力させるよう決定する。 For example, the acquisition unit 131 acquires an automatic voice that is given a priority for each phrase. Then, the determining unit 134 extracts the clauses in the descending order of the reproduction order from the clauses included in the voice information and having the reproduction order set in advance, and outputs the extracted clauses in combination. To decide. The phrase may be arbitrarily set by, for example, a person who creates the automatic voice. Specifically, the acquisition unit 131 is an automatic voice that has the content that “a train bound for Shibuya will soon come to Line 1”, and has phrases such as “To Line 1”, “To Shibuya”, and “Visit” The automatic voice having a relatively high priority and the relatively low priority of the phrases “soon” and “no train ga” are acquired. In this case, the deciding unit 134 decides to output an automatic voice in which a clause with a low priority is omitted, for example, when the blank time is shorter than the predetermined time. In this example, the determination unit 134 omits the phrases “soon” and “no train”, and determines to output an automatic voice with the content “I'm going to Shibuya on line 1.”.

このように、決定部１３４は、文節ごとの優先度に応じて、再生箇所を決定してもよい。これにより、決定部１３４は、再生時間を短くするとともに、伝達内容を損なわないように音声情報を出力させることができる。なお、省略する文節は、必ずしも予め設定されていることを要さない。例えば、決定部１３４は、自動音声に対して機械学習処理を行い、省略されたとしても意味を損なわない箇所について、優先度を自動的に低く設定するなどの処理を行ってもよい。 In this way, the determination unit 134 may determine the reproduction point according to the priority of each clause. As a result, the determining unit 134 can shorten the reproduction time and output the audio information without damaging the transmission content. The omitted clause does not necessarily have to be set in advance. For example, the deciding unit 134 may perform a machine learning process on the automatic voice, and may automatically perform a process of setting the priority to a low place even if omitted, which does not impair the meaning.

また、決定部１３４は、自動音声が示す全体の占有割合に基づいて、音声を短縮して再生するか否か等を判定してもよい。図１のような駅のホームを例に挙げると、朝や夕方などの時間帯には、時間帯において自動音声が占める占有割合が高くなると想定される。言い換えれば、朝や夕方などの時間帯には、全体的に空白時間が短くなる傾向になると想定される。このため、決定部１３４は、例えば、自動音声が占める占有割合が多い時間帯では、全体の音声を短縮して再生するよう決定してもよい。これにより、決定部１３４は、少しでも空白時間を多く確保できるようになるため、全体として、音声情報が衝突することを防止することができる。 Further, the determining unit 134 may determine whether or not to shorten and reproduce the voice based on the entire occupation rate indicated by the automatic voice. Taking the platform of a station as shown in FIG. 1 as an example, it is assumed that the occupancy rate of automatic voice in the time zone is high in the morning and evening hours. In other words, it is assumed that the blank time tends to become shorter as a whole in the morning and evening hours. Therefore, for example, the determining unit 134 may determine to shorten and reproduce the entire voice during a time period when the automatic voice occupies a large proportion. As a result, the deciding unit 134 can secure a large amount of blank time, so that it is possible to prevent the audio information from colliding with each other as a whole.

〔４−３．決定処理の応用〕
上記実施形態では、駅のホームにおいて音声情報の衝突を防止する例を示した。しかし、上記実施形態で示した決定処理は、様々な状況に応用可能である。 [4-3. Application of decision processing]
In the above embodiment, an example of preventing the collision of voice information on the platform of the station has been shown. However, the determination process shown in the above embodiment can be applied to various situations.

例えば、実施形態に係る決定処理は、複数人が参加する会議等にも利用することができる。例えば会議では、話者はマイクロホンを用いて発話し、発話された音声を参加者がイヤホンやスピーカー等で聞くことになる。例えば、決定装置１００は、議長のマイクロホンに最も高い優先度を付与し、その他の話者が発話した場合には、各々の話者の優先度に応じて、優先度の高い音声を順に出力させるよう態様を決定する。また、決定装置１００は、各々の話者の優先度を動的に変化させてもよい。例えば、決定装置１００は、発話の回数に応じて優先度を動的に変化させて付与してもよい。例えば、決定装置１００は、発話の多い話者の優先度を、徐々に低くしていき、一人の話者に発話が集中しないような調整を行ってもよい。 For example, the determination process according to the embodiment can be used for a conference in which a plurality of people participate. For example, in a conference, a speaker speaks using a microphone, and the uttered voice is heard by participants through earphones, speakers, or the like. For example, the determination device 100 gives the chairperson the highest priority, and when another speaker speaks, the determination device 100 sequentially outputs the voices with higher priority according to the priority of each speaker. Mode is determined. Further, the determination device 100 may dynamically change the priority of each speaker. For example, the determining apparatus 100 may dynamically change and assign the priority according to the number of utterances. For example, the determining apparatus 100 may gradually reduce the priority of a speaker who has a large amount of utterances, and may make an adjustment so that the utterance is not concentrated on one speaker.

また、実施形態に係る決定処理は、オンライン上で行われる複数人が参加する通話等に利用されてもよい。実施形態に係る決定処理によれば、複数の話者が同時に端末装置に音声を入力した場合であっても、発話された内容が順に出力される。すなわち、実施形態に係る決定処理によれば、音声の衝突が防止されるため、参加者は、音声を聞き取りやすくなる。また、実施形態に係る決定処理によれば、話者が発話した内容が録音されるため、話者は、発話するタイミングを待たなくとも発話を行うことができる。なお、話者は、自ら優先度を選択するようにしてもよい。 In addition, the determination process according to the embodiment may be used for a telephone call or the like in which a plurality of people participate online. According to the determination process according to the embodiment, even if a plurality of speakers input voices to the terminal device at the same time, the uttered contents are sequentially output. That is, according to the determination processing according to the embodiment, the collision of voices is prevented, and thus the participant can easily hear the voice. Further, according to the determination process according to the embodiment, since the content uttered by the speaker is recorded, the speaker can utter without waiting for the timing of utterance. The speaker may select the priority by himself / herself.

また、実施形態に係る決定処理は、例えば、ＩｏＴ（Internet of Things）機器や、スマートデバイス、ロボット等の発話処理に利用されてもよい。例えば、ロボットの発話に利用される場合、ロボットは、定期的に発話する自動音声として、優先度の低い定期メンテナンスの情報等を保持する。また、ロボットは、緊急時に発話する自動音声として、優先度の高い通知情報（故障の知らせなど）を保持する。そして、ロボットは、例えば何らかの会話の途中において、定期メンテナンスの情報を発話するタイミングが到来したとしても、会話の方の優先度が高いのであれば、定期メンテナンスの情報を発話するタイミングを遅らせる。一方、ロボットは、何らかの会話の途中であっても、故障等の優先度の高い情報が割り込んだ場合、会話を中断して故障の通知を発話する。 Further, the determination process according to the embodiment may be used, for example, for utterance process of an IoT (Internet of Things) device, a smart device, a robot, or the like. For example, when the robot is used for utterance, the robot holds information such as regular maintenance with low priority as an automatic voice that is uttered periodically. Further, the robot holds notification information with high priority (such as notification of failure) as an automatic voice spoken in an emergency. Then, the robot delays the timing of uttering the periodic maintenance information if the conversation has a higher priority even if the timing of uttering the periodic maintenance information comes during some conversation, for example. On the other hand, the robot interrupts the conversation and utters a notification of the failure when the high-priority information such as the failure interrupts even during some conversation.

また、実施形態に係る決定処理は、例えば、家庭内で利用される自動アナウンスに応用されてもよい。例えば、実施形態に係る決定処理が家庭内で利用される自動アナウンスに応用される場合、料理が焦げているといった緊急な状況をアナウンスする場合の優先度を高く設定する。また、あと数時間以内で荷物が届くといった、緊急でないアナウンスの優先度を低く設定する。そして、実施形態に係る決定処理によれば、例えば２つのアナウンスが衝突した場合であっても、緊急性を有するアナウンスが自動的に選択され、優先的に出力されることになる。 Further, the determination process according to the embodiment may be applied to, for example, an automatic announcement used at home. For example, when the determination process according to the embodiment is applied to an automatic announcement used at home, a high priority is set when an emergency situation such as a dish burned is announced. Also, set low priority for non-urgent announcements, such as the delivery of luggage within a few hours. Then, according to the determination process according to the embodiment, even when two announcements collide with each other, an urgent announcement is automatically selected and is preferentially output.

また、実施形態に係る決定処理は、例えば、カーナビアプリ等のナビソフトウェアに利用されてもよい。例えば、実施形態に係る決定処理がカーナビアプリで利用される場合、次の交差点を右折するとか、ガソリンが切れるおそれがあるなど、緊急な状況をアナウンスする場合の優先度を高く設定する。また、付近に美味しいレストランがあるなど、緊急でないアナウンスの優先度を低く設定する。この場合、ある地点を通った際に２つのアナウンスが出力される設定になっていたとしても、実施形態に係る決定処理によれば、次の交差点を右折するといった緊急な状況をアナウンスする音声が優先的に出力される。 Further, the determination process according to the embodiment may be used in navigation software such as a car navigation application, for example. For example, when the determination process according to the embodiment is used in the car navigation application, the priority is set to be high when an emergency situation is announced, such as turning right at the next intersection or running out of gasoline. Also, set a low priority for non-urgent announcements, such as a delicious restaurant nearby. In this case, even if the setting is such that two announcements are output when passing through a certain point, according to the determination process according to the embodiment, a voice that announces an emergency situation such as turning right at the next intersection is output. It is output with priority.

また、実施形態に係る決定処理では、例えば、空港等の比較的広い場所において、伝達する内容が全体に対するものであるか、あるいは、個人的なものであるかに応じて、優先度が設定されてもよい。例えば、乗客全体に対するアナウンスには高い優先度が設定され、個人に通知されるべきアナウンスには比較的低い優先度が設定されてもよい。 Further, in the determination process according to the embodiment, for example, in a relatively large place such as an airport, the priority is set according to whether the content to be transmitted is for the whole or personal. May be. For example, announcements for all passengers may be set to a high priority, and announcements to be notified to an individual may be set to a relatively low priority.

〔５．他の実施形態〕
なお、上記実施形態は例示に過ぎず、本発明は、以下に例示するものやそれ以外の他の実施態様も含むものである。例えば、本出願における機能構成、データ構造、フローチャートに示す処理の順序や内容などは例示に過ぎず、各要素の有無、その配置や処理実行などの順序、具体的内容などは適宜変更可能である。例えば、上述した決定処理は、上記実施形態で例示したように決定装置１００が実現する以外にも、クラウドシステムにおける装置、方法やプログラムとして実現することもできる。 [5. Other Embodiments]
Note that the above-described embodiment is merely an example, and the present invention also includes the following exemplified embodiments and other embodiments. For example, the functional configuration, the data structure, the order and contents of the processes shown in the flowchart in the present application are merely examples, and the presence or absence of each element, the arrangement and order of the process execution, and the specific contents can be appropriately changed. . For example, the determination process described above can be realized as a device, a method, or a program in a cloud system, in addition to being realized by the determination device 100 as illustrated in the above embodiment.

また、決定装置１００を構成する各処理部１３１〜１３５や、決定装置２００を構成する各処理部１３１〜１３６を、さらにそれぞれ独立した装置で実現する構成も一般的である。同様に、外部のプラットフォーム等をＡＰＩ（アプリケーション・プログラム・インタフェース）やネットワークコンピューティング（いわゆるクラウドなど）で呼び出すことで、上記実施形態で示した各手段を実現するなど、本発明の構成は柔軟に変更できる。さらに、本発明に関する手段などの各要素は、コンピュータの演算制御部に限らず物理的な電子回路など他の情報処理機構で実現してもよい。 In addition, it is general that each of the processing units 131 to 135 included in the determination device 100 and each of the processing units 131 to 136 included in the determination device 200 are realized by independent devices. Similarly, by calling an external platform or the like by API (application program interface) or network computing (so-called cloud, etc.), the respective means shown in the above embodiments are realized, and the configuration of the present invention is flexible. Can be changed. Further, each element such as the means related to the present invention may be realized not only by the arithmetic and control unit of the computer but also by another information processing mechanism such as a physical electronic circuit.

〔６．効果〕
上述したように、決定装置１００は、音声情報を取得する。また、決定装置１００は、取得された音声情報に優先度を付与する。また、決定装置１００は、音声情報の出力にあたり、各音声情報が衝突しない時間帯である空白時間を検出する。また、決定装置１００は、検出された空白時間と、付与された優先度とに基づいて、音声情報を出力する態様を決定する。これにより、決定装置１００は、状況に応じて適切に音声の衝突を回避しつつ、伝達する情報の内容を保持することができる。 [6. effect〕
As described above, the determination device 100 acquires voice information. Further, the determination device 100 gives priority to the acquired voice information. Further, the determination device 100 detects a blank time, which is a time period during which the respective pieces of audio information do not collide, when outputting the audio information. In addition, the determining apparatus 100 determines a mode in which the audio information is output, based on the detected blank time and the given priority. As a result, the determination device 100 can appropriately hold the content of the information to be transmitted while avoiding the collision of voices depending on the situation.

また、決定装置１００は、空白時間の長さに合わせて、出力する音声情報の再生速度を決定する。これにより、決定装置１００は、出力すべき音声情報よりも空白時間が短い場合であっても、音声情報を衝突させることなく出力させることができる。 Further, the determining apparatus 100 determines the reproduction speed of the audio information to be output according to the length of the blank time. As a result, the determining apparatus 100 can output the voice information without causing a collision even when the blank time is shorter than the voice information to be output.

また、決定装置１００は、空白時間の長さに合わせて、音声情報における再生箇所を選択し、選択された再生箇所のみを出力するよう決定する。これにより、決定装置１００は、出力すべき音声情報よりも空白時間が短い場合であっても、音声情報を衝突させることなく出力させることができる。 Further, the determining apparatus 100 selects a reproduction part in the audio information according to the length of the blank time and determines to output only the selected reproduction part. As a result, the determining apparatus 100 can output the voice information without causing a collision even when the blank time is shorter than the voice information to be output.

また、決定装置１００は、音声情報に含まれる文節であって予め再生順位が設定された文節の中から、再生順位の高い順に文節を抽出し、抽出した文節を組み合わせて出力するよう決定する。これにより、決定装置１００は、出力すべき音声情報よりも空白時間が短い場合であっても、音声情報を衝突させることなく、また、伝達する内容を損なうことなく、音声情報を出力させることができる。 Further, the determining apparatus 100 determines, from the clauses included in the voice information and having the reproduction order set in advance, the clauses in descending order of the reproduction order, and outputs the combined clauses. As a result, the determination device 100 can output the voice information without causing the voice information to collide with each other and without damaging the content to be transmitted, even when the blank time is shorter than the voice information to be output. it can.

また、決定装置１００は、所定の閾値よりも高い優先度が音声情報に付与された場合には、空白時間に関わらず、音声情報を優先的に出力させるよう決定する。これにより、決定装置１００は、音声が衝突する事態が発生した場合であっても、他の音声情報と衝突させることなく、緊急なアナウンスなどの即時性を有する内容を優先的に出力させることができる。 Further, when the priority higher than the predetermined threshold is given to the voice information, the determination device 100 determines to preferentially output the voice information regardless of the blank time. As a result, the determination device 100 can preferentially output urgent announcements and other immediacy contents without causing a collision with other audio information even when a situation occurs in which the audio collides. it can.

また、決定装置１００は、スケジュールに基づいて出力されるタイミングが予め設定された音声情報である自動音声を取得する。また、決定装置１００は、自動音声に設定されたスケジュールに基づいて、空白時間を検出する。これにより、決定装置１００は、空白時間を的確に把握することができるので、取得した音声情報を出力する態様を適切に決定することができる。 Further, the determination device 100 acquires an automatic voice, which is voice information whose timing to be output based on the schedule is preset. Further, the determination device 100 detects the blank time based on the schedule set to the automatic voice. Thereby, the determining apparatus 100 can accurately grasp the blank time, and thus can appropriately determine the mode in which the acquired voice information is output.

また、決定装置１００は、自動音声よりも優先度の高い音声情報が取得された場合には、スケジュールに関わらず、優先度の高い音声情報を出力させるよう決定する。これにより、決定装置１００は、緊急なアナウンスなどの即時性を有する内容が、音声の衝突によって聞き辛くなるような状況を発生させないようにすることができる。 Further, when the voice information having a higher priority than the automatic voice is acquired, the determination device 100 determines to output the voice information having a high priority regardless of the schedule. Accordingly, the determination device 100 can prevent a situation in which the content having immediacy such as an urgent announcement becomes difficult to hear due to a collision of voices.

また、決定装置１００は、自動音声よりも優先度の高い音声情報を出力することを決定した場合に、他の音声情報が出力されている途中である場合には、優先度の高い音声情報を優先的に出力させるとともに、他の音声情報の出力を停止させることを決定する。これにより、決定装置１００は、緊急なアナウンスなどの即時性を有する内容が、衝突によって聞き辛くなるような状況を発生させないようにすることができる。 Further, when the decision device 100 decides to output the voice information having a higher priority than the automatic voice, and when other voice information is being output, the determination device 100 outputs the voice information having a higher priority. It is determined that the output of other audio information is stopped while the output of the audio information is given priority. Accordingly, the determination device 100 can prevent a situation in which a content having immediacy such as an urgent announcement becomes difficult to hear due to a collision.

また、決定装置１００は、他の音声情報の出力を停止させた場合には、優先度の高い音声情報の出力が終了した後に、他の音声情報の出力を停止させた再生箇所の近傍から、もしくは、他の音声情報の最初から、他の音声情報を再び出力させることを決定する。これにより、決定装置１００は、緊急なアナウンスなどの割り込みがあった場合でも、割り込まれた自動音声等について、本来伝達すべき内容を損なわずに出力させることができる。 Further, when the output of the other audio information is stopped, the determination device 100, after the output of the audio information of high priority is finished, from the vicinity of the reproduction point where the output of the other audio information is stopped, Alternatively, it is decided to output the other voice information again from the beginning of the other voice information. Thus, the determining apparatus 100 can output the interrupted automatic voice without impairing the content to be originally transmitted even when there is an interrupt such as an urgent announcement.

また、決定装置１００は、入力装置を介して話者から音声の入力を受け付けることで音声情報を取得する。また、決定装置１００は、入力装置における設定に基づいて、音声情報に優先度を付与する。これにより、決定装置１００は、自動音声ではなく不規則に取得される音声情報についても、適切に優先度を付与することができる。 Further, the determination device 100 acquires voice information by accepting a voice input from a speaker via the input device. Further, the determination device 100 gives priority to the voice information based on the setting in the input device. As a result, the determining apparatus 100 can appropriately give priority to voice information that is randomly obtained instead of automatic voice.

また、決定装置１００は、話者から入力された音声情報であって、優先度が付与された音声情報を記憶部１２０に格納する。また、決定装置１００は、空白時間が検出された場合に、優先度に基づいて、記憶部１２０に格納された音声情報を出力する態様を決定する。このように、決定装置１００は、一時的に音声を録音することによって、音声の衝突が発生した場合であっても、適切な順番で音声情報を出力させることができる。 Further, the determination device 100 stores, in the storage unit 120, the voice information input by the speaker and given the priority. In addition, the determination device 100 determines a mode of outputting the audio information stored in the storage unit 120 based on the priority when the blank time is detected. As described above, the determination device 100 can temporarily output the voices to output the voice information in an appropriate order even when the voices collide.

また、決定装置１００は、複数の話者から別々に入力された二以上の音声情報であって、優先度が付与された音声情報を取得した場合には、最も優先度の高い音声情報以外を記憶部に格納する。また、決定装置１００は、二以上の音声情報のうち、最も優先度の高い音声情報を出力させるとともに、最も優先度の高い音声情報の出力が終了した後に、記憶部に格納された音声情報のいずれかを出力させるよう決定する。このように、決定装置１００は、優先度の高い音声情報については即時的に出力し、残りの音声情報を録音するようにしてもよい。これにより、決定装置１００は、適切な順番で音声情報を出力させることができる。 In addition, the determination device 100, when acquiring voice information to which priority is given, which is two or more voice information input separately from a plurality of speakers, determines other than the voice information with the highest priority. Store in the storage unit. Further, the determination device 100 outputs the voice information with the highest priority among the two or more voice information, and outputs the voice information stored in the storage unit after the output of the voice information with the highest priority is completed. Decide to output one of them. As described above, the determining apparatus 100 may immediately output the voice information with high priority and record the remaining voice information. Thereby, the determination device 100 can output the voice information in an appropriate order.

また、決定装置２００は、検出された空白時間に関する情報と、空白時間の経過後に出力されることが予定されている所定の音声情報の再生時間、及び所定の音声情報の再生内容を通知する通知情報を生成する。これにより、決定装置２００は、重複した内容のアナウンス等を話者が行うことを事前に防止できるので、自動音声や録音された音声情報との衝突が発生することを回避できる。 Further, the determination apparatus 200 notifies the information about the detected blank time, the reproduction time of the predetermined audio information that is scheduled to be output after the lapse of the blank time, and the reproduction content of the predetermined audio information. Generate information. Accordingly, the determination device 200 can prevent the speaker from making an announcement or the like of the duplicated content in advance, and thus can avoid the occurrence of a collision with the automatic voice or the recorded voice information.

また、上記実施形態及び変形例において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。 Further, of the processes described in the above-described embodiment and modified examples, all or part of the processes described as being automatically performed may be manually performed, or described as manually performed. It is also possible to automatically carry out all or part of the processing performed by a known method. In addition, the processing procedures, specific names, information including various data and parameters shown in the above-mentioned documents and drawings can be arbitrarily changed unless otherwise specified. For example, the various kinds of information shown in each drawing are not limited to the illustrated information.

また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each device shown in the drawings is functionally conceptual, and does not necessarily have to be physically configured as shown. That is, the specific form of distribution / integration of each device is not limited to that shown in the figure, and all or a part of the device may be functionally or physically distributed / arranged in arbitrary units according to various loads and usage conditions. It can be integrated and configured.

また、上述してきた実施形態及び変形例は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Further, the above-described embodiments and modified examples can be appropriately combined within a range in which the processing content is not inconsistent.

また、上述してきた「部（section、module、unit）」は、「手段」や「回路」などに読み替えることができる。例えば、取得部は、取得手段や取得回路に読み替えることができる。 Also, the above-mentioned "section (module, unit)" can be read as "means" or "circuit". For example, the acquisition unit can be read as an acquisition unit or an acquisition circuit.

１決定処理システム
１００決定装置
１２１音声情報記憶部
１２２自動音声テーブル
１２３アナウンステーブル
１２５出力情報記憶部
１２６スケジュールテーブル
１２７出力待ちテーブル
１３０制御部
１３１取得部
１３２付与部
１３３検出部
１３４決定部
１３５送信部
１３６生成部
Ｎネットワーク 1 determination processing system 100 determination device 121 voice information storage unit 122 automatic voice table 123 announcement table 125 output information storage unit 126 schedule table 127 output waiting table 130 control unit 131 acquisition unit 132 addition unit 133 detection unit 134 determination unit 135 transmission unit 136 Generator N Network

Claims

An acquisition unit that acquires audio information including automatic audio with preset timing output based on a schedule ,
An assigning unit that assigns a priority to the voice information acquired by the acquiring unit,
In outputting the voice information, a detection unit that detects a blank time, which is a time period in which the voice information does not collide , based on the schedule set for the automatic voice ,
A blanking time detected by the detection unit, and a determination unit that determines a mode for outputting the voice information based on the priority assigned by the assignment unit,
A determining device, comprising:

The determination unit is
According to the length of the blank time, to determine the playback speed of the audio information to be output,
The determination device according to claim 1, wherein

The determination unit is
In accordance with the length of the blank time, select a playback portion in the audio information, and determine to output only the selected playback portion,
The determination device according to claim 1 or 2, characterized in that.

The determination unit is
Of the clauses included in the voice information, of which the reproduction order is set in advance, the clauses are extracted in descending order of reproduction order, and the extracted clauses are combined to be output.
The determination device according to claim 3, wherein

The determination unit is
When a priority higher than a predetermined threshold is given to the voice information by the giving unit, it is determined that the voice information is preferentially output regardless of the blank time.
The determination device according to any one of claims 1 to 4, characterized in that.

The determination unit is
When voice information having a higher priority than the automatic voice is acquired, it is determined to output the voice information having a high priority regardless of the schedule.
The determination device according to any one of claims 1 to 5 , characterized in that.

The determination unit is
When it is decided to output the voice information having a higher priority than the automatic voice, if other voice information is being output, the voice information having a higher priority is preferentially output. At the same time, it is decided to stop outputting the other audio information.
The determination device according to claim 6 , wherein

The determination unit is
When the output of the other audio information is stopped, after the output of the audio information of the higher priority is finished, from the vicinity of the reproduction position where the output of the other audio information is stopped, or the other From the beginning of the voice information of, it is determined to output the other voice information again,
The determination device according to claim 7 , wherein

The acquisition unit is
Obtaining the voice information by accepting voice input from the speaker via the input device,
The adding unit is
Assigning a priority to the voice information based on the setting in the input device,
Determining apparatus according to any one of claims 1-8, characterized in that.

The acquisition unit is
The voice information input from the speaker, the voice information to which priority is given by the giving unit is stored in a storage unit,
The determination unit is
When the blank time is detected, a mode of outputting the voice information stored in the storage unit is determined based on the priority.
The determination device according to claim 9 , wherein

The acquisition unit is
When two or more voice information input separately from a plurality of speakers and the voice information to which the priority is assigned by the assigning unit is acquired, the storage unit saves the voice information other than the highest priority voice information. Stored in
The determination unit is
Of the two or more audio information, the audio information with the highest priority is output, and after the output of the audio information with the highest priority is completed, any of the audio information stored in the storage unit is output. Decide to let
The determination device according to claim 10 , wherein

Information about the blank time detected by the detection unit, a playback time of predetermined audio information scheduled to be output after the lapse of the blank time, and a notification notifying the playback content of the predetermined audio information A generator that generates information,
Determining apparatus according to any one of claims 1 to 11, characterized in that it further comprises a.

A computer-implemented decision method,
An acquisition step of acquiring voice information including an automatic voice whose output timing is preset based on a schedule ;
An assigning step of assigning priority to the voice information acquired by the acquiring step,
In the output of the voice information, a detection step of detecting a blank time, which is a time period in which each voice information does not collide , based on the schedule set for the automatic voice ,
A deciding step of deciding a mode of outputting the voice information based on the blank time detected by the detecting step and the priority given by the giving step;
A determination method characterized by including.

An acquisition procedure for acquiring audio information including automatic audio whose timing output based on a schedule is preset ,
An assigning procedure for assigning a priority to the voice information acquired by the acquiring procedure,
In outputting the voice information, a detection procedure for detecting a blank time, which is a time zone in which the voice information does not collide , based on the schedule set to the automatic voice ,
Based on the blank time detected by the detection procedure and the priority given by the giving procedure, a decision procedure for deciding a mode of outputting the voice information,
A decision program characterized by causing a computer to execute.