JP2014170134A

JP2014170134A - Outdoor environmental sound transmitting device, and outdoor environmental sound transmitting system

Info

Publication number: JP2014170134A
Application number: JP2013042328A
Authority: JP
Inventors: Yoichi Suzuki; 陽一鈴木; Shuichi Sakamoto; 修一坂本; Masayuki Morimoto; 政之森本; Zhenglie Cui; 正烈崔; Hayato Sato; 逸人佐藤
Original assignee: Tohoku University NUC
Current assignee: Tohoku University NUC
Priority date: 2013-03-04
Filing date: 2013-03-04
Publication date: 2014-09-18

Abstract

PROBLEM TO BE SOLVED: To provide an outdoor environmental sound transmitting device for transmitting a sound message that is easy to hear for local residents outdoors.SOLUTION: An outdoor environmental sound transmitting device 100 for transmitting a sound message for disaster prevention to one or more branch stations 200 laid in an area comprises: sound message obtaining means for obtaining the sound message; message division means 33 for dividing the sound message into meaningful symbol strings or obtaining the sound message divided into the meaningful symbol strings; symbol string difficulty level determination means 34 for determining a difficulty level in people's understanding of meaning of the symbol strings for each of the symbol strings; message creating means 36 for converting a high difficulty level symbol string determined, by the symbol string difficulty level determination means, to have a difficulty level of threshold or more, into a synonym having a difficulty level less than the threshold, to create a post-conversion message; and message transmission means 17 for transmitting the post-conversion message to the branch stations.

Description

本発明は、防災用の音声メッセージを地域に敷設された１つ以上の子局に伝達する屋外環境音声伝達装置に関する。 The present invention relates to an outdoor environment voice transmission device that transmits a voice message for disaster prevention to one or more slave stations laid in an area.

自然現象や人為的原因により様々な災害が生じている。行政や地域団体などは人命に影響を及ぼすおそれがある災害の発生の有無を監視しており、災害が発生した場合には地域住民を適切に避難誘導する。災害の発生や避難要請は、テレビ放送、ラジオ放送、移動体通信、及び、屋外拡声器などで地域住民に伝達される（例えば、特許文献１参照。）。 Various disasters are caused by natural phenomena and human factors. Governments and local organizations monitor the occurrence of disasters that could affect human lives, and appropriately evacuate local residents when disasters occur. Occurrence of disasters and evacuation requests are transmitted to local residents through television broadcasting, radio broadcasting, mobile communication, outdoor loudspeakers, and the like (see, for example, Patent Document 1).

伝達される情報として発生した災害の内容や避難先などを伝達する音声メッセージが付随する場合がある。例えば、地震速報では「気象庁緊急地震速報です。地震が発生しました。身の安全を確保してください。」という音声メッセージが伝達され、津波警報では「津波による被害が発生します。沿岸部や川沿いにいる人はただちに高台や避難ビルなど安全な場所へ避難してください。」という音声メッセージが伝達され、津波注意報では「海の中や海岸付近は危険です。海の中にいる人はただちに海から上がって、海岸から離れてください。」という音声メッセージが伝達される。地域住民は音声メッセージの内容を把握することで速やかに適切な行動を取ることができる。 There may be a case where a voice message that conveys the contents of a disaster that occurred or the evacuation destination is accompanied as information to be transmitted. For example, in the earthquake bulletin, the voice message “Meteorological Agency emergency earthquake bulletin. Earthquake has occurred. Please ensure your safety.” The tsunami warning says “Tsunami damage will occur. A voice message saying “People along the river should evacuate to safe places such as hills and evacuation buildings immediately.” Is transmitted, and the tsunami warning says, “The sea and the coast are dangerous. People in the sea. You should get up from the sea and leave the coast immediately. " Local residents can quickly take appropriate actions by grasping the contents of voice messages.

しかしながら、伝達手法の１つである屋外拡声器による音声メッセージは、地域住民の環境によっては地域住民が聴き取れない場合があるという問題がある。例えば、地域住民が屋内に滞在してテレビなども視聴していない場合、音声メッセージが遮断されるため、地域住民は音声メッセージの内容を把握できない。このような状況の地域住民に対しては、従来から、屋外拡声器による音声メッセージの伝達を補足する技術が考案されている（例えば、特許文献２参照。）。特許文献２には、住宅内に設置された住警器が親機と通信して、住宅内の住民に緊急地震速報等を報知する防災警報連携システムが開示されている。 However, there is a problem that the voice message by the outdoor loudspeaker, which is one of the transmission methods, may not be heard by the local resident depending on the local resident environment. For example, when a local resident stays indoors and does not watch TV, the voice message is blocked, and the local resident cannot grasp the content of the voice message. For local residents in such a situation, a technique for supplementing the transmission of voice messages by an outdoor loudspeaker has been conventionally devised (see, for example, Patent Document 2). Patent Document 2 discloses a disaster prevention alarm linkage system in which a home guard installed in a house communicates with a master unit to notify an inhabitant in the house of an earthquake early warning or the like.

特開２００１−２７３４５５号公報JP 2001-273455 A 特開２０１３−０２９９４４号公報JP 2013-029944 A

しかしながら、地域住民が屋外にいる場合でも、普段の会話などとは異なる拡声音が伝達されるため、地域住民が音声メッセージを聴き取れない場合があるという問題がある。また、地域側の環境にも影響され、例えば、反響が大きい地域や騒音が多い地域では、地域住民が音声メッセージを聴き取れない場合があるという問題がある。 However, even when the local residents are outdoors, there is a problem that the local residents may not be able to hear the voice message because a loud sound different from the usual conversation is transmitted. In addition, there is a problem that local residents may not be able to listen to voice messages in areas where the response is large or where there is a lot of noise.

本発明は、上記課題に鑑み、屋外の地域住民が聴き取りやすい音声メッセージを伝達する屋外環境音声伝達装置を提供することを目的とする。 In view of the above problems, an object of the present invention is to provide an outdoor environment audio transmission device that transmits an audio message that can be easily heard by an outdoor local resident.

本発明は、防災用の音声メッセージを地域に敷設された１つ以上の子局に伝達する屋外環境音声伝達装置であって、前記音声メッセージを取得する音声メッセージ取得手段と、前記音声メッセージ取得手段が取得した前記音声メッセージを意味のある記号列に分割するか、又は、外部に前記音声メッセージの前記記号列への分割を依頼するメッセージ分割手段と、前記メッセージ分割手段が分割した各記号列毎に、人が意味を理解する際の難易度を決定する記号列難易度決定手段と、前記記号列難易度決定手段により閾値以上の難易度に決定された高難易度記号列を、難易度が閾値未満の同意語に置き換えることで変換後メッセージを作成するメッセージ作成手段と、前記変換後メッセージを子局に送信するメッセージ送信手段と、を有することを特徴とする。 The present invention is an outdoor environment voice transmission device that transmits a voice message for disaster prevention to one or more slave stations laid in an area, the voice message acquisition means for acquiring the voice message, and the voice message acquisition means Dividing the voice message acquired by the message into meaningful symbol strings or requesting the outside to divide the voice message into the symbol strings, and for each symbol string divided by the message dividing means In addition, symbol string difficulty determining means for determining the difficulty level when a person understands the meaning, and a high difficulty symbol string determined by the symbol string difficulty determining means to have a difficulty level equal to or higher than a threshold, Message creation means for creating a converted message by replacing with a synonym less than a threshold, and message transmission means for sending the converted message to a slave station And wherein the door.

屋外の地域住民が聴き取りやすい音声メッセージを伝達する屋外環境音声伝達装置を提供することができる。 It is possible to provide an outdoor environment audio transmission device that transmits an audio message that can be easily heard by local residents outside.

本実施形態の防災無線システムの概略的な動作を説明する図の一例である。It is an example of the figure explaining schematic operation | movement of the disaster prevention radio | wireless system of this embodiment. 防災無線の全体的な仕組みの構成例を示す図である。It is a figure which shows the structural example of the whole mechanism of a disaster prevention radio | wireless. 地方公共団体に配置されている防災無線システムのブロック図の一例である。It is an example of the block diagram of the disaster prevention radio system arrange | positioned at a local public entity. 音声変換装置のハードウェア構成図の一例である。It is an example of the hardware block diagram of a speech converter. 音声変換装置の機能ブロック図の一例である。It is an example of the functional block diagram of a voice converter. 難易度別単語データベース、難易度別同意語データベースの一例を示す図である。It is a figure which shows an example of the word database classified by difficulty and a synonym database classified by difficulty. 音声変換装置の機能ブロック図の別の一例である。It is another example of the functional block diagram of a speech converter. 音声変換装置の動作手順を示すフローチャート図の一例である。It is an example of the flowchart figure which shows the operation | movement procedure of a speech converter. 防災無線システムの概略的な動作を説明する図の一例である（実施例２）。(Example 2) which is an example of the figure explaining schematic operation | movement of a disaster prevention radio | wireless system. 防災無線システムのブロック図の一例である（実施例２）。It is an example of the block diagram of a disaster prevention radio system (Example 2). 制御部の動作手順を示すフローチャート図の一例である（実施例２）。It is an example of the flowchart figure which shows the operation | movement procedure of a control part (Example 2). 防災無線システムの概略的な動作を説明する図の一例である（実施例３）。(Example 3) which is an example of the figure explaining schematic operation | movement of a disaster prevention radio | wireless system. エコーによる音声の了解度試験の実験結果を示す図である。It is a figure which shows the experimental result of the speech intelligibility test by echo. 防災無線システムのブロック図の一例である（実施例３）。It is an example of the block diagram of a disaster prevention radio system (Example 3). 子局におけるエコーの観測結果を説明する図の一例である。It is an example of the figure explaining the observation result of the echo in a sub_station | mobile_unit. 制御部の動作手順を示すフローチャート図の一例である（実施例３）。FIG. 10 is an example of a flowchart illustrating an operation procedure of a control unit (third embodiment).

以下、図面を参照して本発明の実施形態について説明する。しかしながら、本発明の技術的範囲が、本実施の形態に限定されるものではない。 Hereinafter, embodiments of the present invention will be described with reference to the drawings. However, the technical scope of the present invention is not limited to this embodiment.

図１は、本実施形態の防災無線システムの概略的な動作を説明する図の一例である。地方公共団体には防災無線システム１００（特許請求の範囲の屋外環境音声伝達装置の一例である）が配置されており、防災無線システム１００と子局２００は主に無線通信により通信可能な状態が維持されている。 FIG. 1 is an example of a diagram illustrating a schematic operation of the disaster prevention radio system according to the present embodiment. The local government has a disaster prevention radio system 100 (which is an example of an outdoor environment audio transmission device in the scope of claims), and the disaster prevention radio system 100 and the slave station 200 can communicate with each other mainly by radio communication. Maintained.

災害の発生時には、防災無線システム１００は発生した災害内容に応じた音声メッセージを選択したり、ひな形を編集したりするなどして音声メッセージを作成する。この音声メッセージをオリジナルメッセージと称することとする。本実施例の防災無線システム１００は、オリジナルメッセージを人間が聴き取りやすい音声メッセージに変換することが特徴の１つとなっている。
(i) 災害が発生し「高台に避難してください」というオリジナルメッセージが用意された。
(ii)音声変換装置３０は、オリジナルメッセージから難易語を検出する。難易語とは人間が聴き取りにくい単語又はフレーズである。ここでは「高台」「避難し」という難易語が検出されたものとする。
(iii)音声変換装置３０は、検出された難易語を、同意義の非難易語に置換する。例えば、「高台」と同意義の非難易語として「高い所」が、「避難し」と同意義の非難易語として「逃げて」が取得できたとする。音声変換装置３０は、「高台」「避難し」をそれぞれ以下のように変換する。
「高台」→「高い所」
「避難し」→「逃げ」
(iv) 音声変換装置３０は、オリジナルメッセージの難易語が非難易語に置換された変換後メッセージを作成する。したがって、変換後メッセージは「高い所に逃げてください」となる。
(v) 防災無線システム１００が子局２００に変換後メッセージを送信すると、子局２００がスピーカ５３から「高い所に逃げてください」という音声メッセージを拡声する。 When a disaster occurs, the disaster prevention radio system 100 creates a voice message by selecting a voice message according to the content of the disaster that occurred or editing a template. This voice message is referred to as an original message. One feature of the disaster prevention wireless system 100 according to the present embodiment is that the original message is converted into a voice message that can be easily heard by humans.
(i) A disaster occurred and an original message “Please evacuate to the hill” was prepared.
(ii) The voice conversion device 30 detects a difficult word from the original message. Difficult words are words or phrases that are difficult for humans to hear. Here, it is assumed that the difficult words “high ground” and “evacuate” are detected.
(iii) The voice conversion device 30 replaces the detected difficult word with a non-difficult word having the same meaning. For example, it is assumed that “high place” can be acquired as an easy word of the same meaning as “high ground” and “escape” can be acquired as an easy word of the same meaning as “evacuation”. The voice conversion device 30 converts “high ground” and “evacuation” as follows.
"High ground" → "High place"
"Evacuate" → "Escape"
(iv) The voice conversion device 30 creates a post-conversion message in which the difficult word of the original message is replaced with a non-easy word. Therefore, the message after conversion is "Please escape to a high place".
(v) When the disaster prevention radio system 100 transmits the converted message to the slave station 200, the slave station 200 amplifies a voice message “Please escape to a high place” from the speaker 53.

したがって、地域住民は、屋外にいても聴き取りやすい音声メッセージを聴覚で知覚することができ、周囲の騒音が大きくても、また、反響によるエコーが発生する場所にいても、音声メッセージの内容を把握し速やかに適切な行動を取ることができる。 Therefore, local residents can perceive a voice message that is easy to hear even when outdoors, and the contents of the voice message can be read even if the surrounding noise is high or the echo is caused by echo. You can grasp and take appropriate action promptly.

なお、音声メッセージの言語は日本語である必要はなく、本実施形態の防災無線システム１００は英語、中国語、韓国語など言語に依存せず適用できる。 Note that the language of the voice message need not be Japanese, and the disaster prevention wireless system 100 of the present embodiment can be applied without depending on the language such as English, Chinese, or Korean.

図２は、防災無線の全体的な仕組みの構成例を示す図である。このような仕組みは、屋外環境音声伝達システムや防災行政無線システムとよばれる。国は、気象庁や消防庁などが入手した早急に対処すべき事態に対する情報を市区町村などの地方公共団体に伝達するシステムを構築している。国と地方公共団体はインターネットや人工衛星通信を介して通信可能である。気象庁などが検出し伝達する情報としては、緊急地震速報、津波警報、津波注意報、噴火警報、噴火予報、東海地震予知情報、震度速報、気象情報、土砂災害警戒情報、竜巻注意情報、記録的短時間大雨情報、指定河川洪水予報、テロやミサイルなどの国民保護情報等がある。以下、これらを区別せずに「緊急情報」という。 FIG. 2 is a diagram illustrating a configuration example of the overall mechanism of the disaster prevention radio. Such a mechanism is called an outdoor environment audio transmission system or a disaster prevention radio system. The national government has established a system for transmitting information to the local governments, such as municipalities, on information that should be dealt with immediately, such as those obtained by the Japan Meteorological Agency or the Fire and Disaster Management Agency. National and local governments can communicate via the Internet and satellite communications. Information detected and transmitted by the Japan Meteorological Agency includes emergency earthquake warnings, tsunami warnings, tsunami warnings, eruption warnings, eruption forecasts, Tokai earthquake prediction information, seismic intensity warnings, weather information, landslide disaster warning information, tornado warning information, record There are short-term heavy rain information, designated river flood forecasts, national protection information such as terrorism and missiles. Hereinafter, these are referred to as “emergency information” without distinction.

地方公共団体は、人工衛星１０１から電波を受信する受信アンテナ１０２やインターネット（後述するネットワーク２４）に接続するための通信装置を有しており、人が介在しなくても緊急情報を常に受信できる。 The local public entity has a receiving antenna 102 for receiving radio waves from the artificial satellite 101 and a communication device for connecting to the Internet (a network 24 described later), and can always receive emergency information without any human intervention. .

地方公共団体は、気象庁などから送信された音声メッセージを受信したり、事態に適切な音声メッセージを選択したりして、同報系の子局２００に伝達する（移動系の子局が同様に送信される）。子局２００は、地域住民が漏れなく音声メッセージを聴覚により知覚できるように地域に計画的に敷設されている。 The local government receives a voice message transmitted from the Japan Meteorological Agency, etc., or selects a voice message appropriate for the situation, and transmits it to the broadcast slave station 200 (the mobile slave station is also the same). Sent). The slave station 200 is systematically laid in the area so that local residents can perceive voice messages by hearing without omission.

なお、地方公共団体は、河川の水位の計測器や地震計などのテレメータ３００を有しており、テレメータ３００から送信されたデータを独自の基準で判別し、子局２００から音声メッセージを拡声することも可能である。 The local public entity has a telemeter 300 such as a river water level measuring instrument or a seismometer. The data transmitted from the telemeter 300 is discriminated based on its own standard, and the voice message is amplified from the slave station 200. It is also possible.

図３は、地方公共団体に配置されている防災無線システムのブロック図の一例を示す。操作卓２０は、ユーザ（地方公共団体の防災関係者）が音声メッセージを送信する際の操作を受け付け、親局通信装置２１を介して送信する音声メッセージを制御する。また、操作卓２０はネットワーク２４を介して気象庁などが送信する緊急情報又は音声メッセージを受信し、ユーザが操作しなくても親局通信装置２１を介して音声メッセージを送信する。 FIG. 3 shows an example of a block diagram of a disaster prevention radio system arranged in a local public entity. The console 20 receives an operation when a user (a disaster prevention person in a local public organization) transmits a voice message, and controls the voice message transmitted via the master station communication device 21. Further, the console 20 receives emergency information or a voice message transmitted by the Japan Meteorological Agency or the like via the network 24, and transmits a voice message via the master station communication device 21 without any user operation.

操作卓２０は、マイク１１、表示部１２、操作部１３、データ記憶部１４、制御部１５、及び、入出力Ｉ／Ｆ１６を有しており、また、制御部１５は親局通信装置２１と接続されている。 The console 20 includes a microphone 11, a display unit 12, an operation unit 13, a data storage unit 14, a control unit 15, and an input / output I / F 16, and the control unit 15 is connected to the master station communication device 21. It is connected.

制御部１５は、ＣＰＵ、ＲＡＭ、ＲＯＭ、Ｉ／Ｏなどを備えたマイコンなどの情報処理装置である。制御部１５は、ＣＰＵが実行するＯＳ(Operating System)やプログラムにより操作卓２０の全体を制御している。 The control unit 15 is an information processing apparatus such as a microcomputer provided with a CPU, RAM, ROM, I / O, and the like. The control unit 15 controls the entire console 20 by an OS (Operating System) or a program executed by the CPU.

マイク１１は、ユーザが発声した音声を集音し電気信号に変換する。電気信号は制御部１５により、所定の方式で符号化され音声信号に変換される。符号化方式は特に規定がないが標準的な符号加速度として２６．６ｋｂｐｓ、６．４ｋｂｐｓなどがある。音声信号はデータ記憶部１４に記憶される。 The microphone 11 collects the voice uttered by the user and converts it into an electrical signal. The electric signal is encoded by the control unit 15 by a predetermined method and converted into an audio signal. The encoding method is not particularly specified, but there are 26.6 kbps, 6.4 kbps, and the like as standard code acceleration. The audio signal is stored in the data storage unit 14.

表示部１２は、液晶などのフラットパネルディスプレイであり、タッチパネルを一体に装備していていることが好ましい。表示部１２には、操作メニューの他、外部から取り込んだ地図、カメラが撮影した映像などが表示される。 The display unit 12 is a flat panel display such as a liquid crystal, and is preferably equipped with a touch panel. In addition to the operation menu, the display unit 12 displays a map captured from the outside, a video taken by the camera, and the like.

操作部１３は、キーボードなどのハードキーの他、タッチパネルに表示されたソフトキー、及び、ポインティングデバイスなどである。操作部１３は、例えば子局２００から送信する音声メッセージの選択などを受け付ける。 The operation unit 13 is a hard key such as a keyboard, a soft key displayed on the touch panel, and a pointing device. The operation unit 13 accepts selection of a voice message transmitted from the slave station 200, for example.

データ記憶部１４はＨＤＤ（Hard Disk Drive）やＳＤＤ（Solid State Drive）であり、音声信号、音声メッセージのテキストデータ、プログラムなどが記憶される。 The data storage unit 14 is an HDD (Hard Disk Drive) or an SDD (Solid State Drive), and stores voice signals, text data of voice messages, programs, and the like.

入出力Ｉ／Ｆ１６は、周辺機器と通信するためのインタフェースであり、例えばイーサネット（登録商標）カード、ＵＳＢホストなどである。入出力Ｉ／Ｆ１６にはＦＡＸ装置２２、音声合成装置２３、音声変換装置３０、及び、ネットワーク２４が接続されている。 The input / output I / F 16 is an interface for communicating with peripheral devices, and is, for example, an Ethernet (registered trademark) card, a USB host, or the like. The input / output I / F 16 is connected to a FAX device 22, a voice synthesizer 23, a voice converter 30, and a network 24.

ＦＡＸ装置２２は、電話回線網やネットワーク２４から画像データを受信して印刷したり、操作卓２０から送信依頼されたテキストを指定された宛先に送信する。音声合成装置２３は、テキストデータを音声に変換する装置である。操作卓２０から依頼された緊急情報の音声メッセージを音声に変換することで、ユーザが音声メッセージを読み上げなくても子局２００に音声信号を送信できる。なお、音声合成装置２３が作成した音声は、音声合成装置２３又は制御部１５により音声信号に変換されデータ記憶部１４に記録される。 The FAX apparatus 22 receives image data from the telephone line network or the network 24 and prints it, or transmits a text requested to be transmitted from the console 20 to a designated destination. The speech synthesizer 23 is a device that converts text data into speech. By converting the voice message of the emergency information requested from the console 20 into voice, the voice signal can be transmitted to the slave station 200 without the user reading out the voice message. The voice created by the voice synthesizer 23 is converted into a voice signal by the voice synthesizer 23 or the control unit 15 and recorded in the data storage unit 14.

音声変換装置３０は、ＰＣ（Personal Computer）、マイコンなどの情報処理装置である。音声変換装置３０の機能については後述するが、音声変換装置３０の機能は制御部１５が有するように構成することもできる。 The voice conversion device 30 is an information processing device such as a PC (Personal Computer) or a microcomputer. Although the function of the voice conversion device 30 will be described later, the function of the voice conversion device 30 may be configured to be included in the control unit 15.

ネットワーク２４は主にＬＡＮであるが、ＬＡＮを経由してインターネットなどＩＰ通信網に接続されている。このため、J-ALERT（全国瞬時警報システム）などの外部からの情報を受信できる。 The network 24 is mainly a LAN, but is connected to an IP communication network such as the Internet via the LAN. For this reason, information from outside such as J-ALERT (National Instantaneous Warning System) can be received.

制御部１５は、データ記憶部１４に記憶された音声信号を親局通信装置２１から無線送信する。防災無線システム１００では無線通信の標準規格（例えば、市町村デジタル同報通信システム標準規格ＡＲＩＢＳＴＤ−Ｔ８６）が定められている。すなわち、送信部１７は、音声信号をπ／４シフトＱＰＳＫで変調し、ＴＤＭＡ−ＴＤＤ（Time Division Multiple Access‐Time Division Duplex：時分割多元接続）方式によって、１つのフレームを６つにタイムスロットに分割する。制御用のスロットを除き、５つのスロットに音声信号を格納できる。変調された音声信号は例えば６０ＭＨｚの搬送波でアンテナ２５から送信される。 The control unit 15 wirelessly transmits the audio signal stored in the data storage unit 14 from the master station communication device 21. In the disaster prevention wireless system 100, a wireless communication standard (for example, a municipal digital broadcast communication system standard ARIB STD-T86) is defined. That is, the transmission unit 17 modulates a voice signal with π / 4 shift QPSK, and converts one frame into six time slots by a TDMA-TDD (Time Division Multiple Access-Time Division Duplex) method. To divide. Except for the control slot, audio signals can be stored in five slots. The modulated audio signal is transmitted from the antenna 25 with a carrier of 60 MHz, for example.

なお、同規格では、親局通信装置２１の送信部１７から子局２００に対しページングチャネルを使用した通報開始指示を行う。ページングチャネルに流れるフレームには都道府県コードとユーザコード（子局を指定する）が含まれており、親局通信装置２１は任意のユーザコードを設定することで、子局（又は子局の群）２００を個別呼び出しすることができる。 In the same standard, the transmitter 17 of the master station communication device 21 instructs the slave station 200 to start reporting using the paging channel. A frame flowing in the paging channel includes a prefecture code and a user code (specifying a slave station), and the master station communication device 21 sets an arbitrary user code, whereby a slave station (or group of slave stations) 200 is set. Can be called individually.

なお、この通信規格は地方公共団体などが定める通信方法の一例に過ぎず、携帯電話網、ＰＨＳ網、ＷｉＭＡＸ網、無線ＬＡＮ網などで、親局通信装置２１と子局２００が通信してもよい。また、ＺｉｇＢｅｅ（登録商標）のように子局同士が通信する通信ネットワークを構築し、子局２００と親局通信装置２１が通信してもよい。また、無線でなく、一部地域又は全ての地域で有線通信を採用してもよい。 Note that this communication standard is only an example of a communication method determined by a local public body, and the like, even if the master station communication device 21 and the slave station 200 communicate with each other via a mobile phone network, a PHS network, a WiMAX network, a wireless LAN network, or the like. Good. Further, a communication network in which slave stations communicate with each other like ZigBee (registered trademark) may be constructed, and the slave station 200 and the master station communication device 21 may communicate with each other. Moreover, you may employ | adopt wired communication in one part area or all the areas instead of a radio | wireless.

また、子局２００は、親局通信装置２１から音声メッセージを受信して、地域住民に対して拡声放送を行う。子局２００は、受信部５１、制御部５２、マイク５４、及び、スピーカ５３等を有している。この他、不図示であるが操作部、表示部などを有していてもよい。また、子局２００のみで、擬似サイレン音、チャイム音などの自局放送を行うことができる。 In addition, the slave station 200 receives a voice message from the master station communication device 21 and broadcasts it to the local residents. The slave station 200 includes a receiving unit 51, a control unit 52, a microphone 54, a speaker 53, and the like. In addition, although not shown, an operation unit, a display unit, and the like may be included. Further, only the slave station 200 can broadcast its own station such as a pseudo siren sound and a chime sound.

子局２００の受信部５１は、アンテナ５５が受信した搬送波を復調して音声信号を取り出し制御部５２に出力する。制御部５２は、音声信号を電圧に変換した後、増幅してスピーカ５３から出力する。なお、マイク５４は、子局２００の周囲の音を集音する。この音声信号を子局２００から親局通信装置２１に送信することも可能である。 The receiving unit 51 of the slave station 200 demodulates the carrier wave received by the antenna 55, extracts an audio signal, and outputs it to the control unit 52. The control unit 52 converts the audio signal into a voltage, amplifies it, and outputs it from the speaker 53. The microphone 54 collects sounds around the slave station 200. It is also possible to transmit this audio signal from the slave station 200 to the master station communication device 21.

〔音声変換装置〕
図４は、音声変換装置３０のハードウェア構成図の一例を示す。音声変換装置３０は、バスに接続された、ＣＰＵ３０１、ＲＯＭ３０２、ＲＡＭ３０３、ＨＤＤ３０４、ディスプレイ３２０が接続されたグラフィックボード３０５、キーボード・マウス３０６、メディアドライブ３０７、及び、ネットワーク通信部３０８を有する。ＣＰＵ３０１はＨＤＤ３０４に記憶されたプログラム３１０をＲＡＭ３０３に展開して実行し、各部品を制御して入出力を行ったり、データの加工を行ったりする。ＲＯＭ３０２にはＢＩＯＳや、ブートストラップローダをＨＤＤ３０４からＲＡＭ３０３に読み出すスタートプログラムが記憶されている。ブートストラップローダは、ＯＳをＨＤＤ３０４からＲＡＭ３０３に読み出す。 [Voice converter]
FIG. 4 shows an example of a hardware configuration diagram of the audio conversion device 30. The audio conversion device 30 includes a CPU 301, a ROM 302, a RAM 303, an HDD 304, a graphic board 305 to which a display 320 is connected, a keyboard / mouse 306, a media drive 307, and a network communication unit 308 connected to a bus. The CPU 301 expands and executes the program 310 stored in the HDD 304 in the RAM 303, and controls each component to perform input / output and process data. The ROM 302 stores a BIOS and a start program for reading the bootstrap loader from the HDD 304 to the RAM 303. The bootstrap loader reads the OS from the HDD 304 to the RAM 303.

ＨＤＤ３０４は、不揮発性のメモリであればＳＳＤ（Solid State Drive）などでもよい。ＨＤＤ３０４はＯＳ、デバイスドライバ、及び、後述する機能を提供するプログラム３１０を記憶している。なお、ＯＳとしては、Windows（登録商標）系、ＬＩＮＵＸ（登録商標）、ＵＮＩＸ（登録商標）などがある。ディスプレイ３２０にはプログラムが指示し、グラフィックボード３０５が作成したＧＵＩ画面が表示される。 The HDD 304 may be an SSD (Solid State Drive) or the like as long as it is a non-volatile memory. The HDD 304 stores an OS, a device driver, and a program 310 that provides functions to be described later. The OS includes Windows (registered trademark), LINUX (registered trademark), UNIX (registered trademark), and the like. The display 320 displays a GUI screen generated by the graphic board 305 as instructed by the program.

キーボード・マウス３０６はユーザの操作を受け付ける入力装置である。なお、キーボード・マウス３０６とディスプレイは、ユーザが直接、音声変換装置３０を操作しない場合は備えていなくてもよい。 A keyboard / mouse 306 is an input device that receives user operations. Note that the keyboard / mouse 306 and the display may not be provided when the user does not directly operate the audio conversion device 30.

メディアドライブ３０７はコンパクトディスク、ＤＶＤ及びブルーレイディスクなどの光学メディアにデータを読み書きする。また、フラッシュメモリなどのメモリカードにデータを読み書きしてもよい。 The media drive 307 reads and writes data to and from optical media such as compact discs, DVDs, and Blu-ray discs. Further, data may be read from and written to a memory card such as a flash memory.

ネットワーク通信部３０８は、例えばＬＡＮに接続するためのイーサネット（登録商標）カードである。ＴＣＰ／ＩＰ（ＵＤＰ／ＩＰ）やアプリケーション層のプロトコルの処理はＯＳやプログラム３１０が行う。アプリケーション層のプロトコルは各種あるが、一般的なプロトコルに対応している（例えば、ＳＭＢ（Server Message Block）、ＨＴＴＰ、ＦＴＰ、ＳＮＭＰ（Simple Network Management Protocol）等）。 The network communication unit 308 is, for example, an Ethernet (registered trademark) card for connecting to a LAN. The OS and program 310 perform TCP / IP (UDP / IP) and application layer protocol processing. There are various application layer protocols, but they are compatible with general protocols (for example, SMB (Server Message Block), HTTP, FTP, SNMP (Simple Network Management Protocol), etc.).

プログラム３１０は、インストール可能な形式又は実行可能な形式のファイルで、コンピュータで読み取り可能な記録メディアに記録して配布される。また、プログラム３１０は、不図示のサーバからインストール可能な形式又は実行可能な形式のファイルで配布される。 The program 310 is a file in an installable or executable format, and is distributed by being recorded on a computer-readable recording medium. The program 310 is distributed in a file that can be installed or executed from a server (not shown).

図５は、音声変換装置３０の機能ブロック図の一例を示す。操作卓２０のデータ記憶部１４に記憶された音声信号は、入出力Ｉ／Ｆ１６を介して音声信号取得部３１が取得する。音声信号取得部３１では発声された音声メッセージの音声信号が取得されているので、音声認識部３２がテキストデータ（例えば、「高台に避難してください」）に変換する。音声認識部３２については公知のソフトウェアを利用すればよく、一般には音響モデルと言語モデルを使用して認識処理を施しテキストデータに変換する。また、ネットワーク２４を介して音声信号を送信し、外部のサーバ（例えばクラウドサービス）を利用して音声認識してもよい。音声認識部３２はテキストデータを単語分割部３３に送出する。 FIG. 5 shows an example of a functional block diagram of the voice conversion device 30. The audio signal stored in the data storage unit 14 of the console 20 is acquired by the audio signal acquisition unit 31 via the input / output I / F 16. Since the voice signal of the voice message uttered is acquired by the voice signal acquisition unit 31, the voice recognition unit 32 converts it into text data (for example, “Please evacuate to the hill”). For the speech recognition unit 32, publicly known software may be used. Generally, recognition processing is performed using an acoustic model and a language model, and converted into text data. Further, a voice signal may be transmitted via the network 24 and voice recognition may be performed using an external server (for example, a cloud service). The voice recognition unit 32 sends the text data to the word division unit 33.

単語分割部３３はテキストデータを単語に分割する。英語などの分かち書き言語（文章が単語毎に分かれている言語）では、テキストデータに変換された時点で単語に分割されている。また、日本語などのように分かち書きでない言語では、以下のようにして単語に分割する。単語に分割する簡易な方法として、日本語など（中国語、韓国語、ベトナム語など）のように漢字への変換が可能な言語では、かな漢字変換することが有効である。かな漢字変換は、一般的なＩＭＥ（Input Method Editor）で可能であり、また、外部のサーバを利用することもできる。 The word dividing unit 33 divides the text data into words. In a divided language such as English (a language in which sentences are divided for each word), it is divided into words when converted into text data. In a language such as Japanese that is not separated, it is divided into words as follows. As a simple method of dividing into words, Kana-Kanji conversion is effective for languages that can be converted to Kanji such as Japanese (Chinese, Korean, Vietnamese, etc.). Kana-Kanji conversion can be performed by a general IME (Input Method Editor), and an external server can also be used.

かな漢字変換されたテキストデータのうち、漢字から次の漢字までを１つの単語と推定する。例えば、かな漢字変換され「高台に避難して下さい」というテキストデータが得られた場合、「高台に」「避難して」「下さい」が各単語である。なお、文法上、「に」「て」「い」なども単語（付属語）であり、「高台に」「避難して」「下さい」のような文字列は「単語（自立語）＋付属語」である。したがって、本実施例の単語とは文法上の単語に限られず、意味がある（人が理解する）記号列を含めばよい。当然ながら文字以外にも数字やアルファベットを含んでよい。 From the kana-kanji converted text data, the kanji to the next kanji are estimated as one word. For example, when Kana-Kanji conversion is performed and text data “Please evacuate to high ground” is obtained, “To high,” “Evacuate” and “Please” are the words. In terms of grammar, “ni”, “te”, “i”, etc. are also words (attached words), and strings such as “on the hill”, “evacuate”, “please” are “words (independent words) + attached” Word ". Therefore, the word in the present embodiment is not limited to a grammatical word, but may include a meaningful symbol string (which a person understands). Of course, numbers and alphabets may be included in addition to letters.

また、単語に分割する方法として形態素解析を施すことが有効である。形態素解析とは、自然言語で書かれた文を、文法上の意味を持つ最小単位に分割する処理であり、日本語の形態素解析では、品詞レベルまで分割できる。本実施例ではテキストデータの助詞や副助詞まで特定する必要性は低いので、例えば、名詞、代名詞、動詞など、変換対象となる形態素だけ特定してもよい。「高台に避難してください」というテキストデータでは、「高台」「避難し」という形態素が抽出される。抽出する品詞や活用形は、ユーザが設定できる。なお、単語分割も外部のサーバを利用することができる。 It is also effective to perform morphological analysis as a method of dividing into words. Morphological analysis is a process of dividing a sentence written in a natural language into the smallest units having grammatical meaning. In Japanese morphological analysis, it can be divided up to the part of speech level. In the present embodiment, since it is not necessary to specify particles and adverbs of text data, for example, only morphemes to be converted such as nouns, pronouns, and verbs may be specified. In the text data “Please evacuate to the hill”, morphemes “Elevation” and “Evacuate” are extracted. The user can set the part of speech and usage to be extracted. Note that an external server can also be used for word division.

このようにして単語に分割されたテキストデータは単語難易度算出部３４に送出される。単語難易度算出部３４は、難易度別単語データベース３８を参照して、テキストデータに含まれる単語の難易度を算出する。 The text data divided into words in this way is sent to the word difficulty level calculation unit 34. The word difficulty level calculation unit 34 refers to the difficulty level word database 38 and calculates the difficulty level of words included in the text data.

図６（ａ）は難易度別単語データベース３８の一例を模式的に説明する図である。難易度とは、地域住民が聴覚で感知した場合の単語の聴き取りにくさを表す程度である。図では１〜５の５段階で難易度を表し、数値が高いほど難易度が高いとした。 FIG. 6A is a diagram schematically illustrating an example of the difficulty-based word database 38. The difficulty level is a level representing difficulty in listening to a word when a local resident senses it. In the figure, the difficulty level is expressed in five stages of 1 to 5, and the higher the numerical value, the higher the difficulty level.

難易度は、親密度や出現頻度などから算出することができる。親密度は、ある単語がどの程度なじみがあると感じられるかを表した指標である。親密度は、各単語を複数の人に聴かせたり見せるなどして、そのなじみの程度を１から７までの数字で答えてもらった場合の平均値である。親密度のデータは辞書（例えば、「日本語の語畳特性」など）に掲載されている。また、出現頻度は、新聞、雑誌、インターネットなどで、出現する頻度を単語毎に算出した値であり、公知のデータを利用できる。 The difficulty level can be calculated from the familiarity and the appearance frequency. Intimacy is an index that expresses how familiar a word is. The familiarity is an average value when the degree of familiarity is answered by a number from 1 to 7 by listening to or showing each word to a plurality of people. The intimacy data is published in a dictionary (for example, “Japanese tatami characteristics”). The appearance frequency is a value obtained by calculating the appearance frequency for each word in newspapers, magazines, the Internet, etc., and publicly known data can be used.

親密度又は出現頻度をそのまま難易度としてもよいし、親密度又は出現頻度を重みづけして難易度を算出してもよい。なじみが高い又は出現頻度が高い単語ほど、難易度が低いと考えられる。 The familiarity or appearance frequency may be used as the difficulty level as it is, or the difficulty level may be calculated by weighting the familiarity or appearance frequency. Words with higher familiarity or higher appearance frequency are considered to be less difficult.

なお、難易度別単語データベース３８は、音声変換装置３０が有している必要はなく、音声変換装置３０がアクセス可能なネットワーク２４上に存在すればよい。また、図６（ａ）では、難易度を５段階で示したが、５段階以上でもよく（例えば親密度の７段階を転用する）、難易度（低）又は難易度（高）の２段階の難易度が登録されていてもよい。 The difficulty level word database 38 does not need to be included in the voice conversion device 30 and may be present on the network 24 accessible by the voice conversion device 30. In FIG. 6A, the difficulty level is shown in five levels, but may be five levels or more (for example, diversion of seven levels of intimacy), two levels of difficulty (low) or difficulty (high). The difficulty level of may be registered.

図５に戻り、単語難易度算出部３４は、各単語に難易度を添付して、難易度判別部３５に送出する。例えば、「高台に」「避難して」「下さい」の３単語では、「高台に（４）」「避難して（３）」「下さい（２）」のように、かっこ内に難易度が添付される。 Returning to FIG. 5, the word difficulty level calculation unit 34 attaches the difficulty level to each word and sends it to the difficulty level determination unit 35. For example, the three words “on the hill”, “evacuate” and “please” indicate the difficulty level in parentheses, such as “on the hill (4)” “evacuate (3)” and “please (2)” Attached.

難易度判別部３５は、予め設定されている閾値と難易度を比較して、閾値以上の単語（以下、難易度（高）の単語という）を単語変換部３６に、閾値未満の単語（以下、難易度（低）の単語という）を文章合成部３７にそれぞれ送出する。ここでは一例として閾値を「３」とする。したがって、「高台に（４）」と「避難して（３）」は単語変換部３６に送出され、「下さい（２）」は文章合成部３７に送出される。 The difficulty level determination unit 35 compares the threshold value set in advance with the difficulty level, and causes the word conversion unit 36 to transmit words that are equal to or higher than the threshold value (hereinafter referred to as difficulty (high) words) to the word conversion unit 36. , Difficulty (low) words) are sent to the sentence composition unit 37. Here, as an example, the threshold value is “3”. Therefore, “to the hill (4)” and “evacuate (3)” are sent to the word conversion unit 36, and “please (2)” is sent to the sentence composition unit 37.

また、難易度の判別により難易度は不要になったので、難易度判別部３５は難易度を削除する。また、後にテキストデータを元の音声メッセージに合成するため、単語の順番を付与する。例えば、「高台に」は１番目、「避難して」は２番目、「下さい」は３番目である。 Further, since the difficulty level is no longer necessary due to the determination of the difficulty level, the difficulty level determination unit 35 deletes the difficulty level. Further, in order to synthesize the text data into the original voice message later, the order of words is given. For example, “To hill” is first, “Evacuate” is second, and “Please” is third.

単語変換部３６は、難易度別同意語データベース３９を参照して難易度（高）の単語を、難易度が低いとされる単語に変換する。 The word conversion unit 36 refers to the difficulty level synonym database 39 to convert a word having a difficulty level (high) into a word having a low difficulty level.

図６（ｂ）は難易度別同意語データベースを模式的に説明する図である。難易度（高）の単語に、置換語が対応づけられている。置換語は、難易度判別部の閾値より難易度が低く、かつ、同意義の単語である。例えば、「高台」の置換語は「高い所」であり、「避難し」の置換語は「逃げ」である。 FIG. 6B schematically illustrates the synonym database classified by difficulty. A substitution word is associated with a word of difficulty (high). The replacement word is a word having a difficulty level lower than the threshold value of the difficulty level determination unit and having the same meaning. For example, the replacement word for “high ground” is “high place”, and the replacement word for “evacuation” is “escape”.

本実施形態では、防災無線で拡声される緊急情報の音声メッセージを主要な適用例としているので、音声メッセージに含まれる単語は限定的である。したがって、比較的容易に難易度（高）の単語と置換語を網羅的にリストアップ可能である。また、活用形を考慮して、難易度別単語データベース３８に難易度（高）と対応する置換語を登録しておくことも可能である。例えば、「避難し」だけでなく、「避難する」という難易度（高）に「逃げる」という置換語を登録しておく。これにより、難易度（高）を置換語で変換した場合に、自然な音声メッセージを容易に作成できる。 In the present embodiment, the voice message of emergency information that is amplified by the disaster prevention radio is used as a main application example, so the words included in the voice message are limited. Therefore, it is possible to comprehensively list words and replacement words of difficulty (high) relatively easily. In addition, in consideration of the utilization form, it is also possible to register a replacement word corresponding to the difficulty level (high) in the difficulty level word database 38. For example, not only “evacuation” but also a substitution word “escape” is registered in the difficulty level (high) of “evacuation”. Thereby, when the difficulty level (high) is converted by the replacement word, a natural voice message can be easily created.

なお、難易度（低）かつ同意語というだけでなく、可能な限り短い置換語を用意しておくことが好ましい。短いとは発声時間（モーラ長）が短いことを言う。発声時間が短い置換語であれば、緊急性が高い防災無線システムの音声メッセージを短時間で伝達できるためである。すなわち、難易度別同意語データベースの少なくとも１つ以上の難易度（高）の単語には、難易度（高）の単語よりモーラ長が短い置換語が対応づけられている。したがって、本実施形態の防災無線システムは聴き取りやすさだけでなく、伝達時間を短縮できるという効果がある。 In addition, it is preferable to prepare not only the difficulty level (low) and synonyms but also the shortest possible replacement words. “Short” means that the utterance time (mora length) is short. This is because a substitution word with a short utterance time can transmit a voice message of a disaster prevention radio system with high urgency in a short time. That is, at least one difficulty (high) word in the synonym database classified by difficulty is associated with a replacement word having a shorter mora length than the difficulty (high) word. Therefore, the disaster prevention wireless system of the present embodiment has an effect that not only the listening ability but also the transmission time can be shortened.

また、当然ながら、難易度（低）かつ同意語で、元の単語よりモーラ長が長い置換語に変換してもよい。 Naturally, it may be converted into a substitution word having a difficulty level (low) and a synonym and having a longer mora length than the original word.

図５に戻り、単語変換部３６は、難易度（高）の単語を、置換語に変換して、文章合成部３７に送出する。したがって、「高い所に」「逃げて」という置換語が送出される。単語の順番は付与されたままである。 Returning to FIG. 5, the word conversion unit 36 converts the word of difficulty (high) into a replacement word, and sends it to the text synthesis unit 37. Therefore, the replacement words “elevate” and “run away” are sent out. The order of the words remains given.

文章合成部３７は、難易度判別部３５と単語変換部３６からそれぞれ送出される単語を合成してテキストデータを作成する。すなわち、３「下さい」、１「高い所に」、２「逃げて」の単語に対し、順番を参照し、順番どおりに単語を並び替える。以上により、「高い所に逃げて下さい」という変換後メッセージが作成される。 The text synthesizing unit 37 synthesizes words sent from the difficulty level determining unit 35 and the word converting unit 36 to create text data. That is, with respect to the words “3”, “1”, “2”, “2”, refer to the order and rearrange the words in the order. As a result, a post-conversion message “Please escape to a high place” is created.

音声変換装置３０は変換後メッセージを操作卓２０に送信する。操作卓２０は、変換後メッセージを、再度、音声信号に変換する。例えば、表示部に表示したり印刷するなどして、ユーザが再度、発声し、マイク１１から集音してもよい。また、音声合成装置２３に送信し、音声合成装置２３から音声信号を取得してもよい。 The voice conversion device 30 transmits the converted message to the console 20. The console 20 converts the converted message into an audio signal again. For example, the user may speak again and collect sound from the microphone 11 by displaying on the display unit or printing. Alternatively, the speech signal may be acquired from the speech synthesizer 23 by being transmitted to the speech synthesizer 23.

送信部１７は、難易語（高）が含まれない音声信号を子局２００に送信するので、子局２００はスピーカ５３から変換後メッセージを拡声することができる。 Since the transmission unit 17 transmits an audio signal that does not include the difficult word (high) to the slave station 200, the slave station 200 can amplify the converted message from the speaker 53.

〔変形例〕
図５では、音声信号に変換されたオリジナルメッセージを変換後メッセージに変換したが、変換後メッセージを作成するタイミングは、実用上、適切なタイミングとすればよい。 [Modification]
In FIG. 5, the original message converted into the audio signal is converted into the converted message, but the timing for creating the converted message may be set to an appropriate timing in practice.

図７は、音声変換装置３０の機能ブロック図の別の一例を示す。図７（ａ）は、音声メッセージがテキストデータ（電子データ又は電子ファイル）として与えられた場合の、機能ブロック図である。図７（ａ）では、ユーザが発話した音声信号でなくテキストデータが与えられるので、音声信号取得部３１の代わりに音声メッセージ取得部４１がある。音声メッセージ取得部４１は、操作卓２０又はネットワーク２４から緊急情報に適した音声メッセージを取得する。操作卓２０から取得する場合、ユーザが選択した音声メッセージのテキストデータ、テレメータ３００からの通知に応じて自動的に選択されたテキストデータ、又は、J-Alertから指示されたテキストデータなどがある。 FIG. 7 shows another example of a functional block diagram of the voice conversion device 30. FIG. 7A is a functional block diagram when a voice message is given as text data (electronic data or electronic file). In FIG. 7A, text data is given instead of the voice signal uttered by the user, so that there is a voice message acquisition unit 41 instead of the voice signal acquisition unit 31. The voice message acquisition unit 41 acquires a voice message suitable for emergency information from the console 20 or the network 24. When acquiring from the console 20, there are text data of a voice message selected by the user, text data automatically selected according to a notification from the telemeter 300, text data instructed by J-Alert, and the like.

また、音声メッセージがテキストデータであれば、音声認識も不要となるため、音声認識部３２はなくてよい。すなわち、音声メッセージ取得部４１が取得した音声メッセージのテキストデータは、直接、単語分割部３３に送出される。以降の手順は図５と同様である。 If the voice message is text data, voice recognition is not necessary, and the voice recognition unit 32 is not necessary. That is, the text data of the voice message acquired by the voice message acquisition unit 41 is directly sent to the word division unit 33. The subsequent procedure is the same as in FIG.

図７（ｂ）は、音声メッセージが用紙に印刷された状態として与えられた場合の、機能ブロック図である。音声メッセージの内容がＦＡＸで送信されたり、いったん打ち出された状態でユーザに手渡されることがある。この場合、図５のようにユーザが発声することで音声信号に変換してもよいが、音声認識などの必要が生じ、また、認識ミスが発声するおそれがある。このため、音声メッセージが用紙に印刷された状態として与えられた場合、以下のように処理することが有効な場合がある。 FIG. 7B is a functional block diagram when a voice message is given as a printed state on paper. The content of the voice message may be transmitted by FAX or handed to the user once it has been launched. In this case, it may be converted into a voice signal by the user uttering as shown in FIG. 5, but there is a need for voice recognition or the like, and a recognition error may be uttered. For this reason, when a voice message is given as being printed on paper, it may be effective to perform the following processing.

図７（ｂ）では、音声メッセージが原稿として与えられるので、光学的に原稿を読み取るスキャナ装置４２が使用される。スキャナ装置４２は、音声変換装置３０に接続されていてもよいし、操作卓２０に接続されていてもよい。いずれにしても音声変換装置３０は原稿の画像データを取得できる。また、スキャナ装置４２は一般に撮像素子が一次元に並んだラインセンサを有するが、いわゆるカメラ（二次元撮像素子）で原稿を撮影し画像データを作成してもよい。 In FIG. 7B, since a voice message is given as a document, a scanner device 42 that optically reads the document is used. The scanner device 42 may be connected to the audio conversion device 30 or may be connected to the console 20. In any case, the audio conversion device 30 can acquire image data of a document. The scanner device 42 generally has a line sensor in which image sensors are arranged one-dimensionally. However, a document may be photographed with a so-called camera (two-dimensional image sensor) to create image data.

また、音声変換装置３０はＯＣＲ（Optical Character Reader）部４３を有しており、画像データに記述されている音声メッセージをテキストデータに変換する。したがって、図７（ａ）と同様にテキストデータが得られる。以降の処理は、図７（ａ）と同様である。 The voice conversion device 30 has an OCR (Optical Character Reader) unit 43, and converts a voice message described in the image data into text data. Therefore, text data is obtained in the same manner as in FIG. The subsequent processing is the same as in FIG.

なお、この他、音声メッセージ（音声信号、テキストデータ、原稿）を外国語に翻訳してから、同様の処理を行うことも可能である。子局２００が敷設された地域に外国人が多く居住している場合、外国人の母国語で音声メッセージを拡声することができる。 In addition, it is also possible to perform the same processing after translating a voice message (voice signal, text data, manuscript) into a foreign language. When many foreigners live in the area where the slave station 200 is laid, the voice message can be expanded in the native language of the foreigner.

以上のように、音声メッセージが音声変換装置３０にどのように入力されるかに制限はなく、入力された音声メッセージをテキストデータに変換すれば、同様に本実施形態の処理を行うことができる。 As described above, there is no restriction on how the voice message is input to the voice conversion device 30. If the input voice message is converted into text data, the processing of the present embodiment can be similarly performed. .

〔動作手順〕
図８は、音声変換装置３０の動作手順を示すフローチャート図の一例である。
まず、音声変換装置３０は緊急情報の内容を伝える音声メッセージをテキストデータの形態で取得する（Ｓ１０）。すなわち、音声信号取得部３１が取得した音声信号を音声認識部３２が音声認識する、音声メッセージ取得部４１がテキストデータそのものを受信する、又は、スキャナが原稿をスキャンしてＯＣＲ部４３がテキストデータに変換する、などにより取得する。 [Operation procedure]
FIG. 8 is an example of a flowchart showing an operation procedure of the voice conversion device 30.
First, the voice conversion device 30 acquires a voice message conveying the contents of emergency information in the form of text data (S10). That is, the voice recognition unit 32 recognizes the voice signal acquired by the voice signal acquisition unit 31, the voice message acquisition unit 41 receives the text data itself, or the scanner scans the document and the OCR unit 43 sets the text data. Obtained by converting to.

次に、単語分割部３３がテキストデータを単語に分割する（Ｓ２０）。 Next, the word dividing unit 33 divides the text data into words (S20).

単語難易度算出部３４は、難易度別単語データベース３８を参照して各単語の難易度を算出する（Ｓ３０）。これにより、各単語には難易度が添付される。 The word difficulty level calculation unit 34 calculates the difficulty level of each word with reference to the word database 38 by difficulty level (S30). Thereby, the difficulty level is attached to each word.

次に、難易度判別部３５は各単語を難易度（高）と難易度（低）に仕分ける（Ｓ４０）。 Next, the difficulty level determination unit 35 classifies each word into a difficulty level (high) and a difficulty level (low) (S40).

そして、単語変換部３６は、難易度別同意語データベース３９を参照して難易度（高）の単語を置換語に変換する（Ｓ５０）。難易度（高）の単語がなければ、変換する必要はない。 And the word conversion part 36 converts the word of difficulty (high) into a replacement word with reference to the synonym database 39 according to difficulty (S50). If there is no word of difficulty (high), there is no need to convert.

そして、文章合成部３７は、難易度（高）の単語から変換された置換語と、難易度（低）の単語を、オリジナルメッセージにおける単語の元の出現順にしたがって合成する（Ｓ６０）。 Then, the text synthesizing unit 37 synthesizes the replacement word converted from the word of difficulty (high) and the word of difficulty (low) according to the original appearance order of the words in the original message (S60).

以上説明したように、本実施例の防災無線システム１００は、地域住民が聴き取りやすい音声メッセージを拡声することができる。 As described above, the disaster prevention radio system 100 according to the present embodiment can amplify a voice message that can be easily heard by local residents.

本実施例では、地域の状況に応じて地域ごとに音声メッセージを振り分けることが可能な防災無線システム１００について説明する。 A present Example demonstrates the disaster prevention radio | wireless system 100 which can distribute a voice message for every area according to the condition of an area.

図９は、本実施例の防災無線システム１００の概略的な動作を説明する図の一例である。なお本実施例において、実施例１において同一の符号を付した構成要素は同様の機能を果たすので、主に本実施例の主要な構成要素についてのみ説明する場合がある。 FIG. 9 is an example of a diagram illustrating a schematic operation of the disaster prevention radio system 100 according to the present embodiment. In the present embodiment, the constituent elements denoted by the same reference numerals in the first embodiment perform similar functions, and therefore, only the main constituent elements of the present embodiment may be mainly described.

子局２００は地域に散在するため、地域によって拡声音の聴き取りやすさに関する状況が異なる場合がある。図９では、地域Ａは繁華街、交通機関、道路などがあるため聴き取りにくい状況になることが多い。これに対し、地域Ｂは閑静な住宅街なので聴き取りにくい状況になることが少ない。したがって、地域Ａに拡声される音声メッセージのみ変換後メッセージに変換すれば充分な可能性がある。また、地域Ｂではオリジナルメッセージが拡声されるので、変換することなく早期に緊急情報を拡声することができる。 Since the slave stations 200 are scattered in the area, the situation regarding the ease of listening to the loud sound may vary depending on the area. In FIG. 9, the area A is often difficult to hear because there are busy streets, transportation facilities, roads, and the like. On the other hand, since area B is a quiet residential area, it is rarely difficult to hear. Therefore, it may be sufficient to convert only the voice message that is output to the area A into the converted message. Further, since the original message is louded in the region B, the emergency information can be loudly voiced early without being converted.

各子局はマイク５４を有しており、周囲の騒音の有無を定期的に又は防災無線システム１００から指示された時に測定し、子局２００の識別情報と共に親局通信装置２１に送信している。防災無線システム１００は、騒音テーブル１８を有しており、子局２００の識別情報に対応づけて騒音有り、騒音なしを更新して登録する。 Each slave station has a microphone 54, and measures the presence or absence of ambient noise regularly or when instructed by the disaster prevention radio system 100, and transmits it to the master station communication device 21 together with the identification information of the slave station 200. Yes. The disaster prevention radio system 100 has a noise table 18, and updates and registers the presence of noise and the absence of noise in association with the identification information of the slave station 200.

防災無線システム１００は、緊急情報を報知する状況になると、騒音テーブル１８を参照して、騒音なしの子局２００にはオリジナルメッセージの音声信号を送信する。したがって、静かな環境の地域には早期にオリジナルメッセージを拡声することができる。一方、騒音ありの子局２００には変換後メッセージの音声信号を送信する。したがって、拡声音を聴き取りにくい環境の地域には、聴き取りやすい単語に変換された変換後メッセージを拡声することができる。 When the emergency wireless system 100 is in a state of notifying emergency information, the disaster prevention radio system 100 refers to the noise table 18 and transmits the voice signal of the original message to the slave station 200 without noise. Therefore, the original message can be amplified early in a quiet environment area. On the other hand, the converted message voice signal is transmitted to the slave station 200 with noise. Therefore, in an environment where it is difficult to hear a loud sound, the converted message converted into a word that is easy to hear can be loud.

音声変換装置３０の機能ブロック図は実施例１（図５、７）と同様でよい。一方、操作卓側は、子局２００に応じてオリジナルメッセージ又は変換後メッセージのどちらを送信するかを制御する。 The functional block diagram of the voice conversion device 30 may be the same as that of the first embodiment (FIGS. 5 and 7). On the other hand, the console side controls whether the original message or the converted message is transmitted according to the slave station 200.

図１０は、本実施例の防災無線システムのブロック図の一例を示す。本実施例では、操作卓２０が騒音テーブル１８を有し、適宜更新している。なお、騒音の度合いが変化しない地域では、騒音テーブル１８を更新する必要はなく、担当者などが、一度、静的な騒音テーブル１８を作っておけばよい。 FIG. 10 shows an example of a block diagram of the disaster prevention wireless system of the present embodiment. In this embodiment, the console 20 has a noise table 18 and is updated as appropriate. In an area where the degree of noise does not change, it is not necessary to update the noise table 18, and a person in charge or the like may create the static noise table 18 once.

制御部１５は、騒音テーブル１８を参照して騒音有りが登録された子局２００に変換後メッセージを送信部１７に送信させる。騒音なしが登録された子局２００にオリジナルメッセージを送信部１７に送信させる。送信部１７は、ページングチャネルに指示された子局のユーザコードを設定して、オリジナルメッセージ又は変換後メッセージの音声信号を子局２００に送信する。 The control unit 15 refers to the noise table 18 and causes the transmission unit 17 to transmit the converted message to the slave station 200 in which presence of noise is registered. An original message is transmitted to the transmission unit 17 to the slave station 200 in which no noise is registered. The transmission unit 17 sets the user code of the slave station instructed to the paging channel, and transmits the voice signal of the original message or the converted message to the slave station 200.

図１１（ａ）は、制御部１５の動作手順を示すフローチャート図の一例である。図１１の動作手順は、例えば子局２００に送信すべき音声メッセージが決定された場合に実行され、すでに変換後メッセージの作成が開始されている。 FIG. 11A is an example of a flowchart illustrating an operation procedure of the control unit 15. The operation procedure of FIG. 11 is executed, for example, when a voice message to be transmitted to the slave station 200 is determined, and the creation of a post-conversion message has already been started.

制御部１５は、例えば騒音テーブル１８に登録されている送信先の子局２００を順番に参照し、「騒音有り」が登録されているか否かを判定する（Ｓ１１０）。 For example, the control unit 15 refers to the transmission destination slave stations 200 registered in the noise table 18 in order, and determines whether or not “with noise” is registered (S110).

「騒音有り」が登録されていない場合（Ｓ１１０のＮｏ）、子局２００が敷設された地域は静かなので、制御部１５はオリジナルメッセージを親局通信装置２１から子局２００に送信する（Ｓ１２０）。 When “with noise” is not registered (No in S110), since the area where the slave station 200 is laid is quiet, the control unit 15 transmits the original message from the master station communication device 21 to the slave station 200 (S120). .

「騒音有り」が登録されている場合（Ｓ１１０のＹｅｓ）、制御部１５は変換後メッセージを作成して送信する（Ｓ１４０）。この処理については図１１（ｂ）にて説明する。 When “with noise” is registered (Yes in S110), the control unit 15 creates and transmits a converted message (S140). This process will be described with reference to FIG.

次いで、制御部１５は、騒音テーブル１８の最後の子局２００まで処理したか否かを判定する（Ｓ１３０）。最後の子局２００まで処理していない場合（Ｓ１３０のＮｏ）、ステップＳ１１０からの処理を繰り返す。したがって、騒音がない子局２００の場合、早期にオリジナルメッセージを送信することができる。 Next, the control unit 15 determines whether or not processing has been performed up to the last slave station 200 of the noise table 18 (S130). If the last slave station 200 has not been processed (No in S130), the processing from Step S110 is repeated. Therefore, in the case of the slave station 200 without noise, the original message can be transmitted early.

図１１（ｂ）はステップＳ１４０の処理手順を示すフローチャート図の一例である。制御部は対変換後メッセージが作成できたか否かを判定する（Ｓ１４１）。 FIG. 11B is an example of a flowchart showing the processing procedure of step S140. The control unit determines whether a post-conversion message has been created (S141).

変換後メッセージが作成された場合（Ｓ１４１のＹｅｓ）、制御部１５は変換後メッセージを親局通信装置２１から子局２００に送信する（Ｓ１４２）。 When the post-conversion message is created (Yes in S141), the control unit 15 transmits the post-conversion message from the master station communication device 21 to the slave station 200 (S142).

変換後メッセージが作成されていない場合（Ｓ１４１のＮｏ）、制御部１５は変換後メッセージが作成されるまで待機する（Ｓ１４３）。 When the post-conversion message has not been created (No in S141), the control unit 15 waits until the post-conversion message is created (S143).

なお、図１１のような送信手順は一例であって、制御部１５は、全ての子局２００にオリジナルメッセージを送信し、その後、騒音テーブル１８に騒音有りが登録された子局２００にのみ、変換後メッセージを送信してもよい。この場合、全ての地域に早期にオリジナルメッセージが送信されるので、騒音のある地域でも一部の地域住民は拡声音を聴き取ることができる。また、この後、騒音のある地域では変換後メッセージが拡声されるので、騒音のある地域の地域住民は拡声音を聴き取る機会を得ることができる。 Note that the transmission procedure as shown in FIG. 11 is an example, and the control unit 15 transmits the original message to all the slave stations 200, and then only to the slave stations 200 registered as having noise in the noise table 18. You may send the message after conversion. In this case, since the original message is transmitted to all areas at an early stage, some local residents can hear loud sounds even in areas with noise. Further, since the converted message is amplified in a noisy area thereafter, local residents in the noisy area can get an opportunity to listen to the loud sound.

また、子局毎に順番に送信するのでなく、騒音テーブル１８から「騒音有り」の子局２００と「騒音なし」の子局２００をリストアップして、群放送を利用してそれぞれの子局２００に一斉送信してもよい。 In addition, instead of transmitting in order for each slave station, the slave table 200 “with noise” and the slave station 200 “without noise” are listed from the noise table 18, and each slave station is used using group broadcasting. 200 may be transmitted simultaneously.

以上説明したように、本実施例の防災無線システム１００によれば、音声メッセージを変換する必要性が低い地域には早期に音声メッセージを拡声し、騒音により聴き取りにくい地域には変換後メッセージを拡声できる。 As described above, according to the disaster prevention radio system 100 of the present embodiment, the voice message is amplified early in an area where the necessity of converting the voice message is low, and the converted message is displayed in an area that is difficult to hear due to noise. Can speak loudly.

実施例２では、騒音に応じて地域ごとに音声メッセージを振り分ける防災無線システム１００について説明したが、本実施例ではエコーの有無に応じて音声メッセージを振り分ける防災無線システム１００について説明する。 In the second embodiment, the disaster prevention radio system 100 that distributes voice messages for each area according to noise has been described, but in this embodiment, the disaster prevention radio system 100 that distributes voice messages according to the presence or absence of echoes will be described.

図１２は、本実施例の防災無線システム１００の概略的な動作を説明する図の一例である。拡声音は大音量であるため遠方まで伝達可能である反面、地域の地形によってはエコー（山彦、反響音）を生じさせ、聴き取りにくい状況になることがある（詳しくは後述する）。図１２の地域Ａは山間地方の集落であり、拡声音が山に反響して地域住民にとって聴き取りにくい。これに対し、地域Ｂは、エコーを生じさせる地形が少なく、聴き取りにくい状況になることが少ない。したがって、実施例２と同様に、地域Ａに拡声される音声メッセージのみ変換後メッセージに変換すれば、充分な可能性がある。また、地域Ｂではオリジナルメッセージが拡声されるので、変換するよりも早期に緊急情報を拡声することができる。 FIG. 12 is an example of a diagram illustrating a schematic operation of the disaster prevention radio system 100 according to the present embodiment. The loud sound is loud and can be transmitted far away. On the other hand, depending on the geographical features of the area, echo (Yamahiko, reverberation sound) may be generated, and it may be difficult to hear (details will be described later). Area A in FIG. 12 is a village in a mountainous region, and the loud sound echoes in the mountains and is difficult for local residents to hear. On the other hand, in the region B, there are few terrains that cause echoes, and it is unlikely to be difficult to hear. Therefore, as in the case of the second embodiment, there is a sufficient possibility that only the voice message that is amplified in the area A is converted into the converted message. Further, since the original message is louded in the region B, the emergency information can be louded out earlier than the conversion.

〔エコー環境における単語変換の有効性〕
出願人は、エコー環境において、難易度（高）の単語を置換語に置き換えることの有効性について実験を行った。なお、以下では、難易度として親密度を採用している。 [Effectiveness of word conversion in echo environment]
The applicant experimented on the effectiveness of replacing words of high difficulty with replacement words in an echo environment. In the following, intimacy is adopted as the difficulty level.

１．実験内容
提示音源として，FW07データベース[近藤他, 信学技報, SP2007-157, 2008.]に収録された話者myaの音声を用いた。単語親密度は、４ランクのうちの上位２種類（親密度高：7.0-5.5，親密度低：5.5-4.0）を使用した。本実験では、後続音が１個、若しくは２個存在する環境を模擬した。 1. The voice of the speaker mya recorded in the FW07 database [Kondo et al., IEICE Tech. Bulletin, SP2007-157, 2008.] was used as the sound source for the experiment contents. For the word familiarity, the top two of the four ranks (high familiarity: 7.0-5.5, low familiarity: 5.5-4.0) were used. In this experiment, an environment with one or two subsequent sounds was simulated.

図１３（ａ）はエコーにより生じる後続音のタイムパターンである。図示のように、４連単語を1つの聴取音とし、そこに指定した遅延時間で後続音を１〜２個付け加えて作成した。図中の○□△×は，４つの異なる単語を表す。１つの単語が４モーラで長さがおよそ750msであったため、1/2単語長の時間ずれは375msになる。なお、後続音は原音と同様の音声を使用した。FW07データベースの性質上、各聴取者が1つの親密度条件の４００単語すべてを聴取する必要があったため、５つの音声セットを作り、各セットに８０個ずつの音声単語（２０個の４連単語）をランダムに振り分けた。後続音の時間ずれが７種類（図１３（ａ）の条件Ａ〜Ｇ）であり、既に聴取された４連単語を別の条件で再び聴取することを避けるため，最低７人の聴取者が必要であった。このため、各条件に５人の異なる聴取者を振り分けた。７人の聴取者は親密度高低の両方の実験に参加した。聴取者は、防音室のイスに座り、１つの４連単語が提示された後、１５秒間で回答用紙に４連単語を聴こえたままに記入するように指示された。音声は，ヘッドホン（SennheiserHDA-200）を経由して両耳に提示した。 FIG. 13A shows the time pattern of the subsequent sound generated by the echo. As shown in the figure, a quadruple word was used as one listening sound, and one or two subsequent sounds were added with a delay time specified there. ○ □ △ × in the figure represents four different words. Since one word is 4 mora and its length is about 750 ms, the time difference of 1/2 word length is 375 ms. The subsequent sound used was the same sound as the original sound. Because of the nature of the FW07 database, each listener needed to listen to all 400 words of one familiarity condition, so five speech sets were created, and 80 speech words (20 quadruple words in each set). ) Was randomly distributed. There are seven types of time lags of the following sounds (conditions A to G in FIG. 13A), and at least seven listeners are required to avoid listening again to the four-word words that have already been heard under different conditions. It was necessary. For this reason, five different listeners were assigned to each condition. Seven listeners participated in both high and low intimacy experiments. The listener was instructed to sit on the chair in the soundproof room and fill in the answer sheet on the answer sheet in 15 seconds after one quad word was presented. The sound was presented to both ears via headphones (SennheiserHDA-200).

２．結果と考察
図１３（ｂ）は親密度高低の各実験条件での平均正答を示す図である。図中の黒い棒は高親密度、白い棒は低親密度条件を指す。７個の時間ずれ条件と２つの親密度条件を被験者内要因として、２要因の分散分析を行った。分析の結果、時間ずれ条件の主効果（F(6,
594)=141.63, p <.001），親密度条件の主効果（F(1, 99) = 85.98, p <.001）、時間ずれと親密度の交互作用（F(6, 594) = 2.15, p <.05），いずれにも有意差が認められた。また、原音のみの条件以外は，親密度が高い場合の正答率が有意に高かった。ロングパスエコーが存在する屋外環境下でも、親密度が語彙の難易度統制指標として有効であることが示唆される。親密度別に分析を行ったところ、高親密度条件では、後続音が２個の場合の平均正解率（Ｅ〜Ｇ間）に有意差が認められなかったものの、後続音が１個の場合は、後続音の遅延時間が１単語長の条件間に有意差が認められた（ＢとＤ）。後続音が２個の場合は、正解率が全体的に低かったため、後続音の遅延時間による影響が見られなかったと考えられる。低親密度条件の場合も同様の傾向が見られたものの、後続音の数が２個の場合も後続音の遅延時間が１単語長の条件間に有意差が認められた（ＢとＤ，ＥとＧ）。この理由については、今後、出現位置ごとの単語了解度を算出し、詳細な検討を行っていく予定である。 2. Results and discussion
FIG. 13B is a diagram showing the average correct answer under each experimental condition of intimacy. Black bars in the figure indicate high familiarity and white bars indicate low familiarity conditions. Two-factor analysis of variance was performed with seven time-shift conditions and two intimacy conditions as factors within the subject. As a result of the analysis, the main effect of the time lag condition (F (6,
594) = 141.63, p <.001), main effect of intimacy condition (F (1, 99) = 85.98, p <.001), interaction between time shift and intimacy (F (6, 594) = 2.15 , p <.05), both were significantly different. In addition, except for the condition of only the original sound, the correct answer rate when the familiarity was high was significantly high. This suggests that intimacy is an effective vocabulary difficulty control index even in outdoor environments where long-path echo is present. When the analysis was performed according to intimacy, there was no significant difference in the average accuracy rate (between E and G) when there were two subsequent sounds under the high familiarity condition, but there was only one subsequent sound. A significant difference was observed between the conditions in which the delay time of the subsequent sound was one word length (B and D). When the number of subsequent sounds is two, the accuracy rate is generally low, so it is considered that the influence of the delay time of the subsequent sounds was not observed. Although the same tendency was observed in the low familiarity condition, a significant difference was observed between the conditions in which the delay time of the subsequent sound was one word length even when the number of subsequent sounds was two (B and D, E and G). In the future, we plan to calculate the word intelligibility for each appearance position and conduct a detailed study.

３．実験内容（実験２（異なる親密度の場合））
実験２では４連単語全て同じ親密度の単語を用いた。実験２では、４連単語を異なる親密度の単語で作成し、先行音と後続音の親密度の関係が了解度に及ぼす影響を調べた。後続音の遅延時間は、実験１の条件ＡとＣ，Ｇの３つに絞り、４連単語は親密度高低を交互に入れ替えるバターンに改めた。更に、親密度高から始まる条件（高低順番）と親密度低から始まる（低高順番）２条件を用意し、各条件に３人ずつの聴取者を割り当てた。 3. Experiment contents (Experiment 2 (in case of different intimacy))
In Experiment 2, all four consecutive words used the same familiarity. In Experiment 2, quadruple words were created with words having different intimacy, and the influence of the relationship between the intimacy of the preceding sound and the succeeding sound on the intelligibility was examined. The delay time of the succeeding sound was limited to the three conditions A, C, and G in Experiment 1, and the quadrant was changed to a pattern in which the intimacy level was alternately switched. Furthermore, two conditions were prepared starting from high intimacy (high to low order) and two conditions starting from low intimacy (low to high order), and three listeners were assigned to each condition.

４．結果と考察
高低順番の場合と低高順番の場合の音声了解度を比較するために、４連単語の各単語の正解率を計算した。図１３（ｃ）は、条件ＣとＧの各単語別の平均正解率を示す。図中の黒い棒は高親密度、白い棒は低親密度の単語を表す。例えば、２番目の「低」は低親密度の先行音に高親密度の後続音が重なった場合を示す。分散分析の結果、後続音の遅延時間の主効果（F(2,398) = 422.14, p <.001）、４連単語の主効果（F(3, 597) = 439.87, p <.001）、親密度高低の主効果（F(1, 99) = 7.3, p <.01）ともに有意差が認められた。また、３つの要因の交互作用にも有意差が認められた（F(6, 1194) = 5.83,p <.001）。下位検定と多重比較の結果、低親密度の先行音に高親密度の後続音が重なった場合の正答率が、高親密度の先行音に低親密度の後続音が重なった場合より、有意に低いことが分かった（条件Ｃの２番目と条件Ｃの４番目、条件Ｇの２番目）。このことは、ターゲットとなる単語が低親密度の場合、高親密度がエコーによって重畳されると聴き取りづらくなることを示しており、文章中の単語親密度の統制が重要であることを示唆している。 4). Results and Discussion In order to compare the speech intelligibility between the case of high and low order and the case of low and high order, the correct answer rate of each word of the quadruple word was calculated. FIG. 13C shows the average correct answer rate for each word of the conditions C and G. The black bars in the figure represent words with high familiarity, and the white bars represent words with low familiarity. For example, the second “low” indicates a case where a low-intimacy preceding sound overlaps a high-intimacy following sound. As a result of analysis of variance, the main effect of delay time of subsequent sound (F (2,398) = 422.14, p <.001), the main effect of quadruple words (F (3, 597) = 439.87, p <.001), parent Significant differences were observed in the main effects of density (F (1, 99) = 7.3, p <.01). There was also a significant difference in the interaction of the three factors (F (6, 1194) = 5.83, p <.001). As a result of the sub-test and multiple comparisons, the correct answer rate when the following sound with high familiarity overlaps the preceding sound with low familiarity is more significant than when the following sound with low familiarity overlaps with the preceding sound with high familiarity. (The second condition C, the fourth condition C, the second condition G). This indicates that when the target word is low intimacy, it is difficult to hear when high intimacy is superimposed by echo, suggesting that control of word intimacy in the sentence is important. doing.

５．まとめ
ロングパスエコー存在下において親密度が単語了解度に及ぼす影響について調べた。実験の結果、親密度が低い方がエコーの影響を強く受けることが明らかになった。すなわち、例えば図１３（ｂ）では、後続音の遅延時間が条件Ｅまでの間、高親密度の単語の方が低親密度の単語よりも正解率が高い。また、図１３（ｃ）では、低親密度の単語に、エコーによって高親密度の単語が重なると、特に低親密度の単語の正解率が低下する。したがって、低親密度の単語を高親密度の単語に変換することで、正解率が大幅に向上することが期待できる。例えば、図１３（ｂ）では条件ＣＤＥにおいて、遅延時間は同じでも高親密度の正解率は１５〜２０％近く高くなっているので、低親密度の単語を高親密度の単語に変換すれば、正解率が１５〜２０％近く高くなることが期待できる。 5. Summary The effect of intimacy on word intelligibility in the presence of longpass echo was investigated. As a result of experiments, it became clear that the one with lower intimacy is strongly affected by echo. That is, for example, in FIG. 13B, during the delay time of the subsequent sound up to the condition E, the high familiarity word has a higher accuracy rate than the low familiarity word. Further, in FIG. 13C, when the high familiarity word overlaps the low familiarity word by echo, the correct answer rate of the low familiarity word is lowered. Therefore, it can be expected that the correct answer rate is greatly improved by converting the low familiarity word into the high familiarity word. For example, in FIG. 13B, in the condition CDE, even if the delay time is the same, the accuracy rate of the high familiarity is nearly 15 to 20% high, so if the low familiarity word is converted into the high familiarity word, The accuracy rate can be expected to increase by 15 to 20%.

以上の実験結果から、本実施例の音声変換装置３０による単語の変換が有効であることが確認できる。 From the above experimental results, it can be confirmed that the conversion of words by the speech conversion apparatus 30 of this embodiment is effective.

図１４は、本実施例の防災無線システムのブロック図の一例を示す。本実施例では、操作卓２０がエコーテーブル１９を有し、適宜更新している。エコーの有無や程度は、地形など比較的、変化しにくい環境に起因するので、エコーテーブル１９を更新する必要性は低い。したがって、エコーテーブル１９は、担当者が地域で拡声音を聴いて確認したり、地域住民からの聴き取りにより作成することができる。 FIG. 14 shows an example of a block diagram of the disaster prevention radio system of the present embodiment. In this embodiment, the console 20 has an echo table 19 and is updated as appropriate. Since the presence / absence and degree of echo are caused by an environment that is relatively difficult to change, such as terrain, the necessity of updating the echo table 19 is low. Therefore, the echo table 19 can be created by the person in charge confirming the sound by listening to the loud sound in the area or listening from the local residents.

また、以下のようにして作成してもよい。
図１５は子局２００におけるエコーの観測結果を説明する図の一例である。この図では，とある地点で３つの子局から同時に発せられた音声の状態を表している。横軸は時間、縦軸は相対性振幅（Relative Amplitude）である。子局２００の制御部５２は、子局２００から音を拡声した時（時刻ｔ０）を基準に、時刻ｔ１でマイク５４が別の子局から発せられた音声（特許請求の範囲の反響強度の一例である）を検出するまでの遅延時間Δｔを測定する。時刻ｔ２は別の子局からの音声が測定された時刻である。なお，この図では他の子局からの音を用いてΔｔを決定しているが，他の子局の影響を受けないような環境で、近隣に山や大きな建物といった障害物が存在する場合には、それぞれの障害物からの反射音をもってΔｔと決める場合がある。図１２（ａ）（ｂ）の実験結果から、正解率は遅延時間が長いほど低下する傾向にあるので、Δｔが閾値Ｔ以上の場合にエコー有りと判断すべきである。また、反射音の相対性振幅が閾値Ｒ以上でないとエコーと認識されない。したがって、測定結果のΔｔが閾値Ｔ以上、かつ、相対性振幅が閾値Ｒ以上の場合に、エコー有りと判断できる。 Moreover, you may create as follows.
FIG. 15 is an example of a diagram for explaining an observation result of echoes in the slave station 200. In this figure, the state of voices simultaneously emitted from three slave stations at a certain point is shown. The horizontal axis is time, and the vertical axis is relative amplitude (Relative Amplitude). The control unit 52 of the slave station 200 uses the reference to the time when the sound is amplified from the slave station 200 (time t0), and the sound that the microphone 54 emits from another slave station at time t1 (the echo intensity of the claims) Measure delay time Δt until it is detected as an example. Time t2 is the time when the voice from another slave station was measured. In this figure, Δt is determined using sound from other slave stations, but there are obstacles such as mountains and large buildings in the environment that are not affected by other slave stations. In some cases, Δt is determined by the reflected sound from each obstacle. From the experimental results shown in FIGS. 12 (a) and 12 (b), the correct answer rate tends to decrease as the delay time increases. Therefore, when Δt is equal to or greater than the threshold T, it should be determined that there is an echo. Further, if the relative amplitude of the reflected sound is not greater than or equal to the threshold value R, it is not recognized as an echo. Therefore, it can be determined that there is an echo when Δt of the measurement result is equal to or greater than the threshold value T and the relative amplitude is equal to or greater than the threshold value R.

各子局は、防災無線システム１００からの要求によりΔｔと相対性振幅の測定を行い、防災無線システム１００に送信する。防災無線システム１００の制御部１５は、Δｔと閾値Ｔ、相対性振幅と閾値Ｒを比較して、エコーの有無を判定する。なお、判定は子局２００が行ってもよい。 Each slave station measures Δt and relative amplitude in response to a request from the disaster prevention radio system 100 and transmits it to the disaster prevention radio system 100. The control unit 15 of the disaster prevention radio system 100 compares Δt with the threshold value T, the relative amplitude and the threshold value R, and determines the presence or absence of an echo. Note that the slave station 200 may make the determination.

また、本実施例では、難易度別同意語データベースの置換語は、エコーに対する頑健性が保証されたものであることが好ましい。すなわち、エコー環境下で行われた感応試験で、聴き取りやすさが閾値以上であると判断された、難易度（高）の同意語が登録されている。したがって、一般的な辞書により難易度が低いことが保証されるだけでなく（又は保証されていなくても）、単語変換部はロングパスエコーに対して聴き取りやすい置換語に置き換えることができる。 In the present embodiment, it is preferable that the substitution word in the synonym database classified by difficulty is guaranteed to be robust against echo. That is, a synonym of difficulty (high), in which it is determined that the ease of listening is equal to or higher than a threshold value in a sensitivity test performed in an echo environment, is registered. Therefore, not only is it guaranteed that the difficulty level is low by a general dictionary (or even if it is not guaranteed), the word conversion unit can be replaced with a replacement word that is easy to hear for a long pass echo.

図１６は、制御部の動作手順を示すフローチャート図の一例である。図１６の動作手順は、図１１のステップＳ１１０の判定が、エコーテーブル１９に登録されている送信先の子局２００に「騒音有り」が登録されているか否かを判定するとなっている（Ｓ１１５）。以降の処理は、図１１と同様である。 FIG. 16 is an example of a flowchart illustrating an operation procedure of the control unit. In the operation procedure of FIG. 16, the determination in step S110 of FIG. 11 determines whether or not “with noise” is registered in the transmission destination slave station 200 registered in the echo table 19 (S115). ). The subsequent processing is the same as in FIG.

以上説明したように、本実施例の防災無線システム１００によれば、音声メッセージを変換する必要性が低い地域には早期に音声メッセージを拡声し、エコーが生じるため聴き取りにくい地域には変換後メッセージを拡声できる。 As described above, according to the disaster prevention radio system 100 of the present embodiment, the voice message is amplified early in an area where the necessity of converting the voice message is low, and after conversion to an area that is difficult to hear because of an echo. Can amplify the message.

１１マイク
２０操作卓
２１親局通信装置
２４ネットワーク
３０音声変換装置
３３単語分割部
３４単語難易度算出部
３５難易度判別部
３６単語変換部
５３スピーカ
１００防災無線システム
２００子局
DESCRIPTION OF SYMBOLS 11 Microphone 20 Console 21 Master station communication apparatus 24 Network 30 Voice converter 33 Word division part 34 Word difficulty calculation part 35 Difficulty determination part 36 Word conversion part 53 Speaker 100 Disaster prevention radio system 200 Child station

Claims

An outdoor environmental voice transmission device that transmits voice messages for disaster prevention to one or more slave stations laid in the area,
Voice message acquisition means for acquiring the voice message;
Message dividing means for dividing the voice message acquired by the voice message acquiring means into a meaningful symbol string or requesting the outside to divide the voice message into the symbol string;
For each symbol string divided by the message dividing means, a symbol string difficulty determining means for determining a difficulty level when a person understands the meaning;
A message creating means for creating a converted message by replacing the high difficulty symbol string determined by the symbol string difficulty determining means with a difficulty level equal to or higher than a threshold value with a synonym having a difficulty level less than the threshold value;
Message transmitting means for transmitting the converted message to a slave station;
An outdoor environment audio transmission device comprising:

Echo determination information in which the presence or absence of echo in the area where the slave station is laid is associated with the slave station identification information, and
The slave station registered to have an echo in the echo determination information causes the message transmission means to transmit the converted message,
Control means for transmitting the voice message to the message transmitting means to a slave station registered as having no echo in the echo determination information;
The outdoor environment audio transmission apparatus according to claim 1, comprising:

Noise judgment information in which the presence or absence of noise in the area where the slave station is laid is associated with the slave station identification information, and
The slave station registered as having noise in the noise determination information causes the message transmission means to transmit the converted message,
Control means for transmitting the voice message to the message transmitting means to a slave station registered as having no noise in the noise judgment information;
The outdoor environment audio transmission apparatus according to claim 1, comprising:

The control means causes the message sending means to send the voice message to all the slave stations, and then sends the converted message only to the slave stations registered as having no echo in the echo determination information. Send
The outdoor environment audio transmission apparatus according to claim 2.

The control means determines in any order whether an echo or no echo is registered in the slave station registered in the echo determination information,
For the slave station registered without echo, let the message transmission means transmit the voice message,
For the slave station registered as having an echo, as soon as the post-conversion message is created, the message transmitting means transmits the post-conversion message.
The outdoor environment audio transmission apparatus according to claim 2.

Corresponding to a high difficulty symbol string, a low difficulty symbol string that has a difficulty level less than the threshold, is synonymous with the high difficulty symbol string, and has a shorter utterance time than the high difficulty symbol string Has a synonym database,
The message creation means replaces the high difficulty symbol string determined by the symbol string difficulty determination means with a difficulty level equal to or higher than a threshold with a low difficulty symbol string registered in the synonym database, Creating the converted message with a shorter utterance time than the message;
The outdoor environment audio transmission apparatus according to any one of claims 1 to 5, wherein

There is a synonym database that is synonymous with the high difficulty symbol string and is associated with the low difficulty symbol string that is synonymous with the high difficulty symbol string and that is easy to hear in the echo environment. And
The message creation means replaces the high difficulty symbol string determined by the symbol string difficulty determination means with a difficulty level equal to or higher than a threshold with a low difficulty symbol string registered in the synonym database. Composing a message,
The outdoor environment audio transmission apparatus according to claim 6.

An outdoor environment audio transmission device that transmits a voice message for disaster prevention to one or more slave stations laid in the area, and an outdoor environment audio transmission system having one or more slave stations,
Voice message acquisition means for acquiring the voice message;
Message dividing means for dividing the voice message acquired by the voice message acquiring means into a meaningful symbol string or requesting the outside to divide the voice message into the symbol string;
For each symbol string divided by the message dividing means, a symbol string difficulty determining means for determining a difficulty level when a person understands the meaning;
A message creating means for creating a converted message by replacing the high difficulty symbol string determined by the symbol string difficulty determining means with a difficulty level equal to or higher than a threshold value with a synonym having a difficulty level less than the threshold value;
Message transmitting means for transmitting the converted message to a slave station,
The slave station is
Message receiving means for receiving the converted message;
Amplifying means for amplifying the converted message,
An outdoor environment audio transmission system characterized by that.