JP2015175772A

JP2015175772A - Sound identification condition setting support device and sound identification condition setting support method

Info

Publication number: JP2015175772A
Application number: JP2014053466A
Authority: JP
Inventors: 勝哉梶山; Katsuya Kajiyama
Original assignee: Chugoku Electric Power Co Inc
Current assignee: Chugoku Electric Power Co Inc
Priority date: 2014-03-17
Filing date: 2014-03-17
Publication date: 2015-10-05
Anticipated expiration: 2034-03-17
Also published as: JP5937127B2

Abstract

PROBLEM TO BE SOLVED: To alleviate a processing load and accelerate the process when performing an allophone feature extraction by computer in a spectrogram of a picked-up sound.SOLUTION: When a sound identification condition setting support device divides a predetermined frequency range of spectrogram data for an allophone sample sound with allophone and no allophone sample sound without allophone into a plurality of blocks, and generates an identification function by use of the histogram data for each block, a sound strength of which average order of degree among the histogram data of the allophone sample sounds in the same frequency band is high is selected as target strength, and an identification function is generated by using only the target strength. A plurality of identification functions are generated while changing number of strength selected as the target strength so as to select the identification function having the highest identification accuracy rate among them.

Description

本発明は、採取音中の異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援装置および音識別条件設定支援方法に関する。 The present invention relates to a sound identification condition setting support device and a sound identification condition setting support method that support the setting of an identification condition for identifying the presence or absence of an abnormal sound in a collected sound.

例えば、送電線から発せられる異音を除去する方策を検討する場合や、機器から発せられる異音に基づいて機器の故障診断を行う場合に、送電線または機器に近い場所で採取した音の中に異音が含まれているか否かを、パターン認識手法を用いて判定する。具体的には、多くの場合、採取音をスペクトル解析し、採取音または異音の周波数の特徴に基づき、採取音中の異音の有無を識別する（特許文献１参照）。 For example, when considering measures to remove abnormal noise emitted from power transmission lines, or when diagnosing equipment malfunctions based on abnormal noise emitted from equipment, of the sounds collected near the power transmission line or equipment. It is determined by using a pattern recognition method whether or not there is an abnormal sound. Specifically, in many cases, the collected sound is subjected to spectrum analysis, and the presence or absence of the abnormal sound in the collected sound is identified based on the characteristics of the collected sound or the frequency of the abnormal sound (see Patent Document 1).

しかしながら、異音の周波数または音の強度が時間的に変化する場合、採取音または異音の周波数の静的な特徴に基づくだけでは、異音の有無を高精度に識別することが難しいことがある。このような場合には、採取音のスペクトログラムを分析し、採取音または異音の周波数または音の強度の時間的変化を勘案し、採取音中の異音の有無を識別することが望ましい。これにより、異音の周波数または音の強度が時間的に変化する場合でも、異音の有無を高精度に識別することができる。 However, if the frequency or intensity of the abnormal sound changes over time, it may be difficult to accurately identify the presence or absence of the abnormal noise based solely on the static characteristics of the sampled sound or the abnormal frequency. is there. In such a case, it is desirable to analyze the spectrogram of the sampled sound and identify the presence or absence of the abnormal sound in the sampled sound in consideration of the temporal change in the frequency or intensity of the sampled sound or abnormal sound. Thereby, even when the frequency of abnormal noise or the intensity of sound changes with time, the presence or absence of abnormal noise can be identified with high accuracy.

特開平１１−２４１９４５号公報JP-A-11-241945

採取音のスペクトログラムを用い、パターン認識手法により採取音中における異音の有無の識別を行うには、スペクトログラムとして表現された当該異音の特徴を抽出することが必要である。この特徴抽出の優劣が識別精度を大きく左右する。 In order to identify the presence or absence of an abnormal sound in the collected sound by the pattern recognition method using the spectrogram of the collected sound, it is necessary to extract the characteristic of the abnormal sound expressed as the spectrogram. The superiority or inferiority of the feature extraction greatly affects the identification accuracy.

異音の特徴抽出方法については、一般的または統一的な方法は知られていない。従来、作業者は、採取音のスペクトログラムをコンピュータのモニタに表示し、それを目で見ることにより異音のスペクトログラムの特徴を見出すといった方法で異音の特徴抽出を行っている。 There is no known general or unified method for extracting abnormal sound features. 2. Description of the Related Art Conventionally, an operator performs feature extraction of an abnormal sound by a method of displaying a spectrogram of a collected sound on a computer monitor and finding the characteristic of the spectrogram of the abnormal sound by visually observing the spectrogram.

しかしながら、異音の高精度な識別を可能にする異音の特徴を、コンピュータのモニタに表示されたスペクトログラムを見ることにより見出す作業は、長い時間がかかり、大きな負担を作業者に強いる。また、異音の高精度な識別を可能にする異音の特徴を見出せるか否かが作業者の経験に依存する場合が多いため、練達した作業者が存在しない場合には高精度な識別を実現することが困難である。 However, it takes a long time to find the characteristics of the abnormal sound that enables high-accuracy identification of the abnormal sound by looking at the spectrogram displayed on the computer monitor, which imposes a heavy burden on the operator. In addition, since it is often dependent on the experience of the operator whether or not the feature of the abnormal sound that enables high-precision identification of abnormal noise can be found, high-precision identification is performed when there is no skilled worker. It is difficult to realize.

そこで、現在、異音の特徴抽出をコンピュータにより自動的に行う方法の開発が進められている。異音の特徴抽出をコンピュータにより自動的に行うことが可能になれば、異音の特徴抽出を容易にかつ確実に実施することができ、当該処理に関する作業者の負担を削減することができる。 Therefore, development of a method for automatically extracting features of abnormal sounds by a computer is now underway. If abnormal noise feature extraction can be performed automatically by a computer, abnormal noise feature extraction can be performed easily and reliably, and the burden on the operator regarding the processing can be reduced.

しかしながら、異音の高精度な識別を可能にする異音の特徴を抽出するためには、採取音の多数のサンプルをコンピュータによる処理の対象にしなければならず、処理すべきデータの量が膨大になることがある。このため、一般的なパーソナルコンピュータで異音の特徴抽出の処理を迅速に行うことが難しく、異音の特徴抽出の処理を迅速に行うためには、高価で大型な専門的なコンピュータを用意しなければならない。 However, in order to extract abnormal noise features that enable highly accurate identification of abnormal noise, a large number of samples of collected sound must be processed by a computer, and the amount of data to be processed is enormous. May be. For this reason, it is difficult to perform the feature extraction process for abnormal sounds quickly with a general personal computer. To perform the feature extraction process for abnormal sounds quickly, an expensive and large specialized computer must be prepared. There must be.

本発明は例えば上述したような問題に鑑みなされたものであり、本発明の課題は、採取音のスペクトログラムにおいて異音の特徴抽出またはそれに相当する処理をコンピュータにより行うに当たり、コンピュータにおける処理の負担を軽減でき、処理の迅速化を図ることができる音識別条件設定支援装置および音識別条件設定支援方法を提供することにある。 The present invention has been made in view of, for example, the above-described problems. The object of the present invention is to reduce the burden of processing on a computer when performing feature extraction of abnormal sounds or processing corresponding thereto in a spectrogram of collected sound. An object of the present invention is to provide a sound identification condition setting support device and a sound identification condition setting support method that can be reduced and speed up the processing.

上記課題を解決するために、本発明の第１の音識別条件設定支援装置は、異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援装置であって、異音を含む複数の設定用有異音サンプル音のスペクトログラムをそれぞれ示す設定用有異音スペクトログラムデータを受け取り、当該各設定用有異音スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す設定用有異音ヒストグラムデータを生成する設定用有異音ヒストグラム生成部と、前記設定用有異音ヒストグラム生成部により設定用有異音ヒストグラムデータが生成された前記複数の設定用有異音スペクトログラムデータにおける同一周波数帯のブロックのヒストグラムデータにおいて、それぞれの音の強度の度数の順位の平均値または累積値を算定し、当該平均値または累積値において順位の高い順に所定の選択数の音の強度を特定し、これら特定した音の強度を対象強度として選択する強度選択部と、異音を含む生成用有異音サンプル音および異音を含まない生成用無異音サンプル音を含む複数の生成用サンプル音のスペクトログラムをそれぞれ示す生成用スペクトログラムデータを受け取り、当該各生成用スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す生成用ヒストグラムデータを生成する生成用ヒストグラム生成部と、前記各生成用ヒストグラムデータにおける前記対象強度の度数を用い、異音の有無を識別するための識別関数を生成する識別関数生成部と、異音を含む検証用有異音サンプル音および異音を含まない検証用無異音サンプル音を含む複数の検証用サンプル音のスペクトログラムをそれぞれ示す検証用スペクトログラムデータを受け取り、当該各検証用スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す検証用ヒストグラムデータを生成する検証用ヒストグラム生成部と、前記各検証用ヒストグラムデータにおける前記対象強度の度数を前記識別関数に入力して異音の有無の識別を実行し、前記識別関数が異音の有無を正しく識別する確率である識別正解率を算定する正解率算定部とを備えていることを特徴とする。 In order to solve the above-described problem, a first sound identification condition setting support device according to the present invention is a sound identification condition setting support device that supports setting of an identification condition for identifying the presence or absence of an abnormal sound. Receiving the setting noise spectrogram data indicating the spectrograms of the plurality of setting noise sample sounds including each, dividing the predetermined frequency range into a plurality of blocks in each setting noise spectrogram data, Anomalous noise histogram generator for setting for generating anomalous noise histogram for setting indicating a histogram of sound intensity distribution in each block, and anomalous noise histogram data for setting is generated by the anomalous noise histogram generator for setting In the histogram data of the blocks in the same frequency band in the plurality of setting different sound spectrogram data set , Calculate the average or cumulative value of the ranks of the frequency of each sound intensity, specify the intensity of a predetermined number of sounds in descending order of the average value or cumulative value, and determine the intensity of these specified sounds. A spectrogram for generation showing a spectrogram of a plurality of sample sounds for generation including an intensity sampler for generation including abnormal sound and a non-noise sample sound for generation including no abnormal sound A generation histogram generation unit that receives data, divides a predetermined frequency range into a plurality of blocks in each of the generation spectrogram data, and generates generation histogram data indicating a histogram of a sound intensity distribution in each block; For identifying the presence or absence of abnormal noise using the frequency of the target intensity in each generation histogram data A discriminant function generator for generating another function, and a verification sample showing spectrograms of a plurality of verification sample sounds including a verification abnormal sound sample sound including abnormal sound and a verification non-sound sample sound including no abnormal sound A verification histogram generator that receives spectrogram data, divides a predetermined frequency range into a plurality of blocks in each verification spectrogram data, and generates verification histogram data indicating a histogram of sound intensity distribution in each block; , The frequency of the target intensity in each verification histogram data is input to the discrimination function to identify the presence / absence of abnormal noise, and the discrimination accuracy rate, which is the probability that the discrimination function correctly identifies the presence / absence of abnormal noise, And a correct rate calculation unit for calculating.

また、本発明の第２の音識別条件設定支援装置は、上述した本発明の第１の音識別条件設定支援装置において、前記選択数の設定を変更しつつ、前記強度選択部、前記生成用ヒストグラム生成部、前記識別関数生成部、前記検証用ヒストグラム生成部および前記正解率算定部による処理を複数回実行し、当該処理を複数回実行した結果、前記識別関数生成部により生成された複数の識別関数のうち前記識別正解率が最も高い識別関数および当該識別関数を生成する際に選択された前記対象強度を前記識別条件として選定する識別条件選定部を備えていることを特徴とする。 Further, the second sound identification condition setting support device of the present invention is the above-described first sound identification condition setting support device of the present invention described above, while changing the setting of the selection number, the intensity selection unit, the generation A process by the histogram generation unit, the discrimination function generation unit, the verification histogram generation unit, and the accuracy rate calculation unit is executed a plurality of times, and the process is executed a plurality of times. An identification condition selecting unit that selects the identification function having the highest classification accuracy rate among the identification functions and the target strength selected when generating the identification function as the identification conditions is provided.

また、本発明の第３の音識別条件設定支援装置は、前記各スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割する際のブロック数および前記選択数の設定を変更しつつ、前記設定用有異音ヒストグラム生成部、前記強度選択部、前記生成用ヒストグラム生成部、前記識別関数生成部、前記検証用ヒストグラム生成部および前記正解率算定部による処理を複数回実行し、当該処理を複数回実行した結果、前記識別関数生成部により生成された複数の識別関数のうち前記識別正解率が最も高い識別関数、当該識別関数を生成する際に設定された前記ブロック数、および当該識別関数を生成する際に選択された前記対象強度を前記識別条件として選定する識別条件選定部を備えていることを特徴とする。 Further, the third sound identification condition setting support device of the present invention changes the setting of the number of blocks and the selection number when dividing a predetermined frequency range in each spectrogram data into a plurality of blocks, The processing by the noise histogram generation unit, the intensity selection unit, the generation histogram generation unit, the discrimination function generation unit, the verification histogram generation unit, and the accuracy rate calculation unit is executed a plurality of times, and the processing is performed a plurality of times. As a result of execution, the discrimination function having the highest discrimination accuracy rate among the plurality of discrimination functions generated by the discrimination function generation unit, the number of blocks set when generating the discrimination function, and the discrimination function are generated. An identification condition selection unit is provided that selects the target intensity selected when performing the identification as the identification condition.

また、本発明の第１の音識別条件設定支援方法は、異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援方法であって、異音を含む複数の設定用有異音サンプル音のスペクトログラムをそれぞれ示す設定用有異音スペクトログラムデータを受け取り、当該各設定用有異音スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す設定用有異音ヒストグラムデータを生成する設定用有異音ヒストグラム生成工程と、前記設定用有異音ヒストグラム生成工程において設定用有異音ヒストグラムデータが生成された前記複数の設定用有異音スペクトログラムデータにおける同一周波数帯のブロックのヒストグラムデータにおいて、それぞれの音の強度の度数の順位の平均値または累積値を算定し、当該平均値または累積値において順位の高い順に所定の選択数の音の強度を特定し、これら特定した音の強度を対象強度として選択する強度選択工程と、異音を含む生成用有異音サンプル音および異音を含まない生成用無異音サンプル音を含む複数の生成用サンプル音のスペクトログラムをそれぞれ示す生成用スペクトログラムデータを受け取り、当該各生成用スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す生成用ヒストグラムデータを生成する生成用ヒストグラム生成工程と、前記各生成用ヒストグラムデータにおける前記対象強度の度数を用い、異音の有無を識別するための識別関数を生成する識別関数生成工程と、異音を含む検証用有異音サンプル音および異音を含まない検証用無異音サンプル音を含む複数の検証用サンプル音のスペクトログラムをそれぞれ示す検証用スペクトログラムデータを受け取り、当該各検証用スペクトログラムデータにおいて、所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示す検証用ヒストグラムデータを生成する検証用ヒストグラム生成工程と、前記各検証用ヒストグラムデータにおける前記対象強度の度数を前記識別関数に入力して異音の有無の識別を実行し、前記識別関数が異音の有無を正しく識別する確率である識別正解率を算定する正解率算定工程とを備え、前記選択数の設定を変更しつつ、前記強度選択工程、前記生成用ヒストグラム生成工程、前記識別関数生成工程、前記検証用ヒストグラム生成工程および前記正解率算定工程による処理を複数回実行し、当該処理を複数回実行した結果、前記識別関数生成工程において生成された複数の識別関数のうち前記識別正解率が最も高い識別関数および当該識別関数を生成する際に選択された前記対象強度を前記識別条件として選定することを特徴とする。 The first sound identification condition setting support method of the present invention is a sound identification condition setting support method that supports setting of an identification condition for identifying the presence or absence of an abnormal sound, and is used for a plurality of settings including abnormal sound. The setting specific sound spectrogram data indicating the spectrogram of the abnormal sound sample is received, the predetermined frequency range is divided into a plurality of blocks in each setting specific sound spectrogram data, and the sound intensity in each block A setting noise histogram generation step for generating setting noise histogram data indicating a distribution histogram, and the plurality of settings in which the setting noise histogram data is generated in the setting noise histogram generation step In the histogram data of the block of the same frequency band in the abnormal sound spectrogram data for each, Calculate the average or cumulative value of the rank of the frequency of the intensity, specify the intensity of the selected number of sounds in descending order of the average value or cumulative value, and select the specified sound intensity as the target intensity Receiving a spectrogram data for generation indicating a spectrogram of each of a plurality of generation sample sounds including a generation noise sample sound including generation noise and a generation non-sound sample sound including no generation noise, In each of the generation spectrogram data, a generation histogram generation step of dividing a predetermined frequency range into a plurality of blocks and generating generation histogram data indicating a histogram of a sound intensity distribution in each block; Using the frequency of the target intensity in the histogram data, an identification function for identifying the presence or absence of abnormal noise And a spectrogram data for verification showing a spectrogram of a plurality of sample sounds for verification including a sample sound for verification including abnormal sounds and a non-noise sample sound for verification that does not include abnormal sound. Receiving each verification spectrogram data, dividing a predetermined frequency range into a plurality of blocks, and generating a verification histogram data indicating a histogram of a sound intensity distribution in each block; The correct answer for inputting the frequency of the target intensity in the verification histogram data to the discrimination function to identify the presence / absence of abnormal noise, and calculating the discrimination accuracy rate that is the probability that the discrimination function correctly identifies the presence / absence of abnormal noise A rate calculation step, changing the setting of the number of selections, the strength selection step, and the generating hiss A plurality of processes generated in the discrimination function generation process are performed as a result of executing the process by the program generation process, the discrimination function generation process, the verification histogram generation process, and the accuracy rate calculation process a plurality of times. The discriminant function having the highest discrimination accuracy rate among discriminant functions and the target strength selected when generating the discriminant function are selected as the discriminating conditions.

また、本発明の第２の音識別条件設定支援方法は、上述した本発明の第１の音識別条件設定支援方法において、前記各スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割する際のブロック数および前記選択数の設定を変更しつつ、前記設定用有異音ヒストグラム生成工程、前記強度選択工程、前記生成用ヒストグラム生成工程、前記識別関数生成工程、前記検証用ヒストグラム生成工程および前記正解率算定工程による処理を複数回実行し、当該処理を複数回実行した結果、前記識別関数生成工程において生成された複数の識別関数のうち前記識別正解率が最も高い識別関数、当該識別関数を生成する際に設定された前記ブロック数、および当該識別関数を生成する際に選択された前記対象強度を前記識別条件として選定することを特徴とする。 Further, the second sound identification condition setting support method of the present invention is the above-described first sound identification condition setting support method of the present invention described above, wherein the predetermined frequency range in each spectrogram data is divided into a plurality of blocks. While changing the setting of the number of blocks and the number of selections, the setting noise histogram generation step, the intensity selection step, the generation histogram generation step, the discrimination function generation step, the verification histogram generation step, and the correct answer As a result of executing the process by the rate calculation step a plurality of times and executing the processing a plurality of times, the discrimination function having the highest discrimination accuracy rate among the plurality of discrimination functions generated in the discrimination function generation step, and the discrimination function are generated As the identification condition, the number of blocks set when performing and the target intensity selected when generating the identification function Characterized in that it constant.

本発明によれば、採取音のスペクトログラムにおいて異音の特徴抽出またはそれに相当する処理をコンピュータにより行うに当たり、コンピュータにおける処理の負担を軽減でき、処理の迅速化を図ることができる。 According to the present invention, when a computer performs a feature extraction of an abnormal sound or a process corresponding thereto in a spectrogram of a collected sound, the processing burden on the computer can be reduced, and the process can be speeded up.

本発明の実施形態による音識別条件設定支援装置を示すブロック図である。It is a block diagram which shows the sound identification condition setting assistance apparatus by embodiment of this invention. 本発明の実施形態による音識別条件設定支援装置による音識別条件設定支援処理を示すフローチャートである。It is a flowchart which shows the sound identification condition setting assistance process by the sound identification condition setting assistance apparatus by embodiment of this invention. 図２に続く音識別条件設定支援処理を示すフローチャートである。It is a flowchart which shows the sound identification condition setting assistance process following FIG. 図３に続く音識別条件設定支援処理を示すフローチャートである。It is a flowchart which shows the sound identification condition setting assistance process following FIG. 有異音サンプル音の波形図である。It is a wave form diagram of a strange sound sample sound. 有異音サンプル音のスペクトログラムを示すグラフである。It is a graph which shows the spectrogram of an unusual sound sample sound. 有異音サンプル音のスペクトログラムの１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in one block of the spectrogram of the unusual sound sample. 有異音サンプル音のスペクトログラムの他の１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in other one block of the spectrogram of an unusual sound sample sound. 有異音サンプル音のスペクトログラムのさらに他の１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in another one block of the spectrogram of a strange sound sample sound. サンプル音のスペクトログラムを２０個のブロック数のブロックに分割した状態を示す説明図である。It is explanatory drawing which shows the state which divided | segmented the spectrogram of the sample sound into the block of 20 blocks. サンプル音のスペクトログラムを３０個のブロック数のブロックに分割した状態を示す説明図である。It is explanatory drawing which shows the state which divided | segmented the spectrogram of the sample sound into the block of the number of 30 blocks. １つの有異音サンプル音のスペクトログラムデータにおける各周波数帯の最大度数強度等を示す説明図である。It is explanatory drawing which shows the maximum frequency intensity | strength etc. of each frequency band in the spectrogram data of one unusual sound sample. 有異音サンプル音のスペクトログラムデータにおいて同一周波数帯の候補ブロックのカウント値等を示す説明図である。It is explanatory drawing which shows the count value etc. of the candidate block of the same frequency band in the spectrogram data of a noise sample sound. 有異音サンプル音の１つのヒストグラムデータにおける音の強度の順位を示す説明図である。It is explanatory drawing which shows the order | rank of the intensity | strength of the sound in one histogram data of a strange sound sample sound. 有異音サンプル音の複数のヒストグラムデータにおける音の強度の順位の平均値を示す説明図である。It is explanatory drawing which shows the average value of the intensity | strength order | rank of the sound intensity in the several histogram data of a noise sample sound. 識別関数の識別正解率の算定方法を示す説明図である。It is explanatory drawing which shows the calculation method of the identification correct rate of an identification function. ブロック数、周波数帯の選択数ｃおよび音の強度の選択数ｄのそれぞれの設定に対応する識別関数の識別正解率を示す説明図である。It is explanatory drawing which shows the discrimination | determination accuracy rate of the discrimination function corresponding to each setting of the number of blocks, the selection number c of a frequency band, and the selection number d of a sound intensity. 本発明の実施形態による異音判定装置を示すブロック図である。It is a block diagram which shows the abnormal sound determination apparatus by embodiment of this invention. 本発明の実施形態による異音判定装置による異音判定処理を示すフローチャートである。It is a flowchart which shows the noise determination process by the noise determination apparatus by embodiment of this invention.

以下、本発明の実施の形態につき、図面を参照しながら説明する。 Embodiments of the present invention will be described below with reference to the drawings.

（音識別条件設定支援装置による処理の概要）
図１は本発明の実施形態による音識別条件設定支援装置を示している。図１において、本発明の実施形態による音識別条件設定支援装置１は、採取音中の異音の有無を識別するための識別条件の設定を支援する装置である。採取音は、例えば、送電線または機器等に近い場所で採取した音である。異音は、例えば、送電線の周囲に住む人が不快を訴える音、または機器が正常である場合には発生しない音等である。 (Outline of processing by the sound identification condition setting support device)
FIG. 1 shows a sound identification condition setting support apparatus according to an embodiment of the present invention. In FIG. 1, a sound identification condition setting support apparatus 1 according to an embodiment of the present invention is an apparatus that supports the setting of an identification condition for identifying the presence or absence of abnormal sounds in a collected sound. The collected sound is, for example, a sound collected at a place near a power transmission line or equipment. The abnormal noise is, for example, a sound in which people living around the power transmission line complain of discomfort or a sound that does not occur when the device is normal.

音識別条件設定支援装置１により行われる識別条件の設定を支援する処理（音識別条件設定支援処理）は、大まかに述べると、採取音中の異音の有無を識別するための識別関数を生成ないし選択する処理である。この処理は、実質的にみて、異音の特徴を抽出する処理に相当する。 The process for supporting the setting of the identification condition performed by the sound identification condition setting support device 1 (sound identification condition setting support process) roughly generates a discrimination function for identifying the presence or absence of abnormal sounds in the collected sound. Or a process to select. This process is substantially equivalent to a process for extracting features of abnormal sounds.

音識別条件設定支援装置１は、音識別条件設定支援処理として、概ね次の処理を行う。まず、処理の準備として、作業者は、有異音サンプル音の音声データおよび無異音サンプル音の音声データを用意する。有異音サンプル音とは、異音を含む採取音のサンプルである。無異音サンプル音とは、異音を含まない採取音のサンプルである。音声データは、例えば、ＷＡＶ、ＭＰ３（ＭＰＥＧＡｕｄｉｏＬａｙｅｒー３）、ＡＡＣ（ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）等の音声ファイルフォーマットを有するデジタルデータである。 The sound identification condition setting support device 1 generally performs the following process as the sound identification condition setting support process. First, as preparation for processing, an operator prepares audio data of abnormal sound samples and audio data of non-natural sound samples. The abnormal sound sample sound is a sample of the collected sound including the abnormal sound. An allophone sample sound is a sample of a sampled sound that does not contain an allophone. The audio data is digital data having an audio file format such as WAV, MP3 (MPEG Audio Layer-3), AAC (Advanced Audio Coding), and the like.

作業者は、例えば次のような方法で、有異音サンプル音の音声データと無異音サンプル音の音声データをそれぞれ複数用意する。すなわち、作業者は、同じ場所で複数回録音を実行することにより複数の音声を採取し、採取した複数の音声を順次再生し、それらを聴き、各音声中の異音の有無を判断する。そして、作業者は、異音が含まれている音声に対応する音声データを有異音サンプル音の音声データに分類し、異音が含まれていない音声に対応する音声データを無異音サンプル音の音声データに分類する。 For example, the operator prepares a plurality of voice data of abnormal sample sounds and a plurality of voice data of non-sound sample sounds by the following method. In other words, the operator collects a plurality of sounds by executing recording at the same place, reproduces the collected sounds sequentially, listens to them, and determines the presence or absence of abnormal sounds in each sound. Then, the operator classifies the audio data corresponding to the sound including the abnormal sound into the sound data of the abnormal sound sample sound, and the sound data corresponding to the sound not including the abnormal sound is sampled without the abnormal sound. Classify into sound data.

次に、作業者は、複数の有異音サンプル音の音声データと、複数の無異音サンプル音の音声データを音識別条件設定支援装置１に入力する。音識別条件設定支援装置１は、これら入力された音声データのそれぞれにつき、音声のスペクトログラムを示すスペクトログラムデータを生成する。 Next, the worker inputs sound data of a plurality of abnormal sound samples and sound data of a plurality of sound samples without sound to the sound identification condition setting support device 1. The sound identification condition setting support device 1 generates spectrogram data indicating a spectrogram of speech for each of the input speech data.

続いて、音識別条件設定支援装置１は、（１）対象周波数帯の選択、（２）対象強度の選択、（３）識別関数の生成、（４）識別関数の識別正解率の算定、（５）確定識別関数等の選定を行う。（１）〜（５）の処理の概要は次の通りである。 Subsequently, the sound identification condition setting support device 1 (1) selection of the target frequency band, (2) selection of the target intensity, (3) generation of the discrimination function, (4) calculation of the discrimination accuracy rate of the discrimination function, ( 5) Select a deterministic identification function. The outline of the processes (1) to (5) is as follows.

（１）対象周波数帯の選択
まず、対象周波数帯の選択の処理は概ね次の通りである。すなわち、音識別条件設定支援装置１は、有異音サンプル音および無異音サンプル音を用いて識別関数の生成等を行うが、識別関数の生成等を、有異音サンプル音および無異音サンプル音の全周波数帯を用いて行うのではなく、異音の周波数を含むことが予想される限られた周波数帯のみを用いて行う。これを実現するために、音識別条件設定支援装置１は、異音の周波数を含むことが予想される周波数帯を特定し、その周波数帯を「対象周波数帯」として選択する。 (1) Selection of target frequency band First, the process of selecting a target frequency band is as follows. That is, the sound identification condition setting support device 1 generates a discrimination function using the abnormal sound sample and the non-noise sample sound, and the discrimination function is generated using the abnormal sound sample and the non-noise sound. This is not performed using the entire frequency band of the sample sound, but only using a limited frequency band that is expected to include the frequency of the abnormal sound. In order to realize this, the sound identification condition setting support device 1 specifies a frequency band that is expected to include an abnormal noise frequency, and selects the frequency band as a “target frequency band”.

具体的には、音識別条件設定支援装置１は、まず、所定の規則に従って「ブロック数」を設定する。続いて、音識別条件設定支援装置１は、有異音サンプル音のスペクトログラムデータのそれぞれにつき、所定の周波数範囲を、ブロック数が示す個数のブロックに分割し、ブロックのそれぞれにつき、音の強度分布のヒストグラムを示すヒストグラムデータを生成する。上記所定の周波数範囲は、例えば一般的な可聴域である２０Ｈｚ程度から２００００Ｈｚ程度までの範囲である。 Specifically, the sound identification condition setting support device 1 first sets the “number of blocks” according to a predetermined rule. Subsequently, the sound identification condition setting support device 1 divides the predetermined frequency range into the number of blocks indicated by the number of blocks for each spectrogram data of the abnormal sound sample, and the sound intensity distribution for each of the blocks Histogram data indicating the histogram of the current is generated. The predetermined frequency range is, for example, a range from about 20 Hz to about 20000 Hz, which is a general audible range.

続いて、音識別条件設定支援装置１は、有異音サンプル音のスペクトログラムデータのそれぞれにつき、周波数が高くなったにもかかわらず、音の強度が増加したブロックを「候補ブロック」として選択する。すなわち、異音が含まれていない採取音においては、通常、周波数が高くなるに伴い音の強度が減少すると考えられる。これに対し、異音が含まれている採取音においては、その異音の周波数の強度がその前後の周波数の強度よりも大きくなる。したがって、採取音において、周波数が高くなったにもかかわらず、音の強度が増加する周波数帯が存在する場合には、その採取音には異音が含まれており、当該周波数帯に異音の周波数が含まれている可能性が高い。そこで、音識別条件設定支援装置１は、周波数が高くなったにもかかわらず、音の強度が増加する周波数帯のブロックを候補ブロックとして選択する。 Subsequently, the sound identification condition setting support device 1 selects, as the “candidate block”, a block whose sound intensity has increased despite the frequency being increased for each spectrogram data of the abnormal sound sample. That is, in the collected sound that does not include the abnormal sound, it is generally considered that the intensity of the sound decreases as the frequency increases. On the other hand, in the sampled sound including the abnormal sound, the intensity of the frequency of the abnormal sound is larger than the intensity of the frequencies before and after the abnormal sound. Therefore, if there is a frequency band in which the intensity of the sound increases in the sampled sound even though the frequency has increased, the sampled sound contains the abnormal sound, and the noise is included in the frequency band. It is highly possible that the frequency is included. Therefore, the sound identification condition setting support device 1 selects, as a candidate block, a block in a frequency band in which the sound intensity increases even though the frequency is increased.

ここで、候補ブロックの選択についてより具体的に説明する。音の強度分布のヒストグラムデータにおいて、度数が最大となる音の強度を「最大度数強度」というとすると、音識別条件設定支援装置１は、有異音サンプル音のスペクトログラムデータのそれぞれにおいて、最も周波数帯の低いブロックを除くすべてのブロックにつき、互いに隣り合う２つのブロックのヒストグラムデータの最大度数強度を比較し、周波数帯が高い方のブロックのヒストグラムデータの最大度数強度が、周波数帯が低い方のブロックのヒストグラムデータの最大度数強度よりも大きい場合には、周波数帯が高い方のブロックを候補ブロックとして選択する。 Here, the selection of candidate blocks will be described more specifically. In the histogram data of the sound intensity distribution, if the intensity of the sound having the maximum frequency is referred to as “maximum frequency intensity”, the sound identification condition setting support apparatus 1 has the highest frequency in each spectrogram data of the sampled sound with abnormal sound. For all the blocks except the low band block, compare the maximum frequency intensity of the histogram data of the two adjacent blocks, and the maximum frequency intensity of the histogram data of the block with the higher frequency band is the lower frequency band When the maximum frequency intensity of the histogram data of the block is larger, the block with the higher frequency band is selected as a candidate block.

続いて、音識別条件設定支援装置１は、所定の規則に従って「選択数ｃ」を設定する。続いて、音識別条件設定支援装置１は、生成した有異音サンプル音のスペクトログラムデータのすべてにおいて、同一周波数帯の候補ブロックの個数を、ブロックの周波数帯ごとにそれぞれカウントする。続いて、音識別条件設定支援装置１は、カウント値の大きい順に、選択数ｃが示す個数の周波数帯を特定し、これら特定した周波数帯および当該周波数帯よりも低い隣の周波数帯を対象周波数帯として選択する。 Subsequently, the sound identification condition setting support device 1 sets the “number of selections c” according to a predetermined rule. Subsequently, the sound identification condition setting support device 1 counts the number of candidate blocks in the same frequency band for each of the frequency bands of the blocks in all the spectrogram data of the generated abnormal sound samples. Subsequently, the sound identification condition setting support device 1 specifies the number of frequency bands indicated by the selection number c in descending order of the count value, and sets the specified frequency band and the adjacent frequency band lower than the frequency band as the target frequency. Select as a belt.

すなわち、有異音サンプル音のそれぞれには、同一の周波数を有する異音が含まれており、その異音が識別すべき対象であると考えられる。音識別条件設定支援装置１により生成された有異音サンプル音のスペクトログラムデータのうち、多くのスペクトログラムデータに、同一周波数帯の候補ブロックが含まれている場合には、その候補ブロックの周波数帯に異音の周波数が含まれている可能性が高い。また、異音の周波数が含まれる周波数帯の強度分布と、その周波数帯よりも低い隣の周波数帯の強度分布との相違により、異音の強度分布が特徴付けられることがある。このような考え方に基づき、音識別条件設定支援装置１は、生成した複数の有異音サンプル音のスペクトログラムデータにおける上記所定の周波数範囲において、相対的にみて多数の候補ブロックが存在する周波数帯と、当該周波数帯よりも低い隣の周波数帯を対象周波数帯として選択する。 In other words, each of the abnormal sound samples includes an abnormal sound having the same frequency, and it is considered that the abnormal sound is a target to be identified. Among the spectrogram data of abnormal sample sounds generated by the sound identification condition setting support device 1, when many spectrogram data include candidate blocks of the same frequency band, the frequency bands of the candidate blocks are included in the frequency bands of the candidate blocks. There is a high possibility that an abnormal noise frequency is included. Further, the intensity distribution of the abnormal noise may be characterized by the difference between the intensity distribution of the frequency band including the frequency of the abnormal noise and the intensity distribution of the adjacent frequency band lower than the frequency band. Based on such a concept, the sound identification condition setting support device 1 includes a frequency band in which a relatively large number of candidate blocks exist in the predetermined frequency range in the generated spectrogram data of a plurality of abnormal sound samples. The adjacent frequency band lower than the frequency band is selected as the target frequency band.

（２）対象強度の選択
次に、対象強度の選択の処理は概ね次の通りである。すなわち、音識別条件設定支援装置１は、有異音サンプル音および無異音サンプル音のスペクトログラムデータのそれぞれにつき、所定の周波数範囲を、設定されたブロック数が示す個数のブロックに分割し、ブロックのそれぞれにつき、音の強度分布のヒストグラムを示すヒストグラムデータを生成し、これらヒストグラムデータを用いて識別関数の生成等を行うが、ヒストグラムデータの全強度範囲を用いて識別関数の生成を行うのではなく、異音の存在により度数が大きくなったことが予想される限られた強度範囲のみを用いて識別関数の生成を行う。これを実現するために、音識別条件設定支援装置１は、異音の存在により度数が大きくなったことが予想される強度範囲を特定し、その強度範囲を「対象強度」として選択する。 (2) Selection of target strength Next, processing for selecting target strength is generally as follows. That is, the sound identification condition setting support device 1 divides a predetermined frequency range into the number of blocks indicated by the set number of blocks for each of spectrogram data of abnormal sound samples and non-noise sample sounds. Histogram data showing a histogram of the sound intensity distribution is generated for each of the above, and the discrimination function is generated using these histogram data, but the discrimination function is generated using the entire intensity range of the histogram data. Instead, the discriminant function is generated using only a limited intensity range in which the frequency is expected to increase due to the presence of abnormal noise. In order to realize this, the sound identification condition setting support device 1 specifies an intensity range in which the frequency is expected to increase due to the presence of an abnormal sound, and selects the intensity range as the “target intensity”.

すなわち、有異音サンプル音において、限られた周波数帯において限られた強度範囲の音が高い頻度で発生している場合、その音は異音である可能性が高い。したがって、有異音サンプル音のスペクトログラムデータにおける対象周波数帯のヒストグラムデータにおいて、強度の度数が周囲よりも大幅に大きい部分が存在する場合、その部分は、異音の存在により度数が大きくなった部分に当たる可能性が高い。このような考え方に基づき、音識別条件設定支援装置１は、有異音サンプル音のスペクトログラムデータにおける同一の対象周波数帯のヒストグラムデータにおいて、相対的にみて度数が大きい強度（例えば、度数の順位が第１位、第２位、第３位の強度など）を対象強度として選択する。 In other words, in the sampled abnormal sound, if a sound in a limited intensity range is generated with a high frequency in a limited frequency band, the sound is likely to be abnormal. Therefore, in the histogram data of the target frequency band in the spectrogram data of the abnormal sound sample sound, if there is a part where the intensity frequency is significantly larger than the surrounding area, that part is the part where the frequency is increased due to the presence of the abnormal sound Is likely to hit. Based on such an idea, the sound identification condition setting support device 1 has a relatively high intensity (for example, the rank of the frequency in the histogram data of the same target frequency band in the spectrogram data of the abnormal sound sample sound). The first, second, and third strengths) are selected as target strengths.

具体的には、音識別条件設定支援装置１は、所定の規則に従って「選択数ｄ」を設定する。続いて、音識別条件設定支援装置１は、生成した有異音サンプル音のスペクトログラムデータのすべてにおいて、同一の対象周波数帯のブロックのヒストグラムデータにおけるそれぞれの音の強度の度数の平均順位を算定し、当該平均順位の高い順に、選択数ｄが示す個数の音の強度を特定し、これら特定した音の強度を対象強度として選択する。 Specifically, the sound identification condition setting support device 1 sets the “number of selections d” according to a predetermined rule. Subsequently, the sound identification condition setting support device 1 calculates the average rank of the frequency of each sound in the histogram data of the blocks of the same target frequency band in all the spectrogram data of the generated abnormal sound samples. The intensity of the number of sounds indicated by the selection number d is specified in descending order of the average rank, and the intensity of the specified sounds is selected as the target intensity.

（３）識別関数の生成
次に、識別関数の生成処理は概ね次の通りである。すなわち、音識別条件設定支援装置１は、入力された複数の有異音サンプル音のうちの一部、例えば半数の有異音サンプル音と、入力した複数の無異音サンプル音のうちの一部、例えば半数の無異音サンプル音とを用い、識別関数を生成する。具体的には、音識別条件設定支援装置１は、入力された複数の有異音サンプル音のうちの半数の有異音サンプル音のスペクトログラムデータと、入力された複数の無異音サンプル音のうちの半数の無異音サンプル音のスペクトログラムデータのそれぞれにつき、上記所定の周波数範囲を、設定されたブロック数が示す個数のブロックに分割し、分割したブロックのうち、対象周波数帯のブロックについてのみ、音の強度分布のヒストグラムを示すヒストグラムデータを生成する。なお、上述した対象周波数帯の選択の処理を行うために、入力された有異音サンプル音のスペクトログラムデータの各ブロックにつきヒストグラムデータをすでに生成している場合には、識別関数の生成処理を行うためにヒストグラムデータを改めて生成する必要はなく、対象周波数帯の選択の処理を行うために生成したヒストグラムデータの中から、対象周波数帯のヒストグラムデータを選択して用いればよい。 (3) Generation of Discriminant Function Next, discriminant function generation processing is generally as follows. That is, the sound identification condition setting support device 1 includes a part of a plurality of input abnormal sound samples, for example, half of the abnormal sound samples and one of the input non-natural sound samples. A discriminant function is generated using a part, for example, half of all sound samples. Specifically, the sound identification condition setting support device 1 includes spectrogram data of half of the different sampled sounds out of the plurality of input sampled sounds, and a plurality of input non-sampled sounds. For each of the spectrogram data of half of the non-sound sample sound, the predetermined frequency range is divided into the number of blocks indicated by the set number of blocks, and only the blocks in the target frequency band among the divided blocks Histogram data indicating a histogram of sound intensity distribution is generated. If the histogram data has already been generated for each block of the spectrogram data of the input abnormal sound sample in order to perform the above-described target frequency band selection processing, the discrimination function generation processing is performed. Therefore, it is not necessary to newly generate the histogram data, and the histogram data of the target frequency band may be selected and used from the histogram data generated to perform the process of selecting the target frequency band.

続いて、音識別条件設定支援装置１は、生成した対象周波数帯のヒストグラムデータにおける対象強度の度数のみを用い、識別関数を生成する。 Subsequently, the sound identification condition setting support device 1 generates an identification function using only the frequency of the target intensity in the generated histogram data of the target frequency band.

（４）識別関数の識別正解率の算定
次に、識別関数の識別正解率を算定する処理は概ね次の通りである。すなわち、音識別条件設定支援装置１は、入力された複数の有異音サンプル音のうち、識別関数の生成処理に用いていない有異音サンプル音と、入力された複数の無異音サンプル音のうち、識別関数の生成処理に用いていない無異音サンプル音を用い、識別関数を実行して異音の有無の識別を行い、識別の正否判定を行う。具体的には、音識別条件設定支援装置１は、入力された複数の有異音サンプル音のうち、識別関数の生成処理に用いていない有異音サンプル音のスペクトログラムデータと、入力された複数の無異音サンプル音のうち、識別関数の生成処理に用いていない無異音サンプル音のスペクトログラムデータのそれぞれにつき、上記所定の周波数範囲を、設定されたブロック数が示す個数のブロックに分割し、分割したブロックのうち、対象周波数帯のブロックについてのみ、音の強度分布のヒストグラムを示すヒストグラムデータを生成する。なお、上述した対象周波数帯の選択の処理を行うために、入力された有異音サンプル音のスペクトログラムデータの各ブロックにつきヒストグラムデータをすでに生成している場合には、識別関数による識別の正否判定を行うためにヒストグラムデータを改めて生成する必要はなく、対象周波数帯の選択の処理を行うために生成したヒストグラムデータの中から、識別関数の生成処理に用いていない有異音サンプル音のスペクトログラムデータにおける対象周波数帯のヒストグラムデータを選択して用いればよい。 (4) Calculation of discrimination correct rate of discrimination function Next, the process of calculating the discrimination correct rate of discrimination function is as follows. That is, the sound identification condition setting support device 1 includes a plurality of input sampled sounds that are not used for the discrimination function generation process, and a plurality of input soundless sampled sounds. Among them, a non-sound sample sound that is not used in the generation process of the discrimination function is used, the discrimination function is executed to discriminate the presence or absence of the abnormal sound, and the correctness of the discrimination is determined. Specifically, the sound identification condition setting support device 1 includes spectrogram data of the different sampled sounds that are not used in the generation process of the discrimination function among the plurality of input different sampled sounds, and the plurality of input For each spectrogram data of the non-sound sample sound that is not used in the discrimination function generation process, the predetermined frequency range is divided into the number of blocks indicated by the set number of blocks. Of the divided blocks, histogram data indicating a histogram of the sound intensity distribution is generated only for the block in the target frequency band. If the histogram data has already been generated for each block of the spectrogram data of the input abnormal sound sample in order to perform the process of selecting the target frequency band described above, it is determined whether the identification function is correct or not. It is not necessary to generate histogram data anew to perform the process, and from the histogram data generated to select the target frequency band, the spectrogram data of the sampled sound that is not used for the discrimination function generation process What is necessary is just to select and use the histogram data of the target frequency band.

続いて、音識別条件設定支援装置１は、生成した対象周波数帯のヒストグラムデータにおける対象強度の度数を識別関数に順次入力し、入力するごとに、当該識別関数により異音の識別を実行し、識別結果の正否を判定する。続いて、音識別条件設定支援装置１は、得られた判定結果に基づき、当該識別関数につき、異音の有無を正しく識別する確率である識別正解率を算定する。 Subsequently, the sound identification condition setting support device 1 sequentially inputs the frequency of the target intensity in the generated histogram data of the target frequency band to the identification function, and each time it is input, the noise identification is performed by the identification function. The correctness of the identification result is determined. Subsequently, the sound identification condition setting support device 1 calculates an identification accuracy rate that is a probability of correctly identifying presence / absence of an abnormal sound for the identification function based on the obtained determination result.

音識別条件設定支援装置１は、ブロック数の設定、選択数ｃの設定、および選択数ｄの設定を変更しながら、（１）対象周波数帯の選択、（２）対象強度の選択、（３）識別関数の生成、（４）識別関数の識別正解率の算定を複数回行う。これにより、複数の識別関数が生成され、これら複数の識別関数のそれぞれについて識別の正否判定が行われ、その結果から、これら複数の識別関数のそれぞれについて識別正解率が算定される。 The sound identification condition setting support device 1 changes the setting of the number of blocks, the setting of the number of selections c, and the setting of the number of selections d, (1) selection of the target frequency band, (2) selection of the target intensity, (3 ) Generate discriminant function, and (4) calculate the discriminant accuracy rate of discriminant function multiple times. As a result, a plurality of discriminant functions are generated, a discrimination correctness determination is performed for each of the plurality of discriminant functions, and a discrimination correct answer rate is calculated for each of the plurality of discriminant functions from the result.

（５）確定識別関数等の選定
次に、確定識別関数等を選定する処理は概ね次の通りである。すなわち、音識別条件設定支援装置１は、上述したように生成された複数の識別関数のうち、識別正解率が最も高い識別関数を選定し、当該識別関数、当該識別関数に対応するブロック数、当該識別関数に対応する対象周波数帯、および当該識別関数に対応する対象強度を、確定識別関数、確定ブロック数、確定対象周波数帯、および確定対象強度として記憶部２１に記憶する。このようにして音識別条件設定支援装置１により選定・記憶された、確定識別関数、確定ブロック数、確定対象周波数帯、および確定対象強度は、採取音中の異音の有無を異音判定装置３０（図１８参照）により判定する際に用いられる。 (5) Selection of deterministic discriminant function etc. Next, the process of selecting the deterministic discriminant function etc. is generally as follows. That is, the sound identification condition setting support device 1 selects the identification function having the highest classification accuracy rate from among the plurality of identification functions generated as described above, the identification function, the number of blocks corresponding to the identification function, The target frequency band corresponding to the discriminant function and the target intensity corresponding to the discriminant function are stored in the storage unit 21 as a definitive discriminant function, the number of definite blocks, a definitive target frequency band, and a definitive target intensity. Thus, the sound identification condition setting support device 1 selects and stores the definite identification function, the number of definite blocks, the definite target frequency band, and the deterministic target intensity. 30 (see FIG. 18).

ここで、音のスペクトログラムおよび強度分布のヒストグラムについて図５ないし図９を参照しながら説明する。図５は有異音サンプル音の波形の具体例を示す。図５において横軸が時間であり、縦軸が音の強さである。図６は、図５に示す有異音サンプル音のスペクトログラムを示している。図６において横軸が時間であり、縦軸が周波数であり、色が音の強さを示している。なお、図６においては、音の強さが黒と白の２段階で表されており、白よりも黒の方が音の強さが大きいことを示しているが、実際のスペクトログラムのグラフは、音の強さが多段階（例えば２５６段階）の色（例えば色の明度）の違いで表される。 Here, the spectrogram of sound and the histogram of intensity distribution will be described with reference to FIGS. FIG. 5 shows a specific example of the waveform of the abnormal sound sample. In FIG. 5, the horizontal axis represents time, and the vertical axis represents sound intensity. FIG. 6 shows a spectrogram of the abnormal sample sound shown in FIG. In FIG. 6, the horizontal axis represents time, the vertical axis represents frequency, and the color represents the intensity of sound. In FIG. 6, the sound intensity is represented in two stages of black and white, and black indicates that the sound intensity is greater than white, but the actual spectrogram graph is The intensity of sound is represented by a difference in color (for example, lightness of color) in multiple levels (for example, 256 levels).

図７は、図６中のブロックＦ１における音の強度分布のヒストグラムを示し、図８は、図６中のブロックＦ２における音の強度分布のヒストグラムを示し、図９は、図６中のブロックＦ３における音の強度分布のヒストグラムを示している。図７ないし図９のそれぞれにおいて、縦軸が度数／全度数であり、横軸が音の強さ（強度）である。ここで、有異音サンプル音と無異音サンプル音とは、両者の長さ（開始時点から終了時点までの時間）が互いに同じであり、有異音サンプル音または無異音サンプル音の長さ（例えば１時間）を所定の単位時間（例えば１秒）で割った値が「全度数」に相当する。また、有異音サンプル音または無異音サンプル音の開始時点から終了時点までの間において、上記所定の単位時間を単位にし、音の強さが同じ程度となった回数をカウントすることにより得られた値が「度数」に相当する。 7 shows a histogram of sound intensity distribution in block F1 in FIG. 6, FIG. 8 shows a histogram of sound intensity distribution in block F2 in FIG. 6, and FIG. 9 shows block F3 in FIG. The histogram of the sound intensity distribution in is shown. In each of FIGS. 7 to 9, the vertical axis represents frequency / total power, and the horizontal axis represents sound intensity (intensity). Here, the noise sample sound and the non-noise sample sound have the same length (time from the start time to the end time), and the length of the sound sample sound or the sound sample sound without noise is the same. A value obtained by dividing (for example, 1 hour) by a predetermined unit time (for example, 1 second) corresponds to “total frequency”. In addition, it is obtained by counting the number of times that the sound intensity is the same in the predetermined unit time from the start point to the end point of the abnormal sound sample or non-noise sample sound. The obtained value corresponds to “frequency”.

図６に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ１では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが大きい状態がほぼ連続している。このブロックＦ１における音の強度分布では、図７に示すように、度数／全度数が最大（ピーク）となる音の強さ、すなわち最大度数強度がＰ１であり、分布がＰ１に集中し、分布の範囲が狭い。 In the spectrogram of the abnormal sound sample shown in FIG. 6, in the block F1, the state where the sound intensity is high is continuous between the start time and the end time of the abnormal sound sample. In the sound intensity distribution in this block F1, as shown in FIG. 7, the sound intensity at which the frequency / total frequency is maximum (peak), that is, the maximum frequency intensity is P1, the distribution is concentrated on P1, and the distribution The range of is narrow.

また、図６に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ２では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが小さい状態がほぼ連続している。このブロックＦ２における音の強度分布では、図８に示すように、最大度数強度がＰ２であり、Ｐ２はＰ１よりも小さい値である。また、ブロックＦ２における音の強度分布では、分布がＰ２に集中し、分布の範囲が狭い。 Further, in the spectrogram of the abnormal sound sample shown in FIG. 6, in the block F2, the state in which the sound intensity is low is continuous from the start time to the end time of the abnormal sound sample. In the sound intensity distribution in the block F2, as shown in FIG. 8, the maximum frequency intensity is P2, and P2 is a value smaller than P1. Further, in the sound intensity distribution in the block F2, the distribution is concentrated on P2, and the distribution range is narrow.

また、図６に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ３では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが大きい状態と音の強さが小さい状態とが混在している。このブロックＦ３における音の強度分布では、図８に示すように、分布の範囲が比較的広い。 Further, in the spectrogram of the abnormal sound sample shown in FIG. 6, in block F3, the sound intensity is high and the sound intensity is low between the start time and the end time of the noise sample sound. Are mixed. In the sound intensity distribution in the block F3, as shown in FIG. 8, the range of the distribution is relatively wide.

このように、有異音サンプル音のスペクトログラムにおけるブロックごとの強度分布のヒストグラムによれば、有異音サンプル音におけるブロックごとの音の強さの時間的変化を表すことができる。したがって、有異音サンプル音、無異音サンプル音、後述の被判定音といった採取音のそれぞれにつき、スペクトログラムにおけるブロックごとの音の強度分布のヒストグラムを用いることにより、採取音および採取音に含まれる異音の周波数または音の強さの時間的変化を勘案しつつ、採取音中の異音の有無を高精度に識別可能な識別関数を生成することができる。 As described above, according to the histogram of the intensity distribution for each block in the spectrogram of the abnormal sound sample, the temporal change in the sound intensity for each block of the abnormal sound sample can be expressed. Therefore, by using the histogram of the sound intensity distribution for each block in the spectrogram for each sampled sound, such as abnormal sound sample, non-natural sound sample sound, to-be-determined sound described later, it is included in the sampled sound and sampled sound. It is possible to generate a discriminant function that can accurately identify the presence or absence of an abnormal sound in a sampled sound while taking into account temporal changes in the frequency or intensity of the abnormal sound.

（音識別条件設定支援装置の構成）
図１に示すように、音識別条件設定支援装置１は演算処理部１１を備え、演算処理部１１は、音声データからスペクトログラムデータを生成するスペクトログラム生成部１２と、スペクトログラムデータを複数のブロックに分割し、各ブロックにおける音の強度分布のヒストグラムデータを生成するヒストグラム生成部１３と、有異音サンプル音の各スペクトログラムデータの複数のブロックから候補ブロックを選択するブロック選択部１４と、有異音サンプル音のスペクトログラムデータにつき、対象周波数帯を選択する周波数帯選択部１５と、有異音サンプル音のスペクトログラムデータにつき、対象強度を選択する強度選択部１６と、対象周波数帯のブロックにおける音の強度分布のヒストグラムデータにおける対象強度の度数を用いて識別関数を生成する識別関数生成部１７と、識別関数の識別正解率を算定する正解率算定部１８と、識別正解率の最も高い識別関数、当該識別関数に対応するブロック数、当該識別関数に対応する対象周波数帯、および当該識別関数に対応する対象強度を選定する識別条件選定部１９と、上記ブロック数および選択数を設定または変更する処理、および音識別条件設定支援処理の流れの制御等を行う制御部２０とを備えている。さらに、音識別条件設定支援装置１は、確定識別関数、確定ブロック数、確定対象周波数帯、確定対象強度および音識別条件設定支援処理中に保持すべき種々のデータを記憶する記憶部２１と、有異音サンプル音または無異音サンプル音の音声データの入力、操作入力、処理の進捗状況等の表示等を行う入出力部２２とを備えている。 (Configuration of sound identification condition setting support device)
As shown in FIG. 1, the sound identification condition setting support device 1 includes an arithmetic processing unit 11. The arithmetic processing unit 11 divides the spectrogram data into a plurality of blocks, and a spectrogram generation unit 12 that generates spectrogram data from speech data. A histogram generation unit 13 that generates histogram data of sound intensity distribution in each block, a block selection unit 14 that selects a candidate block from a plurality of blocks of each spectrogram data of the abnormal sound sample sound, and an abnormal sound sample A frequency band selection unit 15 that selects a target frequency band for spectrogram data of sound, an intensity selection unit 16 that selects target intensity for spectrogram data of abnormal sound samples, and a sound intensity distribution in a block of the target frequency band Using the frequency of object intensity in the histogram data of A discriminant function generating unit 17 for generating a discriminant function, a correct rate calculating unit 18 for calculating a discriminant accuracy rate of the discriminant function, a discriminant function having the highest discriminant correct rate, the number of blocks corresponding to the discriminant function, and the discriminant function Identification condition selection unit 19 for selecting a corresponding target frequency band and a target intensity corresponding to the identification function, a process for setting or changing the number of blocks and the number of selections, a flow control of a sound identification condition setting support process, etc. The control part 20 which performs is provided. Furthermore, the sound identification condition setting support device 1 includes a storage unit 21 that stores a definite identification function, the number of definite blocks, a definite target frequency band, a deterministic target strength, and various data to be held during the sound identification condition setting support process; And an input / output unit 22 that performs input of voice data of abnormal sampled sound or non-sound sampled sound, operation input, display of progress of processing, and the like.

このような構成を有する音識別条件設定支援装置１は、例えばパーソナルコンピュータにより実現することができる。具体的に述べると、演算処理部１１は、パーソナルコンピュータに内蔵されたＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｇＵｎｉｔ）により実現することができる。また、スペクトログラム生成部１２、ヒストグラム生成部１３、ブロック選択部１４、周波数帯選択部１５、強度選択部１６、識別関数生成部１７、正解率算定部１８、識別条件選定部１９、および制御部２０は、パーソナルコンピュータを音識別条件設定支援装置１として機能させるためのコンピュータプログラムをＣＰＵに読み込ませて実行させることにより実現することができる。また、記憶部２１は、パーソナルコンピュータが有する記憶装置（例えばハードディスクドライブ装置、フラッシュメモリ等）により実現することができる。また、入出力部２２は、パーソナルコンピュータが有する入出力装置（例えばキーボード、マウス、モニタ等）により実現することができる。 The sound identification condition setting support device 1 having such a configuration can be realized by a personal computer, for example. More specifically, the arithmetic processing unit 11 can be realized by a CPU (Central Processing Unit) built in a personal computer. Further, the spectrogram generation unit 12, the histogram generation unit 13, the block selection unit 14, the frequency band selection unit 15, the intensity selection unit 16, the discrimination function generation unit 17, the accuracy rate calculation unit 18, the discrimination condition selection unit 19, and the control unit 20 Can be realized by causing a CPU to read and execute a computer program for causing a personal computer to function as the sound identification condition setting support device 1. The storage unit 21 can be realized by a storage device (for example, a hard disk drive device, a flash memory, etc.) included in the personal computer. The input / output unit 22 can be realized by an input / output device (for example, a keyboard, a mouse, a monitor, etc.) included in the personal computer.

なお、ヒストグラム生成部１３が、設定用有異音ヒストグラム生成部、生成用ヒストグラム生成部、および検証用ヒストグラム生成部の具体例である。 The histogram generation unit 13 is a specific example of a setting abnormal sound histogram generation unit, a generation histogram generation unit, and a verification histogram generation unit.

（音識別条件設定支援処理）
図２ないし図４は、音識別条件設定支援装置１により行われる音識別条件設定支援処理の具体的な流れを示している。これより、図２ないし図４等を参照しつつ、先に概説した音識別条件設定支援処理につき、より具体的に説明する。 (Sound identification condition setting support processing)
2 to 4 show a specific flow of the sound identification condition setting support process performed by the sound identification condition setting support device 1. The sound identification condition setting support process outlined above will be described more specifically with reference to FIGS. 2 to 4 and the like.

まず、作業者は、処理の準備として、例えば、１００個の有異音サンプル音Ａ１〜Ａ１００の音声データと、１００個の無異音サンプル音Ｂ１〜Ｂ１００の音声データを用意する。 First, as a preparation for processing, for example, the worker prepares audio data of 100 abnormal sound samples A1 to A100 and audio data of 100 non-natural sound samples B1 to B100.

続いて、作業者は、音識別条件設定支援装置１の入出力部２２を操作し、有異音サンプル音Ａ１〜Ａ１００の音声データおよび無異音サンプル音Ｂ１〜Ｂ１００の音声データを音識別条件設定支援装置１に入力する。なお、これら音声データを音識別条件設定支援装置１に入力する際には、音識別条件設定支援装置１が、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００の音声データを１つ１つ識別可能となるようにすると共に、音声データが有異音サンプル音の音声データであるのか無異音サンプル音の音声データであるのかを識別可能となるようにする。これを実現する方法として、例えば、各音声データに、各音声データを識別するための識別情報、および各音声データが有異音サンプル音の音声データであるのか無異音サンプル音の音声データであるのかを示す分類情報を付す方法がある。この方法は例えば音声データのファイル名の付け方を工夫することで容易に実現することができる。 Subsequently, the operator operates the input / output unit 22 of the sound identification condition setting support device 1 to obtain the sound data of the abnormal sound samples A1 to A100 and the sound data of the non-natural sound samples B1 to B100 as sound identification conditions. Input to the setting support apparatus 1. When these sound data are input to the sound identification condition setting support device 1, the sound identification condition setting support device 1 receives the sound data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100. In addition to being able to identify each one, it is possible to identify whether the sound data is the sound data of an abnormal sound sample or the sound data of an allophone sample sound. As a method for realizing this, for example, each voice data is identified with identification information for identifying each voice data, and whether each voice data is voice data of abnormal sample sound or voice data of non-sound sample sound. There is a method of attaching classification information indicating whether or not there is. This method can be easily realized, for example, by devising a method for naming the file name of the audio data.

続いて、作業者は、音識別条件設定支援処理を開始する旨の指示を音識別条件設定支援装置１に入力する。これに応じ、音識別条件設定支援装置１は、図２ないし図４に示す音識別条件設定支援処理を開始する。 Subsequently, the worker inputs an instruction to start the sound identification condition setting support process to the sound identification condition setting support device 1. In response to this, the sound identification condition setting support device 1 starts the sound identification condition setting support process shown in FIGS.

音識別条件設定支援処理において、図２に示すように、まず、スペクトログラム生成部１２が、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００の音声データのそれぞれについてスペクトログラムデータを生成し、生成した各スペクトログラムデータを記憶部２１に記憶する（ステップＳ１）。 In the sound identification condition setting support process, as shown in FIG. 2, first, the spectrogram generation unit 12 generates spectrogram data for each of the sound data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100. Then, the generated spectrogram data is stored in the storage unit 21 (step S1).

続いて、音識別条件設定支援装置１は対象周波数帯を選択する処理を行う（ステップＳ２〜Ｓ６）。すなわち、まず、制御部２０が、スペクトログラムデータの上記所定の周波数範囲を複数のブロックに分割する際のブロック数を、所定の規則に従って設定する（ステップＳ２）。例えば、本実施形態では、上記所定の規則として、ブロック数の最小値を２０とし、最大値を８０とし、初回に設定するブロック数を上記最小値とし、ブロック数の設定の変更を１回行うごとにブロック数を５増加させるといった規則を採用する。 Subsequently, the sound identification condition setting support device 1 performs a process of selecting a target frequency band (steps S2 to S6). That is, first, the control unit 20 sets the number of blocks when the predetermined frequency range of the spectrogram data is divided into a plurality of blocks according to a predetermined rule (step S2). For example, in the present embodiment, as the predetermined rule, the minimum number of blocks is set to 20, the maximum value is set to 80, the number of blocks set for the first time is set to the minimum value, and the setting of the number of blocks is changed once. A rule of increasing the number of blocks by 5 every time is adopted.

ここで、図１０はブロック数を２０に設定し、採取音のスペクトログラムの所定の周波数範囲をブロックＧに分割した状態を示し、図１１はブロック数を３０に設定し、採取音のスペクトログラムの所定の周波数範囲をブロックＨに分割した状態を示している。図１０および図１１を見るとわかる通り、ブロック数が多くなると、各ブロックの幅（周波数帯の範囲）が小さくなる。 Here, FIG. 10 shows a state in which the number of blocks is set to 20 and a predetermined frequency range of the spectrogram of the collected sound is divided into blocks G, and FIG. 11 shows a state in which the number of blocks is set to 30 and the spectrogram of the collected sound is predetermined. The frequency range is divided into blocks H. As can be seen from FIGS. 10 and 11, as the number of blocks increases, the width (frequency band range) of each block decreases.

続いて、図２に示すように、制御部２０が、各有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータを記憶部２１から読み出す。そして、ヒストグラム生成部１３が、読み出された各有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数が示す個数のブロックに分割し、分割した各ブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部２１に記憶する（ステップＳ３）。これにより、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのそれぞれにつき、強度分布のヒストグラムデータがブロックごとに生成され、記憶される。 Subsequently, as shown in FIG. 2, the control unit 20 reads spectrogram data of each of the abnormal sound samples A1 to A100 from the storage unit 21. Then, the histogram generation unit 13 divides the predetermined frequency range of the spectrogram data of each of the abnormal sound samples A1 to A100 that has been read out into the number of blocks indicated by the number of blocks set in step S2. The generated histogram data of the intensity distribution of each block is generated, and the generated histogram data is stored in the storage unit 21 (step S3). Thereby, histogram data of intensity distribution is generated and stored for each block for the spectrogram data of the abnormal sound samples A1 to A100.

続いて、ブロック選択部１４が、各有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの各ヒストグラムデータを記憶部２１から読み出し、読み出された各スペクトログラムデータにおいて、最も周波数帯の低いブロックを除くすべてのブロックにつき、互いに隣り合う２つのブロックのヒストグラムデータの最大度数強度を比較し、周波数帯が高い方のブロックのヒストグラムデータの最大度数強度が、周波数帯が低い方のブロックのヒストグラムデータの最大度数強度よりも大きい場合には、周波数帯が高い方のブロックを候補ブロックとして選択する（ステップＳ４）。これにより、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのそれぞれにつき、候補ブロックが選択される。 Subsequently, the block selection unit 14 reads out the histogram data of the spectrogram data of each abnormal sound sample A1 to A100 from the storage unit 21, and in each read spectrogram data, all except the block having the lowest frequency band. For each block, the maximum frequency strength of the histogram data of two adjacent blocks is compared, and the maximum frequency strength of the histogram data of the block with the higher frequency band is the maximum frequency strength of the histogram data of the block with the lower frequency band If it is greater than the intensity, the block with the higher frequency band is selected as a candidate block (step S4). Thereby, a candidate block is selected for each of the spectrogram data of the abnormal sound samples A1 to A100.

ここで、図１２は、有異音サンプル音Ａ１のスペクトログラムデータの各ブロックにつき、その周波数帯と、ヒストグラムデータの最大度数強度と、候補ブロックに選択された旨のフラグを示している。説明の便宜のために、各ブロックに＃１〜＃ｎのブロック番号を付している。図１２において、スペクトログラムデータの所定の周波数範囲はαｋＨｚ〜βｋＨｚであり、ブロック数はｎである。例えば、ブロック＃２の最大度数強度は１×１０^-12であり、ブロック＃２よりも周波数帯が低い隣のブロック＃１の最大度数強度１×１０^-13よりも大きい。したがって、ブロック＃２は候補ブロックとして選択されている。同様に、ブロック＃３、＃６，＃７も候補ブロックとして選択されている。 Here, FIG. 12 shows the frequency band, the maximum frequency intensity of the histogram data, and the flag indicating that the candidate block is selected for each block of the spectrogram data of the abnormal sound sample A1. For convenience of explanation, block numbers # 1 to #n are assigned to the respective blocks. In FIG. 12, the predetermined frequency range of the spectrogram data is α kHz to β kHz, and the number of blocks is n. For example, the maximum power intensity of the block # 2 is 1 × 10 ^−12, which is higher than the maximum power intensity 1 × 10 ^{−13 of the} adjacent block # 1 whose frequency band is lower than that of the block # 2. Therefore, block # 2 is selected as a candidate block. Similarly, blocks # 3, # 6, and # 7 are also selected as candidate blocks.

続いて、図２に示すように、制御部２０が、所定の規則に従って、周波数帯の選択に関する選択数ｃを設定する（ステップＳ５）。例えば、本実施形態では、上記所定の規則として、選択数ｃの最小値を３とし、最大値を１５とし、初回に設定する選択数ｃを上記最小値とし、選択数ｃの設定の変更を１回行うごとに選択数ｃを１増加させるといった規則を採用する。 Subsequently, as shown in FIG. 2, the control unit 20 sets the selection number c relating to the selection of the frequency band according to a predetermined rule (step S5). For example, in the present embodiment, as the predetermined rule, the minimum value of the selection number c is 3, the maximum value is 15, the selection number c set for the first time is the minimum value, and the setting of the selection number c is changed. A rule is adopted in which the number of selections c is increased by 1 each time it is performed.

続いて、周波数帯選択部１５が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータにおいて、同一周波数帯の候補ブロックの個数をそれぞれカウントし、カウント値の大きい順に、設定された選択数ｃが示す個数の周波数帯を特定し、これら特定した周波数帯およびそれよりも低い隣の周波数帯を対象周波数帯として選択する（ステップＳ６）。 Subsequently, the frequency band selection unit 15 counts the number of candidate blocks in the same frequency band in the spectrogram data of the abnormal sound samples A1 to A100, and the set selection number c indicates in descending order of the count value. The number of frequency bands are specified, and the specified frequency band and the adjacent frequency band lower than the specified frequency band are selected as target frequency bands (step S6).

ここで、図１３は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータにおける候補ブロックの選択状況と、同一周波数帯の候補ブロックのカウント値を示している。また、図１２と同様に、説明の便宜のために、各ブロックに＃１〜＃ｎのブロック番号を付している。図１３において、ブロック＃３の周波数帯は、カウント値が３０であり、カウント値が最も大きいので、対象周波数帯として選択されている。また、ブロック＃３よりも周波数帯が低い隣のブロック＃２の周波数帯も対象周波数帯として選択されている。また、ブロック＃７の周波数帯は、カウント値が２８であり、カウント値が２番目に大きいので、対象周波数帯として選択されている。また、ブロック＃７よりも周波数帯が低い隣のブロック＃６の周波数帯も対象周波数帯として選択されている。 Here, FIG. 13 shows the selection status of candidate blocks in the spectrogram data of abnormal sound samples A1 to A100 and the count values of candidate blocks in the same frequency band. Similarly to FIG. 12, for convenience of explanation, block numbers # 1 to #n are assigned to the respective blocks. In FIG. 13, the frequency band of block # 3 is selected as the target frequency band because the count value is 30 and the count value is the largest. Further, the frequency band of the adjacent block # 2 whose frequency band is lower than that of the block # 3 is also selected as the target frequency band. The frequency band of block # 7 is selected as the target frequency band because the count value is 28 and the count value is the second largest. Further, the frequency band of the adjacent block # 6 whose frequency band is lower than that of the block # 7 is also selected as the target frequency band.

続いて、音識別条件設定支援装置１は対象強度を選択する処理を行う（図２中のステップＳ７〜Ｓ９）。すなわち、まず、制御部２０が、所定の規則に従って、強度の選択に関する選択数ｄを設定する（ステップＳ７）。例えば、本実施形態では、上記所定の規則として、選択数ｄの最小値を３とし、最大値を８とし、初回に設定する選択数ｄを上記最小値とし、選択数ｄの設定の変更を１回行うごとに選択数ｄを１増加させるといった規則を採用する。 Subsequently, the sound identification condition setting support device 1 performs a process of selecting a target intensity (steps S7 to S9 in FIG. 2). That is, first, the control unit 20 sets the selection number d related to the selection of intensity according to a predetermined rule (step S7). For example, in the present embodiment, as the predetermined rule, the minimum value of the selection number d is set to 3, the maximum value is set to 8, the selection number d set for the first time is set to the minimum value, and the setting of the selection number d is changed. A rule is adopted in which the number of selections d is increased by 1 each time it is performed.

続いて、制御部２０が、各有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータを記憶部２１から読み出す。そして、強度選択部１６が、読み出された各スペクトログラムデータにおけるヒストグラムデータのうち、ステップＳ６で選択された対象周波数帯のブロックのヒストグラムデータを特定し、当該ヒストグラムデータにおけるそれぞれの音の強度の度数に順位を付ける（ステップＳ８）。すなわち、１つのヒストグラムデータ内において、最大度数強度が第１位、次に度数の大きい強度が第２位といった順位付けを、対象周波数帯のそれぞれのヒストグラムデータについて行う。 Subsequently, the control unit 20 reads spectrogram data of each of the unusual sound samples A1 to A100 from the storage unit 21. Then, the intensity selection unit 16 specifies the histogram data of the block of the target frequency band selected in step S6 from the histogram data in each read spectrogram data, and the frequency of the intensity of each sound in the histogram data Is ranked (step S8). That is, in one histogram data, ranking is performed for each histogram data of the target frequency band such that the highest frequency intensity is first and the next highest intensity is second.

続いて、強度選択部１６は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータにおいて、同一の対象周波数帯のヒストグラムデータにおける音の強度の度数の順位の平均（平均順位）を算定し、平均順位の高い順に、ステップＳ７で設定した選択数ｄが示す個数の強度を特定し、これら特定した強度を対象強度として選択する（ステップＳ９）。複数の対象周波数帯が選択されている場合には、ステップＳ９における平均順位の算定と対象強度の選択を対象周波数帯ごとに行う。 Subsequently, the intensity selection unit 16 calculates the average rank (average rank) of the frequency of sound intensity in the histogram data of the same target frequency band in the spectrogram data of the abnormal sound samples A1 to A100. In the descending order, the number of intensities indicated by the selection number d set in step S7 is specified, and these specified intensities are selected as target intensities (step S9). When a plurality of target frequency bands are selected, the calculation of the average rank and the selection of the target strength in step S9 are performed for each target frequency band.

ここで、図１４は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうちの１つのヒストグラムデータのそれぞれの強度と、これらの強度にそれぞれ付された順位を示している。図１４において、説明の便宜のために、各強度に＃１〜＃ｍの強度番号を付している。図１４によれば、＃７の強度が第１位であり、＃６の強度が第２位である。なお、図１４を見るとわかる通り、強度選択部１６が選択する「強度」は、厳密には一定の幅を持った強度範囲である。例えば、＃１の強度は、１×１０^-20〜１×１０^-19といった幅を有する。 Here, FIG. 14 shows the respective intensities of one histogram data among the spectrogram data of the abnormal sound samples A1 to A100, and the rankings assigned to these intensities. In FIG. 14, for the convenience of explanation, the strength numbers # 1 to #m are assigned to the strengths. According to FIG. 14, the intensity of # 7 is first and the intensity of # 6 is second. As can be seen from FIG. 14, the “intensity” selected by the intensity selection unit 16 is strictly an intensity range having a certain width. For example, the intensity of # 1 has a width of 1 × 10 ⁻²⁰ to 1 × 10 ⁻¹⁹ .

また、図１５は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータにおいて、同一の対象周波数帯のブロックのヒストグラムデータにおける音の強度の度数の平均順位を示している。図１５において、＃７の強度の平均順位が１．３であり、＃６の強度の平均順位が２．２である。 FIG. 15 shows the average rank of the sound intensity frequencies in the histogram data of the blocks of the same target frequency band in the spectrogram data of the abnormal sound samples A1 to A100. In FIG. 15, the average rank of intensity of # 7 is 1.3, and the average rank of intensity of # 6 is 2.2.

続いて、音識別条件設定支援装置１は識別関数を生成する処理を行う（ステップＳ１０〜Ｓ１２）。すなわち、まず、図２に示すように、制御部２０が、有異音サンプル音のスペクトログラムデータをランダムに５０個選び、選んだ各スペクトログラムデータにおいて対象周波数帯のブロックのヒストグラムデータを記憶部２１から読み出す（ステップＳ１０）。続いて、制御部２０は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中からどの有異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部２１に記憶する。この履歴情報は、制御部２０がこの後にステップＳ１３において未選択のスペクトログラムデータを特定するために用いられる。 Subsequently, the sound identification condition setting support device 1 performs a process of generating an identification function (steps S10 to S12). That is, first, as shown in FIG. 2, the control unit 20 randomly selects 50 spectrogram data of sampled abnormal sounds, and the histogram data of the block of the target frequency band in each selected spectrogram data from the storage unit 21. Read (step S10). Subsequently, the control unit 20 stores, in the storage unit 21, history information indicating which abnormal sound sample data is selected from the spectrogram data of the abnormal sound samples A1 to A100. This history information is used by the control unit 20 to specify spectrogram data that has not been selected in step S13.

続いて、図３に示すように、制御部２０が、無異音サンプル音のスペクトログラムデータをランダムに５０個選び、選んだ各スペクトログラムデータを記憶部２１から読み出す。そして、ヒストグラム生成部１３が、読み出された各スペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数が示す個数のブロックに分割し、分割したブロックのうち、対象周波数帯のブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部２１に記憶する（ステップＳ１１）。これにより、ランダムに選ばれた５０個の無異音サンプル音のスペクトログラムデータのそれぞれにつき、対象周波数帯のブロックの強度分布のヒストグラムデータが生成され、記憶される。続いて、制御部２０は、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中からどの無異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部２１に記憶する。この履歴情報は、制御部２０がこの後にステップＳ１７において未選択のスペクトログラムデータを特定するために用いられる。 Subsequently, as shown in FIG. 3, the control unit 20 randomly selects 50 spectrogram data of the allophone sample sound, and reads each selected spectrogram data from the storage unit 21. Then, the histogram generation unit 13 divides the predetermined frequency range of each read spectrogram data into the number of blocks indicated by the number of blocks set in step S2, and among the divided blocks, the target frequency band Histogram data of the intensity distribution of the block is generated, and each generated histogram data is stored in the storage unit 21 (step S11). As a result, histogram data of the intensity distribution of the block in the target frequency band is generated and stored for each of the spectrogram data of 50 randomly selected sample sounds. Subsequently, the control unit 20 stores, in the storage unit 21, history information indicating which of the non-sound sample sounds the spectrogram data has been selected from the spectrogram data of the non-sound sample sounds B 1 to B 100. This history information is used by the control unit 20 to specify spectrogram data that has not been selected in step S17.

続いて、識別関数生成部１７が、ステップＳ１０で読み出された５０個の有異音サンプル音のスペクトログラムデータにおける対象周波数帯のヒストグラムデータと、ステップＳ１１で生成された５０個の無異音サンプル音のスペクトログラムデータにおける対象周波数帯のヒストグラムデータとを用い、識別関数を生成する。このとき、識別関数生成部１７は、各ヒストグラムデータにおける対象強度の度数のみを用いて識別関数を生成する（ステップＳ１２）。このように、ステップＳ１２では、有異音サンプル音Ａ１〜１００のスペクトログラムデータのうちランダムに選ばれた５０個のスペクトログラムデータと、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうちランダムに選ばれた５０個のスペクトログラムデータとにより識別関数が生成され、このとき、識別関数の生成に用いられるヒストグラムデータは、対象周波数帯のブロックのヒストグラムデータにおける対象強度の度数のみである。 Subsequently, the discriminant function generation unit 17 performs histogram data of the target frequency band in the spectrogram data of the 50 abnormal sound samples read out in step S10 and the 50 non-noise samples generated in step S11. Using the histogram data of the target frequency band in the sound spectrogram data, a discrimination function is generated. At this time, the discriminant function generation unit 17 generates the discriminant function using only the frequency of the target intensity in each histogram data (step S12). As described above, in step S12, 50 spectrogram data randomly selected from the spectrogram data of the abnormal sound samples A1 to 100 and the spectrogram data of the non-sound sample sounds B1 to B100 are selected at random. The discriminant function is generated from the 50 spectrogram data. At this time, the histogram data used for generating the discriminant function is only the frequency of the target intensity in the histogram data of the block in the target frequency band.

また、識別関数の生成には、例えばサポートベクターマシン等の学習モデルを用いたパターン認識手法（例えば特開２００５−３５２９９７号公報参照）を採用する。なお、ニューラルネットワーク等の他の学習モデルを識別関数の生成に採用してもよい。 In addition, for generating the discriminant function, for example, a pattern recognition method using a learning model such as a support vector machine (see, for example, JP-A-2005-352997) is adopted. Note that another learning model such as a neural network may be employed for generating the discrimination function.

続いて、識別関数生成部１７は、生成した識別関数と、当該識別関数に対応するブロック数（例えば、当該識別関数の生成時に設定されているブロック数を示す数値等の情報）と、当該識別関数に対応する選択数ｃ（例えば、当該識別関数の生成時に設定されている選択数ｃを示す数値等の情報）と、当該識別関数に対応する選択数ｄ（例えば、当該識別関数の生成時に設定されている選択数ｄを示す数値等の情報）と、当該識別関数に対応する対象周波数帯（例えば、当該識別関数の生成時に選択されている対象周波数帯を示す情報）と、当該識別関数に対応する対象強度（例えば、当該識別関数の生成時に選択されている対象強度を示す情報）とを記憶部２１に記憶する。 Subsequently, the identification function generation unit 17 generates the generated identification function, the number of blocks corresponding to the identification function (for example, information such as a numerical value indicating the number of blocks set when the identification function is generated), and the identification The selection number c corresponding to the function (for example, information such as a numerical value indicating the selection number c set when the identification function is generated) and the selection number d corresponding to the identification function (for example, when the identification function is generated) Information such as a numerical value indicating the set selection number d), a target frequency band corresponding to the identification function (for example, information indicating a target frequency band selected when the identification function is generated), and the identification function Is stored in the storage unit 21 (for example, information indicating the target intensity selected when the identification function is generated).

続いて、音識別条件設定支援装置１は識別関数の識別正解率を算定する処理を行う（ステップＳ１３〜Ｓ２２）。すなわち、図３に示すように、まず、制御部２０が、未選択の有異音サンプル音のスペクトログラムデータ、すなわち、ステップＳ１０で選択されていない有異音サンプル音のスペクトログラムデータをランダムに１つ選び、選んだスペクトログラムデータにおいて、対象周波数帯のブロックのヒストグラムデータを記憶部２１から読み出す（ステップＳ１３）。 Subsequently, the sound identification condition setting support device 1 performs processing for calculating the identification accuracy rate of the identification function (steps S13 to S22). That is, as shown in FIG. 3, first, the control unit 20 randomly selects spectrogram data of unselected abnormal sound samples, that is, spectrogram data of abnormal sound samples not selected in step S10. In the selected spectrogram data, the histogram data of the block in the target frequency band is read from the storage unit 21 (step S13).

続いて、正解率算定部１８が、ステップＳ１３で読み出された対象周波数帯のヒストグラムデータにおける対象強度の度数を、ステップＳ１２で生成された識別関数に入力し、当該識別関数により有異音サンプル音中の異音の有無の識別を実行する（ステップＳ１４）。 Subsequently, the accuracy rate calculation unit 18 inputs the frequency of the target intensity in the histogram data of the target frequency band read out in step S13 to the discrimination function generated in step S12, and the abnormal sound sample is input by the discrimination function. The presence / absence of abnormal noise in the sound is identified (step S14).

続いて、正解率算定部１８は、当該識別関数による識別の正否を判定する（ステップＳ１５）。ステップＳ１５の判定では、当該識別関数による識別の結果が「異音あり」である場合に正解であり、そうでない場合には不正解である。続いて、正解率算定部１８は、当該判定結果（例えば判定結果を示すフラグ等の情報）を判定結果情報として記憶部２１に記憶する。 Subsequently, the correct rate calculation unit 18 determines whether or not the identification function is correct (step S15). In the determination in step S15, the answer is correct when the result of the identification by the identification function is “abnormal noise”, and the answer is incorrect otherwise. Subsequently, the correct rate calculation unit 18 stores the determination result (for example, information such as a flag indicating the determination result) in the storage unit 21 as determination result information.

続いて、制御部２０が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち、未選択の有異音サンプル音のスペクトログラムデータ、すなわち、ステップＳ１０でもステップＳ１３でも選択されていない有異音サンプル音のスペクトログラムデータがあるか否かを判断する（ステップＳ１６）。そして、未選択の有異音サンプル音のスペクトログラムデータがある場合には（ステップＳ１６：ＹＥＳ）、制御部２０は処理をステップＳ１３に戻す。一方、未選択の有異音サンプル音のスペクトログラムデータがない場合には（ステップＳ１６：ＮＯ）、制御部２０は処理をステップＳ１７へ移行させる。このように、ステップＳ１３〜Ｓ１６により、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータうち、識別関数の生成に用いられなかった有異音サンプル音のスペクトログラムデータを用い、識別関数による識別の正否が判定される。このとき、判定に用いられるヒストグラムデータは対象周波数帯のヒストグラムデータにおける対象強度の度数のみである。 Subsequently, the control unit 20 uses the spectrogram data of the unselected phantom sample sound among the spectrogram data of the phantom sample sounds A1 to A100, that is, the phantom sample that has not been selected in step S10 or step S13. It is determined whether or not there is sound spectrogram data (step S16). Then, when there is spectrogram data of the unselected abnormal sound sample (step S16: YES), the control unit 20 returns the process to step S13. On the other hand, when there is no spectrogram data of the unselected abnormal sound sample (step S16: NO), the control unit 20 shifts the process to step S17. As described above, by steps S13 to S16, among the spectrogram data of the abnormal sound samples A1 to A100, the spectrogram data of the abnormal sound samples that were not used for generating the discriminant function is used, and whether or not the discrimination by the discriminant function is correct. Determined. At this time, the histogram data used for the determination is only the frequency of the target intensity in the histogram data of the target frequency band.

続いて、制御部２０が、未選択の無異音サンプル音のスペクトログラムデータ、すなわち、ステップＳ１１で選択されていない無異音サンプル音のスペクトログラムデータをランダムに１つ選び、選んだスペクトログラムデータを記憶部２１から読み出す。そして、ヒストグラム生成部１３が、読み出されたスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数が示す個数のブロックに分割し、分割したブロックのうち、対象周波数帯のブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部２１に記憶する（ステップＳ１７）。 Subsequently, the control unit 20 randomly selects one of the spectrogram data of the unselected sound sample sound, that is, the spectrogram data of the sound sample sound not selected in step S11, and stores the selected spectrogram data. Read from the unit 21. Then, the histogram generation unit 13 divides the predetermined frequency range of the read spectrogram data into the number of blocks indicated by the number of blocks set in step S2, and among the divided blocks, blocks in the target frequency band The histogram data of the intensity distribution is generated, and each generated histogram data is stored in the storage unit 21 (step S17).

続いて、正解率算定部１８が、ステップＳ１７で生成された対象周波数帯のブロックのヒストグラムデータにおける対象強度の度数を、ステップＳ１２で生成された識別関数に入力し、当該識別関数により無異音サンプル音中の異音の有無の識別を実行する（ステップＳ１８）。 Subsequently, the correct answer rate calculation unit 18 inputs the frequency of the target intensity in the histogram data of the block of the target frequency band generated in step S17 to the discrimination function generated in step S12, and the allophone is generated by the discrimination function. The presence / absence of abnormal noise in the sample sound is identified (step S18).

続いて、正解率算定部１８は、当該識別関数による識別の正否を判定する（ステップＳ１９）。ステップＳ１６の判定では、当該識別関数による識別の結果が「異音なし」である場合に正解であり、そうでない場合には不正解である。続いて、正解率算定部１８は、当該判定結果を判定結果情報として記憶部２１に記憶する。 Subsequently, the correct answer rate calculation unit 18 determines whether the identification function is correct or not (step S19). In the determination in step S 16, the answer is correct when the result of the identification by the identification function is “no abnormal sound”, otherwise it is incorrect. Subsequently, the correct answer rate calculation unit 18 stores the determination result in the storage unit 21 as determination result information.

続いて、制御部２０が、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち、未選択の無異音サンプル音のスペクトログラムデータ、すなわちステップＳ１１でもステップＳ１７でも選択されていない無異音サンプル音のスペクトログラムデータがあるか否かを判断する（ステップＳ２０）。そして、未選択の無異音サンプル音のスペクトログラムデータがある場合には（ステップＳ２０：ＹＥＳ）、制御部２０は処理をステップＳ１７に戻す。一方、未選択の無異音サンプル音のスペクトログラムデータがない場合には（ステップＳ２０：ＮＯ）、制御部２０は処理をステップＳ２１へ移行させる。このように、ステップＳ１７〜Ｓ２０により、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータうち、識別関数の生成に用いられなかった無異音サンプル音のスペクトログラムデータを用い、識別関数による識別の正否が判定される。このとき、判定に用いられるヒストグラムデータは対象周波数帯のヒストグラムデータにおける対象強度の度数のみである。 Subsequently, the control unit 20 sets the spectrogram data of the unselected non-sound sample sound among the spectrogram data of the non-sound sample sounds B1 to B100, that is, the non-sound sample sound that is not selected in step S11 or step S17. It is determined whether there is any spectrogram data (step S20). If there is spectrogram data of an unselected sound sample sound that has not been selected (step S20: YES), the control unit 20 returns the process to step S17. On the other hand, when there is no spectrogram data of the unselected sound sample sound that has not been selected (step S20: NO), the control unit 20 shifts the process to step S21. As described above, by steps S17 to S20, the spectrogram data of the non-sound sample sound that was not used for the generation of the discrimination function among the spectrogram data of the non-sound sample sounds B1 to B100 is used to determine whether the discrimination function is correct or not. Determined. At this time, the histogram data used for the determination is only the frequency of the target intensity in the histogram data of the target frequency band.

続いて、図４中のステップＳ２１において、制御部２０が、識別関数の生成および識別関数による識別の正否判定（ステップＳ１０〜Ｓ２０）を所定回数、例えば２００回繰り返したか否かを判断する。識別関数の生成および識別関数による識別の正否判定を２００回繰り返していない場合には（ステップＳ２１：ＮＯ）、制御部２０は処理をステップＳ１０に戻し、スペクトログラムデータのランダム選択を初めからやり直すために、スペクトログラムデータの選択の履歴を示す上記履歴情報をすべて消去（初期化）する。一方、識別関数の生成および識別関数による識別の正否判定を２００回繰り返した場合には（ステップＳ２１：ＹＥＳ）、制御部２０は処理をステップＳ２２に移行させる。 Subsequently, in step S21 in FIG. 4, the control unit 20 determines whether or not the discrimination function generation and the discrimination correctness determination by the discrimination function (steps S10 to S20) have been repeated a predetermined number of times, for example, 200 times. If the generation of the discriminant function and the determination of correctness of discrimination by the discriminant function have not been repeated 200 times (step S21: NO), the control unit 20 returns the process to step S10 to redo the random selection of spectrogram data from the beginning. All the history information indicating the history of selection of spectrogram data is erased (initialized). On the other hand, when the generation of the discriminant function and the determination of correctness of discrimination by the discriminant function are repeated 200 times (step S21: YES), the control unit 20 shifts the process to step S22.

以上のステップＳ１０〜Ｓ２１の処理により、識別関数の生成に用いられる５０個の有異音サンプル音の組合せ、および識別関数の生成に用いられる５０個の無異音サンプル音の組合せがそれぞれ２００通り生成され、これら２００通りの組合せを用いて２００個の識別関数が生成されたことになる。さらに、ステップＳ１０〜Ｓ２１の処理により、識別関数による識別の正否判定に用いられる５０個の有異音サンプル音の組合せ、および識別関数による識別の正否判定に用いられる５０個の無異音サンプル音の組合せがそれぞれ２００通り生成され、これら２００通りの組合せを用い、各識別関数につき識別の正否判定が行われたことになる。ステップＳ１０〜Ｓ２１を２００回実行し終えたとき、記憶部２１には、２００００個の判定結果情報が記憶されている。このように、識別関数の生成および識別関数による識別の正否判定を繰り返し行い、多数の判定結果情報を得ることで、後にステップＳ２２で算定される識別正解率の信頼性を高めることができる。 Through the processes in steps S10 to S21 described above, there are 200 combinations of 50 different sample sounds used for generating the discrimination function and 50 combinations of 50 non-sampled sounds used for generating the discrimination function. Thus, 200 discriminant functions are generated using these 200 combinations. Further, by the processing of steps S10 to S21, a combination of 50 different sampled sounds used for determination of correctness of discrimination by the discriminant function and 50 non-sound sample sounds used for determination of correctness of discrimination by the discriminant function. 200 combinations are generated. Using these 200 combinations, it is determined whether each identification function is correct or not. When Steps S10 to S21 have been executed 200 times, the storage unit 21 stores 20000 pieces of determination result information. In this way, by repeatedly generating the discrimination function and determining whether the discrimination function is correct or not, and obtaining a large number of pieces of determination result information, the reliability of the discrimination correct answer rate calculated later in step S22 can be improved.

続いて、ステップＳ２２において、正解率算定部１８が、ステップＳ１０〜Ｓ２１において得られた２００００個の判定結果情報を用い、１つのブロック数、１つの選択数ｃおよび１つの選択数ｄが設定されている現時点の状態の下で生成された識別関数につき、識別正解率を算定し、算定した識別正解率を記憶部２１に記憶する。 Subsequently, in step S22, the correct answer rate calculation unit 18 uses the 20000 pieces of determination result information obtained in steps S10 to S21 to set one block number, one selection number c, and one selection number d. The discrimination accuracy rate is calculated for the discrimination function generated under the current state, and the calculated discrimination accuracy rate is stored in the storage unit 21.

ここで、図１６は、ステップＳ２２における識別正解率の算定方法の一例を示している。図１６において、「○」は、ステップＳ１５またはＳ１９における判定結果が正解であったことを示し、「×」は、ステップＳ１５またはＳ１９における判定結果が不正解であったことを示す。「−」は、そのサンプル音のスペクトログラムデータが識別関数の生成に用いられ、ステップＳ１５またはＳ１９における識別関数による識別の正否判定には用いられなかったことを示す。 Here, FIG. 16 shows an example of the method of calculating the identification correct rate in step S22. In FIG. 16, “◯” indicates that the determination result in step S15 or S19 is correct, and “X” indicates that the determination result in step S15 or S19 is incorrect. “-” Indicates that the spectrogram data of the sample sound is used for generating the discriminant function, and is not used for the discrimination correctness judgment by the discriminant function in step S15 or S19.

図１６に示すように、ステップＳ２２における識別正解率の算定処理において、まず、サンプル音Ａ１〜Ａ１００およびＢ１〜Ｂ１００の各スペクトログラムデータについて識別正解率を算定する。１つのスペクトログラムデータについての識別正解率は、当該スペクトログラムデータのヒストグラムデータを識別関数に入力して得られた識別結果において、正解の個数と不正解の個数とを合わせた値で、正解の個数を割ることにより求めることができる。次に、サンプル音Ａ１〜Ａ１００およびＢ１〜Ｂ１００のスペクトログラムデータの識別正解率の平均値を算定し、この平均値を、１つのブロック数、１つの選択数ｃおよび１つの選択数ｄが設定されている現時点の状態の下で生成された識別関数の識別正解率として用いる。 As shown in FIG. 16, in the discrimination correct rate calculation process in step S22, first, the discrimination correct rate is calculated for the spectrogram data of the sample sounds A1 to A100 and B1 to B100. The classification accuracy rate for one spectrogram data is the sum of the number of correct answers and the number of incorrect answers in the identification result obtained by inputting the histogram data of the spectrogram data into the identification function. It can be determined by dividing. Next, an average value of the identification accuracy rates of the spectrogram data of the sample sounds A1 to A100 and B1 to B100 is calculated, and one block number, one selection number c, and one selection number d are set as the average value. It is used as the discrimination accuracy rate of the discriminant function generated under the current state.

続いて、図４に示すように、制御部２０が、選択数ｄのすべての設定についてステップＳ７〜Ｓ２２の処理の実行を終えたか否かを判断する（ステップＳ２３）。選択数ｄのすべての設定についてステップＳ７〜Ｓ２２の処理の実行を終えていない場合には（ステップＳ２３：ＮＯ）、制御部２０は処理を図２中のステップ７に戻す。そして、ステップＳ７において、制御部２０が上記所定の規則に従って選択数ｄを変更する。そして、これに続き、ステップＳ８〜Ｓ２２の処理が実行される。一方、選択数ｄのすべての設定についてステップＳ７〜Ｓ２２の処理の実行を終えた場合には（ステップＳ２３：ＹＥＳ）、制御部２０は処理をステップＳ２４へ移行させる。 Subsequently, as illustrated in FIG. 4, the control unit 20 determines whether or not the processing of steps S 7 to S 22 has been completed for all settings of the selection number d (step S 23). When the execution of the processing of steps S7 to S22 has not been completed for all the settings of the selection number d (step S23: NO), the control unit 20 returns the processing to step 7 in FIG. In step S7, the control unit 20 changes the selection number d according to the predetermined rule. Subsequently, the processes of steps S8 to S22 are executed. On the other hand, when the execution of the processing of steps S7 to S22 is completed for all the settings of the selection number d (step S23: YES), the control unit 20 shifts the processing to step S24.

上述した選択数ｄの設定についての所定の規則によれば、選択数ｄの設定は全部で６通りであるので、選択数ｄが変更されながらステップＳ７〜Ｓ２２の処理が６回繰り返され、この結果、６通りの選択数ｄにより生成された識別関数の識別正解率が算定される。 According to the predetermined rule for setting the selection number d described above, since the selection number d is set in six ways in total, the processes of steps S7 to S22 are repeated six times while the selection number d is changed. As a result, the discrimination correct answer rate of the discrimination function generated by the six selection numbers d is calculated.

続いて、制御部２０は、選択数ｃのすべての設定についてステップＳ５〜Ｓ２３の処理の実行を終えたか否かを判断する（ステップＳ２４）。選択数ｃのすべての設定についてステップＳ５〜Ｓ２３の処理の実行を終えていない場合には（ステップＳ２４：ＮＯ）、制御部２０は処理を図２中のステップ５に戻す。そして、ステップＳ５において、制御部２０が上記所定の規則に従って選択数ｃを変更する。そして、これに続き、ステップＳ６〜Ｓ２３の処理が実行される。一方、選択数ｃのすべての設定についてステップＳ５〜Ｓ２３の処理の実行を終えた場合には（ステップＳ２４：ＹＥＳ）、制御部２０は処理をステップＳ２５へ移行させる。 Subsequently, the control unit 20 determines whether or not the processing of steps S5 to S23 has been completed for all settings of the selection number c (step S24). If the execution of the processing of steps S5 to S23 has not been completed for all the settings of the selection number c (step S24: NO), the control unit 20 returns the processing to step 5 in FIG. In step S5, the control unit 20 changes the selection number c according to the predetermined rule. Following this, the processes of steps S6 to S23 are executed. On the other hand, when the execution of the processes of steps S5 to S23 is finished for all the settings of the selection number c (step S24: YES), the control unit 20 shifts the process to step S25.

上述した選択数ｃの設定についての所定の規則によれば、選択数ｃの設定は全部で１３通りであるので、選択数ｃが変更されながらステップＳ５〜Ｓ２３の処理が１３回繰り返され、この結果、１３通りの選択数ｃにより生成された識別関数の識別正解率が算定される。 According to the predetermined rule for setting the selection number c described above, since the selection number c is set in 13 ways in total, the processes of steps S5 to S23 are repeated 13 times while the selection number c is changed. As a result, the discrimination accuracy rate of the discrimination function generated by the 13 selection numbers c is calculated.

続いて、制御部２０が、ブロック数のすべての設定についてステップＳ２〜Ｓ２４の処理の実行を終えたか否かを判断する（ステップＳ２５）。ブロック数のすべての設定についてステップＳ２〜Ｓ２４の処理の実行を終えていない場合には（ステップＳ２５：ＮＯ）、制御部２０は処理を図２中のステップＳ２に戻す。そして、ステップＳ２において、制御部２０が上記所定の規則に従ってブロック数を変更する。そして、これに続き、ステップＳ３〜Ｓ２４の処理が実行される。一方、ブロック数のすべての設定についてステップＳ２〜Ｓ２４の処理の実行を終えた場合には（ステップＳ２５：ＹＥＳ）、制御部２０は処理をステップＳ２６に移行させる。 Subsequently, the control unit 20 determines whether or not the processing of steps S2 to S24 has been completed for all settings of the number of blocks (step S25). If the execution of the processes of steps S2 to S24 has not been completed for all settings of the number of blocks (step S25: NO), the control unit 20 returns the process to step S2 in FIG. In step S2, the control unit 20 changes the number of blocks according to the predetermined rule. Subsequently, the processes of steps S3 to S24 are executed. On the other hand, when the execution of the processes of steps S2 to S24 is completed for all settings of the number of blocks (step S25: YES), the control unit 20 shifts the process to step S26.

上述したブロック数の設定についての所定の規則によれば、ブロック数の設定は全部で１３通りであるので、ブロック数が変更されながらステップＳ２〜Ｓ２４の処理が１３回繰り返される。また、ステップＳ２〜Ｓ２４の各ループにおいて、ステップＳ５〜Ｓ２３の処理が１３回繰り返され、さらに、ステップＳ５〜Ｓ２３の各ループにおいて、ステップＳ７〜Ｓ２２の処理が６回繰り返されるので、この結果、１３通りのブロック数と１３通りの選択数ｃと６通りの選択数ｄとの組合せの下で生成された識別関数の識別正解率が算定される。１３通りのブロック数の設定についてステップＳ２〜Ｓ２５の処理の実行を終えたとき、記憶部２１には、１３通りのブロック数、１３通りの選択数ｃおよび６通りの選択数ｄの組合せにそれぞれ対応する１０１４個の識別正解率が記憶されている。 According to the predetermined rule for setting the number of blocks described above, there are 13 types of setting of the number of blocks. Therefore, the processes of steps S2 to S24 are repeated 13 times while the number of blocks is changed. Further, in each loop of steps S2 to S24, the processing of steps S5 to S23 is repeated 13 times, and further, in each loop of steps S5 to S23, the processing of steps S7 to S22 is repeated 6 times. The discrimination accuracy rate of the discrimination function generated under the combination of the 13 block numbers, the 13 selection numbers c, and the 6 selection numbers d is calculated. When the execution of the processing of steps S2 to S25 is finished for the setting of 13 types of blocks, the storage unit 21 stores combinations of 13 types of blocks, 13 types of selections c, and 6 types of selections d, respectively. Corresponding 1014 identification correct answer rates are stored.

ここで、図１７は、上記所定の規則に従って設定されたブロック数、選択数ｃおよび選択数ｄの組合せにそれぞれ対応する識別関数の識別正解率の例を示している。 Here, FIG. 17 shows an example of the identification correct answer rate of the identification function corresponding to the combination of the number of blocks, the number of selections c, and the number of selections d set according to the predetermined rule.

続いて、音識別条件設定支援装置１は確定識別関数等を選定する処理を行う（ステップＳ２６〜Ｓ２７）。すなわち、図４中のステップＳ２６において、識別条件選定部１９が、記憶部２１に記憶された複数の識別正解率を読み出し、それらの中から最も高い識別正解率を特定する。続いて、識別条件選定部１９は、特定した識別正解率を有する識別関数に対応するブロック数、対象周波数帯および対象強度を選定し、選定したブロック数、対象周波数帯および対象強度を確定ブロック数、確定対象周波数帯および確定対象強度として記憶部２１に記憶する。 Subsequently, the sound identification condition setting support device 1 performs a process of selecting a definite identification function or the like (steps S26 to S27). That is, in step S26 in FIG. 4, the identification condition selection unit 19 reads a plurality of identification accuracy rates stored in the storage unit 21 and identifies the highest identification accuracy rate from among them. Subsequently, the identification condition selection unit 19 selects the number of blocks, the target frequency band, and the target intensity corresponding to the identification function having the identified identification accuracy rate, and determines the selected block number, the target frequency band, and the target intensity as the number of determined blocks. Then, it is stored in the storage unit 21 as the confirmation target frequency band and the confirmation target intensity.

続いて、識別関数生成部１７が、確定ブロック数として選定されたブロック数の設定の下で生成された、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００のすべてのスペクトログラムデータにおける確定対象周波数帯のヒストグラムデータにおいて、確定対象強度の度数を用い、改めて識別関数を生成し、生成した識別関数を確定識別関数として記憶部２１に記憶する（ステップＳ２７）。上記確定ブロック数、確定対象周波数帯、確定対象強度および確定識別関数は、後述する異音判定装置３０による異音判定処理に用いられる。 Subsequently, all the spectrogram data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100 generated under the setting of the number of blocks selected as the determined number of blocks by the discriminant function generation unit 17. In the histogram data of the frequency band to be determined at, an identification function is generated again using the frequency of the intensity of the determination target, and the generated identification function is stored in the storage unit 21 as the determined identification function (step S27). The number of determined blocks, the frequency band to be determined, the strength to be determined, and the determination function are used for abnormal sound determination processing by the abnormal sound determination device 30 described later.

以上説明した通り、本発明の実施形態による音識別条件設定支援装置１によれば、採取音中の異音の有無を高精度に識別することができる識別関数を迅速かつ容易に少ない作業負担で生成することができる。すなわち、音識別条件設定支援装置１による識別関数の生成は、音声データを入力しさえすれば自動的に行われるので、識別関数を生成する作業、すなわち異音の特徴抽出に相当する作業を、コンピュータのモニタに表示された採取音のスペクトログラムを見ながら異音の特徴を見出すといった従来の特徴抽出作業と比較し、大幅に簡単化することができる。したがって、作業時間を短縮することができ、また、作業者の負担を軽減することができる。また、サンプル音を用意すれば、音識別条件設定支援装置１により識別関数を自動的に生成することができるので、作業者の経験の多少を問わず、高精度な識別能力を有する識別関数を生成することができる。 As described above, according to the sound identification condition setting support device 1 according to the embodiment of the present invention, an identification function that can accurately identify the presence or absence of an abnormal sound in a collected sound can be quickly and easily performed with a small work load. Can be generated. That is, since the generation of the discrimination function by the sound discrimination condition setting support device 1 is automatically performed as long as voice data is input, the task of generating the discrimination function, that is, the task corresponding to the feature extraction of abnormal noise, Compared with the conventional feature extraction operation of finding the features of abnormal sounds while looking at the spectrogram of the collected sound displayed on the computer monitor, it can be greatly simplified. Therefore, the working time can be shortened and the burden on the worker can be reduced. Also, if a sample sound is prepared, the sound identification condition setting support device 1 can automatically generate a discrimination function, so that a discrimination function having a high accuracy discrimination ability can be obtained regardless of the experience of the operator. Can be generated.

また、本発明の実施形態による音識別条件設定支援処理では、有異音サンプル音のスペクトログラムデータにおいて対象周波数帯および対象強度を選択し、それ以降の処理、すなわち、各無異音サンプル音のスペクトログラムデータのヒストグラムデータを生成する処理、識別関数を生成する処理、および識別関数による識別の正否を判定する処理を、対象周波数帯のブロックのヒストグラムデータにおける対象強度の度数のみを用いて行う。したがって、これらの処理をすべてのブロックを用いて行う場合、またはこれらの処理をヒストグラムデータのすべての強度の度数を用いて行う場合と比較し、処理量、および処理において取り扱うデータの量を大幅に減らすことができる。よって、本発明の実施形態による音識別条件設定支援処理によれば、採取音中の異音の有無を高精度に識別することができる識別関数の生成・選定を、一般的なパーソナルコンピュータを用いて迅速に行うことが可能になる。 In the sound identification condition setting support processing according to the embodiment of the present invention, the target frequency band and the target intensity are selected in the spectrogram data of the abnormal sound sample, and the subsequent processing, that is, the spectrogram of each non-noise sample sound. The process of generating the histogram data of the data, the process of generating the discrimination function, and the process of determining whether the discrimination by the discrimination function is correct or not are performed using only the frequency of the target intensity in the histogram data of the block in the target frequency band. Therefore, compared to the case where these processes are performed using all blocks or the case where these processes are performed using all the frequencies of histogram data, the processing amount and the amount of data handled in the processing are greatly increased. Can be reduced. Therefore, according to the sound identification condition setting support processing according to the embodiment of the present invention, a general personal computer is used to generate and select an identification function that can accurately identify the presence or absence of abnormal sounds in the collected sound. Can be done quickly.

また、本発明の実施形態による音識別条件設定支援処理では、スペクトログラムデータにおける所定の周波数範囲の分割数（ブロック数）を変更し、対象周波数帯として選択すべき周波数帯の個数（選択数ｃ）を変更し、かつ対象強度として選択すべき強度の個数（選択数ｄ）を変更しながら、複数の識別関数を生成し、それらの識別関数の中から識別正解率の最も高い識別関数を選定するので、異音の特徴に応じた高精度な識別能力を有する識別関数を選定することができる。 In the sound identification condition setting support processing according to the embodiment of the present invention, the number of divisions (number of blocks) of a predetermined frequency range in the spectrogram data is changed, and the number of frequency bands to be selected as the target frequency band (number of selections c). And a plurality of discriminant functions are generated while changing the number of intensities to be selected as the target intensities (selection number d), and the discriminant function having the highest discrimination accuracy rate is selected from these discriminant functions. Therefore, it is possible to select a discriminant function having a high-accuracy discriminating capability according to the characteristics of abnormal sounds.

なお、上述した音識別条件設定支援装置１による音識別条件設定支援処理では、スペクトログラムデータの所定の周波数範囲をブロックに分割する際のブロック数と、対象周波数帯として選択すべき周波数帯の個数を定める選択数ｃと、対象強度として選択すべき音の強度の個数を定める選択数ｄのそれぞれを変更しながら複数の識別関数を生成する場合を例にあげたが、本発明はこれに限らない。音識別条件設定支援処理において当該処理の開始時にブロック数を設定した後、ブロック数を変更しないようにしてもよい。また、音識別条件設定支援処理において当該処理の開始時に選択数ｃを設定した後、選択数ｃを変更しないようにしてもよい。また、音識別条件設定支援処理において、対象周波数帯の選択を行わないこととしてもよい。 In the sound identification condition setting support process by the sound identification condition setting support device 1 described above, the number of blocks when the predetermined frequency range of the spectrogram data is divided into blocks and the number of frequency bands to be selected as the target frequency band are calculated. An example has been given of the case where a plurality of discriminant functions are generated while changing the selection number c to be determined and the selection number d to determine the number of sound intensities to be selected as the target intensity, but the present invention is not limited to this. . In the sound identification condition setting support process, after the number of blocks is set at the start of the process, the number of blocks may not be changed. Further, after the selection number c is set at the start of the process in the sound identification condition setting support process, the selection number c may not be changed. In the sound identification condition setting support process, the target frequency band may not be selected.

また、上述した音識別条件設定支援装置１による音識別条件設定支援処理では、有異音サンプル音のスペクトログラムデータにおいて、同一の対象周波数帯のヒストグラムデータにおけるそれぞれの音の強度の順位の平均値を算定し、当該平均値において順位の高い順に音の強度を対象強度として選択する場合を例にあげた（図２中のステップＳ９）。しかしながら、本発明はこれに限らない。有異音サンプル音のスペクトログラムデータにおいて、同一の対象周波数帯のヒストグラムデータにおけるそれぞれの音の強度の順位の累積値を算定し、当該累積値において順位の高い順に音の強度を対象強度として選択してもよい。 Further, in the sound identification condition setting support processing by the sound identification condition setting support device 1 described above, in the spectrogram data of the abnormal sound sample sound, the average value of the ranks of the respective sound intensities in the histogram data of the same target frequency band is obtained. As an example, the calculation is performed and the sound intensity is selected as the target intensity in descending order of the average value (step S9 in FIG. 2). However, the present invention is not limited to this. In the spectrogram data of the sampled noise, calculate the cumulative value of each sound intensity rank in the histogram data of the same target frequency band, and select the sound intensity as the target intensity in descending order of the accumulated value. May be.

また、上述した実施形態における音識別条件設定支援処理では、識別関数の生成および識別関数による識別の正否判定（ステップＳ１０〜Ｓ２０）を例えば２００回繰り返す場合を例にあげたが、本発明はこれに限らず、識別関数の生成および識別関数による識別の正否判定を１回ないし１９９回、または２０１回以上行うこととしてもよい。 Further, in the sound identification condition setting support process in the above-described embodiment, the case where the generation of the identification function and the determination of whether the identification function is correct (steps S10 to S20) is repeated 200 times, for example. Not limited to this, the generation of the discrimination function and the determination of whether the discrimination function is correct or not may be performed once to 199 times, or 201 times or more.

また、上述した実施形態における音識別条件設定支援処理では、確定ブロック数、確定対象周波数帯および確定対象強度を選定した後、確定ブロック数として選定されたブロック数の設定の下で生成された、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００のすべてのスペクトログラムデータにおける確定対象周波数帯のヒストグラムデータにおける確定対象強度の度数を用いて改めて識別関数を生成し、当該生成した識別関数を確定識別関数として選定する場合を例にあげたが、本発明はこれに限らず、確定ブロック数として選定されたブロック数の設定の下で図３中のステップＳ１２において生成された識別関数を確定識別関数として選定してもよい。 Further, in the sound identification condition setting support process in the above-described embodiment, after selecting the number of confirmed blocks, the frequency band to be confirmed and the strength to be confirmed, the sound identification condition setting support process is generated under the setting of the number of blocks selected as the number of confirmed blocks. A discrimination function is generated anew using the frequency of the determination target intensity in the histogram data of the determination target frequency band in all the spectrogram data of the abnormal sound samples A1 to A100 and the non-sound sample sounds B1 to B100, and the generated identification The case where a function is selected as a deterministic discriminating function has been described as an example. However, the present invention is not limited to this, and the discriminating function generated in step S12 in FIG. 3 under the setting of the number of blocks selected as the deterministic block number. May be selected as the deterministic identification function.

また、上述した実施形態では、対象周波数帯および対象強度を選択するための設定用有異音スペクトログラムデータとして、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのすべてを用い、識別関数を生成するための生成用スペクトログラムデータとして、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうちの半数を用い、識別正解率を算定するための検証用スペクトログラムデータとして、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうちの残りの半数を用いる場合を例にあげた。しかしながら、用意した複数の有異音サンプル音のスペクトログラムデータを、設定用有異音スペクトログラムデータ用、生成用スペクトログラムデータ用、および検証用スペクトログラムデータ用としてどのように分配するかは、適宜自由に決めることができる。例えば、１００個の有異音サンプル音のスペクトログラムデータのうちの半数を設定用有異音スペクトログラムデータとして用いてもよい。また、１５０個の有異音サンプル音のスペクトログラムデータを用意し、これらのうちの５０個を設定用有異音スペクトログラムとして用い、他の５０個を生成用スペクトログラムデータとして用い、残りの５０個を検証用スペクトログラムデータとしても用いてもよい。また、１８０個の有異音サンプル音のスペクトログラムデータを用意し、そのうちの１２０個の有異音サンプル音のスペクトログラムデータを設定用有異音スペクトログラムデータおよび生成用スペクトログラムデータとして用い、残りの６０個の有異音サンプル音のスペクトログラムデータを検証用スペクトログラムデータとして用いてもよい。無異音サンプル音のスペクトログラムデータを生成用スペクトログラムデータまたは検証用スペクトログラムデータとして用いる場合も、その分配比率は適宜自由に決めることができる。 In the above-described embodiment, all the spectrogram data of the abnormal sound samples A1 to A100 are used as the abnormal sound spectrogram data for setting for selecting the target frequency band and the target intensity, and the discrimination function is generated. The half of the spectrogram data of the abnormal sound samples A1 to A100 is used as the spectrogram data for generating the sound, and the spectrogram data of the abnormal sound samples A1 to A100 is used as the verification spectrogram data for calculating the discrimination accuracy rate. As an example, the remaining half of the above is used. However, how to distribute the spectrogram data of a plurality of abnormal sound samples prepared for distribution of setting abnormal sound spectrogram data, generation spectrogram data, and verification spectrogram data can be freely determined as appropriate. be able to. For example, half of the spectrogram data of 100 different sample sounds may be used as the setting specific sound spectrogram data. Also, spectrogram data of 150 different sound samples is prepared, and 50 of them are used as setting different sound spectrograms, the other 50 are used as generation spectrogram data, and the remaining 50 are used. It may be used as spectrogram data for verification. Also, spectrogram data of 180 abnormal sound samples is prepared, and the spectrogram data of 120 abnormal sound samples is used as the setting abnormal sound spectrogram data and generation spectrogram data, and the remaining 60 sound samples are generated. The spectrogram data of the different sampled sounds may be used as the spectrogram data for verification. Even when the spectrogram data of the allophone sample sound is used as the spectrogram data for generation or the spectrogram data for verification, the distribution ratio can be freely determined as appropriate.

（異音判定装置の構成）
図１８は、本発明の実施形態による音識別条件設定支援装置１によって生成された確定識別関数、確定ブロック数、確定対象周波数帯、および確定対象強度を用いて採取音中の異音の有無を判定する異音判定装置を示している。図１８において、異音判定装置３０は演算処理部４０を備え、演算処理部４０は、音声データからスペクトログラムデータを生成するスペクトログラム生成部４１と、スペクトログラムデータを複数のブロックに分割し、分割した複数のブロックのうち、確定対象周波数帯のブロックにつき、音の強度分布のヒストグラムデータを生成するヒストグラム生成部４２と、確定対象周波数帯のブロックのヒストグラムデータにおける確定対象強度の度数を識別関数に入力し、異音の有無の識別を行う異音識別部４３と、異音判定処理の流れの制御等を行う制御部４４とを備えている。さらに、異音判定装置３０は、異音判定処理中に保持すべき種々のデータを記憶する記憶部４５と、被判定音の音声データの入力、操作入力、処理結果の表示等を行う入出力部４６とを備えている。 (Configuration of abnormal sound determination device)
FIG. 18 shows the presence / absence of abnormal noise in the collected sound by using the confirmed identification function, the number of confirmed blocks, the confirmed target frequency band, and the confirmed target intensity generated by the sound identification condition setting support device 1 according to the embodiment of the present invention. The abnormal sound determination apparatus which determines is shown. In FIG. 18, the abnormal sound determination device 30 includes an arithmetic processing unit 40. The arithmetic processing unit 40 divides the spectrogram data into a plurality of blocks by dividing the spectrogram data into a plurality of blocks by generating spectrogram data from the speech data. Among these blocks, the histogram generating unit 42 that generates histogram data of the sound intensity distribution for the block in the determination target frequency band, and the frequency of the determination target intensity in the histogram data of the block in the determination target frequency band are input to the identification function. , An abnormal sound identification unit 43 for identifying presence / absence of abnormal noise, and a control unit 44 for controlling the flow of abnormal noise determination processing and the like. Further, the abnormal sound determination device 30 has a storage unit 45 for storing various data to be held during the abnormal sound determination processing, and input / output for performing input of sound data of the sound to be determined, operation input, display of processing results, and the like. Part 46.

異音判定装置３０は例えばパーソナルコンピュータにより実現することができる。具体的には、演算処理部４０は、パーソナルコンピュータに内蔵されたＣＰＵにより実現することができる。また、スペクトログラム生成部４１、ヒストグラム生成部４２、異音識別部４３、および制御部４４は、ＣＰＵにコンピュータプログラムを読み込ませて実行することにより実現することができる。また、記憶部４５は、パーソナルコンピュータが有するハードディスクドライブ装置等の記憶装置により実現することができる。また、入出力部４６は、パーソナルコンピュータが有するキーボード、モニタ等の入出力装置により実現することができる。なお、音識別条件設定支援装置１と異音判定装置３０とを１台のパーソナルコンピュータで同時に動作させるようにしてもよい。 The abnormal sound determination device 30 can be realized by a personal computer, for example. Specifically, the arithmetic processing unit 40 can be realized by a CPU built in a personal computer. The spectrogram generation unit 41, the histogram generation unit 42, the abnormal sound identification unit 43, and the control unit 44 can be realized by causing a CPU to read and execute a computer program. The storage unit 45 can be realized by a storage device such as a hard disk drive device of a personal computer. The input / output unit 46 can be realized by an input / output device such as a keyboard or a monitor included in the personal computer. Note that the sound identification condition setting support device 1 and the abnormal sound determination device 30 may be simultaneously operated by one personal computer.

（異音判定処理）
図１９は異音判定装置３０による異音判定処理を示している。異音判定装置３０は、例えば送電線または機器等に近い場所で採取した比較的長時間（例えば２４時間）に及ぶ採取音につき、異音の有無を判定することができる。この異音判定の対象となる採取音の全体を被判定音という。また、異音判定装置３０による異音判定処理では、被判定音の音声データを比較的短時間（例えば１時間）のいくつかのセクションに分け、セクションごとに異音の有無の識別を行う。 (Abnormal sound judgment processing)
FIG. 19 shows an abnormal sound determination process performed by the abnormal sound determination device 30. The abnormal sound determination device 30 can determine the presence or absence of an abnormal sound for a collected sound over a relatively long time (for example, 24 hours) collected in a place close to a power transmission line or a device, for example. The entire collected sound that is the target of the abnormal sound determination is referred to as a sound to be determined. In the abnormal sound determination processing by the abnormal sound determination device 30, the sound data of the sound to be determined is divided into several sections of a relatively short time (for example, 1 hour), and the presence or absence of the abnormal sound is identified for each section.

異音判定処理の準備として、作業者は、音識別条件設定支援装置１により生成された確定識別関数、確定ブロック数、確定対象周波数帯、および確定対象強度を異音判定装置３０に入力し、これらを異音判定装置３０の記憶部４５に記憶させる。また、作業者は、被判定音の音声データを異音判定装置３０に入力し、これを異音判定装置３０の記憶部４５に記憶させる。続いて、作業者が異音判定処理を開始する旨の指示を異音判定装置３０に入力すると、異音判定装置３０は異音判定処理を開始する。 As preparation for the abnormal sound determination process, the operator inputs the confirmed identification function, the number of confirmed blocks, the determined frequency band, and the determined target intensity generated by the sound identification condition setting support device 1 to the abnormal sound determination device 30. These are stored in the storage unit 45 of the abnormal sound determination device 30. Further, the worker inputs the sound data of the sound to be determined to the abnormal sound determination device 30 and stores it in the storage unit 45 of the abnormal sound determination device 30. Subsequently, when the operator inputs an instruction to start the abnormal sound determination process to the abnormal sound determination apparatus 30, the abnormal sound determination apparatus 30 starts the abnormal sound determination process.

図１９に示すように、異音判定処理において、まず、制御部４４が、確定識別関数、確定ブロック数、確定対象周波数帯、および確定対象強度を記憶部４５から読み出す（ステップＳ４１）。 As shown in FIG. 19, in the abnormal sound determination process, first, the control unit 44 reads a definite identification function, the number of definite blocks, a definite target frequency band, and a definite target intensity from the storage unit 45 (step S41).

続いて、制御部４４は、被判定音の音声データにおいて、異音の有無の識別をまだ実行していないセクションのうちの先頭のセクション（被判定音の音声データの開始時点に最も近いセクション）に対応する音声データを抽出する（ステップＳ４２）。 Subsequently, the control unit 44, in the sound data of the sound to be determined, among the sections that have not yet been identified for the presence or absence of the abnormal sound (the section closest to the start time of the sound data of the sound to be determined) Is extracted (step S42).

続いて、スペクトログラム生成部４１が、ステップＳ４２で抽出された音声データのスペクトログラムデータを生成する（ステップＳ４３）。 Subsequently, the spectrogram generation unit 41 generates spectrogram data of the voice data extracted in step S42 (step S43).

続いて、ヒストグラム生成部４２が、ステップＳ４３で生成されたスペクトログラムデータにつき、所定の周波数範囲を確定ブロック数のブロックに分割し、分割した複数のブロックのうち、確定対象周波数帯の各ブロックにつき、音の強度分布のヒストグラムデータを生成する（ステップＳ４４）。 Subsequently, for the spectrogram data generated in step S43, the histogram generation unit 42 divides a predetermined frequency range into blocks of the determined number of blocks, and among each of the divided blocks, for each block of the determined target frequency band, Histogram data of the sound intensity distribution is generated (step S44).

続いて、異音識別部４３が、確定対象周波数の各ブロックのヒストグラムデータにおける確定対象強度の度数を確定識別関数に入力し、異音の有無の識別を実行し、その識別結果を記憶部４５に記憶する（ステップＳ４５）。 Subsequently, the abnormal sound identification unit 43 inputs the frequency of the determination target intensity in the histogram data of each block of the determination target frequency to the determination identification function, identifies the presence or absence of abnormal noise, and stores the identification result in the storage unit 45. (Step S45).

続いて、制御部４４が、被判定音の音声データのすべてのセクションにつき、異音の有無の識別を実行し終えたか否かを判断する（ステップＳ４６）。被判定音の音声データのすべてのセクションにつき異音の有無の識別を実行し終えていない場合には（ステップＳ４６：ＮＯ）、ステップＳ４２に戻り、制御部４４が、次のセクションに対応する音声データを被判定音の音声データから抽出し、スペクトログラム生成部４１、ヒストグラム生成部４２および異音識別部４３がステップＳ４３〜Ｓ４５の処理を実行する。 Subsequently, the control unit 44 determines whether or not the identification of the presence or absence of abnormal sounds has been completed for all sections of the sound data of the sound to be determined (step S46). If the identification of the presence / absence of abnormal sound has not been performed for all sections of the sound data of the sound to be determined (step S46: NO), the process returns to step S42, and the control unit 44 performs the sound corresponding to the next section. Data is extracted from the sound data of the sound to be determined, and the spectrogram generation unit 41, the histogram generation unit 42, and the abnormal sound identification unit 43 execute the processes of steps S43 to S45.

一方、被判定音の音声データのすべてのセクションにつき異音の有無の識別を実行し終えた場合には（ステップＳ４６：ＹＥＳ）、制御部４４が、被判定音についての異音の有無の識別結果をセクションごとにモニタ等に表示する（ステップＳ４７）。作業者はモニタを見ることにより、被判定音中の異音の有無を即座に把握することができる。また、被判定音中の異音の有無をセクションごとに把握することができるので、被判定音として用いる採取音が比較的長時間に及び、かつその中に含まれる異音が比較的短時間である場合でも、当該採取音中の異音を迅速に発見することができる。 On the other hand, when the identification of the presence / absence of abnormal sound has been completed for all sections of the sound data of the sound to be determined (step S46: YES), the control unit 44 identifies the presence / absence of abnormal sound for the sound to be determined. The result is displayed on a monitor or the like for each section (step S47). By looking at the monitor, the operator can immediately grasp the presence or absence of abnormal noise in the sound to be determined. In addition, since the presence / absence of abnormal noise in the judged sound can be grasped for each section, the collected sound used as the judged sound is relatively long, and the abnormal sound included therein is relatively short. Even in this case, it is possible to quickly find an abnormal sound in the collected sound.

また、異音判定装置３０によれば、被判定音のスペクトログラムデータにおいて、確定対象周波数帯のブロックについてのみヒストグラムデータの生成を行い、また、確定対象周波数帯のブロックのヒストグラムデータにおける確定対象強度の度数についてのみ識別関数による識別を実行するので、被判定音のスペクトログラムデータのすべてのブロックについてヒストグラムデータの生成を行う場合や、被判定音のスペクトログラムデータにおけるヒストグラムデータのすべての強度について識別関数による識別を実行する場合と比較し、大幅な処理の迅速化を図ることができる。 In addition, according to the abnormal sound determination device 30, in the spectrogram data of the sound to be determined, histogram data is generated only for the block in the fixed frequency band, and the determined target intensity in the histogram data of the block in the fixed frequency band is determined. Since discrimination using the discrimination function is performed only for the frequency, when generating histogram data for all blocks of the spectrogram data of the sound to be judged, or using the discrimination function for all intensities of the histogram data in the spectrogram data of the sound to be judged Compared with the case of executing the above, it is possible to greatly speed up the processing.

なお、本発明は、請求の範囲および明細書全体から読み取ることのできる発明の要旨または思想に反しない範囲で適宜変更可能であり、そのような変更を伴う音識別条件設定支援装置および音識別条件設定支援方法もまた本発明の技術思想に含まれる。 It should be noted that the present invention can be appropriately changed without departing from the spirit or idea of the invention that can be read from the claims and the entire specification, and the sound identification condition setting support device and the sound identification condition accompanying such a change. A setting support method is also included in the technical idea of the present invention.

１音識別条件設定支援装置
１１演算処理部
１２スペクトログラム生成部
１３ヒストグラム生成部
１４ブロック選択部
１５周波数帯選択部
１６強度選択部
１７識別関数生成部
１８正解率算定部
１９識別条件選定部
２０制御部
２１記憶部
２２入出力部
３０異音判定装置
４０演算処理部
４１スペクトログラム生成部
４２ヒストグラム生成部
４３異音識別部
４４制御部
４５記憶部
４６入出力部 DESCRIPTION OF SYMBOLS 1 Sound identification condition setting assistance apparatus 11 Arithmetic processing part 12 Spectrogram generation part 13 Histogram generation part 14 Block selection part 15 Frequency band selection part 16 Intensity selection part 17 Discrimination function generation part 18 Correct answer rate calculation part 19 Identification condition selection part 20 Control part DESCRIPTION OF SYMBOLS 21 Memory | storage part 22 Input / output part 30 Abnormal sound determination apparatus 40 Arithmetic processing part 41 Spectrogram generation part 42 Histogram generation part 43 Abnormal sound identification part 44 Control part 45 Storage part 46 Input / output part

Claims

A sound identification condition setting support device that supports setting of an identification condition for identifying the presence or absence of abnormal noise,
It receives setting noise spectrogram data showing the spectrograms of multiple setting abnormal sound samples including abnormal sounds, and divides the predetermined frequency range into multiple blocks in each setting abnormal sound spectrogram data. A setting noise histogram generator for generating setting noise histogram data indicating a histogram of the sound intensity distribution in each block;
In the histogram data of the blocks in the same frequency band in the plurality of setting abnormal sound spectrogram data in which the setting abnormal sound histogram data is generated by the setting abnormal sound histogram generation unit, the frequency of the intensity of each sound An intensity selection unit that calculates an average value or cumulative value of ranks, specifies a predetermined selection number of sound in descending order of the average value or cumulative value, and selects the specified sound intensity as a target intensity; ,
Receiving generation spectrogram data indicating spectrograms of a plurality of generation sample sounds including generation abnormal sound samples including abnormal sounds and generation non-abnormal sound samples including no abnormal sounds, and generating each spectrogram data A generation histogram generation unit that divides a predetermined frequency range into a plurality of blocks and generates generation histogram data indicating a histogram of the sound intensity distribution in each block;
An identification function generator for generating an identification function for identifying the presence or absence of abnormal noise using the frequency of the target intensity in each of the generation histogram data;
It receives spectrogram data for verification indicating spectrograms of a plurality of sample sounds for verification including abnormal sound samples for verification including abnormal sounds and non-noise sample sounds for verification not including abnormal sounds, and each of the spectrogram data for verification A verification histogram generation unit that divides a predetermined frequency range into a plurality of blocks and generates verification histogram data indicating a histogram of sound intensity distribution in each block;
The frequency of the target intensity in each verification histogram data is input to the discrimination function to identify the presence / absence of abnormal noise, and the discrimination accuracy rate that is the probability that the discrimination function correctly identifies the presence / absence of abnormal noise is calculated. A sound identification condition setting support device, comprising:

While changing the setting of the number of selections, the processing by the intensity selection unit, the generation histogram generation unit, the discriminant function generation unit, the verification histogram generation unit, and the accuracy rate calculation unit is executed a plurality of times. As a result of executing the identification function a plurality of times, the identification function having the highest identification accuracy rate among the plurality of identification functions generated by the identification function generation unit and the target strength selected when generating the identification function are determined as the identification condition. The sound identification condition setting support apparatus according to claim 1, further comprising: an identification condition selection unit that selects the sound identification condition as a sound identification condition selection unit.

While changing the setting of the number of blocks and the selection number when dividing a predetermined frequency range in each spectrogram data into a plurality of blocks, the setting unusual sound histogram generation unit, the intensity selection unit, and the generation histogram A plurality of identifications generated by the discrimination function generation unit as a result of executing the processing by the generation unit, the discrimination function generation unit, the verification histogram generation unit, and the accuracy rate calculation unit a plurality of times and executing the processing a plurality of times. The discrimination function having the highest discrimination accuracy rate among the functions, the number of blocks set when generating the discrimination function, and the target strength selected when generating the discrimination function are selected as the discrimination conditions. The sound identification condition setting support device according to claim 1, further comprising an identification condition selection unit.

A sound identification condition setting support method for supporting setting of an identification condition for identifying the presence or absence of abnormal noise,
It receives setting noise spectrogram data showing the spectrograms of multiple setting abnormal sound samples including abnormal sounds, and divides the predetermined frequency range into multiple blocks in each setting abnormal sound spectrogram data. A setting noise histogram generation step for generating setting noise histogram data indicating a histogram of the sound intensity distribution in each block;
In the histogram data of the blocks of the same frequency band in the plurality of setting noise spectrogram data for which the setting noise histogram data has been generated in the setting noise histogram generation step, the frequency of the intensity of each sound A strength selection step of calculating an average value or cumulative value of ranks, specifying a predetermined number of sound intensities in descending order of the average value or cumulative value, and selecting the identified sound strengths as target strengths; ,
Receiving generation spectrogram data indicating spectrograms of a plurality of generation sample sounds including generation abnormal sound samples including abnormal sounds and generation non-abnormal sound samples including no abnormal sounds, and generating each spectrogram data A generation histogram generation step of dividing a predetermined frequency range into a plurality of blocks and generating generation histogram data indicating a histogram of the sound intensity distribution in each block;
An identification function generating step for generating an identification function for identifying the presence or absence of abnormal noise using the frequency of the target intensity in each of the generation histogram data;
It receives spectrogram data for verification indicating spectrograms of a plurality of sample sounds for verification including abnormal sound samples for verification including abnormal sounds and non-noise sample sounds for verification not including abnormal sounds, and each of the spectrogram data for verification A verification histogram generating step of dividing a predetermined frequency range into a plurality of blocks and generating verification histogram data indicating a histogram of the sound intensity distribution in each block;
The frequency of the target intensity in each verification histogram data is input to the discrimination function to identify the presence / absence of abnormal noise, and the discrimination accuracy rate that is the probability that the discrimination function correctly identifies the presence / absence of abnormal noise is calculated. A correct answer rate calculation process,
While changing the setting of the selection number, the processing by the intensity selection step, the generation histogram generation step, the discriminant function generation step, the verification histogram generation step and the accuracy rate calculation step is executed a plurality of times. As a result of executing a plurality of times, the discriminant function having the highest discrimination accuracy rate among the plurality of discriminant functions generated in the discriminant function generation step and the target strength selected when generating the discriminant function are set as the discriminant condition. Sound identification condition setting support method characterized by selecting as

While changing the setting of the number of blocks and the number of selections when dividing a predetermined frequency range in each spectrogram data into a plurality of blocks, the setting abnormal sound histogram generation step, the intensity selection step, and the generation histogram The identification function generation step, the verification histogram generation step, the verification histogram generation step, and the accuracy rate calculation step are executed a plurality of times. As a result of executing the processing a plurality of times, a plurality of identifications generated in the discrimination function generation step The discrimination function having the highest discrimination accuracy rate among the functions, the number of blocks set when generating the discrimination function, and the target strength selected when generating the discrimination function are selected as the discrimination conditions. The sound identification condition setting support method according to claim 4.