JP5937125B2

JP5937125B2 - Sound identification condition setting support apparatus and sound identification condition setting support method

Info

Publication number: JP5937125B2
Application number: JP2014053464A
Authority: JP
Inventors: 勝哉梶山
Original assignee: Chugoku Electric Power Co Inc
Current assignee: Chugoku Electric Power Co Inc
Priority date: 2014-03-17
Filing date: 2014-03-17
Publication date: 2016-06-22
Anticipated expiration: 2034-03-17
Also published as: JP2015175770A

Description

本発明は、採取音中の異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援装置および音識別条件設定支援方法に関する。 The present invention relates to a sound identification condition setting support device and a sound identification condition setting support method that support the setting of an identification condition for identifying the presence or absence of an abnormal sound in a collected sound.

例えば、送電線から発せられる異音を除去する方策を検討する場合や、機器から発せられる異音に基づいて機器の故障診断を行う場合に、送電線または機器に近い場所で採取した音の中に異音が含まれているか否かを、パターン認識手法を用いて判定する。具体的には、多くの場合、採取音をスペクトル解析し、採取音または異音の周波数の特徴に基づき、採取音中の異音の有無を識別する（特許文献１参照）。 For example, when considering measures to remove abnormal noise emitted from power transmission lines, or when diagnosing equipment malfunctions based on abnormal noise emitted from equipment, of the sounds collected near the power transmission line or equipment. It is determined by using a pattern recognition method whether or not there is an abnormal sound. Specifically, in many cases, the collected sound is subjected to spectrum analysis, and the presence or absence of the abnormal sound in the collected sound is identified based on the characteristics of the collected sound or the frequency of the abnormal sound (see Patent Document 1).

しかしながら、異音の周波数または音の強さが時間的に変化する場合、採取音または異音の周波数の静的な特徴に基づくだけでは、異音の有無を高精度に識別することが難しいことがある。このような場合には、採取音のスペクトログラムを分析し、採取音または異音の周波数または音の強さの時間的変化を勘案し、採取音中の異音の有無を識別することが望ましい。これにより、異音の周波数または音の強さが時間的に変化する場合でも、異音の有無を高精度に識別することができる。 However, if the frequency or intensity of the abnormal sound changes over time, it is difficult to accurately identify the presence or absence of the abnormal noise based solely on the static characteristics of the sampled sound or the abnormal noise frequency. There is. In such a case, it is desirable to analyze the spectrogram of the collected sound, and to identify the presence or absence of the abnormal sound in the collected sound in consideration of the temporal change in the frequency or intensity of the collected sound or the abnormal sound. Thereby, even when the frequency or intensity of the abnormal sound changes with time, the presence or absence of the abnormal sound can be identified with high accuracy.

特開平１１−２４１９４５号公報JP-A-11-241945

採取音のスペクトログラムを用い、パターン認識手法により採取音中における異音の有無の識別を行うには、スペクトログラムとして表現された当該異音の特徴を抽出することが必要である。この特徴抽出の優劣が識別精度を大きく左右する。 In order to identify the presence or absence of an abnormal sound in the collected sound by the pattern recognition method using the spectrogram of the collected sound, it is necessary to extract the characteristic of the abnormal sound expressed as the spectrogram. The superiority or inferiority of the feature extraction greatly affects the identification accuracy.

異音の特徴抽出方法については、一般的または統一的な方法は知られていない。従来、作業者は、採取音のスペクトログラムをコンピュータのモニタに表示し、それを目で見ることにより異音のスペクトログラムの特徴を見出すといった方法で異音の特徴抽出を行っている。 There is no known general or unified method for extracting abnormal sound features. 2. Description of the Related Art Conventionally, an operator performs feature extraction of an abnormal sound by a method of displaying a spectrogram of a collected sound on a computer monitor and finding the characteristic of the spectrogram of the abnormal sound by visually observing the spectrogram.

しかしながら、異音の高精度な識別を可能にする異音の特徴を、コンピュータのモニタに表示されたスペクトログラムを見ることにより見出す作業は、長い時間がかかり、大きな負担を作業者に強いる。また、異音の高精度な識別を可能にする異音の特徴を見出せるか否かが作業者の経験に依存する場合が多いため、練達した作業者が存在しない場合には高精度な識別を実現することが困難である。 However, it takes a long time to find the characteristics of the abnormal sound that enables high-accuracy identification of the abnormal sound by looking at the spectrogram displayed on the computer monitor, which imposes a heavy burden on the operator. In addition, since it is often dependent on the experience of the operator whether or not the feature of the abnormal sound that enables high-precision identification of abnormal noise can be found, high-precision identification is performed when there is no skilled worker. It is difficult to realize.

また、例えば送電線から発せられる異音を除去する方法を検討する場面では、異音の発生時期やどのような異音が発生するのかを予測することが困難であると共に、実際に異音が発生した場合には、その異音を除去する方策を短時間で講じなければならない。このような場面では、異音の特徴抽出を迅速にかつ確実に行わなければならないが、上述した通り、従来技術によってこれを実現することは難しい。 In addition, for example, when considering a method for removing abnormal noise emitted from a transmission line, it is difficult to predict when an abnormal noise will occur and what kind of abnormal noise will occur, If it occurs, measures must be taken in a short time to remove the noise. In such a scene, it is necessary to quickly and reliably perform the feature extraction of abnormal noise, but as described above, it is difficult to realize this by the conventional technology.

本発明は例えば上述したような問題に鑑みなされたものであり、本発明の課題は、採取音のスペクトログラムにおいて異音の特徴抽出またはそれに相当する処理を容易にかつ確実に行うことができ、当該処理に関する作業者の負担を削減することができると共に、異音の有無の高精度な識別を実現することができる音識別条件設定支援装置および音識別条件設定支援方法を提供することにある。 The present invention has been made in view of, for example, the above-described problems, and the problem of the present invention is that it is possible to easily and reliably perform abnormal sound feature extraction or processing corresponding thereto in a spectrogram of collected sound. An object of the present invention is to provide a sound identification condition setting support device and a sound identification condition setting support method that can reduce the burden on an operator regarding processing and can realize highly accurate identification of the presence or absence of abnormal noise.

上記課題を解決するために、本発明の音識別条件設定支援装置は、異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援装置であって、異音を含む複数の有異音サンプル音および異音を含まない複数の無異音サンプル音のスペクトログラムをそれぞれ示すスペクトログラムデータを受け取り、当該各スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示すヒストグラムデータを生成するヒストグラム生成部と、前記複数の有異音サンプル音のうちのいくつかの有異音サンプル音についての各ブロックのヒストグラムデータ、および前記複数の無異音サンプル音のうちのいくつかの無異音サンプル音についての各ブロックのヒストグラムデータを用い、異音の有無を識別するための識別関数を生成する識別関数生成部と、前記複数の有異音サンプル音のうちの他のいくつかの有異音サンプル音についての各ブロックのヒストグラムデータ、および前記複数の無異音サンプル音のうちの他のいくつかの無異音サンプル音についての各ブロックのヒストグラムデータを前記識別関数に入力して異音の有無の識別を実行し、前記識別関数が異音の有無を正しく識別する確率である識別正解率を算定する正解率算定部と、前記各スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割する際のブロック数の設定を変更しつつ、前記ヒストグラム生成部、前記識別関数生成部および前記正解率算定部による処理を複数回実行し、当該処理を複数回実行した結果、前記識別関数生成部により生成された複数の識別関数のうち前記識別正解率が最も高い識別関数および当該識別関数を生成する際に設定された前記ブロック数を前記識別条件として選定する識別条件選定部と
を備えていることを特徴とする。 In order to solve the above-described problem, a sound identification condition setting support device according to the present invention is a sound identification condition setting support device that supports setting of an identification condition for identifying the presence or absence of an abnormal sound, and includes a plurality of sound identification condition setting support devices. The spectrogram data indicating the spectrograms of the abnormal sample sound and the non-noise sample sound that does not include the abnormal sound are received, the predetermined frequency range in each spectrogram data is divided into a plurality of blocks, A histogram generator for generating histogram data indicating a histogram of the intensity distribution of the sound; histogram data of each block for several different sampled sounds of the plurality of different sampled sounds; and Histogram of each block for some non-sound sample sounds of all sound samples A discriminant function generator for generating a discriminant function for discriminating the presence or absence of abnormal noise, and each block for some other abnormal sound samples of the plurality of abnormal sound samples Histogram data of each block and the histogram data of each block for some other non-sound sample sounds of the plurality of non-sound sample sounds are input to the discrimination function to identify the presence or absence of allophones A correct rate calculation unit for calculating the correct rate of discrimination, which is a probability that the discrimination function correctly identifies the presence or absence of abnormal noise, and setting of the number of blocks when dividing a predetermined frequency range in each spectrogram data into a plurality of blocks As a result of executing the processing by the histogram generation unit, the discrimination function generation unit and the accuracy rate calculation unit a plurality of times, and executing the processing a plurality of times, And said identification accuracy rate selecting the number of blocks set in generating the highest classification function and the identification function as the identification condition identification condition selecting section among the plurality of discriminant function generated by the function generator It is characterized by having.

また、本発明の第２の音識別条件設定支援装置は、上述した本発明の第１の音識別条件設定支援装置において、前記有異音サンプル音または前記無異音サンプル音のスペクトログラムを示すスペクトログラムデータを生成するスペクトログラム生成部を備えていることを特徴とする。 A second sound identification condition setting support device according to the present invention is a spectrogram showing a spectrogram of the abnormal sound sample or the non-natural sound sample sound in the first sound identification condition setting support device of the present invention described above. A spectrogram generation unit for generating data is provided.

上記課題を解決するために、本発明の音識別条件設定支援方法は、異音の有無を識別するための識別条件の設定を支援する音識別条件設定支援方法であって、異音を含む複数の有異音サンプル音および異音を含まない複数の無異音サンプル音のスペクトログラムをそれぞれ示すスペクトログラムデータのそれぞれにつき、当該スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割し、当該各ブロックにおける音の強度分布のヒストグラムを示すヒストグラムデータを生成するヒストグラム生成工程と、前記複数の有異音サンプル音のうちのいくつかの有異音サンプル音についての各ブロックのヒストグラムデータ、および前記複数の無異音サンプル音のうちのいくつかの無異音サンプル音についての各ブロックのヒストグラムデータを用い、異音の有無を識別するための識別関数を生成する識別関数生成工程と、前記複数の有異音サンプル音のうちの他のいくつかの有異音サンプル音についての各ブロックのヒストグラムデータ、および前記複数の無異音サンプル音のうちの他のいくつかの無異音サンプル音についての各ブロックのヒストグラムデータを前記識別関数に入力して異音の有無の識別を実行し、前記識別関数が異音の有無を正しく識別する確率である識別正解率を算定する正解率算定工程とを備え、前記各スペクトログラムデータにおける所定の周波数範囲を複数のブロックに分割する際のブロック数の設定を変更しつつ、前記ヒストグラム生成工程、前記識別関数生成工程および前記正解率算定工程を複数回実行し、これらの工程を複数回実行した結果、前記識別関数生成工程において生成された複数の識別関数のうち前記識別正解率が最も高い識別関数および当該識別関数を生成する際に設定された前記ブロック数を前記識別条件として選定することを特徴とする。 In order to solve the above-described problem, a sound identification condition setting support method according to the present invention is a sound identification condition setting support method that supports setting of an identification condition for identifying the presence or absence of abnormal noise, and includes a plurality of sound identification condition setting support methods. For each of the spectrogram data indicating the spectrograms of the abnormal sample sound and the non-noise sample sound that does not include the abnormal sound, the predetermined frequency range in the spectrogram data is divided into a plurality of blocks, and A histogram generating step for generating histogram data indicating a histogram of a sound intensity distribution; histogram data of each block for several different sampled sounds of the plurality of different sampled sounds; Hist of each block for some non-sound sample sounds of all sound samples A discriminant function generating step for generating a discriminant function for discriminating the presence or absence of abnormal noise using the ram data, and each block for some other abnormal sound samples of the plurality of abnormal sound samples Histogram data of each block and the histogram data of each block for some other non-sound sample sounds of the plurality of non-sound sample sounds are input to the discrimination function to identify the presence or absence of allophones A correct answer rate calculating step of calculating a correct answer rate that is a probability that the discriminating function correctly identifies the presence or absence of abnormal noise, and the number of blocks when dividing a predetermined frequency range in each spectrogram data into a plurality of blocks The histogram generation step, the discriminant function generation step and the accuracy rate calculation step are executed a plurality of times while changing the setting of As a result, among the plurality of discriminant functions generated in the discriminant function generation step, the discriminant function having the highest discriminant accuracy rate and the number of blocks set when generating the discriminant function are selected as the discriminating conditions. It is characterized by.

本発明によれば、採取音のスペクトログラムにおいて異音の特徴抽出またはそれに相当する処理を容易にかつ確実に行うことができ、当該処理に関する作業者の負担を削減することができる。 According to the present invention, it is possible to easily and reliably perform abnormal noise feature extraction or processing corresponding thereto in a spectrogram of collected sound, and reduce the burden on the operator regarding the processing.

本発明の実施形態による音識別条件設定支援装置を示すブロック図である。It is a block diagram which shows the sound identification condition setting assistance apparatus by embodiment of this invention. 本発明の実施形態による音識別条件設定支援装置による音識別条件設定支援処理を示すフローチャートである。It is a flowchart which shows the sound identification condition setting assistance process by the sound identification condition setting assistance apparatus by embodiment of this invention. 図２に続く音識別条件設定支援処理を示すフローチャートである。It is a flowchart which shows the sound identification condition setting assistance process following FIG. 有異音サンプル音の波形図である。It is a wave form diagram of a strange sound sample sound. 有異音サンプル音のスペクトログラムを示すグラフである。It is a graph which shows the spectrogram of an unusual sound sample sound. 有異音サンプル音のスペクトログラムの１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in one block of the spectrogram of the unusual sound sample. 有異音サンプル音のスペクトログラムの他の１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in other one block of the spectrogram of an unusual sound sample sound. 有異音サンプル音のスペクトログラムのさらに他の１つのブロックにおける音の強度分布のヒストグラムを示すグラフである。It is a graph which shows the histogram of the intensity distribution of the sound in another one block of the spectrogram of a strange sound sample sound. サンプル音のスペクトログラムを２０個のブロック数のブロックに分割した状態を示す説明図である。It is explanatory drawing which shows the state which divided | segmented the spectrogram of the sample sound into the block of 20 blocks. サンプル音のスペクトログラムを３０個のブロック数のブロックに分割した状態を示す説明図である。It is explanatory drawing which shows the state which divided | segmented the spectrogram of the sample sound into the block of the number of 30 blocks. 識別関数の識別正解率の算定方法を示す説明図である。It is explanatory drawing which shows the calculation method of the identification correct rate of an identification function. ブロック数の各設定に対応する識別関数の識別正解率を示す説明図である。It is explanatory drawing which shows the discrimination | determination accuracy rate of the discrimination function corresponding to each setting of the number of blocks. 本発明の実施形態による異音判定装置を示すブロック図である。It is a block diagram which shows the abnormal sound determination apparatus by embodiment of this invention. 本発明の実施形態による異音判定装置による異音判定処理を示すフローチャートである。It is a flowchart which shows the noise determination process by the noise determination apparatus by embodiment of this invention.

以下、本発明の実施の形態につき、図面を参照しながら説明する。 Embodiments of the present invention will be described below with reference to the drawings.

（音識別条件設定支援装置による処理の概要）
図１は本発明の実施形態による音識別条件設定支援装置を示している。図１において、本発明の実施形態による音識別条件設定支援装置１は、採取音中の異音の有無を識別するための識別条件の設定を支援する装置である。採取音は、例えば、送電線または機器等に近い場所で採取した音である。異音は、例えば、送電線の周囲に住む人が不快を訴える音、または機器が正常である場合には発生しない音等である。 (Outline of processing by the sound identification condition setting support device)
FIG. 1 shows a sound identification condition setting support apparatus according to an embodiment of the present invention. In FIG. 1, a sound identification condition setting support apparatus 1 according to an embodiment of the present invention is an apparatus that supports the setting of an identification condition for identifying the presence or absence of abnormal sounds in a collected sound. The collected sound is, for example, a sound collected at a place near a power transmission line or equipment. The abnormal noise is, for example, a sound in which people living around the power transmission line complain of discomfort or a sound that does not occur when the device is normal.

音識別条件設定支援装置１により行われる識別条件の設定を支援する処理（音識別条件設定支援処理）は、大まかに述べると、採取音中の異音の有無を識別することができる識別関数を生成ないし選定する処理である。この処理は、実質的にみて、異音の特徴を抽出する処理に相当する。 The process for supporting the setting of the identification condition performed by the sound identification condition setting support device 1 (sound identification condition setting support process) can be roughly described as an identification function that can identify the presence or absence of abnormal sounds in the collected sound. It is a process to generate or select. This process is substantially equivalent to a process for extracting features of abnormal sounds.

音識別条件設定支援装置１は、音識別条件設定支援処理として、概ね次の処理を行う。まず、処理の準備として、作業者は、有異音サンプル音の音声データおよび無異音サンプル音の音声データを用意する。有異音サンプル音とは、異音を含む採取音のサンプルである。無異音サンプル音とは、異音を含まない採取音のサンプルである。音声データは、例えば、ＷＡＶ、ＭＰ３（ＭＰＥＧＡｕｄｉｏＬａｙｅｒー３）、ＡＡＣ（ＡｄｖａｎｃｅｄＡｕｄｉｏＣｏｄｉｎｇ）等の音声ファイルフォーマットを有するデジタルデータである。 The sound identification condition setting support device 1 generally performs the following process as the sound identification condition setting support process. First, as preparation for processing, an operator prepares audio data of abnormal sound samples and audio data of non-natural sound samples. The abnormal sound sample sound is a sample of the collected sound including the abnormal sound. An allophone sample sound is a sample of a sampled sound that does not contain an allophone. The audio data is digital data having an audio file format such as WAV, MP3 (MPEG Audio Layer-3), AAC (Advanced Audio Coding), and the like.

作業者は、例えば次のような方法で、有異音サンプル音の音声データと無異音サンプル音の音声データをそれぞれ複数用意する。すなわち、作業者は、同じ場所で複数回録音を実行することにより複数の音声を採取し、採取した複数の音声を順次再生し、それらを聴き、各音声中の異音の有無を判断する。そして、作業者は、異音が含まれている音声に対応する音声データを有異音サンプル音の音声データに分類し、異音が含まれていない音声に対応する音声データを無異音サンプル音の音声データに分類する。 For example, the operator prepares a plurality of voice data of abnormal sample sounds and a plurality of voice data of non-sound sample sounds by the following method. In other words, the operator collects a plurality of sounds by executing recording at the same place, reproduces the collected sounds sequentially, listens to them, and determines the presence or absence of abnormal sounds in each sound. Then, the operator classifies the audio data corresponding to the sound including the abnormal sound into the sound data of the abnormal sound sample sound, and the sound data corresponding to the sound not including the abnormal sound is sampled without the abnormal sound. Classify into sound data.

次に、作業者は、有異音サンプル音の音声データおよび無異音サンプル音の音声データを音識別条件設定支援装置１に入力する。 Next, the operator inputs the sound data of the abnormal sample sound and the sound data of the non-sound sample sound to the sound identification condition setting support device 1.

音識別条件設定支援装置１は、入力された複数の有異音サンプル音の音声データおよび複数の無異音サンプル音の音声データのそれぞれにつき、音声のスペクトログラムを示すスペクトログラムデータを生成する。 The sound identification condition setting support device 1 generates spectrogram data indicating a spectrogram of speech for each of the input speech data of a plurality of abnormal sample sounds and the sound data of a plurality of non-sound sample sounds.

続いて、音識別条件設定支援装置１は、複数の有異音サンプル音のスペクトログラムデータのうち、例えば半数の有異音サンプル音のスペクトログラムデータを識別関数生成用の有異音サンプル音のスペクトログラムデータとして選択する。また、音識別条件設定支援装置１は、複数の無異音サンプル音のスペクトログラムデータのうち、例えば半数の無異音サンプル音のスペクトログラムデータを識別関数生成用の無異音サンプル音のスペクトログラムデータとして選択する。 Subsequently, the sound identification condition setting support device 1 uses, for example, spectrogram data of a half-tone sample sound among a plurality of peculiar sound sample spectrogram data, and the spectrogram data of the phantom sample sound for generating a discrimination function. Choose as. Further, the sound identification condition setting support device 1 uses, for example, spectrogram data of half of the non-sound sample sound as spectrogram data of the non-sound sample sound for generating the discrimination function among the spectrogram data of the plurality of non-sound sample sounds. select.

続いて、音識別条件設定支援装置１は、選択したスペクトログラムデータのそれぞれにつき、所定の周波数範囲を、所定の規則に従って設定したブロック数のブロックに分割し、ブロックのそれぞれにつき、音の強度分布のヒストグラムを示すヒストグラムデータを生成する。上記所定の周波数範囲は、例えば一般的な可聴域である２０Ｈｚ程度から２００００Ｈｚ程度までの範囲である。なお、異音の周波数範囲の予想がつく場合などには、上記所定の周波数範囲を、異音の周波数範囲が含まれる、より狭い範囲に限定してもよい。 Subsequently, the sound identification condition setting support device 1 divides the predetermined frequency range for each of the selected spectrogram data into blocks having the number of blocks set according to a predetermined rule, and the sound intensity distribution of each block is determined. Histogram data indicating a histogram is generated. The predetermined frequency range is, for example, a range from about 20 Hz to about 20000 Hz, which is a general audible range. When the abnormal frequency range is predicted, the predetermined frequency range may be limited to a narrower range including the abnormal frequency range.

続いて、音識別条件設定支援装置１は、識別関数生成用の有異音サンプル音のスペクトログラムデータまたは識別関数生成用の無異音サンプル音のスペクトログラムデータとしてそれぞれ選択された複数のスペクトログラムデータにおけるヒストグラムデータを用いて識別関数を生成し、生成した識別関数と、この時点で設定されているブロック数を記憶部１８に記憶する。 Subsequently, the sound identification condition setting support device 1 has histograms in a plurality of spectrogram data respectively selected as spectrogram data of anomalous sample sound for discriminating function generation or spectrogram data of an allogeneic sample sound for discriminant function generation. An identification function is generated using the data, and the generated identification function and the number of blocks set at this time are stored in the storage unit 18.

続いて、音識別条件設定支援装置１は、複数の有異音サンプル音のスペクトログラムデータのうち、例えば残りの半数の有異音サンプル音のスペクトログラムデータを識別関数検証用の有異音サンプル音のスペクトログラムデータとして選択する。また、音識別条件設定支援装置１は、複数の無異音サンプル音のスペクトログラムデータのうち、例えば残りの半数の無異音サンプル音のスペクトログラムデータを識別関数検証用の無異音サンプル音のスペクトログラムデータとして選択する。続いて、音識別条件設定支援装置１は、選択したスペクトログラムデータのそれぞれにつき、所定の周波数範囲を、所定の規則に従って設定したブロック数のブロックに分割し、ブロックのそれぞれにつき、音の強度分布のヒストグラムを示すヒストグラムデータを生成する。 Subsequently, the sound identification condition setting support device 1 uses, for example, the spectrogram data of the remaining half-tone sample sounds among the spectrogram data of the plurality of different-tone sample sounds, for example, Select as spectrogram data. In addition, the sound identification condition setting support apparatus 1 uses, for example, the spectrogram data of the remaining half-sampled sound samples of the plurality of non-sampled sample sounds, and the spectrogram of the non-sampled sample sounds for discrimination function verification. Select as data. Subsequently, the sound identification condition setting support device 1 divides the predetermined frequency range for each of the selected spectrogram data into blocks having the number of blocks set according to a predetermined rule, and the sound intensity distribution of each block is determined. Histogram data indicating a histogram is generated.

続いて、音識別条件設定支援装置１は、識別関数検証用の有異音サンプル音のスペクトログラムデータまたは識別関数検証用の無異音サンプル音のスペクトログラムデータとしてそれぞれ選択された複数のスペクトログラムデータにおけるヒストグラムデータを、生成された識別関数に入力し、１つのヒストグラムデータが入力されるごとに、当該識別関数による識別を実行し、識別結果の正否を判定する。この結果、当該識別関数に入力されたヒストグラムデータの個数に対応する個数の判定結果が得られる。続いて、音識別条件設定支援装置１は、得られた複数の判定結果を用い、当該識別関数の識別正解率を算定し、算定した識別正解率を記憶部１８に記憶する。識別正解率とは、識別関数が異音の有無を正しく識別する確率である。 Subsequently, the sound identification condition setting support device 1 performs histograms in a plurality of spectrogram data respectively selected as spectrogram data of the abnormal sound sample for discrimination function verification or spectrogram data of the non-sound sample sound for verification of the discrimination function. Data is input to the generated discriminant function, and each time one piece of histogram data is input, discriminating by the discriminant function is executed to determine whether the discrimination result is correct or not. As a result, the number of determination results corresponding to the number of histogram data input to the discrimination function is obtained. Subsequently, the sound identification condition setting support device 1 calculates the identification accuracy rate of the identification function using the obtained plurality of determination results, and stores the calculated identification accuracy rate in the storage unit 18. The discrimination correct answer rate is the probability that the discrimination function correctly discriminates the presence or absence of abnormal sounds.

続いて、音識別条件設定支援装置１は、所定の規則に従ってブロック数の設定を変更しながら、上述した識別関数の生成および識別正解率の算定を複数回実行する。これにより、音識別条件設定支援装置１の記憶部１８には、複数のブロック数、それぞれのブロック数の設定の下で生成されたヒストグラムデータを用いて生成された複数の識別関数、これら識別関数の識別正解率が記憶される。 Subsequently, the sound identification condition setting support device 1 performs the above-described generation of the identification function and calculation of the identification accuracy rate a plurality of times while changing the setting of the number of blocks according to a predetermined rule. Accordingly, the storage unit 18 of the sound identification condition setting support device 1 stores a plurality of identification functions generated using the plurality of blocks, histogram data generated under the setting of the number of blocks, and these identification functions. Is stored.

続いて、音識別条件設定支援装置１は、記憶部１８に記憶された識別正解率に基づき、記憶部１８に記憶された複数の識別関数のうち、識別正解率が最も高い識別関数およびそれに対応するブロック数を選定し、これらを確定識別関数および確定ブロック数として記憶部１８に記憶する。このようにして音識別条件設定支援装置１により選定・記憶された、確定識別関数および確定ブロック数は、採取音中の異音の有無を異音判定装置２１（図１３参照）により判定する際に用いられる。 Subsequently, the sound identification condition setting support device 1 is based on the identification accuracy rate stored in the storage unit 18 and the identification function having the highest identification accuracy rate among the plurality of identification functions stored in the storage unit 18 and corresponding to it. The number of blocks to be selected is selected, and these are stored in the storage unit 18 as a definite identification function and a definite block number. The determined identification function and the number of determined blocks selected and stored by the sound identification condition setting support device 1 in this way are used when the abnormal sound determination device 21 (see FIG. 13) determines the presence or absence of abnormal noise in the collected sound. Used for.

ここで、音のスペクトログラムおよび強度分布のヒストグラムについて図４ないし図８を参照しながら説明する。図４は有異音サンプル音の波形の具体例を示す。図４において横軸が時間であり、縦軸が音の強さである。図５は、図４に示す有異音サンプル音のスペクトログラムを示している。図５において横軸が時間であり、縦軸が周波数であり、色が音の強さを示している。なお、図５においては、音の強さが黒と白の２段階で表されており、白よりも黒の方が音の強さが大きいことを示しているが、実際のスペクトログラムのグラフは、音の強さが多段階（例えば２５６段階）の色（例えば色の明度）の違いで表される。 Here, the spectrogram of sound and the histogram of intensity distribution will be described with reference to FIGS. FIG. 4 shows a specific example of the waveform of the abnormal sound sample. In FIG. 4, the horizontal axis represents time, and the vertical axis represents sound intensity. FIG. 5 shows a spectrogram of the abnormal sample sound shown in FIG. In FIG. 5, the horizontal axis is time, the vertical axis is frequency, and the color indicates the intensity of sound. In FIG. 5, the sound intensity is represented in two stages of black and white, and black indicates that the sound intensity is larger than white, but the actual spectrogram graph is The intensity of sound is represented by a difference in color (for example, lightness of color) in multiple levels (for example, 256 levels).

図６は、図５中のブロックＦ１における音の強度分布のヒストグラムを示し、図７は、図５中のブロックＦ２における音の強度分布のヒストグラムを示し、図８は、図５中のブロックＦ３における音の強度分布のヒストグラムを示している。図６ないし図８のそれぞれにおいて、縦軸が度数／全度数であり、横軸が音の強さ（強度）である。ここで、有異音サンプル音と無異音サンプル音とは、両者の長さ（開始時点から終了時点までの時間）が互いに同じであり、有異音サンプル音または無異音サンプル音の長さ（例えば１時間）を所定の単位時間（例えば１秒）で割った値が「全度数」に相当する。また、有異音サンプル音または無異音サンプル音の開始時点から終了時点までの間において、上記所定の単位時間を単位にし、音の強さが同じ程度となった回数をカウントすることにより得られた値が「度数」に相当する。 6 shows a histogram of sound intensity distribution in block F1 in FIG. 5, FIG. 7 shows a histogram of sound intensity distribution in block F2 in FIG. 5, and FIG. 8 shows block F3 in FIG. The histogram of the sound intensity distribution in is shown. In each of FIG. 6 to FIG. 8, the vertical axis represents frequency / total frequency, and the horizontal axis represents sound intensity (intensity). Here, the noise sample sound and the non-noise sample sound have the same length (time from the start time to the end time), and the length of the sound sample sound or the sound sample sound without noise is the same. A value obtained by dividing (for example, 1 hour) by a predetermined unit time (for example, 1 second) corresponds to “total frequency”. In addition, it is obtained by counting the number of times that the sound intensity is the same in the predetermined unit time from the start point to the end point of the abnormal sound sample or non-noise sample sound. The obtained value corresponds to “frequency”.

図５に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ１では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが大きい状態がほぼ連続している。このブロックＦ１における音の強度分布では、図６に示すように、度数／全度数がピークとなる音の強さがＰ１であり、分布がＰ１に集中し、分布の範囲が狭い。 In the spectrogram of the abnormal sound sample shown in FIG. 5, in the block F1, the state where the sound intensity is high is continuous from the start time to the end time of the abnormal sound sample. In the sound intensity distribution in the block F1, as shown in FIG. 6, the sound intensity at which the frequency / total frequency has a peak is P1, the distribution is concentrated on P1, and the distribution range is narrow.

また、図５に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ２では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが小さい状態がほぼ連続している。このブロックＦ２における音の強度分布では、図７に示すように、度数／全度数がピークとなる音の強さがＰ２であり、Ｐ２はＰ１よりも小さい値である。また、ブロックＦ２における音の強度分布では、分布がＰ２に集中し、分布の範囲が狭い。 Further, in the spectrogram of the abnormal sound sample shown in FIG. 5, in the block F2, the state where the sound intensity is low is continuous from the start time to the end time of the abnormal sound sample. In the sound intensity distribution in the block F2, as shown in FIG. 7, the sound intensity at which the frequency / total frequency reaches its peak is P2, and P2 is a value smaller than P1. Further, in the sound intensity distribution in the block F2, the distribution is concentrated on P2, and the distribution range is narrow.

また、図５に示す有異音サンプル音のスペクトログラムにおいて、ブロックＦ３では、有異音サンプル音の開始時点から終了時点までの間において、音の強さが大きい状態と音の強さが小さい状態とが混在している。このブロックＦ３における音の強度分布では、図８に示すように、分布の範囲が比較的広い。 Further, in the spectrogram of the noise sample sound shown in FIG. 5, in the block F3, the sound intensity is high and the sound intensity is low between the start time and the end time of the noise sample sound. Are mixed. In the sound intensity distribution in the block F3, as shown in FIG. 8, the range of the distribution is relatively wide.

このように、有異音サンプル音のスペクトログラムにおけるブロックごとの強度分布のヒストグラムによれば、有異音サンプル音におけるブロックごとの音の強さの時間的変化を表すことができる。したがって、有異音サンプル音、無異音サンプル音、被判定音といった採取音のそれぞれにつき、スペクトログラムにおけるブロックごとの音の強度分布のヒストグラムを用いることにより、採取音および採取音に含まれる異音の周波数または音の強さの時間的変化を勘案しつつ、採取音中の異音の有無を高精度に識別可能な識別関数を生成することができる。 As described above, according to the histogram of the intensity distribution for each block in the spectrogram of the abnormal sound sample, the temporal change in the sound intensity for each block of the abnormal sound sample can be expressed. Therefore, by using a histogram of the sound intensity distribution for each block in the spectrogram for each sampled sound such as abnormal sound sample, non-sound sample sound, judgment sound, abnormal sound contained in the sampled sound and the collected sound It is possible to generate a discriminant function capable of accurately discriminating the presence or absence of abnormal sounds in the collected sound while taking into account temporal changes in the frequency or sound intensity.

（音識別条件設定支援装置の構成）
図１に示すように、音識別条件設定支援装置１は演算処理部１１を備え、演算処理部１１は、音声データからスペクトログラムデータを生成するスペクトログラム生成部１２と、スペクトログラムデータを複数のブロックに分割し、各ブロックにおける音の強度分布のヒストグラムデータを生成するヒストグラム生成部１３と、各ブロックにおける音の強度分布のヒストグラムデータを用いて識別関数を生成する識別関数生成部１４と、識別関数の識別正解率を算定する正解率算定部１５と、識別正解率の最も高い識別関数およびそれに対応するブロック数を選定する識別条件選定部１６と、スペクトログラムデータの所定の周波数範囲を複数のブロックに分割する際のブロック数（すなわち所定の周波数範囲の分割数）を設定または変更する処理、および音識別条件設定支援処理の流れの制御等を行う制御部１７とを備えている。さらに、音識別条件設定支援装置１は、確定識別関数、確定ブロック数、および音識別条件設定支援処理中に保持すべき種々のデータを記憶する記憶部１８と、有異音サンプル音または無異音サンプル音の音声データの入力、操作入力、処理の進捗状況等の表示等を行う入出力部１９とを備えている。 (Configuration of sound identification condition setting support device)
As shown in FIG. 1, the sound identification condition setting support device 1 includes an arithmetic processing unit 11. The arithmetic processing unit 11 divides the spectrogram data into a plurality of blocks, and a spectrogram generation unit 12 that generates spectrogram data from speech data. A histogram generating unit 13 that generates histogram data of the sound intensity distribution in each block, a discrimination function generating unit 14 that generates a discrimination function using the histogram data of the sound intensity distribution in each block, and discrimination of the discrimination function A correct rate calculation unit 15 that calculates the correct rate, an identification function selection unit 16 that selects an identification function with the highest identification accuracy rate and the number of blocks corresponding thereto, and a predetermined frequency range of spectrogram data is divided into a plurality of blocks. Set or change the number of blocks (ie the number of divisions in a given frequency range) It comprises processing and sound identification condition setting support processing and control unit 17 for control of flow. Furthermore, the sound identification condition setting support device 1 includes a storage unit 18 that stores a definite identification function, the number of definite blocks, and various data that should be held during the sound identification condition setting support process, and a sampled noise or non-uniform sound. An input / output unit 19 is provided for inputting sound data of sound sample sounds, inputting operations, displaying the progress of processing, and the like.

このような構成を有する音識別条件設定支援装置１は、例えばパーソナルコンピュータにより実現することができる。具体的に述べると、演算処理部１１は、パーソナルコンピュータに内蔵されたＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｇＵｎｉｔ）により実現することができる。また、スペクトログラム生成部１２、ヒストグラム生成部１３、識別関数生成部１４、正解率算定部１５、識別条件選定部１６、および制御部１７は、パーソナルコンピュータを音識別条件設定支援装置１として機能させるためのコンピュータプログラムをＣＰＵに読み込ませて実行させることにより実現することができる。また、記憶部１８は、パーソナルコンピュータが有する記憶装置（例えばハードディスクドライブ装置、フラッシュメモリ等）により実現することができる。また、入出力部１９は、パーソナルコンピュータが有する入出力装置（例えばキーボード、マウス、モニタ等）により実現することができる。 The sound identification condition setting support device 1 having such a configuration can be realized by a personal computer, for example. More specifically, the arithmetic processing unit 11 can be realized by a CPU (Central Processing Unit) built in a personal computer. Further, the spectrogram generation unit 12, the histogram generation unit 13, the discrimination function generation unit 14, the accuracy rate calculation unit 15, the identification condition selection unit 16, and the control unit 17 make the personal computer function as the sound identification condition setting support device 1. This can be realized by causing the CPU to read and execute the computer program. The storage unit 18 can be realized by a storage device (for example, a hard disk drive device, a flash memory, or the like) included in the personal computer. The input / output unit 19 can be realized by an input / output device (for example, a keyboard, a mouse, a monitor, etc.) included in the personal computer.

（音識別条件設定支援処理）
図２および図３は、音識別条件設定支援装置１により行われる音識別条件設定支援処理を示している。作業者は、処理の準備として、例えば、１００個の有異音サンプル音Ａ１〜Ａ１００の音声データと、１００個の無異音サンプル音Ｂ１〜Ｂ１００の音声データを用意する。 (Sound identification condition setting support processing)
2 and 3 show a sound identification condition setting support process performed by the sound identification condition setting support device 1. FIG. As a preparation for processing, for example, the worker prepares audio data of 100 abnormal sound samples A1 to A100 and audio data of 100 non-natural sound samples B1 to B100.

続いて、作業者は、音識別条件設定支援装置１の入出力部１９を操作し、有異音サンプル音Ａ１〜Ａ１００の音声データおよび無異音サンプル音Ｂ１〜Ｂ１００の音声データを音識別条件設定支援装置１に入力する。なお、これら音声データを音識別条件設定支援装置１に入力する際には、音識別条件設定支援装置１が、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００の音声データを１つ１つ識別可能となるようにすると共に、音声データが有異音サンプル音の音声データであるのか無異音サンプル音の音声データであるのかを識別可能となるようにする。これを実現する方法として、例えば、各音声データに、各音声データを識別するための識別情報、および各音声データが有異音サンプル音の音声データであるのか無異音サンプル音の音声データであるのかを示す分類情報を付す方法がある。この方法は例えば音声データのファイル名の付け方を工夫することで容易に実現することができる。 Subsequently, the operator operates the input / output unit 19 of the sound identification condition setting support device 1 to set the sound data of the abnormal sound samples A1 to A100 and the sound data of the non-natural sound samples B1 to B100 as sound recognition conditions. Input to the setting support apparatus 1. When these sound data are input to the sound identification condition setting support device 1, the sound identification condition setting support device 1 receives the sound data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100. In addition to being able to identify each one, it is possible to identify whether the sound data is the sound data of an abnormal sound sample or the sound data of an allophone sample sound. As a method for realizing this, for example, each voice data is identified with identification information for identifying each voice data, and whether each voice data is voice data of abnormal sample sound or voice data of non-sound sample sound. There is a method of attaching classification information indicating whether or not there is. This method can be easily realized, for example, by devising a method for naming the file name of the audio data.

続いて、作業者は、音識別条件設定支援処理を開始する旨の指示を音識別条件設定支援装置１に入力する。これに応じ、音識別条件設定支援装置１は、図２および図３に示す音識別条件設定支援処理を開始する。 Subsequently, the worker inputs an instruction to start the sound identification condition setting support process to the sound identification condition setting support device 1. In response to this, the sound identification condition setting support device 1 starts the sound identification condition setting support process shown in FIGS.

音識別条件設定支援処理において、図２に示すように、まず、スペクトログラム生成部１２が、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００の音声データのそれぞれについてスペクトログラムデータを生成し、生成した各スペクトログラムデータを記憶部１８に記憶する（ステップＳ１）。 In the sound identification condition setting support process, as shown in FIG. 2, first, the spectrogram generation unit 12 generates spectrogram data for each of the sound data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100. Then, the generated spectrogram data is stored in the storage unit 18 (step S1).

続いて、制御部１７が、所定の規則に従い、スペクトログラムデータの上記所定の周波数範囲を複数のブロックに分割する際のブロック数を設定する（ステップＳ２）。例えば、本実施形態では、上記所定の規則として、ブロック数の最小値を２０とし、最大値を８０とし、初回に設定するブロック数を上記最小値とし、ブロック数の設定の変更を１回行うごとにブロック数を５増加させるといった規則を採用する。 Subsequently, the control unit 17 sets the number of blocks when dividing the predetermined frequency range of the spectrogram data into a plurality of blocks according to a predetermined rule (step S2). For example, in the present embodiment, as the predetermined rule, the minimum number of blocks is set to 20, the maximum value is set to 80, the number of blocks set for the first time is set to the minimum value, and the setting of the number of blocks is changed once. A rule of increasing the number of blocks by 5 every time is adopted.

ここで、図９はブロック数を２０に設定し、採取音のスペクトログラムの所定の周波数範囲をブロックＧに分割した状態を示し、図１０はブロック数を３０に設定し、採取音のスペクトログラムの所定の周波数範囲をブロックＨに分割した状態を示している。図９および図１０を見るとわかる通り、ブロック数が多くなると、各ブロックの幅（周波数範囲）が小さくなる。 Here, FIG. 9 shows a state in which the number of blocks is set to 20, and a predetermined frequency range of the spectrogram of the collected sound is divided into blocks G. FIG. 10 shows a state in which the number of blocks is set to 30 and the spectrogram of the collected sound is predetermined. The frequency range is divided into blocks H. As can be seen from FIGS. 9 and 10, as the number of blocks increases, the width (frequency range) of each block decreases.

続いて、図２に示すように、制御部１７が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中から、未選択のスペクトログラムデータをランダムに１つ選択し、当該スペクトログラムデータを記憶部１８から読み出す（ステップＳ３）。続いて、制御部１７は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中からどの有異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部１８に記憶する。この履歴情報は、制御部１７がこの後にステップＳ３またはＳ１０において未選択のスペクトログラムデータを特定するために用いられる。 Subsequently, as shown in FIG. 2, the control unit 17 randomly selects one unselected spectrogram data from the spectrogram data of the abnormal sound samples A1 to A100, and stores the spectrogram data in the storage unit 18. (Step S3). Subsequently, the control unit 17 stores, in the storage unit 18, history information indicating which abnormal sound sample data is selected from the spectrogram data of the abnormal sound samples A1 to A100. This history information is used by the control unit 17 to specify spectrogram data that has not been selected in step S3 or S10.

続いて、ヒストグラム生成部１３が、選択されたスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数のブロックに分割し、分割した各ブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部１８に記憶する（ステップＳ４）。 Subsequently, the histogram generation unit 13 divides the predetermined frequency range of the selected spectrogram data into blocks having the number of blocks set in step S2, and generates histogram data of intensity distribution of each divided block. Each generated histogram data is stored in the storage unit 18 (step S4).

続いて、制御部１７が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち例えば５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えたか否かを判断する（ステップＳ５）。有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えていない場合（ステップＳ５：ＮＯ）、制御部１７は処理をステップＳ３に戻す。 Subsequently, the control unit 17 determines whether or not the processing for generating the histogram data for, for example, 50 spectrogram data among the spectrogram data of the abnormal sound samples A1 to A100 is finished (step S5). When the process of generating histogram data for 50 spectrogram data of the spectrogram data of the noise sample sounds A1 to A100 has not been completed (step S5: NO), the control unit 17 returns the process to step S3.

一方、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えた場合には（ステップＳ５：ＹＥＳ）、制御部１７が、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中から、未選択のスペクトログラムデータをランダムに１つ選択し、当該スペクトログラムデータを記憶部１８から読み出す（ステップＳ６）。続いて、制御部１７は、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中からどの無異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部１８に記憶する。この履歴情報は、制御部１７がこの後のステップＳ６またはＳ１５において未選択のスペクトログラムデータを特定するために用いられる。 On the other hand, when the process of generating the histogram data for 50 spectrogram data among the spectrogram data of the abnormal sound samples A1 to A100 is completed (step S5: YES), the control unit 17 performs the non-noise sample sound. One spectrogram data that has not been selected is selected at random from the spectrogram data of B1 to B100, and the spectrogram data is read from the storage unit 18 (step S6). Subsequently, the control unit 17 stores, in the storage unit 18, history information that indicates which spectrogram data of which anomalous sample sound is selected from the spectrogram data of the allomalous sample sounds B1 to B100. This history information is used by the control unit 17 to specify spectrogram data that has not been selected in the subsequent step S6 or S15.

続いて、ヒストグラム生成部１３が、選択されたスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数のブロックに分割し、分割した各ブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部１８に記憶する（ステップＳ７）。 Subsequently, the histogram generation unit 13 divides the predetermined frequency range of the selected spectrogram data into blocks having the number of blocks set in step S2, and generates histogram data of intensity distribution of each divided block. Each generated histogram data is stored in the storage unit 18 (step S7).

続いて、制御部１７が、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち例えば５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えたか否かを判断する（ステップＳ８）。無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えていない場合（ステップＳ８：ＮＯ）、制御部１７は処理をステップＳ６に戻す。 Subsequently, the control unit 17 determines whether or not the processing for generating the histogram data for, for example, 50 spectrogram data among the spectrogram data of the allophone sample sounds B1 to B100 is finished (step S8). When the process of generating the histogram data for 50 spectrogram data of the spectrogram data of the allophone sample sounds B1 to B100 has not been completed (step S8: NO), the control unit 17 returns the process to step S6.

一方、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち５０個のスペクトログラムデータについてヒストグラムデータを生成する処理を終えた場合には（ステップＳ８：ＹＥＳ）、識別関数生成部１４が、ステップＳ４で生成された５０個の有異音サンプル音における各ブロックのヒストグラムデータと、ステップＳ７で生成された５０個の無異音サンプル音における各ブロックのヒストグラムデータを記憶部１８から読み出し、これらのヒストグラムデータを用い、識別関数を生成する（ステップＳ９）。識別関数の生成には、例えばサポートベクターマシン等の学習モデルを用いたパターン認識手法（例えば特開２００５−３５２９９７号公報参照）を採用する。なお、ニューラルネットワーク等の他の学習モデルを識別関数の生成に採用してもよい。続いて、識別関数生成部１４は、生成した識別関数を、それに対応するブロック数（例えば、現在設定されているブロック数を示す数値等の情報）と共に記憶部１８に記憶する。 On the other hand, when the process of generating the histogram data for 50 spectrogram data among the spectrogram data of the allophone sample sounds B1 to B100 is finished (step S8: YES), the discriminant function generation unit 14 performs step S4. The histogram data of each block in the generated 50 abnormal sound samples and the histogram data of each block in the 50 sound sample sounds generated in step S7 are read from the storage unit 18, and these histogram data are read out. Is used to generate a discrimination function (step S9). For the generation of the discriminant function, for example, a pattern recognition method using a learning model such as a support vector machine (see, for example, JP-A-2005-352997) is employed. Note that another learning model such as a neural network may be employed for generating the discrimination function. Subsequently, the discriminant function generation unit 14 stores the generated discriminant function in the storage unit 18 together with the number of blocks corresponding thereto (for example, information such as a numerical value indicating the number of blocks currently set).

続いて、制御部１７が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中から、未選択のスペクトログラムデータをランダムに１つ選択し、当該スペクトログラムデータを記憶部１８から読み出す（ステップＳ１０）。続いて、制御部１７は、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中からどの有異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部１８に記憶する。この履歴情報は、制御部１７がこの後のステップＳ３またはＳ１０において未選択のスペクトログラムデータを特定するために用いられる。 Subsequently, the control unit 17 randomly selects one non-selected spectrogram data from the spectrogram data of the abnormal sound samples A1 to A100, and reads the spectrogram data from the storage unit 18 (step S10). Subsequently, the control unit 17 stores, in the storage unit 18, history information indicating which abnormal sound sample data is selected from the spectrogram data of the abnormal sound samples A1 to A100. The history information is used by the control unit 17 to specify unselected spectrogram data in the subsequent step S3 or S10.

続いて、ヒストグラム生成部１３が、選択されたスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数のブロックに分割し、分割した各ブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部１８に記憶する（ステップＳ１１）。 Subsequently, the histogram generation unit 13 divides the predetermined frequency range of the selected spectrogram data into blocks having the number of blocks set in step S2, and generates histogram data of intensity distribution of each divided block. Each generated histogram data is stored in the storage unit 18 (step S11).

続いて、図３に示すように、正解率算定部１５が、ステップＳ１１で生成されたヒストグラムデータを、ステップＳ９で生成された識別関数に入力し、当該識別関数により有異音サンプル音中の異音の有無の識別を実行する（ステップＳ１２）。 Subsequently, as shown in FIG. 3, the accuracy rate calculation unit 15 inputs the histogram data generated in step S11 to the discrimination function generated in step S9, and the discrimination function uses the discrimination function to detect the noise sample sound. The presence / absence of abnormal noise is identified (step S12).

続いて、正解率算定部１５は、当該識別関数による識別の正否を判定する（ステップＳ１３）。ステップＳ１３の判定では、当該識別関数による識別の結果が「異音あり」である場合に正解であり、そうでない場合には不正解である。続いて、正解率算定部１５は、当該判定結果（例えば判定結果を示すフラグ等の情報）を判定結果情報として記憶部１８に記憶する。 Subsequently, the correct answer rate calculation unit 15 determines whether the identification function is correct or not (step S13). In the determination in step S13, the answer is correct when the result of the identification by the identification function is “abnormal noise”, otherwise it is incorrect. Subsequently, the correct answer rate calculation unit 15 stores the determination result (for example, information such as a flag indicating the determination result) in the storage unit 18 as determination result information.

続いて、制御部１７が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち例えば５０個のスペクトログラムデータを用いて識別関数による識別を実行し、当該識別関数の実行結果を判定する処理を終えたか否かを判断する（ステップＳ１４）。有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち５０個のスペクトログラムデータを用いた識別関数による識別の実行とその結果の判定を終えていない場合には（ステップＳ１４：ＮＯ）、制御部１７は処理を図２中のステップＳ１０に戻す。 Subsequently, the control unit 17 executes discrimination by the discrimination function using, for example, 50 spectrogram data among the spectrogram data of the abnormal sound samples A1 to A100, and finishes the process of determining the execution result of the discrimination function. It is determined whether or not (step S14). When the execution of discrimination by the discrimination function using 50 spectrogram data among the spectrogram data of the abnormal sound samples A1 to A100 and the determination of the result are not finished (step S14: NO), the control unit 17 The process returns to step S10 in FIG.

一方、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうち５０個のスペクトログラムデータを用いた識別関数による識別の実行とその結果の判定を終えた場合には（ステップＳ１４：ＹＥＳ）、制御部１７が、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中から、未選択のスペクトログラムデータをランダムに１つ選択し、当該スペクトログラムデータを記憶部１８から読み出す（ステップＳ１５）。続いて、制御部１７は、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中からどの無異音サンプル音のスペクトログラムデータを選択したかを示す履歴情報を記憶部１８に記憶する。この履歴情報は、制御部１７がこの後のステップＳ６またはＳ１５において未選択のスペクトログラムデータを特定するために用いられる。 On the other hand, when the execution of the discrimination by the discrimination function using 50 spectrogram data of the spectrogram data of the unusual sound samples A1 to A100 and the determination of the result are finished (step S14: YES), the control unit 17 However, one non-selected spectrogram data is randomly selected from the spectrogram data of the allophone sample sounds B1 to B100, and the spectrogram data is read from the storage unit 18 (step S15). Subsequently, the control unit 17 stores, in the storage unit 18, history information that indicates which spectrogram data of which anomalous sample sound is selected from the spectrogram data of the allomalous sample sounds B1 to B100. This history information is used by the control unit 17 to specify spectrogram data that has not been selected in the subsequent step S6 or S15.

続いて、ヒストグラム生成部１３が、選択されたスペクトログラムデータの上記所定の周波数範囲を、ステップＳ２において設定されたブロック数のブロックに分割し、分割した各ブロックの強度分布のヒストグラムデータを生成し、生成した各ヒストグラムデータを記憶部１８に記憶する（ステップＳ１６）。 Subsequently, the histogram generation unit 13 divides the predetermined frequency range of the selected spectrogram data into blocks having the number of blocks set in step S2, and generates histogram data of intensity distribution of each divided block. Each generated histogram data is stored in the storage unit 18 (step S16).

続いて、正解率算定部１５が、ステップＳ１６で生成されたヒストグラムデータを、ステップＳ９で生成された識別関数に入力し、当該識別関数により無異音サンプル音中の異音の有無の識別を実行する（ステップＳ１７）。 Subsequently, the correct answer rate calculation unit 15 inputs the histogram data generated in step S16 to the discrimination function generated in step S9, and uses the discrimination function to identify the presence or absence of abnormal sounds in the non-abnormal sound sample. Execute (Step S17).

続いて、正解率算定部１５は、当該識別関数による識別の正否を判定する（ステップＳ１８）。ステップＳ１８の判定では、当該識別関数による識別の結果が「異音なし」である場合に正解であり、そうでない場合には不正解である。続いて、正解率算定部１５は、当該判定結果を判定結果情報として記憶部１８に記憶する。 Subsequently, the correct answer rate calculation unit 15 determines whether the discrimination function is correct or not (step S18). In the determination in step S18, the answer is correct when the result of the identification by the identification function is “no abnormal sound”, otherwise it is incorrect. Subsequently, the correct answer rate calculation unit 15 stores the determination result in the storage unit 18 as determination result information.

続いて、制御部１７が、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち例えば５０個のスペクトログラムデータを用いて識別関数による識別を実行し、当該識別関数による識別を判定する処理を終えたか否かを判断する（ステップＳ１９）。無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち５０個のスペクトログラムデータを用いた識別関数による識別の実行とその結果の判定を終えていない場合には（ステップＳ１９：ＮＯ）、制御部１７は処理をステップＳ１５に戻す。 Subsequently, the control unit 17 has performed identification using an identification function using, for example, 50 spectrogram data among the spectrogram data of the allophone sample sounds B1 to B100, and has finished the process of determining identification based on the identification function It is determined whether or not (step S19). When the execution of the discrimination by the discriminant function using the 50 spectrogram data of the spectrogram data of the allophone sample sounds B1 to B100 and the determination of the result are not finished (step S19: NO), the control unit 17 The process returns to step S15.

一方、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうち５０個のスペクトログラムデータを用いた識別関数による識別の実行とその結果の判定を終えた場合には（ステップＳ１９：ＹＥＳ）、制御部１７は処理をステップＳ２０へ移行させる。 On the other hand, when the execution of the discrimination by the discrimination function using the 50 spectrogram data of the spectrogram data of the allophone sample sounds B1 to B100 and the determination of the result are finished (step S19: YES), the control unit 17 Shifts the process to step S20.

以上のステップＳ３〜Ｓ１９の処理により、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータの中からランダムに選択された半数のスペクトログラムデータと、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータの中からランダムに選択された半数のスペクトログラムデータを用いて識別関数が生成され、当該識別関数の実行結果の正否の判定が、有異音サンプル音Ａ１〜Ａ１００のスペクトログラムデータのうちの残りの半数のスペクトログラムデータと、無異音サンプル音Ｂ１〜Ｂ１００のスペクトログラムデータのうちの残りの半数のスペクトログラムデータを用いて行われたこととなる。 By the processes in steps S3 to S19, half of the spectrogram data randomly selected from the spectrogram data of the abnormal sound samples A1 to A100 and the spectrogram data of the allophone sample sounds B1 to B100 are randomly selected. The discriminant function is generated using the half of the spectrogram data selected in the above, and whether or not the execution result of the discriminant function is correct is determined by determining the remaining half of the spectrogram data of the sampled sounds A1 to A100 and the remaining half of the spectrogram data. That is, the other half of the spectrogram data of the non-sound sample sounds B1 to B100 is used.

続いて、ステップＳ２０において、制御部１７が、識別関数の生成およびその実行結果の判定（ステップＳ３〜Ｓ１９）を複数回（例えば２００回）繰り返したか否かを判断する。識別関数の生成およびその実行結果の判定を２００回繰り返していない場合には（ステップＳ２０：ＮＯ）、制御部１７は処理をステップＳ３に戻し、スペクトログラムデータのランダム選択を初めからやり直すために、スペクトログラムデータの選択の履歴を示す上記履歴情報をすべて消去（初期化）する。 Subsequently, in step S20, the control unit 17 determines whether or not the generation of the identification function and the determination of the execution result (steps S3 to S19) have been repeated a plurality of times (for example, 200 times). If the generation of the discriminant function and the determination of the execution result have not been repeated 200 times (step S20: NO), the control unit 17 returns the process to step S3 and repeats the spectrogram data random selection from the beginning. All the history information indicating the data selection history is erased (initialized).

一方、識別関数の生成およびその実行結果の判定を２００回繰り返した場合には（ステップＳ２０：ＹＥＳ）、制御部１７は処理をステップＳ２１に移行させる。 On the other hand, when the generation of the identification function and the determination of the execution result are repeated 200 times (step S20: YES), the control unit 17 shifts the process to step S21.

以上のステップＳ３〜Ｓ２０の処理により、識別関数の生成に用いられる５０個の有異音サンプル音の組合せ、および識別関数の生成に用いられる５０個の無異音サンプル音の組合せがそれぞれ２００通り生成され、これら２００通りの組合せを用いて２００個の識別関数が生成されたことになる。さらに、ステップＳ３〜Ｓ２０の処理により、識別関数の実行結果の判定に用いられる５０個の有異音サンプル音の組合せ、および識別関数の実行結果の判定に用いられる５０個の無異音サンプル音の組合せがそれぞれ２００通り生成され、これら２００通りの組合せを用い、各識別関数につき識別結果の判定が行われたことになる。ステップＳ３〜Ｓ２０を２００回実行し終えたとき、記憶部１８には、２００００個の判定結果情報が記憶されている。このように、識別関数の生成およびその実行結果の判定を繰り返し行い、多数の判定結果情報を得ることで、後にステップＳ２１で算定される識別正解率の信頼性を高めることができる。 Through the processes in steps S3 to S20, 200 combinations of 50 different sample sounds used for generating the discriminant function and 200 combinations of 50 noise samples used for generating the discriminant function are provided. Thus, 200 discriminant functions are generated using these 200 combinations. Further, by the processing of steps S3 to S20, the combination of 50 different sampled sounds used for determining the execution result of the discriminant function, and 50 non-sound sampled sounds used for determining the execution result of the discriminant function. 200 combinations are generated, and the identification results are determined for each identification function using these 200 combinations. When Steps S3 to S20 have been executed 200 times, the storage unit 18 stores 20000 pieces of determination result information. Thus, by repeatedly generating the identification function and determining the execution result and obtaining a large amount of determination result information, it is possible to improve the reliability of the identification accuracy rate calculated in step S21 later.

続いて、ステップＳ２１において、正解率算定部１５が、ステップＳ３〜Ｓ２０において得られた２００００個の判定結果情報を用い、現在設定されている１つのブロック数により生成された識別関数につき、１つの識別正解率を算定し、算定した識別正解率を記憶部１８に記憶する。 Subsequently, in step S21, the accuracy rate calculation unit 15 uses 20,000 pieces of determination result information obtained in steps S3 to S20, and uses one discriminant function generated with one currently set number of blocks. The identification correct answer rate is calculated, and the calculated identification correct answer rate is stored in the storage unit 18.

ここで、図１１は、ステップＳ２１における識別正解率の算定方法の一例を示している。図１１において、「○」は、ステップＳ１３またはＳ１８における判定結果が正解であったことを示し、「×」は、ステップＳ１３またはＳ１８における判定結果が不正解であったことを示す。「−」は、そのサンプル音のスペクトログラムデータが識別関数の生成に用いられ、ステップＳ１３またはＳ１８における識別関数による識別の正否判定には用いられなかったことを示す。図１１に示すように、ステップＳ２１における識別正解率の算定処理において、まず、サンプル音Ａ１〜Ａ１００およびＢ１〜Ｂ１００の各スペクトログラムデータについて識別正解率を算定する。１つのスペクトログラムデータについての識別正解率は、当該スペクトログラムデータのヒストグラムデータを識別関数に入力して得られた識別結果において、正解の個数と不正解の個数とを合わせた値で、正解の個数を割ることにより求めることができる。次に、サンプル音Ａ１〜Ａ１００およびＢ１〜Ｂ１００のスペクトログラムデータの識別正解率の平均値を算定し、この平均値を、現在設定されている１つのブロック数により生成された識別関数の識別正解率として用いる。 Here, FIG. 11 shows an example of the method of calculating the identification correct rate in step S21. In FIG. 11, “◯” indicates that the determination result in step S13 or S18 is correct, and “X” indicates that the determination result in step S13 or S18 is incorrect. “-” Indicates that the spectrogram data of the sample sound is used for generating the discriminant function, and has not been used for discriminating correctness by the discriminant function in step S13 or S18. As shown in FIG. 11, in the process of calculating the identification correct rate in step S21, first, the identification correct rate is calculated for the spectrogram data of the sample sounds A1 to A100 and B1 to B100. The classification accuracy rate for one spectrogram data is the sum of the number of correct answers and the number of incorrect answers in the identification result obtained by inputting the histogram data of the spectrogram data into the identification function. It can be determined by dividing. Next, the average value of the identification accuracy rates of the spectrogram data of the sample sounds A1 to A100 and B1 to B100 is calculated, and this average value is used as the identification accuracy rate of the identification function generated by the number of one block currently set. Used as

続いて、図３に示すように、制御部１７が、ブロック数のすべての設定についてステップＳ２〜Ｓ２１の処理の実行を終えたか否かを判断する（ステップＳ２２）。ブロック数のすべての設定についてステップＳ２〜Ｓ２１の処理の実行を終えていない場合には（ステップＳ２２：ＮＯ）、制御部１７は処理を図２中のステップＳ２に戻す。そして、ステップＳ２において、制御部１７が上記所定の規則に従ってブロック数を変更し、続いて、制御部１７、ヒストグラム生成部１３および正解率算定部１５がステップＳ３〜Ｓ２１の処理を実行する。 Subsequently, as illustrated in FIG. 3, the control unit 17 determines whether or not the processing of steps S2 to S21 has been completed for all the settings of the number of blocks (step S22). When the execution of steps S2 to S21 has not been completed for all settings of the number of blocks (step S22: NO), the control unit 17 returns the processing to step S2 in FIG. In step S2, the control unit 17 changes the number of blocks in accordance with the predetermined rule, and then the control unit 17, the histogram generation unit 13, and the correct answer rate calculation unit 15 execute the processes of steps S3 to S21.

一方、ブロック数のすべての設定についてステップＳ２〜Ｓ２１の処理の実行を終えた場合には（ステップＳ２２：ＹＥＳ）、制御部１７は処理をステップＳ２３に移行させる。上述したブロック数の設定についての所定の規則によれば、ブロック数の設定は全部で１３通りであるので、ブロック数が変更されながらステップＳ２〜Ｓ２１の処理が１３回繰り返され、この結果、１３通りのブロック数により生成された識別関数の識別正解率が算定される。１３通りのブロック数の設定についてステップＳ２〜Ｓ２１の処理の実行を終えたとき、記憶部１８には、１３通りにブロック数にそれぞれ対応する１３個の識別正解率が記憶されている。 On the other hand, when the execution of the processing of steps S2 to S21 is completed for all the settings of the number of blocks (step S22: YES), the control unit 17 shifts the processing to step S23. According to the predetermined rule regarding the setting of the number of blocks described above, there are 13 types of setting of the number of blocks. Therefore, the processing of steps S2 to S21 is repeated 13 times while the number of blocks is changed. The discrimination accuracy rate of the discrimination function generated by the number of street blocks is calculated. When the execution of the processing of steps S2 to S21 is completed for the setting of 13 types of blocks, the storage unit 18 stores 13 identification accuracy rates respectively corresponding to the number of blocks.

ここで、図１２は、上記所定の規則に従って設定されたブロック数にそれぞれ対応する識別関数の識別正解率の例を示している。図１２によれば、ブロック数が５０の識別正解率が９７％で最も高い。 Here, FIG. 12 shows an example of the identification accuracy rate of the identification function corresponding to the number of blocks set according to the predetermined rule. According to FIG. 12, the identification accuracy rate of 50 blocks is the highest at 97%.

続いて、図３中のステップＳ２３において、識別条件選定部１６が、記憶部１８に記憶された複数の識別正解率を読み出し、それらの中から最も高い識別正解率を特定する。続いて、識別条件選定部１６は、特定した識別正解率を有する識別関数に対応するブロック数を選定し、選定したブロック数を確定ブロック数として記憶部１８に記憶する。 Subsequently, in step S23 in FIG. 3, the identification condition selection unit 16 reads out a plurality of identification accuracy rates stored in the storage unit 18 and identifies the highest identification accuracy rate from among them. Subsequently, the identification condition selection unit 16 selects the number of blocks corresponding to the identification function having the identified identification accuracy rate, and stores the selected number of blocks in the storage unit 18 as the determined number of blocks.

続いて、識別関数生成部１４が、確定ブロック数として記憶されたブロック数の設定の下で生成された、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００のすべてのスペクトログラムデータにおけるすべてのヒストグラムデータを用い、改めて識別関数を生成し、生成した識別関数を確定識別関数として記憶部１８に記憶する（ステップＳ２４）。上記確定ブロック数および確定識別関数は、後述する異音判定装置２１による異音判定処理に用いられる。 Subsequently, all the spectrogram data of the abnormal sound samples A1 to A100 and the non-natural sound samples B1 to B100 generated under the setting of the number of blocks stored as the determined number of blocks. The discrimination function is newly generated using all the histogram data in, and the generated discrimination function is stored in the storage unit 18 as a definite discrimination function (step S24). The determined block number and the determined identification function are used for abnormal sound determination processing by the abnormal sound determination device 21 described later.

以上説明した通り、本発明の実施形態による音識別条件設定支援装置１によれば、採取音中の異音の有無を高精度に識別することができる識別関数を迅速かつ容易に少ない作業負担で生成することができる。すなわち、音識別条件設定支援装置１による識別関数の生成は、音声データを入力しさえすれば自動的に行われるので、識別関数を生成する作業、すなわち異音の特徴抽出に相当する作業を、コンピュータのモニタに表示された採取音のスペクトログラムを見ながら異音の特徴を見出すといった従来の特徴抽出作業と比較し、大幅に簡単化することができる。したがって、作業時間を短縮することができ、また、作業者の負担を軽減することができる。また、サンプル音を用意すれば、音識別条件設定支援装置１により識別関数を自動的に生成することができるので、作業者の経験の多少を問わず、高精度な識別能力を有する識別関数を生成することができる。 As described above, according to the sound identification condition setting support device 1 according to the embodiment of the present invention, an identification function that can accurately identify the presence or absence of an abnormal sound in a collected sound can be quickly and easily performed with a small work load. Can be generated. That is, since the generation of the discrimination function by the sound discrimination condition setting support device 1 is automatically performed as long as voice data is input, the task of generating the discrimination function, that is, the task corresponding to the feature extraction of abnormal noise, Compared with the conventional feature extraction operation of finding the features of abnormal sounds while looking at the spectrogram of the collected sound displayed on the computer monitor, it can be greatly simplified. Therefore, the working time can be shortened and the burden on the worker can be reduced. Also, if a sample sound is prepared, the sound identification condition setting support device 1 can automatically generate a discrimination function, so that a discrimination function having a high accuracy discrimination ability can be obtained regardless of the experience of the operator. Can be generated.

なお、上述した実施形態における音識別条件設定支援処理では、識別関数の生成および識別関数の識別結果の判定（ステップＳ３〜Ｓ２０）を例えば２００回繰り返す場合を例にあげたが、本発明はこれに限らず、識別関数の生成および識別関数の識別結果の判定を１回ないし１９９回、または２０１回以上行うこととしてもよい。 In the sound identification condition setting support process in the above-described embodiment, the case where the generation of the identification function and the determination of the identification result of the identification function (steps S3 to S20) are repeated 200 times, for example, is described in the present invention. Not limited to this, the generation of the discrimination function and the determination of the discrimination result of the discrimination function may be performed once to 199 times or 201 times or more.

また、上述した実施形態における音識別条件設定支援処理では、確定ブロック数を選定した後、確定ブロック数として選定されたブロック数の設定の下で生成された、有異音サンプル音Ａ１〜Ａ１００および無異音サンプル音Ｂ１〜Ｂ１００のすべてのスペクトログラムデータにおけるすべてのヒストグラムデータを用いて改めて識別関数を生成し、当該生成した識別関数を確定識別関数として選定する場合（図３中のステップＳ２４）を例にあげたが、本発明はこれに限らず、確定ブロック数として選定されたブロック数の設定の下で図２中のステップＳ９において生成された識別関数を確定識別関数として選定してもよい。 Further, in the sound identification condition setting support process in the above-described embodiment, after selecting the number of confirmed blocks, the sampled sound samples A1 to A100 generated under the setting of the number of blocks selected as the number of confirmed blocks and A case where a discrimination function is generated again using all histogram data in all spectrogram data of the allophone sample sounds B1 to B100, and the generated discrimination function is selected as a definite discrimination function (step S24 in FIG. 3). As an example, the present invention is not limited to this, and the discriminant function generated in step S9 in FIG. 2 may be selected as the deterministic discriminant function under the setting of the number of blocks selected as the deterministic block number. .

また、上述した実施形態における音識別条件設定支援処理では、識別関数の生成に用いるサンプル音のスペクトログラムデータおよび識別関数の識別結果の判定に用いるサンプル音のスペクトログラムデータをそれぞれランダムに選択する場合を例にあげたが、本発明はこれに限らない。例えば、音識別条件設定支援処理を始める前に、作業者が、採取した複数のサンプル音を、識別関数の生成に用いるサンプル音と、識別関数による識別結果の判定に用いるサンプル音に分類し、音識別条件設定支援装置１は、このような作業者の分類に従い、識別関数の生成に用いるサンプル音のスペクトログラムデータおよび識別関数の識別結果の判定に用いるサンプル音のスペクトログラムデータをそれぞれ選択するようにしてもよい。 Further, in the sound identification condition setting support processing in the above-described embodiment, an example in which the spectrogram data of the sample sound used for generating the discrimination function and the spectrogram data of the sample sound used for determining the discrimination result of the discrimination function are respectively selected at random. However, the present invention is not limited to this. For example, before starting the sound identification condition setting support process, the operator classifies the collected sample sounds into a sample sound used for generating the identification function and a sample sound used for determining the identification result by the identification function, The sound identification condition setting support device 1 selects the spectrogram data of the sample sound used for generating the discrimination function and the spectrogram data of the sample sound used for the determination of the discrimination result of the discrimination function according to the classification of the operator. May be.

（異音判定装置の構成）
図１３は、本発明の実施形態による音識別条件設定支援装置１によって生成された確定識別関数および確定ブロック数を用いて採取音中の異音の有無を判定する異音判定装置を示している。図１３において、異音判定装置２１は演算処理部３０を備え、演算処理部３０は、音声データからスペクトログラムデータを生成するスペクトログラム生成部３１と、スペクトログラムデータを複数のブロックに分割し、各ブロックにおける音の強度分布のヒストグラムデータを生成するヒストグラム生成部３２と、各ブロックにおけるヒストグラムデータを識別関数に入力し、異音の有無の識別を行う異音識別部３３と、異音判定処理の流れの制御等を行う制御部３４とを備えている。さらに、異音判定装置２１は、異音判定処理中に保持すべき種々のデータを記憶する記憶部３５と、被判定音の音声データの入力、操作入力、処理結果の表示等を行う入出力部３６とを備えている。 (Configuration of abnormal sound determination device)
FIG. 13 shows an abnormal sound determination device that determines the presence or absence of abnormal noise in the collected sound using the determined identification function and the number of determined blocks generated by the sound identification condition setting support device 1 according to the embodiment of the present invention. . In FIG. 13, the abnormal sound determination device 21 includes an arithmetic processing unit 30. The arithmetic processing unit 30 divides the spectrogram data into a plurality of blocks, a spectrogram generation unit 31 that generates spectrogram data from audio data, The histogram generation unit 32 that generates histogram data of the sound intensity distribution, the histogram data in each block is input to the identification function, the noise identification unit 33 that identifies the presence or absence of abnormal noise, and the flow of abnormal noise determination processing And a control unit 34 that performs control and the like. Further, the abnormal sound determination device 21 has a storage unit 35 for storing various data to be held during the abnormal sound determination processing, and input / output for performing input of sound data of the sound to be determined, operation input, display of processing results, and the like. Part 36.

異音判定装置２１は例えばパーソナルコンピュータにより実現することができる。具体的には、演算処理部３０は、パーソナルコンピュータに内蔵されたＣＰＵにより実現することができる。また、スペクトログラム生成部３１、ヒストグラム生成部３２、異音識別部３３、および制御部３４は、ＣＰＵにコンピュータプログラムを読み込ませて実行することにより実現することができる。また、記憶部３５は、パーソナルコンピュータが有するハードディスクドライブ装置等の記憶装置により実現することができる。また、入出力部３６は、パーソナルコンピュータが有するキーボード、モニタ等の入出力装置により実現することができる。なお、音識別条件設定支援装置１と異音判定装置２１とを１台のパーソナルコンピュータで同時に動作させるようにしてもよい。 The abnormal sound determination device 21 can be realized by a personal computer, for example. Specifically, the arithmetic processing unit 30 can be realized by a CPU built in a personal computer. The spectrogram generation unit 31, the histogram generation unit 32, the abnormal sound identification unit 33, and the control unit 34 can be realized by causing the CPU to read and execute a computer program. The storage unit 35 can be realized by a storage device such as a hard disk drive device of a personal computer. The input / output unit 36 can be realized by an input / output device such as a keyboard or a monitor included in the personal computer. Note that the sound identification condition setting support device 1 and the abnormal sound determination device 21 may be operated simultaneously by a single personal computer.

（異音判定処理）
図１４は異音判定装置２１による異音判定処理を示している。異音判定装置２１は、例えば送電線または機器等に近い場所で採取した比較的長時間（例えば２４時間）に及ぶ採取音につき、異音の有無を判定することができる。この異音判定の対象となる採取音の全体を被判定音という。また、異音判定装置２１による異音判定処理では、被判定音の音声データを比較的短時間（例えば１時間）のいくつかのセクションに分け、セクションごとに異音の有無の識別を行う。 (Abnormal sound judgment processing)
FIG. 14 shows an abnormal sound determination process performed by the abnormal sound determination device 21. The abnormal sound determination device 21 can determine the presence or absence of abnormal noise, for example, for a collected sound over a relatively long period of time (for example, 24 hours) collected at a location close to a power transmission line or equipment. The entire collected sound that is the target of the abnormal sound determination is referred to as a sound to be determined. In the abnormal sound determination processing by the abnormal sound determination device 21, the sound data of the sound to be determined is divided into several sections of a relatively short time (for example, 1 hour), and the presence or absence of the abnormal sound is identified for each section.

異音判定処理の準備として、作業者は、音識別条件設定支援装置１により生成された確定識別関数および確定ブロック数を異音判定装置２１に入力し、これらを異音判定装置２１の記憶部３５に記憶させる。また、作業者は、被判定音の音声データを異音判定装置２１に入力し、これを異音判定装置２１の記憶部３５に記憶させる。続いて、作業者が異音判定処理を開始する旨の指示を異音判定装置２１に入力すると、異音判定装置２１は異音判定処理を開始する。 In preparation for the abnormal sound determination process, the operator inputs the confirmed identification function and the number of determined blocks generated by the sound identification condition setting support device 1 to the abnormal sound determination device 21 and stores them in the storage unit of the abnormal sound determination device 21. 35. Further, the worker inputs the sound data of the sound to be determined to the abnormal sound determination device 21 and stores it in the storage unit 35 of the abnormal sound determination device 21. Subsequently, when the operator inputs an instruction to start the abnormal sound determination process to the abnormal sound determination apparatus 21, the abnormal sound determination apparatus 21 starts the abnormal sound determination process.

図１４に示すように、異音判定処理において、まず、制御部３４が、確定識別関数および確定ブロック数を記憶部３５から読み出す（ステップＳ３１）。 As shown in FIG. 14, in the abnormal sound determination process, first, the control unit 34 reads the definite identification function and the definite block number from the storage unit 35 (step S <b> 31).

続いて、制御部３４は、被判定音の音声データにおいて、異音の有無の識別をまだ実行していないセクションのうちの先頭のセクション（被判定音の音声データの開始時点に最も近いセクション）に対応する音声データを抽出する（ステップＳ３２）。 Subsequently, the control unit 34, in the sound data of the sound to be determined, the first section (section closest to the start time of the sound data of the sound to be determined) among the sections that have not yet been identified for the presence or absence of abnormal sound. Is extracted (step S32).

続いて、スペクトログラム生成部３１が、ステップＳ３２で抽出した音声データのスペクトログラムデータを生成する（ステップＳ３３）。 Subsequently, the spectrogram generation unit 31 generates spectrogram data of the voice data extracted in step S32 (step S33).

続いて、ヒストグラム生成部３２が、ステップＳ３３で生成したスペクトログラムデータにつき、所定の周波数範囲を確定ブロック数のブロックに分割し、各ブロックにつき、音の強度分布のヒストグラムデータを生成する（ステップＳ３４）。 Subsequently, for the spectrogram data generated in step S33, the histogram generation unit 32 divides a predetermined frequency range into blocks of the determined number of blocks, and generates histogram data of sound intensity distribution for each block (step S34). .

続いて、異音識別部３３が、各ブロックのヒストグラムデータを確定識別関数に入力し、異音の有無の識別を実行し、その識別結果を記憶部３５に記憶する（ステップＳ３５）。 Subsequently, the abnormal sound identification unit 33 inputs the histogram data of each block to the definite identification function, executes the identification of the presence or absence of abnormal noise, and stores the identification result in the storage unit 35 (step S35).

続いて、制御部３４が、被判定音の音声データのすべてのセクションにつき、異音の有無の識別を実行し終えたか否かを判断する（ステップＳ３６）。被判定音の音声データのすべてのセクションにつき異音の有無の識別を実行し終えていない場合には（ステップＳ３６：ＮＯ）、ステップＳ３２に戻り、制御部３４が、次のセクションに対応する音声データを被判定音の音声データから抽出し、スペクトログラム生成部３１、ヒストグラム生成部３２および異音識別部３３がステップＳ３３〜Ｓ３５の処理を実行する。 Subsequently, the control unit 34 determines whether or not the identification of the presence or absence of abnormal sounds has been completed for all sections of the sound data of the sound to be determined (step S36). If the identification of the presence / absence of abnormal sound has not been performed for all sections of the sound data of the sound to be determined (step S36: NO), the process returns to step S32, and the control unit 34 performs the sound corresponding to the next section. Data is extracted from the sound data of the sound to be determined, and the spectrogram generating unit 31, the histogram generating unit 32, and the abnormal sound identifying unit 33 execute the processes of steps S33 to S35.

一方、被判定音の音声データのすべてのセクションにつき異音の有無の識別を実行し終えた場合には（ステップＳ３６：ＹＥＳ）、制御部３４が、被判定音についての異音の有無の識別結果をセクションごとにモニタ等に表示する（ステップＳ３７）。作業者はモニタを見ることにより、被判定音中の異音の有無を即座に把握することができる。また、被判定音中の異音の有無をセクションごとに把握することができるので、被判定音として用いる採取音が比較的長時間に及び、かつその中に含まれる異音が比較的短時間である場合でも、当該採取音中の異音を迅速に発見することができる。 On the other hand, when the identification of the presence / absence of abnormal sound has been completed for all sections of the sound data of the sound to be determined (step S36: YES), the control unit 34 identifies the presence / absence of abnormal sound for the sound to be determined. The result is displayed on a monitor or the like for each section (step S37). By looking at the monitor, the operator can immediately grasp the presence or absence of abnormal noise in the sound to be determined. In addition, since the presence / absence of abnormal noise in the judged sound can be grasped for each section, the collected sound used as the judged sound is relatively long, and the abnormal sound included therein is relatively short. Even in this case, it is possible to quickly find an abnormal sound in the collected sound.

なお、本発明は、請求の範囲および明細書全体から読み取ることのできる発明の要旨または思想に反しない範囲で適宜変更可能であり、そのような変更を伴う音識別条件設定支援装置および音識別条件設定支援方法もまた本発明の技術思想に含まれる。 It should be noted that the present invention can be appropriately changed without departing from the spirit or idea of the invention that can be read from the claims and the entire specification, and the sound identification condition setting support device and the sound identification condition accompanying such a change. A setting support method is also included in the technical idea of the present invention.

１音識別条件設定支援装置
１１演算処理部
１２スペクトログラム生成部
１３ヒストグラム生成部
１４識別関数生成部
１５正解率算定部
１６識別条件選定部
１７制御部
１８記憶部
１９入出力部
２１異音判定装置
３０演算処理部
３１スペクトログラム生成部
３２ヒストグラム生成部
３３異音識別部
３４制御部
３５記憶部
３６入出力部 DESCRIPTION OF SYMBOLS 1 Sound identification condition setting assistance apparatus 11 Operation processing part 12 Spectrogram generation part 13 Histogram generation part 14 Discrimination function generation part 15 Correct answer rate calculation part 16 Discrimination condition selection part 17 Control part 18 Storage part 19 Input / output part 21 Abnormal sound determination apparatus 30 Arithmetic processing unit 31 Spectrogram generation unit 32 Histogram generation unit 33 Abnormal sound identification unit 34 Control unit 35 Storage unit 36 Input / output unit

Claims

A sound identification condition setting support device that supports setting of an identification condition for identifying the presence or absence of abnormal noise,
It receives spectrogram data showing the spectrograms of multiple abnormal sample sounds that contain abnormal sounds and non-noisy sample sounds that do not contain abnormal sounds, and divides the predetermined frequency range in each spectrogram data into multiple blocks. A histogram generation unit that generates histogram data indicating a histogram of the sound intensity distribution in each block;
Histogram data of each block for some noise sample sounds of the plurality of noise sample sounds, and each of the noise data of some noise samples of the plurality of noise sample sounds An identification function generator for generating an identification function for identifying the presence or absence of abnormal noise using block histogram data;
Histogram data of each block for some other abnormal sound samples of the plurality of abnormal sound samples, and some other non-noise samples of the non-natural sound samples Accurate rate calculation that inputs the histogram data of each block for sound into the discrimination function to identify the presence or absence of abnormal noise, and calculates the discrimination accuracy rate that is the probability that the discrimination function correctly identifies the presence or absence of abnormal noise And
While changing the setting of the number of blocks when dividing the predetermined frequency range in each spectrogram data into a plurality of blocks, the processing by the histogram generation unit, the discrimination function generation unit and the accuracy rate calculation unit is executed a plurality of times. As a result of executing the process a plurality of times, among the plurality of discriminant functions generated by the discriminant function generator, the discriminant function having the highest discrimination accuracy rate and the number of blocks set when generating the discriminant function are obtained. A sound identification condition setting support device, comprising: an identification condition selection unit that selects the identification condition .

The sound identification condition setting support apparatus according to claim 1, further comprising a spectrogram generation unit that generates spectrogram data indicating a spectrogram of the abnormal sound sample sound or the non-natural sound sample sound.

A sound identification condition setting support method for supporting setting of an identification condition for identifying the presence or absence of abnormal noise,
For each piece of spectrogram data showing the spectrograms of a plurality of abnormal sample sounds including abnormal sounds and a plurality of non-noise sample sounds not including abnormal sounds, the predetermined frequency range in the spectrogram data is divided into a plurality of blocks. A histogram generation step of generating histogram data indicating a histogram of the sound intensity distribution in each block;
Histogram data of each block for some noise sample sounds of the plurality of noise sample sounds, and each of the noise data of some noise samples of the plurality of noise sample sounds An identification function generating step for generating an identification function for identifying the presence or absence of abnormal noise using the histogram data of the block;
Histogram data of each block for some other abnormal sound samples of the plurality of abnormal sound samples, and some other non-noise samples of the non-natural sound samples Accurate rate calculation that inputs the histogram data of each block for sound into the discrimination function to identify the presence or absence of abnormal noise, and calculates the discrimination accuracy rate that is the probability that the discrimination function correctly identifies the presence or absence of abnormal noise A process,
While changing the setting of the number of blocks when dividing the predetermined frequency range in each spectrogram data into a plurality of blocks, the histogram generation step, the discrimination function generation step and the accuracy rate calculation step are executed a plurality of times, As a result of executing the process of a plurality of times, among the plurality of discriminant functions generated in the discriminant function generating step, the discriminant function having the highest discrimination accuracy rate and the number of blocks set when generating the discriminant function are A sound identification condition setting support method characterized by selecting as an identification condition.