JP2000347689A

JP2000347689A - Method and system for retrieving sound data, and recording medium recorded with sound data retrieving program

Info

Publication number: JP2000347689A
Application number: JP11157045A
Authority: JP
Inventors: Toshihisa Sekida; 俊久関田; Katsuhide Honma; 克英本間
Original assignee: NEC Software Niigata Ltd
Current assignee: NEC Software Niigata Ltd
Priority date: 1999-06-03
Filing date: 1999-06-03
Publication date: 2000-12-15

Abstract

PROBLEM TO BE SOLVED: To reduce useless collation processing to enhance efficiency for retrieval, by determining discriminatingly a portion necessary to be retrieved from an unnecessary portion out of input sound data. SOLUTION: This system is provided with a separator detecting means 201 for detecting a data consistent with separator information stored in a separator information storing part 207 based on a sound data accumulated in an input sound data accumulating part 206 to divide the sound data in a coinciding point, and for storing addresses of the two separator informations existing in a front side and a rear side of the divided sound information data into a collation-objective sound data storing part 209, and a data length comparing means 202 for calculating a data lenght of the divided sound data based on the addresses of the two separator informations stored in the data storing part 209 so as to confirm the consistency with a resistered data length stored in sound data storing part 208. Collation processing is conducted when the consistency is confirmed as a result of the confirmation by the data length comparing means 202.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、音データ検索方
法，音データ検索装置および音データ検索プログラムを
記録した記録媒体に関し、特に入力された音データと同
じ音データを照合して検索するにあたり、無駄な照合処
理を削減して検索の効率化をはかる音データ検索方法，
音データ検索装置および音データ検索プログラムを記録
した記録媒体に関する。なお、音データとは、音声のみ
ならずミュージック，チャイムの音等のあらゆる音を含
んだものとする。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a sound data search method, a sound data search device, and a recording medium on which a sound data search program is recorded. Sound data search method to reduce unnecessary collation processing and improve search efficiency,
The present invention relates to a sound data search device and a recording medium storing a sound data search program. It is assumed that the sound data includes not only voices but also all sounds such as music and chime sounds.

【０００２】[0002]

【従来の技術】従来、入力された音データと同じ音デー
タを照合して検索する場合、入力された音データを一定
時間毎（例えば、１／１００秒毎）の音データに分割し
た後、分割した音データをスペクトルデータに変換し
て、予め格納されている全ての候補音データと照合して
いた。この場合、候補音データ数が多くなると、照合処
理量もそれに伴って増加するため照合処理時間も増加
し、例えば１／１００秒間の音データを１／１００秒以
内で処理できなくなって処理遅延が蓄積し検索不能に至
る場合も発生し得るという問題があった（以下、第１の
従来の技術と記す）。2. Description of the Related Art Conventionally, when collating and searching the same sound data as input sound data, the input sound data is divided into sound data at regular intervals (for example, every 1/100 second), and then divided. The divided sound data is converted into spectrum data and collated with all the candidate sound data stored in advance. In this case, when the number of candidate sound data increases, the matching processing amount also increases, so that the matching processing time also increases. For example, sound data of 1/100 second cannot be processed within 1/100 second, and processing delay is reduced. There has been a problem that the data may be accumulated and search may not be possible (hereinafter referred to as a first conventional technique).

【０００３】そして、以上の第１の従来の技術における
問題を解決する技術として、特開平５−２０４３９６号
公報により開示されている音声認識方法および装置（以
下、第２の従来の技術と記す）がある。この第２の従来
の技術は、一定時間間隔で入力音データの特徴ベクトル
を求めて候補音データの予備選択を行い、この候補音デ
ータの選択数が予め定めた数を下回った時点で照合処理
を開始することにより、照合処理量を低減させている。[0003] As a technique for solving the above-mentioned problem in the first conventional technique, a speech recognition method and apparatus disclosed in Japanese Patent Laid-Open No. 5-204396 (hereinafter referred to as a second conventional technique). There is. According to the second conventional technique, preliminary selection of candidate sound data is performed by obtaining a feature vector of input sound data at predetermined time intervals, and a collation process is performed when the number of selected candidate sound data falls below a predetermined number. , The amount of collation processing is reduced.

【０００４】[0004]

【発明が解決しようとする課題】上述した第１および第
２の従来の技術においては、入力された全ての音データ
について照合処理を行っており、第２の従来の技術にお
いては、候補音データの数の方を絞り込むことによって
照合処理量を低減してはいるものの、候補音データの数
とも関連してさほど大きな低減は望めない。In the first and second prior arts described above, the collation processing is performed on all the input sound data. In the second prior art, the candidate sound data is compared. Although the amount of collation processing is reduced by narrowing down the number, the amount of collation processing cannot be expected to be so large in relation to the number of candidate sound data.

【０００５】本発明は、入力された音データに着目し、
この音データのうち検索が必要な部分と不要な部分とを
判定することにより、無駄な照合処理を削減して検索の
効率化をはかることを目的とする。The present invention focuses on input sound data,
It is an object of the present invention to determine a part of the sound data that requires a search and a part that does not need a search, thereby reducing unnecessary collation processing and improving the efficiency of the search.

【０００６】[0006]

【課題を解決するための手段】本発明の第１の音データ
検索方法は、入力された音データと予め登録，記憶され
た音データとを比較，照合して前記入力された音データ
と同じ音データを前記登録，記憶された音データの中か
ら検索する音データ検索方法であって、前記入力された
音データを分割するための分割点となるセパレータ情報
と少なくとも１つの指定時間値とを予め登録，記憶して
おき、前記入力された音データを前記セパレータ情報に
より分割し、この分割された音データの長さが前記指定
時間値のいずれかと一致した場合に、前記登録，記憶さ
れた音データとの比較，照合を行うことを特徴とする。According to a first sound data search method of the present invention, input sound data is compared with previously registered and stored sound data, and is compared with the inputted sound data. A sound data retrieval method for retrieving sound data from the registered and stored sound data, the method comprising: separating separator information serving as a division point for dividing the input sound data; and at least one designated time value. Registered and stored in advance, the input sound data is divided by the separator information, and when the length of the divided sound data matches one of the designated time values, the registered and stored sound data is stored. It is characterized in that comparison and collation with sound data are performed.

【０００７】本発明の第２の音データ検索方法は、入力
された音データを分割するための分割点となるセパレー
タ情報と少なくとも１つの指定時間値とを予め登録，記
憶しておく第１のステップと、前記入力された音データ
を蓄積する第２のステップと、前記入力された音データ
を前記セパレータ情報により分割する第３のステップ
と、前記第３のステップで分割された音データの長さを
算出する第４のステップと、前記第４のステップで算出
された音データの長さと、前記指定時間値のいずれかと
が一致するかを比較，判定する第５のステップと、前記
第５のステップの結果、一致した場合には、前記分割さ
れた音データを周波数成分のスペクトルデータに変換
し、予め登録，記憶されたスペクトルデータとの照合を
行って同じ音データを検索する第６のステップと、前記
第５のステップの結果、一致しなかった場合には、前記
分割された音データの照合を行わないで処理を終了し、
前記第３のステップに戻る第７のステップとを有するこ
とを特徴とする。In a second sound data search method according to the present invention, a separator information serving as a dividing point for dividing inputted sound data and at least one designated time value are registered and stored in advance. Step, a second step of storing the input sound data, a third step of dividing the input sound data by the separator information, and a length of the sound data divided in the third step. A fourth step of calculating the length of the sound data, a fifth step of comparing and determining whether the length of the sound data calculated in the fourth step matches any of the designated time values, As a result of the above step, if they match, the divided sound data is converted to frequency component spectrum data, and the same sound data is collated with previously registered and stored spectrum data. A sixth step of search, the fifth step results if they do not match, the process ends without performing the matching of the divided sound data,
And a seventh step of returning to the third step.

【０００８】本発明の第３の音データ検索方法は、本発
明の第１または第２の音データ検索方法において、前記
セパレータ情報は、一定時間以上継続した無音部である
ことを特徴とする。[0008] A third sound data search method according to the present invention is characterized in that, in the first or second sound data search method according to the present invention, the separator information is a silent portion that has continued for a predetermined time or more.

【０００９】本発明の第４の音データ検索方法は、本発
明の第１または第２の音データ検索方法において、前記
セパレータ情報は、ベルの音，時報，チャイム，競技の
スタートピストルの音，楽曲の一部等の一定の特性を持
つ音データであることを特徴とする。In a fourth sound data search method according to the present invention, in the first or second sound data search method according to the present invention, the separator information includes a bell sound, a time signal, a chime, a game start pistol sound, It is characterized by being sound data having a certain characteristic such as a part of a music piece.

【００１０】本発明の第１の音データ検索装置は、入力
された音データと予め登録，記憶された音データとを比
較，照合して前記入力された音データと同じ音データを
前記登録，記憶された音データの中から検索する音デー
タ検索装置であって、前記入力された音データを分割す
るための分割点となるセパレータ情報と少なくとも１つ
の指定時間値とを予め登録，記憶する手段と、前記入力
された音データを前記セパレータ情報により分割する手
段と、この分割された音データの長さが前記指定時間値
のいずれかと一致するかを判定する手段と、一致した場
合に前記登録，記憶された音データとの比較，照合を行
う手段とを有することを特徴とする。The first sound data search device of the present invention compares and compares input sound data with previously registered and stored sound data, and registers the same sound data as the inputted sound data. What is claimed is: 1. A sound data search device for searching among stored sound data, wherein means for pre-registering and storing at least one designated time value and separator information serving as a dividing point for dividing the inputted sound data. Means for dividing the input sound data according to the separator information; means for determining whether the length of the divided sound data matches any of the designated time values. And means for comparing and collating with the stored sound data.

【００１１】本発明の第２の音データ検索装置は、外部
から入力された音データを蓄積する入力音データ蓄積部
と、音データを抽出するための分割点となるセパレータ
情報を予め登録され記憶しているセパレータ情報記憶部
と、照合処理を行う対象の入力音データの長さを予め登
録，記憶している音データ長記憶部と、前記音データ長
記憶部に記憶された音データ長と一致する長さを持つ分
割後の音データおよびこの音データの前後にある２つの
セパレータ情報のアドレスを記憶する照合対象音データ
記憶部と、前記照合対象音データ記憶部に記憶されてい
る音データを周波数成分に変換したスペクトルデータを
抽出スペクトルデータとして記憶しているスペクトルデ
ータ記憶部と、照合の対象となるデータ群を、周波数成
分の候補スペクトルデータとして識別番号にそれぞれ対
応付けて予め記憶している候補スペクトルデータ蓄積部
と、前記候補スペクトルデータ蓄積部に記憶されている
各候補スペクトルデータに対応するタイトルや音データ
等の検索結果情報を前記候補スペクトルデータ蓄積部の
識別番号にそれぞれ対応付けて予め記憶している検索結
果情報蓄積部と、前記入力音データ蓄積部に蓄積された
音データを基に、前記セパレータ情報記憶部に記憶され
ているセパレータ情報と一致するデータを検出して一致
点で音データを分割し、この分割された音データの前後
にある２つのセパレータ情報のアドレスを前記照合対象
音データ記憶部に格納するとともに、データ長比較手段
にセパレータ情報が検出された旨通知するセパレータ検
出手段と、前記セパレータ検出手段で分割された音デー
タのデータ長を、前記照合対象音データ記憶部に格納さ
れた２つのセパレータ情報のアドレスから算出し、前記
音データ長記憶部に記憶されている登録データ長と一致
するかを確認し、一致した場合はその音データを前記照
合対象音データ記憶部に格納するとともに、スペクトル
変換手段に対して照合対象音データがあった旨通知する
データ長比較手段と、前記照合対象音データ記憶部に記
憶されている音データを周波数成分のスペクトルデータ
に変換し、抽出スペクトルデータとして前記スペクトル
データ記憶部に格納するとともに、音データ照合手段２
０４を起動するスペクトル変換手段と、前記スペクトル
データ記憶部に記憶されている抽出スペクトルデータと
一致するデータを、前記候補スペクトルデータ蓄積部に
蓄積されている候補スペクトルデータから検索し、この
検索された候補スペクトルデータの識別番号を検索結果
管理手段に通知する音データ照合手段と、前記音データ
照合手段により検索された１つの候補スペクトルデータ
に対応する音データやコマーシャルのタイトル等の検索
結果情報を、前記検索結果情報蓄積部から識別番号によ
り検索し、外部の検索結果出力装置に出力する検索結果
管理手段とを有することを特徴とする。A second sound data retrieval apparatus according to the present invention pre-registers and stores an input sound data storage section for storing sound data inputted from the outside, and separator information serving as division points for extracting sound data. A separator information storage unit, a sound data length storage unit that pre-registers and stores the length of input sound data to be subjected to the matching processing, and a sound data length stored in the sound data length storage unit. A collation target sound data storage unit for storing divided sound data having a matching length and addresses of two pieces of separator information before and after the sound data, and sound data stored in the collation target sound data storage unit A spectrum data storage unit that stores spectrum data obtained by converting the spectrum data into frequency components as extracted spectrum data, and a data group to be compared is stored in a frequency component candidate spectrum. A candidate spectrum data storage unit stored in advance in association with an identification number as data, and search result information such as a title and sound data corresponding to each candidate spectrum data stored in the candidate spectrum data storage unit. A search result information storage unit stored in advance in association with the identification number of the candidate spectrum data storage unit, and sound data stored in the input sound data storage unit, based on the sound data stored in the separator information storage unit. The sound data is divided at the coincidence point by detecting the data that matches the separator information, and the addresses of two pieces of separator information before and after the divided sound data are stored in the matching target sound data storage unit. Separator detecting means for notifying the length comparing means that separator information has been detected, and the separator detecting means The data length of the divided sound data is calculated from the addresses of the two pieces of separator information stored in the comparison target sound data storage unit, and whether the registered data length matches the registered data length stored in the sound data length storage unit is determined. Confirming, if matched, storing the sound data in the matching target sound data storage unit, and a data length comparing means for notifying the spectrum converting means that there is the matching target sound data; and The sound data stored in the storage unit is converted into spectrum data of frequency components and stored as extracted spectrum data in the spectrum data storage unit.
04, and a search for data matching the extracted spectrum data stored in the spectrum data storage unit from the candidate spectrum data stored in the candidate spectrum data storage unit. Sound data matching means for notifying the search result management means of the identification number of the candidate spectrum data, and search result information such as sound data or commercial title corresponding to one candidate spectrum data searched by the sound data matching means, A search result management unit that searches by the identification number from the search result information storage unit and outputs the search result to an external search result output device.

【００１２】本発明の第３の音データ検索装置は、本発
明の第１または第２の音データ検索装置において、前記
セパレータ情報は、一定時間以上継続した無音部である
ことを特徴とする。According to a third sound data search device of the present invention, in the first or second sound data search device of the present invention, the separator information is a silent portion that has continued for a predetermined time or more.

【００１３】本発明の第４の音データ検索装置は、本発
明の第１または第２の音データ検索装置において、前記
セパレータ情報は、ベルの音，時報，チャイム，競技の
スタートピストルの音，楽曲の一部等の一定の特性を持
つ音データであることを特徴とする。A fourth sound data search device according to the present invention is the first or second sound data search device according to the present invention, wherein the separator information includes a bell sound, a time signal, a chime, a start pistol sound of a game, It is characterized by being sound data having a certain characteristic such as a part of a music piece.

【００１４】本発明の記録媒体は、入力された音データ
を分割するための分割点となるセパレータ情報と少なく
とも１つの指定時間値とを予め登録，記憶しておく第１
の処理と、前記入力された音データを蓄積する第２の処
理と、前記入力された音データを前記セパレータ情報に
より分割する第３の処理と、前記第３の処理で分割され
た音データの長さを算出する第４の処理と、前記第４の
処理で算出された音データの長さと、前記指定時間値の
いずれかとが一致するかを比較，判定する第５の処理
と、前記第５の処理の結果、一致した場合には、前記分
割された音データを周波数成分のスペクトルデータに変
換し、予め登録，記憶されたスペクトルデータとの照合
を行って同じ音データを検索する第６の処理と、前記第
５の処理の結果、一致しなかった場合には、前記分割さ
れた音データの照合を行わないで処理を終了し、前記第
３の処理に戻る第７の処理とをコンピュータに実行させ
るための音データ検索プログラムを記録したことを特徴
とする。According to the recording medium of the present invention, first, separator information as a division point for dividing inputted sound data and at least one designated time value are registered and stored in advance.
Processing, a second processing for accumulating the input sound data, a third processing for dividing the input sound data by the separator information, and a processing of the sound data divided in the third processing. A fourth process for calculating the length, a fifth process for comparing and determining whether the length of the sound data calculated in the fourth process matches any of the designated time values, If the result of the processing in step 5 is a coincidence, the divided sound data is converted into spectrum data of frequency components, and the same sound data is searched by collating with the previously registered and stored spectrum data. And, if the result of the fifth process does not match, the seventh process that ends the process without collating the divided sound data and returns to the third process. Sound data detection to be executed by the computer And characterized by recording a program.

【００１５】[0015]

【発明の実施の形態】本発明の実施の形態について図面
を参照して説明する。Embodiments of the present invention will be described with reference to the drawings.

【００１６】図１は、本発明の一実施の形態の構成を示
すブロック図であり、テレビチューナ等の音データ入力
装置１と、音データ検索装置２と、スピーカ，サウンド
ボード，ＣＲＴ等の検索結果出力装置３とが示されてい
る。なお、ここで、音データとは、音声のみならずミュ
ージック，チャイムの音等のあらゆる音を含んだものと
する。FIG. 1 is a block diagram showing the configuration of an embodiment of the present invention. A sound data input device 1 such as a TV tuner, a sound data search device 2, and a search for a speaker, a sound board, a CRT, etc. The result output device 3 is shown. Here, the sound data includes not only voices but also all sounds such as music and chime sounds.

【００１７】音データ検索装置２は、セパレータ検出手
段２０１と、データ長比較手段２０２と、スペクトル変
換手段２０３と、音データ照合手段２０４と、検索結果
管理手段２０５と、入力音データ蓄積部２０６と、セパ
レータ情報記憶部２０７と、音データ長記憶部２０８
と、照合対象音データ記憶部２０９と、スペクトルデー
タ記憶部２１０と、候補スペクトルデータ蓄積部２１１
と、検索結果情報蓄積部２１２とを備えている。The sound data search device 2 includes a separator detection means 201, a data length comparison means 202, a spectrum conversion means 203, a sound data comparison means 204, a search result management means 205, an input sound data storage section 206, , Separator information storage unit 207 and sound data length storage unit 208
, A matching target sound data storage unit 209, a spectrum data storage unit 210, and a candidate spectrum data storage unit 211.
And a search result information storage unit 212.

【００１８】音データ検索装置２に備えられている２０
１〜２１２の手段，蓄積部，記憶部のうち、まず、２０
６〜２１２の蓄積部，記憶部について説明する。20 provided in the sound data search device 2
Of the means 1-22, the storage unit and the storage unit,
The storage units and storage units 6 to 212 will be described.

【００１９】入力音データ蓄積部２０６は、音データ入
力装置１から入力された音データを蓄積するものであ
り、少なくとも音データ長記憶部２０８に登録された最
長時間分の音データを記憶する容量（図４に示すよう
に、音データ長記憶部２０８に、”１４．５秒〜１５．
５秒”，”２９．５秒〜３０．５秒”の２種類の音デー
タ長が登録，記憶されているような場合は、無音部分や
誤差等を考慮して、例えば４０秒程度の容量とする）を
もち、音データ入力装置１から音データが順次入力され
るに伴ってそれを記憶していく。なお、記憶容量が限ら
れていることから記憶容量の範囲内で、時間的に先に記
憶された音データを消しながら、新しく入力された音デ
ータを順次記憶する。The input sound data storage unit 206 stores the sound data input from the sound data input device 1, and has a capacity for storing at least the longest sound data registered in the sound data length storage unit 208. (As shown in FIG. 4, “14.5 seconds to 15.
In the case where two types of sound data lengths of “5 seconds” and “29.5 seconds to 30.5 seconds” are registered and stored, for example, a capacity of about 40 seconds is considered in consideration of silence parts and errors. ) Is stored as the sound data is sequentially input from the sound data input device 1. Since the storage capacity is limited, the storage time is limited within the range of the storage capacity. Newly input sound data is sequentially stored while deleting previously stored sound data.

【００２０】セパレータ情報記憶部２０７は、音データ
を抽出するための分割点となるセパレータ情報を予め登
録され記憶している。例えば、図３に示すように、”Ａ
秒以上の無音”がセパレータ情報として登録されていれ
ば、”Ａ秒以上の無音が分割点である”ということを意
味している。つまり、入力された音データにＡ秒以上継
続する無音部が検出されたら、その無音部を区切りとし
音データを分割して抽出することとする。The separator information storage unit 207 pre-registers and stores separator information serving as division points for extracting sound data. For example, as shown in FIG.
If “silence for more than seconds” is registered as separator information, it means “silence for more than A seconds is a division point.” That is, a silent part that continues for more than A seconds in the input sound data. Is detected, sound data is divided and extracted with the silent part as a delimiter.

【００２１】音データ長記憶部２０８は、照合処理を行
う対象の入力音データの長さを予め登録，記憶してい
る。例えば、図４に示すように、”１４．５秒〜１５．
５秒”，”２９．５秒〜３０．５秒”の２つの音データ
長を記憶している。つまり、本実施の形態においては、
抽出したいコマーシャルの音データのデータ長が１５秒
または３０秒であるため、多少の誤差を考慮して１４．
５秒〜１５．５秒，２９．５秒〜３０．５秒としてい
る。従って、音データのデータ長が１４．５秒〜１５．
５秒または２９．５秒〜３０．５秒のときだけ、その音
データが照合処理を行う対象の音データであるとして認
識する。The sound data length storage unit 208 previously registers and stores the length of input sound data to be subjected to the collation processing. For example, as shown in FIG.
Two sound data lengths of “5 seconds” and “29.5 to 30.5 seconds” are stored.
13. Since the data length of the sound data of the commercial to be extracted is 15 seconds or 30 seconds, some errors are taken into account.
5 seconds to 15.5 seconds, and 29.5 seconds to 30.5 seconds. Therefore, the data length of the sound data is from 14.5 seconds to 15.
Only in the case of 5 seconds or 29.5 to 30.5 seconds, the sound data is recognized as the sound data to be subjected to the matching processing.

【００２２】照合対象音データ記憶部２０９は、音デー
タ長記憶部２０８に記憶された音データ長と一致する長
さを持つ分割後の音データ（照合処理を行う対象の音デ
ータ）およびこの音データの前後にある２つのセパレー
タ情報のアドレスをを記憶する。The collation target sound data storage unit 209 stores the divided sound data (the sound data to be subjected to the collation processing) having the length corresponding to the sound data length stored in the sound data length storage unit 208 and this sound. The addresses of two pieces of separator information before and after the data are stored.

【００２３】スペクトルデータ記憶部２１０は、照合対
象音データ記憶部２０９に記憶されている音データにフ
ーリエ変換を適用し、周波数成分に変換したスペクトル
データを抽出スペクトルデータとして記憶する。The spectrum data storage unit 210 applies Fourier transform to the sound data stored in the comparison target sound data storage unit 209, and stores the spectrum data converted into frequency components as extracted spectrum data.

【００２４】候補スペクトルデータ蓄積部２１１は、照
合の対象となるデータ群を、各々周波数成分の候補スペ
クトルデータとして予め記憶している。例えば、図５に
示すように、第１〜第４の４つのテレビコマーシャルの
音データをそれぞれ周波数成分に変換した候補スペクト
ルデータを、識別番号０１〜０４に対応付けて記憶して
いる。The candidate spectrum data storage unit 211 stores in advance data groups to be collated as candidate spectrum data of frequency components. For example, as shown in FIG. 5, candidate spectrum data obtained by converting sound data of first to fourth television commercials into frequency components are stored in association with identification numbers 01 to 04, respectively.

【００２５】検索結果情報蓄積部２１２は、候補スペク
トルデータ蓄積部２１１に記憶されている各スペクトル
データに対応する検索結果情報を予め記憶している。例
えば、図６に示すように、第１〜第４のコマーシャルの
タイトルや音データ等を、候補スペクトルデータ蓄積部
２１１の識別番号０１〜０４にそれぞれ対応付けて記憶
している。The search result information storage unit 212 stores search result information corresponding to each spectrum data stored in the candidate spectrum data storage unit 211 in advance. For example, as shown in FIG. 6, titles and sound data of the first to fourth commercials are stored in association with identification numbers 01 to 04 of the candidate spectrum data storage unit 211, respectively.

【００２６】次に、音データ検索装置２の２０１〜２０
５の各手段について説明する。Next, 201 to 20 of the sound data search device 2
5 will be described.

【００２７】セパレータ検出手段２０１は、入力音デー
タ蓄積部２０６に蓄積された音データを基に、セパレー
タ情報記憶部２０７に記憶されているセパレータ情報と
一致するデータを検出し、一致点で音データを分割す
る。本実施の形態においては、無音部分を検出し、無音
部分を境界として音データを分割する。The separator detecting means 201 detects data matching the separator information stored in the separator information storage section 207 based on the sound data stored in the input sound data storage section 206, and detects the sound data at the matching point. Split. In the present embodiment, a silent part is detected, and sound data is divided using the silent part as a boundary.

【００２８】データ長比較手段２０２は、セパレータ検
出手段２０１で分割された音データのデータ長が、音デ
ータ長記憶部２０８に記憶されている登録データ長と一
致するかを確認し、一致した場合はその音データを照合
対象音データ記憶部２０９に記憶する。本実施の形態に
おいては、分割された音データのデータ長が１４．５秒
〜１５．５秒または２９．５秒〜３０．５秒であった場
合に、照合対象音データ記憶部２０９に照合対象音デー
タとして記憶する。The data length comparing means 202 checks whether the data length of the sound data divided by the separator detecting means 201 matches the registered data length stored in the sound data length storage section 208, and if the data lengths match. Stores the sound data in the matching target sound data storage unit 209. In the present embodiment, when the data length of the divided sound data is 14.5 seconds to 15.5 seconds or 29.5 seconds to 30.5 seconds, the matching sound data is stored in the matching target sound data storage unit 209. It is stored as target sound data.

【００２９】スペクトル変換手段２０３は、照合対象音
データ記憶部２０９に記憶されている音データを周波数
成分のスペクトルデータに変換し、抽出スペクトルデー
タとしてスペクトルデータ記憶部２１０に記憶する。The spectrum converting means 203 converts the sound data stored in the matching target sound data storage section 209 into frequency component spectrum data, and stores it in the spectrum data storage section 210 as extracted spectrum data.

【００３０】音データ照合手段２０４は、スペクトルデ
ータ記憶部２１０に記憶されている抽出スペクトルデー
タと一致するデータを、候補スペクトルデータ蓄積部２
１１に蓄積されている候補スペクトルデータから検索す
る。The sound data collating means 204 stores data matching the extracted spectrum data stored in the spectrum data storage section 210 into the candidate spectrum data storage section 2.
A search is performed from the candidate spectrum data stored in No. 11.

【００３１】検索結果管理手段２０５は、音データ照合
手段２０４により検索された１つの候補スペクトルデー
タに対応する音データやコマーシャルのタイトルを、検
索結果情報蓄積部２１２から識別番号により検索し、検
索結果出力装置３に出力する。The search result management unit 205 searches the search result information storage unit 212 for the sound data or commercial title corresponding to one candidate spectrum data searched by the sound data matching unit 204 by using the identification number. Output to the output device 3.

【００３２】図２は、本発明の一実施の形態の動作を示
すフローチャートである。図３はセパレータ情報記憶部
２０７に登録されるセパレータ情報の一例を、図４は音
データ長記憶部２０８に登録されるデータ長の一例を、
図５は候補スペクトルデータ蓄積部２１１に登録される
候補スペクトルデータの一例を、図６は検索結果情報蓄
積部２１２に登録される検索結果情報の一例を、それぞ
れ示す図である。FIG. 2 is a flowchart showing the operation of the embodiment of the present invention. FIG. 3 shows an example of the separator information registered in the separator information storage unit 207, and FIG. 4 shows an example of the data length registered in the sound data length storage unit 208.
FIG. 5 is a diagram illustrating an example of candidate spectrum data registered in the candidate spectrum data storage unit 211, and FIG. 6 is a diagram illustrating an example of search result information registered in the search result information storage unit 212, respectively.

【００３３】図７は、入力される音データの一例を示す
図であり、入力音データには、ＣＭ１（１回目のテレビ
コマーシャル）が１５秒間、ＥＴＣ１（ＣＭではない部
分）が３００秒間、ＣＭ２（２回目のテレビコマーシャ
ル）が３０秒間、の各音データが含まれており、各音デ
ータの境界にはセパレータ情報となる無音部１，無音部
２，無音部３，無音部４も含まれている。図８は、入力
される音データの波形の概念図である。FIG. 7 is a diagram showing an example of input sound data. The input sound data includes CM1 (first television commercial) for 15 seconds, ETC1 (non-CM part) for 300 seconds, and CM2. (Second television commercial) includes sound data for 30 seconds, and a boundary of each sound data includes a silent portion 1, a silent portion 2, a silent portion 3, and a silent portion 4 serving as separator information. ing. FIG. 8 is a conceptual diagram of a waveform of input sound data.

【００３４】次に、本発明の一実施の形態の動作につい
て図１〜図８を参照して詳細に説明する。Next, the operation of the embodiment of the present invention will be described in detail with reference to FIGS.

【００３５】本実施の形態は、図示していない時間設
定，管理手段により、予め設定された第１の時刻になる
と起動され、予め設定された第２の時刻になると終了す
るものとして説明する。The present embodiment will be described on the assumption that the time setting and management means (not shown) is activated at a preset first time and ends at a preset second time.

【００３６】まず、開始時刻になると、音データ入力装
置１、例えばテレビチューナから音データが入力音デー
タ蓄積部２０６に供給されるとともに、セパレータ検出
手段２０１が起動され、セパレータ検出手段２０１は、
セパレータ情報記憶部２０７からセパレータ情報、つま
り音データの分割点に関する区切り情報を取得する。本
実施の形態においては、図３に示すように、”Ａ秒以上
の無音”がセパレータ情報としてセパレータ情報記憶部
２０７に登録，記憶されているため、セパレータ検出手
段２０１は、Ａ秒以上の無音がセパレータ情報であると
いうことを取得することになる。そして、セパレータ検
出手段２０１は、入力音データ蓄積部２０６に記憶され
た音データ内の無音部を検索する。このとき、データ先
頭から検索し、Ａ秒以上無音が継続した無音部（図７の
無音部１）を検出する。（図２のステップＡ１）。First, at the start time, sound data is supplied from the sound data input device 1, for example, a television tuner, to the input sound data storage unit 206, and the separator detecting means 201 is activated.
From the separator information storage unit 207, separator information, that is, delimiter information relating to sound data division points is obtained. In the present embodiment, as shown in FIG. 3, "silence for more than A seconds" is registered and stored as separator information in the separator information storage unit 207. Is the separator information. Then, the separator detection unit 201 searches for a silent part in the sound data stored in the input sound data storage unit 206. At this time, a search is made from the beginning of the data, and a silent portion (silent portion 1 in FIG. 7) in which silence continues for A seconds or more is detected. (Step A1 in FIG. 2).

【００３７】ステップＡ１の検索の結果、セパレータ情
報すなわち無音部が２つ検出された場合、セパレータ検
出手段２０１は、検出された２つの無音部（例えば、図
７の無音部１と無音部２の１組）のアドレスを照合対象
音データ記憶部２０９に格納するとともに、データ長比
較手段２０２に無音部が検出された旨通知する。なお、
このとき、照合対象音データ記憶部２０９に同じ１組の
アドレスが既に格納されていた場合には、セパレータ検
出手段２０１は、異なる１組のアドレスの無音部を検出
するまで、照合対象音データ記憶部２０９へのアドレス
格納およびデータ長比較手段２０２への無音部が検出さ
れた旨の通知は行わない。As a result of the search in step A1, if two pieces of separator information, that is, two silent sections are detected, the separator detecting means 201 determines whether the two detected silent sections (for example, the silent section 1 and the silent section 2 in FIG. (1 set) are stored in the comparison target sound data storage unit 209, and the data length comparing unit 202 is notified that a silent section has been detected. In addition,
At this time, if the same set of addresses has already been stored in the collation target sound data storage unit 209, the separator detection unit 201 stores the collation target sound data storage unit until a silence part having a different set of addresses is detected. The address storage to the section 209 and the notification to the data length comparing means 202 that the silent section has been detected are not performed.

【００３８】無音部のアドレスについては、無音部の先
頭アドレスでも末尾アドレスでも中間アドレスでもよ
く、特に限定はしないが、どの部分にするかを予めセパ
レータ検出手段２０１に設定しておくものとする。ま
た、この２つ（１組）の無音部、例えば図７の無音部１
と無音部２のアドレスが照合対象音データ記憶部２０９
に格納されて検索が終了し、次の無音部検出を行って図
７の無音部３と無音部４とが検出された場合は、この無
音部３と無音部４の１組のアドレスが、前回の無音部検
出時に格納された無音部１と無音部２の１組のアドレス
に上書きされる結果、照合対象音データ記憶部２０９に
格納されているアドレスは最新の１組となる（ステップ
Ａ２→Ａ３）。The address of the silent section may be the start address, the end address, or the intermediate address of the silent section. There is no particular limitation, but it is assumed that which section is to be set in the separator detecting means 201 in advance. In addition, the two (one set) silence parts, for example, the silence part 1 in FIG.
And the address of the silent section 2 are stored in the comparison target sound data storage section 209.
When the search is completed and the next silent part detection is performed to detect the silent part 3 and the silent part 4 in FIG. 7, a set of addresses of the silent part 3 and the silent part 4 is As a result of overwriting the set of the silent part 1 and the silent part 2 stored at the time of the previous detection of the silent part, the address stored in the matching target sound data storage unit 209 becomes the latest set (step A2). → A3).

【００３９】ステップＡ１の検索の結果、セパレータ情
報すなわち無音部が２つ（１組）検出されなかった場合
には、セパレータ検出手段２０１は引き続き無音部の検
索を続行する。例えば、図７に示すＥＴＣ１が入力音デ
ータ蓄積部２０６に入力されているような場合には、入
力音データ蓄積部２０６のメモリ容量が上述したように
４０秒程度であり、図７に示すＥＴＣ１の音データ長は
３００秒であるため、無音部２と無音部３とが１組とし
て検出されることはあり得ず、このような場合は、セパ
レータ検出手段２０１は引き続き無音部の検索を続行す
る。すなわち、本実施の形態では、入力音データの中か
ら１５秒または３０秒のコマーシャルを抽出して検索す
るために、音データ長記憶部２０８に”１４．５秒〜１
５．５秒”，”２９．５秒〜３０．５秒”と登録，記憶
させており、３０．５秒を超えるデータ長は抽出されな
い（ステップＡ２→Ａ１）。As a result of the search in step A1, if no separator information, that is, two (one set) of silence parts are not detected, the separator detection means 201 continues to search for silence parts. For example, when the ETC1 shown in FIG. 7 is input to the input sound data storage unit 206, the memory capacity of the input sound data storage unit 206 is about 40 seconds as described above, and the ETC1 shown in FIG. Since the sound data length is 300 seconds, the silent part 2 and the silent part 3 cannot be detected as one set. In such a case, the separator detecting means 201 continues the search for the silent part. I do. That is, in the present embodiment, in order to extract and retrieve a commercial of 15 seconds or 30 seconds from the input sound data, the sound data length storage unit 208 stores "14.5 to 1".
“5.5 seconds” and “29.5 seconds to 30.5 seconds” are registered and stored, and a data length exceeding 30.5 seconds is not extracted (step A2 → A1).

【００４０】セパレータ検出手段２０１から無音部が検
出された旨の通知を受けることにより、データ長比較手
段２０２は、音データ長記憶部２０８に登録，記憶され
ているデータ長、すなわち、”１４．５秒〜１５．５
秒”，”２９．５秒〜３０．５秒”を取得するととも
に、照合対象音データ記憶部２０９に格納されている１
組のアドレス（例えば、図７に示す無音部１と無音部２
のアドレス）を取得し、取得した両アドレスを基に両ア
ドレスに挟まれた音データ部分（図７に示すＣＭ１）の
データ長を算出する。そして、この算出された音データ
長と”１４．５秒〜１５．５秒”，”２９．５秒〜３
０．５秒”とを比較する。ここで、算出された音データ
長が１５秒であった場合は、音データ長記憶部２０８に
登録，記憶されている”１４．５秒〜１５．５秒”と一
致するため、データ長比較手段２０２は、入力音データ
蓄積部から対応する音データを抽出し、照合対象音デー
タ記憶部２０９に格納するとともに、スペクトル変換手
段２０３に対して照合対象音データがあった旨通知す
る。（ステップＡ４）。Upon receiving a notification from the separator detecting means 201 that a silent section has been detected, the data length comparing means 202 causes the data length registered and stored in the sound data length storing section 208, ie, "14. 5 seconds to 15.5
Second "," 29.5 to 30.5 seconds ", and the 1 stored in the matching target sound data storage unit 209.
A set of addresses (for example, silence 1 and silence 2 shown in FIG. 7)
And the data length of the sound data portion (CM1 shown in FIG. 7) sandwiched between both addresses is calculated based on both the obtained addresses. Then, the calculated sound data length and “14.5 seconds to 15.5 seconds”, “29.5 seconds to 3 seconds”
0.5 seconds ". If the calculated sound data length is 15 seconds," 14.5 seconds to 15.5 "registered and stored in the sound data length storage unit 208. Seconds ”, the data length comparison unit 202 extracts the corresponding sound data from the input sound data storage unit, stores it in the comparison target sound data storage unit 209, and sends the comparison target sound data to the spectrum conversion unit 203. A notification that there is data is provided (step A4).

【００４１】ステップＡ４の比較の結果、算出された音
データ長と”１４．５秒〜１５．５秒”，”２９．５秒
〜３０．５秒”とが一致しなかった場合には、ステップ
Ａ１に戻り、セパレータ検出手段２０１は引き続き無音
部の検索を続行する（ステップＡ４→Ａ１）。As a result of the comparison in step A4, if the calculated sound data length does not match "14.5 seconds to 15.5 seconds" or "29.5 seconds to 30.5 seconds", Returning to step A1, the separator detecting means 201 continues to search for a silent part (step A4 → A1).

【００４２】データ長比較手段２０２から照合対象音デ
ータがあった旨通知を受けることにより、スペクトル変
換手段２０３は、照合対象音データ記憶部２０９に記憶
されている抽出音データを取得し、フーリエ変換を適用
することにより、周波数成分のデータであるスペクトル
データに変換し（図８のスペクトルデータ１）、これを
抽出スペクトルデータとしてスペクトルデータ記憶部２
１０に格納するとともに、音データ照合手段２０４を起
動する（ステップＡ５）。Upon receiving a notification from the data length comparing means 202 that there is sound data to be compared, the spectrum converting means 203 acquires the extracted sound data stored in the sound data storing section 209 and performs Fourier transform. Is converted into spectrum data which is data of frequency components (spectral data 1 in FIG. 8), and this is used as extracted spectrum data as a spectrum data storage unit 2.
10 and activate the sound data collating means 204 (step A5).

【００４３】音データ照合手段２０４は、スペクトルデ
ータ記憶部２１０に記憶されている抽出スペクトルデー
タを取得し、候補スペクトルデータ蓄積部２１１に予め
記憶されている１番目の候補スペクトルデータ（図５の
１個目のコマーシャルのスペクトルデータ）と比較する
（ステップＡ６）。The sound data collating means 204 acquires the extracted spectrum data stored in the spectrum data storage section 210, and stores the first candidate spectrum data (1 in FIG. 5) stored in the candidate spectrum data storage section 211 in advance. (The spectrum data of the commercial) (step A6).

【００４４】ステップＡ６において、スペクトルデータ
が一致した場合には、１番目の候補スペクトルデータで
一致したという情報、すなわち、図５に示す識別番号”
０１”を検索結果管理手段２０５に供給する。検索結果
管理手段２０５は、検索結果情報蓄積部２１２に記憶さ
れている、図６に示す識別番号”０１”の音データとタ
イトル（図６の第１のコマーシャルのタイトル，音デー
タ）を取得し、検索結果出力装置３に供給する。検索結
果出力装置３は、供給されたタイトルをＣＲＴ画面等に
表示し、音データを再生する（ステップＡ７→Ａ８）。In step A6, when the spectrum data matches, information indicating that the first candidate spectrum data matches, that is, the identification number shown in FIG.
"01" is supplied to the search result management means 205. The search result management means 205 stores the sound data and title (identification number "01" in FIG. 6) stored in the search result information storage unit 212 and having the identification number "01" shown in FIG. 1 and outputs the title to the search result output device 3. The search result output device 3 displays the supplied title on a CRT screen or the like, and reproduces the sound data (step A7 → A8).

【００４５】ステップＡ６において、１番目の候補スペ
クトルデータと一致しなかった場合には、２番目の候補
スペクトルデータ（図５の第２のコマーシャルのスペク
トルデータ）と比較する。そして、２番目の候補スペク
トルデータとも一致しなかった場合は、３番目，４番目
と順次ステップＡ７→Ａ９→Ａ１０を繰り返し、候補ス
ペクトルデータ蓄積部２１１に記憶されている全ての候
補スペクトルデータ（図５に示す第１〜第４のコマーシ
ャルのスペクトルデータ）と比較しても一致しなかった
場合は、ステップＡ１に戻り、セパレータ検出手段２０
１は引き続き無音部の検索を続行し（ステップＡ９→Ａ
１）、途中で一致する候補スペクトルデータが見つかっ
た場合は、上述したステップＡ７→Ａ８と同様の動作を
行う（ステップＡ７→Ａ８）。If the data does not match the first candidate spectrum data at step A6, it is compared with the second candidate spectrum data (the second commercial spectrum data in FIG. 5). If it does not match the second candidate spectrum data, the steps A7 → A9 → A10 are repeated sequentially from the third and fourth steps, and all the candidate spectrum data stored in the candidate spectrum data storage unit 211 (FIG. 5 does not match even if compared with the first to fourth commercial spectrum data shown in FIG.
1 continues the search for the silent part (step A9 → A
1) If a matching candidate spectrum data is found on the way, the same operation as the above-described step A7 → A8 is performed (step A7 → A8).

【００４６】次に、本発明の他の実施の形態について図
９を参照して詳細に説明する。Next, another embodiment of the present invention will be described in detail with reference to FIG.

【００４７】図９は、本発明の他の実施の形態の構成を
示すブロック図であり、音データ入力装置１と、検索結
果出力装置３と、コンピュータ４と、記録媒体５とを備
えている。コンピュータ４の構成は、図１の本発明の一
実施の形態で説明した音データ検索装置２と基本的に同
じである。記録媒体５は、音データ検索プログラムを記
録している。この記録媒体５は、磁気ディスク，光記録
ディスク，半導体メモリその他の記録媒体であってもよ
い。音データ検索プログラムは、記録媒体５からコンピ
ュータ４に読み込まれ、本発明の一実施の形態と同様の
動作を制御する。FIG. 9 is a block diagram showing the configuration of another embodiment of the present invention, which comprises a sound data input device 1, a search result output device 3, a computer 4, and a recording medium 5. . The configuration of the computer 4 is basically the same as the sound data search device 2 described in the embodiment of the present invention in FIG. The recording medium 5 stores a sound data search program. The recording medium 5 may be a magnetic disk, an optical recording disk, a semiconductor memory, or another recording medium. The sound data search program is read from the recording medium 5 into the computer 4, and controls the same operation as in the embodiment of the present invention.

【００４８】なお、以上の説明において、音データ入力
装置としてテレビチューナを使用した場合について説明
したが、マイク，サウンドボード，ラジオチューナを使
用してもよいし、また、ネットワーク，電話回線等を介
した音データを使用しても、本発明が適用できることは
明らかであり、音データの種類に関する制限はない。In the above description, the case where the TV tuner is used as the sound data input device has been described. However, a microphone, a sound board, a radio tuner may be used, or a network, a telephone line, or the like may be used. It is clear that the present invention can be applied even if the sound data is used, and there is no restriction on the type of the sound data.

【００４９】また、以上の説明において、セパレータ情
報として無音部を使用したが、これに限定されず、ベル
の音，時報，チャイム，競技のスタートピストルの音，
楽曲の一部等のように、一定の特性を持つ音データなら
ば使用可能である。In the above description, a silent portion is used as separator information. However, the present invention is not limited to this. The sound of a bell, a time signal, a chime, the sound of a start pistol of a competition,
Any sound data, such as a part of a music piece, having certain characteristics can be used.

【００５０】さらに、以上の説明においては、入力音デ
ータ蓄積部２０６のメモリ容量が４０秒程度として説明
したが、このメモリ容量の値はいくらであってもよい。
メモリ容量を、例えば６００秒とすれば、図７に示すＥ
ＴＣ１（３００秒）のデータも入力音データ蓄積部２０
６に蓄積でき、無音部２，無音部３の一組の無音部を検
出して、音データ長記憶部２０８に登録されている時間
値と一致するかの判定を行うことになる。Further, in the above description, the memory capacity of the input sound data storage unit 206 has been described as about 40 seconds, but the value of this memory capacity may be any value.
Assuming that the memory capacity is, for example, 600 seconds, E shown in FIG.
The data of TC1 (300 seconds) is also stored in the input sound data storage unit 20.
6, a set of silence portions 2 and 3 is detected, and it is determined whether or not the set value matches the time value registered in the sound data length storage portion 208.

【００５１】[0051]

【発明の効果】本発明は、入力音データの分割点となる
セパレータ情報と検索すべき音データの長さとを予め設
定しておき、セパレータ情報により分割された入力音デ
ータの長さが予め設定された値と一致したときに、検索
が必要な音データと判定して照合処理を行うようにした
ため、無駄な照合処理を削減して検索の効率化をはかる
ことができるという効果を有する。According to the present invention, the length of the input sound data divided by the separator information is set in advance by setting the separator information as the division point of the input sound data and the length of the sound data to be searched. Since the matching process is performed by determining that the sound data needs to be searched when the value matches the searched value, it is possible to reduce unnecessary use of the matching process and improve the efficiency of the search.

[Brief description of the drawings]

【図１】本発明の一実施の形態の構成を示すブロック図
である。FIG. 1 is a block diagram showing a configuration of an embodiment of the present invention.

【図２】本発明の一実施の形態の動作を示すフローチャ
ートである。FIG. 2 is a flowchart showing the operation of the embodiment of the present invention.

【図３】セパレータ情報記憶部２０７に登録されるセパ
レータ情報の一例を示す図である。FIG. 3 is a diagram illustrating an example of separator information registered in a separator information storage unit 207.

【図４】音データ長記憶部２０８に登録されるデータ長
の一例を示す図である。FIG. 4 is a diagram showing an example of a data length registered in a sound data length storage unit 208.

【図５】候補スペクトルデータ蓄積部２１１に登録され
る候補スペクトルデータの一例を示す図である。FIG. 5 is a diagram showing an example of candidate spectrum data registered in a candidate spectrum data storage unit 211.

【図６】検索結果情報蓄積部２１２に登録される検索結
果情報の一例を示す図である。FIG. 6 is a diagram illustrating an example of search result information registered in a search result information storage unit 212;

【図７】入力される音データの一例を示す図である。FIG. 7 is a diagram showing an example of input sound data.

【図８】入力される音データの波形の概念図である。FIG. 8 is a conceptual diagram of a waveform of input sound data.

【図９】本発明の他の実施の形態の構成を示すブロック
図である。FIG. 9 is a block diagram showing a configuration of another embodiment of the present invention.

[Explanation of symbols]

１音データ入力装置２音データ検索装置２０１セパレータ検出手段２０２データ長比較手段２０３スペクトル変換手段２０４音データ照合手段２０５検索結果管理手段２０６入力音データ蓄積部２０７セパレータ情報記憶部２０８音データ長記憶部２０９照合対象音データ記憶部２１０スペクトルデータ記憶部２１１候補スペクトルデータ蓄積部２１２検索結果情報蓄積部３検索結果出力装置４コンピュータ５記録媒体 Reference Signs List 1 sound data input device 2 sound data search device 201 separator detection means 202 data length comparison means 203 spectrum conversion means 204 sound data collation means 205 search result management means 206 input sound data storage section 207 separator information storage section 208 sound data length storage section 209 Reference target sound data storage unit 210 Spectrum data storage unit 211 Candidate spectrum data storage unit 212 Search result information storage unit 3 Search result output device 4 Computer 5 Recording medium

Claims

[Claims]

1. A method of comparing input sound data with previously registered and stored sound data and comparing the same to search for the same sound data from the registered and stored sound data. In the sound data search method, separator information serving as a division point for dividing the input sound data and at least one designated time value are registered and stored in advance, and the input sound data is stored in the sound data search method. Sound data divided according to separator information, and when the length of the divided sound data matches any of the designated time values, comparison and collation with the registered and stored sound data are performed. retrieval method.

2. A first step of pre-registering and storing separator information serving as a division point for dividing input sound data and at least one designated time value; A second step of accumulating; a third step of dividing the input sound data based on the separator information; a fourth step of calculating a length of the sound data divided in the third step; A fifth step of comparing and determining whether the length of the sound data calculated in the fourth step matches one of the designated time values; and, if the result of the fifth step is a match, A sixth step of converting the divided sound data into spectrum data of frequency components, searching for the same sound data by comparing the spectrum data with pre-registered and stored spectrum data, If the result of the fifth step does not match,
And a seventh step of terminating the process without checking the divided sound data and returning to the third step.

3. The apparatus according to claim 1, wherein the separator information is a silent portion that has continued for a predetermined time or more.
The described sound data search method.

4. The apparatus according to claim 1, wherein the separator information is sound data having certain characteristics such as a bell sound, a time signal, a chime, a start pistol sound of a game, and a part of a music piece. The described sound data search method.

5. The registered sound data is compared with the previously registered and stored sound data, and the same sound data as the inputted sound data is searched from the registered and stored sound data. A sound data search device for pre-registering and storing separator information serving as a division point for dividing the input sound data and at least one designated time value; Means for dividing according to the separator information, means for determining whether or not the length of the divided sound data matches any of the designated time values. A sound data search device comprising: means for performing collation.

6. An input sound data storage unit for storing sound data input from the outside, a separator information storage unit for pre-registering and storing separator information as a division point for extracting sound data, The length of the input sound data to be processed is registered in advance,
A stored sound data length storage unit, divided sound data having a length corresponding to the sound data length stored in the sound data length storage unit, and addresses of two separator information before and after the sound data. A comparison target sound data storage unit that stores the spectrum data obtained by converting the sound data stored in the comparison target sound data storage unit into frequency components as extracted spectrum data; A candidate spectrum data storage unit in which a target data group is stored in advance as frequency component candidate spectrum data in association with an identification number, and corresponding to each candidate spectrum data stored in the candidate spectrum data storage unit. Search result information such as title and sound data to be assigned to the identification number of the candidate spectrum data storage unit, respectively. Based on the search result information storage unit pre-stored in association with the sound data stored in the input sound data storage unit,
Data matching the separator information stored in the separator information storage unit is detected, sound data is divided at the coincidence point, and the addresses of two pieces of separator information before and after the divided sound data are used as the matching target sound. Along with storing the data length in the data storage unit, the separator length detection unit that notifies the data length comparison unit that the separator information has been detected, and the data length of the sound data divided by the separator detection unit is stored in the matching target sound data storage unit. It is calculated from the addresses of the two stored pieces of separator information, and it is checked whether the registered data length matches the registered data length stored in the sound data length storage section. Data length comparing means for notifying the spectrum conversion means that there is sound data to be compared, A spectrum conversion unit that converts sound data stored in the sound data storage unit into spectrum data of frequency components, stores the extracted spectrum data in the spectrum data storage unit, and activates a sound data comparison unit 204; Data matching the extracted spectrum data stored in the data storage unit is searched from the candidate spectrum data stored in the candidate spectrum data storage unit, and the identification number of the searched candidate spectrum data is used as a search result management unit. A search result information such as a sound data or a commercial title corresponding to one candidate spectrum data searched by the sound data matching means by an identification number from the search result information storage unit. Search output to an external search result output device The sound data retrieval apparatus characterized by having a fruit management means.

7. The apparatus according to claim 5, wherein the separator information is a silent portion that has continued for a predetermined time or more.
The described sound data search device.

8. The apparatus according to claim 5, wherein the separator information is sound data having certain characteristics such as a bell sound, a time signal, a chime, a start pistol sound of a competition, and a part of a music piece. The described sound data search device.

9. A first process of pre-registering and storing separator information serving as a dividing point for dividing inputted sound data and at least one designated time value, and A second processing to accumulate; a third processing to divide the input sound data by the separator information; a fourth processing to calculate the length of the sound data divided in the third processing; A fifth process of comparing and determining whether the length of the sound data calculated in the fourth process matches any of the designated time values, and a result of the fifth process, A sixth process of converting the divided sound data into spectrum data of frequency components, collating the spectrum data with pre-registered and stored spectrum data to search for the same sound data, and a result of the fifth process. , If not matched Ends the processing without performing the comparison of the divided sound data,
A recording medium on which a sound data search program for causing a computer to execute a seventh process returning to the third process is recorded.