WO2008127052A1 - System and method for searching audio fingerprint by index information - Google Patents

System and method for searching audio fingerprint by index information Download PDF

Info

Publication number
WO2008127052A1
WO2008127052A1 PCT/KR2008/002085 KR2008002085W WO2008127052A1 WO 2008127052 A1 WO2008127052 A1 WO 2008127052A1 KR 2008002085 W KR2008002085 W KR 2008002085W WO 2008127052 A1 WO2008127052 A1 WO 2008127052A1
Authority
WO
WIPO (PCT)
Prior art keywords
fingerprint
audio
index
information
searching
Prior art date
Application number
PCT/KR2008/002085
Other languages
French (fr)
Inventor
Seungjae Lee
Jin Soo Seo
Sang Kwang Lee
Wonyoung Yoo
Young Suk Yoon
Yong Seok Seo
Weon Geun Oh
Young Ho Suh
Original Assignee
Electronics And Telecommunications Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electronics And Telecommunications Research Institute filed Critical Electronics And Telecommunications Research Institute
Priority to CN2008800126394A priority Critical patent/CN101663708B/en
Publication of WO2008127052A1 publication Critical patent/WO2008127052A1/en

Links

Classifications

    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing

Definitions

  • the present invention relates to an audio fingerprint search technology, and more particularly, to a system and method for searching audio fingerprint by index information to improve recognition performance and to increase a search speed by indexing audio fingerprints, searching a predetermined audio fingerprint based on the indexing, and verifying the searched audio fingerprint.
  • the object of an audio fingerprint system is to recognize a predetermined audio by receiving an audio signal and searching a corresponding audio through a previously built audio fingerprint database.
  • the audio fingerprint system has been used to broadcasting monitor, CF recognition, and file filtering.
  • a high recognition rate and a fast search speed are required even under various distortions.
  • the recognition speed is one of the most important factors for real-time processing in broadcasting monitoring and file filtering fields that operate based on the large capacity audio fingerprint database.
  • the audio fingerprint system Furthermore, it is also required for the audio fingerprint system to have a high recognition performance although audio data is deformed through re-sampling, filtering, equalization, and compression as well as the fast recognition speed according to the application fields of the audio finger print system.
  • a search method was introduced in Korean Patent Publication No. 2003-7001489 entitled "Method for search in audio database".
  • a landmark and a fingerprint are extracted, and predetermined audio data is searched using a corresponding relation of a land mark and a fingerprint.
  • a land mark is calculated beside a fingerprint, the calculated land mark is stored as an index, and a candidate list between a landmark and a music ID using a fingerprint in a landmark position.
  • the audio is recognized based on a linear relation thereof.
  • the characteristics thereof were not considered although the audio signal was searched based on the fingerprint in the method.
  • the method needed a landmark to recognize a predetermined audio as a supplementary feature.
  • the present invention is directed to a system and method for searching audio fingerprint by index information which substantially obviates one or more problems due to limitations and disadvantages of the related art.
  • a system for searching an audio fingerprint including: a DB group for generating an index based on statistical characteristics of an audio fingerprint for an audio file and consecutively matching the index, the audio fingerprint, and music information; and an audio fingerprint searching apparatus for generating a new index based on statistical characteristic of an audio fingerprint for a new input audio file and searching corresponding music information for the new input audio file by searching the new index from the DB group.
  • a method for searching an audio fingerprint using index information including the steps of: a) generating an index based on statistical characteristics of an audio fingerprint for an audio file, and preparing a DB group for storing position information that consecutively matches the generated index, audio fingerprint, and music information; b) generating an index based on statistical characteristics of an audio fingerprint for a new input audio file; and c) searching corresponding music information for the new input audio file by searching the generated index, which is generated at the step b), from the DB group.
  • the system and method for searching audio fingerprint by index information according to the present invention generates an index using the statistical characteristics of an audio fingerprint and searches audio fingerprint based on the generated index. Therefore, the system and method for searching audio fingerprint by index information according to the present invention can sustain a fast search time and can be applied to the filtering and monitoring of files in a large capacity database. Furthermore, the system and method for searching audio fingerprint by index information creates candidate indexes that include an index bit of a variable position in order to compensate distortion because the recognition rate is abruptly degraded due to the distortion if the index is directly used to search without the compensation. Therefore, the system and method for searching audio fingerprint by index information can improve the recognition rate by correcting error that may be generated due to the bit index.
  • FIG. 1 is a block diagram illustrating a system for searching audio fingerprint according to an embodiment of the present invention
  • FIG. 2 is a block diagram illustrating an index processor according to an embodiment of the present invention
  • FIG. 3 is a diagram illustrating a relation in DB files used in a system for searching an audio fingerprint according to an embodiment of the present invention
  • FIG. 4 is a diagram illustrating probability distribution used for generating a fingerprint index
  • FIG. 5 is a diagram illustrating a procedure of generating a fingerprint index using a fingerprint extracted from an audio search process and searching a predetermined audio based on the generated fingerprint index;
  • FIG. 6 is a diagram illustrating a procedure of generating a candidate index in an audio search process.
  • FIG. 7 is a diagram illustrating a procedure of searching a final result using the candidate index generated at an audio search process.
  • FIG. 1 is a block diagram illustrating a system for searching audio fingerprint according to an embodiment of the present invention.
  • the system for searching audio fingerprint includes an audio fingerprint search apparatus 1.
  • the audio fingerprint search apparatus 1 includes a fingerprint extracting unit 11 for extracting an audio fingerprint for an audio file, a candidate index searching unit 12 for generating candidate indexes in consideration of a variable position by sorting values of the extracted fingerpting in an ascending order of absolute values of differences between the extracted audio fingerprint value and a mean value that is used when an index is generated, a fingerprint matching unit 13 for matching an audio fingerprint to the extracted audio fingerprint corresponding to a candidate index, and a result verifying unit 14 for verifying that a search result is corresponding music information if a distance between audio fingerprints is in a predetermined value range.
  • the system for searching a fingerprint further includes a DB group 2 for storing audio fingerprints with corresponding indexes that are matched with the audio fingerprints.
  • a DB group 2 for storing audio fingerprints with corresponding indexes that are matched with the audio fingerprints.
  • the DB group 2 it is required to build a related database first. Therefore, it is preferable to form the DB group 2 to have a fingerprint DB 21, a music-information DB 22, and a fingerprint index DB 23.
  • the system for searching audio fingerprint according to the present embodiment is divided into a DB generating area for generating fingerprint indexes and building a database thereof and a DB searching area for searching through indexing. That is, related information is stored in the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23 in a DB generating step. Fingerprint extraction, candidate search and match through indexing, and result verification are performed in a step of searching based on an index.
  • the system for searching audio fingerprint according to the present embodiment is divided into two areas, it is obvious to those skilled in the art that the two areas can be performed in one area.
  • FIG. 2 is a block diagram illustrating an index processor according to an embodiment of the present invention.
  • the index processor 3 includes a fingerprint extractor 31, a fingerprint statistical analyzer 32, a fingerprint binarizor 33, and a fingerprint indexer 34.
  • the fingerprint extractor 31 extracts an audio fingerprint from an input audio file using a fingerprint extraction algorithm, and the fingerprint statistical analyzer 32 analyzes statistical characteristics of an audio fingerprint and calculates a probability distribution. That is, the fingerprint statistical analyzer 32 approximates the extracted audio fingerprint to a probability model by calculating a mean and audio fingerprints and fingerprint distribution. Then, the fingerprint binarizor 33 binarizes the fingerprint to have probabilistic identical distribution by analyzing the statistical characteristics of audio fingerprint, and the fingerprint indexer 34 generates an index.
  • the fingerprint extraction algorithm may be one of zero crossing rate (ZCR), energy difference, spectral flatness, mel frequency cepstral coefficients (MFCC), and frequency centroide.
  • ZCR zero crossing rate
  • MFCC mel frequency cepstral coefficients
  • FIG. 3 is a diagram illustrating a relation with the structure of a DB file used in a system for searching an audio fingerprint according to an embodiment of the present invention.
  • the DB group 2 includes the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23.
  • the fingerprint DB 21 stores fingerprint audio values and position information in music.
  • the music information DB 22 stores information about music ID and the number of fingerprints.
  • the fingerprint index DB 23 stores information about position in the fingerprint DB 21 according to the binarized fingerprint value.
  • the system for searching audio fingerprint When the system for searching audio fingerprint according to the present embodiment receives a request of searching a predetermined audio file after each of the DBs stores corresponding information, the system generates candidate indexes through the indexing step shown in FIG. 2, searches the generated candidate index from the fingerprint index DB 23, and detects fingerprint position information of the candidate index. Then, the system detects fingerprint information stored in the fingerprint DB 21 corresponding to the fingerprint position information and outputs the music information stored in the music information DB 22 corresponding to the position information of music.
  • FIG. 4 is graphs illustrating probability distribution used for generating a fingerprint index.
  • FIG. 5 is a diagram illustrating a procedure of generating a fingerprint index using a fingerprint extracted from an audio search process and searching a predetermined audio based on the generated fingerprint index.
  • three DB files are prepared from an audio file.
  • the audio fingerprint search apparatus 1 performs a search service using the three DB files.
  • a step of extracting an audio fingerprint when a predetermined audio file inputs, a step of extracting an audio fingerprint, a step of calculating a candidate fingerprint based on an index obtained from the extracted fingerprint, a step of matching the extracted audio fingerprint to an audio fingerprint corresponding to the calculated candidate fingerprint index , and a step of verifying a search result using the matching result are sequentially performed as the same method used for generating a DB from the audio file.
  • an index is generated by extracting an audio fingerprint using the same method. That is, the fingerprint extractor 11 extracts an audio fingerprint from input audio files using a fingerprint extraction algorithm, and the fingerprint statistical analyzer 32 analyzes the statistical characteristics of the audio fingerprint and calculates probability distribution having the probabilistically identical distribution. That is, the fingerprint statistical analyzer 32 approximates a fingerprint to a probabilistic model by calculating the mean and distribution of the audio fingerprints. Then, the fingerprint binarizor 33 analyzes the statistical characteristics of the audio fingerprint and binarizes the audio fingerprint to have the probabilistically identical distribution. Then, the fingerprint indexer 34 generates an index.
  • the fingerprint index DB 23 After generating the candidate index, the fingerprint index DB 23 obtains the information about a position having the corresponding candidate index value in the fin- gerprint DB 21. Then, the result of searching corresponding music information through the steps of matching and verifying a fingerprint.
  • FIG. 6 is a diagram illustrating a procedure of generating a candidate index in an audio search process.
  • N dimensional fingerprint values are present, the N dimensional fingerprint values are arranged in an ascending order based on the absolute values of differences with a mean value used for generating an index.
  • a threshold value is decided according to the probability distribution shape, a variable position is decided, and candidate indexes are generated in consideration of the variable position.
  • FIG. 7 is a diagram illustrating a step of searching a final result using the generated candidate index in the audio search step.
  • a fingerprint value is called, which is matched with the generated candidate index value, and a distance between a target audio fingerprint to search and a position having the called fingerprint value is calculated. Then, the calculated distance is compared with the predetermined threshold value. If the minimum value is smaller than the threshold value, one result is stored. In order to provide the high reliable result, the above mentioned steps are repeatedly performed at a fingerprint in a different position. Then, the final result is outputted through verifying the result.
  • N dimensional fingerprint can be expressed N binary numbers using the mean value of '0' as shown in Equation 1.
  • a 16 dimensional audio fingerprint is expressed as one value between 0 to 65535, and this value is used as an index in a database.
  • the audio fingerprint system generates three database files for audio search. As shown in FIG. 3, the three database files are formed as the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23.
  • the fingerprint DB 21 stores extracted fingerprint values. That is, the fingerprint DB
  • the music information DB 22 stores information about music from which a fingerprint is extracted based on information provided when a fingerprint is generated.
  • the music information DB 22 may store various information such as a music ID, copyright information, a length of a fingerprint.
  • the fingerprint index DB 23 transforms a fingerprint to an index through Equation 1 and Equation 2 and stores the fingerprint values, which are indexes, according to the position information in the fingerprint DB 21.
  • fingerprints are sequentially stored with position information as shown in FIG. 3.
  • the music information and the fingerprint information are also stored with them.
  • the fingerprint index is used for audio search by being stored with the position information of a fingerprint having a corresponding index value shown in FIG. 3.
  • the audio fingerprint system After preparing the three DB files from an audio file, the audio fingerprint system performs a search service using the prepared DB files. That is, if a predetermined audio file is inputted, a step of extracting an audio fingerprint using the same method used for generating the DB from the audio file, a step of calculating candidate fingerprints by calculating indexes from extracted fingerprints, a step of matching candidate fingerprints, and a step of verifying using a matching result are sequentially performed. Such steps will be described in more detail as follows.
  • an audio fingerprint is extracted using the audio fingerprint extraction method, and an index is generated based on the extracted audio fingerprint.
  • an N-dimensional value is arranged in an order of closest distances from a mean value of a probabilistic distribution for generating an index, for example a mean value of '0' in the present embodiment.
  • positions having the large probability of changing according to the probabilistic distribution can be sequentially calculated.
  • Positions in a predetermined distance range through probabilistic distribution can be selected.
  • a predetermined number of positions can be selected without any condition.
  • a threshold value having a bell shaped probabilistic distribution is decided as a constant number in a previously used audio fingerprint, information about positions changeable according to the fingerprint can be obtained. After the position is decided, an index is generated using the index generation method used in the step of extracting a fingerprint. In addition, all possible indexes are generated corresponding to variable positions.
  • candidate fingerprints are obtained from a corresponding index with reference to the position information in the fingerprint DB 21 and the obtained candidate fingerprints are arranged by comparing distances to a target audio fingerprint to search according to the position information.
  • the value of Music ID is calculated using position information in the fingerprint index DB 23. Since the music information DB 22 stores the number of fingerprints of each music, the value of a position is larger than the sum of the fingerprint numbers up to (m-l) th music if the m th Music ID is a result. Also, the value of a position is smaller than the sum of the fingerprint numbers of music up to the (m+l)" 1 music. Using this fact, the value of Music ID is calculated.
  • a general system may perform the searching step several times in order to improve the reliability of search. After calculating a candidate index at a predetermined position, a candidate index may be searched again at another position, and a step of searching a candidate fingerprint is performed repeatedly so as to obtain a result.
  • Such a result is decided based on a parameter value that is selected by a system, and the search results are stored as many as the number of times of performing the searching step. The stored results are outputted as a final searching result after the verification step.
  • fingerprints extracted from a predetermined audio signal to search are consecutive values in a time domain. That is, fingerprints are sequentially extracted in time. Similarly, the fingerprints are sequentially extracted and stored in time at the generated fingerprint DB 21.
  • i f Musi cID/n + p7 Musi cID/ny, p - 1 ⁇ Pos i t ion [n + p] - Pos i t ion [n] ⁇ p + 1
  • a test database is generated for 27,000 audio files each having a length of 40 seconds for verification. Then, 100 audio files, which are compressed to 32kbps-MP3 audio files each having a length of 20 seconds, are searched from the test database.
  • a fingerprint described in the present embodiment is extracted for 16-dimensions and the extracted fingerprint is used. In order to compare distances, 52 fingerprints are used. Also, the search step is performed five times for verification. In order to compare performances, a sequential search is performed under the same condition. In the sequential search, all fingerprints are searched in a DB and a result having a minimum value by comparing distances is determined as a final result.
  • the search speed of the index bases search according to the present embodiment is much faster than that of the sequential search. Also, the deterioration of the recognition rate for the bit index value can be overcome by adjusting a parameter value that decides a candidate index although the total search time extends little bit.
  • the recognition rate can be improved from 87% to 96% although the total search time extends as long as about 10 seconds.
  • the search time can be reduced up to 1/9 of the total search time of the sequential search.
  • the system and method for searching audio fingerprint by index information according to the present embodiment can be applied to the filtering and monitoring of files in a large capacity database based on the fast search time and the high recognition rate.
  • the system and method for searching audio fingerprint by index information according to the present embodiment can be applied to the file filtering to solve the copyright problem in a user created content (UCC) field or a P2P field.
  • UCC user created content
  • P2P P2P field

Abstract

Provided are a system and method for searching audio fingerprint by index information. The system includes a DB group for generating an index based on statistical characteristics of an audio fingerprint for an audio file and consecutively matching the index, the audio fingerprint, and music information, and an audio fingerprint searching apparatus for generating a new index based on statistical characteristic of an audio fingerprint for a new input audio file and searching corresponding music information for the new input audio file by searching the new index from the DB group.

Description

Description
SYSTEM AND METHOD FOR SEARCHING AUDIO FINGERPRINT BY INDEX INFORMATION
Technical Field
[1] The present invention relates to an audio fingerprint search technology, and more particularly, to a system and method for searching audio fingerprint by index information to improve recognition performance and to increase a search speed by indexing audio fingerprints, searching a predetermined audio fingerprint based on the indexing, and verifying the searched audio fingerprint.
[2] This work was supported by the Information Technology (IT) Research and Development Program of MIC (the Korean Ministry of Information and Communication ) / IITA (the Korean Institute for Information Technology Advancement) [2007-S-017-01, "Development of user-centric contents protection and distribution technology"].
[3]
Background Art
[4] The object of an audio fingerprint system is to recognize a predetermined audio by receiving an audio signal and searching a corresponding audio through a previously built audio fingerprint database. According to application fields, the audio fingerprint system has been used to broadcasting monitor, CF recognition, and file filtering. In order to effectively use the audio fingerprint system in the application fields, a high recognition rate and a fast search speed are required even under various distortions. Particularly, in order to filter files in P2P or UCC fields, it is required to quickly and accurately search an audio fingerprint data formed of several hundred thousand audio files each having own copyrights. The recognition speed is one of the most important factors for real-time processing in broadcasting monitoring and file filtering fields that operate based on the large capacity audio fingerprint database.
[5] Furthermore, it is also required for the audio fingerprint system to have a high recognition performance although audio data is deformed through re-sampling, filtering, equalization, and compression as well as the fast recognition speed according to the application fields of the audio finger print system.
[6] A search method according to the related art was introduced in Korean Patent Publication No. 2003-7001489 entitled "Method for search in audio database". In the method for search in audio database, a landmark and a fingerprint are extracted, and predetermined audio data is searched using a corresponding relation of a land mark and a fingerprint. In the method, a land mark is calculated beside a fingerprint, the calculated land mark is stored as an index, and a candidate list between a landmark and a music ID using a fingerprint in a landmark position. Then, the audio is recognized based on a linear relation thereof. However, the characteristics thereof were not considered although the audio signal was searched based on the fingerprint in the method. Also, the method needed a landmark to recognize a predetermined audio as a supplementary feature.
[7] An audio search system according to the related art was introduced in Korea Patent
Publication No. 2007-0031765 entitled "Fingerprint producing method and audio fingerprinting system based on normalized spectral subband centroids". The fingerprint producing method and audio fingerprinting system generate a fingerprint based on a normalized spectrum subband centroid and searches a predetermined audio by comparing distances of fingerprints. The fingerprint producing method and audio fingerprinting system did not consider the characteristics of a fingerprint for audio search although the fingerprint producing method and audio fingerprinting system had better recognition performance than MFCC and Tonality of typical fingerprints of MP3, equalization, and random start.
[8]
Disclosure of Invention Technical Problem
[9] Accordingly, the present invention is directed to a system and method for searching audio fingerprint by index information which substantially obviates one or more problems due to limitations and disadvantages of the related art.
[10] It is an object of the present invention to provide a system and method for searching audio fingerprint using index information to improve audio recognition performance and to increase a search speed by generating an index using the statistical characteristics of audio fingerprint feature information and searching a predetermined audio using the generated index.
[H]
Technical Solution
[12] To achieve these objects and other advantages and in accordance with the purpose of the invention, as embodied and broadly described herein, there is provided a system for searching an audio fingerprint including: a DB group for generating an index based on statistical characteristics of an audio fingerprint for an audio file and consecutively matching the index, the audio fingerprint, and music information; and an audio fingerprint searching apparatus for generating a new index based on statistical characteristic of an audio fingerprint for a new input audio file and searching corresponding music information for the new input audio file by searching the new index from the DB group.
[13] In accordance with another purpose of the invention, there is provided a method for searching an audio fingerprint using index information, including the steps of: a) generating an index based on statistical characteristics of an audio fingerprint for an audio file, and preparing a DB group for storing position information that consecutively matches the generated index, audio fingerprint, and music information; b) generating an index based on statistical characteristics of an audio fingerprint for a new input audio file; and c) searching corresponding music information for the new input audio file by searching the generated index, which is generated at the step b), from the DB group.
[14]
Advantageous Effects
[15] The system and method for searching audio fingerprint by index information according to the present invention generates an index using the statistical characteristics of an audio fingerprint and searches audio fingerprint based on the generated index. Therefore, the system and method for searching audio fingerprint by index information according to the present invention can sustain a fast search time and can be applied to the filtering and monitoring of files in a large capacity database. Furthermore, the system and method for searching audio fingerprint by index information creates candidate indexes that include an index bit of a variable position in order to compensate distortion because the recognition rate is abruptly degraded due to the distortion if the index is directly used to search without the compensation. Therefore, the system and method for searching audio fingerprint by index information can improve the recognition rate by correcting error that may be generated due to the bit index.
[16]
Brief Description of the Drawings
[17] The accompanying drawings, which are included to provide a further understanding of the invention, are incorporated in and constitute a part of this application, illustrate embodiments of the invention and together with the description serve to explain the principle of the invention. In the drawings:
[18] FIG. 1 is a block diagram illustrating a system for searching audio fingerprint according to an embodiment of the present invention;
[19] FIG. 2 is a block diagram illustrating an index processor according to an embodiment of the present invention;
[20] FIG. 3 is a diagram illustrating a relation in DB files used in a system for searching an audio fingerprint according to an embodiment of the present invention; [21] FIG. 4 is a diagram illustrating probability distribution used for generating a fingerprint index;
[22] FIG. 5 is a diagram illustrating a procedure of generating a fingerprint index using a fingerprint extracted from an audio search process and searching a predetermined audio based on the generated fingerprint index;
[23] FIG. 6 is a diagram illustrating a procedure of generating a candidate index in an audio search process; and
[24] FIG. 7 is a diagram illustrating a procedure of searching a final result using the candidate index generated at an audio search process.
[25]
Best Mode for Carrying Out the Invention
[26] Reference will now be made in detail to the preferred embodiments of the present invention, examples of which are illustrated in the accompanying drawings.
[27] FIG. 1 is a block diagram illustrating a system for searching audio fingerprint according to an embodiment of the present invention.
[28] Referring to FIG. 1, the system for searching audio fingerprint according to the present embodiment includes an audio fingerprint search apparatus 1. The audio fingerprint search apparatus 1 includes a fingerprint extracting unit 11 for extracting an audio fingerprint for an audio file, a candidate index searching unit 12 for generating candidate indexes in consideration of a variable position by sorting values of the extracted fingerpting in an ascending order of absolute values of differences between the extracted audio fingerprint value and a mean value that is used when an index is generated, a fingerprint matching unit 13 for matching an audio fingerprint to the extracted audio fingerprint corresponding to a candidate index, and a result verifying unit 14 for verifying that a search result is corresponding music information if a distance between audio fingerprints is in a predetermined value range.
[29] Since an index is used to search, the system for searching a fingerprint further includes a DB group 2 for storing audio fingerprints with corresponding indexes that are matched with the audio fingerprints. In order to operate the audio fingerprint search apparatus 1 according to the present invention, it is required to build a related database first. Therefore, it is preferable to form the DB group 2 to have a fingerprint DB 21, a music-information DB 22, and a fingerprint index DB 23. Also, in order to match and store audio fingerprints with corresponding indexes, it is required to perform a preprocess for an audio file. It is preferable to further includes an index processor 3 for receiving audio files and music ID information and matching audio fingerprints to corresponding indexes in the preprocess.
[30] As described above, the system for searching audio fingerprint according to the present embodiment is divided into a DB generating area for generating fingerprint indexes and building a database thereof and a DB searching area for searching through indexing. That is, related information is stored in the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23 in a DB generating step. Fingerprint extraction, candidate search and match through indexing, and result verification are performed in a step of searching based on an index. Although the system for searching audio fingerprint according to the present embodiment is divided into two areas, it is obvious to those skilled in the art that the two areas can be performed in one area.
[31] FIG. 2 is a block diagram illustrating an index processor according to an embodiment of the present invention.
[32] Referring to FIG. 2, the index processor 3 includes a fingerprint extractor 31, a fingerprint statistical analyzer 32, a fingerprint binarizor 33, and a fingerprint indexer 34.
[33] The fingerprint extractor 31 extracts an audio fingerprint from an input audio file using a fingerprint extraction algorithm, and the fingerprint statistical analyzer 32 analyzes statistical characteristics of an audio fingerprint and calculates a probability distribution. That is, the fingerprint statistical analyzer 32 approximates the extracted audio fingerprint to a probability model by calculating a mean and audio fingerprints and fingerprint distribution. Then, the fingerprint binarizor 33 binarizes the fingerprint to have probabilistic identical distribution by analyzing the statistical characteristics of audio fingerprint, and the fingerprint indexer 34 generates an index.
[34] Here, the fingerprint extraction algorithm may be one of zero crossing rate (ZCR), energy difference, spectral flatness, mel frequency cepstral coefficients (MFCC), and frequency centroide.
[35] FIG. 3 is a diagram illustrating a relation with the structure of a DB file used in a system for searching an audio fingerprint according to an embodiment of the present invention.
[36] Referring to FIG. 3, the DB group 2 according to the present embodiment includes the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23.
[37] The fingerprint DB 21 stores fingerprint audio values and position information in music. The music information DB 22 stores information about music ID and the number of fingerprints. The fingerprint index DB 23 stores information about position in the fingerprint DB 21 according to the binarized fingerprint value.
[38] When the system for searching audio fingerprint according to the present embodiment receives a request of searching a predetermined audio file after each of the DBs stores corresponding information, the system generates candidate indexes through the indexing step shown in FIG. 2, searches the generated candidate index from the fingerprint index DB 23, and detects fingerprint position information of the candidate index. Then, the system detects fingerprint information stored in the fingerprint DB 21 corresponding to the fingerprint position information and outputs the music information stored in the music information DB 22 corresponding to the position information of music.
[39] FIG. 4 is graphs illustrating probability distribution used for generating a fingerprint index.
[40] Referring to FIG. 4, the graphs show histogram distribution of normalized frequency centroid values. The graphs clearly show the mean value is close to 0.
[41] FIG. 5 is a diagram illustrating a procedure of generating a fingerprint index using a fingerprint extracted from an audio search process and searching a predetermined audio based on the generated fingerprint index.
[42] Like in FIG. 3, three DB files are prepared from an audio file. The audio fingerprint search apparatus 1 performs a search service using the three DB files.
[43] Referring to FIG. 5, when a predetermined audio file inputs, a step of extracting an audio fingerprint, a step of calculating a candidate fingerprint based on an index obtained from the extracted fingerprint, a step of matching the extracted audio fingerprint to an audio fingerprint corresponding to the calculated candidate fingerprint index , and a step of verifying a search result using the matching result are sequentially performed as the same method used for generating a DB from the audio file. These steps will be described as follows.
[44] If an audio file is inputted, an index is generated by extracting an audio fingerprint using the same method. That is, the fingerprint extractor 11 extracts an audio fingerprint from input audio files using a fingerprint extraction algorithm, and the fingerprint statistical analyzer 32 analyzes the statistical characteristics of the audio fingerprint and calculates probability distribution having the probabilistically identical distribution. That is, the fingerprint statistical analyzer 32 approximates a fingerprint to a probabilistic model by calculating the mean and distribution of the audio fingerprints. Then, the fingerprint binarizor 33 analyzes the statistical characteristics of the audio fingerprint and binarizes the audio fingerprint to have the probabilistically identical distribution. Then, the fingerprint indexer 34 generates an index.
[45] In order to obtain a candidate fingerprint value for audio search, information about a position in the fingerprint DB 21, which has a corresponding index value of the fingerprint index DB 23, is obtained. Meanwhile, if audio is distorted, a fingerprint extracted therefrom may be also distorted. Accordingly, the index value of a fingerprint may change. The index value may vary due to noise, equalization, compression, analog-digital conversion, and digital- analog conversion. Candidate indexes are generated for the index value variation.
[46] After generating the candidate index, the fingerprint index DB 23 obtains the information about a position having the corresponding candidate index value in the fin- gerprint DB 21. Then, the result of searching corresponding music information through the steps of matching and verifying a fingerprint.
[47] FIG. 6 is a diagram illustrating a procedure of generating a candidate index in an audio search process.
[48] As shown in FIG. 6, if N dimensional fingerprint values are present, the N dimensional fingerprint values are arranged in an ascending order based on the absolute values of differences with a mean value used for generating an index. A threshold value is decided according to the probability distribution shape, a variable position is decided, and candidate indexes are generated in consideration of the variable position.
[49] FIG. 7 is a diagram illustrating a step of searching a final result using the generated candidate index in the audio search step.
[50] As shown in FIG. 7, a fingerprint value is called, which is matched with the generated candidate index value, and a distance between a target audio fingerprint to search and a position having the called fingerprint value is calculated. Then, the calculated distance is compared with the predetermined threshold value. If the minimum value is smaller than the threshold value, one result is stored. In order to provide the high reliable result, the above mentioned steps are repeatedly performed at a fingerprint in a different position. Then, the final result is outputted through verifying the result.
[51] In the present embodiment, it is assumed that an extracted audio fingerprint has a floating-point real number value and that the estimated probabilistic model of a fingerprint has a bell shaped distribution which has a mean value of '0' as shown in FIG. 4. Here, N dimensional fingerprint can be expressed N binary numbers using the mean value of '0' as shown in Equation 1.
[52] [Equation 1]
Figure imgf000009_0001
[54] The fingerprint expressed in a binary number is converted to a decimal number through Equation 2. The decimal fingerprint number is used as an index for a database. [55] [Equation 2]
[56]
Index/"i V = ∑ AFB J m] ■ 2 N"ra
[57] For example, in case of N is 16, a 16 dimensional audio fingerprint is expressed as one value between 0 to 65535, and this value is used as an index in a database.
[58] The audio fingerprint system according to the present embodiment generates three database files for audio search. As shown in FIG. 3, the three database files are formed as the fingerprint DB 21, the music information DB 22, and the fingerprint index DB 23.
[59] The fingerprint DB 21 stores extracted fingerprint values. That is, the fingerprint DB
21 stores the extracted fingerprint value as it is. The music information DB 22 stores information about music from which a fingerprint is extracted based on information provided when a fingerprint is generated. For example, the music information DB 22 may store various information such as a music ID, copyright information, a length of a fingerprint. The fingerprint index DB 23 transforms a fingerprint to an index through Equation 1 and Equation 2 and stores the fingerprint values, which are indexes, according to the position information in the fingerprint DB 21.
[60] For example, in case of 16-dimensional fingerprint, fingerprints are sequentially stored with position information as shown in FIG. 3. The music information and the fingerprint information are also stored with them. The fingerprint index is used for audio search by being stored with the position information of a fingerprint having a corresponding index value shown in FIG. 3.
[61] After preparing the three DB files from an audio file, the audio fingerprint system performs a search service using the prepared DB files. That is, if a predetermined audio file is inputted, a step of extracting an audio fingerprint using the same method used for generating the DB from the audio file, a step of calculating candidate fingerprints by calculating indexes from extracted fingerprints, a step of matching candidate fingerprints, and a step of verifying using a matching result are sequentially performed. Such steps will be described in more detail as follows.
[62] When an audio file is inputted, an audio fingerprint is extracted using the audio fingerprint extraction method, and an index is generated based on the extracted audio fingerprint.
[63] In order to obtain a candidate fingerprint value for audio search, information about a position in the fingerprint DB 21, which has a corresponding index value of the fingerprint index DB 23, is obtained. Meanwhile, if audio is distorted, a fingerprint extracted therefrom may be also distorted. Accordingly, the index value of a fingerprint may change. The index value may vary due to noise, equalization, compression, analog-digital conversion, and digital- analog conversion. Candidate indexes are generated for the index value variation.
[64] After generating the candidate indexes, position information in the fingerprint DB
21, which has a corresponding candidate index value in the fingerprint index DB 23, is obtained corresponding to the candidate indexes. Then, a search result of corresponding music information is outputted after the fingerprint matching and verifying steps.
[65] Hereinafter, the step of generating a candidate index will described in more detail. In case of N-dimensional fingerprint, an N-dimensional value is arranged in an order of closest distances from a mean value of a probabilistic distribution for generating an index, for example a mean value of '0' in the present embodiment. Here, positions having the large probability of changing according to the probabilistic distribution can be sequentially calculated. Positions in a predetermined distance range through probabilistic distribution can be selected. Also, a predetermined number of positions can be selected without any condition.
[66] [Equation 3]
[67] sortascend{| F— meanJ}
[68] If a threshold value having a bell shaped probabilistic distribution is decided as a constant number in a previously used audio fingerprint, information about positions changeable according to the fingerprint can be obtained. After the position is decided, an index is generated using the index generation method used in the step of extracting a fingerprint. In addition, all possible indexes are generated corresponding to variable positions.
[69] For example, if a 4-dimensional audio finger value is (-0.2, 0.1, 0.4, 0.2), if a value deciding an index is 0, and if the 2nd position has the large probability to change, an index thereof is 0111 and a candidate index is 0011 because the 2nd position may change.
[70] After calculating the candidate indexes as described above, candidate fingerprints are obtained from a corresponding index with reference to the position information in the fingerprint DB 21 and the obtained candidate fingerprints are arranged by comparing distances to a target audio fingerprint to search according to the position information.
[71] Herein, redundancy is removed based on values corresponding to reference positions as a reference, and distances from K predetermined audio fingerprints having a predetermined length to a fingerprint value of a fingerprint DB 21 are calculated. For example, a Euclidian distance is calculated, and the calculated Euclidian distance is compared with a threshold value. If the calculated is smaller than the threshold value, music information is searched in the music information DB 22 and the search result is outputted. If not, a basic information value indicating that music is not searched is outputted as a result. The above mentioned steps were well described with reference to FIG. 7. The result value is formed of Music ID denoting music information in a data base, Position denoting a temporal position, and Distance denoting distance difference that is reliability as shown in Equation 4.
[72] [Equation 4]
[73] Rf ny = { MusicID[nj, Posit ion[nj,Di stance/" n]}
[74] When the value of Music ID is smaller than a threshold value, the value of Music ID is calculated using position information in the fingerprint index DB 23. Since the music information DB 22 stores the number of fingerprints of each music, the value of a position is larger than the sum of the fingerprint numbers up to (m-l)th music if the m th Music ID is a result. Also, the value of a position is smaller than the sum of the fingerprint numbers of music up to the (m+l)"1 music. Using this fact, the value of Music ID is calculated.
[75] [Equation 5]
[76] m-l m+1
∑ feat_num [k] < pos i t i on/^ny < ^ feat_num [kj k= l k = l
[77] A general system may perform the searching step several times in order to improve the reliability of search. After calculating a candidate index at a predetermined position, a candidate index may be searched again at another position, and a step of searching a candidate fingerprint is performed repeatedly so as to obtain a result.
[78] Such a result is decided based on a parameter value that is selected by a system, and the search results are stored as many as the number of times of performing the searching step. The stored results are outputted as a final searching result after the verification step.
[79] In the verification step, fingerprints extracted from a predetermined audio signal to search are consecutive values in a time domain. That is, fingerprints are sequentially extracted in time. Similarly, the fingerprints are sequentially extracted and stored in time at the generated fingerprint DB 21.
[80] That is, a result of searching using a fingerprint at a predetermined position and a result of searching using a fingerprint at a next position have the temporally identical distance difference. Based on this fact, the searching result is verified using Equation 6.
[81] [Equation 6]
[82] i f Musi cID/n + p7 = Musi cID/ny, p - 1 < Pos i t ion [n + p] - Pos i t ion [n] < p + 1
[83] That is, if searching results of p positions are the same, the difference of position information must be larger than p-1 and smaller than p+1. Based on this fact, repeatedly obtained results are verified. If the condition is satisfied, the results are outputted as a final result.
[84] A test database is generated for 27,000 audio files each having a length of 40 seconds for verification. Then, 100 audio files, which are compressed to 32kbps-MP3 audio files each having a length of 20 seconds, are searched from the test database. A fingerprint described in the present embodiment is extracted for 16-dimensions and the extracted fingerprint is used. In order to compare distances, 52 fingerprints are used. Also, the search step is performed five times for verification. In order to compare performances, a sequential search is performed under the same condition. In the sequential search, all fingerprints are searched in a DB and a result having a minimum value by comparing distances is determined as a final result.
[85] Table 1 [Table 1] [Table ]
Figure imgf000013_0001
[86] As shown in Table 1, the search speed of the index bases search according to the present embodiment is much faster than that of the sequential search. Also, the deterioration of the recognition rate for the bit index value can be overcome by adjusting a parameter value that decides a candidate index although the total search time extends little bit.
[87] That is, the recognition rate can be improved from 87% to 96% although the total search time extends as long as about 10 seconds. The search time can be reduced up to 1/9 of the total search time of the sequential search.
[88] The system and method for searching audio fingerprint by index information according to the present embodiment can be applied to the filtering and monitoring of files in a large capacity database based on the fast search time and the high recognition rate. Particularly, the system and method for searching audio fingerprint by index information according to the present embodiment can be applied to the file filtering to solve the copyright problem in a user created content (UCC) field or a P2P field. [89] It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention. Thus, it is intended that the present invention covers the modifications and variations of this invention provided they come within the scope of the appended claims and their equivalents.

Claims

Claims
[1] A system for searching an audio fingerprint comprising: a DB group for generating an index based on statistical characteristics of an audio fingerprint for an audio file and consecutively matching the index, the audio fingerprint, and music information; and an audio fingerprint searching apparatus for generating a new index based on statistical characteristic of an audio fingerprint for a new input audio file and searching corresponding music information for the new input audio file by searching the new index from the DB group.
[2] The system of claim 1, wherein the DB group includes: a fingerprint DB for storing audio fingerprints for the audio file and position information of the music information; a music information DB for storing music ID for the music information and information about the number of fingerprints; and a fingerprint index DB for storing information about audio fingerprint position in the fingerprint DB corresponding to the index.
[3] The system of claim 2, further comprising an index processor for transferring audio fingerprints and music information, which are extracted by extracting an audio fingerprint from an audio file, to a corresponding DB.
[4] The system of claim 3, wherein the index processor includes: a fingerprint extractor for extracting an audio fingerprint using a fingerprint extraction algorithm; a fingerprint statistic analyzer for approximating a fingerprint to a probabilistic model by obtaining a mean and distribution of audio fingerprints for the extracted audio fingerprint; a fingerprint binarizor for by analyzing statistical characteristics of the audio fingerprint and performing binarization to have a probabilistically identical distribution; and a fingerprint indexer for matching the binarization result to an index.
[5] The system of claim 4, wherein the fingerprint extraction algorithm is one of
Zero Crossing Rate (ZCR), Energy Difference, Spectral flatness, Mel Frequency Cepstral Coefficients (MFCC), and Frequency Centroids.
[6] The system of claim 4, wherein the binarization is performed based on a mean value among the statistical characteristics.
[7] The system of claim 1, wherein the audio fingerprint searching apparatus includes: a fingerprint extractor for extracting an audio fingerprint for the new audio file; a candidate index searching unit for generating a candidate index by sorting values of the extracted audio fingerprint in an ascending order of absolute values of differences between the extracted audio fingerprint value and a mean value, which is used for generating the new index, and in consideration of a variable position; a fingerprint matching unit for matching an audio fingerprint to the extracted audio fingerprint corresponding to the candidate index; and a result verifying unit for measuring a distance between the audio fingerprints and verifying a result through time information if the measured distance is in a predetermined value range.
[8] The system of claim 7, wherein the variable position is decided by a threshold value setting in a probabilistic distribution shape.
[9] A method for searching an audio fingerprint using index information, comprising the steps of: a) generating an index based on statistical characteristics of an audio fingerprint for an audio file, and preparing a DB group for storing position information that consecutively matches the generated index, audio fingerprint, and music information; b) generating an index based on statistical characteristics of an audio fingerprint for a new input audio file; and c) searching corresponding music information for the new input audio file by searching the generated index, which is generated at the step b), from the DB group.
[10] The method of claim 9, wherein the index generation in the steps a) and b) includes the steps of: extracting an audio fingerprint using a fingerprint extraction algorithm; approximating a fingerprint to a probabilistic model by calculating a mean value and distribution of an audio fingerprint for the extracted audio fingerprint; and generating an index by analyzing the statistical characteristics of the audio fingerprint and performing binarization to have a probabilistically identical distribution.
[11] The method of claim 9, wherein the step a) includes the steps of: storing an audio fingerprint for the audio file and position information for the music information at a fingerprint DB; storing an Music ID that is an unique ID for the music information and information including the number of fingerprints in a music information DB; and storing information about audio fingerprint position in a fingerprint DB corresponding to the index at a fingerprint index DB.
[12] The method of claim 9, wherein the step c) includes the steps of: generating a candidate index by sorting values of the extracted audio fingerprint in an ascending order of absolute values of differences between the extracted audio fingerprint value and a mean value used for generating the index in the step b) and in consideration of a variable position; matching an audio fingerprint corresponding to the candidate index to the extracted audio fingerprint; and verifying a result through time information by measuring a distance between the audio fingerprints and the measured distance is in a predetermined value range.
[13] The method of claim 12, wherein a candidate index is generated after absolute values of differences with a mean value, which is used for generating the index, are sorted in an ascending order and a bit value of a corresponding position changes by deciding a position of a dimension close to a mean value according to a threshold value.
[14] The method of claim 13, wherein Euclidian distances between fingerprint position information calculated from the candidate index and a fingerprint obtained from the new input audio file are calculated as many as a predetermined number with reference to a fingerprint DB, and music information having a minimum distance is searched.
[15] The method of claim 14, wherein music information of a fingerprint position having a minimum distance is outputted as a result if the measured distance is in a threshold value range in the candidate index.
[16] The method of claim 15, wherein the music information search is outputted as a final result using fingerprint of other positions if a distance difference between fingerprints in the new input audio file is identical to position information difference of a result.
PCT/KR2008/002085 2007-04-17 2008-04-14 System and method for searching audio fingerprint by index information WO2008127052A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2008800126394A CN101663708B (en) 2007-04-17 2008-04-14 System and method for searching audio fingerprint by index information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR1020070037399A KR100862616B1 (en) 2007-04-17 2007-04-17 Searching system and method of audio fingerprint by index information
KR10-2007-0037399 2007-04-17

Publications (1)

Publication Number Publication Date
WO2008127052A1 true WO2008127052A1 (en) 2008-10-23

Family

ID=39864101

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2008/002085 WO2008127052A1 (en) 2007-04-17 2008-04-14 System and method for searching audio fingerprint by index information

Country Status (3)

Country Link
KR (1) KR100862616B1 (en)
CN (1) CN101663708B (en)
WO (1) WO2008127052A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2487795A (en) * 2011-02-07 2012-08-08 Slowink Ltd Indexing media files based on frequency content
CN103995890A (en) * 2014-05-30 2014-08-20 杭州智屏软件有限公司 Method for updating and searching for data of real-time audio fingerprint search library
US9558272B2 (en) 2014-08-14 2017-01-31 Yandex Europe Ag Method of and a system for matching audio tracks using chromaprints with a fast candidate selection routine
US9881083B2 (en) 2014-08-14 2018-01-30 Yandex Europe Ag Method of and a system for indexing audio tracks using chromaprints
CN110322886A (en) * 2018-03-29 2019-10-11 北京字节跳动网络技术有限公司 A kind of audio-frequency fingerprint extracting method and device
EP2795913B1 (en) * 2011-12-20 2019-11-27 Oath Inc. Audio fingerprint for content identification
US20220019697A1 (en) * 2020-07-16 2022-01-20 Humanscape Inc. System for embedding digital verification fingerprint and method thereof
EP2638516B1 (en) * 2010-11-12 2024-03-13 Google LLC Syndication including melody recognition and opt out

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101833579B (en) * 2010-05-11 2012-09-05 同方知网(北京)技术有限公司 Method and system for automatically detecting academic misconduct literature
US8584197B2 (en) * 2010-11-12 2013-11-12 Google Inc. Media rights management using melody identification
CN102314875B (en) * 2011-08-01 2016-04-27 北京音之邦文化科技有限公司 Audio file identification method and device
CN103179430A (en) * 2011-12-20 2013-06-26 中国电信股份有限公司 Method, device and server for audio and video content transcoding on basis of cloud computing
US9552607B2 (en) 2012-03-21 2017-01-24 Beatport, LLC Systems and methods for selling sounds
CN105138541B (en) * 2015-07-08 2018-02-06 广州酷狗计算机科技有限公司 The method and apparatus of audio-frequency fingerprint matching inquiry
KR101661666B1 (en) * 2015-11-20 2016-09-30 광운대학교 산학협력단 Hybrid audio fingerprinting apparatus and method
KR102037221B1 (en) 2017-11-06 2019-10-29 주식회사 아이티밥 Audio finger print matching method
KR102037220B1 (en) 2017-11-06 2019-10-29 주식회사 아이티밥 Audio finger print matching system
CN113536026B (en) * 2020-04-13 2024-01-23 阿里巴巴集团控股有限公司 Audio searching method, device and equipment

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003007185A1 (en) * 2001-07-10 2003-01-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method and device for producing a fingerprint and method and device for identifying an audio signal
KR20050061594A (en) * 2002-11-01 2005-06-22 코닌클리케 필립스 일렉트로닉스 엔.브이. Improved audio data fingerprint searching
WO2005101243A1 (en) * 2004-04-13 2005-10-27 Matsushita Electric Industrial Co. Ltd. Method and apparatus for identifying audio such as music

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19930518C1 (en) * 1999-07-05 2000-10-12 Thyssenkrupp Stahl Ag Production of a non grain-oriented electric sheet used as core material in motors and generators comprises producing a hot strip from a steel pre-material, hot rolling and spooling
KR100473163B1 (en) * 2002-01-15 2005-03-08 주식회사 에듀미디어텍 A storage medium storing multimedia contents and apparatus and method for reproducing the same

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2003007185A1 (en) * 2001-07-10 2003-01-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Method and device for producing a fingerprint and method and device for identifying an audio signal
KR20050061594A (en) * 2002-11-01 2005-06-22 코닌클리케 필립스 일렉트로닉스 엔.브이. Improved audio data fingerprint searching
WO2005101243A1 (en) * 2004-04-13 2005-10-27 Matsushita Electric Industrial Co. Ltd. Method and apparatus for identifying audio such as music

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2638516B1 (en) * 2010-11-12 2024-03-13 Google LLC Syndication including melody recognition and opt out
GB2487795A (en) * 2011-02-07 2012-08-08 Slowink Ltd Indexing media files based on frequency content
EP2795913B1 (en) * 2011-12-20 2019-11-27 Oath Inc. Audio fingerprint for content identification
CN103995890A (en) * 2014-05-30 2014-08-20 杭州智屏软件有限公司 Method for updating and searching for data of real-time audio fingerprint search library
US9558272B2 (en) 2014-08-14 2017-01-31 Yandex Europe Ag Method of and a system for matching audio tracks using chromaprints with a fast candidate selection routine
US9881083B2 (en) 2014-08-14 2018-01-30 Yandex Europe Ag Method of and a system for indexing audio tracks using chromaprints
CN110322886A (en) * 2018-03-29 2019-10-11 北京字节跳动网络技术有限公司 A kind of audio-frequency fingerprint extracting method and device
US20220019697A1 (en) * 2020-07-16 2022-01-20 Humanscape Inc. System for embedding digital verification fingerprint and method thereof
US11836274B2 (en) * 2020-07-16 2023-12-05 Humanscape Inc. System for embedding digital verification fingerprint and method thereof

Also Published As

Publication number Publication date
CN101663708A (en) 2010-03-03
KR100862616B1 (en) 2008-10-09
CN101663708B (en) 2012-10-10

Similar Documents

Publication Publication Date Title
WO2008127052A1 (en) System and method for searching audio fingerprint by index information
AU2020200997B2 (en) Optimization of audio fingerprint search
KR100838674B1 (en) Audio fingerprinting system and method
EP2580750B1 (en) System and method for audio media recognition
EP2659480B1 (en) Repetition detection in media data
EP2791935B1 (en) Low complexity repetition detection in media data
JP2013077025A (en) Method for deriving set of feature on audio input signal
KR100733145B1 (en) Fingerprint Producing Method and Audio Fingerprinting System Based on Normalized Spectral Subband Centroids
WO2005101243A1 (en) Method and apparatus for identifying audio such as music
WO2016189307A1 (en) Audio identification method
Du et al. Large-scale signature matching using multi-stage hashing
CN112967734B (en) Music data identification method, device, equipment and storage medium based on multiple sound parts
CN113420178A (en) Data processing method and equipment
CN113515662A (en) Similar song retrieval method, device, equipment and storage medium
CN113066512A (en) Buddhism music recognition method, device, equipment and storage medium
US20220335082A1 (en) Method for audio track data retrieval, method for identifying audio clip, and mobile device
Seo Pairwise Similarity Normalization Based on a Hubness Score for Improving Cover Song Retrieval Accuracy
CN115691553A (en) Copyright identification method for video background music
Gramaglia A binary auditory words model for audio content identification
Yoon et al. Automatic classification of western music in digital library
Yoon et al. Robust music information retrieval on mobile network based on multi-feature clustering
LUSARDI Robust cover identification approach based on local spectrogram and chromagram image descriptors
Yoon et al. Robust music information retrieval in mobile environment
NZ722874B2 (en) Optimization of audio fingerprint search
Park Classification of Audio Data Using a Centroid Neural Network

Legal Events

Date Code Title Description
WWE Wipo information: entry into national phase

Ref document number: 200880012639.4

Country of ref document: CN

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 08741330

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 08741330

Country of ref document: EP

Kind code of ref document: A1