CN103995890A - Method for updating and searching for data of real-time audio fingerprint search library - Google Patents

Method for updating and searching for data of real-time audio fingerprint search library Download PDF

Info

Publication number
CN103995890A
CN103995890A CN201410244730.6A CN201410244730A CN103995890A CN 103995890 A CN103995890 A CN 103995890A CN 201410244730 A CN201410244730 A CN 201410244730A CN 103995890 A CN103995890 A CN 103995890A
Authority
CN
China
Prior art keywords
data
real
time
fingerprint
audio fingerprint
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410244730.6A
Other languages
Chinese (zh)
Inventor
王磊
邱戴飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HANGZHOU SMART SCREEN SOFTWARE CO Ltd
Original Assignee
HANGZHOU SMART SCREEN SOFTWARE CO Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HANGZHOU SMART SCREEN SOFTWARE CO Ltd filed Critical HANGZHOU SMART SCREEN SOFTWARE CO Ltd
Priority to CN201410244730.6A priority Critical patent/CN103995890A/en
Publication of CN103995890A publication Critical patent/CN103995890A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Abstract

The invention discloses a method for updating and searching for data of a real-time audio fingerprint search library. The real-time audio fingerprint search library can receive multiple paths of audio fingerprint streams in real time, each fingerprint stream in the multiple paths of audio fingerprint streams comprises paired audio fingerprints and female metainformation, an annular queue is adopted by the real-time audio fingerprint search library to store the data, the annular queue comprises a head pointer, a tail pointer, maximum capacity and current number information, the data are added from the head portion of the queue, the head pointer moves forwards every time the data are added successfully, the data between the tail pointer and the head pointer are set to be current valid data, the data are started to be acquired from the tail of the queue in the search process, each fingerprint stream in the multiple paths of audio fingerprint streams has expiration time when the data are stored, and stale data are invalid by moving the tail pointer. According to the method, the real-time performance of the audio fingerprint data is guaranteed, all the audio fingerprint data have the validity periods, and the function of automatically deleting the overtime audio fingerprint data is achieved.

Description

A kind of real-time audio fingerprint search library Data Update and search method
Technical field
The present invention relates to audio-frequency fingerprint searching field, especially relate to a kind of real-time audio fingerprint search library Data Update and search method.
Background technology
Audio-frequency fingerprint (audio fingerprint) is the one group of cryptographic hash calculating according to sound signal, current audio fingerprint technique is applied to song search widely, the fields such as pirate detection, along with the development of mobile Internet, audio fingerprint techniques is also applied to TV programme carries out in real-time interactive, audio fingerprint search library is for preserving the storehouse of maternal audio-frequency fingerprint and corresponding raw information, the feature of song search search library is that data volume is large, once having put in storage substantially not can frequent updating, belong to off-line storehouse, what pay close attention to is Installed System Memory consumption and retrieval rate, the feature in TV program retrieving storehouse is that data volume is relatively little, but due to the feature of live telecast, database needs real-time update, and every section of fingerprint need to have certain life span, having spent this time ceases to be in force automatically, belong to online storehouse, with respect to off-line storehouse, online storehouse is except needs are paid close attention to internal memory and retrieval rate, also need to pay close attention to real-time, fingerprint timeout mechanism, and settling time stab overflow problem.
Summary of the invention
The present invention is in order to solve the problem of the audio-frequency fingerprint storage of online storehouse and retrieval, a kind of real-time audio fingerprint search library Data Update and search method that possesses real-time, the fingerprint term of validity, overtime fingerprint delete function to be provided.
To achieve these goals, the present invention is by the following technical solutions: a kind of real-time audio fingerprint search library Data Update and search method, described real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in described each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, described real-time audio fingerprint search library adopts circle queue storage data, described circle queue comprises head pointer, tail pointer, max cap., current quantity information, described data are added from the head of queue, successfully add data at every turn, described head pointer is toward reach, described tail pointer and the data between head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data.This programme audio fingerprint search library receives multichannel voice frequency fingerprint stream in real time, in each fingerprint stream, comprise paired (key, value) data, key is wherein exactly audio-frequency fingerprint, value comprises maternal metamessage (for example, for identifying the index of channel name or programme content), in audio-frequency fingerprint storehouse, mode with hash table is stored metamessage data, the mode of hash can provide the retrieval complexity that approaches 0 (1), the corresponding queue of each key, deposit the metamessage corresponding to audio-frequency fingerprint of identical hash value, because fingerprint base has expired feature automatically, so adopt circle queue to store, queue has head pointer, tail pointer, max cap., and the information such as current quantity, data are added from the head of queue, successfully add data at every turn, head pointer is toward reach, between tail pointer and head pointer, be current valid data, during retrieval, from tail of the queue, start to obtain data, in fingerprint search library, the real-time update of finger print data and concurrent the carrying out of retrieval, if needed, guarantee to insert, the strong consistency reading, need to add latching operation, system for real-time update will inevitably cause the performance of retrieval low like this, because finger print data in this programme is to upgrade from queue head, afterbody is overtime, while obtaining, from tail of the queue, start to obtain, the probability upgrading in same queue continuously during warehouse-in is lower, and warehouse-in and search operaqtion are all the operations that speed is a lot, so put in storage in practice, search operaqtion does not lock, owing to there being a large amount of finger print datas p.s. in audio fingerprint stream, so this processing mode can not cause obvious precise decreasing.
As preferably, the described every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by described tail pointer, moved stale data was lost efficacy.In order to realize fingerprint base expire, in audio-frequency fingerprint warehouse-in, the expired time that can first arrange according to every road audio fingerprint stream, starts expired fingerprint to delete from tail of the queue, the deletion here does not need real deletion data operation, only needs mobile rear pointer once; In the audio-frequency fingerprint storehouse of off-line, the length of each audio file is limited, a few minutes at least, several hours at most, but audio fingerprint stream is endless in theory in real-time streaming system, and the bit number of temporal information in expression audio fingerprint stream is limited, so need the problem that the consideration time overflows in real-time audio fingerprint base, check overtime and obtain fingerprint in need to consider because time figure place is overflowed the timestamp the causing situation of beating, if do not consider this situation, in the time of can causing so each time to overflow, system is unavailable; For real-time audio fingerprint search library, need to consider the expire mechanism of finger print data, to support some to apply in real time, and guarantee that the capacity in storehouse does not constantly increase along with the growth of time, performance does not decline along with the growth of time.
As preferably, whether the maternal metamessage data that described real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, and described real-time audio fingerprint search library returns to the result for retrieval within effective time.The mode of upgrading when each finger print data arrives, can not guarantee that data in queue are all 100% within effective time scope, because if a certain road fingerprint stream goes wrong, in absence about upgrading in time, data in queue just can not be overtime, in order to address this problem, in retrieval audio-frequency fingerprint, whether the metamessage data that fingerprint base can judge each acquisition within effective time scope, only return to the result for retrieval within effective time.
Therefore, the present invention has following beneficial effect: the real-time that 1) guarantees audio-frequency fingerprint data; 2) all audio-frequency fingerprint data are equipped with the term of validity; 3) possesses the automatic delete function of overtime audio-frequency fingerprint data.
Embodiment
Below the present invention will be further described.
A kind of real-time audio fingerprint search library Data Update and search method, real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, real-time audio fingerprint search library adopts circle queue storage data, circle queue comprises head pointer, tail pointer, max cap., current quantity information, data are added from the head of queue, successfully add data at every turn, head pointer is toward reach, data between tail pointer and head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data, the every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by tail pointer, move stale data was lost efficacy, whether the maternal metamessage data that real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, real-time audio fingerprint search library only returns to the result for retrieval within effective time.

Claims (3)

1. a real-time audio fingerprint search library Data Update and search method, it is characterized in that, described real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in described each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, described real-time audio fingerprint search library adopts circle queue storage data, described circle queue comprises head pointer, tail pointer, max cap., current quantity information, described data are added from the head of queue, successfully add data at every turn, described head pointer is toward reach, described tail pointer and the data between head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data.
2. a kind of real-time audio fingerprint search library Data Update according to claim 1 and search method, it is characterized in that, the described every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by described tail pointer, moved stale data was lost efficacy.
3. a kind of real-time audio fingerprint search library Data Update according to claim 1 and search method, it is characterized in that, whether the maternal metamessage data that described real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, and described real-time audio fingerprint search library returns to the result for retrieval within effective time.
CN201410244730.6A 2014-05-30 2014-05-30 Method for updating and searching for data of real-time audio fingerprint search library Pending CN103995890A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410244730.6A CN103995890A (en) 2014-05-30 2014-05-30 Method for updating and searching for data of real-time audio fingerprint search library

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410244730.6A CN103995890A (en) 2014-05-30 2014-05-30 Method for updating and searching for data of real-time audio fingerprint search library

Publications (1)

Publication Number Publication Date
CN103995890A true CN103995890A (en) 2014-08-20

Family

ID=51310055

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410244730.6A Pending CN103995890A (en) 2014-05-30 2014-05-30 Method for updating and searching for data of real-time audio fingerprint search library

Country Status (1)

Country Link
CN (1) CN103995890A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105224658A (en) * 2015-09-30 2016-01-06 北京京东尚科信息技术有限公司 A kind of Query method in real time of large data and system
KR101740271B1 (en) 2015-02-13 2017-05-26 레이 왕 Method and device for constructing on-line real-time updating of massive audio fingerprint database
CN107731220A (en) * 2017-10-18 2018-02-23 北京达佳互联信息技术有限公司 Audio identification methods, device and server
CN109657093A (en) * 2018-11-27 2019-04-19 腾讯音乐娱乐科技(深圳)有限公司 Audio search method, device and storage medium
CN111309728A (en) * 2020-02-07 2020-06-19 联想(北京)有限公司 Flow statistical method, device, system and storage medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101055591A (en) * 2007-05-25 2007-10-17 中兴通讯股份有限公司 Data access method for all-memory database
CN101105786A (en) * 2006-07-14 2008-01-16 中兴通讯股份有限公司 Double CPU communication method based on shared memory
WO2008127052A1 (en) * 2007-04-17 2008-10-23 Electronics And Telecommunications Research Institute System and method for searching audio fingerprint by index information
CN101673262A (en) * 2008-09-12 2010-03-17 未序网络科技(上海)有限公司 Method for searching audio content
CN102289518A (en) * 2011-09-13 2011-12-21 盛乐信息技术(上海)有限公司 Method and system for updating audio fingerprint search library
CN102567428A (en) * 2010-12-30 2012-07-11 中国移动通信集团浙江有限公司 Method and device for managing life cycle of online data
CN103152423A (en) * 2013-03-15 2013-06-12 安科智慧城市技术(中国)有限公司 Cloud storage system and data access method thereof
CN103440330A (en) * 2013-09-03 2013-12-11 网易(杭州)网络有限公司 Music program information acquisition method and equipment

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101105786A (en) * 2006-07-14 2008-01-16 中兴通讯股份有限公司 Double CPU communication method based on shared memory
WO2008127052A1 (en) * 2007-04-17 2008-10-23 Electronics And Telecommunications Research Institute System and method for searching audio fingerprint by index information
CN101055591A (en) * 2007-05-25 2007-10-17 中兴通讯股份有限公司 Data access method for all-memory database
CN101673262A (en) * 2008-09-12 2010-03-17 未序网络科技(上海)有限公司 Method for searching audio content
CN102567428A (en) * 2010-12-30 2012-07-11 中国移动通信集团浙江有限公司 Method and device for managing life cycle of online data
CN102289518A (en) * 2011-09-13 2011-12-21 盛乐信息技术(上海)有限公司 Method and system for updating audio fingerprint search library
CN103152423A (en) * 2013-03-15 2013-06-12 安科智慧城市技术(中国)有限公司 Cloud storage system and data access method thereof
CN103440330A (en) * 2013-09-03 2013-12-11 网易(杭州)网络有限公司 Music program information acquisition method and equipment

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101740271B1 (en) 2015-02-13 2017-05-26 레이 왕 Method and device for constructing on-line real-time updating of massive audio fingerprint database
CN105224658A (en) * 2015-09-30 2016-01-06 北京京东尚科信息技术有限公司 A kind of Query method in real time of large data and system
CN105224658B (en) * 2015-09-30 2018-11-30 北京京东尚科信息技术有限公司 A kind of Query method in real time and system of big data
CN107731220A (en) * 2017-10-18 2018-02-23 北京达佳互联信息技术有限公司 Audio identification methods, device and server
WO2019076313A1 (en) * 2017-10-18 2019-04-25 北京达佳互联信息技术有限公司 Audio recognition method, device and server
US11482242B2 (en) 2017-10-18 2022-10-25 Beijing Dajia Internet Information Technology Co., Ltd. Audio recognition method, device and server
CN109657093A (en) * 2018-11-27 2019-04-19 腾讯音乐娱乐科技(深圳)有限公司 Audio search method, device and storage medium
CN111309728A (en) * 2020-02-07 2020-06-19 联想(北京)有限公司 Flow statistical method, device, system and storage medium

Similar Documents

Publication Publication Date Title
CN103995890A (en) Method for updating and searching for data of real-time audio fingerprint search library
CN102521405B (en) Massive structured data storage and query methods and systems supporting high-speed loading
CN103559018B (en) Character string matching method and the system calculated based on GPU
EP2898430B1 (en) Mail indexing and searching using hierarchical caches
CN103853727B (en) Improve the method and system of big data quantity query performance
CN102521406B (en) Distributed query method and system for complex task of querying massive structured data
US9619512B2 (en) Memory searching system and method, real-time searching system and method, and computer storage medium
CN104123238A (en) Data storage method and device
CN101673192B (en) Method for time-sequence data processing, device and system therefor
CN103116634A (en) System for supporting high concurrent cache task queue and asynchronous batch operation method thereof
CN103020204A (en) Method and system for carrying out multi-dimensional regional inquiry on distribution type sequence table
US20140172823A1 (en) Using temporary performance objects for enhanced query performance
CN109857898A (en) A kind of method and system of mass digital audio-frequency fingerprint storage and retrieval
CN104750740A (en) Data renewing method and device
CN103744913A (en) Database retrieval method based on search engine technology
CN112286941A (en) Big data synchronization method and device based on Binlog + HBase + Hive
CN108874930A (en) File attribute information statistical method, device, system, equipment and storage medium
CN102622353B (en) Fixed audio retrieval method
US20090248620A1 (en) Interacting methods of data extraction
US10789234B2 (en) Method and apparatus for storing data
CN102073690B (en) Method for constructing memory database supporting historical Key information
CN102724301B (en) Cloud database system and method and equipment for reading and writing cloud data
CN109688422A (en) A kind of method and device of video processing
WO2023246259A1 (en) Video identification method and apparatus, computer device, and storage medium
CN103246567A (en) Queuing method for target tracking internal memory management

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20140820