CN103995890A - Method for updating and searching for data of real-time audio fingerprint search library - Google Patents
Method for updating and searching for data of real-time audio fingerprint search library Download PDFInfo
- Publication number
- CN103995890A CN103995890A CN201410244730.6A CN201410244730A CN103995890A CN 103995890 A CN103995890 A CN 103995890A CN 201410244730 A CN201410244730 A CN 201410244730A CN 103995890 A CN103995890 A CN 103995890A
- Authority
- CN
- China
- Prior art keywords
- data
- real
- time
- fingerprint
- audio fingerprint
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/61—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
Abstract
The invention discloses a method for updating and searching for data of a real-time audio fingerprint search library. The real-time audio fingerprint search library can receive multiple paths of audio fingerprint streams in real time, each fingerprint stream in the multiple paths of audio fingerprint streams comprises paired audio fingerprints and female metainformation, an annular queue is adopted by the real-time audio fingerprint search library to store the data, the annular queue comprises a head pointer, a tail pointer, maximum capacity and current number information, the data are added from the head portion of the queue, the head pointer moves forwards every time the data are added successfully, the data between the tail pointer and the head pointer are set to be current valid data, the data are started to be acquired from the tail of the queue in the search process, each fingerprint stream in the multiple paths of audio fingerprint streams has expiration time when the data are stored, and stale data are invalid by moving the tail pointer. According to the method, the real-time performance of the audio fingerprint data is guaranteed, all the audio fingerprint data have the validity periods, and the function of automatically deleting the overtime audio fingerprint data is achieved.
Description
Technical field
The present invention relates to audio-frequency fingerprint searching field, especially relate to a kind of real-time audio fingerprint search library Data Update and search method.
Background technology
Audio-frequency fingerprint (audio fingerprint) is the one group of cryptographic hash calculating according to sound signal, current audio fingerprint technique is applied to song search widely, the fields such as pirate detection, along with the development of mobile Internet, audio fingerprint techniques is also applied to TV programme carries out in real-time interactive, audio fingerprint search library is for preserving the storehouse of maternal audio-frequency fingerprint and corresponding raw information, the feature of song search search library is that data volume is large, once having put in storage substantially not can frequent updating, belong to off-line storehouse, what pay close attention to is Installed System Memory consumption and retrieval rate, the feature in TV program retrieving storehouse is that data volume is relatively little, but due to the feature of live telecast, database needs real-time update, and every section of fingerprint need to have certain life span, having spent this time ceases to be in force automatically, belong to online storehouse, with respect to off-line storehouse, online storehouse is except needs are paid close attention to internal memory and retrieval rate, also need to pay close attention to real-time, fingerprint timeout mechanism, and settling time stab overflow problem.
Summary of the invention
The present invention is in order to solve the problem of the audio-frequency fingerprint storage of online storehouse and retrieval, a kind of real-time audio fingerprint search library Data Update and search method that possesses real-time, the fingerprint term of validity, overtime fingerprint delete function to be provided.
To achieve these goals, the present invention is by the following technical solutions: a kind of real-time audio fingerprint search library Data Update and search method, described real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in described each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, described real-time audio fingerprint search library adopts circle queue storage data, described circle queue comprises head pointer, tail pointer, max cap., current quantity information, described data are added from the head of queue, successfully add data at every turn, described head pointer is toward reach, described tail pointer and the data between head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data.This programme audio fingerprint search library receives multichannel voice frequency fingerprint stream in real time, in each fingerprint stream, comprise paired (key, value) data, key is wherein exactly audio-frequency fingerprint, value comprises maternal metamessage (for example, for identifying the index of channel name or programme content), in audio-frequency fingerprint storehouse, mode with hash table is stored metamessage data, the mode of hash can provide the retrieval complexity that approaches 0 (1), the corresponding queue of each key, deposit the metamessage corresponding to audio-frequency fingerprint of identical hash value, because fingerprint base has expired feature automatically, so adopt circle queue to store, queue has head pointer, tail pointer, max cap., and the information such as current quantity, data are added from the head of queue, successfully add data at every turn, head pointer is toward reach, between tail pointer and head pointer, be current valid data, during retrieval, from tail of the queue, start to obtain data, in fingerprint search library, the real-time update of finger print data and concurrent the carrying out of retrieval, if needed, guarantee to insert, the strong consistency reading, need to add latching operation, system for real-time update will inevitably cause the performance of retrieval low like this, because finger print data in this programme is to upgrade from queue head, afterbody is overtime, while obtaining, from tail of the queue, start to obtain, the probability upgrading in same queue continuously during warehouse-in is lower, and warehouse-in and search operaqtion are all the operations that speed is a lot, so put in storage in practice, search operaqtion does not lock, owing to there being a large amount of finger print datas p.s. in audio fingerprint stream, so this processing mode can not cause obvious precise decreasing.
As preferably, the described every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by described tail pointer, moved stale data was lost efficacy.In order to realize fingerprint base expire, in audio-frequency fingerprint warehouse-in, the expired time that can first arrange according to every road audio fingerprint stream, starts expired fingerprint to delete from tail of the queue, the deletion here does not need real deletion data operation, only needs mobile rear pointer once; In the audio-frequency fingerprint storehouse of off-line, the length of each audio file is limited, a few minutes at least, several hours at most, but audio fingerprint stream is endless in theory in real-time streaming system, and the bit number of temporal information in expression audio fingerprint stream is limited, so need the problem that the consideration time overflows in real-time audio fingerprint base, check overtime and obtain fingerprint in need to consider because time figure place is overflowed the timestamp the causing situation of beating, if do not consider this situation, in the time of can causing so each time to overflow, system is unavailable; For real-time audio fingerprint search library, need to consider the expire mechanism of finger print data, to support some to apply in real time, and guarantee that the capacity in storehouse does not constantly increase along with the growth of time, performance does not decline along with the growth of time.
As preferably, whether the maternal metamessage data that described real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, and described real-time audio fingerprint search library returns to the result for retrieval within effective time.The mode of upgrading when each finger print data arrives, can not guarantee that data in queue are all 100% within effective time scope, because if a certain road fingerprint stream goes wrong, in absence about upgrading in time, data in queue just can not be overtime, in order to address this problem, in retrieval audio-frequency fingerprint, whether the metamessage data that fingerprint base can judge each acquisition within effective time scope, only return to the result for retrieval within effective time.
Therefore, the present invention has following beneficial effect: the real-time that 1) guarantees audio-frequency fingerprint data; 2) all audio-frequency fingerprint data are equipped with the term of validity; 3) possesses the automatic delete function of overtime audio-frequency fingerprint data.
Embodiment
Below the present invention will be further described.
A kind of real-time audio fingerprint search library Data Update and search method, real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, real-time audio fingerprint search library adopts circle queue storage data, circle queue comprises head pointer, tail pointer, max cap., current quantity information, data are added from the head of queue, successfully add data at every turn, head pointer is toward reach, data between tail pointer and head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data, the every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by tail pointer, move stale data was lost efficacy, whether the maternal metamessage data that real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, real-time audio fingerprint search library only returns to the result for retrieval within effective time.
Claims (3)
1. a real-time audio fingerprint search library Data Update and search method, it is characterized in that, described real-time audio fingerprint search library can receive multichannel voice frequency fingerprint stream in real time, in described each fingerprint stream of multichannel voice frequency fingerprint stream, all comprise paired audio-frequency fingerprint, maternal metamessage, described real-time audio fingerprint search library adopts circle queue storage data, described circle queue comprises head pointer, tail pointer, max cap., current quantity information, described data are added from the head of queue, successfully add data at every turn, described head pointer is toward reach, described tail pointer and the data between head pointer are made as current valid data, during retrieval, from tail of the queue, start to obtain data.
2. a kind of real-time audio fingerprint search library Data Update according to claim 1 and search method, it is characterized in that, the described every road of multichannel voice frequency fingerprint stream fingerprint stream data are equipped with expired time while depositing in, by described tail pointer, moved stale data was lost efficacy.
3. a kind of real-time audio fingerprint search library Data Update according to claim 1 and search method, it is characterized in that, whether the maternal metamessage data that described real-time audio fingerprint search library can judge each acquisition in audio-frequency fingerprint in retrieval are within effective time scope, and described real-time audio fingerprint search library returns to the result for retrieval within effective time.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410244730.6A CN103995890A (en) | 2014-05-30 | 2014-05-30 | Method for updating and searching for data of real-time audio fingerprint search library |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410244730.6A CN103995890A (en) | 2014-05-30 | 2014-05-30 | Method for updating and searching for data of real-time audio fingerprint search library |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103995890A true CN103995890A (en) | 2014-08-20 |
Family
ID=51310055
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410244730.6A Pending CN103995890A (en) | 2014-05-30 | 2014-05-30 | Method for updating and searching for data of real-time audio fingerprint search library |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103995890A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105224658A (en) * | 2015-09-30 | 2016-01-06 | 北京京东尚科信息技术有限公司 | A kind of Query method in real time of large data and system |
KR101740271B1 (en) | 2015-02-13 | 2017-05-26 | 레이 왕 | Method and device for constructing on-line real-time updating of massive audio fingerprint database |
CN107731220A (en) * | 2017-10-18 | 2018-02-23 | 北京达佳互联信息技术有限公司 | Audio identification methods, device and server |
CN109657093A (en) * | 2018-11-27 | 2019-04-19 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio search method, device and storage medium |
CN111309728A (en) * | 2020-02-07 | 2020-06-19 | 联想(北京)有限公司 | Flow statistical method, device, system and storage medium |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101055591A (en) * | 2007-05-25 | 2007-10-17 | 中兴通讯股份有限公司 | Data access method for all-memory database |
CN101105786A (en) * | 2006-07-14 | 2008-01-16 | 中兴通讯股份有限公司 | Double CPU communication method based on shared memory |
WO2008127052A1 (en) * | 2007-04-17 | 2008-10-23 | Electronics And Telecommunications Research Institute | System and method for searching audio fingerprint by index information |
CN101673262A (en) * | 2008-09-12 | 2010-03-17 | 未序网络科技(上海)有限公司 | Method for searching audio content |
CN102289518A (en) * | 2011-09-13 | 2011-12-21 | 盛乐信息技术(上海)有限公司 | Method and system for updating audio fingerprint search library |
CN102567428A (en) * | 2010-12-30 | 2012-07-11 | 中国移动通信集团浙江有限公司 | Method and device for managing life cycle of online data |
CN103152423A (en) * | 2013-03-15 | 2013-06-12 | 安科智慧城市技术(中国)有限公司 | Cloud storage system and data access method thereof |
CN103440330A (en) * | 2013-09-03 | 2013-12-11 | 网易(杭州)网络有限公司 | Music program information acquisition method and equipment |
-
2014
- 2014-05-30 CN CN201410244730.6A patent/CN103995890A/en active Pending
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101105786A (en) * | 2006-07-14 | 2008-01-16 | 中兴通讯股份有限公司 | Double CPU communication method based on shared memory |
WO2008127052A1 (en) * | 2007-04-17 | 2008-10-23 | Electronics And Telecommunications Research Institute | System and method for searching audio fingerprint by index information |
CN101055591A (en) * | 2007-05-25 | 2007-10-17 | 中兴通讯股份有限公司 | Data access method for all-memory database |
CN101673262A (en) * | 2008-09-12 | 2010-03-17 | 未序网络科技(上海)有限公司 | Method for searching audio content |
CN102567428A (en) * | 2010-12-30 | 2012-07-11 | 中国移动通信集团浙江有限公司 | Method and device for managing life cycle of online data |
CN102289518A (en) * | 2011-09-13 | 2011-12-21 | 盛乐信息技术(上海)有限公司 | Method and system for updating audio fingerprint search library |
CN103152423A (en) * | 2013-03-15 | 2013-06-12 | 安科智慧城市技术(中国)有限公司 | Cloud storage system and data access method thereof |
CN103440330A (en) * | 2013-09-03 | 2013-12-11 | 网易(杭州)网络有限公司 | Music program information acquisition method and equipment |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR101740271B1 (en) | 2015-02-13 | 2017-05-26 | 레이 왕 | Method and device for constructing on-line real-time updating of massive audio fingerprint database |
CN105224658A (en) * | 2015-09-30 | 2016-01-06 | 北京京东尚科信息技术有限公司 | A kind of Query method in real time of large data and system |
CN105224658B (en) * | 2015-09-30 | 2018-11-30 | 北京京东尚科信息技术有限公司 | A kind of Query method in real time and system of big data |
CN107731220A (en) * | 2017-10-18 | 2018-02-23 | 北京达佳互联信息技术有限公司 | Audio identification methods, device and server |
WO2019076313A1 (en) * | 2017-10-18 | 2019-04-25 | 北京达佳互联信息技术有限公司 | Audio recognition method, device and server |
US11482242B2 (en) | 2017-10-18 | 2022-10-25 | Beijing Dajia Internet Information Technology Co., Ltd. | Audio recognition method, device and server |
CN109657093A (en) * | 2018-11-27 | 2019-04-19 | 腾讯音乐娱乐科技(深圳)有限公司 | Audio search method, device and storage medium |
CN111309728A (en) * | 2020-02-07 | 2020-06-19 | 联想(北京)有限公司 | Flow statistical method, device, system and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103995890A (en) | Method for updating and searching for data of real-time audio fingerprint search library | |
CN102521405B (en) | Massive structured data storage and query methods and systems supporting high-speed loading | |
CN103559018B (en) | Character string matching method and the system calculated based on GPU | |
EP2898430B1 (en) | Mail indexing and searching using hierarchical caches | |
CN103853727B (en) | Improve the method and system of big data quantity query performance | |
CN102521406B (en) | Distributed query method and system for complex task of querying massive structured data | |
US9619512B2 (en) | Memory searching system and method, real-time searching system and method, and computer storage medium | |
CN104123238A (en) | Data storage method and device | |
CN101673192B (en) | Method for time-sequence data processing, device and system therefor | |
CN103116634A (en) | System for supporting high concurrent cache task queue and asynchronous batch operation method thereof | |
CN103020204A (en) | Method and system for carrying out multi-dimensional regional inquiry on distribution type sequence table | |
US20140172823A1 (en) | Using temporary performance objects for enhanced query performance | |
CN109857898A (en) | A kind of method and system of mass digital audio-frequency fingerprint storage and retrieval | |
CN104750740A (en) | Data renewing method and device | |
CN103744913A (en) | Database retrieval method based on search engine technology | |
CN112286941A (en) | Big data synchronization method and device based on Binlog + HBase + Hive | |
CN108874930A (en) | File attribute information statistical method, device, system, equipment and storage medium | |
CN102622353B (en) | Fixed audio retrieval method | |
US20090248620A1 (en) | Interacting methods of data extraction | |
US10789234B2 (en) | Method and apparatus for storing data | |
CN102073690B (en) | Method for constructing memory database supporting historical Key information | |
CN102724301B (en) | Cloud database system and method and equipment for reading and writing cloud data | |
CN109688422A (en) | A kind of method and device of video processing | |
WO2023246259A1 (en) | Video identification method and apparatus, computer device, and storage medium | |
CN103246567A (en) | Queuing method for target tracking internal memory management |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20140820 |