CN107066581A - Distributed traffic monitor video data storage and quick retrieval system - Google Patents

Distributed traffic monitor video data storage and quick retrieval system Download PDF

Info

Publication number
CN107066581A
CN107066581A CN201710242755.6A CN201710242755A CN107066581A CN 107066581 A CN107066581 A CN 107066581A CN 201710242755 A CN201710242755 A CN 201710242755A CN 107066581 A CN107066581 A CN 107066581A
Authority
CN
China
Prior art keywords
video data
semantic
distributed
data
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710242755.6A
Other languages
Chinese (zh)
Other versions
CN107066581B (en
Inventor
傅慧源
马华东
车进
方瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN201710242755.6A priority Critical patent/CN107066581B/en
Publication of CN107066581A publication Critical patent/CN107066581A/en
Application granted granted Critical
Publication of CN107066581B publication Critical patent/CN107066581B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a kind of distributed traffic monitor video data storage and quick retrieval system, including:Video data memory module, is connected with the distributed columnar databases of HBase, for video data to be stored into the distributed columnar databases of HBase;Distributed video data semantic retrieval module, for setting up the structured index model based on internal memory to video data semanteme;Data communication module, for the data communication between multiple video data producers, multiple video data memory modules and the distributed video data semantic retrieval module.The present invention quickly can be carried out data retrieval according to video semanteme information and retrieved video data can be obtained with rapid batch, and recall precision is obviously improved while video data storage is realized.

Description

Distributed traffic monitor video data storage and quick retrieval system
Technical field
The present invention relates to video data storage and retrieval technical field, a kind of distributed traffic monitor video number is particularly related to According to storage and quick retrieval system.
Background technology
In recent years, with the raising of people's quality of the life, automobile has no longer been unreachable object, is increasingly becoming every family The essential consumer goods of each household.Fast development and country along with auto manufacturing for promoting automobile industry to send out energetically The policy of exhibition, China's car ownership rises year by year.Video data is generally stored in point by traditional monitor video storage system In cloth database, extensive video data can be provided by distributed data base redundant storage and the characteristic of deblocking high The stable and high storage and retrieval service handled up.
But the distributed data base of storage large-scale data can only support the data, services of OLAP modes, they are for multiple Miscellaneous inquiry request can not be answered in time, and the Millisecond response that major key can only be supported to inquire about.Traditional monitor video data are deposited Storage system generally requires to carry out a large amount of Optimization Works to improve the recall precision of system, nevertheless, recall precision is still relatively low.
Presently, there are some for video data store researchs, or but its most of be only absorbed in a specific field Optimization of Information Retrieval, the retrieval of such as geographic information;Focus on the optimization to particular memory system, it is impossible to meet video counts Can not be completely fitted for the demand of performance and with monitor video data according to storage and retrieval.And more existing be absorbed in , there is retrieval performance issue in the system of video data storage, it is impossible to the need for being competent to video data quick-searching.Also one A little searching systems based on HBase do not possess the function of retrieval video semanteme, and some provide the system of video semanteme search function HBase index structure is not optimized.In addition, the method requirement extracted characteristics of image using MapReduce tasks and retrieved It is image to input information, it is impossible to directly to text semantic information retrieval.
The content of the invention
In view of this, it is an object of the invention to propose a kind of distributed traffic monitor video data storage and quick-searching System, quickly can carry out data retrieval according to video semanteme information and can obtain the video figure retrieved with rapid batch As data.
A kind of distributed traffic monitor video data storage and quick retrieval system provided based on the above-mentioned purpose present invention, Including:
Video data memory module, is connected with the distributed columnar databases of HBase, for video data to be stored into HBase distribution columnar databases;
Distributed video data semantic retrieval module, for setting up the structured index based on internal memory to video data semanteme Model;
Data communication module, for multiple video data producers, multiple video data memory modules and described point Data communication between cloth video data semantic retrieval module.
In some embodiments, the video data memory module incorporates video semanteme information in RowKey, makes many Data access requests of the secondary HBase based on RowKey is converted to the request of one or more RowKey range scans.
In some embodiments, the distributed video data semantic retrieval module stores same using skip list structure The timestamp information of the semantic all video datas of camera same type;For the video data of image type, to image Time of origin is ranked up;For the video data of video type, the initial time to video is ranked up.
In some embodiments, the Probability Choice Model of the skip list structure based on random function, the random of skip list is put down Equal search length is C (log1/pN-1)=(log1/pN-1)/p, wherein n are skip list data storage number, and p is selection probability;Should Function obtains minimum near p=0.5, chooses Probability p and is set to 0.5.
In some embodiments, filtered using bit array come the queried access to the skip list structure.
In some embodiments, the semantic information with timestamp is mapped, each semanteme of each camera One bit array of correspondence, and the value that the semantic time of occurrence minimum time semantic with this subtracts each other is stored in for the video data The subscript of bit array, when there is video data in some timestamp, is then 1 by its corresponding bit array position, is otherwise 0.
In some embodiments, trigger-type compression is carried out to the bit array using the lower calibration method of compression, when than Special array capacity exceedes the threshold value set, and bit array will be compressed.
In some embodiments, using cuckoo Hash hash function to the distributed video data semantic retrieval mould Hash table in block is optimized, first, and the data of character string type are passed through into mapping functionReflect Penetrate as integer type, then pass through mapping functionMapping is carried out to obtain Take the subscript position hashed twice.
In some embodiments, the data communication module is serialized using binary mode to message;Institute State data communication module and the video data producer and the distributed video number are also transmitted using the subscribing mode based on Redis According to the message between semantic retrieval module;The data communication module is also by video data by the way of history message persistence All message durations communicated between the producer and the distributed video data semantic retrieval module into ordered queue, when When having new node addition or need to rebuild index, the data pulled in history message queue carry out retrieval structure.
From the above it can be seen that the distributed traffic monitor video data storage of the invention provided and quick-searching system System, quickly can carry out data retrieval and can retrieve with quick obtaining video and image type to tie according to video semanteme information Fruit collection data, recall precision is obviously improved while video data storage is realized.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with Other accompanying drawings are obtained according to these accompanying drawings.
Distributed traffic monitor video data storages and quick retrieval system structural representation of the Fig. 1 for the embodiment of the present invention Figure;
Fig. 2 is video data memory module RowKey conceptual schematic drawing schematic diagrames in the embodiment of the present invention;
Fig. 3 is the skip list structural representation in the embodiment of the present invention;
Fig. 4 changes point diagram for test skip list structure average length of search in the embodiment of the present invention with random chance;
Fig. 5 is the bit array schematic diagram in the embodiment of the present invention;
Fig. 6 uses the hash structure schematic diagram for improving cuckoo hashing algorithm in implementing for the present invention;
Fig. 7 is the data communication module structure chart in present invention implementation;
Fig. 8 is the RowKey retrieval flow schematic diagrames in present invention implementation.
Embodiment
For the object, technical solutions and advantages of the present invention are more clearly understood, below in conjunction with specific embodiment, and reference Accompanying drawing, the present invention is described in more detail.
The embodiments of the invention provide a kind of distributed traffic monitor video data storage and quick retrieval system.With reference to figure 1, it is the distributed traffic monitor video data storage and quick retrieval system structural representation of the embodiment of the present invention.
The distributed traffic monitor video data storage and quick retrieval system, including:With the distributed column numbers of HBase The video data memory module connected according to storehouse, the distributed video data semantic retrieval module based on internal memory and data communication mould Block.Wherein:
Video data memory module is used to video data being stored into the distributed columnar databases of HBase, and ensures to deposit Storage and the high efficiency accessed.
Distributed video data semantic retrieval module, for setting up the structured index based on internal memory to video data semanteme Model, accelerates the efficiency based on semantic retrieval.
Data communication module, for multiple video data producers, multiple video data memory modules and described point Data communication between cloth video data semantic retrieval module.
Specifically, video data memory module includes following operation content:
(11) to break random RowKey and being incremented by inefficiencies of the RowKey design methods on based on semantic retrieval, use Based on semantic RowKey design methods, semantic information is dissolved among RowKey so that obtain RowKey from retrieval module When list, these RowKey are physically data access requests of adjacent, the so multiple HBase based on RowKey The request of one or several RowKey range scans can be converted to, reduction database loads lift retrieval performance simultaneously.The module RowKey designs are as shown in Figure 2.Such RowKey designs cause the identical semantic video data of identical camera It is distributed in sequentially in time in HBase databases, so each retrieval request is physically to connect by retrieve module acquisition Continuous RowKey scopes, so only need to obtain the video data in the range of this successively, it is only necessary to carry out once or several Secondary database access, the process is as shown in Figure 8.
Specifically, distributed video data semantic retrieval module includes following operation content:
(21) to accelerate semantic information time range effectiveness of retrieval, while taking into account when semantic information data volume is huge The expense of system maintenance is waited, the semantic all video datas of same camera same type are stored using orderly skip list structure Timestamp information, for the video data of image type, is ranked up to the time of origin of image, and regarding for video type Frequency evidence, the initial time to video is ranked up.Image and video type video data skip list structure storage schematic diagram such as Fig. 3 It is shown.
(22) because hash table is unknown and in the case that data volume is larger in the data structure of storage, there is storage efficiency low Lower the problem of, to improve storage efficiency and recall precision to a large amount of categorical datas, using cuckoo Hash hash function to dissipating List is optimized.
Wherein, the operation content (21) includes following operation content:
(21a) is relevant due to the average length of search of skip list structure and the selection Probability p setting of skip list, in order to ensure skip list Recall precision it is optimal, it is necessary to choose Probability p optimize setting.Probability choosing of the used skip list structure based on random function Model is selected, the stochastic averagina search length of skip list is C (log1/pN-1)=(log1/pN-1)/p, wherein n are skip list data storage Number, p is selection probability.It is as shown in Figure 4 that skip list structure average length of search changes point diagram with random chance.The function is in p= 0.5 nearby obtains minimum, chooses Probability p and is set to 0.5.
(21b) can cause to inquire about idle running twice, consumption systematicness because skip list structure is when data are not present in query context Can, it is empty retrieval scene search efficiency for query structure collection to accelerate skip list structure, using bit array come to skip list The queried access of structure is filtered.Bit array schematic diagram is as shown in Figure 5.
Further, the operation content (21b) includes following operation content:
(21ba) is a kind of binary structure of arrays due to bit array, is believed in order that being stabbed with bit array storage time Breath, maps the semantic information with timestamp, each semanteme of each camera can correspond to a bit array, And the value that the semantic time of occurrence subtracts each other with the semantic minimum time is stored in the subscript of bit array for the video data, when There is video data in some timestamp, be then 1 by its corresponding bit array position, be otherwise then 0.
(21bb) has the problem of storage space-consuming is larger, balance to solve bit array when data volume is larger High efficiency and space efficiency are retrieved, trigger-type compression is carried out to bit array using the lower calibration method of compression, when bit array is held Amount will be compressed more than the threshold value set to bit array, the time range increase that each in bit array is represented, such as Each can only represent this second either with or without data before, and each represents two seconds either with or without data now.Each is represented Scope increase and be twice, the space-consuming of bit array declines one times.
Wherein, the operation content (22) includes following operation content:
(22a) is unknown and in the case that data volume is larger in the data structure of storage due to hash table, exists The problem of storage efficiency is low, to improve storage efficiency and recall precision to a large amount of categorical datas, is breathed out using cuckoo Uncommon hash function is optimized to hash table.Because hash structure stores character string type data, cuckoo Hash is calculated Method requirement has two hash function calculating elements storage locations, by the way of mapping twice, first by character string class The data of type pass through mapping functionIt is mapped as integer type.Next, passing through mapping functionCarry out mapping and obtain the subscript position hashed twice.Use improvement The hash structure of cuckoo hashing algorithm is as shown in Figure 6.
Specifically, data communication module includes following operation content:
(31) in order to ensure the high efficiency communicated between data producer and retrieval module, offseted using binary mode Breath is serialized so that small volume is adapted to network transmission after message sequence.Use Protocol Buffer data exchanges Agreement is used as underlying protocol.
(32) because data producer and retrieval module are the relations of multi-to-multi, in order to ensure the video data producer with dividing Space between cloth video data semantic retrieval module is non-coupled and synchronous non-coupled, using the subscribing mode based on Redis To transmit the message between the video data producer and distributed video data semantic retrieval module.The video data producer is with dividing Cloth video data semantic retrieval module is by subscribing to designated key and sending the message to reach the purpose of communication to designated key.
(33) in order to ensure that the time between data producer and retrieval module is non-coupled, using history message persistence Mode is by all message durations communicated between the video data producer and distributed video data semantic retrieval module to having In sequence queue, when having new node to add or need to rebuild index, the data pulled in history message queue are entered Row retrieval is built.Data communication module structure chart is as shown in Figure 7.
Those of ordinary skills in the art should understand that:The discussion of any of the above embodiment is exemplary only, not It is intended to imply that the scope of the present disclosure (including claim) is limited to these examples;Under the thinking of the present invention, above example Or can also not be combined between the technical characteristic in be the same as Example, step can be realized with random order, and be existed such as Many other changes of upper described different aspect of the invention, for simplicity, they are provided not in details.
Elaborating detail (such as color and car plate type semantic data) with describe the present invention exemplary embodiment In the case of, it will be apparent to those skilled in the art that can in the case of these details or These details implement the present invention in the case of changing.Therefore, these descriptions are considered as illustrative rather than limit Property processed.
Although having been incorporated with specific embodiment of the invention, invention has been described, according to retouching above State, many replacements of these embodiments, modifications and variations will be apparent for those of ordinary skills.Example Such as, store other kinds of semantic data and retrieved by these semantic datas (for example, the type of vehicle, vehicle Size) discussed embodiment can be used.
Embodiments of the invention be intended to fall within the broad range of appended claims it is all it is such replace, Modifications and variations.Therefore, within the spirit and principles of the invention, any omission, modification, equivalent substitution, the improvement made Deng should be included in the scope of the protection.

Claims (10)

1. a kind of distributed traffic monitor video data storage and quick retrieval system, it is characterised in that including:
Video data memory module, is connected with the distributed columnar databases of HBase, for video data to be stored into HBase points Cloth columnar database;
Distributed video data semantic retrieval module, for setting up the structured index mould based on internal memory to video data semanteme Type;
Data communication module, for multiple video data producers, multiple video data memory modules and distributed video Data communication between data semantic retrieval module.
2. system according to claim 1, it is characterised in that the video data memory module incorporates semantic information In RowKey, data access requests of the multiple HBase based on RowKey is set to be converted to a RowKey range scans request.
3. system according to claim 1, it is characterised in that the distributed video data semantic retrieval module is using jump Table structure stores the timestamp information of the semantic all video datas of same camera same type;For regarding for image type Frequency evidence, is ranked up to the time of origin of image;For the video data of video type, the initial time to video is arranged Sequence.
4. system according to claim 3, it is characterised in that probability selection mould of the skip list structure based on random function Type, the stochastic averagina search length of skip list isWherein n is skip list data storage Number, p is selection probability;The function obtains minimum near p=0.5, chooses Probability p and is set to 0.5.
5. system according to claim 3, it is characterised in that visited using bit array the inquiry of the skip list structure Ask and filtered.
6. system according to claim 5, it is characterised in that map the semantic information with timestamp, each One bit array of each semantic correspondence of camera, and the value that the semantic time of occurrence minimum time semantic with this subtracts each other is The video data is stored in the subscript of bit array, when there is video data in some timestamp, then by its corresponding bit array Position is 1, is otherwise 0.
7. system according to claim 5, it is characterised in that carried out using the lower calibration method of compression to the bit array Trigger-type is compressed, and when bit array capacity exceedes the threshold value set, bit array will be compressed.
8. system according to claim 1, it is characterised in that using cuckoo hash function to the distributed video number Optimized according to the hash table in semantic retrieval module.
9. system according to claim 8, it is characterised in that the data of character string type are passed through into mapping functionInteger type need to be mapped as;Pass through mapping function Carry out mapping and obtain the subscript position hashed twice.
10. system according to claim 1, it is characterised in that the data communication module uses binary mode pair Message is serialized;The data communication module also transmits the video data producer using the subscribing mode based on Redis With the message between the distributed video data semantic retrieval module;The data communication module is also lasting using history message The mode of change holds all message communicated between the video data producer and the distributed video data semantic retrieval module Longization when having new node to add or need to rebuild index, is pulled in history message queue into ordered queue Data carry out retrieval structure.
CN201710242755.6A 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system Active CN107066581B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710242755.6A CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710242755.6A CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Publications (2)

Publication Number Publication Date
CN107066581A true CN107066581A (en) 2017-08-18
CN107066581B CN107066581B (en) 2019-10-08

Family

ID=59600596

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710242755.6A Active CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Country Status (1)

Country Link
CN (1) CN107066581B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679185A (en) * 2017-09-30 2018-02-09 深圳市晟达机械设计有限公司 A kind of Intelligent traffic video searching system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984745A (en) * 2014-05-23 2014-08-13 何震宇 Distributed video vertical searching method and system
CN104850576A (en) * 2015-03-02 2015-08-19 武汉烽火众智数字技术有限责任公司 Fast characteristic extraction system based on mass videos

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984745A (en) * 2014-05-23 2014-08-13 何震宇 Distributed video vertical searching method and system
CN104850576A (en) * 2015-03-02 2015-08-19 武汉烽火众智数字技术有限责任公司 Fast characteristic extraction system based on mass videos

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵晓萌: "云环境下监控视频结构化分析方法研究与实现", 《中国优秀硕士学位论文全文数据库信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679185A (en) * 2017-09-30 2018-02-09 深圳市晟达机械设计有限公司 A kind of Intelligent traffic video searching system

Also Published As

Publication number Publication date
CN107066581B (en) 2019-10-08

Similar Documents

Publication Publication Date Title
CN106528773B (en) Map computing system and method based on Spark platform supporting spatial data management
CN103714134B (en) Network flow data index method and system
CN102163195B (en) Query optimization method based on unified view of distributed heterogeneous database
CN103067461B (en) A kind of metadata management system of file and metadata management method
CN109299113B (en) Range query method with storage-aware mixed index
CN107169083A (en) Public security bayonet socket magnanimity vehicle data storage and retrieval method and device, electronic equipment
CN104424258A (en) Multidimensional data query method and system, query server and column storage server
CN106503243A (en) Electric power big data querying method and system based on HBase secondary indexs
CN107506464A (en) A kind of method that HBase secondary indexs are realized based on ES
CN107766445B (en) Efficient and rapid data retrieval method supporting multi-dimensional retrieval
CN103617276A (en) Method for storing distributed hierarchical RDF data
CN101594319A (en) List item lookup method and device
CN104111924A (en) Database system
CN103326925B (en) A kind of information push method and device
Von der Weth et al. Multiterm keyword search in NoSQL systems
CN113811928B (en) Distributed memory space data storage for K nearest neighbor search
CN105635278A (en) Method for managing metadata of storage system and metadata server
CN102737061A (en) Distributed ticket query management system and method
CN103425694B (en) The searching method of relational data and device
CN107066581B (en) The storage of distributed traffic monitor video data and quick retrieval system
CN110716933B (en) Novel urban rail train big data-oriented high-flexibility distributed index method
CN110365728A (en) A kind of grouping cooperative caching system of Information central site network
CN114338725B (en) Distributed storage scheduling method for improving upper limit of large-scale cluster rendering
CN103020300B (en) Method and device for information retrieval
Qin et al. Massive AIS data management based on HBase and Spark

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant