CN107066581B - The storage of distributed traffic monitor video data and quick retrieval system - Google Patents

The storage of distributed traffic monitor video data and quick retrieval system Download PDF

Info

Publication number
CN107066581B
CN107066581B CN201710242755.6A CN201710242755A CN107066581B CN 107066581 B CN107066581 B CN 107066581B CN 201710242755 A CN201710242755 A CN 201710242755A CN 107066581 B CN107066581 B CN 107066581B
Authority
CN
China
Prior art keywords
video data
data
module
distributed
semantic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710242755.6A
Other languages
Chinese (zh)
Other versions
CN107066581A (en
Inventor
傅慧源
马华东
车进
方瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing University of Posts and Telecommunications
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN201710242755.6A priority Critical patent/CN107066581B/en
Publication of CN107066581A publication Critical patent/CN107066581A/en
Application granted granted Critical
Publication of CN107066581B publication Critical patent/CN107066581B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • G06F16/24578Query processing with adaptation to user needs using ranking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

The invention discloses a kind of storage of distributed traffic monitor video data and quick retrieval systems, comprising: video data memory module is connect with HBase distribution columnar database, for video data to be stored into HBase distribution columnar database;Distributed video data semantic retrieval module, for establishing structured index model memory-based to video data semanteme;Data communication module, for the data communication between multiple video data producers, multiple video data memory modules and the distributed video data semantic retrieval module.The present invention quickly can be carried out data retrieval according to video semanteme information and can be obtained retrieved video data with rapid batch, be obviously improved recall precision while realizing video data storage.

Description

The storage of distributed traffic monitor video data and quick retrieval system
Technical field
The present invention relates to video data storage and retrieval technical fields, particularly relate to a kind of distributed traffic monitor video number According to storage and quick retrieval system.
Background technique
In recent years, with the raising of people's quality of the life, automobile is no longer unreachable object, is increasingly becoming every family The essential consumer goods of each household.Along with the fast development of auto manufacturing and country for pushing automobile industry to send out energetically The policy of exhibition, China's car ownership rise year by year.Video data is usually stored in point by traditional monitor video storage system In cloth database, it is high that extensive video data can be provided by distributed data base redundant storage and the characteristic of deblocking The stable and high storage and retrieval service handled up.
But the distributed data base for storing large-scale data can only support the data service of OLAP mode, they are for multiple Miscellaneous inquiry request can not be answered in time, and the Millisecond response that major key can only be supported to inquire.Traditional monitor video data are deposited Storage system generally requires to carry out recall precision of a large amount of Optimization Works to improve system, nevertheless, recall precision is still lower.
Presently, there are some researchs for video data storage, but its most of otherwise be only absorbed in a specific field Optimization of Information Retrieval, such as the retrieval of geographic information;It focuses on not being able to satisfy video counts to the optimization of particular memory system It can not be bonded completely according to storage and retrieval for the demand of performance and with monitor video data.And more existing it is absorbed in , there is retrieval performance, the needs to video data quick-searching can not be competent in the system of video data storage.There are also one The searching system based on HBase does not have the function of retrieval video semanteme a bit, some to provide the system of video semanteme search function The index structure of HBase is not optimized.In addition, the method requirement extracting characteristics of image using MapReduce task and retrieving Inputting information is image, cannot be directly to text semantic information retrieval.
Summary of the invention
In view of this, it is an object of the invention to propose a kind of storage of distributed traffic monitor video data and quick-searching System quickly can carry out data retrieval according to video semanteme information and can obtain the video figure retrieved with rapid batch As data.
Based on a kind of above-mentioned purpose distributed traffic monitor video data storage provided by the invention and quick retrieval system, Include:
Video data memory module is connect, for video data to be stored into HBase distribution columnar database HBase distribution columnar database;
Distributed video data semantic retrieval module, for establishing structured index memory-based to video data semanteme Model;
Data communication module, for multiple video data producers, multiple video data memory modules and described point Data communication between cloth video data semantic retrieval module.
In some embodiments, the video data memory module incorporates video semanteme information in RowKey, makes more Secondary HBase is converted to the request of one or more RowKey range scans based on the data access request of RowKey.
In some embodiments, the distributed video data semantic retrieval module is stored same using skip list structure The timestamp information of all video datas of camera same type semanteme;For the video data of image type, to image Time of origin is ranked up;For the video data of video type, the initial time of video is ranked up.
In some embodiments, the Probability Choice Model of the skip list structure based on random function, the random of skip list are put down Equal search length is C (log1/pN-1)=(log1/pN-1)/p, wherein n is skip list storing data number, and p is to choose probability;It should Function obtains minimum near p=0.5, chooses Probability p and is set as 0.5.
In some embodiments, the queried access of the skip list structure is filtered using bit array.
In some embodiments, the semantic information with timestamp is mapped, each semanteme of each camera A corresponding bit array, and the value that the minimum time of the semanteme time of occurrence and the semanteme subtracts each other is that the video data is stored in Its corresponding bit array position is then 1, is otherwise 0 by the subscript of bit array when there is video data in some timestamp.
In some embodiments, trigger-type compression is carried out to the bit array using compressing lower calibration method, when than Special array capacity is more than the threshold value of setting, will be compressed to bit array.
In some embodiments, using cuckoo Hash hash function to the distributed video data semantic retrieval mould Hash table in block optimizes, firstly, the data of character string type are passed through mapping functionIt reflects It penetrates as integer type, then passes through mapping functionMapping is carried out to obtain Take the subscript position hashed twice.
In some embodiments, the data communication module serializes message using binary mode;Institute It states data communication module and the video data producer and the distributed video number is also transmitted using the subscribing mode based on Redis According to the message between semantic retrieval module;The data communication module is also by video data by the way of history message persistence All message durations communicated between the producer and the distributed video data semantic retrieval module into ordered queue, when When having new node to be added or need to rebuild index, the data pulled in history message queue carry out retrieval building.
From the above it can be seen that distributed traffic monitor video data storage provided by the invention and quick-searching system System quickly can be carried out data retrieval according to video semanteme information and can be tied with quick obtaining video and image type retrieval Fruit collects data, is obviously improved recall precision while realizing video data storage.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is the storage of distributed traffic monitor video data and quick retrieval system structural representation of the embodiment of the present invention Figure;
Fig. 2 is video data memory module RowKey conceptual schematic drawing schematic diagram in the embodiment of the present invention;
Fig. 3 is the skip list structural schematic diagram in the embodiment of the present invention;
Fig. 4 is to test skip list structure average length of search in the embodiment of the present invention to change point diagram with random chance;
Fig. 5 is the bit array schematic diagram in the embodiment of the present invention;
Fig. 6 is during the present invention is implemented using the hash structure schematic diagram for improving cuckoo hashing algorithm;
Fig. 7 is the data communication module structure chart during the present invention is implemented;
Fig. 8 is the RowKey retrieval flow schematic diagram during the present invention is implemented.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific embodiment, and reference Attached drawing, the present invention is described in more detail.
The embodiment of the invention provides a kind of storage of distributed traffic monitor video data and quick retrieval systems.With reference to figure 1, it is the storage of distributed traffic monitor video data and the quick retrieval system structural schematic diagram of the embodiment of the present invention.
The distributed traffic monitor video data storage and quick retrieval system, comprising: with HBase distribution column number Video data memory module, distributed video data semantic retrieval module memory-based and the data communication mould connected according to library Block.Wherein:
Video data memory module is used to for video data being stored into HBase distribution columnar database, and guarantees to deposit The high efficiency of storage and access.
Distributed video data semantic retrieval module, for establishing structured index memory-based to video data semanteme Model accelerates the efficiency based on semantic retrieval.
Data communication module, for multiple video data producers, multiple video data memory modules and described point Data communication between cloth video data semantic retrieval module.
Specifically, video data memory module includes following operation content:
(11) it to break random RowKey and incremental RowKey design method based on the inefficiencies on semantic retrieval, uses Semantic-based RowKey design method, semantic information is dissolved among RowKey, so that obtaining RowKey from retrieval module When list, these RowKey are physically data access request of adjacent, the multiple in this way HBase based on RowKey The request of one or several RowKey range scans can be converted to, database loads is reduced and promotes retrieval performance simultaneously.The module RowKey design scheme is as shown in Figure 2.Such RowKey design scheme makes the video data of the identical semanteme of identical camera It is distributed in HBase database sequentially in time, what retrieval request each in this way was obtained by retrieval module is physically to connect Continuous RowKey range only needs successively to obtain the video data within the scope of this in this way, it is only necessary to carry out primary or several Secondary database access, the process are as shown in Figure 8.
Specifically, distributed video data semantic retrieval module includes following operation content:
(21) it to accelerate semantic information time range effectiveness of retrieval, combines when semantic information data volume is huge The expense for waiting system maintenance, all video datas of same camera same type semanteme are stored using orderly skip list structure Timestamp information is ranked up the video data of image type to the time of origin of image, and for the view of video type Frequency evidence is ranked up the initial time of video.Image and video type video data skip list structure store schematic diagram such as Fig. 3 It is shown.
(22) due to hash table in the data structure of storage in the biggish situation of unknown and data volume, it is low that there are storage efficiencies Under problem, be to improve to the storage efficiency and recall precision of a large amount of categorical datas, using cuckoo Hash hash function to dissipating List optimizes.
Wherein, the operation content (21) includes following operation content:
(21a) since the average length of search of skip list structure and the selection Probability p setting of skip list are related, in order to ensure skip list Recall precision it is optimal, need to choose Probability p optimize setting.Used skip list structure is selected based on the probability of random function Model is selected, the stochastic averagina search length of skip list is C (log1/pN-1)=(log1/pN-1)/p, wherein n is skip list storing data Number, p are to choose probability.Skip list structure average length of search is as shown in Figure 4 with random chance variation point diagram.The function is in p= 0.5 nearby obtains minimum, chooses Probability p and is set as 0.5.
(21b) will cause inquiry idle running twice, consumption systematicness since skip list structure is when data are not present in query context Can, it is empty retrieval scene search efficiency for query structure collection to accelerate skip list structure, using bit array come to skip list The queried access of structure is filtered.Bit array schematic diagram is as shown in Figure 5.
Further, the operation content (21b) includes following operation content:
(21ba) believes since bit array is a kind of binary structure of arrays in order to use bit array storage time to stab Breath maps the semantic information with timestamp, and each of each camera semanteme can correspond to a bit array, And the value that the minimum time of the semanteme time of occurrence and the semanteme subtracts each other is the subscript that the video data is stored in bit array, when There is video data in some timestamp, is then 1 by its corresponding bit array position, is otherwise then 0.
(21bb) be solves the problems, such as bit array had when data volume is larger storage occupied space it is larger, weigh High efficiency and space efficiency are retrieved, trigger-type compression is carried out to bit array using lower calibration method is compressed, when bit array is held Amount is more than the threshold value of setting, will be compressed to bit array, and the time range that each in bit array indicates increases, such as Each can only indicate this second either with or without data before, each expression now two seconds either with or without data.Each is indicated Range increasing be twice, the occupied space of bit array declines one times.
Wherein, the operation content (22) includes following operation content:
(22a) since hash table is in the data structure of storage in the biggish situation of unknown and data volume, there are storage efficiencies Low problem uses cuckoo Hash hash function pair to improve storage efficiency and recall precision to a large amount of categorical datas Hash table optimizes.Since hash structure stores character string type data, there are two hash letter for the requirement of cuckoo hash algorithm The data of character string type are passed through mapping function first by the way of mapping twice by number calculating elements storage locationIt is mapped as integer type.Next, passing through mapping functionIt carries out mapping and obtains the subscript position hashed twice.Use improvement The hash structure of cuckoo hashing algorithm is as shown in Figure 6.
Specifically, data communication module includes following operation content:
(31) it in order to guarantee the high efficiency communicated between data producer and retrieval module, is offseted using binary mode Breath is serialized, so that small volume is suitble to network transmission after message sequence.Use Protocol Buffer data exchange Agreement is as underlying protocol.
(32) due to the relationship that data producer and retrieval module are multi-to-multis, in order to guarantee the video data producer and divide Space between cloth video data semantic retrieval module it is non-coupled with it is synchronous non-coupled, using the subscribing mode based on Redis To transmit the message between the video data producer and distributed video data semantic retrieval module.The video data producer with point Cloth video data semantic retrieval module is by subscribing to designated key and sending the message to achieve the purpose that communication to designated key.
(33) in order to guarantee that the time between data producer and retrieval module is non-coupled, using history message persistence Mode is by all message durations communicated between the video data producer and distributed video data semantic retrieval module to having In sequence queue, when thering is new node to be added or when need to rebuild index, pull data in history message queue into Row retrieval building.Data communication module structure chart is as shown in Figure 7.
It should be understood by those ordinary skilled in the art that: the discussion of any of the above embodiment is exemplary only, not It is intended to imply that the scope of the present disclosure (including claim) is limited to these examples;Under thinking of the invention, above embodiments Or can also be combined between the technical characteristic in different embodiments, step can be realized with random order, and be existed such as Many other variations of the upper different aspect of the invention, for simplicity, they are not provided in details.
Elaborating detail (such as color and license plate type semantic data) to describe exemplary embodiment of the present invention In the case where, it will be apparent to those skilled in the art that can without these specific details or These details implement the present invention in the case where changing.Therefore, these descriptions should be considered as illustrative rather than limit Property processed.
Although having been incorporated with specific embodiments of the present invention, invention has been described, according to retouching for front It states, many replacements of these embodiments, modifications and variations will be apparent for those of ordinary skills.Example Such as, it stores other kinds of semantic data and is retrieved by these semantic datas (for example, the type of vehicle, vehicle Size) discussed embodiment can be used.
The embodiment of the present invention be intended to cover fall into all such replacements within the broad range of appended claims, Modifications and variations.Therefore, all within the spirits and principles of the present invention, any omission, modification, equivalent replacement, the improvement made Deng should all be included in the protection scope of the present invention.

Claims (7)

1. a kind of distributed traffic monitor video data storage and quick retrieval system characterized by comprising
Video data memory module is connect with HBase distribution columnar database, for video data to be stored into HBase points Cloth columnar database;The video data memory module incorporates semantic information in RowKey, is based on multiple HBase The data access request of RowKey is converted to a RowKey range scans request;
Distributed video data semantic retrieval module, for establishing structured index mould memory-based to video data semanteme Type;The hash table in the distributed video data semantic retrieval module is optimized using cuckoo hash function;By word The data of symbol string type pass through mapping functionIt need to be mapped as integer type;Pass through mapping functionIt carries out mapping and obtains the subscript position hashed twice;
Data communication module is used for multiple video data producers, multiple video data memory modules and distributed video Data communication between data semantic retrieval module.
2. system according to claim 1, which is characterized in that the distributed video data semantic retrieval module is using jump Table structure stores the timestamp informations of all video datas of same camera same type semanteme;For the view of image type Frequency evidence is ranked up the time of origin of image;For the video data of video type, the initial time of video is arranged Sequence.
3. system according to claim 2, which is characterized in that probability selection mould of the skip list structure based on random function Type, the stochastic averagina search length of skip list are C (log1/pN-1)=(log1/pN-1)/p, wherein n is skip list storing data number, P is to choose probability;The function obtains minimum near p=0.5, chooses Probability p and is set as 0.5.
4. system according to claim 2, which is characterized in that visited using bit array come the inquiry to the skip list structure It asks and is filtered.
5. system according to claim 4, which is characterized in that map the semantic information with timestamp, each The semantic corresponding bit array of each of camera, and the value that the minimum time of the semanteme time of occurrence and the semanteme subtracts each other is The video data is stored in the subscript of bit array, when there is video data in some timestamp, then by its corresponding bit array Position is 1, is otherwise 0.
6. system according to claim 4, which is characterized in that carried out using lower calibration method is compressed to the bit array Trigger-type compression will compress bit array when threshold value of the bit array capacity more than setting.
7. system according to claim 1, which is characterized in that the data communication module is offseted using binary mode Breath is serialized;The data communication module also using based on Redis subscribing mode come transmit the video data producer with Message between the distributed video data semantic retrieval module;The data communication module also uses history message persistence Mode all message communicated between the video data producer and the distributed video data semantic retrieval module are lasting Change into ordered queue, when thering is new node to be added or need to rebuild index, pulls in history message queue Data carry out retrieval building.
CN201710242755.6A 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system Active CN107066581B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710242755.6A CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710242755.6A CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Publications (2)

Publication Number Publication Date
CN107066581A CN107066581A (en) 2017-08-18
CN107066581B true CN107066581B (en) 2019-10-08

Family

ID=59600596

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710242755.6A Active CN107066581B (en) 2017-04-14 2017-04-14 The storage of distributed traffic monitor video data and quick retrieval system

Country Status (1)

Country Link
CN (1) CN107066581B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107679185A (en) * 2017-09-30 2018-02-09 深圳市晟达机械设计有限公司 A kind of Intelligent traffic video searching system

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984745A (en) * 2014-05-23 2014-08-13 何震宇 Distributed video vertical searching method and system
CN104850576A (en) * 2015-03-02 2015-08-19 武汉烽火众智数字技术有限责任公司 Fast characteristic extraction system based on mass videos

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103984745A (en) * 2014-05-23 2014-08-13 何震宇 Distributed video vertical searching method and system
CN104850576A (en) * 2015-03-02 2015-08-19 武汉烽火众智数字技术有限责任公司 Fast characteristic extraction system based on mass videos

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
云环境下监控视频结构化分析方法研究与实现;赵晓萌;《中国优秀硕士学位论文全文数据库信息科技辑》;20150830;31-47页 *

Also Published As

Publication number Publication date
CN107066581A (en) 2017-08-18

Similar Documents

Publication Publication Date Title
CN106528773B (en) Map computing system and method based on Spark platform supporting spatial data management
CN104252536B (en) A kind of internet log data query method and device based on hbase
US8078394B2 (en) Indexing large-scale GPS tracks
CN107169083A (en) Public security bayonet socket magnanimity vehicle data storage and retrieval method and device, electronic equipment
CN105303456A (en) Method for processing monitoring data of electric power transmission equipment
CN100458784C (en) Researching system and method used in digital labrary
CN110825733B (en) Multi-sampling-stream-oriented time series data management method and system
CN103067461A (en) Metadata management system of document and metadata management method thereof
CN105159845A (en) Memory reading method
CN102739622A (en) Expandable data storage system
CN101247271A (en) Performance data storage method and device
CN103326925B (en) A kind of information push method and device
CN105096174A (en) Transaction matching method and transaction matching system
CN104111924A (en) Database system
CN112148731B (en) Data paging query method, device and storage medium
Von der Weth et al. Multiterm keyword search in NoSQL systems
CN113811928B (en) Distributed memory space data storage for K nearest neighbor search
CN107066581B (en) The storage of distributed traffic monitor video data and quick retrieval system
CN112559459B (en) Cloud computing-based self-adaptive storage layering system and method
CN116756253B (en) Data storage and query methods, devices, equipment and media of relational database
CN112181302A (en) Data multilevel storage and access method and system
CN112015733A (en) Method for storing and rapidly inquiring mass data of electric power customer service operation and distribution service
CN114338725B (en) Distributed storage scheduling method for improving upper limit of large-scale cluster rendering
CN105095224A (en) Method, apparatus and system for carrying out OLAP analysis in mobile communication network
Qin et al. Massive AIS data management based on HBase and Spark

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant