CN110245267A - Multi-user's video flowing deep learning is shared to calculate multiplexing method - Google Patents

Multi-user's video flowing deep learning is shared to calculate multiplexing method Download PDF

Info

Publication number
CN110245267A
CN110245267A CN201910413748.7A CN201910413748A CN110245267A CN 110245267 A CN110245267 A CN 110245267A CN 201910413748 A CN201910413748 A CN 201910413748A CN 110245267 A CN110245267 A CN 110245267A
Authority
CN
China
Prior art keywords
deep learning
request
precision
multiplexing
frame
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910413748.7A
Other languages
Chinese (zh)
Other versions
CN110245267B (en
Inventor
汤善江
刘言杰
于策
孙超
肖健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Tianjin University
Original Assignee
Tianjin University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tianjin University filed Critical Tianjin University
Priority to CN201910413748.7A priority Critical patent/CN110245267B/en
Publication of CN110245267A publication Critical patent/CN110245267A/en
Application granted granted Critical
Publication of CN110245267B publication Critical patent/CN110245267B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/73Querying
    • G06F16/735Filtering based on additional data, e.g. user or group profiles
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/74Browsing; Visualisation therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/70Information retrieval; Database structures therefor; File system structures therefor of video data
    • G06F16/78Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/783Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • G06F16/7837Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content using objects detected or recognised in the video content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The present invention relates to computers, video processing, to realize under multi-user scene, the service of query video is provided by technologies such as target identification and target detections, the present invention, multi-user's video flowing deep learning is shared to calculate multiplexing method, firstly, when the request with detection operation or identification operation arrives, according to the relevance of Spatial Dimension, request is merged;Then, according to the relevance of time dimension, whether inquiry has suitable data for multiplexing, and the deep learning model for having configured parameter is called to be analyzed;For not reusable part, first according to the equilibrium relation of velocity accuracy, find most suitable parameter configuration in analytic process, then difference detector and deep learning model is called to carry out video analysis accordingly, analysis result is finally exported and is stored in data warehouse, hoisting module is used to be promoted in database original result precision then in order to be multiplexed to high-precision inquiry request.Present invention is mainly applied to videos to handle occasion.

Description

Multi-user's video flowing deep learning is shared to calculate multiplexing method
Technical field
The present invention relates to computers, video processing, and in particular to multi-user's video flowing deep learning is shared to calculate multiplexing side Method.
Background technique
Currently, deep learning, which has become, pushes artificial intelligence application landing and universal important engine.Especially counting Calculation machine visual field, the fast development of deep learning is that the field brings deep change, wherein the most representative It is the continuous development of the technologies such as image analysis technology such as target detection and target identification.Target detection can be used to divide target Class, such as identify whether there is dog in image, someone has desk, but can not identify the name of the people;Target identification then may be used Identify the specific identity of the people.For an image, application target detection model can quickly identify all in image Target, this brings huge variation for video analysis process.
During traditional video analysis, query service mainly is provided for user by way of labelling manually.It borrows The development for helping deep learning can automatically analyze video content by technologies such as target identification, target detections, provide more high-quality With query service abundant.For example, when user's editing video certain in video can be automatically found by Target Recognition Algorithms All segments that people occurs, this is bringing great convenience property of user.Therefore, it is analyzed using the video flowing based on deep learning It substitutes traditional manual label type query video mode and has been increasingly becoming a kind of trend.But, because deep learning model provides The features such as source demand is high, and the training time is long, it will usually allow one computing platform of multiple user sharings by the way of resource-sharing, To improve the utilization rate of deep learning system resources in computation and reduce entreprise cost.
The shortcomings that prior art
The existing Video Analysis Technology based on deep learning has the shortcomings that there are two aspects mostly: first is that query result list One, second is that not solving the locality that multi-user inquires under shared platform.Query result is single in noscope, chameleon etc. That embodies in system is particularly evident.In general, target detection model can identify thousands of kinds of different target categories, but know Other speed is slower.Therefore, to solve the problems, such as that deep learning model treatment video data is excessively slow, noscope proposition passes through training The network number of plies less proprietary model identifies vehicle, substantially increases recognition speed.But this mode abandons The versatility of deep learning model is just unable to satisfy its demand when user needs to inquire multiple targets.
On the other hand, these models of noscope, chameleon, videostorm do not solve multi-user under shared platform The limitation of inquiry.Limitation is embodied in the following aspects:
The locality of time dimension.The locality of time dimension shows that different periods inquiry has weight in inquiry data Renaturation, when it is new inquire come when, before there are some users to carry out inquiry concern to identical content.This is in some heat It is showed on door video particularly evident.The locality of time dimension is also embodied in the similitude on continuous content frame.Due to video counts According to exclusive feature itself, the continuity of video is kept between successive frame with high similitude.In addition, in some prisons The camera head monitor in video such as park is controlled, changing very little in part-time (such as night) content in the video.When passing through depth When spending the target in learning model identification video frame, the recognition result between these successive frames also can be essentially identical, therefore these Successive frame, which is brought, largely to be computed repeatedly.
The locality of Spatial Dimension.Be mainly manifested in multiple reasonings request that same one piece of data can be arrived simultaneously simultaneously into Row query processing.Often there is identical and similitudes for these inquiries.These inquiries are calculated, it can often be cut out Union operation is cut, to avoid unnecessary redundant computation.
Locality between data result logic.Be embodied in that data result between different models has a multiplexing in logic can It can property.Such as often there is multiplexing relationships in logic each other for object detection and object identification.For example, for video A certain frame, when object detection shows nobody, object recognition algorithm can not then can recognize that Zhang San, so as to avoid object The calculating of body identification people.Conversely, the result of object identification also can be directly multiplexed into object detection in any case.As led to Object recognition algorithm is crossed to show then to be shown to be someone there are Zhang San, directly avoid the object detection for people.
Summary of the invention
In order to overcome the deficiencies of the prior art, the present invention is directed to propose a kind of shared calculating of multi-user's video flowing deep learning is multiple Use method.It realizes under multi-user scene, provides the service of query video by technologies such as target identification and target detections.And needle Locality existing for multi-user's query video under shared calculating environment is optimized, so that depth can be directed between multi-user It practises the reasoning results to be shared, to improve the speed of service, solution deep learning model is spent low in processing video data speed per hour The problem of.Meanwhile solving the equilibrium problem because of shared bring speed and precision.For this reason, the technical scheme adopted by the present invention is that Multi-user's video flowing deep learning is shared to calculate multiplexing method, firstly, when the request with detection operation or identification operation arrives When, according to the relevance of Spatial Dimension, request is merged, crops the part that there is overlapping in request;Then, according to when Between dimension relevance, request retrieved into object detection database or object identification database, inquiry whether have conjunction Suitable data are for multiplexing, when reusable data are not present, according to the relevance between logic, using object identification database Or data operate detection in object detection database or identification operation is filtered reduction invalid computation, recall the good ginseng of configuration Several deep learning models are analyzed;For not reusable part, first according to the equilibrium relation of velocity accuracy, find point Then most suitable parameter configuration during analysis calls difference detector and deep learning model to carry out video analysis, most accordingly Analysis result exports and is stored in data warehouse at last, hoisting module be then used to be promoted in database original result precision in order to It is multiplexed to high-precision inquiry request.
It needs to carry out reasonable disposition to relevant parameter such as resolution ratio, the selection of deep learning model, frame-skipping rate.
The multiplexing specific steps of time dimension:
1) similarity between successive frame is detected by the way of Difference test, difference detector obtains the histogram of successive frame Scheme and calculate Histogram distance, further calculate out similarity, then judges whether data can be multiplexed according to similarity;
2) when new request arrives, two parts be split as to request first, reusable part with can not multiplexing part, The data of reusable part are present in database, directly can obtain knot by the faster database query operations of runing time Fruit;For not reusable part, processing acquisition request is carried out as a result, the knot that then will be requested yet by deep learning model Fruit feeds back to database, in order to which subsequent query is multiplexed.
The multiplexing of Spatial Dimension: request is merged in such a way that request cuts and merges, crops overlapping therein Multiple repetitive requests are reduced in part.
The multiplexing of logical dimension: being associated with the data foundation of object detection and object identification, finds between different models and exists The data of relevance are to be filtered mutually.
The multiplexing specific steps of logical dimension:
Contain m frame image in one video, shared l type objects are detectable and k personage can recognize, use Dl*m and Rk*m Two matrixes store data, and Dl*m is to represent object matrix, and Rk*m is representative figure's matrix, when new inquiring comes When, it is inquired in Dl*m and Rk*m first whether with the presence of corresponding data, if thening be used directly, Dij is 1 the i-th frame picture of expression In there are j object, indicate that the object is not present for 0;
Theorem one: if D1j is 0, as 0 < j≤m, R*j is also 0;
Theorem two: if Rij is 1, as 0 < j≤m, D1j is also 1;
Theorem one indicates, when jth frame detects nobody, then Model of Target Recognition will can't detect in jth frame Anyone;Theorem two indicates that, when identifying personage i in jth frame, target detection matrix can automatically update someone in jth frame, by This dynamically updates database data according to theorem one and theorem two when carrying out logical multiplexing, is target detection and target identification Establish association.
Balance the precision and speed specific steps of inquiry:
Step 1: being fitted to the precision of the different models under different parameters with length velocity relation, each model speed is obtained The corresponding relationship of degree and precision can select optimal parameter to carry out video analysis, when inquiring and to reach user demand;
Step 2:, by the diversity factor between the precision and two frames of former frame, adding phase according to Markov Chain rule The adjustment parameter answered constantly corrects the numerical value of adjustment parameter by confirmatory experiment to predict the precision of a later frame, so as to Accurate precision evaluation is carried out, specifically used δ diff represents the diversity factor between two frames, and the essence of i-1 frame is represented using A (fi-1) Degree, uses k as adjustment parameter, then according to Markov Chain rule, the precision of the i-th frame are as follows:
A (fi-1)=k* δ diff*A (fi-1)
Finally, the result multiplexing between reasoning request is conditional multiplexing;
Step 3: retain deep learning model as a result, to the partial results of difference detector using high-precision model into Row detects again, and sufficiently calculated result of the multiplexing difference detector about similarity between two frames, reappraise successive frame it Between precision, to improve detection accuracy on the whole.
Conditional multiplexing is specifically included using the frequency increased using deep learning model.
The features of the present invention and beneficial effect are:
The present invention has built the deep learning model of multiplicity, it is thus possible to realize under multi-user scene, be known by target The service of the technologies such as other target detection offer query video.And office existing for multi-user's query video under environment is calculated for shared It is sex-limited to optimize, so that can be shared for deep learning the reasoning results between multi-user, to improve the speed of service, solve Certainly deep learning model spends low problem in processing video data speed per hour.Meanwhile it solving because of shared bring speed and precision Equilibrium problem.
Detailed description of the invention:
Fig. 1: the request results multiplexing of same type data.
Fig. 2: the merging treatment of inquiry.
Fig. 3: the multiplexing of logical dimension.
Fig. 4: more reasoning request results are multiplexed overall architecture.
Specific embodiment
The present invention relates to computer visions and high-performance computing sector, and it is total to propose a kind of multi-user's video flowing deep learning Enjoy calculating multiplexing method.The method achieve under multi-user scene, video is provided by technologies such as target identification target detections The service of inquiry.And optimized for locality existing for multi-user's query video under shared calculating environment, so that multi-user Between can be shared for deep learning the reasoning results, to improve the speed of service, solve deep learning model and regarded in processing Frequency spends low problem according to speed per hour.Meanwhile solving the equilibrium problem because of shared bring speed and precision.
For the limitation of the prior art, it is proposed that a new video flowing analysis method, supports multiple users share ring Data sharing under border is to improve analysis speed.It is calculated in environment shared, the reasoning request content that user submits includes several A aspect: request type (object detection, object identification, object tracking etc.), request data and range, demand precision etc..It is full The sufficient diversified reasoning request content of user need to build various deep learning model in shared calculate, arrive in request in environment Suitable model is dynamically selected when coming to meet user demand.In more reasoning request results multiplying questions, research emphasis is just It is to find the connection between more reasoning requests among this diversified request content, the relevance constructed between multi-request is closed System carries out multiplexing to accelerate runing time, while solving the problems such as bring precision, to reach due to multiplexing and meet user demand Effect.
This method mainly includes two big modules, first is that the relevance between building request, second is that solving because of multiplexing bring essence The equilibrium problem of degree and speed.
Relevance between building request
Step 1, the multiplexing of time dimension.For video data, the relevance of time dimension is embodied in terms of two: one It is in video analysis, in terms of content with the similitude of height between successive frame;Second is that being had largely not for hot data Inquiry with the period can carry out retrieval analysis to it.A possibility that the two aspects bring a large amount of multiplexings.Firstly for continuous The similitude of frame, since deep learning model reasoning request time is longer, two frame similar for height can be by former frame The result of (referred to as reference frame) is multiplexed and uses to a later frame, and the detection so as to avoid deep learning model to a later frame mentions At high speed.For this purpose, we detect the similarity between successive frame by the way of Difference test, difference detector obtains successive frame Histogram and calculate Histogram distance, further calculate out similarity.Then judge whether data can carry out according to similarity Multiplexing.
Repeatability of the different periods inquiry in inquiry data equally also brings a large amount of redundant computation for system.Due to It is not associated between inquiry, therefore inquiry each time all independently can make requests analysis to video data, bring The huge wasting of resources.In response to this problem, it is stored for request results, when new finds out duplicate portion when inquiring and Divide and is directly multiplexed.As shown in Figure 1, when new request arrives, be split as two parts to request first, reusable part with not Reusable part.The data of reusable part are present in database, can directly be looked by the faster database of runing time It askes operation and obtains result.For not reusable part, processing acquisition request result is carried out yet by deep learning model.Then The result requested is fed back into database, in order to which subsequent query is multiplexed.It is assumed that contain m frame image in a video, Shared l type objects are detectable and k personage can recognize that we store data using two matrixes of Dl*m and Rk*m, When new inquiring comes, whether with the presence of corresponding data in inquiry Dl*m and Rk*m first, if thening be used directly, avoid It computes repeatedly.Dij is there are j object to indicate that the object is not present for 0 in 1 the i-th frame picture of expression.
Step 2, the multiplexing of Spatial Dimension.It is calculated in environment shared, there is a large amount of concurrent inquiry operations, when this When concurrent inquiry request is interested in same partial data a bit, just results in and largely compute repeatedly.The present invention is used and is asked The mode for asking cutting combined merges request, crops lap therein, reduces multiple repetitive requests.Such as Fig. 2 It is shown.To partial data, there is overlapping with Q2 by two request inquiry Q1 to arrive simultaneously.In such a way that inquiry merges, rationally Two requests are merged into a request Q1 ', avoid weight part from computing repeatedly.It is also one when multiple queries arrive simultaneously Sample merges multiple queries, finds the lap between multi-request to the maximum extent to reduce repetition detection.
Step 3, the multiplexing of logical dimension.Diversified analysis mode is provided in video analytic system, including in video Data carry out object detection, object identification, the various aspects such as object tracking, this just needs the deep learning model of multiplicity to provide not Same detectability.However between these different models, a possibility that there is also multiplexings mutually.Most intuitively a bit, If object detection can't detect people in certain frames, it can directly remove object identification model from the process of these frames identification people.Instead It, if object identification has recognized someone, object detection can also remove to the detection in these frames for people.As shown in figure 3, We be associated withs the data foundation of object detection and object identification, find between different models there are the data of relevance to carry out Filtering mutually.For example, when object identification model recognizes people, it, also can be by data while updating object identification database It feeds back to object detection database to be updated, will test result queue is someone.When new reasoning request inquiry whether someone When can be directly filtered according to this result.We using in D1j record jth frame whether presence of people, use Rij record the Whether there are personage i, R*j to indicate all people's object in jth frame in j frame.According to these properties, we have concluded that following two fixed Reason.
Theorem one: if D1j is 0, as 0 < j≤m, R*j is also 0.
Theorem two: if Rij is 1, as 0 < j≤m, D1j is also 1.
Theorem one indicates, when jth frame detects nobody, then Model of Target Recognition will can't detect in jth frame Anyone.Theorem two indicates that, when identifying personage i in jth frame, target detection matrix can automatically update someone in jth frame.By This, when carrying out logical multiplexing, we can dynamically update database data according to theorem one and theorem two, be target detection and Target identification establishes association.
Balance the precision and speed of inquiry
In current deep learning model, although analysis precision is stepping up, the bring with precision improvement Resource consumption is also increasing, and this requires longer runing times.In general, model with high accuracy, the speed of service is slower, The low model of precision, the speed of service are very fast.On the other hand, the photo resolution in video analysis process, the jump of difference detector Frame number etc. also can all have an impact precision and speed.This is just that parameter configuration in analytic process brings challenge.Because In practical application, different reasonings requests the requirement for precision and speed different.Component requests are as plundered kidnapping Detection needs higher precision to keep correctness, can sacrifice partial velocity thus;And the request such as traffic lights detection Higher timeliness is then needed, precision reaches suitable horizontal.For this purpose, during how reasonably selecting video analysis Model, resolution ratio, the parameters such as frame-skipping number are the matters of utmost importance faced.
Step 1: being fitted to the precision of the different models under different parameters with length velocity relation, each model speed is obtained The corresponding relationship of degree and precision.Optimal parameter can be selected to carry out video analysis, when inquiring and to reach user demand.
On the other hand, difference detector is that the measurement of precision brings uncertainty.Difference detector detect present frame with The similarity of former frame (reference frame) is directly multiplexed the analysis of former frame as a result, to avoid generation in the similar situation of height The high reasoning and calculation of valence.But directly multiplexing results in multiplexed result and actual result there are deviation, reduces and detects essence Degree.So need to carry out effective accuracy evaluation to difference detector, it is accurately full so as to calculate whole effective accuracy The accuracy requirement of sufficient user.Since difference detector is detected to continuous two frame of front and back, it is evident that the essence of a later frame Degree and diversity factor between the precision and two frames of former frame are closely related.This characteristic between successive frame meets Markov Chain rule.
Step 2:, by the diversity factor between the precision and two frames of former frame, adding phase according to Markov Chain rule The adjustment parameter answered, to predict the precision of a later frame.The numerical value that adjustment parameter is constantly corrected by a large amount of confirmatory experiment, from And it is able to carry out accurate precision evaluation.We represent the diversity factor between two frames using δ diff, represent i-1 using A (fi-1) The precision of frame, uses k as adjustment parameter, then according to Markov Chain rule, the precision of the i-th frame are as follows:
A (fi-1)=k* δ diff*A (fi-1)
Finally, the result multiplexing between reasoning request is conditional multiplexing, and it is not complete multiplexing without a moment's thought, essence The influence spent in multiplexing is even more important.For example, when one requires the request of high-precision result to arrive, already present low precision As a result it obviously can not be multiplexed and be requested to high-precision.The problem of can not being multiplexed accordingly, there exist the data of different accuracy.Obviously most intuitive Method be to abandon low precision as a result, detected again using high-precision model, but this can unquestionably bring larger hold Pin.In view of the data of multiplexing are by deep learning model and the common bring of difference detector as a result, such as deep learning mould Type detects first frame, and difference detector is for four frames after handling, and wherein deep learning model bring result precision is compared to difference Different detector bring result precision wants high.For example, for same section of video data, it is assumed that inquiry request Q1 requires 90% essence Degree, according to this requirement, precision velocity balance part selects frame-skipping step number to be detected for 5 frequency.That is, passing through depth every 5 frames Degree learning model is once identified that 5 frames skipped obtain diversity factor by difference detector.After a period of time, new inquiry Request Q2 arrival requires 95% precision, it is clear that the result of Q1 can not be directly used in Q2, at this point, learning that frame-skipping walks by calculating Number can reach accuracy requirement when being 2.Firstly, the data in multiplexing Q1, that is, be multiplexed the frame detected, to using difference detector 5 frames skipped can reach the requirement that frame-skipping step number is 2 using the 3rd frame of deep learning model inspection in this way.It has been finally reached The effect of condition multiplexing.
Step 3: retain deep learning model as a result, the partial results to difference detector are (a certain in such as rear four frame Frame) it is detected again using high-precision model.And sufficiently it is multiplexed calculating knot of the difference detector about similarity between two frames Fruit reappraises the precision between successive frame, to improve detection accuracy on the whole.
In summary several parts, it is as shown in Figure 4 for the data-reusing model general frame of more reasonings request.The system Middle to provide diversified video analysis request, there are two main classes: object identification for the deep learning model that these requests are finally used Model and object detection model.First.When the request with detection operation (or identification operation) arrives, according to Spatial Dimension Relevance, system first merge request, crop the part that there is overlapping in request.Then, the Multiplexing module in figure Part, according to the relevance of time dimension, request is retrieved in object detection database (or object identification database), is looked into It askes and whether has suitable data for multiplexing.In addition, according to the relevance between logic, using object identification database (or object Body Test database) in data reduction invalid computation is filtered to detection operation (or identification operate).When there is no reusables Data when, call and configured the deep learning model of parameter and analyzed.Before calling deep learning model, root is first had to Demand according to user about precision and speed configures relevant parameter by speed-precision balance portion, is meeting user While accuracy requirement, reach speed as high as possible.The part inference in figure, need to relevant parameter such as resolution ratio, Selection, frame-skipping rate of deep learning model etc. carry out reasonable disposition, and difference detector and deep learning model is then called to carry out Analysis result is finally exported and is stored in data warehouse by video analysis.Precision improvement module then constantly updates the knot in database Fruit precision, so that high-precision inquiry request can be multiplexed.

Claims (8)

1. a kind of multi-user's video flowing deep learning is shared to calculate multiplexing method, characterized in that firstly, when with detection operation or When the request of identification operation arrives, according to the relevance of Spatial Dimension, request is merged, crops and there is overlapping in request Part;Then, according to the relevance of time dimension, request is examined into object detection database or object identification database Whether rope, inquiry have suitable data for multiplexing, when reusable data are not present, according to the relevance between logic, It is invalid to operate or identify that operation is filtered reduction to detection using data in object identification database or object detection database It calculates, recalls and configured the deep learning model of parameter and analyzed;For not reusable part, first according to speed essence The equilibrium relation of degree finds most suitable parameter configuration in analytic process, then calls difference detector and deep learning accordingly Model carries out video analysis, finally exports and be stored in data warehouse for analysis result, hoisting module is then used to be promoted in database Original result precision is in order to being multiplexed to high-precision inquiry request.
2. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that need to phase It closes parameter such as resolution ratio, the selection of deep learning model, frame-skipping rate and carries out reasonable disposition.
3. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that time dimension Multiplexing specific steps:
1) similarity between successive frame is detected by the way of Difference test, difference detector obtains the histogram of successive frame simultaneously Histogram distance is calculated, similarity is further calculated out, then judges whether data can be multiplexed according to similarity;
2) when new request arrives, two parts be split as to request first, reusable part with can not multiplexing part, can answer It is present in database with the data of part, directly can obtain result by the faster database query operations of runing time; For not reusable part, processing acquisition request is carried out as a result, the result that then will be requested yet by deep learning model Database is fed back to, in order to which subsequent query is multiplexed.
4. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that Spatial Dimension Multiplexing: using request cut merge by the way of request is merged, crop lap therein, reduce multiple weight Multiple request.
5. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that logical dimension Multiplexing: the data foundation of object detection and object identification be associated with, find between different models there are the data of relevance thus It is filtered mutually.
6. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that logical dimension Multiplexing specific steps:
Contain m frame image in one video, shared l type objects are detectable and k personage can recognize, use Dl*m and Rk*m two Matrix stores data, and Dl*m is to represent object matrix, and Rk*m is representative figure matrix, when new inquiring comes, It is inquired in Dl*m and Rk*m first whether with the presence of corresponding data, if thening be used directly, Dij is to deposit in 1 the i-th frame picture of expression In j object, indicate that the object is not present for 0;
Theorem one: if D1j is 0, as 0 < j≤m, R*j is also 0;
Theorem two: if Rij is 1, as 0 < j≤m, D1j is also 1;
Theorem one indicate, when jth frame detects nobody, then Model of Target Recognition will can't detect in jth frame it is any People;Theorem two indicates that, when identifying personage i in jth frame, target detection matrix can automatically update someone in jth frame, as a result, When carrying out logical multiplexing, database data is dynamically updated according to theorem one and theorem two, is built for target detection and target identification Vertical association.
7. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that balance inquiry Precision and speed specific steps:
Step 1: be fitted to the precision of the different models under different parameters with length velocity relation, obtain each model velocity with The corresponding relationship of precision can select optimal parameter to carry out video analysis, when inquiring and to reach user demand;
Step 2: according to Markov Chain rule, by the diversity factor between the precision and two frames of former frame, along with corresponding Adjustment parameter is constantly corrected the numerical value of adjustment parameter by confirmatory experiment, thus allowed for predict the precision of a later frame Accurate precision evaluation, specifically used δ diff represent the diversity factor between two frames, and the precision of i-1 frame is represented using A (fi-1), Use k as adjustment parameter, then according to Markov Chain rule, the precision of the i-th frame are as follows:
A (fi-1)=k* δ diff*A (fi-1)
Finally, the result multiplexing between reasoning request is conditional multiplexing;
Step 3: retain deep learning model as a result, carrying out weight using high-precision model to the partial results of difference detector New detection, and sufficiently calculated result of the multiplexing difference detector about similarity between two frames, reappraise between successive frame Precision, to improve detection accuracy on the whole.
8. multi-user's video flowing deep learning as described in claim 1 is shared to calculate multiplexing method, characterized in that conditional Multiplexing is specifically included using the frequency increased using deep learning model.
CN201910413748.7A 2019-05-17 2019-05-17 Multi-user video stream deep learning sharing calculation multiplexing method Active CN110245267B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910413748.7A CN110245267B (en) 2019-05-17 2019-05-17 Multi-user video stream deep learning sharing calculation multiplexing method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910413748.7A CN110245267B (en) 2019-05-17 2019-05-17 Multi-user video stream deep learning sharing calculation multiplexing method

Publications (2)

Publication Number Publication Date
CN110245267A true CN110245267A (en) 2019-09-17
CN110245267B CN110245267B (en) 2023-08-11

Family

ID=67884171

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910413748.7A Active CN110245267B (en) 2019-05-17 2019-05-17 Multi-user video stream deep learning sharing calculation multiplexing method

Country Status (1)

Country Link
CN (1) CN110245267B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI776462B (en) * 2020-04-30 2022-09-01 大陸商深圳市商湯科技有限公司 Image processing method, electronic device and computer readable storage medium
US11488117B2 (en) * 2020-08-27 2022-11-01 Mitchell International, Inc. Systems and methods for managing associations between damaged parts and non-reusable parts in a collision repair estimate

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009150425A2 (en) * 2008-06-10 2009-12-17 Half Minute Media Ltd Automatic detection of repeating video sequences
CN102918531A (en) * 2010-05-28 2013-02-06 甲骨文国际公司 Systems and methods for providing multilingual support for data used with a business intelligence server
US9176987B1 (en) * 2014-08-26 2015-11-03 TCL Research America Inc. Automatic face annotation method and system
CN107480178A (en) * 2017-07-01 2017-12-15 广州深域信息科技有限公司 A kind of pedestrian's recognition methods again compared based on image and video cross-module state
CN108235001A (en) * 2018-01-29 2018-06-29 上海海洋大学 A kind of deep-sea video quality objective assessment model based on space-time characteristic
CN108549846A (en) * 2018-03-26 2018-09-18 北京航空航天大学 A kind of movement character combined and head and shoulder structure pedestrian detection and statistical method
CN108846384A (en) * 2018-07-09 2018-11-20 北京邮电大学 Merge the multitask coordinated recognition methods and system of video-aware

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2009150425A2 (en) * 2008-06-10 2009-12-17 Half Minute Media Ltd Automatic detection of repeating video sequences
CN102918531A (en) * 2010-05-28 2013-02-06 甲骨文国际公司 Systems and methods for providing multilingual support for data used with a business intelligence server
US9176987B1 (en) * 2014-08-26 2015-11-03 TCL Research America Inc. Automatic face annotation method and system
CN107480178A (en) * 2017-07-01 2017-12-15 广州深域信息科技有限公司 A kind of pedestrian's recognition methods again compared based on image and video cross-module state
CN108235001A (en) * 2018-01-29 2018-06-29 上海海洋大学 A kind of deep-sea video quality objective assessment model based on space-time characteristic
CN108549846A (en) * 2018-03-26 2018-09-18 北京航空航天大学 A kind of movement character combined and head and shoulder structure pedestrian detection and statistical method
CN108846384A (en) * 2018-07-09 2018-11-20 北京邮电大学 Merge the multitask coordinated recognition methods and system of video-aware

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
ROBERTO CAMACHO BARRANCO: "Enhancing Yelp Data with Deep Learning and Information Reuse", 《2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI)》 *
万南洋: "数据仓库事实表的建模理论与方法", 《计算机工程》 *
文孟飞等: "网络媒体大数据流异构多模态目标识别策略", 《计算机研究与发展》 *
汤善江等: "层级式可视化并行程序建模系统研究", 《计算机工程》 *
高洁等: "基于增量式Bayes的中文网页自动分类技术", 《电脑知识与技术(学术交流)》 *
高翔等: "基于视频场景深度学习的人物语义识别模型", 《计算机技术与发展》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI776462B (en) * 2020-04-30 2022-09-01 大陸商深圳市商湯科技有限公司 Image processing method, electronic device and computer readable storage medium
US11488117B2 (en) * 2020-08-27 2022-11-01 Mitchell International, Inc. Systems and methods for managing associations between damaged parts and non-reusable parts in a collision repair estimate

Also Published As

Publication number Publication date
CN110245267B (en) 2023-08-11

Similar Documents

Publication Publication Date Title
CN106295594B (en) A kind of across camera method for tracking target and device based on dynamic route tree
CN110245267A (en) Multi-user&#39;s video flowing deep learning is shared to calculate multiplexing method
CN107239203A (en) A kind of image management method and device
CN107423754B (en) Automatic radiation source identification system based on parameter multi-attribute autonomous intelligent decision
CN109117856A (en) People and object method for tracing, apparatus and system based on intelligent edge cloud
JP7143260B2 (en) Methods and systems for assisting users in creating and selecting images
CN102136147A (en) Target detecting and tracking method, system and video monitoring device
CN111739056B (en) Track tracking system
CN108287550A (en) The method of SLAM systems and construction data correlation based on data correlation and error detection
CN114153795A (en) Method and device for intelligently calling electronic archive, electronic equipment and storage medium
CN111107319A (en) Target tracking method, device and system based on regional camera
CN110232133A (en) A kind of image of clothing search method and system classified based on Fusion Features and style
CN110992124A (en) House resource recommendation method and system
CN109411093A (en) A kind of intelligent medical treatment big data analysis processing method based on cloud computing
CN114879891A (en) Multi-mode man-machine interaction method under self-supervision multi-target tracking
CN112288530B (en) Resource sharing intelligent dining method, device, system, medium and equipment
Wang et al. You only need two detectors to achieve multi-modal 3d multi-object tracking
CN111061911B (en) Target detection and tracking method, device and equipment for multi-video monitoring data
CN110059658B (en) Remote sensing satellite image multi-temporal change detection method based on three-dimensional convolutional neural network
US10956741B2 (en) System for automatic monitoring of birds and related method of use
US20230076241A1 (en) Object detection systems and methods including an object detection model using a tailored training dataset
CN115062676A (en) Data processing method, device and computer readable storage medium
CN115129659A (en) Main and auxiliary kernel computer processing platform
US11610080B2 (en) Object detection improvement based on autonomously selected training samples
CN109389053A (en) High performance vehicle detection system based on vehicle perpendicular type feature

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant