CN108965885B

CN108965885B - Video online reconstruction and moving target detection method based on frame compression measurement

Info

Publication number: CN108965885B
Application number: CN201810564696.9A
Authority: CN
Inventors: 曹文飞; 韩国栋; 徐麟
Original assignee: Shaanxi Normal University
Current assignee: Shaanxi Normal University
Priority date: 2018-06-04
Filing date: 2018-06-04
Publication date: 2020-11-10
Anticipated expiration: 2038-06-04
Also published as: CN108965885A

Abstract

The invention relates to a video online reconstruction and moving target detection method based on frame compression measurement. Firstly, carrying out compression measurement on a monitoring scene at an information source end by a camera designed based on a compression perception principle; then, the compression measurement of the single frame is transmitted to a monitoring center after channel coding; and finally, collecting the compression measurement data of the single frame in the monitoring center through decoding, and reconstructing the background and the foreground of the video frame through a compression reconstruction algorithm. The low-rank property of the video background is modeled by using the matrix kernel norm, the fragment smoothness of the video foreground and the sparsity of the Laplace mixed distribution modeling foreground are modeled by using the total variation TV function, and the reconstruction precision of the video and the detection precision of the moving target can be improved by the refined modeling.

Description

Video online reconstruction and moving target detection method based on frame compression measurement

Technical Field

The invention belongs to the technical field of video processing, and particularly relates to a video online reconstruction and moving target detection method based on frame compression measurement.

Background

Video moving object detection based on compression measurement is a new technology which is recently raised in the field of video processing, couples two basic problems of video reconstruction and moving object detection, and integrates respective advantageous features of the two problems. The existing video moving target detection method focuses on moving target detection on complete video data. Because the method can be processed after a complete video frame sequence is transmitted, technical bottlenecks such as mass data storage effectiveness, transmission timeliness, processing instantaneity and the like are met in the current background of video big data. Fortunately, a new information processing theory, compressed sensing [ e.j.cand. an interaction to compressed sampling. ieee signal processing magazine,25(2):21-30,2008 ], brings new clues to the resolution of the technical bottleneck. The theory states that: at the information source end, only the compression measurement is needed to be carried out on the scene or the critical information of the scene is sampled and stored; then, transmitting little measurement data to the sink end through the channel; finally, the original scene can be reconstructed with high probability on the sink side with a small number of compressed measurements. Under the inspiration of the compressed sensing theory, researchers have proposed a video reconstruction and moving object detection method based on compression measurement, or a moving object detection method on compressed video data. Such methods are highly challenging. The following describes a conventional approach to moving object detection on full video data and on compressed video data.

1. Moving target detection method on complete video data

For ease of description, we introduce some notation. Suppose a video sequence is marked as

The number of frames of the video sequence is K, the size of each frame image is M multiplied by N, and the video frame sequence can be supposed to be decomposed into a background sequence and a foreground sequence, namely

X＝B+S,

Where B is ═ B⁽¹⁾,b⁽²⁾,…,b^(K)]，S＝[s⁽¹⁾,s⁽²⁾,…,s^(K)]。

The method aims to design a certain method to extract a foreground sequence S from a complete video sequence X. From literature investigations, such methods can be divided into two broad categories: the first type uses a batch processing mode (integral processing) of video sequences, and the second type uses an online processing mode (frame-by-frame processing) of video sequences.

For batch processing methods, the existing popular algorithms are usually implemented by optimizing an energy function. The background sequence and the foreground sequence are used as optimization variables, an energy function model based on constraints such as video frame image sparsity and low rank performance is established, and the background sequence and the foreground sequence are simultaneously obtained by minimizing the energy function model. The general form of its energy function model is:

where B and S are the variables to be optimized, λ and μ are the regularization parameters, and Ω (B) and Ψ (S) are the regularization constraints on the background sequence and the foreground sequence, respectively, usually some sparsity constraint. Typical literature for such processes is: J.Wright, A.Ganesh, S.Rao, Y.Peng, and Y.Ma.Robust primary component analysis of Exact retrieval of corrected low-random matrix vision correction, in NIPS,2009. B.Xin, Y.Tian, Y.Wang, and W.Gao.Background sub-traction video corrected functional for corrected modulation in CVPR,2015. and W.Hu, Y.Yayayayayag, W.Zhang, and Y.Xie.Moving object detection using sensor-base low-random and corrected parameter synthesis, 26.737.27. I.M. J.G.G.G.G.M.. Wherein, the document [ J.Wright, A.Ganesh, S.Rao, Y.Peng, and Y.Ma.Robust primary component analysis: Exact recovery of corrected low-random information visual convergence. in NIPS,2009 ] uses a robust principal component analysis model to solve the problem of mobile object detection, wherein the correlation of the background sequence is modeled by matrix low rank, and the characteristics of the foreground sequence are characterized by sparsity; in the document [ b.xin, y.tie, y.wang, and w.gao.background generated Fused used for aggregate modeling. in CVPR,2015 ] a Generalized Fused Lasso is used to model a foreground sequence, and this modeling mode not only describes the sparsity of the foreground image but also considers the local similarity of the pixel intensity, so that a better moving target detection effect can be obtained; the documents [ W.Hu, Y.Yang, W.Zhang, and Y.Xie.moving object detection using tensor-based low-rank and saturated full-spaced composition. IEEE Trans.on Image Processing,26(2): 724-. For a foreground sequence, document [4] firstly obtains a saliency region of a video sequence through saliency detection, and then embeds saliency region information into a generalized fusion lasso model. Compared with the two former documents, documents [ W.Hu, Y.Yang, W.Zhang, and Y.Xie.moving object detection using transmitter-based low-rank and sample full-parameter composition. IEEE trans. on Image Processing,26(2): 724-.

The other is a detection method in an online processing mode. The method has higher processing speed, and can meet the requirement of detecting the moving target from video frame to video frame in practical application. The general models appearing in the literature are:

wherein omega (b)_t) And Ψ(s)_t) Respectively being the t-th background frame b_tAnd the foreground frame s_tThe regular constraint function of (a) above is typically a manifold constraint of some kind in the background and a sparse constraint of some kind in the foreground. Specifically, based on the incremental gradient on the grassmann manifold, documents [ j.he, l.balzano, and a.szlam.incorporated gradient on gradsmanian for online formed and background separation in subsampled video in CVPR,2012.]Providing an online video moving target detection method; on the basis of this document, the document [ J.Xu, V.K.Ithapu, L.Mukherjee, J.M.Rehg, and V.Singh.GOSUS: Grassmannian online subspaces with structured-space.In ICCV,2013.]Further modeling the block sparsity of the foreground, the documents [ j.he, l.balzano, and a.szlam.incorporated graphics on Grassmannian for online formed and background section in subsampled video in CVPR,2012 are given.]An improved version of the method; documents [ Y.Pang, L.Ye, X.Li, and J.Pan.inductive learning with localization map for moving object detection. IEEE Transactions on Circuits and Systems for Video Technology,28(3): 640-.]In view of the salient features of video frames, and in the literature [ j.he, l.balzano, and a.szlam.incorporated graphics on Grassmannian for online formed and background separation in subsampled video in CVPR,2012.]An efficient moving target detection model and an algorithm thereof are provided on the basis of the model.

Under the background of large video data, the above method for detecting a moving target on complete video data usually encounters some bottlenecks, for example, a large number of cameras distributed on an urban traffic main road may capture a large number of surveillance videos, the surveillance videos need to be timely transmitted to a network control center for analysis, and synchronous transmission of the complete video data to the network control center may cause network congestion and even network paralysis, so that it is especially necessary to transmit a small amount of non-redundant video data. To address the challenge, moving object detection methods on compressed video data are in force.

2. Moving object detection method on compressive video data

The aim of such methods is to reconstruct the video background from the compressed video data and to detect moving objects by designing detection methods. Among them, documents [ v.cevher, a.sankararayanan, m.f.duarte, d.reddy, r.g.baraniuk, and r.chellappa.compressive sensing for background collection in ECCV,2008 ] propose an effective moving object detection method based on a typical compressive sensing recovery model. In the document [ A.E.Waters, A.C.Sankaraarayanan, and R.Baraniuk.SparCS: Recovering low-rank and sparse matrices from complex measurements.In NIPS,2011 ] an optimization model is provided to solve the problem of mobile target detection by modeling background and sparsity through low rank. Based on the methods proposed in the documents [ A.E.Waters, A.C.Sankararayanan, and R.Baraniuk.SpaCS: Recovering low-rank and spark formats from complex measures.In NIPS,2011 ], the documents [ W.Cao, Y.Wang, J.Sun, D.Meng, C.Yang, A.Cichorki, and Z.xu.T. variation regulated tenso RPCA for background and bottom conductivity from complex measures.IEEE Transactions Image Processing,25 (9); 4075 4090,2016 ] consider designing more refined background and foreground sequences for modeling. Specifically, tensor low-rank property is used for replacing matrix low-rank property to model a background sequence, a time-space total variation model replaces sparsity to depict a foreground sequence, and an effective moving target detection method is provided based on refined modeling.

The other type is an online processing type method based on frame compression measurement, and the method can meet the real-time requirement in practical video monitoring application. The general form of the model in the literature is:

where A is some compression measurement matrix, Ω (b)_t) And Ψ(s)_t) Respectively being the t-th background frame b_tAnd the foreground frame s_tSome regular constraint function of (1). In the documents [ J.F.Mota, N.Deligianiis, A.C.Sankaraarayanan, V.Cevher, and M.R.Rodrigues.Adaptation-rate specific Signal reception with application in compressive background.IEEE Transactions on Signal Processing,64(14): 3651) 3666,2016.]In, authors are based on l₁-l₁The minimization model provides an online moving target detection algorithm. In the literature [ H.Luong, N.Deligianiis, J.Seiler, S.Forchhammer, and A.Kaup.compressive on reliable basic component analysis via n-l1minimization. to apply the polypeptide in IEEE Transactions on Image Processing,2018.]In, the authors write l₁-l₁Minimization model to n-l₁Minimizing the model, converting the moving target detection problem based on frame compression measurement into a sparse optimization problem, and designing an efficient optimization method.

The above moving object detection methods on full/compressed video data all have limitations. For the moving target detection method on the complete video data, because the compressibility of the data is not considered in the scene acquisition process, the defects of storage burden at the information source end, network blockage in the transmission process and the like are easily caused under the background of large video data; for the moving target detection method on the compressive video data, although the compressive measurement is considered in the scene acquisition process, only some simple sparse prior constraints are considered when the background and the foreground of the video are modeled, and finer modeling is not considered for the 3-dimensional video data, so that the detection accuracy of the moving target under a certain compressive sampling rate needs to be further improved.

Disclosure of Invention

In order to solve the above problems in the prior art, the present invention provides a video online reconstruction and moving object detection method based on frame compression measurement. The technical problem to be solved by the invention is realized by the following technical scheme: a video online reconstruction and moving target detection method based on frame compression measurement comprises the following steps:

step 1, collecting a video sequence X₀As a training set, the video sequence X₀Inputting into a steady principal component analysis model, and outputting a video prior background B₀And foreground sequence S₀The foreground sequence S₀The value of the k frame video image is assigned to obtain the prior foreground of the video

Wherein the video sequence X₀The number of frames of (a) is L, L represents a positive integer;

step 2, collecting compression measurement y of t frame video image of monitoring scene_t；

Step 3, establishing a reconstruction model, wherein the reconstruction model adopts a matrix nuclear norm to model the low rank of the video prior background, and adopts a total variation TV regular function and a negative logarithm Laplace mixed function to respectively model the fragment smoothness and the sparsity of the video prior foreground;

step 4, measuring the compression y of the t frame video image_tVideo prior background B_t-1Foreground prior to video

Inputting the reconstruction model, and obtaining the t frame background b of the video by minimizing the output of the reconstruction model_tAnd foreground s_tThen using image threshold segmentation method to segment from the foreground s_tDetecting a moving target; wherein t represents a positive integer;

step 5, according to the background b_tBackground a priori with video B_t-1Updating the prior background of the current video to be B_tAnd according toProspect s_tForeground prior to video

Update the current video prior foreground to

And 6, sequentially circulating the steps 2 to 5, and when T is equal to T, terminating the updating of the prior background B of the current video_tAnd current video prior foreground

Where T represents a monitoring time or a video frame number.

Further, the reconstruction model in step 3 is:

where a is the compression measurement matrix, λ and μ are regularization parameters, γ ═ μ · τ; b is_t-1Which represents the a priori background of the video,

representing the a priori foreground of the video, b_tRepresenting the t-th frame background, s of the video image_tRepresenting the tth frame foreground of the video image.

Ω(b_t)＝||[B_t-₁,b_t]||_*Modeling the low rank nature of the video prior background for a matrix nuclear norm, where the matrix nuclear norm is | | Z | | survival_*＝∑_iσ_i(Z)，σ_i(Z) the ith singular value of the matrix Z;

respectively modeling the fragment smoothness and the sparsity of the prior foreground of the video for a total variation TV regular function and a negative logarithm Laplace mixing function,

is the video prior foreground

The number of the ith row of (a),

is a variance vector of the mixed laplacian distribution,

is a proportional coefficient vector of Laplace mixture distribution components, and tau is a balance theta_t-1,π_t-1Parameter of, TV(s)_t)＝||Ds_t||₁Is the total variation function of the anisotropy of the image, D ═ D_h；D_v]The image processing method comprises the steps of forming a difference operator in the horizontal direction and the vertical direction of an image;

the laplacian mixture probability distribution is:

parameter(s)

And

estimating by an expectation maximization algorithm;

further, in the step 4, the t-th frame background b of the video is obtained by minimizing the output of the reconstruction model_tAnd foreground s_tThe method comprises the following specific steps: optimizing the reconstruction model using a neighboring gradient method and an expectation-maximization algorithm, wherein the neighboring gradient function is:

wherein

Are gradients, their size being

Parameter(s)

By an alternative optimization method, the proximity gradient function is converted into

And

three subproblems, which are iterated, when the relative change is less than 1e-5, the iteration is stopped, and the t frame background b of the video is output_tAnd foreground s_t；

Wherein the sub-problem

And obtaining an updated expression of the parameter through the expectation maximization algorithm.

Further, the specific steps of step 5 are:

step 5.1, video prior background B_tThe update formula of (2) is:

[U,S,V]＝SVD([B_t-1,b_t]) Formula (1)

B_t＝U(:,1:L)·S(1:L,1:L)·V(:,1:L)^TFormula (2)

Wherein formula (1) represents a pair matrix [ B ]_t-1,b_t]Singular value decomposition is carried out, and then a video priori background B can be derived based on the singular value formula_t(ii) a SVD represents singular value decomposition of a matrix, U represents a left singular vector of the matrix, V represents a right singular vector of the matrix, and S represents a singular value of the matrix;

step 5.2, the current video priori foreground

The update formula of (2) is:

further, in the step 2, a block discrete hadamard matrix which is randomly down-sampled is adopted to simulate a compression measurement matrix, the size of the discrete hadamard matrix is 32 × 32, and the sampling rate of the random matrix is set.

Compared with the prior art, the invention has the beneficial effects that: the invention provides a novel online video reconstruction and detection method on frame compression measurement based on recent progress of sparsity modeling. The method uses the matrix kernel norm to model the low rank of the video background, uses the total variation TV function to model the fragment smoothness of the video foreground and the sparsity of the Laplace mixed distribution modeling foreground, and the refined modeling can improve the reconstruction precision of the video and the detection precision of the moving target.

The method is designed based on the compression sensing theory, and because the sampling and the compression are carried out simultaneously, the method not only can reduce the storage burden of a signal source end, but also can reduce the defects of network blockage and the like to a certain extent.

The invention can be applied to video monitoring applications such as traffic video monitoring, campus video monitoring, military port video monitoring and the like, and can simultaneously meet the video monitoring requirements of data compression and online processing.

Drawings

Fig. 1 is a schematic diagram of the whole process of video reconstruction and moving object detection according to the present invention.

FIG. 2 is a flow chart of the method of the present invention.

FIG. 3 is an alternate iteration of background, foreground, parameters of the present invention.

Fig. 4a is a real original image of video reconstruction and moving object detection under a sampling of 0.7 of a scene of the present invention.

Fig. 4b is a reconstructed foreground image of video reconstruction and moving object detection under a sampling of 0.7 of a scene according to the present invention.

Fig. 4c is a reconstructed background image of video reconstruction and moving object detection at a sampling of 0.7 for a scene according to the present invention.

Fig. 5a is a real original image of video reconstruction and moving object detection under a sampling of 0.4 for a scene of the present invention.

Fig. 5b is a reconstructed foreground image of video reconstruction and moving object detection under a sampling of 0.4 for a scene according to the present invention.

Fig. 5c is a reconstructed background image of video reconstruction and moving object detection at a sampling of 0.4 for a scene according to the present invention.

Fig. 6a shows the result of detecting a moving object in the 10 th frame (sample rate ═ 0.15) according to the present invention.

Fig. 6b is the result of the method proposed by Luong et al detecting a moving object at frame 10 (sample rate 0.15).

Fig. 7a shows the result of detecting a moving object at frame 30 (sample rate 0.15) according to the present invention.

Fig. 7b shows the result of the method proposed by Luong et al detecting a moving object at frame 30 (sample rate 0.15).

Fig. 8a shows the result of detecting a moving object in the 10 th frame (sample rate ═ 0.2) according to the present invention.

Fig. 8b is the result of the method proposed by Luong et al detecting a moving object at frame 10 (sample rate 0.2).

Fig. 9a shows the result of detecting a moving object at frame 30 (sample rate 0.2) according to the present invention.

Fig. 9b shows the result of the method proposed by Luong et al detecting a moving object at frame 30 (sampling rate 0.2).

Detailed Description

The present invention will be described in further detail with reference to specific examples, but the embodiments of the present invention are not limited thereto.

In the description of the present invention, it is to be understood that the terms "central," "longitudinal," "lateral," "upper," "lower," "front," "rear," "left," "right," "vertical," "horizontal," "top," "bottom," "inner," "outer," and the like are used in the orientation or positional relationship indicated in the drawings, which are merely for convenience in describing the invention and to simplify the description, and are not intended to indicate or imply that the referenced device or element must have a particular orientation, be constructed and operated in a particular orientation, and are therefore not to be construed as limiting the invention.

Furthermore, the terms "first," "second," "third," and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicit to a number of indicated technical features. Thus, a feature defined as "first," "second," etc. may explicitly or implicitly include one or more of that feature. In the description of the invention, the meaning of "a plurality" is two or more unless otherwise specified.

The terms "mounted," "connected," and "coupled" are to be construed broadly and may, for example, be fixedly coupled, detachably coupled, or integrally coupled; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meaning of the above terms in the creation of the present invention can be understood by those of ordinary skill in the art through specific situations.

As shown in fig. 1, we show the overall process of video reconstruction and moving object detection over video frame compression measurement. Firstly, carrying out compression measurement on a monitoring scene at an information source end by a camera designed based on a compression perception principle; then, the compression measurement of the single frame is transmitted to a monitoring center after channel coding; and finally, collecting the compressed measurement data of the single frame by decoding in the monitoring center, and reconstructing the background and the foreground of the video frame by a certain compressed reconstruction algorithm.

Fig. 2 shows a flow chart of the implementation of the present invention, and the specific implementation steps are as follows:

the embodiment provides a video online reconstruction and moving target detection method based on frame compression measurement, which comprises the following steps:

specifically, the step 1 comprises the following specific steps:

step 1.1, construct training set

Video sequence X for collecting a certain monitoring scene for a period of time₀The frame number of the video sequence is L, and L is usually 250. The video sequence will be used to initialize the video a priori background and the video a priori foreground.

Step 1.2, monitoring video sequence X collected in step 1.1₀Input into a Robust Principal Component Analysis (RPCA) method that will output the video prior background (background sequence) B of the video segment₀And foreground sequence S₀. Assigning the rear k of the video foreground sequence to the prior foreground by 3 frames of images

A priori foreground

The video motion target detection method is used as auxiliary prior information for video motion target detection. Here, k may take a value of 2, 4 or as needed.

Step 2, collecting compression measurement y of t frame video image of monitoring scene_t(ii) a The implementation adopts a block discrete Hadamard matrix which is randomly sampled to simulate a compression measurement matrix, and carries out compression measurement on the scene image. Of course, in actual operation, a video camera (such as a single-pixel camera) designed based on the compressed sensing principle can be purchased in the market to carry out compressed measurement on a scene, and the invention carries out shooting based on the compressed sensing principleThe type of camera is not limited in any way. In the present invention, the size of the discrete hadamard matrix is not set to be 32 × 32, and the sampling rate ratio of the random matrix is 0.7, but the sampling rate may be set to other values, for example, 0.15, 0.2, 0.4, or set according to actual needs.

specifically, for video background, we use matrix kernel norm to model the low rank property of the video prior background, namely:

Ω(b_t)＝||[B_t-1,b_t]||_*，

here | Z | non-calculation_*＝∑_iσ_i(Z)，σ_i(Z) is the ith singular value of the matrix Z, [ B ]_t-1,b_t]. For a video foreground, a negative logarithm Laplace mixing function and a total variation TV regular function are utilized to respectively model sparsity and slicing smoothness of a video prior foreground, namely:

here, the

Is the video prior foreground

The number of the ith row of (a),

is a variance vector of the mixed laplacian distribution,

is a proportional coefficient vector of Laplace mixture distribution components, and tau is a balance theta_t-1,π_t-1Both of these termsParameter, TV(s)_t)＝||Ds_t||₁Is the anisotropic total variation function of the image, where D ═ D_h；D_v]The image processing method is characterized by comprising difference operators in the horizontal direction and the vertical direction of an image, and the mixed probability distribution of Laplace is as follows:

attention parameter

And

can be estimated by an expectation maximization algorithm (EM-algorithm).

Based on the above background and foreground modeling for the t-th frame, a reconstructed model of the video can be obtained as follows:

the first item at the right end of the equation equal sign is a data fidelity item, the second item is the low rank of the prior background of the modeling video, the third item is the slice smoothness of the prior foreground of the description t frame video, and the fourth item is the sparsity of the prior foreground of the description t frame video. The t frame background b can be obtained by minimizing the model_tAnd foreground s_t. A is the compression measurement matrix, λ and μ are the regularization parameters, γ ═ μ · τ. B is_t-1Which represents the a priori background of the video,

representing the a priori foreground of the video, b_tRepresenting the t-th frame background, s_tRepresenting the tth frame foreground.

specifically, the t frame background b of the video is obtained by minimizing the output of the reconstruction model_tAnd foreground s_tThe method comprises the following specific steps: optimizing the reconstruction model using a proximity gradient method and an expectation-maximization algorithm (EM-algorithm), wherein the proximity gradient function is:

wherein

Are gradients, their size being

Parameter(s)

By the alternative optimization method, the adjacent gradient function is converted into three sub-problems:

sub-problem 1:

sub-problem 2:

sub-problem 3:

iterating the three sub-problems, when the relative change is less than 1e-5, terminating the iteration, and outputting the t frame background b of the obtained video_tAnd foreground s_t；

Wherein, sub-problem 3:

and obtaining an updated expression of the parameter through the expectation maximization algorithm. Figure 3 shows an alternate iteration diagram of three sub-problems of the algorithm.

Obtaining the t frame background b of the video_tAnd foreground s_tThen adopting image threshold segmentation method to segment from foreground s_tThe moving target is detected, for example, the moving target in the foreground can be obtained by using an image threshold segmentation method.

Step 5, according to the background b_tBackground a priori with video B_t-1Updating current video prior background B_tAnd according to the foreground s_tForeground prior to video

Updating a current video prior foreground

Specifically, the specific steps of step 5 are:

step 5.1, video prior background B_tThe update formula of (2) is:

[U,S,V]＝SVD([B_t-1,b_t]) Formula (1)

B_t＝U(:,1:L)·S(1:L,1:L)·V(:,1:L)^TFormula (2)

step 5.2, the current video priori foreground

The update formula of (2) is:

update B in combination with the above-mentioned a priori background_tA priori foreground update

And the t +1 frame compression measurement y of the monitored scene_t+1By the above step 5, we can reconstruct the background and foreground of the frame image. Cycling is performed in sequence until a termination condition is reached.

Wherein, T represents the monitoring time or the video frame number, and the monitoring time can be set according to the requirement.

To specifically explain the effects of the present invention, the following description is made by comparative experiments:

as shown in the figure, the method of the invention has higher precision or more accurate detection of the moving target under the same sampling rate. The comparative examples are shown in fig. 6a and 6b, in fig. 7a and 7b, and in fig. 8a and 8b, in fig. 9a and 9 b. In this figure, the results of the method of the present invention and the results of the method proposed by Luong et al are shown [ H.Luong, N.Deligianics, J.Seiler, S.Forchhammer, and A.Kaup.compressive on line robust basic component analysis via n-l 11 minimization. to apply in IEEE Transactions on Image Processing,2018 ], and it can be seen that at low sampling rates, the method of the present invention can still detect moving objects, whereas the Luong method cannot detect clear moving objects.

The foregoing is a more detailed description of the invention in connection with specific preferred embodiments and it is not intended that the invention be limited to these specific details. For those skilled in the art to which the invention pertains, several simple deductions or substitutions can be made without departing from the spirit of the invention, and all shall be considered as belonging to the protection scope of the invention.

Claims

1. A video online reconstruction and moving target detection method based on frame compression measurement is characterized in that: the method comprises the following steps:

Inputting the reconstruction model, and obtaining the t frame background b of the video by minimizing the output of the reconstruction model_tAnd foreground s_tThen using image threshold segmentation method to segment from the foreground s_tDetecting a moving target; wherein t is a positive integer;

step 5, according to the background b_tBackground a priori with video B_t-1Updating the prior background of the current video to be B_tAnd according to the foreground s_tForeground prior to video

Update the current video prior foreground to

Wherein T represents monitoring time or video frame number;

the reconstruction model in the step 3 is as follows:

representing the a priori foreground of the video, b_tRepresenting the t-th frame background, s of the video image_tRepresenting a tth frame foreground of the video image;

Ω(b_t)＝‖[B_t-1,b_t]‖_*modeling low rank nature of video prior background for matrix kernel norm, wherein matrix kernel norm is | Z | |)_*＝∑_iσ_i(Z)，σ_i(Z) the ith singular value of the matrix Z;

is the video prior foreground

The number of the ith row of (a),

is a variance vector of the mixed laplacian distribution,

a scale coefficient vector in which components are mixedly distributed for Laplace, and pi_t-1Satisfy the constraint condition

And

τ is the balance θ_t-1,π_t-1Parameter of, TV(s)_t)＝||Ds_t||₁Is an anisotropic total variation function of the image,D＝[D_h；D_v]the image processing method comprises the steps of forming a difference operator in the horizontal direction and the vertical direction of an image;

the laplacian mixture probability distribution is:

parameter(s)

And

the estimation is performed by an expectation maximization algorithm.

2. The method for video online reconstruction and moving object detection based on frame compression measurement as claimed in claim 1, wherein: in the step 4, the t-th frame background b of the video is obtained through the minimized reconstruction model output_tAnd foreground s_tThe method comprises the following specific steps: optimizing the reconstruction model using a neighboring gradient method and an expectation-maximization algorithm, wherein the neighboring gradient function is:

wherein

Are gradients, their size being

Parameter(s)

And

Wherein the sub-problem

3. The method for video online reconstruction and moving object detection based on frame compression measurement as claimed in claim 1 or 2, wherein: the specific steps of the step 5 are as follows:

step 5.1, video prior background B_tThe update formula of (2) is:

[U,S,V]＝SVD([B_t-1,b_t]) Formula (1)

B_t＝U(:,1:L)·S(1:L,1:L)·V(:,1:L)^TFormula (2)

step 5.2, the current video priori foreground

The update formula of (2) is:

4. the method for video online reconstruction and moving object detection based on frame compression measurement as claimed in claim 1, wherein: in the step 2, a block discrete Hadamard matrix which is randomly down-sampled is adopted to simulate a compression measurement matrix, the size of the discrete Hadamard matrix is 32 multiplied by 32, and the sampling rate of the random matrix is set.