CN106920255B

CN106920255B - Moving object extraction method and device for image sequence

Info

Publication number: CN106920255B
Application number: CN201510983350.9A
Authority: CN
Inventors: 胡懋地
Original assignee: Aisino Corp
Current assignee: Aisino Corp
Priority date: 2015-12-24
Filing date: 2015-12-24
Publication date: 2020-06-05
Anticipated expiration: 2035-12-24
Also published as: CN106920255A

Abstract

The invention relates to the technical field of computer vision, and discloses a moving target extraction method and a device for an image sequence, wherein the moving target extraction method comprises the following steps: establishing hidden Markov models of various moving targets; detecting an image sequence of a moving target to be extracted by adopting a hidden Markov model, and outputting attribute information sets of all bounding boxes, which are obtained by detecting the hidden Markov model corresponding to various moving targets in all frame images; and searching out a set of moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm. The hidden Markov model is adopted to model the moving target, the appearance of the target under different postures is accurately represented, and the target with complex change can be processed. And (3) searching an optimal subsequence in a bounding box set obtained by detection in the image sequence by combining a forward probability propagation method and a backward backtracking method, so that the moving target can be quickly and accurately extracted.

Description

Moving object extraction method and device for image sequence

Technical Field

The invention relates to the technical field of computer vision, in particular to a moving target extraction method and device for an image sequence.

Background

In the field of computer vision, two applications of similar target extraction and target tracking are carried out on a specific moving target in an image sequence, and the difference is that the target extraction is to extract the position and the size of the target in each frame from the existing image sequence; and the target tracking is to find the position and the size of a known target in a new image according to an existing image sequence when a new frame of image arrives, such as missile guidance.

In the technology, the target extraction process generally assumes that each frame of image is known and can be used for searching and matching, and a common searching method has a dynamic programming algorithm, but the algorithm efficiency is low; generally, target tracking models known targets in latest multi-frame images, predicts position and size changes of the targets, finds the known targets in a new frame image as much as possible, updates a model after finding the targets, and makes new predictions.

In addition, for target extraction and target tracking, there are also methods for extracting and tracking multiple types of moving targets in the prior art, for example, a component analysis method is used to perform multiple types of target matching, determine the attributes of the moving targets, recognize the moving targets, and perform temporal association to realize tracking of the moving targets. For another example, motion trajectory information of each motion target is extracted from video data, a motion mode of multi-motion target motion behaviors is modeled on each level, and recognition of the multi-motion target motion behaviors in the video is achieved by using a classifier. However, these methods have a drawback of low efficiency when classifying and extracting a specific moving object in an image sequence.

Disclosure of Invention

The invention aims to provide a moving object extraction method and a moving object extraction device for an image sequence, which are used for solving the problem of low efficiency of a classification extraction scheme of a specific moving object in the existing image sequence.

In order to achieve the above object, the present invention provides a moving object extraction method for an image sequence, the moving object extraction method comprising: establishing hidden Markov models of various moving targets; detecting an image sequence of a moving target to be extracted by adopting a hidden Markov model, and outputting attribute information sets of all bounding boxes, which are obtained by detecting the hidden Markov model corresponding to various moving targets in all frame images; and searching out a set of moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm.

Preferably, the outputting the attribute information set of each bounding box detected by the hidden markov model corresponding to each moving target in each frame image includes: and for any bounding box of each frame of image, if the probability that the image features of the bounding box are any states of any kinds of moving objects is greater than a first set threshold value, outputting the attribute information set of the bounding box.

Preferably, the attribute information includes an abscissa, an ordinate, a width, a height of the bounding box, and a probability that the bounding box is an arbitrary state of an arbitrary kind of moving object.

Preferably, the searching out a set of moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a backward backtracking algorithm includes: for each moving target, combining the attribute information set, and calculating the optimal probability of taking the image sequence at the end of each surrounding frame of each frame of image as the moving target by adopting a forward probability propagation algorithm; and searching out a set of moving objects to be extracted from the attribute information set by adopting a reverse backtracking algorithm in combination with the optimal probability.

Preferably, the calculating the optimal probability of the moving object by using the forward probability propagation algorithm with the image sequence at the end of each bounding box of each frame image comprises: v for t frame image_tA bounding box, a previous bounding box v of the bounding box is calculated_t-1And the v th_tThe surrounding frames are the same moving object, and the v-th frame of the image_tProbability E (v) of each bounding box being state i_t-1I); from all the probabilities calculated, E (v)_t-1I) selecting the maximum value E', and corresponding v to E_t-1And i are set to v ″, respectively_t-1And i'; if (W (k, t, v)_t)+E＇)/(L(k,t,v_t)+1)>z', wherein W (k, t, v)_t) Indicating the v-th frame for the k-th moving object_tThe probability that the image sequence at the end of the bounding box is the k-th moving object, L (k, t, v)_t) Indicating a vth in a tth frame_tThe number of frames of the image sequence at the end of each bounding box, z', is a second set threshold, then the recursive operation is performed according to the following formula: q (k, t, v)_t)＝v＇_t-1，L(k,t,v_t)＝L(k,t,v＇_t-1)+1，W(k,t,v_t)＝W(k,t,v_t) + E'; wherein, Q (k, t, v)_t) V-th frame representing the frame of t_tThe bounding box serial number of the t-1 th frame connected with the bounding box; and finally calculated W (k, t, v)_t) As for the k-th moving object, the v-th frame of the t-th frame_tThe image sequence at the end of each bounding box is the optimal probability for this kth moving object.

Preferably, the probability E (v) is calculated using the following formula_t-1,i)：E(v_t-1,i)＝P_k(a(t,v_t)|i)+αT(S(k,t,v_t-1),i)+βC(t,vt,t-1,v_t-1) (ii) a Wherein i represents the ith state, and i is 1, …, n, n is the total number of state types; c (t, v)_t,t-1,v_t-1) Is the v-th frame of the t-th frame_tA v th frame surrounding the frame and the t-1 th frame_t-1The size of the overlapping area of the bounding box accounts for the vth of the tth frame_tThe ratio of the sizes of the bounding boxes passing through t and v in the attribute information set_tCalculating corresponding attribute information; t (S (k, T, v)_t-1) I) is the S (k, t, v)_t-1) Probability of state transition of individual state to i-th state, S (k, t, v)_t-1) V-th representing a t-th frame_t-1The sequence numbers of the states corresponding to the surrounding frames, α and β are the weight values of the state transition probability and the proportion of the overlapping area respectively.

Preferably, the searching out the set of moving objects to be extracted from the attribute information set by using a reverse backtracking algorithm includes: obtaining the v th frame of the t th frame according to the calculation result of the forward probability propagation algorithm_tThe image sequence at the end of each bounding box is the maximum probability W ' of the kth moving object, and k corresponding to W ' is set as k '; let r be the v th frame to be searched_tAdding the attribute information corresponding to k' in the attribute information set into r for the image sequence at the end of each bounding box; find the v th frame of the t th frame_tThe bounding box number of the t-1 th frame of the bounding box connection is marked as v^～ _t-1K 'and v' in the attribute information set^～ _t-1Adding corresponding attribute information into r, and repeating the operation until the v th frame of the t th frame_tThe serial number of the bounding box of the t-1 th frame connected with the bounding box is 0; and combining the finally obtained r and k ' into a set { r, k ' } of the moving target to be extracted, wherein r represents the position and the size of the moving target, and k ' represents the type of the moving target.

The technical scheme of the invention also provides a moving object extraction device for the image sequence, which comprises the following steps: the model training module is used for establishing hidden Markov models of various moving targets; the target detection module is used for detecting the image sequence of the moving target to be extracted by adopting a hidden Markov model and outputting attribute information sets of all the surrounding frames, which are obtained by detecting the hidden Markov model corresponding to various moving targets in all the frames of images; and the target extraction module is used for searching a set of the moving targets to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm.

Preferably, the object detection module comprises: the sequence detection submodule is used for detecting the image sequence of the moving target to be extracted by adopting a hidden Markov model; and the selection output sub-module is used for outputting the attribute information set of the bounding box for any bounding box of each frame of image if the image characteristics of the bounding box are that the probability of any state of any kind of moving object is greater than a first set threshold value.

Preferably, the object extraction module comprises: the forward calculation submodule is used for calculating the optimal probability of taking the image sequence at the end of each enclosing frame of each frame of image as the moving target by combining the attribute information set for each moving target and adopting a forward probability propagation algorithm; and the reverse calculation submodule is used for searching out a set of the moving objects to be extracted from the attribute information set by adopting a reverse backtracking algorithm in combination with the optimal probability.

Through the technical scheme, the invention has the beneficial effects that: the hidden Markov model is adopted to model the moving target, the appearance of the target under different postures is accurately represented, and the target with complex change can be processed. And (3) searching an optimal subsequence in a bounding box set obtained by detection in the image sequence by combining a forward probability propagation method and a backward backtracking method, so that the moving target can be quickly and accurately extracted. After possible moving targets are rapidly detected through the appearance features and the moving features, sequence features are blended, and the sequence features are gradually refined from coarse to fine, so that the target extraction efficiency is improved.

Additional features and advantages of the invention will be set forth in the detailed description which follows.

Drawings

The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:

fig. 1 is a flowchart illustrating a moving object extraction method according to a first embodiment of the present invention.

Fig. 2 is a schematic structural diagram of a moving object extraction device in a second embodiment of the present invention.

Description of the reference numerals

1 model training module 2 target detection module

3 target extraction module 21 sequence detection submodule

22 selection output submodule 31 forward calculation submodule

32 inverse calculation submodule

Detailed Description

The following detailed description of embodiments of the invention refers to the accompanying drawings. It should be understood that the detailed description and specific examples, while indicating the present invention, are given by way of illustration and explanation only, not limitation.

The invention provides a scheme for extracting a target by using a Hidden Markov Model (HMM) aiming at the problem of low efficiency of a classification extraction scheme of a specific moving target in the existing image sequence. The classical usage of HMMs is for recognition, such as shooting, after modeling a series of movements of the framed object, where modeling requires framing the position of the shooting hand in each frame of training data in advance, and recognition requires knowing the position of the shooting hand in each frame of test data in advance, and then the HMM can give the probability that this is a shot sequence. Compared with the classical usage, the method mainly emphasizes the extraction of the target sequence, disassembles the HMM, and utilizes the probability and backtracking of each frame of the HMM in each frame, and the details are described below.

The moving object extraction to be realized by the invention is to select an image sequence of the moving object from a plurality of sections of specific moving objects and each frame of image by using a rectangular surrounding frame. In view of this, a first embodiment of the present invention provides a moving object extraction method, as shown in fig. 1, including:

step S1, establishing hidden Markov models of various moving targets;

step S2, detecting the image sequence of the moving target to be extracted by adopting a hidden Markov model, and outputting the attribute information set of each bounding box detected in each frame image by the hidden Markov model corresponding to each moving target; and

and step S3, searching a set of the moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm.

The above steps S1-S3 implement model building, target detection and target extraction, respectively, and the detailed implementation processes of these three parts are described in detail below.

First, model establishment

First, the image features in the bounding box in each frame image are extracted, and the embodiment preferably extracts the HoG + HOF combined features commonly used in computer vision applications. HoG + HOF is the direct connection of the HoG characteristic vector and the HOF characteristic vector, and HoG (histograms of gradients) is a gradient histogram and represents the gradient distribution of a local area in an image; HOF (histograms of Optical flow) is an Optical flow histogram, and represents the Optical flow distribution of a local area in an image.

Then, for each moving object, using the image features of the image sequences corresponding to the moving object in the training data (hereinafter also referred to as image sequences simply as sequences), training an n_s(k) HMM (Hidden MarkovModel) of a state obtains a state transition probability and an emission probability. Recording the state transition probability corresponding to the kth moving target as T_kAnd the transmission probability is denoted as P_k. Let the number of kinds of moving objects be n_aThen k is 1, …, n_a. Number of states n_s(k) Each state corresponds to a pose of the moving object, depending on the complexity of the moving object change. Probability of state transition T_k(i, j) represents the probability of the kth moving object jumping from the ith state to the jth state, and the emission probability P_k(q | i) represents the probability that the ith state of the kth moving object exhibits an image feature of q.

Second, target detection

First, a sliding window method is used to generate images in each frameAnd (4) surrounding frames with different positions and sizes, and extracting image features in each surrounding frame. Setting the image characteristic as a for any one of the surrounding frames, and calculating the probability P of the image characteristic relative to each state of each motion through the emission probability in the hidden Markov model training module_k(a|i)，k＝1,…,n_a，i＝1…n_s(k) In that respect If the attribute information set B includes attribute information such as the position, size, and probability of each bounding box detected by the hidden markov model corresponding to each moving target in each frame image, in this embodiment, after the image sequence of the moving target to be extracted is detected by the hidden markov model, it is preferable that the probability P of a certain state i is obtained by taking the image feature a as a certain motion k_k(ai) the attributes of the bounding box that are greater than a first set threshold z are saved in a set B, where the vth of the tth frame_tThe attributes of each bounding box are noted as:

B(k,t,v_t)＝(x(k,t,v_t),y(k,t,v_t),w(k,t,v_t),h(k,t,v_t),P₁(a(t,v_t)|1),…,P_na(a(t,v_t)|n_s(n_a)) (1)

wherein t is 1, …, n_f，v＝1,…,n_b(t)，a(t,v_t) Is the tth frame v_tThe image features in the surrounding frame, x, y, w, h, P and other items on the right side of the formula (1) respectively represent the v < th > frame detected by the hidden Markov model of the k < th > moving target_tThe abscissa, ordinate, width, height of each bounding box, the probability … … that the bounding box is the 1 st state of the 1 st motion, and the bounding box is the nth state_aN th of movement_s(n_a) Probability of individual state, n_b(t) represents the number of bounding boxes in the tth frame in which the probability of a certain state of a certain motion is greater than the first set threshold value z. Wherein v is_tThe order of z can be arbitrarily specified, z is the minimum value of the probability that a single bounding box can be considered as a moving object, and the value of z is an empirical value, and can be preferably the average value of a plurality of collected P values.

Third, target extraction

For the kth moving object, let W (k, t, v)_t) Watch (A)Denoted as the vth of the t frame_tThe sequence at the end of each bounding box is the probability, L (k, t, v), of the kth moving object_t) Indicating a vth in a tth frame_tNumber of sequence frames at the end of the bounding box, Q (k, t, v)_t) V-th frame representing the frame of t_tBounding box number of t-1 frame, S (k, t, v) of a bounding box connection_t) V-th representing a t-th frame_tThe second set threshold value z' represents the minimum value of the average probability of each frame that a bounding box sequence can be regarded as a moving object, preferably 1.5 times of the first set threshold value z, and R represents the finally extracted moving object set.

And step 1, for each moving target, combining the attribute information set, and calculating the optimal probability of taking the image sequence at the end of each enclosing frame of each frame image as the moving target by adopting a forward probability propagation algorithm.

For t 1 … n_f，v_t＝1…n_b(t)，k＝1…n_aPerforming steps 1.1-1.4

Step 1.1, if t is equal to 1, skipping steps 1.1-1.3, and executing step 1.4; otherwise, for v_t-1＝1…n_b(t-1) calculating the bounding box and the tth frame_tThe surrounding frames are the same moving object, and the nth frame of the tth frame is calculated by adopting the following formula_tProbability E (v) of each bounding box being state i_t-1,i)。

E(v_t-1,i)＝P_k(a(t,v_t)|i)+αT(S(k,t,v_t-1),i)+βC(t,v_t,t-1,v_t-1) (2)

Wherein i represents the ith state, and i is 1, …, n_s(k)，n_s(k) The state number corresponding to the kth motion is obtained; c (t, v)_t,t-1,v_t-1) Is the v-th frame of the t-th frame_tA v th frame surrounding the frame and the t-1 th frame_t-1The size of the overlapping area of the bounding box accounts for the vth of the tth frame_tThe ratio of the sizes of the bounding boxes passing through t and v in the attribute information set_tCalculating corresponding attribute information; t (S (k, T, v)_t-1) I) is the S (k, t, v)_t-1) Probability of state transition of individual state to i-th state, S (k, t, v)_t-1) V-th representing a t-th frame_t-1The sequence numbers of the states corresponding to the surrounding frames, α and β are the weight values of the state transition probability and the proportion of the overlapping area respectively.

Step 1.2, assign E ═ max_vt-1,iE(v_t-1I), i.e. from all the probabilities E (v) calculated_t-1I) selecting the maximum E' and the maximum v_t-1And i are each v_t-1'and i'.

Step 1.3, if (W (k, t, v)_t)+E＇)/(L(k,t,v_t)+1)>z', then assigning values according to the following rules:

Q(k,t,v_t)＝v＇_t-1(3)

L(k,t,v_t)＝L(k,t,v_t-1＇)+1 (4)

W(k,t,v_t)＝W(k,t,v_t)+E＇(5)

and after the values are assigned according to the formula (3) to the formula (5), skipping the step 1.4;

if (W (k, t, v)_t)+E＇)/(L(k,t,v_t)+1)>z' does not hold, then step 1.4 is continued.

Step 1.4, carrying out recursive operation according to the following formula:

Q(k,t,v_t)＝0， (6)

L(k,t,v_t)＝1 (7)

E＇＝max_iP_k(a(t,v_t)|i) (8)

W(k,t,v_t)＝W(k,t,v_t)+E＇ (9)

performing an optimal search through the above steps 1.1-1.4, W (k, t, v) will be finally calculated_t) As for the k-th moving object, the v-th frame of the t-th frame_tThe image sequence at the end of each bounding box is the optimal probability for this kth moving object.

And 2, searching a set of moving targets to be extracted from the attribute information set by adopting a reverse backtracking algorithm in combination with the optimal probability.

For t ═ n_f…1，v_t＝n_b(t) … 1, performing steps 2.1-2.2,

step 2.1, find out the v th frame with t th frame_tThe sequence at the end of a bounding box is the maximum probability of a moving object, i.e. of

W＇＝max_kW(k,t,v_t) (10)

Let k be k' which is the maximum value in the formula (10).

Step 2.2, let r be the v th frame to be searched_tThe sequence at the end of each bounding box is initially empty. Collecting attribute information B (k', t, v)_t) { x (k', t, v) } in_t),y(k＇,t,v_t),w(k＇,t,v_t),h(k＇,t,v_t) Add sequence r. With Q (k', t, v)_t) Find the v th frame of the t th frame_tThe bounding box number of the t-1 th frame of the bounding box connection is marked as v^～ _t-1B (k', t-1, v)^～ _t-1) { x (k', t-1, v) } of (C)^～ _t-1),y(k＇,t-1,v^～ _t-1),w(k＇,t-1,v^～ _t-1),h(k＇,t-1,v^～ _t-1) Add sequence r.

Thus cycling L (k', t, v)_t) 1 times, i.e. up to Q (k ', t-L (k', t, v)_t)-1,v_{t-L(k,t,vt)-1}) Until the value of (2) is 0. In this case, the content in r should be

{{x(k＇,t,v_t),y(k＇,t,v_t),w(k＇,t,v_t),h(k＇,t,v_t)},

{x(k＇,t-1,v^～ _t-1),y(k＇,t-1,v^～ _t-1),w(k＇,t-1,v^～ _t-1),h(k＇,t-1,v^～ _t-1),

……}。 (11)

Adding the group of R, k 'into the set R, wherein R represents the position and the size of the moving target, k' represents the type of the moving target,

in order to avoid repetition of bounding boxes among a plurality of acquired sequences, after each bounding box is acquired, whether the overlapping degree of the existing bounding box of the frame and the existing bounding box of the frame is greater than a certain threshold is checked in the found sequence set R, the threshold is preferably 50%, if so, the bounding box before the frame is considered to be other moving targets, the search is not continued, and the frame is used as the starting point of the current sequence.

Based on the same inventive idea as the moving object extracting method in the first embodiment described above, a second embodiment of the present invention proposes a moving object extracting device, as shown in fig. 2, including: the model training module 1 is used for establishing hidden Markov models of various moving targets; the target detection module 2 is configured to detect an image sequence of a moving target to be extracted by using a hidden markov model, and output attribute information sets of each bounding box detected in each frame image by the hidden markov model corresponding to each moving target; and the target extraction module 3 is used for searching a set of the moving targets to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm.

Further, the object detection module 2 includes: the sequence detection submodule 21 is configured to detect an image sequence of a moving target to be extracted by using a hidden markov model; and a selection output sub-module 22, configured to, for any bounding box of each frame of image, output the attribute information set of the bounding box if the probability that the image feature of the bounding box is any state of any type of moving object is greater than a first set threshold.

Further, the target extraction module 3 includes: the forward calculation submodule 31 is configured to calculate, for each moving object, the optimal probability of taking the image sequence at the end of each bounding box of each frame image as the moving object by using a forward probability propagation algorithm in combination with the attribute information set; and a reverse calculation submodule 32, configured to search out a set of moving objects to be extracted from the attribute information set by using a reverse backtracking algorithm in combination with the optimal probability.

It should be noted that each functional module of the moving object extracting apparatus of this embodiment corresponds to the relevant steps of the moving object extracting method of the first embodiment, and therefore the specific implementation process and the working principle of each functional module are the same as or similar to those of the first embodiment, and are not described herein again.

In summary, the moving object extraction method and apparatus in the embodiments of the present invention have the following advantages:

1. the hidden Markov model is adopted to model the moving target, the appearance of the target under different postures is accurately represented, and the target with complex change can be processed.

2. And (3) by combining the methods of a forward probability propagation algorithm and a reverse backtracking algorithm, searching the optimal subsequence in the bounding box set obtained by detection in the image sequence, and realizing the rapid and accurate extraction of the moving target.

3. After possible moving targets are rapidly detected through the appearance features and the moving features, sequence features are blended, and the sequence features are gradually refined from coarse to fine, so that the target extraction efficiency is improved.

It will be understood, however, that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer readable program instructions. These computer-readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer-readable program instructions may also be stored in a computer-readable storage medium that can direct a computer, programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer-readable medium storing the instructions comprises an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks. The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer, other programmable apparatus or other devices implement the functions/acts specified in the flowchart and/or block diagram block or blocks.

The preferred embodiments of the present invention have been described in detail with reference to the accompanying drawings, however, the present invention is not limited to the specific details of the above embodiments, and various simple modifications can be made to the technical solution of the present invention within the technical idea of the present invention, and these simple modifications are within the protective scope of the present invention.

It should be noted that the various features described in the above embodiments may be combined in any suitable manner without departing from the scope of the invention. The invention is not described in detail in order to avoid unnecessary repetition.

In addition, any combination of the various embodiments of the present invention is also possible, and the same should be considered as the disclosure of the present invention as long as it does not depart from the spirit of the present invention.

Claims

1. A moving object extraction method for an image sequence, the moving object extraction method comprising:

establishing hidden Markov models of various moving targets;

detecting an image sequence of a moving target to be extracted by adopting a hidden Markov model, and outputting attribute information sets of all bounding boxes, which are obtained by detecting the hidden Markov model corresponding to various moving targets in all frame images; and

searching out a set of moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a backward backtracking algorithm,

the outputting of the attribute information sets of the bounding boxes detected by the hidden markov models corresponding to the various moving targets in the images of the frames includes: for any bounding box of each frame of image, if the image feature of the bounding box is that the probability of any state of any kind of moving object is greater than a first set threshold, outputting the attribute information set of the bounding box;

the searching out a set of moving objects to be extracted from the attribute information set by combining a forward probability propagation algorithm and a backward backtracking algorithm comprises the following steps:

for each moving target, combining the attribute information set, and calculating the optimal probability of taking the image sequence at the end of each surrounding frame of each frame of image as the moving target by adopting a forward probability propagation algorithm; and

and searching a set of the moving objects to be extracted from the attribute information set by adopting a reverse backtracking algorithm in combination with the optimal probability.

2. The moving object extraction method according to claim 1, wherein the attribute information includes an abscissa, an ordinate, a width, a height of the bounding box, and a probability that the bounding box is an arbitrary state of an arbitrary kind of moving object.

3. The method for extracting a moving object according to claim 1, wherein the calculating the optimal probability of the moving object by using the forward probability propagation algorithm with the image sequence at the end of each bounding box of each frame image comprises:

v for t frame image_tA bounding box, a previous bounding box v of the bounding box is calculated_t-1And the v th_tThe surrounding frames are the same moving object, and the v-th frame of the image_tProbability E (v) of each bounding box being state i_t-1,i)；

From all the probabilities calculated, E (v)_t-1I) selecting the maximum value E', and corresponding v to E_t-1And i are set to v ″, respectively_t-1And i';

if (W (k, t, v)_t)+E＇)/(L(k,t,v_t)+1)>z', wherein W (k, t, v)_t) Indicating the v-th frame for the k-th moving object_tThe probability that the image sequence at the end of the bounding box is the k-th moving object, L (k, t, v)_t) Indicating a vth in a tth frame_tThe number of frames of the image sequence at the end of each bounding box, z', is a second set threshold, then the recursive operation is performed according to the following formula:

Q(k,t,v_t)＝v＇_t-1，

L(k,t,v_t)＝L(k,t,v＇_t-1)+1，

W(k,t,v_t)＝W(k,t,v_t-1)+E＇；

wherein, Q (k, t, v)_t) V-th frame representing the frame of t_tThe bounding box serial number of the t-1 th frame connected with the bounding box; and

w (k, t, v) to be finally calculated_t) As for the k-th moving object, the v-th frame of the t-th frame_tThe image sequence at the end of each bounding box is the optimal probability for this kth moving object.

4. A moving object extraction method according to claim 3, characterized in that the probability E (v) is calculated using the following formula_t-1,i)：

E(v_t-1,i)＝P_k(a(t,v_t)|i)+αT(S(k,t,v_t-1),i)+βC(t,v_t,t-1,v_t-1)

Wherein i represents the ith state, and i is 1, …, n, n is the total number of state types; c (t, v)_t,t-1,v_t-1) Is the v-th frame of the t-th frame_tA v th frame surrounding the frame and the t-1 th frame_t-1The size of the overlapping area of the bounding box accounts for the vth of the tth frame_tThe ratio of the sizes of the bounding boxes passing through t and v in the attribute information set_tCalculating corresponding attribute information; t (S (k, T, v)_t-1) I) is the S (k, t, v)_t-1) Probability of state transition of individual state to i-th state, S (k, t, v)_t-1) V-th representing a t-th frame_t-1The sequence numbers of the states corresponding to the surrounding frames, α and β are the weight values of the state transition probability and the proportion of the overlapping area respectively.

5. The method for extracting moving objects according to claim 1, wherein the searching out the set of moving objects to be extracted from the attribute information set by using a backward backtracking algorithm comprises:

obtaining the v th frame of the t th frame according to the calculation result of the forward probability propagation algorithm_tThe maximum probability W' that the image sequence at the end of the bounding box is of the kth moving objectAnd setting the k corresponding to W 'as k';

let r be the v th frame to be searched_tAdding the attribute information corresponding to k' in the attribute information set into r for the image sequence at the end of each bounding box;

find the v th frame of the t th frame_tThe bounding box number of the t-1 th frame of the bounding box connection is marked as v^～ _t-1K 'and v' in the attribute information set^～ _t-1Adding corresponding attribute information into r, and repeating the operation until the v th frame of the t th frame_tThe serial number of the bounding box of the t-1 th frame connected with the bounding box is 0; and

and (3) combining the finally obtained r and k ' into a set { r, k ') of the moving target to be extracted, wherein r represents the position and size of the moving target, and k ' represents the type of the moving target.

6. A moving object extraction apparatus for an image sequence, characterized by comprising:

the model training module is used for establishing hidden Markov models of various moving targets;

the target detection module is used for detecting the image sequence of the moving target to be extracted by adopting a hidden Markov model and outputting attribute information sets of all the surrounding frames, which are obtained by detecting the hidden Markov model corresponding to various moving targets in all the frames of images; and

a target extraction module, which is used for searching a set of the moving targets to be extracted from the attribute information set by combining a forward probability propagation algorithm and a reverse backtracking algorithm,

wherein the target detection module comprises:

the sequence detection submodule is used for detecting the image sequence of the moving target to be extracted by adopting a hidden Markov model; and

the selection output sub-module is used for outputting an attribute information set of each bounding box for any bounding box of each frame of image, if the image characteristics of the bounding box are that the probability of any state of any kind of moving target is greater than a first set threshold value;

the target extraction module includes:

the forward calculation submodule is used for calculating the optimal probability of taking the image sequence at the end of each enclosing frame of each frame of image as the moving target by combining the attribute information set for each moving target and adopting a forward probability propagation algorithm; and

and the reverse calculation submodule is used for searching out a set of the moving targets to be extracted from the attribute information set by adopting a reverse backtracking algorithm in combination with the optimal probability.