CN115376037A

CN115376037A - Station key area safety state monitoring method based on video

Info

Publication number: CN115376037A
Application number: CN202210815091.9A
Authority: CN
Inventors: 谢征宇; 吴剑凡; 秦勇; 贾利民; 闫香玲
Original assignee: Beijing Jiaotong University
Current assignee: Beijing Jiaotong University
Priority date: 2022-07-12
Filing date: 2022-07-12
Publication date: 2022-11-22

Abstract

The invention provides a station key area safety state monitoring method based on videos. The method comprises the following steps: acquiring a video image in a monitoring area under a section of track traffic station scene, and performing passenger flow target detection on the video image by using a target detection algorithm to acquire a passenger flow target detection frame; carrying out target tracking on the passenger flow target detection frame by using an improved DaSim-RPN algorithm to obtain a passenger flow target tracking frame of the current frame; judging whether the track of the passenger flow target tracking frame of the current frame exceeds the monitoring area, and if so, stopping the target tracking process; otherwise, continuing the target tracking process of the video image. According to the invention, a deep learning method is applied to a passenger flow information detection scene of a rail transit station, and the YOLOv3 and an improved DaSim-RPN algorithm are utilized to improve the passenger flow information monitoring precision, so that the target monitoring under the shielding condition can be effectively improved, the running speed is improved, and the real-time performance of the passenger flow information detection is ensured.

Description

Station key area safety state monitoring method based on video

Technical Field

The invention relates to the technical field of rail transit video monitoring, in particular to a method for monitoring the safety state of a station key area based on videos.

Background

The normal operation of the rail transit station has great significance to the operation safety of the whole urban rail transit network, and the accurate grasp of the passenger flow operation state of the station is the fundamental basis for realizing the scientific management and control of the passenger flow.

At present, a neural network method is mostly adopted in a video monitoring method in the prior art, the method is widely applied to a rail transit station scene, but is still limited by various factors of field engineering, the passenger flow state information monitoring in a specific area in a station cannot achieve a good effect, and the method cannot be known at first time when dangerous states such as large passenger flow congestion, channel retrograde motion and non-working personnel entering an operation area occur.

Therefore, how to develop an effective video-based station key area safety state monitoring method is an urgent problem to be solved.

Disclosure of Invention

The embodiment of the invention provides a method for monitoring the safety state of a station key area based on videos, so as to effectively monitor the safety of the station key area.

In order to achieve the purpose, the invention adopts the following technical scheme.

A safety state monitoring method for a station key area based on videos comprises the following steps:

acquiring a video image in a monitoring area under a section of track traffic station scene, and performing passenger flow target detection on the video image by using a target detection algorithm to acquire a passenger flow target detection frame;

carrying out target tracking on the passenger flow target detection frame by using an improved DaSim-RPN algorithm to obtain a passenger flow target tracking frame of the current frame;

judging whether the track of the passenger flow target tracking frame of the current frame exceeds the monitoring area, and if so, stopping the target tracking process; and if not, continuing the target tracking process on the video image.

Preferably, the method comprises the steps of obtaining a video image in a monitoring area under a section of track traffic station scene, carrying out passenger flow target detection on the video image by using a target detection algorithm, and obtaining a passenger flow target detection frame;

acquiring video images of passage passenger flows in a monitoring area under a section of rail transit station scene, and extracting and storing continuous frames of the video images;

and carrying out passenger flow target detection on the continuous frames of the video image by using a target detection algorithm to obtain a passenger flow target detection frame and coordinates and confidence coefficients of four vertexes of the detection frame.

Preferably, the target detection algorithm comprises a deep learning-based YOLOv3 algorithm.

Preferably, the improved DaSim-RPN algorithm is used for carrying out target tracking on the passenger flow target detection frame to obtain a passenger flow target tracking frame of the current frame, and the method comprises the following steps of;

screening the detected passenger flow target detection frames, and establishing initial tracking positions corresponding to the targets by using the screened detection frames;

in the first frame picture, a region of 127 × 127 is intercepted from the central point of the initial tracking target and is used as a template frame region; in the subsequent pictures, a 255-255 region is intercepted from the central point of the tracking target in the previous frame and is used as a search frame region;

finding k candidate region frames closest to the tracking target central point O, calculating the cross-over ratio IoU of each candidate region frame, calculating the mean value and the variance of the k IoUs, setting the mean value plus the variance as an adaptive threshold, defining the candidate region frames larger than the adaptive threshold as interferents, removing the interferents, and forming all the candidate region frames with the interferents removed into a search frame region;

template frame area andthe search frame area is sent into an improved DaSim-RPN network, the improved DaSim-RPN network outputs a tracking frame and a score of a target, the score is subjected to cosine window suppression, the score of an edge area far away from a central point is punished, the central point of the tracking frame with the highest score is taken as a new central point (A'), the width W height H of the target of the previous frame and the width W of the tracking frame are taken as _g High H _g And performing smooth weighting to obtain a new width W ' height H ', adopting a new central point A ' and the new width W ' height H ' as passenger flow target tracking frames of the current frame, and performing subsequent tracking and loop iteration in sequence.

Preferably, the method comprises the steps of judging whether the track of the passenger flow target tracking frame of the current frame exceeds the monitoring area, and if so, stopping the target tracking process; if not, continuing to perform a target tracking process on the video image, including;

judging whether the track of the passenger flow target tracking frame of the previous frame exceeds a defined monitoring area or not, and stopping the target tracking process if the track of the passenger flow target tracking frame of the previous frame exceeds the defined monitoring area; and if not, continuing the target tracking process to monitor the passenger flow safety.

According to the technical scheme provided by the embodiment of the invention, the deep learning method is applied to the passenger flow information detection scene of the rail transit station, the YOLOv3 and the improved DaSim-RPN algorithm are utilized to improve the passenger flow information monitoring precision, the target monitoring under the shielding condition can be effectively improved, the running speed is increased, and the real-time performance of the passenger flow information detection is ensured. The invention can well monitor the designated area in the station and provides effective guarantee for the passenger flow and the environmental safety of the target area.

Additional aspects and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the description below are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a schematic diagram illustrating an implementation of a method for monitoring a safety state of a station key area based on video according to an embodiment of the present invention;

fig. 2 is a flowchart of a method for monitoring safety status of a station key area based on video according to an embodiment of the present invention;

fig. 3 is a schematic view of an aisle experiment scene of a rail transit station according to an embodiment of the present invention.

Detailed Description

Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the accompanying drawings are illustrative only for the purpose of explaining the present invention, and are not to be construed as limiting the present invention.

As used herein, the singular forms "a", "an", "the" and "the" are intended to include the plural forms as well, unless the context clearly indicates otherwise. It will be further understood that the terms "comprises" and/or "comprising," when used in this specification, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof. It will be understood that when an element is referred to as being "connected" or "coupled" to another element, it can be directly connected or coupled to the other element or intervening elements may also be present. Further, "connected" or "coupled" as used herein may include wirelessly connected or coupled. As used herein, the term "and/or" includes any and all combinations of one or more of the associated listed items.

It will be understood by those skilled in the art that, unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the prior art and will not be interpreted in an idealized or overly formal sense unless expressly so defined herein.

For the convenience of understanding the embodiments of the present invention, the following description will be further explained by taking several specific embodiments as examples in conjunction with the drawings, and the embodiments are not to be construed as limiting the embodiments of the present invention.

The embodiment of the invention provides a method for monitoring the safety state of a station key area based on a video, which solves the problem of monitoring a user-defined area in a rail transit station scene by combining YOLOv3 and an improved DaSim-RPN algorithm.

An implementation schematic diagram of the station key area safety state monitoring method based on the video is shown in fig. 1, and a specific processing flow is shown in fig. 2, and the method includes the following processing steps:

and S10, acquiring a video image in a monitoring area in a section of track traffic station scene, and performing passenger flow target detection on the video image by using a target detection algorithm to acquire a passenger flow target detection frame.

The method comprises the steps of obtaining video images of passage passenger flow in a monitoring area under a section of rail transit station scene, extracting and storing continuous frames of the video images, and extracting 24 frames every 1 s.

And carrying out passenger flow target detection on the continuous frames of the video image by using a target detection algorithm to obtain a passenger flow target detection frame and coordinates and confidence coefficients of four vertexes of the detection frame. The target detection algorithm may be a deep learning-based YOLOv3 algorithm.

And step S20, carrying out target tracking on the passenger flow target detection frame by using an improved DaSim-RPN algorithm, and acquiring a passenger flow target tracking frame of the current frame.

And screening the detected passenger flow target detection frames, and establishing an initial tracking position corresponding to the target by using the screened detection frames, namely an initial tracking target.

In the first frame picture, a region of 127 × 127 is cut out from a central point (A) of an initial tracking target and is used as a template frame region; in the subsequent pictures, a 255 × 255 region is cut from the central point (B) of the tracking target of the previous frame, and the region is used as a search frame region.

K candidate region frames closest to the tracking target central point O, and calculating IoU of each candidate region frame

(Intersection overlapping unit), then, calculating the mean and variance of k IoUs, setting the mean plus variance as an Adaptive threshold by an ATSS (Adaptive Training Sample Selection) method, finally defining some candidate region frames larger than the Adaptive threshold as interferents, and removing the interferents. And forming a search frame region by all the candidate region frames from which the interferents are removed.

And sending a template frame region (namely a real image of the first frame) and the search frame region into a modified DaSiam-RPN network, wherein the modified DaSiam-RPN network predicts a tracking frame and a score of a target, and the template frame region is the real image of the first frame.

The improved DaSim-RPN network carries out similarity calculation on the template frame region target and the search frame region target, so that a tracking target which is most similar to a tracking target in a template frame is searched in the search frame region, and a new incremental learning method is introduced, so that the tracking frame prediction is more stable, the tracking ID jump is reduced, and a corresponding tracking frame and a corresponding score are predicted.

Performing cosine window suppression on the score, namely punishing the score of an edge area far away from a central point (A), taking the central point of a tracking frame with the highest score as a new central point (A'), and taking the width (W) and the height (H) of a target in the previous frame and the width (W) of the tracking frame _g ) High (H) _g ) The smooth weighting is performed as the new width (W ') height (H').

Adopting the new central point (A ') and the width (W ') and the height (H ') as passenger flow target tracking frames of the current frame, and sequentially and circularly iterating the subsequent tracking; and setting the weight factor of the passenger flow target tracking frame as 1/2 of the weight of the interference object. And in the proportion setting in the new increment learning, the weight factor of the tracking frame is set to be 1/2 of the weight of the interference object, so that the tracking target is more stable and the ID jump does not occur.

Step S30, judging whether the track of the latest passenger flow target tracking frame exceeds a defined monitoring area, and if so, stopping the target tracking process; and if not, continuing the target tracking process to monitor the passenger flow safety.

In order to verify the effectiveness and the accuracy of the invention, a certain passage scene of the rail transit station is selected for testing, as shown in fig. 3, a solid line in fig. 3 is a limit solid line which is manually calibrated, when a tracked target crosses the limit solid line, the tracking ID of the tracked target is recorded, and therefore, the cross-border people counting and the like can be carried out.

In summary, the embodiment of the invention provides a detection model based on deep learning, aiming at the problem of monitoring the safety state of the rail transit station in the key area. The model detects the passenger flow target through a YOLOv3 algorithm, and transmits the detection result to an improved DaSim-RPN algorithm in real time for target tracking. The target tracking missing report rate can be reduced, the tracking accuracy is improved, and the aim of monitoring the passenger flow in the designated area monitored by the video is well fulfilled.

The improved DaSim-RPN algorithm of the embodiment of the invention leads the threshold value to be more reasonably set by introducing a self-adaptive threshold value method, can further screen out interferents and inhibit the influence of the interferents, thereby generating a stable tracking frame; meanwhile, the corresponding relation between the weight factor and the weight of the interferent is changed, so that the weight factor is changed along with the change of the weight of the interferent, and the tracking precision of the target is improved. Compared with other deep learning methods, the method has the advantages of lower missing report rate and higher accuracy rate of monitoring the key area of the rail transit station, simultaneously improves the tracking efficiency, consumes less computer resources and is suitable for the field environment.

Those of ordinary skill in the art will understand that: the figures are merely schematic representations of one embodiment, and the blocks or flow diagrams in the figures are not necessarily required to practice the present invention.

From the above description of the embodiments, it is clear to those skilled in the art that the present invention can be implemented by software plus necessary general hardware platform. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which may be stored in a storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the method according to the embodiments or some parts of the embodiments.

The embodiments in the present specification are described in a progressive manner, and the same and similar parts among the embodiments are referred to each other, and each embodiment focuses on the differences from the other embodiments. In particular, for apparatus or system embodiments, since they are substantially similar to method embodiments, they are described in relative terms, as long as they are described in partial descriptions of method embodiments. The above-described embodiments of the apparatus and system are merely illustrative, and the units described as separate parts may or may not be physically separate, and the parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of this embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

The above description is only for the preferred embodiment of the present invention, but the scope of the present invention is not limited thereto, and any changes or substitutions that can be easily conceived by those skilled in the art within the technical scope of the present invention are included in the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A safety state monitoring method for a station key area based on videos is characterized by comprising the following steps:

2. The method according to claim 1, wherein the step of obtaining the video image in the monitoring area under the scene of a section of track traffic station, using a target detection algorithm to perform passenger flow target detection on the video image, and obtaining a passenger flow target detection frame comprises;

and carrying out passenger flow target detection on the continuous frames of the video image by using a target detection algorithm, and acquiring a passenger flow target detection frame and coordinates and confidence coefficients of four vertexes of the detection frame.

3. The method of claim 2, wherein the target detection algorithm comprises a deep learning based YOLOv3 algorithm.

4. The method according to claim 1, wherein the improved DaSiam-RPN algorithm is used for carrying out target tracking on the passenger flow target detection frame to obtain a passenger flow target tracking frame of the current frame, including;

in the first frame picture, a region of 127 × 127 is intercepted from the central point of the initial tracking target and is used as a template frame region; in the subsequent pictures, a 255 x 255 region is intercepted from the central point of the tracking target of the previous frame and is used as a search frame region;

sending the template frame region and the search frame region into an improved DaSim-RPN network, outputting a tracking frame and a score of a target by the improved DaSim-RPN network, performing cosine window suppression on the score, punishing the score of an edge region far away from a central point, and taking the central point of the tracking frame with the highest score as a new central point (A'), wherein the width W and the height H of the target of the previous frame and the width W and the height H of the tracking frame _g High H _g And performing smooth weighting to obtain a new width W ' height H ', adopting a new central point A ' and the new width W ' height H ' as passenger flow target tracking frames of the current frame, and performing subsequent tracking and loop iteration in sequence.

5. The method according to claim 1, wherein the method comprises the steps of judging whether the track of the passenger flow target tracking frame of the current frame exceeds the monitoring area, and if so, stopping the target tracking process; if not, continuing to perform a target tracking process on the video image, including;

judging whether the track of the passenger flow target tracking frame of the previous frame exceeds a defined monitoring area or not, and if so, stopping the target tracking process; and if not, continuing the target tracking process to monitor the passenger flow safety.