CN112686107A

CN112686107A - Tunnel invading object detection method and device

Info

Publication number: CN112686107A
Application number: CN202011522501.8A
Authority: CN
Inventors: 杨恩泽; 刘玉鑫; 刘硕研; 方凯; 王明哲; 李超; 王瑞; 徐成伟; 关则彬; 李隆; 胡昊; 李洵; 张煜山; 樊楠; 金久强
Original assignee: China Academy of Railway Sciences Corp Ltd CARS; Institute of Computing Technologies of CARS; Beijing Jingwei Information Technology Co Ltd
Current assignee: China Academy of Railway Sciences Corp Ltd CARS; Institute of Computing Technologies of CARS; Beijing Jingwei Information Technology Co Ltd
Priority date: 2020-12-21
Filing date: 2020-12-21
Publication date: 2021-04-20
Anticipated expiration: 2040-12-21
Also published as: CN112686107B

Abstract

The invention provides a tunnel intrusion object detection method and a tunnel intrusion object detection device. The method comprises the following steps: processing key frames of the tunnel monitoring video based on a lightweight feature extraction network to extract a multi-scale feature map of the tunnel monitoring video; processing the multi-scale feature map based on a multi-scale feature fusion network to obtain information of rail vehicles and pedestrians in the multi-scale feature map; processing the key frame based on a background difference algorithm to acquire position information of a moving object; and determining whether an invading object exists in the tunnel according to the information of the rail vehicles and the pedestrians and the position information of the moving object. The tunnel invading object detection method and the device provided by the invention can accurately detect the tunnel invading object, greatly reduce the misjudgment probability of the invading object in the tunnel, and provide guarantee for rapidly eliminating the potential safety hazard in the tunnel.

Description

Tunnel invading object detection method and device

Technical Field

The invention relates to the technical field of rail transit safety, in particular to a tunnel invading object detection method and a tunnel invading object detection device.

Background

For railway tunnels built between mountain lands with steep terrain, the railway tunnels are easily affected by geological disasters, and once safety accidents occur, the rescue difficulty is extremely high, so that the safety of driving and passengers in the tunnels is seriously threatened.

Due to the outbreak and unpredictability of tunnel disasters, the regular manual overhaul and safety inspection measures cannot meet the current safety requirements. With the rapid development of computer vision and artificial intelligence, researches for detecting and judging invading foreign matters based on an intelligent video analysis technology instead of manpower have been provided.

At present, a background difference mode is commonly used for analyzing a monitoring video of a tunnel. But because track vehicle frequently passes through in the tunnel, the indeterminate construction of maintainer etc. all can cause the change of tunnel environment, consequently a large amount of wrong reports can be aroused to current background difference mode, seriously influences early warning system normal work.

Therefore, how to provide a method can accurately detect the invading object in the tunnel, thereby eliminating the potential safety hazard and having very important significance.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides a tunnel intrusion object detection method and a tunnel intrusion object detection device.

The invention provides a tunnel intrusion object detection method, which comprises the following steps:

processing key frames of the tunnel monitoring video based on a lightweight feature extraction network to extract a multi-scale feature map of the tunnel monitoring video;

processing the multi-scale feature map based on a multi-scale feature fusion network to obtain information of rail vehicles and pedestrians in the multi-scale feature map;

processing the key frame based on a background difference algorithm to acquire position information of a moving object;

and determining whether an invading object exists in the tunnel according to the information of the rail vehicles and the pedestrians and the position information of the moving object.

In one embodiment, the processing, by the lightweight feature extraction-based network, the key frames of the tunnel surveillance video to extract the multi-scale feature map of the tunnel surveillance video includes:

cutting a target area in the key frame based on the lightweight feature extraction network;

performing convolution operation on the cut image to obtain the multi-scale feature map;

the multi-scale feature map comprises multi-layer images with different resolutions, and each layer of image comprises a spatial feature representing the position of an object and/or a semantic feature representing the category of the object.

In one embodiment, the processing the multi-scale feature map based on the multi-scale feature fusion network to obtain the rail vehicle and pedestrian information in the multi-scale feature map comprises:

fusing the spatial features of the images of each layer based on the multi-scale feature fusion network to determine the position information of the object in the target area;

fusing the semantic features of each layer of image to determine object category information corresponding to the position information;

and determining the rail vehicle and pedestrian information in the multi-scale feature map according to the position information and the object category information.

In one embodiment, processing the keyframes based on a background subtraction algorithm to obtain moving object information comprises:

determining a foreground and a background of the target region based on the background difference algorithm;

the foreground is filtered to determine moving object position information indicative of a position of the moving object in the target region.

In one embodiment, the determining whether there is an intruding object in the tunnel according to the rail vehicle and pedestrian information and the moving object position information comprises:

matching the positions corresponding to the rail vehicles and the pedestrians with the positions of the moving objects;

and if the moving object position which is not matched with the positions corresponding to the railway vehicle and the pedestrian exists, determining the moving object corresponding to the non-matched moving object position as the intrusion object.

The present invention also provides a tunnel intrusion object detection device, including:

the extraction module is used for processing key frames of the tunnel monitoring video based on the lightweight feature extraction network so as to extract a multi-scale feature map of the tunnel monitoring video;

the fusion module is used for processing the multi-scale feature map based on a multi-scale feature fusion network so as to obtain the information of the rail vehicles and the pedestrians in the multi-scale feature map;

the difference module is used for processing the key frame based on a background difference algorithm to acquire the position information of the moving object;

and the determining module is used for determining whether an invading object exists in the tunnel according to the information of the rail vehicles and the pedestrians and the position information of the moving object.

In one embodiment, the extraction module is specifically configured to:

In one embodiment, the fusion module is specifically configured to:

determining rail vehicle and pedestrian information in the multi-scale feature map according to the position information and the object category information

The invention further provides an electronic device, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor executes the computer program to realize the steps of any one of the above-mentioned tunnel intrusion object detection methods.

The present invention also provides a non-transitory computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of any of the above-described tunnel intrusion object detection methods.

According to the tunnel invading object detection method and device provided by the invention, the information of the rail vehicles and the pedestrians is determined by using the lightweight characteristic extraction network and the multi-scale characteristic fusion network, and the invading object in the tunnel is determined by combining the moving object position information determined by the differential algorithm, so that the tunnel invading object can be accurately detected, the misjudgment probability of the invading object in the tunnel is greatly reduced, and the guarantee is provided for quickly eliminating the potential safety hazard in the tunnel.

Drawings

In order to more clearly illustrate the technical solutions of the present invention or the prior art, the drawings needed for the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and those skilled in the art can also obtain other drawings according to the drawings without creative efforts.

Fig. 1 is a schematic flow chart of a tunnel intrusion object detection method according to the present invention;

fig. 2 is a schematic structural diagram of a tunnel intrusion object detection device provided by the present invention;

fig. 3 is a schematic structural diagram of an electronic device provided in the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is obvious that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Fig. 1 is a schematic flow chart of a tunnel intrusion object detection method provided by the present invention. Referring to fig. 1, the method for detecting an object invaded in a tunnel according to the present invention may include:

s110, processing key frames of the tunnel monitoring video based on the lightweight feature extraction network to extract a multi-scale feature map of the tunnel monitoring video;

s120, processing the multi-scale feature map based on the multi-scale feature fusion network to obtain information of the rail vehicles and the pedestrians in the multi-scale feature map;

s130, processing the key frame based on a background difference algorithm to acquire position information of the moving object;

and S140, determining whether an invading object exists in the tunnel according to the information of the rail vehicles and the pedestrians and the position information of the moving object.

The execution subject of the tunnel intrusion object detection method provided by the invention can be electronic equipment, a component in the electronic equipment, an integrated circuit or a chip. The electronic device may be a mobile electronic device or a non-mobile electronic device. By way of example, the mobile electronic device may be a mobile phone, a tablet computer, a notebook computer, a palm top computer, a vehicle-mounted electronic device, a wearable device, an ultra-mobile personal computer (UMPC), a netbook or a Personal Digital Assistant (PDA), and the like, and the non-mobile electronic device may be a server, a Network Attached Storage (NAS), a personal computer (personal computer, PC), a Television (TV), a teller machine, a self-service machine, and the like, and the present invention is not limited in particular.

Specifically, all the tunnel surveillance video data of the surveillance area may be acquired first, and the RSTP stream is decoded using FFMPEG, for example, a 25-frame image may be processed in one second.

In order to reduce the redundancy degree of processing and improve the detection efficiency, the video can be subjected to frame extraction processing: extracting key frames at 2 frames/second, wherein the key Frame set is Frame ═ X₁,X₂,X₃,……，X_i,X_n}. Then, the extracted keyframes may be preprocessed, for example, image denoising and image enhancement are performed.

After the key frame is obtained, the key frame can be processed according to the lightweight feature extraction network so as to extract the multi-scale feature map. The neural network parameters of the lightweight feature extraction network are less, so that the light-weight feature extraction network is beneficial to engineering deployment.

Meanwhile, the keyframes may be processed to determine the location information of the moving object based on a background difference algorithm.

The multi-scale feature map may then be processed using a multi-scale feature fusion network to determine rail vehicle and pedestrian information in the multi-scale feature map. The multi-scale feature fusion network has the advantage of efficiently detecting objects in a complex tunnel environment.

And finally, determining whether an invasive object exists in the tunnel by comparing the position information of the moving object determined by the background difference algorithm with the information of the rail vehicles and pedestrians determined by the multi-scale feature fusion network.

According to the tunnel invading object detection method, the information of the rail vehicles and the pedestrians is determined by using the lightweight feature extraction network and the multi-scale feature fusion network, and the moving object position information determined by the differential algorithm is combined to determine the invading object in the tunnel, so that the tunnel invading object can be accurately detected, the misjudgment probability of the invading object in the tunnel is greatly reduced, and the safety hazard in the tunnel is rapidly eliminated.

Further, in one embodiment, step S110 may include:

performing convolution operation on the cut image to obtain a multi-scale characteristic diagram;

The target area may be an area in which the rail is located, for example, an area within about 1m of the rail; but also other areas that have a significant impact on the safe operation of the rail vehicle. The present invention is not particularly limited in this regard.

In this embodiment, the lightweight feature extraction network adopted by the present invention uses separable convolution instead of the conventional convolution kernel in the normal feature extraction network, thereby achieving an accurate feature extraction effect with a small number of parameters.

For example, a separable convolution is used instead of the original conventional convolution kernel, and a similar feature extraction effect is achieved with a smaller parameter number.

For example, replacing the k × k convolution kernel with a 1 × k and k × 1 distributed convolution effectively reduces k²-a parameter quantity of 2 k.

The lightweight feature extraction network adopted by the invention also combines an attention mechanism to learn the weight distribution features of the channel layer and compress the feature channels.

For example, through training, the network learns that the probability of the rail vehicles and pedestrians appearing in a certain scene or a certain position is high, and the weight of the corresponding space coordinate is gained. The probability of the object of one type appearing is extremely low, and simultaneously, the object of another type appears, and the channel represented by the object can be merged with the object of another type.

And aiming at each feature extraction layer, obtaining an optimal unit structure under the above strategy by using neural architecture search.

After the target area is cut, the lightweight feature extraction network can perform convolution operation on the cut image to determine the multi-scale feature map.

For example, after performing convolution operation on the clipped image, a feature map set P with gradually expanding top-down scale can be generated_i(i ═ 1,2,3,4, 5). Wherein P is₁Layer comprisesThe space characteristics are more, so that the space position of a certain object can be calibrated conveniently; and P is₅The layer contains richer semantic features for determining the specific category of objects at a certain spatial location.

According to the tunnel invading object detection method, the multi-scale feature map is obtained through the lightweight feature extraction network, so that the information of rail vehicles and pedestrians can be conveniently determined according to the multi-scale feature fusion network, and the efficiency of finally detecting the tunnel invading object is improved.

Further, in one embodiment, step S120 may include:

fusing the spatial features of each layer of image based on a multi-scale feature fusion network to determine the position information of the object in the target area;

fusing semantic features of each layer of image to determine object category information corresponding to the position information;

and determining the information of the rail vehicles and the pedestrians in the multi-scale feature map according to the position information and the object category information.

It should be noted that, under a fixed camera angle, objects in the tunnel have different dimensions, and an object in a region far from the camera has a small image, so that the detection effect is poor.

Therefore, the invention adopts the multi-scale feature fusion network to effectively fuse the multi-resolution feature map generated by the feature extraction network, and improves the detection performance of small-scale objects, specifically:

after feature graphs under multiple resolutions generated by the lightweight feature extraction network are input into the multi-scale feature fusion network, the multi-scale feature fusion network learns the locally optimal feature fusion unit in a neural architecture search mode, and spatial features of a positioned object and semantic features for judging object attributes are fully expressed.

For example, for a rail vehicle with distinct pixel characteristics, P₃Or P₄The layer is sufficient to give accurate position information, and the semantic features of the rail vehicle are P₂Under the condition that the confidence of the layer is high enough, the relevance among other layers can be abandoned, and the multi-scale features can be effectively reducedAnd fusing the parameters of the network.

And sparsifying the learned fusion unit, pruning the connection with smaller contribution weight, and further reducing the parameter number of the fusion network.

And repeating the iterative fusion unit to improve the multi-scale expression capability of the features. The position and the type of the object are input into one convolution layer respectively, and the detection result of the object is output.

For example, convolution before outputting the position of a pedestrian or a rail vehicle is responsible for learning the expression of spatial position information in the feature map, and convolution before outputting the attribute of the pedestrian or the category is responsible for learning the expression of category attribute information in the feature map.

According to the tunnel invading object detection method, the information of the rail vehicles and the pedestrians in the multi-scale feature map is obtained by adopting the multi-scale feature fusion network, the defect that the object detection effect of the region far away from the camera is poor in the prior art is overcome, and the detection precision of the tunnel invading object is obviously improved.

Further, in one embodiment, step S130 may include:

determining the foreground and the background of the target area based on a background difference algorithm;

The background difference algorithm takes a key frame set as input and outputs the moving foreground regions of the front frame and the rear frame of the image. For example: moving trains, passing pedestrians or animals, and areas in tunnels where foreign objects fall.

Specifically, the input key Frame information set is Frame ═ { X ═ X₁,X₂,X₃,……，X_i,X_nDifferentiating adjacent frames to form a subframe ═ S₁,S₂,S₃,……，S_i-1,S_n-1And (4) interframe information.

Modeling the background using a mixed gaussian model:

wherein, N (x | mu)_k,Σ_k) Represents the mean value of μ_kVariance is Σ_kA gaussian distribution of (a).

Setting a threshold value according to the Gaussian mixture distribution, setting the area with the difference value between frames at the time t larger than the threshold value as a foreground, and updating the parameter pi if the area is not used as the background_j ^(t)，μ_j ^(t)，Σ_j ^(t){j＝1，2，……，K}。

And filtering the foreground area according to the target detection result to obtain the detection result of the object without the attribute. And further carrying out denoising treatment on the filtered foreground region to finally obtain a region of the tunnel invading foreign matters.

According to the tunnel invading object detection method provided by the invention, the background difference algorithm is adopted to extract the motion foreground information among the video key frames, and the motion foreground is combined with the target detection result, so that the invading object in the tunnel environment can be effectively positioned.

In one embodiment, step S140 may include:

and if the position of the moving object which is not matched with the positions corresponding to the railway vehicle and the pedestrian exists, determining the moving object corresponding to the position of the moving object which is not matched as the invading object.

Specifically, the positions of the rail vehicle and the pedestrian in the target area are acquired in step S120, and the position of the moving object in the target area is acquired in step S130.

In the case where there is no tunnel intruding object, the positions of the rail vehicles and pedestrians should be in one-to-one correspondence with the positions of the moving objects.

In the case of an intruding object in a tunnel, the position of the moving object may include the position of the intruding object in addition to the positions of the rail vehicles and pedestrians.

Therefore, by matching the positions corresponding to the rail vehicle and the pedestrian with the position of the moving object, whether the invading object exists or not and the position of the invading object can be determined according to the matching result.

According to the tunnel invading object detection method, the positions corresponding to the rail vehicles and the pedestrians are matched with the position of the moving object, whether the invading object exists or not and the position of the invading object can be rapidly and accurately judged, and the tunnel invading object detection efficiency is improved.

The invention also provides a tunnel invading object detection device, which can be correspondingly referred to the tunnel invading object detection method.

Fig. 2 is a schematic structural diagram of a tunnel intrusion object detection apparatus provided in the present invention, and as shown in fig. 2, the apparatus includes:

the extraction module 210 is configured to process the key frames of the tunnel surveillance video based on the lightweight feature extraction network to extract a multi-scale feature map of the tunnel surveillance video;

the fusion module 220 is configured to process the multi-scale feature map based on a multi-scale feature fusion network to obtain information of rail vehicles and pedestrians in the multi-scale feature map;

a difference module 230, configured to process the key frame based on a background difference algorithm to obtain position information of the moving object;

and the determining module 240 is configured to determine whether an intruding object exists in the tunnel according to the information of the rail vehicles and the pedestrians and the position information of the moving object.

According to the tunnel invading object detection device, the information of the rail vehicles and the pedestrians is determined by using the lightweight feature extraction network and the multi-scale feature fusion network, and the invading object in the tunnel is determined by combining the moving object position information determined by the differential algorithm, so that the tunnel invading object can be accurately detected, the misjudgment probability of the invading object in the tunnel is greatly reduced, and the safety hazard in the tunnel is rapidly eliminated.

In one embodiment, the extraction module 210 is specifically configured to:

According to the tunnel invading object detection device, the multi-scale feature map is obtained through the lightweight feature extraction network, so that the information of rail vehicles and pedestrians can be conveniently determined according to the multi-scale feature fusion network, and the efficiency of finally detecting the tunnel invading object is improved.

In one embodiment, the fusion module 220 is specifically configured to:

According to the tunnel invading object detection device, the information of the rail vehicles and the pedestrians in the multi-scale feature map is obtained by adopting the multi-scale feature fusion network, the defect that the object detection effect of the region far away from the camera is poor in the prior art is overcome, and the detection precision of the tunnel invading object is obviously improved.

In one embodiment, the difference module 230 is specifically configured to:

According to the tunnel invading object detection device provided by the invention, the background difference algorithm is adopted to extract the motion foreground information among the video key frames, and the motion foreground is combined with the target detection result, so that the invading object in the tunnel environment can be effectively positioned.

In one embodiment, the determining module 240 is specifically configured to:

According to the tunnel invading object detection device, the positions corresponding to the rail vehicles and the pedestrians are matched with the position of the moving object, whether the invading object exists or not and the position of the invading object can be rapidly and accurately judged, and the tunnel invading object detection efficiency is improved.

The present invention also provides an electronic device, as shown in fig. 3, the electronic device may include: a processor (processor)310, a communication interface (communication interface)320, a memory (memory)330 and a communication bus (bus)340, wherein the processor 310, the communication interface 320 and the memory 330 complete communication with each other through the communication bus 340. The processor 310 may invoke logic instructions in the memory 330 to perform the steps of the tunnel intrusion object detection method provided by the above-described method embodiments, including, for example:

processing key frames of the tunnel monitoring video based on the lightweight feature extraction network to extract a multi-scale feature map of the tunnel monitoring video;

processing the multi-scale feature map based on the multi-scale feature fusion network to obtain information of rail vehicles and pedestrians in the multi-scale feature map;

processing the key frame based on a background difference algorithm to acquire position information of the moving object;

and determining the tunnel intrusion object according to the information of the rail vehicles and the pedestrians and the position information of the moving object.

In addition, the logic instructions in the memory 330 may be implemented in the form of software functional units and stored in a computer readable storage medium when the software functional units are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

In another aspect, the present invention also provides a computer program product, which includes a computer program stored on a non-transitory computer readable storage medium, the computer program includes program instructions, and when the program instructions are executed by a computer, the computer can execute the steps of the tunnel intrusion object detection method provided by the above method embodiments, for example, the steps include:

In still another aspect, the present invention further provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps of the tunnel intrusion object detection method provided by the above method embodiments, for example, including:

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A tunnel intrusion object detection method is characterized by comprising the following steps:

2. The method according to claim 1, wherein the processing key frames of the tunnel surveillance video by the network based on lightweight feature extraction to extract the multi-scale feature map of the tunnel surveillance video comprises:

3. The method for detecting the tunnel intrusion object according to claim 2, wherein the processing the multi-scale feature map based on the multi-scale feature fusion network to obtain the information of the rail vehicles and pedestrians in the multi-scale feature map comprises:

4. The method of claim 3, wherein processing the keyframe based on a background difference algorithm to obtain moving object information comprises:

5. The method according to claim 4, wherein the determining whether there is an intruding object in the tunnel according to the rail vehicle and pedestrian information and the moving object position information comprises:

6. A tunnel intrusion object detection device, comprising:

7. The device according to claim 6, wherein the extraction module is specifically configured to:

8. The device according to claim 7, wherein the fusion module is specifically configured to:

9. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the tunnel intrusion object detection method according to any one of claims 1 to 5 when executing the computer program.

10. A non-transitory computer readable storage medium having stored thereon a computer program, which when executed by a processor implements the steps of the tunnel intrusion object detection method according to any one of claims 1 to 5.