CN111582017B - Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application - Google Patents
Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application Download PDFInfo
- Publication number
- CN111582017B CN111582017B CN202010211669.0A CN202010211669A CN111582017B CN 111582017 B CN111582017 B CN 111582017B CN 202010211669 A CN202010211669 A CN 202010211669A CN 111582017 B CN111582017 B CN 111582017B
- Authority
- CN
- China
- Prior art keywords
- geological disaster
- video
- network
- features
- geological
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 51
- 230000002159 abnormal effect Effects 0.000 claims abstract description 32
- 238000012549 training Methods 0.000 claims abstract description 25
- 230000002265 prevention Effects 0.000 claims abstract description 5
- 230000009467 reduction Effects 0.000 claims abstract description 5
- 238000012544 monitoring process Methods 0.000 claims description 25
- 230000006870 function Effects 0.000 claims description 19
- 238000011176 pooling Methods 0.000 claims description 12
- 239000012634 fragment Substances 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 4
- 239000000284 extract Substances 0.000 claims description 3
- 238000000605 extraction Methods 0.000 claims description 3
- 238000013526 transfer learning Methods 0.000 claims description 3
- 230000001131 transforming effect Effects 0.000 claims description 3
- 238000002372 labelling Methods 0.000 claims description 2
- 241000282414 Homo sapiens Species 0.000 description 10
- 230000000694 effects Effects 0.000 description 8
- 238000010586 diagram Methods 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 239000002689 soil Substances 0.000 description 4
- 230000009471 action Effects 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000003491 array Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000006073 displacement reaction Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 239000011435 rock Substances 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 206010000117 Abnormal behaviour Diseases 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 239000003245 coal Substances 0.000 description 1
- 238000002485 combustion reaction Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 230000003993 interaction Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000008092 positive effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000004162 soil erosion Methods 0.000 description 1
- 230000002269 spontaneous effect Effects 0.000 description 1
- CCEKAJIANROZEO-UHFFFAOYSA-N sulfluramid Chemical group CCNS(=O)(=O)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)C(F)(F)F CCEKAJIANROZEO-UHFFFAOYSA-N 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/10—Terrestrial scenes
- G06V20/13—Satellite images
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N20/00—Machine learning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
- G06V20/41—Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Computing Systems (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Biophysics (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Astronomy & Astrophysics (AREA)
- Remote Sensing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Medical Informatics (AREA)
- Image Analysis (AREA)
- Alarm Systems (AREA)
Abstract
The invention belongs to the technical field of geological disaster prevention and disaster reduction, and discloses a video monitoring-oriented end-to-end geological disaster automatic identification method, a system and application, wherein a video is input into an H-C3D network to extract features, a 4096-dimensional 3D feature is extracted from each 16 frames, and the features are combined into 32 x 4096-dimensional features according to a time sequence; classifying videos in a weak supervision mode, wherein geological disaster videos are classified into one type, and non-geological disaster videos are classified into one type; building and training a geological disaster classification recognition network R-Net to obtain a training model; judging the early warning level of the geological disaster by converting the characteristics into abnormal scores; extracting 3D features from the video through an H-C3D network, combining the features into 32 x 4096 dimensions according to a time sequence, sending the combined features into a trained R-Net network, and acquiring abnormal scores in real time along with video frames; and dividing the abnormal score into five intervals to obtain the early warning level of the geological disaster. The invention realizes the automatic judgment of the early warning level of the geological disaster.
Description
Technical Field
The invention belongs to the technical field of geological disaster prevention and reduction, and particularly relates to an end-to-end geological disaster automatic identification method and system for video monitoring and application.
Background
Geological disasters refer to disastrous geological events caused by various geological actions during the development and evolution of the earth. The distribution change rule of geological disasters in time and space is not only limited by natural environment, but also related to human activities, and is often the result of interaction between human and the natural world. Natural disasters mainly caused by geological dynamic activities or abnormal geological environment. Under the action of the internal power, the external power or the artificial geological power, the earth generates abnormal energy release, material movement, deformation and displacement of rock and soil bodies, abnormal change of the environment and the like, and the phenomena or processes of harming human lives and properties, living and economic activities or destroying resources and environments on which human beings live and develop are generated. Adverse geological phenomena are commonly called geological disasters, and refer to geological events that deteriorate geological environment, reduce environmental quality, directly or indirectly harm human safety, and cause losses for social and economic construction, caused by natural geological effects and human activities. Geological disasters are geological effects (phenomena) which are formed under the action of natural or human factors and damage and lose human lives, properties and environments. Such as collapse, landslide, debris flow, ground fissure, ground subsidence, rock burst, water burst in underground tunnel, mud burst, gas burst, spontaneous combustion of coal bed, loess collapse, rock-soil expansion, sandy soil liquefaction, land freeze-thaw, water loss and soil erosion, land desertification and swampiness, soil salinization, earthquake, volcano, geothermal damage, etc. Currently, the closest prior art: the monitoring and early warning of geological disasters such as landslide and debris flow are realized by technical means such as rainfall, micro-earthquake, earth sound, infrasonic wave, earth surface displacement, mud level and mechanical monitoring. The existing geological disaster video monitoring technical means only record the occurrence process of the geological disaster, and can not automatically analyze and realize the on-site real-time monitoring and early warning of the geological disaster.
In summary, the problems of the prior art are as follows:
(1) at present, the geological disaster video monitoring technology only collects video data of a field scene, and related personnel can check the video data in a playback mode when necessary, so that the development situation of the field situation before and after the geological disaster is visually known.
(2) The C3D network can be used to extract video 3D features, but is mainly a video for human behavior and is not suitable for the field of geological disasters.
(3) If the C3D network is applied to the field of geological disasters to extract the 3D characteristics of geological disaster videos, the existing C3D network needs to be modified and optimized, a proper amount of geological disaster videos are added, and retraining is carried out to obtain the network suitable for extracting the 3D characteristics of geological disaster scenes.
The difficulty of solving the technical problems is as follows:
(1) the existing C3D network is reformed and optimized into a network suitable for extracting the 3D characteristics of the geological disaster scene.
(2) Aiming at the actual conditions of the multiple segmented feature prediction values, the loss function is redesigned by combining the thought of multi-example learning, so that the converged geological disaster classification recognition network R-Net can be optimized through traversing all the feature prediction values in one turn during training.
(3) Aiming at the problem that the loss function possibly causes the over-fitting of the parameters of the geological disaster classification recognition network R-Net and thus poor test effect along with the superposition of training times, the method adds a regular term to the loss function to limit the network parameters and prevent the over-fitting, and specifically adds a time smooth constraint and a space smooth constraint to the loss function respectively.
The significance of solving the technical problems is as follows: the invention can fully utilize the video monitoring data in the field of geological disasters, plays a role in real-time automatic monitoring and early warning, and can well serve the geological disaster prevention and reduction. Rather than the traditional mode of today: after the geological disaster occurs, calling a video to see the current situation development condition.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides a video monitoring-oriented end-to-end geological disaster automatic identification method, a system and application.
The invention is realized in such a way that the method for automatically identifying the video monitoring end-to-end geological disaster comprises the following steps:
firstly, transforming and optimizing a C3D model to obtain a network (H-C3D) suitable for extracting 3D features of geological disaster scenes, inputting videos into the H-C3D network to extract the features, extracting one 4096-dimensional 3D feature every 16 frames, and combining all the features into 32 x 4096-dimensional features according to a time sequence;
secondly, classifying the videos in a weak supervision mode, wherein the videos of the geological disasters are classified into one type, and the videos of the non-geological disasters are classified into one type; defining a multi-example learning method and a loss function thereof, building a geological disaster classification and identification network R-Net, obtaining a training model by adopting an AdaGrad gradient descent algorithm R-Net, and determining the early warning grade of the geological disaster by converting 3D characteristics into abnormal values by the model;
thirdly, when the method is applied, 3D features are extracted from the video through an H-C3D network, the features are combined into 32 x 4096 dimensions according to time series, the combined features are sent to a trained R-Net, and abnormal scores can be obtained in real time along with video frames;
and fourthly, dividing the abnormal score into five intervals according to observation statistics of a large amount of video data, respectively corresponding to five early warning grades (table 1) of the geological disaster, and obtaining the early warning grade of the geological disaster according to the interval where the abnormal score is located.
Further, the video monitoring-oriented end-to-end geological disaster automatic identification method extracts the 3D characteristics of geological disaster/non-geological disaster videos: based on a C3D network trained by a large-scale data set, a proper amount of geological disaster videos are added by using the idea of transfer learning, and the C3D network is modified and optimized to obtain a network (H-C3D) suitable for extracting the 3D characteristics of geological disaster scenes.
Further comprising: the image is cut into 128 × 117 size by taking a 16-frame sequence image as a unit, the length and width are internally adjusted to 112 × 112 size, and 50% of the total data volume (namely half of the number of images) is horizontally inverted to adapt to the jitter situation on time and space; the H-C3D network consists of 5 convolutional layers, 5 pooling layers and 1 fully-connected layer, the number of filters from 1 to 5 convolutional layers is 64, 128, 256, 512, 512, respectively, each convolutional layer is followed by a pooling layer, the size of each subsequent pooling layer is 2 x 2 by 1 step through 3D convolution kernels of 3 x 3 and the first pooling layer of 1 x 2; the output dimension of the fully connected layer is 4096 dimensions.
Further, the video monitoring-oriented end-to-end geological disaster automatic identification method utilizes 3D features extracted through an H-C3D network to train a geological disaster classification identification network R-Net: after video features are extracted by taking 16 frames as a unit, 32 features for training an R-Net network are set, and if the number of the video frames is less than 16 × 32, the previous features are repeated to the 32 th feature; if the video frame number is larger than 16 × 32, after the features are arranged in the time sequence, 32 features are obtained at equal intervals. These 32 x 4096 dimensional 3D features are fed into the R-Net for training.
Further, the multi-example learning loss function for training the geological disaster classification recognition network R-Net is as follows:
wherein k is the total number of samples, w is the network parameter of R-Net,and (3) taking the fractional value of R-Net, wherein A is a geological disaster feature fragment set, N is a non-geological disaster video feature fragment set, and then adding time smoothness constraint and space smoothness constraint:
wherein λ1,λ2N is the total number of the characteristic segments in one video, and is a hyper-parameter; training R-Net by using the loss function in combination with an AdaGrad gradient descent algorithm to obtain a trained model and parameters thereof;
another object of the present invention is to provide a video monitoring end-to-end geological disaster automatic identification method system for implementing the video monitoring end-to-end geological disaster automatic identification method, wherein the video monitoring end-to-end geological disaster automatic identification method system comprises:
the characteristic extraction module is used for obtaining a network (H-C3D) suitable for extracting the 3D characteristics of the geological disaster scene by modifying and optimizing the C3D network, extracting one 4096-dimensional 3D characteristic every 16 frames, and combining all the characteristics into 32 x 4096-dimensional characteristics according to a time sequence;
the video frame labeling module is used for classifying videos under weak supervision, wherein geological disaster videos are classified into one type, and non-geological disaster videos are classified into one type;
the video frame abnormal score acquisition module is used for constructing a geological disaster classification and identification network R-Net by defining a multi-example learning method and a loss function thereof, training the R-Net by adopting an AdaGrad gradient descent algorithm to obtain a training model, and judging the possibility of geological disaster occurrence by converting 3D characteristics into abnormal scores;
when the method is used for application, 3D features are extracted from the video through an H-C3D network, the features are combined into 32 x 4096 dimensions according to time series, and the combined features are sent to a trained R-Net, so that abnormal scores can be obtained in real time along with video frames.
According to observation statistics of a large amount of video data, dividing the abnormal score into five intervals, respectively corresponding to five early warning grades (table 1), and obtaining the early warning grade of the geological disaster according to the interval where the abnormal score is located.
The invention also aims to provide an information data processing terminal for realizing the video monitoring-oriented end-to-end geological disaster automatic identification method.
Another object of the present invention is to provide a computer-readable storage medium, which includes instructions that, when executed on a computer, cause the computer to execute the video-monitoring-oriented end-to-end geological disaster automatic identification method.
The invention also aims to provide application of the video monitoring-oriented end-to-end geological disaster automatic identification method in geological disaster prevention and reduction.
The invention also aims to provide a geological disaster information data processing platform applying the video monitoring-oriented end-to-end geological disaster automatic identification method.
In summary, the advantages and positive effects of the invention are: the multi-example learning loss function formula trains the geological disaster classification and identification network, so that the network can be converged, and the network output is closer to the expected value. Unlike the existing loss function: the invention aims at the specific problem to be faced, and redesigns the loss function by combining the idea of multi-example learning so as to optimize the geological disaster classification and identification network. The existing geological disaster video monitoring only collects video information, and can play back to see a specific scene at a glance when necessary. The video monitoring data is not truly and fully utilized. The invention aims to fully utilize video monitoring data and play a role in real-time monitoring and early warning.
At present, no technology for automatically judging whether a geological disaster occurs in real time by utilizing video monitoring exists. The invention relates to a method for automatically identifying geological disasters in real time by using camera monitoring for the first time in the field of geological disasters. The invention is based on the idea of human abnormal behavior detection, improves and optimizes the prior art (C3D network), and then is applied to the field of geological disasters to realize real-time automatic judgment of the early warning level of geological disasters.
Drawings
Fig. 1 is a flow chart of an end-to-end geological disaster automatic identification method for video monitoring according to an embodiment of the present invention.
Fig. 2 is a diagram of an H-C3D model network architecture according to an embodiment of the present invention.
FIG. 3 is a block diagram of a geological disaster classification and identification network R-Net according to an embodiment of the present invention.
Fig. 4 is a schematic diagram illustrating automatic discrimination of landslide geological disasters according to an embodiment of the present invention.
Fig. 5 is a schematic diagram illustrating automatic discrimination of a debris flow geological disaster according to an embodiment of the present invention.
Fig. 6 is a schematic diagram illustrating automatic discrimination of a non-geological disaster according to an embodiment of the present invention.
FIG. 7 is a graph of ROC provided by an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
Aiming at the problems in the prior art, the invention provides a video monitoring-oriented end-to-end geological disaster automatic identification method, a system and application thereof, and the invention is described in detail below by combining with the attached drawings.
As shown in fig. 1, the method for automatically identifying an end-to-end geological disaster facing video monitoring provided by the embodiment of the invention comprises the following steps:
s101: transforming and optimizing a C3D model to obtain a network (H-C3D) suitable for extracting 3D features of geological disaster scenes, inputting a video into the H-C3D network to extract the features, extracting one 4096-dimensional 3D feature every 16 frames, and combining all the features into 32 x 4096-dimensional features according to a time sequence;
s102: classifying videos in a weak supervision mode, wherein geological disaster videos are classified into one type, and non-geological disaster videos are classified into one type; defining a multi-example learning method and a loss function thereof, training a geological disaster classification recognition network R-Net by adopting an AdaGrad gradient descent algorithm to obtain a training model, and judging the early warning level of the geological disaster by converting 3D characteristics into abnormal values;
s103: when the method is applied, 3D features are extracted from the video through an H-C3D network, the features are combined into 32 x 4096 dimensions according to a time sequence, and the combined features are sent to a trained R-Net network, so that abnormal scores can be obtained in real time along with video frames.
S104: according to observation statistics of a large amount of video data, dividing the abnormal score into five intervals, respectively corresponding to five early warning grades (table 1), and obtaining the early warning grade of the geological disaster according to the interval where the abnormal score is located.
TABLE 1 early warning level corresponding to abnormal score interval
Anomaly score f | f<0.2 | 0.2≤f<0.4 | 0.4≤f<0.6 | 0.6≤f<0.8 | f≥0.8 |
|
1 | 2 | 3 | 4 | 5 |
Color warning | Green colour | Blue color | Yellow colour | Orange colour | Red colour |
The technical solution of the present invention is further described below with reference to the accompanying drawings.
The video monitoring-oriented end-to-end automatic geological disaster identification method provided by the embodiment of the invention specifically comprises the following steps:
(1) extracting 3D characteristics of geological disaster/non-geological disaster videos: based on a C3D network trained by a large-scale data set, a proper amount of geological disaster videos are added by using the idea of transfer learning, the C3D network is modified and optimized, a network H-C3D which is suitable for geological disaster scenes and has good 3D feature extraction capability is obtained, and the H-C3D network structure for extracting 3D features is shown in FIG. 2:
the method specifically comprises the following steps: first, the image is cut into 128 × 117 size by taking a 16-frame sequence image as a unit, the length and width of the image is adjusted to 112 × 112 size inside the image, and half of the image is horizontally inverted to adapt to the temporal and spatial jitter. The H-C3D network consists of 5 convolutional layers, 5 pooling layers and 1 fully-connected layer, the number of filters from 1 to 5 convolutional layers is 64, 128, 256, 512, 512, respectively, each convolutional layer is followed by a pooling layer, the size of each subsequent pooling layer is 2 x 2 by 1 step through 3D convolution kernels of 3 x 3 and the first pooling layer of 1 x 2; the output dimension of the fully connected layer is 4096 dimensions.
(2) Training a geological disaster classification recognition network R-Net by using the 3D features extracted by the H-C3D network: first, after extracting video features in units of 16 frames, the invention is used for R-Net training network with 32 such features, and if the number of video frames is less than 16 × 32, the previous features are repeated to the 32 th. If the video frame number is more than 16 × 32, after the features are arranged in the time sequence, 32 features are obtained at equal intervals. These 32 x 4096 dimensional 3D features are fed into a classification recognition network R-Net for training, and the block diagram of the classification recognition network R-Net is shown in fig. 3:
because the geological disaster video and the non-geological disaster video are divided into 32 segment features, it is assumed that maximum score values classified by an R-Net network are respectively taken from the 32 segment features, so that the maximum abnormal score of the geological disaster video segment is always larger than the maximum abnormal score of the non-geological disaster video segment, namely, a loss function is expressed as the following form:
wherein k is the total number of samples, w is the network parameter of R-Net,and (3) taking the fractional value of R-Net, wherein A is a geological disaster feature fragment set, N is a non-geological disaster video feature fragment set, and then adding time smoothness constraint and space smoothness constraint:
wherein λ1,λ2For hyper-parameters, n is the total number of feature segments in a video. (2) + (3) constitutes the loss function for multi-instance learning of the present invention. And training a geological disaster classification recognition network R-Net by using the function and combining an AdaGrad gradient descent algorithm to obtain a trained model and parameters thereof.
The technical effects of the present invention will be described in detail with reference to the tests below.
When testing a new video, the invention firstly extracts 3D features from the video through an H-C3D network, and then combines the features into 32 x 4096 dimensions according to a time sequence. And inputting the combined 3D features into a trained geological disaster classification and identification network R-Net for classification, outputting abnormal value scores in real time, drawing a dynamic abnormal value curve, and realizing automatic judgment of the early warning level of the geological disaster. As shown in fig. 4 (landslide) and 5 (debris flow):
the invention uses the ROC curve graph to evaluate the effectiveness of the algorithm. At present, a computer vision technology is not used for a case of automatically judging geological disasters, so the method is compared with a random guessing method, wherein the probability of judging geological disasters and non-geological disasters by the random guessing method is 50 percent respectively. The test data are divided into 200 videos including 100 geological disaster videos and 100 non-geological disaster videos, and the videos do not participate in the training. Positions of disaster occurrence frames are marked in a geological disaster type video in time sequence, and an ROC curve graph is drawn as shown in FIG. 6. The application test is carried out by utilizing the video data, and the test result shows that the method has higher accuracy rate which can reach 94%.
It should be noted that the embodiments of the present invention can be realized by hardware, software, or a combination of software and hardware. The hardware portion may be implemented using dedicated logic; the software portions may be stored in a memory for execution by a suitable instruction execution system, such as a microprocessor or specially designed hardware. Those skilled in the art will appreciate that the apparatus and methods described above may be implemented using computer executable instructions and/or embodied in processor control code, such code being provided on a carrier medium such as a disk, CD-or DVD-ROM, programmable memory such as read only memory (firmware), or a data carrier such as an optical or electronic signal carrier, for example. The apparatus and its modules of the present invention may be implemented by hardware circuits such as very large scale integrated circuits or gate arrays, semiconductors such as logic chips, transistors, or programmable hardware devices such as field programmable gate arrays, programmable logic devices, etc., or by software executed by various types of processors, or by a combination of hardware circuits and software, e.g., firmware.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.
Claims (8)
1. An end-to-end geological disaster automatic identification method facing video monitoring is characterized by comprising the following steps:
firstly, transforming and optimizing a C3D network to obtain an H-C3D network suitable for extracting geological disaster features, inputting a video into the H-C3D network to extract the features, extracting a 4096-dimensional 3D feature from each 16 frames, and combining all the features into 32 x 4096-dimensional features according to a time sequence;
secondly, classifying the videos in a weak supervision mode, wherein the videos of the geological disasters are classified into one type, and the videos of the non-geological disasters are classified into one type; defining a multi-example learning method and a loss function thereof, building a classification recognition network R-Net, training the R-Net by adopting AdaGrad to obtain a training model, and judging the early warning level of the geological disaster by converting the 3D characteristics into abnormal values by the model;
thirdly, when the method is applied, 3D features are extracted from the video through an H-C3D network, the 3D features are recombined into 32 x 4096 dimensions according to time sequence, the combined features are sent to a trained classification recognition network R-Net, and abnormal scores are obtained in real time along with video frames;
fourthly, dividing the abnormal score into five intervals based on a large number of observation statistics, respectively corresponding to five early warning grades, and obtaining an early warning level of the geological disaster according to the interval where the abnormal score is located;
the video monitoring-oriented end-to-end geological disaster automatic identification method can extract the 3D characteristics of geological disaster/non-geological disaster videos: the method comprises the steps that a C3D network trained on a large-scale data set is added with a geological disaster video by utilizing the idea of transfer learning, and the C3D network is modified and optimized to obtain a network H-C3D suitable for extracting 3D characteristics of a geological disaster scene;
further comprising: taking a 16-frame sequence image as a unit, cutting the image into a size of 128 × 117, adjusting the length and width to a size of 112 × 112 internally, and horizontally flipping 50% of the total data volume to adapt to the jitter situation on time and space; the H-C3D network consists of 5 convolutional layers, 5 pooling layers and 1 fully-connected layer, the number of filters from 1 to 5 convolutional layers is 64, 128, 256, 512, 512, respectively, each convolutional layer is followed by a pooling layer, the size of each subsequent pooling layer is 2 x 2 by 1 step through 3D convolution kernels of 3 x 3 and the first pooling layer of 1 x 2; the output dimension of the fully connected layer is 4096 dimensions.
2. The video-monitoring-oriented end-to-end geological disaster automatic identification method as claimed in claim 1, characterized in that the video-monitoring-oriented end-to-end geological disaster automatic identification method utilizes 3D features extracted through an H-C3D network to train a geological disaster classification identification network R-Net: after the video is input into the H-C3D network by taking 16 frames as a unit to extract 3D features, the training network has 32 such features, and if the number of the video frames is less than 16 x 32, the previous features are repeated to the 32 th feature; if the number of the video frames is more than 16 × 32, arranging the features according to a time sequence, and obtaining 32 features at equal intervals; and (3) feeding the 32-by-4096-dimensional 3D features into an R-Net for training.
3. The video-monitoring-oriented end-to-end geological disaster automatic identification method as claimed in claim 1, characterized in that the multi-example learning loss function for training the geological disaster classification and identification network R-Net is:
wherein k is the total number of samples, w is the network parameter of R-Net,and (3) taking the fractional value of R-Net, wherein A is a geological disaster feature fragment set, N is a non-geological disaster video feature fragment set, and then adding time smoothness constraint and space smoothness constraint:
wherein λ1,λ2N is the total number of the characteristic segments in one video, and is a hyper-parameter; and (4) training R-Net by combining the loss function with an AdaGrad gradient descent algorithm to obtain a trained model and parameters thereof.
4. An automatic identification method system for video monitoring end-to-end geological disasters, which implements the automatic identification method for video monitoring end-to-end geological disasters according to any one of claims 1 to 3, comprises the following steps:
the characteristic extraction module is used for obtaining a network H-C3D suitable for extracting 3D characteristics of geological disaster scenes by modifying and optimizing a C3D network, extracting one 4096-dimensional 3D characteristic every 16 frames, and combining all the characteristics into 32 x 4096-dimensional characteristics according to a time sequence;
the video labeling module is used for classifying videos under weak supervision, wherein geological disaster videos are classified into one type, and non-geological disaster videos are classified into one type;
the video frame classification and identification module builds a geological disaster classification and identification network R-Net by defining a multi-example learning method and a loss function thereof, trains the R-Net by adopting an AdaGrad gradient descent algorithm to obtain a training model, and judges the early warning level of the geological disaster by converting 3D characteristics into abnormal values;
when the method is applied, 3D features are extracted from a video through an H-C3D network, the features are combined into 32 x 4096 dimensions according to a time sequence, the combined features are sent to a trained R-Net, and abnormal scores can be obtained in real time along with video frames;
according to observation statistics of a large amount of video data, dividing the abnormal score into five intervals, respectively corresponding to five early warning levels, and obtaining the early warning level of the geological disaster according to the interval where the abnormal score is located.
5. An information data processing terminal for realizing the video monitoring-oriented end-to-end geological disaster automatic identification method according to any one of claims 1 to 3.
6. A computer-readable storage medium comprising instructions which, when run on a computer, cause the computer to perform the video surveillance-oriented end-to-end geological disaster automatic identification method according to any of claims 1 to 3.
7. An application of the video monitoring oriented end-to-end geological disaster automatic identification method according to any one of claims 1 to 3 in geological disaster prevention and disaster reduction.
8. A geological disaster information data processing platform applying the video monitoring end-to-end geological disaster automatic identification method according to any one of claims 1 to 3.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010211669.0A CN111582017B (en) | 2020-03-24 | 2020-03-24 | Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010211669.0A CN111582017B (en) | 2020-03-24 | 2020-03-24 | Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111582017A CN111582017A (en) | 2020-08-25 |
CN111582017B true CN111582017B (en) | 2021-03-16 |
Family
ID=72117190
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010211669.0A Active CN111582017B (en) | 2020-03-24 | 2020-03-24 | Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111582017B (en) |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113033324B (en) * | 2021-03-03 | 2024-03-08 | 广东省地质环境监测总站 | Geological disaster precursor factor identification method and device, electronic equipment and storage medium |
CN115035439A (en) * | 2022-05-30 | 2022-09-09 | 广州交信投科技股份有限公司 | Campus abnormal event monitoring system based on deep network learning |
CN116778395B (en) * | 2023-08-21 | 2023-10-24 | 成都理工大学 | Mountain torrent flood video identification monitoring method based on deep learning |
Family Cites Families (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9377546B2 (en) * | 2012-05-06 | 2016-06-28 | Schlumberger Technology Corporation | Automatic extraction and characterization of fault and fracture populations |
CN103912310B (en) * | 2014-04-15 | 2016-05-25 | 辽宁工程技术大学 | A kind of mine motive force hazard prediction method |
CN110462445B (en) * | 2017-02-09 | 2022-07-26 | 地质探索系统公司 | Deep learning of geophysical |
CN109116419A (en) * | 2018-08-16 | 2019-01-01 | 中国石油大学(北京) | Based on the geologic anomaly conformation identification method and device that frequency-shared phase guiding is relevant |
CN109284780A (en) * | 2018-09-10 | 2019-01-29 | 中山大学 | Ore mineral image automatic identification and classification method |
CN110824142B (en) * | 2019-11-13 | 2022-06-24 | 杭州鲁尔物联科技有限公司 | Geological disaster prediction method, device and equipment |
-
2020
- 2020-03-24 CN CN202010211669.0A patent/CN111582017B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN111582017A (en) | 2020-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111582017B (en) | Video monitoring-oriented end-to-end geological disaster automatic identification method, system and application | |
Lee et al. | Rapid prediction of peak storm surge from tropical cyclone track time series using machine learning | |
Zhu et al. | Msnet: A multilevel instance segmentation network for natural disaster damage assessment in aerial videos | |
CN111160125B (en) | Railway foreign matter intrusion detection method based on railway monitoring | |
CN110232380A (en) | Fire night scenes restored method based on Mask R-CNN neural network | |
CN111862143B (en) | Automatic monitoring method for river dike collapse | |
CN102036079A (en) | Omni-directional prediction error histogram modification-based reversible image watermarking algorithm | |
CN110706211A (en) | Convolutional neural network-based real-time detection method for railway roadbed disease radar map | |
CN104751485B (en) | GPU adaptive foreground extracting method | |
CN114255250A (en) | River protection engineering slope collapse detection method based on deep learning | |
CN109145743A (en) | A kind of image-recognizing method and device based on deep learning | |
CN114783149A (en) | Geological disaster monitoring and early warning method based on big data and image recognition | |
Lu et al. | Cascading implications of a single climate change event for fragile ecosystems on the Qinghai‐Tibetan Plateau | |
CN104184997A (en) | Quick surveillance video playback technique | |
CN114220024B (en) | Static satellite sand storm identification method based on deep learning | |
Ren et al. | Research on identification of natural and unnatural earthquake events based on AlexNet convolutional neural network | |
CN114155551A (en) | Improved pedestrian detection method and device based on YOLOv3 under complex environment | |
CN118038021A (en) | Transformer substation operation site foreign matter intrusion detection method based on improvement yolov4 | |
CN118365983A (en) | Training method of target detection model, and anti-external-damage monitoring and early warning method and equipment | |
CN116665041A (en) | Water bank abnormal condition monitoring algorithm and device based on semantic segmentation | |
CN116844016A (en) | Flame detection method based on Fire-MCANet | |
CN116563809A (en) | Road disease identification method and device, electronic equipment and storage medium | |
KR20230086233A (en) | A Reconstructed Convolutional Neural Network Using Conditional Min Pooling and Method thereof | |
CN113609952A (en) | Deep-forgery video frequency domain detection method based on dense convolutional neural network | |
CN118164196B (en) | Method and system for monitoring health state of coal conveying belt based on machine vision |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |