WO2022121130A1 - 电力目标检测方法、装置、计算机设备和存储介质 - Google Patents
电力目标检测方法、装置、计算机设备和存储介质 Download PDFInfo
- Publication number
- WO2022121130A1 WO2022121130A1 PCT/CN2021/079280 CN2021079280W WO2022121130A1 WO 2022121130 A1 WO2022121130 A1 WO 2022121130A1 CN 2021079280 W CN2021079280 W CN 2021079280W WO 2022121130 A1 WO2022121130 A1 WO 2022121130A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- power
- target
- image
- identified
- power target
- Prior art date
Links
- 238000001514 detection method Methods 0.000 title claims abstract description 75
- 238000003860 storage Methods 0.000 title claims abstract description 13
- 238000012544 monitoring process Methods 0.000 claims abstract description 150
- 238000000034 method Methods 0.000 claims abstract description 50
- 238000004590 computer program Methods 0.000 claims description 18
- 238000010586 diagram Methods 0.000 description 15
- 230000008569 process Effects 0.000 description 10
- 230000008859 change Effects 0.000 description 8
- 230000006870 function Effects 0.000 description 8
- 238000004422 calculation algorithm Methods 0.000 description 6
- 238000004519 manufacturing process Methods 0.000 description 6
- 238000004891 communication Methods 0.000 description 5
- 238000009826 distribution Methods 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 238000012549 training Methods 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 238000012805 post-processing Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 230000001960 triggered effect Effects 0.000 description 2
- ORILYTVJVMAKLC-UHFFFAOYSA-N Adamantane Natural products C1C(C2)CC3CC1CC2C3 ORILYTVJVMAKLC-UHFFFAOYSA-N 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 241000699666 Mus <mouse, genus> Species 0.000 description 1
- 241000699670 Mus sp. Species 0.000 description 1
- 241000270295 Serpentes Species 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 238000013434 data augmentation Methods 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 238000009877 rendering Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/52—Surveillance or monitoring of activities, e.g. for recognising suspicious objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
- G06N3/084—Backpropagation, e.g. using gradient descent
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/20—Image preprocessing
- G06V10/25—Determination of region of interest [ROI] or a volume of interest [VOI]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V2201/00—Indexing scheme relating to image or video recognition or understanding
- G06V2201/07—Target detection
Definitions
- the present application relates to the field of computer technology, and in particular, to a method, apparatus, computer device and storage medium for detecting a power target.
- the power targets in the power production equipment area are often monitored, for example, the power targets in the collected monitoring images of the power production equipment area are identified and detected.
- a power target detection method comprising:
- the power monitoring image includes a power target to be identified
- the image acquisition device is controlled at the second zoom factor to photograph the power target to be identified, and an image of the power target is obtained; in the power target image, the power target to be identified is located in the central main image area ; the second zoom factor is greater than the first zoom factor;
- a target classification result of the power target to be identified is determined.
- the method further includes:
- the method before the step of detecting whether there is a human body in the power monitoring image, the method further includes:
- the detecting whether there is a human body in the power monitoring image includes:
- the pedestrian detection result includes at least one of presence of pedestrians and absence of pedestrians;
- the pedestrian detection result is that there is a pedestrian and the change degree of the image content is greater than a preset threshold, it is determined that there is a human body in the power monitoring image.
- the determining of the key position points of the power target to be identified in the power monitoring image further includes:
- the key position points of the to-be-identified power target in the power monitoring image are determined.
- the method further includes:
- the step of controlling the image acquisition device to photograph the to-be-identified power target at a second zoom factor according to the key position points, and obtaining an image of the power target;
- identifying the target classification result of the to-be-identified power target according to the power target image includes:
- the power target image is input into a pre-trained target classification network to generate a target classification result of the to-be-identified power target.
- a power target detection device the device includes:
- an acquisition module configured to acquire a power monitoring image captured by the image acquisition device at a first zoom factor; the power monitoring image includes a power target to be identified;
- a determination module configured to determine the key position points of the to-be-identified power target in the power monitoring image
- control module configured to control the image acquisition device to photograph the to-be-identified power target at a second zoom factor according to the key position point to obtain a power target image; in the power target image, the to-be-identified power target in the central main image area; the second zoom factor is greater than the first zoom factor;
- the identification module is configured to determine the target classification result of the power target to be identified according to the power target image.
- a computer device includes a memory and a processor, the memory stores a computer program, and the processor implements the steps of the above method when the processor executes the computer program.
- the above power target detection method, device, computer equipment and storage medium by acquiring the power monitoring image including the power target to be identified and captured by the image acquisition device at the first zoom factor, and determining the power target to be identified in the power monitoring image. key position point; then, according to the key position point, the image acquisition device is controlled at the second zoom factor to shoot the power target to be identified, and an image of the power target is obtained; in the power target image, the power target to be identified is in the central main image area; the second zoom The multiple is greater than the first zoom multiple; finally, the target classification result of the power target to be identified is determined according to the power target image; in this way, after the power monitoring image identifies the key position points of the power target to be identified in the power monitoring image, and based on the power monitoring image At this key position, the image acquisition device is controlled at the second zoom factor to shoot the power target to be identified, and a high-definition power target image is obtained, so that the power target image can contain as many features of the power target to be identified as possible, and then can
- 1 is an application environment diagram of a power target detection method in one embodiment
- FIG. 2 is a schematic flowchart of a method for detecting a power target in an embodiment
- Fig. 3 is the training flow chart of a kind of key point detection network in one embodiment
- FIG. 4 is a schematic diagram of the network structure of a key point detection network in one embodiment
- FIG. 5 is a schematic structural diagram of a double downsampling module in one embodiment
- FIG. 6a is a display diagram of a component target prediction process in one embodiment
- FIG. 6b is a diagram showing an animal target prediction process in one embodiment
- FIG. 7 is a schematic flowchart of a method for detecting a power target in another embodiment
- FIG. 8 is a block diagram of a processing flow of a method for detecting a power target in one embodiment
- FIG. 9 is a structural block diagram of a power target detection device in one embodiment.
- Figure 10 is a diagram of the internal structure of a computer device in one embodiment.
- the power target detection method provided in this application can be applied to the application environment shown in FIG. 1 .
- the computer device 110 communicates with the image acquisition device 120 through a network.
- the computer device 110 acquires the power monitoring image captured by the image acquisition device 120 at the first zoom factor; the power monitoring image includes the power target to be identified.
- the computer device 110 determines the key position points of the power target to be identified in the power monitoring image.
- the computer device 110 controls the image acquisition device to photograph the power target to be identified at the second zoom factor according to the key position points, and obtains an image of the power target; in the power target image, the power target to be identified is in the central main image area; the second zoom The magnification is greater than the first zoom magnification.
- the computer device 110 determines the target classification result of the power target to be identified according to the power target image.
- the computer device 110 can be, but is not limited to, various personal computers, notebook computers, smart phones, tablet computers and portable wearable devices.
- a method for detecting a power target is provided, which is described by taking the method applied to the computer device 110 in FIG. 1 as an example, including the following steps:
- Step S210 acquiring a power monitoring image captured by the image acquisition device at a first zoom factor; the power monitoring image includes a power target to be identified.
- the power monitoring image may refer to an image collected when monitoring the power production area. Practical applications,
- the power target may refer to various targets that need to be detected in power production. For example, in the production area of a power plant, it is necessary to detect small objects such as parts of equipment (screws, nuts, etc.) and small animals (snakes, mice, cats, etc.).
- the image acquisition device may refer to a camera.
- the camera has a zoom function.
- digital zoom function optical zoom function.
- the computer device acquires the power monitoring image captured by the image acquisition device at the first zoom factor.
- Step S220 determining the key position points of the power target to be identified in the power monitoring image.
- the computer device determines the key position points of the power target to be identified in the power monitoring image.
- the computer equipment can input the power monitoring images to the pretrained keypoint recognition model.
- the key position points of the power target to be identified in the power monitoring image are identified through the pre-trained key point identification model.
- Step S230 according to the key position point, the image acquisition device is controlled to photograph the power target to be identified at the second zoom factor, and an image of the power target is obtained.
- the power target to be identified is located in the central main image area.
- the second zoom factor is greater than the first zoom factor.
- the computer device controls the image acquisition device to photograph the power target to be identified at the second zoom factor to obtain the power target image.
- the computer device sends the key position point to the image acquisition device, and then controls the image acquisition device to focus and zoom in, and shoots the to-be-identified power target so that the to-be-identified power target is in the central main image area, so as to obtain the power target image .
- Step S240 Determine the target classification result of the power target to be identified according to the power target image.
- the computer device determines the target classification result of the power target to be identified according to the power target image.
- the computer equipment can input the power target image into a pre-trained target classification network, identify the power target image through the pre-trained target classification network, and obtain a target classification result for the power target to be identified.
- the pre-trained target classification network can adopt the MobileNetV2 structure, in which the target classification network can use data augmentation to enhance the accuracy of multi-scale target recognition, that is, during training, by adjusting the size of the target to be recognized in the screen.
- the proportion of data is enhanced to reduce the influence of the size of the object in the target frame on the accuracy of target recognition.
- the power monitoring image including the power target to be identified and captured by the image acquisition device at the first zoom factor is obtained, and the key position points of the power target to be identified in the power monitoring image are determined;
- the image acquisition device is controlled at the second zoom factor to shoot the power target to be identified, and an image of the power target is obtained; in the power target image, the power target to be identified is in the central main image area; the second zoom ratio is greater than the first zoom ratio;
- the target classification result of the power target to be identified is determined according to the power target image; in this way, after the power monitoring image identifies the key position point of the power target to be identified in the power monitoring image, and based on the key position point, the image can be controlled
- the acquisition device shoots the power target to be identified at the second zoom factor, and obtains a high-definition power target image, so that the power target image can contain as many features of the power target to be identified as possible, and then can accurately treat the power target based on the power target image.
- the method further includes: detecting whether there is a human body in the power monitoring image; if there is no human body in the power monitoring image, Then execute the step of determining the key position points of the power target to be identified in the power monitoring image; if there is a human body in the power monitoring image, return to the step of acquiring the power monitoring image captured by the image acquisition device at the first zoom factor.
- the computer device after the computer device acquires the power monitoring image captured by the image acquisition device at the first zoom factor, the computer device also needs to detect whether there is a human body in the power monitoring image; if there is no human body in the power monitoring image, then Triggering the subsequent target detection network is to perform the step of determining the key position points of the power target to be identified in the power monitoring image; if there is a human body in the power monitoring image, the subsequent target detection network is not triggered, and the image acquisition device is returned to obtain the first The step of the power monitoring image captured at the zoom factor is to input the next image frame sequence.
- the detection algorithm can be pre-positioned with a trigger mechanism, and the detection algorithm is triggered for detection when certain conditions are met, which removes the interference of human targets, reduces the amount of calculation and saves energy. consumption.
- the method before the step of detecting whether there is a human body in the power monitoring image, the method further includes: acquiring a difference between the power monitoring image and a preset historical background image; if the difference is greater than a preset difference threshold, returning to detecting Steps to monitor the presence or absence of a human body in an image.
- detecting whether there is a human body in the power monitoring image includes: inputting the power monitoring image into a pre-trained pedestrian detection network to obtain a pedestrian detection result; the pedestrian detection result includes at least one of the presence of pedestrians and the absence of pedestrians If the pedestrian detection result is that there is a pedestrian, it is determined that there is a human body in the power monitoring image.
- the computer device before the computer device detects whether there is a human body in the power monitoring image, the computer device specifically includes: the computer device can obtain the difference between the power monitoring image and the preset historical background image; if the difference is greater than the preset difference threshold, triggering again It is determined whether there is a human body in the power monitoring image.
- the computer equipment can use the mixed Gaussian model for multiple frames of power monitoring images to establish a real-time model for the background of the moving target, and then take two consecutive frames of images for difference, so as to obtain the contour of the moving target. If the contour of the moving target exceeds the preset value When the target threshold, that is, the change degree of the image content is greater than the preset threshold, it is determined that there is a human body in the power monitoring image.
- mixed Gaussian background modeling is a background representation method based on the statistical information of pixel samples, which uses statistical information such as the probability density of a large number of sample values of pixels over a long period of time (such as the number of patterns, the mean and standard deviation of each pattern) to represent the background , and then use statistical difference (such as the 3 ⁇ principle) to judge the target pixel, which can model the complex dynamic background.
- the color information between pixels is considered to be irrelevant, and the processing of each pixel is mutually independant.
- the change of its value in the sequence image can be regarded as a random process that continuously generates pixel values, that is, the Gaussian distribution is used to describe the color rendering law of each pixel.
- Each new pixel value X t is compared with the current K models according to formula (1), until a distribution model matching the new pixel value is found, that is, the mean deviation from the model is within 2.5 ⁇ ;
- X t is the RGB component of the pixel X at time t
- ⁇ i, t-1 is the standard deviation of the ith Gaussian distribution at time t-1.
- the pixel belongs to the background, otherwise it belongs to the foreground.
- ⁇ represents the learning rate
- w k,t and w k,t-1 are the weights.
- ⁇ t (1- ⁇ )* ⁇ t-1 + ⁇ *X t (4);
- ⁇ represents the learning rate and ⁇ is the parameter learning rate.
- the pattern with the smallest weight is replaced, that is, the mean of the pattern is the current pixel value, the standard deviation is the initial larger value, and the weight is the smaller value;
- the patterns are arranged in descending order according to w/ ⁇ 2 , and the patterns with large weight and small standard deviation are arranged in the front;
- the first B modes are selected as the background, and the calculation of B is as in formula (6).
- T is the weight threshold, and the value range is [0.5, 1].
- the best distribution describing the background can be selected through the setting of T.
- the image difference method is a simple and effective method to detect whether there is an abnormality in the current image by subtracting the currently acquired image and the background historical image. By subtracting images, the speed of image processing can be improved.
- the present invention adopts the Gaussian background modeling algorithm to first perform background modeling on the input image, and then differentiate, so that it is more suitable for practical application scenarios.
- the formula for calculating the difference is as follows:
- F i represents the result of the image difference, where two frames of pictures are taken to make a difference for the corresponding pixels, the absolute value is taken, and then the average value of all the pixel values after the difference is taken, F i is the current frame image, F b represents The background image obtained by the mixture Gaussian background modeling algorithm.
- the computer device determines that there is a human body in the power monitoring image according to the result of the image difference.
- ⁇ F i represents the result of image difference
- T is a threshold obtained according to the experiment
- ⁇ F i is less than the threshold, it means that the difference between the current picture and the background is low
- the flag bit flag1 is set to False
- ⁇ F i is greater than or equal to the threshold value
- the flag bit flag1 is set to True. In this way, it can be determined based on the flag bit flag1 that there is a human body in the power monitoring image.
- the computer equipment inputs the power monitoring image to a pre-trained pedestrian detection network to obtain a pedestrian detection result; the pedestrian detection result includes at least one of the presence of pedestrians and the absence of pedestrians.
- the pretrained pedestrian detection network can be the open source RFBNet300 network. If the pedestrian detection result output by the pre-trained pedestrian detection network is that there is a pedestrian, the flag bit flag2 is set to False. If the pedestrian detection result output by the pre-trained pedestrian detection network is that there is no pedestrian, the flag bit flag2 is set to True.
- the computer device performs a summation operation on the flag bit flag1 and the flag bit flag2, and if the summation result is True (true), the step of determining the key position point of the power target to be identified in the power monitoring image is executed .
- the technical solution of this embodiment is to obtain the difference between the power monitoring image and the preset historical background image; if the difference is greater than the preset difference threshold, return to the step of detecting whether there is a human body in the power monitoring image; When the difference between the output power monitoring image and the preset historical background image is greater than the preset difference threshold, the subsequent target detection processing is further performed, which reduces the redundant calculation amount and saves the power consumption.
- determining the key position points of the power target to be identified in the power monitoring image includes: inputting the power monitoring image into a pre-trained key point detection network to obtain a heat map of key points for the power target to be identified; The key point heat map determines the key position points of the power target to be identified in the power monitoring image.
- the computer equipment specifically includes: the computer equipment inputs the power monitoring image into a pre-trained key point detection network, and obtains a target for the power target to be identified.
- the key point heat map of according to the key point heat map, determine the key position points of the power target to be identified in the power monitoring image.
- Figure 3 exemplarily provides a training flow chart of a key point detection network; as shown in Figure 3, it is necessary to perform data preprocessing on the input samples, and then perform the forward operation of the neural network, calculate the loss function, and backpropagate Update the network parameters of the keypoint detection network.
- the training data is the picture marked with the target frame.
- the data is first enhanced, including random flip, random color change, random contrast change, random brightness change, random scale change, etc. Then standardize, subtract the mean and divide by the variance. Finally, a heat map of the target is generated, that is, a two-dimensional Gaussian distribution with different variances is generated at the center of each target according to the size of the target box.
- Neural network forward operation input the processed data into the network to get the output, including the predicted heat map, and the offset of each prediction center relative to the target frame.
- the loss function calculates the loss of the output heat map and the predicted heat map, use the Focal loss loss function (focus loss function), and calculate the loss of the predicted target frame offset, use the smooth L1 loss (smoothed L1 loss), the loss function is as follows :
- FIG. 4 provides a schematic diagram of the network structure of a key point detection network; in which, the concatenate operation is to connect the feature maps by channel, and the bilinear interpolation algorithm is often used for double upsampling.
- the key point detection network includes a double downsampling module.
- FIG. 5 provides a schematic structural diagram of a double downsampling module.
- Post-processing (1) perform threshold operation on the heat map, set the minimum threshold to A, then the response value is lower than the position of A; (2) perform the maximum pooling operation to extract the response extreme point; (3) take the extreme point The predicted offset of the position yields the target bounding box.
- FIG. 6a provides a display diagram of a component target prediction process
- FIG. 6b provides a display diagram of an animal target prediction process.
- the method further includes: generating a target frame surrounding the power target to be identified in the power monitoring image according to the key position points; If the area of the target frame is within the preset area range, execute the step of controlling the image acquisition device at the second zoom factor to capture the power target to be identified according to the key position points, and obtain the power target image; if the area of the target frame is not within the preset area range, return to the step of acquiring the power monitoring image captured when the image capturing device is at the first zoom factor.
- the method further includes: the computer device generates a target frame surrounding the power target to be identified in the power monitoring image according to the key position points in the power monitoring image Then, the computer equipment judges whether the area S of the target frame is in the preset area range; If the computer equipment determines that the area S of the target frame is in the preset area range, S satisfies the following formula:
- S 1 and S 2 are the set area size limiting parameters.
- secondary identification that is, to control the image acquisition device to be in the second zoom factor to shoot according to the key position points.
- a target frame surrounding the power target to be identified is generated in the power monitoring image according to the key position points; if the area of the target frame is within the preset area range, the image acquisition device is controlled according to the key position points.
- a method for detecting a power target is provided, and the method is applied to the computer device 110 in FIG. 1 as an example for illustration, including the following steps: Step S710 , acquiring an image acquisition device The power monitoring image captured at the first zoom factor; the power monitoring image includes the power target to be identified. Step S720, acquiring the difference between the power monitoring image and a preset historical background image. Step S730, if the difference is greater than a preset difference threshold, detect whether there is a human body in the power monitoring image. Step S740, if there is no human body in the power monitoring image, determine the key position points of the power target to be identified in the power monitoring image.
- Step S750 control the image acquisition device to photograph the power target to be identified at a second zoom factor, and obtain a power target image; in the power target image, the power target to be identified is in the center the main image area; the second zoom factor is greater than the first zoom factor.
- Step S760 Determine a target classification result of the power target to be identified according to the power target image.
- FIG. 8 provides a block diagram of the processing flow of a power target detection method; wherein, firstly input the image frame sequence; the computer equipment judges whether the detection requirements are met; if the detection requirements are met, the image frame sequence is input Then, the computer equipment further judges whether secondary recognition is required; if secondary recognition is required, the secondary recognition is performed; the recognition result is output, and the input next image frame sequence is returned.
- a power target detection device including:
- an acquisition module 910 configured to acquire a power monitoring image captured when the image acquisition device is at a first zoom factor; the power monitoring image includes a power target to be identified;
- a determination module 920 configured to determine the key position points of the power target to be identified in the power monitoring image
- the control module 930 is configured to control the image acquisition device to photograph the to-be-identified power target at a second zoom factor according to the key position point, to obtain a power target image; in the power target image, the to-be-identified power target The target is in the central main image area; the second zoom factor is greater than the first zoom factor;
- the identification module 940 is configured to determine a target classification result of the power target to be identified according to the power target image.
- the power target detection device further includes: a human body detection module, configured to detect whether there is a human body in the power monitoring image; if there is no human body in the power monitoring image, perform the determination The step of the key position points of the power target to be identified in the power monitoring image; if there is a human body in the power monitoring image, returning the power monitoring image captured by the acquired image acquisition device at the first zoom factor A step of.
- a human body detection module configured to detect whether there is a human body in the power monitoring image; if there is no human body in the power monitoring image, perform the determination The step of the key position points of the power target to be identified in the power monitoring image; if there is a human body in the power monitoring image, returning the power monitoring image captured by the acquired image acquisition device at the first zoom factor A step of.
- the power target detection device further includes: a difference acquisition module, configured to acquire the difference between the power monitoring image and a preset historical background image; if the difference is greater than a preset difference threshold , then return to the step of detecting whether there is a human body in the power monitoring image.
- a difference acquisition module configured to acquire the difference between the power monitoring image and a preset historical background image; if the difference is greater than a preset difference threshold , then return to the step of detecting whether there is a human body in the power monitoring image.
- the human body detection module is specifically configured to input the power monitoring image into a pre-trained pedestrian detection network to obtain a pedestrian detection result; the pedestrian detection result includes the presence of pedestrians and the absence of pedestrians. At least one; if the pedestrian detection result is that there is a pedestrian and the change degree of the image content is greater than a preset threshold, it is determined that there is a human body in the power monitoring image.
- the determining module 920 is specifically configured to input the power monitoring image into a pre-trained key point detection network to obtain a heat map of key points for the power target to be identified; according to the key points The heat map determines the key position points of the power target to be identified in the power monitoring image.
- the power target detection device further includes: a judgment module, configured to generate a target frame surrounding the to-be-identified power target in the power monitoring image according to the key position points; If the area of the target frame is within the preset area range, then execute the step of controlling the image acquisition device to photograph the to-be-identified power target at the second zoom factor according to the key position points, and obtain an image of the power target; If the area of the target frame is not within the preset area range, return to the step of acquiring the power monitoring image captured when the image capturing device is at the first zoom factor.
- a judgment module configured to generate a target frame surrounding the to-be-identified power target in the power monitoring image according to the key position points
- the identification module 940 is specifically configured to input the power target image into a pre-trained target classification network to generate a target classification result of the power target to be identified.
- All or part of the modules in the above-mentioned power target detection device can be implemented by software, hardware and combinations thereof.
- the above modules can be embedded in or independent of the processor in the computer device in the form of hardware, or stored in the memory in the computer device in the form of software, so that the processor can call and execute the operations corresponding to the above modules.
- a computer device is provided, and the computer device may be a terminal, and its internal structure diagram may be as shown in FIG. 10 .
- the computer equipment includes a processor, memory, a communication interface, a display screen, and an input device connected by a system bus.
- the processor of the computer device is used to provide computing and control capabilities.
- the memory of the computer device includes a non-volatile storage medium, an internal memory.
- the nonvolatile storage medium stores an operating system and a computer program.
- the internal memory provides an environment for the execution of the operating system and computer programs in the non-volatile storage medium.
- the communication interface of the computer device is used for wired or wireless communication with an external terminal, and the wireless communication can be realized by WIFI, operator network, NFC (Near Field Communication) or other technologies.
- the computer program when executed by a processor, implements a method for detecting an electrical target.
- the display screen of the computer equipment may be a liquid crystal display screen or an electronic ink display screen, and the input device of the computer equipment may be a touch layer covered on the display screen, or a button, a trackball or a touchpad set on the shell of the computer equipment , or an external keyboard, trackpad, or mouse.
- FIG. 10 is only a block diagram of a partial structure related to the solution of the present application, and does not constitute a limitation on the computer equipment to which the solution of the present application is applied. Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
- a computer device including a memory and a processor, the memory stores a computer program, and when the computer program is executed by the processor, the processor executes the steps of the above-mentioned method for detecting a power target.
- the steps of an electric power target detection method here may be the steps in the electric power target detection method of each of the above embodiments.
- a computer-readable storage medium storing a computer program, and when the computer program is executed by a processor, the processor causes the processor to perform the steps of the above-mentioned method for detecting a power target.
- the steps of an electric power target detection method here may be the steps in the electric power target detection method of each of the above embodiments.
- Non-volatile memory may include read-only memory (Read-Only Memory, ROM), magnetic tape, floppy disk, flash memory, or optical memory, and the like.
- Volatile memory may include random access memory (RAM) or external cache memory.
- the RAM may be in various forms, such as Static Random Access Memory (SRAM) or Dynamic Random Access Memory (DRAM).
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Molecular Biology (AREA)
- Computational Linguistics (AREA)
- Multimedia (AREA)
- Software Systems (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Mathematical Physics (AREA)
- General Health & Medical Sciences (AREA)
- Computing Systems (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (10)
- 一种电力目标检测方法,其特征在于,包括:获取图像采集设备处于第一变焦倍数时拍摄到的电力监控图像;所述电力监控图像包括有待识别电力目标;确定所述待识别电力目标在所述电力监控图像中的关键位置点;根据所述关键位置点,控制所述图像采集设备处于第二变焦倍数拍摄所述待识别电力目标,得到电力目标图像;在所述电力目标图像中,所述待识别电力目标处于中心主图区;所述第二变焦倍数大于所述第一变焦倍数;根据所述电力目标图像,确定所述待识别电力目标的目标分类结果。
- 根据权利要求1所述的方法,其特征在于,在所述获取图像采集设备处于第一变焦倍数时拍摄到的电力监控图像的步骤之后,所述方法还包括:检测所述电力监控图像中是否存在人体;若所述电力监控图像中不存在人体,则执行所述确定所述待识别电力目标在所述电力监控图像中的关键位置点的步骤;若所述电力监控图像中存在人体,则返回所述获取图像采集设备处于第一变焦倍数时拍摄到的电力监控图像的步骤。
- 根据权利要求2所述的方法,其特征在于,在所述检测所述电力监控图像中是否存在人体的步骤之前,还包括:获取所述电力监控图像与预设的历史背景图像之间的差异;若所述差异大于预设差异阈值,则返回所述检测所述电力监控图像中是否存在人体的步骤。
- 根据权利要求2所述的方法,其特征在于,所述检测所述电力监控图像中是否存在人体,包括:将所述电力监控图像输入至预训练的行人检测网络,得到行人检测结果;所述行人检测结果包括存在行人和不存在行人中的至少一种;若所述行人检测结果为存在行人,则判定所述电力监控图像中存在人体。
- 根据权利要求1所述的方法,其特征在于,所述确定所述待识别电力目标在所述电力监控图像中的关键位置点,包括:将所述电力监控图像输入预训练的关键点检测网络,得到针对所述待识别电力目标的关键点热力图;根据所述关键点热力图,确定所述待识别电力目标在所述电力监控图像中的关键位置点。
- 根据权利要求1所述的方法,其特征在于,在所述确定所述待识别电力目标在所述电力监控图像中的关键位置点的步骤之后,所述方法还包括:根据所述关键位置点,在所述电力监控图像中生成包围所述待识别电力目标的目标框;若所述目标框的面积处于预设面积范围,则执行所述根据所述关键位置点,控制所述图像采集设备处于第二变焦倍数拍摄所述待识别电力目标,得到电力目标图像的步骤;若所述目标框的面积不处于预设面积范围,则返回所述获取图像采集设备处于第一变焦倍数时拍摄到的电力监控图像的步骤。
- 根据权利要求1至6任意一项所述的方法,其特征在于,所述根据所述电力目标图像,识别出所述待识别电力目标的目标分类结果,包括:将所述电力目标图像输入至预训练的目标分类网络,生成所述待识别电力目标的目标分类结果。
- 一种电力目标检测装置,其特征在于,所述装置包括:获取模块,用于获取图像采集设备处于第一变焦倍数时拍摄到的电力监控图像;所述电力监控图像包括有待识别电力目标;确定模块,用于确定所述待识别电力目标在所述电力监控图像中的关键位置点;控制模块,用于根据所述关键位置点,控制所述图像采集设备处于第二变焦倍数拍摄所述待识别电力目标,得到电力目标图像;在所述电力目标图像中,所述待识别电力目标处于中心主图区;所述第二变焦倍数大于所述第一变焦倍数;识别模块,用于根据所述电力目标图像,确定所述待识别电力目标的目标 分类结果。
- 一种计算机设备,包括存储器和处理器,所述存储器存储有计算机程序,其特征在于,所述处理器执行所述计算机程序时实现权利要求1至7中任一项所述的方法的步骤。
- 一种计算机可读存储介质,其上存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现权利要求1至7中任一项所述的方法的步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011452606.0A CN112989910A (zh) | 2020-12-12 | 2020-12-12 | 电力目标检测方法、装置、计算机设备和存储介质 |
CN202011452606.0 | 2020-12-12 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022121130A1 true WO2022121130A1 (zh) | 2022-06-16 |
Family
ID=76344935
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/079280 WO2022121130A1 (zh) | 2020-12-12 | 2021-03-05 | 电力目标检测方法、装置、计算机设备和存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN112989910A (zh) |
WO (1) | WO2022121130A1 (zh) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116451168A (zh) * | 2023-06-15 | 2023-07-18 | 北京国电通网络技术有限公司 | 异常电力信息生成方法、装置、电子设备和可读介质 |
CN116953416A (zh) * | 2023-09-19 | 2023-10-27 | 英迪格(天津)电气有限公司 | 一种铁路变配电装置运行状态的监控系统 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113989695B (zh) * | 2021-09-18 | 2022-05-20 | 北京远度互联科技有限公司 | 目标跟踪方法、装置、电子设备及存储介质 |
CN115187603A (zh) * | 2022-09-13 | 2022-10-14 | 国网浙江省电力有限公司 | 一种基于深度神经网络的电力设备检测方法及装置 |
CN117612095A (zh) * | 2023-11-27 | 2024-02-27 | 中国南方电网有限责任公司 | 一种电力安全控制方法及装置 |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150294176A1 (en) * | 2012-12-26 | 2015-10-15 | Canon Kabushiki Kaisha | Automatic tracking image pickup system |
CN105100724A (zh) * | 2015-08-13 | 2015-11-25 | 电子科技大学 | 一种基于视觉分析的智能家居远程安全监控方法及装置 |
CN107336417A (zh) * | 2017-06-13 | 2017-11-10 | 上海斐讯数据通信技术有限公司 | 一种基于机器视觉的模具保护方法及系统 |
CN107766801A (zh) * | 2017-09-28 | 2018-03-06 | 中国南方电网有限责任公司超高压输电公司检修试验中心 | 一种基于红外热点追踪的绝缘子识别方法及装置 |
CN110139037A (zh) * | 2019-05-20 | 2019-08-16 | 深圳铂石空间科技有限公司 | 对象监测方法及装置、存储介质和电子设备 |
CN110149482A (zh) * | 2019-06-28 | 2019-08-20 | Oppo广东移动通信有限公司 | 对焦方法、装置、电子设备和计算机可读存储介质 |
CN111339846A (zh) * | 2020-02-12 | 2020-06-26 | 深圳市商汤科技有限公司 | 图像识别方法及装置、电子设备和存储介质 |
-
2020
- 2020-12-12 CN CN202011452606.0A patent/CN112989910A/zh active Pending
-
2021
- 2021-03-05 WO PCT/CN2021/079280 patent/WO2022121130A1/zh active Application Filing
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150294176A1 (en) * | 2012-12-26 | 2015-10-15 | Canon Kabushiki Kaisha | Automatic tracking image pickup system |
CN105100724A (zh) * | 2015-08-13 | 2015-11-25 | 电子科技大学 | 一种基于视觉分析的智能家居远程安全监控方法及装置 |
CN107336417A (zh) * | 2017-06-13 | 2017-11-10 | 上海斐讯数据通信技术有限公司 | 一种基于机器视觉的模具保护方法及系统 |
CN107766801A (zh) * | 2017-09-28 | 2018-03-06 | 中国南方电网有限责任公司超高压输电公司检修试验中心 | 一种基于红外热点追踪的绝缘子识别方法及装置 |
CN110139037A (zh) * | 2019-05-20 | 2019-08-16 | 深圳铂石空间科技有限公司 | 对象监测方法及装置、存储介质和电子设备 |
CN110149482A (zh) * | 2019-06-28 | 2019-08-20 | Oppo广东移动通信有限公司 | 对焦方法、装置、电子设备和计算机可读存储介质 |
CN111339846A (zh) * | 2020-02-12 | 2020-06-26 | 深圳市商汤科技有限公司 | 图像识别方法及装置、电子设备和存储介质 |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116451168A (zh) * | 2023-06-15 | 2023-07-18 | 北京国电通网络技术有限公司 | 异常电力信息生成方法、装置、电子设备和可读介质 |
CN116451168B (zh) * | 2023-06-15 | 2023-09-12 | 北京国电通网络技术有限公司 | 异常电力信息生成方法、装置、电子设备和可读介质 |
CN116953416A (zh) * | 2023-09-19 | 2023-10-27 | 英迪格(天津)电气有限公司 | 一种铁路变配电装置运行状态的监控系统 |
CN116953416B (zh) * | 2023-09-19 | 2023-12-08 | 英迪格(天津)电气有限公司 | 一种铁路变配电装置运行状态的监控系统 |
Also Published As
Publication number | Publication date |
---|---|
CN112989910A (zh) | 2021-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022121130A1 (zh) | 电力目标检测方法、装置、计算机设备和存储介质 | |
US11222239B2 (en) | Information processing apparatus, information processing method, and non-transitory computer-readable storage medium | |
CN109255352B (zh) | 目标检测方法、装置及系统 | |
CN109271870B (zh) | 行人重识别方法、装置、计算机设备及存储介质 | |
CN108960163B (zh) | 手势识别方法、装置、设备和存储介质 | |
US10534957B2 (en) | Eyeball movement analysis method and device, and storage medium | |
US8805018B2 (en) | Method of detecting facial attributes | |
JP6351240B2 (ja) | 画像処理装置、画像処理方法及びプログラム | |
CN110097050B (zh) | 行人检测方法、装置、计算机设备及存储介质 | |
US11900676B2 (en) | Method and apparatus for detecting target in video, computing device, and storage medium | |
CN111126208B (zh) | 行人归档方法、装置、计算机设备及存储介质 | |
JP6756406B2 (ja) | 画像処理装置、画像処理方法および画像処理プログラム | |
CN108764143B (zh) | 图像处理方法、装置、计算机设备和存储介质 | |
CN112446379A (zh) | 一种动态大场景自适应智能处理方法 | |
CN114155365B (zh) | 模型训练方法、图像处理方法及相关装置 | |
CN111062263A (zh) | 手部姿态估计的方法、设备、计算机设备和存储介质 | |
JP2012059076A (ja) | 情報処理装置、その認識方法及びプログラム | |
KR20140095333A (ko) | 영상에서 객체 추적 방법 및 장치 | |
Ribeiro et al. | Face detection on infrared thermal image | |
CN113743426A (zh) | 一种训练方法、装置、设备以及计算机可读存储介质 | |
CN113780145A (zh) | 精子形态检测方法、装置、计算机设备和存储介质 | |
CN113706481A (zh) | 精子质量检测方法、装置、计算机设备和存储介质 | |
CN116740652A (zh) | 一种基于神经网络模型的锈斑面积扩大的监测方法与系统 | |
CN108875488B (zh) | 对象跟踪方法、对象跟踪装置以及计算机可读存储介质 | |
Mei et al. | Fast template matching in multi-modal image under pixel distribution mapping |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21901847 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21901847 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 14.12.2023) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21901847 Country of ref document: EP Kind code of ref document: A1 |