WO2020151153A1 - Image processing method and apparatus, and computer device and storage medium - Google Patents

Image processing method and apparatus, and computer device and storage medium Download PDF

Info

Publication number
WO2020151153A1
WO2020151153A1 PCT/CN2019/089196 CN2019089196W WO2020151153A1 WO 2020151153 A1 WO2020151153 A1 WO 2020151153A1 CN 2019089196 W CN2019089196 W CN 2019089196W WO 2020151153 A1 WO2020151153 A1 WO 2020151153A1
Authority
WO
WIPO (PCT)
Prior art keywords
area
pixel
sampling
feature map
pixel value
Prior art date
Application number
PCT/CN2019/089196
Other languages
French (fr)
Chinese (zh)
Inventor
王义文
张文龙
王健宗
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020151153A1 publication Critical patent/WO2020151153A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features

Definitions

  • This application relates to the field of image processing, and in particular to an image processing method, device, computer equipment and storage medium.
  • Target detection In the field of image processing, it is usually necessary to detect and analyze a certain area, which is called target detection.
  • target detection In the field of image processing, it is usually necessary to detect and analyze a certain area, which is called target detection.
  • target detection Among the many frameworks in the field of target detection, such as Fast-RCNN, Faster-RCNN, RFCN, and ROI Pooling's role is to pool the corresponding area into a fixed size in the feature map according to the preselected box Region Proposal, that is, the position coordinates of the candidate area Feature map for subsequent classification operations.
  • the position information of Region Proposal is obtained by model regression, and the obtained position information is generally in the form of floating-point numbers, and the feature map of the pooling operation requires a fixed size.
  • the process of pooling the corresponding region into a fixed-size feature map involves two rounding and quantization operations: (1) Rounding and quantizing the Region Proposal boundary to integer point coordinate values. (2) The boundary area after rounding and quantization is equally divided into N xN units, and the boundary of each unit is rounded and quantized.
  • the obtained candidate frame has a certain deviation from the initial regression position. This deviation will affect the accuracy of detection or segmentation, which is called a region misalignment problem (misalignment).
  • region misalignment region misalignment problem
  • the above-mentioned integer quantization operation is cancelled, and the image value on the pixel with the floating point coordinate is obtained by the bilinear interpolation method, so as to transform the entire feature aggregation process into a continuous operation.
  • This application provides an image processing method, device, computer equipment, and storage medium, so that the pixel value calculation result of the fixed-size feature map finally obtained is more accurate.
  • An image processing method comprising: obtaining a feature map of a candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is a feature extraction method of an input image through a convolutional neural network It is obtained that the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; dividing the candidate area feature map into NxM small areas of equal size according to a preset candidate area pooling parameter, The N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include a width parameter and a length parameter for pooling processing; each of the small regions is averaged according to the preset number of sampling points P Divide into P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the candidate area feature map; according to the candidate area feature map and the The pixel value at the center point of the sampling area is determined by the intersecting pixel points where the sampling area intersects; according to the
  • An image processing device includes: a first acquisition module, configured to acquire a feature map of a candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is obtained through a convolutional neural network
  • the input image is obtained by feature extraction, and the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm;
  • the first division module is configured to group the candidate area according to preset candidate area pooling parameters
  • the candidate region feature map acquired by the first acquiring module is divided into NxM small regions of the same size, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include those used for pooling Processed width parameters and length parameters;
  • the second division module is configured to divide the small areas divided by each of the first division modules into P sampling areas on average according to the preset number of sampling points P, where P is positive An integer and greater than or equal to 2;
  • a first determining module configured to determine the intersecting pixels in the candidate region
  • a computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor.
  • the processor executes the computer-readable instructions, the following steps are implemented: obtaining candidates The feature map of the region, the feature map of the candidate region is obtained by mapping the candidate region to the feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, and the candidate region is based on a preset
  • the target detection algorithm is obtained by detecting the target area of the input image; according to the preset candidate area pooling parameter, the candidate area feature map is divided into NxM small areas of equal size, where N and M are positive integers and greater than Or equal to 1, the preset candidate area pooling parameters include width parameters and length parameters for pooling processing; each small area is divided into P sampling areas on average according to the preset number of sampling points P, Is a positive integer and greater than or equal to 2; determine the intersection pixel point that intersects the sampling area in the candidate area feature map; determine the intersection pixel point that intersects
  • One or more non-volatile readable storage media storing computer readable instructions.
  • the one or more processors perform the following steps: obtaining A feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to a feature map, the feature map is obtained by feature extraction of an input image through a convolutional neural network, and the candidate area is based on a prediction
  • a target detection algorithm performs target region detection on the input image
  • the candidate region feature map is divided into NxM small regions of equal size, where N and M are positive integers and Greater than or equal to 1
  • the preset candidate area pooling parameters include a width parameter and a length parameter for pooling processing
  • each small area is divided into P sampling areas on average according to the preset number of sampling points P, P is a positive integer and greater than or equal to 2
  • Fig. 1 is a schematic diagram of a system frame diagram applied by the image processing method in this application;
  • FIG. 2 is a schematic flowchart of an embodiment of the image processing method in this application.
  • FIG. 3 is a schematic diagram of the sampling area of the feature map of the candidate area in the present application.
  • FIG 5 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application.
  • FIG. 6 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application.
  • FIG. 7 is a schematic flowchart of another embodiment of the image processing method in this application.
  • FIG. 8 is a schematic flowchart of another embodiment of the image processing method in this application.
  • FIG. 9 is a schematic flowchart of another embodiment of the image processing method in this application.
  • FIG. 10 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application.
  • FIG. 11 is a schematic flowchart of another embodiment of the image processing method in this application.
  • FIG. 12 is a schematic structural diagram of an embodiment of an image processing device in this application.
  • Fig. 13 is a schematic structural diagram of an embodiment of a computer device in this application.
  • the embodiment of the application provides an image processing method, which can be applied to the system framework diagram shown in Fig. 1.
  • the server can execute the processing method provided by this image processing method on the input image to obtain a fixed-size feature map after processing, where:
  • the server can be implemented by an independent server or a server cluster composed of multiple servers. The following describes the embodiments of the present application in detail:
  • an image processing method including the following steps:
  • S10 Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, the candidate region It is obtained by performing target area detection on the input image according to a preset target detection algorithm.
  • the embodiments of this application are applied to Regions with Convolutional Neural Networks (RCNN), including but not limited to Fast RCNN, Faster RCNN, and Region-based Fully Convolutional Networks (Region-based Fully Convolutional Networks, RFCN).
  • RCNN Regions with Convolutional Neural Networks
  • RFCN Region-based Fully Convolutional Networks
  • the input image is processed by the convolutional layer, the pooling layer, the region of interest pooling layer (ROI pooling), and the fully connected layer.
  • ROI pooling layer performs processing.
  • the candidate area is to find out the possible position of the target in the input image in advance, that is, the area of interest, and detect the candidate area of the input image by using feature information such as texture, edge, and color in the input image.
  • the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm.
  • the preset target detection algorithm specifically uses selective search to directly perform a selective search on the input image.
  • the preset target detection algorithm refers to the feature map of the input image after the feature extraction of the convolutional layer, and then through the regional candidate network (region proposal network, PRN) to perform the feature map
  • PRN region proposal network
  • the feature map of the candidate region refers to the image obtained after mapping the candidate region to the feature map corresponding to the input image. It can be understood that the input image passes through the convolutional layer after inputting the convolutional layer of the region-based convolutional neural network.
  • the feature extraction process can obtain the feature map corresponding to the input object.
  • the specific feature extraction process of the convolutional layer will not be repeated here.
  • the candidate region feature map corresponding to the input image can be obtained.
  • the candidate region feature map referred to in the embodiment of this application generally refers to each candidate region feature map corresponding to the input image. For ease of description, the following describes the image processing method proposed in the embodiment of the present application with a SAR map of the candidate area.
  • S20 Divide the candidate region feature map into NxM small regions of equal size according to preset candidate region pooling parameters, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters Including width parameters and length parameters.
  • the preset candidate region pooling parameters are preset parameters in the ROI pooling layer.
  • the above preset candidate region pooling parameters are intended to pool the candidate region feature maps into parameters of fixed-size feature maps, and the specific address includes the width parameter (pooled-h) and length parameter (pooled-w).
  • the input image has a candidate area with a size of 665*665
  • the candidate area is mapped to the feature map, and the result is
  • the size of the feature map of the candidate area is: 665/32 ⁇ 20.78, that is, the feature map of the candidate area is: 20.78*20.78.
  • the candidate region feature map is divided into NxM small regions of equal size according to preset candidate region pooling parameters, and the specific sizes of N and M are configured by actual application requirements.
  • the embodiment of this application does not specifically limit it.
  • the above N and M may also be 8 respectively.
  • the candidate region feature map is fixed to 8*8 after processing The size of the feature map.
  • S30 Divide each small area into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2.
  • the candidate area feature map is divided into NxM small areas of equal size according to the preset candidate area pooling parameters
  • the following processing is performed for each small area: according to the preset number of sampling points P
  • P is a positive integer and greater than or equal to 2
  • the sampling areas have the same shape type as the pixel points of the feature map of the candidate area.
  • the preset number of sampling points P is the set number of sampling points used to calculate the pixel value of each small area.
  • the preset number of sampling points may be 4, 8, etc., which is not specifically described in the embodiment of this application. limited.
  • S40 Determine an intersection pixel point that intersects the sampling area in the feature map of the candidate area.
  • each small area is divided into P sampling areas on average according to the preset number of sampling points P, and then the intersecting pixel points that intersect the sampling area in the feature map of the candidate area are determined. That is, after each sampling area corresponding to each small area is obtained through the processing of step S30, the intersecting pixel points that respectively intersect each sampling area in the feature map of the candidate area are determined.
  • the intersecting pixel points that intersect each sampling area can be determined by the coordinate position of each sampling area and the coordinate position of the pixel point of the candidate area SAR map.
  • the center point positions of the 4 sampling areas corresponding to each small area and the intersection pixel points that intersect the 4 sampling areas corresponding to the small area can be obtained.
  • the area shown in Fig. 3 is only a part of the feature map of the candidate area, including 1-16 pixels, and the area where the thick-lined boxes A, B, C and D are located
  • sampling areas A, B, C, and D constitute a small area.
  • the position of the center point of sampling area A can be determined by the bilinear difference interpolation algorithm, and the intersecting pixels that intersect with the sampling area A, namely pixels 1, 2, 5, and 6. .
  • the center point position and the intersection pixel point that intersects the sampling area can be determined.
  • the target pixel points that intersect with sampling area B are 2, 3, and 6. And 7.
  • S50 Determine the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area.
  • the pixel value of the center point of the sampling area is determined. As shown in FIG. 3, taking the sampling area A as an example, the pixel value at the center point of the sampling area A can be determined according to the intersecting pixels 1, 2, 3, and 4 that intersect the sampling area A. It can be understood that based on the same calculation method, the pixel value of the center point position of each sampling area corresponding to each small area can be obtained respectively.
  • S60 Correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to each small area.
  • the pixel value of the center point position of the sampling area of each small area in the candidate area can be obtained.
  • the pixel value of the center point position of each sampling area corresponding to the small area is obtained.
  • the pixel value corresponds to the pixel value of each small area.
  • the pixel values of the center point positions of the sampling areas A, B, C, and D corresponding to the small areas can be obtained, respectively, and the correspondingly determined in Figure 3, composed of the sampling areas A, B, C, and D
  • the pixel value of a small area Based on the same calculation method, the pixel value corresponding to each small area can be obtained, which will not be illustrated here.
  • S70 Obtain a fixed-size feature map of the candidate area according to the pixel value of each small area.
  • each small area After the pixel value of each small area is correspondingly determined according to the pixel value of the center point position of each sampling area corresponding to each small area, the processing is obtained according to the pixel value of each small area A feature map of the candidate area with a fixed size afterwards.
  • each small area is 2.97*2.97, after the previous steps S10-S70, get the pixel value corresponding to each small area, get the pixel value of 49 small areas, and output a 7*7 candidate area feature map.
  • the embodiment of the present application provides an image processing method, which effectively ensures that the corresponding candidate area is pooled into a fixed-size feature map according to the intersection relationship between the sampling area and the pixel, and because the calculation is performed based on the intersection pixel , The calculation result of the pixel value of the final sampling area is more accurate, so that the finally obtained pixel value calculation result of the fixed-size feature map is also more accurate.
  • the embodiment of the present application also provides specific information based on the intersection of the candidate area feature map and the sampling area.
  • the method for determining the pixel value of the center point position of the sampling area by the target intersecting pixel point is described below:
  • step 50 that is, according to the target intersecting pixel points that intersect the sampling area in the feature map of the candidate area, the center point position of the sampling area is determined before the pixel value of, the method further includes the following steps:
  • S80 Determine whether the size of the sampling area is greater than or equal to the size of the pixel of the feature map of the candidate area.
  • step S30 after each small area is divided into P sampling areas on average according to the preset number of sampling points P, it is determined whether the sampling area of each small area is greater than or equal to the size of the pixel point in the feature map of the candidate area. It can be understood that since the size of each pixel of the input image is the same, the size of each pixel in the feature map of the candidate area obtained by mapping is also the same, and because each sample of the candidate area obtained by the division The sizes of the regions are the same, so it can be judged whether any sample region is greater than or equal to the size of any pixel in the feature map of the candidate region.
  • the auxiliary frame corresponding to the sampling area is generated with the center point position as the center, that is, for each small area
  • Each sampling area will generate a corresponding auxiliary frame.
  • the shape of the auxiliary frame is the same as the pixel of the feature map of the candidate area, and the size of the auxiliary frame is smaller than or equal to the size of the pixel of the feature map of the candidate area .
  • the bilinear interpolation method may be used in the embodiment of the present application to determine the center point position of each sampling area, and the process of specifically calculating the center point position of each sampling area will not be repeated here.
  • the size of the sampling area is greater than or equal to the size of the pixel of the candidate area image (in FIG. 5, the auxiliary frame and the pixel are the same size as an example).
  • the sampling area D Take the sampling area D as an example.
  • the auxiliary frame d corresponding to the sampling area D can be generated at the center point of the sampling area D, as shown by the slash box in FIG. 5, where the sampling area D corresponds to
  • the shape of the auxiliary frame d is the same as the pixels of the feature map of the candidate area, and the size of the auxiliary frame d corresponding to the sampling area D is smaller than or equal to the pixels of the feature map of the candidate area.
  • the sampling area D is only taken as an example for description.
  • the corresponding auxiliary frame can be generated through the embodiment of the present application.
  • step S50 the determining the pixel value of the center point position of the sampling area according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area specifically includes the following steps:
  • S50' Determine the pixel value at the center point of the sampling area according to the intersecting pixels that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area.
  • the center point position of the sampling area is determined according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the corresponding auxiliary frame The pixel value.
  • sampling area D As an example, it can be seen that the intersecting pixels that intersect with sampling area D are pixels 6, 7, 10, and 11, and the auxiliary frame corresponding to sampling area D is auxiliary frame d, the pixel value at the center point of the sampling area D is determined according to the pixels 6, 7, 10, 11 and the auxiliary frame d.
  • the other sampling areas of the feature map of the candidate area are determined based on the pixel value calculation method of the center point position of the sampling area D.
  • the sampling area A it is visible and sampled.
  • the intersecting pixels in area A are pixels 1, 2, 5, and 6, and the auxiliary frame corresponding to sampling area A is auxiliary frame a.
  • the pixel points 1, 2, 5, 6 and auxiliary frame a are used to determine this
  • the calculation methods of the pixel value at the center point of the sampling area a and the pixel values at the center points of other sampling areas are not described here.
  • step S50' that is, according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area, and the auxiliary frame corresponding to the sampling area
  • Determining the pixel value of the center point of the sampling area specifically includes the following steps:
  • S52' Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point, and the pixel value of each pixel point.
  • S53' Use the first target pixel value corresponding to the sampling area as the pixel value at the center point of the sampling area.
  • step S51 ⁇ please continue to refer to FIG. 5, taking the sampling area D as an example, it can be seen that the auxiliary area d corresponding to the sampling area D intersects with the intersecting pixels, that is, the pixels 6, 7, 10, and 11.
  • the intersecting areas of the auxiliary frame d and the intersecting pixels 6 , 7 , 10 , and 11 can be determined, which are respectively denoted as C 6d , C 7d , C 10d and C 11d in this embodiment.
  • steps S52 ⁇ -S53 ⁇ taking the sampling area D as an example, after obtaining C 6d , C 7d , C 10d and C 11d , according to C 6d , C 7d , C 10d and C 11d , and the intersecting pixels 6, 7, 10, and 11 determine the first target pixel value corresponding to the sampling area D, and use the first target pixel value as the pixel value at the center point of the sampling area D.
  • the pixel values of the center point positions corresponding to the sampling areas can be obtained through steps S51 ⁇ -S53 ⁇ , and the details are not repeated here.
  • step S53' that is, according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel, and the pixel value of the respective pixel
  • Determining the first target pixel value corresponding to the sampling area specifically includes the following steps:
  • S531' correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel.
  • S533' Calculate the sum of the intersection area of each pixel to obtain the sum of the area of the first intersecting pixel.
  • S534' Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  • sampling area D in the feature map of the candidate area is taken as an example to describe the embodiment of the present application:
  • step S531 ⁇ after obtaining C 6d, C 7d, C 10d and C 11d, respectively, corresponding to the calculated pixel value of each pixel of C 6d, C 7d, C 10d C 11d and the intersection point of the pixel region corresponding to the sample
  • the product between, that is, the product of C 6d and the pixel value A 6 of pixel 6 is calculated separately, and it is recorded as: A 6 C 6d ; the product of C 7d and the pixel value A 7 of pixel 7 is calculated, and it is recorded as :A 7 C 7d ;
  • Calculate the product of C 10d and the pixel value A 10 of the pixel point 10 denoted as: A 10 C 10d ;
  • step S532 ⁇ after calculating A 6 C 6d , A 7 C 7d , A 10 C 10d and A 11 C 11d , the above products are added to obtain the first product sum, that is, A 6 C 6d + A 7 C 7d +A 10 C 10d +A 11 C 11d .
  • step S533' the sum of the intersecting areas of the auxiliary frame d corresponding to the sampling area D and the respective pixels, and the sum of the areas of the first intersecting pixel, that is, C 6d + C 7d + C 10d + C 11d .
  • step S234' the quotient U D between the first product sum corresponding to the sampling area D and the first intersecting pixel area sum is calculated as shown in the following formula:
  • the sampling area D is taken as an example for description.
  • the calculation can be performed by referring to the first target pixel value calculation method corresponding to D of the sampling area. , I will not repeat it here.
  • the embodiment of the present application also provides specific intersecting pixels based on the target intersecting the sampling area in the feature map of the candidate area. Point, the method for determining the pixel value at the center point of the sampling area:
  • the image processing method further includes the following steps:
  • the preset sampling points are preset configurations.
  • the size of the sampling area divided into each small area in the candidate area according to the preset number of sampling points P may be smaller than the size of the pixel points in the feature map of the candidate area, for example
  • the candidate area is a small target
  • the feature map of the candidate area corresponding to the input image will be relatively small
  • the size of each sample area divided may be smaller than the pixel size of the feature map of the candidate area.
  • sampling areas A, B, C, and D divided by the small area are smaller than the size of the pixel, then according to each intersection of the candidate area feature map and the sampling area
  • the pixel value of the pixel point and the intersection area between the sampling area and each of the intersecting pixel points determine the second target pixel value corresponding to the sampling area.
  • sampling area B as an example, the intersecting pixels that intersect with sampling area B are pixels 5 and 6, respectively.
  • the pixel value of pixel 5 is A 5
  • the pixel value of pixel 6 is A 6
  • sampling area B The intersecting areas with pixels 5 and 6 are denoted as C 5B and C 6B respectively
  • the second target pixel value corresponding to sampling area B is determined according to A 5 , A 6 , C 5B and C 6B
  • sampling area B corresponds to
  • the second target pixel value of is used as the pixel value of the center point of the sampling area B.
  • the pixel values at the positions of the heavy center points of the sampling areas can be determined based on the above-mentioned method, which will not be repeated here.
  • step S100 if it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, the The pixel value of each intersecting pixel point intersected by the sampling area and the intersecting area of the sampling area and each intersecting pixel point are determined to determine the second target pixel value corresponding to the sampling area, which specifically includes the following steps:
  • S103 Calculate the sum of the intersecting areas of the intersecting pixels to obtain a second sum of the areas of the intersecting pixels.
  • S104 Calculate the quotient between the second product and the area of the second intersecting pixel to obtain a second target pixel value corresponding to the sampling area.
  • steps S101-S104 please continue to refer to Fig. 10, taking the sampling area B in the feature map of the candidate area as an example, after obtaining C 5B and C 6B , the pixel value and the pixel value of each pixel in the intersecting pixel are calculated respectively.
  • the product between C 5B and C 6B that is, the product of C 5B and the pixel value A 5 of pixel 5 is calculated respectively, and it is recorded as: A 5 C 5B ; the pixel value A 6 of C 6B and pixel 6 is calculated
  • the product of is recorded as: A 6 C 6B .
  • the product A 5 C 5B of C 5B and the pixel value A 5 of pixel 5 is obtained, and the pixel value A 6 of C 6B and pixel 6 is calculated. After the products A 6 C 6B are multiplied, the products are added to obtain the second product sum, that is, A 5 C 5B + A 6 C 6B .
  • the sum of the intersecting areas of the intersecting pixels corresponding to the sampling area B, that is, the sum of the areas of the first intersecting pixels is: C 5B +C 6B .
  • the quotient between the second product sum and the area of the second intersecting pixel is calculated by specifically sampling the following formula:
  • the sampling area B is taken as an example for description.
  • the calculation can be performed by referring to the second target pixel value calculation method corresponding to B of the sampling area. , I will not repeat it here.
  • an image processing device corresponds to the image processing method in the above-mentioned embodiment one-to-one.
  • the image processing apparatus 10 includes a first acquisition module 101, a first division module 102, a second division module 103, a first determination module 104, a second determination module 105, a third determination module 106, and a second Obtaining module 107.
  • the detailed description of each functional module is as follows:
  • the first acquisition module is used to acquire a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network ,
  • the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm;
  • a first division module is configured to combine the data obtained by the first obtaining module according to a preset candidate area pooling parameter
  • the candidate region feature map is divided into NxM small regions of equal size, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include a width parameter and a length parameter for pooling processing;
  • the second dividing module is configured to divide the small areas divided by each of the first dividing modules into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2;
  • a determining module configured to determine the intersection pixel points in the candidate region feature map acquired by the first
  • the image processing device further includes a fourth determining module and a generating module
  • the fourth determining module is configured to determine before the second determining module determines the pixel value at the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area Whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area; the generating module is configured to determine that the size of the sampling area is greater than or equal to the candidate area if the fourth determining module determines For the size of the pixel points of the image, the auxiliary frame corresponding to the sampling area is generated with the center point position of the sampling area as the center. The shape of the auxiliary frame is the same as the pixel points of the feature map of the candidate area.
  • the size of the frame is smaller than or equal to the pixel size of the feature map of the candidate area; the second determining module is specifically configured to: according to the intersection pixel point of the feature map of the candidate area that intersects the sampling area, and The auxiliary frame corresponding to the sampling area determines the pixel value of the center point position of the sampling area.
  • the second determining module is configured to determine the center of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area
  • the pixel value of the point position includes: the second determining module is configured to: obtain the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point in the intersecting pixel; according to the auxiliary frame corresponding to the sampling area The intersection area with each pixel point and the pixel value of each pixel point determine the first target pixel value corresponding to the sampling area; take the first target pixel value corresponding to the sampling area as the The pixel value of the center point of the sampling area.
  • the second determining module is configured to determine the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point and the pixel value of each pixel point
  • the first target pixel value corresponding to the area includes: the second determining module is specifically configured to: respectively correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel; Add the product between the pixel value of the pixel and the intersection area of each pixel to obtain a first product sum; calculate the sum of the intersection area of each pixel to obtain the first intersecting pixel area sum; Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  • the image processing device further includes a fifth determining module: the fifth determining module: after determining whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area If it is determined that the size of the sampling area is smaller than the size of the pixel of the feature map of the candidate area, then the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area is determined, and the sampling The intersecting area of the area and each of the intersecting pixel points determines the second target pixel value corresponding to the sampling area; taking the second target pixel value corresponding to the sampling area as the pixel value at the center point of the sampling area .
  • the fifth determining module is used for the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area, and the difference between the sampling area and each intersecting pixel.
  • the intersection area determines the second target pixel value corresponding to the sampling area, including: a fifth determining module is configured to: respectively calculate the pixel value of each target intersection pixel point, and the intersection of each target intersection pixel point The product between the areas; adding the pixel value of each intersecting pixel to the product of the intersecting area of each intersecting pixel to obtain a second product sum; calculating the intersection of each intersecting pixel The sum of the areas is used to obtain the area sum of the second intersecting pixel; the quotient between the second product sum and the area of the second intersecting pixel is calculated to obtain the second target pixel value corresponding to the sampling area.
  • Each module in the above-mentioned image processing device may be implemented in whole or in part by software, hardware, and a combination thereof.
  • the foregoing modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the foregoing modules.
  • a computer device is provided.
  • the computer device may be a server, and its internal structure diagram may be as shown in FIG. 13.
  • the computer equipment includes a processor, a memory, a network interface and a database connected by a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities.
  • the memory of the computer device includes a non-volatile storage medium and an internal memory.
  • the non-volatile storage medium stores an operating system, computer readable instructions, and a database.
  • the internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium.
  • the database of the computer equipment is used to store feature maps and so on.
  • the network interface of the computer device is used to communicate with an external terminal through a network connection.
  • the computer-readable instructions are executed by the processor to realize an image processing method.
  • a computer device including a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and the processor executes the computer-readable
  • the instruction is implemented as follows: Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to the feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network,
  • the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; dividing the candidate area feature map into NxM small areas of equal size according to preset candidate area pooling parameters, N and M are positive integers and greater than or equal to 1, the preset candidate area pooling parameters include a width parameter and a length parameter used for pooling processing; each of the small areas is divided equally into P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the feature map of the candidate area; according to the feature map of the candidate area and the sampling area Determine the
  • one or more non-volatile readable storage media storing computer readable instructions are provided.
  • the computer readable instructions are executed by one or more processors, the one or more Each processor performs the following steps: Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network
  • the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; the candidate area feature map is divided into NxM small areas of equal size according to a preset candidate area pooling parameter, so
  • the N and M are positive integers and greater than or equal to 1
  • the preset candidate area pooling parameters include a width parameter and a length parameter for pooling processing; each of the small areas is divided equally according to the preset number of sampling points P Are P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the candidate area feature map;
  • Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory.
  • ROM read only memory
  • PROM programmable ROM
  • EPROM electrically programmable ROM
  • EEPROM electrically erasable programmable ROM
  • Volatile memory may include random access memory (RAM) or external cache memory.
  • RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
  • Module completion means dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Image Analysis (AREA)

Abstract

An image processing method and apparatus, and a computer device and a storage medium, by which a pixel value calculation result of a finally obtained feature map having a fixed size is relatively accurate. The method section comprises: obtaining a candidate region feature map; dividing the candidate region feature map into NxM small regions having the same size according to preset candidate region pooling parameters; averagely dividing each small region into P sampling regions according to a preset sampling point number P; determining intersected pixel points intersected with each sampling region in the candidate region feature map (S40); determining a pixel value of a central point position of the sampling region according to the intersected pixel points; correspondingly determining the pixel value of each small region according to the pixel value of the central point position of each sampling region corresponding to the small region (S60); and obtaining the candidate region feature map having a fixed size according to the pixel value of each small region (S70).

Description

图像处理方法、装置、计算机设备及存储介质Image processing method, device, computer equipment and storage medium 技术领域Technical field
本申请涉及图像处理领域,尤其涉及一种图像处理方法、装置、计算机设备及存储介质。This application relates to the field of image processing, and in particular to an image processing method, device, computer equipment and storage medium.
本申请以2019年1月24日提交的申请号为201910067366.3,名称为“图像处理方法、This application is based on the application number 201910067366.3 submitted on January 24, 2019, and the name is "Image processing methods, 装置、计算机设备及存储介质”的中国发明专利申请为基础,并要求其优先权。"Devices, Computer Equipment and Storage Media" are based on the Chinese invention patent application and claim priority.
背景技术Background technique
在图像处理领域,通常需要检测并分析某块区域,称为目标检测。在目标检测领域的众多框架中,比如Fast-RCNN,Faster-RCNN,RFCN,ROI Pooling的作用是根据预选框Region Proposal,也即候选区域的位置坐标在特征图中将相应区域池化为固定尺寸的特征图,以便进行后续的分类操作。但是Region Proposal的位置信息是由模型回归得到的,得到的位置信息一般是浮点数的形式,而池化操作的特征图要求尺寸固定。故将相应区域池化为固定尺寸的特征图这一过程存在两次取整量化的操作:(1)将Region Proposal边界取整量化为整数点坐标值。(2)将取整量化后的边界区域平均分割成N xN个单元,对每一个单元的边界再进行取整量化。In the field of image processing, it is usually necessary to detect and analyze a certain area, which is called target detection. Among the many frameworks in the field of target detection, such as Fast-RCNN, Faster-RCNN, RFCN, and ROI Pooling's role is to pool the corresponding area into a fixed size in the feature map according to the preselected box Region Proposal, that is, the position coordinates of the candidate area Feature map for subsequent classification operations. However, the position information of Region Proposal is obtained by model regression, and the obtained position information is generally in the form of floating-point numbers, and the feature map of the pooling operation requires a fixed size. Therefore, the process of pooling the corresponding region into a fixed-size feature map involves two rounding and quantization operations: (1) Rounding and quantizing the Region Proposal boundary to integer point coordinate values. (2) The boundary area after rounding and quantization is equally divided into N xN units, and the boundary of each unit is rounded and quantized.
然而经过两次取整量化,得到的候选框已经和最开始回归出来的位置有一定的偏差,这个偏差会影响检测或者分割的准确度,称为区域不匹配问题(misalignment)。传统的方案中,是取消取了上述的整量化操作,使用双线性内插的方法获得坐标为浮点数的像素点上的图像数值,从而将整个特征聚集过程转化为一个连续的操作,具体是:先通过双线性内插值算法计算出“红叉点”,并依据红叉点的旁边固定位置,例如统一红叉点所在的像素点的左下、右下像素点计算出红叉点所在的像素的位置,然而,红叉点的旁边固定位置所在的像素点的像素不一定是接近红叉点所在的像素点的像素,因此可能会导致计算出来的结果存在一定的误差。However, after two rounding and quantification, the obtained candidate frame has a certain deviation from the initial regression position. This deviation will affect the accuracy of detection or segmentation, which is called a region misalignment problem (misalignment). In the traditional scheme, the above-mentioned integer quantization operation is cancelled, and the image value on the pixel with the floating point coordinate is obtained by the bilinear interpolation method, so as to transform the entire feature aggregation process into a continuous operation. Yes: First calculate the "red cross point" through the bilinear interpolation algorithm, and calculate the red cross point based on the fixed position beside the red cross point, for example, the lower left and lower right pixels of the pixel where the unified red cross point is located However, the pixel at the fixed position next to the red cross point is not necessarily the pixel close to the pixel point where the red cross point is located, so it may cause a certain error in the calculated result.
发明内容Summary of the invention
本申请提供了一种图像处理方法、装置、计算机设备及存储介质,使得最后得到的固定尺寸的特征图的像素值计算结果比较准确。This application provides an image processing method, device, computer equipment, and storage medium, so that the pixel value calculation result of the fixed-size feature map finally obtained is more accurate.
一种图像处理方法,包括:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;确定出所述候选区域特征图中与所述采样区域相交的相交像素点;根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。An image processing method, comprising: obtaining a feature map of a candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is a feature extraction method of an input image through a convolutional neural network It is obtained that the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; dividing the candidate area feature map into NxM small areas of equal size according to a preset candidate area pooling parameter, The N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include a width parameter and a length parameter for pooling processing; each of the small regions is averaged according to the preset number of sampling points P Divide into P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the candidate area feature map; according to the candidate area feature map and the The pixel value at the center point of the sampling area is determined by the intersecting pixel points where the sampling area intersects; according to the pixel value at the center point of each sampling area corresponding to the small area, each of the The pixel value of a small area; obtaining a fixed-size feature map of the candidate area according to the pixel value of each small area.
一种图像处理装置,包括:第一获取模块,用于获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标 区域检测所得到;第一划分模块,用于根据预设候选区域池化参数将所述第一获取模块获取的所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;第二划分模块,用于根据预设采样点数P将每个所述第一划分模块划分得到的所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;第一确定模块,用于确定出所述第一获取模块获取的所述候选区域特征图中与所述第二划分模块划分得到的所述采样区域相交的相交像素点;第二确定模块,用于根据所述第一确定模块确定的所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;第三确定模块,用于根据所述第二确定模块确定的所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;第二获取模块,用于根据所述第三确定模块确定的每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。An image processing device includes: a first acquisition module, configured to acquire a feature map of a candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is obtained through a convolutional neural network The input image is obtained by feature extraction, and the candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; the first division module is configured to group the candidate area according to preset candidate area pooling parameters The candidate region feature map acquired by the first acquiring module is divided into NxM small regions of the same size, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include those used for pooling Processed width parameters and length parameters; the second division module is configured to divide the small areas divided by each of the first division modules into P sampling areas on average according to the preset number of sampling points P, where P is positive An integer and greater than or equal to 2; a first determining module, configured to determine the intersecting pixels in the candidate region feature map acquired by the first acquiring module and the sampling region divided by the second dividing module The second determining module is configured to determine the pixel value of the center point of the sampling area according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area determined by the first determining module; Third, a determining module, configured to determine the pixel value of each small area corresponding to the pixel value of the center point of each sampling area corresponding to the small area determined by the second determining module; second An obtaining module is configured to obtain a fixed-size feature map of the candidate area according to the pixel value of each small area determined by the third determining module.
一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;确定出所述候选区域特征图中与所述采样区域相交的相交像素点;根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor. When the processor executes the computer-readable instructions, the following steps are implemented: obtaining candidates The feature map of the region, the feature map of the candidate region is obtained by mapping the candidate region to the feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, and the candidate region is based on a preset The target detection algorithm is obtained by detecting the target area of the input image; according to the preset candidate area pooling parameter, the candidate area feature map is divided into NxM small areas of equal size, where N and M are positive integers and greater than Or equal to 1, the preset candidate area pooling parameters include width parameters and length parameters for pooling processing; each small area is divided into P sampling areas on average according to the preset number of sampling points P, Is a positive integer and greater than or equal to 2; determine the intersection pixel point that intersects the sampling area in the candidate area feature map; determine the intersection pixel point that intersects the sampling area in the candidate area feature map The pixel value at the center point of the sampling area; according to the pixel value at the center point of each sampling area corresponding to the small area, the pixel value of each small area is correspondingly determined; The pixel value of the small area obtains the feature map of the candidate area of a fixed size.
一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;确定出所述候选区域特征图中与所述采样区域相交的相交像素点;根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。One or more non-volatile readable storage media storing computer readable instructions. When the computer readable instructions are executed by one or more processors, the one or more processors perform the following steps: obtaining A feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to a feature map, the feature map is obtained by feature extraction of an input image through a convolutional neural network, and the candidate area is based on a prediction Suppose that a target detection algorithm performs target region detection on the input image; according to preset candidate region pooling parameters, the candidate region feature map is divided into NxM small regions of equal size, where N and M are positive integers and Greater than or equal to 1, the preset candidate area pooling parameters include a width parameter and a length parameter for pooling processing; each small area is divided into P sampling areas on average according to the preset number of sampling points P, P is a positive integer and greater than or equal to 2; determine the intersection pixel point that intersects the sampling area in the candidate area feature map; determine the intersection pixel point that intersects the sampling area in the candidate area feature map The pixel value at the center point of the sampling area is determined; according to the pixel value at the center point of each sampling area corresponding to the small area, the pixel value of each small area is correspondingly determined; according to each The pixel value of the small area obtains the candidate area feature map of a fixed size.
本申请的一个或多个实施例的细节在下面的附图和描述中提出,本申请的其他特征和优点将从说明书、附图以及权利要求变得明显。The details of one or more embodiments of the present application are presented in the following drawings and descriptions, and other features and advantages of the present application will become apparent from the description, drawings and claims.
附图说明Description of the drawings
为了更清楚地说明本申请的技术方案,下面将对本申请的描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solution of the present application more clearly, the following will briefly introduce the drawings that need to be used in the description of the present application. Obviously, the drawings in the following description are only some embodiments of the present application. Ordinary technicians can also obtain other drawings based on these drawings without creative labor.
图1是本申请中图像处理方法所应用的系统框架图一个示意图;Fig. 1 is a schematic diagram of a system frame diagram applied by the image processing method in this application;
图2是本申请中图像处理方法一个实施例流程示意图;2 is a schematic flowchart of an embodiment of the image processing method in this application;
图3是本申请中候选区域特征图的采样区域的一个示意图;3 is a schematic diagram of the sampling area of the feature map of the candidate area in the present application;
图4是本申请中图像处理方法另一实施例流程示意图;4 is a schematic flowchart of another embodiment of the image processing method in this application;
图5是本申请中候选区域特征图的采样区域的另一示意图;5 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application;
图6是本申请中候选区域特征图的采样区域的另一示意图;6 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application;
图7是本申请中图像处理方法另一实施例流程示意图;FIG. 7 is a schematic flowchart of another embodiment of the image processing method in this application;
图8是本申请中图像处理方法另一实施例流程示意图;FIG. 8 is a schematic flowchart of another embodiment of the image processing method in this application;
图9是本申请中图像处理方法另一实施例流程示意图;9 is a schematic flowchart of another embodiment of the image processing method in this application;
图10是本申请中候选区域特征图的采样区域的另一示意图;10 is another schematic diagram of the sampling area of the feature map of the candidate area in the present application;
图11是本申请中图像处理方法另一实施例流程示意图;FIG. 11 is a schematic flowchart of another embodiment of the image processing method in this application;
图12是本申请中图像处理装置一个实施例结构示意图;FIG. 12 is a schematic structural diagram of an embodiment of an image processing device in this application;
图13是本申请中计算机设备的一个实施例结构示意图。Fig. 13 is a schematic structural diagram of an embodiment of a computer device in this application.
具体实施方式detailed description
下面将结合本申请中的附图,对本申请中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in this application will be clearly and completely described below in conjunction with the drawings in this application. Obviously, the described embodiments are part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
本申请实施例提供图像处理方法,可应用在如图1的系统框架示意图中,服务器可对输入图像执行本图像处理方法所提供的处理方式,以获得处理后得到固定尺寸的特征图,其中,服务器可以用独立的服务器或者是多个服务器组成的服务器集群来实现,下面对本申请实施例进行详细介绍:The embodiment of the application provides an image processing method, which can be applied to the system framework diagram shown in Fig. 1. The server can execute the processing method provided by this image processing method on the input image to obtain a fixed-size feature map after processing, where: The server can be implemented by an independent server or a server cluster composed of multiple servers. The following describes the embodiments of the present application in detail:
在一实施例中,如图2所示,提供一种图像处理方法,包括如下步骤:In an embodiment, as shown in FIG. 2, an image processing method is provided, including the following steps:
S10:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到。S10: Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, the candidate region It is obtained by performing target area detection on the input image according to a preset target detection algorithm.
本申请实施例应用于基于区域的卷积神经网络(Regions with Convolutional Neural Networks,RCNN)中,包括但不局限于Fast RCNN、Faster RCNN以及基于区域的完整卷积网络(Region-based Fully Convolutional Networks,RFCN)。可以理解,在上述基于区域的卷积神经网络中,均会输入图像进行卷积层、池化层、感兴趣区域池化层(region of interest pooling layer,ROI pooling)以及全连接层的处理过程,而本申请实施例是在获得输入图像的候选区域(Region Proposal),以及输入图像的特征图(也即feature maps)后,ROI pooling层进行处理。The embodiments of this application are applied to Regions with Convolutional Neural Networks (RCNN), including but not limited to Fast RCNN, Faster RCNN, and Region-based Fully Convolutional Networks (Region-based Fully Convolutional Networks, RFCN). It can be understood that in the above-mentioned region-based convolutional neural network, the input image is processed by the convolutional layer, the pooling layer, the region of interest pooling layer (ROI pooling), and the fully connected layer. However, in the embodiment of the present application, after obtaining the candidate region (Region Proposal) of the input image and the feature maps (ie feature maps) of the input image, the ROI pooling layer performs processing.
其中,候选区域,就是预先找出输入图像的中目标可能出现的位置,也即感兴趣区域,通过利用输入图像中的纹理、边缘、颜色等特征信息,检测出输入图像的候选区域,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到,其中,具体地,在Fast RCNN中,预设目标检测算法具体是采用选择性搜索(selective search)直接对输入图像进行候选区域的提取,而在Faster RCNN中,预设目标检测算法是指对输入图像进行卷积层的特征提取得到特征图后,再经过区域候选网络(region proposal network,PRN)对特征图进行候选区域的提取,具体过程这里不展开赘述,但可以理解的是,经过上述预设目标检测算法的处理,均可获得输入图像对应的候选区域。Among them, the candidate area is to find out the possible position of the target in the input image in advance, that is, the area of interest, and detect the candidate area of the input image by using feature information such as texture, edge, and color in the input image. The candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm. Specifically, in Fast RCNN, the preset target detection algorithm specifically uses selective search to directly perform a selective search on the input image. Carry out the extraction of candidate regions, and in Faster RCNN, the preset target detection algorithm refers to the feature map of the input image after the feature extraction of the convolutional layer, and then through the regional candidate network (region proposal network, PRN) to perform the feature map The specific process of the extraction of the candidate area will not be described here, but it is understandable that the candidate area corresponding to the input image can be obtained through the processing of the aforementioned preset target detection algorithm.
候选区域特征图,是指将候选区域映射至输入图像所对应的特征图后所得到的图像,可以理解,输入图像在输入基于区域的卷积神经网络的卷积层后,经过卷积层的特征提取处理,可以得到输入对象对应的特征图,具体卷积层的特征提取过程,这里也不展开赘述。在本申请实施例中,可获取到输入图像对应的候选区域特征图,需要说明的是,本申请实施例所指的候选区域特征图泛指输入图像对应的每一张候选区域特征图,为了便于描述,下文将一张候选区域特区图对本申请实施例所提出的图像处理方法进行描述。The feature map of the candidate region refers to the image obtained after mapping the candidate region to the feature map corresponding to the input image. It can be understood that the input image passes through the convolutional layer after inputting the convolutional layer of the region-based convolutional neural network. The feature extraction process can obtain the feature map corresponding to the input object. The specific feature extraction process of the convolutional layer will not be repeated here. In the embodiment of this application, the candidate region feature map corresponding to the input image can be obtained. It should be noted that the candidate region feature map referred to in the embodiment of this application generally refers to each candidate region feature map corresponding to the input image. For ease of description, the following describes the image processing method proposed in the embodiment of the present application with a SAR map of the candidate area.
S20:根据预设候选区域池化参数对所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括宽度参数以及长度参数。S20: Divide the candidate region feature map into NxM small regions of equal size according to preset candidate region pooling parameters, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters Including width parameters and length parameters.
其中,预设候选区域池化参数为ROI pooling层中预设的参数,上述预设候选区域池化参数旨在将候选区域特征图池化为固定尺寸的特征图的参数,具体地址包括宽度参数(pooled-h)以及长度参数(pooled-w)。Among them, the preset candidate region pooling parameters are preset parameters in the ROI pooling layer. The above preset candidate region pooling parameters are intended to pool the candidate region feature maps into parameters of fixed-size feature maps, and the specific address includes the width parameter (pooled-h) and length parameter (pooled-w).
示例性的,假设输入图像的尺寸为800*800,基于区域的卷积神经网络卷积层使用的是VGG16网络,feat-stride=32(表示输入图像经过卷积层处理后图片缩小为原图的1/32),也即经过VGG16网络处理后得到输入图像对应的特征图为25*25,假定输入图像有一候选区域,大小为665*665,则该候选区域映射至特征图后,得到的候选区域特征图的大小为:665/32≈20.78,即候选区域特征图为:20.78*20.78。需要说明得到是,在本申请实施例中,在ROI pooling层处理过程中,为了便于描述,通常保留小数点两位浮点数,出现浮点数时均保留小数点后两位浮点数进行说明。假设预设候选区域池化参数的宽度参数和长度参数分别为:pooled-h=7,pooled-w=7,则该候选区域特征图经过处理后固定成7*7大小的特征图,换句话说的,将在特征图上映射的20.78*20.78的候选区域划分为7*7=49个大小等同的小区域,每个小区域的大小为20.78/7≈2.97,也就是说每个小区域均为2.97*2.97。Exemplarily, assuming that the size of the input image is 800*800, the convolutional layer of the region-based convolutional neural network uses the VGG16 network, feat-stride=32 (indicating that the input image is reduced to the original image after the convolutional layer is processed 1/32), that is, the feature map corresponding to the input image obtained after VGG16 network processing is 25*25. Assuming that the input image has a candidate area with a size of 665*665, then the candidate area is mapped to the feature map, and the result is The size of the feature map of the candidate area is: 665/32≈20.78, that is, the feature map of the candidate area is: 20.78*20.78. It should be noted that, in the embodiment of the present application, in the ROI pooling layer processing process, for ease of description, two decimal points are usually reserved for floating-point numbers, and two floating-point numbers after the decimal point are reserved for description when floating-point numbers appear. Assuming that the width parameter and length parameter of the preset candidate region pooling parameters are: pooled-h=7, pooled-w=7, then the candidate region feature map is fixed into a 7*7 feature map after processing, in other words In other words, the 20.78*20.78 candidate area mapped on the feature map is divided into 7*7=49 small areas of equal size, and the size of each small area is 20.78/7≈2.97, which means that each small area Both are 2.97*2.97.
需要说明的是,在本申请实施例中,根据预设候选区域池化参数对所述候选区域特征图划分为NxM个大小等同的小区域中,N、M的具体大小由实际应用需求所配置,与预设候选区域池化参数相关,本申请实施例不做具体限定,示例性的,上述N、M还可以分别为8,此时,该候选区域特征图经过处理后固定成8*8大小的特征图。It should be noted that in this embodiment of the application, the candidate region feature map is divided into NxM small regions of equal size according to preset candidate region pooling parameters, and the specific sizes of N and M are configured by actual application requirements. , Related to the preset candidate region pooling parameters, the embodiment of this application does not specifically limit it. For example, the above N and M may also be 8 respectively. At this time, the candidate region feature map is fixed to 8*8 after processing The size of the feature map.
S30:根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2。S30: Divide each small area into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2.
本申请实施例中,在根据预设候选区域池化参数对所述候选区域特征图划分为NxM个大小等同的小区域之后,对于每个小区域作如下处理:根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2,所述采样区域与所述候选区域特征图的像素点的形状类型相同。其中,该预设采样点数P为用于计算出每个小区域的像素值的所设置的采样点数,示例性的,该预设采样点数可以为4、8等,具体本申请实施例不做限定。如前述所示,示例性的,在特征图上映射得到的20.78*20.78的候选区域划分为7*7≈49个大小等同的小区域后,若预设采样点数P为4,则将49个大小等同的小区域中的每个小区域平均划分为4个采样区域。In this embodiment of the application, after the candidate area feature map is divided into NxM small areas of equal size according to the preset candidate area pooling parameters, the following processing is performed for each small area: according to the preset number of sampling points P Each of the small areas is divided into P sampling areas on average, where P is a positive integer and greater than or equal to 2, and the sampling areas have the same shape type as the pixel points of the feature map of the candidate area. Wherein, the preset number of sampling points P is the set number of sampling points used to calculate the pixel value of each small area. Illustratively, the preset number of sampling points may be 4, 8, etc., which is not specifically described in the embodiment of this application. limited. As shown above, exemplarily, after the 20.78*20.78 candidate area obtained by mapping on the feature map is divided into 7*7≈49 small areas of the same size, if the preset number of sampling points P is 4, then 49 Each of the small areas of the same size is divided into 4 sampling areas on average.
S40:确定出所述候选区域特征图中与所述采样区域相交的相交像素点。S40: Determine an intersection pixel point that intersects the sampling area in the feature map of the candidate area.
在经过步骤30,根据预设采样点数P将每个所述小区域平均划分为P个采样区域后,确定出所述候选区域特征图中与所述采样区域相交的相交像素点。也即在经过步骤S30处理得到每个小区域对应的每个采样区域之后,确定出候选区域特征图中,分别与每个采样区域相交的相交像素点。需要说明的是,具体地,可通过每个采样区域的坐标位置以及候选区域特区图的像素点的坐标位置确定出与每个采样区域相交的相交像素点。示例性的,可以理解,经过步骤S40,可得到每个小区域对应的4个采样区域的中心点位置,以及与该小区域对应的4个采样区域相交的相交像素点。After step 30, each small area is divided into P sampling areas on average according to the preset number of sampling points P, and then the intersecting pixel points that intersect the sampling area in the feature map of the candidate area are determined. That is, after each sampling area corresponding to each small area is obtained through the processing of step S30, the intersecting pixel points that respectively intersect each sampling area in the feature map of the candidate area are determined. It should be noted that, specifically, the intersecting pixel points that intersect each sampling area can be determined by the coordinate position of each sampling area and the coordinate position of the pixel point of the candidate area SAR map. Exemplarily, it can be understood that, after step S40, the center point positions of the 4 sampling areas corresponding to each small area and the intersection pixel points that intersect the 4 sampling areas corresponding to the small area can be obtained.
示例性的,如图3所示:其中,图3所示的区域只是候选区域特征图中的一部分区域,包括1-16个像素点,粗线方框A、B、C以及D所在的区域为采样区域,采样区域A、B、C以及D构成一小区域。以采样区域A为例,可通过双线性差插值算法确定出采样区域A的中心点位置,并确定出与所述采样区域A相交的相交像素点,也就是像素点1、2、5以及6。对于每个小区域对应的采样区域,均可确定出中心点位置和与采样区域相交的相交像素点,例如,对于采样区域B,与采样区域B存在相交的目标像素点为2、3、6以及7。Exemplarily, as shown in Fig. 3: Among them, the area shown in Fig. 3 is only a part of the feature map of the candidate area, including 1-16 pixels, and the area where the thick-lined boxes A, B, C and D are located As the sampling area, sampling areas A, B, C, and D constitute a small area. Taking sampling area A as an example, the position of the center point of sampling area A can be determined by the bilinear difference interpolation algorithm, and the intersecting pixels that intersect with the sampling area A, namely pixels 1, 2, 5, and 6. . For the sampling area corresponding to each small area, the center point position and the intersection pixel point that intersects the sampling area can be determined. For example, for sampling area B, the target pixel points that intersect with sampling area B are 2, 3, and 6. And 7.
S50:根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采 样区域的中心点位置的像素值。S50: Determine the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area.
在确定所述采样区域的中心点位置,以及确定出所述候选区域特征图中与所述采样区域相交的相交像素点之后,根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值。如图3所示,以采样区域A为例,可根据与采样区域A存在相交的相交像素点1、2、3以及4确定出该采样区域A的中心点位置的像素值。可以理解,基于同样的计算方式,可分别获得每个小区域对应的每个采样区域的中心点位置的像素值。After determining the position of the center point of the sampling area, and determining the intersection pixel point that intersects the sampling area in the feature map of the candidate area, according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area Point, the pixel value of the center point of the sampling area is determined. As shown in FIG. 3, taking the sampling area A as an example, the pixel value at the center point of the sampling area A can be determined according to the intersecting pixels 1, 2, 3, and 4 that intersect the sampling area A. It can be understood that based on the same calculation method, the pixel value of the center point position of each sampling area corresponding to each small area can be obtained respectively.
S60:根据每个所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值。S60: Correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to each small area.
经过步骤S50后,可获得候选区域中,每个小区域的采样区域的中心点位置的像素值,本申请实施例中,根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值。如图3所示,可分别获得小区域对应的采样区域A、B、C以及D的中心点位置的像素值,对应确定出图3中,由采样区域A、B、C以及D所构成的小区域的像素值。基于同样的计算方式,可获得每个小区域对应的像素值,这里不一一举例说明。After step S50, the pixel value of the center point position of the sampling area of each small area in the candidate area can be obtained. In the embodiment of the present application, the pixel value of the center point position of each sampling area corresponding to the small area is obtained. The pixel value corresponds to the pixel value of each small area. As shown in Figure 3, the pixel values of the center point positions of the sampling areas A, B, C, and D corresponding to the small areas can be obtained, respectively, and the correspondingly determined in Figure 3, composed of the sampling areas A, B, C, and D The pixel value of a small area. Based on the same calculation method, the pixel value corresponding to each small area can be obtained, which will not be illustrated here.
S70:根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。S70: Obtain a fixed-size feature map of the candidate area according to the pixel value of each small area.
在根据每个所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值之后,根据每个所述小区域的像素值获得处理后固定尺寸的所述候选区域特征图。如前述所示,示例性的,在将在特征图上映射的20.78*20.78的候选区域划分为7*7≈49个大小等同的小区域后,每个小区域为2.97*2.97,经过前面步骤S10-S70,的获得每个小区域对应的像素值,得到49个小区域的像素值,从而输出得到7*7的候选区域特征图,因此,可输出得到固定尺寸的7*7的候选区域特征图,该固定尺寸的候选区域特征图可用于基于区域的卷积神经网络后续的分类和回归处理。可见,本申请实施例提供了一种图像处理方法,根据采样区域与像素点的相交关系,有效地保证了将相应候选区域池化为固定尺寸的特征图,且由于是根据相交像素点进行计算,最终采样区域的像素值的计算结果较为准确,使得最后得到的固定尺寸的特征图的像素值计算结果也比较准确。After the pixel value of each small area is correspondingly determined according to the pixel value of the center point position of each sampling area corresponding to each small area, the processing is obtained according to the pixel value of each small area A feature map of the candidate area with a fixed size afterwards. As shown above, exemplarily, after dividing the 20.78*20.78 candidate area mapped on the feature map into 7*7≈49 small areas of the same size, each small area is 2.97*2.97, after the previous steps S10-S70, get the pixel value corresponding to each small area, get the pixel value of 49 small areas, and output a 7*7 candidate area feature map. Therefore, you can output a fixed size 7*7 candidate area Feature map, the fixed-size candidate region feature map can be used for subsequent classification and regression processing of the region-based convolutional neural network. It can be seen that the embodiment of the present application provides an image processing method, which effectively ensures that the corresponding candidate area is pooled into a fixed-size feature map according to the intersection relationship between the sampling area and the pixel, and because the calculation is performed based on the intersection pixel , The calculation result of the pixel value of the final sampling area is more accurate, so that the finally obtained pixel value calculation result of the fixed-size feature map is also more accurate.
需要说明的是,结合上述实施例,根据采样区域与候选区域特征图中的像素点的尺寸的大小关系,本申请实施例还提供了具体根据所述候选区域特征图中与所述采样区域相交的目标相交像素点,确定出所述采样区域的中心点位置的像素值的实现方式,下面分别进行介绍:It should be noted that, in combination with the foregoing embodiment, according to the size relationship between the sampling area and the pixel point in the feature map of the candidate area, the embodiment of the present application also provides specific information based on the intersection of the candidate area feature map and the sampling area. The method for determining the pixel value of the center point position of the sampling area by the target intersecting pixel point is described below:
在一实施例中,如图4所示,步骤50之前,也即所述根据所述候选区域特征图中与所述采样区域相交的目标相交像素点,确定出所述采样区域的中心点位置的像素值之前,所述方法还包括如下步骤:In one embodiment, as shown in FIG. 4, before step 50, that is, according to the target intersecting pixel points that intersect the sampling area in the feature map of the candidate area, the center point position of the sampling area is determined Before the pixel value of, the method further includes the following steps:
S80:判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸。S80: Determine whether the size of the sampling area is greater than or equal to the size of the pixel of the feature map of the candidate area.
在步骤S30中,根据预设采样点数P将每个所述小区域平均划分为P个采样区域后,会判断每个小区域的采样区域是否大于或等于候选区域特征图中的像素点的尺寸,可以理解,由于输入图像的每个像素点的尺寸都是一致的,因此,映射得到的候选区域特征图的每个像素点的尺寸也是一样的,又因为划分得到的候选区域的每个采样区域的大小等同,因此可以判断任意一个采样区域是否大于或等于候选区域特征图的任意一个像素点的尺寸。In step S30, after each small area is divided into P sampling areas on average according to the preset number of sampling points P, it is determined whether the sampling area of each small area is greater than or equal to the size of the pixel point in the feature map of the candidate area It can be understood that since the size of each pixel of the input image is the same, the size of each pixel in the feature map of the candidate area obtained by mapping is also the same, and because each sample of the candidate area obtained by the division The sizes of the regions are the same, so it can be judged whether any sample region is greater than or equal to the size of any pixel in the feature map of the candidate region.
S90:若判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸。S90: If it is determined that the size of the sampling area is greater than or equal to the size of the pixel point of the candidate area image, generate an auxiliary frame corresponding to the sampling area with the center point position as the center, and the shape of the auxiliary frame is the same as The pixels of the feature map of the candidate area are the same, and the size of the auxiliary frame is smaller than or equal to the size of the pixels of the feature map of the candidate area.
当判断采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述中心 点位置为中心生成所述采样区域对应的辅助框,也就是说,对于每个小区域的每个采样区域,都会生成对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸。需要说明的是,本申请实施例中可采用双线性插值法确定出每个采样区域的中心点位置,具体地计算每个采样区域的中心点位置的过程这里不展开赘述。When it is determined that the size of the sampling area is greater than or equal to the pixel size of the candidate area image, then the auxiliary frame corresponding to the sampling area is generated with the center point position as the center, that is, for each small area Each sampling area will generate a corresponding auxiliary frame. The shape of the auxiliary frame is the same as the pixel of the feature map of the candidate area, and the size of the auxiliary frame is smaller than or equal to the size of the pixel of the feature map of the candidate area . It should be noted that the bilinear interpolation method may be used in the embodiment of the present application to determine the center point position of each sampling area, and the process of specifically calculating the center point position of each sampling area will not be repeated here.
示例性的,请参阅图5,由图5可知,采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸(图5中以辅助框和像素点大小一致为例进行说明),以采样区域D为例,本申请实施例中,可以以采样区域D的中心点位置生成该采样区域D对应的辅助框d,如图5中斜杠方框所示,其中,该采样区域D对应的辅助框d的形状与所述候选区域特征图的像素点相同,采样区域D对应的辅助框d的尺寸大小则小于或等于所述候选区域特征图的像素点。需要说明的是,这里只是以采样区域D为例进行说明,对于生成候选区域特征图中每个小区域的每个采样区域,通过本申请实施例,均可生成对应的辅助框。Exemplarily, please refer to FIG. 5. As can be seen from FIG. 5, the size of the sampling area is greater than or equal to the size of the pixel of the candidate area image (in FIG. 5, the auxiliary frame and the pixel are the same size as an example). Take the sampling area D as an example. In the embodiment of the present application, the auxiliary frame d corresponding to the sampling area D can be generated at the center point of the sampling area D, as shown by the slash box in FIG. 5, where the sampling area D corresponds to The shape of the auxiliary frame d is the same as the pixels of the feature map of the candidate area, and the size of the auxiliary frame d corresponding to the sampling area D is smaller than or equal to the pixels of the feature map of the candidate area. It should be noted that the sampling area D is only taken as an example for description. For each sampling area of each small area in the candidate area feature map, the corresponding auxiliary frame can be generated through the embodiment of the present application.
步骤S50中,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值,具体包括如下步骤:In step S50, the determining the pixel value of the center point position of the sampling area according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area specifically includes the following steps:
S50`:根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值。S50': Determine the pixel value at the center point of the sampling area according to the intersecting pixels that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area.
在生成候选区域特征图中每个采样区域的辅助框后,根据所述候选区域特征图中与所述采样区域相交的相交像素点以及对应的辅助框,确定出所述采样区域的中心点位置的像素值。After the auxiliary frame of each sampling area in the feature map of the candidate area is generated, the center point position of the sampling area is determined according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the corresponding auxiliary frame The pixel value.
为了便于理解,请再参阅图5,以采样区域D为例,可见,与采样区域D存在相交的相交像素点为像素点6、7、10以及11,采样区域D对应的辅助框为辅助框d,则根据像素点6、7、10、11,以及辅助框d确定出该采样区域D的中心点位置的像素值。For ease of understanding, please refer to Figure 5 again. Taking sampling area D as an example, it can be seen that the intersecting pixels that intersect with sampling area D are pixels 6, 7, 10, and 11, and the auxiliary frame corresponding to sampling area D is auxiliary frame d, the pixel value at the center point of the sampling area D is determined according to the pixels 6, 7, 10, 11 and the auxiliary frame d.
需要说明的是,对于候选区域特征图的其他采样区域,基于上述采样区域D的中心点位置的像素值计算方式进行确定,示例性的,请参阅图6,对于采样区域A,可见,与采样区域A存在相交的相交像素点为像素点1、2、5以及6,采样区域A对应的辅助框为辅助框a,则根据像素点1、2、5、6,以及辅助框a确定出该采样区域a的中心点位置的像素值,其他采样区域的中心点的像素值的计算方式,这里不一一说明。It should be noted that the other sampling areas of the feature map of the candidate area are determined based on the pixel value calculation method of the center point position of the sampling area D. For example, please refer to Fig. 6. For the sampling area A, it is visible and sampled. The intersecting pixels in area A are pixels 1, 2, 5, and 6, and the auxiliary frame corresponding to sampling area A is auxiliary frame a. Then the pixel points 1, 2, 5, 6 and auxiliary frame a are used to determine this The calculation methods of the pixel value at the center point of the sampling area a and the pixel values at the center points of other sampling areas are not described here.
在一实施例中,如图7所示,步骤S50`中,也即所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,具体包括如下步骤:In one embodiment, as shown in FIG. 7, in step S50', that is, according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area, and the auxiliary frame corresponding to the sampling area, Determining the pixel value of the center point of the sampling area specifically includes the following steps:
S51`:获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积。S51': Obtain the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point in the intersection pixel point.
S52`:根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值。S52': Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point, and the pixel value of each pixel point.
S53`:将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。S53': Use the first target pixel value corresponding to the sampling area as the pixel value at the center point of the sampling area.
对于步骤S51`,请继续参阅图5,以采样区域D为例,可见采样区域D对应的辅助区域d分别与相交像素点,也即像素点6、7、10以及11相交,在本步骤中,可确定辅助框d分别与相交像素点6、7、10以及11的相交面积,在本实施例中分别记为:C 6d、C 7d、C 10d以及C 11dFor step S51`, please continue to refer to FIG. 5, taking the sampling area D as an example, it can be seen that the auxiliary area d corresponding to the sampling area D intersects with the intersecting pixels, that is, the pixels 6, 7, 10, and 11. In this step , The intersecting areas of the auxiliary frame d and the intersecting pixels 6 , 7 , 10 , and 11 can be determined, which are respectively denoted as C 6d , C 7d , C 10d and C 11d in this embodiment.
对于步骤S52`-S53`,同样以采样区域D为例,在获得C 6d、C 7d、C 10d以及C 11d之后,根据C 6d、C 7d、C 10d以及C 11d,以及相交像素点6、7、10以及11确定出采样区域 D对应的第一目标像素值,并将该第一目标像素值作为采样区域D的中心点位置的像素值。需要说明的是,对于候选区域特征图的其他采样区域,通过步骤S51`-S53`均可得到采样区域对应的中心点位置的像素值,这里不一一重复赘述。 For steps S52`-S53`, taking the sampling area D as an example, after obtaining C 6d , C 7d , C 10d and C 11d , according to C 6d , C 7d , C 10d and C 11d , and the intersecting pixels 6, 7, 10, and 11 determine the first target pixel value corresponding to the sampling area D, and use the first target pixel value as the pixel value at the center point of the sampling area D. It should be noted that, for other sampling areas of the feature map of the candidate area, the pixel values of the center point positions corresponding to the sampling areas can be obtained through steps S51`-S53`, and the details are not repeated here.
在一实施例中,如图8所示,步骤S53`中,也即所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值,具体包括如下步骤:In an embodiment, as shown in FIG. 8, in step S53', that is, according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel, and the pixel value of the respective pixel, Determining the first target pixel value corresponding to the sampling area specifically includes the following steps:
S531`:分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积。S531': correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel.
S532`:将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和。S532': Add the product between the pixel value of each pixel and the intersection area of each pixel to obtain a first product sum.
S533`:计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和。S533': Calculate the sum of the intersection area of each pixel to obtain the sum of the area of the first intersecting pixel.
S534`:计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。S534': Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
这里以候选区域特征图中的采样区域D为例,对本申请实施例进行说明:Here, the sampling area D in the feature map of the candidate area is taken as an example to describe the embodiment of the present application:
对于步骤S531`,在获得C 6d、C 7d、C 10d以及C 11d之后,分别对应计算出采样区域对应的相交像素点中各个像素点的像素值与C 6d、C 7d、C 10d以及C 11d之间的乘积,也即分别计算出C 6d与像素点6的像素值A 6的乘积,记为:A 6C 6d;计算出C 7d与像素点7的像素值A 7的乘积,记为:A 7C 7d;计算出C 10d与像素点10的像素值A 10的乘积,记为:A 10C 10d;计算出C 11d与像素点11的像素值A 11的乘积,记为:A 11C 11dFor step S531`, after obtaining C 6d, C 7d, C 10d and C 11d, respectively, corresponding to the calculated pixel value of each pixel of C 6d, C 7d, C 10d C 11d and the intersection point of the pixel region corresponding to the sample The product between, that is, the product of C 6d and the pixel value A 6 of pixel 6 is calculated separately, and it is recorded as: A 6 C 6d ; the product of C 7d and the pixel value A 7 of pixel 7 is calculated, and it is recorded as :A 7 C 7d ; Calculate the product of C 10d and the pixel value A 10 of the pixel point 10, denoted as: A 10 C 10d ; Calculate the product of C 11d and the pixel value A 11 of the pixel point 11, denoted as: A 11 C 11d .
对于步骤S532`,计算得到A 6C 6d、A 7C 7d、A 10C 10d以及A 11C 11d之后,将上述乘积进行相加得到第一乘积和,也即A 6C 6d+A 7C 7d+A 10C 10d+A 11C 11dFor step S532`, after calculating A 6 C 6d , A 7 C 7d , A 10 C 10d and A 11 C 11d , the above products are added to obtain the first product sum, that is, A 6 C 6d + A 7 C 7d +A 10 C 10d +A 11 C 11d .
对于步骤S533`,采样区域D对应的辅助框d与所述各个像素点的相交面积的和,第一相交像素点面积和,也就是C 6d+C 7d+C 10d+C 11dFor step S533', the sum of the intersecting areas of the auxiliary frame d corresponding to the sampling area D and the respective pixels, and the sum of the areas of the first intersecting pixel, that is, C 6d + C 7d + C 10d + C 11d .
对于步骤S234`,计算采样区域D对应的第一乘积和,与所述第一相交像素点面积和之间的商U D,具体如下述公式所示: For step S234', the quotient U D between the first product sum corresponding to the sampling area D and the first intersecting pixel area sum is calculated as shown in the following formula:
Figure PCTCN2019089196-appb-000001
Figure PCTCN2019089196-appb-000001
需要说明的是,本申请实施例中,以采样区域D为例进行说明,对于候选区域特征图中的其他采样区域,均可参阅上述采样区域的D对应的第一目标像素值得计算方式进行计算,这里不重复赘述。另外需要说明的是,根据采样区域与候选区域特征图中的像素点的尺寸的大小关系,本申请实施例还提供了具体根据所述候选区域特征图中与所述采样区域相交的目标相交像素点,确定出所述采样区域的中心点位置的像素值的实现方式:It should be noted that, in the embodiment of the present application, the sampling area D is taken as an example for description. For other sampling areas in the feature map of the candidate area, the calculation can be performed by referring to the first target pixel value calculation method corresponding to D of the sampling area. , I will not repeat it here. In addition, it should be noted that, according to the size relationship between the sampling area and the size of the pixel points in the feature map of the candidate area, the embodiment of the present application also provides specific intersecting pixels based on the target intersecting the sampling area in the feature map of the candidate area. Point, the method for determining the pixel value at the center point of the sampling area:
如图9所示,步骤S80之后,也即所述判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸之后,该图像处理方法还包括如下步骤:As shown in FIG. 9, after step S80, that is, after determining whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area, the image processing method further includes the following steps:
S100:若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,则根据 所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值。S100: If it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, then according to the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area, and the The intersection area of the sampling area and each of the intersecting pixel points determines the second target pixel value corresponding to the sampling area.
S110:将所述采样区域对应的第二目标像素值作为所述采样区域的中心点位置的像素值。S110: Use the second target pixel value corresponding to the sampling area as the pixel value at the center point of the sampling area.
如前述,预设采样点是预设配置的,根据预设采样点数P对候选区域中的每个小区域划分出来的采样区域的尺寸,有可能小于候选区域特征图中像素点的尺寸,例如,候选区域是小目标时,映射至输入图像对应的得到的候选区域特征图将比较小,划分出的每个采样区域的尺寸可能小于候选区域特图的像素点的尺寸。As mentioned above, the preset sampling points are preset configurations. The size of the sampling area divided into each small area in the candidate area according to the preset number of sampling points P may be smaller than the size of the pixel points in the feature map of the candidate area, for example When the candidate area is a small target, the feature map of the candidate area corresponding to the input image will be relatively small, and the size of each sample area divided may be smaller than the pixel size of the feature map of the candidate area.
示例性的,如图10所示:可见,小区域划分出的采样区域A、B、C以及D小于像素点的尺寸,则根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值。以采样区域B为例,与采样区域B相交的各个相交像素点分别为像素点5和6,分别记像素点5的像素值为A 5,像素点6的像素值为A 6,采样区域B与像素点5和6的相交面积分别记为C 5B、C 6B,则根据A 5、A 6、C 5B以及C 6B确定出采样区域B对应的第二目标像素值,并将采样区域B对应的第二目标像素值作为该采样区域B的中心点位置的像素值。对于尺寸小于候选区域特征图的像素点尺寸的其他采样区域,可基于上述方式,分别确定出采样区域的重中心点位置的像素值,这里不一一重复赘述。 Exemplarily, as shown in FIG. 10: it can be seen that the sampling areas A, B, C, and D divided by the small area are smaller than the size of the pixel, then according to each intersection of the candidate area feature map and the sampling area The pixel value of the pixel point and the intersection area between the sampling area and each of the intersecting pixel points determine the second target pixel value corresponding to the sampling area. Taking sampling area B as an example, the intersecting pixels that intersect with sampling area B are pixels 5 and 6, respectively. The pixel value of pixel 5 is A 5 , the pixel value of pixel 6 is A 6 , and sampling area B The intersecting areas with pixels 5 and 6 are denoted as C 5B and C 6B respectively , and then the second target pixel value corresponding to sampling area B is determined according to A 5 , A 6 , C 5B and C 6B , and sampling area B corresponds to The second target pixel value of is used as the pixel value of the center point of the sampling area B. For other sampling areas whose size is smaller than the pixel size of the feature map of the candidate area, the pixel values at the positions of the heavy center points of the sampling areas can be determined based on the above-mentioned method, which will not be repeated here.
在一实施例中,如图11所示,步骤S100中,若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,所述根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值,具体包括如下步骤:In one embodiment, as shown in FIG. 11, in step S100, if it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, the The pixel value of each intersecting pixel point intersected by the sampling area and the intersecting area of the sampling area and each intersecting pixel point are determined to determine the second target pixel value corresponding to the sampling area, which specifically includes the following steps:
S101:分别对应计算所述各个相交像素点的像素值,与所述各个相交像素点的相交面积之间的乘积。S101: Calculate the product of the pixel value of each intersecting pixel and the intersecting area of each intersecting pixel respectively.
S102:将所述各个相交像素点的像素值,与所述各个相交像素点的相交面积之间的乘积相加,以获得第二乘积和。S102: Add the pixel value of each intersecting pixel and the product of the intersecting area of each intersecting pixel to obtain a second sum of products.
S103:计算所述各个相交像素点的相交面积之间的和,以获得第二相交像素点面积和。S103: Calculate the sum of the intersecting areas of the intersecting pixels to obtain a second sum of the areas of the intersecting pixels.
S104:计算第二乘积和与所述第二相交像素点面积之间的商,以获得所述采样区域对应的第二目标像素值。S104: Calculate the quotient between the second product and the area of the second intersecting pixel to obtain a second target pixel value corresponding to the sampling area.
对于步骤S101-S104,请继续参阅图10,以候选区域特征图中的采样区域B为例,在获得为C 5B、C 6B之后,分别对应计算出相交像素点中各个像素点的像素值与C 5B、C 6B之间的乘积,也即分别计算出C 5B与像素点5的像素值A 5的乘积,记为:A 5C 5B;计算出C 6B与像素点6的像素值A 6的乘积,记为:A 6C 6BFor steps S101-S104, please continue to refer to Fig. 10, taking the sampling area B in the feature map of the candidate area as an example, after obtaining C 5B and C 6B , the pixel value and the pixel value of each pixel in the intersecting pixel are calculated respectively. The product between C 5B and C 6B , that is, the product of C 5B and the pixel value A 5 of pixel 5 is calculated respectively, and it is recorded as: A 5 C 5B ; the pixel value A 6 of C 6B and pixel 6 is calculated The product of is recorded as: A 6 C 6B .
以候选区域特征图中的采样区域B为例,在获得计算出C 5B与像素点5的像素值A 5的乘积A 5C 5B,以及计算出C 6B与像素点6的像素值A 6的乘积A 6C 6B之后,对乘积进行相加得到第二乘积和,也即:A 5C 5B+A 6C 6BTaking the sampling area B in the feature map of the candidate area as an example, the product A 5 C 5B of C 5B and the pixel value A 5 of pixel 5 is obtained, and the pixel value A 6 of C 6B and pixel 6 is calculated. After the products A 6 C 6B are multiplied, the products are added to obtain the second product sum, that is, A 5 C 5B + A 6 C 6B .
采样区域B对应的相交像素点的相交面积的和,即第一相交像素点面积和为:C 5B+C 6BThe sum of the intersecting areas of the intersecting pixels corresponding to the sampling area B, that is, the sum of the areas of the first intersecting pixels is: C 5B +C 6B .
第二乘积和与所述第二相交像素点面积之间的商,具体采样以下公式计算:The quotient between the second product sum and the area of the second intersecting pixel is calculated by specifically sampling the following formula:
Figure PCTCN2019089196-appb-000002
Figure PCTCN2019089196-appb-000002
需要说明的是,本申请实施例中,以采样区域B为例进行说明,对于候选区域特征图中的其他采样区域,均可参阅上述采样区域的B对应的第二目标像素值得计算方式进行计算,这里不重复赘述。It should be noted that in the embodiment of the present application, the sampling area B is taken as an example for description. For other sampling areas in the feature map of the candidate area, the calculation can be performed by referring to the second target pixel value calculation method corresponding to B of the sampling area. , I will not repeat it here.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence number of each step in the foregoing embodiment does not mean the order of execution, and the execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation on the implementation process of the embodiment of the present application.
在一实施例中,提供一种图像处理装置,该图像处理装置与上述实施例中图像处理方法一一对应。如图12所示,该图像处理装置10包括第一获取模块101、第一划分模块102、第二划分模块103、第一确定模块104、第二确定模块105、第三确定模块106和第二获取模块107。各功能模块详细说明如下:In one embodiment, an image processing device is provided, and the image processing device corresponds to the image processing method in the above-mentioned embodiment one-to-one. As shown in FIG. 12, the image processing apparatus 10 includes a first acquisition module 101, a first division module 102, a second division module 103, a first determination module 104, a second determination module 105, a third determination module 106, and a second Obtaining module 107. The detailed description of each functional module is as follows:
第一获取模块,用于获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;第一划分模块,用于根据预设候选区域池化参数将所述第一获取模块获取的所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;第二划分模块,用于根据预设采样点数P将每个所述第一划分模块划分得到的所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;第一确定模块,用于确定出所述第一获取模块获取的所述候选区域特征图中与所述第二划分模块划分得到的所述采样区域相交的相交像素点;第二确定模块,用于根据所述第一确定模块确定的所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;第三确定模块,用于根据所述第二确定模块确定的所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;第二获取模块,用于根据所述第三确定模块确定的每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。The first acquisition module is used to acquire a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network , The candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; a first division module is configured to combine the data obtained by the first obtaining module according to a preset candidate area pooling parameter The candidate region feature map is divided into NxM small regions of equal size, where N and M are positive integers and greater than or equal to 1, and the preset candidate region pooling parameters include a width parameter and a length parameter for pooling processing; The second dividing module is configured to divide the small areas divided by each of the first dividing modules into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2; A determining module, configured to determine the intersection pixel points in the candidate region feature map acquired by the first acquiring module and the sampling area divided by the second dividing module; a second determining module, The pixel value of the center point of the sampling area is determined according to the intersecting pixels in the feature map of the candidate area determined by the first determining module and the sampling area; The pixel value of the center point of each sampling area corresponding to the small area determined by the second determining module determines the pixel value of each small area correspondingly; the second acquiring module is configured to determine the pixel value of each small area according to the The pixel value of each small area determined by the third determining module obtains the candidate area feature map of a fixed size.
在一实施例中,所述图像处理装置还包括第四确定模块、生成模块;In an embodiment, the image processing device further includes a fourth determining module and a generating module;
所述第四确定模块,用于所述第二确定模块根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值之前,判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸;所述生成模块,用于若所述第四确定模块判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述采样区域的中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸;所述第二确定模块,具体用于:根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值。The fourth determining module is configured to determine before the second determining module determines the pixel value at the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area Whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area; the generating module is configured to determine that the size of the sampling area is greater than or equal to the candidate area if the fourth determining module determines For the size of the pixel points of the image, the auxiliary frame corresponding to the sampling area is generated with the center point position of the sampling area as the center. The shape of the auxiliary frame is the same as the pixel points of the feature map of the candidate area. The size of the frame is smaller than or equal to the pixel size of the feature map of the candidate area; the second determining module is specifically configured to: according to the intersection pixel point of the feature map of the candidate area that intersects the sampling area, and The auxiliary frame corresponding to the sampling area determines the pixel value of the center point position of the sampling area.
在一实施例中,所述第二确定模块用于根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,包括:所述第二确定模块用于:获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积;根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值;将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。In an embodiment, the second determining module is configured to determine the center of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area The pixel value of the point position includes: the second determining module is configured to: obtain the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point in the intersecting pixel; according to the auxiliary frame corresponding to the sampling area The intersection area with each pixel point and the pixel value of each pixel point determine the first target pixel value corresponding to the sampling area; take the first target pixel value corresponding to the sampling area as the The pixel value of the center point of the sampling area.
在一实施例中,所述第二确定模块用于所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目 标像素值,包括:所述第二确定模块具体用于:分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积;将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和;计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和;计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。In an embodiment, the second determining module is configured to determine the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point and the pixel value of each pixel point The first target pixel value corresponding to the area includes: the second determining module is specifically configured to: respectively correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel; Add the product between the pixel value of the pixel and the intersection area of each pixel to obtain a first product sum; calculate the sum of the intersection area of each pixel to obtain the first intersecting pixel area sum; Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
在一实施例中,所述图像处理装置还包括第五确定模块:所述第五确定模块:所述判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸之后,若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,则根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值;将所述采样区域对应的第二目标像素值作为所述采样区域的中心点位置的像素值。In an embodiment, the image processing device further includes a fifth determining module: the fifth determining module: after determining whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area If it is determined that the size of the sampling area is smaller than the size of the pixel of the feature map of the candidate area, then the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area is determined, and the sampling The intersecting area of the area and each of the intersecting pixel points determines the second target pixel value corresponding to the sampling area; taking the second target pixel value corresponding to the sampling area as the pixel value at the center point of the sampling area .
在一实施例中,第五确定模块用于所述根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值,包括:第五确定模块用于:分别对应计算所述各个目标相交像素点的像素值,与所述各个目标相交像素点的相交面积之间的乘积;将所述各个相交像素点的像素值,与所述各个相交像素点的相交面积之间的乘积相加,以获得第二乘积和;计算所述各个相交像素点的相交面积之间的和,以获得第二相交像素点面积和;计算第二乘积和与所述第二相交像素点面积之间的商,以获得所述采样区域对应的第二目标像素值。In an embodiment, the fifth determining module is used for the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area, and the difference between the sampling area and each intersecting pixel. The intersection area determines the second target pixel value corresponding to the sampling area, including: a fifth determining module is configured to: respectively calculate the pixel value of each target intersection pixel point, and the intersection of each target intersection pixel point The product between the areas; adding the pixel value of each intersecting pixel to the product of the intersecting area of each intersecting pixel to obtain a second product sum; calculating the intersection of each intersecting pixel The sum of the areas is used to obtain the area sum of the second intersecting pixel; the quotient between the second product sum and the area of the second intersecting pixel is calculated to obtain the second target pixel value corresponding to the sampling area.
关于图像处理装置的具体限定可以参见上文中对于图像处理方法的限定,在此不再赘述。上述图像处理装置中的各个模块可全部或部分通过软件、硬件及其组合来实现。上述各模块可以硬件形式内嵌于或独立于计算机设备中的处理器中,也可以以软件形式存储于计算机设备中的存储器中,以便于处理器调用执行以上各个模块对应的操作。For the specific definition of the image processing device, please refer to the above definition of the image processing method, which will not be repeated here. Each module in the above-mentioned image processing device may be implemented in whole or in part by software, hardware, and a combination thereof. The foregoing modules may be embedded in the form of hardware or independent of the processor in the computer equipment, or may be stored in the memory of the computer equipment in the form of software, so that the processor can call and execute the operations corresponding to the foregoing modules.
在一个实施例中,提供了一种计算机设备,该计算机设备可以是服务器,其内部结构图可以如图13所示。该计算机设备包括通过系统总线连接的处理器、存储器、网络接口和数据库。其中,该计算机设备的处理器用于提供计算和控制能力。该计算机设备的存储器包括非易失性存储介质、内存储器。该非易失性存储介质存储有操作系统、计算机可读指令和数据库。该内存储器为非易失性存储介质中的操作系统和计算机可读指令的运行提供环境。该计算机设备的数据库用于存储特征图等。该计算机设备的网络接口用于与外部的终端通过网络连接通信。该计算机可读指令被处理器执行时以实现一种图像处理方法。In one embodiment, a computer device is provided. The computer device may be a server, and its internal structure diagram may be as shown in FIG. 13. The computer equipment includes a processor, a memory, a network interface and a database connected by a system bus. Among them, the processor of the computer device is used to provide calculation and control capabilities. The memory of the computer device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer readable instructions, and a database. The internal memory provides an environment for the operation of the operating system and computer-readable instructions in the non-volatile storage medium. The database of the computer equipment is used to store feature maps and so on. The network interface of the computer device is used to communicate with an external terminal through a network connection. The computer-readable instructions are executed by the processor to realize an image processing method.
在一个实施例中,提供了一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,所述处理器执行所述计算机可读指令时实现如下步骤:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;确定出所述候选区域特征图中与所述采样区域相交的相交像素点;根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。In one embodiment, a computer device is provided, including a memory, a processor, and computer-readable instructions stored in the memory and executable on the processor, and the processor executes the computer-readable The instruction is implemented as follows: Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to the feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network, The candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; dividing the candidate area feature map into NxM small areas of equal size according to preset candidate area pooling parameters, N and M are positive integers and greater than or equal to 1, the preset candidate area pooling parameters include a width parameter and a length parameter used for pooling processing; each of the small areas is divided equally into P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the feature map of the candidate area; according to the feature map of the candidate area and the sampling area Determine the pixel value of the center point of the sampling area for the intersecting pixel points; determine each small area corresponding to the pixel value of the center point of each sampling area corresponding to the small area According to the pixel value of each of the small areas, a fixed-size feature map of the candidate area is obtained.
在一个实施例中,提供了一个或多个存储有计算机可读指令的非易失性可读存储介质,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如 下步骤:获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;确定出所述候选区域特征图中与所述采样区域相交的相交像素点;根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。In one embodiment, one or more non-volatile readable storage media storing computer readable instructions are provided. When the computer readable instructions are executed by one or more processors, the one or more Each processor performs the following steps: Obtain a feature map of the candidate region, the feature map of the candidate region is obtained by mapping the candidate region to a feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network The candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm; the candidate area feature map is divided into NxM small areas of equal size according to a preset candidate area pooling parameter, so The N and M are positive integers and greater than or equal to 1, the preset candidate area pooling parameters include a width parameter and a length parameter for pooling processing; each of the small areas is divided equally according to the preset number of sampling points P Are P sampling areas, where P is a positive integer and greater than or equal to 2; determine the intersection pixel points that intersect the sampling area in the candidate area feature map; according to the candidate area feature map and the sampling The pixel value at the center point position of the sampling area is determined for the intersecting pixel points where the area intersects; according to the pixel value at the center point position of each sampling area corresponding to the small area, each small area is correspondingly determined. The pixel value of the area; obtaining a fixed-size feature map of the candidate area according to the pixel value of each small area.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机可读指令来指令相关的硬件来完成,所述的计算机可读指令可存储于一非易失性计算机可读取存储介质中,该计算机可读指令在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。非易失性存储器可包括只读存储器(ROM)、可编程ROM(PROM)、电可编程ROM(EPROM)、电可擦除可编程ROM(EEPROM)或闪存。易失性存储器可包括随机存取存储器(RAM)或者外部高速缓冲存储器。作为说明而非局限,RAM以多种形式可得,诸如静态RAM(SRAM)、动态RAM(DRAM)、同步DRAM(SDRAM)、双数据率SDRAM(DDRSDRAM)、增强型SDRAM(ESDRAM)、同步链路(Synchlink)DRAM(SLDRAM)、存储器总线(Rambus)直接RAM(RDRAM)、直接存储器总线动态RAM(DRDRAM)、以及存储器总线动态RAM(RDRAM)等。A person of ordinary skill in the art can understand that all or part of the processes in the above-mentioned embodiment methods can be implemented by instructing relevant hardware through computer-readable instructions, which can be stored in a non-volatile computer. In a readable storage medium, when the computer-readable instructions are executed, they may include the processes of the above-mentioned method embodiments. Wherein, any reference to memory, storage, database or other media used in the embodiments provided in this application may include non-volatile and/or volatile memory. Non-volatile memory may include read only memory (ROM), programmable ROM (PROM), electrically programmable ROM (EPROM), electrically erasable programmable ROM (EEPROM), or flash memory. Volatile memory may include random access memory (RAM) or external cache memory. As an illustration and not a limitation, RAM is available in many forms, such as static RAM (SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rate SDRAM (DDRSDRAM), enhanced SDRAM (ESDRAM), synchronous chain Channel (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM), etc.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。Those skilled in the art can clearly understand that for the convenience and conciseness of description, only the division of the above functional units and modules is used as an example. In actual applications, the above functions can be allocated to different functional units, Module completion means dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above.
以上所述实施例仅用以说明本申请的技术方案,而非对其限制;尽管参照前述实施例对本申请进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本申请各实施例技术方案的精神和范围,均应包含在本申请的保护范围之内。The above-mentioned embodiments are only used to illustrate the technical solutions of the present application, not to limit them; although the present application has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that they can still implement the foregoing The technical solutions recorded in the examples are modified, or some of the technical features are equivalently replaced; these modifications or replacements do not cause the essence of the corresponding technical solutions to deviate from the spirit and scope of the technical solutions of the embodiments of the application, and should be included in Within the scope of protection of this application.

Claims (20)

  1. 一种图像处理方法,其特征在于,包括:An image processing method, characterized by comprising:
    获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;Obtain a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, and the candidate area is based on A preset target detection algorithm is obtained by performing target area detection on the input image;
    根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;According to preset candidate area pooling parameters, the candidate area feature map is divided into NxM small areas of equal size, where N and M are positive integers and greater than or equal to 1, and the preset candidate area pooling parameters include Width and length parameters for pooling processing;
    根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;Divide each small area into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2;
    确定出所述候选区域特征图中与所述采样区域相交的相交像素点;Determining an intersection pixel point that intersects the sampling area in the candidate area feature map;
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;Determine the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area;
    根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;Correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to the small area;
    根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。A fixed-size feature map of the candidate area is obtained according to the pixel value of each small area.
  2. 如权利要求1所述的图像处理方法,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值之前,所述方法还包括:The image processing method according to claim 1, wherein the pixel value of the center point of the sampling area is determined according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area. , The method further includes:
    判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸;Judging whether the size of the sampling area is greater than or equal to the size of the pixel point of the feature map of the candidate area;
    若判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述采样区域的中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸;If it is determined that the size of the sampling area is greater than or equal to the size of the pixel point of the candidate area image, the auxiliary frame corresponding to the sampling area is generated with the center point position of the sampling area as the center, and the shape of the auxiliary frame Same as the pixels of the feature map of the candidate area, the size of the auxiliary frame is smaller than or equal to the size of the pixels of the feature map of the candidate area;
    所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值,包括:The determining the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area includes:
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值。The pixel value of the center point of the sampling area is determined according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area.
  3. 如权利要求2所述的图像处理方法,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,包括:3. The image processing method according to claim 2, wherein the sampling is determined based on the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area. The pixel value at the center point of the area, including:
    获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积;Acquiring the intersection area of the auxiliary frame corresponding to the sampling area and each pixel in the intersection pixel;
    根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值;Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel points, and the pixel value of each pixel point;
    将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。The first target pixel value corresponding to the sampling area is used as the pixel value at the center point of the sampling area.
  4. 如权利要求3所述的图像处理方法,其特征在于,所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值,包括:5. The image processing method according to claim 3, wherein said determining the said area according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel and the pixel value of each pixel The first target pixel value corresponding to the sampling area includes:
    分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积;Correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel;
    将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和;Adding the product between the pixel value of each pixel and the intersection area of each pixel to obtain a first product sum;
    计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和;Calculating the sum of the intersecting area of each pixel to obtain the sum of the area of the first intersecting pixel;
    计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  5. 如权利要求2所述的图像处理方法,其特征在于,所述判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸之后,所述方法还包括:3. The image processing method according to claim 2, wherein after determining whether the size of the sampling area is greater than or equal to the pixel size of the feature map of the candidate area, the method further comprises:
    若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,则根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值;If it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area and the sampling area Determine the second target pixel value corresponding to the sampling area by the intersection area with each of the intersecting pixel points;
    将所述采样区域对应的第二目标像素值作为所述采样区域的中心点位置的像素值。The second target pixel value corresponding to the sampling area is taken as the pixel value at the center point of the sampling area.
  6. 如权利要求5所述的图像处理方法,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值,包括:The image processing method of claim 5, wherein the pixel value of each intersecting pixel point that intersects the sampling area in the candidate area feature map, and the sampling area and each intersecting pixel value The intersection area of the pixel points to determine the second target pixel value corresponding to the sampling area includes:
    分别对应计算所述各个目标相交像素点的像素值,与所述各个目标相交像素点的相交面积之间的乘积;Correspondingly calculate the product of the pixel value of each target intersecting pixel and the product of the intersecting area of each target intersecting pixel;
    将所述各个相交像素点的像素值,与所述各个相交像素点的相交面积之间的乘积相加,以获得第二乘积和;Adding the pixel value of each intersecting pixel to the product of the intersecting area of each intersecting pixel to obtain a second product sum;
    计算所述各个相交像素点的相交面积之间的和,以获得第二相交像素点面积和;Calculating the sum of the intersecting areas of the intersecting pixels to obtain a second sum of the areas of the intersecting pixels;
    计算第二乘积和与所述第二相交像素点面积之间的商,以获得所述采样区域对应的第二目标像素值。The quotient between the second product and the area of the second intersecting pixel is calculated to obtain the second target pixel value corresponding to the sampling area.
  7. 一种图像处理装置,其特征在于,包括:An image processing device, characterized by comprising:
    第一获取模块,用于获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;The first acquisition module is used to acquire a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, and the feature map is obtained by feature extraction of the input image through a convolutional neural network , The candidate area is obtained by performing target area detection on the input image according to a preset target detection algorithm;
    第一划分模块,用于根据预设候选区域池化参数将所述第一获取模块获取的所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;The first division module is configured to divide the candidate region feature map acquired by the first acquisition module into NxM small regions of equal size according to preset candidate region pooling parameters, where N and M are positive integers and greater than Or equal to 1, the preset candidate region pooling parameters include a width parameter and a length parameter used for pooling processing;
    第二划分模块,用于根据预设采样点数P将每个所述第一划分模块划分得到的所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;The second dividing module is configured to divide the small areas divided by each of the first dividing modules into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2;
    第一确定模块,用于确定出所述第一获取模块获取的所述候选区域特征图中与所述第二划分模块划分得到的所述采样区域相交的相交像素点;A first determining module, configured to determine intersecting pixels in the candidate area feature map acquired by the first acquiring module and the sampling area divided by the second dividing module;
    第二确定模块,用于根据所述第一确定模块确定的所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;The second determining module is configured to determine the pixel value of the center point of the sampling area according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area determined by the first determining module;
    第三确定模块,用于根据所述第二确定模块确定的所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;A third determining module, configured to correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to the small area determined by the second determining module;
    第二获取模块,用于根据所述第三确定模块确定的每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。The second obtaining module is configured to obtain a fixed-size feature map of the candidate area according to the pixel value of each small area determined by the third determining module.
  8. 如权利要求7所述的图像处理装置,其特征在于,所述图像处理装置还包括第四确定模块、生成模块;8. The image processing device of claim 7, wherein the image processing device further comprises a fourth determining module and a generating module;
    所述第四确定模块,用于所述第二确定模块根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值之前,判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸;The fourth determining module is configured to determine before the second determining module determines the pixel value at the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area Whether the size of the sampling area is greater than or equal to the size of the pixel point of the feature map of the candidate area;
    所述生成模块,用于若所述第四确定模块判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述采样区域的中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸;The generating module is configured to, if the fourth determining module determines that the size of the sampling area is greater than or equal to the size of the pixel point of the candidate area image, generate the center point of the sampling area as the center. The auxiliary frame corresponding to the sampling area, the shape of the auxiliary frame is the same as the pixels of the feature map of the candidate area, and the size of the auxiliary frame is smaller than or equal to the size of the pixels of the feature map of the candidate area;
    所述第二确定模块,具体用于:根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值。The second determining module is specifically configured to: determine the position of the center point of the sampling area according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area Pixel values.
  9. 如权利要求8所述的图像处理装置,其特征在于,所述第二确定模块用于根据所述 候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,包括:The image processing device according to claim 8, wherein the second determining module is used to determine the pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary corresponding to the sampling area. Frame, determining the pixel value of the center point of the sampling area, including:
    所述第二确定模块用于:The second determining module is used for:
    获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积;Acquiring the intersection area of the auxiliary frame corresponding to the sampling area and each pixel in the intersection pixel;
    根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值;Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel points, and the pixel value of each pixel point;
    将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。The first target pixel value corresponding to the sampling area is used as the pixel value at the center point of the sampling area.
  10. 如权利要求9所述的图像处理装置,其特征在于,所述第二确定模块用于所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值,包括:9. The image processing device according to claim 9, wherein the second determining module is used for the intersection area between the auxiliary frame corresponding to the sampling area and the respective pixel, and the respective pixel To determine the first target pixel value corresponding to the sampling area, including:
    所述第二确定模块具体用于:The second determining module is specifically configured to:
    分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积;Correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel;
    将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和;Adding the product between the pixel value of each pixel and the intersection area of each pixel to obtain a first product sum;
    计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和;Calculating the sum of the intersecting area of each pixel to obtain the sum of the area of the first intersecting pixel;
    计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  11. 一种计算机设备,包括存储器、处理器以及存储在所述存储器中并可在所述处理器上运行的计算机可读指令,其特征在于,所述处理器执行所述计算机可读指令时实现如下步骤:A computer device includes a memory, a processor, and computer-readable instructions stored in the memory and capable of running on the processor, wherein the processor executes the computer-readable instructions as follows step:
    获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;Obtain a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, and the candidate area is based on A preset target detection algorithm is obtained by performing target area detection on the input image;
    根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;According to preset candidate area pooling parameters, the candidate area feature map is divided into NxM small areas of equal size, where N and M are positive integers and greater than or equal to 1, and the preset candidate area pooling parameters include Width and length parameters for pooling processing;
    根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;Divide each small area into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2;
    确定出所述候选区域特征图中与所述采样区域相交的相交像素点;Determining an intersection pixel point that intersects the sampling area in the candidate area feature map;
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;Determine the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area;
    根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;Correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to the small area;
    根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。A fixed-size feature map of the candidate area is obtained according to the pixel value of each small area.
  12. 如权利要求11所述的计算机设备,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值之前,所述处理器执行所述计算机可读指令时还如下步骤:The computer device according to claim 11, wherein before determining the pixel value of the center point position of the sampling area according to the intersection pixel point that intersects the sampling area in the feature map of the candidate area, When the processor executes the computer-readable instruction, the following steps are also performed:
    判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸;Judging whether the size of the sampling area is greater than or equal to the size of the pixel point of the feature map of the candidate area;
    若判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述采样区域的中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸;If it is determined that the size of the sampling area is greater than or equal to the size of the pixel point of the candidate area image, the auxiliary frame corresponding to the sampling area is generated with the center point position of the sampling area as the center, and the shape of the auxiliary frame Same as the pixels of the feature map of the candidate area, the size of the auxiliary frame is smaller than or equal to the size of the pixels of the feature map of the candidate area;
    所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值,包括:The determining the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area includes:
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对 应的辅助框,确定所述采样区域的中心点位置的像素值。The pixel value at the center point of the sampling area is determined according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area.
  13. 如权利要求12所述的计算机设备,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,包括:The computer device according to claim 12, wherein the sampling area is determined based on the intersection pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area The pixel value of the center point position, including:
    获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积;Acquiring the intersection area of the auxiliary frame corresponding to the sampling area and each pixel in the intersection pixel;
    根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值;Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel points, and the pixel value of each pixel point;
    将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。The first target pixel value corresponding to the sampling area is used as the pixel value at the center point of the sampling area.
  14. 如权利要求13所述的计算机设备,其特征在于,所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值,包括:The computer device according to claim 13, wherein the sampling area is determined according to the intersection area of the auxiliary frame corresponding to the sampling area and each pixel point, and the pixel value of each pixel point. The first target pixel value corresponding to the area includes:
    分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积;Correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel;
    将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和;Adding the product between the pixel value of each pixel and the intersection area of each pixel to obtain a first product sum;
    计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和;Calculating the sum of the intersecting area of each pixel to obtain the sum of the area of the first intersecting pixel;
    计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  15. 如权利要求12所述的计算机设备,其特征在于,所述判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸之后,还包括:The computer device according to claim 12, wherein after determining whether the size of the sampling area is greater than or equal to the size of the pixel point of the feature map of the candidate area, the method further comprises:
    若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,则根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值;If it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area and the sampling area Determine the second target pixel value corresponding to the sampling area by the intersection area with each of the intersecting pixel points;
    将所述采样区域对应的第二目标像素值作为所述采样区域的中心点位置的像素值。The second target pixel value corresponding to the sampling area is taken as the pixel value at the center point of the sampling area.
  16. 一个或多个存储有计算机可读指令的非易失性可读存储介质,其特征在于,所述计算机可读指令被一个或多个处理器执行时,使得所述一个或多个处理器执行如下步骤:One or more non-volatile readable storage media storing computer readable instructions, wherein when the computer readable instructions are executed by one or more processors, the one or more processors execute The following steps:
    获取候选区域特征图,所述候选区域特征图为将候选区域映射至特征图后所得到的,所述特征图为通过卷积神经网络对输入图像进行特征提取所得到,所述候选区域为根据预设目标检测算法对所述输入图像进行目标区域检测所得到;Obtain a feature map of the candidate area, the feature map of the candidate area is obtained by mapping the candidate area to the feature map, the feature map is obtained by feature extraction of the input image through a convolutional neural network, and the candidate area is based on A preset target detection algorithm is obtained by performing target area detection on the input image;
    根据预设候选区域池化参数将所述候选区域特征图划分为NxM个大小等同的小区域,所述N、M为正整数且大于或等于1,所述预设候选区域池化参数包括用于池化处理的宽度参数以及长度参数;According to preset candidate area pooling parameters, the candidate area feature map is divided into NxM small areas of equal size, where N and M are positive integers and greater than or equal to 1, and the preset candidate area pooling parameters include Width and length parameters for pooling processing;
    根据预设采样点数P将每个所述小区域平均划分为P个采样区域,所述P为正整数且大于或等于2;Divide each small area into P sampling areas on average according to the preset number of sampling points P, where P is a positive integer and greater than or equal to 2;
    确定出所述候选区域特征图中与所述采样区域相交的相交像素点;Determining an intersection pixel point that intersects the sampling area in the candidate area feature map;
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值;Determine the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area;
    根据所述小区域对应的每个所述采样区域的中心点位置的像素值,对应确定出每个所述小区域的像素值;Correspondingly determine the pixel value of each small area according to the pixel value of the center point position of each sampling area corresponding to the small area;
    根据每个所述小区域的像素值获得固定尺寸的所述候选区域特征图。A fixed-size feature map of the candidate area is obtained according to the pixel value of each small area.
  17. 如权利要求16所述的非易失性可读存储介质,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值之前,还包括:The non-volatile readable storage medium according to claim 16, wherein the center point of the sampling area is determined according to the intersection pixel points that intersect the sampling area in the candidate area feature map Before the pixel value of the position, it also includes:
    判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸;Judging whether the size of the sampling area is greater than or equal to the size of the pixel point of the feature map of the candidate area;
    若判断所述采样区域的尺寸大于或等于所述候选区域图像的像素点的尺寸,则以所述采样区域的中心点位置为中心生成所述采样区域对应的辅助框,所述辅助框的形状与所述 候选区域特征图的像素点相同,所述辅助框的尺寸则小于或等于所述候选区域特征图的像素点的尺寸;If it is determined that the size of the sampling area is greater than or equal to the size of the pixel point of the candidate area image, the auxiliary frame corresponding to the sampling area is generated with the center point position of the sampling area as the center, and the shape of the auxiliary frame Same as the pixels of the feature map of the candidate area, the size of the auxiliary frame is smaller than or equal to the size of the pixels of the feature map of the candidate area;
    所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,确定出所述采样区域的中心点位置的像素值,包括:The determining the pixel value of the center point position of the sampling area according to the intersection pixel points that intersect the sampling area in the feature map of the candidate area includes:
    根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值。The pixel value of the center point of the sampling area is determined according to the intersecting pixel points that intersect the sampling area in the feature map of the candidate area and the auxiliary frame corresponding to the sampling area.
  18. 如权利要求17所述的非易失性可读存储介质,其特征在于,所述根据所述候选区域特征图中与所述采样区域相交的相交像素点,以及所述采样区域对应的辅助框,确定所述采样区域的中心点位置的像素值,包括:The non-volatile readable storage medium according to claim 17, wherein the intersecting pixel points that intersect the sampling area in the candidate area feature map and the auxiliary frame corresponding to the sampling area , Determining the pixel value of the center point of the sampling area includes:
    获取所述采样区域对应的辅助框分别和所述相交像素点中各个像素点的相交面积;Acquiring the intersection area of the auxiliary frame corresponding to the sampling area and each pixel in the intersection pixel;
    根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值;Determine the first target pixel value corresponding to the sampling area according to the intersection area of the auxiliary frame corresponding to the sampling area and the respective pixel points, and the pixel value of each pixel point;
    将所述采样区域对应的第一目标像素值,作为所述采样区域的中心点位置的像素值。The first target pixel value corresponding to the sampling area is used as the pixel value at the center point of the sampling area.
  19. 如权利要求18所述的非易失性可读存储介质,其特征在于,所述根据所述采样区域对应的辅助框与所述各个像素点的相交面积,以及所述各个像素点的像素值,确定出所述采样区域对应的第一目标像素值,包括:The non-volatile readable storage medium according to claim 18, wherein the intersecting area of the auxiliary frame corresponding to the sampling area and the respective pixels, and the pixel value of the respective pixels , Determining the first target pixel value corresponding to the sampling area includes:
    分别对应计算所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积;Correspondingly calculate the product between the pixel value of each pixel and the intersection area of each pixel;
    将所述各个像素点的像素值与所述各个像素点的相交面积之间的乘积相加,以得到第一乘积和;Adding the product between the pixel value of each pixel and the intersection area of each pixel to obtain a first product sum;
    计算所述各个像素点的相交面积的和,以得到第一相交像素点面积和;Calculating the sum of the intersecting area of each pixel to obtain the sum of the area of the first intersecting pixel;
    计算第一乘积和与所述第一相交像素点面积之间的商,以获得所述采样区域对应的第一目标像素值。Calculate the quotient between the first product and the area of the first intersecting pixel to obtain the first target pixel value corresponding to the sampling area.
  20. 如权利要求17所述的非易失性可读存储介质,其特征在于,所述判断所述采样区域的尺寸是否大于或等于所述候选区域特征图的像素点的尺寸之后,还包括:17. The nonvolatile readable storage medium according to claim 17, wherein after determining whether the size of the sampling area is greater than or equal to the size of the pixel of the feature map of the candidate area, the method further comprises:
    若判断所述采样区域的尺寸小于所述候选区域特征图的像素点的尺寸,则根据所述候选区域特征图中与所述采样区域相交的各个相交像素点的像素值,以及所述采样区域与所述各个相交像素点的相交面积,确定出所述采样区域对应的第二目标像素值;If it is determined that the size of the sampling area is smaller than the pixel size of the feature map of the candidate area, the pixel value of each intersecting pixel that intersects the sampling area in the feature map of the candidate area and the sampling area Determine the second target pixel value corresponding to the sampling area by the intersection area with each of the intersecting pixel points;
    将所述采样区域对应的第二目标像素值作为所述采样区域的中心点位置的像素值。The second target pixel value corresponding to the sampling area is taken as the pixel value at the center point of the sampling area.
PCT/CN2019/089196 2019-01-24 2019-05-30 Image processing method and apparatus, and computer device and storage medium WO2020151153A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910067366.3 2019-01-24
CN201910067366.3A CN109886279B (en) 2019-01-24 2019-01-24 Image processing method, device, computer equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2020151153A1 true WO2020151153A1 (en) 2020-07-30

Family

ID=66926823

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/089196 WO2020151153A1 (en) 2019-01-24 2019-05-30 Image processing method and apparatus, and computer device and storage medium

Country Status (2)

Country Link
CN (1) CN109886279B (en)
WO (1) WO2020151153A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113553685A (en) * 2021-07-27 2021-10-26 久瓴(江苏)数字智能科技有限公司 Lighting device arrangement method and device and electronic equipment

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110415239B (en) * 2019-08-01 2022-12-16 腾讯科技(深圳)有限公司 Image processing method, image processing apparatus, medical electronic device, and medium
CN110646006A (en) * 2019-09-02 2020-01-03 平安科技(深圳)有限公司 Assembly path planning method and related device
CN111462094A (en) * 2020-04-03 2020-07-28 联觉(深圳)科技有限公司 PCBA component detection method and device and computer readable storage medium
CN112256906A (en) * 2020-10-23 2021-01-22 安徽启新明智科技有限公司 Method, device and storage medium for marking annotation on display screen
CN115393586B (en) * 2022-08-18 2023-06-27 北京爱科农科技有限公司 Farmland breeding area dividing method, device, computer equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170220904A1 (en) * 2015-04-02 2017-08-03 Tencent Technology (Shenzhen) Company Limited Training method and apparatus for convolutional neural network model
CN107145889A (en) * 2017-04-14 2017-09-08 中国人民解放军国防科学技术大学 Target identification method based on double CNN networks with RoI ponds
CN107301383A (en) * 2017-06-07 2017-10-27 华南理工大学 A kind of pavement marking recognition methods based on Fast R CNN
CN108133217A (en) * 2017-11-22 2018-06-08 北京达佳互联信息技术有限公司 Characteristics of image determines method, apparatus and terminal
CN109117876A (en) * 2018-07-26 2019-01-01 成都快眼科技有限公司 A kind of dense small target deteection model building method, model and detection method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2141928A1 (en) * 2008-06-30 2010-01-06 Thomson Licensing S.A. Device and method for analysing an encoded image
CN106599866B (en) * 2016-12-22 2020-06-02 上海百芝龙网络科技有限公司 Multi-dimensional user identity identification method
CN108876791B (en) * 2017-10-23 2021-04-09 北京旷视科技有限公司 Image processing method, device and system and storage medium
CN107808141A (en) * 2017-11-08 2018-03-16 国家电网公司 A kind of electric transmission line isolator explosion recognition methods based on deep learning
CN108764143B (en) * 2018-05-29 2020-11-24 北京字节跳动网络技术有限公司 Image processing method, image processing device, computer equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170220904A1 (en) * 2015-04-02 2017-08-03 Tencent Technology (Shenzhen) Company Limited Training method and apparatus for convolutional neural network model
CN107145889A (en) * 2017-04-14 2017-09-08 中国人民解放军国防科学技术大学 Target identification method based on double CNN networks with RoI ponds
CN107301383A (en) * 2017-06-07 2017-10-27 华南理工大学 A kind of pavement marking recognition methods based on Fast R CNN
CN108133217A (en) * 2017-11-22 2018-06-08 北京达佳互联信息技术有限公司 Characteristics of image determines method, apparatus and terminal
CN109117876A (en) * 2018-07-26 2019-01-01 成都快眼科技有限公司 A kind of dense small target deteection model building method, model and detection method

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113553685A (en) * 2021-07-27 2021-10-26 久瓴(江苏)数字智能科技有限公司 Lighting device arrangement method and device and electronic equipment
CN113553685B (en) * 2021-07-27 2024-03-22 久瓴(江苏)数字智能科技有限公司 Lighting device arrangement method and device and electronic equipment

Also Published As

Publication number Publication date
CN109886279A (en) 2019-06-14
CN109886279B (en) 2023-09-29

Similar Documents

Publication Publication Date Title
WO2020151153A1 (en) Image processing method and apparatus, and computer device and storage medium
CN111079632A (en) Training method and device of text detection model, computer equipment and storage medium
WO2022160787A1 (en) Robot hand-eye calibration method and apparatus, readable storage medium, and robot
US20210342593A1 (en) Method and apparatus for detecting target in video, computing device, and storage medium
WO2022089079A1 (en) Image processing method, apparatus and system, and device and computer-readable storage medium
CN110473233B (en) Registration method, computer device, and storage medium
WO2020233083A1 (en) Image restoration method and apparatus, storage medium, and terminal device
CN111191653A (en) License plate recognition method and device, computer equipment and storage medium
CN109409318B (en) Statistical model training method, statistical device and storage medium
CN111899237A (en) Scale precision measuring method, scale precision measuring device, computer equipment and storage medium
CN115601774A (en) Table recognition method, apparatus, device, storage medium and program product
CN111339724B (en) Method, apparatus and storage medium for generating data processing model and layout
WO2021092797A1 (en) Image registration method, terminal, and computer storage medium
CN115439454A (en) Blister medicine quality detection method and device, computer equipment, medium and product
CN110322477B (en) Feature point observation window setting method, tracking method, device, equipment and medium
CN110211200B (en) Dental arch wire generating method and system based on neural network technology
CN111178126A (en) Target detection method, target detection device, computer equipment and storage medium
CN111192279A (en) Object segmentation method based on edge detection, electronic terminal and storage medium
CN117291790A (en) SAR image registration method, SAR image registration device, SAR image registration equipment and SAR image registration medium
CN116385415A (en) Edge defect detection method, device, equipment and storage medium
CN110956131A (en) Single-target tracking method, device and system
US11928808B2 (en) Wafer detection method, device, apparatus, and storage medium
CN113344961B (en) Image background segmentation method, device, computing equipment and storage medium
WO2022120669A1 (en) Gesture recognition method, computer device and storage medium
CN112308889B (en) Point cloud registration method and storage medium by utilizing rectangle and oblateness information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19911275

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS (EPO FORM 1205A DATED 14.09.2021)

122 Ep: pct application non-entry in european phase

Ref document number: 19911275

Country of ref document: EP

Kind code of ref document: A1