WO2023071121A1 - Multi-model fusion-based object detection method and apparatus, device and medium - Google Patents

Multi-model fusion-based object detection method and apparatus, device and medium Download PDF

Info

Publication number
WO2023071121A1
WO2023071121A1 PCT/CN2022/090233 CN2022090233W WO2023071121A1 WO 2023071121 A1 WO2023071121 A1 WO 2023071121A1 CN 2022090233 W CN2022090233 W CN 2022090233W WO 2023071121 A1 WO2023071121 A1 WO 2023071121A1
Authority
WO
WIPO (PCT)
Prior art keywords
target detection
initial
models
model
image
Prior art date
Application number
PCT/CN2022/090233
Other languages
French (fr)
Chinese (zh)
Inventor
金良
李仁刚
赵雅倩
郭振华
范宝余
徐聪
胡克坤
Original Assignee
苏州浪潮智能科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 苏州浪潮智能科技有限公司 filed Critical 苏州浪潮智能科技有限公司
Publication of WO2023071121A1 publication Critical patent/WO2023071121A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present application relates to the technical field of image processing, in particular to a method, device, equipment and medium for target detection based on multi-model fusion.
  • Object detection is an important branch of computer vision and the first step in visual perception. Object detection is not only to determine what is in the image, but also to determine where the object is in the image.
  • model integration algorithms are: multi-model direct averaging method, single-model multiple snapshot integration (Snapshots Ensemble) average fusion and AABBFI algorithm (Axis-Aligned Bounding Box Fuzzy Integral, axis-aligned bounding box fuzzy integral).
  • the multi-model direct averaging method directly averages the output results of multiple models without considering the differences between different models, which makes the detection accuracy limited; single-model multiple snapshot integration average fusion is For the integration of a single network model, because the differences between different models are not considered, the model diversity is insufficient; because the AABBFI algorithm only fuses the position of the detection frame (ie Bounding Box) in the detection result in a single model, it does not Considering other factors in the model and other models, therefore, leads to limited accuracy improvement of this fusion algorithm.
  • the purpose of the present application is to provide an object detection method, device, equipment and medium based on multi-model fusion, which can improve the accuracy of object detection.
  • the specific plan is as follows:
  • the present application discloses a target detection method based on multi-model fusion, including:
  • each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models
  • the trained multiple target detection models before acquiring the trained multiple target detection models, it also includes:
  • a plurality of target detection models to be trained are screened out by using screening conditions constructed based on model structure differences; wherein, the model structure differences between different target detection models to be trained all meet preset difference conditions;
  • the multiple target detection models to be trained are trained to obtain multiple trained target detection models.
  • the acquisition of the image set to be detected obtained after performing image enhancement processing on the original image to be detected includes:
  • corresponding image enhancement processing is performed on the original image to be detected, so as to obtain a set of images to be detected corresponding to each of the target detection models.
  • weighting all initial target detection results in any set of initial target detection results includes:
  • weighting is performed on all the initial target detection results in the group of the initial target detection results, so as to obtain an initial weighted target detection result corresponding to the target detection model.
  • the described target detection method based on multi-model fusion also includes:
  • the weight of each target detection model is determined based on the mean average precision evaluation index corresponding to each target detection model and the sum of the evaluation indexes.
  • the mean average precision evaluation index corresponding to any of the trained target detection models including:
  • an average average precision evaluation index of the target detection model is determined.
  • using the trained target detection model to predict each image in the verification set includes:
  • Weighting is performed on multiple initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
  • the present application discloses a target detection device based on multi-model fusion, including:
  • the obtaining module is used to obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected;
  • An image detection module configured to use each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
  • a single-model weighting module configured to weight all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
  • a multi-model weighting module configured to weight the multiple initial weighted target detection results corresponding to multiple target detection models based on the weight of each target detection model determined in advance using the verification set, to obtain the The final target detection result corresponding to the original image to be detected.
  • the present application discloses an electronic device, including a processor and a memory; wherein, when the processor executes the computer program stored in the memory, the aforementioned multi-model fusion-based target detection method is implemented.
  • the present application discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, the aforementioned multi-model fusion-based target detection method is implemented.
  • the present application first obtains a plurality of target detection models that have been trained, and obtains the image set to be detected after image enhancement processing is performed on the original image to be detected, and then uses each of the target detection models to perform the detection of the target detection model.
  • the images to be detected in the image set are detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then all initial target detection results in each set of initial target detection results are respectively weighted to obtain Obtaining the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multiple initial weighted target detection models corresponding to multiple target detection models The weighted target detection results are weighted to obtain a final target detection result corresponding to the original image to be detected.
  • this application reduces the difference in single-model detection results caused by image enhancement processing on the original image, and improves the robustness of the model.
  • the diversity of the models can be fully utilized and the accuracy of target detection can be improved.
  • Fig. 1 is a flow chart of a target detection method based on multi-model fusion disclosed in the present application
  • FIG. 2 is a flow chart of a specific multi-model-based weight calculation method disclosed in the present application
  • Fig. 3 is a schematic diagram of a target detection structure of a single model and a single image disclosed in the present application;
  • FIG. 4 is a flow chart of a specific multi-model fusion-based target detection method disclosed in the present application.
  • FIG. 5 is a schematic structural diagram of a target detection device based on multi-model fusion disclosed in the present application.
  • FIG. 6 is a structural diagram of an electronic device disclosed in the present application.
  • the embodiment of the present application discloses a target detection method based on multi-model fusion, as shown in Figure 1, the method includes:
  • Step S11 Obtain a plurality of trained target detection models, and obtain a set of images to be detected obtained after performing image enhancement processing on the original images to be detected.
  • the target detection model includes, but is not limited to, three types of models: One-stage, Two-stage, and Anchor-free;
  • the One-stage model includes, but is not limited to, RetinaNet , YOLOV2/V3 (YOLO, You Only Look Once), etc.;
  • Two-stage models include but not limited to Faster R-CNN, etc.;
  • Anchor-free models include but not limited to CornerNet, ExtremeNet, CenterNet, FCOS (Fully Convolutional One-Stage Object Detection, first-order full convolution target detection), etc.
  • the training conditions constructed based on model structure differences to screen out multiple target detection models to be trained;
  • the model structure differences between the target detection models to be trained all meet the preset difference conditions; using the training set obtained after image enhancement is performed on the historical original image set, a plurality of the target detection models to be trained are trained to obtain the trained Good multiple of the object detection models.
  • the rules based on the size of the model structure differences that meet the preset difference conditions can be paired.
  • Multiple initial target detection models are screened to obtain multiple target detection models to be trained. For example, five target detection models, YOLOV3, RetinaNet, Faster R-CNN, CenterNet and FCOS, are selected from the above three types of models, namely One-stage, Two-stage and Anchor-free.
  • image enhancement processing can be performed on historical original images to obtain a training set, and the above-mentioned training set can be used to filter
  • the above-mentioned multiple target detection models to be trained are trained to obtain multiple trained target detection models.
  • the acquiring the image set to be detected obtained after performing image enhancement processing on the original image to be detected may specifically include: according to the model category of each target detection model, determining the corresponding target detection model Image enhancement algorithm: using the image enhancement algorithm corresponding to each of the target detection models to perform corresponding image enhancement processing on the original image to be detected, so as to obtain a set of images to be detected corresponding to each of the target detection models.
  • image enhancement algorithms include but not limited to image enhancement algorithms based on geometric transformation and color transformation; wherein, image enhancement algorithms based on geometric transformation include but not limited to random cropping (ie Random Cropping), random expansion (ie Random Expansion), Random horizontal flip (that is, Random Horizontal Flip), random zoom (that is, Random Resize), etc.; image enhancement algorithms based on color transformation include but are not limited to color dithering, Fancy PCA (PCA, Principal Component Analysis, principal component analysis), etc.
  • PCA Principal Component Analysis, principal component analysis
  • the target image enhancement algorithm in the process of adding the target image enhancement algorithm to the above-mentioned multiple target detection models to be trained, it should be selectively added according to the functions of the target detection models to be trained, and the image with repeated functions will be enhanced Algorithm to remove.
  • the selected image enhancement algorithm Random Cropping is eliminated in the Two-stage class model, because the RPN (Region Proposal Network, Region Generation Network) network in the Two-stage has similar functions to Random Cropping.
  • each An image enhancement algorithm corresponding to each of the target detection models it can be determined that each An image enhancement algorithm corresponding to each of the target detection models, and using the image enhancement algorithm corresponding to each target detection model to perform corresponding image enhancement processing on the original image to be detected, to obtain an image to be detected corresponding to each of the target detection models set.
  • Step S12 Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the plurality of target detection models.
  • each of the target detection models is used to detect the above-mentioned target detection models. All the images to be detected in the image set are detected, and multiple sets of initial target detection results corresponding to multiple target detection models output by each target detection model are obtained. Wherein, the number of each group of initial target detection results is the same as the number of images included in the image set to be detected after image enhancement processing.
  • Step S13 weighting all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models.
  • each of the target detection models After using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the multiple target detection models, it is necessary to All the initial target detection results corresponding to each of the target detection models are weighted, that is, the internal detection results of a single model are weighted to obtain the initial weighted target detection results corresponding to each of the target detection models.
  • Step S14 Based on the weight of each target detection model determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the corresponding to the original image to be detected The final target detection result.
  • the pre-used verification set pairs are obtained. After each target detection model is trained, determine the weight corresponding to each target detection model, and use the above weight to perform weighting processing on the multiple initial weighted target detection results corresponding to multiple target detection models, That is, different weight values are assigned to the detection results corresponding to the multiple target detection models, and then the final target detection result corresponding to the original image to be detected is obtained.
  • the embodiment of the present application firstly obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected, and then use each of the target detection models to separately Detecting the images to be detected in the image set to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then weighting all the initial target detection results in each set of initial target detection results , to obtain the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multiple target detection models corresponding to multiple target detection models The initial weighted target detection result is weighted to obtain the final target detection result corresponding to the original image to be detected.
  • the embodiment of the present application discloses a specific target detection method based on multi-model fusion, as shown in Figure 2, the method includes:
  • Step S21 Obtain a plurality of trained target detection models, and obtain an image set to be detected obtained after performing image enhancement processing on the original image to be detected.
  • Step S22 Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the multiple target detection models.
  • Step S23 Clustering all the initial object detection results in any group of the initial object detection results to obtain a first clustering result corresponding to the group of the initial object detection results.
  • the aforementioned preset clustering algorithms include but are not limited to K-means algorithm (ie K-means Clustering Algorithm, k-means clustering algorithm) and the like.
  • Step S24 According to the first clustering result, and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, determine the weight of each of the initial target detection results.
  • the specific process of suppressing the non-maximum value in the above-mentioned first clustering result based on the non-maximum value suppression algorithm is as follows: obtain the index corresponding to the maximum value in the first clustering result, and compare it with the above-mentioned The clustering result of the index corresponding to the maximum value is removed from the first clustering result to obtain the target clustering result, and the clustering results of all the detection frames in the above target clustering result and the index corresponding to the above maximum value are calculated respectively. The intersection and union ratio of the detection frame, and determine the weight corresponding to each initial target detection result according to the intersection and union ratio.
  • Step S25 Based on the weight of each of the initial target detection results, weight all the initial target detection results in the group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to the target detection model .
  • each of the above-mentioned initial targets can be used
  • the weight value of the detection result performs corresponding weighting processing on all the above-mentioned initial target detection results in the group of the initial target detection results, and obtains corresponding primary weighted target detection results corresponding to the target detection model.
  • the above-mentioned process of obtaining the initial weighted target detection result corresponding to the target detection model is mainly based on the AABBFI algorithm.
  • the specific implementation process of the AABBFI algorithm is shown in Figure 3.
  • the input single original image to be detected is Perform data enhancement processing to obtain the image set to be detected, and then use the trained single target detection model to infer the images in the above image set to be detected based on the AABB (Axis-Aligned Bounding Box) algorithm, and obtain multiple Inference results, and based on the FI (Fuzzy Integral, fuzzy integral) algorithm, perform fuzzy operations on the above inference results to obtain target detection results.
  • AABB Aligned Bounding Box
  • FI Fuzzy Integral, fuzzy integral
  • Step S26 Based on the weight of each target detection model determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the target detection results corresponding to the original image to be detected The final target detection result.
  • weighting is performed on all the initial target detection results in the group of initial target detection results to obtain the initial weighted target corresponding to the target detection model
  • the target detection result is weighted to obtain the final target detection result corresponding to the original image to be detected.
  • the target detection method based on multi-model fusion may specifically include:
  • Step S31 Determine the mean value and average precision evaluation index corresponding to each of the trained target detection models based on the verification set;
  • Step S32 Summing the mean and average precision evaluation indicators corresponding to all the target detection models to obtain the sum of corresponding evaluation indicators
  • Step S33 Determine the weight of each target detection model based on the mean value average precision evaluation index corresponding to each target detection model and the sum of the evaluation indexes.
  • the verification set can be used to determine the index for measuring the accuracy of target detection and recognition corresponding to each of the above target detection models, that is, the mean average precision evaluation index ( mAP, mean Average Precision). Further, the sum of the mean average precision evaluation indicators corresponding to all the above target detection models is calculated to obtain the corresponding sum of evaluation indicators. Then, the weight corresponding to each target detection model can be determined based on the ratio of the mean average precision evaluation index corresponding to each target detection model to the sum of the above evaluation indexes.
  • the mean average precision evaluation index mAP, mean Average Precision
  • determining the mean average precision evaluation index corresponding to any of the trained target detection models based on the verification set may specifically include: using the trained target detection model to evaluate each target detection model in the verification set Prediction of each image in order to obtain the prediction result corresponding to each image in the verification set output by the target detection model; based on the difference between the prediction result and the corresponding real labeling result in the verification set, determine the The mean average precision evaluation index of the target detection model.
  • the trained target detection model is used to predict each image in the verification set, and the prediction result output by the target detection model corresponding to each image in the verification set can be obtained. Further, based on the above The degree of difference between the prediction result and the corresponding real labeling result (ie, Ground Truth) in the verification set can determine the mean average precision evaluation index corresponding to the above-mentioned target detection model.
  • using the trained target detection model to predict each image in the verification set may include: respectively performing image enhancement on each image in the verification set to obtain the corresponding A plurality of enhanced images; using the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial predictions corresponding to each image in the verification set Result: performing weighting processing on multiple initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
  • image enhancement processing can be performed on each image in the verification set respectively to obtain Multiple enhanced images corresponding to each image, and then use the trained target detection model to predict the multiple enhanced images corresponding to each image in the verification set, and then obtain multiple enhanced images corresponding to each image in the verification set
  • the multiple initial prediction results corresponding to each image in the above verification set are weighted, and the corresponding to each image in the above verification set is obtained. forecast result.
  • the first clustering result corresponding to the initial target detection results of the group is obtained, and then according to the first The overlapping degree of detection frames corresponding to different cluster centers in the clustering results determines the weight of each of the initial target detection results corresponding to each cluster center, and based on the weight of each of the initial target detection results, the group of All the initial target detection results in the initial target detection results are weighted to obtain an initial weighted target detection result corresponding to the target detection model.
  • the embodiment of the present application also discloses a target detection device based on multi-model fusion, as shown in Fig. 5, the device includes:
  • the acquiring module 11 is used to acquire a plurality of target detection models that have been trained, and acquire an image set to be detected obtained after performing image enhancement processing on the original image to be detected;
  • the image detection module 12 is configured to use each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
  • a single-model weighting module 13 configured to weight all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
  • the multi-model weighting module 14 is used to weight the multiple initial weighted target detection results corresponding to multiple target detection models based on the weight of each target detection model determined in advance using the verification set, to obtain the weighted target detection results corresponding to the target detection models. Describe the final target detection result corresponding to the original image to be detected.
  • the embodiment of the present application first obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected, and then use each of the target detection models to The images to be detected in the image set to be detected are detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then all initial target detection results in each set of initial target detection results are respectively weighting to obtain the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multi The initial weighted target detection results are weighted to obtain the final target detection result corresponding to the original image to be detected.
  • the acquisition module 11 before the acquisition module 11, it may also include:
  • the model screening unit is used to screen out a plurality of target detection models to be trained by using screening conditions constructed based on model structure differences; wherein, the model structure differences between different target detection models to be trained all meet the preset difference conditions;
  • the first training unit is configured to use a training set obtained after performing image enhancement on a historical original image set to train a plurality of target detection models to be trained, so as to obtain a plurality of trained target detection models.
  • the acquisition module 11 may specifically include:
  • An algorithm determining unit configured to determine an image enhancement algorithm corresponding to each of the target detection models according to the model category of each of the target detection models
  • the first image enhancement unit is configured to use the image enhancement algorithm corresponding to each of the target detection models to perform corresponding image enhancement processing on the original image to be detected, so as to obtain the target to be detected corresponding to each of the target detection models image set.
  • the weighting of all the initial target detection results in any group of the initial target detection results may specifically include:
  • the first clustering unit is configured to cluster all the initial target detection results in any group of the initial target detection results to obtain a first clustering result corresponding to the group of the initial target detection results;
  • the first weight determination unit is configured to determine the weight of each of the initial target detection results based on the first clustering result and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm;
  • the first weighting unit is configured to weight all the initial target detection results in the group of initial target detection results based on the weight of each of the initial target detection results, so as to obtain the initial target detection model corresponding to the corresponding target. Weighted object detection results.
  • the target detection device based on multi-model fusion may also include:
  • the first evaluation index determination unit is configured to determine the mean average precision evaluation index corresponding to each of the trained target detection models based on the verification set;
  • a summation unit configured to sum the mean average precision evaluation indicators corresponding to all the target detection models to obtain the corresponding evaluation index sum
  • the second weight determination unit is configured to determine the weight of each of the target detection models based on the mean average precision evaluation index corresponding to each of the target detection models and the sum of the evaluation indexes.
  • the first evaluation indicator determining unit may specifically include:
  • a first prediction unit configured to use the trained target detection model to predict each image in the verification set, so as to obtain a prediction result output by the target detection model corresponding to each image in the verification set;
  • the second evaluation index determination unit is configured to determine the mean average precision evaluation index of the target detection model based on the difference between the prediction result and the corresponding real labeling result in the verification set.
  • the first prediction unit may specifically include:
  • the second image enhancement unit is used to respectively perform image enhancement on each image in the verification set to obtain a plurality of enhanced images corresponding to each image in the verification set;
  • the second prediction unit is configured to use the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial images corresponding to each image in the verification set forecast result;
  • the second weighting unit is configured to respectively perform weighting processing on a plurality of initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
  • FIG. 6 is a structural diagram of an electronic device 20 according to an exemplary embodiment.
  • the content in the figure should not be regarded as any limitation on the application scope of the present application.
  • FIG. 6 is a schematic structural diagram of an electronic device 20 provided by an embodiment of the present application.
  • the electronic device 20 may specifically include: at least one processor 21 , at least one memory 22 , a power supply 23 , a communication interface 24 , an input/output interface 25 and a communication bus 26 .
  • the memory 22 is used to store a computer program, and the computer program is loaded and executed by the processor 21 to implement relevant steps in the multi-model fusion-based target detection method disclosed in any of the foregoing embodiments.
  • the electronic device 20 in this embodiment may specifically be an electronic computer.
  • the power supply 23 is used to provide working voltage for each hardware device on the electronic device 20;
  • the communication interface 24 can create a data transmission channel between the electronic device 20 and external devices, and the communication protocol it follows is applicable Any communication protocol in the technical solution of the present application is not specifically limited here;
  • the input and output interface 25 is used to obtain external input data or output data to the external, and its specific interface type can be selected according to specific application needs, here Not specifically limited.
  • the memory 22, as a resource storage carrier can be a read-only memory, random access memory, magnetic disk or optical disk, etc., and the resources stored thereon can include operating system 221, computer program 222, etc., and the storage method can be temporary storage or permanent storage. .
  • the operating system 221 is used to manage and control each hardware device on the electronic device 20 and the computer program 222, which may be Windows Server, Netware, Unix, Linux, etc.
  • the computer program 222 can also optionally include a computer program that can be used to complete other specific tasks. Computer program.
  • the present application also discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, the aforementioned multi-model fusion-based target detection method is implemented.
  • a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, the aforementioned multi-model fusion-based target detection method is implemented.
  • the specific steps of the method reference may be made to the corresponding content disclosed in the foregoing embodiments, and details are not repeated here.
  • each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same or similar parts of each embodiment can be referred to each other.
  • the description is relatively simple, and for the related information, please refer to the description of the method part.
  • RAM random access memory
  • ROM read-only memory
  • EEPROM electrically programmable ROM
  • EEPROM electrically erasable programmable ROM
  • registers hard disk, removable disk, CD-ROM, or any other Any other known storage medium.

Abstract

A multi-model fusion-based object detection method and apparatus, a device and a medium, the method comprising: acquiring a plurality of trained object detection models, and acquiring a set of images to be detected obtained after image enhancement processing is performed on original images to be detected (S11); respectively using the object detection models to detect images to be detected in said set of images, so as to obtain a plurality of groups of initial object detection results corresponding to the plurality of object detection models (S12); respectively weighting all the initial object detection results in each group of initial object detection results, so as to obtain primary weighted object detection results respectively corresponding to the object detection models (S13); and weighting, on the basis of the weight of each object detection model determined in advance by using a validation set, the plurality of primary weighted object detection results corresponding to the plurality of object detection models, so as to obtain final object detection results corresponding to said original images (S14). The diversity of the models is fully used, thereby improving the precision of object detection.

Description

一种基于多模型融合的目标检测方法、装置、设备及介质A target detection method, device, equipment and medium based on multi-model fusion
本申请要求在2021年10月26日提交中国专利局、申请号为202111244219.2、发明名称为“一种基于多模型融合的目标检测方法、装置、设备及介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application submitted to the China Patent Office on October 26, 2021, with the application number 202111244219.2, and the title of the invention is "a target detection method, device, equipment and medium based on multi-model fusion". The entire contents are incorporated by reference in this application.
技术领域technical field
本申请涉及图像处理技术领域,特别涉及一种基于多模型融合的目标检测方法、装置、设备及介质。The present application relates to the technical field of image processing, in particular to a method, device, equipment and medium for target detection based on multi-model fusion.
背景技术Background technique
目前,随着大数据和人工智能的蓬勃发展,基于深度学习的计算机视觉也在各个领域得到了广泛的应用,如自主车辆的视觉导航、医学图像分析和人脸识别等领域。目标检测是计算机视觉的一个重要分支,也是视觉感知的第一步。目标检测不仅要确定出图像中有什么,还需确定出目标在图像中的什么位置。At present, with the vigorous development of big data and artificial intelligence, computer vision based on deep learning has also been widely used in various fields, such as visual navigation of autonomous vehicles, medical image analysis, and face recognition. Object detection is an important branch of computer vision and the first step in visual perception. Object detection is not only to determine what is in the image, but also to determine where the object is in the image.
当前,关于目标检测的网络模型和算法层出不穷。通常情况下,仅使用一种算法无法获取到更精确的结果,因此需要将多种学习算法组合起来得到一个较为准确的结果。在目标检测领域中,通常采用的模型集成算法有:多模型直接平均法、单模型多个快照集成(即Snapshots Ensemble)平均融合及AABBFI算法(Axis-Aligned Bounding Box Fuzzy Integral,轴对齐包围盒模糊积分)。在上述三种模型集成算法中,多模型直接平均法是直接对多个模型的输出结果做平均,并未考虑不同模型之间的差异,使得检测精度有限;单模型多个快照集成平均融合是对单个网络模型的集成,由于也未考虑不同模型之间的差异,使得模型多样性不足;AABBFI算法由于仅在单模型中针对检测结果中的检测框(即Bounding Box)位置做融合,并未考虑模型中的其他因素及其他模型,因此,导致此融合算法精度提升有限。Currently, there are endless network models and algorithms for target detection. Usually, more accurate results cannot be obtained by using only one algorithm, so it is necessary to combine multiple learning algorithms to obtain a more accurate result. In the field of target detection, the commonly used model integration algorithms are: multi-model direct averaging method, single-model multiple snapshot integration (Snapshots Ensemble) average fusion and AABBFI algorithm (Axis-Aligned Bounding Box Fuzzy Integral, axis-aligned bounding box fuzzy integral). Among the above three model integration algorithms, the multi-model direct averaging method directly averages the output results of multiple models without considering the differences between different models, which makes the detection accuracy limited; single-model multiple snapshot integration average fusion is For the integration of a single network model, because the differences between different models are not considered, the model diversity is insufficient; because the AABBFI algorithm only fuses the position of the detection frame (ie Bounding Box) in the detection result in a single model, it does not Considering other factors in the model and other models, therefore, leads to limited accuracy improvement of this fusion algorithm.
综上所述,如何提高目标检测的精度是本领域有待解决的问题。To sum up, how to improve the accuracy of object detection is a problem to be solved in this field.
发明内容Contents of the invention
有鉴于此,本申请的目的在于提供一种基于多模型融合的目标检测方法、装置、设备及介质,能够提高目标检测的精度。其具体方案如下:In view of this, the purpose of the present application is to provide an object detection method, device, equipment and medium based on multi-model fusion, which can improve the accuracy of object detection. The specific plan is as follows:
第一方面,本申请公开了一种基于多模型融合的目标检测方法,包括:In the first aspect, the present application discloses a target detection method based on multi-model fusion, including:
获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集;Obtain multiple target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected;
分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果;Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果;Weighting all the initial target detection results in each group of the initial target detection results respectively, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型 对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。Based on the weight of each of the target detection models determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the final result corresponding to the original image to be detected. Target detection results.
可选的,所述获取已训练好的多个目标检测模型之前,还包括:Optionally, before acquiring the trained multiple target detection models, it also includes:
利用基于模型结构差异构建的筛选条件,筛选出多个待训练目标检测模型;其中,不同所述待训练目标检测模型之间的模型结构差异均满足预设差异条件;A plurality of target detection models to be trained are screened out by using screening conditions constructed based on model structure differences; wherein, the model structure differences between different target detection models to be trained all meet preset difference conditions;
利用对历史原始图像集进行图像增强后得到的训练集,对多个所述待训练目标检测模型进行训练,以得到已训练好的多个所述目标检测模型。Using the training set obtained after performing image enhancement on the historical original image set, the multiple target detection models to be trained are trained to obtain multiple trained target detection models.
可选的,所述获取对待检测原始图像进行图像增强处理后得到的待检测图像集,包括:Optionally, the acquisition of the image set to be detected obtained after performing image enhancement processing on the original image to be detected includes:
根据每个所述目标检测模型的模型类别,确定每个所述目标检测模型对应的图像增强算法;Determine an image enhancement algorithm corresponding to each of the target detection models according to the model category of each of the target detection models;
利用每个所述目标检测模型对应的图像增强算法,对所述待检测原始图像进行相应的图像增强处理,以得到与每个所述目标检测模型对应的待检测图像集。Using the image enhancement algorithm corresponding to each of the target detection models, corresponding image enhancement processing is performed on the original image to be detected, so as to obtain a set of images to be detected corresponding to each of the target detection models.
可选的,对任一组所述初始目标检测结果中的所有初始目标检测结果进行加权,包括:Optionally, weighting all initial target detection results in any set of initial target detection results includes:
对任一组所述初始目标检测结果中的所有初始目标检测结果进行聚类,以得到与该组所述初始目标检测结果对应的第一聚类结果;clustering all the initial target detection results in any group of the initial target detection results to obtain a first clustering result corresponding to the group of the initial target detection results;
根据所述第一聚类结果,并基于轴对齐包围盒模糊积分算法和非极大值抑制算法,确定各个所述初始目标检测结果的权重;According to the first clustering result, and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, determine the weight of each of the initial target detection results;
基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结果中的所有所述初始目标检测结果进行加权,以得到相应所述目标检测模型对应的初次加权目标检测结果。Based on the weight of each of the initial target detection results, weighting is performed on all the initial target detection results in the group of the initial target detection results, so as to obtain an initial weighted target detection result corresponding to the target detection model.
可选的,所述的基于多模型融合的目标检测方法,还包括:Optionally, the described target detection method based on multi-model fusion also includes:
基于验证集确定出已训练好的每个所述目标检测模型对应的均值平均精度评价指标;Based on the verification set, determine the mean value and average precision evaluation index corresponding to each of the target detection models that have been trained;
对所有所述目标检测模型对应的均值平均精度评价指标进行求和,以得到相应的评价指标总和;Summing the mean and average precision evaluation indicators corresponding to all the target detection models to obtain the corresponding evaluation index sum;
基于每个所述目标检测模型对应的均值平均精度评价指标以及所述评价指标总和,确定出每个所述目标检测模型的权重。The weight of each target detection model is determined based on the mean average precision evaluation index corresponding to each target detection model and the sum of the evaluation indexes.
可选的,基于所述验证集确定出已训练好的任一所述目标检测模型对应的均值平均精度评价指标,包括:Optionally, based on the verification set, determine the mean average precision evaluation index corresponding to any of the trained target detection models, including:
利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,以得到所述目标检测模型输出的与所述验证集中每张图像对应的预测结果;Using the trained target detection model to predict each image in the verification set, so as to obtain a prediction result output by the target detection model corresponding to each image in the verification set;
基于所述预测结果与所述验证集中对应的真实标注结果之间的差异,确定出所述目标检测模型的均值平均精度评价指标。Based on the difference between the prediction result and the corresponding real labeling result in the verification set, an average average precision evaluation index of the target detection model is determined.
可选的,所述利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,包括:Optionally, using the trained target detection model to predict each image in the verification set includes:
分别对验证集中的每张图像进行图像增强,得到所述验证集中每张图像对应的多个增强后图像;Carry out image enhancement to each image in the verification set respectively, and obtain a plurality of enhanced images corresponding to each image in the verification set;
利用已训练好的所述目标检测模型对所述验证集中每张图像对应的多个增强后图像进行预测,以得到与所述验证集中每张图像对应的多个初始预测结果;Using the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial prediction results corresponding to each image in the verification set;
分别对所述验证集中每张图像对应的多个初始预测结果进行加权处理,以得到与所述验证集中每张图像对应的预测结果。Weighting is performed on multiple initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
第二方面,本申请公开了一种基于多模型融合的目标检测装置,包括:In the second aspect, the present application discloses a target detection device based on multi-model fusion, including:
获取模块,用于获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集;The obtaining module is used to obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected;
图像检测模块,用于分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果;An image detection module, configured to use each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
单模型加权模块,用于分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果;A single-model weighting module, configured to weight all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
多模型加权模块,用于基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。A multi-model weighting module, configured to weight the multiple initial weighted target detection results corresponding to multiple target detection models based on the weight of each target detection model determined in advance using the verification set, to obtain the The final target detection result corresponding to the original image to be detected.
第三方面,本申请公开了一种电子设备,包括处理器和存储器;其中,所述处理器执行所述存储器中保存的计算机程序时实现前述的基于多模型融合的目标检测方法。In a third aspect, the present application discloses an electronic device, including a processor and a memory; wherein, when the processor executes the computer program stored in the memory, the aforementioned multi-model fusion-based target detection method is implemented.
第四方面,本申请公开了一种计算机可读存储介质,用于存储计算机程序;其中,所述计算机程序被处理器执行时实现前述的基于多模型融合的目标检测方法。In a fourth aspect, the present application discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, the aforementioned multi-model fusion-based target detection method is implemented.
可见,本申请先从获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集,然后分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果,再分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果,最后基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。可见,本申请通过对每组初始目标检测结果中的所有初始目标检测结果进行加权处理,降低了由于对原始图像进行图像增强处理而导致的单模型检测结果差异,提升了模型的鲁棒性,通过基于训练集将不同模型赋予不同的权重,能够充分发挥模型的多样性,提高目标检测的精确度。It can be seen that the present application first obtains a plurality of target detection models that have been trained, and obtains the image set to be detected after image enhancement processing is performed on the original image to be detected, and then uses each of the target detection models to perform the detection of the target detection model. The images to be detected in the image set are detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then all initial target detection results in each set of initial target detection results are respectively weighted to obtain Obtaining the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multiple initial weighted target detection models corresponding to multiple target detection models The weighted target detection results are weighted to obtain a final target detection result corresponding to the original image to be detected. It can be seen that, by weighting all the initial target detection results in each group of initial target detection results, this application reduces the difference in single-model detection results caused by image enhancement processing on the original image, and improves the robustness of the model. By assigning different weights to different models based on the training set, the diversity of the models can be fully utilized and the accuracy of target detection can be improved.
附图说明Description of drawings
为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据提供的附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only It is an embodiment of the present application, and those skilled in the art can also obtain other drawings according to the provided drawings without creative work.
图1为本申请公开的一种基于多模型融合的目标检测方法流程图;Fig. 1 is a flow chart of a target detection method based on multi-model fusion disclosed in the present application;
图2为本申请公开的一种具体的基于多模型权重计算方法流程图;FIG. 2 is a flow chart of a specific multi-model-based weight calculation method disclosed in the present application;
图3为本申请公开的一种单模型单张图像进行目标检测结构示意图;Fig. 3 is a schematic diagram of a target detection structure of a single model and a single image disclosed in the present application;
图4为本申请公开的一种具体的基于多模型融合的目标检测方法流程图;FIG. 4 is a flow chart of a specific multi-model fusion-based target detection method disclosed in the present application;
图5为本申请公开的一种基于多模型融合的目标检测装置结构示意图;FIG. 5 is a schematic structural diagram of a target detection device based on multi-model fusion disclosed in the present application;
图6为本申请公开的一种电子设备结构图。FIG. 6 is a structural diagram of an electronic device disclosed in the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实 施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.
本申请实施例公开了一种基于多模型融合的目标检测方法,参见图1所示,该方法包括:The embodiment of the present application discloses a target detection method based on multi-model fusion, as shown in Figure 1, the method includes:
步骤S11:获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集。Step S11: Obtain a plurality of trained target detection models, and obtain a set of images to be detected obtained after performing image enhancement processing on the original images to be detected.
本实施例中,首先需要获取预先对多个待训练目标检测模型进行训练后确定出的多个目标检测模型,然后利用与已训练好的上述多个目标检测模型对应的图像增强算法对待检测原始图像进行图像增强处理,得到待检测图像集。其中,所述目标检测模型包括但不限于单步(One-stage)、两步(Two-stage)及无锚(Anchor-free)3类模型;所述One-stage类模型包括但不限于RetinaNet、YOLOV2/V3(YOLO,即You Only Look Once)等;Two-stage类模型包括但不限于Faster R-CNN等;Anchor-free类模型包括但不限于CornerNet、ExtremeNet、CenterNet、FCOS(即Fully Convolutional One-Stage Object Detection,一阶全卷积目标检测)等。In this embodiment, it is first necessary to obtain multiple target detection models determined after pre-training multiple target detection models to be trained, and then use the image enhancement algorithm corresponding to the trained multiple target detection models The image is subjected to image enhancement processing to obtain the image set to be detected. Wherein, the target detection model includes, but is not limited to, three types of models: One-stage, Two-stage, and Anchor-free; the One-stage model includes, but is not limited to, RetinaNet , YOLOV2/V3 (YOLO, You Only Look Once), etc.; Two-stage models include but not limited to Faster R-CNN, etc.; Anchor-free models include but not limited to CornerNet, ExtremeNet, CenterNet, FCOS (Fully Convolutional One-Stage Object Detection, first-order full convolution target detection), etc.
本实施例中,在所述获取已训练好的多个目标检测模型之前,具体还可以包括:利用基于模型结构差异构建的筛选条件,筛选出多个待训练目标检测模型;其中,不同所述待训练目标检测模型之间的模型结构差异均满足预设差异条件;利用对历史原始图像集进行图像增强后得到的训练集,对多个所述待训练目标检测模型进行训练,以得到已训练好的多个所述目标检测模型。可以理解的是,为了提升多个目标检测模型融合之后的检测结果,保证模型的多样性,在对多个目标检测模型进行训练之前,可以基于模型结构差异的大小满足预设差异条件的规则对多个初始目标检测模型进行筛选,得到多个待训练目标检测模型。例如,从上述One-stage、Two-stage及Anchor-free这3类模型中挑选出模型之间结构差异较大的YOLOV3、RetinaNet、Faster R-CNN、CenterNet和FCOS这5个目标检测模型。In this embodiment, before the acquisition of multiple target detection models that have been trained, it may specifically include: using screening conditions constructed based on model structure differences to screen out multiple target detection models to be trained; The model structure differences between the target detection models to be trained all meet the preset difference conditions; using the training set obtained after image enhancement is performed on the historical original image set, a plurality of the target detection models to be trained are trained to obtain the trained Good multiple of the object detection models. It is understandable that, in order to improve the detection results after the fusion of multiple target detection models and ensure the diversity of the models, before training multiple target detection models, the rules based on the size of the model structure differences that meet the preset difference conditions can be paired. Multiple initial target detection models are screened to obtain multiple target detection models to be trained. For example, five target detection models, YOLOV3, RetinaNet, Faster R-CNN, CenterNet and FCOS, are selected from the above three types of models, namely One-stage, Two-stage and Anchor-free.
可选地,在筛选出多个上述待训练目标检测模型之后,为了提升模型的泛化能力以及样本的多样性,可以对历史原始图像进行图像增强处理得到训练集,并利用上述训练集对筛选出的上述多个待训练目标检测模型进行训练,得到已训练好的多个所述目标检测模型。Optionally, after selecting a plurality of the above-mentioned target detection models to be trained, in order to improve the generalization ability of the model and the diversity of samples, image enhancement processing can be performed on historical original images to obtain a training set, and the above-mentioned training set can be used to filter The above-mentioned multiple target detection models to be trained are trained to obtain multiple trained target detection models.
本实施例中,所述获取对待检测原始图像进行图像增强处理后得到的待检测图像集,具体可以包括:根据每个所述目标检测模型的模型类别,确定每个所述目标检测模型对应的图像增强算法;利用每个所述目标检测模型对应的图像增强算法,对所述待检测原始图像进行相应的图像增强处理,以得到与每个所述目标检测模型对应的待检测图像集。上述图像增强算法包括但不限于基于几何变换和颜色变换对应的图像增强算法;其中,基于几何变换的图像增强算法包括但不限于随机剪切(即Random Cropping)、随机扩展(即Random Expansion)、随机水平翻转(即Random Horizontal Flip)、随机缩放(即Random Resize)等;基于颜色变换的图像增强算法包括但不限于颜色抖动、Fancy PCA(PCA,即Principal Component Analysis,主成分分析)等。在对上述多个待训练目标检测模型进行训练之前,还需要从上述图像增强算法中筛选出目标图像增强算法,并将筛选出的上述目标图像增强算法添加至上述多个待训练目标检测模型中,作为数据的预处理阶段。In this embodiment, the acquiring the image set to be detected obtained after performing image enhancement processing on the original image to be detected may specifically include: according to the model category of each target detection model, determining the corresponding target detection model Image enhancement algorithm: using the image enhancement algorithm corresponding to each of the target detection models to perform corresponding image enhancement processing on the original image to be detected, so as to obtain a set of images to be detected corresponding to each of the target detection models. The above-mentioned image enhancement algorithms include but not limited to image enhancement algorithms based on geometric transformation and color transformation; wherein, image enhancement algorithms based on geometric transformation include but not limited to random cropping (ie Random Cropping), random expansion (ie Random Expansion), Random horizontal flip (that is, Random Horizontal Flip), random zoom (that is, Random Resize), etc.; image enhancement algorithms based on color transformation include but are not limited to color dithering, Fancy PCA (PCA, Principal Component Analysis, principal component analysis), etc. Before training the above-mentioned multiple target detection models to be trained, it is also necessary to screen out the target image enhancement algorithm from the above-mentioned image enhancement algorithms, and add the screened above-mentioned target image enhancement algorithm to the above-mentioned multiple target detection models to be trained , as a preprocessing stage of the data.
需要指出的是,在将所述目标图像增强算法添加至上述多个待训练目标检测模型的过程中,应根据所述待训练目标检测模型的功能进行选择性添加,将具有重复功能的图像增强算法进行剔除。例如在Two-stage类模型中剔除筛选出的图像增强算法Random Cropping,因为Two-stage中的RPN(即Region Proposal Network,区域生成网络)网络与Random  Cropping具有相似的功能。可以理解的是,由于目标检测模型在进行训练之前,已将筛选出的目标图像增强算法添加到了对应的目标检测模型中,因此,根据每个目标检测模型的模型类别,便可以确定出与每个所述目标检测模型对应的图像增强算法,并利用每个目标检测模型对应的图像增强算法,对待检测原始图像进行相应的图像增强处理,得到与每个所述目标检测模型对应的待检测图像集。It should be pointed out that in the process of adding the target image enhancement algorithm to the above-mentioned multiple target detection models to be trained, it should be selectively added according to the functions of the target detection models to be trained, and the image with repeated functions will be enhanced Algorithm to remove. For example, the selected image enhancement algorithm Random Cropping is eliminated in the Two-stage class model, because the RPN (Region Proposal Network, Region Generation Network) network in the Two-stage has similar functions to Random Cropping. It can be understood that, before the target detection model is trained, the selected target image enhancement algorithm has been added to the corresponding target detection model, therefore, according to the model category of each target detection model, it can be determined that each An image enhancement algorithm corresponding to each of the target detection models, and using the image enhancement algorithm corresponding to each target detection model to perform corresponding image enhancement processing on the original image to be detected, to obtain an image to be detected corresponding to each of the target detection models set.
步骤S12:分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果。Step S12: Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the plurality of target detection models.
本实施例中,在获取到已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集之后,分别利用每个所述目标检测模型对上述待检测图像集中的所有待检测图像进行检测,得到每个所述目标检测模型输出的与多个所述目标检测模型对应的多组初始目标检测结果。其中,每组初始目标检测结果的数目与经过图像增强处理后的所述待检测图像集中包含的图像数目相同。In this embodiment, after acquiring a plurality of target detection models that have been trained, and obtaining the image set to be detected after performing image enhancement processing on the original image to be detected, each of the target detection models is used to detect the above-mentioned target detection models. All the images to be detected in the image set are detected, and multiple sets of initial target detection results corresponding to multiple target detection models output by each target detection model are obtained. Wherein, the number of each group of initial target detection results is the same as the number of images included in the image set to be detected after image enhancement processing.
步骤S13:分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果。Step S13: weighting all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models.
本实施例中,在分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,得到与所述多个目标检测模型对应的多组初始目标检测结果之后,需要将每个所述目标检测模型对应的所有初始目标检测结果进行加权处理,即进行单模型内部检测结果的加权,得到每个所述目标检测模型各自对应的初次加权目标检测结果。In this embodiment, after using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the multiple target detection models, it is necessary to All the initial target detection results corresponding to each of the target detection models are weighted, that is, the internal detection results of a single model are weighted to obtain the initial weighted target detection results corresponding to each of the target detection models.
步骤S14:基于预先利用验证集确定的每个目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。Step S14: Based on the weight of each target detection model determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the corresponding to the original image to be detected The final target detection result.
本实施例中,在分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,得到每个所述目标检测模型各自对应的初次加权目标检测结果之后,获取预先利用验证集对每个所述目标检测模型进行训练之后确定的与每个所述目标检测模型对应的权重,并利用上述权重对多个上述目标检测模型对应的多个所述初次加权目标检测结果进行加权处理,即为多个目标检测模型对应的检测结果赋予不同的权重值,进而得到与所述待检测原始图像对应的最终目标检测结果。In this embodiment, after weighting all the initial target detection results in each group of the initial target detection results to obtain the initial weighted target detection results corresponding to each of the target detection models, the pre-used verification set pairs are obtained. After each target detection model is trained, determine the weight corresponding to each target detection model, and use the above weight to perform weighting processing on the multiple initial weighted target detection results corresponding to multiple target detection models, That is, different weight values are assigned to the detection results corresponding to the multiple target detection models, and then the final target detection result corresponding to the original image to be detected is obtained.
可见,本申请实施例先从获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集,然后分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果,再分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果,最后基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。可见,本申请实施例通过对每组初始目标检测结果中的所有初始目标检测结果进行加权处理,降低了由于对原始图像进行图像增强处理而导致的单模型检测结果差异,提升了模型的鲁棒性,通过基于训练集将不同模型赋予不同的权重,能够充分发挥模型的多样性,提高目标检测的精确度。It can be seen that, in the embodiment of the present application, firstly obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected, and then use each of the target detection models to separately Detecting the images to be detected in the image set to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then weighting all the initial target detection results in each set of initial target detection results , to obtain the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multiple target detection models corresponding to multiple target detection models The initial weighted target detection result is weighted to obtain the final target detection result corresponding to the original image to be detected. It can be seen that in the embodiment of the present application, by weighting all the initial target detection results in each group of initial target detection results, the difference in single-model detection results caused by image enhancement processing on the original image is reduced, and the robustness of the model is improved. By assigning different weights to different models based on the training set, the diversity of the models can be fully utilized and the accuracy of target detection can be improved.
本申请实施例公开了一种具体的基于多模型融合的目标检测方法,参见图2所示,该方法包括:The embodiment of the present application discloses a specific target detection method based on multi-model fusion, as shown in Figure 2, the method includes:
步骤S21:获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集。Step S21: Obtain a plurality of trained target detection models, and obtain an image set to be detected obtained after performing image enhancement processing on the original image to be detected.
步骤S22:分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果。Step S22: Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to the multiple target detection models.
步骤S23:对任一组所述初始目标检测结果中的所有初始目标检测结果进行聚类,以得到与该组所述初始目标检测结果对应的第一聚类结果。Step S23: Clustering all the initial object detection results in any group of the initial object detection results to obtain a first clustering result corresponding to the group of the initial object detection results.
本实施例中,在分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,得到与多个所述目标检测模型对应的多组初始目标检测结果之后,进一步的基于预设聚类算法对多个上述目标检测模型对应的多组初始目标检测结果中的任意一组所述初始目标检测结果中的所有初始目标检测结果进行聚类处理,得到与该组所述初始目标检测结果对应的第一聚类结果。其中,上述预设聚类算法包括但不限于K-means算法(即K-means Clustering Algorithm,k均值聚类算法)等。In this embodiment, after using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, further Based on the preset clustering algorithm, clustering is performed on all the initial target detection results in any one of the multiple groups of initial target detection results corresponding to the above-mentioned target detection models, and the results are obtained as described in the group. The first clustering result corresponding to the initial target detection result. Wherein, the aforementioned preset clustering algorithms include but are not limited to K-means algorithm (ie K-means Clustering Algorithm, k-means clustering algorithm) and the like.
步骤S24:根据所述第一聚类结果,并基于轴对齐包围盒模糊积分算法和非极大值抑制算法,确定各个所述初始目标检测结果的权重。Step S24: According to the first clustering result, and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, determine the weight of each of the initial target detection results.
本实施例中,在对任一组所述初始目标检测结果中的所有初始目标检测结果进行聚类,得到与该组所述初始目标检测结果对应的第一聚类结果之后,基于非极大值抑制(即NMS,Non Maximum Suppression)算法对上述第一聚类结果中的非极大值进行抑制,得到第一聚类抑制结果,并基于轴对齐包围盒模糊积分算法对上述第一聚类抑制结果进行运算,得到各个所述初始目标检测结果对应的权重。其中,上述基于非极大值抑制算法对上述第一聚类结果中的非极大值进行抑制的具体处理过程如下:获取所述第一聚类结果中最大值对应的索引,并将与上述最大值对应的索引的聚类结果从所述第一聚类结果中剔除,得到目标聚类结果,并分别计算上述目标聚类结果中所有检测框与上述最大值对应的索引的聚类结果中检测框的交并比,并根据所述交并比确定各个初始目标检测结果对应的权重。In this embodiment, after clustering all the initial target detection results in any group of the initial target detection results to obtain the first clustering result corresponding to the initial target detection results in the group, based on non-maximum The value suppression (NMS, Non Maximum Suppression) algorithm suppresses the non-maximum values in the above first clustering results, and obtains the first clustering suppression results, and based on the axis-aligned bounding box fuzzy integration algorithm, the above first clustering The suppression results are calculated to obtain the weights corresponding to each of the initial target detection results. Wherein, the specific process of suppressing the non-maximum value in the above-mentioned first clustering result based on the non-maximum value suppression algorithm is as follows: obtain the index corresponding to the maximum value in the first clustering result, and compare it with the above-mentioned The clustering result of the index corresponding to the maximum value is removed from the first clustering result to obtain the target clustering result, and the clustering results of all the detection frames in the above target clustering result and the index corresponding to the above maximum value are calculated respectively. The intersection and union ratio of the detection frame, and determine the weight corresponding to each initial target detection result according to the intersection and union ratio.
步骤S25:基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结果中的所有所述初始目标检测结果进行加权,以得到相应所述目标检测模型对应的初次加权目标检测结果。Step S25: Based on the weight of each of the initial target detection results, weight all the initial target detection results in the group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to the target detection model .
本实施例中,根据所述第一聚类结果,并基于轴对齐包围盒模糊积分算法和非极大值抑制算法,确定出各个所述初始目标检测结果的权重之后,可以利用各个上述初始目标检测结果的权重值,对该组所述初始目标检测结果中的所有上述初始目标检测结果进行相应的加权处理,得到相应的与所述目标检测模型对应的初次加权目标检测结果。具体的,上述获取与所述目标检测模型对应的初次加权目标检测结果的过程主要基于AABBFI算法实现的,其中AABBFI算法的具体实现过程参见图3所示,首先将输入的单张待检测原始图像进行数据增强处理,得到待检测图像集,然后利用已训练好的单个目标检测模型并基于AABB(Axis-Aligned Bounding Box,轴对齐包围盒)算法对上述待检测图像集中的图像进行推理,得到多个推理结果,并基于FI(即Fuzzy Integral,模糊积分)算法对上述推理结果进行模糊运算,得到目标检测结果。In this embodiment, after determining the weights of each of the initial target detection results based on the first clustering result and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, each of the above-mentioned initial targets can be used The weight value of the detection result performs corresponding weighting processing on all the above-mentioned initial target detection results in the group of the initial target detection results, and obtains corresponding primary weighted target detection results corresponding to the target detection model. Specifically, the above-mentioned process of obtaining the initial weighted target detection result corresponding to the target detection model is mainly based on the AABBFI algorithm. The specific implementation process of the AABBFI algorithm is shown in Figure 3. First, the input single original image to be detected is Perform data enhancement processing to obtain the image set to be detected, and then use the trained single target detection model to infer the images in the above image set to be detected based on the AABB (Axis-Aligned Bounding Box) algorithm, and obtain multiple Inference results, and based on the FI (Fuzzy Integral, fuzzy integral) algorithm, perform fuzzy operations on the above inference results to obtain target detection results.
步骤S26:基于预先利用验证集确定的每个目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。Step S26: Based on the weight of each target detection model determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the target detection results corresponding to the original image to be detected The final target detection result.
本实施例中,在基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结 果中的所有所述初始目标检测结果进行加权,得到相应所述目标检测模型对应的初次加权目标检测结果之后,获取基于预先利用验证集确定出的每个所述目标检测模型对应的权重值,并利用上述目标检测模型对应的权重值对多个上述目标检测模型对应的多个所述初次加权目标检测结果进行加权处理,得到与上述待检测原始图像对应的最终目标检测结果。In this embodiment, based on the weight of each initial target detection result, weighting is performed on all the initial target detection results in the group of initial target detection results to obtain the initial weighted target corresponding to the target detection model After the detection result, obtain the weight value corresponding to each of the target detection models determined based on the verification set in advance, and use the weight value corresponding to the above target detection model to weight the multiple initial weights corresponding to the multiple target detection models The target detection result is weighted to obtain the final target detection result corresponding to the original image to be detected.
本实施例中,参见图4所示,所述基于多模型融合的目标检测方法,具体还可以包括:In this embodiment, as shown in FIG. 4, the target detection method based on multi-model fusion may specifically include:
步骤S31:基于验证集确定出已训练好的每个所述目标检测模型对应的均值平均精度评价指标;Step S31: Determine the mean value and average precision evaluation index corresponding to each of the trained target detection models based on the verification set;
步骤S32:对所有所述目标检测模型对应的均值平均精度评价指标进行求和,以得到相应的评价指标总和;Step S32: Summing the mean and average precision evaluation indicators corresponding to all the target detection models to obtain the sum of corresponding evaluation indicators;
步骤S33:基于每个所述目标检测模型对应的均值平均精度评价指标以及所述评价指标总和,确定出每个所述目标检测模型的权重。Step S33: Determine the weight of each target detection model based on the mean value average precision evaluation index corresponding to each target detection model and the sum of the evaluation indexes.
本实施例中,在对获取已训练好的多个目标检测模型之前,可以利用验证集确定出每个上述目标检测模型对应的用于衡量目标检测识别精度的指标,即均值平均精度评价指标(mAP,mean Average Precision)。进一步的,计算所有上述目标检测模型对应的均值平均精度评价指标的总和,得到相应的评价指标总和。然后基于每个所述目标检测模型对应的均值平均精度评价指标与上述评价指标总和的比值,便可以确定出每个所述目标检测模型对应的权重。In this embodiment, before obtaining a plurality of trained target detection models, the verification set can be used to determine the index for measuring the accuracy of target detection and recognition corresponding to each of the above target detection models, that is, the mean average precision evaluation index ( mAP, mean Average Precision). Further, the sum of the mean average precision evaluation indicators corresponding to all the above target detection models is calculated to obtain the corresponding sum of evaluation indicators. Then, the weight corresponding to each target detection model can be determined based on the ratio of the mean average precision evaluation index corresponding to each target detection model to the sum of the above evaluation indexes.
本实施例中,基于所述验证集确定出已训练好的任一所述目标检测模型对应的均值平均精度评价指标,具体可以包括:利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,以得到所述目标检测模型输出的与所述验证集中每张图像对应的预测结果;基于所述预测结果与所述验证集中对应的真实标注结果之间的差异,确定出所述目标检测模型的均值平均精度评价指标。本实施例中,利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,可以得到上述目标检测模型输出的与上述验证集中每张图像对应的预测结果,进一步的,基于上述预测结果与所述验证集中对应的真实标注结果(即Ground Truth)之间的差异程度,可以确定出与上述目标检测模型对应的均值平均精度评价指标。In this embodiment, determining the mean average precision evaluation index corresponding to any of the trained target detection models based on the verification set may specifically include: using the trained target detection model to evaluate each target detection model in the verification set Prediction of each image in order to obtain the prediction result corresponding to each image in the verification set output by the target detection model; based on the difference between the prediction result and the corresponding real labeling result in the verification set, determine the The mean average precision evaluation index of the target detection model. In this embodiment, the trained target detection model is used to predict each image in the verification set, and the prediction result output by the target detection model corresponding to each image in the verification set can be obtained. Further, based on the above The degree of difference between the prediction result and the corresponding real labeling result (ie, Ground Truth) in the verification set can determine the mean average precision evaluation index corresponding to the above-mentioned target detection model.
具体的,所述利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,可以包括:分别对验证集中的每张图像进行图像增强,得到所述验证集中每张图像对应的多个增强后图像;利用已训练好的所述目标检测模型对所述验证集中每张图像对应的多个增强后图像进行预测,以得到与所述验证集中每张图像对应的多个初始预测结果;分别对所述验证集中每张图像对应的多个初始预测结果进行加权处理,以得到与所述验证集中每张图像对应的预测结果。本实施例中,在利用已训练好的所述目标检测模型对验证集中的每张图像进行预测的过程中,可以先分别对验证集中的每张图像进行图像增强处理,得到与上述验证集中每张图像对应的多个增强后图像,然后利用已训练好的上述目标检测模型对上述验证集中每张图像对应的多个增强后图像进行预测,进而得到与上述验证集中每张图像对应的多个初始预测结果,再基于轴对齐包围盒模糊积分算法和非极大值抑制算法分别对上述验证集中的每张图像对应的多个初始预测结果进行加权处理,得到与上述验证集中每张图像对应的预测结果。Specifically, using the trained target detection model to predict each image in the verification set may include: respectively performing image enhancement on each image in the verification set to obtain the corresponding A plurality of enhanced images; using the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial predictions corresponding to each image in the verification set Result: performing weighting processing on multiple initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set. In this embodiment, in the process of using the trained target detection model to predict each image in the verification set, image enhancement processing can be performed on each image in the verification set respectively to obtain Multiple enhanced images corresponding to each image, and then use the trained target detection model to predict the multiple enhanced images corresponding to each image in the verification set, and then obtain multiple enhanced images corresponding to each image in the verification set Based on the initial prediction results, based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, the multiple initial prediction results corresponding to each image in the above verification set are weighted, and the corresponding to each image in the above verification set is obtained. forecast result.
其中,关于上述步骤S21、S22更加具体的处理过程可以参考前述实施例中公开的相应内容,在此不再进行赘述。Wherein, for more specific processing procedures of the above-mentioned steps S21 and S22, reference may be made to the corresponding contents disclosed in the foregoing embodiments, and details are not repeated here.
可见,本申请实施例通过对任一组初始目标检测结果中的所有初始目标检测结果进行 聚类,得到与该组所述初始目标检测结果对应的第一聚类结果,然后根据所述第一聚类结果中不同聚类中心对应的检测框重叠程度,确定每个聚类中心对应的各个所述初始目标检测结果的权重,并基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结果中的所有所述初始目标检测结果进行加权,得到相应所述目标检测模型对应的初次加权目标检测结果。通过对每一组初始目标检测结果中的所有初始目标检测结果进行聚类及加权处理,能够降低由于对原始图像进行图像增强而导致的检测结果的差异,提升单个目标检测模型的鲁棒性。It can be seen that, in the embodiment of the present application, by clustering all the initial target detection results in any group of initial target detection results, the first clustering result corresponding to the initial target detection results of the group is obtained, and then according to the first The overlapping degree of detection frames corresponding to different cluster centers in the clustering results determines the weight of each of the initial target detection results corresponding to each cluster center, and based on the weight of each of the initial target detection results, the group of All the initial target detection results in the initial target detection results are weighted to obtain an initial weighted target detection result corresponding to the target detection model. By clustering and weighting all the initial target detection results in each group of initial target detection results, the difference in detection results caused by image enhancement on the original image can be reduced, and the robustness of a single target detection model can be improved.
相应的,本申请实施例还公开了一种基于多模型融合的目标检测装置,参见图5所示,该装置包括:Correspondingly, the embodiment of the present application also discloses a target detection device based on multi-model fusion, as shown in Fig. 5, the device includes:
获取模块11,用于获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集;The acquiring module 11 is used to acquire a plurality of target detection models that have been trained, and acquire an image set to be detected obtained after performing image enhancement processing on the original image to be detected;
图像检测模块12,用于分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果;The image detection module 12 is configured to use each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
单模型加权模块13,用于分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果;A single-model weighting module 13, configured to weight all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
多模型加权模块14,用于基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。The multi-model weighting module 14 is used to weight the multiple initial weighted target detection results corresponding to multiple target detection models based on the weight of each target detection model determined in advance using the verification set, to obtain the weighted target detection results corresponding to the target detection models. Describe the final target detection result corresponding to the original image to be detected.
其中,关于上述各个模块的具体工作流程可以参考前述实施例中公开的相应内容,在此不再进行赘述。For the specific work flow of each of the above modules, reference may be made to the corresponding content disclosed in the foregoing embodiments, which will not be repeated here.
可见,本申请实施例中,先从获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集,然后分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果,再分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果,最后基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。可见,本申请实施例通过对每组初始目标检测结果中的所有初始目标检测结果进行加权处理,降低了由于对原始图像进行图像增强处理而导致的单模型检测结果差异,提升了模型的鲁棒性,通过基于训练集将不同模型赋予不同的权重,能够充分发挥模型的多样性,提高目标检测的精确度。It can be seen that in the embodiment of the present application, first obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected, and then use each of the target detection models to The images to be detected in the image set to be detected are detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models, and then all initial target detection results in each set of initial target detection results are respectively weighting to obtain the initial weighted target detection results corresponding to each of the target detection models, and finally based on the weight of each of the target detection models determined in advance using the verification set, the multi The initial weighted target detection results are weighted to obtain the final target detection result corresponding to the original image to be detected. It can be seen that in the embodiment of the present application, by weighting all the initial target detection results in each group of initial target detection results, the difference in single-model detection results caused by image enhancement processing on the original image is reduced, and the robustness of the model is improved. By assigning different weights to different models based on the training set, the diversity of the models can be fully utilized and the accuracy of target detection can be improved.
在一些具体实施例中,所述获取模块11之前,还可以包括:In some specific embodiments, before the acquisition module 11, it may also include:
模型筛选单元,用于利用基于模型结构差异构建的筛选条件,筛选出多个待训练目标检测模型;其中,不同所述待训练目标检测模型之间的模型结构差异均满足预设差异条件;The model screening unit is used to screen out a plurality of target detection models to be trained by using screening conditions constructed based on model structure differences; wherein, the model structure differences between different target detection models to be trained all meet the preset difference conditions;
第一训练单元,用于利用对历史原始图像集进行图像增强后得到的训练集,对多个所述待训练目标检测模型进行训练,以得到已训练好的多个所述目标检测模型。The first training unit is configured to use a training set obtained after performing image enhancement on a historical original image set to train a plurality of target detection models to be trained, so as to obtain a plurality of trained target detection models.
在一些具体实施例中,所述获取模块11,具体可以包括:In some specific embodiments, the acquisition module 11 may specifically include:
算法确定单元,用于根据每个所述目标检测模型的模型类别,确定每个所述目标检测模型对应的图像增强算法;An algorithm determining unit, configured to determine an image enhancement algorithm corresponding to each of the target detection models according to the model category of each of the target detection models;
第一图像增强单元,用于利用每个所述目标检测模型对应的图像增强算法,对所述待 检测原始图像进行相应的图像增强处理,以得到与每个所述目标检测模型对应的待检测图像集。The first image enhancement unit is configured to use the image enhancement algorithm corresponding to each of the target detection models to perform corresponding image enhancement processing on the original image to be detected, so as to obtain the target to be detected corresponding to each of the target detection models image set.
在一些具体实施例中,所述对任一组所述初始目标检测结果中的所有初始目标检测结果进行加权,具体可以包括:In some specific embodiments, the weighting of all the initial target detection results in any group of the initial target detection results may specifically include:
第一聚类单元,用于对任一组所述初始目标检测结果中的所有初始目标检测结果进行聚类,以得到与该组所述初始目标检测结果对应的第一聚类结果;The first clustering unit is configured to cluster all the initial target detection results in any group of the initial target detection results to obtain a first clustering result corresponding to the group of the initial target detection results;
第一权重确定单元,用于根据所述第一聚类结果,并基于轴对齐包围盒模糊积分算法和非极大值抑制算法,确定各个所述初始目标检测结果的权重;The first weight determination unit is configured to determine the weight of each of the initial target detection results based on the first clustering result and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm;
第一加权单元,用于基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结果中的所有所述初始目标检测结果进行加权,以得到相应所述目标检测模型对应的初次加权目标检测结果。The first weighting unit is configured to weight all the initial target detection results in the group of initial target detection results based on the weight of each of the initial target detection results, so as to obtain the initial target detection model corresponding to the corresponding target. Weighted object detection results.
在一些具体实施例中,所述基于多模型融合的目标检测装置,还可以包括:In some specific embodiments, the target detection device based on multi-model fusion may also include:
第一评价指标确定单元,用于基于验证集确定出已训练好的每个所述目标检测模型对应的均值平均精度评价指标;The first evaluation index determination unit is configured to determine the mean average precision evaluation index corresponding to each of the trained target detection models based on the verification set;
求和单元,用于对所有所述目标检测模型对应的均值平均精度评价指标进行求和,以得到相应的评价指标总和;A summation unit, configured to sum the mean average precision evaluation indicators corresponding to all the target detection models to obtain the corresponding evaluation index sum;
第二权重确定单元,用于基于每个所述目标检测模型对应的均值平均精度评价指标以及所述评价指标总和,确定出每个所述目标检测模型的权重。The second weight determination unit is configured to determine the weight of each of the target detection models based on the mean average precision evaluation index corresponding to each of the target detection models and the sum of the evaluation indexes.
在一些具体实施例中,所述第一评价指标确定单元,具体可以包括:In some specific embodiments, the first evaluation indicator determining unit may specifically include:
第一预测单元,用于利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,以得到所述目标检测模型输出的与所述验证集中每张图像对应的预测结果;A first prediction unit, configured to use the trained target detection model to predict each image in the verification set, so as to obtain a prediction result output by the target detection model corresponding to each image in the verification set;
第二评价指标确定单元,用于基于所述预测结果与所述验证集中对应的真实标注结果之间的差异,确定出所述目标检测模型的均值平均精度评价指标。The second evaluation index determination unit is configured to determine the mean average precision evaluation index of the target detection model based on the difference between the prediction result and the corresponding real labeling result in the verification set.
在一些具体实施例中,所述第一预测单元,具体可以包括:In some specific embodiments, the first prediction unit may specifically include:
第二图像增强单元,用于分别对验证集中的每张图像进行图像增强,得到所述验证集中每张图像对应的多个增强后图像;The second image enhancement unit is used to respectively perform image enhancement on each image in the verification set to obtain a plurality of enhanced images corresponding to each image in the verification set;
第二预测单元,用于利用已训练好的所述目标检测模型对所述验证集中每张图像对应的多个增强后图像进行预测,以得到与所述验证集中每张图像对应的多个初始预测结果;The second prediction unit is configured to use the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial images corresponding to each image in the verification set forecast result;
第二加权单元,用于分别对所述验证集中每张图像对应的多个初始预测结果进行加权处理,以得到与所述验证集中每张图像对应的预测结果。The second weighting unit is configured to respectively perform weighting processing on a plurality of initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
可选地,本申请实施例还公开了一种电子设备,图6是根据一示例性实施例示出的电子设备20结构图,图中的内容不能认为是对本申请的使用范围的任何限制。Optionally, the embodiment of the present application also discloses an electronic device. FIG. 6 is a structural diagram of an electronic device 20 according to an exemplary embodiment. The content in the figure should not be regarded as any limitation on the application scope of the present application.
图6为本申请实施例提供的一种电子设备20的结构示意图。该电子设备20,具体可以包括:至少一个处理器21、至少一个存储器22、电源23、通信接口24、输入输出接口25和通信总线26。其中,所述存储器22用于存储计算机程序,所述计算机程序由所述处理器21加载并执行,以实现前述任一实施例公开的基于多模型融合的目标检测方法中的相关步骤。另外,本实施例中的电子设备20具体可以为电子计算机。FIG. 6 is a schematic structural diagram of an electronic device 20 provided by an embodiment of the present application. The electronic device 20 may specifically include: at least one processor 21 , at least one memory 22 , a power supply 23 , a communication interface 24 , an input/output interface 25 and a communication bus 26 . Wherein, the memory 22 is used to store a computer program, and the computer program is loaded and executed by the processor 21 to implement relevant steps in the multi-model fusion-based target detection method disclosed in any of the foregoing embodiments. In addition, the electronic device 20 in this embodiment may specifically be an electronic computer.
本实施例中,电源23用于为电子设备20上的各硬件设备提供工作电压;通信接口24能够为电子设备20创建与外界设备之间的数据传输通道,其所遵循的通信协议是能够适用于本申请技术方案的任意通信协议,在此不对其进行具体限定;输入输出接口25,用于获 取外界输入数据或向外界输出数据,其具体的接口类型可以根据具体应用需要进行选取,在此不进行具体限定。In this embodiment, the power supply 23 is used to provide working voltage for each hardware device on the electronic device 20; the communication interface 24 can create a data transmission channel between the electronic device 20 and external devices, and the communication protocol it follows is applicable Any communication protocol in the technical solution of the present application is not specifically limited here; the input and output interface 25 is used to obtain external input data or output data to the external, and its specific interface type can be selected according to specific application needs, here Not specifically limited.
另外,存储器22作为资源存储的载体,可以是只读存储器、随机存储器、磁盘或者光盘等,其上所存储的资源可以包括操作系统221、计算机程序222等,存储方式可以是短暂存储或者永久存储。In addition, the memory 22, as a resource storage carrier, can be a read-only memory, random access memory, magnetic disk or optical disk, etc., and the resources stored thereon can include operating system 221, computer program 222, etc., and the storage method can be temporary storage or permanent storage. .
其中,操作系统221用于管理与控制电子设备20上的各硬件设备以及计算机程序222,其可以是Windows Server、Netware、Unix、Linux等。计算机程序222除了包括能够用于完成前述任一实施例公开的由电子设备20执行的基于多模型融合的目标检测方法的计算机程序之外,还可以可选地包括能够用于完成其他特定工作的计算机程序。Wherein, the operating system 221 is used to manage and control each hardware device on the electronic device 20 and the computer program 222, which may be Windows Server, Netware, Unix, Linux, etc. In addition to the computer program 222 that can be used to complete the multi-model fusion-based target detection method performed by the electronic device 20 disclosed in any of the foregoing embodiments, it can also optionally include a computer program that can be used to complete other specific tasks. Computer program.
可选地,本申请还公开了一种计算机可读存储介质,用于存储计算机程序;其中,所述计算机程序被处理器执行时实现前述公开的基于多模型融合的目标检测方法。关于该方法的具体步骤可以参考前述实施例中公开的相应内容,在此不再进行赘述。Optionally, the present application also discloses a computer-readable storage medium for storing a computer program; wherein, when the computer program is executed by a processor, the aforementioned multi-model fusion-based target detection method is implemented. Regarding the specific steps of the method, reference may be made to the corresponding content disclosed in the foregoing embodiments, and details are not repeated here.
本说明书中各个实施例采用递进的方式描述,每个实施例重点说明的都是与其它实施例的不同之处,各个实施例之间相同或相似部分互相参见即可。对于实施例公开的装置而言,由于其与实施例公开的方法相对应,所以描述的比较简单,相关之处参见方法部分说明即可。Each embodiment in this specification is described in a progressive manner, each embodiment focuses on the difference from other embodiments, and the same or similar parts of each embodiment can be referred to each other. As for the device disclosed in the embodiment, since it corresponds to the method disclosed in the embodiment, the description is relatively simple, and for the related information, please refer to the description of the method part.
专业人员还可以进一步意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Professionals can further realize that the units and algorithm steps of the examples described in conjunction with the embodiments disclosed herein can be implemented by electronic hardware, computer software or a combination of the two. In order to clearly illustrate the possible For interchangeability, in the above description, the composition and steps of each example have been generally described according to their functions. Whether these functions are executed by hardware or software depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the present application.
结合本文中所公开的实施例描述的方法或算法的步骤可以直接用硬件、处理器执行的软件模块,或者二者的结合来实施。软件模块可以置于随机存储器(RAM)、内存、只读存储器(ROM)、电可编程ROM、电可擦除可编程ROM、寄存器、硬盘、可移动磁盘、CD-ROM、或技术领域内所公知的任意其它形式的存储介质中。The steps of the methods or algorithms described in connection with the embodiments disclosed herein may be directly implemented by hardware, software modules executed by a processor, or a combination of both. Software modules can be placed in random access memory (RAM), internal memory, read-only memory (ROM), electrically programmable ROM, electrically erasable programmable ROM, registers, hard disk, removable disk, CD-ROM, or any other Any other known storage medium.
最后,还需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。Finally, it should also be noted that in this text, relational terms such as first and second etc. are only used to distinguish one entity or operation from another, and do not necessarily require or imply that these entities or operations, any such actual relationship or order exists. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.
以上对本申请所提供的一种基于多模型融合的目标检测方法、装置、设备及介质进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。A target detection method, device, equipment and medium based on multi-model fusion provided by this application has been introduced in detail above. In this paper, specific examples are used to illustrate the principle and implementation of this application. The description of the above embodiments It is only used to help understand the method of the present application and its core idea; at the same time, for those of ordinary skill in the art, according to the idea of the present application, there will be changes in the specific implementation and application scope. In summary, The contents of this specification should not be understood as limiting the application.

Claims (10)

  1. 一种基于多模型融合的目标检测方法,其特征在于,包括:A target detection method based on multi-model fusion, characterized in that, comprising:
    获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集;Obtain multiple target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected;
    分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果;Using each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
    分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果;Weighting all the initial target detection results in each group of the initial target detection results respectively, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
    基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。Based on the weight of each of the target detection models determined in advance using the verification set, weight the multiple initial weighted target detection results corresponding to the multiple target detection models, and obtain the final result corresponding to the original image to be detected. Target detection results.
  2. 根据权利要求1所述的基于多模型融合的目标检测方法,其特征在于,所述获取已训练好的多个目标检测模型之前,还包括:The target detection method based on multi-model fusion according to claim 1, wherein, before obtaining a plurality of trained target detection models, further comprising:
    利用基于模型结构差异构建的筛选条件,筛选出多个待训练目标检测模型;其中,不同所述待训练目标检测模型之间的模型结构差异均满足预设差异条件;A plurality of target detection models to be trained are screened out by using screening conditions constructed based on model structure differences; wherein, the model structure differences between different target detection models to be trained all meet preset difference conditions;
    利用对历史原始图像集进行图像增强后得到的训练集,对多个所述待训练目标检测模型进行训练,以得到已训练好的多个所述目标检测模型。Using the training set obtained after performing image enhancement on the historical original image set, the multiple target detection models to be trained are trained to obtain multiple trained target detection models.
  3. 根据权利要求1所述的基于多模型融合的目标检测方法,其特征在于,所述获取对待检测原始图像进行图像增强处理后得到的待检测图像集,包括:The target detection method based on multi-model fusion according to claim 1, wherein said acquisition of the image set to be detected after performing image enhancement processing on the original image to be detected comprises:
    根据每个所述目标检测模型的模型类别,确定每个所述目标检测模型对应的图像增强算法;Determine an image enhancement algorithm corresponding to each of the target detection models according to the model category of each of the target detection models;
    利用每个所述目标检测模型对应的图像增强算法,对所述待检测原始图像进行相应的图像增强处理,以得到与每个所述目标检测模型对应的待检测图像集。Using the image enhancement algorithm corresponding to each of the target detection models, corresponding image enhancement processing is performed on the original image to be detected, so as to obtain a set of images to be detected corresponding to each of the target detection models.
  4. 根据权利要求1所述的基于多模型融合的目标检测方法,其特征在于,对任一组所述初始目标检测结果中的所有初始目标检测结果进行加权,包括:The target detection method based on multi-model fusion according to claim 1, wherein weighting all initial target detection results in any set of initial target detection results includes:
    对任一组所述初始目标检测结果中的所有初始目标检测结果进行聚类,以得到与该组所述初始目标检测结果对应的第一聚类结果;clustering all the initial target detection results in any group of the initial target detection results to obtain a first clustering result corresponding to the group of the initial target detection results;
    根据所述第一聚类结果,并基于轴对齐包围盒模糊积分算法和非极大值抑制算法,确定各个所述初始目标检测结果的权重;According to the first clustering result, and based on the axis-aligned bounding box fuzzy integration algorithm and the non-maximum value suppression algorithm, determine the weight of each of the initial target detection results;
    基于各个所述初始目标检测结果的权重,对该组所述初始目标检测结果中的所有所述初始目标检测结果进行加权,以得到相应所述目标检测模型对应的初次加权目标检测结果。Based on the weight of each of the initial target detection results, weighting is performed on all the initial target detection results in the group of the initial target detection results, so as to obtain an initial weighted target detection result corresponding to the target detection model.
  5. 根据权利要求1至4任一项所述的基于多模型融合的目标检测方法,其特征在于,还包括:The target detection method based on multi-model fusion according to any one of claims 1 to 4, further comprising:
    基于验证集确定出已训练好的每个所述目标检测模型对应的均值平均精度评价指标;Based on the verification set, determine the mean value and average precision evaluation index corresponding to each of the target detection models that have been trained;
    对所有所述目标检测模型对应的均值平均精度评价指标进行求和,以得到相应的评价指标总和;Summing the mean and average precision evaluation indicators corresponding to all the target detection models to obtain the corresponding evaluation index sum;
    基于每个所述目标检测模型对应的均值平均精度评价指标以及所述评价指标总和,确定出每个所述目标检测模型的权重。The weight of each target detection model is determined based on the mean average precision evaluation index corresponding to each target detection model and the sum of the evaluation indexes.
  6. 根据权利要求5所述的基于多模型融合的目标检测方法,其特征在于,基于所述验证 集确定出已训练好的任一所述目标检测模型对应的均值平均精度评价指标,包括:The target detection method based on multi-model fusion according to claim 5, wherein, based on the verification set, determine the mean value average precision evaluation index corresponding to any of the trained target detection models, including:
    利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,以得到所述目标检测模型输出的与所述验证集中每张图像对应的预测结果;Using the trained target detection model to predict each image in the verification set, so as to obtain a prediction result output by the target detection model corresponding to each image in the verification set;
    基于所述预测结果与所述验证集中对应的真实标注结果之间的差异,确定出所述目标检测模型的均值平均精度评价指标。Based on the difference between the prediction result and the corresponding real labeling result in the verification set, an average average precision evaluation index of the target detection model is determined.
  7. 根据权利要求6所述的基于多模型融合的目标检测方法,其特征在于,所述利用已训练好的所述目标检测模型对验证集中的每张图像进行预测,包括:The target detection method based on multi-model fusion according to claim 6, wherein said utilizing the trained target detection model to predict each image in the verification set comprises:
    分别对验证集中的每张图像进行图像增强,得到所述验证集中每张图像对应的多个增强后图像;Carry out image enhancement to each image in the verification set respectively, and obtain a plurality of enhanced images corresponding to each image in the verification set;
    利用已训练好的所述目标检测模型对所述验证集中每张图像对应的多个增强后图像进行预测,以得到与所述验证集中每张图像对应的多个初始预测结果;Using the trained target detection model to predict a plurality of enhanced images corresponding to each image in the verification set, so as to obtain a plurality of initial prediction results corresponding to each image in the verification set;
    分别对所述验证集中每张图像对应的多个初始预测结果进行加权处理,以得到与所述验证集中每张图像对应的预测结果。Weighting is performed on multiple initial prediction results corresponding to each image in the verification set, so as to obtain a prediction result corresponding to each image in the verification set.
  8. 一种基于多模型融合的目标检测装置,其特征在于,包括:A target detection device based on multi-model fusion, characterized in that it comprises:
    获取模块,用于获取已训练好的多个目标检测模型,并获取对待检测原始图像进行图像增强处理后得到的待检测图像集;The obtaining module is used to obtain a plurality of target detection models that have been trained, and obtain the image set to be detected after performing image enhancement processing on the original image to be detected;
    图像检测模块,用于分别利用每个所述目标检测模型对所述待检测图像集中的待检测图像进行检测,以得到与多个所述目标检测模型对应的多组初始目标检测结果;An image detection module, configured to use each of the target detection models to detect the images to be detected in the set of images to be detected to obtain multiple sets of initial target detection results corresponding to multiple target detection models;
    单模型加权模块,用于分别对每组所述初始目标检测结果中的所有初始目标检测结果进行加权,以得到每个所述目标检测模型各自对应的初次加权目标检测结果;A single-model weighting module, configured to weight all the initial target detection results in each group of the initial target detection results, so as to obtain the initial weighted target detection results corresponding to each of the target detection models;
    多模型加权模块,用于基于预先利用验证集确定的每个所述目标检测模型的权重,对多个所述目标检测模型对应的多个所述初次加权目标检测结果进行加权,得到与所述待检测原始图像对应的最终目标检测结果。A multi-model weighting module, configured to weight the multiple initial weighted target detection results corresponding to multiple target detection models based on the weight of each target detection model determined in advance using the verification set, to obtain the The final target detection result corresponding to the original image to be detected.
  9. 一种电子设备,其特征在于,包括处理器和存储器;其中,所述处理器执行所述存储器中保存的计算机程序时实现如权利要求1至7任一项所述的基于多模型融合的目标检测方法。An electronic device, characterized in that it includes a processor and a memory; wherein, when the processor executes the computer program stored in the memory, the goal based on multi-model fusion according to any one of claims 1 to 7 is achieved Detection method.
  10. 一种计算机可读存储介质,其特征在于,用于存储计算机程序;其中,所述计算机程序被处理器执行时实现如权利要求1至7任一项所述的基于多模型融合的目标检测方法。A computer-readable storage medium, characterized in that it is used to store a computer program; wherein, when the computer program is executed by a processor, the multi-model fusion-based target detection method according to any one of claims 1 to 7 is realized .
PCT/CN2022/090233 2021-10-26 2022-04-29 Multi-model fusion-based object detection method and apparatus, device and medium WO2023071121A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111244219.2A CN113688957A (en) 2021-10-26 2021-10-26 Target detection method, device, equipment and medium based on multi-model fusion
CN202111244219.2 2021-10-26

Publications (1)

Publication Number Publication Date
WO2023071121A1 true WO2023071121A1 (en) 2023-05-04

Family

ID=78587953

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/090233 WO2023071121A1 (en) 2021-10-26 2022-04-29 Multi-model fusion-based object detection method and apparatus, device and medium

Country Status (2)

Country Link
CN (1) CN113688957A (en)
WO (1) WO2023071121A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116777913A (en) * 2023-08-22 2023-09-19 青岛市勘察测绘研究院 Method, device, equipment and medium for detecting registration precision of three-dimensional model
CN116935290A (en) * 2023-09-14 2023-10-24 南京邮电大学 Heterogeneous target detection method and system for high-resolution array camera in airport scene

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113688957A (en) * 2021-10-26 2021-11-23 苏州浪潮智能科技有限公司 Target detection method, device, equipment and medium based on multi-model fusion
CN114693612A (en) * 2022-03-16 2022-07-01 深圳大学 Knee joint bone tumor detection method based on deep learning and related device
CN117541770A (en) * 2022-07-29 2024-02-09 马上消费金融股份有限公司 Data enhancement method and device and electronic equipment
CN115223166A (en) * 2022-09-20 2022-10-21 整数智能信息技术(杭州)有限责任公司 Picture pre-labeling method, picture labeling method and device, and electronic equipment
CN115376054B (en) * 2022-10-26 2023-03-24 浪潮电子信息产业股份有限公司 Target detection method, device, equipment and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110569808A (en) * 2019-09-11 2019-12-13 腾讯科技(深圳)有限公司 Living body detection method and device and computer equipment
CN111178405A (en) * 2019-12-18 2020-05-19 浙江工业大学 Similar object identification method fusing multiple neural networks
CN111680675A (en) * 2020-08-14 2020-09-18 腾讯科技(深圳)有限公司 Face living body detection method, system, device, computer equipment and storage medium
CN112528908A (en) * 2020-12-18 2021-03-19 平安科技(深圳)有限公司 Living body detection method, living body detection device, electronic apparatus, and storage medium
CN112863194A (en) * 2021-01-20 2021-05-28 青岛以萨数据技术有限公司 Image processing method, device, terminal and medium
CN113255607A (en) * 2021-06-30 2021-08-13 北京道达天际科技有限公司 Remote sensing house detection method and device based on multi-model fusion
CN113313058A (en) * 2021-06-16 2021-08-27 北京航空航天大学 Unmanned aerial vehicle aerial image target detection method based on self-adaptive model integration
WO2021190046A1 (en) * 2020-03-24 2021-09-30 京东方科技集团股份有限公司 Training method for gesture recognition model, gesture recognition method, and apparatus
CN113688957A (en) * 2021-10-26 2021-11-23 苏州浪潮智能科技有限公司 Target detection method, device, equipment and medium based on multi-model fusion

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110189307B (en) * 2019-05-14 2021-11-23 慧影医疗科技(北京)有限公司 Pulmonary nodule detection method and system based on multi-model fusion
CN111461784B (en) * 2020-03-31 2022-04-22 华南理工大学 Multi-model fusion-based fraud detection method

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110569808A (en) * 2019-09-11 2019-12-13 腾讯科技(深圳)有限公司 Living body detection method and device and computer equipment
CN111178405A (en) * 2019-12-18 2020-05-19 浙江工业大学 Similar object identification method fusing multiple neural networks
WO2021190046A1 (en) * 2020-03-24 2021-09-30 京东方科技集团股份有限公司 Training method for gesture recognition model, gesture recognition method, and apparatus
CN111680675A (en) * 2020-08-14 2020-09-18 腾讯科技(深圳)有限公司 Face living body detection method, system, device, computer equipment and storage medium
CN112528908A (en) * 2020-12-18 2021-03-19 平安科技(深圳)有限公司 Living body detection method, living body detection device, electronic apparatus, and storage medium
CN112863194A (en) * 2021-01-20 2021-05-28 青岛以萨数据技术有限公司 Image processing method, device, terminal and medium
CN113313058A (en) * 2021-06-16 2021-08-27 北京航空航天大学 Unmanned aerial vehicle aerial image target detection method based on self-adaptive model integration
CN113255607A (en) * 2021-06-30 2021-08-13 北京道达天际科技有限公司 Remote sensing house detection method and device based on multi-model fusion
CN113688957A (en) * 2021-10-26 2021-11-23 苏州浪潮智能科技有限公司 Target detection method, device, equipment and medium based on multi-model fusion

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116777913A (en) * 2023-08-22 2023-09-19 青岛市勘察测绘研究院 Method, device, equipment and medium for detecting registration precision of three-dimensional model
CN116777913B (en) * 2023-08-22 2023-11-03 青岛市勘察测绘研究院 Method, device, equipment and medium for detecting registration precision of three-dimensional model
CN116935290A (en) * 2023-09-14 2023-10-24 南京邮电大学 Heterogeneous target detection method and system for high-resolution array camera in airport scene
CN116935290B (en) * 2023-09-14 2023-12-12 南京邮电大学 Heterogeneous target detection method and system for high-resolution array camera in airport scene

Also Published As

Publication number Publication date
CN113688957A (en) 2021-11-23

Similar Documents

Publication Publication Date Title
WO2023071121A1 (en) Multi-model fusion-based object detection method and apparatus, device and medium
US11961227B2 (en) Method and device for detecting and locating lesion in medical image, equipment and storage medium
CN106919957B (en) Method and device for processing data
CN110879982B (en) Crowd counting system and method
CN113011973B (en) Method and equipment for financial transaction supervision model based on intelligent contract data lake
CN107612938A (en) A kind of network user's anomaly detection method, device, equipment and storage medium
CN110956255A (en) Difficult sample mining method and device, electronic equipment and computer readable storage medium
WO2020108219A1 (en) Traffic safety risk based group division and difference analysis method and system
CN110969200A (en) Image target detection model training method and device based on consistency negative sample
US10885593B2 (en) Hybrid classification system
CN114330499A (en) Method, device, equipment, storage medium and program product for training classification model
CN114818828B (en) Training method of radar interference perception model and radar interference signal identification method
CN110096013A (en) A kind of intrusion detection method and device of industrial control system
CN114092793A (en) End-to-end biological target detection method suitable for complex underwater environment
CN111881706B (en) Living body detection, image classification and model training method, device, equipment and medium
CN110348516B (en) Data processing method, data processing device, storage medium and electronic equipment
CN114363212B (en) Equipment detection method, device, equipment and storage medium
CN116452810A (en) Multi-level semantic segmentation method and device, electronic equipment and storage medium
CN114529890A (en) State detection method and device, electronic equipment and storage medium
CN112001453A (en) Method and device for calculating accuracy of video event detection algorithm
CN116743637A (en) Abnormal flow detection method and device, electronic equipment and storage medium
CN115115680A (en) Image processing method, device, equipment and storage medium
CN115410250A (en) Array type human face beauty prediction method, equipment and storage medium
CN111652102A (en) Power transmission channel target object identification method and system
CN113869516B (en) Knowledge graph embedded model training method and device, electronic equipment and medium