CN109978029B - Invalid image sample screening method based on convolutional neural network - Google Patents

Invalid image sample screening method based on convolutional neural network Download PDF

Info

Publication number
CN109978029B
CN109978029B CN201910188287.8A CN201910188287A CN109978029B CN 109978029 B CN109978029 B CN 109978029B CN 201910188287 A CN201910188287 A CN 201910188287A CN 109978029 B CN109978029 B CN 109978029B
Authority
CN
China
Prior art keywords
sample
invalid
model
samples
images
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201910188287.8A
Other languages
Chinese (zh)
Other versions
CN109978029A (en
Inventor
张永军
闫思宇
沈涛
文韩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Xinhang Century Information Technology Co ltd
Original Assignee
Beijing University of Posts and Telecommunications
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing University of Posts and Telecommunications filed Critical Beijing University of Posts and Telecommunications
Priority to CN201910188287.8A priority Critical patent/CN109978029B/en
Publication of CN109978029A publication Critical patent/CN109978029A/en
Application granted granted Critical
Publication of CN109978029B publication Critical patent/CN109978029B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/10Pre-processing; Data cleansing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Abstract

The invention discloses a method for screening invalid image samples based on establishment of a convolutional neural network filtering sample model, wherein a large number of invalid sample (redundant sample) images such as fuzzy, blank shooting and damage are screened out from an original sample through the filtering sample model established by the convolutional neural network, and the rest samples are image samples with better quality and more representativeness and can be used as an effective sample set for image classification. The method can be finally realized as follows: and the invalid samples are screened out through an algorithm, so that the working hours consumed by screening out a large number of invalid samples are reduced, and the labor cost is reduced.

Description

Invalid image sample screening method based on convolutional neural network
Technical Field
The invention relates to the field of machine learning, in particular to a method for screening invalid image samples based on establishment of a convolutional neural network filtering sample model.
Background
When the convolutional neural network is used for image classification, a large number of image samples are required to be classified as a sample library for constructing a model. In an actual industrial process, compared with a normal industrial process, some invalid image data, such as a blurred image, an empty shot image and a damaged image, which are acquired during pipeline production, often exist in acquired product image data, and the images belong to invalid data for model construction.
Disclosure of Invention
The invention mainly solves the technical problem of providing a method for screening invalid image samples based on establishing a convolutional neural network filtering sample model, which can screen out fuzzy images, blank shot images and the like from collected samples with less labor hour and labor cost and realize the cleaning of original samples.
In order to solve the technical problems, the invention adopts a technical scheme that: and (3) blurring the part of the sample image which is not marked, screening out the image with extremely blurred sample, and performing primary blurring cleaning.
The processed samples are classified into invalid images, namely blurred images, blank shot images and the like, and effective sample images through manual classification.
The extremely blurred images processed by the filter are also divided into invalid images, two types of image division are formed, and a sample library is formed.
The CNN algorithm adopts the sample library to construct a classifier model containing two classification conditions, and then the model is used for cleaning samples, so that invalid samples in a large number of samples are screened out, and the samples are cleaned.
The invention has the beneficial effects that: the method adopts the convolutional neural network to clean the samples, realizes the screening of invalid samples in a large number of samples, reduces the working hours consumed by image screening, and reduces the labor cost.
Drawings
FIG. 1 is a schematic flow diagram of a method for invalid image sample screening based on building a convolutional neural network filter sample model;
FIG. 2 is a schematic flow diagram of a process for constructing a model of a filtered sample;
Detailed Description
The following detailed description of the preferred embodiments of the present invention, taken in conjunction with the accompanying drawings, will make the advantages and features of the invention easier to understand by those skilled in the art, and thus will clearly and clearly define the scope of the invention.
Referring to fig. 1 and 2, an embodiment of the present invention includes:
a method for screening invalid image samples based on a convolutional neural network filtering sample model is characterized in that a convolutional neural network is used for constructing the model, the model is applied to collected samples, less labor hour and labor cost are spent on screening invalid data such as fuzzy images and blank shot images from the collected samples, and the original samples are cleaned.
The first embodiment is as follows: blurring and refining the sample
(1) Preparation of an original sample: image data of a produced product is collected in industrial production, and 1000 sheets are taken as a basic quantity.
(2) A fuzzy filter: the method of cv2 Laplacian () of opencv can be used for realizing the filtering of pictures with high fuzzy degree and realizing one-time cleaning work on samples.
Fuzzy and refined sample obtaining mode: and (4) the prepared basic sample is processed by a fuzzy filter, a picture with higher fuzzy degree is filtered out to be used as a fuzzy sample, and the rest samples are used as refined samples.
Example two: sample collection
(1) Manual filtration: when the sample model is not filtered, a manual filtering mode is needed to classify the refined samples into an invalid sample type and a valid sample type.
(2) Type of invalid sample: and taking sample images such as the blurred image, the blank shot image and the damage as invalid sample types.
(3) Valid sample types: and other clear and obvious images serve as effective sample types.
(4) Sample set: the two types of invalid samples and valid samples form a sample set.
Example three: constructing a model of a filtered sample
(1) The algorithm is as follows: and realizing sample cleaning based on a deep convolutional neural network algorithm.
(2) Judging the number and proportion of samples: if the positive and negative samples are not balanced, the following method can be adopted:
oversampling: the number of minority class samples in the samples is increased. A few samples are copied or random noise is added into the few samples, and interference data generate certain samples through certain rules.
Down-sampling: reducing the number of majority samples. Most samples are randomly removed until most and few samples are the same.
(3) And (3) constructing a model, and starting to calculate the model when the number of positive and negative samples in the sample set is proper and proportional and balanced, such as the flow of fig. 2.
Accuracy (accuracuracy):
Figure BDA0001993552620000021
recall (recall) is a measure of coverage, and there are several positive examples divided into positive examples:
Figure BDA0001993552620000022
when the model accuracy rate ACC is low or the recall rate recall is small, the calculated model does not meet the requirements.
(4) And putting the calculated new model into the flow of the figure 1 for processing.
Example four: sample library meeting requirements
(1) Judging whether the sample library meets the requirements: the number of sample banks, etc. meet the requirements.
(2) And (3) circulating treatment: generally, a sufficient number of sample libraries cannot be obtained once, so when the sample libraries do not meet the requirements, circulation is started, the original samples are filtered by the fuzzy filter to obtain fuzzy images, the sample images are classified by the filtered sample model, the classification of the new images is judged, and sample cleaning is realized.

Claims (5)

1. A method for carrying out invalid image sample screening based on establishment of a convolutional neural network filtering sample model comprises the following steps:
s1: collecting image data of products in industrial production as a basis to form an original sample set;
s2: constructing a fuzzy filter to filter pictures with high fuzzy degree;
s3: the residual image filtered by the fuzzy filter, namely the refined sample, is manually classified to form an invalid sample type and an effective sample type, the extremely-fuzzy image processed by the fuzzy filter is also classified into the invalid sample type, and the invalid sample type comprises the following steps: blurred images, blank shot images and damaged images;
s4: the two types of the invalid sample and the valid sample form a sample set;
s5: constructing a convolutional neural network filtering sample model;
s6: and classifying a large number of original sample images by using the fuzzy filter of S2 and the model loop of S5 to realize sample cleaning until a sample library meeting the requirements is built.
2. The method of claim 1, wherein: before the step of S3, a blur filter is added in the step of S2 to filter out a sample image with a high degree of blur in advance.
3. The method of claim 1, wherein: the refined sample is manually processed in step S3.
4. The method of claim 1, wherein: and step S5, calculating the model when the quantity of the positive and negative samples in the sample set is proper and proportional balance is achieved, and abandoning the model when the model recall rate and the model accuracy rate do not meet the requirements.
5. The method of claim 1, wherein: and when the sample library does not meet the requirement, starting a loop in the step S6, classifying a large number of original sample images, and cleaning the samples.
CN201910188287.8A 2019-03-13 2019-03-13 Invalid image sample screening method based on convolutional neural network Active CN109978029B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910188287.8A CN109978029B (en) 2019-03-13 2019-03-13 Invalid image sample screening method based on convolutional neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910188287.8A CN109978029B (en) 2019-03-13 2019-03-13 Invalid image sample screening method based on convolutional neural network

Publications (2)

Publication Number Publication Date
CN109978029A CN109978029A (en) 2019-07-05
CN109978029B true CN109978029B (en) 2021-02-09

Family

ID=67078702

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910188287.8A Active CN109978029B (en) 2019-03-13 2019-03-13 Invalid image sample screening method based on convolutional neural network

Country Status (1)

Country Link
CN (1) CN109978029B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110990917A (en) * 2019-11-19 2020-04-10 北京长空云海科技有限公司 BIM model display method, device and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5590218A (en) * 1993-10-18 1996-12-31 Bayer Corporation Unsupervised neural network classification with back propagation
CN106067020A (en) * 2016-06-02 2016-11-02 广东工业大学 The system and method for quick obtaining effective image under real-time scene
CN108960409A (en) * 2018-06-13 2018-12-07 南昌黑鲨科技有限公司 Labeled data generation method, equipment and computer readable storage medium
CN109379557A (en) * 2018-09-30 2019-02-22 田东县文设芒果专业合作社 Mango insect pest intelligent monitor system based on image recognition

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7991223B2 (en) * 2006-09-05 2011-08-02 Bruker Daltonik Gmbh Method for training of supervised prototype neural gas networks and their use in mass spectrometry
KR102592076B1 (en) * 2015-12-14 2023-10-19 삼성전자주식회사 Appartus and method for Object detection based on Deep leaning, apparatus for Learning thereof
CN107909566A (en) * 2017-10-28 2018-04-13 杭州电子科技大学 A kind of image-recognizing method of the cutaneum carcinoma melanoma based on deep learning
CN108171175B (en) * 2017-12-29 2020-06-23 苏州科达科技股份有限公司 Deep learning sample enhancement system and operation method thereof
CN108154134B (en) * 2018-01-11 2019-07-23 天格科技(杭州)有限公司 Pornographic image detection method is broadcast live in internet based on depth convolutional neural networks
CN108764372B (en) * 2018-06-08 2019-07-16 Oppo广东移动通信有限公司 Construction method and device, mobile terminal, the readable storage medium storing program for executing of data set
CN108986075A (en) * 2018-06-13 2018-12-11 浙江大华技术股份有限公司 A kind of judgment method and device of preferred image
CN109165671A (en) * 2018-07-13 2019-01-08 上海交通大学 Confrontation sample testing method based on sample to decision boundary distance
CN109117887A (en) * 2018-08-17 2019-01-01 哈尔滨工业大学 A kind of support vector machines accelerating method and device for low-dimensional data screening sample
CN109241903B (en) * 2018-08-30 2023-08-29 平安科技(深圳)有限公司 Sample data cleaning method, device, computer equipment and storage medium

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5590218A (en) * 1993-10-18 1996-12-31 Bayer Corporation Unsupervised neural network classification with back propagation
CN106067020A (en) * 2016-06-02 2016-11-02 广东工业大学 The system and method for quick obtaining effective image under real-time scene
CN108960409A (en) * 2018-06-13 2018-12-07 南昌黑鲨科技有限公司 Labeled data generation method, equipment and computer readable storage medium
CN109379557A (en) * 2018-09-30 2019-02-22 田东县文设芒果专业合作社 Mango insect pest intelligent monitor system based on image recognition

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Consensus of Ambiguity: Theory and Application of Active Learning for Biomedical Image Analysis;Scott Doyle 等;《pattern recognition in bioinformatics-5th IAPR international conference》;20100930;314-324 *
Efficient Multiple Instance Convolutional Neural Networks for Gigapixel Resolution Image Classification;Le Hou 等;《arXiv:1504.07947v3》;20150519;1-9 *
基于卷积神经网络的小样本图像识别方法;段萌 等;《计算机工程与设计》;20180131;第39卷(第1期);224-229 *

Also Published As

Publication number Publication date
CN109978029A (en) 2019-07-05

Similar Documents

Publication Publication Date Title
CN110782399B (en) Image deblurring method based on multitasking CNN
CN107507173B (en) No-reference definition evaluation method and system for full-slice image
CN105608679B (en) A kind of image de-noising method of fusion structure tensor and the full variation of non-local
CN105894460A (en) Image filtering method and device
CN109740721B (en) Wheat ear counting method and device
CN106910169A (en) A kind of image salt-pepper noise minimizing technology for preventing edge blurry
CN111598918B (en) Video image stabilizing motion estimation method based on reference frame optimization and foreground and background separation
CN110097522B (en) Single outdoor image defogging method based on multi-scale convolution neural network
CN110490822B (en) Method and device for removing motion blur of image
CN109978029B (en) Invalid image sample screening method based on convolutional neural network
CN106910170B (en) A kind of minimizing technology of image salt-pepper noise
CN109118434A (en) A kind of image pre-processing method
CN109801231B (en) Image processing method of electrophoresis electronic paper detection equipment
CN110942436A (en) Image deblurring method based on image quality evaluation
WO2017177559A1 (en) Image management method and apparatus
CN113554649A (en) Defect detection method and device, computer equipment and storage medium
CN108107611A (en) A kind of adaptive defect inspection method, device and electronic equipment
CN114663427A (en) Boiler part size detection method based on image processing
CN113034404A (en) Traffic image deblurring method and device based on multi-scale counterstudy
CN111104976B (en) Blue algae coverage calculating method based on time sequence images
CN107248143A (en) A kind of depth image restorative procedure split based on image
CN115008255B (en) Tool wear identification method and device for machine tool
CN115984613A (en) Fundus image classification method, device, equipment and storage medium
CN114486916A (en) Mobile phone glass cover plate defect detection method based on machine vision
CN114897214A (en) Metal additive manufacturing time prediction system and method based on graphical processing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20240129

Address after: Room 401-55, No. 821 Lianting Road, Min'an Street, Xiang'an District, Xiamen City, Fujian Province, 361101

Patentee after: Xiamen Xinhang Century Information Technology Co.,Ltd.

Country or region after: China

Address before: 100876 Beijing city Haidian District Xitucheng Road No. 10

Patentee before: Beijing University of Posts and Telecommunications

Country or region before: China

TR01 Transfer of patent right