CN109919209A - A kind of domain-adaptive deep learning method and readable storage medium storing program for executing - Google Patents

A kind of domain-adaptive deep learning method and readable storage medium storing program for executing Download PDF

Info

Publication number
CN109919209A
CN109919209A CN201910139916.8A CN201910139916A CN109919209A CN 109919209 A CN109919209 A CN 109919209A CN 201910139916 A CN201910139916 A CN 201910139916A CN 109919209 A CN109919209 A CN 109919209A
Authority
CN
China
Prior art keywords
domain
image
task
learning
sample set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910139916.8A
Other languages
Chinese (zh)
Other versions
CN109919209B (en
Inventor
许娇龙
聂一鸣
肖良
朱琪
商尔科
戴斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
National Defense Technology Innovation Institute PLA Academy of Military Science
Original Assignee
National Defense Technology Innovation Institute PLA Academy of Military Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by National Defense Technology Innovation Institute PLA Academy of Military Science filed Critical National Defense Technology Innovation Institute PLA Academy of Military Science
Priority to CN201910139916.8A priority Critical patent/CN109919209B/en
Publication of CN109919209A publication Critical patent/CN109919209A/en
Application granted granted Critical
Publication of CN109919209B publication Critical patent/CN109919209B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The invention discloses a kind of domain-adaptive deep learning method, target area image is carried out rotation transformation, obtains self-supervisory learning training sample set by this method;By after conversion self-supervisory learning training sample set and source domain training sample set carry out joint training, obtain domain-adaptive deep learning model for the visual task on aiming field.This method can effectively learning objective characteristic of field indicate without being labeled to aiming field sample, promote the performance of Computer Vision Task on aiming field.Disclosed herein as well is a kind of domain-adaptive deep learning readable storage medium storing program for executing, equally have above-mentioned beneficial effect.

Description

A kind of domain-adaptive deep learning method and readable storage medium storing program for executing
Technical field
The present invention relates to domain-adaptive deep learning fields, the especially domain-adaptive of computer-oriented visual task Deep learning method and readable storage medium storing program for executing.
Background technique
The models such as image classification, image, semantic segmentation, target identification, target detection in Computer Vision Task, usually It is all to be obtained by supervised learning training.Supervised learning is based particularly on the supervised learning of deep neural network, it usually needs The training sample largely marked.The mark of these samples needs to expend a large amount of manpower and material resources, for example, image segmentation need into The semantic tagger of row pixel-by-pixel, mark difficulty is very big, and cost is very high.After model on labeled data complete by training, It applies it in test data.When test data and training data distribution having the same, supervised learning be it is a kind of very Effective method.However usually will appear test data in practical application and be distributed different situations from training data, so that The model performance decline learnt on the training data.
Domain-adaptive (domain adaptation) is that solution is above-mentioned since trained and test data distributional difference causes A kind of technical method of model performance decline problem.Training dataset is usually known as source domain, test data set is known as target Field.The data of source domain are with markup information, and the data of target domain are usually not no markup information.Field is certainly Adaptive technique is intended to move to the supervision message of source domain target domain, promotes the performance of task on target domain.
By learning cross-cutting constant character representation, i.e., domain-adaptive study based on deep neural network is usually Character representation with architecture, the performance of the task on Lai Tisheng target domain.Cross-cutting invariant features in order to obtain It indicates, the method for current main-stream is realized by field dual training.Field dual training since it is desired that optimize one simultaneously To the objective function confronted with each other, the non-dual training of the convergence ratio of training process is difficult, and the model that training obtains is frequently not most Excellent.
Summary of the invention
The technical problem to be solved in the present invention is to provide a kind of domain-adaptive deep learning methods, and this method is towards calculating Machine visual task provides a kind of domain-adaptive method of non-confrontation type, to improve the performance of task on target domain;The application A kind of domain-adaptive deep learning readable storage medium storing program for executing is provided simultaneously, equally solution above-mentioned technical problem.
The application provides a kind of domain-adaptive deep learning method, comprising:
Step 1: to each target area image, being rotated by the angle of setting, the image formed after rotation respectively corresponds not Same class label;The image scaling formed after rotation is cropped to identical size, then upsets all image sequences at random, The corresponding class label of each image remains unchanged, and forms self-supervisory learning training sample set;
Step 2: by multi-task learning deep neural network, to the vision on source domain training sample set building aiming field Task (T), while image classification task is constructed to the self-supervisory learning training sample set, then to source domain training sample set Joint training is carried out with self-supervisory learning training sample set;
Step 3: the visual task (T) the deep learning model that above-mentioned joint training obtains being used on aiming field.
Optionally, 0 °, 90 °, 180 ° of rotation are carried out to each target area image in step 1, forms three new figures Piece.
Optionally, it is 224 pixels that the image scaling formed after rotating in step 1, which is cropped to length and width,.
Optionally, carry out data enhancing after rotating in step 1 before the image scaling that is formed, the data enhancing include with Machine brightness or saturation degree are adjusted.
Optionally, the multi-task learning deep neural network includes encoder core network (F), Image Classifier network Branch (C) and visual task network branches (S);
Wherein encoder core network (F) and Image Classifier network branches (C) are to the self-supervisory learning training sample Collection building image classification task, encoder core network (F) and visual task network branches (S) are to source domain training sample set structure Build learning tasks (T).
Readable storage medium storing program for executing provided by the invention is stored with program on the readable storage medium storing program for executing, and described program is processed The step of device realizes the domain-adaptive deep learning method when executing.
Domain-adaptive deep learning method provided by the invention, by combining source domain supervised learning and target domain certainly Supervised learning task carrys out the expression of learning objective domain features, to realize domain-adaptive.This method can give full play to supervision Learn the high efficiency in deep neural network training, and does not depend on artificial mark to construct target domain self-supervisory learning training Collection.By the joint training of source domain sample and target domain sample, the general character of source domain and target domain is efficiently utilized, To establish the character representation for adapting to target domain task, to improve the performance of task on target domain.
Domain-adaptive deep learning readable storage medium storing program for executing provided by the invention equally has above-mentioned beneficial effect.
Detailed description of the invention
Fig. 1 is the flow diagram of the self-supervisory domain-adaptive deep learning training process of the embodiment of the present invention.
Fig. 2 is the flow diagram that the model that will be trained is used for target domain test.
Specific embodiment
Present invention is further described in detail with reference to the accompanying drawing.
Fig. 1 gives the domain-adaptive deep learning training flow diagram of the embodiment of the present invention.This method is mainly wrapped Include following steps:
Step 1: target area image being subjected to rotation transformation, obtains self-supervisory learning training sample set;
Step 2: by after conversion self-supervisory learning training sample set and source domain training sample carry out joint training, obtain Deep learning model.
Step 3: the visual task T model that above-mentioned joint training obtains being used on aiming field.
Target area image is subjected to rotation transformation in step 1, obtains self-supervisory learning training sample set.The process is right first Target area image carries out 0 °, 90 ° and 180 ° of rotation respectively, and three rotation angles respectively correspond class label 0,1 and 2.This Process does not need manually to be labeled each picture one by one, realizes in self-supervisory study and marks automatically generating for sample. Data enhancing pretreatment is carried out to postrotational image, including adjusts contrast, brightness at random, then scaling is cropped to unification Picture size, it is 224 pixels that the present embodiment image scaling, which is cut into length and width,.By treated, image sequence is upset at random, often The corresponding label of a image is constant.Process input is target area image, is exported as the target area image X after conversiontAnd its it is right The rotation class label Y answeredt
Step 2 by after conversion self-supervisory learning training sample set and source domain training sample carry out joint training, obtain Deep learning model.The process constructs a multi-task learning deep neural network first, which includes a feature extraction Encoder core network F, Image Classifier network branches C network point corresponding with the visual task T on aiming field Branch S.The supervised learning of encoder core network F and visual task network branches S for task T on source domain sample in the network Task;Encoder core network F and Image Classifier network branches C is used for transformed target domain self-supervisory learning sample Image classification task.
In figure, solid black lines arrow represents the forward-propagating of data in neural network, and dotted line indicates the backpropagation of gradient. For image classification task, by the transformed image X of aiming fieldtInput coding device core network F, the feature of output is as figure As the input of classifier network branches C, obtained output Yt *For the image rotation classification of prediction.Classifier loss function is according to mark Infuse classification YtWith prediction classification Yt *Classifier error is calculated, the error is by backpropagation to Image Classifier network branches C It is updated with the parameter of feature coding device F.For visual task T, by source domain image XsInput coding device core network F, Input of the feature of output as the network branches S of visual task T, obtained output Ys *For the predicted value of visual task T, vision Task T loss function is according to mark classification YsWith prediction classification Ys *The error of visual task T is calculated, the error is by reversely passing It broadcasts and the parameter of visual task network branches S and encoder core network F is updated.Due to the sample standard deviation of source domain and aiming field The parameter for influencing encoder core network F updates, therefore can be learnt by the encoder core network F that training obtains to across neck The character representation in domain, to realize to the adaptive of target domain.
Fig. 2 gives the model for obtaining field adaptive learning and illustrates applied to the process of target domain visual task T Figure.
As shown in Fig. 2, target domain image zooming-out feature of the feature extraction encoder core network F to input, then will Task T prediction result is calculated by propagated forward in input of the obtained feature as task T-network branch C.
Readable storage medium storing program for executing provided by the embodiments of the present application is introduced below, readable storage medium storing program for executing described below with Above-described domain-adaptive deep learning method can correspond to each other reference.
A kind of readable storage medium storing program for executing disclosed in the present application, is stored thereon with program, and neck is realized when program is executed by processor The step of adaptive deep learning method in domain.
It is apparent to those skilled in the art that for convenience and simplicity of description, foregoing description it is readable The process of program, can refer to corresponding processes in the foregoing method embodiment in storage medium, and details are not described herein.
Technical effect in order to better illustrate the present invention, by taking image, semantic segmentation task as an example, inventor is also carried out Following experiments:
Experiment 1: from SYNTHIA data set to the segmentation domain-adaptive study of the image, semantic of Cityscapes data set
The experiment carries out domain-adaptive study between SYNTHIA data set and Cityscapes data set.SYNTHIA It is a virtual scene data set, all data are by three-dimensional artificial software development, altogether comprising 9400 pictures and its correspondence Semantic segmentation mark.Cityscapes is a real world data collection, and training dataset includes 2975 pictures, verifies number It include 500 pictures according to collection.It includes 19 kinds of semantic labels that Cityscapes, which has altogether,.In this experiment, we use SYNTHIA number According to collection and 13 Cityscapes public class semantic labels.In the experiment, SYNTHIA data set as source domain data set, Cityscapes data set is as target domain data set.The validation data set of Cityscapes is used to assess to be mentioned in this experiment The performance of method out.Evaluation metrics are handed over and using average than mIoU (mean intersection over union).mIoU What is indicated is the coverage rate of the image, semantic segmentation result relative real value of prediction.Shown in test result is as follows:
Table 1:
It arranges from third to last one and arranges in table 1, each column respectively indicate some semantic classes, and secondary series mean accuracy is The average value of semantic segmentation precision of all categories.Table 1 compared three kinds of methods, the method including only utilizing source domain sample training (SRC), the method based on dual training (FCN-W) and method proposed by the present invention (RotDA).From table 1 it follows that Only with the method for source domain sample training, due to not doing domain-adaptive adjustment, performance is worst on aiming field.Using pair Field degree of aliasing of the method for anti-training due to improving character representation, available preferable adaptive learning effect.This hair It is bright to be learnt by self-supervisory, the character representation for more adapting to target domain is obtained, therefore achieve significant property in target domain It can be promoted.
Experiment 2: from GTA data set to the segmentation domain-adaptive study of the image, semantic of Cityscapes data set
The experiment carries out domain-adaptive study between GTA data set and Cityscapes data set.GTA data set comes Los Angeles City scenarios in 3 D video game, the data set include 24966 pictures and its corresponding semantic segmentation Mark.Mark is consistent with the definition of the semantic label of Cityscapes comprising 19 class semantic labels, therefore this experiment is in 19 classes semanteme It is assessed on label.In the experiment, GTA data set is led as source domain data set, Cityscapes data set as target Numeric field data collection.Shown in test result is as follows:
Table 2:
From the available conclusion similar with table 1 of table 2, i.e., domain-adaptive deep learning method can be obtained leads better than source The performance of domain model, and compare dual training better performance.Have benefited from the joint by source domain and target domain sample Training has obtained adapting to the character representation of target domain, and the present invention achieves excellent performance.
Although the present invention has been described by means of preferred embodiments, the present invention is not limited to as described herein Embodiment further includes made various changes and variation without departing from the present invention.

Claims (6)

1. a kind of domain-adaptive deep learning method, characterized in that it comprises the following steps:
S1: it to each target area image, is rotated by the angle of setting, the image formed after rotation respectively corresponds different classifications Label;The image scaling formed after rotation is cropped to identical size, then upsets all image sequences at random, each image Corresponding class label remains unchanged, and forms self-supervisory learning training sample set;
S2: by multi-task learning deep neural network, to the visual task on source domain training sample set building aiming field (T), image classification task while to the self-supervisory learning training sample set is constructed, then to source domain training sample set and certainly Supervised learning training sample set carries out joint training;
S3: the visual task (T) the deep learning model that above-mentioned joint training obtains being used on aiming field.
2. according to the method described in claim 1, it is characterized by: in step S1 each target area image carry out 0 °, 90 °, 180 ° of rotation form three new pictures.
3. according to the method described in claim 2, it is characterized by: the image scaling formed after rotating in step S1 is cropped to Length and width is 224 pixels.
4. according to the method described in claim 1, it is characterized by: being carried out before the image scaling formed after being rotated in step S1 Data enhancing, the data enhancing include that random brightness or saturation degree are adjusted.
5. according to the method described in claim 1, it is characterized by: the multi-task learning deep neural network includes encoder Core network (F), Image Classifier network branches (C) and visual task network branches (S);
Wherein encoder core network (F) and Image Classifier network branches (C) are to the self-supervisory learning training sample set structure It builds image classification task, encoder core network (F) and visual task network branches (S) and is constructed to source domain training sample set Habit task (T).
6. a kind of readable storage medium storing program for executing, which is characterized in that be stored with program on the readable storage medium storing program for executing, described program is processed Such as domain-adaptive deep learning method described in any one of claim 1 to 5 is realized when device executes.
CN201910139916.8A 2019-02-26 2019-02-26 Domain self-adaptive deep learning method and readable storage medium Active CN109919209B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910139916.8A CN109919209B (en) 2019-02-26 2019-02-26 Domain self-adaptive deep learning method and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910139916.8A CN109919209B (en) 2019-02-26 2019-02-26 Domain self-adaptive deep learning method and readable storage medium

Publications (2)

Publication Number Publication Date
CN109919209A true CN109919209A (en) 2019-06-21
CN109919209B CN109919209B (en) 2020-06-19

Family

ID=66962296

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910139916.8A Active CN109919209B (en) 2019-02-26 2019-02-26 Domain self-adaptive deep learning method and readable storage medium

Country Status (1)

Country Link
CN (1) CN109919209B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111093123A (en) * 2019-12-09 2020-05-01 华中科技大学 Flexible optical network time domain equalization method and system based on composite neural network
CN111144565A (en) * 2019-12-27 2020-05-12 中国人民解放军军事科学院国防科技创新研究院 Self-supervision field self-adaptive deep learning method based on consistency training
CN111160553A (en) * 2019-12-23 2020-05-15 中国人民解放军军事科学院国防科技创新研究院 Novel field self-adaptive learning method
CN111898696A (en) * 2020-08-10 2020-11-06 腾讯云计算(长沙)有限责任公司 Method, device, medium and equipment for generating pseudo label and label prediction model
CN112308149A (en) * 2020-11-02 2021-02-02 平安科技(深圳)有限公司 Optimization method and device for image information identification based on machine learning
CN112417940A (en) * 2019-08-20 2021-02-26 通用汽车环球科技运作有限责任公司 Domain adaptation for image analysis
CN112949583A (en) * 2021-03-30 2021-06-11 京科互联科技(山东)有限公司 Target detection method, system, equipment and storage medium for complex city scene
CN114283290A (en) * 2021-09-27 2022-04-05 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium
CN115701868A (en) * 2022-08-22 2023-02-14 中山大学中山眼科中心 Domain self-adaptive enhancement method suitable for various visual tasks
WO2023159819A1 (en) * 2022-02-25 2023-08-31 北京百度网讯科技有限公司 Visual processing and model training methods, device, storage medium and program product

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170161633A1 (en) * 2015-12-07 2017-06-08 Xerox Corporation Transductive adaptation of classifiers without source data
CN107273927A (en) * 2017-06-13 2017-10-20 西北工业大学 Sorting technique is adapted to based on the unsupervised field matched between class
CN107316061A (en) * 2017-06-22 2017-11-03 华南理工大学 A kind of uneven classification ensemble method of depth migration study
CN107909101A (en) * 2017-11-10 2018-04-13 清华大学 Semi-supervised transfer learning character identifying method and system based on convolutional neural networks
CN108009633A (en) * 2017-12-15 2018-05-08 清华大学 A kind of Multi net voting towards cross-cutting intellectual analysis resists learning method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170161633A1 (en) * 2015-12-07 2017-06-08 Xerox Corporation Transductive adaptation of classifiers without source data
CN107273927A (en) * 2017-06-13 2017-10-20 西北工业大学 Sorting technique is adapted to based on the unsupervised field matched between class
CN107316061A (en) * 2017-06-22 2017-11-03 华南理工大学 A kind of uneven classification ensemble method of depth migration study
CN107909101A (en) * 2017-11-10 2018-04-13 清华大学 Semi-supervised transfer learning character identifying method and system based on convolutional neural networks
CN108009633A (en) * 2017-12-15 2018-05-08 清华大学 A kind of Multi net voting towards cross-cutting intellectual analysis resists learning method and system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JIAOLONG XU等: "Domain Adaptation of Deformable Part-Based Models", 《IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112417940B (en) * 2019-08-20 2024-02-23 通用汽车环球科技运作有限责任公司 Domain adaptation for image analysis
CN112417940A (en) * 2019-08-20 2021-02-26 通用汽车环球科技运作有限责任公司 Domain adaptation for image analysis
CN111093123A (en) * 2019-12-09 2020-05-01 华中科技大学 Flexible optical network time domain equalization method and system based on composite neural network
CN111093123B (en) * 2019-12-09 2020-12-18 华中科技大学 Flexible optical network time domain equalization method and system based on composite neural network
CN111160553A (en) * 2019-12-23 2020-05-15 中国人民解放军军事科学院国防科技创新研究院 Novel field self-adaptive learning method
CN111144565A (en) * 2019-12-27 2020-05-12 中国人民解放军军事科学院国防科技创新研究院 Self-supervision field self-adaptive deep learning method based on consistency training
CN111144565B (en) * 2019-12-27 2020-10-27 中国人民解放军军事科学院国防科技创新研究院 Self-supervision field self-adaptive deep learning method based on consistency training
CN111898696A (en) * 2020-08-10 2020-11-06 腾讯云计算(长沙)有限责任公司 Method, device, medium and equipment for generating pseudo label and label prediction model
CN111898696B (en) * 2020-08-10 2023-10-27 腾讯云计算(长沙)有限责任公司 Pseudo tag and tag prediction model generation method, device, medium and equipment
CN112308149B (en) * 2020-11-02 2023-10-24 平安科技(深圳)有限公司 Optimization method and device for image information identification based on machine learning
CN112308149A (en) * 2020-11-02 2021-02-02 平安科技(深圳)有限公司 Optimization method and device for image information identification based on machine learning
CN112949583A (en) * 2021-03-30 2021-06-11 京科互联科技(山东)有限公司 Target detection method, system, equipment and storage medium for complex city scene
CN114283290A (en) * 2021-09-27 2022-04-05 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium
CN114283290B (en) * 2021-09-27 2024-05-03 腾讯科技(深圳)有限公司 Training of image processing model, image processing method, device, equipment and medium
WO2023159819A1 (en) * 2022-02-25 2023-08-31 北京百度网讯科技有限公司 Visual processing and model training methods, device, storage medium and program product
CN115701868A (en) * 2022-08-22 2023-02-14 中山大学中山眼科中心 Domain self-adaptive enhancement method suitable for various visual tasks
CN115701868B (en) * 2022-08-22 2024-02-06 中山大学中山眼科中心 Domain self-adaptive enhancement method applicable to various visual tasks

Also Published As

Publication number Publication date
CN109919209B (en) 2020-06-19

Similar Documents

Publication Publication Date Title
CN109919209A (en) A kind of domain-adaptive deep learning method and readable storage medium storing program for executing
CN109741332B (en) Man-machine cooperative image segmentation and annotation method
US20230177343A1 (en) Scene understanding and generation using neural networks
Andriluka et al. Fluid annotation: a human-machine collaboration interface for full image annotation
DeVries et al. Dataset augmentation in feature space
KR102318772B1 (en) Domain Separation Neural Networks
US11314989B2 (en) Training a generative model and a discriminative model
CA3069365A1 (en) Generation of point of interest copy
CN113408537B (en) Remote sensing image domain adaptive semantic segmentation method
CN110288007A (en) The method, apparatus and electronic equipment of data mark
CN111709966B (en) Fundus image segmentation model training method and device
CN110880176B (en) Semi-supervised industrial image defect segmentation method based on countermeasure generation network
CN114266945B (en) Training method of target detection model, target detection method and related device
Liao et al. A deep ordinal distortion estimation approach for distortion rectification
US10922852B2 (en) Oil painting stroke simulation using neural network
CN111666979B (en) Underwater scene target detection integration method and system for label generation
US20220076062A1 (en) Image processing device and operation method thereof
CN110599453A (en) Panel defect detection method and device based on image fusion and equipment terminal
KR20230073751A (en) System and method for generating images of the same style based on layout
CN111325212A (en) Model training method and device, electronic equipment and computer readable storage medium
CN114299573A (en) Video processing method and device, electronic equipment and storage medium
CN107729885B (en) Face enhancement method based on multiple residual error learning
CN116091784A (en) Target tracking method, device and storage medium
Khoroshiltseva et al. Relaxation labeling meets gans: solving jigsaw puzzles with missing borders
CN114897884A (en) No-reference screen content image quality evaluation method based on multi-scale edge feature fusion

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant