CN110473195A

CN110473195A - It is a kind of can automatic customization medicine lesion detection framework and method

Info

Publication number: CN110473195A
Application number: CN201910743751.5A
Authority: CN
Inventors: 梁小丹; 王绍菊; 林冰倩; 林倞
Original assignee: Sun Yat Sen University
Current assignee: Sun Yat Sen University
Priority date: 2019-08-13
Filing date: 2019-08-13
Publication date: 2019-11-19
Anticipated expiration: 2039-08-13
Also published as: CN110473195B

Abstract

The invention discloses it is a kind of can automatic customization medicine lesion detection framework and method, the detection framework include: candidate feature extraction module, feature extraction is carried out to medical image；Lesion detects network header automatic customization module, defines search space and merges the perception relationship between candidate region, and using can micro- NAS algorithm obtain best lesion detection network header；Lesion detects the optimal module of network header, utilize a convolutional layer, and new candidate feature is obtained by a standard cell and two contractive cells, binary classification is carried out to candidate feature by two articulamentums and prediction block returns, the weight M that candidate feature in binary classification is classified is exported as high-level semantic information to knowledge migration module；Knowledge migration module transmits relevant contextual information, the candidate feature enhanced in conjunction with semantic relation and in different regions, and the candidate feature of enhancing and former candidate feature is merged, and carries out multivariate classification and recurrence finally by full articulamentum.

Description

It is a kind of can automatic customization medicine lesion detection framework and method

Technical field

The present invention relates to the technical fields such as image recognition, target detection and deep learning, can be automatic more particularly to one kind The medicine lesion detection framework and method of customization.

Background technique

Object detection task is position and the size found out all interested targets in image, and determine them.Medicine Lesion Detection task is position and the size found out lesion all in medical image, and determine lesion；This is area of computer aided The important prerequisite of detection/diagnosis (CADe/CADx).Currently, with deep learning algorithm, especially convolutional neural networks (CNN) Fast development, to medicine lesion detection bring significant progress.However, most methods are all directly by natural figure at present The various CNN pre-training target detection models such as RetinaNet of picture, full convolutional network (R-FCN) based on region etc., for curing Learn the lesion detection of image；But since there is huge field difference, the detections of medicine lesion for medical image and natural image Can have that such as lesion and the high similitude of background, lesion classification is unequal and the field is specifically challenged based on small lesion, institute Directly to use the method limited performance of traditional natural image detection model, performance can be caused to drop because of above-mentioned challenge It is low.Therefore, the network architecture that customization detects dedicated for medicine lesion is very important.

Recently, neural framework search (Neural Architecture Search, NAS) is in image classification, semantic segmentation Very competitive performance is achieved in tasks with natural image processing etc., and the purpose of NAS is searched automatically according to task object The optimal neural network framework of rope, the limitation of researcher's hand-designed is broken through with this, to realize more preferably performance.Existing Target detection NAS work only moves to the network architecture searched in picture classification task in detection skeleton, Er Qiexu Consume a large amount of GPU video memory and time.

Traditional target detection frame mainly includes three parts: feature extractor, region candidate network (RPN) and be based on area The head CNN in domain.It is generally known that facilitating lesion inspection with the feature extractor and RPN of natural image such as ImageNet pre-training It surveys, is the important component of medicine lesion detection network.Currently, there are many researchs on the head CNN based on region, such as attached drawing 1 It is shown, it can be broadly divided into following three kinds: 1) receptive field head (RFH), consider that multiple size and shape of receptive field are leaned on protrusion The importance of nearly central area, and improve the insensitivity mobile to small space.2) head (FCH) is connected entirely, parameter redundancy, and Ignore spatial information.3) residual error bottleneck head (RBH) enhances candidate information using residual error bottleneck module, and skip floor connection fusion is not The feature of ad eundem and gradient disperse is avoided, but performance can be limited by single receptive field.

Summary of the invention

In order to overcome the deficiencies of the above existing technologies, purpose of the present invention is to provide it is a kind of can automatic customization medicine Lesion detects framework and method, to realize that one kind can share relevant information with a variety of lesion types, and it is real in a seamless fashion Now unified polymorphic type lesion detects network.

In order to achieve the above object, the present invention propose it is a kind of can automatic customization medicine lesion detection framework, comprising:

Candidate feature extraction module, for carrying out feature extraction to the medical image of input, the candidate for extracting image is special Sign；

Lesion detects network header automatic customization module, for according to medical image characteristic, focus characteristic and target detection Relevant knowledge, define new search space, described search space includes that largely there is flexible receptive field, skip floor the subnets such as to connect The advanced operation of network framework, and increase a non local operation, the perception relationship between candidate region is merged, according to time Select feature and using can micro- NAS algorithm search for suitable operation and connection type in the search space of design and makes its composition It is suitble to the best lesion of medical image to detect network header；

Lesion detects the optimal module of network header, detects the best of network header automatic customization Custom modules for the lesion Lesion detect network header, by the candidate feature extraction module export candidate feature, first pass around a convolution kernel be 3 × Then 3 convolutional layer obtains new candidate feature by a standard cell and two contractive cells, and passes through two articulamentums Binary classification is carried out to new candidate feature and prediction block returns, the weight M that candidate feature in binary classification is classified is as height Layer semantics information is exported to knowledge migration module；

Knowledge migration module, on the basis of the components graph that best lesion detection network header learns, in conjunction with language Adopted relationship, and relevant contextual information is transmitted in different regions, to obtain the candidate feature of an enhancing, and will enhancing Candidate feature and original candidate feature afterwards merges the relevant information for sharing a variety of lesion types, finally by complete Articulamentum carries out multivariate classification and recurrence.

Preferably, described search space includes following 9 kinds of operations: 1) connectionless；2) skip floor connects；3) 3 × 3 average pond Change；4) non local；5) 1 × 3 and 3 × 1 convolution；6) 3 × 3 depth separates convolution；7) 5 × 5 depth separates convolution； 8) 3 × 3 empty convolution that expansion rate is 3；9) 3 × 3 empty convolution that expansion rate is 5.

Preferably, it is described can be in micro- NAS algorithm, it is necessary first to it is then fixed according to the suitable search space of task design The module of justice search, including standard cell module and contractive cell module, wherein standard cell module step-length is 1, to keep defeated The resolution ratio same with input out, while port number is constant, contractive cell module step-length is 2, resolution ratio is reduced half, simultaneously Port number is double, and each module, that is, cell regards a directed acyclic graph as, defines its branch's number, one spy of each branching representation Sign is schemed, and the connection type between branch indicates operation；There are two input and an output from branch before, In for each branch After completing above-mentioned search definition, Initialize installation is carried out, while its discrete topology serialization is made by softmax function, it Gradient passback is carried out using gradient descent algorithm afterwards and updates its weight, finally after the search by certain time, first 9 Kind, which operates, retains the maximum operation of weight in connection, i.e., intensively connected by one and become partially connected, then select weight Maximum two are connected to the input of the branch, and their result are incorporated as exporting.

Preferably, 9 kinds of candidate operations set are defined as follows:

1) it connectionless operation: is not connected between branch；

2) skip floor attended operation: branch is directly connected to, and does not pass through any operation；

3) 3 × 3 average pondization operation: the average pond that Chi Huahe size is 3 × 3；

4) 1 × 3 and 3 × 1 convolution operation: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 1 × 3, one layer batch Normalize layer, one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 3 × 1, one layer batch of normalization layer；

5) the separable convolution operation of 3 × 3 depth: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 3 × 3, one The convolutional layer that layer convolution kernel is 1 × 1, one layer batch of normalization layer, one layer of ReLU active coating, the convolution that one layer of convolution kernel is 3 × 3 Layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

6) the separable convolution operation of 5 × 5 depth: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 5 × 5, one The convolutional layer that layer convolution kernel is 1 × 1, one layer batch of normalization layer, one layer of ReLU active coating, the convolution that one layer of convolution kernel is 5 × 5 Layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

7) 3 × 3 empty convolution operations that expansion rate is 3: one layer of ReLU active coating, one layer of convolution kernel are 3 × 3 and expansion rate For 3 convolutional layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

8) 3 × 3 empty convolution operations that expansion rate is 5: one layer of ReLU active coating, one layer of convolution kernel are 3 × 3 and expansion rate For 5 convolutional layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

9) non local operation: the purpose is to compile to the semantic information between region candidate relevant to target detection Code.

Preferably, in the non local operation, by interregional relationship be expressed as a region to region non-directed graph G:G=<N, E>, the corresponding region candidate of each node in N, each edge e_{I, j}∈ E encodes the pass between two nodes System, the input that the Non-local is operated areThe adjacency matrix of non-directed graph G can be by matrix multiplication E= softmax(φ(X)φ(X)^T) can be calculated, wherein φ () is the nonlinear transformation for having ReLU activation primitive；Later Each node in E is propagated with picture scroll lamination Y=σ (Ef (X) W), wherein f (), W are a nonlinear transformations, and σ is one Activation primitive, finally connecting full articulamentum is consistent its input and output size.

Preferably, the search strategy of the lesion detection network header automatic customization module is to be calculated using stochastic gradient descent Method goes to optimize by the parameter expression of discrete topology serialization, goes one group of continuous framework weight of studyThe output tensor of branchFor the weighted blend of candidate operations, it is expressed as Wherein weightFor Configuration parameters.

Preferably, training data is divided into two Uncrossed subsets, i.e. training set and verifying collection, optimization process is by following Two step iteration carry out: 1) passing throughUpdate network weight w；2) pass throughMore new architecture power Weight a.L_train(w, a) and L_val(w a) is respectively training set and the loss for verifying collection, wherein L_train(w, a) and L_val(w a) divides Not Wei training set and verifying collection loss.

Preferably, the knowledge migration module realizes that process is as follows:

A. the weight and deviation of original binary classification are collectedAs high-level semantic information, P is AutoRCNN Head exports the dimension of tensor, and uses soft mappingDemapping M, wherein s_ijThe region i obtained from original two classification layer is classified as the score of j.

B. the feature f that figure reasoning is enhanced is carried out by matrix multiplication_o, i.e. f_o=E Γ^sMW_o, wherein It is weight transfer matrix, o is the output dimension of knowledge migration module, and by Enhanced feature f_oIt is combined with primitive character f Improve position and the classification performance of the detection of polymorphic type lesion.

In order to achieve the above objectives, the present invention also provides it is a kind of can automatic customization medicine lesion detection method, including it is as follows Step:

Step S1 carries out feature extraction using medical image of the candidate feature extraction module to input, extracts image Candidate feature；

Step S2 is exported candidate feature extraction module using the lesion detection optimal module of network header of automatic customization Candidate feature first passes around the convolutional layer that a convolution kernel is 3 × 3, then passes through a standard cell and two contractive cells New candidate feature is obtained, and binary classification and prediction block recurrence are carried out to new candidate feature by two articulamentums, by two The weight M of candidate feature classification is exported as high-level semantic information to knowledge migration module in member classification；

Step S3 utilizes the basis for the components graph that knowledge migration module learns in best lesion detection network header On, in conjunction with semantic relation, and relevant contextual information is transmitted in different regions, it is special with the candidate for obtaining an enhancing Then enhanced candidate feature is merged the related letter for sharing a variety of lesion types by sign to original candidate feature Breath carries out multivariate classification and recurrence finally by full articulamentum.

Preferably, the method also includes:

It is empty to define new search according to the relevant knowledge of medical image characteristic, focus characteristic and target detection by step S0 Between, described search space includes the advanced operation largely with sub-networks frameworks such as flexible receptive field, skip floor connections, and increases by one A non local operation merges the perception relationship between candidate region, according to candidate feature and using can micro- NAS algorithm exist Suitable operation and connection type are searched in the search space of design makes it form the inspection of the best lesion of a suitable medical image Survey network header.

Compared with prior art, one kind of the present invention can automatic customization medicine lesion detection framework and method by being automatically Medicine lesion Detection task has customized the detection network architecture head of a suitable medical image, and by knowledge migration module with Seamless mode realizes a unified polymorphic type lesion detection network, not only increases the precision of lesion detection, also increases The function of polymorphic type lesion detection.The medicine lesion detection network architecture head and knowledge migration module that the present invention realizes can Based on the detection network skeleton that any natural image is general, and performance has different degrees of promotion, all realizes detection net The optimum performance of network skeleton.

Detailed description of the invention

Fig. 1 is the RCNN Head architecture diagram of the prior art；

Fig. 2 be one kind of the present invention can automatic customization medicine lesion detection framework configuration diagram；

Fig. 3 is the standard cell architecture diagram in the specific embodiment of the invention in AutoRCNN Head；

Fig. 4 is the contractive cell architecture diagram in the specific embodiment of the invention in AutoRCNN Head；

Fig. 5 be one kind of the present invention can automatic customization medicine lesion detection method step flow chart.

Specific embodiment

Below by way of specific specific example and embodiments of the present invention are described with reference to the drawings, those skilled in the art can Understand further advantage and effect of the invention easily by content disclosed in the present specification.The present invention can also pass through other differences Specific example implemented or applied, details in this specification can also be based on different perspectives and applications, without departing substantially from Various modifications and change are carried out under spirit of the invention.

Fig. 2 be one kind of the present invention can automatic customization medicine lesion detection framework configuration diagram.As shown in Fig. 2, this Invent it is a kind of can automatic customization medicine lesion detection framework, comprising:

Candidate feature extraction module 201 extracts the candidate of image for carrying out feature extraction to the medical image of input Feature.In the specific embodiment of the invention, candidate feature extraction module 201 using it is ready-made include feature extractor and RPN General detection network extracts the candidate feature of image, and since medical image is single channel grayscale image, candidate feature is extracted Module 201 can using the key frame of medical image and its before and after frames composition triple channel be used as input picture I, by feature extractor with Feature pyramid extracts the semantic feature of image different levels, extracts the candidate feature of image by RPN network later, passes through After RoI alignment, network header (AutoRCNN Head) optimal module 203 is detected into lesion, due to candidate feature of the present invention The image characteristics extraction of extraction module 201 is using the prior art, and it will not be described here

Lesion detect network header (AutoRCNN Head) automatic customization module 202, for according to medical image characteristic, The relevant knowledge of focus characteristic and target detection, defines new search space, and described search includes a large amount of with flexibly impression The advanced operation of the sub-networks frameworks such as wild, skip floor connection, and increase by one non local (Non-local) operation, it goes candidate regions Perception relationship between domain merges, according to candidate feature using can micro- NAS algorithm conjunction is searched in the search space of design Suitable operation and connection type make its composition one detection network head, i.e., using can micro- NAS algorithm go to search out one automatically The best lesion of a suitable medical image detects network header AutoRCNN Head.

In object detection task, the identification of character representation is can be enhanced in the relationship between the size and centrality of receptive field Property and robustness, therefore, the present invention be can micro- NAS algorithm devise a new search space, including following 9 kinds of operations: 1) It is connectionless；2) skip floor connects；3) 3 × 3 average pond；4) non local (Non-local)；5) 1 × 3 and 3 × 1 convolution；6)3 × 3 depth separates convolution；7) 5 × 5 depth separates convolution；8) 3 × 3 empty convolution that expansion rate is 3；9) expansion rate For 53 × 3 empty convolution.

Can be in micro- NAS algorithm, it is necessary first to according to the suitable search space of task design, 9 kinds of such as above-mentioned operations, so Need to define the module of search, i.e. standard cell module and contractive cell module afterwards, wherein standard cell module step-length is 1, with It keeps output and inputs same resolution ratio, while port number is constant, contractive cell module step-length is 2, and resolution ratio is reduced by one Half, at the same port number is double.Each module, that is, cell can regard a directed acyclic graph as, need to define its branch's number, often One characteristic pattern of a branching representation, the connection type between branch indicate operation, such as skip floor connection, 3 × 3 average pond；It is each There are two inputs and an output from branch before to carry out initialization after completing above-mentioned search definition and set for branch It sets, there is the input from previous all branches in each branch, while having all candidate OP to connect between the branch of every two connection It connects, such as candidate OP has nine kinds, then 9 kinds of connection relationships is initialized as between the branch for having connection, and their weight is arranged and is 1, while its discrete topology serialization is made by softmax function, gradient passback, which is carried out, using gradient descent algorithm later updates Its weight.Finally after the search by certain time, retain the maximum behaviour of weight in 9 kinds of operation connections first Make, i.e., partially connected is become by an intensive connection, then select weight maximum two inputs for being connected to the branch, together When their result is incorporated as exporting.

Assuming that each module, that is, cell is the directed acyclic graph comprising B branch, there are two is come from for each branch The input of branch before and an output；Branching representation characteristic tensor, line indicate the operation between characteristic tensor.5 yuan can be used Group (X₁,X₂,OP₁,OP₂,) indicate branch b, wherein X in cell c₁, To input tensor；OP₁,OP₂∈ OP points It Wei not X₁,X₂Respective operations, OP are 9 kinds of candidate operations set proposed by the present invention；Cell c last output Y_c=merge

Specifically, the present invention be can micro- NAS algorithm devise a new search space, keep it big using receptive field The identity and robustness that the small relationship between centrality goes Enhanced feature to indicate.9 kinds of candidate operations set, that is, OP are specific It is defined as follows:

1) it connectionless operation: is not connected between branch；

8) 3 × 3 empty convolution operations that expansion rate is 5: one layer of ReLU active coating, one layer of convolution kernel are 3 × 3 and expansion rate For 5 convolutional layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer.

9) non local (Non-local) operation: the purpose is to the semanteme between region candidate relevant to target detection Information is encoded.Specifically, by interregional relationship be expressed as a region to region non-directed graph G:G=<N, E>, in N In the corresponding region candidate of each node, each edge e_{I, j}∈ E encodes the relationship between two nodes.Of the invention specific In embodiment, the input of Non-local operation isThe adjacency matrix of non-directed graph G can be by matrix multiplication E= softmax(φ(X)φ(X)^T) can be calculated, wherein φ () is the nonlinear transformation for having ReLU activation primitive；Later Each node in E is propagated with picture scroll lamination Y=σ (Ef (X) W), wherein f (), W are a nonlinear transformations, and σ is one Activation primitive, finally connecting full articulamentum is consistent its input and output size.

The search strategy that lesion detects network header (AutoRCNN Head) automatic customization module 202 is to use stochastic gradient Descent algorithm goes to optimize by the parameter expression of discrete topology serialization, goes one group of continuous framework weight of studyBranch it is defeated Tensor outIt is the weighted blend of candidate operations, is represented by Wherein weigh WeightIt is configuration parameters.In lesion detection network header AutoRCNN Head of the invention, contractive cell is in 1/3 and arrives The position of 2/3 depth, other positions are standard cell.Their corresponding configuration parameters are respectively (a^{It shrinks}, a^Standard), wherein All standard cells in AutoRCNN Head share weight a^Standard, all contractive cells share weight a^{It shrinks}。

After by discrete topology serialization, the task of framework search is reduced to one group of continuous variable of studyIt can be with having The stochastic gradient descent algorithm of effect optimizes.Loss function include lesion classification intersect entropy function and lesion detection it is absolute Loss function updates network weight by iteration and framework weight carries out.The present invention by training data be divided into two it is Uncrossed Subset, i.e. training set and verifying collection, their corresponding losses are L_trainAnd L_val.Optimization process is by following two steps iteration It carries out:

1) pass throughUpdate network weight w；

2) pass throughMore new architecture weight a；

After search, pass throughRetain the operation OP of a maximum weight_ijIt goes to obtain A discrete framework is obtained, i.e., searches out the standard cell and contractive cell of a suitable medical image automatically, will finally be shunk Cell is placed on 1/3 to 2/3 depth of entire head, remaining position is stacked up for standard cell, and group becomes medical image The best lesion of customization detects network header AutoRCNN Head.The optimization criteria cell and contractive cell architecture diagram searched It shows in figs. 3 and 4.In cell architecture diagram, c_ { k } indicates to work as precellular output, and c_ { k-1 } and c_ { k-2 } are respectively Indicate the output of the first two cell.Wherein 0,/1/,2/3 the first/bis-/tri-/tetra- branches in cell are respectively indicated, between branch Connection be search after, candidate operations concentrate the operation of remain one maximum weight.I.e. each cell comes from The output of the first two cell has an output as input.In initialization, there are 9 kinds of operation connections between each branch, After the search for carrying out gradient updating, every connection path first only retains the operation of a maximum weight, finally in these behaviour Reselection retains input of the operation of two maximum weights from different inputs as the branch in work.Finally, all branches Output of the splicing of output as entire cell.Such as in the standard cell architecture diagram of attached drawing 3, the input of branch 0 is respectively 3 × 3 empty convolution operations that the expansion rate of skip floor attended operation and c_ { k-2 } from c_ { k-1 } is 5；The input of branch 1 point It is not the non local operation of skip floor attended operation and c_ { k-2 } from c_ { k-1 }；The input of branch 2 is from c_ { k- respectively 1 } 3 × 3 depth separates the expansion rate of convolution operation and branch 1 as 33 × 3 empty convolution operations；The input of branch 3 It is 3 × 3 empty convolution operations that the expansion rate of non local operation and branch 1 from branch 0 is 3 respectively；Finally the standard is thin The output of born of the same parents is that the output of branch 0/1/2/3 is spliced.In the contractive cell architecture diagram of attached drawing 4, the input of branch 0 comes respectively 5 × 5 depth of 1 × 3 and 3 × 1 convolution operation and c_ { k-2 } from c_ { k-1 } separates convolution operation；Branch 1 it is defeated Enter be respectively from c_ { k-1 } 1 × 3 and 3 × 1 convolution operation and branch 05 × 5 depth separate convolution operation；Point The input of branch 2 is 1 × 3 and 3 × 1 convolution operation of 3 × 3 average pondization operation and branch 1 from branch 0 respectively；Point The input of branch 3 is 1 × 3 and 3 × 1 convolution behaviour that 5 × 5 depth from branch 1 separates convolution operation and branch 2 respectively Make；The finally output of the standard cell is that the output of branch 0/1/2/3 is spliced.

Lesion detects network header (AutoRCNN Head) optimal module 203, detects network header for lesion The best lesion of (AutoRCNN Head) automatic customization module 202 customization detects network header AutoRCNN Head, will wait The candidate feature for selecting characteristic extracting module 201 to export first passes around the convolutional layer that a convolution kernel is 3 × 3, then passes through one Standard cell and two contractive cells obtain new candidate feature, and carry out binary to new candidate feature by two articulamentums Classification and prediction block return, and the weight M that candidate feature in binary classification is classified is exported as high-level semantic information to knowledge Transferring module 204, to map the candidate feature for moving to multivariate classification module and being enhanced by knowledge migration module 204, and will Enhanced feature and candidate feature are combined, and carry out polymorphic type lesion to the candidate feature after merging by most latter two articulamentum Detection, i.e. multivariate classification and prediction block return.In the specific embodiment of the invention, the standard cell of AutoRCNN Head module Step-length is 1, and to keep output and input same resolution ratio, while port number is constant；Contractive cell step-length is 2, by resolution ratio Half is reduced, while port number is double.

Knowledge migration module 204, in the components graph that best lesion detection network header AutoRCNN Head learns On the basis of, in conjunction with semantic relation, and relevant contextual information is transmitted in different regions, to obtain the time of an enhancing Feature is selected, enhanced candidate feature and original candidate feature are merged to the phase for sharing a variety of lesion types later Information is closed, carries out multivariate classification and recurrence finally by full articulamentum, so that the present invention be made to realize one in a seamless fashion Unified polymorphic type lesion detects network.

In the present invention, the not instead of not simply one polymorphic type classifier of fine tuning of knowledge migration module 204, further grinds Study carefully the ability of best lesion detection network header AutoRCNN Head.Since in multiple search experiment, Non-local can It appears in the network architecture finally selected, therefore by E=softmax (φ (X) φ (X)^T) graph structure acquired, by pushing away Binary classification is switched to polymorphic type classification by reason.

In the specific embodiment of the invention, knowledge migration module 204 has used figure reasoning algorithm to realize, specifically, Realization process is as follows:

A. the weight and deviation of original binary classification are collectedAs high-level semantic information, P is The dimension of AutoRCNN Head output tensor.Since figure G is extracted in operating from Non-local in AutoRCNN Head A region to the figure in region, then need to find the input node f from the high-level semantic information of classification to knowledge migration module_i The most suitable mapping of ∈ f.For the error for avoiding original binary classification from generating, the present invention uses soft mappingDemapping M, wherein s_ijThe region i obtained from original two classification layer is classified as point of j Number.

B. the feature f that figure reasoning is enhanced is carried out by matrix multiplication_o, i.e. f_o=E Γ^sMW_o, wherein It is weight transfer matrix, o is the output dimension of knowledge migration module.By Enhanced feature f_oIt is combined and mentions with primitive character f The position of high polymorphic type lesion detection and classification performance.

Fig. 5 be one kind of the present invention can automatic customization medicine lesion detection method step flow chart.As shown in figure 5, this Invent it is a kind of can automatic customization medicine lesion detection method, include the following steps:

Step S1 carries out feature extraction using medical image of the candidate feature extraction module to input, extracts image Candidate feature.In the specific embodiment of the invention, candidate feature extraction module using it is ready-made include feature extractor and RPN General detection network extracts the candidate feature of image, and since medical image is single channel grayscale image, candidate feature is extracted Module can be using the key frame of medical image and its before and after frames composition triple channel as input picture I, by feature extractor and spy Sign pyramid extracts the semantic feature of image different levels, extracts the candidate feature of image by RPN network later, passes through After RoI alignment, network header (AutoRCNN Head) module is detected into lesion, since candidate feature of the present invention extracts mould The image characteristics extraction of block is using the prior art, and it will not be described here.

Step S2, will be candidate special using lesion detection network header (AutoRCNN Head) the optimal module of automatic customization The candidate feature for levying extraction module output first passes around the convolutional layer that a convolution kernel is 3 × 3, then thin by a standard Born of the same parents and two contractive cells obtain new candidate feature, and by two articulamentums to new candidate feature carry out binary classification and Prediction block returns, and the weight M that candidate feature in binary classification is classified is exported as high-level semantic information to knowledge migration mould Block.

Step S3, using knowledge migration module in the area that best lesion detection network header AutoRCNN Head learns On the basis of the relational graph of domain, in conjunction with semantic relation, and relevant contextual information is transmitted in different regions, to obtain one Enhanced candidate feature and original candidate feature are merged share a variety of lesions later by the candidate feature of enhancing The relevant information of type carries out multivariate classification and recurrence finally by full articulamentum, to realize one in a seamless fashion Unified polymorphic type lesion detects network.

Preferably, one kind of the present invention can automatic customization medicine lesion detection method, further includes:

It is empty to define new search according to the relevant knowledge of medical image characteristic, focus characteristic and target detection by step S0 Between, described search space includes the advanced operation largely with sub-networks frameworks such as flexible receptive field, skip floor connections, and increases by one A non local (Non-local) operation, goes to merge the perception relationship between candidate region, and being utilized according to candidate feature can Micro- NAS algorithm searches for suitable operation and connection type in the search space of design makes the head of one detection network of its composition Portion, i.e., using can micro- NAS algorithm go to search out the best lesion of a suitable medical image automatically and detect network header AutoRCNN Head。

In object detection task, the identification of character representation is can be enhanced in the relationship between the size and centrality of receptive field Property and robustness, therefore, the present invention be can micro- NAS algorithm devise a new search space, including following 9 kinds of operations: 1) It is connectionless；2) skip floor connects；3) 3 × 3 average pond；4) non local (Non-local)；5) 1 × 3 and 3 × 1 convolution；6)3 × 3 depth separates convolution；7) 5 × 5 depth separates convolution；8) 3 × 3 empty convolution that expansion rate is 3；9) expansion rate For 53 × 3 empty convolution.9 kinds of candidate operations set, that is, OP are defined as follows:

1) it connectionless operation: is not connected between branch；

In step S0, the search strategy used is to go optimization by discrete topology serialization with stochastic gradient descent algorithm Parameter expression, go study one group of continuous framework weightThe output tensor of branchIt is the weighted blend of candidate operations, It is represented byWherein weightIt is configuration parameters.In disease of the invention Stove detects in network header AutoRCNN Head, and contractive cell is in the position of 1/3 to 2/3 depth, and other positions are that standard is thin Born of the same parents.Their corresponding configuration parameters are respectively (a^{It shrinks}, a^Standard), wherein all standard cells in AutoRCNN Head are shared Weight a^Standard, all contractive cells share weight a^{It shrinks}。

1) pass throughUpdate network weight w；

2) pass throughMore new architecture weight a；

After search, pass throughRetain the operation OP of a maximum weight_ijIt goes to obtain A discrete framework is obtained, i.e., searches out the standard cell and contractive cell of a suitable medical image automatically, will finally be shunk Cell is placed on 1/3 to 2/3 depth of entire head, remaining position is stacked up for standard cell, and group becomes medical image The best lesion of customization detects network header AutoRCNN Head.The optimization criteria cell and contractive cell architecture diagram searched It shows in figs. 3 and 4.In cell architecture diagram, c_ { k } indicates to work as precellular output, and c_ { k-1 } and c_ { k-2 } are respectively Indicate the output of the first two cell.Wherein 0,/1/,2/3 the first/bis-/tri-/tetra- branches in cell are respectively indicated, between branch Connection be search after, candidate operations concentrate the operation of remain one maximum weight.I.e. each cell comes from The output of the first two cell has an output as input.In initialization, there are 9 kinds of operation connections between each branch, After the search for carrying out gradient updating, every connection path first only retains the operation of a maximum weight, finally in these behaviour Reselection retains input of the operation of two maximum weights from different inputs as the branch in work.

Preferably, in step S3, the knowledge migration module has used figure reasoning algorithm to realize, specifically, in fact Existing process is as follows:

In conclusion one kind of the present invention can automatic customization medicine lesion detection framework and method by automatically for medicine disease Stove Detection task has customized the detection network architecture head of a suitable medical image, and by knowledge migration module with seamless Mode realizes a unified polymorphic type lesion detection network, not only increases the precision of lesion detection, also adds multiclass The function of type lesion detection.The medicine lesion detection network architecture head and knowledge migration module that the present invention realizes can be based on appointing The general detection network skeleton of meaning natural image, and performance has different degrees of promotion, all realizes detection network skeleton Optimum performance.

The above-described embodiments merely illustrate the principles and effects of the present invention, and is not intended to limit the present invention.Any Without departing from the spirit and scope of the present invention, modifications and changes are made to the above embodiments by field technical staff.Therefore, The scope of the present invention, should be as listed in the claims.

Claims

1. one kind can automatic customization medicine lesion detect framework, comprising:

Candidate feature extraction module extracts the candidate feature of image for carrying out feature extraction to the medical image of input；

Lesion detects network header automatic customization module, for the phase according to medical image characteristic, focus characteristic and target detection Knowledge is closed, new search space is defined, described search space includes largely having the sub-networks framves such as flexible receptive field, skip floor connection The advanced operation of structure, and increase a non local operation, the perception relationship between candidate region is merged, according to candidate special Levy and utilize can micro- NAS algorithm search for suitable operation and connection type in the search space of design and be suitble to its composition one The best lesion of medical image detects network header；

Lesion detects the optimal module of network header, and the best lesion of network header automatic customization Custom modules is detected for the lesion Network header is detected, the candidate feature that the candidate feature extraction module is exported, first passing around a convolution kernel is 3 × 3 Then convolutional layer obtains new candidate feature by a standard cell and two contractive cells, and passes through two articulamentums pair New candidate feature carries out binary classification and prediction block returns, and the weight M that candidate feature in binary classification is classified is as high level Secondary semantic information is exported to knowledge migration module；

Knowledge migration module is closed on the basis of the components graph that best lesion detection network header learns in conjunction with semanteme System, and relevant contextual information is transmitted in different regions, to obtain the candidate feature of an enhancing, and will be enhanced Candidate feature and original candidate feature merge the relevant information for sharing a variety of lesion types, finally by full connection Layer carries out multivariate classification and recurrence.

2. one kind as described in claim 1 can the medicine lesion of automatic customization detect framework, it is characterised in that: described search is empty Between include following 9 kinds operation: 1) it is connectionless；2) skip floor connects；3) 3 × 3 average pond；4) non local；5) 1 × 3 and 3 × 1 Convolution；6) 3 × 3 depth separates convolution；7) 5 × 5 depth separates convolution；8) 3 × 3 cavity volumes that expansion rate is 3 Product；9) 3 × 3 empty convolution that expansion rate is 5.

3. one kind as claimed in claim 2 can automatic customization medicine lesion detect framework, it is characterised in that: it is described can be micro- In NAS algorithm, it is necessary first to according to the suitable search space of task design, then define the module of search, including standard cell Module and contractive cell module, wherein standard cell module step-length is 1, to keep output and input same resolution ratio, simultaneously Port number is constant, and contractive cell module step-length is 2, resolution ratio is reduced half, while port number is double, each module, that is, thin Born of the same parents regard a directed acyclic graph as, define its branch's number, one characteristic pattern of each branching representation, and the connection type between branch indicates Operation；There are two inputs and an output from branch before to carry out after completing above-mentioned search definition for each branch Initialize installation, while its discrete topology serialization is made by softmax function, gradient is carried out using gradient descent algorithm later Passback updates its weight, and finally after the search by certain time, it is maximum to retain weight in 9 kinds of operation connections first One operation becomes partially connected by an intensive connection, then selects weight maximum two and be connected to the branch Input, and their result is incorporated as exporting.

4. one kind as claimed in claim 3 can automatic customization medicine lesion detect framework, it is characterised in that: it is described 9 kinds time Selection operation set is defined as follows:

1) it connectionless operation: is not connected between branch；

4) 1 × 3 and 3 × 1 convolution operation: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 1 × 3, one layer batch of normalizing Change layer, one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 3 × 1, one layer batch of normalization layer；

5) 3 × 3 depth separates convolution operation: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 3 × 3, one layer of volume The convolutional layer that product core is 1 × 1, one layer batch of normalization layer, one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 3 × 3, one The convolutional layer that layer convolution kernel is 1 × 1, one layer batch of normalization layer；

6) 5 × 5 depth separates convolution operation: one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 5 × 5, one layer of volume The convolutional layer that product core is 1 × 1, one layer batch of normalization layer, one layer of ReLU active coating, the convolutional layer that one layer of convolution kernel is 5 × 5, one The convolutional layer that layer convolution kernel is 1 × 1, one layer batch of normalization layer；

7) 3 × 3 empty convolution operations that expansion rate is 3: one layer of ReLU active coating, one layer of convolution kernel is 3 × 3 and expansion rate is 3 Convolutional layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

8) 3 × 3 empty convolution operations that expansion rate is 5: one layer of ReLU active coating, one layer of convolution kernel is 3 × 3 and expansion rate is 5 Convolutional layer, the convolutional layer that one layer of convolution kernel is 1 × 1, one layer batch of normalization layer；

9) non local operation: the purpose is to encode to the semantic information between region candidate relevant to target detection.

5. one kind as claimed in claim 4 can automatic customization medicine lesion detect framework, it is characterised in that: in the non-office In portion's operation, by interregional relationship be expressed as a region to region non-directed graph G:G=< N, E >, each section in N The corresponding region candidate of point, each edge e_{I, j}∈ E encodes the relationship between two nodes, the input of the Non-local operation ForThe adjacency matrix of non-directed graph G can be by matrix multiplication E=softmax (φ (X) φ (X)^T) can be calculated, wherein φ () is the nonlinear transformation for having ReLU activation primitive；It is propagated in E with picture scroll lamination Y=σ (Ef (X) W) later Each node, wherein f (), W are a nonlinear transformations, and σ is an activation primitive, and finally connecting full articulamentum makes its input Output size is consistent.

6. one kind as claimed in claim 5 can the medicine lesion of automatic customization detect framework, it is characterised in that: lesion inspection The search strategy for surveying network header automatic customization module is to go optimization by discrete topology serialization using stochastic gradient descent algorithm Parameter expression, go study one group of continuous framework weightThe output tensor of branchFor the weighted blend of candidate operations, It is expressed asWherein weightFor configuration parameters.

7. one kind as claimed in claim 6 can automatic customization medicine lesion detect framework, which is characterized in that by training data It is divided into two Uncrossed subsets, i.e. training set and verifying collection, optimization process is carried out by following two steps iteration: 1) being passed throughUpdate network weight w；2) pass throughMore new architecture weight a.L_train(w, a) and L_val(w, It a) is respectively training set and the loss for verifying collection, wherein L_train(w, a) and L_val(w a) is respectively training set and the damage for verifying collection It loses.

8. one kind as claimed in claim 7 can automatic customization medicine lesion detect framework, which is characterized in that the knowledge is moved Shifting formwork block realizes that process is as follows:

A. the weight and deviation of original binary classification are collectedAs high-level semantic information, P AutoRCNN Head exports the dimension of tensor, and uses soft mappingDemapping M, wherein s_ijFrom original two The region i that classification layer obtains is classified as the score of j.

B. the feature f that figure reasoning is enhanced is carried out by matrix multiplication_o, i.e. f_o=E Γ^sMW_o, wherein It is power Weight transformation matrix, o are the output dimension of knowledge migration module, and by Enhanced feature f_oIt is combined and improves with primitive character f The position of polymorphic type lesion detection and classification performance.

9. one kind can automatic customization medicine lesion detection method, include the following steps:

Step S1 carries out feature extraction using medical image of the candidate feature extraction module to input, extracts the candidate of image Feature；

Step S2, the candidate for being exported candidate feature extraction module using the lesion detection optimal module of network header of automatic customization Feature first passes around the convolutional layer that a convolution kernel is 3 × 3, then obtains by a standard cell and two contractive cells New candidate feature, and binary classification and prediction block recurrence are carried out to new candidate feature by two articulamentums, by binary point The weight M that candidate feature is classified in class is exported as high-level semantic information to knowledge migration module；

Step S3, using knowledge migration module on the basis of the components graph that best lesion detection network header learns, In conjunction with semantic relation, and relevant contextual information is transmitted in different regions, to obtain the candidate feature of an enhancing, so Enhanced candidate feature and original candidate feature are merged to the relevant information for sharing a variety of lesion types afterwards, most Multivariate classification and recurrence are carried out by full articulamentum afterwards.

10. one kind as claimed in claim 9 can automatic customization medicine lesion detection method, which is characterized in that the method Further include:

Step S0 defines new search space, institute according to the relevant knowledge of medical image characteristic, focus characteristic and target detection Stating search space includes the advanced operation largely with sub-networks frameworks such as flexible receptive field, skip floor connections, and increase by one is non- Partial operation merges the perception relationship between candidate region, according to candidate feature and using can micro- NAS algorithm designing Search space in the suitable operation of search and connection type so that it is formed the best lesion of suitable medical image detection net Headstall portion.