CN106407992A - Breast ultrasound image self-learning extraction method and system based on stacked noise reduction self-encoder - Google Patents

Breast ultrasound image self-learning extraction method and system based on stacked noise reduction self-encoder Download PDF

Info

Publication number
CN106407992A
CN106407992A CN201610834295.1A CN201610834295A CN106407992A CN 106407992 A CN106407992 A CN 106407992A CN 201610834295 A CN201610834295 A CN 201610834295A CN 106407992 A CN106407992 A CN 106407992A
Authority
CN
China
Prior art keywords
layer
noise reduction
batch
sample
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610834295.1A
Other languages
Chinese (zh)
Other versions
CN106407992B (en
Inventor
陈壮威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fujian Maternal And Child Care Service Centre
Original Assignee
Fujian Maternal And Child Care Service Centre
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fujian Maternal And Child Care Service Centre filed Critical Fujian Maternal And Child Care Service Centre
Priority to CN201610834295.1A priority Critical patent/CN106407992B/en
Publication of CN106407992A publication Critical patent/CN106407992A/en
Application granted granted Critical
Publication of CN106407992B publication Critical patent/CN106407992B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • G06F19/34
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Abstract

The invention discloses a breast ultrasound image self-learning extraction method and system based on a stacked noise reduction self-encoder. The method comprises the steps of extracting manual shallow layer features from each ultrasound breast lesion area image ROI as a training sample to form a training sample set set_unlabeled = {x(1), x(2), ..., x(n)}, the i-th sample x(i) belonging to [0, 1]<d>, i = 1, 2, ..., n; based on the training sample set, training a first noise reduction self-encoder DAE1; after training the first noise reduction self-encoder, re-entering the training sample set, using the self-encoder trained in the step S4 to extract feature expressions obtained through hidden layer learning of all the samples to form a new sample {y(1), y(2), ..., y(n)}, and using the new sample as an input of a second noise reduction self-encoder DAE2 to train the second noise reduction self-encoder. The invention achieves extraction of breast ultrasound image features, thereby provides valuable reference opinions for clinic diagnosis, and improves the accuracy and efficiency of breast cancer diagnosis.

Description

A kind of breast ultrasound characteristics of image self study extraction based on stacking noise reduction self-encoding encoder Method and system
Technical field
The present invention relates to Feature Engineering technical field, more particularly, to a kind of breast ultrasound based on stacking noise reduction self-encoding encoder Characteristics of image self study extracting method and system.
Background technology
Breast carcinoma is the most commonly seen a kind of malignant tumor of women all over the world, and about 400,000 people die from this disease every year.China It is one of fastest-rising country of breast cancer incidence, especially breast carcinoma has become as China women sickness rate ranking in recent years The malignant tumor of one.The therapeutic effect of breast carcinoma of early stage is good, can save the life of patient to a great extent, therefore improves breast The precision of the early diagnosiss of adenocarcinoma and accuracy become more and more meaningful.
At present, the image checks such as breast ultrasound, molybdenum target are mainly applied in breast carcinoma clinical diagnosises, and diagnosis person passes through lump, calcium The image features such as change, blood flow signal image is analyzed.Breast ultrasonography is widely used to the clinical position of China In, its have easy to operate, "dead", hurtless measure, to lump accurate positioning and economic and practical the advantages of.But ultrasonic inspection Look into and there are still many deficiencies, the image of such as breast carcinoma of early stage is not often true to type and is difficult to differentiate, especially because diagnosis person visual impression The difference known, visual fatigue, the use of different features and diagnostic criteria, lack the quantitative measurement of characteristics of image, result in not With diagnosis result difference so that the mistaken diagnosis and failing to pinpoint a disease in diagnosis of breast carcinoma of early stage still occurs often.
With the continuous development of Medical Imaging Technology and computer technology, carry out auxiliary diagnosis using computer and improve diagnosis Accuracy be possibly realized;Such as:Using digital image processing techniques, extract pathology in breast ultrasound image related spy Levy, according to these features, Classification and Identification etc. is carried out to Diagnosis of Breast tumors with machine learning methods such as SVM.
From the point of view of the application present situation of computer-aided diagnosises breast carcinoma, the accuracy of computer-aided diagnosises largely takes Certainly whether effective in extracting B ultrasonic image pathology correlated characteristic.The medical image features being presently used for computer-aided diagnosises carry Take and substantially position focus area-of-interest using manual, and some bases of method extraction being processed by primary image is normal Rule feature, such as:Grey level histogram feature, shape facility, gray level co-occurrence matrixes feature, wavelet character etc..But said method have with Under several aspects deficiency:The extraction time and effort consuming one by one of the firstth, above-mentioned basic general characteristics;Secondth, above-mentioned single basis is often Not field is related for rule feature itself, and the application-specific degree of association of breast carcinoma is little;3rd, design effectively can be used for calculating The basic general characteristics combination of machine auxiliary diagnosis breast carcinoma has serious uncertainty.There is above-mentioned circumscribed essential reason It is the low-level feature that feature after Feature Selection remains medical image, with medical image pathology semanteme high-level characteristic on the whole Between have no direct mapping relations, therefore, best settlement mechanism be provide one kind can according to conventional breast carcinoma B ultrasonic image from Dynamic learn with pathology about and can be used for auxiliary diagnosis characteristics of image method.
Content of the invention
For this reason, it may be necessary to provide a kind of breast ultrasound characteristics of image self study extraction side based on stacking noise reduction self-encoding encoder Case, solves how automatically to learn image spy that is relevant with pathology and can be used for auxiliary diagnosis according to conventional breast carcinoma B ultrasonic image The problem levied.
For achieving the above object, inventor provide a kind of breast ultrasound characteristics of image based on stacking noise reduction self-encoding encoder Self study extracting method, comprises the following steps:
Step S1:A given medium-scale above breast ultrasound focal area image set, described medium-scale expression This image set is at least containing breast ultrasound diagnostic images more than 200 width;
Step S2:The focal area image of each breast ultrasound diagnostic image in image set in manual extraction step S1 ROI;
Step S3:Extract manual shallow-layer feature to train as one from each breast ultrasound focal area image ROI Sample, composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈[0,1]d, i=1, 2,…,n;Wherein d represents the characteristic dimension of sample, and n represents training set number of samples;
Step S4:Based on training sample set, train first noise reduction self-encoding encoder DAE1;
Step S5:After having trained first noise reduction self-encoding encoder, re-enter training sample set, trained according to step S4 Encoder extract the hidden layer character representation that obtains of study of all samples, constitute new sample { y(1),y(2),…,y(n), will It trains second noise reduction self-encoding encoder DAE2 as the input of second noise reduction self-encoding encoder;
Step S6:Two noise reduction self-encoding encoders DAE1 completing to train and DAE2 stacking are obtained three layers of SDAE structure, Corresponding ground floor is input layer, and dimension is d;The second layer is the hidden layer in DAE1, and dimension is dh1;Third layer is corresponding in DAE2 Hidden layer, dimension is dh2;By this SDAE structure, the manual shallow-layer feature of given breast molybdenum target image, after feed-forward, obtain base Semantic feature in the higher level of abstraction of stacking noise reduction self-encoding encoder represents
Further, described step S3 is:Extract the GLCM of each ROI image, small echo, wavelet packet, MPEG-7 tetra- respectively Plant manual shallow-layer feature, the characteristic vector being cascaded as a d dimension is as a sample.
Further, in described step S4, first noise reduction self-encoding encoder is made up of three-layer network, corresponding input layer x, hidden Layer y, the neuron number of output layer z are respectively d, dh1, d, certain sample x that wherein input of input layer is concentrated for training sample(i)∈[0,1]d, i=1,2 ..., n;Input layer has been artificially induced noise;Parameter θ1={ W1,b1, θ2={ W2,b2, b1、b2 For being respectively the bias vector of hidden layer and output layer, size is respectively dh1With d dimension, W1、W2It is respectively input layer to the power of hidden layer It is worth connection matrix and hidden layer to the weights connection matrix of output layer, size is respectively d × dh1、dh1×d;Activation primitive all adopts Sigmoid function;
Step S4 comprises the following steps:
Step S41:Training sample set is split, concretely comprises the following steps:The training sample set of ultrasonoscopy is divided at random It is segmented into num batch, each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Step S42:Network parameter initializes;It is specifically configured to:
Learningrate=1;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Step 43:Setting maximum cycle NN;
Step 44:Recirculate outward t=1 to NN;
Inside recirculate s=1 to num;
Step 441:Corrosion data:By binary system masking noise mode, with certain probability by input feature value x certain A little values are randomly reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_ Size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Step 442:Feed-forward:
z(i)=sigmoid (W2 Ty(i)+b2), i=1,2 ..., batch_size;
Step 443:Reverse transfer:
Step 444:Undated parameter
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.
The present invention provides a kind of breast ultrasound characteristics of image self study extraction system based on stacking noise reduction self-encoding encoder, bag Include with lower module:
Image set gives module:For giving a medium-scale above breast ultrasound focal area image set, described Medium-scale this image set of expression is at least containing breast ultrasound diagnostic images more than 200 width;
Focal area extraction module:For each breast ultrasound diagnostic image in image set in manual extraction step S1 Focal area image ROI;
Sample training module:Make for extracting manual shallow-layer feature from each breast ultrasound focal area image ROI For a training sample, composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈ [0,1]d, i=1,2 ..., n;Wherein d represents the characteristic dimension of sample, and n represents training set number of samples;
First encoder training module:For based on training sample set, training first noise reduction self-encoding encoder DAE1;
Second encoder training module:For having trained after first noise reduction self-encoding encoder, re-enter training sample set, The character representation being obtained according to the hidden layer study that the encoder that step S4 trains extracts all samples, constitutes new sample { y(1),y(2),…,y(n), as the input of second noise reduction self-encoding encoder, train second noise reduction self-encoding encoder DAE2;
Semantic feature generation module:For two noise reduction self-encoding encoders DAE1 completing to train and DAE2 stacking are obtained three layers SDAE structure, corresponding ground floor is input layer, and dimension is d;The second layer is the hidden layer in DAE1, and dimension is dh1;Third layer is DAE2 In corresponding hidden layer, dimension be dh2;By this SDAE structure, the manual shallow-layer feature of given breast molybdenum target image, obtain after feed-forward Semantic feature to the higher level of abstraction based on stacking noise reduction self-encoding encoder represents
Further, described sample training module:It is additionally operable to extract respectively the GLCM of each ROI image, small echo, small echo The manual shallow-layer feature of bag, tetra- kinds of MPEG-7, the characteristic vector being cascaded as a d dimension is as a sample.
Further, in described first encoder training module, first noise reduction self-encoding encoder is made up of three-layer network, right The neuron number answering input layer x, hidden layer y, output layer z is respectively d, dh1, d, wherein the input of input layer be training sample set In certain sample x(i)∈[0,1]d, i=1,2 ..., n;Input layer has been artificially induced noise;Parameter θ1={ W1,b1, θ2= {W2,b2, b1、b2For being respectively the bias vector of hidden layer and output layer, size is respectively dh1With d dimension, W1、W2It is respectively input layer To the weights connection matrix of hidden layer and hidden layer to the weights connection matrix of output layer, size is respectively d × dh1、dh1×d;Activation Function is all using sigmoid function;
First encoder training module is included with lower unit:
Sample decomposition unit:For splitting to training sample set, concretely comprise the following steps:Training sample by ultrasonoscopy Integrate random division as num batch, each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Network parameter initialization unit:For network parameter initialization;It is specifically configured to:
Learningrate=1;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Cycle-index arranging unit:For arranging maximum cycle NN;
The inside and outside arranging unit that recirculates:For arranging the outer t=1 to NN that recirculates;
For the s=1 to num that recirculates in arranging;
Corrosion data unit:For corrosion data:By binary system masking noise mode, with certain probability by input feature vector In vector x, some values are randomly reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_ Size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Feed-forward unit:For feed-forward:
z(i)=sigmoid (W2 Ty(i)+b2), i=1,2 ..., batch_size;
Reverse transfer unit:For reverse transfer:
Undated parameter unit:For undated parameter
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.
It is different from prior art, technique scheme is based on medium scale gland molybdenum target focal area image, training obtains Two self-encoding encoders, and obtain SDAE structure according to two self-encoding encoders, and finally give semantic feature it is achieved that breast ultrasound The extraction of characteristics of image, thus for clinical diagnosises provide valuable " advisory opinion ", improve breast cancer diagnosis accuracy rate and Efficiency.
Brief description
Fig. 1 is that the single noise reduction self-encoding encoder of breast ultrasound characteristics of image deep learning in the embodiment of the present invention was trained Journey;
Fig. 2 is to stack noise reduction self-encoding encoder training process in the embodiment of the present invention.
Specific embodiment
By the technology contents of detailed description technical scheme, structural features, realized purpose and effect, below in conjunction with concrete reality Apply example and coordinate accompanying drawing to be explained in detail.
Refer to Fig. 1 to Fig. 2, it is super that the present embodiment present embodiments provides a kind of mammary gland based on stacking noise reduction self-encoding encoder Acoustic image feature self study extracting method, specific as follows:
Step S1:A given medium-scale above breast ultrasound focal area image set, described medium-scale expression This image set is at least containing breast molybdenum target diagnostic images more than 200 width;
Step S2:The breast molybdenum target focal zone of each breast ultrasound diagnostic image in image set in manual extraction step S1 Area image ROI (Region of interest, area-of-interest);Wherein said breast ultrasound focal area image ROI's is big Little is 150 × 150;
Step S3:Extract manual shallow-layer feature to train as one from each breast ultrasound focal area image ROI Sample, composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈[0,1]d, i=1, 2,…,n.Wherein d represents the characteristic dimension of sample, and n represents training set number of samples.
Step S4:Based on training sample set, train first noise reduction self-encoding encoder DAE1, wherein DAE is Denoising Autoencoder, noise reduction self-encoding encoder.
Step S5:After having trained first noise reduction self-encoding encoder, re-entering set_unlabeled sample is training sample Collection, the character representation being obtained according to the hidden layer study that the model DAE1 that step S4 trains extracts all samples, constitute new sample This { y(1),y(2),…,y(n), as the input of second noise reduction self-encoding encoder, train second noise reduction self-encoding encoder DAE2.
Step S6:Two that complete to train noise reduction self-encoding encoder (DAE1, DAE2) stackings are obtained three layers of SDAE (stacked Denoising Autoencoder) structure, as shown in Figure 2.Corresponding ground floor is input layer, and dimension is d;Second Layer is the hidden layer in DAE1, and dimension is dh1;Third layer is corresponding hidden layer in DAE2, and dimension is dh2.By this model, give The manual shallow-layer feature of breast ultrasound image, can obtain the higher level of abstraction based on stacking noise reduction self-encoding encoder after feed-forward Semantic feature representSo obtain semantic feature it is achieved that mammary gland surpasses The extraction of acoustic image feature, thus providing valuable " advisory opinion " for clinical diagnosises, improves the accuracy rate of breast cancer diagnosis And efficiency.
Further, described step S3 is:Extract GLCM ((gray level co-occurrence matrixes, the Gray- of each ROI image respectively Level co-occurrence matrix), small echo, wavelet packet, MPEG-7 (Moving Picture Experts Group, Dynamic image expert group) four kinds of manual shallow-layer features, the characteristic vector being cascaded as a d dimension is as a sample.In view of certain A little actual maximums of characteristic attribute and minima are unknown, and there is a possibility that outlier, take z-score's first Normalization method, normalizing is as follows:
Wherein x represents the observation of a certain dimensional characteristics, and mean is the average of this dimensional characteristics observation, and std is this dimension The standard deviation of degree feature observation, x' carries out the result after z-score specification for x.In view of nerve during training own coding Unit is that existed with Probability Forms, proceeds Min-Max and standardizes to [0,1] interval.Normalizing is as follows:
Wherein x' represents the observation of a certain dimensional characteristics, and min is the minima of this dimensional characteristics observation, and max is should The maximum of dimensional characteristics observation, x " carry out the result after Min-Max specification for x'.
In step s3, it focuses on cascade therein, in general, GLCM, small echo, wavelet packet, MPEG-7 this four Kind of shallow-layer feature is all only extracted the part physical feature of image, not comprehensively, in order to ensure subsequently can be from comprehensive physics Feature learning goes out more preferable high-level characteristic, and these four different shallow-layer features are concatenated together the study as follow-up work Basis, farthest to comprise the physical message of ROI image.
Further, described step S4 is:As shown in figure 1, whole noise reduction self-encoding encoder is made up of three-layer network, correspondence is defeated Enter neuron number respectively d, d of a layer x, hidden layer y, output layer zh1, d, wherein the input of input layer be training set set_ Certain sample x in unlabeled(i)∈[0,1]d, i=1,2 ..., n.Input layer has been artificially induced noise.Parameter θ1= {W1,b1, θ2={ W2,b2, b1、b2For being respectively the bias vector of hidden layer and output layer, size is respectively dh1With d dimension, W1、W2 It is respectively input layer and arrive the weights connection matrix of hidden layer and hidden layer to the weights connection matrix of output layer, size respectively d × dh1、 dh1×d.Activation primitive is all using sigmoid function.
Specifically include following steps:
Step S41:Training sample set is split, concretely comprises the following steps:Training sample set set_ by ultrasonoscopy Unlabeled random division is num batch (block), each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Step S42:Network parameter initializes;It is specifically configured to:
Learningrate=1;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Step 43:Setting maximum cycle NN;
Step 44:Recirculate outward t=1 to NN;
Inside recirculate s=1 to num;
Step 441:Corrosion data:By binary system masking noise mode, with certain probability by input feature value x certain A little values are randomly reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_ Size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Step 442:Feed-forward:
z(i)=sigmoid (W2 Ty(i)+b2), i=1,2 ..., batch_size;
Step 443:Reverse transfer:
Step 444:Undated parameter
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.This reality Apply example to be advantageous in that, traditional GLCM, small echo, wavelet packet, these shallow-layer features of MPEG-7 are all conventional image in fact Physical features, and the pathological characters that ultrasonoscopy to carry out needs when auxiliary diagnosis as medical image not directly do not close Connection, so had as the description sign on ultrasonoscopy pathology with GLCM, small echo, wavelet packet, these shallow-layer features of MPEG-7 Unreliability.And the aspect ratio physical features higher level by learner study gained, it is more nearly the semantic feature of image, Bigger with the pathology degree of association of ultrasonoscopy, it is more suitable for characterizing as the description on ultrasonoscopy pathology.
The present invention provides a kind of breast ultrasound characteristics of image self study extraction system based on stacking noise reduction self-encoding encoder, bag Include with lower module:
Image set gives module:For giving a medium-scale above breast ultrasound focal area image set, described Medium-scale this image set of expression is at least containing breast ultrasound diagnostic images more than 200 width;
Focal area extraction module:For each breast ultrasound diagnostic image in image set in manual extraction step S1 Focal area image ROI;
Sample training module:Make for extracting manual shallow-layer feature from each breast ultrasound focal area image ROI For a training sample, composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈ [0,1]d, i=1,2 ..., n;Wherein d represents the characteristic dimension of sample, and n represents training set number of samples;
First encoder training module:For based on training sample set, training first noise reduction self-encoding encoder DAE1;
Second encoder training module:For having trained after first noise reduction self-encoding encoder, re-enter training sample set, The character representation being obtained according to the hidden layer study that the encoder that step S4 trains extracts all samples, constitutes new sample { y(1),y(2),…,y(n), as the input of second noise reduction self-encoding encoder, train second noise reduction self-encoding encoder DAE2;
Semantic feature generation module:For two noise reduction self-encoding encoders DAE1 completing to train and DAE2 stacking are obtained three layers SDAE structure, corresponding ground floor is input layer, and dimension is d;The second layer is the hidden layer in DAE1, and dimension is dh1;Third layer is DAE2 In corresponding hidden layer, dimension be dh2;By this SDAE structure, the manual shallow-layer feature of given breast molybdenum target image, obtain after feed-forward The semantic feature of the higher level of abstraction based on stacking noise reduction self-encoding encoder represents So obtain semantic feature it is achieved that the extraction of breast ultrasound characteristics of image, thus providing valuable " reference for clinical diagnosises Suggestion ", improves accuracy rate and the efficiency of breast cancer diagnosis.
Further, described sample training module:It is additionally operable to extract respectively the GLCM of each ROI image, small echo, small echo The manual shallow-layer feature of bag, tetra- kinds of MPEG-7, the characteristic vector being cascaded as a d dimension is as a sample.Sample training module weight Point is cascade therein, and in general, these four shallow-layer features of GLCM, small echo, wavelet packet, MPEG-7 are all only extracted image Part physical feature, not comprehensively, preferably high-rise special in order to ensure subsequently can to go out from comprehensive physical features learning Levy, these four different shallow-layer features are concatenated together the learning foundation as follow-up work, farthest to comprise ROI The physical message of image.
Further, in described first encoder training module, first noise reduction self-encoding encoder is made up of three-layer network, right The neuron number answering input layer x, hidden layer y, output layer z is respectively d, dh1, d, wherein the input of input layer be training sample set In certain sample x(i)∈[0,1]d, i=1,2 ..., n;Input layer has been artificially induced noise;Parameter θ1={ W1,b1, θ2= {W2,b2, b1、b2For being respectively the bias vector of hidden layer and output layer, size is respectively dh1With d dimension, W1、W2It is respectively input layer To the weights connection matrix of hidden layer and hidden layer to the weights connection matrix of output layer, size is respectively d × dh1、dh1×d;Activation Function is all using sigmoid function;
First encoder training module is included with lower unit:
Sample decomposition unit:For splitting to training sample set, concretely comprise the following steps:Training sample by ultrasonoscopy Integrate random division as num batch, each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Network parameter initialization unit:For network parameter initialization;It is specifically configured to:
Learningrate=1;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Cycle-index arranging unit:For arranging maximum cycle NN;
The inside and outside arranging unit that recirculates:For arranging the outer t=1 to NN that recirculates;
For the s=1 to num that recirculates in arranging;
Corrosion data unit:For corrosion data:By binary system masking noise mode, with certain probability by input feature vector In vector x, some values are randomly reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_ Size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Feed-forward unit:For feed-forward:
z(i)=sigmoid (W2 Ty(i)+b2), i=1,2 ..., batch_size;
Reverse transfer unit:For reverse transfer:
Undated parameter unit:For undated parameter
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.This reality Apply example to be advantageous in that, traditional GLCM, small echo, wavelet packet, these shallow-layer features of MPEG-7 are all conventional image in fact Physical features, and the pathological characters that ultrasonoscopy to carry out needs when auxiliary diagnosis as medical image not directly do not close Connection, so had as the description sign on ultrasonoscopy pathology with GLCM, small echo, wavelet packet, these shallow-layer features of MPEG-7 Unreliability.And the aspect ratio physical features higher level by learner study gained, it is more nearly the semantic feature of image, Bigger with the pathology degree of association of ultrasonoscopy, it is more suitable for characterizing as the description on ultrasonoscopy pathology.
It should be noted that herein, such as first and second or the like relational terms are used merely to a reality Body or operation are made a distinction with another entity or operation, and not necessarily require or imply these entities or deposit between operating In any this actual relation or order.And, term " inclusion ", "comprising" or its any other variant are intended to The comprising of nonexcludability, so that include a series of process of key elements, method, article or terminal unit not only include those Key element, but also include other key elements being not expressly set out, or also include for this process, method, article or end The intrinsic key element of end equipment.In the absence of more restrictions, limited by sentence " including ... " or " comprising ... " It is not excluded that also there is other key element in process, method, article or the terminal unit including described key element in key element.This Outward, herein, " more than ", " less than ", " exceeding " etc. be interpreted as not including this number;" more than ", " below ", " within " etc. understand It is including this number.
Those skilled in the art are it should be appreciated that the various embodiments described above can be provided as method, device or computer program product Product.These embodiments can be using complete hardware embodiment, complete software embodiment or the embodiment combining software and hardware aspect Form.All or part of step in the method that the various embodiments described above are related to can be instructed by program correlation hardware Lai Complete, described program can be stored in the storage medium that computer equipment can read, for executing the various embodiments described above side All or part of step described in method.Described computer equipment, including but not limited to:Personal computer, server, general-purpose computations Machine, special-purpose computer, the network equipment, embedded device, programmable device, intelligent mobile terminal, intelligent home device, Wearable Smart machine, vehicle intelligent equipment etc.;Described storage medium, including but not limited to:RAM, ROM, magnetic disc, tape, CD, sudden strain of a muscle Deposit, USB flash disk, portable hard drive, storage card, memory stick, webserver storage, network cloud storage etc..
The various embodiments described above are with reference to the method according to embodiment, equipment (system) and computer program Flow chart and/or block diagram are describing.It should be understood that can be by every in computer program instructions flowchart and/or block diagram Flow process in one flow process and/or square frame and flow chart and/or block diagram and/or the combination of square frame.These computers can be provided Programmed instruction to computer equipment processor to produce a machine so that by the finger of the computing device of computer equipment Order produces and is used for what realization was specified in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame The device of function.
These computer program instructions may be alternatively stored in and the computer that computer equipment works in a specific way can be guided to set So that the instruction being stored in this computer equipment readable memory produces the manufacture including command device in standby readable memory Product, this command device is realized in one flow process of flow chart or multiple flow process and/or one square frame of block diagram or multiple square frame middle finger Fixed function.
These computer program instructions also can be loaded on computer equipment so that executing a series of on a computing device Operating procedure is to produce computer implemented process, thus the instruction executing on a computing device is provided for realizing in flow process The step of the function of specifying in one flow process of figure or multiple flow process and/or one square frame of block diagram or multiple square frame.
Although being described to the various embodiments described above, those skilled in the art once know basic wound The property made concept, then can make other change and modification to these embodiments, so the foregoing is only embodiments of the invention, Not thereby the equivalent structure that the scope of patent protection of the restriction present invention, every utilization description of the invention and accompanying drawing content are made Or equivalent flow conversion, or directly or indirectly it is used in other related technical fields, all include the patent in the present invention in the same manner Within protection domain.

Claims (6)

1. a kind of breast ultrasound characteristics of image self study extracting method based on stacking noise reduction self-encoding encoder is it is characterised in that wrap Include following steps:
Step S1:A given medium-scale above breast ultrasound focal area image set, described medium-scale this figure of expression Image set is at least containing breast ultrasound diagnostic images more than 200 width;
Step S2:The focal area image ROI of each breast ultrasound diagnostic image in image set in manual extraction step S1;
Step S3:Extract manual shallow-layer feature as a training sample from each breast ultrasound focal area image ROI, Composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈[0,1]d, i=1,2 ..., n;Wherein d represents the characteristic dimension of sample, and n represents training set number of samples;
Step S4:Based on training sample set, train first noise reduction self-encoding encoder DAE1;
Step S5:After having trained first noise reduction self-encoding encoder, re-enter training sample set, the volume training according to step S4 Code device extracts the character representation that the hidden layer study of all samples obtains, and constitutes new sample { y(1),y(2),…,y(n), made For the input of second noise reduction self-encoding encoder, train second noise reduction self-encoding encoder DAE2;
Step S6:Two noise reduction self-encoding encoders DAE1 completing to train and DAE2 stacking are obtained three layers of SDAE structure, corresponding Ground floor is input layer, and dimension is d;The second layer is the hidden layer in DAE1, and dimension is dh1;Third layer is corresponding hidden in DAE2 Layer, dimension is dh2;By this SDAE structure, the manual shallow-layer feature of given breast molybdenum target image, it is based on after feed-forward The semantic feature of the higher level of abstraction of stacking noise reduction self-encoding encoder represents
2. the breast ultrasound characteristics of image self study extraction side based on stacking noise reduction self-encoding encoder according to claim 1 Method it is characterised in that:
Described step S3 is:Extract the GLCM of each ROI image, small echo, wavelet packet, the manual shallow-layers of tetra- kinds of MPEG-7 respectively special Levy, the characteristic vector being cascaded as a d dimension is as a sample.
3. the breast ultrasound characteristics of image self study extraction side based on stacking noise reduction self-encoding encoder according to claim 1 Method it is characterised in that:
In described step S4, first noise reduction self-encoding encoder is made up of three-layer network, corresponding input layer x, hidden layer y, output layer z Neuron number is respectively d, dh1, d, certain sample x that wherein input of input layer is concentrated for training sample(i)∈[0,1]d,i =1,2 ..., n;Input layer has been artificially induced noise;Parameter θ1={ W1,b1, θ2={ W2,b2, b1、b2For being respectively hidden layer With the bias vector of output layer, size is respectively dh1With d dimension, W1、W2Be respectively input layer arrive hidden layer weights connection matrix with To the weights connection matrix of output layer, size is respectively d × d to hidden layerh1、dh1×d;Activation primitive is all using sigmoid function;
Step S4 comprises the following steps:
Step S41:Training sample set is split, concretely comprises the following steps:The training sample set random division of ultrasonoscopy is Num batch, each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Step S42:Network parameter initializes;It is specifically configured to:
Learningrate=1;
W 1 = ( r a n d ( d , d h 1 ) - 0.5 ) &times; 6 d + d h 1 ;
W 2 = ( r a n d ( d h 1 , d ) - 0.5 ) &times; 6 d h 1 + d ;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Step 43:Setting maximum cycle NN;
Step 44:Recirculate outward t=1to NN;
Inside recirculate s=1to num;
Step 441:Corrosion data:By binary system masking noise mode, with certain probability by values some in input feature value x Randomly it is reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Step 442:Feed-forward:
z ( i ) = s i g m o i d ( W 2 T y ( i ) + b 2 ) , i = 1 , 2 , ... , b a t c h _ s i z e ;
Step 443:Reverse transfer:
&delta; j ( i ) h = &Sigma; t = 1 d W 2 j t &delta; t ( i ) o y j ( i ) ( 1 - y j ( i ) ) , i = 1 , 2 , ... , b a t c h _ s i z e , 1 &le; j &le; d h 1 ;
Step 444:Undated parameter
W 2 = W 2 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e y ( i ) &delta; ( i ) o T ;
b 1 = b 1 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e &delta; j ( i ) h T ;
b 2 = b 2 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e &delta; j ( i ) o T ;
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.
4. a kind of breast ultrasound characteristics of image self study extraction system based on stacking noise reduction self-encoding encoder is it is characterised in that wrap Include with lower module:
Image set gives module:For giving a medium-scale above breast ultrasound focal area image set, described medium Scale represents this image set at least containing breast ultrasound diagnostic images more than 200 width;
Focal area extraction module:Focus for each breast ultrasound diagnostic image in image set in manual extraction step S1 Area image ROI;
Sample training module:For extracting manual shallow-layer feature from each breast ultrasound focal area image ROI as one Individual training sample, composing training sample set set_unlabeled={ x(1),x(2),…,x(n), i-th sample x(i)∈[0,1 ]d, i=1,2 ..., n;Wherein d represents the characteristic dimension of sample, and n represents training set number of samples;
First encoder training module:For based on training sample set, training first noise reduction self-encoding encoder DAE1;
Second encoder training module:For having trained after first noise reduction self-encoding encoder, re-enter training sample set, according to The encoder that step S4 trains extracts the character representation that the hidden layer study of all samples obtains, and constitutes new sample { y(1),y(2),…,y(n), as the input of second noise reduction self-encoding encoder, train second noise reduction self-encoding encoder DAE2;
Semantic feature generation module:For two noise reduction self-encoding encoders DAE1 completing to train and DAE2 stacking being obtained three layers of SDAE Structure, corresponding ground floor is input layer, and dimension is d;The second layer is the hidden layer in DAE1, and dimension is dh1;Third layer is corresponding in DAE2 Hidden layer, dimension be dh2;By this SDAE structure, the manual shallow-layer feature of given breast molybdenum target image, after feed-forward, obtain base Semantic feature in the higher level of abstraction of stacking noise reduction self-encoding encoder represents
5. according to claim 4 extraction based on the breast ultrasound characteristics of image self study of stacking noise reduction self-encoding encoder is System it is characterised in that:
Described sample training module:It is additionally operable to extract respectively the GLCM of each ROI image, small echo, wavelet packet, tetra- kinds of handss of MPEG-7 Work shallow-layer feature, the characteristic vector being cascaded as a d dimension is as a sample.
6. according to claim 4 extraction based on the breast ultrasound characteristics of image self study of stacking noise reduction self-encoding encoder is System it is characterised in that:
In described first encoder training module, first noise reduction self-encoding encoder is made up of three-layer network, corresponding input layer x, hidden Layer y, the neuron number of output layer z are respectively d, dh1, d, certain sample x that wherein input of input layer is concentrated for training sample(i)∈[0,1]d, i=1,2 ..., n;Input layer has been artificially induced noise;Parameter θ1={ W1,b1, θ2={ W2,b2, b1、b2 For being respectively the bias vector of hidden layer and output layer, size is respectively dh1With d dimension, W1、W2It is respectively input layer to the power of hidden layer It is worth connection matrix and hidden layer to the weights connection matrix of output layer, size is respectively d × dh1、dh1×d;Activation primitive all adopts Sigmoid function;
First encoder training module is included with lower unit:
Sample decomposition unit:For splitting to training sample set, concretely comprise the following steps:By the training sample set of ultrasonoscopy with Machine is divided into num batch, each batchi∈[0,1]batch_size×d, i=1,2 ..., num;
Network parameter initialization unit:For network parameter initialization;It is specifically configured to:
Learningrate=1;
W 1 = ( r a n d ( d , d h 1 ) - 0.5 ) &times; 6 d + d h 1 ;
W 2 = ( r a n d ( d h 1 , d ) - 0.5 ) &times; 6 d h 1 + d ;
b1=0, b2=0;
Wherein, leanningrate represents learning rate, and rand (m, n) function is the random m × n rank matrix generating [0,1];
Cycle-index arranging unit:For arranging maximum cycle NN;
The inside and outside arranging unit that recirculates:For arranging the outer t=1to NN that recirculates;
For the s=1to num that recirculates in arranging;
Corrosion data unit:For corrosion data:By binary system masking noise mode, with certain probability by input feature value In x, some values are randomly reset to 0;It is specially:
batchs=batchs× (rand (batch_size, d) > threashold), a batchsConstitute batch_size × d rank matrix, threashold is the threshold value setting, and is specifically set as 0.2;If the random matrix A=rand generating (batch_size, d) in elements AijLess than threashold, then matrix batchsThe element of middle correspondence position is reset to 0;Fixed Adopted batchsIn i-th sample be x(i), after corrosion it is
Feed-forward unit:For feed-forward:
z(i)=sigmoid (W2 Ty(i)+b2), i=1,2 ..., batch_size;
Reverse transfer unit:For reverse transfer:
&delta; j ( i ) o = ( z j ( i ) - x j ( i ) ) z j ( i ) ( 1 - z j ( i ) ) , i = 1 , 2 , ... , b a t c h _ s i z e , 1 &le; j &le; d ;
&delta; j ( i ) h = &Sigma; t = 1 d W 2 j t &delta; t ( i ) o y j ( i ) ( 1 - y j ( i ) ) , i = 1 , 2 , ... , b a t c h _ s i z e , 1 &le; j &le; d h 1 ;
Undated parameter unit:For undated parameter
W 2 = W 2 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e y ( i ) &delta; ( i ) o T ;
b 1 = b 1 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e &delta; j ( i ) h T ;
b 2 = b 2 - 1 b a t c h _ s i z e &Sigma; i = 1 b a t c h _ s i z e &delta; j ( i ) o T ;
Wherein,Represent the residual error of i-th sample correspondence output layer and j-th node of hidden layer respectively.
CN201610834295.1A 2016-09-20 2016-09-20 A kind of breast ultrasound characteristics of image self study extracting method and system based on stacking noise reduction self-encoding encoder Active CN106407992B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610834295.1A CN106407992B (en) 2016-09-20 2016-09-20 A kind of breast ultrasound characteristics of image self study extracting method and system based on stacking noise reduction self-encoding encoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610834295.1A CN106407992B (en) 2016-09-20 2016-09-20 A kind of breast ultrasound characteristics of image self study extracting method and system based on stacking noise reduction self-encoding encoder

Publications (2)

Publication Number Publication Date
CN106407992A true CN106407992A (en) 2017-02-15
CN106407992B CN106407992B (en) 2019-04-02

Family

ID=57997905

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610834295.1A Active CN106407992B (en) 2016-09-20 2016-09-20 A kind of breast ultrasound characteristics of image self study extracting method and system based on stacking noise reduction self-encoding encoder

Country Status (1)

Country Link
CN (1) CN106407992B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103601A (en) * 2017-04-14 2017-08-29 成都知识视觉科技有限公司 A kind of cell mitogen detection method in breast cancer points-scoring system
CN109559358A (en) * 2018-10-22 2019-04-02 天津大学 It is a kind of that the method for sampling is risen from the image pattern of coding based on convolution
CN109858408A (en) * 2019-01-17 2019-06-07 西安交通大学 A kind of ultrasonic signal processing method based on self-encoding encoder
CN110503147A (en) * 2019-08-22 2019-11-26 山东大学 Multi-mode image categorizing system based on correlation study
CN111178196A (en) * 2019-12-19 2020-05-19 东软集团股份有限公司 Method, device and equipment for cell classification

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104933446A (en) * 2015-07-15 2015-09-23 福州大学 Method used for verifying feature validity of breast B-ultrasonography through computer-aided diagnosis
CN105023023A (en) * 2015-07-15 2015-11-04 福州大学 Mammary gland type-B ultrasonic image feature self-learning extraction method used for computer-aided diagnosis
US20160093048A1 (en) * 2014-09-25 2016-03-31 Siemens Healthcare Gmbh Deep similarity learning for multimodal medical images

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20160093048A1 (en) * 2014-09-25 2016-03-31 Siemens Healthcare Gmbh Deep similarity learning for multimodal medical images
CN104933446A (en) * 2015-07-15 2015-09-23 福州大学 Method used for verifying feature validity of breast B-ultrasonography through computer-aided diagnosis
CN105023023A (en) * 2015-07-15 2015-11-04 福州大学 Mammary gland type-B ultrasonic image feature self-learning extraction method used for computer-aided diagnosis

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
JIEZHI CHENG ET AL.: ""Computer-Aided Diagnosis with Deep Learning Architecture: Applications to Breast Lesions in US Images and Pulmonary Nodules in CT Scans"", 《SCIENTIFIC REPORTS》 *
PASCAL VINCENT ET AL.: ""Stacked Denoising Autoencoders: Learning Useful Representations in a Deep Network with a Local Denoising Criterion"", 《JOURNAL OF MACHINE LEARNING RESEARCH》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103601A (en) * 2017-04-14 2017-08-29 成都知识视觉科技有限公司 A kind of cell mitogen detection method in breast cancer points-scoring system
CN107103601B (en) * 2017-04-14 2020-04-24 成都知识视觉科技有限公司 Cell mitosis detection method in breast cancer scoring system
CN109559358A (en) * 2018-10-22 2019-04-02 天津大学 It is a kind of that the method for sampling is risen from the image pattern of coding based on convolution
CN109559358B (en) * 2018-10-22 2023-07-04 天津大学 Image sample up-sampling method based on convolution self-coding
CN109858408A (en) * 2019-01-17 2019-06-07 西安交通大学 A kind of ultrasonic signal processing method based on self-encoding encoder
CN110503147A (en) * 2019-08-22 2019-11-26 山东大学 Multi-mode image categorizing system based on correlation study
CN110503147B (en) * 2019-08-22 2022-04-08 山东大学 Multi-mode image classification system based on correlation learning
CN111178196A (en) * 2019-12-19 2020-05-19 东软集团股份有限公司 Method, device and equipment for cell classification
CN111178196B (en) * 2019-12-19 2024-01-23 东软集团股份有限公司 Cell classification method, device and equipment

Also Published As

Publication number Publication date
CN106407992B (en) 2019-04-02

Similar Documents

Publication Publication Date Title
Wei et al. Automatic skin cancer detection in dermoscopy images based on ensemble lightweight deep learning network
CN106407992A (en) Breast ultrasound image self-learning extraction method and system based on stacked noise reduction self-encoder
Zhang et al. Automatic skin lesion segmentation by coupling deep fully convolutional networks and shallow network with textons
CN105469100A (en) Deep learning-based skin biopsy image pathological characteristic recognition method
Hu et al. AS-Net: Attention Synergy Network for skin lesion segmentation
Han et al. Hybrid resampling and multi-feature fusion for automatic recognition of cavity imaging sign in lung CT
CN109508787A (en) Neural network model training method and system for ultrasound displacement estimation
Kumar et al. MobiHisNet: a lightweight CNN in mobile edge computing for histopathological image classification
CN105957092B (en) Breast molybdenum target characteristics of image self study extracting method for computer-aided diagnosis
Aldhyani et al. Multi-class skin lesion classification using a lightweight dynamic kernel deep-learning-based convolutional neural network
Yu et al. Fetal facial standard plane recognition via very deep convolutional networks
Li et al. 3D tumor detection in automated breast ultrasound using deep convolutional neural network
Lama et al. ChimeraNet: U-Net for hair detection in dermoscopic skin lesion images
CN105023023A (en) Mammary gland type-B ultrasonic image feature self-learning extraction method used for computer-aided diagnosis
CN114663445A (en) Three-dimensional heart image segmentation method based on multi-scale edge perception
CN117078692B (en) Medical ultrasonic image segmentation method and system based on self-adaptive feature fusion
Zhang et al. ACCPG-Net: A skin lesion segmentation network with Adaptive Channel-Context-Aware Pyramid Attention and Global Feature Fusion
US20230360256A1 (en) Contextual instance decoupling-based multi-person pose estimation method and apparatus
Layode et al. Deep learning based integrated classification and image retrieval system for early skin cancer detection
CN115564756A (en) Medical image focus positioning display method and system
Sarwinda et al. Automatic multi-class classification of indonesian traditional food using convolutional neural networks
Wang et al. FBUNet: Full convolutional network based on fusion block architecture for biomedical image segmentation
Ojha et al. Skin Disease Detection and Classification
CN113673303A (en) Human face action unit intensity regression method, device and medium
Liu et al. Dermatoscopic image melanoma recognition based on CFLDnet fusion network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB03 Change of inventor or designer information
CB03 Change of inventor or designer information

Inventor after: Chen Zhuangwei

Inventor after: Huang Xiaoxi

Inventor after: Liu Shu

Inventor before: Chen Zhuangwei

GR01 Patent grant
GR01 Patent grant