CN110263804B - Medical image segmentation method based on safe semi-supervised clustering - Google Patents
Medical image segmentation method based on safe semi-supervised clustering Download PDFInfo
- Publication number
- CN110263804B CN110263804B CN201910371366.2A CN201910371366A CN110263804B CN 110263804 B CN110263804 B CN 110263804B CN 201910371366 A CN201910371366 A CN 201910371366A CN 110263804 B CN110263804 B CN 110263804B
- Authority
- CN
- China
- Prior art keywords
- sample
- clustering
- labeled
- unlabeled
- density
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000003709 image segmentation Methods 0.000 title claims abstract description 19
- 239000011159 matrix material Substances 0.000 claims abstract description 6
- 238000005457 optimization Methods 0.000 claims description 8
- 238000004364 calculation method Methods 0.000 claims description 6
- 238000010606 normalization Methods 0.000 claims description 4
- 230000011218 segmentation Effects 0.000 claims description 3
- 238000013507 mapping Methods 0.000 claims description 2
- 230000006870 function Effects 0.000 description 5
- 230000008569 process Effects 0.000 description 3
- 230000015556 catabolic process Effects 0.000 description 2
- 238000006731 degradation reaction Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 230000007786 learning performance Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000003759 clinical diagnosis Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000001627 detrimental effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012800 visualization Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T7/00—Image analysis
- G06T7/10—Segmentation; Edge detection
- G06T7/136—Segmentation; Edge detection involving thresholding
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- Image Analysis (AREA)
- Medical Treatment And Welfare Office Work (AREA)
Abstract
The invention discloses a medical image segmentation method based on safe semi-supervised clustering, and relates to a semi-supervised FCM clustering and density peak value clustering method. Firstly, a local graph is constructed by adopting a k-nearest neighbor method to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.
Description
Technical Field
The invention relates to a medical image segmentation method based on semi-supervised clustering, in particular to a medical image segmentation method based on safe semi-supervised clustering, and belongs to the field of data mining based on medical images.
Background
With the continuous development of visualization technology, modern medicine has become more and more unable to process information of medical images, and medical images play an important role in clinical diagnosis, teaching and scientific research, and the like. The medical image segmentation method based on semi-supervised clustering integrates limited manual supervision information, namely, a plurality of limited points are clicked on an image to identify the relation between corresponding regions, the points are used as sample data with label information in the medical image segmentation method based on semi-supervised clustering, and the sample data is used for guiding clustering, so that the algorithm performance is improved, and the image segmentation is more accurate. The marking in the medical image is generally finished by experts, but wrong marking may occur due to various conditions in the marking process, and the medical image often carries noise points and outliers, and the traditional medical image segmentation method based on semi-supervised clustering does not consider the two aspects in the clustering process.
In this case, the performance of the conventional semi-supervised clustering method may be worse than that of the corresponding unsupervised learning method, which limits the application of the semi-supervised clustering in the medical image segmentation to a certain extent. In other words, the marked data may be detrimental to performance, while noise and outliers in the unmarked data also have a large impact on performance. The traditional semi-supervised clustering generally considers that the prior knowledge is beneficial to learning effect, but the collected prior knowledge (such as error labeled samples and noise) can possibly cause the degradation of learning performance. Xuesong Yin indicates that wrong a priori knowledge can lead to a degradation of learning performance. Based on the two aspects, it makes sense to design a safe semi-supervised learning method. Therefore, the invention tries to develop a mechanism that different samples have different safety degrees, so as to realize that the clustering performance is not lower than that of the original unsupervised clustering and semi-supervised clustering methods.
Disclosure of Invention
The invention provides a medical image segmentation method based on safe semi-supervised clustering, aiming at the defect that the risk of a marked sample and an unmarked sample is not considered simultaneously in the traditional medical image segmentation method based on semi-supervised clustering, which can cause the final segmentation effect to be reduced.
Firstly, the invention adopts a k-nearest neighbor method to construct a local graph to obtain a graph regular term. Second, the FCM clustering and density clustering methods are used to estimate the confidence of the labeled and unlabeled samples. Then, confidence weighting of the samples and a regular term based on a local graph are introduced into the target function of the original semi-supervised FCM clustering method to obtain the target function of the safe semi-supervised clustering method. And finally, obtaining a clustering result by iteratively optimizing the membership matrix and the clustering center. The technical scheme is as follows: a medical image segmentation method based on safe semi-supervised clustering comprises the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
step four: constructing a local graph with the aim of limiting the output of the labeled samples with low confidence to the output of the adjacent samples;
step five: integrating information to construct a target function;
step six: solving the optimization problem by adopting an iterative optimization method;
step seven: and judging the category of the unlabeled sample to realize medical image segmentation.
Compared with the traditional semi-supervised clustering method, the method measures the confidence coefficient of the samples by using the density and the distance between the samples, and limits the marked samples with low confidence coefficient to the output of the adjacent samples by constructing the local graph, so that each sample can be safely and reasonably used, and the clustering is more accurate and robust. The invention solves the safe use problem of the marked sample, simultaneously solves the safe use problem of the unmarked sample, and improves the accuracy and the robustness of the medical image segmentation.
Drawings
FIG. 1 is a flow chart of an embodiment of the present invention.
Detailed Description
While the invention has been described in connection with what is presently considered to be the most practical and preferred embodiment, it is to be understood that the invention is not to be limited to the disclosed embodiment, but on the contrary, is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
To better illustrate the objects and advantages of the present invention, an embodiment of the method of the present invention is described in further detail below with reference to fig. 1 and examples.
The method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn]。
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
tagging predictions using the Kuhn-Munkres algorithmMapping to equivalent labelsWith a given labelAre consistent.
Step three: obtaining the confidence coefficient of the unmarked sample by using a density peak value clustering method and according to the local density of the unmarked sample and the minimum distance between the unmarked sample and the point with higher density, obtaining the confidence coefficient of the marked sample according to the local density of the marked sample in the same marked sample cluster and the minimum distance between the marked sample and the point with higher density, and normalizing the confidence coefficient;
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs the truncation distance.
unlabeled sample confidence: gamma rayk=ρk/δk (4)
local density of labeled samples in the same labeled sample cluster:
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of samples with the same label.
Minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing a local neighborhood graph of the marked sample, and then weighting W ═ W of the local graph edgek′r]n×nThe calculation is as follows:
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrσ represents the width parameter of the gaussian kernel for the neighboring sample points.
Step five: and integrating the information to construct an objective function.
The objective function is as follows:
the limiting conditions are as follows:
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained. To simplify the calculation, the value of m is set to 2. The method solves the sample membership degree and the clustering center by adopting a Lagrange multiplier method.
Membership u of unlabeled samplesik:
membership u of labeled sampleik′:
cluster center vi:
And obtaining a final membership matrix U and a clustering center V through iterative calculation. When in useOr when the maximum iteration number is reached, the iteration is terminated, wherein t is the current iteration number, and eta is a set threshold value.
Step seven: and judging the category of the unlabeled sample to realize the segmentation of the medical image.
And after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.
Claims (1)
1. A medical image segmentation method based on safe semi-supervised clustering is characterized by comprising the following steps:
the method comprises the following steps: inputting labeled and unlabeled medical image datasets;
a subset of labeled samples of the input medical image dataset: xl=[x1,...,xl]The corresponding label is ykE { 1.. c }, unlabeled sample subset: xu=[xl+1,...,xn];
Step two: FCM clustering is carried out on the data set to obtain a prediction label of the data set;
tagging predictions using the Kuhn-Munkres algorithmIs mapped asMapping tagsWith a given label ykKeeping consistent on categories;
step three: obtaining the confidence coefficient of the unlabeled sample by using a density peak value clustering method and through the local density of the unlabeled sample and the minimum distance between the unlabeled sample and a point with higher density, obtaining the confidence coefficient of the labeled sample through the local density of the labeled sample in the same labeled sample cluster and the minimum distance between the labeled sample and the point with higher density, and normalizing the confidence coefficient;
wherein j ═ 1, 2.. times, n],k=[l+1,...,n]Dist (k, j) is a point xkAnd xjEuclidean distance of dcIs a truncation distance;
unlabeled sample confidence: gamma rayk=ρk/δk (4)
local density of labeled samples in the same labeled sample cluster:
wherein j isy=[1,2,...q],k′=[1,2,...,l],jyRepresenting sample set and labeled sample point xk′A set of identically labeled samples;
minimum distance of the marked sample from the point with higher density in the same marked sample cluster:
and for the data point with the greatest density:
labeled sample confidence:
step four: constructing a k-nearest neighbor local graph with the aim of limiting labeled sample outputs with low confidence to those of neighboring samples;
constructing local neighborhoods of labeled samplesIf the graph has a partial graph edge weight W [ < W >k′r]n×nThe calculation is as follows:
wherein N isp(xk′) Finger xk′P data of nearest neighbor, xk′To mark sample points, xrIs a neighboring sample point, and sigma represents a width parameter of the Gaussian kernel function;
step five: integrating information to construct a target function;
the objective function is as follows:
the limiting conditions are as follows:
step six: solving the optimization problem by adopting an iterative optimization method;
by minimizing the above optimization problem, an optimal solution can be obtained; to simplify the calculation, the value of m is set to 2; the method adopts a Lagrange multiplier method to solve the sample membership degree and the clustering center;
membership u of unlabeled samplesik:
membership u of labeled sampleik′:
cluster center vi:
Obtaining a final membership matrix U and a clustering center V through iterative calculation; when in useOr when the maximum iteration times is reached, the iteration is terminated, wherein t is the current iteration times, and eta is a set threshold;
step seven: judging the category of the unmarked sample to realize the segmentation of the medical image;
and after obtaining the membership matrix U, defuzzifying according to the maximum membership principle to obtain the category of the unlabeled sample, and finally, carrying out image segmentation to obtain a result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910371366.2A CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910371366.2A CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110263804A CN110263804A (en) | 2019-09-20 |
CN110263804B true CN110263804B (en) | 2021-08-03 |
Family
ID=67914306
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910371366.2A Active CN110263804B (en) | 2019-05-06 | 2019-05-06 | Medical image segmentation method based on safe semi-supervised clustering |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110263804B (en) |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111612735A (en) * | 2020-04-08 | 2020-09-01 | 杭州电子科技大学 | Lung nodule image classification method based on information fusion safety semi-supervised clustering |
CN111898704B (en) * | 2020-08-17 | 2024-05-10 | 腾讯科技(深圳)有限公司 | Method and device for clustering content samples |
CN113780750B (en) * | 2021-08-18 | 2024-03-01 | 同济大学 | Medical risk assessment method and device based on medical image segmentation |
CN115131610B (en) * | 2022-06-13 | 2024-02-27 | 西北工业大学 | Robust semi-supervised image classification method based on data mining |
CN118397389A (en) * | 2024-04-16 | 2024-07-26 | 常熟市第一人民医院 | Semi-supervised clustering algorithm model for cerebral obstruction focus image |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106447676A (en) * | 2016-10-12 | 2017-02-22 | 浙江工业大学 | Image segmentation method based on rapid density clustering algorithm |
CN107341812A (en) * | 2017-07-04 | 2017-11-10 | 太原理工大学 | A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8170306B2 (en) * | 2007-04-25 | 2012-05-01 | Siemens Aktiengesellschaft | Automatic partitioning and recognition of human body regions from an arbitrary scan coverage image |
CN104156438A (en) * | 2014-08-12 | 2014-11-19 | 德州学院 | Unlabeled sample selection method based on confidence coefficients and clustering |
CN104881687A (en) * | 2015-06-02 | 2015-09-02 | 四川理工学院 | Magnetic resonance image classification method based on semi-supervised Gaussian mixed model |
CN105825226A (en) * | 2016-03-11 | 2016-08-03 | 江苏畅远信息科技有限公司 | Association-rule-based distributed multi-label image identification method |
CN106611418A (en) * | 2016-03-29 | 2017-05-03 | 四川用联信息技术有限公司 | Image segmentation algorithm |
CN108629783B (en) * | 2018-05-02 | 2021-05-04 | 山东师范大学 | Image segmentation method, system and medium based on image feature density peak search |
CN109409400A (en) * | 2018-08-28 | 2019-03-01 | 西安电子科技大学 | Merge density peaks clustering method, image segmentation system based on k nearest neighbor and multiclass |
-
2019
- 2019-05-06 CN CN201910371366.2A patent/CN110263804B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106447676A (en) * | 2016-10-12 | 2017-02-22 | 浙江工业大学 | Image segmentation method based on rapid density clustering algorithm |
CN107341812A (en) * | 2017-07-04 | 2017-11-10 | 太原理工大学 | A kind of sequence Lung neoplasm image partition method based on super-pixel and Density Clustering |
Also Published As
Publication number | Publication date |
---|---|
CN110263804A (en) | 2019-09-20 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110263804B (en) | Medical image segmentation method based on safe semi-supervised clustering | |
Wu et al. | Handwritten mathematical expression recognition via paired adversarial learning | |
CN112115995B (en) | Image multi-label classification method based on semi-supervised learning | |
CN113408605B (en) | Hyperspectral image semi-supervised classification method based on small sample learning | |
Azadi et al. | Auxiliary image regularization for deep cnns with noisy labels | |
CN110188827B (en) | Scene recognition method based on convolutional neural network and recursive automatic encoder model | |
Cao et al. | A multi-kernel based framework for heterogeneous feature selection and over-sampling for computer-aided detection of pulmonary nodules | |
CN112614131A (en) | Pathological image analysis method based on deformation representation learning | |
CN113326731A (en) | Cross-domain pedestrian re-identification algorithm based on momentum network guidance | |
CN111581466B (en) | Partial multi-mark learning method for characteristic information noise | |
CN113469270B (en) | Semi-supervised intuitive clustering method based on decomposition multi-target differential evolution superpixel | |
CN110458022B (en) | Autonomous learning target detection method based on domain adaptation | |
Cho et al. | Effective pseudo-labeling based on heatmap for unsupervised domain adaptation in cell detection | |
CN113222072A (en) | Lung X-ray image classification method based on K-means clustering and GAN | |
CN113535947A (en) | Multi-label classification method and device for incomplete data with missing labels | |
CN117393098A (en) | Medical image report generation method based on visual priori and cross-modal alignment network | |
CN114399687A (en) | Semi-supervised self-training hyperspectral remote sensing image classification method based on spatial correction | |
CN117972122A (en) | Cross-modal image-text retrieval method based on correlation matching | |
Bai et al. | A unified deep learning model for protein structure prediction | |
CN116363460A (en) | High-resolution remote sensing sample labeling method based on topic model | |
CN113592045B (en) | Model adaptive text recognition method and system from printed form to handwritten form | |
CN114692746A (en) | Information entropy based classification method of fuzzy semi-supervised support vector machine | |
CN108805162A (en) | A kind of saccharomycete multiple labeling feature selection approach and device based on particle group optimizing | |
Chong et al. | Category-wise Fine-Tuning: Resisting Incorrect Pseudo-Labels in Multi-Label Image Classification with Partial Labels | |
Cai et al. | HiCervix: An Extensive Hierarchical Dataset and Benchmark for Cervical Cytology Classification |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |