CN111563549B - Medical image clustering method based on multitasking evolutionary algorithm - Google Patents

Medical image clustering method based on multitasking evolutionary algorithm Download PDF

Info

Publication number
CN111563549B
CN111563549B CN202010364563.4A CN202010364563A CN111563549B CN 111563549 B CN111563549 B CN 111563549B CN 202010364563 A CN202010364563 A CN 202010364563A CN 111563549 B CN111563549 B CN 111563549B
Authority
CN
China
Prior art keywords
roi
medical image
clustering
image
area
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010364563.4A
Other languages
Chinese (zh)
Other versions
CN111563549A (en
Inventor
胡晓敏
颜志鹏
陈伟能
李敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN202010364563.4A priority Critical patent/CN111563549B/en
Publication of CN111563549A publication Critical patent/CN111563549A/en
Application granted granted Critical
Publication of CN111563549B publication Critical patent/CN111563549B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02TCLIMATE CHANGE MITIGATION TECHNOLOGIES RELATED TO TRANSPORTATION
    • Y02T10/00Road transport of goods or passengers
    • Y02T10/10Internal combustion engine [ICE] based vehicles
    • Y02T10/40Engine management systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a medical image clustering method based on a multitasking evolution algorithm, which comprises the following steps: s1, extracting ROI feature description data of a medical image; s2, reading in ROI feature description data of the extracted medical image, and obtaining a plurality of clustering results under a multi-task framework by optimizing a plurality of clustering internal indexes by using NMP clustering rules; s3, selecting an optimal result from the results by utilizing expert knowledge of doctors. The invention can fully express the connotation of the medical image, optimize one population to obtain a plurality of clustering results, and converge to global optimum more easily through cross-domain communication, so that the clustering effect is more obvious.

Description

Medical image clustering method based on multitasking evolutionary algorithm
Technical Field
The invention relates to the two fields of medical technology and intelligent computation, in particular to a medical image clustering method based on a multitasking evolution algorithm.
Background
In recent 20 years, medical imaging technology has become one of the areas of rapid development in medical technology, and medical images are becoming easier to acquire and store. Among the image data, medical image data occupies a large proportion. In medical systems, medical images play an important role. By observing the medical image, the lesion part is judged more directly and more clearly, so that a more accurate diagnosis result is obtained. The medical image clustering is accurately carried out, so that scientific reference can be better provided when medical staff judges and diagnoses the etiology of the illness, thereby greatly reducing the misdiagnosis rate caused by insufficient eyesight resolution of human beings or subjectively insufficient clinical experience of medical staff, and further improving the utilization rate of medical images.
At present, for medical image clustering, although traditional clustering algorithms are transplanted to medical images at home and abroad, such as a K-means-based method and a plurality of varieties thereof, the methods have the defects of sensitivity to an initial center, sensitivity to K values, sensitivity to density-based methods, sensitivity to field radius and MinPtr values, larger influence of noise and lack of accuracy of grid-based methods.
In addition, an evolution algorithm is used for clustering, and genetic algorithm or differential algorithm based on the like are commonly used, but the clustering is based on a single-task framework, and only a single target can be optimized in one operation, so that a single optimization result is obtained.
Also, not every pixel in a medical image is worth noting by medical personnel, who may be more interested in distinctive pixel areas, referred to as ROI (region of interest). In previous image studies, researchers have extracted ROIs based on features of conventional images such as color, texture, and shape. But this is not applicable to medical images. Medical images have many characteristics compared to common image data.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provide a medical image clustering method based on a multi-task evolutionary algorithm, which can more fully express the connotation of medical images, simultaneously optimize a population to obtain a plurality of clustering results, more easily converge to global optimum through cross-domain communication and has more obvious clustering effect,
in order to achieve the above purpose, the technical scheme provided by the invention is as follows:
the medical image clustering method based on the multitasking evolution algorithm comprises the following steps:
s1, extracting ROI feature description data of a medical image;
s2, reading in ROI feature description data of the extracted medical image, and obtaining a plurality of clustering results under a multi-task framework by optimizing a plurality of clustering internal indexes by using NMP clustering rules;
s3, selecting an optimal result from the results by utilizing expert knowledge of doctors.
Further, the ROI characterization data of the medical image extracted in the step S1 includes a relative gray scale S1, a relative area S2, a relative centroid coordinate S3, a circularity S4, an angle S5, and a symmetry S6.
Further, the specific process of extracting ROI feature description data of the medical image in step S1 is as follows:
s1-1, reading in a medical image;
s1-2, scanning a medical image to obtain the maximum value of the pixel number, the length, the width and the gray level of the image;
s1-3, detecting symmetry S6 of the medical image;
s1-4, extracting an ROI region of the medical image according to the gray scale range;
s1-5, extracting gray average values, the number of pixels, the longest axis, the shortest axis, centroid coordinates and angles from the ROI area obtained in the step S1-4;
s1-6, calculating feature description data of the ROI area.
Further, the step S1-3 is to detect the symmetry S6 of the medical image:
folding the medical image to a difference, performing binarization processing by using a gray threshold value, and judging that the medical image is symmetrical if the residual pixel points are less than a set value, otherwise, judging that the medical image is asymmetrical;
the specific process of calculating the feature description data of the ROI area in the step S1-6 is as follows:
relative gray s1:wherein roi.gray is the average gray of the ROI area, image.gray is the average gray of the whole image;
relative area s2: s2=roi.area/IMAGE.area, ROI.area is the number of pixels of the ROI area, and image.area is the number of pixels of the whole image;
relative centroid coordinates s3: s3= (ROI. X/IMAGE.length, ROI.y/image. Height), ROI. X is the abscissa of the ROI centroid, image. Length is the length of the original image, ROI. Y is the ordinate of the ROI centroid, image. Height is the height of the original image;
rounding s4: s4=4×pi×roi area/roi. Peripheral 2 Roi.area is the number of pixels of the ROI area, and roi.perimeter is the number of pixels around the ROI area;
angle s5: s5= (orientation+90)/180, orientation is the angle from the long axis to the X axis of the ROI.
Further, the specific process of step S2 is as follows:
s2-1, reading in ROI feature description data of an extracted medical image;
s2-2, initializing a population and the maximum iteration number n, wherein k=0;
s2-3, clustering is executed by combining NMP clustering rules, and the clustering index is used for evaluation;
s2-4, calculating the skill factor tau of each individual;
s2-5, generating offspring; randomly selecting two individuals a and b from the population as father, generating a random number rand which is more than 0 and less than 1, if rand is less than an algorithm parameter rmp or the skill factors tau of the two individuals are equal, executing simulated binary crossover, otherwise, executing mutation on the two individuals respectively; repeating the steps of hybridization or mutation until the number of the obtained filial generation is equal to the number of the population individuals, and entering the step S2-6;
s2-6, calculating the adaptive value of the generated offspring under each clustering index optimization task;
s2-7, combining the father and the filial generation to form a new population, and recalculating and updating skill factors tau and scalar adaptation values phi of all individuals according to the adaptation values after clustering;
s2-8, sequencing individuals in the population according to a scalar adaptive value phi, and then sequentially selecting the individuals from top to bottom to enter the next generation population, wherein individuals with larger phi are preferentially selected into the next generation population; k=k+1;
s2-9, if k is less than n, returning to the step S2-3, otherwise stopping iteration;
and S2-10, finding out an individual with the skill factor tau equal to 1, and recording the individual as the optimal individual of each task at present.
Further, in the step S2-2, the population is initialized by the following codes:
setting the maximum value K of the clustering class number max Individuals of the population are encoded as K max +K max * Vector of d dimensiond is the number of data dimensions, m ij Is the coordinate vector of the cluster center, T ij (j=1,...,K max ) To obtain an activation threshold of the class centroid point;
the definition of the centroid point obtained by activation is specifically as follows:
if T ij Greater than 0.5, the centroid point m of the corresponding cluster ij Activated, otherwise not activated; if T ij And if the centroid number is larger than 1 or is a negative number, resetting to 1 or 0, and if the obtained centroid number is smaller than the set minimum category number, randomly selecting a plurality of activation thresholds to activate so as to meet the requirement of the minimum category number.
Further, in step S2-3, the NMP clustering rule is specifically as follows:
given a data set of number N, the data is represented as x= (X) 1 ,...,x N ) The centroid of K clusters is c= { C 1 ,...,C K And D represents distance, then sample point x in NMP rule i = (i=0,., N) and a certain cluster category C h (h=1,., K) is defined as follows:
D(x i ,C h )=min{D(x i ,x j ),D(x i ,m h )|x j ∈C h }
i.e., the distance from the sample to the cluster category is the distance from the sample to the smallest distance among the points of the cluster;
each sample is assigned to the nearest cluster, all samples assigned to the same cluster forming a candidate sample set, this cluster being referred to as an undetermined cluster of the samples; then, for each cluster, selecting a nearest sample from the candidate sample set, such nearest sample points of all clusters being combined to be referred to as a nearest sample set; finally, finding a sample in the nearest sample set, wherein the distance between the sample and the undetermined cluster where the sample is located is the smallest in the nearest sample set, and then distributing the sample into the undetermined clusters; the above steps are repeated until all samples have been dispensed.
Further, in step S2-3, the cluster indexes include CH indexes, dunn indexes, and SIL indexes, which respectively correspond to one optimization task; the indexes are specifically as follows:
CH index:
in the above-mentioned method, the step of,a trace representing an inter-class dispersion matrix, m representing an average vector of the entire dataset;
dunn index:
in the above, D (C i ,C j ) Representing the distance between the different categories as the distance between the two closest data points, the formula is as follows:δ(C i ) Distance between two furthest points for this category:
SIL index:
in the above, s j =(b j -a j )/max(a j ,b j ) Representing data point x j Is a contour width of (2); data point x j Average distance a to other data points of the class to which it belongs j And minimum distance b to other classes of data points j The calculation formula of (2) is as follows:
further, in step S2-5, the operation of the analog binary crossover is as follows:
is provided with two father generations x a =[x a (1),...,x a (d)]And x b =[x b (1),...,x b (d)]D is the dimension of the data, and the distribution factor c (j) is calculated first:
wherein, beta is a system parameter larger than 0, and r is a random number larger than 0 and smaller than 1 in each dimension;
the obtained sub-generations are:
x e (j)=[(1+c(j))x a (j)+(1-c(j))x b (j)]/2
x f (j)=[(1+c(j))x b (j)+(1-c(j))x a (j)]/2。
compared with the prior art, the scheme has the following principle and advantages:
1. through a multi-factor evolution algorithm, a population is optimized simultaneously aiming at internal indexes of clusters such as CH indexes, dunn indexes and SIL indexes to obtain a plurality of clustering results, and cross-domain communication is easier to converge to global optimum.
2. The traditional image feature mining method based on color, texture and shape often ignores information borne by the unique attribute of the medical image ROI, the ROI attribute used by the method is relative gray scale, relative area, relative centroid coordinates, circularity, angle and symmetry, and the characteristic feature of the medical image is considered when the traditional image feature is considered.
3. Based on the clustering rules of NMP (nearest multiple prototypes): when the traditional particle swarm optimization mode is applied to cluster optimization, the effect is poor for the problem of non-circular cluster of the cluster shape. Also, the distance-based clustering rule, NMP rule is a dynamic process, more flexible and generally better clustering effect.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions of the prior art, the services required in the embodiments or the description of the prior art will be briefly described below, and it is obvious that the figures in the following description are only some embodiments of the present invention, and that other figures can be obtained according to these figures without inventive effort to a person skilled in the art.
FIG. 1 is an overall flow chart of a medical image clustering method based on a multitasking evolutionary algorithm of the present invention;
FIG. 2 is a flow chart of ROI data extraction from a medical image;
FIG. 3 is a schematic diagram of a multitasking cluster evolution algorithm code.
Detailed Description
The invention is further illustrated by the following examples:
as shown in fig. 1, the medical image clustering method based on the multitasking evolution algorithm according to the embodiment includes the following steps:
s1, extracting ROI characteristic description data of a medical image, wherein the ROI characteristic description data comprise relative gray scale S1, relative area S2, relative centroid coordinates S3, circularity S4, angle S5 and symmetry S6.
As shown in fig. 2, the specific process of extraction is as follows:
s1-1, reading in a medical image;
s1-2, scanning a medical image to obtain the maximum value of the pixel number, the length, the width and the gray level of the image;
s1-3, detecting symmetry S6 of the medical image:
folding the medical image to a difference, performing binarization processing by using a gray threshold value, and judging that the medical image is symmetrical if the residual pixel points are less than a set value, otherwise, judging that the medical image is asymmetrical;
s1-4, extracting an ROI region of the medical image according to the gray scale range;
s1-5, extracting gray average values, the number of pixels, the longest axis, the shortest axis, centroid coordinates and angles from the ROI area obtained in the step S1-4;
s1-6, calculating feature description data of the ROI area:
relative gray s1:wherein roi.gray is the average gray of the ROI area, image.gray is the average gray of the whole image;
relative area s2: s2=roi.area/IMAGE.area, ROI.area is the number of pixels of the ROI area, and image.area is the number of pixels of the whole image;
relative centroid coordinates s3: s3= (ROI. X/IMAGE.length, ROI.y/image. Height), ROI. X is the abscissa of the ROI centroid, image. Length is the length of the original image, ROI. Y is the ordinate of the ROI centroid, image. Height is the height of the original image;
rounding s4: s4=4×pi×roi area/roi. Peripheral 2 Roi.area is the number of pixels of the ROI area, and roi.perimeter is the number of pixels around the ROI area;
angle s5: s5= (orientation+90)/180, orientation is the angle from the long axis to the X axis of the ROI.
S2, after ROI feature description data of the medical image are extracted, data are read in, and a plurality of clustering results are obtained under a multi-task frame by optimizing a plurality of clustering internal indexes through NMP clustering rules; the specific process is as follows:
s2-1, reading in ROI feature description data of an extracted medical image;
s2-2, initializing a population and the maximum iteration number n, wherein k=0; wherein, this step completes initialization by the following code:
setting the maximum value K of the clustering class number max Individuals of the population are encoded as K max +K max * Vector of d dimensiond is the number of data dimensions, m ij Is the coordinate vector of the cluster center, T ij (j=1,...,K max ) To obtain an activation threshold of the class centroid point;
the definition of the centroid point obtained by activation is specifically as follows:
if T ij Greater than 0.5, the centroid point m of the corresponding cluster ij Activated, otherwise not activated; if T ij And if the centroid number is larger than 1 or is a negative number, resetting to 1 or 0, and if the obtained centroid number is smaller than the set minimum category number, randomly selecting a plurality of activation thresholds to activate so as to meet the requirement of the minimum category number. As shown in the accompanying drawingsIn fig. 3, one dimension d is 2, the maximum clustering category number Kmax is 4, the front is the activation threshold, the second position 0.4 is less than 0.5, so that the corresponding second centroid is in an inactive state, and the threshold of other positions is greater than 0.5, so that the individual is activated. Thus, the number of clusters for this individual is 3. In practice, d is set to 6.
S2-3, clustering is executed by combining NMP clustering rules, and the clustering index is used for evaluation;
the NMP clustering rule in the step is specifically as follows:
given a data set of number N, the data is represented as x= (X) 1 ,...,x N ) The centroid of K clusters is c= { C 1 ,...,C K And D represents distance, then sample point x in NMP rule i = (i=0,., N) and a certain cluster category C h (h=1,., K) is defined as follows:
D(x i ,C h )=min{D(x i ,x j ),D(x i ,m h )|x j ∈C h }
i.e., the distance from the sample to the cluster category is the distance from the sample to the smallest distance among the points of the cluster;
each sample is assigned to the nearest cluster, all samples assigned to the same cluster forming a candidate sample set, this cluster being referred to as an undetermined cluster of the samples; then, for each cluster, selecting a nearest sample from the candidate sample set, such nearest sample points of all clusters being combined to be referred to as a nearest sample set; finally, finding a sample in the nearest sample set, wherein the distance between the sample and the undetermined cluster where the sample is located is the smallest in the nearest sample set, and then distributing the sample into the undetermined clusters; the above steps are repeated until all samples have been dispensed.
The clustering indexes comprise CH indexes, dunn indexes and SIL indexes, and correspond to one optimization task respectively; the indexes are specifically as follows:
CH index:
in the above-mentioned method, the step of,a trace representing an inter-class dispersion matrix, m representing an average vector of the entire dataset;
dunn index:
in the above, D (C i ,C j ) Representing the distance between the different categories as the distance between the two closest data points, the formula is as follows:δ(C i ) Distance between two furthest points for this category:
SIL index:
in the above, s j =(b j -a j )/max(a j ,b j ) Representing data point x j Is a contour width of (2); data point x j Average distance a to other data points of the class to which it belongs j And minimum distance b to other classes of data points j The calculation formula of (2) is as follows:
s2-4, calculating the skill factor tau of each individual, and recording the optimal task which is best represented by the single individual i in all tasks. If individual i is the forefront in the individual ranking of the jth task, τ i =j. When the skill factor is calculated, sequencing all individuals in the population according to the size of the adaptive value under a certain task, and then obtaining a corresponding serial number in the sequence as a certain individual i; at the factorial level r corresponding to task j i j
S2-5, generating offspring; randomly selecting two individuals a and b from the population as father, generating a random number rand which is more than 0 and less than 1, if rand is less than an algorithm parameter rmp (random mating probability) or the skill factors tau of the two individuals are equal, executing simulated binary crossover, otherwise, executing mutation on the two individuals respectively; repeating the steps of hybridization or mutation until the number of the obtained filial generation is equal to the number of the population individuals, and entering the step S2-6;
wherein the operation of simulating binary interleaving is as follows:
is provided with two father generations x a =[x a (1),...,x a (d)]And x b =[x b (1),...,x b (d)]D is the dimension of the data, and a distribution factor (scatter factor) c (j) is calculated first:
wherein, beta is a system parameter larger than 0, and r is a random number larger than 0 and smaller than 1 in each dimension;
the obtained sub-generations are:
x e (j)=[(1+c(j))x a (j)+(1-c(j))x b (j)]/2
x f (j)=[(1+c(j))x b (j)+(1-c(j))x a (j)]/2。
s2-6, calculating the adaptive value of the generated offspring under each clustering index optimization task;
s2-7, combining the father and the filial generation to form a new population, and recalculating and updating skill factors tau and scalar adaptation values phi of all individuals according to the adaptation values after clustering; if the order of the ith individual in all K tasks is respectivelyThen the scalar fitness value of the individual is +.>I.e. the individual scalar adaptation value is determined by its factorial level in the task that performs best;
s2-8, sequencing individuals in the population according to a scalar adaptive value phi, and then sequentially selecting the individuals from top to bottom to enter the next generation population, wherein individuals with larger phi are preferentially selected into the next generation population; k=k+1;
s2-9, if k is less than n, returning to the step S2-3, otherwise stopping iteration;
and S2-10, finding out an individual with the skill factor tau equal to 1, and recording the individual as the optimal individual of each task at present.
S3, obtaining 3 clustering results after the step S2 is finished, and selecting an optimal result from the clustering results by using expert knowledge of doctors.
In the embodiment, the optimization of different indexes can be regarded as different tasks through multi-task clustering, the optimal individuals under each task are respectively found, and expert knowledge is used for finding out the clustering index which is most suitable for the medical images. The clustering index can select an index suitable for actual conditions so as to achieve a better effect. The method utilizes the migration process of multi-task learning, the communication among different learning tasks is favorable for breaking through the local optimum to be closer to the global optimum, and the optimization speed is more efficient than the optimization speed of the same method for running different optimization targets successively for a plurality of times.
The above embodiments are only preferred embodiments of the present invention, and are not intended to limit the scope of the present invention, so variations in shape and principles of the present invention should be covered.

Claims (7)

1. The medical image clustering method based on the multitasking evolution algorithm is characterized by comprising the following steps of:
s1, extracting ROI feature description data of a medical image;
s2, reading in ROI feature description data of the extracted medical image, and obtaining a plurality of clustering results under a multi-task framework by optimizing a plurality of clustering internal indexes by using NMP clustering rules;
s3, selecting an optimal result from the expert knowledge of a doctor;
the specific process of the step S2 is as follows:
s2-1, reading in ROI feature description data of an extracted medical image;
s2-2, initializing a population and the maximum iteration number n, wherein the current iteration number k=0;
s2-3, clustering is executed by combining NMP clustering rules, and the clustering index is used for evaluation;
s2-4, calculating the skill factor tau of each individual;
s2-5, generating offspring; randomly selecting two individuals a and b from the population as father, generating a random number rand which is more than 0 and less than 1, if rand is less than an algorithm parameter rmp or the skill factors tau of the two individuals are equal, executing simulated binary crossover, otherwise, executing mutation on the two individuals respectively; repeating the steps of hybridization or mutation until the number of the obtained filial generation is equal to the number of the population individuals, and entering the step S2-6;
s2-6, calculating the adaptive value of the generated offspring under each clustering index optimization task;
s2-7, combining the father and the filial generation to form a new population, and recalculating and updating skill factors tau and scalar adaptation values phi of all individuals according to the adaptation values after clustering;
s2-8, sequencing individuals in the population according to a scalar adaptive value phi, and then sequentially selecting the individuals from top to bottom to enter the next generation population, wherein individuals with larger phi are preferentially selected into the next generation population; k=k+1;
s2-9, if k is less than n, returning to the step S2-3, otherwise stopping iteration;
s2-10, finding out an individual with a skill factor tau equal to 1, and recording the individual as the optimal individual of each task at present;
in the step S2-3, the clustering indexes comprise CH indexes, dunn indexes and SIL indexes, and respectively correspond to one optimization task; the indexes are specifically as follows:
CH index:
in the above-mentioned method, the step of,representing an inter-class dispersion matrix S B M represents the average value vector of the whole data set, and K is the number of clustering categories;
dunn index:
in the above, D (C i ,C j ) Representing the distance between the different categories as two closest data points C i And C j The distance between them is expressed as follows:δ(C i ) Distance between two furthest points for this category:
SIL index:
in the above, s j =(b j -a j )/max(a j ,b j ) Representing data point x j Is a contour width of (2); data point x j Average distance a to other data points of the class to which it belongs j And minimum distance b to other classes of data points j The calculation formula of (2) is as follows:
2. the medical image clustering method based on the multitasking evolution algorithm according to claim 1, wherein the ROI feature description data of the medical image extracted in the step S1 includes a relative gray scale S1, a relative area S2, a relative centroid coordinate S3, a circularity S4, an angle S5 and a symmetry S6.
3. The medical image clustering method based on the multitasking evolution algorithm according to claim 2, wherein the specific process of extracting ROI feature description data of the medical image in step S1 is as follows:
s1-1, reading in a medical image;
s1-2, scanning a medical image to obtain the maximum value of the pixel number, the length, the width and the gray level of the image;
s1-3, detecting symmetry S6 of the medical image;
s1-4, extracting an ROI region of the medical image according to the gray scale range;
s1-5, extracting gray average values, the number of pixels, the longest axis, the shortest axis, centroid coordinates and angles from the ROI area obtained in the step S1-4;
s1-6, calculating feature description data of the ROI area.
4. The medical image clustering method based on the multitasking evolution algorithm according to claim 3, wherein the step S1-3 of detecting the medical image symmetry S6 is:
folding the medical image to a difference, performing binarization processing by using a gray threshold value, and judging that the medical image is symmetrical if the residual pixel points are less than a set value, otherwise, judging that the medical image is asymmetrical;
the specific process of calculating the feature description data of the ROI area in the step S1-6 is as follows:
relative gray s1:wherein roi.gray is the average gray of the ROI area, image.gray is the average gray of the whole image;
relative area s2: s2=roi.area/IMAGE.area, ROI.area is the number of pixels of the ROI area, and image.area is the number of pixels of the whole image;
relative centroid coordinates s3: s3= (ROI. X/IMAGE.length, ROI.y/image. Height), ROI. X is the abscissa of the ROI centroid, image. Length is the length of the original image, ROI. Y is the ordinate of the ROI centroid, image. Height is the height of the original image;
rounding s4: s4=4×pi×roi area/roi. Peripheral 2 Roi.area is the number of pixels of the ROI area, and roi.perimeter is the number of pixels around the ROI area;
angle s5: s5= (orientation+90)/180, orientation is the angle from the long axis to the X axis of the ROI.
5. The medical image clustering method based on the multitasking evolution algorithm according to claim 1, wherein in the step S2-2, the population is initialized by the following codes:
setting the maximum value K of the clustering class number max Individuals of the population are encoded as K max +K max * Vector of d dimensiond is the number of data dimensions, m ij Is the coordinate vector of the cluster center, T ij To get an activation threshold for a class centroid point, j=1 max
The definition of the centroid point obtained by activation is specifically as follows:
if T ij Greater than 0.5, the centroid point m of the corresponding cluster ij Activated, otherwise not activated; if T ij And if the centroid number is larger than 1 or is a negative number, resetting to 1 or 0, and if the obtained centroid number is smaller than the set minimum category number, randomly selecting a plurality of activation thresholds to activate so as to meet the requirement of the minimum category number.
6. The medical image clustering method based on the multitasking evolution algorithm according to claim 1, wherein in step S2-3, the NMP clustering rule is specifically as follows:
given a data set of number N, the data is represented as x= (X) 1 ,...,x N ) The centroid of K clusters is c= { C 1 ,...,C K And D represents distance, then sample point x in NMP rule i = (i=0,., N) and a certain cluster category C h (h=1,., K) is defined as follows:
D(x i ,C h )=min{D(x i ,x j ),D(x i ,m h )|x j ∈C h }
i.e., the distance from the sample to the cluster category is the distance from the sample to the smallest distance among the points of the cluster;
each sample is assigned to the nearest cluster, all samples assigned to the same cluster forming a candidate sample set, this cluster being referred to as an undetermined cluster of the samples; then, for each cluster, selecting a nearest sample from the candidate sample set, such nearest sample points of all clusters being combined to be referred to as a nearest sample set; finally, finding a sample in the nearest sample set, wherein the distance between the sample and the undetermined cluster where the sample is located is the smallest in the nearest sample set, and then distributing the sample into the undetermined clusters; the above steps are repeated until all samples have been dispensed.
7. The medical image clustering method based on a multitasking evolution algorithm according to claim 1, characterized in that in step S2-5 the operation of simulating a binary intersection is as follows:
is provided with two father generations x a =[x a (1),...,x a (d)]And x b =[x b (1),...,x b (d)]D is the dimension of the data, and the distribution factor c (j) is calculated first:
wherein, beta is a system parameter larger than 0, and r is a random number larger than 0 and smaller than 1 in each dimension;
the obtained sub-generations are:
x e (j)=[(1+c(j))x a (j)+(1-c(j))x b (j)]/2
x f (j)=[(1+c(j))x b (j)+(1-c(j))x a (j)]/2。
CN202010364563.4A 2020-04-30 2020-04-30 Medical image clustering method based on multitasking evolutionary algorithm Active CN111563549B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010364563.4A CN111563549B (en) 2020-04-30 2020-04-30 Medical image clustering method based on multitasking evolutionary algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010364563.4A CN111563549B (en) 2020-04-30 2020-04-30 Medical image clustering method based on multitasking evolutionary algorithm

Publications (2)

Publication Number Publication Date
CN111563549A CN111563549A (en) 2020-08-21
CN111563549B true CN111563549B (en) 2023-07-28

Family

ID=72070695

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010364563.4A Active CN111563549B (en) 2020-04-30 2020-04-30 Medical image clustering method based on multitasking evolutionary algorithm

Country Status (1)

Country Link
CN (1) CN111563549B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115222007B (en) * 2022-05-31 2023-06-20 复旦大学 Improved particle swarm parameter optimization method for colloid rumen multitasking integrated network
CN115204323B (en) * 2022-09-16 2022-12-02 华智生物技术有限公司 Seed multi-feature based clustering and synthesis method, system, device and medium
CN115346665B (en) * 2022-10-19 2023-03-10 南昌大学第二附属医院 Method, system and equipment for constructing retinopathy incidence risk prediction model

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110458859A (en) * 2019-07-01 2019-11-15 南开大学 A kind of segmenting system of the myelomatosis multiplex stove based on multisequencing MRI
CN110991518A (en) * 2019-11-28 2020-04-10 山东大学 Two-stage feature selection method and system based on evolution multitask

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101499136B (en) * 2009-03-05 2011-06-01 西安电子科技大学 Image over-segmenting optimization method based on multi-target evolution clustering and spatial information
CN102567963A (en) * 2011-11-10 2012-07-11 西安电子科技大学 Quantum multi-target clustering-based remote sensing image segmentation method
CN104156945B (en) * 2014-07-16 2017-08-25 西安电子科技大学 Gray-scale image segmentation method based on multi-objective particle swarm algorithm
EP3273387B1 (en) * 2016-07-19 2024-05-15 Siemens Healthineers AG Medical image segmentation with a multi-task neural network system
CN106570873B (en) * 2016-11-08 2019-10-25 江苏大学 A kind of medical image cutting method
CN106886467B (en) * 2017-02-24 2019-02-01 电子科技大学 Preferred method is tested based on the multitask of grouping-synthesis multi-target evolution
US10489690B2 (en) * 2017-10-24 2019-11-26 International Business Machines Corporation Emotion classification based on expression variations associated with same or similar emotions
US11442978B2 (en) * 2018-03-01 2022-09-13 King Fahd University Of Petroleum And Minerals Heuristic for the data clustering problem
CN110136828A (en) * 2019-05-16 2019-08-16 杭州健培科技有限公司 A method of medical image multitask auxiliary diagnosis is realized based on deep learning

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110458859A (en) * 2019-07-01 2019-11-15 南开大学 A kind of segmenting system of the myelomatosis multiplex stove based on multisequencing MRI
CN110991518A (en) * 2019-11-28 2020-04-10 山东大学 Two-stage feature selection method and system based on evolution multitask

Also Published As

Publication number Publication date
CN111563549A (en) 2020-08-21

Similar Documents

Publication Publication Date Title
CN111563549B (en) Medical image clustering method based on multitasking evolutionary algorithm
CN108171209B (en) Face age estimation method for metric learning based on convolutional neural network
CN112288706B (en) Automatic chromosome karyotype analysis and abnormality detection method
CN109994200B (en) Multi-group cancer data integration analysis method based on similarity fusion
Geng et al. Automatic tracking, feature extraction and classification of C. elegans phenotypes
CN106228185B (en) A kind of general image classifying and identifying system neural network based and method
CN108122236B (en) Iterative fundus image blood vessel segmentation method based on distance modulation loss
CN106295124B (en) The method of a variety of image detecting technique comprehensive analysis gene subgraph likelihood probability amounts
WO2021062904A1 (en) Tmb classification method and system based on pathological image, and tmb analysis device based on pathological image
CN106778687B (en) Fixation point detection method based on local evaluation and global optimization
CN108389211B (en) Image segmentation method based on improved whale optimized fuzzy clustering
CN109871875B (en) Building change detection method based on deep learning
CN106650314A (en) Method and system for predicting amino acid mutation
WO2015173435A1 (en) Method for predicting a phenotype from a genotype
CN111009005A (en) Scene classification point cloud rough registration method combining geometric information and photometric information
CN112802054A (en) Mixed Gaussian model foreground detection method fusing image segmentation
CN112215847A (en) Method for automatically segmenting overlapped chromosomes based on counterstudy multi-scale features
CN112084877A (en) NSGA-NET-based remote sensing image identification method
CN113362277A (en) Workpiece surface defect detection and segmentation method based on deep learning
Vigdor et al. Accurate and fast off and online fuzzy ARTMAP-based image classification with application to genetic abnormality diagnosis
CN113764034B (en) Method, device, equipment and medium for predicting potential BGC in genome sequence
CN113408573B (en) Method and device for automatically classifying and classifying tile color numbers based on machine learning
CN114187288A (en) COVID-19 detection method based on cooperative deep learning and lung CT image
KR102572437B1 (en) Apparatus and method for determining optimized learning model based on genetic algorithm
CN117708628A (en) Spatial domain identification method in spatial transcriptome based on map deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant