WO2021082444A1

WO2021082444A1 - Multi-granulation spark-based super-trust fuzzy method for large-scale brain medical record segmentation

Info

Publication number: WO2021082444A1
Application number: PCT/CN2020/094104
Authority: WO
Inventors: 丁卫平; 丁嘉陆; 王杰华; 胡彬; 陈森博; 万杰; 赵理莉; 孙颖; 冯志豪; 李铭; 任龙杰; 丁帅荣
Original assignee: 南通大学
Priority date: 2019-10-28
Filing date: 2020-06-03
Publication date: 2021-05-06
Also published as: CN110867224A; AU2020286320B2; CN110867224B; AU2020286320A1

Abstract

A multi-granulation Spark-based super-trust fuzzy method for large-scale brain medical record segmentation, comprising: first, segmenting a large-scale brain medical record data attribute set into different multi-granulation evolutionary subpopulations (Granu-populationi) on a Spark cloud platform; designing a multi-granulation Spark-based super-trust model to construct trust between different super elitists in multi-granulation populations; adjusting a multi-granulation center threshold, and dynamically updating the super elitists using a multi-granulation subpopulation balance adjustment strategy, performing global search segmentation and local refinement segmentation on large-scale brain medical records, wherein super elitists can collaboratively extract knowledge reduction subsets in respective regions; and finally, obtaining the optimal large-scale brain medical record segmentation characteristic set and storing same on the Spark cloud platform. By means of the present method, stable segmentation can be implemented on large-scale brain medical record knowledge reduction sets to provide important diagnostic basis for intelligent diagnosis and auxiliary treatment of brain diseases.

Description

Multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation

Technical field:

The present invention relates to the field of medical information, in particular to a multi-granularity Spark super-trust fuzzy method for large-scale brain medical record segmentation.

Background technique:

The medical health service big data project not only requires the construction of electronic health records and electronic medical records databases, but also a medical health management and service big data application system covering public health, medical services, medical security, drug supply, family planning and integrated management services. Under the existing medical resources, to achieve the goal of the big data project of medical and health services, we need to make full use of various information technologies such as big data, cloud computing and mobile Internet to promote the effective interoperability of electronic medical record databases and electronic health record databases, and Realize benign interaction to implement the big data project of medical and health services.

With the advent of cloud computing and big data era, large-scale electronic medical record intelligent processing is extremely complicated in the entire process of generating and using medical big data. The medical data stored in the electronic medical record system has large capacity, scattered sources, diverse formats, and access Features such as high speed and high application value. Using some artificial intelligence and data mining techniques to effectively discover and extract important medical diagnosis rules and knowledge in large-scale electronic medical records is the key to forming a clinical decision support system. However, because the electronic medical record system is a special medical information system, electronic medical records The medical data stored in the system has complex characteristics such as massive, diverse, incomplete, and time-sensitive, which brings great difficulties to its feature selection, collaborative services, knowledge discovery, and clinical decision support services. How to effectively process complex and large-scale electronic medical records is the key to designing future-oriented medical and health service big data engineering and clinical intelligent decision analysis service system. Combining the characteristics of the large-scale electronic medical record system, adopting some efficient models and methods to reduce the knowledge of complex medical records is the trend of future development.

Using artificial intelligence and big data processing methods to automatically segment brain attributes from large-scale brain medical records, discover potential medical laws, and play an important role in the prevention, control and treatment of brain diseases. Large-scale brain medical record segmentation problems widely exist in the research of brain medical record feature selection, rule mining and clinical decision support system. It is the core technology of brain medical record intelligent application under the background of medical big data. Therefore, there is an urgent need to consider providing effective methods under the cloud computing environment to solve the problem of large-scale brain medical record segmentation, and further improve the intelligent processing and service mode of massive brain medical records. This is the current intelligent auxiliary diagnosis and treatment of brain medical records and clinical decision support under the background of medical big data. The key issues that need to be solved urgently in system research are also challenging research topics in the field of brain medical records. However, due to the high degree of incompleteness and ambiguity of large-scale brain medical records, the non-authentic characteristics of brain medical records data are more distinctive and the uncertainty is more obvious, which greatly limits the application of traditional attribute segmentation methods. Therefore, in the medical big data environment, an effective segmentation method is proposed for the characteristics of large-scale brain medical records, and the optimal and consistent balance of global search reduction and local refined knowledge collaborative reduction in brain medical record segmentation is obtained, which supports large-scale brain medical records decision-making. Analysis has very important meaning and value.

The present invention discloses a multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation. Firstly, the large-scale brain medical record data attribute set is divided into different multi-granular evolutionary subpopulations Granu-population _i on the Spark cloud platform; Design a super-trust model based on multi-granularity Spark to build trust among different super elites in multi-granularity populations; adjust multi-granularity center threshold, use multi-granularity sub-population balance adjustment strategy for super elites to dynamically update, and large-scale brain disease records Perform global search segmentation and local refinement segmentation, super elites can collaboratively extract knowledge reduction subsets in their respective regions; finally, obtain the optimal segmentation feature set of large-scale brain medical records

And stored in the Spark cloud platform. The invention can stably segment a large-scale brain disease history knowledge reduction collection, and provide an important diagnosis basis for the intelligent diagnosis and auxiliary treatment of brain diseases.

The further improvement of the present invention lies in: the specific steps of step B are as follows:

a. Set the number of multi-granularity populations to n, and n≥2, and initialize the multi-granularity population to GP _h and h∈{1,...,n};

b. Initialize the center of the first granularity population as

Then initialize the center of the second granularity population as

Make it a priority for the super elite

c. For the 3rd and above multi-granularity population centers

Calculate current elite priority

The minimum distance from the center of all current particle size populations is calculated as follows:

Assign the minimum distance to the u-th multi-granularity population center

Repeat this process until all n multi-granularity evolutionary populations are initialized;

d. The trust degree of the i-th super elite in the same granular subpopulation is defined as follows:

Where n is the total number of elites, SP _i is the i-th super elite, and P _ij is the j-th ordinary elite in the i-th multi-granularity population;

e. Calculate the i-th super elite SP _i at the h-th multi-granularity population center

The iterative calculation formula of the trust degree R _i in is as follows:

Where i∈{2,...,N},

f. Set up a multi-granularity population center

The number of similarities between the current cycles is t,t∈{2,...,n-1}, and each multi-granularity population center

The trust degree of is calculated from the t-1 iteration of the previous round, so that the size of the large-scale brain disease record attribute set will be dynamically updated iteratively through the sub-population trust relationship in different granular spaces;

g. Calculate the trust deviation Diff _ij _{between the trust degrees of different super elites SP i} and SP _j in the multi-granularity population, the calculation formula is

Where Re _ij is the credibility of the i-th super elite to the j-th super elite, R _mj is the partial trust recommended by the m-th ordinary elite in the population to the j-th super elite, and I(j) is the The set of all elites in j multi-granularity populations GP _j , |I(j)| is the potential of the set;

h. The population trust between the h-th multi-granularity population and the u-th multi-granularity population center is

Calculated as follows:

Where m is the number of iterations,

Is the variation range of the t-th iteration of the two multi-granularity populations, and the calculation formula is

i. For the h-th multi-granularity population

If satisfied

ε is the similarity threshold, and the range is ε∈[0,1], then the multi-granularity population conforms to the subpopulation trust relationship in different granular spaces;

g. Construct a formula for the trust relationship between different super elites in a multi-granularity population, which is defined as

Among them, λ is the confidence factor of the direct trust between super elites. The value of λ is related to the number of super elite interactions. The greater the number of interactions, the greater the value of λ, 0≤λ≤1. We take λ=h/H _Lmt , where h is the number of interactions between super elite i and super elite j, and H _Lmt is the set threshold for the number of interactions. The size of the large-scale brain disease record attribute set is determined by different granularity spaces. The neutron population trust relationship is dynamically updated iteratively.

A further improvement of the present invention lies in: the specific steps of step C are as follows:

a. Use the traditional clustering method k-means to initialize the multi-granularity center as

b. Assuming that the multi-granularity sub-species cluster and center are both empty sets, V=Φ and C=Φ, and the number of iterations t=1. Calculate the distance between each multi-granularity subpopulation and the multi-granularity center, and divide the large-scale brain disease record attribute set into the corresponding multi-granularity centers according to the principle of minimum distance, forming k

And record the number of super elites in each center

Set the initial adjustment label

c. Recalculate each multi-granularity center

And each initial displacement movement of the center of the particle size _{_{d (c 1i, c 0i)}} , where | V _i | represents the number of multi-particle populations V _i in the population;

_{d. The distance between the particle size center c 1} and the initial particle size center c ₀ after the first iteration of the particle size subpopulation is d(c ₁ , c ₀ ), and the new particle size center c′ and the original particle size center after the i-th iteration The distance d(c,c′) between c, if

ε is the similarity threshold and the range is ε∈[0,1], then the granularity center represented by c′ will no longer participate in the next round of iterative adjustment, otherwise iterative adjustment will continue;

e. Calculate _{the distance between each super elite in the multi-granularity population labeled f tj} =1 and the center of the multi-granularity population participating in the adjustment, and divide the brain disease record attributes into corresponding multi-granularity populations according to the principle of minimum distance to form k new multi-granularity populations {V _tj }, and record the number of super elites in each multi-granularity population {N _tj }, and find the adjusted number of super elites ΔN _{tj for} segmentation of large-scale brain disease records;

f. Recalculate and adjust multi-granularity centers

_{And the displacement d(c tj} ,c _tj ) of the movement of the multi-granularity center;

g. provided to adjust the size of the center of the migration threshold ε and the number of multiple sub-populations granularity adjustment threshold θ, if the center of the multi-granularity V _tj satisfies c _tj

with

Then the adjustment label in the multi-granularity center V _tj is set to 0, that is, f _tj =0, and V _tj and c _{tj are} added to the final multi-granularity population center set, that is, V=V∪{V _tj } and C= C∪{c _tj }, if a set containing k multi-granularity centers is formed, at this time |V|=k, the iteration is terminated.

The further improvement of the present invention lies in: the specific steps of the step E are as follows:

a. Suppose two adjacent super elite clusters are

with

Their elite membership degrees are respectively

with

b. If

Super elites will evolve into elite clusters

The combination of; otherwise it will evolve into an elite cluster

The combination;

. c execution Competition and Cooperation in Multi-granularity subpopulation mixed synergistic medical split brain mass, assuming S _i is the i-th super elite, the i = 1 to | perform operations | S _i:

(1) is inserted into S _i representing super elite S _{i, rep} in the P _i ^t;

(2) if n _x> | S _i |, selected from a plurality of super elite P _i granularity Granu-subpopulation _i subset of ^T;

(3) Combine all the solutions of S _i,j and other multi-granularity subpopulation Granu-subpopulation _i , sort them and calculate the number of small generation environments _{of S i,j;}

(4) _{The super-elite representative who updates S i} obtains the non-dominant solution in the dominant area of Pareto, decides the winning multi-granularity subpopulation, and updates S _i =S _k ;

. d fuzzy membership degree of super elite _uCh (P _i) calculated using a member similar manner, wherein a distance defined between the reference values P _i and the super elite center C _h is _{_{d (P i, C h)}} ;

e. Calculate the equilibrium CI for each super-elite sub-population

The consensus probability CR is

Where t∈{1,2,...,s};

f. For any inconsistent balance

Obtain the optimal uniform equilibrium degree of the t-th multi-granularity subpopulation super elite as

among them

g. The global optimal consensus probability of obtaining all super elites is

t∈{1,2,...,s}, construct the optimal consistent equilibrium degree and probability degree pair of large-scale brain disease record attribute segmentation as

t∈{1,2,...,s};

h. Super elites are based on the optimal consistent equilibrium degree and probability degree pair

Segment the feature sets of different attribute regions of brain medical records as F ₁ , F ₂ ,..., F _n , and obtain the optimal feature set of large-scale brain medical records

Compared with the prior art, the present invention has the following advantages:

1) The present invention adopts a multi-granular Spark super trust model to build trust between different super elites in a multi-granular population, uses different multi-granular sub-population balance adjustment strategies for super elites to dynamically update, and performs global brain disease records on a large scale. Search segmentation and local refinement segmentation, super elites can collaboratively extract knowledge reduction subsets in their respective regions, which greatly reduces execution time and improves the accuracy of large-scale brain medical record segmentation.

2) The present invention constructs a multi-granularity population super-elite dynamic cooperative operation mechanism on the Spark cloud platform based on the dynamic elite dominant area, and achieves the optimal and consistent balance of large-scale brain medical record segmentation, and reduces the complexity cost of large-scale brain medical record feature segmentation. It further improves the granularity and robustness of large-scale parallel feature extraction of brain medical records on the cloud computing Spark cloud platform, and lays a good foundation for the development of intelligent services such as brain medical record feature selection, rule mining, and clinical decision support.

Description of the drawings:

Figure 1 is the overall flow chart of the system;

Figure 2 is a diagram of the dynamic execution process of the multi-granularity super-trust Spark model;

Figure 3-5 is a diagram of the dynamic fuzzy collaborative operation process of multi-granularity population super elites;

Detailed ways:

In order to deepen the understanding of the present invention, the present invention will be described in further detail below in conjunction with examples. The examples are only used to explain the present invention and do not constitute a limitation on the protection scope of the present invention.

The specific implementation of the multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation is shown in Figure 1 to Figure 5. The specific steps are as follows:

A. On the big data Spark cloud platform, the large-scale brain medical record attribute set is divided into different multi-granular evolutionary populations Granu-population _i , i=1, 2,...n, the brain medical record attribute segmentation task is decomposed into multiple parallelized ones Homework tasks, and then calculate the equivalence classes of different brain disease record candidate attribute sets in the decomposed multiple homework tasks;

B. The design is based on the multi-granularity super-trust model, and the i-th multi-granularity evolutionary population Granu-population _{i is} used for the reduction and segmentation of the i-th attribute set of the brain disease record to build the trust between different super elites in the multi-granularity population , Calculate the trust bias of the multi-granularity population, and the scale of the large-scale brain disease record attribute set is dynamically updated iteratively through the sub-population trust relationship in different granular spaces; the specific steps are as follows:

b. Initialize the center of the first granularity population as

Then initialize the center of the second granularity population as

Make it a priority for the super elite

c. For the 3rd and above multi-granularity population centers

Calculate current elite priority

Assign the minimum distance to the u-th multi-granularity population center

Repeat this process until all n multi-granularity evolution populations are initialized;

The iterative calculation formula of the trust degree R _i in is as follows:

Where i∈{2,...,N},

f. Set up a multi-granularity population center

Calculated as follows:

Where m is the number of iterations,

i. For the h-th multi-granularity population

If satisfied

C. Set the multi-granularity Spark super trust center adjustment threshold for large-scale brain medical record segmentation to λ. After the i-th iteration is completed, the multi-granularity subpopulation Granu-population _i whose granularity center adjustment is greater than the threshold λ is performed next time iterative adjustment is provided to adjust the threshold granularity center migration values ε and the number of multi-granularity subset adjust the threshold value [theta], Opportunities c _tj multi-granularity V _tj and added to the final multi-size population centers set form comprising k multi-granularity Central collection; specifically includes the following steps:

And record the number of super elites in each center

Set the initial adjustment label

c. Recalculate each multi-granularity center

f. Recalculate and adjust multi-granularity centers

with

D. Use the equilibrium adjustment strategy to dynamically update the super elites in the multi-granularity sub-population, divide the multi-granularity sub-population super elites into an isosceles right-angled triangle content, and calculate their respective granularity values.

If two super elites have the same lower granularity

Then their approximation attribute values converge to the equilibrium pair as

If two super elites have the same higher granularity

Then their approximation attribute values converge to the equilibrium pair as

This equilibrium adjustment strategy is beneficial to increase the optimal uniform equilibrium degree of multi-granularity subpopulations.

E. Construct a multi-granularity subpopulation super-elite dynamic fuzzy collaborative segmentation strategy, perform global search segmentation and local refinement segmentation on large-scale brain medical record attributes in the dynamic elite dominance area, and perform a hybrid collaboration of competition and cooperation in multi-granularity subpopulations , To construct the optimal uniformity and probability of large-scale brain medical record attribute segmentation, so that super elites can collaboratively extract knowledge reduction subsets in their corresponding Pareto superior areas, and can stably segment large-scale brain medical records with different attribute areas. Optimal feature set of large-scale brain medical records

It includes the following steps:

a. Suppose two adjacent super elite clusters are

with

Their elite membership degrees are respectively

with

b. If

Super elites will evolve into elite clusters

The combination of; otherwise it will evolve into an elite cluster

The combination;

(1) is inserted into S _i representing super elite S _{i, rep} in the P _i ^t;

e. Calculate the equilibrium CI for each super-elite with multiple granularity subpopulations as

The consensus probability CR is

Where t∈{1,2,...,s};

f. For any inconsistent balance

among them

g. The global optimal consensus probability of obtaining all super elites is

t∈{1,2,...,s};

F. Compare the relationship between the large-scale brain medical record segmentation accuracy RC obtained above and the preset accuracy value η, if RC≥η, then output the large-scale brain medical record optimal segmentation knowledge set. Otherwise, continue to perform the above steps C, D, and E until the segmentation accuracy of large-scale brain medical records meets RC≥η;

G. Segmenting the optimal feature set of the big data brain medical record

It is stored in the Spark cloud platform to provide an important knowledge basis for intelligent auxiliary diagnosis for the clinical diagnosis and treatment of diseases related to large-scale brain medical records.

The invention adopts a multi-granular Spark super trust model to construct trust between different super elites in a multi-granular population, uses different multi-granular sub-population balance adjustment strategies for super elites to dynamically update, and performs global search and segmentation of large-scale brain disease records With local refined segmentation, super elites can collaboratively extract knowledge reduction subsets in their respective regions, which greatly reduces the execution time and improves the accuracy of large-scale brain medical record segmentation.

The present invention constructs a multi-granularity population super-elite dynamic cooperative operation mechanism on the Spark cloud platform based on the dynamic elite dominant area, achieves the optimal and consistent balance of large-scale brain disease record segmentation, reduces the complexity cost of large-scale brain disease record feature segmentation, and further improves The fine-grained and robustness of large-scale parallel feature extraction of brain medical records on the cloud computing Spark cloud platform has laid a good foundation for the development of intelligent services such as brain medical record feature selection, rule mining, and clinical decision support.

The foregoing description of the disclosed embodiments enables those skilled in the art to implement or use the present invention. Various modifications to these embodiments will be obvious to those skilled in the art, and the general principles defined herein can be implemented in other embodiments without departing from the spirit or scope of the present invention.

Therefore, the present invention will not be limited to the embodiments shown in this document, but should conform to the widest scope consistent with the principles and novel features disclosed in this document.

Claims

The multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation is characterized in that: the specific steps are as follows:

A. On the big data Spark cloud platform, the large-scale brain medical record attribute set is divided into different multi-granular evolutionary populations Granu-population i , i=1, 2,...n, the brain medical record attribute segmentation task is decomposed into multiple parallelized ones Homework tasks, and then calculate the equivalence classes of different brain disease record candidate attribute sets in the decomposed multiple homework tasks;

B. The design is based on the multi-granularity super-trust model, and the i-th multi-granularity evolutionary population Granu-population i is used for the reduction and segmentation of the i-th attribute set of the brain disease record to build the trust between different super elites in the multi-granularity population , Calculate the trust bias of multi-granularity populations, and the size of the large-scale brain disease record attribute set is dynamically updated iteratively through the sub-population trust relationship in different granular spaces;

C. Set the multi-granularity Spark super trust center adjustment threshold for large-scale brain medical record segmentation to λ. After the i-th iteration is completed, the multi-granularity subpopulation Granu-population i whose granularity center adjustment is greater than the threshold λ is performed next time iterative adjustment is provided to adjust the threshold granularity center migration values ε and the number of multi-granularity subset adjust the threshold value [theta], Opportunities c tj multi-granularity V tj and added to the final multi-size population centers set form comprising k multi-granularity Central collection

D. Use the equilibrium adjustment strategy to dynamically update the super elites in the multi-granularity sub-population, divide the multi-granularity sub-population super elites into an isosceles right-angled triangle content, and calculate their respective granularity values
If two super elites have the same lower granularity
Then their approximation attribute values converge to the equilibrium pair as
If two super elites have the same higher granularity
Then their approximation attribute values converge to the equilibrium pair as
This equilibrium adjustment strategy is beneficial to increase the optimal uniform equilibrium degree of multi-granularity subpopulations.

E. Construct a multi-granularity subpopulation super-elite dynamic fuzzy collaborative segmentation strategy, perform global search segmentation and local refinement segmentation on large-scale brain medical record attributes in the dynamic elite dominance area, and perform a hybrid collaboration of competition and cooperation in multi-granularity subpopulations , To construct the optimal uniformity and probability of large-scale brain medical record attribute segmentation, so that super elites can collaboratively extract knowledge reduction subsets in their corresponding Pareto superior areas, and can stably segment large-scale brain medical records with different attribute areas. Optimal feature set of large-scale brain medical records

F. Compare the relationship between the large-scale brain medical record segmentation accuracy RC obtained above and the preset accuracy value η, if RC≥η, then output the large-scale brain medical record optimal segmentation knowledge set. Otherwise, continue to perform the above steps C, D and E until the segmentation accuracy of large-scale brain medical records meets RC≥η;

G. Segmenting the optimal feature set of the big data brain medical record
It is stored in the Spark cloud platform to provide an important knowledge basis for intelligent auxiliary diagnosis for the clinical diagnosis and treatment of diseases related to large-scale brain medical records.
The multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation according to claim 1, wherein the specific steps of step B are as follows:

a. Set the number of multi-granularity populations to n, and n≥2, and initialize the multi-granularity population to GP h and h∈{1,...,n};

b. Initialize the center of the first granularity population as
Then initialize the center of the second granularity population as
Make it a priority for the super elite

c. For the 3rd and above multi-granularity population centers
Calculate current elite priority
The minimum distance from the center of all current particle size populations is calculated as follows:

Assign the minimum distance to the u-th multi-granularity population center
Repeat this process until all n multi-granularity evolutionary populations are initialized;

d. The trust degree of the i-th super elite in the same granular subpopulation is defined as follows:

Where n is the total number of elites, SP i is the i-th super elite, and P ij is the j-th ordinary elite in the i-th multi-granularity population;

e. Calculate the i-th super elite SP i at the h-th multi-granularity population center
The iterative calculation formula of the trust degree R i in

Where i∈{2,...,N},

f. Set up a multi-granularity population center
The number of similarities between the current cycles is t,t∈{2,...,n-1}, and each multi-granularity population center
The trust degree of is calculated from the t-1 iteration of the previous round, so that the size of the large-scale brain disease record attribute set will be dynamically updated iteratively through the sub-population trust relationship in different granular spaces;

g. Calculate the trust deviation Diff ij between the trust degrees of different super elites SP i and SP j in the multi-granularity population, the calculation formula is

Where Re ij is the credibility of the i-th super elite to the j-th super elite, R mj is the partial trust recommended by the m-th ordinary elite in the population to the j-th super elite, and I(j) is the The set of all elites in j multi-granularity populations GP j , |I(j)| is the potential of the set;

h. The population trust between the h-th multi-granularity population and the u-th multi-granularity population center is
Calculated as follows:

Where m is the number of iterations,
Is the variation range of the t-th iteration of the two multi-granularity populations, and the calculation formula is

i. For the h-th multi-granularity population
If satisfied
ε is the similarity threshold, and the range is ε∈[0,1], then the multi-granularity population conforms to the subpopulation trust relationship in different granular spaces;

g. Construct a formula for the trust relationship between different super elites in a multi-granularity population, which is defined as

Among them, λ is the confidence factor of the direct trust between super elites. The value of λ is related to the number of super elite interactions. The greater the number of interactions, the greater the value of λ, 0≤λ≤1. We take λ=h/H Lmt , where h is the number of interactions between super elite i and super elite j, and H Lmt is the set threshold for the number of interactions. The size of the large-scale brain disease record attribute set is determined by different granularity spaces. The neutron population trust relationship is dynamically updated iteratively.
The multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation according to claim 1, wherein the specific steps of step C are as follows:

a. Use the traditional clustering method k-means to initialize the multi-granularity center as

b. Assuming that the multi-granularity sub-species cluster and center are both empty sets, V=Φ and C=Φ, and the number of iterations t=1. Calculate the distance between each multi-granularity subpopulation and the multi-granularity center, and divide the large-scale brain disease record attribute set into the corresponding multi-granularity centers according to the principle of minimum distance, forming k
And record the number of super elites in each center
Set the initial adjustment label

c. Recalculate each multi-granularity center
And each initial displacement movement of the center of the particle size d (c 1i, c 0i) , where | V i | represents the number of multi-particle populations V i in the population;

d. The distance between the particle size center c 1 and the initial particle size center c 0 after the first iteration of the particle size subpopulation is d(c 1 , c 0 ), and the new particle size center c′ and the original particle size center after the i-th iteration The distance d(c,c′) between c, if
ε is the similarity threshold, and the range is ε∈[0,1], then the granularity center represented by c′ will no longer participate in the next round of iterative adjustment, otherwise iterative adjustment will continue;

e. Calculate the distance between each super elite in the multi-granularity population labeled f tj =1 and the center of the multi-granularity population participating in the adjustment, and divide the brain disease record attributes into corresponding multi-granularity populations according to the principle of minimum distance to form k new multi-granularity populations {V tj }, and record the number of super elites in each multi-granularity population {N tj }, and find the adjusted number of super elites ΔN tj for segmentation of large-scale brain disease records;

f. Recalculate and adjust multi-granularity centers
And the displacement d(c tj ,c tj ) of the movement of the multi-granularity center;

g. provided to adjust the size of the center of the migration threshold ε and the number of multiple sub-populations granularity adjustment threshold θ, if the center of the multi-granularity V tj satisfies c tj
with
Then the adjustment label in the multi-granularity center V tj is set to 0, that is, f tj =0, and V tj and c tj are added to the final multi-granularity population center set, that is, V=V∪{V tj } and C= C∪{c tj }, if a set containing k multi-granularity centers is formed, at this time |V|=k, the iteration is terminated.
The multi-granular Spark super-trust fuzzy method for large-scale brain medical record segmentation according to claim 1, wherein the specific steps of step E are as follows:

a. Suppose two adjacent super elite clusters are
with
Their elite membership degrees are respectively
with

b. If
Super elites will evolve into elite clusters
The combination of; otherwise it will evolve into an elite cluster
The combination;

. c execution Competition and Cooperation in Multi-granularity subpopulation mixed synergistic medical split brain mass, assuming S i is the i-th super elite, the i = 1 to | perform operations | S i:

(1) is inserted into S i representing super elite S i, rep in the P i t;

(2) if n x> | S i |, selected from a plurality of super elite P i granularity Granu-subpopulation i subset of T;

(3) Combine all the solutions of S i,j and other multi-granularity subpopulation Granu-subpopulation i , sort them and calculate the number of small generation environments of S i,j;

(4) The super-elite representative who updates S i obtains the non-dominant solution in the dominant area of Pareto, decides the winning multi-granularity subpopulation, and updates S i =S k ;

. d fuzzy membership degree of super elite uCh (P i) calculated using a member similar manner, wherein a distance defined between the reference values P i and the super elite center C h is d (P i, C h) ;

e. Calculate the equilibrium CI for each super-elite with multiple granularity subpopulations as
The consensus probability CR is
Where t∈{1,2,...,s};

f. For any inconsistent balance
Obtain the optimal uniform equilibrium degree of the t-th multi-granularity subpopulation super elite as
among them

g. The global optimal consensus probability of obtaining all super elites is
Constructing a large-scale brain disease record attribute segmentation optimal consistent balance and probability
t∈{1, 2,..., s};

h. Super elites are based on the optimal consistent equilibrium degree and probability degree pair
Segment the feature sets of different attribute regions of brain medical records as F 1 , F 2 ,..., F n , and obtain the optimal feature set of large-scale brain medical records