CN110968651A - Data processing method and system based on grey fuzzy clustering - Google Patents

Data processing method and system based on grey fuzzy clustering Download PDF

Info

Publication number
CN110968651A
CN110968651A CN201911129127.2A CN201911129127A CN110968651A CN 110968651 A CN110968651 A CN 110968651A CN 201911129127 A CN201911129127 A CN 201911129127A CN 110968651 A CN110968651 A CN 110968651A
Authority
CN
China
Prior art keywords
data
evaluation
fuzzy clustering
factors
processing method
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911129127.2A
Other languages
Chinese (zh)
Inventor
肖炯恩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
GUANGZHOU SAIBAO LIANRUI INFORMATION TECHNOLOGY CO LTD
Guangdong University of Business Studies
Original Assignee
GUANGZHOU SAIBAO LIANRUI INFORMATION TECHNOLOGY CO LTD
Guangdong University of Business Studies
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGZHOU SAIBAO LIANRUI INFORMATION TECHNOLOGY CO LTD, Guangdong University of Business Studies filed Critical GUANGZHOU SAIBAO LIANRUI INFORMATION TECHNOLOGY CO LTD
Priority to CN201911129127.2A priority Critical patent/CN110968651A/en
Publication of CN110968651A publication Critical patent/CN110968651A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Abstract

The invention discloses a data processing method based on gray fuzzy clustering, and government affair data sharing is an important and complex project and relates to a large number of influencing factors. In order to identify key factors, the method adopts a model combining gray correlation analysis and fuzzy clustering analysis to obtain 43 influencing factors, and performs empirical research to identify the important influencing factors, namely the common important influencing factors. And according to the grey fuzzy clustering result, proposing a countermeasure or suggestion for promoting government affair data sharing by combining with the attitude curve of the evaluator. The data sharing method provides a reference for data sharing work of provincial and urban governments and can implement an overall solution idea.

Description

Data processing method and system based on grey fuzzy clustering
Technical Field
The invention relates to the field of data processing, in particular to a data processing method and system based on gray fuzzy clustering.
Background
In recent years, with the progress of technology and the guidance in policy, the reform of digital governments proposes to break through government ' data islands ', the optimization of operator environments requires that each government unit carries out data butt joint, increasingly severe safety problems require that the data of each unit can be effectively shared, the optimization and upgrade of convenient service requires that each government department cooperates to realize zero running legs ' of the public, and all the steps of governments are promoted to actively promote the cross-department government affair data sharing. The cross-department government affair data sharing is a complex project, is limited by a plurality of influence factors, and needs to analyze the core influence factors by deeply and comprehensively analyzing the influence factors. However, the traditional analysis method has certain disadvantages or short boards, so that the analysis of the influence factors of the cross-department government affair data sharing by adopting a new angle or method becomes the focus of research of a plurality of scholars.
Many scholars have made beneficial exploration in government cross-department data sharing from the perspective of theoretical and empirical research. In 1996 daves first conducted systematic research on government cross-department data sharing, conducted field investigations in new york state, and analyzed 173 government personnel' opinions on the interests and barriers to information sharing to present a theoretical framework for government cross-department data sharing based on organizational and policy aspects, whereas investigations conducted in the 90 s of the 20 th century failed to consider technical factors that facilitate electronic sharing of cross-department data. Landsberge and Wolken investigated five state federal and state government officers on the basis of the theoretical model proposed by Dawes, acquired the data of two cases (environmental reports and geographic information positioning service), proposed an extended government cross-department data sharing model, and intensively studied the influence of factors such as technical infrastructure, laws, management and policies. The previous research mainly focuses on starting from a conceptual model of the influencing factors, mainly searching the influencing factors, taking the factors revealing the positive influencing factors or hindering data sharing as research focuses, adopting a qualitative analysis method and proposing corresponding countermeasures. And the students mainly adopt a questionnaire survey mode to analyze the government affair data sharing influence factors, but questionnaire survey evaluation results have ambiguity, and the traditional research method cannot reflect the ambiguity relation among the factors, for example, a certain influence factor has a part of scorers giving high scores and another part of scorers giving low scores, so that the calculated average score or total score cannot well reflect the actual attitude of the scorers, and the strength relation among the factors cannot be well reflected.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a data processing method and a data processing system based on gray fuzzy clustering, namely, correlation among identification factors is analyzed through gray correlation, and a fuzzy clustering method is adopted to realize clustering of key factors. And according to the clustering result, establishing an attitude curve, analyzing different factors of the attitude curve, and further formulating a strategy capable of effectively promoting government data sharing.
The purpose of the invention is realized by adopting the following technical scheme:
a data processing method based on gray fuzzy clustering comprises the following steps: collecting data from each government department, wherein the collected data comprises evaluation data and fact data; data preprocessing: synthesizing the evaluation data to obtain comprehensive evaluation data; converting the fact data into fact evaluation data; the comprehensive evaluation data and the factual evaluation data constitute n evaluation factors and generate an evaluation vector Xk=(xk1,xk2,…,xkj,…xkm) Wherein x iskmThe n evaluation vectors constitute an object matrix X as the influence degree of the factorsn×mNormalizing the matrix and calculating a correlation coefficient and a correlation degree; fuzzy clustering: obtaining a fuzzy similarity matrix R according to the relevance; obtaining a transfer packet t (R) according to a transfer closed packet method, and ordering elements in the t (R) from large to small, wherein the lambda belongs to [0,1]]And sequentially taking values from large to small according to the sequence of t (R) to obtain the truncations of different lambda levels.
Further, the data preprocessing step adopts
Figure BDA0002277777600000031
Normalizing said matrix, wherein xijIs a matrix Xn×mRow i and column j.
Further, the data preprocessing step is carried out by
Figure BDA0002277777600000032
Figure BDA0002277777600000033
Obtaining the correlation coefficient ξijWherein x is0=(x01,x02,…,x0j,…,x0m) And selecting rho as the resolution according to a specific practical problem.
Further, the value of the resolution ρ is 0.5.
Further, the degree of association
Figure BDA0002277777600000034
i=1,2,…,n,pjThe weight of the jth influence level in the evaluation target.
Further, the fuzzy similarity matrix R ═ (R)ij)n×n,rij=1-|ri-rj|,(i,j=1,2,…n)。
Further, the fuzzy similarity matrix R ═ (R)ij)n×n,rij=1-|ri-rj|,(i,j=1,2,…n)。
Further, the transfer packet t (R) ═ RkWherein R iskοRk=Rk
Further, the influence degree of the factors is divided into f grades of 1,2 … … f, the degree from 1 to f is changed from weak to strong, and the value of the influence degree is obtained according to the statistical frequency.
The invention also provides a data processing system based on gray fuzzy clustering, which comprises a data acquisition module, a data preprocessing module, a fuzzy clustering module and a result output module; the data acquisition module acquires data from each government department, and the acquired data comprises evaluation data and fact data; the data preprocessing module receives the evaluation data and the fact data output by the data acquisition module, and synthesizes the evaluation data to obtain comprehensive evaluation data; converting the fact data into fact evaluation data; the comprehensive evaluation data and the factual evaluation data constitute n evaluation factorsElement and generate an evaluation vector Xk=(xk1,xk2,…,xkj,…xkm) Wherein x iskmThe n evaluation vectors constitute an object matrix X as the influence degree of the factorsn×mNormalizing the matrix and calculating a correlation coefficient and a correlation degree; the fuzzy clustering module obtains a fuzzy similarity matrix R according to the relevance; obtaining a transfer packet t (R) according to a transfer closed packet method, and ordering elements in the t (R) from large to small, wherein the lambda belongs to [0,1]]Sequentially taking values from large to small according to t (R) sequence to obtain truncation sets with different lambda levels; and the result output module outputs clusters according to the cut sets of different lambda levels.
The method decomposes a plurality of influence factors from each department, accurately identifies key factors from the plurality of influence factors, clusters each influence factor according to the standard to achieve the purpose of qualitative analysis, and identifies the influence degrees of different clusters through analyzing the influence degrees of the influence factors so as to achieve the purpose of quantitative analysis.
Drawings
FIG. 1 is a system for evaluating cross-department data influencing factors according to the present invention.
FIG. 2 is a comprehensive evaluation chart of grey correlation of government affair data sharing influence factors according to the invention;
FIG. 3 is a dynamic clustering chart of government data sharing influencing factors of the present invention;
FIG. 4 is an attitude curve highlighting the emphasis classes of the government data sharing influencing factors of the present invention;
FIG. 5 is an attitude curve for a general emphasis class of government data sharing influencing factors of the present invention.
Detailed Description
The invention will be further described with reference to the accompanying drawings and the detailed description below:
a data processing method based on gray fuzzy clustering comprises the processes of data acquisition, data preprocessing and fuzzy clustering.
The data acquisition process refers to the data acquisition from each government department, and the acquired data comprises evaluation data and fact data; the evaluation data is an evaluation of the degree of influence of each department on a certain matter, and these evaluations are often not obtained from objective factual data, and need to be obtained by summarizing the work of each department or performing subjective evaluation according to actual needs, for example: the national legislation or the promotion of a specific item, the decision process of the project with democratics, the influence factors of the project cooperation participants and other member relations, and the like.
The fact data refers to business data existing in database of each department and government database, and the evaluation process can be obtained through calculation. For example, the factor "having sufficient financial investment" may be evaluated according to the effect of the factor "having sufficient financial investment" on the capital support requested by the department, the actual capital support, and the final completion of the business. For example, the degree of influence of quantitative indicators such as the academic degree, the level and quantity of the thesis, the final implementation effect and the like on the key project participants with leading technical ability can be evaluated.
Data preprocessing (Gray correlation analysis)
In the data preprocessing stage, the evaluation data can be integrated to obtain integrated evaluation data; the fact data is converted into fact evaluation data. The comprehensive evaluation data and the factual evaluation data constitute n evaluation factors.
The matrix of study objects may be established based on the evaluation factors. The number of the government affair data sharing influence factors is n, and each factor is evaluated by m grades, so that the influence factor set to be classified is U-U1,u2,…,uk,…un}. Wherein each ukIs a set of factor evaluation vectors, denoted Xk=(xk1,xk2,…,xkj,…xkm) And representing the influence degree grade set of the kth factor, wherein the grade set is obtained by processing the evaluation data or the fact data according to the invention in a calculation mode, namely, the grade influence is obtained by statistics or is obtained by an objective data calculation mode. Further, a study object matrix can be obtained:
Figure BDA0002277777600000061
and (5) a data standardization process. In order to reduce the problem of 'large number swallowing small number' caused by comparing data with different orders of magnitude, the data normalization is required to be carried out on the original data, and the data is normalized to the [0,1] interval. Adopting a normalization process on the jth column of the matrix X:
Figure BDA0002277777600000062
and calculating the correlation coefficient and the correlation degree. And (3) after the reference sequence and the evaluation object are subjected to data standardization, evaluating the correlation coefficient of the corresponding index between the index and the reference sequence according to the j level of the ith factor in the step (2).
Figure BDA0002277777600000071
Wherein the optimal reference sequence of the evaluation target index is x0=(x01,x02,…,x0j,…,x0m) Generally, it is selected according to the practical problem. ρ is resolution, and the value range is (0,1), generally 0.5.
When the relevance degree is calculated, the influence degrees of different indexes in a research object need to be considered comprehensively, so index weight, namely the weight of the influence level of each influence factor, is introduced, the relevance coefficients of all indexes of each factor can be integrated into one relevance degree, the relevance degree after the weight is considered can be obtained, and the relevance degree calculation formula is as shown in formula (3).
Figure BDA0002277777600000072
Wherein p isjThe weight of the jth influence level in the evaluation object; r isiAnd the weighted association degree of the ith influencing factor and the reference sequence is shown, and the larger the value, the more similar the sequence and the reference sequence is, namely the stronger the influence degree of the influencing factor is.
(II) fuzzy clustering
The clustering analysis is a mathematical method for objectively classifying objects according to the relationships among the objects, such as different characteristics, degree of affinity and sparseness, similarity and the like, and the mathematical basis of the method is multivariate analysis in mathematical statistics. Generally, the boundary between objects to be clustered is fuzzy, and the fuzzy clustering method is very suitable for being applied. The research result of the influence factors of the government affair data sharing also has the characteristic of ambiguity. Therefore, clustering analysis is to measure the degree of affinity and sparseness among the factors by using the similarity, so as to perform clustering. Firstly, a similar matrix needs to be established, and the essence of the similar matrix is to establish the similar relation between every two objects in the factor set to be evaluated. The relevance calculation result is adopted to establish a similarity matrix among the influencing factors.
And establishing a fuzzy similarity matrix. Establishing a similarity matrix among the influence factors according to the relevance calculated by the formula (3), expressing the difference among the influence factors by using Euclidean distance, and showing the similarity coefficient of the influence factors by the formula (4), thereby obtaining a fuzzy similarity matrix as follows: r ═ R (R)ij)n×n
rij=1-|ri-rj|,(i,j=1,2,…n) (4)
Wherein r is more than or equal to 0ij≤1,rijCloser to 1 indicates a higher similarity of the two factors.
And establishing a fuzzy equivalent matrix. The fuzzy similar matrix obtained by the formula (4) generally only has reflexibility and symmetry but not transitivity, and the fuzzy similar matrix needs to be transformed into a fuzzy equivalent matrix by adopting a transmission closed-packet method so that the matrix has transitivity. The transmission closed-packet method is to calculate R in turn2,R4,R8…, find k, let RkοRk=RkThe transitive closure of R is t (R) ═ Rk
And (5) dynamic fuzzy clustering. And (3) sequencing elements in the t (R) from large to small, and sequentially taking values of lambda belonging to [0,1] according to the sequence of the t (R) from large to small to obtain intercepts with different lambda levels, thereby realizing dynamic clustering.
According to the data processing method of gray fuzzy clustering, the invention constructs a data processing system based on gray fuzzy clustering, which comprises a data acquisition module, a data preprocessing module, a fuzzy clustering module and a result output module; what is needed isThe data acquisition module acquires data from each government department, and the acquired data comprises evaluation data and fact data; the data preprocessing module receives the evaluation data and the fact data output by the data acquisition module, and synthesizes the evaluation data to obtain comprehensive evaluation data; converting the fact data into fact evaluation data; the comprehensive evaluation data and the factual evaluation data constitute n evaluation factors and generate an evaluation vector Xk=(xk1,xk2,…,xkj,…xkm) Wherein x iskmThe n evaluation vectors constitute an object matrix X as the influence degree of the factorsn×mNormalizing the matrix and calculating a correlation coefficient and a correlation degree; the fuzzy clustering module obtains a fuzzy similarity matrix R according to the relevance; obtaining a transfer packet t (R) according to a transfer closed packet method, and ordering elements in the t (R) from large to small, wherein the lambda belongs to [0,1]]Sequentially taking values from large to small according to t (R) sequence to obtain truncation sets with different lambda levels; and the result output module outputs clusters according to the cut sets of different lambda levels. The system of the invention is capable of fully applying all of the methods of the invention described above.
Of course, the data processing method of gray fuzzy clustering according to the present invention can also be placed in a system or a memory, that is, the system or the memory contains an execution code, and the execution code can execute the data processing method of gray fuzzy clustering according to the present invention.
The invention also provides a specific embodiment of a data processing method and a system for gray fuzzy clustering.
The invention adopts the business data of each department of the government of a certain city and collects the evaluation data fed back by each department, and arranges six categories of 43 factors in total, wherein the influence degree of each factor is divided into seven grades of 1,2,3,4,5,6 and 7, and the degree is changed from 1 to 7. The evaluation data can directly specify and adopt 7 influence factor grades for evaluation in the evaluation process, and the fact data divides evaluation grades to which a certain data influence value range belongs according to the calculated data influence value and an expected range.
Influence factor set is U ═ U1,u2,…,uk,…u43}. Calculating each influence factor u in the survey statistical tablekEvaluation vector X ofk=(xk1,xk2,xk3,xk4,xk5,xk6,xk7). The influence factor evaluation matrix is X ═ Xij]43×7. The evaluation matrix X obtained by calculation is subjected to data normalization processing to obtain a normalization matrix of the influencing factors, as shown in table 2. It is desirable that the more influential grades 5,6 and 7 are the better, the less influential grades 1,2 and 3 are the better, and grade 4 belongs to the transition phase, conveniently taken as the median value. Reference sequence x was constructed in conjunction with the normalized data of Table 10The correlation between the factor to be evaluated and the reference sequence is compared, and the correlation is calculated by a formula (2); the evaluation grade is determined by expert consultation and by an analytic hierarchy process to be p ═ 0.01,0.02,0.03,0.04,0.25,0.30 and 0.35.
TABLE 1 influence factor normalized data for government affairs data sharing
Figure BDA0002277777600000101
Figure BDA0002277777600000111
The correlation was calculated from equation (3), and some of the results are shown in Table 2. The relevance is sorted according to the size, and the larger the value is, the higher the influence degree is, and the relevance can be regarded as a key factor. As shown in fig. 2.
TABLE 2 correlation coefficient and degree of correlation (partial data only)
Figure BDA0002277777600000112
Figure BDA0002277777600000121
And (3) calculating the similarity relation between every two influence factors according to the obtained relevance value formula (4) to construct a fuzzy similarity matrix R shown in table 3. And transforming R into a fuzzy equivalent matrix t (R) by adopting a transfer closure method, and the table 4 shows.
TABLE 3 fuzzy similarity matrix
Figure BDA0002277777600000122
TABLE 4 fuzzy equivalence matrix
Figure BDA0002277777600000123
Dynamic clustering of the influencing factors is realized by determining different levels of lambda values, and the dynamic clustering result is shown in fig. 3. As can be seen from the figure, different lambda values are selected, and clustering results with different classification and refinement degrees are formed. The larger the lambda value, the finer the classification. Combining the relevance calculation and the graph of fig. 2, when λ is 1, 43 factors are respectively in one class, the influence strength of each factor is ranked as shown in fig. 2, the influence degree of the factor 3 is strongest, and the influence degree of the factor 18 is weakest; when λ is 0.9849, the influencing factors can be divided into three categories: the influencing factors are as follows from strong to weak: the first type is {3, 11, 14, 2, 12}, the corresponding influence factors are { D3, O2, O5, D2, O3}, the second type is {32, 1, 13}, the corresponding influence factors are { OM3, D1, O4}, and the third type is: others; when λ is 0.9758, if the response factors are classified into two categories: the key factors are as follows: {3, 11, 14, 2, 12}, others are non-critical factors. Meanwhile, the dynamic clustering result is consistent with the correlation analysis result, the influence degree of each factor can be better reflected, and the method has great value for guiding practice.
In the construction process of government affair data sharing, all factors play a certain role in influence. The influence factors are divided into three categories to better make measures and policies so as to promote the construction of government affair data sharing. According to the result of gray fuzzy clustering, the influence factors can be divided into three categories: the influencing factors are as follows from strong to weak: emphasis classes D3, O2, O5, D2, O3, general emphasis classes OM3, D1, O4, and others. And constructing an attitude curve according to the scores corresponding to the influence factors in the classes, and proposing a corresponding strategy suggestion according to the attitude curve. As shown in fig. 4-5 below.
Various other modifications and changes may be made by those skilled in the art based on the above-described technical solutions and concepts, and all such modifications and changes should fall within the scope of the claims of the present invention.

Claims (10)

1. A data processing method based on gray fuzzy clustering is characterized by comprising the following steps:
data acquisition: collecting data from each government department, wherein the collected data comprises evaluation data and fact data;
data preprocessing: synthesizing the evaluation data to obtain comprehensive evaluation data; converting the fact data into fact evaluation data; the comprehensive evaluation data and the factual evaluation data constitute n evaluation factors and generate an evaluation vector Xk=(xk1,xk2,…,xkj,…xkm) Wherein x iskmThe n evaluation vectors constitute an object matrix X as the influence degree of the factorsn×mNormalizing the matrix and calculating a correlation coefficient and a correlation degree;
fuzzy clustering: obtaining a fuzzy similarity matrix R according to the relevance; and obtaining a transfer packet t (R) according to a transfer closed packet method, sequencing elements in the t (R) from large to small, and sequentially taking values from large to small according to the sequence of the t (R) by the lambda epsilon [0,1] to obtain the cut sets with different lambda levels.
2. The data processing method based on gray fuzzy clustering of claim 1, characterized in that: the data preprocessing step adopts
Figure FDA0002277777590000011
Normalizing said matrix, wherein xijIs a matrix Xn×mRow i and column j.
3. The data processing method based on gray fuzzy clustering of claim 1, characterized in that: the dataIn the pretreatment step, by
Figure FDA0002277777590000012
Figure FDA0002277777590000013
Obtaining the correlation coefficient ξijWherein x is0=(x01,x02,…,x0j,…,x0m) And selecting rho as the resolution according to a specific practical problem.
4. A data processing method based on gray fuzzy clustering as claimed in claim 3, characterized in that: the resolution p is 0.5.
5. The data processing method based on gray fuzzy clustering of claim 4, characterized in that: the degree of association
Figure FDA0002277777590000021
i=1,2,…,n,pjThe weight of the jth influence level in the evaluation target.
6. The data processing method based on gray fuzzy clustering of claim 5, characterized in that: the fuzzy similarity matrix R ═ (R)ij)n×n,rij=1-|ri-rj|,(i,j=1,2,…n)。
7. The data processing method based on gray fuzzy clustering of claim 6, characterized in that: the fuzzy similarity matrix R ═ (R)ij)n×n,rij=1-|ri-rj|,(i,j=1,2,…n)。
8. The data processing method based on gray fuzzy clustering of claim 7, wherein: the transfer packet t (R) ═ RkWherein
Figure FDA0002277777590000022
9. A data processing method based on gray fuzzy clustering according to any of the claims 1-8, characterized in that: the influence degree of the factors is divided into f grades of 1,2 … … f, the degree from 1 to f is changed from weak to strong, and the value of the influence degree is obtained according to the statistical frequency.
10. A data processing system based on gray fuzzy clustering comprises a data acquisition module, a data preprocessing module, a fuzzy clustering module and a result output module; the method is characterized in that:
the data acquisition module acquires data from each government department, and the acquired data comprises evaluation data and fact data;
the data preprocessing module receives the evaluation data and the fact data output by the data acquisition module, and synthesizes the evaluation data to obtain comprehensive evaluation data; converting the fact data into fact evaluation data; the comprehensive evaluation data and the factual evaluation data constitute n evaluation factors and generate an evaluation vector Xk=(xk1,xk2,…,xkj,…xkm) Wherein x iskmThe n evaluation vectors constitute an object matrix X as the influence degree of the factorsn×mNormalizing the matrix and calculating a correlation coefficient and a correlation degree;
the fuzzy clustering module obtains a fuzzy similarity matrix R according to the relevance; obtaining a transfer packet t (R) according to a transfer closed packet method, sequencing elements in the t (R) from large to small, and sequentially taking values from large to small according to the sequence of the t (R) by the lambda epsilon [0,1] to obtain cut sets with different lambda levels;
and the result output module outputs clusters according to the cut sets of different lambda levels.
CN201911129127.2A 2019-11-18 2019-11-18 Data processing method and system based on grey fuzzy clustering Pending CN110968651A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911129127.2A CN110968651A (en) 2019-11-18 2019-11-18 Data processing method and system based on grey fuzzy clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911129127.2A CN110968651A (en) 2019-11-18 2019-11-18 Data processing method and system based on grey fuzzy clustering

Publications (1)

Publication Number Publication Date
CN110968651A true CN110968651A (en) 2020-04-07

Family

ID=70031065

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911129127.2A Pending CN110968651A (en) 2019-11-18 2019-11-18 Data processing method and system based on grey fuzzy clustering

Country Status (1)

Country Link
CN (1) CN110968651A (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111768842A (en) * 2020-07-06 2020-10-13 宁波方太厨具有限公司 Method and system for identifying traditional Chinese medicine syndrome element, electronic equipment and readable storage medium
CN112380687A (en) * 2020-11-10 2021-02-19 西安工程大学 Rapid lining quantification method for business suit based on fuzzy theory
CN113379564A (en) * 2021-04-08 2021-09-10 国网河北省电力有限公司营销服务中心 Power grid load prediction method and device and terminal equipment
CN114332515A (en) * 2021-11-01 2022-04-12 江苏方天电力技术有限公司 Online evaluation method for operation safety of coal mill based on intuition fuzzy clustering

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923615A (en) * 2010-06-11 2010-12-22 北京工业大学 Grey fuzzy comprehensive evaluation-based trust quantization method
CN102629296A (en) * 2012-02-29 2012-08-08 浙江工商大学 Enterprise credit evaluation method based on gray fuzzy
CN104636587A (en) * 2013-11-11 2015-05-20 邢蓉 Drug utilization and evaluation method based on weighted TOPSIS method
CN107092653A (en) * 2017-03-15 2017-08-25 西安工程大学 A kind of landslide Critical Rainfall Threshold based on method of fuzzy cluster analysis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101923615A (en) * 2010-06-11 2010-12-22 北京工业大学 Grey fuzzy comprehensive evaluation-based trust quantization method
CN102629296A (en) * 2012-02-29 2012-08-08 浙江工商大学 Enterprise credit evaluation method based on gray fuzzy
CN104636587A (en) * 2013-11-11 2015-05-20 邢蓉 Drug utilization and evaluation method based on weighted TOPSIS method
CN107092653A (en) * 2017-03-15 2017-08-25 西安工程大学 A kind of landslide Critical Rainfall Threshold based on method of fuzzy cluster analysis

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111768842A (en) * 2020-07-06 2020-10-13 宁波方太厨具有限公司 Method and system for identifying traditional Chinese medicine syndrome element, electronic equipment and readable storage medium
CN111768842B (en) * 2020-07-06 2023-08-11 宁波方太厨具有限公司 Identification method and system of traditional Chinese medicine certification, electronic equipment and readable storage medium
CN112380687A (en) * 2020-11-10 2021-02-19 西安工程大学 Rapid lining quantification method for business suit based on fuzzy theory
CN112380687B (en) * 2020-11-10 2024-02-13 西安工程大学 Fuzzy theory-based western-style clothes quick lining quantification method
CN113379564A (en) * 2021-04-08 2021-09-10 国网河北省电力有限公司营销服务中心 Power grid load prediction method and device and terminal equipment
CN114332515A (en) * 2021-11-01 2022-04-12 江苏方天电力技术有限公司 Online evaluation method for operation safety of coal mill based on intuition fuzzy clustering

Similar Documents

Publication Publication Date Title
CN107633265B (en) Data processing method and device for optimizing credit evaluation model
CN110968651A (en) Data processing method and system based on grey fuzzy clustering
Hájek et al. Forecasting corporate financial performance using sentiment in annual reports for stakeholders’ decision-making
CN110956273A (en) Credit scoring method and system integrating multiple machine learning models
Menekse et al. Internal audit planning using spherical fuzzy ELECTRE
Liu Using big data database to construct new GFuzzy text mining and decision algorithm for targeting and classifying customers
CN114880486A (en) Industry chain identification method and system based on NLP and knowledge graph
WO2023130774A1 (en) Data acquisition system for scientific research capability assessment based on subject development
CN113642922A (en) Small and medium-sized micro enterprise credit evaluation method and device
CN112989621A (en) Model performance evaluation method, device, equipment and storage medium
Wang et al. Design of the Sports Training Decision Support System Based on the Improved Association Rule, the Apriori Algorithm.
CN108509588B (en) Lawyer evaluation method and recommendation method based on big data
CN113591947A (en) Power data clustering method and device based on power consumption behaviors and storage medium
Li et al. A prediction model of human resources recruitment demand based on convolutional collaborative BP neural network
Işık et al. Detection of fraudulent transactions using artificial neural networks and decision tree methods
Yang et al. An evidential reasoning rule-based ensemble learning approach for evaluating credit risks with customer heterogeneity
CN114493224A (en) Method and device for evaluating sustainable development degree of enterprise
WO1992017853A2 (en) Direct data base analysis, forecasting and diagnosis method
Li et al. Research on listed companies’ credit ratings, considering classification performance and interpretability
Franke et al. The six identities of marketing: A vector quantization of research approaches
Zeng A comparison study on the era of internet finance China construction of credit scoring system model
CN112926442B (en) Construction method for image target data set balance completion
Zhang News Communication Value Evaluation Algorithm Based on Deep Neural Network
Arinichev et al. METRIC METHODS OF DATA MINING IN THE ASSESSMENT OF ENVIRONMENTAL AND ECONOMIC RISKS OF BUSINESSES
ARINICHEV et al. Organization bankruptcy risk forecasting methodology based on analytic hierarchy process

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200407