CN113159235B

CN113159235B - Knowledge collaborative clustering method for product green design

Info

Publication number: CN113159235B
Application number: CN202110565342.8A
Authority: CN
Inventors: 张雷; 张光立; 王青亚; 陈二蒙
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2021-05-24
Filing date: 2021-05-24
Publication date: 2022-09-30
Anticipated expiration: 2041-05-24
Also published as: CN113159235A

Abstract

The invention relates to the technical field of knowledge clustering, and discloses a knowledge collaborative clustering method for product green design, which comprises the steps of restraining knowledge required in the product green design process in two aspects of functions and attributes, establishing a knowledge model of characteristic restraint, limiting the research range of the knowledge, distinguishing data types of different knowledge, respectively selecting corresponding distance calculation formulas based on similarity calculation consideration, and carrying out one-layer retrieval clustering on different functional characteristics by combining a semantic tree according to new design requirements. The knowledge collaborative clustering method for the green design of the product can effectively manage the green design knowledge, achieve the purposes of reducing research and development cost and improving the quality of product design, and can also process fragmented and isomerized knowledge, so that complicated and complicated data can be rapidly and accurately clustered in a collaborative mode.

Description

Knowledge collaborative clustering method for product green design

Technical Field

The invention relates to the technical field of knowledge clustering, in particular to a knowledge collaborative clustering method for product green design.

Background

Under the background of the current big data age, knowledge clustering plays an important and active role in the development process of knowledge engineering, belongs to a product with rapidly rising science and technology, acquires new knowledge with higher value or usability by organizing and managing a dispersive data source and a knowledge source and processing, converting and integrating knowledge elements according to the requirements of customers, provides subsequent services based on knowledge by optimally integrating knowledge objects in aspects of functions, attributes, structures and the like and needs a large amount of design knowledge at each stage of design activity like product green design of the conceptual design, but in the design process, engineers usually spend a large amount of time processing the dispersive data source to acquire required data.

Although the concept of green design knowledge is widely known at present, a systematic method is still lacked for researching the green design knowledge of the product, the green design knowledge cannot be effectively managed, the purposes of reducing research and development cost and improving the quality of product design are achieved, and in addition, the fragmentized and isomerized knowledge lacks connection, is not easy to process, and cannot perform rapid and accurate clustering processing on complex and complicated data.

Disclosure of Invention

Technical problem to be solved

Aiming at the defects of the prior art, the invention provides a knowledge collaborative clustering method for product green design, which can effectively manage green design knowledge, achieve the purposes of reducing research and development cost and improving the quality of product design, can also process fragmented and isomerized knowledge, can realize the advantages of fast and accurate collaborative clustering of complex and frequent data, and the like, and solves the problems that the green design knowledge cannot be effectively managed, the research and development cost is reduced, and the quality of product design is improved.

(II) technical scheme

In order to achieve the purpose, the invention provides the following technical scheme: a knowledge collaborative clustering method for product green design comprises the following steps;

step one, constraining knowledge required in the green design process of a product in two aspects of functions and attributes, establishing a knowledge model of characteristic constraint, and limiting the research range of the knowledge;

step two, distinguishing data types of different knowledge, calculating and considering based on similarity, and respectively selecting corresponding distance calculation formulas;

step three, aiming at new design requirements, firstly, combining a semantic tree to perform one-layer retrieval clustering on different functional characteristics to obtain one-layer functional similarity (FR), and retrieving a certain amount of samples from a sample library according to the FR;

step four, using the deeper second-layer attribute to screen and cluster to make a decision of a feasible clustering scheme;

and step five, calculating the distances among the attributes with different data types, introducing a cooperative operator, and calculating cooperative similarity (CR) by combining different distance formulas and an improved classical Fuzzy C-means (FCM) algorithm to realize the cooperative clustering effect of hierarchical iteration and the centralized processing of discrete knowledge.

Preferably, in the step one, a knowledge model of characteristic constraints is established, a research range of knowledge is limited, knowledge required in a product green design process is constrained in two aspects of functions and attributes, and a knowledge model based on the characteristic constraints is established.

Preferably, the calculation of the similarity in the second step is the core of the clustering calculation, and the calculation of the similarity is highly dependent on the distance between each sample, the similarity and the sample distance are in an inverse relationship in terms of quantity, the greater the distance, the lower the sample similarity is, and the smaller the distance, the higher the sample similarity is, therefore, the calculation of the similarity can be obtained by the reciprocal of the distance.

Preferably, the Semantic Tree (ST) in the three steps analyzes and identifies a text, selects feature words and feature words with synonyms, peers or membership of the feature words to be represented in a tree form, measures attributes of the Semantic tree mainly including Semantic tree height, Semantic element path, Semantic element speed and Semantic element depth, when functional knowledge is retrieved from a sample library in input design requirements, FR is a key factor, according to the analysis of the Semantic tree, when the difference of the depth d(s) between two Semantic elements is small, the degree of association is high, when some functional features are in the same layer or adjacent layers, whether the functional features are equally needed can be determined through the similarity between the Semantic elements, where the inverse of the depth difference is used to measure the correlation between the two Semantic elements, as shown in the following formula:

preferably, when the deeper second-layer attribute screening clustering is used in the fourth step to make a decision on a feasible clustering scheme, the feature constraint is characterized in that the data types of constraint variables are not uniform, the data types are classified into four types, namely a semantic type, a numerical type, a fuzzy interval type and a tuple type, and the types appearing in the feature constraint include several types to all types.

Preferably, the classic fuzzy C-means clustering algorithm in the fifth step is calculated by adopting the following formula;

as described above, equation (1) defines the equation for the FCM objective function, where | x | (R |) _j -c _i I represents a sample x _j To class center c _i A distance of c _i As a cluster center, x _j As data samples, u _ij Is degree of membership, m is degree of membership factor, J is objective function, formula (2) is constraint formula, and formula (3) is degree of membership u _ij Calculating a formula, wherein the formula (4) is a clustering center iteration formula;

when the algorithm is started to run, giving u randomly _ij Or c _i One of the values is assigned, as long as the value is assigned under a full condition, then through repeated iteration, the objective function J gradually tends to a stable value, and finally a solution meeting the requirement is obtained, wherein different functional characteristics comprise N attribute characteristics, the N attribute characteristics have different data types, and each type of attribute characteristics are connected in series to form the functional characteristics in a knowledge model of characteristic constraint, wherein the work to be done by collaborative clustering is to generate clustering subsamples of each group of functional characteristics on the premise of dividing the functional characteristics into N fuzzy classes, and then through collaborative cooperation among the subsamples, a collaborative operator is introduced to perform appropriate strengthening or weakening on the different characteristics to generate an integral clustering sample of the characteristic constraint;

let Ui be a membership matrix, U _i ＝{u _ij ∈[0,1]}，p _i Is a layer of samples after functional clustering, CO [ i, j]Is a co-operator, d _ij (i 1,2, …, n, j 1,2, p) is a distance function, an objective function form in a classic FCM clustering function is applied and improved, and after a synergistic operator is added, the following synergistic function is obtainedImproved objective function J with same effect _i ；

Referring to an iteration mode in a classical FCM algorithm, introducing a Lagrange multiplier lambda, converting a cooperation problem into a non-constrained optimization problem, and obtaining:

minimizing the requirements according to the objective function:

obtaining a membership matrix element iterative formula:

and obtaining the collaborative clustering algorithm after determining the main parameter iteration mode.

Preferably, the collaborative clustering algorithm is specifically as follows;

inputting: input sample set X ₁ ,X ₂ ,..,X _n ；

Selecting: selecting corresponding distance function, cluster number c and cooperative operator CO [ i, j ] according to different data types, wherein the algorithm termination criterion satisfies the formula:

D _i+1 -D _i ≤δ

initialization: initializing a membership matrix and an algorithm termination threshold delta, and carrying out independent clustering by adopting different distance formulas according to specific data types of attribute characteristics contained in the membership matrix and the algorithm termination threshold delta to obtain an initial clustering sample related to design requirements;

and (3) calculating: and continuously iterating the membership degree matrix until a termination criterion is met.

(III) advantageous effects

Compared with the prior art, the invention provides a knowledge collaborative clustering method for product green design, which has the following beneficial effects:

when the invention is used, the knowledge required in the green design process of the product is restricted in two aspects of function and attribute, a knowledge model of characteristic restriction is established, the knowledge research is limited to be in a certain specific condition set according to characteristic parameters, the data types of different knowledge are distinguished, then the similarity of different knowledge is obtained through the reciprocal distance obtained by a distance calculation formula, different functional characteristics are subjected to one-layer searching and clustering by combining a semantic tree to obtain one-layer functional similarity, a certain amount of samples are searched from a sample library, a text is analyzed and identified through the semantic tree, the selected characteristic words and the characteristic words with synonyms, peers or subordination relations are expressed in a tree form, when certain functional characteristics are in the same layer or adjacent layers, whether the functional characteristics are equally required can be judged through the similarity between semantic elements, at the moment, the correlation of two semantic elements is measured by utilizing the reciprocal of the depth difference, the knowledge model characteristic attributes identified by semantic tree analysis are calculated by utilizing a formula, and then the cooperative similarity can be obtained, finally, the distance between the attributes with different data types is calculated by introducing a cooperative operator, and the cooperative similarity is calculated by combining different distance formulas and an improved FCM algorithm, so that the cooperative clustering effect of hierarchical iteration is realized, the centralized processing of discrete knowledge is realized, and the rapid and accurate clustering of complicated data is realized.

Drawings

FIG. 1 is a knowledge model diagram of feature constraints in a knowledge collaborative clustering method for product green design according to the present invention;

FIG. 2 is a logic expression diagram of a semantic tree in the knowledge collaborative clustering method for product green design according to the present invention;

FIG. 3 is a flow chart of collaborative clustering calculation in the knowledge collaborative clustering method for green product design according to the present invention;

FIG. 4 is a collaborative clustering meaning diagram in the knowledge collaborative clustering method for product green design according to the present invention;

FIG. 5 is a flow chart of a collaborative clustering algorithm in the knowledge collaborative clustering method for product green design according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Example (b):

referring to fig. 1-5, a knowledge collaborative clustering method for product green design includes the following steps;

step one, constraining knowledge required in the green design process of a product in two aspects of functions and attributes, establishing a knowledge model of characteristic constraint, limiting the research range of the knowledge, establishing the knowledge model of the characteristic constraint in step one, limiting the research range of the knowledge, constraining the knowledge required in the green design process of the product in two aspects of functions and attributes, and establishing a knowledge model based on the characteristic constraint, wherein in the knowledge model of the characteristic constraint, the characteristic constraint is composed of characteristic parameters of functional characteristics, meanwhile, different functional characteristics comprise different attributes and corresponding values thereof, and the characteristic parameters limit the research of the knowledge in a certain specific condition set;

and secondly, distinguishing data types of different knowledge, and selecting corresponding distance calculation formulas based on similarity calculation consideration, wherein the data types of various attributes contained in the characteristic parameters are different, the data types of some attributes are semantic types, and the data types of some attributes are numerical types. Therefore, when sample clustering is performed, two or three or more functional features and attribute features doubled after decomposition may be faced, and the features and the attributes also have different data types, the calculation of the similarity is the core of the clustering calculation, and the calculation of the similarity is highly dependent on the distance between each sample, the similarity and the sample distance are in an inverse relation in number, the greater the distance is, the lower the sample similarity is, the smaller the distance is, the higher the sample similarity is, and therefore, the calculation of the similarity can be obtained by the reciprocal of the distance;

let the feature constraint of knowledge be FC _i Where i is 1, 2.. and n, different feature constraints correspond to different data types, and there are corresponding distance calculation formulas, as shown in table 1,

TABLE 1

Step three, aiming at new design requirements, firstly, combining a Semantic tree to perform one-layer retrieval clustering on different functional characteristics to obtain one-layer functional similarity (FR), retrieving a certain amount of samples from a sample library according to FR, wherein the Semantic Tree (ST) is used for analyzing and identifying texts, selecting characteristic words and characteristic words with synonymy, same level or membership of the characteristic words and expressing the characteristic words in a tree form, and measuring the attributes of the Semantic tree mainly comprising Semantic tree height, Semantic element path, Semantic element high speed and Semantic element depth, when the design requirements are input to retrieve functional knowledge from the sample library, FR is a key factor, according to the analysis of the Semantic tree, when the difference of the depth d(s) between two Semantic elements is small, the association degree is high, and when some functional characteristics are in the same layer or an adjacent layer, whether the functional characteristics are equally needed or not can be judged through the similarity between the Semantic elements, the inverse of the depth difference is used here to measure the correlation between two semantic elements, as shown in the following equation:

in the above, the semantic tree height, i.e. the total number of layers in the semantic tree, is represented by h (ST),

semantic Meta Path, i.e. the shortest path from semantic Meta i to semantic Meta j, using p(s) _i ,s _j ) It is shown that the process of the present invention,

the semantic element height, i.e. the length of the semantic element path, is called the height of the semantic element, denoted h(s),

the semantic element depth is the length of the longest path from the root of the semantic tree to a specific semantic element, namely the semantic element depth and is represented by d(s);

step four, using the deeper second-layer attribute screening cluster to make a decision of a feasible clustering scheme, when using the deeper second-layer attribute screening cluster to make a decision of the feasible clustering scheme, the characteristic constraint is characterized in that the data types of the constraint variables are not uniform, the data types are divided into four types of semantic type, numerical type, fuzzy interval type and group type, and the types appearing in the characteristic constraint include several types to all types, therefore, when calculating the sample similarity, the distance relation between the characteristics needs to be considered firstly, the influence of different data types needs to be faced, and the attribute characteristic constraint in the knowledge model of the characteristic constraint is set as AC _i (i-1, 2, …, n), after determining the data type of the corresponding feature, selecting the corresponding distance formula in table 1 to calculate the distance, and setting the attribute feature constraint AC _i (i is 1,2, …, n) is d _i (i ═ 1,2, …, n) and a cooperative distance D, acting on D _i The CO-operator (Cooperator) above is CO _i (i＝1,2,…,n),COi∈[0,1]The following formula is used to express the cooperative distance:

the reciprocal of the cooperative distance is the cooperative similarity:

calculating the distances among the attributes with different data types, introducing a cooperative operator, and calculating cooperative similarity (CR) by combining different distance formulas and an improved classical Fuzzy C-means (FCM) algorithm to realize the cooperative clustering effect of hierarchical iteration and the centralized processing of discrete knowledge, wherein the classical Fuzzy C-means clustering algorithm adopts the following formula for calculation;

as described above, equation (1) defines the equation for the FCM objective function, where | x | (R |) _j -c _i I represents a sample x _j To class center c _i A distance of c _i As a cluster center, x _j As data samples, u _ij Is the degree of membership, m is the degree of membership factor, J is the objective function, equation (2) is the constraint equation, and equation (3) is the degree of membership u _ij Calculating a formula, wherein the formula (4) is a clustering center iterative formula;

when the algorithm is started to run, giving u randomly _ij Or c _i One assignment is only required to be assigned under a condition, then the objective function J gradually tends to a stable value through repeated iteration, and finally a solution meeting the requirement is obtained, wherein different functional characteristics comprise N attribute characteristics, and the N attribute characteristics have different numbersAccording to types, each type of attribute features are connected in series to form functional features in a feature constrained knowledge model, wherein the collaborative clustering is to do work, namely, on the premise of dividing the functional features into n fuzzy classes, clustering subsamples of each group of functional features are generated, then, through collaborative cooperation among the subsamples, a collaborative operator is introduced to carry out appropriate amount of strengthening or weakening on different features, and an integral clustering sample of feature constraint is generated;

let Ui be a membership matrix, U _i ＝{u _ij ∈[0,1]}，p _i Is a layer of samples after functional clustering, CO [ i, j ]]Is a co-operator, d _ij The method is characterized in that (i is 1,2, …, n, J is 1,2, p) is a distance function, an objective function form in a classic FCM clustering function is applied and improved, and after a synergistic operator is added, an improved objective function J with a synergistic effect is obtained _i ；

minimizing the requirements according to the objective function:

obtaining a membership matrix element iterative formula:

after the main parameter iteration mode is determined, the obtained collaborative clustering algorithm is concretely as follows;

inputting: input sample set X ₁ ,X ₂ ,..,X _n ；

D _i+1 -D _i ≤δ

the difference value before and after the cooperative distance is smaller than a threshold value delta, the termination criterion depends on the change of a membership matrix in the continuous iteration process of the clustering method, when different cooperative operators are taken, the distance formula can be used according to different distance formulas, for example, the Mahalanobis distance can be used as the measurement standard of the final cooperative distance, and along with continuous iteration, when the error between the current measured value and the measured value of the last two times is smaller than a given threshold value, the iteration is terminated;

When the invention is used, the knowledge required in the green design process of the product is restricted in two aspects of function and attribute, a knowledge model of characteristic restriction is established, the knowledge research is limited in a certain specific condition set according to characteristic parameters, the data types of different knowledge are distinguished, then the similarity of different knowledge is obtained through the reciprocal distance obtained by a distance calculation formula, different functional characteristics are subjected to one-layer searching clustering by combining a semantic tree to obtain one-layer functional similarity, a certain amount of samples are searched from a sample library, a text is analyzed and identified through the semantic tree, the selected characteristic words and the characteristic words with synonymy, sibling or subordination relation are expressed in a tree form, and when certain functional characteristics are in the same layer or adjacent layers, whether the functional characteristics are equally required can be judged through the similarity between semantic elements, at the moment, the correlation of two semantic elements is measured by utilizing the reciprocal of the depth difference, the knowledge model characteristic attributes identified by semantic tree analysis are calculated by utilizing a formula, and then the cooperative similarity can be obtained, finally, the distance between the attributes with different data types is calculated by introducing a cooperative operator, and the cooperative similarity is calculated by combining different distance formulas and an improved FCM algorithm, so that the cooperative clustering effect of hierarchical iteration is realized, the centralized processing of discrete knowledge is realized, and the rapid and accurate clustering of complicated data is realized.

It is to be noted that the term "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion, such that a process or method including a list of elements does not include only those elements but also other elements not expressly listed or inherent to such process or method. Without further limitation, an element defined by the phrase "comprising … …" does not exclude the presence of additional like elements in a process or method that includes the element.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. A knowledge collaborative clustering method for product green design is characterized in that: comprises the following steps;

step three, aiming at new design requirements, firstly, combining a semantic tree to perform one-layer retrieval clustering on different functional features to obtain one-layer functional similarity (FR), and retrieving a certain amount of samples from a sample library according to the FR;

the Semantic Tree (ST) analyzes and identifies a text, selects feature words and feature words with synonyms, peers or membership of the feature words and represents the feature words in a tree form, measures attributes of the Semantic tree mainly including Semantic tree height, Semantic element path, Semantic element high speed and Semantic element depth, when functional knowledge is retrieved from a sample library in input design requirements, FR is a key factor, according to the analysis of the Semantic tree, when the difference of the depth d(s) between two Semantic elements is small, the degree of association is high, when some functional features are in the same layer or adjacent layer, whether the functional features are required to be equal can be determined through the similarity between the Semantic elements, the reciprocal of the depth difference is used to measure the correlation between the two Semantic elements, as shown in the following formula:

in the formula, R _f Being the degree of association between two semantic elements, d _i (s) and d _j (s) are two semantic elements;

step four, using the deeper second-layer attribute to screen the clustering to make a decision on the feasible clustering scheme, when using the deeper second-layer attribute to screen the clustering to make a decision on the feasible clustering scheme, the characteristic constraint is characterized in that the data types of the constraint variables are not uniform, the data types are divided into four types, namely a semantic type, a numerical type, a fuzzy interval type and a tuple type, and the types appearing in the characteristic constraint include several types to all types, so when calculating the sample similarity, the distance relation among the characteristics needs to be considered at first, and then the influence of different data types needs to be faced;

calculating distances among attributes with different data types, introducing a cooperative operator, and calculating cooperative similarity (CR) by combining different distance formulas and an improved classical Fuzzy C-means (FCM) algorithm to realize a cooperative clustering effect of hierarchical iteration and centralized processing of discrete knowledge;

calculating by using the following formula in a classical fuzzy C-means clustering algorithm in the step five;

above, equation (1) defines the equation for the FCM target function, where | x _j -c _i II denotes sample x _j To class center c _i A distance of c _i As a cluster center, x _j As data samples, u _ij Is degree of membership, m is degree of membership factor, J is objective function, formula (2) is constraint formula, and formula (3) is degree of membership u _ij Calculating a formula, wherein the formula (4) is a clustering center iterative formula;

let Ui be a membership matrix, U _i ＝{u _ij ∈[0,1]}，p _i Is a layer of samples after functional clustering, CO [ i, j]Is a co-operator, d _ij (i 1,2, …, n; J1, 2,., p) is a distance function, an objective function form in a classic FCM clustering function is applied and improved, and after a synergistic operator is added, an improved objective function J with a synergistic effect is obtained _i ；

minimizing the requirements according to the objective function:

obtaining a membership matrix element iterative formula:

2. The knowledge collaborative clustering method for green design of products according to claim 1, characterized in that: in the first step, a knowledge model of characteristic constraint is established, the research range of knowledge is limited, the knowledge required in the green design process of products is constrained in two aspects of functions and attributes, and the knowledge model based on the characteristic constraint is established.

3. The knowledge collaborative clustering method for green design of products according to claim 1, characterized in that: the calculation of the similarity in the second step is the core of the clustering calculation, the calculation of the similarity is highly dependent on the distance between each sample, the similarity and the sample distance are in an inverse relationship in quantity, the greater the distance is, the lower the sample similarity is, the smaller the distance is, the higher the sample similarity is, and therefore, the calculation of the similarity can be obtained through the reciprocal of the distance.

4. The product green design-oriented knowledge collaborative clustering method according to claim 1, characterized in that: the collaborative clustering algorithm is concretely as follows;

inputting: input sample set X ₁ ,X ₂ ,..,X _n ；

D _i+1 -D _i ≤δ