CN115098672A

CN115098672A - User demand discovery method and system based on multi-view deep clustering

Info

Publication number: CN115098672A
Application number: CN202210510779.6A
Authority: CN
Inventors: 杨颖�; 蒋文文; 王刚
Original assignee: Hefei University of Technology
Current assignee: Hefei University of Technology
Priority date: 2022-05-11
Filing date: 2022-05-11
Publication date: 2022-09-23

Abstract

The invention provides a user demand discovery method, a system, a storage medium and electronic equipment based on multi-view deep clustering, and relates to the technical field of data mining. Firstly, acquiring a plurality of texts containing single user requirement description, and vectorizing the texts; then, obtaining multi-view text representation characteristics according to the vectorized text; and then, inputting the text representation characteristics of each view into the deep clustering network provided by the invention, and acquiring a user demand clustering result by adopting a deep clustering algorithm with multi-view consistency and diversity cooperation. By establishing a multi-view collaborative learning mechanism, view diversity information can be effectively retained, consistency information can be mined, information complementarity and bottom information consistency among multi-view data are fully utilized, accuracy of a user demand clustering result is improved, and therefore most of class representative viewpoints and few class novelty viewpoints related to user demands in user generated contents are effectively mined.

Description

User demand discovery method and system based on multi-view deep clustering

Technical Field

The invention relates to the technical field of data mining, in particular to a user demand discovery method and system based on multi-view deep clustering, a storage medium and electronic equipment.

Background

With the development of society, the marketing guidance of enterprises is gradually changed from production drive to user demand drive, the user demand is mined, and products or services are designed or updated to meet the user demand, so that the marketing guidance is an important link of the sustainable development of enterprises.

At present, the first step of mining user requirements is demand collection, the development of big data and internet technology broadens the collection channel of user demand data, user generated contents such as online comments, social media, blogs and the like are the main forms of user expression for personalized experience of products or services, rich text data are provided for mining user requirements, and the method is a promising data mining source, has higher analysis value and good timeliness, can quickly obtain original user contents, is low in cost, and the repeatability and non-informativeness of a large amount of label-free user generated contents enable establishment of an effective demand mining model to become a problem to be solved urgently.

Disclosure of Invention

Technical problem to be solved

Aiming at the defects of the prior art, the invention provides a user demand discovery method, a system, a storage medium and electronic equipment based on multi-view deep clustering, which solve the technical problem that demand mining cannot be effectively carried out on user generated content.

(II) technical scheme

In order to realize the purpose, the invention is realized by the following technical scheme:

a user demand discovery method based on multi-view depth clustering comprises the following steps:

s1, acquiring a plurality of texts containing the description of the requirement of a single user, and vectorizing the texts;

s2, obtaining multi-view text representation characteristics according to the vectorized text;

and S3, inputting the text representation characteristics of each view into the deep clustering network provided by the invention, and acquiring a user demand clustering result by adopting a deep clustering algorithm of multi-view consistency and diversity cooperation.

Preferably, the obtaining of the multi-view text representation feature in S2 includes:

and respectively constructing a text convolutional neural network and a bidirectional long-time and short-time memory network based on a maximum pooling strategy and an average pooling strategy, and acquiring three-view text representation characteristics considering local characteristics and context characteristics simultaneously.

Preferably, the deep clustering network comprises a self-encoder composed of an encoding layer and a decoding layer, and a clustering layer, wherein the encoding layer is composed of a plurality of diversity encoders and a consistency encoder;

the S3 includes:

s31, inputting each view text representation feature into a corresponding diversity encoder for convolution transformation to obtain the diversity depth coding feature of a single view; inputting all the view text representation characteristics into a consistency encoder to carry out convolution transformation, and acquiring consistency depth coding characteristics containing all view information;

and S32, respectively inputting the diversity depth coding features and the consistency depth coding features into a KL divergence-based clustering layer, and acquiring a user demand clustering result.

Preferably, in the training phase, the loss function of the self-encoder is as follows:

wherein L is _loss Representing a sample reconstruction loss function;

n is the number of views, and M is the number of samples;

a text vector representing the jth sample in the ith view;

coding function representing ith viewAnd Θ denotes encoder parameters;

represents the decoding function for the ith view and Ω represents the decoder parameters.

Preferably, in the training phase, the loss function of the cluster layer is as follows

Wherein L is _C Representing a clustering loss function;

λ ₁ representing a clustering loss adjustment coefficient;

Q _i and P _i Respectively representing the clustering soft label distribution and the target distribution corresponding to the diversity depth coding characteristics of the ith view; q and P represent the cluster soft label distribution and the target distribution shared by all views, respectively.

Preferably, the diversity depth coding feature and the uniformity depth coding feature are defined by Z _i I ═ 1, 2, …, N, and Z represent;

appointing the cluster number K, and respectively depth coding the diversity characteristics Z _i I-1, 2, …, inputting N into a K-means clustering algorithm, respectively generating centroids of K initial clusters, and calculating the distribution Q of the soft label of each view _i And target distribution P _i ；

Wherein the content of the first and second substances,

representing encoded feature information of a jth sample in an ith view;

representing the centroid of the c clustering cluster obtained by k-means clustering the M sample features of the i view;

representing the probability that the jth sample in the ith view belongs to the c cluster;

representing a reference probability that a jth sample in the ith view belongs to a c-th cluster;

preferably, the consistent depth coding features Z are input into a K-means clustering algorithm to generate centroids of K initial clustering clusters, and soft label distribution Q and target distribution P shared by all views are calculated;

Q＝[q _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (7)

P＝[p _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (8)

wherein the content of the first and second substances,

z _j feature information representing a jth sample in the consistent depth coding features;

μ _c representing the centroid of the c-th cluster obtained through k-means clustering consistency depth coding characteristics;

q _jc representing the probability that the jth sample belongs to the c cluster;

p _jc indicating the reference probability that the jth sample belongs to the c-th cluster.

Preferably, in the training phase, the total loss function of the deep clustering network is as follows

L＝L _loss +L _C +L _R (11)

Wherein L represents the total loss function of the deep clustering network, L _R A representation parameter regularization term;

λ ₂ expressing the regularization term adjustment coefficient of consistency, λ ₃ Representing a diversity regularization term adjustment coefficient;

denotes the ith ₁ (i ₁ 1,., N) soft label distribution of the views,

denotes the ith ₂ (i ₂ 1,., N) view.

A multi-view depth clustering-based user demand discovery system comprises:

the text acquisition module is used for executing S1, acquiring a plurality of texts containing the description of the requirement of a single user and vectorizing the texts;

the characteristic obtaining module is used for executing S2 and obtaining the multi-view text representation characteristic according to the vectorized text;

and the result clustering module is used for executing S3, inputting the text representation characteristics of each view into the deep clustering network provided by the invention, and acquiring a user demand clustering result by adopting a deep clustering algorithm of multi-view consistency and diversity cooperation.

A storage medium storing a computer program for multi-view depth clustering based user requirement discovery, wherein the computer program causes a computer to execute the user requirement discovery method as described above.

An electronic device, comprising:

one or more processors;

a memory; and

one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the user need discovery method as described above.

(III) advantageous effects

The invention provides a user demand discovery method, a system, a storage medium and electronic equipment based on multi-view deep clustering. Compared with the prior art, the method has the following beneficial effects:

firstly, acquiring a plurality of texts containing single user requirement description, and vectorizing the texts; then, obtaining multi-view text representation characteristics according to the vectorized text; and then, inputting the text representation characteristics of each view into the deep clustering network provided by the invention, and acquiring a user demand clustering result by adopting a deep clustering algorithm with multi-view consistency and diversity cooperation. By establishing a multi-view collaborative learning mechanism, view diversity information can be effectively reserved, consistency information can be mined, information complementarity and bottom information consistency among multi-view data are fully utilized, accuracy of a user demand clustering result is improved, and therefore most of representative viewpoints and few novel viewpoints about user demands in user generated contents are effectively mined.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to these drawings without creative efforts.

Fig. 1 is a schematic flowchart of a user requirement discovery method based on multi-view deep clustering according to an embodiment of the present invention;

fig. 2 is a structural block diagram of a deep clustering network according to an embodiment of the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention are clearly and completely described, and it is obvious that the described embodiments are a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without inventive step based on the embodiments of the present invention, are within the scope of protection of the present invention.

The embodiment of the application provides a user demand discovery method, a system, a storage medium and electronic equipment based on multi-view deep clustering, and solves the technical problem that demand mining cannot be effectively carried out on user generated content.

In order to solve the technical problems, the general idea of the embodiment of the application is as follows:

in the embodiment of the invention, firstly, a plurality of texts containing the description of the requirement of a single user are obtained, and the texts are vectorized; then, obtaining multi-view text representation characteristics according to the vectorized text; and then, inputting the text representation characteristics of each view into the deep clustering network provided by the invention, and acquiring a user demand clustering result by adopting a deep clustering algorithm with multi-view consistency and diversity cooperation. By establishing a multi-view collaborative learning mechanism, view diversity information can be effectively reserved, consistency information can be mined, information complementarity and bottom information consistency among multi-view data are fully utilized, accuracy of a user demand clustering result is improved, and therefore most of representative viewpoints and few novel viewpoints about user demands in user generated contents are effectively mined.

In order to better understand the technical scheme, the technical scheme is described in detail in the following with reference to the attached drawings of the specification and specific embodiments.

Example (b):

as shown in fig. 1, an embodiment of the present invention provides a user requirement discovery method based on multi-view deep clustering, including:

and S3, inputting the text representation characteristics of each view into a pre-trained deep clustering network, and acquiring a user demand clustering result by adopting a deep clustering algorithm of multi-view consistency and diversity cooperation.

According to the embodiment of the invention, by establishing the multi-view collaborative learning mechanism, view diversity information can be effectively retained, consistency information can be mined, information complementarity and bottom information consistency among multi-view data are fully utilized, and the accuracy of a user demand clustering result is improved, so that most of class representative viewpoints and few class novel viewpoints about user demands in user generated contents are effectively mined.

The following will describe each step of the above technical solution in detail with reference to the specific content:

in step S1, several texts containing a single user requirement description are acquired and the texts are vectorized.

Gathering user-generated content from online reviews, social media, blogs, and the like, defining x ₁ ，x ₂ ，…，x _M M pieces of text containing a single user requirement description are represented, and the text is vectorized through word2vec technology.

In step S2, a multi-view text representation feature is acquired from the vectorized text.

In order to improve the information utilization rate of the text required by the user, a text convolution neural network and a two-way long-time and short-time memory network based on a maximum pooling strategy and an average pooling strategy are respectively constructed in the step, and a three-view text representation characteristic considering local characteristics and context characteristics simultaneously is obtained.

In step S3, the text representation characteristics of each view are input into the deep clustering network provided by the present invention, and a user demand clustering result is obtained by using a deep clustering algorithm with multi-view consistency and diversity cooperation.

It should be noted that, as shown in fig. 2, the deep clustering network provided by the present invention includes an auto-encoder composed of an encoding layer and a decoding layer, and a clustering layer. Wherein: the coding layer consists of a plurality of diversity encoders and a consistency encoder; the structure of the decoding layer and the coding layer is completely symmetrical, and the decoding layer and the coding layer comprise a plurality of diversity decoders and a consistency decoder; the input of the clustering layer is the output of all encoders, and the object of clustering action is the depth feature obtained by the encoding layer.

The embodiment of the invention adopts the deep neural network to extract the deep semantic features of the text generated by the user, avoids the defects of time and labor waste, strong subjectivity and the like caused by manual feature extraction, can quickly cluster the generated content of the user, finds most representative requirements and few novelty requirements of the user in time, and helps enterprises to update products and services; and the deep semantic features of the extracted user generated text are better adapted to the clustering task by adopting an end-to-end deep clustering network, the problem of isolation of upstream and downstream tasks caused by mutual independence of the traditional feature engineering and the clustering algorithm is avoided, the globally optimal feature expression can be constructed for the specific clustering algorithm, and the clustering effect is improved.

In addition, in order to fuse multi-view diversity and consistency information and reversely update the deep learning network parameters by using the clustering result, the following loss function and parameter training strategy are designed.

The parameter training process comprises the following two steps:

(1) pre-training the parameters of a deep self-encoder with the goal of minimizing the network reconstruction loss, a loss function L _loss Is composed of

Wherein L is _loss Representing a sample reconstruction loss function;

n is the number of views, and M is the number of samples;

a text vector representing the jth sample in the ith view;

representing the encoding function of the ith view, and theta represents the encoder parameters;

(2) After the pre-trained coding and decoding layer network parameters are obtained, multi-view data are input into an encoder, the diversity depth coding characteristics of a single view and the consistency depth coding characteristics containing all view information are obtained, and Z is used for obtaining the diversity depth coding characteristics of a single view and the consistency depth coding characteristics of all view information _i I is 1, 2, …, N and Z. Then, the clustering result is obtained according to the following steps.

And (2-1) calculating the clustering distribution of the diversity depth features.

Wherein the content of the first and second substances,

representing encoded feature information of a jth sample in an ith view;

representing a reference probability that a jth sample in the ith view belongs to the c cluster;

and (2-2) calculating the cluster distribution of the consistency depth characteristics.

Inputting the consistent depth coding characteristics Z into a K-means clustering algorithm, generating centroids of K initial clustering clusters, and calculating soft label distribution Q and target distribution P shared by all views;

Q＝[q _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (6)

P＝[p _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (7)

wherein the content of the first and second substances,

μ _c representing the centroid of the c-th cluster obtained through k-means cluster consistency depth coding characteristics;

In addition, in the implementation of the present invention, the loss function of the clustering layer is designed as follows:

wherein L is _C Representing a clustering loss function;

λ ₁ representing a clustering loss adjustment coefficient;

Construction while taking sample reconstruction loss L into account _loss Cluster loss L _C Parameter regularization term L _R The total loss function is shown in formula (11), and a random gradient descent algorithm is adopted to optimize the total loss function L to obtain the soft label distribution Q and the target distribution P shared by all the views of the optimal parameters.

The total loss function of the deep clustering network is as follows:

L＝L _loss +L _C +L _R (11)

denotes the ith ₁ (i ₁ 1,., N) soft label distribution of the views,

denotes the ith ₂ (i ₂ 1,., N) view.

In the embodiment of the invention, when the loss function is designed, the KL divergence is adopted to measure the difference between the two distributions, so that the loss function is greatly simplified, and the effect of aligning the clustering distribution parameters of each view with the consistent clustering distribution parameters is achieved while the diversity view information is kept.

After the algorithm converges, the final clustering distribution result is generated according to a formula (13) by adopting the globally optimized shared clustering soft label distribution Q:

wherein s is _j Representing the clustering result of the jth sample; j ═ 1, 2, …, M; c is 1, 2, …, K, q _jc The value representing the shared clustered soft label distribution Q located in the jth row and the c-th column represents the probability that the jth sample belongs to the c-th cluster.

Correspondingly, the S3 specifically includes:

s31, inputting each view text representation feature into a corresponding diversity encoder for convolution transformation, and acquiring the diversity depth coding feature of a single view; inputting all the view text representation characteristics into a consistency encoder to carry out convolution transformation, and acquiring consistency depth coding characteristics containing all view information;

The embodiment of the invention provides a user demand discovery system based on multi-view deep clustering, which comprises the following steps:

the characteristic obtaining module is used for executing S2 and obtaining multi-view text representation characteristics according to the vectorized text;

An embodiment of the present invention provides a storage medium storing a computer program for user requirement discovery based on multi-view deep clustering, wherein the computer program enables a computer to execute the user requirement discovery method as described above.

An embodiment of the present invention further provides an electronic device, including:

one or more processors;

a memory; and

It can be understood that the system, the storage medium, and the electronic device for discovering the user requirement based on the multi-view deep clustering provided in the embodiment of the present invention correspond to the method for discovering the user requirement based on the multi-view deep clustering provided in the embodiment of the present invention, and the corresponding parts in the method for discovering the user requirement may be referred to for explanation, example, and beneficial effects of the relevant contents, and are not described herein again.

In summary, compared with the prior art, the method has the following beneficial effects:

1. according to the embodiment of the invention, through establishing a multi-view collaborative learning mechanism, view diversity information can be effectively retained, consistency information can be mined, information complementarity and bottom information consistency among multi-view data are fully utilized, and the accuracy of a user demand clustering result is improved, so that most representative viewpoints and few novel viewpoints about user demands in user generated contents are effectively mined.

2. The embodiment of the invention adopts the deep neural network to extract the deep semantic features of the text generated by the user, avoids the defects of time and labor waste, strong subjectivity and the like caused by manual feature extraction, can quickly cluster the generated content of the user, finds most representative requirements and few novelty requirements of the user in time, and helps enterprises to update products and services; and the deep semantic features of the extracted user generated text are better adapted to the clustering task by adopting an end-to-end deep clustering network, the problem of isolation of upstream and downstream tasks caused by mutual independence of the traditional feature engineering and the clustering algorithm is avoided, the globally optimal feature expression can be constructed for the specific clustering algorithm, and the clustering effect is improved.

3. In the embodiment of the invention, when the loss function is designed, the KL divergence is adopted to measure the difference between the two distributions, so that the loss function is greatly simplified, and the effect of aligning the clustering distribution parameters of each view with the consistent clustering distribution parameters is achieved while the diversity view information is kept.

It should be noted that, in this document, relational terms such as first and second, and the like are used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. The term "comprising", without further limitation, means that the element so defined is not excluded from the group consisting of additional identical elements in the process, method, article, or apparatus that comprises the element.

The above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims

1. A user demand discovery method based on multi-view depth clustering is characterized by comprising the following steps:

2. The method for discovering user requirements according to claim 1, wherein the obtaining of the multi-view text representation features in S2 includes:

and respectively constructing a text convolutional neural network and a two-way long-time memory network based on a maximum pooling strategy and an average pooling strategy, and acquiring a three-view text representation characteristic considering local characteristics and context characteristics simultaneously.

3. The user demand discovery method according to claim 1 or 2, wherein the deep clustering network comprises an auto-encoder composed of an encoding layer and a decoding layer, and a clustering layer, wherein the encoding layer is composed of a plurality of diversity encoders and a consistency encoder;

the S3 includes:

4. A user demand discovery method according to claim 3, wherein during a training phase, the loss function of said self-encoder is as follows:

wherein L is _loss Representing a sample reconstruction loss function;

n is the number of views, and M is the number of samples;

a text vector representing the jth sample in the ith view;

5. The method of claim 4, wherein in the training phase, the loss function of the clustering layer is as follows

Wherein L is _C Representing a clustering loss function;

λ ₁ representing a clustering loss adjustment coefficient;

6. The user demand discovery method according to claim 5,

defining diversity depth coding characteristics and consistency depth coding characteristics, respectively using Z _i I ═ 1, 2, …, N, and Z represent;

Wherein, the first and the second end of the pipe are connected with each other,

representing encoded feature information of a jth sample in an ith view;

representing the centroid of the c cluster obtained by good-means clustering the M sample features of the i view;

and/or inputting the consistent depth coding characteristics Z into a K-means clustering algorithm to generate centroids of K initial clustering clusters, and calculating soft label distribution Q and target distribution P shared by all views;

Q＝[q _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (7)

P＝[p _jc ] _M×K ，j＝1，2，…，M；c＝1，2，…，K (8)

q _jc representing the probability that the jth sample belongs to the c-th cluster;

7. The method of claim 5, wherein in the training phase, the total loss function of the deep clustering network is as follows

L＝L _loss +L _C +L _R (11)

Wherein L represents the total loss function of the deep clustering network, L _R A representation parameter regularization term; lambda [ alpha ] ₂ Expressing the regularization term adjustment coefficient of consistency, λ ₃ Representing a diversity regularization term adjustment coefficient;

denotes the ith ₁ The distribution of the soft labels of the individual views,

denotes the ith ₂ Soft label distribution for individual views.

8. A user demand discovery system based on multi-view depth clustering, comprising:

9. A storage medium storing a computer program for multi-view depth clustering based user requirement discovery, wherein the computer program causes a computer to execute the user requirement discovery method according to any one of claims 1 to 7.

10. An electronic device, comprising:

one or more processors;

a memory; and

one or more programs, wherein the one or more programs are stored in the memory and configured to be executed by the one or more processors, the programs comprising instructions for performing the user need discovery method of any of claims 1-7.