CN112836520A

CN112836520A - Method and device for generating user description text based on user characteristics

Info

Publication number: CN112836520A
Application number: CN202110189542.8A
Authority: CN
Inventors: 李怀松; 黄涛; 王睿祺; 金先明; 张天翼
Original assignee: Alipay Hangzhou Information Technology Co Ltd
Current assignee: Alipay Hangzhou Information Technology Co Ltd
Priority date: 2021-02-19
Filing date: 2021-02-19
Publication date: 2021-05-25

Abstract

The embodiment of the specification provides a method and a device for generating a user description text based on user characteristics. The method comprises the following steps: inputting feature names of various features of a target user and feature values corresponding to the feature names into a first encoder to obtain feature vectors of various initial users; inputting each initial user feature vector into a retrieval model, and performing K iterations to obtain K sentences; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and searching out a statement from an artificial knowledge base according to the comprehensive characterization vector; inputting the K sentences into a second encoder, and encoding the K sentences based on an attention mechanism to obtain semantic representation vectors; and inputting each initial user feature vector and the semantic representation vector into a generation model to generate a user description text of the target user. Efficiency and text quality can be both considered.

Description

Method and device for generating user description text based on user characteristics

Technical Field

One or more embodiments of the present specification relate to the field of computers, and more particularly, to a method and apparatus for generating user description text based on user characteristics.

Background

Since the user characteristics of the user have an association relationship with the user category of the user, the user can be classified based on the user characteristics of the user. The user characteristics may be data of the user's age, academic calendar, income, etc., and the user categories may include a plurality of predetermined categories, for example, whether there is a payment risk, whether there is a money laundering risk, etc. Generally, it is not convincing to only give the user characteristics of the user and the user category of the user, so after obtaining the user characteristics of the user, a user description text needs to be generated based on the user characteristics, and the user description text comprises a plurality of sentences, which can embody the association relationship between the user characteristics and the user category. The requirement for the user description text is a normative message with compact logic, sufficient demonstration, simplicity and understandability.

In the prior art, there are two ways to generate a user description text based on user characteristics: one way to manually compose user description text is inefficient; another way is to machine-generate the user description text, which is of poor quality.

Accordingly, improved solutions are desired that allow for both efficiency and text quality.

Disclosure of Invention

One or more embodiments of the present specification describe a method and apparatus for generating a user description text based on user characteristics, which can take both efficiency and text quality into account.

In a first aspect, a method for generating a user description text based on user characteristics is provided, and the method includes:

inputting feature names of various features of a target user and feature values corresponding to the feature names into a first encoder, and obtaining initial user feature vectors corresponding to the various features through the first encoder;

inputting the characteristic vectors of all the initial users into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector;

inputting the K sentences into a second encoder, and encoding the K sentences through the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences;

and inputting the initial user feature vectors and the semantic representation vectors into a generation model, and generating a user description text of the target user through the generation model.

In a possible implementation, the types of the features include:

numeric or text.

Further, before obtaining each initial user feature vector corresponding to each feature through the first encoder, the method further includes:

performing word segmentation processing on an original characteristic value of a characteristic with a text type to obtain a plurality of word segmentation results;

the inputting of the feature names of the features of the target user and the feature values corresponding to the feature names into the first encoder includes:

and inputting the feature name of the feature of which the type of the target user is a text type and a plurality of word segmentation results corresponding to the feature name into a first encoder.

In a possible embodiment, the first encoder comprises a first embedding matrix, a second embedding matrix and a first coding model; obtaining, by the first encoder, initial user feature vectors corresponding to the features respectively includes:

taking any one of the features as a target item feature, and converting the feature name of the target item feature into a first embedded vector through a first embedded matrix;

converting the eigenvalue corresponding to the target item characteristic into a second embedded vector through a second embedded matrix;

and inputting the first embedded vector and the second embedded vector corresponding to each feature into the first coding model, and coding the first coding model based on an attention mechanism to obtain initial user feature vectors corresponding to each feature.

In a possible implementation manner, the determining attention coefficients of the current iteration corresponding to the features respectively includes:

and determining attention coefficients corresponding to each feature of the iteration respectively according to the initial user feature vectors and the comprehensive characterization vector obtained by the last iteration.

In a possible embodiment, the retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector includes:

passing the comprehensive characterization vector through a full connection layer to obtain an output vector with dimensions of the number of sentences contained in the artificial knowledge base;

normalizing the output vector to obtain a normalized vector;

and selecting the dimension corresponding to the maximum numerical value in the normalized vector, and taking the statement corresponding to the dimension as the statement retrieved from the artificial knowledge base in the iteration.

In one possible embodiment, the second encoder comprises a second coding model and a self-attention layer; said encoding, by the second encoder, the K statements based on an attention mechanism, comprising:

inputting the word embedding vector of each word included in the K sentences into a second coding model, and determining the word coding vector of each word through the second coding model;

inputting the word coding vector of each word into a self-attention layer, determining an attention coefficient corresponding to each word through the self-attention layer, and performing weighted summation on the word coding vector of each word according to the attention coefficient corresponding to each word to obtain semantic representation vectors corresponding to the K sentences.

In one possible embodiment, the generative model comprises: the device comprises a first sublayer, a second sublayer and an intermediate layer, wherein the first sublayer and the second sublayer are time sequence-based neural network layers;

the generating of the user description text of the target user through the generative model includes:

the first sublayer takes a word generated at the last moment and the hidden state of the second sublayer at the last moment as the current moment input of the first sublayer to generate the hidden state of the first sublayer at the current moment; wherein the semantic representation vector is used as a hidden state of the second sublayer at the initial moment;

the intermediate layer determines each weight coefficient corresponding to each feature according to the hidden state of the first sublayer at the current moment and each initial user feature vector, and performs weighted summation on each initial user feature vector according to each weight coefficient to obtain an intermediate characterization vector;

the second sublayer takes the intermediate characterization vector and the hidden state of the first sublayer at the current moment as the current moment input of the second sublayer, and generates the hidden state of the second sublayer at the current moment; the hidden state of the second sublayer at the current time is used to determine the word generated at the current time.

In one possible embodiment, the method further comprises:

adjusting parameters of at least one of the first encoder, the retrieval model, the second encoder and the generation model by using a preset total loss function; the total loss function is determined by a first loss function and a second loss function, wherein the function value of the first loss function depends on the generation probability of each word in the artificial description text of the target user in the generation model, and the function value of the second loss function depends on whether each word in the artificial description text of the target user exists in a preset label text or not and the generation probability of each word in the label text in the generation model.

Further, the generation model is a time sequence-based model which sequentially generates words in the user description text at a plurality of moments; the generation probability of each word in the generative model comprises the generation probability of each word obtained by the generative model at each moment.

In a second aspect, an apparatus for generating a user description text based on user characteristics is provided, the apparatus comprising:

the first coding unit is used for inputting the feature names of various features of a target user and the feature values corresponding to the feature names into a first coder, and obtaining various initial user feature vectors corresponding to the various features through the first coder;

the retrieval unit is used for inputting each initial user feature vector obtained by the first coding unit into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector;

the second coding unit is used for inputting the K sentences obtained by the retrieval unit into a second coder, and coding the K sentences through the second coder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences;

and the generating unit is used for inputting each initial user feature vector obtained by the first encoding unit and the semantic representation vector obtained by the second encoding unit into a generating model, and generating the user description text of the target user through the generating model.

In a third aspect, there is provided a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of the first aspect.

In a fourth aspect, there is provided a computing device comprising a memory having stored therein executable code and a processor that, when executing the executable code, implements the method of the first aspect.

According to the method and the device provided by the embodiment of the specification, firstly, feature names of various features of a target user and feature values corresponding to the feature names are input into a first encoder, and various initial user feature vectors corresponding to the various features are obtained through the first encoder; then inputting the characteristic vectors of all the initial users into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector; inputting the K sentences into a second encoder, and encoding the K sentences through the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences; and finally, inputting the characteristic vectors of the initial users and the semantic representation vectors into a generation model, and generating a user description text of the target user through the generation model. Therefore, the user description text is automatically generated through a machine, efficiency is high, in the process, initial user feature vectors corresponding to various features of the target user are utilized, semantic representation vectors corresponding to the retrieved K sentences are utilized, and the K sentences are derived from the artificial knowledge base, so that the artificial experience most relevant to the target user can be effectively utilized, the problems of word folding, wrong words and the like can be well solved, applicability is high, text quality is good, and efficiency and text quality can be considered.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without creative efforts.

FIG. 1 is a schematic diagram illustrating an implementation scenario of an embodiment disclosed herein;

FIG. 2 illustrates a flow diagram of a method for generating user description text based on user characteristics, according to one embodiment;

FIG. 3 illustrates a retrieval system architecture diagram according to one embodiment;

FIG. 4 shows a schematic block diagram of a second encoder according to an embodiment;

FIG. 5 illustrates a structural diagram of a generative model according to one embodiment;

FIG. 6 shows a schematic block diagram of an apparatus for generating user description text based on user characteristics, according to one embodiment.

Detailed Description

The scheme provided by the specification is described below with reference to the accompanying drawings.

Fig. 1 is a schematic view of an implementation scenario of an embodiment disclosed in this specification. The implementation scenario involves generating user description text based on user features. The user may be classified based on the user characteristics of the user. The user characteristics may be data of the user's age, academic calendar, income, etc., and the user categories may include a plurality of predetermined categories, for example, whether there is a payment risk, whether there is a money laundering risk, etc. The user description text comprises a plurality of sentences, and can embody the association relation between the user characteristics and the user categories. The requirement for the user description text is a normative message with compact logic, sufficient demonstration, simplicity and understandability.

Referring to fig. 1, the table lists feature names and corresponding feature values of various features of user a, and it can be understood that the target user is user a, the feature name is age, the feature value is 50 years old, the feature name is scholarly, the feature value is high, … …, and the feature name is annual income, the feature value is 3 ten thousand yuan. According to the feature names of the features of the user A and the corresponding feature values, the generated user description text is ' the user A is older, the academic history is lower, the income is lower in … … years, and therefore the repayment risk ' is realized '. In the embodiment of the present specification, the user characteristics may include, but are not limited to, the above-listed user attribute characteristics such as age, academic degree, and income of the year, and may further include historical behavior characteristics of the user for a specific application, for example, a historical debit amount, whether there is a delayed payment, and the like. The specific content and manner of generation of the user description text is generally not fixed. In the embodiment of the specification, the user description text is generated by combining expert experience and machine learning, and it can be understood that the user description text is automatically generated by a machine through the expert experience, namely manual experience, the efficiency is high, in the process, not only are various characteristics of the target user utilized, but also sentences retrieved from the manual knowledge base are utilized, so that the manual experience most relevant to the target user can be effectively utilized, the problems of word folding, wrong words and the like can be well solved, the applicability is strong, the text quality is good, and the efficiency and the text quality can be considered at the same time.

Fig. 2 shows a flowchart of a method for generating a user description text based on user characteristics according to an embodiment, which may be based on the implementation scenario shown in fig. 1. As shown in fig. 2, the method for generating the user description text based on the user characteristics in this embodiment includes the following steps: step 21, inputting feature names of various features of a target user and feature values corresponding to the feature names into a first encoder, and obtaining initial user feature vectors corresponding to the various features through the first encoder; step 22, inputting the characteristic vectors of the initial users into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector; step 23, inputting the K sentences into a second encoder, and encoding the K sentences through the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences; and 24, inputting the initial user feature vectors and the semantic representation vectors into a generation model, and generating a user description text of the target user through the generation model. Specific execution modes of the above steps are described below.

Firstly, in step 21, feature names of various features of a target user and feature values corresponding to the feature names are input into a first encoder, and various initial user feature vectors corresponding to the various features are obtained through the first encoder. It is understood that the first encoder may be based on various model structures, such as a transformer (transform), a long-short-term memory (LSTM), a gated round-robin unit (GRU), and so on.

In one example, the types of the various features include:

numeric or text.

For example, the age of the user a is 50 years, the feature of the age belongs to a feature of a numerical type, the feature name of the feature is the age, and the corresponding feature value is 50; the place of the user A is Beijing and Shanghai, the feature of the place belongs to a text type feature, the feature name of the feature is the place of the place, and the corresponding feature values are Beijing and Shanghai.

It will be appreciated that the type of feature is also the type of its corresponding feature value.

In the embodiment of the present specification, for a feature whose type is a numerical type, a feature name of the feature and an original feature value corresponding to the feature name may be input to a first encoder; for the feature with text type, the corresponding original feature value can be firstly subjected to word segmentation to obtain a plurality of word segmentation results, and then the feature name of the feature and the plurality of word segmentation results corresponding to the feature name are input into the first encoder.

In one example, the first encoder includes a first embedding matrix, a second embedding matrix, and a first coding model; obtaining, by the first encoder, initial user feature vectors corresponding to the features respectively includes:

For example, a first embedding vector corresponding to the ith feature is denoted as x _ i _ feature, a second embedding vector corresponding to the ith feature is denoted as x _ i _ value, the first coding model is a transform model structure, and x _ i ═ x _ i _ feature and x _ i _ value ] are coded by using the transform model structure to obtain an initial user feature vector corresponding to the ith feature. The model structure of the transform mainly comprises an attention layer, a residual layer, a normalization layer, a feedforward layer and the like.

Then, in step 22, inputting the characteristic vectors of the initial users into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and searching out a statement from an artificial knowledge base according to the comprehensive characterization vector. It can be understood that the sentences in the artificial knowledge base embody artificial experiences, and the artificial experiences related to the target user can be obtained through a retrieval mode.

In an example, the determining attention coefficients of the current iteration respectively corresponding to the features includes:

For example, X_iRepresenting each initial user feature vector, the value of i is from 1 to N, namely N features are shared, and X is to divide each X into_iThe vector after the combination is carried out,

W_X、W_Cis a parameter to be learned, C_t-1Is the integrated token vector obtained from the last iteration, or is called the integrated token vector at the last moment, C_tThe comprehensive characterization vector obtained by the iteration is referred to as the comprehensive characterization vector of the current moment.

First z can be determined by the following formula_t：

Wherein tanh represents the activation function.

Then z is calculated by the following formula_tAnd (3) carrying out normalization processing to obtain each attention coefficient:

α_t＝softmax(z_t) Wherein α is_tRepresenting each attention coefficient alpha_t,iAnd (5) merging the vectors.

Finally, each attention coefficient pair X is utilized_iAnd carrying out weighted summation to obtain a comprehensive characterization vector of the iteration, wherein the comprehensive characterization vector is represented by the following formula:

in one example, the retrieving a statement from an artificial knowledge base based on the comprehensive characterization vector includes:

normalizing the output vector to obtain a normalized vector;

FIG. 3 shows a retrieval system architecture diagram according to one embodiment. Referring to fig. 3, the retrieval system includes a first encoder and a retrieval model. Inputting each feature of a target user into a first encoder, obtaining each initial user feature vector corresponding to each feature through the first encoder, representing the vector after each initial user feature vector is combined by using X, inputting the X into a retrieval model, and retrieving K sentences from an artificial knowledge base through the retrieval model. Where N is the total number of sentences contained in the artificial knowledge base, N is usually a large number, for example, N may be hundreds or thousands, and K is a predetermined number, for example, K may be 2, 3, or 5. The retrieval system is used for executing the actions of the foregoing step 21 and step 22 to obtain K statements.

Then, in step 23, the K sentences are input into a second encoder, and the K sentences are encoded by the second encoder based on an attention mechanism, so as to obtain semantic representation vectors corresponding to the K sentences. It will be appreciated that the second encoder may be based on a variety of model structures, for example, transform, LSTM, GRU, etc.

In one example, the second encoder includes a second coding model and a self attention layer; said encoding, by the second encoder, the K statements based on an attention mechanism, comprising:

For example, the second coding model is a transform model structure, where K sentences contain l words, w_iEmbedding vectors, s, for words of the ith word in K statements_iEncoding a vector for a word of the ith word determined by the transformer, alpha_iThe attention coefficient for the ith word, H is the semantic representation vector,

W_sare all parameters to be learned.

First, s can be determined by the following formula_i：

s_i＝Transformer(w_i)。

Then, alpha is determined by the following formula_i：

Wherein tanh represents the activation function.

Finally, H is determined by the following formula:

fig. 4 shows a schematic structural diagram of a second encoder according to an embodiment. Referring to fig. 4, the second encoder includes a second coding model and a self-attention layer. Embedding words of each word included in the K sentences into a vector w_iInputting a second coding model, determining a word coding vector s of each word by the second coding model_i(ii) a Encoding the word of each word into a vector s_iAnd inputting the semantic representation vectors H corresponding to the K sentences through a self-attention layer.

Finally, in step 24, the initial user feature vectors and the semantic representation vectors are input into a generation model, and a user description text of the target user is generated through the generation model. It can be understood that the generated model functions as a decoder, and the generated user description text can be used as a final user description text, or used for further editing and processing the user description text manually to form a final user description text, which helps to improve the efficiency of manually forming the text.

In one example, the generative model comprises: the device comprises a first sublayer, a second sublayer and an intermediate layer, wherein the first sublayer and the second sublayer are time sequence-based neural network layers;

For example,

hidden state at time t-1 of the second sublayer, w_t-1For the word generated at time t-1,

hidden state at time t of the first sublayer, f_jInitial user feature vector, f, for the jth feature_jAnd the aforementioned X_iThere is no essential difference, and both are used to represent the initial user feature vector, w^T、W_fb、W_hbAre parameters to be learned.

First determined by the first sublayer

Can be represented by the following formula:

then determining an intermediate characterization vector c through the intermediate layer_tThe following formula is involved:

wherein tanh represents the activation function.

β_l＝softmax(b_l) (ii) a Wherein, b_lB representing respective correspondence of each feature_j,tMerged vector, beta_lRepresenting the vector formed by the weight coefficients corresponding to the features respectively.

Where M represents how many items are common to the user features.

Finally, the hidden state of the second sublayer at the time t is determined through the second sublayer

Can be represented by the following formula:

FIG. 5 illustrates a structural schematic of a generative model according to one embodiment. Referring to fig. 5, the generative model includes: a first sublayer, a second sublayer and an intermediate layer, the first sublayer and the second sublayer being time sequence based neural networksThe layer can be, but is not limited to, a neural network layer of a transducer, LSTM, GRU, or the like. The first sublayer will generate the word w at the last moment_t-1And a hidden state of the second sublayer at a previous moment in time

Generating a hidden state of the first sublayer at the current time as its current time input

Wherein the semantic representation vector H is used as a hidden state of the initial moment of the second sublayer

The middle layer is hidden according to the current time of the first sublayer

And said initial user feature vectors [ f ]₁,…,f_M]Determining each weight coefficient beta corresponding to each feature_lAnd according to each weight coefficient beta_lCarrying out weighted summation on the characteristic vectors of all the initial users to obtain an intermediate characterization vector c_t(ii) a The second sublayer characterizes the intermediate vector c_tAnd a hidden state of the first sublayer at the current time

Generating a hidden state of the second sublayer at the current time as its current time input

Hidden state of the second sublayer at the current moment

Word w generated for determining current time_t。

According to the embodiment of the specification, the generated model can more fully utilize the original characteristics and the hidden state at each moment, so that the quality of the generated user description text is more guaranteed.

In one example, the method further comprises:

The total loss function described above can be formulated as:

where α may be a predetermined constant.

The first loss function is

A second loss function of

The first loss function, expert experience part: mainly, artificial experience, namely expert rules, forms a description text for each client, and the probability of each word in the text is inherited from a generation model, namely

m represents the number of words in the text, and the maximum probability thereof is taken as log to determine the loss function of the expert experience part, so that the model parameters are corrected: when the machine generates correct characters, the probability of the correct characters is strengthened; when the machine generates the wrong word, the probability of the wrong word is reduced, the probability of the correct word is increased, and meanwhile, the convergence of the model is accelerated. Wherein the first loss function may also take other forms, e.g. will

Taking the maximum value and replacing the maximum value with the average value or the minimum value and the like.

The second loss function, machine learning part: the cross entropy of a general generative model can be used as a loss function.

According to the embodiment of the specification, the expert experience part is added into the total loss function, so that the accuracy and the coverage rate of the generated user description text can be improved, and a good effect can be achieved under the condition that the number of the label texts is small.

In the embodiment of the present specification, evaluation index optimization is also performed, which is generally used for evaluating indexes of good and bad generated texts, and the generated quality is mostly evaluated through similarity of texts, but in a specific field, such as the anti-money laundering field, accuracy of describing risk points is better considered, and coverage of the risk points is as much as possible, so the following evaluation indexes are proposed:

recall, namely splitting the original text and the generated text by sentences, taking the number of sentences appearing in the two texts simultaneously as numerators, taking the number of sentences of the original text as denominators, and obtaining a ratio which is the Recall ratio;

precision: the numerator is the same as the numerator of Recall, the denominator is the number of sentences of the model generated text, and the obtained ratio is the accuracy;

human Evaluation, namely, manual sampling inspection, for example, 100 messages generated by the model are given, and the number of qualified messages is manually judged, for example, 90 messages are provided, so that the quality of the text generated by the model is 90%.

According to the method provided by the embodiment of the specification, firstly, feature names of various features of a target user and feature values corresponding to the feature names are input into a first encoder, and various initial user feature vectors corresponding to the various features are obtained through the first encoder; then inputting the characteristic vectors of all the initial users into a retrieval model, and performing K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector; inputting the K sentences into a second encoder, and encoding the K sentences through the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences; and finally, inputting the characteristic vectors of the initial users and the semantic representation vectors into a generation model, and generating a user description text of the target user through the generation model. Therefore, the user description text is automatically generated through a machine, efficiency is high, in the process, initial user feature vectors corresponding to various features of the target user are utilized, semantic representation vectors corresponding to the retrieved K sentences are utilized, and the K sentences are derived from the artificial knowledge base, so that the artificial experience most relevant to the target user can be effectively utilized, the problems of word folding, wrong words and the like can be well solved, applicability is high, text quality is good, and efficiency and text quality can be considered.

According to an embodiment of another aspect, an apparatus for generating a user description text based on a user characteristic is further provided, and the apparatus is configured to perform the method for generating a user description text based on a user characteristic provided in the embodiments of the present specification. FIG. 6 shows a schematic block diagram of an apparatus for generating user description text based on user characteristics, according to one embodiment. As shown in fig. 6, the apparatus 600 includes:

a first encoding unit 61, configured to input feature names of various features of a target user and feature values corresponding to the feature names into a first encoder, and obtain, by using the first encoder, initial user feature vectors corresponding to the various features respectively;

a retrieval unit 62, configured to input each initial user feature vector obtained by the first encoding unit 61 into a retrieval model, and perform K iterations through the retrieval model to obtain K statements through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector;

a second encoding unit 63, configured to input the K statements obtained by the retrieval unit 62 into a second encoder, and encode the K statements by using the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K statements;

and a generating unit 64, configured to input each initial user feature vector obtained by the first encoding unit 61 and the semantic representation vector obtained by the second encoding unit 63 into a generation model, and generate a user description text of the target user through the generation model.

Optionally, as an embodiment, the types of the features include:

numeric or text.

Further, the apparatus further comprises:

a word segmentation unit, configured to perform word segmentation processing on an original feature value of a feature of a text type before the first encoding unit 61 obtains, through the first encoder, each initial user feature vector corresponding to each feature, so as to obtain a plurality of word segmentation results;

the first encoding unit 61 is specifically configured to input a feature name of a feature that the type of the target user is a text type and a plurality of segmentation results corresponding to the feature name into the first encoder.

Optionally, as an embodiment, the first encoder includes a first embedding matrix, a second embedding matrix, and a first coding model; the first encoding unit 61 includes:

the first embedding subunit is used for taking any one of the features as a target item feature and converting the feature name of the target item feature into a first embedding vector through a first embedding matrix;

the second embedding subunit is used for converting the eigenvalue corresponding to the target item characteristic into a second embedding vector through a second embedding matrix;

and the coding subunit is configured to input a first embedded vector obtained by the first embedding subunit and a second embedded vector obtained by the second embedding subunit, where the features respectively correspond to the first feature, into the first coding model, and the first coding model is coded based on an attention mechanism to obtain initial user feature vectors corresponding to the features respectively.

Optionally, as an embodiment, the retrieving unit 62 is specifically configured to determine, according to the initial user feature vectors and the comprehensive characterization vector obtained in the last iteration, attention coefficients corresponding to each feature of the current iteration respectively.

Optionally, as an embodiment, the retrieving unit 62 includes:

the full-connection subunit is used for enabling the comprehensive characterization vector to pass through a full-connection layer to obtain an output vector with the dimensionality being the number of the sentences contained in the artificial knowledge base;

the normalization subunit is used for normalizing the output vector obtained by the full-connection subunit to obtain a normalized vector;

and the determining subunit is used for selecting the dimension corresponding to the maximum numerical value in the normalized vector obtained by the normalizing subunit, and taking the statement corresponding to the dimension as the statement retrieved from the artificial knowledge base in the iteration.

Optionally, as an embodiment, the second encoder includes a second coding model and a self attention layer; the second encoding unit 63 includes:

the coding subunit is used for inputting the word embedding vector of each word included in the K sentences into a second coding model, and determining the word coding vector of each word through the second coding model;

and the self-attention subunit is used for inputting the word coding vector of each word obtained by the coding subunit into a self-attention layer, determining the attention coefficient corresponding to each word through the self-attention layer, and performing weighted summation on the word coding vector of each word according to the attention coefficient corresponding to each word to obtain the semantic representation vector corresponding to the K sentences.

Optionally, as an embodiment, the generating a model includes: the device comprises a first sublayer, a second sublayer and an intermediate layer, wherein the first sublayer and the second sublayer are time sequence-based neural network layers;

the generating unit 64 includes:

the first processing subunit is configured to input, as the current time of the first sublayer, a word generated at a previous time and a hidden state of the second sublayer at the previous time as input of the first sublayer, and generate a hidden state of the first sublayer at the current time; wherein the semantic representation vector is used as a hidden state of the second sublayer at the initial moment;

the intermediate processing subunit is configured to determine, through the intermediate layer, each weight coefficient corresponding to each feature according to the hidden state of the first sublayer at the current time and each initial user feature vector generated by the first processing subunit, and perform weighted summation on each initial user feature vector according to each weight coefficient to obtain an intermediate characterization vector;

the second processing subunit is configured to use, through the second sublayer, the intermediate characterization vector obtained by the intermediate processing subunit and the hidden state of the first sublayer at the current time, generated by the first processing subunit, as its current time input, and generate the hidden state of the second sublayer at the current time; the hidden state of the second sublayer at the current time is used to determine the word generated at the current time.

Optionally, as an embodiment, the apparatus further includes:

a parameter adjusting unit, configured to adjust a parameter of at least one of the first encoder, the search model, the second encoder, and the generation model using a preset total loss function; the total loss function is determined by a first loss function and a second loss function, wherein the function value of the first loss function depends on the generation probability of each word in the artificial description text of the target user in the generation model, and the function value of the second loss function depends on whether each word in the artificial description text of the target user exists in a preset label text or not and the generation probability of each word in the label text in the generation model.

With the device provided in this specification, first, the first encoding unit 61 inputs feature names of various features of a target user and feature values corresponding to the feature names into the first encoder, and obtains initial user feature vectors corresponding to the various features respectively through the first encoder; then, the retrieval unit 62 inputs the initial user feature vectors into a retrieval model, and performs K iterations through the retrieval model to obtain K sentences through the K iterations; each iteration comprises the steps of determining each attention coefficient of the current iteration corresponding to each feature, carrying out weighted summation on each initial user feature vector according to each attention coefficient to obtain a comprehensive characterization vector, and retrieving a statement from an artificial knowledge base according to the comprehensive characterization vector; then, the second encoding unit 63 inputs the K sentences into a second encoder, and the K sentences are encoded by the second encoder based on an attention mechanism to obtain semantic representation vectors corresponding to the K sentences; finally, the generating unit 64 inputs the initial user feature vectors and the semantic representation vectors into a generating model, and generates the user description text of the target user through the generating model. Therefore, the user description text is automatically generated through a machine, efficiency is high, in the process, initial user feature vectors corresponding to various features of the target user are utilized, semantic representation vectors corresponding to the retrieved K sentences are utilized, and the K sentences are derived from the artificial knowledge base, so that the artificial experience most relevant to the target user can be effectively utilized, the problems of word folding, wrong words and the like can be well solved, applicability is high, text quality is good, and efficiency and text quality can be considered.

According to an embodiment of another aspect, there is also provided a computer-readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the method described in connection with fig. 2.

According to an embodiment of yet another aspect, there is also provided a computing device comprising a memory having stored therein executable code, and a processor that, when executing the executable code, implements the method described in connection with fig. 2.

Those skilled in the art will recognize that, in one or more of the examples described above, the functions described in this invention may be implemented in hardware, software, firmware, or any combination thereof. When implemented in software, the functions may be stored on or transmitted over as one or more instructions or code on a computer-readable medium.

The above-mentioned embodiments, objects, technical solutions and advantages of the present invention are further described in detail, it should be understood that the above-mentioned embodiments are only exemplary embodiments of the present invention, and are not intended to limit the scope of the present invention, and any modifications, equivalent substitutions, improvements and the like made on the basis of the technical solutions of the present invention should be included in the scope of the present invention.

Claims

1. A method of generating user description text based on user characteristics, the method comprising:

2. The method of claim 1, wherein the types of features comprise:

numeric or text.

3. The method of claim 2, wherein before obtaining, by the first encoder, initial user feature vectors corresponding to respective features, the method further comprises:

4. The method of claim 1, wherein the first encoder comprises a first embedding matrix, a second embedding matrix, and a first coding model; obtaining, by the first encoder, initial user feature vectors corresponding to the features respectively includes:

5. The method of claim 1, wherein the determining attention coefficients of the current iteration corresponding to the features respectively comprises:

6. The method of claim 1, wherein said retrieving a statement from an artificial knowledge base based on said comprehensive characterization vector comprises:

normalizing the output vector to obtain a normalized vector;

7. The method of claim 1, wherein the second encoder comprises a second coding model and a self attention layer; said encoding, by the second encoder, the K statements based on an attention mechanism, comprising:

8. The method of claim 1, wherein the generating a model comprises: the device comprises a first sublayer, a second sublayer and an intermediate layer, wherein the first sublayer and the second sublayer are time sequence-based neural network layers;

9. The method of claim 1, wherein the method further comprises:

10. The method of claim 9, wherein the generative model is a time-sequential based model that sequentially generates words in the user description text at multiple times; the generation probability of each word in the generative model comprises the generation probability of each word obtained by the generative model at each moment.

11. An apparatus for generating user description text based on user characteristics, the apparatus comprising:

12. The apparatus of claim 11, wherein the types of features comprise:

numeric or text.

13. The apparatus of claim 12, wherein the apparatus further comprises:

the word segmentation unit is used for performing word segmentation processing on original characteristic values of the characteristics of the text type to obtain a plurality of word segmentation results before the first encoding unit obtains each initial user characteristic vector corresponding to each characteristic through the first encoder;

the first encoding unit is specifically configured to input a feature name of a feature that the type of the target user is a text type and a plurality of word segmentation results corresponding to the feature name into the first encoder.

14. The apparatus of claim 11, wherein the first encoder comprises a first embedding matrix, a second embedding matrix, and a first coding model; the first encoding unit includes:

15. The apparatus according to claim 11, wherein the retrieving unit is specifically configured to determine, according to the initial user feature vectors and the comprehensive characterization vector obtained in the previous iteration, attention coefficients corresponding to the features of the current iteration, respectively.

16. The apparatus of claim 11, wherein the retrieving unit comprises:

17. The apparatus of claim 11, wherein the second encoder comprises a second coding model and a self attention layer; the second encoding unit includes:

18. The apparatus of claim 11, wherein the generative model comprises: the device comprises a first sublayer, a second sublayer and an intermediate layer, wherein the first sublayer and the second sublayer are time sequence-based neural network layers;

the generation unit includes:

19. The apparatus of claim 11, wherein the apparatus further comprises:

20. The apparatus of claim 19, wherein the generative model is a time-sequential based model that sequentially generates words in the user description text at a plurality of times; the generation probability of each word in the generative model comprises the generation probability of each word obtained by the generative model at each moment.

21. A computer-readable storage medium, on which a computer program is stored which, when executed in a computer, causes the computer to carry out the method of any one of claims 1-10.

22. A computing device comprising a memory having stored therein executable code and a processor that, when executing the executable code, implements the method of any of claims 1-10.