CN111428448B

CN111428448B - Text generation method, device, computer equipment and readable storage medium

Info

Publication number: CN111428448B
Application number: CN202010136551.6A
Authority: CN
Inventors: 邓悦; 金戈; 徐亮
Original assignee: Ping An Technology Shenzhen Co Ltd
Current assignee: Ping An Technology Shenzhen Co Ltd
Priority date: 2020-03-02
Filing date: 2020-03-02
Publication date: 2024-05-07
Anticipated expiration: 2040-03-02
Also published as: WO2021174827A1; CN111428448A

Abstract

The invention discloses a text generation method, a text generation device, computer equipment and a readable storage medium, and belongs to the field of text processing. The text generation method, the device, the computer equipment and the readable storage medium generate target text data according to target guide data through the text generation countermeasure network model obtained through pre-training, solve the problem that discrete output cannot be updated, and achieve the purpose that text sentences can be generated according to sentence head data by adopting the text generation countermeasure network model.

Description

Text generation method, device, computer equipment and readable storage medium

Technical Field

The present invention relates to the field of text processing, and in particular, to a text generation method, apparatus, computer device, and readable storage medium.

Background

In the intelligent interview scenario, artificial intelligence (ARTIFICIAL INTELLIGENCE, abbreviated as AI) needs to ask a question about a candidate according to a preset question, and also needs to present an open question to the candidate according to an actual situation so as to test an actual response capability of the candidate. The open question requires the AI to generate the question text using a generative model.

The current generation model mainly adopts a generation countermeasure network (GAN), and because the generation countermeasure network needs to update parameter variables based on continuous output data, the generation countermeasure network is mainly applied to image processing, various image generation tasks comprise unsupervised generation, tagged generation, super-resolution restoration, automatic coloring, street view generation and the like, and the quality of generated pictures is lifelike and is difficult for human eyes to distinguish true from false.

When the generating countermeasure network is applied to the text generating task, the generating countermeasure network needs to output probability distribution of the next word in the vocabulary based on the generated text sequence in the text generating process, then the word is selected, the output result is discrete data, and the discrete data cannot realize training update of the network. The current generation countermeasure network cannot be applied to the text generation task.

Disclosure of Invention

Aiming at the problem that the existing generation countermeasure network only supports continuous output, a text generation method, a device, computer equipment and a readable storage medium for generating the countermeasure network based on the text which can be updated according to discrete data are provided.

In order to achieve the above object, the present invention provides a text generation method, including the steps of:

Collecting answer data generated by a business object in a question-answer scene;

extracting the answer data and acquiring target guide data;

generating an countermeasure network model through a text obtained through pre-training and generating target text data according to the target guide data;

the target guiding data is sentence head data of the target text data.

In one embodiment, before the step of generating the countermeasure network model from the text obtained by training in advance and generating the target text data from the target guidance data, the method includes:

Obtaining a sample guide set and a sample text set, wherein the sample guide set comprises at least one sample guide data, the sample text set comprises at least one sample text data, and the sample guide data is sentence head data of the sample text data;

Training an initial countermeasure network model according to the sample guide set and the sample text set, and obtaining a text generation countermeasure network model.

In one embodiment, the initial countermeasure network model includes a generator and a arbiter, and the training of the initial countermeasure network model according to the sample guidance set and the sample text set and obtaining the text generated countermeasure network model includes:

Generating, by the generator and from at least one sample guide data in the sample guide set, at least one sample data;

Simulating the at least one sample text data by using Monte Carlo simulation and obtaining a plurality of sample simulated text data;

identifying the plurality of sample simulated text data by the discriminator according to target text data in the sample text set, and updating the parameter value of the generator according to the identification result;

updating the arbiter based on the updated generator and according to a loss function;

And circularly updating the generator and the discriminator until the initial countermeasure network model meets a preset convergence condition, and obtaining the text formed by the updated generator to generate the countermeasure network model.

In one embodiment, the step of generating, by the generator and from the sample guidance data of at least one of the sample guidance sets, at least one sample text sample data comprises:

Calculating according to the sample guide data through the generator, obtaining a first sample word with the highest probability in a vocabulary, and adding the first sample word at the tail end of the sample guide data;

And calculating according to the first sample word through the generator to obtain a second sample word with the highest probability in the vocabulary, adding the second sample word at the tail end of the first sample word, and circularly executing the steps until sample text data with preset length is obtained.

In one embodiment, the step of simulating the at least one sample of text data using a monte carlo simulation and obtaining a plurality of samples of simulated text data comprises:

the words in each sample text data are simulated one by adopting Monte Carlo simulation, and a plurality of sample simulation text data corresponding to the sample text data are generated.

In one embodiment, the step of identifying, by the arbiter and according to the target text data in the set of sample texts, the plurality of sample simulated text data, and updating the parameter value of the generator according to the identification result, includes:

Identifying the plurality of sample simulated text data by the discriminator according to target text data in the sample text set, and acquiring a state cost function according to an identification result;

and calculating an objective function according to the state cost function, and updating the parameter value of the generator according to the objective function.

In one embodiment, the step of generating the countermeasure network model by the text obtained through pre-training and generating the target text data according to the target guidance data includes:

calculating the target guide data by adopting a generator of the text generation countermeasure network model to obtain a first sample word with the highest probability in a vocabulary, and adding the first sample word to the tail end of the target guide data;

and calculating the first sample word by adopting the generator, obtaining a second sample word with the highest probability in the vocabulary, adding the second sample word at the tail end of the first sample word, and circularly executing the steps until target text data with preset length is obtained.

In order to achieve the above object, the present invention further provides a text generating apparatus, including:

the collecting unit is used for collecting answer data generated by the business object in the question-answer scene;

the acquisition unit is used for extracting the answer data and acquiring target guide data;

The generation unit is used for generating an countermeasure network model through the text obtained through pre-training and generating target text data according to the target guide data;

the target guiding data is sentence head data of the target text data.

To achieve the above object, the present invention also provides a computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the above method when executing the computer program.

To achieve the above object, the present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, implements the steps of the above method.

The beneficial effects of the technical scheme are that:

According to the technical scheme, the text generation method, the device, the computer equipment and the readable storage medium generate target text data according to target guide data (such as sentence head data) through the text generation countermeasure network model obtained through training in advance, solve the problem that discrete output cannot be updated, and achieve the purpose that the text generation countermeasure network model can generate text sentences (such as text problems) according to the sentence head data.

Drawings

FIG. 1 is a method flow diagram of one embodiment of a text generation method of the present invention;

FIG. 2 is a method flow diagram of one embodiment of obtaining a text generation countermeasure network model;

FIG. 3 is a flow chart of a method of one embodiment of training an initial countermeasure network model based on a sample guide set and a sample text set to obtain text generated countermeasure network models;

FIG. 4 is a block diagram of one embodiment of a text generating device according to the present invention;

fig. 5 is a schematic diagram of a hardware architecture of an embodiment of a computer device according to the present invention.

Detailed Description

The present invention will be described in further detail with reference to the drawings and examples, in order to make the objects, technical solutions and advantages of the present invention more apparent. It should be understood that the specific embodiments described herein are for purposes of illustration only and are not intended to limit the scope of the invention. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.

The text generation method, the device, the computer equipment and the readable storage medium are suitable for the business fields of insurance, finance and the like, and provide a text generation method capable of automatically generating open text problems for a loan system, an insurance system and a finance system so as to be convenient for testing the thinking ability of a candidate. According to the invention, the target text data is generated according to the target guide data (such as sentence head data) through the text generation countermeasure network model obtained through pre-training, so that the problem that discrete output cannot be updated is solved, and the aim that the text sentence (such as text problem) can be generated according to the sentence head data by adopting the text generation countermeasure network model is fulfilled.

Example 1

Referring to fig. 1, a text generating method of the present embodiment includes the following steps:

S1, collecting answer data generated by a business object in a question-answer scene;

In this step, the business object may be a consulting user consulting with the business, or a buyer of the online transaction platform, or an interviewer in the interview process. The answer data may be collected by a collection device, such as an audio receiving device, a microphone or a mobile terminal with recording capabilities, etc.

The text generation method in this embodiment is mainly applied to dialogue scenes (at least two users), and generates a question text based on answer information of a target object, so that the target object can answer the question text, for example: when the text generation method is applied to the interview scene, an open text problem is generated according to keywords provided by interviewees.

S2, extracting the answer data and acquiring target guide data;

In step S2, the answer data may be subjected to semantic analysis to extract keywords in the answer data, and the keywords are used as target guidance data; analyzing the answer data to extract nouns in the answer data, and taking the nouns as target guide data.

It should be noted that: the target guidance data may be keywords, or words at the beginning of a sentence.

S3, generating an countermeasure network model through a text obtained through pre-training and generating target text data according to the target guide data;

It should be noted that: the target guiding data is sentence head data of the target text data. For example: the target guidance data is: "today"; the target text data is: "how does today weather? ". The target guidance data may be two words or three words, and is not limited herein.

Referring to fig. 2, before performing step S3, the step of acquiring the text to generate the countermeasure network model may include:

S31, acquiring a sample guide set and a sample text set, wherein the sample guide set comprises at least one sample guide data, the sample text set comprises at least one sample text data, and the sample guide data is sentence head data of the sample text data;

In this embodiment, the sample guidance set is a sequence composed of sample guidance data (period data); a sample text set is a sequence of real text data composed of sample text data (complete sentences). The sample guidance data is sentence head data of the real text data.

S32, training an initial countermeasure network model according to the sample guide set and the sample text set, and obtaining a text generation countermeasure network model.

At present, the pixel value of each point of the generated image is a continuous value in the image processing process of the generated countermeasure network, so that the calculation map of the whole network is differentiable (micro-guidable) from the weight of the generator to the output of the generator, and then the weight and the output classification of the discriminator are all obtained, the error can be normally back-propagated, and the gradient and the weight can be normally updated. However, in the text generation process, the generator actually outputs a sequence, each round of outputting the probability distribution of the next word in the vocabulary based on the generated text sequence, and then selecting the word with the highest probability, wherein the selection process is not tiny, the generator outputs discrete token, and in the training process, the error back propagates to the point, and the gradient update of the pixel value like the image generation task on each token cannot be performed, so that the weight value of the generator is updated. On the other hand, the arbiter can directly receive the input of a complete text sequence and output the true or false of sentences, but cannot judge the sentences which are generated by the generator to half and are not finished, so that the arbiter cannot provide supervision on the training of the generator for each word in the generated text sequence.

Therefore, in the training process of generating the countermeasure network model in this embodiment, in order to solve the irreducible problem caused by the discrete output of the generator, in this embodiment, the generating process of the text sequence is regarded as a sequence decision process, a policy gradient (policy gradient) method in reinforcement learning is adopted, the judgment result of the discriminator is taken as a reward (review), part of the text generated by the generator is taken as a state (state), the generator is taken as an agent (agent), the next word is predicted as an action, and the generator is a policy (policy) which needs to be updated, so that the irreducible problem of the loss function of the discrete output is solved. In the method for judging the unfinished sequence, the embodiment adopts Monte Carlo search (Monte Carlo search), based on the generated sequence, the generator continues to generate until the sequence is completed, the discriminator judges the sequence, the simulation is carried out for a plurality of times, and the average value of the final reward is used as the estimate of the reward of the current unfinished sequence.

It should be noted that: the initial countermeasure network model comprises a generator and a discriminator; referring to fig. 3, in step S32, training an initial countermeasure network model according to the sample guidance set and the sample text set, and obtaining a text generation countermeasure network model includes:

By way of example and not limitation, the generator may employ a long short term memory network (LSTM) of output sequences for generating text sequences from a given initial state; the discriminator can adopt a two-class long-short-period memory network for receiving the output text and the real text of the generator and judging the true or false of the output text.

S321, generating at least one sample data according to at least one sample guidance data in the sample guidance set through the generator;

further, the step in step S321 may include:

And calculating according to the first sample word through the generator, obtaining a second sample word with the highest probability in the vocabulary, adding the second sample word at the tail end of the first sample word, and circularly executing the steps (and so on) until sample text data with preset length is obtained.

In this step, the generator G _θ and the arbiter D _φ are initialized; the sample guiding data is a real text set S= { X _1～T }, the sentence length of each real text in the real text set is T, and the tail part of the length less than T is filled with zero; the sample guide set is the word set { Y ₁ }.

The word set { Y ₁ } is input to the generator G _θ, the input layer of the generator G _θ maps the input words to label information (token) corresponding to the corresponding words in the vocabulary, the label information is embedded and expressed, the input information is (Y ₁,y₂,…,y_t-1) used as input to the generator G _θ in practical application, the generator G _θ outputs the probability of each word of the next word in the vocabulary according to the input data, the word with the highest probability is used as Y _t, and the like, and the processing is circulated until the end of a sentence Y _T, so that a set of generated sample text set { Y _1～T } with the length of T (length less than zero filling) is obtained.

Wherein (y ₁,y₂,…,y_t-1) represents an incomplete sentence composed of t-1 words, and y ₁ represents the 1 st word in a sentence; y ₂ represents the 2 nd word in a sentence; y _t-1 represents the t-1 st word in a sentence; y _T the T-th word (end of period) in a sentence.

In the step, only the generator G _θ is used to input a word y ₁, the generator G _θ embeds the word y ₁ and then transmits the word y ₁ to the LSTM, and the generated token sequence and the word corresponding to the token sequence in the vocabulary are output, so as to obtain the generated text sequence (y ₁,y₂,…,y_T).

S322, simulating the at least one sample text data by adopting Monte Carlo simulation and obtaining a plurality of sample simulation text data;

further, the step of step S322 includes:

In this implementation, for each sequence in the sample text set { Y _1～T }, taking the (Y ₁,y₂,…,y_T) sequence as an example, each word Y _t in the sequence is traversed, and N times of monte carlo simulation are performed, which is different from selecting the word with the highest probability as Y _t before, where each time the generator G _θ is used to sample according to the multiple distribution of output words, repeating until the end of period Y _T is reached, so as to obtain N different complete sample simulated text sets { Y _1～T ¹,Y_1～T ²,…,Y_1～T ^N }.

The number of simulations of words in the sample text set at different positions in the sentence may be the same or different.

S323, identifying the plurality of sample simulation text data through the discriminator according to target text data in the sample text set, and updating the parameter value of the generator according to the identification result;

Further, step S323 may include:

In an embodiment, inputting the obtained sample simulation text set { Y _1～T ¹,Y_1～T ²,…,Y_1～T ^N } into a discriminator D _φ for two classification, comparing each sample simulation text with a corresponding real text, if the sample simulation text is consistent, indicating that the sample simulation text generated by the generator is real (mark 1); if not, the sample simulated text generated by the show generator is false (marker 0). For a complete sentence, directly taking the output result of the discriminator D _φ as a state value; and for incomplete sentences, averaging the discrimination results of N complete sentences obtained by Monte Carlo simulation. To sum up, the state-cost function can be expressed as:

where i represents the number of simulations of the monte carlo simulation.

The parameters θ of the generator G _θ are updated according to the state-cost function, the objective function of which is to produce as much as possible a more realistic sample spoof arbiter, i.e. to maximize the rewards it gets under the policy G _θ:

wherein G _θ(y_t|Y_1～t-1) represents the policy output, which can be essentially considered as a probability, outputting the probability value of y _t in the vocabulary; y _1～t-1 is the value that all Y _t has appeared. The parameter θ is a weight parameter in generator G _θ; the parameters of generator G _θ are updated on J (θ), in other words, the gradient of the strategy comes from J (θ):

wherein, alpha _θ is the learning rate.

S324, updating the discriminator according to a loss function based on the updated generator;

In this step, using the updated generator G _θ, a set of text sequences { Y _1～T } is generated, and at the same time, the same number of text sequence sets { X _1～T } are selected from the real text sets s= { X _1～T }, and input into the discriminator D _φ for classification, where the loss function is a binary logarithmic loss function:

the parameters of D _φ are updated on J (φ):

Wherein, alpha _φ is the learning rate.

S325, circularly updating the generator and the discriminator until the initial countermeasure network model meets preset convergence conditions, and obtaining the text formed by the updated generator to generate the countermeasure network model.

In this step, in each training round, the training generator n _G times is repeated, and the training of the discriminator n _D times is repeated until the model meets the preset convergence condition. Such as: the preset convergence condition is n _D>n_G to ensure that the arbiter can correctly guide the generator to update.

In step S3, the step of generating the countermeasure network model by the text obtained through training in advance and generating target text data according to the target guidance data includes:

And calculating the first sample word by adopting the generator, obtaining a second sample word with the highest probability in the vocabulary, adding the second sample word at the tail end of the first sample word, and circularly executing the steps (and so on) until target text data with preset length is obtained. Therefore, the target text data for asking is generated according to the answer data, the purpose of opening asking and answering based on the answer of the business object is achieved, and the temporary response capability of the business object to the open question is conveniently tested.

In the embodiment, the text generation method is based on the countermeasure type long-short-term memory network and the strategy gradient, and the structure of the LSTM-based discriminator-generator is used, so that the tasks of generating the text sequence and judging the authenticity of the text can be accurately realized; by means of countermeasure training, the discriminator can dynamically update parameters of the discriminator, the recognition capability is improved continuously, the generator provides proper guidance, and the method has potential more than generating the quality of the text purely based on other static reference evaluation; by means of the idea of reinforcement learning, the sequence generation process is converted into a sequence decision process, the problem that a loss function caused by discrete output is not tiny is solved, and training of an countermeasure network is possible; using Monte Carlo search to obtain a complete sequence of each step and a scoring result thereof in a discriminator by using strategy simulation, taking the average value as a reward value of the current time step, and solving the reward problem that an unfinished sequence cannot be directly obtained; in addition, only the generator part needs to be reserved in the training stage, and compared with other technologies such as Gumbel-softmax which do not have a micro discretization, no additional parameters need to be trained, and the model occupies less memory.

Example two

As shown in fig. 4, the present invention also provides a text generating apparatus 1, including: an acquisition unit 11, an acquisition unit 12, and a generation unit 13, wherein:

the collecting unit 11 is used for collecting answer data generated by the business object in the question-answer scene;

the business object can be a consulting user consulting with the business, a buyer of an online transaction platform or an interviewer in an interviewing process. The answer data may be collected by a collection device, such as an audio receiving device, a microphone or a mobile terminal with recording capabilities, etc.

The text generating apparatus 1 in this embodiment is mainly applied to a dialogue scene (at least two users), and generates a question text based on answer information of a target object, so that the target object can answer the question text, for example: when the text generating device 1 is applied to an interview scene, an open text question is generated from keywords provided by interviewees.

An acquisition unit 12 for extracting the answer data and acquiring target guidance data;

The acquiring unit 12 is adopted to perform semantic analysis on the answer data to extract keywords in the answer data, and the keywords are used as target guide data; analyzing the answer data to extract nouns in the answer data, and taking the nouns as target guide data.

A generating unit 13 for generating an countermeasure network model by a text obtained by training in advance and generating target text data from the target guidance data;

the target guiding data is sentence head data of the target text data.

Specifically, the generating unit 13 calculates according to the target guidance data by using the generator of the text generation countermeasure network model, obtains a first sample word with the highest probability in the vocabulary, and adds the first sample word to the end of the target guidance data;

The generator calculates according to the first sample word, acquires a second sample word with the highest probability in the vocabulary, adds the second sample word at the tail end of the first sample word, and the like until target text data with preset length is acquired.

In this embodiment, the text generating device 1 uses the structure of the LSTM-based discriminator-generator based on the countermeasure long-short term memory network and the policy gradient, so that the task of generating a text sequence and judging the authenticity of the text can be accurately realized; by means of countermeasure training, the discriminator can dynamically update parameters of the discriminator, the recognition capability is improved continuously, the generator provides proper guidance, and the method has potential more than generating the quality of the text purely based on other static reference evaluation; by means of the idea of reinforcement learning, the sequence generation process is converted into a sequence decision process, the problem that a loss function caused by discrete output is not tiny is solved, and training of an countermeasure network is possible; using Monte Carlo search to obtain a complete sequence of each step and a scoring result thereof in a discriminator by using strategy simulation, taking the average value as a reward value of the current time step, and solving the reward problem that an unfinished sequence cannot be directly obtained; in addition, only the generator part needs to be reserved in the training stage, and compared with other technologies such as Gumbel-softmax which do not have a micro discretization, no additional parameters need to be trained, and the model occupies less memory.

Example III

In order to achieve the above objective, the present invention further provides a computer device 2, where the computer device 2 includes a plurality of computer devices 2, and the components of the text generating apparatus 1 of the second embodiment may be dispersed in different computer devices 2, and the computer device 2 may be a smart phone, a tablet computer, a notebook computer, a desktop computer, a rack server, a blade server, a tower server, or a rack server (including a stand-alone server, or a server cluster formed by a plurality of servers) that execute a program, or the like. The computer device 2 of the present embodiment includes at least, but is not limited to: a memory 21, a processor 23, a network interface 22, and the text generating device 1 (refer to fig. 5) which can be communicatively connected to each other through a system bus. It should be noted that fig. 5 only shows a computer device 2 having components, but it should be understood that not all of the illustrated components are required to be implemented, and that more or fewer components may be implemented instead.

In this embodiment, the memory 21 includes at least one type of computer readable storage medium, including flash memory, hard disk, multimedia card, card memory (e.g., SD or DX memory, etc.), random Access Memory (RAM), static Random Access Memory (SRAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), programmable read-only memory (PROM), magnetic memory, magnetic disk, optical disk, etc. In some embodiments, the memory 21 may be an internal storage unit of the computer device 2, such as a hard disk or a memory of the computer device 2. In other embodiments, the memory 21 may also be an external storage device of the computer device 2, such as a plug-in hard disk provided on the computer device 2, a smart memory card (SMART MEDIA CARD, SMC), a Secure Digital (SD) card, a flash memory card (FLASH CARD), or the like. Of course, the memory 21 may also comprise both an internal memory unit of the computer device 2 and an external memory device. In this embodiment, the memory 21 is typically used to store an operating system and various types of application software installed on the computer device 2, such as program codes of the text generation method of the first embodiment. Further, the memory 21 may be used to temporarily store various types of data that have been output or are to be output.

The processor 23 may be a central processing unit (Central Processing Unit, CPU), controller, microcontroller, microprocessor, or other data processing chip in some embodiments. The processor 23 is typically used to control the overall operation of the computer device 2, e.g. to perform control and processing related to data interaction or communication with said computer device 2, etc. In this embodiment, the processor 23 is configured to execute the program code or the processing data stored in the memory 21, for example, to execute the text generating apparatus 1 or the like.

The network interface 22 may comprise a wireless network interface or a wired network interface, which network interface 22 is typically used to establish a communication connection between the computer device 2 and other computer devices 2. For example, the network interface 22 is used to connect the computer device 2 to an external terminal through a network, establish a data transmission channel and a communication connection between the computer device 2 and the external terminal, and the like. The network may be an Intranet (Intranet), the Internet (Internet), a global system for mobile communications (Global System of Mobile communication, GSM), wideband code division multiple access (Wideband Code Division Multiple Access, WCDMA), a 4G network, a 5G network, bluetooth (Bluetooth), wi-Fi, or other wireless or wired network.

It is noted that fig. 5 only shows a computer device 2 having components 21-23, but it is understood that not all of the illustrated components are required to be implemented, and that more or fewer components may alternatively be implemented.

In the present embodiment, the text generating device 1 stored in the memory 21 may be further divided into one or more program modules, which are stored in the memory 21 and executed by one or more processors (the processor 23 in the present embodiment) to complete the present invention.

Embodiment four:

To achieve the above object, the present invention also provides a computer-readable storage medium including a plurality of storage media such as a flash memory, a hard disk, a multimedia card, a card memory (e.g., SD or DX memory, etc.), a Random Access Memory (RAM), a Static Random Access Memory (SRAM), a Read Only Memory (ROM), an Electrically Erasable Programmable Read Only Memory (EEPROM), a Programmable Read Only Memory (PROM), a magnetic memory, a magnetic disk, an optical disk, a server, an App application store, etc., on which a computer program is stored, which when executed by the processor 23, performs the corresponding functions. The computer-readable storage medium of the present embodiment is for storing the text generating apparatus 1, and when executed by the processor 23, implements the text generating method of the first embodiment.

The foregoing embodiment numbers of the present invention are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.

From the above description of the embodiments, it will be clear to those skilled in the art that the above-described embodiment method may be implemented by means of software plus a necessary general hardware platform, but of course may also be implemented by means of hardware, but in many cases the former is a preferred embodiment.

The foregoing description is only of the preferred embodiments of the present invention, and is not intended to limit the scope of the invention, but rather is intended to cover any equivalents of the structures or equivalent processes disclosed herein or in the alternative, which may be employed directly or indirectly in other related arts.

Claims

1. A method of text generation, characterized in that it is based on question-answer scenarios, said method comprising the steps of:

extracting the answer data and acquiring target guide data;

the target guide data are sentence head data of the target text data;

the step of generating the countermeasure network model through the text obtained through pre-training and generating target text data according to the target guiding data comprises the following steps:

Calculating the first sample word by adopting the generator, obtaining a second sample word with the highest probability in a vocabulary, and adding the second sample word at the tail of the first sample word;

Circularly executing the steps until target text data with preset length is obtained;

The generator adopts a long-short-term memory network LSTM of the output sequence for generating a text sequence from a given initial state; the discriminator adopts a two-classification long-short-term memory network, and uses the structure of the discriminator-generator based on LSTM to generate a text sequence and judge the authenticity of the text;

before the step of generating the countermeasure network model by the text obtained through the pre-training and generating target text data according to the target guidance data, the method comprises the following steps:

training an initial countermeasure network model according to the sample guide set and the sample text set, and obtaining a text generation countermeasure network model;

The generation process of the text sequence is regarded as a sequence decision process, a strategy gradient method in reinforcement learning is adopted, the judgment result of the discriminator is used as a reward, part of the text generated by the generator is used as a state, the generator is used as an agent, the next word is predicted to be used as an action, and the generator is a strategy which needs to be updated.

2. The text generation method of claim 1, wherein the initial countermeasure network model includes a generator and a arbiter, the step of training the initial countermeasure network model based on the sample guide set and the sample text set and obtaining a text generation countermeasure network model includes:

3. The text generation method of claim 2, wherein the step of generating at least one sample of text data by the generator and from at least one sample guide data in the sample guide set comprises:

Calculating according to the first sample word through the generator, obtaining a second sample word with the highest probability in a vocabulary, and adding the second sample word at the tail end of the first sample word;

and executing the steps circularly until sample text data with preset length is acquired.

4. The text generation method according to claim 2, wherein the step of simulating the at least one sample of text data using a monte carlo simulation and obtaining a plurality of samples of simulated text data comprises:

5. The text generation method according to claim 2, wherein the step of identifying the plurality of sample simulated text data by the discriminator and according to the target text data in the sample text set, and updating the parameter value of the generator according to the identification result, comprises:

6. A text generation apparatus, characterized by comprising, based on a question-answer scenario:

the target guide data are sentence head data of the target text data;

The text generation countermeasure network model obtained through pre-training and target text data generation according to the target guide data comprise the following steps:

the text generation device is also used for:

7. A computer device comprising a memory, a processor, and a computer program stored on the memory and executable on the processor, characterized by: the processor, when executing the computer program, implements the steps of the method of any one of claims 1 to 5.

8. A computer-readable storage medium having stored thereon a computer program, characterized by: the computer program implementing the steps of the method of any one of claims 1 to 5 when executed by a processor.