CN114610871A

CN114610871A - Information system modeling analysis method based on artificial intelligence algorithm

Info

Publication number: CN114610871A
Application number: CN202210511565.0A
Authority: CN
Inventors: 张晟; 杨晓冬; 王吉平
Original assignee: Beijing Daoda Tianji Technology Co ltd
Current assignee: Beijing Daoda Tianji Technology Co ltd
Priority date: 2022-05-12
Filing date: 2022-05-12
Publication date: 2022-06-10
Anticipated expiration: 2042-05-12
Also published as: CN114610871B

Abstract

The invention relates to an information system modeling analysis method based on an artificial intelligence algorithm, which comprises the following steps: analyzing the model document by using a neural language model to obtain a plurality of word vectors; performing chapter decomposition on the model document to obtain a model full text and n chapters corresponding to the model document, wherein the model full text and the n chapters both comprise a plurality of word vectors; processing the model full text through an LSTM recurrent neural network to generate a full text thought vector c; processing the n chapters through an LSTM recurrent neural network based on the full-text thought vector c to generate chapter thought vectors cp; and processing the chapter thought vector cp through an LSTM recurrent neural network to generate a model abstract and realize intention analysis. The LSTM recurrent neural network used in the scheme has correct intention analysis, automatically processes and generates the model abstract, not only can improve the accuracy of information analysis results, but also can reduce the time and energy cost of information analysts.

Description

Information system modeling analysis method based on artificial intelligence algorithm

Technical Field

The invention relates to the technical field of information analysis, in particular to an information system modeling analysis method based on an artificial intelligence algorithm.

Background

With the development of information technology, a large number of new technologies are applied to the field of intelligence analysis, and the intelligence gathering capacity is greatly improved. Intelligence is a prerequisite and basis for decision making, but not much of an advantage, as things can go the opposite if a certain margin is exceeded, especially today with information inundation. Various information of the internet is covered on the ground, the truth is difficult to distinguish, the information cannot be directly used without analysis, and valuable information can be obtained only by analysis.

The intelligence analysis is a process of information selection and synthesis, an analyst acquires data through a conventional search engine by adopting a traditional analysis mode, manages the data by a mode of manually arranging documents and the like, and consumes a large amount of time and energy, and an intelligence analysis model accumulated in the process cannot be directly used in the process of information collection and analysis, so that the intelligence analysis effect is not ideal.

Various models are widely used and applied in the field of information analysis, and the information analysis models are more and more computerized, intelligent, full-resource and modeled. How to really apply the information analysis model to an actual information system to realize automation and intellectualization is difficult, and because the business modeling of the information has no certain rule and can not be structured, the information analysis model generally adopts natural language. However, the intelligence analysis model currently used faces the following problems when performing intelligence gathering and analysis:

How to make a computer read an information analysis model and accurately comprehend the intention of a modeler. To realize intelligence collection and analysis, the model must be analyzed intentionally, and the accuracy of the analysis determines the accuracy of the final intelligence analysis result.

And secondly, analyzing how the searched information is, so that the information can be close to the requirement of the original model to the maximum extent, and further generating an analysis report. The information technology has great promotion effect on information collection and analysis, for example, keywords and the like can be used for quick search and analysis, the existing information search system can absorb all information data in an undesirable way, and huge consumption of time and energy is caused to information analysts.

Therefore, it is necessary to improve the accuracy of the information analysis result and to reduce the time and effort cost of the information analyst.

Disclosure of Invention

The invention aims to improve the accuracy of information analysis results and reduce the time and energy costs of information analysts, and provides an information system modeling analysis method based on an artificial intelligence algorithm.

In order to achieve the above object, the embodiments of the present invention provide the following technical solutions:

An intelligence system modeling analysis method based on an artificial intelligence algorithm comprises the following steps:

step S1: collecting and analyzing intelligence by using an intelligence analysis model so as to generate a model document;

step S2: analyzing the model document by using a neural language model trained based on a domain corpus, thereby obtaining a plurality of Word vectors, wherein the neural language model is a Word2Vec model; performing chapter decomposition on the model document to obtain a model full text and n chapters corresponding to the model document, wherein the model full text and the n chapters respectively comprise a plurality of word vectors;

step S3: processing the model full text through an LSTM recurrent neural network to generate a full text thought vector c; processing the n chapters through an LSTM recurrent neural network based on the full-text thought vector c to generate chapter thought vectors cp;

step S4: and processing the chapter thought vector cp through an LSTM recurrent neural network to generate a model abstract and realize intention analysis.

In the scheme, the existing information analysis model is used for acquiring information, but the acquired information is disordered and has good and bad information, so the scheme processes the acquired information of the information analysis model, and the processing process comprises the steps of firstly decomposing the information (namely a model document) into a plurality of word vectors, then automatically processing the word vectors through an LSTM recurrent neural network to generate a model abstract, and obtaining the key information from the model abstract. The LSTM recurrent neural network has correct intention analysis, automatically processes and generates the model abstract, not only can improve the accuracy of information analysis results, but also can reduce the time and energy cost of information analysts.

The step S1 specifically includes the following steps: the information analysis model is utilized to establish task nodes according to the analysis requirements of users, working contents are set for the nodes established by each information analysis model, and the working contents can be search contents, search ranges, analysis methods and the like, so that an analysis link is formed, and the analysis link is a model document.

The step S3 is preceded by the steps of: training the LSTM recurrent neural network:

the LSTM recurrent neural network comprises an encoder and a decoder;

sequentially inputting word vector training set X = { X) to encoder of LSTM recurrent neural network_tIs formed by T ∈ N, N is an integer which is more than or equal to 1, and x_tRepresenting the word vector input to the encoder at time t; when t =1, a first initial hidden state h is input to the encoder₀And x₁The encoder outputs the hidden state h at this moment₁(ii) a At t>When 1, the hidden state h of the last moment is input into the encoder_t-1And the word vector x at this moment_tThe encoder outputs the hidden state h at this moment_t(ii) a Up to all word vectors x_tAfter all the input encoders are finished, the encoder finally outputs the hidden state h of the Nth moment_N；

Sequentially inputting a predictive value training set Y = { Y ] to a decoder of the LSTM recurrent neural network_t`T' is an integer greater than or equal to 1, M is an integer greater than or equal to 1, y _t`Represents the prediction value input to the decoder at time t'; when t '= 1, a second initial hidden state h' is input to the decoder₀And initial predicted value y₁The second initial hidden state h ″₀Outputting a hidden state h of the Nth moment for the encoder_NInitial predicted value y₁Is a self-defined value<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y of the next time₂(ii) a At t ″)>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the concealment at this momentState h' of_t`And the predicted value y at the next time_t`+1(ii) a Until all the predicted values y_t`All input to the decoder, or until the decoder outputs the custom predictor y_t`+1=<eos>；

Thereby obtaining a trained encoder and a trained decoder.

In the above scheme, a large number of word vectors are collected as a training set to train the encoder and decoder of the LSTM recurrent neural network, so that the LSTM recurrent neural network can have correct intent resolution.

The step of processing the model full text through the LSTM recurrent neural network to generate a full text thought vector c comprises the following steps:

the LSTM recurrent neural network comprises a trained encoder and a trained decoder;

the model contains T word vectors X in the whole text, and the word vector X = { X = _tInputting the codes into the coder in sequence, T belongs to T, T is an integer which is more than or equal to 1, x_tRepresents the word vector input to the encoder at time t; when t =1, a first initial hidden state h is input to the trained encoder₀Word vector x₁The encoder outputs the hidden state h at this moment₁(ii) a When t is>When 1, the hidden state h of the last moment is input into the encoder_t-1And the word vector x at this moment_tThe encoder outputs the hidden state h at this moment_t(ii) a Up to all word vectors x_tAll input into the encoder to obtain the hidden state h at the Tth moment_T；

Retiming, predicting word Y = { Y = }_t`Is sequentially input into a decoder, T' is an integer greater than or equal to 1, y_t`Represents the prediction value input to the decoder at time t'; when t '= 1, inputting a second initial hidden state h' into the trained decoder₀And the initial predicted value y₁The second initial hidden state h ″, the first initial hidden state₀Hidden state h for encoder output_TInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y at the next time₂(ii) a At t ″)>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the hidden state h' at this moment_t`And the predicted value y at the next time_t`+1(ii) a Until T' is reached, or until the decoder outputs the custom prediction value y _t`+1=<eos>；

And generating a full-text thought vector c according to the hidden states output by the encoder and the decoder.

In the scheme, the trained LSTM recurrent neural network is used for processing the full text of the model to generate a full text thought vector c.

The step of generating the full-text thought vector c according to the hidden states output by the encoder and the decoder comprises the following steps: at the t' time of decoder, full text thought vector c can be generated_t`：

Wherein, the first and the second end of the pipe are connected with each other,

full text thought vector c representing time t_t`I is the ith time of the encoder, h_iA hidden state output for the ith moment of the encoder;

weight of

The calculation of (2):

hidden state h 'output by decoder at t' moment_t`And the hidden state h output by the encoder at each moment_iCalculating a score

(ii) a After the score is processed by softmax, the score is processed by softmax

Is converted into

；

Obtaining a full-text thought vector c = { c = }_t`}，t`∈T`。

The hidden state h 'output by the decoder at the t' moment_t`And the hidden state h output by the encoder at each moment_iCalculating a score

Comprises the following steps:

wherein the content of the first and second substances,

、

、

representing learnable parameters, h ″_t`And h_iThe combined signals are input into a multi-layer perceptron to obtain a score, and tanh is an activation function.

The step of processing the n chapters based on the full-text thought vector c through the LSTM recurrent neural network to generate the chapter thought vector cp comprises the following steps:

analyzing n chapters based on the full-text thought vector c, wherein the n chapters contain K word vectors X, and the word vectors X = { X =_kK belongs to K, K is an integer which is more than or equal to 1, and x_kRepresents the word vector input to the encoder at time k; when k =1, inputting a third initial hidden state g to the trained encoder₀Sum word vector x₁The encoder outputs the hidden shape g at this moment₁(ii) a When k is>When 1, the hidden state g of the last time is input to the encoder_k-1And the word vector x at this moment_kThe encoder outputs the hidden state g at this moment_k(ii) a Up to all word vectors x_kAll input into the encoder, the hidden state g at the Kth moment is obtained_K；

Retiming, predicting word Y = { Y = }_k`Inputting the data into a decoder in sequence, K' is an integer greater than or equal to 1, and y_k`Represents the prediction value input to the decoder at time k'; when k =1, inputting a fourth initial hidden state g' into the trained decoder₀And the initial predicted value y₁The fourth initial latent state g ″₀Hidden state g for encoder output_KInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state g' at this moment₁And the predicted value y at the next time₂(ii) a At k>At 1, the hidden state g' at the last moment is input to the decoder _k`-1And the predicted value y at that time_k`The decoder outputs the hidden state g' at this moment_k`And the predicted value y at the next time_k`-1(ii) a Until K' is reached, or until the decoder outputs the custom prediction value y_k`+1=<eos>；

The chapter idea vector cp is generated from the hidden states output by the encoder and decoder.

In the scheme, the word vectors in n chapters are analyzed by using the full-text thought vector c, and then the word vectors in n chapters are processed by using the trained LSTM recurrent neural network to generate the chapter thought vector cp.

The step of generating the chapter idea vector cp according to the hidden states output by the encoder and the decoder includes: at time k' of the decoder, a chapter thought vector cp may be generated_k`：

Wherein the content of the first and second substances,

shows the chapter idea vector cp at time k `_k`J is the jth moment of the encoder, g_jA hidden state output for the j moment of the encoder;

weight of

The calculation of (2):

hidden state g 'output by decoder at k' moment_k`And the hidden state g output by the encoder at each moment_jCalculating a score

Is converted into

；

Get chapter idea vector cp = { cp = { (cp)_k`}，k`∈K`。

The hidden state g 'output at the k' th moment of the decoder_k`And the hidden state g output by the encoder at each moment _jCalculating a score

Comprises the following steps:

、

、

representing a learnable parameter, g ″)_k`And g_jThe combined signals are input into a multi-layer perceptron to obtain a score, and tanh is an activation function.

The method comprises the following steps of processing chapter thought vectors cp through an LSTM recurrent neural network to generate a model abstract and realize intention analysis, and comprises the following steps of:

at the k' th moment of the decoder, the chapter thought vector is divided into twocp_k`And hidden state g' output by decoder_k`After splicing together, inputting a full connection layer, and obtaining an abstract through softmax:

where Pvocab represents the chapter idea vector cp_k`Selecting the chapter thought vector cp with the maximum probability value at the moment k' according to the corresponding probability value_k`Chapter p as the time;

thereby generating a model section P = { P = { P }_k`}，k`∈K`。

In the scheme, a final model abstract is obtained according to the generated chapter idea vector cp.

Compared with the prior art, the invention has the beneficial effects that:

the LSTM recurrent neural network used in the scheme has correct intention analysis, automatically processes and generates the model abstract, not only can improve the accuracy of information analysis results, but also can reduce the time and energy cost of information analysts.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a process for training the LSTM recurrent neural network according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating the processing of the model full text using the LSTM recurrent neural network according to an embodiment of the present invention;

FIG. 4 is a process of using the LSTM recurrent neural network for n chapters according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments of the present invention without making any creative effort, shall fall within the protection scope of the present invention.

It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures. Also, in the description of the present invention, the terms "first", "second", and the like are used for distinguishing between descriptions and not necessarily for describing a relative importance or implying any actual relationship or order between such entities or operations.

Example 1:

the invention is realized by the following technical scheme, and the intelligence system modeling analysis method based on the artificial intelligence algorithm comprises the following steps:

step S1: intelligence gathering and analysis is performed using an intelligence analysis model, thereby generating a model document.

The method comprises the steps of establishing task nodes according to user analysis requirements by utilizing the existing information analysis model, setting working contents for the nodes established by each information analysis model, wherein the working contents can be search contents, search ranges, analysis methods and the like, so that an analysis link is formed, and the analysis link is a model document.

Step S2: analyzing the model document by using a neural language model trained based on a domain corpus so as to obtain a plurality of Word vectors, wherein the neural language model is a Word2Vec model; and performing chapter decomposition on the model document to obtain a model full text and n chapters corresponding to the model document, wherein the model full text and the n chapters respectively comprise a plurality of word vectors.

The model document is analyzed in the step, and the purpose is to decompose the model document into a model full text and n chapters, wherein the model full text consists of T word vectors, and the n chapters consist of K 'word vectors, so that the model document consists of T + K' word vectors. However, in step S3, n chapters composed of K 'word vectors are modified according to the generated full-text thought vector c, for example, K' word vectors contained in n chapters are modified into K word vectors, and the thought intent of the model full-text is added.

Step S3: processing the model full text through an LSTM recurrent neural network to generate a full text thought vector c; and processing the n chapters on the basis of the full-text thought vector c through an LSTM recurrent neural network to generate a chapter thought vector cp.

Before the model full text is processed using the LSTM recurrent neural network, which includes an encoder and a decoder, the LSTM recurrent neural network needs to be trained. During training, a large number of word vectors are collected, and the word vectors can be words commonly used in the industry, words commonly used in life, professional terms in the field and the like, and are not limited here.

Referring to fig. 2, a large number of word vectors form a training set X = { X = { [ X ]_tN is an integer greater than or equal to 1, the N is the number of word vectors, the N word vectors in the training set X are input into an encoder of the LSTM recurrent neural network in sequence, the word vector input into the first encoder is X₁I.e., t = 1; the last word vector input to the encoder is x_NI.e. t = N, that is, x_tRepresenting the word vector input to the encoder at time t.

When t =1, inputting a first initial implicit state h to the encoder₀And x₁The encoder outputs the hidden state h at this moment ₁(ii) a At t>1 hour, the hidden state h of the previous moment is input into the encoder_t-1And the word vector x at this time_tThe encoder outputs the hidden state h at this moment_t(ii) a Up to all word vectors x_tAfter all the input encoders are finished, the encoder finally outputs the hidden state h of the Nth moment_N. It should be noted that the first initial hidden state h₀The setting is customized for the user and can be set according to the actual situation.

Then, a predictive value training set Y = { Y ] is sequentially input to a decoder of the LSTM recurrent neural network_t`T' is an integer greater than or equal to 1, M is an integer greater than or equal to 1, y_t`Indicating the predictor input to the decoder at time t', predictor y_t`And the output of the decoder is obtained in turn. When t '= 1, inputting a second initial hidden state h' into a decoder₀And the initial predicted value y₁The second initial hidden state h ″, the first initial hidden state₀Outputting a hidden state h of the Nth moment for the encoder_NInitial predicted value y₁Custom value for a user<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y of the next time₂(ii) a At t ″)>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the hidden state h' at this moment_t`And the predicted value y at the next time_t`+1(ii) a Until all the predicted values y_t`All input into the decoder and output the hidden state h' at the Mth time _MAnd the predicted value y at the next time_M+1Or until the decoder outputs a custom predictor y_t`+1=<eos>. It should be noted that the second initial hidden state h ″, is₀The setting is customized for the user and can be set according to the actual situation.

Thereby obtaining a trained encoder and a trained decoder.

Referring to fig. 3, assuming that the model full text consists of T word vectors X, the word vector X = { X = { (X) }_tInputting the codes into the coder in sequence, T belongs to T, T is an integer which is more than or equal to 1, x_tRepresenting the word vector input to the encoder at time t; when t =1, a first initial hidden state h is input to the trained encoder₀Word vector x₁The encoder outputs the hidden state h at this moment₁(ii) a When t is>When 1, the hidden state h of the last moment is input into the encoder_t-1And the word vector x at this moment_tThe encoder outputs the hidden state at this momentState h_t(ii) a Up to all word vectors x_tAll input into the encoder to obtain the hidden state h at the Tth moment_T。

Retiming, predicting word Y = { Y = }_t`Is sequentially input into a decoder, T' is an integer greater than or equal to 1, y_t`Represents the prediction value input to the decoder at time t'; when t '= 1, inputting a second initial hidden state h' into the trained decoder₀And the initial predicted value y₁The second initial hidden state h ″, the first initial hidden state ₀Hidden state h for encoder output_TInitial predicted value y₁Is a self-defined value<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y at the next time₂(ii) a At t>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the hidden state h' at this moment_t`And the predicted value y at the next time_t`+1(ii) a Until T 'time is reached and the hidden state h' of the time is output_T`And the predicted value y at the next time_T`+1Or until the decoder outputs the custom prediction value y_t`+1=<eos>。

Generating a full-text thought vector c according to the hidden state output by the encoder and the decoder, and generating the full-text thought vector c at the t' moment of the decoder_t`：

Wherein the content of the first and second substances,

weight of

The calculation of (2):

：

Wherein the content of the first and second substances,

、

、

After the score is processed by softmax, the score is processed by softmax

Is converted into

(ii) a Obtaining a full-text thought vector c = { c = }_t`}，t`∈T`。

Referring to fig. 4, n chapters are analyzed using the full text thought vector c, assuming that n chapters are composed of K word vectors X, for which X = { X = { _kThe processing is the same as that of the full-text model, K belongs to K, K is an integer which is more than or equal to 1, and x_kRepresenting the word vector input to the encoder at time k.

When k =1, inputting a third initial hidden state g to the trained encoder₀Sum word vector x₁The encoder outputs the hidden shape g at this moment₁(ii) a When k is>When 1, the hidden state g of the last time is input to the encoder_k-1And the word vector x at this moment_kThe encoder outputs the hidden state g at this moment_k(ii) a Up to all wordsVector x_kAll input into the encoder, the hidden state g at the Kth moment is obtained_K. In addition, the third initial hidden state g₀The setting is customized for the user and can be set according to the actual situation.

Retiming, predicting word Y = { Y = }_k`Inputting the data into a decoder in sequence, K' is an integer greater than or equal to 1, and y_k`Represents the prediction value input to the decoder at time k'; when k =1, inputting a fourth initial hidden state g' into the trained decoder₀And the initial predicted value y₁The fourth initial latent state g ″₀Hidden state g for encoder output_KInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state g' at this moment₁And the predicted value y at the next time₂(ii) a At k>At 1, the hidden state g' at the last moment is input to the decoder _k`-1And the predicted value y at that time_k`The decoder outputs the hidden state g' at this moment_k`And the predicted value y at the next time_k`-1(ii) a Until K 'is reached and the hidden state g' at this moment is output_K`And the predicted value y at the next time_K`-1Or until the decoder outputs the custom prediction value y_k`+1=<eos>. It should be noted that the fourth initial hidden state g ″, is₀The setting is customized for the user and can be set according to the actual situation.

The chapter thought vector cp is generated according to the hidden state output by the encoder and the decoder, and the chapter thought vector cp can be generated at the k' time of the decoder_k`：

Wherein the content of the first and second substances,

weight of

The calculation of (2):

：

Wherein the content of the first and second substances,

、

、

representing learnable parameters, g ″_k`And g_jThe combined signals are input into a multi-layer perceptron to obtain a score, and tanh is an activation function.

After the score is processed by softmax, the score is processed by softmax

Is converted into

(ii) a Get chapter idea vector cp = { cp = { (cp)_k`}，k`∈K`。

At the k' th time of the decoder, chapter idea vector cp is added_k`And hidden state g' output by decoder_k`After splicing together, inputting a full connection layer, and obtaining an abstract through softmax:

wherein Pvocab represents a chapter thought vector cp_k`Selecting the chapter thought vector cp with the maximum probability value at the moment k' according to the corresponding probability value_k`Chapter p as the time;

thereby generating a model section P = { P = { P }_k`}，k`∈K`。

The above description is only for the specific embodiments of the present invention, but the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive of the changes or substitutions within the technical scope of the present invention, and all the changes or substitutions should be covered within the scope of the present invention. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. An information system modeling analysis method based on artificial intelligence algorithm is characterized in that: the method comprises the following steps:

step S2: analyzing the model document by using a neural language model trained based on a domain corpus so as to obtain a plurality of Word vectors, wherein the neural language model is a Word2Vec model; performing chapter decomposition on the model document to obtain a model full text and n chapters corresponding to the model document, wherein the model full text and the n chapters respectively comprise a plurality of word vectors;

step S4: and processing the chapter thought vectors cp through an LSTM recurrent neural network to generate a model abstract and realize intention analysis.

2. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 1, characterized in that: the step S1 specifically includes the following steps: the information analysis model is utilized to establish task nodes according to the analysis requirements of users, working contents are set for the nodes established by each information analysis model, and the working contents can be search contents, search ranges, analysis methods and the like, so that an analysis link is formed, and the analysis link is a model document.

3. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 1, characterized in that: the step S3 is preceded by the steps of: training the LSTM recurrent neural network:

the LSTM recurrent neural network comprises an encoder and a decoder;

sequentially inputting word vector training set X = { X) to encoder of LSTM recurrent neural network _tIs formed by T ∈ N, N is an integer which is more than or equal to 1, and x_tRepresenting the word vector input to the encoder at time t; when t =1, a first initial hidden state h is input to the encoder₀And x₁The encoder outputs the hidden state h at this moment₁(ii) a At t>When 1, the hidden state h of the last moment is input into the encoder_t-1And the word vector x at this moment_tThe encoder outputs the hidden state h at this moment_t(ii) a Up to all word vectors x_tAfter all the input encoders are finished, the encoder finally outputs the hidden state h of the Nth moment_N；

Sequentially inputting a predictive value training set Y = { Y ] to a decoder of the LSTM recurrent neural network_t`T' is an integer greater than or equal to 1, M is an integer greater than or equal to 1, y_t`Represents the prediction value input to the decoder at time t'; when t '= 1, inputting a second initial hidden state h' into a decoder₀And the initial predicted value y₁The second initial hidden state h ″, the first initial hidden state₀Outputting a hidden state h of the Nth moment for the encoder_NInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y of the next time₂(ii) a At t ″)>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the hidden state h' at this moment_t`And the predicted value y at the next time_t`+1(ii) a Until all the predicted values y _t`All input to the decoder, or until the decoder outputs a custom predictor y_t`+1=<eos>；

Thereby obtaining a trained encoder and a trained decoder.

4. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 1, wherein: the step of processing the model full text through the LSTM recurrent neural network to generate a full text thought vector c comprises the following steps:

the model contains T word vectors X in the whole text, and the word vector X = { X =_tInputting into the coder in turn, T belongs to T, T is an integer greater than or equal to 1, x_tRepresenting the word vector input to the encoder at time t; when t =1, a first initial hidden state h is input to the trained encoder₀Word vector x₁The encoder outputs the hidden state h at this moment₁(ii) a When t is>When 1, the hidden state h of the last moment is input into the encoder_t-1And the word vector x at this moment_tThe encoder outputs the hidden state h at this moment_t(ii) a Up to all word vectors x_tAll input into the encoder to obtain the hidden state h at the Tth moment_T；

Retiming, predicting word Y = { Y = }_t`Is sequentially input into a decoder, T' is an integer greater than or equal to 1, y _t`Represents the prediction value input to the decoder at time t'; when t '= 1, a second initial hidden state h' is input to the trained decoder₀And initial predicted value y₁The second initial hidden state h ″₀Hidden state h for encoder output_TInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state h' at this moment₁And the predicted value y at the next time₂(ii) a At t ″)>At 1, the hidden state h' of the last time is input to the decoder_t`-1And the predicted value y at that time_t`The decoder outputs the implicit state at this momentState h' of_t`And the predicted value y at the next time_t`+1(ii) a Until T' is reached, or until the decoder outputs the custom prediction value y_t`+1=<eos>；

5. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 4, characterized in that: the step of generating the full-text thought vector c according to the hidden states output by the encoder and the decoder comprises the following steps: at time t' of the decoder, a full-text thought vector c can be generated_t`：

Wherein the content of the first and second substances,

weight of

The calculation of (2):

Is converted into

；

Obtaining a full-text thought vector c = { c = }_t`}，t`∈T`。

6. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 5, characterized in that: the hidden state h 'output by the decoder at the t' moment_t`And the hidden state h output by the encoder at each moment_iCalculating a score

Comprises the following steps:

wherein the content of the first and second substances,

、

、

7. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 1, characterized in that: the step of processing the n chapters based on the full-text thought vector c through the LSTM recurrent neural network to generate the chapter thought vector cp comprises the following steps:

analyzing n chapters based on the full-text thought vector c, wherein the n chapters contain K word vectors X, and the word vectors X = { X =_kK belongs to K, K is an integer greater than or equal to 1, and x_kIndicating input encoding at time k A word vector of the machine; when k =1, inputting a third initial hidden state g to the trained encoder₀Sum word vector x₁The encoder outputs the hidden shape g at this moment₁(ii) a When k is>When 1, the hidden state g of the last time is input to the encoder_k-1And the word vector x at this moment_kThe encoder outputs the hidden state g at this moment_k(ii) a Up to all word vectors x_kAll input into the encoder, the hidden state g at the Kth moment is obtained_K；

Retiming, predicting word Y = { Y = }_k`Inputting the data into a decoder in sequence, K' is an integer greater than or equal to 1, and y_k`Represents the prediction value input to the decoder at time k'; when k =1, inputting a fourth initial hidden state g' into the trained decoder₀And the initial predicted value y₁The fourth initial latent state g ″₀Hidden state g for encoder output_KInitial predicted value y₁Is a custom value<bos>The decoder outputs the hidden state g' at this moment₁And the predicted value y at the next time₂(ii) a At k>At 1, the hidden state g' at the last moment is input to the decoder_k`-1And the predicted value y at that time_k`The decoder outputs the hidden state g' at this moment_k`And the predicted value y at the next time_k`-1(ii) a Until K' is reached, or until the decoder outputs the custom prediction value y_k`+1=<eos>；

8. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 7, wherein: the step of generating the chapter idea vector cp according to the hidden states output by the encoder and the decoder comprises the following steps: at time k' of the decoder, chapter idea vector cp may be generated_k`：

weight of

The calculation of (2):

Is converted into

；

Get chapter idea vector cp = { cp = { (cp)_k`}，k`∈K`。

9. The intelligence system modeling analysis method based on artificial intelligence algorithm of claim 8, characterized by: the hidden state g 'output at the k' th moment of the decoder_k`And the hidden state g output by the encoder at each moment_jCalculating a score

The method comprises the following steps:

wherein the content of the first and second substances,

、

、