WO2024134768A1

WO2024134768A1 - Natural language processing device, natural language processing method, and computer program

Info

Publication number: WO2024134768A1
Application number: PCT/JP2022/046889
Authority: WO
Inventors: 弘毅中西; 公雄土川; 晴夫大石
Original assignee: 日本電信電話株式会社
Filing date: 2022-12-20
Publication date: 2024-06-27

Abstract

A natural language processing device according to an embodiment comprises: a document structure data generation unit (5) which generates document structure data that represents a tree structure corresponding to table-of-contents data by acquiring the table-of-contents data about a training procedure document; a training data generation unit (6) which refers to nodes connected sequentially from a root node side of the document structure data through edges, and combines, to a training text, a text of the training procedure document indicated by the nodes when the nodes have been acquired; a pre-training processing unit (8) which constructs a trained evaluation model through deep learning that has used the training text; a text-to-be-evaluated input unit (11) which inputs text to be evaluated; and a text-to-be-evaluated evaluation unit (9) which uses the trained evaluation model to evaluate the input text to be evaluated.

Description

Natural language processing device, natural language processing method and computer program

The present invention relates to a natural language processing device, a natural language processing method, and a computer program.

Manuals and procedures that summarize the operation of equipment, work flows, and procedures often contain not only serially written sections, but also parallel sentences. When users refer to such manuals or procedures, they must keep the document structure (table of contents) in mind as they work.

For example, if we could use machine learning to learn about manuals and other documents and correctly evaluate the context of the manuals according to their structure, it would save users the trouble of having to carefully read through the manuals and would help them operate their devices and perform their work.

In conventional natural language processing learning methods, when learning text such as an operating manual, the context is understood by learning the relationship between adjacent sentences from the beginning to the end.

For example, Non-Patent Document 1 describes a domain-specific natural language processing and construction framework developed based on a pre-training method for natural language processing and a general-purpose language model.

However, even when the context of text such as an operating manual is parallel, the system learns the context as being serial, and the parallel sentences are incorrectly understood as having the same context.

The present invention was made in consideration of the above circumstances, and aims to provide a natural language processing device, a natural language processing method, and a computer program that accurately grasps context.

The natural language processing device according to the first aspect of the present invention includes a document structure data generation unit that acquires table of contents data of a learning procedure manual and generates document structure data showing a tree structure corresponding to the table of contents data, a learning data generation unit that references nodes connected by edges in order from the root node side of the document structure data and, when the node is acquired, combines the text of the learning procedure manual pointed to by the node with learning text, a pre-learning processing unit that constructs a trained evaluation model by deep learning using the training text, an evaluation target text input unit that inputs evaluation target text, and an evaluation target text evaluation unit that evaluates the input evaluation target text using the trained evaluation model.

The natural language processing method according to the second aspect of the present invention includes: acquiring table of contents data of a learning procedure manual, generating document structure data showing a tree structure corresponding to the table of contents data, referencing nodes connected by edges in order from the root node side of the document structure data, and when the node is acquired, combining the text of the learning procedure manual pointed to by the node with a learning text, constructing a trained evaluation model by deep learning using the training text, inputting a text to be evaluated, and evaluating the inputted text to be evaluated using the trained evaluation model.

A computer program according to a third aspect of the present invention causes a computer to execute the natural language processing method according to the second aspect.

The present invention provides a natural language processing device, a natural language processing method, and a computer program that can accurately grasp context.

FIG. 1 is a block diagram illustrating an example of a configuration of a natural language processing apparatus according to an embodiment. FIG. 2 is a diagram showing an example of document structure data generated by a document structure data generating unit of the natural language processing apparatus according to an embodiment. FIG. 3 is a flowchart illustrating an example of a learning data generation process of the procedure manual analysis unit of the natural language processing apparatus according to an embodiment. FIG. 4 is a diagram illustrating an example of a natural language processing method of the natural language processing apparatus according to an embodiment.

Below, a natural language processing device according to an embodiment of the present invention will be described with reference to the drawings. Note that in the following embodiments, parts with the same numbers perform similar operations, and redundant explanations will be omitted.

FIG. 1 is a block diagram illustrating an example of the configuration of a natural language processing apparatus 1 according to an embodiment.
The natural language processing device 1 of this embodiment is a device that learns using document structure data generated from table of contents data of a learning procedure manual and learning text generated for each of the document structure data, and includes at least one processor and a memory in which a program executed by the processor is recorded, and can realize various functions described below by software or a combination of software and hardware.

The natural language processing device 1 includes a learning procedure data storage unit 2, a learning procedure reading unit 3, a procedure analysis unit 4, a document structure data generation unit 5, a learning data generation unit 6, a deep learning type natural language processing unit 7, a pre-learning processing unit 8, an evaluation target text evaluation unit 9, an evaluation result output unit 10, and an evaluation target text input unit 11.

The learning procedure manual data storage unit 2 stores data on the learning procedure manual to be learned in advance. The learning procedure manual data storage unit 2 is, for example, a memory, and acquires and stores the learning procedure manual from the outside. The learning procedure manual data storage unit 2 outputs the stored learning procedure manual to the learning procedure manual reading unit 3 as necessary. The learning procedure manual is, for example, an operation manual, etc., and has a table of contents including, for example, the titles and summaries of chapters, sections, and paragraphs.

The learning procedure manual reading unit 3 acquires learning procedure manual data from the learning procedure manual data accumulation unit 2. The learning procedure manual reading unit 3 transmits the acquired learning procedure manual data in response to a request from the procedure manual analysis unit 4. The learning procedure manual reading unit 3 may also periodically acquire learning procedure manual data from the learning procedure manual data accumulation unit 2 and transmit it to the procedure manual analysis unit 4.

Note that the learning procedure manual data storage unit 2 and the learning procedure manual reading unit 3 do not necessarily need to be included in the natural language processing device 1, and may be configured to be connected to the natural language processing device 1 from the outside.

FIG. 2 is a diagram showing an example of document structure data generated by the document structure data generating unit 5 of the natural language processing apparatus 1 according to an embodiment.
The procedure manual analysis unit 4 includes a document structure data generation unit 5 and a learning data generation unit 6 .

The document structure data generation unit 5 generates document structure data showing a tree structure corresponding to the table of contents data included in the learning procedure manual data acquired from the learning procedure manual reading unit 3. For example, in the document structure data shown in FIG. 2, the root node is "table of contents," and the nodes "Chapter 1," "Chapter 2," and "Chapter 3" branch off from the root node by edges (branches). From each of the nodes "Chapter 1," "Chapter 2," and "Chapter 3," nodes for the sections included in each chapter branch off by edges (branches). The table of contents data includes, for example, the text described in the chapters, sections, and sections.

For example, when generating document structure data, the document structure data generation unit 5 divides the learning procedure manual according to the number of chapters. For example, if there are three chapters, the document structure data generation unit 5 divides the learning procedure manual into three and generates document structure data. The method of dividing the document structure data is not limited to the above, and it may be divided by the number of sections or the number of clauses, for example.

The document structure data generation unit 5 obtains the titles of the chapters, sections, and paragraphs, and the summaries described in the chapters, sections, and paragraphs from the data of the learning procedure manual. The document structure data generation unit 5 links the titles of the chapters, sections, and paragraphs of the document structure data with the corresponding summaries. Note that in this embodiment, it is assumed that the context conditions are described in the summaries corresponding to the chapters, sections, and paragraphs of the learning procedure manual. For example, the summary corresponding to chapter n describes that if the OS is Windows (registered trademark), the processing described in chapter n, section 1 must be performed, and if the OS is Mac, the processing described in chapter n, section 2 must be performed. The document structure data generation unit 5 supplies the document structure data and the summary data corresponding to the chapters, sections, and paragraphs of the document structure data to the learning data generation unit 6.

The learning data generation unit 6 combines the text of the learning procedure manual to generate learning text. The learning data generation unit 6 acquires the document structure data generated by the document structure data generation unit 5 and summaries corresponding to the chapters, sections, and paragraphs of the document structure data. The learning data generation unit 6 sequentially references the nodes connected by edges (branches) from the root node side of the acquired document structure data.

When the learning data generating unit 6 acquires a node of a specific chapter, section, or paragraph of the document structure data (S), it combines the text of the learning procedure manual indicated by the node with the learning text (T).
For example, the learning data generation unit 6 acquires a first node included in the nth chapter of the document structure data, and combines the text of the learning procedure manual pointed to by the first node with the learning text. Next, the learning data generation unit 6 acquires a second node connected to an edge branched off from the first node, and combines the text of the learning procedure manual pointed to by the second node with the learning text.

Continuing this process, the nth node connected to the edge branching off from the n-1th node is acquired, and the text of the learning procedure manual pointed to by the nth node is merged into the learning text. The learning data generation unit 6 then sequentially acquires the 1st node to the mth node of the n+1th chapter of the document structure data, and performs a process of sequentially merging the text of the learning procedure manual pointed to by the 1st node to the mth node of the n+1th chapter into the learning text.

The learning data generation unit 6 also acquires nodes from the document structure data according to the context conditions described in the summary corresponding to the chapter, section, and paragraph of the document structure data, and combines the text of the learning procedure manual indicated by the node with the learning text.

The learning data generation unit 6 acquires the context conditions described in the summary corresponding to the nth chapter of the document structure data (n). The context conditions may include, for example, a first condition (e.g., the OS is iOS (registered trademark)) and a second condition (e.g., the OS is Android (registered trademark)).

When the first condition described in the context condition is satisfied, the learning data generation unit 6 acquires a first condition node included in the nth chapter of the document structure data (n) and combines the text of the learning procedure indicated by the first condition node (e.g., an operation manual for iOS (registered trademark)) with the learning text (n). When the second condition described in the context condition is satisfied, the learning data generation unit 6 acquires a second condition node included in the nth chapter of the document structure data (n) and combines the text of the learning procedure indicated by the second condition node (e.g., an operation manual for Android (registered trademark)) with the learning text (n).

This prevents learning procedure texts that describe the processing required to meet different contextual conditions from being serially combined.

The learning data generation unit 6 references the nodes connected by edges in order from the root node of the document structure data, and if it is unable to obtain a node for a specific chapter, section, or paragraph, it determines that all of the text in the learning procedure manual has been combined into the learning text (T) and outputs the learning text (T) to the deep learning natural language processing unit 7 (pre-learning processing unit 8).

The deep learning type natural language processing unit 7 performs deep learning in advance using the learning text and constructs a trained model. The deep learning type natural language processing unit 7 uses the constructed trained model to evaluate the context of the text to be evaluated and outputs the evaluation result.

The deep learning type natural language processing unit 7 includes a pre-learning processing unit 8, an evaluation target text evaluation unit 9, and an evaluation result output unit 10.

The pre-learning processing unit 8 acquires the training text output by the training data generation unit 6, converts the input evaluation target text into a text structured similar to the training text by deep learning using the training text, and constructs a trained model (trained evaluation model) that evaluates the context of the text in order according to the document structure of the converted text. Here, the pre-learning processing unit has a trained model that evaluates the context of the input text data based on a known algorithm that evaluates the context of natural language, and may update the trained model by adding a trained model that converts the structure of the text data to be input to the trained model. The pre-learning processing unit 8 may update the trained model every time the training data generation unit 6 outputs training text, or may periodically update the trained model using multiple training texts. Note that the deep learning method in the pre-learning processing unit 8 is not limited, and various deep learning methods such as neural networks can be adopted. The pre-learning processing unit 8 supplies the trained model to the evaluation target text evaluation unit 9.

The evaluation target text input unit 11 inputs the evaluation target text acquired from outside to the evaluation target text evaluation unit 9. The evaluation target text is, for example, text such as an operation manual.

The evaluation target text evaluation unit 9 evaluates the evaluation target text input from the evaluation target text input unit 11 based on the trained model supplied from the pre-learning processing unit 8.

The evaluation result output unit 10 may include, for example, an image output device such as a monitor, and may be configured to output the evaluation results to an image output device. The evaluation result output unit 10 acquires the evaluation results output from the evaluation target text evaluation unit 9, and outputs them to a monitor or the like. Note that the evaluation result output unit 10 may be configured to output the evaluation results not only as images, but also as audio, etc.

Below, an example of a procedure for combining the text of a learning procedure manual with learning text using document structure data generated from table of contents data of the learning procedure manual by the learning data generation unit 6 of the natural language processing device 1 according to this embodiment will be described. Note that the contents of the process in the following operation description are merely examples, and various processes that can achieve the same effect can be used as appropriate.

FIG. 3 is a flowchart illustrating an example of a learning data generation process of the procedure manual analysis unit of the natural language processing apparatus according to an embodiment.
FIG. 4 is a diagram for explaining an example of a natural language processing method of the natural language processing apparatus according to an embodiment.

The document structure data generation unit 5 of the procedure manual analysis unit 4 acquires the text of the learning procedure manual from the learning procedure manual reading unit 3 (step 21). The document structure data generation unit 5 acquires the table of contents data contained in the text of the learning procedure manual (step 22). The document structure data generation unit 5 generates document structure data (S) showing a tree structure corresponding to the table of contents data (step 23).

The learning data generation unit 6 of the procedure manual analysis unit 4 acquires the document structure data (S) and summaries corresponding to the chapters, sections, and paragraphs of the document structure data (S), and generates learning text (T) for studying the text of the learning procedure manual (step 24).

The learning data generation unit 6 sequentially references the nodes connected by edges starting from the root node of the document structure data (S) and determines whether a node for a specific chapter, section, or paragraph of the document structure data (S) has been obtained (step 25).

When the learning data generation unit 6 determines that it has acquired a node for a specific chapter, section, or paragraph of the document structure data (S) (step 25, node present), it acquires the text of the learning procedure manual pointed to by that node (step 26).

The learning data generation unit 6 combines the text of the learning procedure manual acquired in step 26 into the learning text (T) (step 27).

The learning data generation unit 6 acquires all nodes for specific chapters, sections, and paragraphs in the document structure data (S), and if it determines that there are no nodes (step 25, no nodes), it outputs the combined learning text to the deep learning natural language processing unit 7 (step 28) and ends the process.

The deep learning type natural language processing unit 7 includes a pre-learning processing module (pre-learning processing unit 8) and an inference module (evaluation target text evaluation unit 9). The pre-learning processing module performs deep learning using the learning text and generates a trained evaluation model for evaluating the text. The inference module uses the trained evaluation model generated by the pre-learning processing module to evaluate the evaluation target text.

According to the natural language processing device 1 of this embodiment, document structure data is generated from the table of contents data of the learning procedure manual, and pre-learning is performed using the learning text generated using the contextual conditions described in the summary linked to the chapters, sections, and paragraphs included in the table of contents data and the document structure data, a trained evaluation model is generated, and the text of the learning procedure manual is evaluated. By learning sentences in a parallel context as parallel sentences rather than as consecutive (serial) sentences, it becomes possible to accurately grasp the context.

The program according to this embodiment may be transferred in a state where it is stored in an electronic device, or in a state where it is not stored in an electronic device. In the latter case, the program may be transferred via a network, or in a state where it is stored in a storage medium. The storage medium is a non-transitory tangible medium. The storage medium is a computer-readable medium. The storage medium may be in any form, such as a CD-ROM or memory card, as long as it is capable of storing a program and is computer-readable.

The present invention is not limited to the above-described embodiments, and can be modified in various ways during implementation without departing from the gist of the invention. The embodiments may also be implemented in appropriate combination, in which case the combined effects can be obtained. Furthermore, the above-described embodiments include various inventions, and various inventions can be extracted by combinations selected from the multiple constituent elements disclosed. For example, if the problem can be solved and an effect can be obtained even if some constituent elements are deleted from all the constituent elements shown in the embodiments, the configuration from which these constituent elements are deleted can be extracted as an invention.

Reference Signs List 1: Natural language processing device 2: Learning procedure manual data storage unit 3: Learning procedure manual reading unit 4: Procedure manual analysis unit 5: Document structure data generation unit 6: Learning data generation unit 7: Deep learning type natural language processing unit 8: Pre-learning processing unit 9: Evaluation target text evaluation unit 10: Evaluation result output unit 11: Evaluation target text input unit

Claims

a document structure data generating unit that obtains a learning procedure manual including table of contents data and generates document structure data showing a tree structure corresponding to the table of contents data;
a learning data generation unit that refers to nodes connected by edges in order from a root node side of the document structure data, and when the node is acquired, combines the text of the learning procedure manual indicated by the node with a learning text;
A pre-learning processing unit that constructs a trained evaluation model by deep learning using the training text;
an evaluation target text input section for inputting an evaluation target text;
an evaluation target text evaluation unit that evaluates an input evaluation target text using the trained evaluation model;
A natural language processing device comprising:
The learning data generation unit
refer to the nodes connected by edges in order from the root node of the document structure data, and if the node is not acquired, output the learning text;
The natural language processing device according to claim 1 .
The learning data generation unit
obtaining the node from the document structure data in accordance with a context condition described in an outline included in the document structure data;
The natural language processing device according to claim 1 .
Acquire a learning procedure manual including table of contents data, and generate document structure data showing a tree structure corresponding to the table of contents data;
Refer to the nodes connected by edges in order from the root node side of the document structure data, and when the node is acquired, combine the text of the learning procedure manual pointed to by the node into a learning text;
A trained evaluation model is constructed by deep learning using the training text,
Enter the text to be evaluated,
A natural language processing method for evaluating an input text to be evaluated using the trained evaluation model.
A computer program causing a computer to carry out the method according to claim 4.