WO2022113314A1

WO2022113314A1 - Learning method, learning program, and learning device

Info

Publication number: WO2022113314A1
Application number: PCT/JP2020/044396
Authority: WO
Inventors: 航光田; 竜一郎東中; 哲也杵渕
Original assignee: 日本電信電話株式会社
Priority date: 2020-11-27
Filing date: 2020-11-27
Publication date: 2022-06-02

Abstract

In the present invention, in an evaluation expression extraction step, a plurality of sentences are collected and evaluation sentences which have a specified subject of interest and include an evaluation expression are extracted. In a pair creation step, evaluation sentences are combined to generate a sentence pair comprising an input sentence and an output sentence on the basis of the polarity of an evaluation of each of the evaluation sentences extracted in the evaluation expression extraction step. In a learning step, a generation model is learned on the basis of the polarity of the evaluations of the input sentences and output sentences and the sentence pairs generated in the sentence pair creation step. In an input step, information about the standpoint of a response sentence with respect to an utterance input sentence representing an utterance composing a dialog and the utterance input sentence is acquired, the polarity of the evaluations of the utterance input sentence and the response sentence is identified, and input data is generated. In an utterance generation step, the input data generated in the input step is acquired, and the generation model generated in the learning step is used to generate and output a response sentence on the basis of the utterance input sentence and the polarities of the utterance input sentence and the response sentence.

Description

Learning methods, learning programs and learning devices

The present invention relates to a learning method, a learning program and a learning device.

In a dialogue system, humans interact with a computer to acquire various information and satisfy requests. There are also dialogue systems that not only accomplish certain tasks, but also engage in daily conversations, which allow humans to gain mental stability, satisfy their desire for approval, and build relationships of trust.

On the other hand, research is also underway to realize discussions using computers rather than task achievement and daily conversation. Disputation has the function of changing human value judgments and organizing thoughts, and plays an important role for humans. For example, a technique has been proposed in which user utterances are mapped to nodes using graph data with opinions as nodes, and nodes having a connection relationship with the mapped nodes are returned to the user as system utterances for discussion. Graph data is manually created based on a preset discussion theme, such as "If you live permanently, the city is better than the countryside." By using the data of the discussion created by hand, it is possible to discuss a specific topic.

In a dialogue system such as that proposed in a technique for discussing using graph data like this, while it is possible to have a deep discussion on a specific topic (closed domain), it deviates from a specific preset discussion theme. It is difficult to respond appropriately to the user's speech. In order to solve this problem, an approach to create graph data for discussion on any topic in advance can be considered, but it is not realistic because there are innumerable discussion themes and the coverage is low. ..

In order to deal with this problem, a method of applying an utterance generation model using deep learning has been proposed. This method targets the problem of generating a supportive utterance for an input utterance (an utterance that states a specific reason for supporting the content of the input utterance). As a specific method, information similar to keywords and categories is acquired as learning knowledge from external knowledge such as SNS and Internet encyclopedias by using information on keywords and categories that frequently appear in input utterances and supportive utterances. ..

However, the method of applying the utterance generation model using deep learning described above is a technique for generating supportive utterances, and is difficult to apply to the generation of counter-argument utterances. For the support utterance, it is sufficient to generate a sentence having a content similar to the input utterance, in other words, a sentence having a meaning similar to the input utterance, and expressions such as keywords and categories that frequently appear in the input utterance and the support utterance (for example, "... is fun". Information similar to ")) may be acquired as prior knowledge. Semantic similarity can be dealt with in a general deep learning-based utterance generation model as used in the method of applying the utterance generation model using deep learning described above. However, counter-argument utterances are required to generate sentences with different meanings, more specifically sentences with opposite positions, rather than similar content to input utterances. Therefore, in order to handle counter-argument utterances, it is necessary to explicitly handle whether or not the positions of input and output are reversed.

In an experiment using data collected manually, an event occurred in which a support was erroneously generated in a counter-argument utterance generator using a general utterance generation model. As an error, for example, as a counterargument to "the sea is fun", "fish is delicious" may be output. When we investigated the data collected manually based on the case of error, in the counterargument, we mentioned the part A of X against "X is good" and compared the case of saying "A is bad" and X and Y. Then, there was a case where "Y is better".

In a general utterance generation model, expressions with opposite positions are treated as similar information internally, so it is not always possible to output the opposite positions appropriately. If we collect innumerable data that can be used for learning, it may be possible to learn all of these correspondences, but this is not realistic because the topics covered in the discussion are diverse. Therefore, in order to handle counter-argument utterances, it is necessary to use a method of accurately judging sentences whose positions are reversed to the extent that they can be used for learning, and inputting information that captures them into the model for learning. preferable.

However, it is difficult to automatically estimate support and counterarguments because the sentences contained in the Web text contain a lot of noise. Therefore, it is conceivable to use a typical keyword-based estimation method in the conventional knowledge acquisition method. This method presumes that, for example, if "therefore" is at the beginning of a sentence, the target sentence and the immediately preceding sentence are in a supportive relationship, and if there is "but" at the beginning of the sentence, the target sentence and the immediately preceding sentence are countered. It is a method of presuming that there is a relationship of. However, in the case of this method, it is not clear whether or not the size of the sentence pair that can be acquired is sufficient. Therefore, it is difficult to construct an appropriate dialogue system that handles counter-argument utterances when simply using a typical keyword-based estimation method.

The present invention has been made in view of the above, and an object of the present invention is to improve the processing performance of discussion dialogue in a dialogue system.

In order to solve the above-mentioned problems and achieve the purpose, the evaluation expression extraction process collects a plurality of sentences and extracts the evaluation sentences having the specified subject of interest and including the evaluation expression. The pair creation step generates a sentence pair of an input sentence and an output sentence by combining the evaluation sentences based on the polarity of each evaluation of the evaluation sentences extracted in the evaluation expression extraction step. In the learning step, the generation model is learned based on the sentence pair generated in the sentence pair creation step, and the polarities of the evaluations of the input sentence and the output sentence. In the input process, information on the position of the utterance input sentence representing the utterance to be dialogued and the response sentence to the utterance input sentence is acquired, and the polarity of the evaluation of the utterance input sentence and the response sentence is specified and the input data is specified. To generate. The utterance generation step acquires the input data generated in the input step, and the generation model generated in the learning step based on the utterance input sentence and the polarities of the utterance input sentence and the response sentence. Is used to generate and output the response statement.

According to the present invention, it is possible to improve the processing performance of the discussion dialogue in the dialogue system.

FIG. 1 is a block diagram of an utterance generator. FIG. 2 is a diagram showing an outline of the flow of utterance generation. FIG. 3 is a diagram showing a specific example of a sentence pair created by the pair creation unit. FIG. 4 is a diagram showing an example of learning data. FIG. 5 is a diagram showing an example of a counter-argument utterance generated by the utterance generator. FIG. 6 is a flowchart of the learning process by the utterance generator. FIG. 7 is a flowchart of the utterance generation process by the utterance generation device. FIG. 8 is a diagram showing an example of a computer that executes a learning program.

Hereinafter, one embodiment of the learning method, learning program, and learning device disclosed in the present application will be described in detail with reference to the drawings. The learning method, learning program and learning device disclosed in the present application are not limited by the following embodiments.

[Configuration of utterance generator]
FIG. 1 is a block diagram of an utterance generator. Further, FIG. 2 is a diagram showing an outline of the flow of utterance generation. The configuration of the utterance generation device 1 will be described with reference to FIGS. 1 and 2.

The utterance generation device 1 is a learning device such as a server. The utterance generation device 1 is a device that generates and outputs a counter-argument utterance with respect to the input amount. As shown in FIG. 1, the utterance generation device 1 includes an evaluation expression extraction unit 11, a pair creation unit 12, a learning unit 13, an input unit 14, an utterance generation unit 15, and an output unit 16.

The evaluation expression extraction unit 11 collects Web texts existing on the Web, as shown in step S1 of FIG. Further, the evaluation expression extraction unit 11 receives the input of the subject of interest, which is the subject of the sentence to be evaluated.

Next, the evaluation expression extraction unit 11 performs morphological analysis on the collected Web text. Next, the evaluation expression extraction unit 11 performs focal word extraction for extracting keywords representing the topic of the Web text by using the sentence analyzed by the morphological element. Further, the evaluation expression extraction unit 11 extracts the proper noun using the sentence analyzed by the morphological element. Further, the evaluation expression extraction unit 11 extracts evaluation expressions for extracting evaluation information such as likes, dislikes, and conveniences by using the sentences analyzed by morphological analysis. Further, the evaluation expression extraction unit 11 performs modality extraction for extracting the presence / absence of a negative expression or the like by using the sentence analyzed by the morphological element. Further, the evaluation expression extraction unit 11 uses the sentences analyzed by morphological analysis to estimate the dialogue action in which the dialogue is estimated, such as whether each sentence is a question or a detailed sentence. Since the above-mentioned evaluation expression extraction is a general language processing task, there is no particular limitation on the language analyzer used to realize the above-mentioned evaluation expression extraction.

The evaluation expression extraction unit 11 extracts a sentence including an evaluation expression having a subject of interest after performing each of the above-mentioned analyzes. For example, when the subject of interest is "X", the evaluation expression extraction unit 11 extracts an evaluation sentence that starts with "X is" and includes an evaluation for X.

Next, the evaluation expression extraction unit 11 uses the evaluation information, the result of modality extraction, and the like to describe an evaluation sentence including an evaluation expression having the extracted subject of interest as a sentence having a positive evaluation polarity and a negative evaluation sentence with respect to the subject of interest. Divide into polar sentences. The polarity of a sentence's evaluation is information that indicates whether to take a supportive position or an opposite position to the subject of the sentence. Called negative polarity. In the following, the polarity of sentence evaluation is simply referred to as polarity. Then, the evaluation expression extraction unit 11 classifies and registers the evaluation sentence including the extracted subject of interest into a sentence having a positive polarity with respect to the subject of interest and a sentence having a negative polarity, and generates an evaluation classification list.

Here, in order to accurately judge sentences whose positions are reversed to the extent that they can be used for learning, and to input information that captures them into the generative model for learning, the output opinion is the subject of interest. It is important to capture positive or negative opinions about. Therefore, the evaluation expression extraction unit 11 according to this embodiment classifies sentences according to the polarity with respect to the subject of interest as described above.

For example, as shown in step S2 of FIG. 2, the evaluation expression extraction unit 11 classifies a sentence having a positive polarity starting with "X is" and a sentence having a negative polarity starting with "X is". In FIG. 2, Sent (X, +) _n indicates that X is a sentence starting with "X is", + indicates that it is a positive sentence, and n is assigned a serial number to each sentence. Represents an identification number. Further, in Sent (X, +) _n , − indicates that it is a negative sentence, and the other symbols are the same as in the case of Sent (X, +) _n .

The evaluation expression extraction unit 11 can appropriately extract a sentence whose subject of discussion is the subject of interest by acquiring an evaluation sentence starting with the subject of interest as a language pattern. The evaluation expression extraction unit 11 outputs the generated evaluation classification list to the pair creation unit 12.

The pair creation unit 12 receives the input of the evaluation classification list from the evaluation expression extraction unit 11. Next, the pair creation unit 12 uses the acquired evaluation classification list to combine evaluation sentences having the same polarity as support sentence pairs to generate a sentence pair in which one is an input side sentence and the other is an output side sentence. do. That is, the pair creation unit 12 combines a positive polarity evaluation sentence and a positive polarity evaluation sentence to form a sentence pair, and further combines a negative polarity evaluation sentence and a negative polarity evaluation sentence to form a sentence pair. And. This sentence pair is a combination of mutually supportive sentences. In the following, the statement on the input side may be simply referred to as the input side, and the statement on the output side may be simply referred to as the output side.

For example, as shown in step S3 of FIG. 2, the pair creation unit 12 has Sent (X, +) ₁ as the input side and Sent (X, +) ₂ as the output side, and is a supportive sentence pair. Further, the pair creation unit 12 has Sent (X, −) ₁ as the input side and Sent (X, −) ₂ as the output side, and is a supportive sentence pair.

Further, the pair creation unit 12 uses the acquired evaluation classification list to combine evaluation sentences of different polarities as a counter-argument sentence pair to generate a sentence pair with one as the input side and the other as the output side. That is, the pair creation unit 12 uses the input side as an evaluation sentence having a positive polarity and combines them as an evaluation sentence having a negative polarity on the output side to form a sentence pair. Further, the pair creation unit 12 combines the input side as an evaluation sentence having a negative polarity and the output side as an evaluation sentence having a positive polarity to form a sentence pair. This sentence pair is a combination of sentences that are mutually counter-arguing.

For example, as shown in step S3 of FIG. 2, the pair creation unit 12 sets the input side to Sent (X, +) ₁ and the output side to Sent (X,-) ₁ to form a sentence pair of counterarguments. .. Further, in the pair creation unit 12, the input side is Sent (X, +) ₂ and the output side is Sent (X, −) ₂ , and the sentence pair is a counter-argument sentence.

FIG. 3 is a diagram showing a specific example of a sentence pair created by the pair creation unit. FIG. 3 is an example of a sentence pair of an anti-thesis when the subject of interest is “ramen”. In this case, the polarity of the data on the input side is positive, and the polarity on the output side is negative. The pair creation unit 12 can generate each sentence pair shown in FIG. 3 when ramen is the subject of interest. As shown in FIG. 3, by using the polarity and the language pattern, the pair creation unit 12 can appropriately acquire a sentence pair that seems to be a counterargument.

The learning unit 13 acquires the sentence pair generated by the pair creation unit 12. Then, the learning unit 13 learns the utterance generation model using the acquired sentence pair. The learning unit 13 inserts the polarity on the input side and the polarity on the output side as tokens at the end of the input sentence as a polarity guide indicating whether each sentence has a positive polarity or a negative polarity. Then, input / output learning is performed using an input statement in which the polarity on the input side and the polarity on the output side are inserted as tokens. For example, FIG. 4 is a diagram showing an example of learning data. FIG. 4 shows a pair of counter-argument sentences in which the subject of interest is ramen, in which the sentence on the input side has a positive polarity and the sentence on the output side has a negative polarity. The arrow in FIG. 4 represents the conversion from the input sentence to the output sentence. The positive and negative signs before the arrow indicate the polarity on the input side and the polarity on the output side.

The learning unit 13 learns using, for example, a generative model using a method that uses BERT (Bidirectional Encoder Representations from Transformers), which is a pre-learning method, for encoding and decoding. As a result, as shown in step S4 of FIG. 2, the learning unit 13 generates a trained generative model. However, the method used for learning is not limited to BERT, and the learning unit 13 may use another utterance generation model learning algorithm.

Further, the learning unit 13 according to the present embodiment performs fine tuning using the data of the support utterance and the data of the counter-argument utterance collected manually in addition to the learning by BERT to improve the accuracy of the generated model. The learning unit 13 does not have to perform fine tuning, but it is possible to generate more accurate supportive utterances and counter-argument utterances by performing fine tuning. The learning unit 13 outputs the trained generation model to the utterance generation unit 15.

The input unit 14 receives the input of the spoken sentence to be the target of the dialogue. As for the input of this sentence, the sentence generated by voice recognition of the words spoken by a person may be acquired, or the sentence corresponding to the utterance may be manually input by the user. In the following, this input sentence is referred to as an utterance input sentence. Further, the input unit 14 receives an instruction from the position of response such as whether to output a support sentence or request a counterargument sentence as a response sentence to the utterance input sentence. The input unit 14 may have acquired the instruction of the position of this response in advance, or may receive the input together with the utterance input sentence.

Then, the input unit 14 determines the polarity of the utterance input sentence with respect to the subject. For example, the input unit 14 estimates the polarity of the utterance input sentence with respect to the subject by the same evaluation polarity determination process as the evaluation expression extraction unit 11. Further, the input unit 14 determines the polarity on the output side as the same polarity as the utterance input sentence if the response position is designated as support, and as the opposite polarity if the response position is specified as counterargument. Then, as shown in step S5 of FIG. 2, the input unit 14 inserts the polarity on the input side and the polarity on the output side as a token at the end of the utterance input sentence as a polarity guide to generate input data. Then, the input unit 14 outputs the input data in which the token is inserted at the end of the input sentence to the utterance generation unit 15 having the trained generation model.

The utterance generation unit 15 acquires the trained generation model from the learning unit 13. After that, the utterance generation unit 15 receives the input of the input data in which the token is inserted at the end of the utterance input sentence from the input unit 14. Then, as shown in step S6 of FIG. 2, the utterance generation unit 15 generates a supportive utterance or a counter-argument utterance for the utterance input sentence input using the learning model acquired according to the token. If the response position is supportive, the utterance generation unit 15 generates a counter-utterance to the utterance input sentence. More specifically, the utterance generation unit 15 generates a sentence positive for the subject if the utterance input sentence is a positive sentence for the subject, and the utterance input sentence is a negative sentence for the subject. For example, it produces a negative sentence for the subject. If the response position is a counterargument, the utterance generation unit 15 generates a counterargument utterance to the utterance input sentence. More specifically, the utterance generation unit 15 generates a negative sentence for the subject if the utterance input sentence is a positive sentence for the subject, and the utterance generation unit 15 may be a negative sentence for the subject. For example, it produces a positive sentence for the subject. After that, the utterance generation unit 15 outputs a response sentence to the generated utterance input sentence to the output unit 16.

The output unit 16 acquires a response sentence, which is a supportive utterance or a counter-argument utterance to the utterance input sentence, from the utterance generation unit 15. Then, the output unit 16 outputs the acquired response statement as shown in step S7 of FIG.

FIG. 5 is a diagram showing an example of a counter-argument utterance generated by the utterance generator. For example, when a sentence that baseball is fun is input, the utterance generation device 1 generates and outputs a counter-argument that the rules are difficult. In this case, the utterance generation device 1 outputs a negative sentence for baseball as a counter-utterance to a positive sentence for baseball. Further, the utterance generator 1 generates and outputs a counter-argument that the economy is good when a sentence that the stock is not profitable is input. In this case, the utterance generator 1 outputs a positive sentence for the stock as a counter-utterance to the negative sentence for the stock. Further, the utterance generation device 1 generates and outputs a counter-argument that practice is required when a sentence that golf is fun is input. In this case, the utterance generator 1 outputs a negative sentence for golf as a counter-utterance to a positive sentence for golf. It can be said that each counter-argument utterance is an appropriate counter-argument to the input sentence.

[Generative model learning process and utterance generation process]
Next, with reference to FIG. 6, the flow of the learning process by the utterance generation device 1 will be described. FIG. 6 is a flowchart of the learning process by the utterance generator.

The evaluation expression extraction unit 11 collects Web texts existing on the Web (step S11). Further, the evaluation expression extraction unit 11 receives the input of the subject of interest.

Next, the evaluation expression extraction unit 11 extracts a sentence having a designated subject of interest and including an evaluation expression from the collected Web text (step S12).

Next, the evaluation expression extraction unit 11 classifies the extracted sentences into positive sentences and negative sentences with respect to the subject of interest, and creates an evaluation classification list (step S13). After that, the evaluation expression extraction unit 11 outputs the generated evaluation classification list to the pair creation unit 12.

The pair creation unit 12 receives the input of the evaluation classification list from the evaluation expression extraction unit 11. Next, the pair creation unit 12 creates an instruction sentence pair and a counter-argument sentence pair from the sentences registered in the evaluation classification list (step S14). After that, the pair creation unit 12 outputs the generated sentence pair to the learning unit 13 and the learning unit 13.

The learning unit 13 receives the input of the sentence pair from the pair creation unit 12. Next, the learning unit 13 inserts tokens representing the polarities of the input side and the output side of the sentence pair at the end of the input sentence (step S15).

Next, the learning unit 13 learns the generation model using BERT for the sentence pair in which tokens indicating the polarities of the input side and the output side of the sentence pair are inserted at the end of the input sentence. Further, the learning unit 13 performs fine-tuning on the trained generative model using the data of the support utterance and the data of the counter-argument utterance collected manually, and performs detailed learning (step S16). After that, the learning unit 13 outputs the trained generation model to the utterance generation unit 15. The learning unit 13 acquires and holds a generative model that has already been played from the learning unit 13. This completes the learning process.

Next, with reference to FIG. 7, the flow of the utterance generation process by the utterance generation device 1 will be described. FIG. 7 is a flowchart of the utterance generation process by the utterance generation device.

The input unit 14 receives the input of the utterance input sentence to be the target of the dialogue (step S21). Further, the input unit 14 acquires an instruction of the position of the response to the utterance input sentence.

Next, the input unit 14 performs an evaluation process for the subject of the utterance input sentence, and specifies the polarity on the input side. Further, the input unit 14 specifies the polarity on the output side according to the instruction from the position of the response (step S22).

Next, the input unit 14 inserts tokens representing the polarities of the specified input side and output side at the end of the utterance input sentence to generate input data (step S23). Then, the input unit 14 outputs the input data in which the tokens representing the polarities of the input side and the output side are inserted at the end of the utterance input sentence to the utterance generation unit 15.

The utterance generation unit 15 receives input of input data from the input unit 14. Then, the utterance generation unit 15 generates a response sentence which is a supportive utterance or a counter-speech to the utterance input sentence according to the polarity of the output side by using the trained generation model (step S24). After that, the utterance generation unit 15 outputs the generated response sentence to the output unit 16.

The output unit 16 receives an input of a response sentence which is a supportive utterance or a counter-argument utterance to the utterance input sentence from the utterance generation unit 15. Then, the output unit 16 outputs the acquired response statement (step S25).

[Effects of generative model learning process and utterance generation process]
As described above, the utterance generator 1 extracts a sentence having a subject of interest and including an evaluation expression from the Web, and generates a sentence pair of an instruction and a sentence pair of a counterargument depending on whether or not the polarities of the evaluations are the same. do. Then, the utterance generation device 1 learns the generation model using the polarized sentence pairs. Further, when the utterance generation device 1 receives the utterance input sentence to be the target of the dialogue, the utterance generation device 1 determines the polarity of the utterance input sentence and also determines the polarity of the output side. After that, the utterance generation device 1 generates and outputs a supportive utterance or a counter-argument utterance which is a response sentence to the utterance input sentence by using the polarities of the input side and the output side together with the utterance input sentence.

In this way, the utterance generation device 1 is a generation model capable of robustly generating instructional utterances and counterargument utterances for any proposition by learning using polarized support and counterargument sentence pairs. It is possible to build. In addition to learning, by giving the input utterance position and the output utterance position as input, it is possible to suppress the exchange of instructions and counterarguments in the output, and it is possible to have an appropriate dialogue. Will be. The processing performance of the discussion dialogue in the dialogue system can be improved, and a smooth discussion dialogue can be constructed.

[System configuration, etc.]
Further, each component of each of the illustrated devices is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific forms of distribution and integration of each device are not limited to those shown in the figure, and all or part of them may be functionally or physically dispersed or physically distributed in arbitrary units according to various loads and usage conditions. Can be integrated and configured. Further, each processing function performed by each device is realized by a CPU (Central Processing Unit) and a program that is analyzed and executed by the CPU, or hardware by wired logic. Can be realized as.

Further, among the processes described in the present embodiment, all or part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed can be performed. All or part of it can be done automatically by a known method. In addition, the processing procedure, control procedure, specific name, and information including various data and parameters shown in the above document and drawings can be arbitrarily changed unless otherwise specified.

[program]
As one embodiment, the utterance generator 1 can be implemented by installing a learning program that executes the above information processing as package software or online software on a desired computer. For example, by causing the information processing device to execute the above learning program, the information processing device can function as the utterance generation device 1. The information processing device referred to here includes a desktop type or notebook type personal computer. In addition, information processing devices include smartphones, mobile communication terminals such as mobile phones and PHS (Personal Handy-phone System), and slate terminals such as PDAs (Personal Digital Assistants). Is done.

Further, the utterance generation device 1 can be implemented as a management server device in which the terminal device used by the user is a client and the service related to the above management process is provided to the client. For example, the management server device is implemented as a server device that receives a config input request as an input and provides a management service for inputting a config. In this case, the management server device may be implemented as a Web server, or may be implemented as a cloud that provides services related to the above management processing by outsourcing.

FIG. 8 is a diagram showing an example of a computer that executes a learning program. The computer 1000 has, for example, a memory 1010 and a CPU 1020. The computer 1000 also has a hard disk drive interface 1030, a disk drive interface 1040, a serial port interface 1050, a video adapter 1060, and a network interface 1070. Each of these parts is connected by a bus 1080.

The memory 1010 includes a ROM (Read Only Memory) 1011 and a RAM (Random Access Memory) 1012. The ROM 1011 stores, for example, a boot program such as a BIOS (BASIC Input Output System). The hard disk drive interface 1030 is connected to the hard disk drive 1090. The disk drive interface 1040 is connected to the disk drive 1100. For example, a removable storage medium such as a magnetic disk or an optical disk is inserted into the disk drive 1100. The serial port interface 1050 is connected to, for example, a mouse 1110 and a keyboard 1120. The video adapter 1060 is connected to, for example, the display 1130.

The hard disk drive 1090 stores, for example, the OS 1091, the application program 1092, the program module 1093, and the program data 1094. That is, the learning program that defines each process of the utterance generation device 1 having the same function as the utterance generation device 1 is implemented as a program module 1093 in which a code that can be executed by a computer is described. The program module 1093 is stored in, for example, the hard disk drive 1090. For example, the program module 1093 for executing the same processing as the functional configuration in the utterance generation device 1 is stored in the hard disk drive 1090. The hard disk drive 1090 may be replaced by an SSD (Solid State Drive).

Further, the setting data used in the processing of the above-described embodiment is stored as program data 1094 in, for example, a memory 1010 or a hard disk drive 1090. Then, the CPU 1020 reads the program module 1093 and the program data 1094 stored in the memory 1010 and the hard disk drive 1090 into the RAM 1012 as needed, and executes the process of the above-described embodiment.

The program module 1093 and the program data 1094 are not limited to those stored in the hard disk drive 1090, but may be stored in, for example, a removable storage medium and read by the CPU 1020 via the disk drive 1100 or the like. Alternatively, the program module 1093 and the program data 1094 may be stored in another computer connected via a network (LAN (Local Area Network), WAN (Wide Area Network), etc.). Then, the program module 1093 and the program data 1094 may be read from another computer by the CPU 1020 via the network interface 1070.

1 Utterance generator 11 Evaluation expression extraction unit 12 Pair creation unit 13 Learning unit 14 Input unit 15 Utterance generation unit 16 Output unit

Claims

An evaluation expression extraction process that collects multiple sentences and extracts an evaluation sentence that has a specified subject of interest and includes an evaluation expression.
A pair creation step of combining the evaluation sentences to generate a sentence pair of an input sentence and an output sentence based on the polarity of each evaluation of the evaluation sentence extracted in the evaluation expression extraction step.
A learning step of learning a generation model based on the sentence pair generated in the pair creation step, and the polarities of the evaluations of the input sentence and the output sentence.
Input that acquires the position information of the utterance input sentence representing the utterance to be the dialogue and the response sentence to the utterance input sentence, specifies the polarity of the evaluation of the utterance input sentence and the response sentence, and generates input data. Process and
The input data generated in the input step is acquired, and the generation model generated in the learning step is used based on the utterance input sentence and the polarities of the utterance input sentence and the response sentence. A learning method characterized by including an utterance generation process that generates and outputs a response sentence.
The learning method according to claim 1, wherein the evaluation expression extraction step estimates the polarity of each evaluation of the evaluation sentence.
The claim is characterized in that the pair creation step generates a supporting sentence pair in which the evaluation sentences having the same evaluation polarity and a counter-argument sentence pair in which the evaluation sentences having the opposite evaluation polarities are combined. The learning method according to 1 or 2.
The learning according to any one of claims 1 to 3, wherein the learning step performs learning by adding information indicating the polarity of the evaluation of the input sentence and the output sentence to the input sentence. Method.
In the input step, information indicating the polarity of the evaluation of the utterance input sentence and the response sentence is added to the utterance input sentence to generate the input data.
The utterance generation step is characterized in that the response sentence is generated by using the generation model for the utterance input sentence to which information indicating the polarity of the evaluation of the utterance input sentence and the response sentence is added. The learning method according to any one of claims 1 to 3.
A learning program for causing a computer to execute the method according to claims 1 to 5.
An evaluation expression extraction unit that collects multiple sentences and extracts evaluation sentences that have a specified subject of interest and include evaluation expressions.
A pair creation unit that generates a sentence pair of an input sentence and an output sentence by combining the evaluation sentences based on the polarity of each evaluation of the evaluation sentence extracted by the evaluation expression extraction unit.
A learning unit that learns a generation model based on the sentence pair generated by the pair creation unit, and the polarities of the evaluations of the input sentence and the output sentence.
Input that acquires the position information of the utterance input sentence representing the utterance to be the dialogue and the response sentence to the utterance input sentence, specifies the polarity of the evaluation of the utterance input sentence and the response sentence, and generates input data. Department and
The response is received from the input unit using the generation model generated by the learning unit based on the input of the input data, the utterance input sentence, and the polarities of the utterance input sentence and the response sentence. A learning device characterized by having an utterance generator that generates and outputs sentences.