WO2021161908A1 - Information processing device and information processing method - Google Patents

Information processing device and information processing method Download PDF

Info

Publication number
WO2021161908A1
WO2021161908A1 PCT/JP2021/004278 JP2021004278W WO2021161908A1 WO 2021161908 A1 WO2021161908 A1 WO 2021161908A1 JP 2021004278 W JP2021004278 W JP 2021004278W WO 2021161908 A1 WO2021161908 A1 WO 2021161908A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
language
information processing
semantic analysis
processing system
Prior art date
Application number
PCT/JP2021/004278
Other languages
French (fr)
Japanese (ja)
Inventor
淳也 小野
Original Assignee
ソニーグループ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by ソニーグループ株式会社 filed Critical ソニーグループ株式会社
Publication of WO2021161908A1 publication Critical patent/WO2021161908A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Definitions

  • This disclosure relates to an information processing device and an information processing method.
  • the translation result sentence is selected by comparing the result of the semantic analysis of the translation source language sentence with the result of the semantic analysis of the translated translation result sentence.
  • the translation result sentence is selected based on the similarity between the result of the translation source language and the result of the translation destination language.
  • the translation result sentence is selected based on the result of the translation source language with low accuracy, and it is difficult to perform appropriate processing.
  • there are results of semantic analysis in a plurality of languages there is also a problem that it is difficult to determine which of the results of the semantic analysis is suitable as the result of the semantic analysis. Therefore, it is desired to enable comparison of semantic analysis accuracy between languages.
  • the information processing apparatus of one form according to the present disclosure relates to one or more character information corresponding to each of one or more languages including a target language which is a language corresponding to a user's utterance. Based on the execution unit that executes the semantic analysis process and the result of the semantic analysis process corresponding to each of the one or more character information, the accuracy of the semantic analysis process is applied to each of the one or more character information. It is provided with a calculation unit for calculating an accuracy index value that enables comparison between a plurality of languages.
  • Embodiment 1-1 Outline of information processing according to the embodiment of the present disclosure 1-1-1. Outline, background, effects, etc. 1-1-2. Score function 1-1-2-1. Function generation example 1-1-3. Main flow of processing 1-2. Configuration of Information Processing System According to Embodiment 1-3. Configuration of Information Processing Device According to Embodiment 1-4. Configuration of the terminal device according to the embodiment 1-5. Response example 1-6. Information processing procedure according to the embodiment 1-6-1. Procedure for processing related to information processing equipment 1-6-2. Procedure for processing related to information processing system 1-6-3. Other Procedures for Processing Related to Information Processing Systems Part 1 1-6-4. Other procedure of processing related to information processing system Part 2 1-7.
  • FIG. 1 is a diagram showing an example of information processing according to the embodiment of the present disclosure.
  • the information processing according to the embodiment of the present disclosure is realized by the information processing system 1 (see FIG. 2) including the information processing device 100 (see FIG. 3) and the terminal device 10 (see FIG. 8).
  • FIG. 1 describes an outline of information processing realized by the information processing system 1.
  • FIG. 1 is a diagram showing an example of information processing according to the embodiment of the present disclosure.
  • the language (input language) corresponding to the user's utterance is described as "target language”.
  • the target language corresponds to a language accepted as input by the information processing system 1.
  • the language to be translated into the target language is described as "translation destination language”. That is, one language may be the target language, or the translation destination language for another language may be. For example, if one language is a language corresponding to the user's utterance, it is the target language, and if that one language is the translation destination language of another language, it is the translation destination language.
  • the target language and the translation destination language referred to here are names for distinguishing and expressing languages based on the relationship with other languages of each language in the processing described later.
  • the character information corresponding to the target language may be described as "input language character information”
  • the character information corresponding to the translation destination language may be described as "translated sentence” or "translated character information”.
  • a language capable of semantic analysis processing is described as a "specific language”.
  • a language other than a specific language that is, a language that cannot perform semantic analysis processing may be described as a "non-specific language”.
  • the information processing system 1 is capable of voice recognition (language identification) and translation processing for many languages, and can accept many languages as input languages (target languages). For example, as shown in FIG. 1, the information processing system 1 can speak many languages such as English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, and Korean. It can be accepted as an input language (target language). Note that these languages are only examples, and the information processing system 1 can accept not only the above languages but also a large number of languages as input languages (target languages).
  • each process shown in FIG. 1 may be performed by either the information processing device 100 or the terminal device 10 of the information processing system 1. Any device included in the information processing system 1 may perform the processing in which the information processing system 1 is described as the main body of the processing.
  • the information processing device 100 executes processing such as voice recognition, translation, and semantic analysis in response to the user's utterance detected by the terminal device 10 will be described as an example.
  • processing information processing
  • FIG. 1 a case where the information processing device 100 performs processing (information processing) such as voice recognition, translation, and semantic analysis will be described as an example, but even if the terminal device 10 performs these processing (information processing). good. This point will be described later.
  • FIG. 1 will be specifically described.
  • the user speaks In the example of FIG. 1, a case where the user speaks in Korean is shown. For example, the user utters "Tell me the weather in Tokyo tomorrow (Korean)" in Korean. In this way, when “... (Korean)" is described, it is the language in which the specification is described (for example, Japanese), but in reality, it is assumed that the utterance is in Korean or Korean characters. ..
  • the information processing system 1 accepts the user's utterance in Korean. For example, the information processing system 1 acquires voice information of utterances in Korean, which is an input language (target language).
  • the information processing system 1 executes the processing related to voice recognition as shown in the processing phase FS1.
  • the information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition. In the example of FIG. 1, the information processing system 1 determines that the user's utterance is Korean by the language identification process. The information processing system 1 uses character information (speech information) in Korean, which is the target language, as input language character information.
  • the information processing system 1 develops the utterance sentence (step S1).
  • the information processing system 1 develops an utterance sentence that generates character information (also referred to as "paraphrase") in which the input language character information of the input language "Korean” is paraphrased into another expression while keeping the input language "Korean”.
  • a paraphrase is character information in which a certain word (character string) in a certain character information is replaced with another word (character string), character information in which a new word (character string) is added to a certain character information, or a certain character. It may be character information generated by various conversion modes such as character information in which information is paraphrased into another expression.
  • the paraphrase may be any character information as long as it is character information converted into another expression while including the meaning of one character information.
  • the information processing system 1 generates paraphrases by appropriately using various techniques related to paraphrases.
  • the information processing system 1 may generate a paraphrase by using information indicating a paraphrase generation rule (paraphrase generation rule information).
  • the information processing device 100 may generate a paraphrase by using the paraphrase generation rule information stored in the storage unit 120 (see FIG. 3).
  • the paraphrase generation rule information may be list information such as a rule for converting a character string such as a flexion of a word ending or a rule for adding a specific character string (a word ending, an object, etc.) to a sentence.
  • the information processing system 1 may transmit the input language character information to the service providing device that provides the paraphrase of a certain character information, and acquire the paraphrase of the input language character information from the service providing device.
  • the information processing system 1 uses a plurality of paraphrases that paraphrase the input language character information (referred to as "first utterance development sentence") "Tell me the weather in Tokyo tomorrow (Korean)" by expanding the utterance sentence. Generate. For example, the information processing system 1 generates "Tell me the weather in Tokyo tomorrow (Korean)" as a paraphrase of input language character information (referred to as “second utterance development sentence”) by utterance sentence expansion. .. For example, the information processing system 1 generates "What is the weather in Tokyo tomorrow? (Korean)" as a paraphrase of input language character information (referred to as "third utterance development sentence”) by utterance sentence expansion. do.
  • first utterance development sentence paraphrase the input language character information
  • second utterance development sentence a paraphrase of input language character information
  • third utterance development sentence a paraphrase of input language character information
  • the information processing system 1 generates "I want to know the weather in Tokyo tomorrow (Korean)" as a paraphrase of input language character information (referred to as "fourth utterance development sentence") by expanding the utterance sentence. ..
  • the above is shown as an example of a paraphrase with a large change for explanation, and the information processing system 1 generates a paraphrase of input language character information by appropriately using various techniques.
  • the information processing system 1 utterably expands the input language character information "Tell me the weather in Tokyo tomorrow (Korean)" into N sentences by the utterance sentence expansion as described above. .. That is, the information processing system 1 generates N sentences including "Tell me the weather in Tokyo tomorrow (Korean)" which is input language character information by expanding the spoken sentences.
  • the information processing system 1 generates a pseudo utterance text list (hereinafter, also referred to as “utterance sentence list”) including input language character information and a paraphrase of the input language character information.
  • the information processing system 1 generates an utterance sentence list including N sentences.
  • the information processing system 1 generates an utterance sentence list including N sentences such as a first utterance development sentence, a second utterance development sentence, a third utterance development sentence, and a fourth utterance development sentence.
  • the information processing system 1 translates each sentence in the utterance sentence list (step S2).
  • the information processing system 1 converts each sentence in the utterance sentence list into a specific language.
  • the information processing system 1 converts each of the N Korean sentences in the utterance sentence list into a specific language.
  • M languages such as English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, and Korean are specific languages. Show the case. Note that these languages are only examples, and the information processing system 1 may use a language other than the above languages as a specific language as long as it can perform semantic analysis processing.
  • the information processing system 1 is a language (specific language) capable of performing semantic analysis processing for each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence.
  • the information processing system 1 uses English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, and Germany for each of the N sentences such as the first utterance development sentence to the fourth utterance development sentence.
  • Translate into M languages (specific languages) such as Chinese, Japanese, and Korean.
  • generating a sentence of a specific language from a sentence of an input language is also referred to as "language expansion".
  • language expansion In FIG.
  • the information processing system 1 since Korean, which is an input language (target language), is also included in the specific language, the information processing system 1 has the first utterance development sentence, the second utterance development sentence, and the third utterance development sentence for Korean. N sentences such as the expanded sentence and the fourth utterance expanded sentence are used as they are. Further, when the input language (target language) is not included in the specific language, the information processing system 1 does not use the sentence of the input language (speech expansion sentence) for the processing of semantic analysis or the like.
  • the information processing system 1 has M-1 languages other than Korean among M specific languages, such as a first utterance development sentence, a second utterance development sentence, a third utterance development sentence, and a fourth utterance development sentence.
  • M-1 languages other than Korean among M specific languages such as a first utterance development sentence, a second utterance development sentence, a third utterance development sentence, and a fourth utterance development sentence.
  • Translate each of the N sentences For example, the information processing system 1 translates each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence into English.
  • the information processing system 1 converts the first utterance development sentence "Tell me the weather in Tokyo tomorrow (Korean)" into a translated sentence "Tell me the weather in Tokyo tomorrow” corresponding to English.
  • the information processing system 1 converts the second utterance development sentence “Tell me the weather in Tokyo tomorrow (Korean)” into the translated sentence "Please tell me the weather in Tokyo tomorrow” corresponding to English.
  • the information processing system 1 converts the third utterance development sentence "What is the weather in Tokyo tomorrow? (Korean)" into a translation sentence "What is the weather in Tokyo tomorrow?" Corresponding to English. ..
  • the information information system 1 converts the fourth utterance development sentence "I want to know the weather in Tokyo tomorrow (Korean)" into a translation sentence "I want to know the weather of Tokyo tomorrow" corresponding to English. ..
  • the information processing system 1 translates N English sentences corresponding to each of the N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. Generate a statement.
  • the information processing system 1 translates N Chinese characters corresponding to each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. Generate a statement. Further, the information processing system 1 has N translations of N Vietnamese corresponding to each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. To generate. In addition, the information processing system 1 also applies to Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, etc. in the same manner as the first utterance development sentence, the second utterance development sentence, and the third utterance development sentence. , N translation sentences corresponding to each of N sentences such as the fourth utterance expansion sentence are generated for each of M languages.
  • the information processing system 1 By the above-mentioned processing, the information processing system 1 generates N ⁇ M sentences including N sentences in Korean and N translated sentences in each of M-1 languages.
  • the information processing system 1 generates a list (also referred to as a "translation list") including N sentences in Korean and N translations of each of M-1 languages.
  • the information processing system 1 generates a translation list containing N ⁇ M sentences, which are N sentences in each of the M languages. In this way, the information processing system 1 executes the processing of the utterance sentence expansion and the language expansion in the processing phase FS1.
  • the processing phase FS1 is a processing that depends on the input language.
  • the information processing system 1 executes the processing related to the semantic analysis as shown in the processing phase FS2.
  • the information processing system 1 performs an utterance semantic analysis process (also referred to as “semantic analysis process”) (step S3).
  • the information processing system 1 performs a semantic analysis process using each sentence included in the translation list (also referred to as a "semantic analysis target sentence").
  • the information processing system 1 generates information on the semantic frame as an analysis result by the semantic analysis process.
  • the information processing system 1 generates information of N ⁇ M semantic frames corresponding to each of N ⁇ M sentences.
  • the information processing system 1 may generate more than N ⁇ M semantic frames.
  • the information processing system 1 may generate a plurality of meaning frames for one sentence.
  • the information processing system 1 generates information on a semantic frame including a score (also referred to as a “semantic analysis score”) indicating the accuracy of the semantic analysis by the semantic analysis process.
  • the information processing system 1 uses a semantic analyzer corresponding to the language of the sentence to generate information on a semantic frame including a semantic analysis score indicating the accuracy of the semantic analysis of the sentence.
  • the information processing system 1 uses an English semantic analyzer to perform semantic analysis processing on N English sentences to indicate the accuracy of the semantic analysis of each of the N English sentences. Generates information on N semantic frames including analysis scores. For example, the information processing system 1 inputs an English translation "Tell me the weather in Tokyo tomorrow" (also referred to as a "first semantic analysis target sentence") corresponding to the first utterance development sentence into an English semantic analyzer. By doing so, the information of the semantic frame including the semantic analysis score is generated. The information processing system 1 identifies the Domain-Goal of the first semantic analysis target sentence as "Weather-Check".
  • the slot value (also referred to as "Value") of the Attribute "Date” corresponding to the Domain-Goal "Weather-Check” is "tomorrow", and the slot value of the Attribute “Place” is "Tokyo".
  • the domain-Goal is "Weather-Check”
  • the slot value of the Attribute “Date” is “tomorrow”
  • the slot value of the Attribute “Place” is "Tokyo”.
  • the information processing system 1 obtains a score "0.9” indicating the certainty of the semantic analysis output by the English semantic analyzer into which the first semantic analysis target sentence is input, and performs a semantic analysis of the first semantic analysis target sentence. Used as a score.
  • the information processing system 1 also uses an English semantic analyzer for the remaining N-1 translated sentences, and N-1 semantic frames including the semantic analysis scores of each of the N-1 sentences. Generate information about. As a result, the information processing system 1 generates information of N semantic frames including the semantic analysis score of each of the N sentences in English.
  • the information processing system 1 also uses the semantic analyzer of each language for each of the M-1 languages other than English to generate information of the semantic frame including the semantic analysis score.
  • the information processing system 1 uses a Korean semantic analyzer to generate information on N semantic frames including the semantic analysis score of each of the N sentences in Korean.
  • the information processing system 1 uses a Japanese semantic analyzer to generate information on N semantic frames including the semantic analysis score of each of the N sentences in Japanese.
  • the information processing system 1 also uses a semantic analyzer of each language for Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, etc., and N pieces of each language. Generates information about the semantic frame containing the semantic analysis score for each of the sentences in.
  • the information processing system 1 generates information of N ⁇ M semantic frames including the semantic analysis score of N ⁇ M semantic analysis target sentences.
  • the information processing system 1 does not have to generate the information of the semantic frame for all the N ⁇ M semantic analysis target sentences.
  • the information processing system 1 may perform the semantic analysis process except for the sentences having low translation quality among the N ⁇ M semantic analysis target sentences, and the details of this point will be described later.
  • the semantic analyzer function of semantic analysis processing of each language is created for each language, and the relationship between the languages is not considered. Therefore, the semantic analysis score generated by the semantic analyzer (function of semantic analysis processing) of each language indicates the certainty (accuracy) within that language, and the score considering the relationship with other languages No. Therefore, although the accuracy of such a semantic analysis score tends to be higher as the value is larger, when it is used as it is for the comparison of the accuracy of the semantic analysis between languages, it may not be an appropriate comparison. For example, the Japanese semantic analysis score "0.9" and the English semantic analysis score "0.9" do not always show the same accuracy.
  • the information processing system 1 performs a process of converting the semantic analysis score into an accuracy index value that enables comparison between a plurality of languages.
  • the information processing system 1 performs a process of converting the semantic analysis score into the semantic analysis accuracy (%), which is an accuracy index value that enables comparison between a plurality of languages (step S4).
  • the information processing system 1 calculates the semantic analysis accuracy (%) using the semantic analysis score.
  • the accuracy index value “semantic analysis accuracy (%)” is used is shown as an example, but the accuracy index value is "semantic analysis accuracy (semantic analysis accuracy (%)” if the semantic analysis accuracy between languages can be compared. %) ”, Any information may be used.
  • the unit is not limited to "%”, and may be various units or may have no unit.
  • the information processing system 1 calculates the semantic analysis accuracy by using a function as shown in the following equation (1) (hereinafter, also referred to as a “score function”). For example, the information processing apparatus 100 calculates the semantic analysis accuracy by using the score function stored in the storage unit 120. The details of the generation of the score function that outputs the accuracy index value that enables the accuracy of the semantic analysis process to be compared between a plurality of languages will be described later.
  • Acc indicates the accuracy of semantic analysis.
  • “score” in the formula (1) indicates a semantic analysis score. In “score”, the semantic analysis score to be converted into the semantic analysis accuracy is input. "Lang” in the formula (1) indicates a language. In “lang”, information indicating the language that is the target of the semantic analysis score is input. “F ()” in the equation (1) indicates a score function that outputs the semantic analysis accuracy by inputting the language specification and the semantic analysis score.
  • the information processing system 1 uses the equation (1) to convert the semantic analysis score so that it can be compared between a plurality of languages according to the semantic analysis score and the language designation (meaning). Analysis accuracy) is calculated.
  • the equation (1) is an example, and the information processing system 1 may perform a process of calculating the semantic analysis accuracy by using a function other than the equation (1).
  • the information processing system 1 calculates the semantic analysis accuracy by using a function for each language (score function for each language). In this case, the information processing system 1 calculates the semantic analysis accuracy by using the language-specific score function that outputs the semantic analysis accuracy in response to the input of the semantic analysis score. For example, the information processing apparatus 100 calculates the semantic analysis accuracy by using the language-specific score function stored in the storage unit 120. The information processing system 1 selects a language-specific score function corresponding to the language of the semantic analysis score from a plurality of language-specific score functions, and calculates the semantic analysis accuracy using the selected language-specific score function.
  • the information processing system 1 calculates the semantic analysis accuracy by using the English score function, which is a language-specific score function for English.
  • the semantic analysis accuracy is calculated by using the Japanese score function, which is a language-specific score function for Japanese.
  • the function of Eq. (1) may be a program including conditional branching by language.
  • the function of the equation (1) may include a conditional branch according to the comparison between the variable "lang" and each language.
  • the function of the equation (1) may be a function (program) that calculates the semantic analysis accuracy using the language-specific score function of the language in which the variable “lang” matches and outputs the result.
  • the function of the equation (1) may be a function (program) including a plurality of language-specific score functions corresponding to each language and a conditional branch of which language-specific score function is used.
  • the information processing system 1 uses the equation (1) to convert each of the semantic analysis scores of N ⁇ M semantic analysis target sentences into the semantic analysis accuracy.
  • the information processing system 1 calculates the semantic analysis accuracy of N ⁇ M semantic analysis target sentences by using the equation (1). For example, the information processing system 1 uses the semantic analysis score "0.9" of the first semantic analysis target sentence and the information indicating the language "English” of the first semantic analysis target sentence, and uses the information indicating the language "English” of the first semantic analysis target sentence. Calculate the semantic analysis accuracy of.
  • the information processing system 1 inputs the semantic analysis score "0.9" of the first semantic analysis target sentence and the information indicating the language "English” of the first semantic analysis target sentence into the equation (1).
  • 1 Semantic analysis Calculate the semantic analysis accuracy of the target sentence. In the example of FIG.
  • the information processing system 1 calculates the semantic analysis accuracy of the first semantic analysis target sentence as "96.6 (%)". Similarly, the information processing system 1 calculates the semantic analysis accuracy for N ⁇ M-1 sentences (semantic analysis target sentences) other than the first semantic analysis target sentence. As a result, the information processing system 1 calculates the semantic analysis accuracy of all N ⁇ M sentences (semantic analysis target sentences) in the translation list. In this way, the information processing system 1 executes the semantic analysis process on the sentences (N ⁇ M sentences) in the translation list in the processing phase FS2, and calculates the semantic analysis accuracy of the N ⁇ M sentences. .. As described above, the processing phase FS2 is an input language-independent process.
  • the information processing system 1 executes the processing related to the response generation as shown in the processing phase FS3.
  • the information processing system 1 selects a semantic frame (semantic analysis target sentence) to be used for response generation or the like prior to response generation.
  • the information processing system 1 selects a sentence (also referred to as “character information to be processed”) used for response generation or the like from the sentences in the translation list.
  • the information processing system 1 selects a meaning frame used for response generation or the like from N ⁇ M meaning frames.
  • the information processing system 1 selects a semantic frame (semantic analysis target sentence) having the maximum semantic analysis accuracy from N ⁇ M semantic frames.
  • a semantic frame semantic analysis target sentence
  • the information processing system 1 selects the first semantic analysis target sentence having a semantic analysis accuracy of "96.6 (%)" as the semantic frame (semantic analysis target sentence) used for response generation or the like.
  • the information processing system 1 has a semantic frame included in the semantic analysis result of the English translation "Tell me the weather in Tokyo tomorrow" whose semantic analysis accuracy is “96.6 (%)” (also referred to as “processed semantic frame”). Is selected as the information used for response generation, etc.
  • the information processing system 1 performs slot inverse transformation (step S5).
  • the information processing system 1 converts the slot value in the processing target meaning frame into the slot value of the input language (target language).
  • the information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language).
  • the information processing system 1 converts the slot value of English, which is the translation destination language, into the slot value of Korean, which is the input language.
  • the information processing system 1 converts the slot value of the Attribute “Date” from “tomorrow” to “tomorrow (Korean)” and converts the slot value from English to Korean.
  • the information processing system 1 converts the slot value of the Attribute “Place” from “Tokyo” to “Tokyo (Korean)” and converts the slot value from English to Korean.
  • the information processing system 1 determines the service to be started (step S6). For example, the information processing system 1 determines a service to be started from various services such as a calendar service SV1, a weather service SV2, an alarm service SV3, and a music service SV4. In FIG. 1, since the user is asking for the weather, the information processing system 1 decides to start the weather service SV2. Then, the information processing system 1 generates a response. For example, the information processing system 1 outputs information indicating the weather in Tokyo tomorrow in Korean. For example, the information processing system 1 outputs a response such as "Tomorrow's weather in Tokyo is sunny (Korean)" by voice in Korean or displays it in Korean. As described above, the processing phase FS3 is a processing that depends on the input language.
  • the information processing system 1 performs semantic analysis processing for each of a plurality of sentences in a plurality of languages generated by the processing of utterance sentence expansion and language expansion. Then, the information processing system 1 converts the semantic analysis score generated by the processing of the semantic analysis into an accuracy index value (semantic analysis accuracy) that enables comparison between a plurality of languages. As a result, the information processing system 1 can compare the accuracy of semantic analysis between languages.
  • the information processing system 1 accepts a user's utterance to a device such as a smart speaker, generates a table structure of meaning frames from the utterance content, classifies the domain goals of the utterance, and takes out slots. As a result, the information processing system 1 provides a mechanism for improving the accuracy of the semantic analysis process for extracting information necessary for executing an application or service.
  • the method executed by the information processing system 1 generates an utterance sentence list in which phrases and phrases are increased in variations by different expressions by expanding the utterance sentences for the utterance sentences after speech recognition. do. Further, the method executed by the information processing system 1 performs language conversion by translating the utterance sentence list into all specific languages that can be processed by the utterance meaning analyzer. As a result, the information processing system 1 can make a judgment from various expressions and languages, although normally only the semantic analysis result of only the input language can be obtained.
  • the information processing system 1 has sufficient analysis accuracy by translating into all specific languages that can be processed by the speech semantic analyzer. It enables analysis in language.
  • the information processing system 1 since the translation accuracy of the translator in language conversion is greatly affected, the information processing system 1 maintains the same meaning by expanding the utterance sentence, and does not affect the result of the meaning frame, so that the character string on the surface layer is not affected. Absorb by increasing the expression.
  • the information processing system 1 prepares a module for estimating the quality of translation accuracy because the semantic analysis results increase due to utterance sentence expansion and language expansion, and utterances with accuracy below a certain level may be rejected. Will be described later.
  • the information processing system 1 prepares in advance a corresponding function (score function) between the score value of the semantic analysis process and the percentage of the analysis accuracy for each language, so that the normal score value does not make sense for comparison between languages. It is possible to compare the semantic analysis results in the above from the viewpoint of analysis accuracy.
  • semantic analysis process In order to create a semantic analysis process (semantic analyzer, etc.), it is necessary to collect utterance sentences according to the domain goal, and from there, it is necessary to perform labeling to cut out as a slot. Therefore, in terms of understanding the target language, designing the standard for domain goals, and collecting and labeling the corpus, localizing it into one language is compared to machine translation, where bilingual collection is the main cost. The load is high, and multilingual development of semantic analysis generally requires time and labor costs. Localization is an unavoidable issue when expanding our business globally. In addition, if the utterance sentence collection is insufficient or if there is a language in which the semantic frame design is not sufficient, it is difficult to obtain sufficient semantic analysis results for that language.
  • the information processing system 1 has the following technical features.
  • the information processing system 1 improves or supports the accuracy of semantic analysis by using translation technology. Further, the information processing system 1 performs utterance development that converts the input utterance sentence into an expression having the same meaning, for example, an expression that is not affected by the output result of the meaning frame. Further, the information processing system 1 translates the increased utterance sentences due to the utterance development into a language that can be processed by the semantic analyzer.
  • the information processing system 1 passes the accuracy function from the score of the semantic analysis for each combination of language and expansion, and obtains the one with the maximum accuracy as the semantic analysis result. Further, in order to return a response in the input language, the information processing system 1 reversely converts the slot value of the semantic frame from the language (specific language) for which the semantic analysis corresponding to the semantic frame is performed to the input language.
  • the information processing system 1 outputs in a format that can be understood which language it is in three phases of input, analysis, and output when responding.
  • the information processing system 1 outputs in the form of an image (icon or the like), a voice (sound effect or the like), a text (language name / language code) or the like. The details of this point will be described later.
  • the information processing system 1 interrupts the processing when the translation accuracy and the semantic analysis accuracy are below a certain level, and presents the reason for the interruption. Executing an application or service with low accuracy often does not produce the results that the user expects. Therefore, the information processing system 1 tells the user whether the translation process was not successful or the translation was successful but the semantic analysis process was not successful, so that the user can adjust (control) the next time the input is performed. Allows you to.
  • FIG. 19 is a diagram showing an example of translation via another language.
  • FIG. 19 shows an example of a method via a specific language in multilingualization.
  • the minor language referred to here means a language in which it is difficult to collect a sufficient amount of data, for example, in the country to which the provider of the information processing system 1 belongs.
  • the major language means a language in which it is relatively easy to collect a sufficient amount of data, for example, in the country to which the provider of the information processing system 1 belongs.
  • the major language includes a language used in the country to which the provider of the information processing system 1 belongs.
  • a minor language may be a language in which the number of people (speakers) who use the language is relatively small, and a major language is a language in which the number of people (speakers) who use the language is relatively large. It may be.
  • Fig. 19 there is a method to convert to a major language once.
  • the major language is translated into the minor language B twice.
  • the first is that the cost from minor language A to major language and from major language to minor language B is lower than the cost required to collect and create bilingual sentences from minor language A to minor language B.
  • the major language is a major language, there is a need for multilingual support from minor language A to major language, and from major language to minor language B, and in many cases it has already been supported and can be translated. This is because there is a high possibility that the existing translator (translation processing) can be used as it is (it can be diverted).
  • the object in the example shown in FIG. 1 is that a multilingual translator is used to improve the accuracy of the semantic analyzer.
  • the translator and the semantic analyzer have different properties and purposes, and may not function well with a single bond.
  • translators tend to have a literary tone of written words, which aims to be widely used by translators. This is due to the fact that it is often learned based on a corpus that is not ambiguous and has a high degree of perfection as a sentence.
  • a style in which a person asks a machine such as a smart speaker or an AI (Artificial Intelligence) chatbot, or a style in which a person communicates with a machine
  • the expression of words such as a person-to-person conversation is naturally used. Has been done.
  • the first point is that, for example, the input information to be input is not the written language but the spoken language.
  • the second point is that, for example, an appropriate translation cannot be obtained for a language-specific expression.
  • recent speech translators may also incorporate (correspond to) spoken language (colloquial tone), and it is important to address the second point below.
  • the information processing system 1 expands the user's utterance into a plurality of expressions (plural utterance expansion sentences) by expanding the utterance sentence. Then, the information processing system 1 uses a translator of a plurality of languages to develop each of the plurality of utterance development sentences into a language. Then, the information processing system 1 calculates an accuracy index value for making it possible to compare the semantic analysis accuracy between a plurality of languages, and using the calculated accuracy index value, which language has the better semantic analysis accuracy. Can be compared. The information processing system 1 performs processing such as response generation using a language that can be processed accurately among each language.
  • the information processing system 1 selects a sentence that is expected to have good accuracy from a large number of candidates even if there is a gap between the accuracy of the translator and the accuracy of the semantic analyzer. By processing, response processing and the like can be performed based on accurate semantic analysis.
  • the score function of the equation (1) is a function that converts the analysis accuracy into a percentage from the score of the semantic analysis result (semantic analysis score) and the analysis language.
  • semantic analysis is learned and modeled based on the corpus collected for each language. That is, the learning is not performed assuming the inter-language at the time of learning. Therefore, the score value indicating the statistical superiority at the time of inference can be relatively used in the language of the learning model, but the score value of the language A and the language B, etc. beyond the language (semantic analysis). It doesn't make sense to compare scores). For example, regarding the score (semantic analysis score), when English is "0.4" and German is "0.5", there is no reason to select German with a high numerical value.
  • the information processing system 1 calculates an index value (semantic analysis accuracy) that enables accuracy comparison between languages using a score function as in Eq. (1).
  • FIGS. 16 to 18 are diagrams showing an example of analysis accuracy.
  • FIG. 16 is a diagram showing an example of Japanese analysis accuracy.
  • FIG. 17 is a diagram showing an example of English analysis accuracy.
  • FIG. 18 is a diagram showing an example of the relationship between the analysis accuracy and the score.
  • the information processing system 1 calculates the probability of correct answer (percentage of analysis accuracy) when collecting the results of a value equal to or higher than the score of a certain value from the evaluation data in the language, changes a certain value, and evaluates again. Calculate the probability of correct answer.
  • the information processing system 1 can create a score table for each language as shown in FIGS. 16 and 17.
  • the information processing system 1 calculates the probability of a correct answer (percentage of analysis accuracy) when collecting results of a value equal to or higher than a certain value for Japanese, changes a certain value, evaluates it again, and corrects the answer.
  • a score table as shown in FIG. 16 is generated.
  • the information processing system 1 calculates the probability of a correct answer (percentage of analysis accuracy) when collecting results of a value equal to or higher than a certain value for English, changes a certain value, evaluates it again, and determines the probability of the correct answer.
  • a score table as shown in FIG. 17 is generated.
  • the information processing system 1 generates a score table for each language (specific language) by performing the same processing for each language (specific language) such as Chinese and Spanish.
  • the information processing system 1 has evaluation data in which the text (character information) after voice recognition in Japanese and the result of semantic analysis of the character information are associated with each other.
  • the information processing system 1 provides evaluation data in which the text (character information) after voice recognition in Japanese and the result of its semantic analysis are associated with information (label) indicating whether or not the result is correct.
  • the information processing system 1 has a semantic analysis score indicating the certainty of semantic analysis for each text (character information) and a label indicating whether the specified domain goal or the like for each text (character information) is correct.
  • a Japanese score table is generated using the associated evaluation data.
  • the label corresponding to the result of the semantic analysis of each text (character information) may be set by the administrator of the information processing system 1 or the like, or may be set automatically. Further, the label may be assigned to the result of the semantic analysis of the corresponding text (character information) by the administrator of the information processing system 1, or may be automatically assigned.
  • the information processing system 1 analyzes the meaning of a text (character information) when the domain goal or slot, which is the result of the semantic analysis with a semantic analysis score of "0.4", is incorrect. Evaluation data in which the score "0.4" is associated with the label "0" indicating an incorrect answer is used. Further, in the information processing system 1, when the domain goal and the slot, which are the results of the semantic analysis in which the semantic analysis score of a certain text (character information) is "0.9", are correct, the meaning of the text (character information). Evaluation data in which the analysis score "0.9” is associated with the label "1” indicating the correct answer is used. The information processing system 1 generates a Japanese score table using the evaluation data in which the semantic analysis score of each text (character information) and the label are associated with each other.
  • the information processing system 1 generates the graph GR1 as shown in FIG. 18 based on the information showing the relationship between the semantic analysis score of each language and the correct answer rate (semantic analysis accuracy) as shown in FIGS. 16 and 17. ..
  • the information processing system 1 generates a score function (score function for each language) of each language based on the graph GR1 as shown in FIG.
  • black circles connected by solid lines show the results in Japanese
  • white circles connected by dotted lines show the results in English.
  • the horizontal axis indicates the “score threshold”, that is, the threshold of the semantic analysis score
  • the vertical axis indicates the “probability of correct answer (unit: percentage)”, that is, the semantic analysis accuracy.
  • the graph GR1 of FIG. 18 shows the ratio of the analysis results in which the estimated information (domain goal, etc.) is correct in the analysis result group whose semantic analysis score is equal to or higher than the score threshold value for each language. For example, for Japanese, it is shown that 60% of the analysis result group whose semantic analysis score is "0.6" or more is the analysis result in which the estimated information is correct.
  • the number of analysis results having a semantic analysis score of "0.6" or more is "10000", and the number of analysis results whose estimated information is correct is "6000".
  • the probability of correct answer (unit: percentage) that is, the semantic analysis accuracy is calculated as "60%”.
  • the horizontal axis has a low “score threshold”
  • the horizontal axis has a high “score threshold”
  • the value is stable and high.
  • the horizontal axis is where the "score threshold” exceeds "0.5" the increase is monotonous.
  • the information processing system 1 derives (generates) a fitting function in which the horizontal axis is the "score threshold" and the vertical axis is the "probability of correct answer (unit: percentage)". Then, the information processing system 1 uses the fitting function as the score function. The information processing system 1 generates a score function for each language by calculating a fitting function for each language.
  • the information processing system 1 generates the equation (1) by generating a function (program) using the score function for each language.
  • the information processing system 1 generates the score function (program) shown in the equation (1) for calculating the semantic analysis accuracy (acc) from the analysis language (lang) and the semantic analysis score (score) of the language.
  • the information processing system 1 performs the following processing for utterance input.
  • the information processing system 1 converts utterances into texts by voice recognition.
  • the input language is predetermined as a language that can be supported by the information processing system 1, such as Spanish.
  • the information processing system 1 performs the following processing for the automatic expansion of utterance sentences.
  • the translator and the semantic analyzer have different purposes.
  • the corpus of bilingual sentences and the corpus of each domain goal are collected and created separately, and are learned and modeled separately. Therefore, there is a gap between the translated sentence output from the translator model and the analysis sentence assumed by the semantic analyzer model.
  • the information processing system 1 automatically changes the wording and the phrase of the utterance sentence under the condition that the output of the meaning frame does not change, and generates a pseudo utterance sentence list. For example, the information processing system 1 responds to the input utterance "play music” by “play music”, “play music”, “play music”, “music play”, and the like. Generate paraphrases.
  • the information processing system 1 performs the following processing from the input language to the language development by translation.
  • the information processing system 1 uses the pseudo-utterance sentence list as input to the translator and translates each pseudo-utterance sentence into all languages that can be processed by the semantic analyzer.
  • the information processing system 1 generates N pseudo utterance sentences from one utterance sentence and translates them into M language. In this case, the number of elements of the translated sentence list output from the translator is N ⁇ M.
  • the translation sentence list may include sentences in the input language.
  • the information processing system 1 generates N pseudo utterance sentences including the one utterance sentence from one utterance sentence. Then, when the language (input language) of the utterance sentence is a specific language, the information processing system 1 translates it into the M-1 language and generates N sentences for each of the M languages including the input language. In this case, the number of elements in the translated sentence list is N ⁇ M.
  • the information processing system 1 performs the following processing on the semantic analysis processing (speech semantic analyzer).
  • the information processing system 1 performs utterance semantic analysis processing for each language and utterance sentence development.
  • the information processing system 1 generates a semantic frame from a text (character string) by a semantic analysis process.
  • the information processing system 1 is used to determine which application or service to execute by the function of the semantic analysis process (speech semantic analyzer) and to specifically execute the application or service. Generates semantic frame information in tabular form including slot information to be created.
  • the information processing system 1 executes a semantic analysis process on an element of a translated sentence list (in each language for utterance development in FIG. 6), and obtains a semantic frame and an analysis score value.
  • the information processing system 1 executes a semantic analysis process on 40 sentences in the translated sentence list, and obtains a semantic frame and an analysis score value. That is, in the example of FIG. 6, the information processing system 1 executes a semantic analysis process on 40 sentences obtained by multiplying the number of utterance development sentences "4" by the number of languages "10", and executes a semantic frame. And get the analysis score value.
  • the information processing system 1 performs the following processing for conversion from the score value (semantic analysis score) to the analysis accuracy (semantic analysis accuracy).
  • the score value (semantic analysis score) expresses the estimated probability, and the higher the value, the more statistically superior it becomes. In other words, the higher the score value (semantic analysis score), the higher the estimation accuracy.
  • the score value (semantic analysis score)
  • the higher the score value (semantic analysis score) the higher the estimation accuracy.
  • it is judged from the same model from the pseudo utterance sentence list it is possible to select the utterance sentence and the meaning frame having the highest score value.
  • the information processing system 1 calculates the analysis accuracy (semantic analysis accuracy) as shown in FIG. 6 from the score function by inputting the analysis language and the score value.
  • the information processing system 1 performs the following processing for analysis language and semantic frame selection.
  • the information processing system 1 selects the result of the analysis language, utterance sentence, and semantic frame having the highest analysis accuracy (semantic analysis accuracy) from the analysis accuracy (semantic analysis accuracy).
  • the information processing system 1 performs the following processing for the slot inverse transformation of the semantic frame.
  • the semantic frame of the maximum analysis accuracy is the analysis result in a specific language, and the domain goal is a language-independent abstract expression, so it is language-independent.
  • the value of the slot information is expressed in a specific language, and the application or service generally needs to return a response in the input language spoken by the user. Therefore, the information processing system 1 may reversely convert the slot information from the specific language to the input language in order to accurately execute the application or service.
  • the information processing system 1 performs inverse transformation by the following method.
  • the first method is to convert words / phrases from a knowledge database (DB) using a multilingual conversion dictionary.
  • the second method is to reverse-translate into a specific language with a translator.
  • the information processing system 1 is not limited to the method described above, and may perform inverse transformation by various methods.
  • the slot information of the meaning frame is basically an artist name, song name, place name, device name, etc., and is a word or phrase rather than a sentence. Can be done. Therefore, the slot information can be accurately converted by the dictionary.
  • the information processing system 1 may perform reverse translation from a specific language to an input language by reverse translation of the second method, and perform two-step conversion.
  • the information processing system 1 performs the following processing for response generation.
  • the semantic analysis process speech semantic analyzer
  • the information processing system 1 outputs in the form of an image (icon or the like), a voice (sound effect or the like), a text (language name / language code) or the like.
  • FIG. 2 is a diagram showing a configuration example of an information processing system according to an embodiment.
  • the information processing system 1 shown in FIG. 2 may include a plurality of terminal devices 10 and a plurality of information processing devices 100.
  • the information processing system 1 realizes the above-mentioned dialogue system.
  • the information processing device 100 is a computer that converts input language character information corresponding to a user's utterance in the target language into a translated sentence of the translation destination language and executes a semantic analysis process.
  • the information processing device 100 performs an inverse transformation process of converting the result of the semantic analysis process corresponding to the translation destination language into the target language.
  • the information processing device 100 is a computer that transmits various information to the terminal device 10.
  • the information processing device 100 is a server device used to provide services related to various functions. For example, the information processing device 100 is used to provide the user with a service related to the dialogue system.
  • the information processing device 100 performs various information processing related to the dialogue system to the user.
  • the information processing device 100 may have software modules such as voice signal processing, voice recognition, utterance semantic analysis, and dialogue control.
  • the information processing apparatus 100 may have functions of natural language understanding (NLU: Natural Language Understanding) and automatic speech recognition (ASR: Automatic Speech Recognition).
  • NLU Natural Language Understanding
  • ASR Automatic Speech Recognition
  • the information processing device 100 may estimate information about a user's intent (intention) or entity (target) from input information uttered by the user.
  • the information processing device 100 functions as a voice recognition server having functions of natural language understanding and automatic voice recognition.
  • the terminal device 10 is a computer that detects the user's utterance and transmits the voice of the user's utterance to the information processing device 100 or the like. Further, the terminal device 10 may have functions such as natural language understanding and automatic voice recognition. For example, the terminal device 10 may estimate information about a user's intent (intention) or entity (target) from input information uttered by the user.
  • the terminal device 10 is a device device used by a user. The terminal device 10 accepts input by the user. The terminal device 10 accepts voice input by the user's utterance and input by the user's operation. The terminal device 10 displays information according to the input of the user.
  • the terminal device 10 is an information processing device used by the user.
  • the terminal device 10 is used to provide a dialogue service that responds to a user's utterance.
  • the terminal device 10 has a sound sensor that detects the sound of a microphone or the like.
  • the terminal device 10 uses a sound sensor to detect a user's utterance around the terminal device 10.
  • the terminal device 10 may be a device (voice assist terminal) that detects ambient sounds and performs various processes according to the detected sounds.
  • the terminal device 10 is a computer that processes a user's utterance.
  • the terminal device 10 may be any device as long as the processing in the embodiment can be realized.
  • the terminal device 10 may be any device as long as it has a function of detecting the user's utterance and transmitting it to the information processing device 100.
  • the terminal device 10 may be, for example, a device such as a smartphone, a tablet terminal, a notebook PC (Personal Computer), a desktop PC, a mobile phone, or a PDA (Personal Digital Assistant).
  • the terminal device 10 may be a wearable terminal (Wearable Device) or the like that the user can wear.
  • the terminal device 10 may be a wristwatch-type terminal, a glasses-type terminal, or the like.
  • the terminal device 10 may be a so-called home electric appliance such as a television or a refrigerator.
  • the terminal device 10 may be a robot that interacts with a human (user), such as a smart speaker, an entertainment robot, or a domestic robot.
  • the terminal device 10 may be a device arranged at a predetermined position such as digital signage.
  • FIG. 3 is a diagram showing a configuration example of the information processing device 100 according to the embodiment of the present disclosure.
  • the information processing device 100 includes a communication unit 110, a storage unit 120, and a control unit 130.
  • the information processing device 100 includes an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from the administrator of the information processing device 100, and a display unit (for example, a liquid crystal display, etc.) for displaying various information. You may have.
  • the communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. Then, the communication unit 110 is connected to the network N (see FIG. 2) by wire or wirelessly, and transmits / receives information to / from another information processing device such as the terminal device 10. Further, the communication unit 110 may send and receive information to and from the terminal device 10.
  • a NIC Network Interface Card
  • the storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. As shown in FIG. 3, the storage unit 120 according to the embodiment includes a language information storage unit 121, a semantic frame information storage unit 122, an analysis accuracy information storage unit 123, a threshold information storage unit 124, and a knowledge information storage unit. It has 125 and.
  • the storage unit 120 stores various information, not limited to the above.
  • the storage unit 120 may store information indicating a specific language corresponding to each language.
  • the storage unit 120 is a semantic analyzer that outputs information on a semantic frame such as a specified domain goal and a score (also referred to as a “semantic analysis score”) indicating its accuracy (confidence) in response to input of character information.
  • a semantic analyzer that outputs information on a semantic frame such as a specified domain goal and a score (also referred to as a “semantic analysis score”) indicating its accuracy (confidence) in response to input of character information.
  • the storage unit 120 stores the information of the semantic analyzer for each specific language capable of semantic analysis.
  • the storage unit 120 stores information of a semantic analyzer for each specific language, such as an English semantic analyzer or a Japanese semantic analyzer, which is a specific language.
  • the language information storage unit 121 stores various information related to the language. For example, the language information storage unit 121 stores various information in a language in which the information processing system 1 can identify the language (speech recognition). The language information storage unit 121 stores information indicating whether each language is a language capable of semantic analysis (specific language) and information indicating a language capable of translating each language (translation destination language).
  • FIG. 4 is a diagram showing an example of the language information storage unit according to the embodiment.
  • the language information storage unit 121 shown in FIG. 4 includes items such as "language” and "translation destination language”. Further, the "translation destination language” includes items such as "# 1" and "# 2". Although only “# 1" and "# 2" are shown in FIG. 4, the "translation destination language” includes a number of items corresponding to the translation destination language such as "# 3" and "# 4". It may be.
  • “Language” indicates the language.
  • “language” indicates a language in which the information processing system 1 can identify the language (speech recognition).
  • identification information (language ID) for identifying a language may be stored in association with each language.
  • a language code that identifies each language may be stored.
  • “Translation destination language” indicates a language (translation destination language) capable of translating the language.
  • “translation destination language” indicates a translation destination language (translation destination language) in which the language can be translated.
  • the language information storage unit 121 is not limited to the above, and may store various information depending on the purpose.
  • the language information storage unit 121 may store information indicating whether each language is a language (specific language) capable of semantic analysis.
  • the language information storage unit 121 may store a flag indicating whether each language is a language (specific language) capable of semantic analysis.
  • the language information storage unit 121 may store the case where the language is a specific language in association with the flag "1" and the case where the language is not a specific language in association with the flag "0".
  • the information processing apparatus 100 may extract a specific language from the translation destination language associated with each language. For example, the information processing apparatus 100 may use a language whose flag is "1" among the translation destination languages associated with each language as a specific language.
  • the semantic frame information storage unit 122 stores various information related to the semantic frame.
  • the semantic frame information storage unit 122 stores various information related to the semantic frame for each language.
  • the semantic frame information storage unit 122 stores information about the semantic frame corresponding to each language.
  • the semantic frame information storage unit 122 stores information about the semantic frame corresponding to each specific language.
  • the semantic frame information storage unit 122 stores information (table) for each specific language, such as frame information FM1 and frame information FM2.
  • the frame information FM1 indicates information about a semantic frame of the language "English”.
  • the frame information FM2 indicates information regarding a semantic frame of the language "China”.
  • the frame information FM1 and the frame information FM2 shown in FIG. 5 include items such as "language”, “Domain-Goal”, and “Slot”. Further, “Slot” includes items such as "Attribute” and "Value”.
  • “Language” indicates the language.
  • “language” indicates a language in which the information processing system 1 can identify the language (speech recognition).
  • identification information (language ID) for identifying a language may be stored in association with each language.
  • a language code that identifies each language may be stored.
  • Domain-Goal indicates the domain goal of the semantic frame. For example, “Domain-Goal” indicates the purpose (intention) of the utterance.
  • “Slot” stores various information about the corresponding Domain-Goal slot (component). For example, in “Slot”, an attribute (slot name) included in the corresponding domain goal and its value (slot value) are stored. "Attribute” indicates an attribute (slot name) of a slot (component). "Value” indicates the slot value of the corresponding attribute (slot name). The “ ⁇ (hyphen)” indicated by “Value” in the semantic frame information storage unit 122 indicates that the value is not stored in “Value”. When used in the processing of the user's semantic analysis, the "Value” stores a specific value (information) corresponding to the user's utterance.
  • the language “English” includes a meaning frame in which Domain-Goal is “Weather-Check” or “Music-Play”. Further, it is shown that the Domain-Goal “Weather-Check” includes a slot whose "Attribute” is “Date” or “Place”. That is, the Domain-Goal "Weather-Check” for checking the weather includes slots related to the date and time and place.
  • the semantic frame information storage unit 122 is not limited to the above, and various information may be stored depending on the purpose.
  • the semantic frame information storage unit 122 may store the format of the value stored in each “Value”. For example, information indicating whether the value stored in "Value” is a numerical value or other information (character string or the like) may be stored. For example, information indicating whether the value stored in "Value” is information that can be commonly used in the language may be stored. For example, when the value stored in "Value” is a numerical value, information indicating that the information can be commonly used in the language may be stored.
  • the analysis accuracy information storage unit 123 stores various information related to the analysis accuracy.
  • the analysis accuracy information storage unit 123 stores various information related to the score, analysis accuracy, and the like in association with each character information.
  • the analysis accuracy information storage unit 123 stores various information related to the score, analysis accuracy, and the like in association with each character information corresponding to the input language and the translation destination language.
  • FIG. 6 is a diagram showing an example of the analysis accuracy information storage unit according to the embodiment. In the analysis accuracy information storage unit 123 shown in FIG. 6, "input language”, “utterance development”, “translation destination specific language”, “translation destination translation sentence”, “semantic analysis score”, “semantic analysis accuracy (%)) ] Is included.
  • “Input language” indicates a language that can be input.
  • “input language” indicates a language in which the information processing system 1 can identify a language (speech recognition).
  • identification information (language ID) for identifying the input language may be stored in association with each input language.
  • a language code that identifies each input language may be stored.
  • “Utterance development” indicates character information corresponding to the user's utterance and a paraphrase in which the character information is paraphrased.
  • “A” which is “play the Beatles” indicates the character information corresponding to the user's utterance
  • the remaining three “B” to “D” are " Indicates a paraphrase that paraphrases "A”.
  • utterance development information A indicates the character information of the user's utterance text as it is.
  • “B” which is "playing the Beatles song” in FIG. 6 may be described as “utterance development information B”.
  • the utterance development information B clarifies the target as "song” in the utterance development information A, and indicates character information paraphrased in detail.
  • utterance development information C which is "Please play the Beatles song” in Fig. 6, may be described as “Utterance development information C”.
  • D which is "Please play the Beatles song” in FIG. 6, may be described as "utterance development information D”.
  • the utterance development information C and the utterance development information D indicate character information obtained by paraphrasing the utterance development information A into expressions used in more daily life.
  • FIG. 6 shows a case where the utterance development information A is paraphrased into three paraphrases of the utterance development information B, the utterance development information C, and the utterance development information D, but the number of paraphrases is four or more. It may be one or two.
  • Translation destination specific language indicates the translation destination language.
  • “translation destination specific language” is a language to be translated and indicates a specific language.
  • a language other than the specific language (non-specific language) is stored in the "translation destination specific language”.
  • identification information (language ID) for identifying the translation destination specific language may be stored in association with each translation destination specific language.
  • a language code that identifies each translation destination specific language may be stored.
  • the input language when the input language is also a specific language, the input language is also shown in a list together with the translation destination language in the "translation destination specific language". The same language as "indicates the target language, not the target language.
  • Translation destination translation indicates character information corresponding to the translation destination specific language.
  • the "translation destination translation sentence” indicates the character information in which the character information shown in the utterance development is translated into the corresponding translation destination language.
  • the line corresponding to "utterance development” is utterance development information A and "translation destination specific language” corresponds to "English” is utterance development information A which is Japanese character information. Indicates the textual information translated into English.
  • the input language when the input language is also a specific language, the input language is also shown in a list together with the translation destination language in the "translation destination translation sentence".
  • the character information corresponding to the same language as "" is not the translated sentence, but the same character information as the character-order information shown in "Utterance development”.
  • Semantic analysis score indicates the degree of certainty (accuracy) of semantic analysis.
  • the “semantic analysis score” indicates the certainty (accuracy) of the domain goal specified in the semantic analysis process.
  • “Semantic analysis accuracy (%)” indicates an accuracy index value that makes it possible to compare the accuracy of semantic analysis processing between multiple languages.
  • the “semantic analysis accuracy (%)” indicates an index value (score) obtained by converting the value of the “semantic analysis score” by a predetermined function.
  • the utterance development information A in which the input language "Japanese” is "playing the Beatles” corresponding to the user's utterance, and the utterance development information B to D in which the utterance development information A is paraphrased.
  • the case where the utterance is expanded into three paraphrases is shown.
  • the information processing device 100 performs utterance development that converts one character information corresponding to the user's utterance into a plurality of paraphrases.
  • the translation destination specific language of "Japanese”, which is the input language (target language), is English, Spanish, French, German, Italian, Chinese (simplified), Korean, Hindi, Arabic, etc. Indicates that there is.
  • the information processing apparatus 100 uses the character information of the target language "Japanese” in English, Spanish, French, German, Italian, Chinese (simplified), Korean, Hindi, Arabic, and the like. Convert to textual information.
  • the character information "Play the Beatles”, which is the Japanese utterance development information A indicates that the semantic analysis score is "0.83" and the semantic analysis accuracy is "82.9".
  • the character information "Play the Beatles”, which is the Japanese utterance development information A "Play the Beatles” translated into English has a semantic analysis score of "0.51” and a semantic analysis accuracy of "0.51". It shows that it is 49.8 ”.
  • the analysis accuracy information storage unit 123 is not limited to the above, and may store various information depending on the purpose.
  • the threshold information storage unit 124 stores various information related to the threshold value.
  • the threshold information storage unit 124 stores various information regarding the threshold value used for comparison with the score.
  • FIG. 7 is a diagram showing an example of the threshold information storage unit according to the embodiment.
  • the threshold information storage unit 124 shown in FIG. 7 includes items such as “threshold ID” and “threshold”.
  • Threshold ID indicates identification information for identifying the threshold value. Further, the “threshold value” indicates a specific value of the threshold value identified by the corresponding threshold ID. In addition, information indicating its use is stored in association with each threshold value.
  • the value of the threshold value TH1 identified by the threshold value ID “TH1” is “0.75”. Further, the threshold value TH1 is stored in association with information indicating that its use is quality estimation (for example, translation).
  • the threshold information storage unit 124 is not limited to the above, and may store various information depending on the purpose.
  • the threshold information storage unit 124 may store the use of the threshold value in association with the threshold value ID.
  • the threshold information storage unit 124 may store the threshold ID “TH1” in association with the use “quality estimation”.
  • the knowledge information storage unit 125 stores various information related to knowledge.
  • the knowledge information storage unit 125 functions as a knowledge database (knowledge DB).
  • the knowledge information storage unit 125 stores information in a multilingual dictionary. For example, the knowledge information storage unit 125 stores information indicating the correspondence between character strings indicating each object between languages.
  • the information processing device 100 acquires knowledge from the outside, the information processing device 100 does not have to have the knowledge information storage unit 125.
  • control unit 130 for example, a program stored inside the information processing apparatus 100 by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like (for example, a determination program such as an information processing program according to the present disclosure) is stored in a RAM. It is realized by executing (Random Access Memory) etc. as a work area. Further, the control unit 130 is a controller, and is realized by, for example, an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).
  • ASIC Application Specific Integrated Circuit
  • FPGA Field Programmable Gate Array
  • the control unit 130 includes an acquisition unit 131, a conversion unit 132, an execution unit 133, a calculation unit 134, a selection unit 135, an inverse conversion unit 136, a generation unit 137, and a transmission unit. It has 138 and realizes or executes the functions and actions of information processing described below.
  • the internal configuration of the control unit 130 is not limited to the configuration shown in FIG. 3, and may be another configuration as long as it is a configuration for performing information processing described later.
  • the connection relationship of each processing unit included in the control unit 130 is not limited to the connection relationship shown in FIG. 3, and may be another connection relationship.
  • the acquisition unit 131 acquires various information.
  • the acquisition unit 131 acquires various information from an external information processing device.
  • the acquisition unit 131 acquires various information from the terminal device 10.
  • the acquisition unit 131 acquires various information from the storage unit 120.
  • the acquisition unit 131 acquires various information from the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125.
  • the acquisition unit 131 acquires various information converted by the conversion unit 132.
  • the acquisition unit 131 acquires various information executed by the execution unit 133.
  • the acquisition unit 131 acquires various information calculated by the calculation unit 134.
  • the acquisition unit 131 acquires various information selected by the selection unit 135.
  • the acquisition unit 131 acquires various information converted by the inverse conversion unit 136.
  • the acquisition unit 131 acquires various information generated by the generation unit 137.
  • the acquisition unit 131 may acquire a model (function).
  • the acquisition unit 131 acquires a model for estimating the quality of translation (quality estimation model).
  • the acquisition unit 131 acquires a model (quality estimation model) for estimating the translation quality from an external information processing device or a storage unit 120 that provides various models (functions).
  • the acquisition unit 131 acquires the user's utterance information in the target language.
  • the acquisition unit 131 acquires character information corresponding to the user's utterance in the target language.
  • the conversion unit 132 converts various information.
  • the conversion unit 132 converts various information based on the information from the external information processing device and the information stored in the storage unit 120.
  • the conversion unit 132 converts various information from the storage unit 120.
  • the conversion unit 132 converts various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the conversion unit 132 converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language.
  • the conversion unit 132 converts the character information corresponding to the target language into the character information corresponding to the language whose meaning can be interpreted.
  • the conversion unit 132 determines various information. The conversion unit 132 determines various information. The conversion unit 132 determines various information. The conversion unit 132 determines whether the language is capable of language identification (speech recognition). The conversion unit 132 determines whether the input language is a non-target language. The conversion unit 132 determines whether the target language is a non-target language. The conversion unit 132 determines that a language that cannot be language-identified (speech recognition) is a non-target language. The conversion unit 132 determines whether or not the input language is a specific language. The conversion unit 132 determines whether or not the target language is a specific language.
  • Execution unit 133 executes various processes.
  • the execution unit 133 executes various processes based on information from an external information processing device.
  • the execution unit 133 executes various processes based on the information stored in the storage unit 120.
  • the execution unit 133 executes various processes based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the execution unit 133 generates various information by executing the process.
  • the execution unit 133 executes various processes based on various information acquired by the acquisition unit 131.
  • the execution unit 133 executes various processes based on various information converted by the conversion unit 132.
  • the execution unit 133 executes various processes based on various information calculated by the calculation unit 134.
  • the execution unit 133 executes various processes based on various information selected by the selection unit 135.
  • the execution unit 133 executes various processes based on various information converted by the inverse conversion unit 136.
  • Execution unit 133 determines various information.
  • the execution unit 133 determines various information.
  • the execution unit 133 determines the execution of various processes.
  • the execution unit 133 determines the execution of various processes.
  • the execution unit 133 specifies various types of information.
  • the execution unit 133 estimates various information.
  • the execution unit 133 extracts various information.
  • the execution unit 133 selects various information.
  • the execution unit 133 extracts various information based on the information from the external information processing device and the information stored in the storage unit 120.
  • the execution unit 133 extracts various information from the storage unit 120.
  • the execution unit 133 extracts various information from the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125.
  • the execution unit 133 extracts various information based on the various information acquired by the acquisition unit 131.
  • the execution unit 133 extracts various information based on the various information converted by the conversion unit 132.
  • the execution unit 133 extracts various information based on the various information calculated by the calculation unit 134.
  • the execution unit 133 extracts various information based on the various information selected by the selection unit 135. Further, the execution unit 133 extracts various information based on the various information converted by the inverse conversion unit 136.
  • the execution unit 133 extracts various information based on the information generated by the generation unit 137.
  • Execution unit 133 executes analysis of character information corresponding to the user's utterance by appropriately using natural language processing technology such as morphological analysis.
  • the execution unit 133 estimates (identifies) the content of the user's utterance by semantic analysis using the character information corresponding to the user's utterance.
  • the execution unit 133 may estimate (specify) the user's situation by dialogue state estimation (Dialog State Tracking) using the character information corresponding to the user's utterance.
  • the execution unit 133 estimates (identifies) the content of the character information by analyzing the character information converted by the conversion unit 132 by appropriately using semantic analysis and dialogue state estimation.
  • the execution unit 133 analyzes the character information converted from the target language to the translation destination language by the conversion unit 132 by appropriately using a natural language processing technique such as morphological analysis. For example, the execution unit 133 estimates the content of the user's utterance corresponding to the character information by appropriately analyzing the character information by using various conventional techniques such as parsing.
  • the execution unit 133 estimates the content such as the intention of the user's utterance by analyzing the user's utterance.
  • the execution unit 133 estimates the content such as the intention of the user's utterance by appropriately using various conventional techniques.
  • the execution unit 133 estimates the content of the user's utterance by analyzing the user's utterance by appropriately using various conventional techniques.
  • the execution unit 133 extracts important keywords from the character information of the user's utterance, and estimates the content of the user's utterance based on the extracted keywords.
  • the execution unit 133 identifies the Domain-Goal (domain goal) corresponding to the user's utterance by analyzing the character information corresponding to the utterance.
  • the execution unit 133 estimates the attribute information such as the slot value corresponding to the specified Domain-Goal (domain goal) by analyzing the character information corresponding to the utterance.
  • the execution unit 133 performs translation quality estimation (also simply referred to as “quality estimation”).
  • quality estimation also simply referred to as “quality estimation”.
  • Execution unit 133 estimates the quality of the translated character information.
  • the execution unit 133 calculates the quality estimation translation accuracy (quality score) of the character information (translation text) by an appropriate method.
  • the execution unit 133 uses a model (quality estimation model) that outputs a score (quality score) indicating the quality estimation translation accuracy in response to the input of the character information before translation and the character information after translation.
  • Quality estimation translation accuracy quality score
  • the execution unit 133 uses a quality estimation model learned by using a combination of character information before translation, character information after translation, and its score (correct answer score) as learning data to determine the quality estimation translation accuracy (quality score). It may be calculated.
  • the execution unit 133 uses a quality estimation model learned as learning data including a score (correct answer score) set by the administrator of the information processing system 1 based on the character information before translation and the character information after translation. It may be used to calculate the quality estimation translation accuracy (quality score).
  • Execution unit 133 compares the quality score with the threshold value (for example, 0.75 etc.). If the quality score of the character information is equal to or higher than the threshold value, the execution unit 133 determines that the quality of the character information is high (high score), and if the quality estimation translation accuracy (quality score) of the translated text is less than the threshold value. If so, it is determined that the quality of the translated text is low (low score).
  • the threshold value for example, 0.75 etc.
  • Execution unit 133 executes semantic analysis processing for one or more character information corresponding to each of one or more languages including the target language which is the language corresponding to the user's utterance.
  • the execution unit 133 executes a semantic analysis process on one or more character information including one character information corresponding to the user's utterance in the target language.
  • the execution unit 133 executes a semantic analysis process on one or more character information including the translated character information in which one character information is converted into the translation destination language to be the translation destination of the target language.
  • Execution unit 133 executes a semantic analysis process on one or more character information including a paraphrase in which one character information of the target language is paraphrased into another expression of the target language.
  • the execution unit 133 executes a semantic analysis process on one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to which the target language is translated.
  • the execution unit 133 executes a semantic analysis process on the character information converted by the conversion unit 132.
  • the execution unit 133 does not execute the semantic analysis process of a language other than the target language in order to reduce the processing cost.
  • the execution unit 133 provides one or more character information corresponding to each of the one or more languages including the target language.
  • the semantic analysis process is executed.
  • the execution unit 133 does not execute the semantic analysis process using the character information.
  • the execution unit 133 executes the semantic analysis process using the character information generated by the post-editing.
  • the execution unit 133 executes the process related to the process interruption.
  • the execution unit 133 executes a process of notifying that the process is interrupted.
  • the execution unit 133 executes the semantic analysis process on the character information of the target language spoken by the user.
  • Execution unit 133 executes the semantic analysis process for each language using the semantic analyzer of each language.
  • the execution unit 133 uses a semantic analyzer that outputs information on a semantic frame such as a specified domain goal and a score (semantic analysis score) indicating its accuracy (confidence) in response to input of character information. Executes semantic analysis processing.
  • the execution unit 133 calculates a score (semantic analysis score) indicating the accuracy of the semantic analysis.
  • the execution unit 133 calculates a semantic analysis score indicating the certainty (accuracy) of the domain goal specified in the semantic analysis process.
  • the execution unit 133 may use the score output by the semantic analyzer used for the semantic analysis process as the semantic analysis score.
  • the execution unit 133 uses the score output by the semantic analyzer for each language used for the semantic analysis processing of each language as the semantic analysis score of each language.
  • the execution unit 133 may calculate the semantic analysis score by appropriately using various techniques.
  • the calculation unit 134 calculates various information.
  • the calculation unit 134 calculates various values.
  • the calculation unit 134 calculates various scores. For example, the calculation unit 134 calculates various types of information based on information from an external information processing device and information stored in the storage unit 120.
  • the calculation unit 134 calculates various types of information based on information from other information processing devices such as the terminal device 10.
  • the calculation unit 134 calculates various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the calculation unit 134 calculates various information based on various information acquired by the acquisition unit 131.
  • the calculation unit 134 calculates various information based on the various information converted by the conversion unit 132.
  • the calculation unit 134 calculates various information based on various processes executed by the execution unit 133.
  • the calculation unit 134 calculates various information based on the result of the semantic analysis executed by the execution unit 133.
  • the calculation unit 134 calculates various information based on the various information selected by the selection unit 135.
  • the calculation unit 134 calculates various information based on the various information converted by the inverse conversion unit 136.
  • the calculation unit 134 calculates various information based on various information generated by the generation unit 137.
  • the calculation unit 134 makes it possible to compare the accuracy of the semantic analysis processing for each of the one or more character information between a plurality of languages based on the result of the semantic analysis processing corresponding to each of the one or more character information. Calculate the index value.
  • the calculation unit 134 calculates each accuracy index value of one or more character information by using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process.
  • the calculation unit 134 calculates each accuracy index value of one or more character information by using a function that outputs an accuracy index value by inputting a score included in the result of the semantic analysis process.
  • the calculation unit 134 uses a function that inputs a score (semantic analysis score) included in the result of the semantic analysis process and outputs an accuracy index value (semantic analysis accuracy) to obtain each accuracy index value of one or more character information. calculate.
  • the calculation unit 134 uses a function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating one language, and the accuracy index value of the character information of one language. Is calculated.
  • the calculation unit 134 calculates the accuracy index value corresponding to the character information of the target language.
  • the selection unit 135 selects various information.
  • the selection unit 135 extracts various information.
  • the selection unit 135 specifies various types of information.
  • the selection unit 135 selects various information based on the information from the external information processing device and the information stored in the storage unit 120.
  • the selection unit 135 selects various types of information based on information from other information processing devices such as the terminal device 10.
  • the selection unit 135 selects various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the selection unit 135 selects various information based on various information acquired by the acquisition unit 131.
  • the selection unit 135 selects various information based on the various information converted by the conversion unit 132.
  • the selection unit 135 selects various information based on various processes executed by the execution unit 133.
  • the selection unit 135 selects various information based on the result of the semantic analysis executed by the execution unit 133.
  • the selection unit 135 selects various information based on the various information calculated by the calculation unit 134.
  • the selection unit 135 selects various information based on the various information converted by the inverse conversion unit 136.
  • the selection unit 135 selects various information based on the various information generated by the generation unit 137.
  • the selection unit 135 selects the processing target character information, which is the character information used for processing, from the one or more character information based on each accuracy index value of the one or more character information calculated by the calculation unit 134.
  • the selection unit 135 selects the character information having the maximum accuracy index value as the character information to be processed.
  • the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the selection unit 135 selects the character information of the target language as the character information to be processed.
  • the inverse conversion unit 136 converts various information. For example, the inverse conversion unit 136 converts various information based on the information from the external information processing device and the information stored in the storage unit 120. The inverse conversion unit 136 converts various information based on the information from other information processing devices such as the terminal device 10. The inverse conversion unit 136 converts various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. do.
  • the inverse conversion unit 136 converts various information based on various information acquired by the acquisition unit 131.
  • the inverse conversion unit 136 converts various information based on the various information converted by the conversion unit 132.
  • the inverse conversion unit 136 converts various information based on the various information calculated by the execution unit 133.
  • the inverse conversion unit 136 converts various information based on the various information calculated by the calculation unit 134.
  • the inverse conversion unit 136 converts various information based on the various information selected by the selection unit 135.
  • the inverse conversion unit 136 converts various information based on the various information generated by the generation unit 137.
  • the inverse conversion unit 136 changes various information based on the conversion.
  • Various information is updated based on the information acquired by the acquisition unit 131.
  • the inverse transformation unit 136 converts the result of the semantic analysis processing corresponding to the language of the processing target character information selected by the selection unit 135 into the target language.
  • the inverse transformation unit 136 converts the result of the semantic analysis processing corresponding to the language of the processing target character information into the target language.
  • the inverse transformation unit 136 converts the result of the semantic analysis process into the target language.
  • the inverse transformation unit 136 converts a part of the result of the semantic analysis process into the target language.
  • the inverse conversion unit 136 converts the slot value of the result of the semantic analysis process into the target language.
  • the inverse conversion unit 136 determines various information.
  • the inverse conversion unit 136 determines various information.
  • the inverse conversion unit 136 determines the execution of various processes.
  • the inverse transformation unit 136 determines whether or not the inverse transformation needs to be executed.
  • the inverse transformation unit 136 does not have to convert the information that does not require inverse transformation into the target language.
  • the inverse transformation unit 136 does not have to convert the information that does not require inverse transformation among the results of the semantic analysis process into the target language.
  • the inverse transformation unit 136 does not have to convert the information common to the languages among the results of the semantic analysis processing into the target language.
  • the inverse conversion unit 136 does not have to convert the information common to the languages such as numerical values among the slot values into the target language.
  • Generation unit 137 generates various information.
  • the generation unit 137 generates various information based on the information from the external information processing device and the information stored in the storage unit 120.
  • the generation unit 137 generates various information based on the information from other information processing devices such as the terminal device 10.
  • the generation unit 137 generates various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the generation unit 137 generates various information based on the various information acquired by the acquisition unit 131.
  • the generation unit 137 generates various information based on the various information converted by the conversion unit 132.
  • the generation unit 137 generates various information based on various information generated by the processing execution of the execution unit 133.
  • the generation unit 137 generates various information based on the various information calculated by the calculation unit 134.
  • the generation unit 137 generates various information based on the various information selected by the selection unit 135.
  • the generation unit 137 generates various information based on the various information converted by the inverse conversion unit 136.
  • the generation unit 137 appropriately uses various techniques to generate various information such as a screen (image information) to be provided to an external information processing device.
  • the generation unit 137 generates a screen (image information) or the like to be provided to the terminal device 10.
  • the generation unit 137 generates a screen (image information) or the like to be provided to the terminal device 10 based on the information stored in the storage unit 120.
  • the generation unit 137 may generate the screen (image information) or the like by any process as long as the screen (image information) or the like to be provided to the external information processing device can be generated.
  • the generation unit 137 generates a screen (image information) to be provided to the terminal device 10 by appropriately using various techniques related to image generation, image processing, and the like.
  • the generation unit 137 appropriately uses various techniques such as Java (registered trademark) to generate a screen (image information) to be provided to the terminal device 10.
  • the generation unit 137 may generate a screen (image information) to be provided to the terminal device 10 based on the format of CSS, Javascript (registered trademark), or HTML.
  • the generation unit 137 may generate a screen (image information) in various formats such as JPEG (Joint Photographic Experts Group), GIF (Graphics Interchange Format), and PNG (Portable Network Graphics).
  • the generation unit 137 generates a quality estimation model using learning data including a combination of character information before translation, character information after translation, and a score (correct answer score) thereof.
  • the generation unit 137 uses learning data including a score (correct answer score) set by the administrator of the information processing system 1 based on the character information before translation and the character information after translation to generate a quality estimation model. Generate.
  • the transmission unit 138 transmits various information.
  • the transmission unit 138 transmits various information to an external information processing device.
  • the transmission unit 138 provides various information to an external information processing device.
  • the transmission unit 138 transmits various information to another information processing device such as the terminal device 10.
  • the transmission unit 138 provides the information stored in the storage unit 120.
  • the transmission unit 138 transmits the information stored in the storage unit 120.
  • the transmission unit 138 provides various types of information based on information from other information processing devices such as the terminal device 10.
  • the transmission unit 138 provides various information based on the information stored in the storage unit 120.
  • the transmission unit 138 provides various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
  • the transmission unit 138 transmits information indicating a function to be executed by the terminal device 10 to the terminal device 10.
  • the transmission unit 138 transmits information indicating the function (service) selected by the selection unit 135 to the terminal device 10.
  • the transmission unit 138 transmits various information to the terminal device 10 in response to an instruction from the execution unit 133.
  • the transmission unit 138 transmits information instructing the terminal device 10 to execute a function (service).
  • the transmission unit 138 transmits the image information generated by the generation unit 137.
  • FIG. 8 is a diagram showing a configuration example of the terminal device according to the embodiment of the present disclosure.
  • the terminal device 10 includes a communication unit 11, an input unit 12, an output unit 13, a storage unit 14, a control unit 15, a sensor unit 16, and a display unit 17.
  • the communication unit 11 is realized by, for example, a NIC or a communication circuit.
  • the communication unit 11 is connected to the network N (Internet or the like) by wire or wirelessly, and transmits / receives information to / from other devices such as the information processing device 100 via the network N.
  • the input unit 12 accepts various inputs.
  • the input unit 12 receives the detection by the sensor unit 16 as an input.
  • the input unit 12 accepts the input of the user's utterance information.
  • the input unit 12 accepts input by the user's physical movement.
  • the input unit 12 accepts the user's gesture and line of sight as input.
  • the input unit 12 receives sound as input by the sensor unit 16 having a function of detecting voice.
  • the input unit 12 receives the voice information detected by the microphone (sound sensor) that detects the voice as the input information.
  • the input unit 12 receives the voice spoken by the user as input information.
  • the input unit 12 may accept an operation (user operation) on the terminal device 10 used by the user as an operation input by the user.
  • the input unit 12 may receive information regarding the operation of the user using the remote controller (remote controller) via the communication unit 11.
  • the input unit 12 may have a button provided on the terminal device 10 or a keyboard or mouse connected to the terminal device 10.
  • the input unit 12 may have a touch panel capable of realizing functions equivalent to those of a remote controller, a keyboard, and a mouse.
  • various information is input to the input unit 12 via the display unit 17.
  • the input unit 12 receives various operations from the user via the display screen by the function of the touch panel realized by various sensors. That is, the input unit 12 receives various operations from the user via the display unit 17 of the terminal device 10.
  • the input unit 12 receives an operation such as a user's designated operation via the display unit 17 of the terminal device 10.
  • the input unit 12 functions as a reception unit that receives a user's operation by the function of the touch panel.
  • the input unit 12 and the reception unit 153 may be integrated.
  • the capacitance method is mainly adopted in the tablet terminal, but other detection methods such as the resistance film method, the surface acoustic wave method, the infrared method, and the electromagnetic induction method are used. Any method may be adopted as long as the user's operation can be detected and the touch panel function can be realized.
  • the input unit 12 accepts a user's utterance as an input.
  • the input unit 12 receives the user's utterance detected by the sensor unit 16 as input.
  • the input unit 12 receives the user's utterance detected by the sound sensor of the sensor unit 16 as an input.
  • the output unit 13 outputs various information.
  • the output unit 13 has a function of outputting audio.
  • the output unit 13 has a speaker that outputs sound.
  • the output unit 13 outputs various information by voice according to the control by the execution unit 152.
  • the output unit 13 outputs information by voice to the user.
  • the output unit 13 outputs the information displayed on the display unit 17 by voice.
  • the storage unit 14 is realized by, for example, a semiconductor memory element such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk.
  • the storage unit 14 stores various information used for displaying the information.
  • the control unit 15 is realized by, for example, a CPU, an MPU, or the like executing a program stored inside the terminal device 10 (for example, a display program such as an information processing program according to the present disclosure) with a RAM or the like as a work area. Will be done. Further, the control unit 15 is a controller, and may be realized by an integrated circuit such as an ASIC or FPGA.
  • control unit 15 includes a reception unit 151, an execution unit 152, a reception unit 153, and a transmission unit 154, and realizes or executes the information processing functions and operations described below. ..
  • the internal configuration of the control unit 15 is not limited to the configuration shown in FIG. 8, and may be another configuration as long as it is a configuration for performing information processing described later.
  • the receiving unit 151 receives various information.
  • the receiving unit 151 receives various information from an external information processing device.
  • the receiving unit 151 receives various information from other information processing devices such as the information processing device 100.
  • the receiving unit 151 receives information instructing the execution of a function (service) from the information processing device 100.
  • the receiving unit 151 receives execution instructions of various functions (services) from the information processing device 100. For example, the receiving unit 151 receives information specifying a function (service) from the information processing device 100 as a function execution instruction.
  • the receiving unit 151 receives the content.
  • the receiving unit 151 receives the content to be displayed from the information processing device 100.
  • Execution unit 152 executes various processes.
  • the execution unit 152 determines the execution of various processes.
  • the execution unit 152 executes various processes based on information from an external information processing device.
  • the execution unit 152 executes various processes based on the information from the information processing device 100.
  • the execution unit 152 executes various processes in response to an instruction from the information processing device 100.
  • the execution unit 152 executes various processes based on the information stored in the storage unit 14.
  • the execution unit 152 executes a function (service).
  • the execution unit 152 controls various outputs.
  • the execution unit 152 controls the audio output by the output unit 13.
  • the execution unit 152 controls various displays.
  • the execution unit 152 controls the display of the display unit 17.
  • the execution unit 152 controls the display of the display unit 17 in response to the reception by the reception unit 151.
  • the execution unit 152 controls the display of the display unit 17 based on the information received by the reception unit 151.
  • the execution unit 152 controls the display of the display unit 17 based on the information received by the reception unit 153.
  • the execution unit 152 controls the display of the display unit 17 in response to the reception by the reception unit 153.
  • Reception department 153 receives various information.
  • the reception unit 153 receives input by the user via the input unit 12.
  • the reception unit 153 accepts the utterance by the user as an input.
  • the reception unit 153 accepts operations by the user.
  • the reception unit 153 accepts the user's operation on the information displayed by the display unit 17.
  • the reception unit 153 accepts character input by the user.
  • the transmission unit 154 transmits various information to an external information processing device. For example, the transmission unit 154 transmits various information to another information processing device such as the information processing device 100. The transmission unit 154 transmits the information stored in the storage unit 14.
  • the transmission unit 154 transmits various types of information based on information from other information processing devices such as the information processing device 100.
  • the transmission unit 154 transmits various types of information based on the information stored in the storage unit 14.
  • the transmission unit 154 transmits the sensor information detected by the sensor unit 16 to the information processing device 100.
  • the transmission unit 154 transmits the user's utterance information detected by the sound sensor of the sensor unit 16 to the information processing device 100.
  • the transmission unit 154 transmits the input information input by the user to the information processing device 100.
  • the transmission unit 154 transmits the input information voice-input by the user to the information processing device 100.
  • the transmission unit 154 transmits the input information input by the user's operation to the information processing device 100.
  • the transmission unit 154 transmits the user's utterance information in the target language to the information processing device 100.
  • the transmission unit 154 transmits the character information corresponding to the user's utterance in the target language to the information processing device 100.
  • the sensor unit 16 detects various sensor information.
  • the sensor unit 16 has a sound sensor (speaker) that detects sound.
  • the sensor unit 16 has a function as an imaging unit for capturing an image.
  • the sensor unit 16 has an image sensor function and detects image information.
  • the sensor unit 16 functions as an image input unit that receives an image as an input.
  • the sensor unit 16 is not limited to the above, and may have various sensors.
  • the sensor unit 16 is a position sensor, an acceleration sensor, a gyro sensor, a temperature sensor, a humidity sensor, an illuminance sensor, a pressure sensor, a proximity sensor, a sensor for receiving biological information such as odor, sweat, heartbeat, pulse, and brain wave. It may have various sensors. Further, the sensors that detect the above-mentioned various information in the sensor unit 16 may be common sensors, or may be realized by different sensors.
  • the display unit 17 is provided on the terminal device 10 and displays various information.
  • the display unit 17 is realized by, for example, a liquid crystal display, an organic EL (Electro-Luminescence) display, or the like.
  • the display unit 17 may be realized by any means as long as the information provided by the information processing device 100 can be displayed.
  • the display unit 17 displays various information according to the control by the execution unit 152.
  • the display unit 17 displays various information received by the reception unit 151.
  • the display unit 17 displays the response received from the information processing device 100.
  • the display unit 17 displays information related to language conversion.
  • FIG. 9 is a diagram showing an example of the response according to the embodiment of the present disclosure.
  • FIG. 9 shows an example of a response in a language-recognizable format.
  • the information processing system 1 may output information for the user to recognize when the input language and the language for which the semantic analysis processing is performed are different. For example, the information processing system 1 translates the input language and performs semantic analysis processing using the translated information in three phases of input, analysis, and output at the time of response. Output in a format that shows which language was processed.
  • the terminal device 10 displays a response in a language-recognizable format on the display unit 17.
  • the input language target language
  • the specific language translation destination language
  • the output language target language
  • the information processing system 1 outputs a response in a format that indicates in which language the processing was performed in three phases of input, analysis, and output.
  • FIG. 10 is a flowchart showing processing of the information processing apparatus according to the embodiment of the present disclosure. Specifically, FIG. 10 is a flowchart showing a procedure of information processing by the information processing apparatus 100.
  • the information processing apparatus 100 executes a semantic analysis process on the character information corresponding to each of the languages including the target language corresponding to the user's utterance (step S101). Then, the information processing apparatus 100 calculates an accuracy index value for each of the character information based on the result of the semantic analysis process (step S102). For example, the information processing apparatus 100 calculates an accuracy index value (semantic analysis accuracy) that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the character information.
  • an accuracy index value semantic analysis accuracy
  • FIG. 11 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure.
  • the processing shown in FIG. 11 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good.
  • the information processing system 1 acquires the voice information of the utterance by the user (step S201). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
  • the information processing system 1 performs voice recognition processing (step S202).
  • the information processing system 1 performs voice recognition processing for voice information spoken by the user.
  • the information processing system 1 acquires the utterance text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition.
  • the information processing system 1 determines whether the language is capable of language identification (speech recognition), and if possible, performs voice recognition processing. If the information processing system 1 is not a language capable of language identification (speech recognition), the processing may be terminated. In this case, the information processing system 1 may notify the user that the language is not compatible.
  • the information processing system 1 uses the utterance information as input language character information. When it is necessary to normalize the utterance information, the information processing system 1 may normalize the utterance information to the input language character information.
  • the information processing system 1 develops the utterance sentence (step S203).
  • the information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased.
  • the information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
  • the information processing system 1 has three (plurality) parameters of utterance development information B, utterance development information C, and utterance development information D, which are paraphrases of utterance development information A corresponding to the user's utterance. Generate a phrase.
  • the information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
  • the information processing system 1 develops the language (step S204).
  • the information processing system 1 translates (converts) the input language character information and its paraphrase into another language.
  • the information processing system 1 translates the input language character information of the target language into another language.
  • the information processing system 1 converts the input language character information into the character information of the translation destination language (translation destination character information).
  • translation destination character information For example, as shown in FIG. 6, the information processing system 1 translates the utterance development information A, which is Japanese input language character information, into English, Spanish, French, and the like.
  • the information processing system 1 translates each paraphrase of the target language into another language.
  • the information processing system 1 converts each paraphrase into character information (translation destination character information) of the translation destination language.
  • the information processing system 1 translates the utterance development information B, which is a Japanese paraphrase, into English, Spanish, French, and the like as shown in FIG.
  • the information processing system 1 translates the utterance development information C and the utterance development information D, which are Japanese paraphrases, into English, Spanish, French, and the like.
  • the information processing system 1 provides a list of translated sentences including the input language character information, its paraphrase, the translated sentence obtained by translating the input language character information into each language, and the translated sentence (translated paraphrase) obtained by translating the paraphrase. Generate.
  • the information processing system 1 performs loop processing of the translated sentence list (step S205).
  • the information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (also referred to as "selected sentence").
  • the information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S206).
  • the information processing system 1 generates the result of the semantic analysis process for the selected sentence.
  • the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
  • the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S207). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1).
  • the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed.
  • the information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
  • the information processing system 1 performs the processes of steps S206 to S207 with each sentence in the translated sentence list as a selected sentence by the loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
  • step S208: No the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S206 to S207.
  • the information processing system 1 ends the loop processing of the translated sentence list when there is no unprocessed sentence in the translated sentence list (step S208: Yes).
  • the information processing system 1 selects a semantic frame (step S209).
  • the information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
  • the information processing system 1 performs slot inverse transformation (step S210).
  • the information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy).
  • the information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language).
  • the information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language).
  • the information processing system 1 does not need to perform step S210 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
  • the information processing system 1 generates a response (step S211).
  • the information processing system 1 generates information such as images, sounds, and texts according to the output mode.
  • the information processing system 1 outputs the generated information (step S212).
  • the information processing system 1 displays images and texts and outputs sounds.
  • FIG. 12 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure.
  • the processing shown in FIG. 12 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good. The same points as in FIG. 11 will be omitted as appropriate.
  • the information processing system 1 acquires the voice information of the utterance by the user (step S301). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
  • the information processing system 1 performs voice recognition processing (step S302).
  • the information processing system 1 performs voice recognition processing for voice information spoken by the user.
  • the information processing system 1 acquires the utterance text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition.
  • the information processing system 1 determines whether the language is capable of language identification (speech recognition), and if possible, performs voice recognition processing. If the information processing system 1 is not a language capable of language identification (speech recognition), the processing may be terminated. In this case, the information processing system 1 may notify the user that the language is not compatible.
  • the information processing system 1 uses the utterance information as input language character information. When it is necessary to normalize the utterance information, the information processing system 1 may normalize the utterance information to the input language character information.
  • the information processing system 1 performs an utterance semantic analysis process on the input language character information for the utterance information (step S303).
  • the information processing system 1 generates the result of the semantic analysis process for the input language character information.
  • the information processing system 1 estimates information such as a domain goal of input language character information by utterance meaning analysis processing, and provides information on a meaning frame including information such as a meaning analysis score indicating the certainty of the estimated information. Generate.
  • the information processing system 1 develops the utterance sentence (step S305). For example, the information processing system 1 determines whether or not the accuracy of the semantic analysis process is low based on the comparison between the semantic analysis score of the semantic analysis process using the input language character information and a predetermined threshold value. When the semantic analysis score of the semantic analysis process using the input language character information is equal to or less than a predetermined threshold value, the information processing system 1 determines that the accuracy of the semantic analysis process is low. The information processing system 1 may determine whether or not the accuracy of the semantic analysis process is low by using the accuracy index value (semantic analysis accuracy) obtained by converting the semantic analysis score. In this case, the information processing system 1 determines that the accuracy of the semantic analysis process is low when the accuracy index value (semantic analysis accuracy) of the input language character information calculated by the equation (1) is equal to or less than a predetermined threshold value.
  • step S305 the information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased. Since this point is the same as the utterance sentence development in FIG. 11, the description thereof will be omitted.
  • the information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
  • the information processing system 1 develops the language (step S306).
  • the information processing system 1 translates (converts) the input language character information and its paraphrase into another language.
  • the information processing system 1 translates (converts) the input language character information and its paraphrase into another language by using the information of the specific language.
  • the information processing system 1 uses information in a specific language to translate (convert) input language character information and its paraphrases into another language capable of semantic analysis processing. Since this point is the same as the language development in FIG. 11, the description thereof will be omitted.
  • the information processing system 1 generates a translation sentence list including input language character information, its paraphrase, a translation sentence obtained by translating the input language character information into each language, and a translation sentence (translation paraphrase) obtained by translating the paraphrase.
  • the information processing system 1 performs loop processing of the translated sentence list (step S307).
  • the information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (selected sentence).
  • the information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S308).
  • the information processing system 1 generates the result of the semantic analysis process for the selected sentence. For example, the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
  • the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S309). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1).
  • the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed.
  • the information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
  • the information processing system 1 performs the processes of steps S308 to S309 with each sentence in the translated sentence list as a selected sentence by the loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
  • step S310: No When there is an unprocessed sentence in the translated sentence list (step S310: No), the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S308 to S309.
  • step S310: Yes the information processing system 1 ends the loop processing of the translated sentence list.
  • the information processing system 1 selects a semantic frame (step S311).
  • the information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
  • the information processing system 1 performs slot inverse transformation (step S312).
  • the information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy).
  • the information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language).
  • the information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language). Note that the information processing system 1 does not have to perform step S312 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
  • the information processing system 1 generates a response (step S313).
  • the information processing system 1 generates information such as images, sounds, and texts according to the output mode.
  • the information processing system 1 generates a response using the result of the semantic analysis process of the sentence having the maximum accuracy index value (semantic analysis accuracy) of each sentence.
  • step S304 the information processing system 1 generates a response using the result of the semantic analysis process of the input language character information. For example, when the accuracy of the semantic analysis process using the input language character information is high, the information processing system 1 responds by using the result of the semantic analysis process of the input language character information without performing the processes of steps S305 to S312. Generate.
  • the information processing system 1 outputs the generated information (step S314).
  • the information processing system 1 displays images and texts and outputs sounds.
  • FIG. 13 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure.
  • the processing shown in FIG. 13 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good.
  • the same points as those in FIGS. 11 and 12 will be omitted as appropriate.
  • the information processing system 1 acquires the voice information of the utterance by the user (step S401). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
  • the information processing system 1 performs voice recognition processing (step S402).
  • the information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the text (utterance information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition.
  • the information processing system 1 determines whether the language is not the target language (step S403). For example, the information processing system 1 determines that a language that cannot be language-identified (speech recognition) is a non-target language.
  • the information processing system 1 determines that the language is not a non-target language (step S403: Yes)
  • the information processing system 1 expands the utterance sentence (step S404).
  • the information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased. Since this point is the same as the utterance sentence development in FIG. 11, the description thereof will be omitted.
  • the information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
  • the information processing system 1 develops the language (step S405).
  • the information processing system 1 translates (converts) the input language character information and its paraphrase into another language.
  • the information processing system 1 translates (converts) the input language character information and its paraphrase into another language by using the information of the specific language.
  • the information processing system 1 uses information in a specific language to translate (convert) input language character information and its paraphrases into another language capable of semantic analysis processing. Since this point is the same as the language development in FIG. 11, the description thereof will be omitted.
  • the information processing system 1 generates a translation sentence list including input language character information, its paraphrase, a translation sentence obtained by translating the input language character information into each language, and a translation sentence (translation paraphrase) obtained by translating the paraphrase.
  • the information processing system 1 performs loop processing of the translated sentence list (step S406).
  • the information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (selected sentence).
  • the information processing system 1 estimates the quality of the selected sentence (step S407). For example, the information processing system 1 estimates the quality of the selected sentence (translated text) when the selected sentence is a translated sentence. For example, the information processing system 1 calculates the quality estimated translation accuracy (quality score) of the translated text (translated text) by an appropriate method, and sets the quality estimated translation accuracy (quality score) and the threshold value (for example, 0.75, etc.). compare. Then, if the quality estimated translation accuracy (quality score) of the translated sentence is equal to or higher than the threshold value, the information processing system 1 determines that the quality of the selected sentence is high (high score), and determines that the quality estimated translation accuracy of the selected sentence is high. If (quality score) is less than the threshold value, it is determined that the quality of the selected sentence is low (low score).
  • quality score quality estimated translation accuracy
  • step S407 determines whether manual editing of the selected sentence is completed within a predetermined time.
  • step S408 LONG TIME
  • the information processing system 1 rejects the processing of the selected sentence (step S415). For example, when the information processing system 1 determines that the manual editing of the selected sentence is not completed within a predetermined time, the processing of the selected sentence is interrupted. Then, the information processing system 1 generates a response in step S415 by using the reason for interruption for the selected sentence. The information processing system 1 continues the loop processing of the translated sentence list in step S406 for the remaining sentences (unprocessed sentences) even when the processing of a certain selected sentence is rejected.
  • step S408 SHORT TIME
  • the selected sentence is used in the manually edited character information in step S409.
  • ⁇ S410 is processed.
  • the information processing system 1 determines that the manual editing of the selected sentence is completed within a predetermined time
  • the information processing system 1 replaces the selected sentence with the manually edited sentence (edited sentence).
  • step S407 HIGH
  • the information processing system 1 performs the processes of steps S409 to S410 using the selection statement.
  • the information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S409).
  • the information processing system 1 generates the result of the semantic analysis process for the selected sentence. For example, the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
  • the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S410). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1).
  • the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed.
  • the information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
  • the information processing system 1 performs the processes of steps S409 to S410 by selecting each sentence in the translated sentence list by loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
  • step S411 When there is an unprocessed sentence in the translated sentence list (step S411: No), the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S409 to S410.
  • the information processing system 1 ends the loop processing of the translated sentence list when there is no unprocessed sentence in the translated sentence list (step S411: Yes).
  • the information processing system 1 selects a semantic frame (step S412).
  • the information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing.
  • the information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
  • the information processing system 1 performs slot inverse transformation (step S413).
  • the information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy).
  • the information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language).
  • the information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language). Note that the information processing system 1 does not have to perform step S413 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
  • the information processing system 1 generates a response (step S414).
  • the information processing system 1 generates information such as images, sounds, and texts according to the output mode.
  • the information processing system 1 generates a response using the result of the semantic analysis process of the sentence having the maximum accuracy index value (semantic analysis accuracy) of each sentence.
  • step S403 determines that the language is not the target language (step S403: No)
  • the information processing system 1 rejects the process (step S415).
  • the information processing system 1 interrupts the process.
  • the information processing system 1 generates a response in step S415 using the reason for interruption.
  • the information processing system 1 uses the reason for interruption to generate a response such as "processing is interrupted because of a language other than the target".
  • the information processing system 1 outputs the generated information (step S416).
  • the information processing system 1 displays images and texts and outputs sounds.
  • FIG. 14 is a conceptual diagram showing an example of processing by the information processing system.
  • the system process PS1 shown in FIG. 14 shows an example of the process realized by the information processing system 1.
  • FIG. 14 shows a schematic configuration diagram of the information processing system 1.
  • Speech expansion, accuracy conversion, score function, semantic frame language selection, language expansion translator, slot inverse conversion, and response generation in FIG. 14 are important points for improving the accuracy and support of semantic analysis, for example.
  • Speech expansion, precision conversion, score function, and semantic frame language selection are very important parts of the process.
  • the system process PS1 shown in FIG. 14 is a diagram conceptually showing each process from the input of the user's utterance to the output of the response, and the functions and hardware configurations for realizing each process.
  • each process shown in the system process PS1 is executed by the information processing apparatus 100.
  • the language expansion translator in the system processing PS1 is realized by the function of the conversion unit 132 of the information processing device 100.
  • the utterance meaning analyzer in the system processing PS1 is realized by the function of the execution unit 133 of the information processing device 100.
  • the information processing system 1 performs language identification processing for utterances by the user. Then, when the information processing system 1 can identify the language of the input utterance language (input language), the information processing system 1 develops the utterance sentence. For example, the information processing system 1 generates a utterance sentence list including character information corresponding to the user's utterance and a paraphrase of the character information by expanding the utterance sentence. For example, the information processing system 1 generates an utterance sentence list including the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 by the utterance sentence expansion.
  • the information processing system 1 includes the utterance development information A in which the user's utterance is converted into a character string, the utterance development information B which is a paraphrase of the utterance development information A, the utterance development information C, the utterance development information D, and the like. Generate an utterance list.
  • the language expansion translator may be a plurality of translators that translate (convert) an input language into each of a plurality of languages.
  • the language expansion translator is a first translator that translates Japanese into English, a second translator that translates Japanese into Spanish, a third translator that translates Japanese into English-French, and the like. It may be composed of a plurality of translators of.
  • the information processing system 1 generates character information in which the utterance development sentence is converted into each language by inputting each sentence (utterance development sentence) in the utterance sentence list into the translator of each language.
  • the information processing system 1 converts each sentence (speech expansion sentence) included in the utterance sentence list into a sentence of each translation destination language, thereby converting a sentence (translation sentence) of each translation destination language corresponding to each utterance development sentence. ) Is generated.
  • the information processing system 1 generates a translation list including sentences (translated sentences) of each translation destination language corresponding to each utterance development sentence.
  • the information processing system 1 uses a translator of each language for each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. A translation of each language corresponding to each of the development information B, the utterance development information C, and the utterance development information D is generated.
  • the information processing system 1 lists each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 in the translated text in FIG. Generate translations for each language. Taking English as an example, the information processing system 1 translates each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 into English (first translator). ) Is used to generate four English sentences corresponding to each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D.
  • the information processing system 1 adds each sentence of the utterance sentence list to the translation list to generate a translation list.
  • the information processing system 1 generates a translation list including the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D because Japanese is also a specific language in the example of FIG.
  • the information processing system 1 estimates the quality of the translation result. For example, the information processing system 1 estimates the quality of the translated text in the translation list.
  • the information processing system 1 determines that at least one translated sentence has a high score
  • the information processing system 1 performs a semantic analysis process by the utterance semantic analyzer using the translated sentence having a high score.
  • the information processing system 1 determines that at least one of the translated sentences in the translation list is of high quality
  • the information processing system 1 performs a semantic analysis process by the utterance semantic analyzer using the translated sentence of high quality.
  • the information processing system 1 generates a response indicating that the processing is interrupted for that language.
  • the information processing system 1 generates a response indicating that the processing is interrupted when it is determined that the quality of all the translated sentences translated into a specific language is low.
  • the information processing system 1 generates a response such as "The translation accuracy from the XX language to the YY language is low, so the process is interrupted.”
  • the information processing system 1 generates a response such as "The input language is interrupted because the translation accuracy from Korean to English for semantic analysis is low.”
  • the information processing system 1 When the input language is a specific language, the information processing system 1 performs semantic analysis processing by the utterance meaning analyzer using the utterance development sentence of the input language and the translated sentence having a high score. Further, when the input language is a specific language, the information processing system 1 may perform semantic analysis processing by the utterance meaning analyzer using the utterance expansion sentence even when there is no translated sentence having a high score. ..
  • the information processing system 1 performs processing related to post-editing when it is determined that all languages (specific languages) have low scores. For example, when the information processing system 1 determines that the quality of all the translated sentences is low, the information processing system 1 performs a process related to post-editing. When the input language is a specific language, the information processing system 1 performs semantic analysis processing by the utterance semantic analyzer using the utterance expansion sentence even when it is determined that the quality of all the translated sentences is low. , It is not necessary to perform the processing related to post-editing.
  • the information processing system 1 determines that the input language is not a specific language and the translated sentences of all the languages have a low score, and the processing for the user's utterance does not require immediacy, the translation result is obtained.
  • the information processing system 1 causes a cloud worker to perform manual editing.
  • the information processing system 1 causes the cloud worker to manually edit the translation result by transmitting the utterance development sentence and the translated sentence to a device (terminal device 10 or the like) used by the cloud worker.
  • the information processing system 1 rejects the processing when the post-editing processing time is long (long time). For example, the information processing system 1 rejects the process if the manual editing by a cloud worker or the like is not completed within a predetermined time. In this case, the information processing system 1 generates a response such as "The translation time from the XX language to the YY language is long, so the process is interrupted.” For example, the information processing system 1 generates a response such as "The translation time from the input language Dutch to Japanese for semantic analysis is long, so the process is interrupted.”
  • the information processing system 1 performs the semantic analysis processing by the utterance semantic analyzer using the translated sentence edited by hand.
  • the information processing system 1 outputs each sentence (semantic analysis target sentence) by performing semantic analysis processing on the utterance development sentence and the translated sentence having a high score in the translation list by the utterance semantic analyzer.
  • the information processing system 1 obtains the result of the semantic analysis for each sentence to be analyzed.
  • the information processing system 1 generates information of a semantic frame corresponding to each semantic analysis target sentence by a semantic analysis process.
  • the information processing system 1 specifies the Domain-Goal (domain goal) corresponding to each semantic analysis target sentence by the semantic analysis process, and generates information of the semantic frame in which the slot value is set.
  • the information processing system 1 generates information of a semantic frame including a Domain-Goal (domain goal) corresponding to each semantic analysis target sentence and a score (semantic analysis score) indicating the certainty of the slot value.
  • the information processing system 1 converts the semantic analysis score (analysis score) of each semantic analysis target sentence into the semantic analysis accuracy (%) which is an accuracy index value.
  • the information processing system 1 calculates the semantic analysis accuracy (%) by using the semantic analysis score (analysis score) and the score function.
  • the information processing system 1 calculates the semantic analysis accuracy by using the information indicating the language and the score function that outputs the semantic analysis accuracy by inputting the semantic analysis score (analysis score).
  • the information processing system 1 selects the language of the semantic frame. For example, in the information processing system 1, when there are a plurality of semantic analysis target sentences for which utterance semantic analysis processing has been performed, the language of the semantic frame of the semantic analysis target sentence is selected by selecting the semantic analysis target sentence based on the semantic analysis accuracy. Is selected as the language to be used for the subsequent processing. The information processing system 1 selects the language of the semantic frame of the semantic analysis target sentence as the language to be used for the subsequent processing by selecting the semantic analysis target sentence having the maximum semantic analysis accuracy. When the information processing system 1 has one semantic analysis target sentence for which the utterance semantic analysis process has been performed, the information processing system 1 selects the language of the semantic frame of the semantic analysis target sentence as the language to be used for the subsequent processing.
  • the information processing system 1 performs slot inverse transformation using the knowledge DB.
  • the information processing system 1 performs reverse slot conversion using reverse translation or a multilingual dictionary.
  • the information processing system 1 performs slot inverse conversion using a knowledge database such as an external knowledge information providing server or a knowledge information storage unit 125.
  • the information processing system 1 reversely converts the slot value of the translation destination language in the semantic frame into the slot value of the input language (target language).
  • the information processing system 1 does not have to perform the slot inverse transformation process when the inverse transformation is unnecessary, such as when the language of the semantic frame is the input language (target language).
  • the information processing system 1 generates a response.
  • the information processing system 1 generates information corresponding to the result of the semantic analysis process.
  • the information processing system 1 generates response information based on the domain-goal (domain goal) of the semantic frame and the slot value information.
  • FIG. 15 is a diagram showing an example of specific processing by the information processing system.
  • the processing example shown in FIG. 15 corresponds to the flow shown in FIG.
  • the process shown in FIG. 15 may be performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1.
  • the same points as those in FIGS. 11 to 14 will be omitted as appropriate.
  • the user speaks in Korean, saying "Play a song of XXX (Korean)".
  • the information processing system 1 acquires the input IN1 of the user in Korean. It is assumed that "XXX" is a predetermined artist name (singer name).
  • the information processing system 1 performs semantic analysis processing in Korean.
  • the information processing system 1 acquires the result of the semantic analysis in Korean as shown in the result RS1.
  • the information processing system 1 acquires a result indicating that the domain is "cooking” and the score value is "low” in the semantic analysis in Korean.
  • the information processing system 1 uses a Korean semantic analyzer to generate a semantic analysis result in which the domain is specified as "cooking” and the semantic analysis score is a low numerical value such as "0.2".
  • the information processing system 1 Since the score of the semantic analysis in Korean is low and the accuracy is low, the information processing system 1 performs the semantic analysis process using the English translation of the Korean word "Korean song (Korean)". conduct. The information processing system 1 performs a semantic analysis process using the English translation "Play XXX song", and as shown in the result RS2, the domain (domain goal) is "music playback" and its score. Get the result indicating that the value is "high”. The information processing system 1 uses an English semantic analyzer to generate a semantic analysis result in which the domain is specified as "music reproduction” and the semantic analysis score is a high numerical value such as "0.9".
  • the information processing system 1 determines the response to the user using the English semantic analysis result.
  • the information processing system 1 determines to play the song of the artist "XXX” using the English semantic analysis result. It is assumed that the song title is “AAA & BBB” artist “XXX”.
  • the information processing system 1 outputs a response in Korean saying "AAA & BBB is reproduced (Korean)". Then, the information processing system 1 executes a process of playing the song "AAA & BBB" of the artist "XXX”.
  • the information processing system 1 may notify the user of the input language (Korean) and the analysis language (English). In addition, the information processing system 1 provides feedback to the user. As shown in the feedback FB1, the information processing system 1 notifies the user in Korean that "the accuracy of Korean is low, so the analysis was performed in English (Korean)".
  • the notification mode may be various modes such as voice output and display on the screen. Further, the information processing system 1 may turn on a light of a color (for example, green or the like) indicating that the analysis is in English, or may output a sound effect (for example, pip or the like) meaning translation. ..
  • Language identification is a known technique that assumes a plurality of input languages and automatically identifies the language of the speaker, and is often used at the same time as voice recognition processing in recent years.
  • language identification is roughly divided into a case of judging from an acoustic model of speech recognition and a case of judging from a language model.
  • the former has the advantage that the time (delay) required for language identification can be reduced, and the latter enables more accurate language identification, which is a trade-off relationship between delay and accuracy.
  • the information processing system 1 may perform language identification by either a process of determining from an acoustic model of voice recognition or a process of determining from a language model.
  • the processing is interrupted and the reason for the interruption is included in the response.
  • the information processing system 1 outputs such as "ZZ language does not support translation and semantic analysis".
  • the information processing system 1 may first perform semantic analysis processing of only input utterance and input language when there is a viewpoint of time constraints, for example.
  • the information processing system 1 makes it possible to execute either the utterance sentence expansion process, the language expansion process, or both when the result of the semantic analysis process of only the input utterance and the input language is low accuracy.
  • the score value semantic analysis score
  • the semantic analysis accuracy of the semantic analysis processing of only the input utterance and the input language is less than a certain value (threshold)
  • the information processing system 1 performs the processing of the utterance sentence expansion and the language expansion. Make either or both feasible.
  • the value of this constant value (threshold value) can be set as appropriate. ..
  • the value of this constant value (threshold value) may be appropriately set according to the purpose of processing and the like. For example, the higher the accuracy required for processing, the larger the constant value (threshold value) may be set.
  • the input language is Korean and the semantic analysis process becomes the cooking domain due to low accuracy, and then the utterance sentence converted by the English translation makes the semantic analysis process highly accurate again and is changed to the music domain. ..
  • the information processing system 1 can include a response sentence such as "translated and analyzed in English because of low accuracy" in the response.
  • Quality estimation is a process (module) for estimating the accuracy of the output of a translator. If the accuracy is low, it is difficult to obtain the desired result even with the semantic analysis process, and in order to avoid the disadvantage to the user due to the semantic analysis estimating another domain goal, at the stage of the translation result. This is to stop the processing. That is, the information processing system 1 performs a normal semantic analysis process when the score value (quality score) of the quality estimation is higher than a certain level, interrupts the process when the score value is lower than a certain level, and includes the reason for the interruption in the response.
  • the information processing system 1 when only the quality scores of English, French and German have a threshold value of "0.75", the information processing system 1 has three languages of English, French and German, which are languages having a threshold value of "0.75" or higher. Performs normal semantic analysis processing, other languages interrupt the processing and include the reason for the interruption in the response.
  • the information processing system 1 outputs such as "The processing is interrupted because the translation accuracy from Dutch to Japanese is low". This allows the user to know which language was unable to successfully translate.
  • the information processing system 1 may output (notify) in various expression formats as long as it can be recognized by the user, such as displaying an error sound or an identifiable color on the device.
  • the information processing system 1 calculates a quality estimation score value (quality score) for each sentence of all the translated sentence lists generated by the utterance sentence expansion and the language expansion, and any one of them is above a certain level. If it is a value, the semantic analysis process is executed for all specific languages above a certain level. On the other hand, when all the specific languages have values below a certain level, the information processing system 1 interrupts the processing and includes the reason for the interruption in the response. For example, the information processing system 1 outputs such as "Translation to a processable language has low accuracy, so processing is interrupted.”
  • the information processing system 1 causes a cloud worker (translation editor) to manually correct the translation result (post-editing) when the translation result of all specific languages is below a certain level. It is also possible. However, since it is manually edited and it takes a long time to process, this process is expected to be used for which immediacy is not required. Therefore, if the processing is not completed even after a certain period of time has passed, the information processing system 1 interrupts the processing and includes the reason for the interruption in the response. For example, the information processing system 1 outputs such as "It will be interrupted because the manual translation from Hindi to Japanese takes a long time.” Further, the information processing system 1 may adjust a certain time or more according to the domain goal.
  • the information processing system 1 may change various values such as a threshold value.
  • a threshold value For example, in the information processing system 1, the user may freely set the translation accuracy and the waiting time above a certain level in addition to the default values of the system. Depending on the user, if you want to obtain only high quality products, set a high accuracy threshold, and if you want results even if it takes time, you can freely customize by setting a long waiting time. , It is possible to optimize for each user.
  • the terminal device 10 performs conversion processing, semantic analysis processing, inverse transformation theory, etc. May be done. That is, the terminal device 10 which is a device on the client side may be an information processing device that performs the above-mentioned conversion processing, semantic analysis processing, inverse conversion theory, and the like.
  • the system configuration of the information processing system 1 is not limited to the configuration in which the information processing device 100, which is a device on the server side, performs conversion processing, semantic analysis processing, inverse transformation, and the like, and is a terminal device which is a device on the client side. 10 may be configured to perform the above-mentioned conversion processing, semantic analysis processing, inverse transformation theory, and the like.
  • the information processing system 1 performs translation, semantic analysis, and inverse conversion on the client side (terminal device 10). Then, the server side (information processing device 100) acquires the information of the semantic analysis result and the inverse conversion result from the terminal device 10 and performs various processes.
  • the execution unit 152 of the terminal device 10 may have the same function as the execution unit 133 of the information processing device 100. Further, the terminal device 10 may have a conversion unit that realizes the same function as the conversion unit 132 described above, and an inverse conversion unit that realizes the same function as the inverse conversion unit 136. Further, in this case, the information processing apparatus 100 does not have to have the conversion unit 132 and the inverse conversion unit 136.
  • the information processing system 1 may have a system configuration in which the client side (terminal device 10) analyzes the meaning of the utterance and the server side (information processing device 100) performs inverse conversion.
  • the terminal device 10 which is a device on the client side is an information processing device which performs the above-mentioned conversion processing and the semantic analysis processing
  • the information processing device 100 which is a device on the server side performs the above-mentioned inverse conversion processing. It may be.
  • the conversion unit and the execution unit 152 of the terminal device 10 perform conversion processing and semantic analysis processing
  • the inverse conversion unit 136 of the information processing device 100 performs reverse conversion processing.
  • the information processing system 1 may have a system configuration in which either the client-side device (terminal device 10) or the server-side device (information processing device 100) performs each process.
  • each component of each device shown in the figure is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of the device is functionally or physically dispersed / physically distributed in any unit according to various loads and usage conditions. Can be integrated and configured.
  • the information processing device includes an execution unit (execution unit 133 in the embodiment) and a calculation unit (calculation unit 134 in the embodiment).
  • the execution unit executes a semantic analysis process on one or more character information corresponding to each of one or more languages including a target language which is a language corresponding to the user's utterance.
  • the calculation unit is an accuracy index that makes it possible to compare the accuracy of the semantic analysis processing for each of the one or more character information among a plurality of languages based on the result of the semantic analysis processing corresponding to each of the one or more character information. Calculate the value.
  • the information processing apparatus determines the accuracy of the semantic analysis processing for each of the one or more character information between a plurality of languages based on the result of the semantic analysis processing of each of the one or more character information. By calculating the accuracy index value that can be compared with, it is possible to compare the semantic analysis accuracy between languages.
  • the calculation unit calculates each accuracy index value of one or more character information by using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process.
  • the information processing apparatus can appropriately calculate the accuracy index value by using the function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the calculation unit calculates the accuracy index value of each of one or more character information by using a function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process.
  • the information processing apparatus can appropriately convert the score into the accuracy index value by using the function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the calculation unit uses a function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating one language, and the accuracy index of the character information of one language. Calculate the value.
  • the information processing apparatus can appropriately convert the score into the accuracy index value by using the function that outputs the accuracy index value by inputting the score and the language information. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the execution unit executes a semantic analysis process on one or more character information including one character information corresponding to the user's utterance in the target language.
  • the information processing device can compare the semantic analysis accuracy between languages including the target language by calculating the accuracy index value for each of one or more character information including the character information of the target language. Can be.
  • the execution unit executes a semantic analysis process on one or more character information including the translated character information in which one character information is converted into the translation destination language which is the translation destination of the target language.
  • the information processing device calculates the accuracy index value for each of one or more character information including the character information converted into the translation destination language, thereby performing semantic analysis between the languages including the translation destination language. It is possible to compare the accuracy.
  • the execution unit executes a semantic analysis process on one or more character information including a paraphrase in which one character information of the target language is paraphrased into another expression of the target language.
  • the information processing device calculates the accuracy index value for each of one or more character information including the paraphrase paraphrased into another expression of the character information of the target language, so that the user can make a plurality of utterances. After expanding to the expression of, it is possible to compare the accuracy of each of the expressions. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the execution unit executes a semantic analysis process on one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to which the target language is translated.
  • the information processing device sets the accuracy index value for each of one or more character information including the character information in which the paraphrase paraphrased into another expression of the character information of the target language is converted into the translation destination language.
  • the information processing device includes a conversion unit (conversion unit 132 in the embodiment).
  • the conversion unit converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language.
  • the execution unit executes a semantic analysis process on the character information converted by the conversion unit.
  • the information processing device converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language, and performs semantic analysis processing to perform the semantic analysis of each translation destination language.
  • the accuracy index value corresponding to can be calculated. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the conversion unit converts the character information corresponding to the target language into the character information corresponding to the language whose meaning can be interpreted.
  • the information processing device converts the character information corresponding to the target language into the character information corresponding to the language that can interpret the meaning (specific language), and performs the semantic analysis process to convert the character information into each translation destination language.
  • the corresponding accuracy index value can be calculated. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
  • the information processing apparatus includes a selection unit (selection unit 135 in the embodiment).
  • the selection unit selects the processing target character information, which is the character information used for processing, from the one or more character information based on each accuracy index value of the one or more character information calculated by the calculation unit.
  • the information processing apparatus appropriately selects the character information used for processing by selecting the processing target character information which is the character information used for processing from among one or more character information based on the accuracy index value. can do. Therefore, the information processing apparatus can compare the accuracy of semantic analysis between languages and appropriately perform processing based on the comparison result.
  • the selection unit selects the character information having the maximum accuracy index value as the character information to be processed.
  • the information processing apparatus can perform processing using the character information having the maximum accuracy by selecting the character information having the maximum accuracy index value as the character information to be processed. Therefore, the information processing apparatus can compare the accuracy of semantic analysis between languages and appropriately perform processing based on the comparison result.
  • the calculation unit calculates the accuracy index value corresponding to the character information of the target language.
  • the selection unit selects the character information of the target language as the processing target character information.
  • the information processing apparatus selects the character information of the target language as the processing target character information, so that the accuracy of the target language is high.
  • the language can be used for processing. Therefore, the information processing apparatus can appropriately perform processing using the target language.
  • the execution unit does not execute the semantic analysis process of a language other than the target language.
  • the information processing apparatus can suppress an increase in the processing load by not executing the semantic analysis process of a language other than the target language. .. Therefore, the information processing apparatus can appropriately perform processing while suppressing an increase in processing load.
  • the information processing apparatus includes an inverse conversion unit (in the embodiment, an inverse conversion unit 136).
  • the inverse transformation unit converts the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language.
  • the information processing apparatus can obtain the information of the semantic analysis corresponding to the target language by converting the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
  • the inverse conversion unit converts the result of the semantic analysis process into the target language when the language of the processing target character information is other than the target language.
  • the information processing device converts the result of the semantic analysis process into the target language, and the result of the semantic analysis other than the target language is converted into the target language. You can get the information available in. Therefore, the information processing apparatus can appropriately perform processing using the target language.
  • the inverse transformation unit converts a part of the result of the semantic analysis process into the target language.
  • the information processing device converts a part of the result of the semantic analysis process into the target language, and by converting only the necessary information into the target language, the result of the semantic analysis other than the target language is obtained. , You can get the information available in the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
  • the inverse conversion unit converts the slot value of the result of the semantic analysis process into the target language.
  • the information processing device converts the slot value of the result of the semantic analysis process into the target language, thereby converting only the information (slot value) necessary for executing the service, for example, into the target language. It is possible to obtain information that makes the results of semantic analysis in a language other than the target language available in the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
  • the execution unit executes a semantic analysis process on one or more character information corresponding to each of the one or more languages including the target language.
  • the information processing apparatus can appropriately perform processing on the language-identifiable language by executing the semantic analysis process.
  • the information processing device can perform processing on a language that is not language-identifiable, and can suppress improper processing or output of inappropriate results.
  • FIG. 20 is a hardware configuration diagram showing an example of a computer 1000 that realizes the functions of information processing devices such as the information processing device 100 and the terminal device 10.
  • the computer 1000 includes a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, an HDD (Hard Disk Drive) 1400, a communication interface 1500, and an input / output interface 1600.
  • Each part of the computer 1000 is connected by a bus 1050.
  • the CPU 1100 operates based on the program stored in the ROM 1300 or the HDD 1400, and controls each part. For example, the CPU 1100 expands the program stored in the ROM 1300 or the HDD 1400 into the RAM 1200 and executes processing corresponding to various programs.
  • the ROM 1300 stores a boot program such as a BIOS (Basic Input Output System) executed by the CPU 1100 when the computer 1000 is started, a program that depends on the hardware of the computer 1000, and the like.
  • BIOS Basic Input Output System
  • the HDD 1400 is a computer-readable recording medium that non-temporarily records a program executed by the CPU 1100 and data used by the program.
  • the HDD 1400 is a recording medium for recording an information processing program according to the present disclosure, which is an example of program data 1450.
  • the communication interface 1500 is an interface for the computer 1000 to connect to an external network 1550 (for example, the Internet).
  • the CPU 1100 receives data from another device or transmits data generated by the CPU 1100 to another device via the communication interface 1500.
  • the input / output interface 1600 is an interface for connecting the input / output device 1650 and the computer 1000.
  • the CPU 1100 receives data from an input device such as a keyboard or mouse via the input / output interface 1600. Further, the CPU 1100 transmits data to an output device such as a display, a speaker, or a printer via the input / output interface 1600. Further, the input / output interface 1600 may function as a media interface for reading a program or the like recorded on a predetermined recording medium (media).
  • the media is, for example, an optical recording medium such as DVD (Digital Versatile Disc) or PD (Phase change rewritable Disk), a magneto-optical recording medium such as MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory.
  • an optical recording medium such as DVD (Digital Versatile Disc) or PD (Phase change rewritable Disk)
  • MO Magnetic-Optical disk
  • tape medium a magnetic recording medium
  • magnetic recording medium or a semiconductor memory.
  • semiconductor memory for example, when the computer 1000 functions as the information processing device 100 according to the embodiment, the CPU 1100 of the computer 1000 realizes the functions of the control unit 130 and the like by executing the information processing program loaded on the RAM 1200. Further, the information processing program according to the present disclosure and the data in the storage unit 120 are stored in the HDD 1400. The CPU 1100 reads the program data 1450 from the HDD 1400 and executes the program, but as another example, these programs
  • the present technology can also have the following configurations.
  • An execution unit that executes semantic analysis processing for one or more character information corresponding to each of one or more languages including a target language that is a language corresponding to a user's utterance.
  • An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information.
  • the calculation unit that calculates the value and Information processing device equipped with.
  • (2) The calculation unit Using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
  • the information processing device according to (1).
  • the calculation unit Using the function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
  • the calculation unit Using the function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating the one language, the accuracy of the character information of the one language. Calculate the index value, The information processing device according to (3).
  • the execution unit The semantic analysis process is executed on the one or more character information including one character information corresponding to the utterance of the user in the target language.
  • the information processing device according to any one of (1) to (4).
  • the execution unit The semantic analysis process is executed on the one or more character information including the translated character information in which the one character information is converted into the translation destination language to be the translation destination of the target language.
  • the information processing device according to (5).
  • the execution unit The semantic analysis process is executed on the one or more character information including the paraphrase in which the one character information of the target language is paraphrased into another expression of the target language.
  • the information processing device according to (5) or (6).
  • the execution unit The semantic analysis process is executed on the one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to be the translation destination of the target language.
  • the information processing device according to (7).
  • a conversion unit that converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language. With more The execution unit The semantic analysis process is executed on the character information converted by the conversion unit.
  • the information processing device according to any one of (6) to (8).
  • the conversion unit The character information corresponding to the target language is converted into the character information corresponding to the language that can interpret the meaning.
  • the information processing device according to (9).
  • a selection unit that selects processing target character information, which is character information used for processing, from the one or more character information based on the accuracy index value of each of the one or more character information calculated by the calculation unit.
  • the information processing apparatus according to any one of (1) to (10).
  • the selection unit The character information having the maximum accuracy index value is selected as the processing target character information.
  • the information processing device (11).
  • the calculation unit The accuracy index value corresponding to the character information of the target language is calculated, and the accuracy index value is calculated.
  • the selection unit When the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the character information of the target language is selected as the processing target character information.
  • the execution unit When the accuracy index value corresponding to the target language is equal to or higher than a predetermined value, the semantic analysis process of a language other than the target language is not executed.
  • the information processing device 13).
  • the information processing device according to any one of (1) to (18).
  • Semantic analysis processing is executed for one or more character information corresponding to each of one or more languages including the target language which is the language corresponding to the user's utterance.
  • An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information. Calculate the value, An information processing method that executes processing.
  • Information processing system 100 Information processing device 110 Communication unit 120 Storage unit 121 Language information storage unit 122 Semantic frame information storage unit 123 Analysis accuracy information storage unit 124 Threshold information storage unit 125 Knowledge information storage unit 130 Control unit 131 Acquisition unit 132 Conversion unit 133 Execution unit 134 Calculation unit 135 Selection unit 136 Inverse conversion unit 137 Generation unit 138 Transmission unit 10 Terminal device 11 Communication unit 12 Input unit 13 Output unit 14 Storage unit 15 Control unit 151 Reception unit 152 Execution unit 153 Reception unit 154 Transmission unit 16 Sensor unit 17 Display unit

Abstract

An information processing device according to the present invention is provided with: an execution unit that executes a meaning analysis process for one or more pieces of character information corresponding to one or more languages including a target language which corresponds to utterance by a user; and a calculation unit that calculates an accuracy index value enabling the accuracy of the meaning analysis process to be compared among the languages for each of the one or more pieces of character information on the basis of the result of the meaning analysis process corresponding to each of the one or more pieces of character information.

Description

情報処理装置及び情報処理方法Information processing device and information processing method
 本開示は、情報処理装置及び情報処理方法に関する。 This disclosure relates to an information processing device and an information processing method.
 近年、意味解析や翻訳等の自然言語処理に関する技術が知られている。例えば、意味解析技術を用いて、翻訳元言語文を翻訳先言語に変換する機械翻訳システムが提供されている。 In recent years, technologies related to natural language processing such as semantic analysis and translation have been known. For example, a machine translation system that converts a translation source language sentence into a translation destination language by using a semantic analysis technique is provided.
特開2004-318344号公報Japanese Unexamined Patent Publication No. 2004-318344
 従来技術によれば、翻訳元言語文を意味解析した結果と、翻訳した翻訳結果文を意味解析した結果とを比較し翻訳結果文を選択する処理を行う。 According to the prior art, the translation result sentence is selected by comparing the result of the semantic analysis of the translation source language sentence with the result of the semantic analysis of the translated translation result sentence.
 しかしながら、従来技術は、翻訳元言語の結果と翻訳先言語の結果との類似度に基づいて、翻訳結果文を選択している。この場合、例えば翻訳元言語の意味解析の精度が低い場合、その精度が低い翻訳元言語の結果に基づいて翻訳結果文の選択を行うこととなり、適切な処理を行うことが難しい。また、複数の言語での意味解析の結果が有る場合に、そのいずれの意味解析の結果が意味解析の結果として好適であるかの判別が難しいといった課題もある。そのため、言語間における意味解析精度の比較を可能にすることが望まれている。 However, in the prior art, the translation result sentence is selected based on the similarity between the result of the translation source language and the result of the translation destination language. In this case, for example, when the accuracy of the semantic analysis of the translation source language is low, the translation result sentence is selected based on the result of the translation source language with low accuracy, and it is difficult to perform appropriate processing. Further, when there are results of semantic analysis in a plurality of languages, there is also a problem that it is difficult to determine which of the results of the semantic analysis is suitable as the result of the semantic analysis. Therefore, it is desired to enable comparison of semantic analysis accuracy between languages.
 そこで、本開示では、言語間における意味解析精度の比較を可能にすることができる情報処理装置及び情報処理方法を提案する。 Therefore, in this disclosure, we propose an information processing device and an information processing method that can compare the accuracy of semantic analysis between languages.
 上記の課題を解決するために、本開示に係る一形態の情報処理装置は、ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する実行部と、前記1以上の文字情報の各々に対応する前記意味解析処理の結果に基づいて、前記1以上の文字情報の各々に対して、前記意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する算出部と、を備える。 In order to solve the above problems, the information processing apparatus of one form according to the present disclosure relates to one or more character information corresponding to each of one or more languages including a target language which is a language corresponding to a user's utterance. Based on the execution unit that executes the semantic analysis process and the result of the semantic analysis process corresponding to each of the one or more character information, the accuracy of the semantic analysis process is applied to each of the one or more character information. It is provided with a calculation unit for calculating an accuracy index value that enables comparison between a plurality of languages.
本開示の実施形態に係る情報処理の一例を示す図である。It is a figure which shows an example of information processing which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理システムの構成例を示す図である。It is a figure which shows the structural example of the information processing system which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理装置の構成例を示す図である。It is a figure which shows the structural example of the information processing apparatus which concerns on embodiment of this disclosure. 本開示の実施形態に係る言語情報記憶部の一例を示す図である。It is a figure which shows an example of the language information storage part which concerns on embodiment of this disclosure. 本開示の実施形態に係る意味フレーム情報記憶部の一例を示す図である。It is a figure which shows an example of the semantic frame information storage part which concerns on embodiment of this disclosure. 本開示の実施形態に係る解析精度情報記憶部の一例を示す図である。It is a figure which shows an example of the analysis accuracy information storage part which concerns on embodiment of this disclosure. 本開示の実施形態に係る閾値情報記憶部の一例を示す図である。It is a figure which shows an example of the threshold information storage part which concerns on embodiment of this disclosure. 本開示の実施形態に係る端末装置の構成例を示す図である。It is a figure which shows the structural example of the terminal apparatus which concerns on embodiment of this disclosure. 本開示の実施形態に係る応答の一例を示す図である。It is a figure which shows an example of the response which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理装置の処理を示すフローチャートである。It is a flowchart which shows the process of the information processing apparatus which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。It is a flowchart which shows the process of the information processing system which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。It is a flowchart which shows the process of the information processing system which concerns on embodiment of this disclosure. 本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。It is a flowchart which shows the process of the information processing system which concerns on embodiment of this disclosure. 情報処理システムによる処理の一例を示す概念図である。It is a conceptual diagram which shows an example of processing by an information processing system. 情報処理システムによる具体的な処理の一例を示す図である。It is a figure which shows an example of the specific processing by an information processing system. 解析精度の一例を示す図である。It is a figure which shows an example of analysis accuracy. 解析精度の一例を示す図である。It is a figure which shows an example of analysis accuracy. 解析精度とスコアとの関係の一例を示す図である。It is a figure which shows an example of the relationship between analysis accuracy and score. 他の言語を介した翻訳の一例を示す図である。It is a figure which shows an example of translation through other languages. 情報処理装置の機能を実現するコンピュータの一例を示すハードウェア構成図である。It is a hardware block diagram which shows an example of the computer which realizes the function of an information processing apparatus.
 以下に、本開示の実施形態について図面に基づいて詳細に説明する。なお、この実施形態により本願にかかる情報処理装置及び情報処理方法が限定されるものではない。また、以下の各実施形態において、同一の部位には同一の符号を付することにより重複する説明を省略する。 Hereinafter, embodiments of the present disclosure will be described in detail with reference to the drawings. The information processing apparatus and information processing method according to the present application are not limited by this embodiment. Further, in each of the following embodiments, duplicate description will be omitted by assigning the same reference numerals to the same parts.
 以下に示す項目順序に従って本開示を説明する。
  1.実施形態
   1-1.本開示の実施形態に係る情報処理の概要
    1-1-1.概要、背景及び効果等
    1-1-2.スコア関数
     1-1-2-1.関数生成例
    1-1-3.処理の主な流れ
   1-2.実施形態に係る情報処理システムの構成
   1-3.実施形態に係る情報処理装置の構成
   1-4.実施形態に係る端末装置の構成
   1-5.応答例
   1-6.実施形態に係る情報処理の手順
    1-6-1.情報処理装置に係る処理の手順
    1-6-2.情報処理システムに係る処理の手順
    1-6-3.情報処理システムに係る処理の他の手順その1
    1-6-4.情報処理システムに係る処理の他の手順その2
   1-7.情報処理システムによる処理の概念図
    1-7-1.情報処理システムによる処理の具体例
   1-8.処理の詳細等
    1-8-1.言語識別
    1-8-2.低精度の意味解析器の改善
    1-8-3.品質推定
    1-8-4.クラウドワーカ
    1-8-5.翻訳精度・時間の調整
  2.その他の実施形態
   2-1.クライアント側で意味解析処理等を行う構成例
   2-2.その他の構成例
   2-3.その他
  3.本開示に係る効果
  4.ハードウェア構成
The present disclosure will be described according to the order of items shown below.
1. 1. Embodiment 1-1. Outline of information processing according to the embodiment of the present disclosure 1-1-1. Outline, background, effects, etc. 1-1-2. Score function 1-1-2-1. Function generation example 1-1-3. Main flow of processing 1-2. Configuration of Information Processing System According to Embodiment 1-3. Configuration of Information Processing Device According to Embodiment 1-4. Configuration of the terminal device according to the embodiment 1-5. Response example 1-6. Information processing procedure according to the embodiment 1-6-1. Procedure for processing related to information processing equipment 1-6-2. Procedure for processing related to information processing system 1-6-3. Other Procedures for Processing Related to Information Processing Systems Part 1
1-6-4. Other procedure of processing related to information processing system Part 2
1-7. Conceptual diagram of processing by information processing system 1-7-1. Specific examples of processing by an information processing system 1-8. Details of processing, etc. 1-8-1. Language identification 1-8-2. Improvement of low-precision semantic analyzer 1-8-3. Quality estimation 1-8-4. Cloud worker 1-8-5. Translation accuracy and time adjustment 2. Other Embodiments 2-1. Configuration example of performing semantic analysis processing on the client side 2-2. Other configuration examples 2-3. Others 3. Effect of this disclosure 4. Hardware configuration
[1.実施形態]
[1-1.本開示の実施形態に係る情報処理の概要]
 図1は、本開示の実施形態に係る情報処理の一例を示す図である。本開示の実施形態に係る情報処理は、情報処理装置100(図3参照)や端末装置10(図8参照)を含む情報処理システム1(図2参照)によって実現される。図1では、情報処理システム1によって実現される情報処理の概要を説明する。図1は、本開示の実施形態に係る情報処理の一例を示す図である。
[1. Embodiment]
[1-1. Outline of information processing according to the embodiment of the present disclosure]
FIG. 1 is a diagram showing an example of information processing according to the embodiment of the present disclosure. The information processing according to the embodiment of the present disclosure is realized by the information processing system 1 (see FIG. 2) including the information processing device 100 (see FIG. 3) and the terminal device 10 (see FIG. 8). FIG. 1 describes an outline of information processing realized by the information processing system 1. FIG. 1 is a diagram showing an example of information processing according to the embodiment of the present disclosure.
 まず、図1の説明に先だって説明に用いる用語について記載する。以下では、ユーザの発話に対応する言語(入力言語)を「対象言語」と記載する。例えば、対象言語は、情報処理システム1が入力として受け付ける言語に対応する。また、対象言語の翻訳先となる言語を「翻訳先言語」と記載する。すなわち、一の言語が対象言語となったり、他の言語に対しての翻訳先言語となったりする。例えば、一の言語がユーザの発話に対応する言語である場合は対象言語となり、その一の言語が他の言語の翻訳先の言語となる場合は翻訳先言語となる。すなわち、ここでいう対象言語や翻訳先言語とは、後述する処理において、各言語の他の言語との関係を基に言語を区別して表現可能にするための名称である。また、以下では、対象言語に対応する文字情報を「入力言語文字情報」と記載し、翻訳先言語に対応する文字情報を「翻訳文」や「翻訳文字情報」と記載する場合がある。また、言語のうち、意味解析処理が可能な言語を「特定言語」と記載する。また、言語のうち、特定言語以外の言語、すなわち意味解析処理ができない言語を「非特定言語」と記載する場合がある。 First, the terms used in the explanation will be described prior to the explanation in FIG. In the following, the language (input language) corresponding to the user's utterance is described as "target language". For example, the target language corresponds to a language accepted as input by the information processing system 1. In addition, the language to be translated into the target language is described as "translation destination language". That is, one language may be the target language, or the translation destination language for another language may be. For example, if one language is a language corresponding to the user's utterance, it is the target language, and if that one language is the translation destination language of another language, it is the translation destination language. That is, the target language and the translation destination language referred to here are names for distinguishing and expressing languages based on the relationship with other languages of each language in the processing described later. Further, in the following, the character information corresponding to the target language may be described as "input language character information", and the character information corresponding to the translation destination language may be described as "translated sentence" or "translated character information". In addition, among the languages, a language capable of semantic analysis processing is described as a "specific language". In addition, among the languages, a language other than a specific language, that is, a language that cannot perform semantic analysis processing may be described as a "non-specific language".
 以下、情報処理システム1により処理の概要を説明するが、その処理の前提となる事項を簡単に説明する。情報処理システム1は、多くの言語について音声認識(言語識別)や翻訳処理が可能であり、多くの言語を入力言語(対象言語)として受け付けることができる。例えば、情報処理システム1は、図1に示すように英語、中国語、ヒンディー語、スペイン語、フランス語、アラビア語、ポルトガル語、ベンガル語、ドイツ語、日本語、韓国語等の数多くの言語を入力言語(対象言語)として受け付けることができる。なお、これらの言語は一例に過ぎず、情報処理システム1は、上記の言語に限らず、多数の言語を入力言語(対象言語)として受け付けることができる。 The outline of the processing will be explained below by the information processing system 1, but the preconditions for the processing will be briefly explained. The information processing system 1 is capable of voice recognition (language identification) and translation processing for many languages, and can accept many languages as input languages (target languages). For example, as shown in FIG. 1, the information processing system 1 can speak many languages such as English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, and Korean. It can be accepted as an input language (target language). Note that these languages are only examples, and the information processing system 1 can accept not only the above languages but also a large number of languages as input languages (target languages).
 また、図1に示す各処理は、情報処理システム1の情報処理装置100及び端末装置10のいずれの装置が行ってもよい。情報処理システム1が処理の主体として記載されている処理については、情報処理システム1に含まれるいずれの装置が行ってもよい。以下では、端末装置10により検知されたユーザの発話に対して、情報処理装置100が音声認識や翻訳や意味解析等の処理を実行する場合を一例として説明する。なお、図1では、情報処理装置100が音声認識や翻訳や意味解析等の処理(情報処理)を行う場合を一例として説明するが、端末装置10がこれらの処理(情報処理)を行ってもよい。この点については後述する。 Further, each process shown in FIG. 1 may be performed by either the information processing device 100 or the terminal device 10 of the information processing system 1. Any device included in the information processing system 1 may perform the processing in which the information processing system 1 is described as the main body of the processing. In the following, a case where the information processing device 100 executes processing such as voice recognition, translation, and semantic analysis in response to the user's utterance detected by the terminal device 10 will be described as an example. In FIG. 1, a case where the information processing device 100 performs processing (information processing) such as voice recognition, translation, and semantic analysis will be described as an example, but even if the terminal device 10 performs these processing (information processing). good. This point will be described later.
 ここから、図1について具体的に説明する。まず、ユーザが発話を行う。図1の例では、ユーザが韓国語で発話を行った場合を示す。例えば、ユーザは、「明日の東京の天気を教えて(韓国語)」と韓国語で発話する。このように、「・・・(韓国語)」と記載した場合、記載上は明細書を記載する言語(例えば日本語)であるが、実際は韓国語での発話やハングル文字であるものとする。これにより、情報処理システム1は、ユーザによる韓国語での発話を受け付ける。例えば、情報処理システム1は、入力言語(対象言語)である韓国語の発話の音声情報を取得する。 From here, FIG. 1 will be specifically described. First, the user speaks. In the example of FIG. 1, a case where the user speaks in Korean is shown. For example, the user utters "Tell me the weather in Tokyo tomorrow (Korean)" in Korean. In this way, when "... (Korean)" is described, it is the language in which the specification is described (for example, Japanese), but in reality, it is assumed that the utterance is in Korean or Korean characters. .. As a result, the information processing system 1 accepts the user's utterance in Korean. For example, the information processing system 1 acquires voice information of utterances in Korean, which is an input language (target language).
 そして、情報処理システム1は、処理フェーズFS1に示すように、音声認識に関する処理を実行する。情報処理システム1は、ユーザによる発話の音声情報に対する音声認識の処理を行う。例えば、情報処理システム1は、音声認識により入力言語(対象言語)でのユーザによる発話のテキスト(文字情報)を発話情報として取得する。図1の例では、情報処理システム1は、言語識別処理によりユーザの発話が韓国語であると判定する。情報処理システム1は、対象言語である韓国語の文字情報(発話情報)を入力言語文字情報として用いる。 Then, the information processing system 1 executes the processing related to voice recognition as shown in the processing phase FS1. The information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition. In the example of FIG. 1, the information processing system 1 determines that the user's utterance is Korean by the language identification process. The information processing system 1 uses character information (speech information) in Korean, which is the target language, as input language character information.
 そして、情報処理システム1は、発話文展開を行う(ステップS1)。情報処理システム1は、入力言語「韓国語」の入力言語文字情報を入力言語「韓国語」のまま別の表現に言い換えた文字情報(「パラフレーズ」ともいう)を生成する発話文展開を行う。例えば、パラフレーズは、ある文字情報中のある語句(文字列)を他の語句(文字列)に置き換えた文字情報、ある文字情報に新たな語句(文字列)を追加した文字情報、ある文字情報を別の表現に言い換えた文字情報等、種々の変換態様により生成される文字情報であってもよい。例えば、パラフレーズは、ある文字情報の意味を含んだまま別の表現に変換された文字情報であればどのような文字情報であってもよい。なお、情報処理システム1は、パラフレーズに関する種々の技術を適宜用いてパラフレーズを生成する。例えば、情報処理システム1は、パラフレーズ生成のルールを示す情報(パラフレーズ生成ルール情報)を用いて、パラフレーズを生成してもよい。例えば、情報処理装置100は、記憶部120(図3参照)に記憶されたパラフレーズ生成ルール情報を用いて、パラフレーズを生成してもよい。例えば、パラフレーズ生成ルール情報には、語尾の変換等の文字列の変換の規則や、文へ特定の文字列(語尾や目的語等)を追加する規則等の一覧情報であってもよい。また、情報処理システム1は、ある文字情報のパラフレーズを提供するサービス提供装置に入力言語文字情報を送信して、サービス提供装置から入力言語文字情報のパラフレーズを取得してもよい。 Then, the information processing system 1 develops the utterance sentence (step S1). The information processing system 1 develops an utterance sentence that generates character information (also referred to as "paraphrase") in which the input language character information of the input language "Korean" is paraphrased into another expression while keeping the input language "Korean". .. For example, a paraphrase is character information in which a certain word (character string) in a certain character information is replaced with another word (character string), character information in which a new word (character string) is added to a certain character information, or a certain character. It may be character information generated by various conversion modes such as character information in which information is paraphrased into another expression. For example, the paraphrase may be any character information as long as it is character information converted into another expression while including the meaning of one character information. The information processing system 1 generates paraphrases by appropriately using various techniques related to paraphrases. For example, the information processing system 1 may generate a paraphrase by using information indicating a paraphrase generation rule (paraphrase generation rule information). For example, the information processing device 100 may generate a paraphrase by using the paraphrase generation rule information stored in the storage unit 120 (see FIG. 3). For example, the paraphrase generation rule information may be list information such as a rule for converting a character string such as a flexion of a word ending or a rule for adding a specific character string (a word ending, an object, etc.) to a sentence. Further, the information processing system 1 may transmit the input language character information to the service providing device that provides the paraphrase of a certain character information, and acquire the paraphrase of the input language character information from the service providing device.
 情報処理システム1は、発話文展開により、入力言語文字情報(「第1発話展開文」とする)である「明日の東京の天気を教えて(韓国語)」を言い換えた複数のパラフレーズを生成する。例えば、情報処理システム1は、発話文展開により、「明日の東京の天気を教えてください(韓国語)」を入力言語文字情報のパラフレーズ(「第2発話展開文」とする)として生成する。例えば、情報処理システム1は、発話文展開により、「明日の東京の天気は何ですか?(韓国語)」を入力言語文字情報のパラフレーズ(「第3発話展開文」とする)として生成する。例えば、情報処理システム1は、発話文展開により、「明日の東京の天気が知りたいです(韓国語)」を入力言語文字情報のパラフレーズ(「第4発話展開文」とする)として生成する。なお、上記は説明のために変化の大きいパラフレーズを一例として示しており、情報処理システム1は、種々の技術を適宜用いて入力言語文字情報のパラフレーズを生成する。 The information processing system 1 uses a plurality of paraphrases that paraphrase the input language character information (referred to as "first utterance development sentence") "Tell me the weather in Tokyo tomorrow (Korean)" by expanding the utterance sentence. Generate. For example, the information processing system 1 generates "Tell me the weather in Tokyo tomorrow (Korean)" as a paraphrase of input language character information (referred to as "second utterance development sentence") by utterance sentence expansion. .. For example, the information processing system 1 generates "What is the weather in Tokyo tomorrow? (Korean)" as a paraphrase of input language character information (referred to as "third utterance development sentence") by utterance sentence expansion. do. For example, the information processing system 1 generates "I want to know the weather in Tokyo tomorrow (Korean)" as a paraphrase of input language character information (referred to as "fourth utterance development sentence") by expanding the utterance sentence. .. The above is shown as an example of a paraphrase with a large change for explanation, and the information processing system 1 generates a paraphrase of input language character information by appropriately using various techniques.
 図1の例では、情報処理システム1は、上記のような発話文展開により、入力言語文字情報である「明日の東京の天気を教えて(韓国語)」をN個の文に発話展開する。すなわち、情報処理システム1は、発話文展開により、入力言語文字情報である「明日の東京の天気を教えて(韓国語)」を含むN個の文を生成する。例えば、情報処理システム1は、入力言語文字情報と、その入力言語文字情報のパラフレーズを含む疑似発話テキストリスト(以下「発話文リスト」ともいう)を生成する。情報処理システム1は、N個の文を含む発話文リストを生成する。図1の例では、情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文を含む発話文リストを生成する。 In the example of FIG. 1, the information processing system 1 utterably expands the input language character information "Tell me the weather in Tokyo tomorrow (Korean)" into N sentences by the utterance sentence expansion as described above. .. That is, the information processing system 1 generates N sentences including "Tell me the weather in Tokyo tomorrow (Korean)" which is input language character information by expanding the spoken sentences. For example, the information processing system 1 generates a pseudo utterance text list (hereinafter, also referred to as “utterance sentence list”) including input language character information and a paraphrase of the input language character information. The information processing system 1 generates an utterance sentence list including N sentences. In the example of FIG. 1, the information processing system 1 generates an utterance sentence list including N sentences such as a first utterance development sentence, a second utterance development sentence, a third utterance development sentence, and a fourth utterance development sentence.
 そして、情報処理システム1は、発話文リスト中の各文を翻訳する(ステップS2)。情報処理システム1は、発話文リスト中の各文を特定言語へ変換する。情報処理システム1は、発話文リスト中の韓国語のN個の文の各々を特定言語へ変換する。ここで、図1の例では、英語、中国語、ヒンディー語、スペイン語、フランス語、アラビア語、ポルトガル語、ベンガル語、ドイツ語、日本語、韓国語等のM個の言語が特定言語である場合を示す。なお、これらの言語は一例に過ぎず、情報処理システム1は、意味解析処理が可能であれば、上記の言語以外の言語を特定言語としてもよい。 Then, the information processing system 1 translates each sentence in the utterance sentence list (step S2). The information processing system 1 converts each sentence in the utterance sentence list into a specific language. The information processing system 1 converts each of the N Korean sentences in the utterance sentence list into a specific language. Here, in the example of FIG. 1, M languages such as English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, and Korean are specific languages. Show the case. Note that these languages are only examples, and the information processing system 1 may use a language other than the above languages as a specific language as long as it can perform semantic analysis processing.
 情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々を、意味解析処理が可能な言語(特定言語)に翻訳する。情報処理システム1は、第1発話展開文~第4発話展開文等のN個の文の各々を、英語、中国語、ヒンディー語、スペイン語、フランス語、アラビア語、ポルトガル語、ベンガル語、ドイツ語、日本語、韓国語等のM個の言語(特定言語)に翻訳する。以下、入力言語の文からを特定言語の文に生成することを「言語展開」ともいう。なお、図1では、入力言語(対象言語)である韓国語も特定言語に含まれるため、情報処理システム1は、韓国語については、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文をそのまま用いる。また、入力言語(対象言語)が特定言語に含まれない場合、情報処理システム1は、入力言語の文(発話展開文)については意味解析の処理等に用いない。 The information processing system 1 is a language (specific language) capable of performing semantic analysis processing for each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. Translate to. The information processing system 1 uses English, Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, and Germany for each of the N sentences such as the first utterance development sentence to the fourth utterance development sentence. Translate into M languages (specific languages) such as Chinese, Japanese, and Korean. Hereinafter, generating a sentence of a specific language from a sentence of an input language is also referred to as "language expansion". In FIG. 1, since Korean, which is an input language (target language), is also included in the specific language, the information processing system 1 has the first utterance development sentence, the second utterance development sentence, and the third utterance development sentence for Korean. N sentences such as the expanded sentence and the fourth utterance expanded sentence are used as they are. Further, when the input language (target language) is not included in the specific language, the information processing system 1 does not use the sentence of the input language (speech expansion sentence) for the processing of semantic analysis or the like.
 情報処理システム1は、M個の特定言語のうち韓国語以外のM-1個の言語に、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々を翻訳する。例えば、情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々を、英語に翻訳する。 The information processing system 1 has M-1 languages other than Korean among M specific languages, such as a first utterance development sentence, a second utterance development sentence, a third utterance development sentence, and a fourth utterance development sentence. Translate each of the N sentences. For example, the information processing system 1 translates each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence into English.
 例えば、情報処理システム1は、第1発話展開文「明日の東京の天気を教えて(韓国語)」を、英語に対応する翻訳文「Tell me the weather in Tokyo tomorrow」に変換する。例えば、情報処理システム1は、第2発話展開文「明日の東京の天気を教えてください(韓国語)」を、英語に対応する翻訳文「Please tell me the weather in Tokyo tomorrow」に変換する。例えば、情報処理システム1は、第3発話展開文「明日の東京の天気は何ですか?(韓国語)」を、英語に対応する翻訳文「What is the weather in Tokyo tomorrow?」に変換する。例えば、情報処理システム1は、第4発話展開文「明日の東京の天気が知りたいです(韓国語)」を、英語に対応する翻訳文「I want to know the weather of Tokyo tomorrow」に変換する。このように、情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々に対応するN個の英語の翻訳文を生成する。 For example, the information processing system 1 converts the first utterance development sentence "Tell me the weather in Tokyo tomorrow (Korean)" into a translated sentence "Tell me the weather in Tokyo tomorrow" corresponding to English. For example, the information processing system 1 converts the second utterance development sentence "Tell me the weather in Tokyo tomorrow (Korean)" into the translated sentence "Please tell me the weather in Tokyo tomorrow" corresponding to English. For example, the information processing system 1 converts the third utterance development sentence "What is the weather in Tokyo tomorrow? (Korean)" into a translation sentence "What is the weather in Tokyo tomorrow?" Corresponding to English. .. For example, the information information system 1 converts the fourth utterance development sentence "I want to know the weather in Tokyo tomorrow (Korean)" into a translation sentence "I want to know the weather of Tokyo tomorrow" corresponding to English. .. In this way, the information processing system 1 translates N English sentences corresponding to each of the N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. Generate a statement.
 同様に、情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々に対応するN個の中国語の翻訳文を生成する。また、情報処理システム1は、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々に対応するN個のヒンディー語の翻訳文を生成する。また、情報処理システム1は、スペイン語、フランス語、アラビア語、ポルトガル語、ベンガル語、ドイツ語、日本語等についても同様に、第1発話展開文、第2発話展開文、第3発話展開文、第4発話展開文等のN個の文の各々に対応するN個の翻訳文をM個の言語ごとに生成する。 Similarly, the information processing system 1 translates N Chinese characters corresponding to each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. Generate a statement. Further, the information processing system 1 has N translations of N Hindi corresponding to each of N sentences such as the first utterance development sentence, the second utterance development sentence, the third utterance development sentence, and the fourth utterance development sentence. To generate. In addition, the information processing system 1 also applies to Spanish, French, Arabic, Portuguese, Bengali, German, Japanese, etc. in the same manner as the first utterance development sentence, the second utterance development sentence, and the third utterance development sentence. , N translation sentences corresponding to each of N sentences such as the fourth utterance expansion sentence are generated for each of M languages.
 上述した処理により、情報処理システム1は、韓国語のN個の文と、M-1個の言語の各々のN個の翻訳文とを含むN×M個の文を生成する。情報処理システム1は、韓国語のN個の文と、M-1個の言語の各々のN個の翻訳文とを含むリスト(「翻訳リスト」ともいう)を生成する。情報処理システム1は、M個の言語の各々のN個の文であるN×M個の文を含む翻訳リストを生成する。このように、情報処理システム1は、処理フェーズFS1で発話文展開及び言語展開の処理を実行する。このように、処理フェーズFS1は、入力言語に依存する処理である。 By the above-mentioned processing, the information processing system 1 generates N × M sentences including N sentences in Korean and N translated sentences in each of M-1 languages. The information processing system 1 generates a list (also referred to as a "translation list") including N sentences in Korean and N translations of each of M-1 languages. The information processing system 1 generates a translation list containing N × M sentences, which are N sentences in each of the M languages. In this way, the information processing system 1 executes the processing of the utterance sentence expansion and the language expansion in the processing phase FS1. As described above, the processing phase FS1 is a processing that depends on the input language.
 そして、情報処理システム1は、処理フェーズFS2に示すように、意味解析に関する処理を実行する。情報処理システム1は、発話意味解析処理(「意味解析処理」ともいう)を行う(ステップS3)。情報処理システム1は、翻訳リストに含まれる各文(「意味解析対象文」ともいう)を用いて、意味解析処理を行う。情報処理システム1は、意味解析処理による解析結果として、意味フレームの情報を生成する。図1の例では、情報処理システム1は、N×M個の文の各々に対応するN×M個の意味フレームの情報を生成する。なお、情報処理システム1は、N×M個よりも多い個数の意味フレームを生成してもよい。図1では説明のために1つの文に対して1つの意味フレームを生成する場合を示すが、情報処理システム1は、1つの文に対して、複数の意味フレームを生成してもよい。情報処理システム1は、意味解析処理により、意味解析の精度を示すスコア(「意味解析スコア」ともいう)を含む意味フレームの情報を生成する。情報処理システム1は、文の言語に対応する意味解析器を用いて、その文の意味解析の精度を示す意味解析スコアを含む意味フレームの情報を生成する。 Then, the information processing system 1 executes the processing related to the semantic analysis as shown in the processing phase FS2. The information processing system 1 performs an utterance semantic analysis process (also referred to as “semantic analysis process”) (step S3). The information processing system 1 performs a semantic analysis process using each sentence included in the translation list (also referred to as a "semantic analysis target sentence"). The information processing system 1 generates information on the semantic frame as an analysis result by the semantic analysis process. In the example of FIG. 1, the information processing system 1 generates information of N × M semantic frames corresponding to each of N × M sentences. The information processing system 1 may generate more than N × M semantic frames. Although FIG. 1 shows a case where one meaning frame is generated for one sentence for explanation, the information processing system 1 may generate a plurality of meaning frames for one sentence. The information processing system 1 generates information on a semantic frame including a score (also referred to as a “semantic analysis score”) indicating the accuracy of the semantic analysis by the semantic analysis process. The information processing system 1 uses a semantic analyzer corresponding to the language of the sentence to generate information on a semantic frame including a semantic analysis score indicating the accuracy of the semantic analysis of the sentence.
 例えば、情報処理システム1は、英語の意味解析器を用いて、英語のN個の文に対して意味解析処理を行うことにより、英語のN個の文の各々の意味解析の精度を示す意味解析スコアを含むN個の意味フレームの情報を生成する。例えば、情報処理システム1は、第1発話展開文に対応する英語の翻訳文「Tell me the weather in Tokyo tomorrow」(「第1意味解析対象文」ともいう)を英語の意味解析器に入力することにより、意味解析スコアを含む意味フレームの情報を生成する。情報処理システム1は、第1意味解析対象文のDomain-Goalを「Weather-Check」であると特定する。また、情報処理システム1は、Domain-Goal「Weather-Check」に対応するAttribute「Date」のスロット値(「Value」ともいう)が「tomorrow」であり、Attribute「Place」のスロット値が「Tokyo」であると推定する。これにより、情報処理システム1は、Domain-Goalが「Weather-Check」であり、Attribute「Date」のスロット値が「tomorrow」であり、Attribute「Place」のスロット値が「Tokyo」であることを示す意味フレームの情報を生成する。また、情報処理システム1は、第1意味解析対象文が入力された英語の意味解析器が出力した意味解析の確信度を示すスコア「0.9」を、第1意味解析対象文の意味解析スコアとして用いる。 For example, the information processing system 1 uses an English semantic analyzer to perform semantic analysis processing on N English sentences to indicate the accuracy of the semantic analysis of each of the N English sentences. Generates information on N semantic frames including analysis scores. For example, the information processing system 1 inputs an English translation "Tell me the weather in Tokyo tomorrow" (also referred to as a "first semantic analysis target sentence") corresponding to the first utterance development sentence into an English semantic analyzer. By doing so, the information of the semantic frame including the semantic analysis score is generated. The information processing system 1 identifies the Domain-Goal of the first semantic analysis target sentence as "Weather-Check". Further, in the information processing system 1, the slot value (also referred to as "Value") of the Attribute "Date" corresponding to the Domain-Goal "Weather-Check" is "tomorrow", and the slot value of the Attribute "Place" is "Tokyo". Is presumed to be. As a result, in the information processing system 1, the domain-Goal is "Weather-Check", the slot value of the Attribute "Date" is "tomorrow", and the slot value of the Attribute "Place" is "Tokyo". Generates the information of the meaning frame to be shown. Further, the information processing system 1 obtains a score "0.9" indicating the certainty of the semantic analysis output by the English semantic analyzer into which the first semantic analysis target sentence is input, and performs a semantic analysis of the first semantic analysis target sentence. Used as a score.
 また、情報処理システム1は、残りのN-1個の翻訳文についても、英語の意味解析器を用いて、N-1個の文の各々の意味解析スコアを含むN-1個の意味フレームの情報を生成する。これにより、情報処理システム1は、英語のN個の文の各々の意味解析スコアを含むN個の意味フレームの情報を生成する。 In addition, the information processing system 1 also uses an English semantic analyzer for the remaining N-1 translated sentences, and N-1 semantic frames including the semantic analysis scores of each of the N-1 sentences. Generate information about. As a result, the information processing system 1 generates information of N semantic frames including the semantic analysis score of each of the N sentences in English.
 また、情報処理システム1は、英語以外のM-1個の言語の各々についても各言語の意味解析器を用いて、意味解析スコアを含む意味フレームの情報を生成する。例えば、情報処理システム1は、韓国語の意味解析器を用いて、韓国語のN個の文の各々の意味解析スコアを含むN個の意味フレームの情報を生成する。情報処理システム1は、日本語の意味解析器を用いて、日本語のN個の文の各々の意味解析スコアを含むN個の意味フレームの情報を生成する。同様に、情報処理システム1は、中国語、ヒンディー語、スペイン語、フランス語、アラビア語、ポルトガル語、ベンガル語、ドイツ語等についても、各言語の意味解析器を用いて、各言語のN個の文の各々の意味解析スコアを含む意味フレームの情報を生成する。 Further, the information processing system 1 also uses the semantic analyzer of each language for each of the M-1 languages other than English to generate information of the semantic frame including the semantic analysis score. For example, the information processing system 1 uses a Korean semantic analyzer to generate information on N semantic frames including the semantic analysis score of each of the N sentences in Korean. The information processing system 1 uses a Japanese semantic analyzer to generate information on N semantic frames including the semantic analysis score of each of the N sentences in Japanese. Similarly, the information processing system 1 also uses a semantic analyzer of each language for Chinese, Hindi, Spanish, French, Arabic, Portuguese, Bengali, German, etc., and N pieces of each language. Generates information about the semantic frame containing the semantic analysis score for each of the sentences in.
 これにより、情報処理システム1は、N×M個の意味解析対象文の意味解析スコアを含むN×M個の意味フレームの情報を生成する。なお、情報処理システム1は、N×M個の意味解析対象文の全てを対象に意味フレームの情報を生成しなくてもよい。例えば、情報処理システム1は、N×M個の意味解析対象文のうち、翻訳の品質が低い文を除いて、意味解析処理を行ってもよいが、この点についての詳細は後述する。 As a result, the information processing system 1 generates information of N × M semantic frames including the semantic analysis score of N × M semantic analysis target sentences. The information processing system 1 does not have to generate the information of the semantic frame for all the N × M semantic analysis target sentences. For example, the information processing system 1 may perform the semantic analysis process except for the sentences having low translation quality among the N × M semantic analysis target sentences, and the details of this point will be described later.
 ここで、詳細は後述するが、各言語の意味解析器(意味解析処理の機能)は、言語ごとに作成されており、言語間での関係性は考慮されていない。そのため、各言語の意味解析器(意味解析処理の機能)が生成する意味解析スコアは、その言語内での確信度(精度)を示すものであり、他の言語との関係を考慮したスコアではない。したがって、このような意味解析スコアは、値が大きい程精度が高いという傾向はあるものの、そのまま言語間での意味解析の精度の比較に用いた場合、適切な比較となっていない場合がある。例えば、日本語の意味解析スコアの「0.9」と、英語の意味解析スコアの「0.9」とが同じ精度を示すとは限らない。 Here, the details will be described later, but the semantic analyzer (function of semantic analysis processing) of each language is created for each language, and the relationship between the languages is not considered. Therefore, the semantic analysis score generated by the semantic analyzer (function of semantic analysis processing) of each language indicates the certainty (accuracy) within that language, and the score considering the relationship with other languages No. Therefore, although the accuracy of such a semantic analysis score tends to be higher as the value is larger, when it is used as it is for the comparison of the accuracy of the semantic analysis between languages, it may not be an appropriate comparison. For example, the Japanese semantic analysis score "0.9" and the English semantic analysis score "0.9" do not always show the same accuracy.
 そこで、情報処理システム1は、意味解析スコアを複数言語間で比較可能にする精度指標値に変換する処理を行う。情報処理システム1は、意味解析スコアを複数言語間で比較可能にする精度指標値である意味解析精度(%)に変換する処理を行う(ステップS4)。情報処理システム1は、意味解析スコアを用いて、意味解析精度(%)を算出する。なお、図1では、精度指標値の「意味解析精度(%)」を用いる場合を一例として示すが、精度指標値は、言語間での意味解析精度が比較可能であれば「意味解析精度(%)」に限らず、どのような情報であってもよい。また、単位は「%」に限らず、種々の単位であってもよいし、単位が無くてもよい。 Therefore, the information processing system 1 performs a process of converting the semantic analysis score into an accuracy index value that enables comparison between a plurality of languages. The information processing system 1 performs a process of converting the semantic analysis score into the semantic analysis accuracy (%), which is an accuracy index value that enables comparison between a plurality of languages (step S4). The information processing system 1 calculates the semantic analysis accuracy (%) using the semantic analysis score. In FIG. 1, the case where the accuracy index value "semantic analysis accuracy (%)" is used is shown as an example, but the accuracy index value is "semantic analysis accuracy (semantic analysis accuracy (%)" if the semantic analysis accuracy between languages can be compared. %) ”, Any information may be used. Further, the unit is not limited to "%", and may be various units or may have no unit.
 情報処理システム1は、以下の式(1)に示すような関数(以下「スコア関数」ともいう)を用いて、意味解析精度を算出する。例えば、情報処理装置100は、記憶部120に記憶されたスコア関数を用いて、意味解析精度を算出する。なお、意味解析処理の精度を複数言語間で比較可能にする精度指標値を出力するスコア関数の生成についての詳細は後述する。 The information processing system 1 calculates the semantic analysis accuracy by using a function as shown in the following equation (1) (hereinafter, also referred to as a “score function”). For example, the information processing apparatus 100 calculates the semantic analysis accuracy by using the score function stored in the storage unit 120. The details of the generation of the score function that outputs the accuracy index value that enables the accuracy of the semantic analysis process to be compared between a plurality of languages will be described later.
 acc = f (lang、 score) Acc = f (lang, score)
 式(1)中の「acc」は、意味解析精度を示す。また、式(1)中の「score」は、意味解析スコアを示す。「score」には、意味解析精度への変換対象となる意味解析スコアが入力される。式(1)中の「lang」は、言語を示す。「lang」には、意味解析スコアの対象となっている言語を示す情報が入力される。式(1)中の「f()」は、言語の指定と、意味解析スコアとを入力として、意味解析精度を出力するスコア関数を示す。 "Acc" in equation (1) indicates the accuracy of semantic analysis. Further, "score" in the formula (1) indicates a semantic analysis score. In "score", the semantic analysis score to be converted into the semantic analysis accuracy is input. "Lang" in the formula (1) indicates a language. In "lang", information indicating the language that is the target of the semantic analysis score is input. “F ()” in the equation (1) indicates a score function that outputs the semantic analysis accuracy by inputting the language specification and the semantic analysis score.
 このように、情報処理システム1は、式(1)を用いることにより、意味解析スコア及び言語の指定に応じて、その意味解析スコアを複数言語間で比較可能に変換された精度指標値(意味解析精度)を算出する。なお、式(1)は一例であり、情報処理システム1は、式(1)以外の関数を用いて、意味解析精度を算出する処理を行ってもよい。 In this way, the information processing system 1 uses the equation (1) to convert the semantic analysis score so that it can be compared between a plurality of languages according to the semantic analysis score and the language designation (meaning). Analysis accuracy) is calculated. The equation (1) is an example, and the information processing system 1 may perform a process of calculating the semantic analysis accuracy by using a function other than the equation (1).
 例えば、情報処理システム1は、言語ごとの関数(言語別スコア関数)を用いて、意味解析精度を算出する。この場合、情報処理システム1は、意味解析スコアの入力に応じて、意味解析精度を出力する言語別スコア関数を用いて、意味解析精度を算出する。例えば、情報処理装置100は、記憶部120に記憶された言語別スコア関数を用いて、意味解析精度を算出する。情報処理システム1は、複数の言語別スコア関数のうち、意味解析スコアの言語に対応する言語別スコア関数を選択して、選択した言語別スコア関数を用いて、意味解析精度を算出する。 For example, the information processing system 1 calculates the semantic analysis accuracy by using a function for each language (score function for each language). In this case, the information processing system 1 calculates the semantic analysis accuracy by using the language-specific score function that outputs the semantic analysis accuracy in response to the input of the semantic analysis score. For example, the information processing apparatus 100 calculates the semantic analysis accuracy by using the language-specific score function stored in the storage unit 120. The information processing system 1 selects a language-specific score function corresponding to the language of the semantic analysis score from a plurality of language-specific score functions, and calculates the semantic analysis accuracy using the selected language-specific score function.
 例えば、情報処理システム1は、英語を対象として、意味解析精度を算出する場合、英語用の言語別スコア関数である英語用スコア関数を用いて、意味解析精度を算出する。例えば、情報処理システム1は、日本語を対象として、意味解析精度を算出する場合、日本語用の言語別スコア関数である日本語用スコア関数を用いて、意味解析精度を算出する。 For example, when calculating the semantic analysis accuracy for English, the information processing system 1 calculates the semantic analysis accuracy by using the English score function, which is a language-specific score function for English. For example, when the information processing system 1 calculates the semantic analysis accuracy for Japanese, the semantic analysis accuracy is calculated by using the Japanese score function, which is a language-specific score function for Japanese.
 また、式(1)の関数は、言語による条件分岐が含まれるプログラムであってもよい。この場合、式(1)の関数は、変数「lang」と各言語との比較に応じた条件分岐が含まれてもよい。例えば、式(1)の関数は、変数「lang」が一致した言語の言語別スコア関数を用いて、意味解析精度を算出し、その結果を出力する関数(プログラム)であってもよい。このように、式(1)の関数は、各言語に対応する複数の言語別スコア関数と、どの言語別スコア関数を用いるかの条件分岐を含む関数(プログラム)であってもよい。 Further, the function of Eq. (1) may be a program including conditional branching by language. In this case, the function of the equation (1) may include a conditional branch according to the comparison between the variable "lang" and each language. For example, the function of the equation (1) may be a function (program) that calculates the semantic analysis accuracy using the language-specific score function of the language in which the variable “lang” matches and outputs the result. As described above, the function of the equation (1) may be a function (program) including a plurality of language-specific score functions corresponding to each language and a conditional branch of which language-specific score function is used.
 情報処理システム1は、式(1)を用いて、N×M個の意味解析対象文の意味解析スコアの各々を意味解析精度に変換する。情報処理システム1は、式(1)を用いて、N×M個の意味解析対象文の意味解析精度を算出する。例えば、情報処理システム1は、第1意味解析対象文の意味解析スコア「0.9」と、第1意味解析対象文の言語「英語」を示す情報とを用いて、第1意味解析対象文の意味解析精度を算出する。情報処理システム1は、第1意味解析対象文の意味解析スコア「0.9」と、第1意味解析対象文の言語「英語」を示す情報とを式(1)に入力することにより、第1意味解析対象文の意味解析精度を算出する。図1の例では、情報処理システム1は、第1意味解析対象文の意味解析精度を「96.6(%)」と算出する。同様に、情報処理システム1は、第1意味解析対象文以外のN×M-1個の文(意味解析対象文)について、意味解析精度を算出する。これにより、情報処理システム1は、翻訳リスト中の全N×M個の文(意味解析対象文)について、意味解析精度を算出する。このように、情報処理システム1は、処理フェーズFS2で翻訳リスト中の文(N×M個の文)に対して意味解析処理を実行し、N×M個の文の意味解析精度を算出する。このように、処理フェーズFS2は、入力言語に非依存の処理である。 The information processing system 1 uses the equation (1) to convert each of the semantic analysis scores of N × M semantic analysis target sentences into the semantic analysis accuracy. The information processing system 1 calculates the semantic analysis accuracy of N × M semantic analysis target sentences by using the equation (1). For example, the information processing system 1 uses the semantic analysis score "0.9" of the first semantic analysis target sentence and the information indicating the language "English" of the first semantic analysis target sentence, and uses the information indicating the language "English" of the first semantic analysis target sentence. Calculate the semantic analysis accuracy of. The information processing system 1 inputs the semantic analysis score "0.9" of the first semantic analysis target sentence and the information indicating the language "English" of the first semantic analysis target sentence into the equation (1). 1 Semantic analysis Calculate the semantic analysis accuracy of the target sentence. In the example of FIG. 1, the information processing system 1 calculates the semantic analysis accuracy of the first semantic analysis target sentence as "96.6 (%)". Similarly, the information processing system 1 calculates the semantic analysis accuracy for N × M-1 sentences (semantic analysis target sentences) other than the first semantic analysis target sentence. As a result, the information processing system 1 calculates the semantic analysis accuracy of all N × M sentences (semantic analysis target sentences) in the translation list. In this way, the information processing system 1 executes the semantic analysis process on the sentences (N × M sentences) in the translation list in the processing phase FS2, and calculates the semantic analysis accuracy of the N × M sentences. .. As described above, the processing phase FS2 is an input language-independent process.
 そして、情報処理システム1は、処理フェーズFS3に示すように、応答生成に関する処理を実行する。図1では、情報処理システム1は、応答生成に先立って、応答生成等に用いる意味フレーム(意味解析対象文)を選択する。情報処理システム1は、翻訳リスト中の文のうち、応答生成等に用いる文(「処理対象文字情報」ともいう)を選択する。情報処理システム1は、N×M個の意味フレームのうち、応答生成等に用いる意味フレームを選択する。情報処理システム1は、N×M個の意味フレームのうち、意味解析精度が最大の意味フレーム(意味解析対象文)を選択する。図1の例では、情報処理システム1は、意味解析精度が「96.6(%)」である第1意味解析対象文を応答生成等に用いる意味フレーム(意味解析対象文)に選択する。情報処理システム1は、意味解析精度が「96.6(%)」である英語の翻訳文「Tell me the weather in Tokyo tomorrow」の意味解析結果に含まれる意味フレーム(「処理対象意味フレーム」ともいう)を応答生成等に用いる情報に選択する。 Then, the information processing system 1 executes the processing related to the response generation as shown in the processing phase FS3. In FIG. 1, the information processing system 1 selects a semantic frame (semantic analysis target sentence) to be used for response generation or the like prior to response generation. The information processing system 1 selects a sentence (also referred to as “character information to be processed”) used for response generation or the like from the sentences in the translation list. The information processing system 1 selects a meaning frame used for response generation or the like from N × M meaning frames. The information processing system 1 selects a semantic frame (semantic analysis target sentence) having the maximum semantic analysis accuracy from N × M semantic frames. In the example of FIG. 1, the information processing system 1 selects the first semantic analysis target sentence having a semantic analysis accuracy of "96.6 (%)" as the semantic frame (semantic analysis target sentence) used for response generation or the like. The information processing system 1 has a semantic frame included in the semantic analysis result of the English translation "Tell me the weather in Tokyo tomorrow" whose semantic analysis accuracy is "96.6 (%)" (also referred to as "processed semantic frame"). Is selected as the information used for response generation, etc.
 そして、情報処理システム1は、スロット逆変換を行う(ステップS5)。情報処理システム1は、処理対象意味フレーム中のスロット値を入力言語(対象言語)のスロット値に変換する。情報処理システム1は、特定言語(翻訳先言語)のスロット値を入力言語(対象言語)のスロット値に変換する。 Then, the information processing system 1 performs slot inverse transformation (step S5). The information processing system 1 converts the slot value in the processing target meaning frame into the slot value of the input language (target language). The information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language).
 図1では、情報処理システム1は、翻訳先言語である英語のスロット値を、入力言語である韓国語のスロット値に変換する。例えば、情報処理システム1は、Attribute「Date」のスロット値を「tomorrow」から「明日(韓国語)」に変換し、スロット値を英語から韓国語に変換する。例えば、情報処理システム1は、Attribute「Place」のスロット値を「Tokyo」から「東京(韓国語)」に変換し、スロット値を英語から韓国語に変換する。 In FIG. 1, the information processing system 1 converts the slot value of English, which is the translation destination language, into the slot value of Korean, which is the input language. For example, the information processing system 1 converts the slot value of the Attribute “Date” from “tomorrow” to “tomorrow (Korean)” and converts the slot value from English to Korean. For example, the information processing system 1 converts the slot value of the Attribute “Place” from “Tokyo” to “Tokyo (Korean)” and converts the slot value from English to Korean.
 そして、情報処理システム1は、開始するサービスを決定する(ステップS6)。例えば、情報処理システム1は、カレンダーサービスSV1、天気サービスSV2、アラームサービスSV3、音楽サービスSV4等の種々のサービスから、開始するサービスを決定する。図1では、情報処理システム1は、ユーザが天気を尋ねているため、天気サービスSV2を開始するサービスに決定する。そして、情報処理システム1は、応答を生成する。例えば、情報処理システム1は、明日の東京の天気を示す情報を韓国語で出力する。例えば、情報処理システム1は、「明日の東京の天気は晴れです(韓国語)」といった応答を、韓国語で音声により出力したり、韓国語で表示したりする。このように、処理フェーズFS3は、入力言語に依存する処理である。 Then, the information processing system 1 determines the service to be started (step S6). For example, the information processing system 1 determines a service to be started from various services such as a calendar service SV1, a weather service SV2, an alarm service SV3, and a music service SV4. In FIG. 1, since the user is asking for the weather, the information processing system 1 decides to start the weather service SV2. Then, the information processing system 1 generates a response. For example, the information processing system 1 outputs information indicating the weather in Tokyo tomorrow in Korean. For example, the information processing system 1 outputs a response such as "Tomorrow's weather in Tokyo is sunny (Korean)" by voice in Korean or displays it in Korean. As described above, the processing phase FS3 is a processing that depends on the input language.
 上述したように、図1では、情報処理システム1は、発話文展開及び言語展開の処理により生成された複数の言語の各々の複数の文を対象として意味解析の処理を行う。そして、情報処理システム1は、意味解析の処理により生成される意味解析スコアを複数言語間で比較可能にする精度指標値(意味解析精度)に変換する。これにより、情報処理システム1は、言語間における意味解析精度の比較を可能にすることができる。 As described above, in FIG. 1, the information processing system 1 performs semantic analysis processing for each of a plurality of sentences in a plurality of languages generated by the processing of utterance sentence expansion and language expansion. Then, the information processing system 1 converts the semantic analysis score generated by the processing of the semantic analysis into an accuracy index value (semantic analysis accuracy) that enables comparison between a plurality of languages. As a result, the information processing system 1 can compare the accuracy of semantic analysis between languages.
[1-1-1.概要、背景及び効果等]
 上述のように、情報処理システム1は、例えばスマートスピーカなどのデバイスに対し、ユーザの発話を受け付け、発話内容から意味フレームのテーブル構造を生成し、発話のドメインゴールを分類および、スロットを取り出す。これにより、情報処理システム1は、アプリやサービスの実行に必要な情報を抽出する意味解析処理の精度を改善するための仕組みを提供する。
[1-1-1. Overview, background and effects]
As described above, the information processing system 1 accepts a user's utterance to a device such as a smart speaker, generates a table structure of meaning frames from the utterance content, classifies the domain goals of the utterance, and takes out slots. As a result, the information processing system 1 provides a mechanism for improving the accuracy of the semantic analysis process for extracting information necessary for executing an application or service.
 情報処理システム1により実行される手法は、図1に示すように、音声認識後の発話文に対して、発話文展開により、言い回しやフレーズを別の表現によりバリエーションを増やした発話文リストを生成する。また、情報処理システム1により実行される手法は、発話文リストに対し、発話意味解析器で処理可能なすべての特定言語へ翻訳により言語変換を行う。これにより、情報処理システム1は、通常は入力言語のみの意味解析結果しか得られないところ、様々な表現および言語から判断が可能となる。 As shown in FIG. 1, the method executed by the information processing system 1 generates an utterance sentence list in which phrases and phrases are increased in variations by different expressions by expanding the utterance sentences for the utterance sentences after speech recognition. do. Further, the method executed by the information processing system 1 performs language conversion by translating the utterance sentence list into all specific languages that can be processed by the utterance meaning analyzer. As a result, the information processing system 1 can make a judgment from various expressions and languages, although normally only the semantic analysis result of only the input language can be obtained.
 このように、情報処理システム1は、意味解析器の解析精度が十分ではない入力言語の場合であっても、発話意味解析器で処理可能なすべての特定言語へ翻訳により、解析精度が十分な言語での解析が可能になる。また、言語変換における翻訳器の翻訳精度に大きく影響してしまうため、情報処理システム1は、発話文展開により、同じ意味を維持しながら、意味フレームの結果に影響しないように、表層の文字列表現を増やすことで吸収する。情報処理システム1は、発話文展開、言語展開により、意味解析結果が増えるため、翻訳精度の品質を推定するモジュールを用意し、一定以下の精度の発話は棄却してもよいが、この点については後述する。また、情報処理システム1は、事前に意味解析処理のスコア値と解析精度のパーセンテージの対応関数(スコア関数)を言語ごとに用意することで、通常のスコア値では比較に意味をなさない言語間での意味解析結果の比較が解析精度の視点で可能となる。 As described above, even in the case of an input language in which the analysis accuracy of the semantic analyzer is not sufficient, the information processing system 1 has sufficient analysis accuracy by translating into all specific languages that can be processed by the speech semantic analyzer. It enables analysis in language. In addition, since the translation accuracy of the translator in language conversion is greatly affected, the information processing system 1 maintains the same meaning by expanding the utterance sentence, and does not affect the result of the meaning frame, so that the character string on the surface layer is not affected. Absorb by increasing the expression. The information processing system 1 prepares a module for estimating the quality of translation accuracy because the semantic analysis results increase due to utterance sentence expansion and language expansion, and utterances with accuracy below a certain level may be rejected. Will be described later. In addition, the information processing system 1 prepares in advance a corresponding function (score function) between the score value of the semantic analysis process and the percentage of the analysis accuracy for each language, so that the normal score value does not make sense for comparison between languages. It is possible to compare the semantic analysis results in the above from the viewpoint of analysis accuracy.
 意味解析処理(意味解析器等)を作成するためには、ドメインゴールに応じた発話文を収集する必要があり、そこから、スロットとして切り出すためのラベリングを行う必要がある。そのために、対象言語を理解しドメインゴールの基準設計や、コーパスを収集しラベリングにかかるコストの面で、一つの言語へローカライズしていくのは、対訳収集が主なコストとなる機械翻訳と比べて負荷が高く、意味解析の多言語展開には時間と人手コストが一般的には必要とされる。グローバルに事業を展開していく際に、ローカライズゼーションは避けては通れない課題である。また、発話文収集が十分でない場合や、意味フレームの設計が十分でない言語がある場合、その言語に対しては十分な意味解析結果を得ることは難しい。 In order to create a semantic analysis process (semantic analyzer, etc.), it is necessary to collect utterance sentences according to the domain goal, and from there, it is necessary to perform labeling to cut out as a slot. Therefore, in terms of understanding the target language, designing the standard for domain goals, and collecting and labeling the corpus, localizing it into one language is compared to machine translation, where bilingual collection is the main cost. The load is high, and multilingual development of semantic analysis generally requires time and labor costs. Localization is an unavoidable issue when expanding our business globally. In addition, if the utterance sentence collection is insufficient or if there is a language in which the semantic frame design is not sufficient, it is difficult to obtain sufficient semantic analysis results for that language.
 一方、情報処理システム1は以下のような技術的な特徴を有する。情報処理システム1は、翻訳技術を利用することで、意味解析の精度を向上させたり、支援したりする。また、情報処理システム1は、入力発話文に対して、同じ意味、例えば意味フレームの出力結果に影響されない表現に変換する発話展開を行う。また、情報処理システム1は、発話展開による増加した発話文に対して、意味解析器が処理可能な言語に翻訳する。 On the other hand, the information processing system 1 has the following technical features. The information processing system 1 improves or supports the accuracy of semantic analysis by using translation technology. Further, the information processing system 1 performs utterance development that converts the input utterance sentence into an expression having the same meaning, for example, an expression that is not affected by the output result of the meaning frame. Further, the information processing system 1 translates the increased utterance sentences due to the utterance development into a language that can be processed by the semantic analyzer.
 また、情報処理システム1は、翻訳器で言語変換した発話文に対して、意味解析の結果である意味フレームを展開数×言語数ごとに得る。また、情報処理システム1は、事前に意味解析結果のスコアと、パーセンテージ表示の精度指標を評価データから算出して、スコア精度対応表または関数化する。また、情報処理システム1は、精度=f(スコア)の精度関数(言語別スコア関数等)を生成しておく。 In addition, the information processing system 1 obtains a semantic frame, which is the result of semantic analysis, for each utterance sentence that has been linguistically converted by a translator for each number of expansions x number of languages. Further, the information processing system 1 calculates the score of the semantic analysis result and the accuracy index of the percentage display from the evaluation data in advance, and converts it into a score accuracy correspondence table or a function. Further, the information processing system 1 generates an accuracy function (score function for each language, etc.) with accuracy = f (score).
 また、情報処理システム1は、言語及び展開の組合せごとに意味解析のスコアから、精度関数を通し、最大値の精度のものを意味解析結果とする。また、情報処理システム1は、入力言語で応答を返すために、意味フレームのスロット値を、その意味フレームに対応する意味解析を行った言語(特定言語)から入力言語へ逆変換する。 In addition, the information processing system 1 passes the accuracy function from the score of the semantic analysis for each combination of language and expansion, and obtains the one with the maximum accuracy as the semantic analysis result. Further, in order to return a response in the input language, the information processing system 1 reversely converts the slot value of the semantic frame from the language (specific language) for which the semantic analysis corresponding to the semantic frame is performed to the input language.
 情報処理システム1は、応答の際には入力時、解析時、出力時の3フェーズにて、どの言語かわかる形式で出力する。例えば、情報処理システム1は、画像(アイコンなど)、音声(効果音など)、テキスト(言語名/言語コード)等の形式で出力する。なお、この点についての詳細は後述する。 The information processing system 1 outputs in a format that can be understood which language it is in three phases of input, analysis, and output when responding. For example, the information processing system 1 outputs in the form of an image (icon or the like), a voice (sound effect or the like), a text (language name / language code) or the like. The details of this point will be described later.
 また、情報処理システム1は、翻訳精度、意味解析精度が一定以下の場合は処理を中断し、その中断理由を提示する。精度が低い状態でアプリケーションやサービスを実行してもユーザが期待する結果にならないことが多い。そのため、情報処理システム1は、ユーザに翻訳処理が上手くできなかったのか、翻訳は成功したが意味解析処理が上手くできなかったのかを伝えることで、ユーザ側が次に入力する際に調整(制御)することを可能にする。 Further, the information processing system 1 interrupts the processing when the translation accuracy and the semantic analysis accuracy are below a certain level, and presents the reason for the interruption. Executing an application or service with low accuracy often does not produce the results that the user expects. Therefore, the information processing system 1 tells the user whether the translation process was not successful or the translation was successful but the semantic analysis process was not successful, so that the user can adjust (control) the next time the input is performed. Allows you to.
 ここで、図19を用いて従来の翻訳の一例について簡単に説明する。図19は、他の言語を介した翻訳の一例を示す図である。図19は、多言語化における特定言語経由の方法の一例を示す。 Here, an example of conventional translation will be briefly described with reference to FIG. FIG. 19 is a diagram showing an example of translation via another language. FIG. 19 shows an example of a method via a specific language in multilingualization.
 翻訳技術において、翻訳のモデルを訓練するためには、大量の対訳文の組が必要とされ、特にマイナー言語Aからマイナー言語Bへの翻訳の際は特にコーパスの収集が困難になり、翻訳エンジンの実現が難しくなる。なお、ここでいうマイナー言語とは、例えば情報処理システム1の提供元の属する国において、十分な量のデータを収集することが難しい言語を意味する。一方、メジャー言語とは、例えば情報処理システム1の提供元の属する国において、十分な量のデータを収集することが比較的容易な言語を意味する。例えば、メジャー言語には、情報処理システム1の提供元の属する国で用いられる言語が含まれる。なお、マイナー言語とは、その言語を用いる人(話者)の数が比較的少ない言語であってもよく、メジャー言語とは、その言語を用いる人(話者)の数が比較的多い言語であってもよい。 In translation technology, a large number of pairs of bilingual sentences are required to train a translation model, and especially when translating from minor language A to minor language B, it becomes difficult to collect a corpus, and the translation engine. Will be difficult to realize. The minor language referred to here means a language in which it is difficult to collect a sufficient amount of data, for example, in the country to which the provider of the information processing system 1 belongs. On the other hand, the major language means a language in which it is relatively easy to collect a sufficient amount of data, for example, in the country to which the provider of the information processing system 1 belongs. For example, the major language includes a language used in the country to which the provider of the information processing system 1 belongs. A minor language may be a language in which the number of people (speakers) who use the language is relatively small, and a major language is a language in which the number of people (speakers) who use the language is relatively large. It may be.
 そこで、図19のように一度メジャー言語に変換する手法がある。図19の例では、マイナー言語Aから、メジャー言語に翻訳した後、メジャー言語からマイナー言語Bへ2回翻訳する。このように2回翻訳する理由は、以下の2つの点がある。一つ目は、マイナー言語Aからマイナー言語Bへの対訳文の収集・作成に要するコストに比べ、マイナー言語Aからメジャー言語、及びメジャー言語からマイナー言語Bのコストのほうが低いためである。二つ目は、メジャー言語はメジャー言語であるため、マイナー言語Aからメジャー言語、及びメジャー言語からマイナー言語Bの多言語対応としてのニーズがあり、既に対応済みで翻訳可能であることが多く、既存の翻訳器(翻訳処理)をそのまま利用可能(流用可能)である可能性が高いためである。 Therefore, as shown in Fig. 19, there is a method to convert to a major language once. In the example of FIG. 19, after the minor language A is translated into the major language, the major language is translated into the minor language B twice. There are two reasons for translating twice in this way. The first is that the cost from minor language A to major language and from major language to minor language B is lower than the cost required to collect and create bilingual sentences from minor language A to minor language B. Secondly, since the major language is a major language, there is a need for multilingual support from minor language A to major language, and from major language to minor language B, and in many cases it has already been supported and can be translated. This is because there is a high possibility that the existing translator (translation processing) can be used as it is (it can be diverted).
 上述のように、翻訳において特定の言語(例えばメジャー言語)へ翻訳し、更に目的の言語に再翻訳する手法が考えられる。上記は、目的が同じ翻訳器同士を単なる結合(単結合)する場合であり、性質・目的が同じもの同士であるため、上手く機能する可能性が高い。 As mentioned above, a method of translating into a specific language (for example, a major language) and then retranslating into the target language can be considered. The above is a case where translators having the same purpose are simply combined (single-bonded), and since they have the same properties and purposes, there is a high possibility that they will function well.
 一方で、図1に示す例での対象は、意味解析器の精度改善に、複数言語の翻訳器を使用する点である。しかしながら、音声認識後のテキストを特定言語に翻訳する際、翻訳器と意味解析器は性質や目的が違うものであり、単結合では上手く機能しない場合がある。また、各言語に変換した翻訳文に対する意味解析結果を比較して選択するのは難しい。その理由として、以下に従来技術の問題点として記載する。 On the other hand, the object in the example shown in FIG. 1 is that a multilingual translator is used to improve the accuracy of the semantic analyzer. However, when translating text after speech recognition into a specific language, the translator and the semantic analyzer have different properties and purposes, and may not function well with a single bond. In addition, it is difficult to compare and select the semantic analysis results for the translated sentences converted into each language. The reasons for this are described below as problems with the prior art.
 一般的に翻訳器は、書き言葉の文語調の傾向にあり、それは翻訳器が広く使われることを目指し、科学文書、ニュースなどの人が読み・正確に伝わる文として、主語省略がなく敬語表現などあいまい性がなく、文としての完成度が高いコーパスをベースに学習されることが多い点に起因する。一方で、スマートスピーカやAI(Artificial Intelligence)チャットボットなど、人が機械に依頼するようなスタイル、人と機械がコミュニケーションするスタイルの場合、人と人が会話するような言葉の表現が自然に用いられている。 In general, translators tend to have a literary tone of written words, which aims to be widely used by translators. This is due to the fact that it is often learned based on a corpus that is not ambiguous and has a high degree of perfection as a sentence. On the other hand, in the case of a style in which a person asks a machine, such as a smart speaker or an AI (Artificial Intelligence) chatbot, or a style in which a person communicates with a machine, the expression of words such as a person-to-person conversation is naturally used. Has been done.
 つまり、以下のような2つの点が、翻訳処理(翻訳器)と意味解析処理(意味解析器)とを単純に結合(単結合)した場合に上手く機能しない原因となり得る。 That is, the following two points can cause the translation process (translator) and the semantic analysis process (semantic analyzer) to not function well when simply combined (single bond).
 第1点としては、例えば入力される入力情報が、書き言葉の文語調ではなく、話し言葉の口語調である点が挙げられる。第2点としては、例えば言語特有の表現には適切な翻訳が得られない点が挙げられる。ただし、近年の音声翻訳器は話し言葉(口語調)も取り入れている(対応している)場合があり、以下第2点への対応が重要になる。 The first point is that, for example, the input information to be input is not the written language but the spoken language. The second point is that, for example, an appropriate translation cannot be obtained for a language-specific expression. However, recent speech translators may also incorporate (correspond to) spoken language (colloquial tone), and it is important to address the second point below.
 第2点については、「音楽をかけて」など動詞の「かけて」の表現が、翻訳器を通して正しく翻訳される保証はない。また、翻訳器単体として見た場合には適切に翻訳されている場合であっても、意味解析器から見た場合は想定外の表現である可能性がある。これは学習フェーズにおけるドメインゴールごとの訓練文に依存していることが要因である。翻訳器の正確さと、意味解析器の正確さにはギャップがある。 Regarding the second point, there is no guarantee that the expression of the verb "kake" such as "play music" will be translated correctly through the translator. Moreover, even if the translation is properly performed when viewed as a single translator, it may be an unexpected expression when viewed from the semantic analyzer. This is because it depends on the training sentences for each domain goal in the learning phase. There is a gap between the accuracy of the translator and the accuracy of the semantic analyzer.
 一方で、情報処理システム1は、上述したように、ユーザの発話を発話文展開により複数の表現(複数の発話展開文)に展開する。そして、情報処理システム1は、複数言語の翻訳器を用いて、複数の発話展開文の各々を言語展開する。そして、情報処理システム1は、複数言語間での意味解析精度を比較可能にするための精度指標値を算出し、算出した精度指標値を用いて、いずれの言語の意味解析の精度が良いかを比較可能にする。情報処理システム1は、各言語のうち精度よく処理可能な言語を用いて応答生成等の処理を行う。これにより、情報処理システム1は、翻訳器の正確さと、意味解析器の正確さにはギャップがある場合であっても、数多くの候補の中から精度が良いと想定される文を選択して処理することにより、精度よい意味解析を基に応答処理などを行うことができる。 On the other hand, as described above, the information processing system 1 expands the user's utterance into a plurality of expressions (plural utterance expansion sentences) by expanding the utterance sentence. Then, the information processing system 1 uses a translator of a plurality of languages to develop each of the plurality of utterance development sentences into a language. Then, the information processing system 1 calculates an accuracy index value for making it possible to compare the semantic analysis accuracy between a plurality of languages, and using the calculated accuracy index value, which language has the better semantic analysis accuracy. Can be compared. The information processing system 1 performs processing such as response generation using a language that can be processed accurately among each language. As a result, the information processing system 1 selects a sentence that is expected to have good accuracy from a large number of candidates even if there is a gap between the accuracy of the translator and the accuracy of the semantic analyzer. By processing, response processing and the like can be performed based on accurate semantic analysis.
[1-1-2.スコア関数]
 ここから、スコア関数について説明する。上述したように、式(1)のスコア関数は、意味解析結果のスコア(意味解析スコア)と解析言語から、解析精度をパーセンテージに変換する関数である。
[1-1-2. Score function]
From here, the score function will be described. As described above, the score function of the equation (1) is a function that converts the analysis accuracy into a percentage from the score of the semantic analysis result (semantic analysis score) and the analysis language.
 例えば、意味解析は言語ごとに収集したコーパスに基づいて学習され、モデル化される。すなわち、学習時に言語間を想定した学習されない。そのため、推論時の統計的な優位性を示すスコア値は、学習モデルの言語内では相対的に利用することが可能であるが、言語を超えて言語Aと言語B等のスコア値(意味解析スコア)を比較することは意味をなさない。例えば、スコア(意味解析スコア)について、英語が「0.4」であり、ドイツ語が「0.5」の場合に数値の高いドイツ語を選択する根拠がない。 For example, semantic analysis is learned and modeled based on the corpus collected for each language. That is, the learning is not performed assuming the inter-language at the time of learning. Therefore, the score value indicating the statistical superiority at the time of inference can be relatively used in the language of the learning model, but the score value of the language A and the language B, etc. beyond the language (semantic analysis). It doesn't make sense to compare scores). For example, regarding the score (semantic analysis score), when English is "0.4" and German is "0.5", there is no reason to select German with a high numerical value.
 そこで、情報処理システム1は、式(1)のようなスコア関数を用いて言語間での精度比較が可能な指標値(意味解析精度)を算出する。 Therefore, the information processing system 1 calculates an index value (semantic analysis accuracy) that enables accuracy comparison between languages using a score function as in Eq. (1).
[1-1-2-1.関数生成例]
 以下、図16~図18を用いて、スコア関数の生成例について説明する。図16及び図17は、解析精度の一例を示す図である。図16は、日本語の解析精度の一例を示す図である。図17は、英語の解析精度の一例を示す図である。図18は、解析精度とスコアとの関係の一例を示す図である。
[1-1-2-1. Function generation example]
Hereinafter, an example of generating a score function will be described with reference to FIGS. 16 to 18. 16 and 17 are diagrams showing an example of analysis accuracy. FIG. 16 is a diagram showing an example of Japanese analysis accuracy. FIG. 17 is a diagram showing an example of English analysis accuracy. FIG. 18 is a diagram showing an example of the relationship between the analysis accuracy and the score.
 例えば、情報処理システム1は、言語内において評価データから、ある値のスコア以上の値の結果を集めた場合の正解の確率(解析精度のパーセンテージ)を算出し、ある値を変えて再度評価し正解の確率を算出する。これにより、情報処理システム1は、図16や図17に示すような、言語ごとのスコアテーブルが作成可能である。 For example, the information processing system 1 calculates the probability of correct answer (percentage of analysis accuracy) when collecting the results of a value equal to or higher than the score of a certain value from the evaluation data in the language, changes a certain value, and evaluates again. Calculate the probability of correct answer. As a result, the information processing system 1 can create a score table for each language as shown in FIGS. 16 and 17.
 例えば、情報処理システム1は、日本語を対象として、ある値のスコア以上の値の結果を集めた場合の正解の確率(解析精度のパーセンテージ)を算出し、ある値を変えて再度評価し正解の確率を算出することで図16に示すようなスコアテーブルを生成する。情報処理システム1は、英語を対象として、ある値のスコア以上の値の結果を集めた場合の正解の確率(解析精度のパーセンテージ)を算出し、ある値を変えて再度評価し正解の確率を算出することで図17に示すようなスコアテーブルを生成する。なお、情報処理システム1は、中国語やスペイン語等、各言語(特定言語)について同様の処理を行うことで、各言語(特定言語)のスコアテーブルを生成する。 For example, the information processing system 1 calculates the probability of a correct answer (percentage of analysis accuracy) when collecting results of a value equal to or higher than a certain value for Japanese, changes a certain value, evaluates it again, and corrects the answer. By calculating the probability of, a score table as shown in FIG. 16 is generated. The information processing system 1 calculates the probability of a correct answer (percentage of analysis accuracy) when collecting results of a value equal to or higher than a certain value for English, changes a certain value, evaluates it again, and determines the probability of the correct answer. By calculating, a score table as shown in FIG. 17 is generated. The information processing system 1 generates a score table for each language (specific language) by performing the same processing for each language (specific language) such as Chinese and Spanish.
 日本語を対象とした場合を一例として説明すると、情報処理システム1は、日本語での音声認識後のテキスト(文字情報)と、その文字情報の意味解析の結果とが対応付けられた評価データを用いて、日本語のスコアテーブルを生成する。例えば、情報処理システム1は、日本語での音声認識後のテキスト(文字情報)及びその意味解析の結果と、その結果が正解か否かを示す情報(ラベル)とを対応付けた評価データを用いて、日本語のスコアテーブルを生成する。例えば、情報処理システム1は、各テキスト(文字情報)に対する意味解析の確信度を示す意味解析スコアと、各テキスト(文字情報)に対して特定したドメインゴール等が正解であったかを示すラベルとを対応付けた評価データを用いて、日本語のスコアテーブルを生成する。各テキスト(文字情報)の意味解析の結果に対応するラベルは、情報処理システム1の管理者等が設定してもよいし、自動で設定してもよい。また、ラベルは、情報処理システム1の管理者等が対応するテキスト(文字情報)の意味解析の結果に割り当ててもよいし、自動で割り当ててもよい。 To explain the case of targeting Japanese as an example, the information processing system 1 has evaluation data in which the text (character information) after voice recognition in Japanese and the result of semantic analysis of the character information are associated with each other. To generate a Japanese score table using. For example, the information processing system 1 provides evaluation data in which the text (character information) after voice recognition in Japanese and the result of its semantic analysis are associated with information (label) indicating whether or not the result is correct. Use to generate a Japanese score table. For example, the information processing system 1 has a semantic analysis score indicating the certainty of semantic analysis for each text (character information) and a label indicating whether the specified domain goal or the like for each text (character information) is correct. A Japanese score table is generated using the associated evaluation data. The label corresponding to the result of the semantic analysis of each text (character information) may be set by the administrator of the information processing system 1 or the like, or may be set automatically. Further, the label may be assigned to the result of the semantic analysis of the corresponding text (character information) by the administrator of the information processing system 1, or may be automatically assigned.
 情報処理システム1は、あるテキスト(文字情報)の意味解析スコアが「0.4」である意味解析の結果であるドメインゴールまたはスロットが不正解である場合、そのテキスト(文字情報)の意味解析スコア「0.4」に不正解を示すラベル「0」を対応付けた評価データを用いる。また、情報処理システム1は、あるテキスト(文字情報)の意味解析スコアが「0.9」である意味解析の結果であるドメインゴール及びスロットが正解である場合、そのテキスト(文字情報)の意味解析スコア「0.9」に正解を示すラベル「1」を対応付けた評価データを用いる。情報処理システム1は、このような各テキスト(文字情報)の意味解析スコアとラベルとが対応付けられた評価データを用いて、日本語のスコアテーブルを生成する。 The information processing system 1 analyzes the meaning of a text (character information) when the domain goal or slot, which is the result of the semantic analysis with a semantic analysis score of "0.4", is incorrect. Evaluation data in which the score "0.4" is associated with the label "0" indicating an incorrect answer is used. Further, in the information processing system 1, when the domain goal and the slot, which are the results of the semantic analysis in which the semantic analysis score of a certain text (character information) is "0.9", are correct, the meaning of the text (character information). Evaluation data in which the analysis score "0.9" is associated with the label "1" indicating the correct answer is used. The information processing system 1 generates a Japanese score table using the evaluation data in which the semantic analysis score of each text (character information) and the label are associated with each other.
 図16や図17に示すような各言語の意味解析スコアと正解率(意味解析精度)との関係を示す情報を基に、情報処理システム1は、図18に示すようなグラフGR1を生成する。情報処理システム1は、図18に示すようなグラフGR1を基に各言語のスコア関数(言語別スコア関数)を生成する。図18では、実線で結んだ黒丸が日本語の結果を示し、点線で結んだ白丸が英語の結果を示す。 The information processing system 1 generates the graph GR1 as shown in FIG. 18 based on the information showing the relationship between the semantic analysis score of each language and the correct answer rate (semantic analysis accuracy) as shown in FIGS. 16 and 17. .. The information processing system 1 generates a score function (score function for each language) of each language based on the graph GR1 as shown in FIG. In FIG. 18, black circles connected by solid lines show the results in Japanese, and white circles connected by dotted lines show the results in English.
 ここで、図18のグラフGR1は、横軸が「スコア閾値」、すなわち意味解析スコアの閾値を示し、縦軸が「正解の確率(単位:パーセンテージ)」、すなわち意味解析精度を示す。このように、図18のグラフGR1は、各言語について、意味解析スコアがスコア閾値以上である解析結果群のうち、推定した情報(ドメインゴール等)が正解である解析結果の割合を示す。例えば、日本語について、意味解析スコアが「0.6」以上であった解析結果群のうち、推定した情報が正解である解析結果の割合が60%であることを示す。例えば、情報処理システム1は、日本語について、意味解析スコアが「0.6」以上であった解析結果の数が「10000」であり、推定した情報が正解である解析結果の数が「6000」である場合、「正解の確率(単位:パーセンテージ)」、すなわち意味解析精度を「60%」を算出する。例えば、横軸が「スコア閾値」の低い所では、(統計的に)ばらつき、横軸が「スコア閾値」の高い所では安定して高くなる。例えば、横軸が「スコア閾値」が「0.5」を超える箇所では単調増加になる。 Here, in the graph GR1 of FIG. 18, the horizontal axis indicates the “score threshold”, that is, the threshold of the semantic analysis score, and the vertical axis indicates the “probability of correct answer (unit: percentage)”, that is, the semantic analysis accuracy. As described above, the graph GR1 of FIG. 18 shows the ratio of the analysis results in which the estimated information (domain goal, etc.) is correct in the analysis result group whose semantic analysis score is equal to or higher than the score threshold value for each language. For example, for Japanese, it is shown that 60% of the analysis result group whose semantic analysis score is "0.6" or more is the analysis result in which the estimated information is correct. For example, in the information processing system 1, for Japanese, the number of analysis results having a semantic analysis score of "0.6" or more is "10000", and the number of analysis results whose estimated information is correct is "6000". In the case of ", the probability of correct answer (unit: percentage)", that is, the semantic analysis accuracy is calculated as "60%". For example, where the horizontal axis has a low "score threshold", there is (statistically) variation, and where the horizontal axis has a high "score threshold", the value is stable and high. For example, when the horizontal axis is where the "score threshold" exceeds "0.5", the increase is monotonous.
 例えば、情報処理システム1は、横軸が「スコア閾値」、縦軸「正解の確率(単位:パーセンテージ)」のフィッティング関数を導出(生成)する。そして、情報処理システム1は、フィッティング関数をスコア関数として用いる。情報処理システム1は、言語ごとにフィッティング関数を算出することで、言語別スコア関数を生成する。 For example, the information processing system 1 derives (generates) a fitting function in which the horizontal axis is the "score threshold" and the vertical axis is the "probability of correct answer (unit: percentage)". Then, the information processing system 1 uses the fitting function as the score function. The information processing system 1 generates a score function for each language by calculating a fitting function for each language.
 そして、情報処理システム1は、言語別スコア関数を用いた関数(プログラム)を生成することで式(1)を生成する。これにより、情報処理システム1は、解析言語(lang)とその言語の意味解析スコア(score)から、意味解析精度(acc)を算出する式(1)に示すスコア関数(プログラム)を生成する。 Then, the information processing system 1 generates the equation (1) by generating a function (program) using the score function for each language. As a result, the information processing system 1 generates the score function (program) shown in the equation (1) for calculating the semantic analysis accuracy (acc) from the analysis language (lang) and the semantic analysis score (score) of the language.
[1-1-3.処理の主な流れ]
 以下、処理の主な流れについて簡単に記載する。
[1-1-3. Main flow of processing]
The main flow of processing will be briefly described below.
 情報処理システム1は、発話入力について以下のような処理を行う。情報処理システム1は、音声認識により発話からテキストへ変換する。ここで入力言語は、例えばスペイン語等、情報処理システム1の対応可能な言語として予め決定している。 The information processing system 1 performs the following processing for utterance input. The information processing system 1 converts utterances into texts by voice recognition. Here, the input language is predetermined as a language that can be supported by the information processing system 1, such as Spanish.
 情報処理システム1は、発話文の自動展開について以下のような処理を行う。まず、前提としては、翻訳器と意味解析器は目的が違う処理である。対訳文のコーパスと、ドメインゴールごとのコーパスとを別々に収集したり、作成したりされ、別々に学習されてモデル化される。そのため、翻訳器のモデルから出力される翻訳文と、意味解析器のモデルが想定している解析文とのギャップが生じる。 The information processing system 1 performs the following processing for the automatic expansion of utterance sentences. First, as a premise, the translator and the semantic analyzer have different purposes. The corpus of bilingual sentences and the corpus of each domain goal are collected and created separately, and are learned and modeled separately. Therefore, there is a gap between the translated sentence output from the translator model and the analysis sentence assumed by the semantic analyzer model.
 そこで、情報処理システム1は、意味フレームの出力が変わらない条件の下で、発話文の言い回しやフレーズを自動的に変えて、疑似発話文リストを生成する。例えば、情報処理システム1は、入力発話文「音楽をかけて」に対し、「音楽を再生して」、「音楽を再生してください」、「ミュージックをかけて」、「ミュージックプレイ」などのパラフレーズを生成する。 Therefore, the information processing system 1 automatically changes the wording and the phrase of the utterance sentence under the condition that the output of the meaning frame does not change, and generates a pseudo utterance sentence list. For example, the information processing system 1 responds to the input utterance "play music" by "play music", "play music", "play music", "music play", and the like. Generate paraphrases.
 情報処理システム1は、入力言語から翻訳による言語展開について以下のような処理を行う。情報処理システム1は、疑似発話文リストを翻訳器への入力として、疑似発話文それぞれに対して、意味解析器が処理可能なすべての言語に翻訳する。情報処理システム1は、一つの発話文から、N個の疑似発話文を生成し、M言語へ翻訳する。この場合、翻訳器から出力される翻訳文リストの要素は、N×M個になる。 The information processing system 1 performs the following processing from the input language to the language development by translation. The information processing system 1 uses the pseudo-utterance sentence list as input to the translator and translates each pseudo-utterance sentence into all languages that can be processed by the semantic analyzer. The information processing system 1 generates N pseudo utterance sentences from one utterance sentence and translates them into M language. In this case, the number of elements of the translated sentence list output from the translator is N × M.
 なお、上述したように、翻訳文リスト中には、入力言語が特定言語である場合、入力言語の文が含まれてもよい。例えば、情報処理システム1は、一つの発話文から、その一つの発話文を含むN個の疑似発話文を生成する。そして、情報処理システム1は、発話文の言語(入力言語)が特定言語である場合、M-1言語へ翻訳し、入力言語を含むM個の言語の各々のN個の文を生成する。この場合、翻訳文リストの要素はN×M個になる。 As described above, when the input language is a specific language, the translation sentence list may include sentences in the input language. For example, the information processing system 1 generates N pseudo utterance sentences including the one utterance sentence from one utterance sentence. Then, when the language (input language) of the utterance sentence is a specific language, the information processing system 1 translates it into the M-1 language and generates N sentences for each of the M languages including the input language. In this case, the number of elements in the translated sentence list is N × M.
 情報処理システム1は、意味解析処理(発話意味解析器)について以下のような処理を行う。情報処理システム1は、すべての言語および発話文展開ごとの発話意味解析処理を行う。情報処理システム1は、意味解析処理により、テキスト(文字列)から、意味フレームを生成する。例えば、情報処理システム1は、意味解析処理(発話意味解析器)の機能により、どのアプリやサービスを実行するかを判断するためのドメインゴールと、アプリやサービスを具体的に実行するために用いられるスロット情報を含めた表形式である意味フレームの情報を生成する。 The information processing system 1 performs the following processing on the semantic analysis processing (speech semantic analyzer). The information processing system 1 performs utterance semantic analysis processing for each language and utterance sentence development. The information processing system 1 generates a semantic frame from a text (character string) by a semantic analysis process. For example, the information processing system 1 is used to determine which application or service to execute by the function of the semantic analysis process (speech semantic analyzer) and to specifically execute the application or service. Generates semantic frame information in tabular form including slot information to be created.
 例えば、情報処理システム1は、図6に示すように、翻訳文リスト(図6では発話展開に対する各言語)の要素に対して、意味解析処理を実行し、意味フレームと解析スコア値を得る。図6の例では、情報処理システム1は、翻訳文リスト中の40個の文に対して、意味解析処理を実行し、意味フレームと解析スコア値を得る。すなわち、図6の例では、情報処理システム1は、発話展開文の数「4」と言語の数「10」とを乗算した40個の文に対して、意味解析処理を実行し、意味フレームと解析スコア値を得る。 For example, as shown in FIG. 6, the information processing system 1 executes a semantic analysis process on an element of a translated sentence list (in each language for utterance development in FIG. 6), and obtains a semantic frame and an analysis score value. In the example of FIG. 6, the information processing system 1 executes a semantic analysis process on 40 sentences in the translated sentence list, and obtains a semantic frame and an analysis score value. That is, in the example of FIG. 6, the information processing system 1 executes a semantic analysis process on 40 sentences obtained by multiplying the number of utterance development sentences "4" by the number of languages "10", and executes a semantic frame. And get the analysis score value.
 情報処理システム1は、スコア値(意味解析スコア)から解析精度(意味解析精度)に変換について以下のような処理を行う。まず、スコア値(意味解析スコア)は推定確率を表現しており、数値が高いほど、統計的に優位性が増す。つまり、スコア値(意味解析スコア)は数値が高いほど、推定精度が高いことを意味する。ここで、同じ言語においては疑似発話文リストから、同一モデルから判断されるため、最もスコア値の高い発話文と意味フレームを選択することが可能である。 The information processing system 1 performs the following processing for conversion from the score value (semantic analysis score) to the analysis accuracy (semantic analysis accuracy). First, the score value (semantic analysis score) expresses the estimated probability, and the higher the value, the more statistically superior it becomes. In other words, the higher the score value (semantic analysis score), the higher the estimation accuracy. Here, in the same language, since it is judged from the same model from the pseudo utterance sentence list, it is possible to select the utterance sentence and the meaning frame having the highest score value.
 しかしながら、言語ごとにモデルが異なるため、言語間でのスコア値を比較することは意味をなさない。そこで、情報処理システム1は、スコア関数から、解析言語とスコア値を入力として、図6のような解析精度(意味解析精度)を算出する。 However, since the model is different for each language, it does not make sense to compare the score values between the languages. Therefore, the information processing system 1 calculates the analysis accuracy (semantic analysis accuracy) as shown in FIG. 6 from the score function by inputting the analysis language and the score value.
 情報処理システム1は、解析言語および意味フレーム選択について以下のような処理を行う。情報処理システム1は、解析精度(意味解析精度)から、最も高い解析精度(意味解析精度)の解析言語、発話文、意味フレームの結果を選択する。 The information processing system 1 performs the following processing for analysis language and semantic frame selection. The information processing system 1 selects the result of the analysis language, utterance sentence, and semantic frame having the highest analysis accuracy (semantic analysis accuracy) from the analysis accuracy (semantic analysis accuracy).
 情報処理システム1は、意味フレームのスロット逆変換について以下のような処理を行う。最大解析精度の意味フレームは、特定言語による解析結果であり、ドメインゴールは言語依存しない抽象化された表現であるため、言語非依存である。一方、スロット情報の値は、特定言語の表現となり、アプリやサービスは一般的にはユーザが発話した入力言語で応答を返す必要がある。そのため、情報処理システム1は、正確にアプリやサービスを実行するために、スロット情報を特定言語から、入力言語に逆変換する場合がある。 The information processing system 1 performs the following processing for the slot inverse transformation of the semantic frame. The semantic frame of the maximum analysis accuracy is the analysis result in a specific language, and the domain goal is a language-independent abstract expression, so it is language-independent. On the other hand, the value of the slot information is expressed in a specific language, and the application or service generally needs to return a response in the input language spoken by the user. Therefore, the information processing system 1 may reversely convert the slot information from the specific language to the input language in order to accurately execute the application or service.
 情報処理システム1は、以下のような方法により逆変換を行う。1つ目の方法として、知識データベース(DB)から単語・フレーズの多言語変換辞書により変換する方法が挙げられる。2つ目の方法として、特定言語への翻訳器による逆翻訳する方法が挙げられる。情報処理システム1は、上述した方法に限らず、種々の方法により逆変換を行ってもよい。 The information processing system 1 performs inverse transformation by the following method. The first method is to convert words / phrases from a knowledge database (DB) using a multilingual conversion dictionary. The second method is to reverse-translate into a specific language with a translator. The information processing system 1 is not limited to the method described above, and may perform inverse transformation by various methods.
 なお、意味フレームのスロット情報は、基本的にはアーティスト名、曲名、地名、デバイス名などであり、文というよりは単語やフレーズであるため、知識DBで辞書に変換対象により精度よく変換することができる。したがって、スロット情報は、辞書により精度よく変換することができる。一方、情報処理システム1は、知識DBにない表記の場合、2つ目の方法の逆翻訳により、特定言語から入力言語への逆翻訳を行い、2段階の変換を行ってもよい。 Note that the slot information of the meaning frame is basically an artist name, song name, place name, device name, etc., and is a word or phrase rather than a sentence. Can be done. Therefore, the slot information can be accurately converted by the dictionary. On the other hand, in the case of a notation not found in the knowledge DB, the information processing system 1 may perform reverse translation from a specific language to an input language by reverse translation of the second method, and perform two-step conversion.
 情報処理システム1は、応答生成について以下のような処理を行う。例えば、情報処理システム1は、意味解析処理(発話意味解析器)が通常の対応言語ではなく、対象外の言語を翻訳器により、疑似的に処理したことがわかるように、応答の際に、入力時、解析時、出力時の3フェーズにて、どの言語で処理したかがわかる形式で出力する。なお、この点の例については図9で説明する。例えば、情報処理システム1は、画像(アイコンなど)、音声(効果音など)、テキスト(言語名/言語コード)等の形式で出力する。 The information processing system 1 performs the following processing for response generation. For example, in the information processing system 1, when responding, it can be seen that the semantic analysis process (speech semantic analyzer) is not a normal supported language but a pseudo-processed language that is not the target by the translator. Output in a format that shows which language was processed in three phases: input, analysis, and output. An example of this point will be described with reference to FIG. For example, the information processing system 1 outputs in the form of an image (icon or the like), a voice (sound effect or the like), a text (language name / language code) or the like.
[1-2.実施形態に係る情報処理システムの構成]
 図2に示す情報処理システム1について説明する。図2に示すように、情報処理システム1は、端末装置10と、情報処理装置100とが含まれる。端末装置10と、情報処理装置100とは所定の通信網(ネットワークN)を介して、有線または無線により通信可能に接続される。図2は、実施形態に係る情報処理システムの構成例を示す図である。なお、図2に示した情報処理システム1には、複数台の端末装置10や、複数台の情報処理装置100が含まれてもよい。例えば、情報処理システム1は、上述した対話システムを実現する。
[1-2. Configuration of information processing system according to the embodiment]
The information processing system 1 shown in FIG. 2 will be described. As shown in FIG. 2, the information processing system 1 includes a terminal device 10 and an information processing device 100. The terminal device 10 and the information processing device 100 are connected to each other via a predetermined communication network (network N) so as to be communicable by wire or wirelessly. FIG. 2 is a diagram showing a configuration example of an information processing system according to an embodiment. The information processing system 1 shown in FIG. 2 may include a plurality of terminal devices 10 and a plurality of information processing devices 100. For example, the information processing system 1 realizes the above-mentioned dialogue system.
 情報処理装置100は、対象言語によるユーザの発話に対応する入力言語文字情報を、翻訳先言語の翻訳文に変換し、意味解析処理を実行するコンピュータである。情報処理装置100は、翻訳先言語に対応する意味解析処理の結果を対象言語に変換する逆変換処理を行う。また、情報処理装置100は、各種情報を端末装置10に送信するコンピュータである。情報処理装置100は、各種機能に関するサービスを提供するために用いられるサーバ装置である。例えば、情報処理装置100は、ユーザに対話システムに関するサービスを提供するために用いられる。情報処理装置100は、ユーザに対話システムに関する各種情報処理を行う。 The information processing device 100 is a computer that converts input language character information corresponding to a user's utterance in the target language into a translated sentence of the translation destination language and executes a semantic analysis process. The information processing device 100 performs an inverse transformation process of converting the result of the semantic analysis process corresponding to the translation destination language into the target language. Further, the information processing device 100 is a computer that transmits various information to the terminal device 10. The information processing device 100 is a server device used to provide services related to various functions. For example, the information processing device 100 is used to provide the user with a service related to the dialogue system. The information processing device 100 performs various information processing related to the dialogue system to the user.
 また、情報処理装置100は、音声信号処理や音声認識や発話意味解析や対話制御等のソフトウェアモジュールを有してもよい。例えば、情報処理装置100は、自然言語理解(NLU:Natural Language Understanding)や自動音声認識(ASR:Automatic Speech Recognition)の機能を有してもよい。例えば、情報処理装置100は、ユーザの発話による入力情報からユーザのインテント(意図)やエンティティ(対象)に関する情報を推定してもよい。情報処理装置100は、自然言語理解や自動音声認識の機能を有する音声認識サーバとして機能する。 Further, the information processing device 100 may have software modules such as voice signal processing, voice recognition, utterance semantic analysis, and dialogue control. For example, the information processing apparatus 100 may have functions of natural language understanding (NLU: Natural Language Understanding) and automatic speech recognition (ASR: Automatic Speech Recognition). For example, the information processing device 100 may estimate information about a user's intent (intention) or entity (target) from input information uttered by the user. The information processing device 100 functions as a voice recognition server having functions of natural language understanding and automatic voice recognition.
 端末装置10は、ユーザの発話を検知し、ユーザの発話の音声等を情報処理装置100等へ送信するコンピュータである。また、端末装置10は、自然言語理解や自動音声認識等の機能を有してもよい。例えば、端末装置10は、ユーザの発話による入力情報からユーザのインテント(意図)やエンティティ(対象)に関する情報を推定してもよい。端末装置10は、ユーザによって利用されるデバイス装置である。端末装置10は、ユーザによる入力を受け付ける。端末装置10は、ユーザの発話による音声入力や、ユーザの操作による入力を受け付ける。端末装置10は、ユーザの入力に応じた情報を表示する。 The terminal device 10 is a computer that detects the user's utterance and transmits the voice of the user's utterance to the information processing device 100 or the like. Further, the terminal device 10 may have functions such as natural language understanding and automatic voice recognition. For example, the terminal device 10 may estimate information about a user's intent (intention) or entity (target) from input information uttered by the user. The terminal device 10 is a device device used by a user. The terminal device 10 accepts input by the user. The terminal device 10 accepts voice input by the user's utterance and input by the user's operation. The terminal device 10 displays information according to the input of the user.
 端末装置10は、ユーザによって利用される情報処理装置である。端末装置10は、ユーザの発話に対して応答を行う対話サービスの提供に用いられる。端末装置10は、マイク等の音を検知する音センサを有する。例えば、端末装置10は、音センサにより、端末装置10の周囲におけるユーザの発話を検知する。例えば、端末装置10は、周囲の音を検知し、検知した音に応じて種々の処理を行うデバイス(音声アシスト端末)であってもよい。端末装置10は、ユーザの発話に対して、処理を行うコンピュータである。 The terminal device 10 is an information processing device used by the user. The terminal device 10 is used to provide a dialogue service that responds to a user's utterance. The terminal device 10 has a sound sensor that detects the sound of a microphone or the like. For example, the terminal device 10 uses a sound sensor to detect a user's utterance around the terminal device 10. For example, the terminal device 10 may be a device (voice assist terminal) that detects ambient sounds and performs various processes according to the detected sounds. The terminal device 10 is a computer that processes a user's utterance.
 端末装置10は、実施形態における処理を実現可能であれば、どのような装置であってもよい。端末装置10は、ユーザの発話を検知し、情報処理装置100へ送信する機能を有する構成であれば、どのような装置であってもよい。端末装置10は、例えば、スマートフォンや、タブレット型端末や、ノート型PC(Personal Computer)や、デスクトップPCや、携帯電話機や、PDA(Personal Digital Assistant)等の装置であってもよい。端末装置10は、ユーザが身に着けるウェアラブル端末(Wearable Device)等であってもよい。例えば、端末装置10は、腕時計型端末やメガネ型端末等であってもよい。また、端末装置10は、テレビや冷蔵庫等のいわゆる家電製品であってもよい。例えば、端末装置10は、スマートスピーカやエンタテインメントロボットや家庭用ロボットと称されるような、人間(ユーザ)と対話するロボットであってもよい。また、端末装置10は、デジタルサイネージ等の所定の位置に配置される装置であってもよい。 The terminal device 10 may be any device as long as the processing in the embodiment can be realized. The terminal device 10 may be any device as long as it has a function of detecting the user's utterance and transmitting it to the information processing device 100. The terminal device 10 may be, for example, a device such as a smartphone, a tablet terminal, a notebook PC (Personal Computer), a desktop PC, a mobile phone, or a PDA (Personal Digital Assistant). The terminal device 10 may be a wearable terminal (Wearable Device) or the like that the user can wear. For example, the terminal device 10 may be a wristwatch-type terminal, a glasses-type terminal, or the like. Further, the terminal device 10 may be a so-called home electric appliance such as a television or a refrigerator. For example, the terminal device 10 may be a robot that interacts with a human (user), such as a smart speaker, an entertainment robot, or a domestic robot. Further, the terminal device 10 may be a device arranged at a predetermined position such as digital signage.
[1-3.実施形態に係る情報処理装置の構成]
 次に、実施形態に係る情報処理を実行する情報処理装置の一例である情報処理装置100の構成について説明する。図3は、本開示の実施形態に係る情報処理装置100の構成例を示す図である。
[1-3. Configuration of Information Processing Device According to Embodiment]
Next, the configuration of the information processing device 100, which is an example of the information processing device that executes the information processing according to the embodiment, will be described. FIG. 3 is a diagram showing a configuration example of the information processing device 100 according to the embodiment of the present disclosure.
 図3に示すように、情報処理装置100は、通信部110と、記憶部120と、制御部130とを有する。なお、情報処理装置100は、情報処理装置100の管理者等から各種操作を受け付ける入力部(例えば、キーボードやマウス等)や、各種情報を表示するための表示部(例えば、液晶ディスプレイ等)を有してもよい。 As shown in FIG. 3, the information processing device 100 includes a communication unit 110, a storage unit 120, and a control unit 130. The information processing device 100 includes an input unit (for example, a keyboard, a mouse, etc.) that receives various operations from the administrator of the information processing device 100, and a display unit (for example, a liquid crystal display, etc.) for displaying various information. You may have.
 通信部110は、例えば、NIC(Network Interface Card)等によって実現される。そして、通信部110は、ネットワークN(図2参照)と有線または無線で接続され、端末装置10等の他の情報処理装置との間で情報の送受信を行う。また、通信部110は、端末装置10との間で情報の送受信を行ってもよい。 The communication unit 110 is realized by, for example, a NIC (Network Interface Card) or the like. Then, the communication unit 110 is connected to the network N (see FIG. 2) by wire or wirelessly, and transmits / receives information to / from another information processing device such as the terminal device 10. Further, the communication unit 110 may send and receive information to and from the terminal device 10.
 記憶部120は、例えば、RAM(Random Access Memory)、フラッシュメモリ(Flash Memory)等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。実施形態に係る記憶部120は、図3に示すように、言語情報記憶部121と、意味フレーム情報記憶部122と、解析精度情報記憶部123と、閾値情報記憶部124と、知識情報記憶部125とを有する。なお、記憶部120は、上記に限らず、種々の情報を記憶する。記憶部120は、各言語に対応する特定言語を示す情報を記憶してもよい。記憶部120は、文字情報の入力に応じて、特定したドメインゴール等の意味フレームの情報と、その精度(確信度)を示すスコア(「意味解析スコア」ともいう)とを出力する意味解析器を記憶する。記憶部120は、意味解析が可能な特定言語ごとの意味解析器の情報を記憶する。例えば、記憶部120は、特定言語である英語の意味解析器や日本語の意味解析器等、特定言語ごとの意味解析器の情報を記憶する。 The storage unit 120 is realized by, for example, a semiconductor memory element such as a RAM (Random Access Memory) or a flash memory (Flash Memory), or a storage device such as a hard disk or an optical disk. As shown in FIG. 3, the storage unit 120 according to the embodiment includes a language information storage unit 121, a semantic frame information storage unit 122, an analysis accuracy information storage unit 123, a threshold information storage unit 124, and a knowledge information storage unit. It has 125 and. The storage unit 120 stores various information, not limited to the above. The storage unit 120 may store information indicating a specific language corresponding to each language. The storage unit 120 is a semantic analyzer that outputs information on a semantic frame such as a specified domain goal and a score (also referred to as a “semantic analysis score”) indicating its accuracy (confidence) in response to input of character information. Remember. The storage unit 120 stores the information of the semantic analyzer for each specific language capable of semantic analysis. For example, the storage unit 120 stores information of a semantic analyzer for each specific language, such as an English semantic analyzer or a Japanese semantic analyzer, which is a specific language.
 実施形態に係る言語情報記憶部121は、言語に関する各種情報を記憶する。例えば、言語情報記憶部121は、情報処理システム1が言語識別(音声認識)可能な言語の各種情報を記憶する。言語情報記憶部121は、各言語が意味解析可能な言語(特定言語)であるかを示す情報や、各言語を翻訳可能な言語(翻訳先言語)を示す情報を記憶する。図4は、実施形態に係る言語情報記憶部の一例を示す図である。図4に示す言語情報記憶部121には、「言語」、「翻訳先言語」といった項目が含まれる。また、「翻訳先言語」には、「#1」、「#2」等といった項目が含まれる。なお、図4では「#1」、「#2」のみを図示するが、「翻訳先言語」には、「#3」、「#4」等、翻訳先言語に対応する数の項目が含まれてもよい。 The language information storage unit 121 according to the embodiment stores various information related to the language. For example, the language information storage unit 121 stores various information in a language in which the information processing system 1 can identify the language (speech recognition). The language information storage unit 121 stores information indicating whether each language is a language capable of semantic analysis (specific language) and information indicating a language capable of translating each language (translation destination language). FIG. 4 is a diagram showing an example of the language information storage unit according to the embodiment. The language information storage unit 121 shown in FIG. 4 includes items such as "language" and "translation destination language". Further, the "translation destination language" includes items such as "# 1" and "# 2". Although only "# 1" and "# 2" are shown in FIG. 4, the "translation destination language" includes a number of items corresponding to the translation destination language such as "# 3" and "# 4". It may be.
 「言語」は、言語を示す。例えば、「言語」は、情報処理システム1が言語識別(音声認識)可能な言語を示す。また、言語を識別するための識別情報(言語ID)が各言語に対応付けて記憶されてもよい。例えば、各言語を識別する言語コードが記憶されてもよい。「翻訳先言語」は、その言語を翻訳可能な言語(翻訳先言語)を示す。例えば、「翻訳先言語」は、その言語を翻訳することができる翻訳先の言語(翻訳先言語)を示す。 "Language" indicates the language. For example, "language" indicates a language in which the information processing system 1 can identify the language (speech recognition). In addition, identification information (language ID) for identifying a language may be stored in association with each language. For example, a language code that identifies each language may be stored. "Translation destination language" indicates a language (translation destination language) capable of translating the language. For example, "translation destination language" indicates a translation destination language (translation destination language) in which the language can be translated.
 図4の例では、言語「英語」は、中国語やヒンディー語等に翻訳可能であることを示す。また、言語「中国語」は、英語やアラビア語等に翻訳可能であることを示す。 In the example of FIG. 4, it is shown that the language "English" can be translated into Chinese, Hindi, etc. In addition, the language "Chinese" indicates that it can be translated into English, Arabic, or the like.
 なお、言語情報記憶部121は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、言語情報記憶部121は、各言語が意味解析可能な言語(特定言語)であるかを示す情報を記憶してもよい。言語情報記憶部121は、各言語が意味解析可能な言語(特定言語)であるかを示すフラグを記憶してもよい。例えば、言語情報記憶部121は、その言語が特定言語である場合をフラグ「1」に対応付け、その言語が特定言語ではない場合をフラグ「0」に対応付けて記憶してもよい。 The language information storage unit 121 is not limited to the above, and may store various information depending on the purpose. For example, the language information storage unit 121 may store information indicating whether each language is a language (specific language) capable of semantic analysis. The language information storage unit 121 may store a flag indicating whether each language is a language (specific language) capable of semantic analysis. For example, the language information storage unit 121 may store the case where the language is a specific language in association with the flag "1" and the case where the language is not a specific language in association with the flag "0".
 この場合、情報処理装置100は、各言語に対応付けられた翻訳先言語から特定言語を抽出してもよい。例えば、情報処理装置100は、各言語に対応付けられた翻訳先言語のうち、フラグが「1」である言語を、特定言語として用いてもよい。 In this case, the information processing apparatus 100 may extract a specific language from the translation destination language associated with each language. For example, the information processing apparatus 100 may use a language whose flag is "1" among the translation destination languages associated with each language as a specific language.
 実施形態に係る意味フレーム情報記憶部122は、意味フレームに関する各種情報を記憶する。意味フレーム情報記憶部122は、言語ごとに意味フレームに関する各種情報を記憶する。意味フレーム情報記憶部122は、各言語に対応する意味フレームに関する情報を記憶する。例えば、意味フレーム情報記憶部122は、各特定言語に対応する意味フレームに関する情報を記憶する。 The semantic frame information storage unit 122 according to the embodiment stores various information related to the semantic frame. The semantic frame information storage unit 122 stores various information related to the semantic frame for each language. The semantic frame information storage unit 122 stores information about the semantic frame corresponding to each language. For example, the semantic frame information storage unit 122 stores information about the semantic frame corresponding to each specific language.
 図5の例では、意味フレーム情報記憶部122は、フレーム情報FM1やフレーム情報FM2等のように特定言語ごとに情報(テーブル)を記憶する。例えば、フレーム情報FM1は、言語「英語」の意味フレームに関する情報を示す。また、例えば、フレーム情報FM2は、言語「中国」の意味フレームに関する情報を示す。 In the example of FIG. 5, the semantic frame information storage unit 122 stores information (table) for each specific language, such as frame information FM1 and frame information FM2. For example, the frame information FM1 indicates information about a semantic frame of the language "English". Further, for example, the frame information FM2 indicates information regarding a semantic frame of the language "China".
 図5に示すフレーム情報FM1やフレーム情報FM2等は、「言語」、「Domain-Goal」、「Slot」といった項目が含まれる。また、「Slot」には、「Attribute」、「Value」といった項目が含まれる。 The frame information FM1 and the frame information FM2 shown in FIG. 5 include items such as "language", "Domain-Goal", and "Slot". Further, "Slot" includes items such as "Attribute" and "Value".
 「言語」は、言語を示す。例えば、「言語」は、情報処理システム1が言語識別(音声認識)可能な言語を示す。また、言語を識別するための識別情報(言語ID)が各言語に対応付けて記憶されてもよい。例えば、各言語を識別する言語コードが記憶されてもよい。 "Language" indicates the language. For example, "language" indicates a language in which the information processing system 1 can identify the language (speech recognition). In addition, identification information (language ID) for identifying a language may be stored in association with each language. For example, a language code that identifies each language may be stored.
 また、「Domain-Goal」は、意味フレームのDomain-Goal(ドメインゴール)を示す。例えば、「Domain-Goal」は、発話の目的(意図)等を示す。 Also, "Domain-Goal" indicates the domain goal of the semantic frame. For example, "Domain-Goal" indicates the purpose (intention) of the utterance.
 「Slot」は、対応するDomain-Goalのスロット(構成要素)に関する各種情報が記憶される。例えば、「Slot」は、対応するドメインゴールに含まれる属性(スロット名)やその値(スロット値)が記憶される。「Attribute」は、スロット(構成要素)の属性(スロット名)を示す。「Value」は、対応する属性(スロット名)のスロット値を示す。なお、意味フレーム情報記憶部122中の「Value」に示す「-(ハイフン)」は、「Value」に値が格納されていないことを示す。なお、「Value」には、ユーザの意味解析の処理に用いられる場合、ユーザの発話に対応して具体的な値(情報)が格納される。 "Slot" stores various information about the corresponding Domain-Goal slot (component). For example, in "Slot", an attribute (slot name) included in the corresponding domain goal and its value (slot value) are stored. "Attribute" indicates an attribute (slot name) of a slot (component). "Value" indicates the slot value of the corresponding attribute (slot name). The “− (hyphen)” indicated by “Value” in the semantic frame information storage unit 122 indicates that the value is not stored in “Value”. When used in the processing of the user's semantic analysis, the "Value" stores a specific value (information) corresponding to the user's utterance.
 図5の例では、言語「英語」には、Domain-Goalが「Weather-Check」や「Music-Play」である意味フレームが含まれることを示す。また、Domain-Goal「Weather-Check」には、「Attribute」が「Date」や「Place」であるSlotが含まれることを示す。すなわち、天気をチェックするDomain-Goal「Weather-Check」には、日時や場所に関するスロットが含まれる。 In the example of FIG. 5, it is shown that the language "English" includes a meaning frame in which Domain-Goal is "Weather-Check" or "Music-Play". Further, it is shown that the Domain-Goal "Weather-Check" includes a slot whose "Attribute" is "Date" or "Place". That is, the Domain-Goal "Weather-Check" for checking the weather includes slots related to the date and time and place.
 なお、意味フレーム情報記憶部122は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、意味フレーム情報記憶部122には、各「Value」に格納される値の形式が記憶されてもよい。例えば、「Value」に記憶される値が、数値か、それ以外の情報(文字列等)であるかを示す情報を記憶されてもよい。例えば、「Value」に記憶される値が、言語に共通して使用可能な情報であるかを示す情報が記憶されてもよい。例えば、「Value」に記憶される値が、数値である場合、言語に共通して使用可能な情報ことを示す情報が記憶されてもよい。 Note that the semantic frame information storage unit 122 is not limited to the above, and various information may be stored depending on the purpose. For example, the semantic frame information storage unit 122 may store the format of the value stored in each “Value”. For example, information indicating whether the value stored in "Value" is a numerical value or other information (character string or the like) may be stored. For example, information indicating whether the value stored in "Value" is information that can be commonly used in the language may be stored. For example, when the value stored in "Value" is a numerical value, information indicating that the information can be commonly used in the language may be stored.
 実施形態に係る解析精度情報記憶部123は、解析精度に関する各種情報を記憶する。解析精度情報記憶部123は、各文字情報に対してスコアや解析精度等に関する各種情報を対応付けて記憶する。解析精度情報記憶部123は、入力言語や翻訳先言語に対応する各文字情報に対してスコアや解析精度等に関する各種情報を対応付けて記憶する。図6は、実施形態に係る解析精度情報記憶部の一例を示す図である。図6に示す解析精度情報記憶部123には、「入力言語」、「発話展開」、「翻訳先特定言語」、「翻訳先翻訳文」、「意味解析スコア」、「意味解析精度(%)」といった項目が含まれる。 The analysis accuracy information storage unit 123 according to the embodiment stores various information related to the analysis accuracy. The analysis accuracy information storage unit 123 stores various information related to the score, analysis accuracy, and the like in association with each character information. The analysis accuracy information storage unit 123 stores various information related to the score, analysis accuracy, and the like in association with each character information corresponding to the input language and the translation destination language. FIG. 6 is a diagram showing an example of the analysis accuracy information storage unit according to the embodiment. In the analysis accuracy information storage unit 123 shown in FIG. 6, "input language", "utterance development", "translation destination specific language", "translation destination translation sentence", "semantic analysis score", "semantic analysis accuracy (%)) ] Is included.
 「入力言語」は、入力可能な言語を示す。例えば、「入力言語」は、情報処理システム1が言語識別(音声認識)可能な言語を示す。また、入力言語を識別するための識別情報(言語ID)が各入力言語に対応付けて記憶されてもよい。例えば、各入力言語を識別する言語コードが記憶されてもよい。 "Input language" indicates a language that can be input. For example, "input language" indicates a language in which the information processing system 1 can identify a language (speech recognition). In addition, identification information (language ID) for identifying the input language may be stored in association with each input language. For example, a language code that identifies each input language may be stored.
 「発話展開」は、ユーザの発話に対応する文字情報やその文字情報を言い換えたパラフレーズを示す。図6の例では、「発話展開」のうち、「ビートルズを再生して」である「A」がユーザの発話に対応する文字情報を示し、残りの「B」~「D」の3つが「A」を言い換えたパラフレーズを示す。 "Utterance development" indicates character information corresponding to the user's utterance and a paraphrase in which the character information is paraphrased. In the example of FIG. 6, among the "utterance development", "A" which is "play the Beatles" indicates the character information corresponding to the user's utterance, and the remaining three "B" to "D" are " Indicates a paraphrase that paraphrases "A".
 図6中の「ビートルズを再生して」である「A」は、「発話展開情報A」と記載する場合がある。例えば、発話展開情報Aは、ユーザの発話テキストのままの文字情報を示す。また、図6中の「ビートルズの曲を再生して」である「B」は、「発話展開情報B」と記載する場合がある。例えば、発話展開情報Bは、発話展開情報Aに「曲を」と対象を明確にし、詳細に言い換えた文字情報を示す。 "A", which is "Play the Beatles" in FIG. 6, may be described as "utterance development information A". For example, the utterance development information A indicates the character information of the user's utterance text as it is. Further, "B" which is "playing the Beatles song" in FIG. 6 may be described as "utterance development information B". For example, the utterance development information B clarifies the target as "song" in the utterance development information A, and indicates character information paraphrased in detail.
 また、図6中の「ビートルズの曲を流してください」である「C」は、「発話展開情報C」と記載する場合がある。図6中の「ビートルズの曲をかけてください」である「D」は、「発話展開情報D」と記載する場合がある。例えば、発話展開情報Cや発話展開情報Dは、発話展開情報Aをより日常で用いられる表現に言い換えた文字情報を示す。なお、図6では、発話展開情報Aを発話展開情報B、発話展開情報C、発話展開情報Dの3つのパラフレーズに言い換えた場合を示すが、パラフレーズの数は、4個以上であってもよいし、1個や2個であってもよい。 In addition, "C", which is "Please play the Beatles song" in Fig. 6, may be described as "Utterance development information C". "D", which is "Please play the Beatles song" in FIG. 6, may be described as "utterance development information D". For example, the utterance development information C and the utterance development information D indicate character information obtained by paraphrasing the utterance development information A into expressions used in more daily life. Note that FIG. 6 shows a case where the utterance development information A is paraphrased into three paraphrases of the utterance development information B, the utterance development information C, and the utterance development information D, but the number of paraphrases is four or more. It may be one or two.
 「翻訳先特定言語」は、翻訳先言語を示す。例えば、「翻訳先特定言語」は、翻訳先となる言語であって特定言語を示す。例えば、「翻訳先特定言語」には、特定言語以外の言語(非特定言語)が記憶される。また、翻訳先特定言語を識別するための識別情報(言語ID)が各翻訳先特定言語に対応付けて記憶されてもよい。例えば、各翻訳先特定言語を識別する言語コードが記憶されてもよい。 "Translation destination specific language" indicates the translation destination language. For example, "translation destination specific language" is a language to be translated and indicates a specific language. For example, a language other than the specific language (non-specific language) is stored in the "translation destination specific language". In addition, identification information (language ID) for identifying the translation destination specific language may be stored in association with each translation destination specific language. For example, a language code that identifies each translation destination specific language may be stored.
 なお、図6の例では、「翻訳先特定言語」には、入力言語も特定言語である場合、入力言語も翻訳先言語とともに一覧で示すが、「翻訳先特定言語」のうち、「入力言語」と同じ言語は、翻訳先言語ではなく、対象言語を示す。 In the example of FIG. 6, when the input language is also a specific language, the input language is also shown in a list together with the translation destination language in the "translation destination specific language". The same language as "indicates the target language, not the target language.
 「翻訳先翻訳文」は、翻訳先特定言語に対応する文字情報を示す。例えば、「翻訳先翻訳文」は、発話展開に示す文字情報が、対応する翻訳先言語に翻訳された文字情報を示す。例えば、「翻訳先翻訳文」のうち、「発話展開」が発話展開情報Aであり、「翻訳先特定言語」が「英語」に対応する行は、日本語の文字情報である発話展開情報Aが英語に翻訳された文字情報を示す。 "Translation destination translation" indicates character information corresponding to the translation destination specific language. For example, the "translation destination translation sentence" indicates the character information in which the character information shown in the utterance development is translated into the corresponding translation destination language. For example, in the "translation destination translation sentence", the line corresponding to "utterance development" is utterance development information A and "translation destination specific language" corresponds to "English" is utterance development information A which is Japanese character information. Indicates the textual information translated into English.
 なお、図6の例では、「翻訳先翻訳文」には、入力言語も特定言語である場合、入力言語も翻訳先言語とともに一覧で示すが、「翻訳先翻訳文」のうち、「入力言語」と同じ言語に対応する文字情報は、翻訳文ではなく、「発話展開」に示す文字次情報と同じ文字情報となる。 In the example of FIG. 6, when the input language is also a specific language, the input language is also shown in a list together with the translation destination language in the "translation destination translation sentence". The character information corresponding to the same language as "" is not the translated sentence, but the same character information as the character-order information shown in "Utterance development".
 「意味解析スコア」は、意味解析の確信度(精度)を示す。「意味解析スコア」は、意味解析処理において特定したドメインゴールの確信度(精度)を示す。 "Semantic analysis score" indicates the degree of certainty (accuracy) of semantic analysis. The "semantic analysis score" indicates the certainty (accuracy) of the domain goal specified in the semantic analysis process.
 「意味解析精度(%)」は、意味解析処理の精度を複数言語間で比較可能にする精度指標値を示す。「意味解析精度(%)」は、「意味解析スコア」の値を所定の関数により変換した指標値(スコア)を示す。 "Semantic analysis accuracy (%)" indicates an accuracy index value that makes it possible to compare the accuracy of semantic analysis processing between multiple languages. The “semantic analysis accuracy (%)” indicates an index value (score) obtained by converting the value of the “semantic analysis score” by a predetermined function.
 図6の例では、入力言語「日本語」が、ユーザの発話に対応する「ビートルズを再生して」である発話展開情報Aや、その発話展開情報Aを言い換えた発話展開情報B~Dの3つパラフレーズに発話展開された場合を示す。このように、情報処理装置100は、ユーザの発話に対応する1つの文字情報を複数のパラフレーズに変換する発話展開を行う。 In the example of FIG. 6, the utterance development information A in which the input language "Japanese" is "playing the Beatles" corresponding to the user's utterance, and the utterance development information B to D in which the utterance development information A is paraphrased. The case where the utterance is expanded into three paraphrases is shown. In this way, the information processing device 100 performs utterance development that converts one character information corresponding to the user's utterance into a plurality of paraphrases.
 また、入力言語(対象言語)である「日本語」の翻訳先特定言語が、英語、スペイン語、フランス語、ドイツ語、イタリア語、中国語(簡体字)、韓国語、ヒンディー語、アラビア語等であることを示す。このように、情報処理装置100は、対象言語「日本語」の文字情報を、英語、スペイン語、フランス語、ドイツ語、イタリア語、中国語(簡体字)、韓国語、ヒンディー語、アラビア語等の文字情報に変換する。 In addition, the translation destination specific language of "Japanese", which is the input language (target language), is English, Spanish, French, German, Italian, Chinese (simplified), Korean, Hindi, Arabic, etc. Indicates that there is. In this way, the information processing apparatus 100 uses the character information of the target language "Japanese" in English, Spanish, French, German, Italian, Chinese (simplified), Korean, Hindi, Arabic, and the like. Convert to textual information.
 また、日本語の発話展開情報Aである文字情報「ビートルズを再生して」は、意味解析スコアが「0.83」であり、意味解析精度が「82.9」であることを示す。また、日本語の発話展開情報Aである「ビートルズを再生して」が英語に翻訳された文字情報「Play the Beatles」は、意味解析スコアが「0.51」であり、意味解析精度が「49.8」であることを示す。 In addition, the character information "Play the Beatles", which is the Japanese utterance development information A, indicates that the semantic analysis score is "0.83" and the semantic analysis accuracy is "82.9". In addition, the character information "Play the Beatles", which is the Japanese utterance development information A "Play the Beatles" translated into English, has a semantic analysis score of "0.51" and a semantic analysis accuracy of "0.51". It shows that it is 49.8 ”.
 なお、解析精度情報記憶部123は、上記に限らず、目的に応じて種々の情報を記憶してもよい。 The analysis accuracy information storage unit 123 is not limited to the above, and may store various information depending on the purpose.
 実施形態に係る閾値情報記憶部124は、閾値に関する各種情報を記憶する。閾値情報記憶部124は、スコアとの比較に用いる閾値に関する各種情報を記憶する。図7は、実施形態に係る閾値情報記憶部の一例を示す図である。図7に示す閾値情報記憶部124には、「閾値ID」、「閾値」といった項目が含まれる。 The threshold information storage unit 124 according to the embodiment stores various information related to the threshold value. The threshold information storage unit 124 stores various information regarding the threshold value used for comparison with the score. FIG. 7 is a diagram showing an example of the threshold information storage unit according to the embodiment. The threshold information storage unit 124 shown in FIG. 7 includes items such as “threshold ID” and “threshold”.
 「閾値ID」は、閾値を識別するための識別情報を示す。また、「閾値」は、対応する閾値IDにより識別される閾値の具体的な値を示す。また、各閾値には、その用途を示す情報が対応付けて記憶される。 "Threshold ID" indicates identification information for identifying the threshold value. Further, the "threshold value" indicates a specific value of the threshold value identified by the corresponding threshold ID. In addition, information indicating its use is stored in association with each threshold value.
 図7の例では、閾値ID「TH1」により識別される閾値TH1の値は、「0.75」であることを示す。また、閾値TH1は、その用途が(例えば翻訳の)品質推定であることを示す情報が対応付けて記憶される。 In the example of FIG. 7, it is shown that the value of the threshold value TH1 identified by the threshold value ID “TH1” is “0.75”. Further, the threshold value TH1 is stored in association with information indicating that its use is quality estimation (for example, translation).
 なお、閾値情報記憶部124は、上記に限らず、目的に応じて種々の情報を記憶してもよい。例えば、閾値情報記憶部124は、閾値の用途を閾値IDに対応付けて記憶してもよい。例えば、閾値情報記憶部124は、閾値ID「TH1」に用途「品質推定」を対応付けて記憶してもよい。 The threshold information storage unit 124 is not limited to the above, and may store various information depending on the purpose. For example, the threshold information storage unit 124 may store the use of the threshold value in association with the threshold value ID. For example, the threshold information storage unit 124 may store the threshold ID “TH1” in association with the use “quality estimation”.
 実施形態に係る知識情報記憶部125は、知識に関する各種情報を記憶する。知識情報記憶部125は、知識データベース(知識DB)として機能する。知識情報記憶部125は、多言語辞書の情報を記憶する。例えば、知識情報記憶部125は、各言語間での各対象を示す文字列の対応関係を示す情報を記憶する。なお、情報処理装置100が外部から知識を取得する場合、情報処理装置100は、知識情報記憶部125を有しなくてもよい。 The knowledge information storage unit 125 according to the embodiment stores various information related to knowledge. The knowledge information storage unit 125 functions as a knowledge database (knowledge DB). The knowledge information storage unit 125 stores information in a multilingual dictionary. For example, the knowledge information storage unit 125 stores information indicating the correspondence between character strings indicating each object between languages. When the information processing device 100 acquires knowledge from the outside, the information processing device 100 does not have to have the knowledge information storage unit 125.
 図3に戻り、説明を続ける。制御部130は、例えば、CPU(Central Processing Unit)やMPU(Micro Processing Unit)等によって、情報処理装置100内部に記憶されたプログラム(例えば、本開示に係る情報処理プログラム等の決定プログラム)がRAM(Random Access Memory)等を作業領域として実行されることにより実現される。また、制御部130は、コントローラ(controller)であり、例えば、ASIC(Application Specific Integrated Circuit)やFPGA(Field Programmable Gate Array)等の集積回路により実現される。 Return to Fig. 3 and continue the explanation. In the control unit 130, for example, a program stored inside the information processing apparatus 100 by a CPU (Central Processing Unit), an MPU (Micro Processing Unit), or the like (for example, a determination program such as an information processing program according to the present disclosure) is stored in a RAM. It is realized by executing (Random Access Memory) etc. as a work area. Further, the control unit 130 is a controller, and is realized by, for example, an integrated circuit such as an ASIC (Application Specific Integrated Circuit) or an FPGA (Field Programmable Gate Array).
 図3に示すように、制御部130は、取得部131と、変換部132と、実行部133と、算出部134と、選択部135と、逆変換部136と、生成部137と、送信部138とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部130の内部構成は、図3に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。また、制御部130が有する各処理部の接続関係は、図3に示した接続関係に限られず、他の接続関係であってもよい。 As shown in FIG. 3, the control unit 130 includes an acquisition unit 131, a conversion unit 132, an execution unit 133, a calculation unit 134, a selection unit 135, an inverse conversion unit 136, a generation unit 137, and a transmission unit. It has 138 and realizes or executes the functions and actions of information processing described below. The internal configuration of the control unit 130 is not limited to the configuration shown in FIG. 3, and may be another configuration as long as it is a configuration for performing information processing described later. Further, the connection relationship of each processing unit included in the control unit 130 is not limited to the connection relationship shown in FIG. 3, and may be another connection relationship.
 取得部131は、各種情報を取得する。取得部131は、外部の情報処理装置から各種情報を取得する。取得部131は、端末装置10から各種情報を取得する。 The acquisition unit 131 acquires various information. The acquisition unit 131 acquires various information from an external information processing device. The acquisition unit 131 acquires various information from the terminal device 10.
 取得部131は、記憶部120から各種情報を取得する。取得部131は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125から各種情報を取得する。 The acquisition unit 131 acquires various information from the storage unit 120. The acquisition unit 131 acquires various information from the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125.
 取得部131は、変換部132が変換した各種情報を取得する。取得部131は、実行部133が実行した各種情報を取得する。取得部131は、算出部134が算出した各種情報を取得する。取得部131は、選択部135が選択した各種情報を取得する。取得部131は、逆変換部136が変換した各種情報を取得する。取得部131は、生成部137が生成した各種情報を取得する。 The acquisition unit 131 acquires various information converted by the conversion unit 132. The acquisition unit 131 acquires various information executed by the execution unit 133. The acquisition unit 131 acquires various information calculated by the calculation unit 134. The acquisition unit 131 acquires various information selected by the selection unit 135. The acquisition unit 131 acquires various information converted by the inverse conversion unit 136. The acquisition unit 131 acquires various information generated by the generation unit 137.
 例えば、取得部131は、モデル(関数)を取得してもよい。例えば、取得部131は、翻訳の品質を推定するモデル(品質推定モデル)を取得する。取得部131は、各種のモデル(関数)を提供する外部の情報処理装置や記憶部120から翻訳の品質を推定するモデル(品質推定モデル)を取得する。 For example, the acquisition unit 131 may acquire a model (function). For example, the acquisition unit 131 acquires a model for estimating the quality of translation (quality estimation model). The acquisition unit 131 acquires a model (quality estimation model) for estimating the translation quality from an external information processing device or a storage unit 120 that provides various models (functions).
 取得部131は、対象言語によるユーザの発話情報を取得する。取得部131は、対象言語によるユーザの発話に対応する文字情報を取得する。 The acquisition unit 131 acquires the user's utterance information in the target language. The acquisition unit 131 acquires character information corresponding to the user's utterance in the target language.
 変換部132は、各種情報を変換する。変換部132は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を変換する。変換部132は、記憶部120から、各種情報を変換する。変換部132は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を変換する。 The conversion unit 132 converts various information. The conversion unit 132 converts various information based on the information from the external information processing device and the information stored in the storage unit 120. The conversion unit 132 converts various information from the storage unit 120. The conversion unit 132 converts various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
 変換部132は、対象言語に対応する文字情報を、対象言語の翻訳先となる翻訳先言語に対応する文字情報に変換する。変換部132は、対象言語に対応する文字情報を、意味解釈可能な言語に対応する文字情報に変換する。 The conversion unit 132 converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language. The conversion unit 132 converts the character information corresponding to the target language into the character information corresponding to the language whose meaning can be interpreted.
 変換部132は、各種情報を決定する。変換部132は、各種情報を判定する。変換部132は、言語識別(音声認識)が可能な言語かを判定する。変換部132は、入力言語が対象外言語かを判定する。変換部132は、対象言語が対象外言語かを判定する。変換部132は、言語識別(音声認識)できない言語を対象外言語であると判定する。変換部132は、入力言語が特定言語かどうかを判定する。変換部132は、対象言語が特定言語かどうかを判定する。 The conversion unit 132 determines various information. The conversion unit 132 determines various information. The conversion unit 132 determines whether the language is capable of language identification (speech recognition). The conversion unit 132 determines whether the input language is a non-target language. The conversion unit 132 determines whether the target language is a non-target language. The conversion unit 132 determines that a language that cannot be language-identified (speech recognition) is a non-target language. The conversion unit 132 determines whether or not the input language is a specific language. The conversion unit 132 determines whether or not the target language is a specific language.
 実行部133は、各種処理を実行する。実行部133は、外部の情報処理装置からの情報に基づいて、各種処理を実行する。実行部133は、記憶部120に記憶された情報に基づいて、各種処理を実行する。実行部133は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種処理を実行する。実行部133は、処理の実行により各種情報を生成する。 Execution unit 133 executes various processes. The execution unit 133 executes various processes based on information from an external information processing device. The execution unit 133 executes various processes based on the information stored in the storage unit 120. The execution unit 133 executes various processes based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. .. The execution unit 133 generates various information by executing the process.
 実行部133は、取得部131により取得された各種情報に基づいて、各種処理を実行する。実行部133は、変換部132により変換された各種情報に基づいて、各種処理を実行する。実行部133は、算出部134が算出した各種情報に基づいて、各種処理を実行する。実行部133は、選択部135が選択した各種情報に基づいて、各種処理を実行する。実行部133は、逆変換部136により変換された各種情報に基づいて、各種処理を実行する。 The execution unit 133 executes various processes based on various information acquired by the acquisition unit 131. The execution unit 133 executes various processes based on various information converted by the conversion unit 132. The execution unit 133 executes various processes based on various information calculated by the calculation unit 134. The execution unit 133 executes various processes based on various information selected by the selection unit 135. The execution unit 133 executes various processes based on various information converted by the inverse conversion unit 136.
 実行部133は、各種情報を決定する。実行部133は、各種情報を判定する。実行部133は、各種処理の実行を決定する。実行部133は、各種処理の実行を判定する。実行部133は、各種情報を特定する。実行部133は、各種情報を推定する。実行部133は、各種情報を抽出する。実行部133は、各種情報を選択する。実行部133は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を抽出する。実行部133は、記憶部120から、各種情報を抽出する。実行部133は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125から、各種情報を抽出する。 Execution unit 133 determines various information. The execution unit 133 determines various information. The execution unit 133 determines the execution of various processes. The execution unit 133 determines the execution of various processes. The execution unit 133 specifies various types of information. The execution unit 133 estimates various information. The execution unit 133 extracts various information. The execution unit 133 selects various information. The execution unit 133 extracts various information based on the information from the external information processing device and the information stored in the storage unit 120. The execution unit 133 extracts various information from the storage unit 120. The execution unit 133 extracts various information from the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125.
 実行部133は、取得部131により取得された各種情報に基づいて、各種情報を抽出する。実行部133は、変換部132により変換された各種情報に基づいて、各種情報を抽出する。実行部133は、算出部134が算出した各種情報に基づいて、各種情報を抽出する。実行部133は、選択部135が選択した各種情報に基づいて、各種情報を抽出する。また、実行部133は、逆変換部136により変換された各種情報に基づいて、各種情報を抽出する。実行部133は、生成部137により生成された情報に基づいて、各種情報を抽出する。 The execution unit 133 extracts various information based on the various information acquired by the acquisition unit 131. The execution unit 133 extracts various information based on the various information converted by the conversion unit 132. The execution unit 133 extracts various information based on the various information calculated by the calculation unit 134. The execution unit 133 extracts various information based on the various information selected by the selection unit 135. Further, the execution unit 133 extracts various information based on the various information converted by the inverse conversion unit 136. The execution unit 133 extracts various information based on the information generated by the generation unit 137.
 実行部133は、ユーザの発話に対応する文字情報を、形態素解析等の自然言語処理技術を適宜用いた解析を実行する。実行部133は、ユーザの発話に対応する文字情報を用いて、意味解析により、ユーザの発話の内容を推定(特定)する。実行部133は、ユーザの発話に対応する文字情報を用いて、対話状態推定(Dialog State Tracking)により、ユーザの状況を推定(特定)してもよい。実行部133は、変換部132により変換された文字情報を、意味解析や対話状態推定を適宜用いて解析することにより、文字情報の内容を推定(特定)する。実行部133は、変換部132により対象言語から翻訳先言語に変換された文字情報を、形態素解析等の自然言語処理技術を適宜用いて解析する。例えば、実行部133は、文字情報を構文解析等の種々の従来技術を適宜用いて解析することにより、文字情報に対応するユーザの発話の内容を推定する。 Execution unit 133 executes analysis of character information corresponding to the user's utterance by appropriately using natural language processing technology such as morphological analysis. The execution unit 133 estimates (identifies) the content of the user's utterance by semantic analysis using the character information corresponding to the user's utterance. The execution unit 133 may estimate (specify) the user's situation by dialogue state estimation (Dialog State Tracking) using the character information corresponding to the user's utterance. The execution unit 133 estimates (identifies) the content of the character information by analyzing the character information converted by the conversion unit 132 by appropriately using semantic analysis and dialogue state estimation. The execution unit 133 analyzes the character information converted from the target language to the translation destination language by the conversion unit 132 by appropriately using a natural language processing technique such as morphological analysis. For example, the execution unit 133 estimates the content of the user's utterance corresponding to the character information by appropriately analyzing the character information by using various conventional techniques such as parsing.
 実行部133は、ユーザの発話を解析することにより、ユーザの発話の意図等の内容を推定する。実行部133は、種々の従来技術を適宜用いてユーザの発話の意図等の内容を推定する。実行部133は、種々の従来技術を適宜用いて、ユーザの発話を解析することにより、ユーザの発話の内容を推定する。実行部133は、ユーザの発話の文字情報から重要なキーワードを抽出し、抽出したキーワードに基づいてユーザの発話の内容を推定する。 The execution unit 133 estimates the content such as the intention of the user's utterance by analyzing the user's utterance. The execution unit 133 estimates the content such as the intention of the user's utterance by appropriately using various conventional techniques. The execution unit 133 estimates the content of the user's utterance by analyzing the user's utterance by appropriately using various conventional techniques. The execution unit 133 extracts important keywords from the character information of the user's utterance, and estimates the content of the user's utterance based on the extracted keywords.
 実行部133は、発話に対応する文字情報を解析することにより、ユーザの発話に対応するDomain-Goal(ドメインゴール)を特定する。実行部133は、発話に対応する文字情報を解析することにより、特定したDomain-Goal(ドメインゴール)に対応するスロット値等の属性情報を推定する。実行部133は、翻訳の品質推定(単に「品質推定」ともいう)を行う。実行部133は、翻訳された文字情報の品質を推定する。実行部133は、適宜の方法により文字情報(翻訳テキスト)の品質推定翻訳精度(品質スコア)を算出する。例えば、実行部133は、翻訳前の文字情報と、翻訳後の文字情報との入力に応じて、その品質推定翻訳精度を示すスコア(品質スコア)を出力するモデル(品質推定モデル)を用いて、品質推定翻訳精度(品質スコア)を算出してもよい。例えば、実行部133は、翻訳前の文字情報、翻訳後の文字情報及びそのスコア(正解スコア)の組合せを学習データとして学習された品質推定モデルを用いて、品質推定翻訳精度(品質スコア)を算出してもよい。実行部133は、翻訳前の文字情報と翻訳後の文字情報とを基に、情報処理システム1の管理者等により設定されたスコア(正解スコア)を含む学習データとして学習された品質推定モデルを用いて、品質推定翻訳精度(品質スコア)を算出してもよい。実行部133は、品質スコアと閾値(例えば0.75等)と比較する。実行部133は、文字情報の品質スコアが閾値以上であれば、その文字情報の品質が高い(ハイスコアである)と判定し、翻訳文の品質推定翻訳精度(品質スコア)が閾値未満であれば、その翻訳文の品質が低い(ロースコアである)と判定する。 The execution unit 133 identifies the Domain-Goal (domain goal) corresponding to the user's utterance by analyzing the character information corresponding to the utterance. The execution unit 133 estimates the attribute information such as the slot value corresponding to the specified Domain-Goal (domain goal) by analyzing the character information corresponding to the utterance. The execution unit 133 performs translation quality estimation (also simply referred to as “quality estimation”). Execution unit 133 estimates the quality of the translated character information. The execution unit 133 calculates the quality estimation translation accuracy (quality score) of the character information (translation text) by an appropriate method. For example, the execution unit 133 uses a model (quality estimation model) that outputs a score (quality score) indicating the quality estimation translation accuracy in response to the input of the character information before translation and the character information after translation. , Quality estimation translation accuracy (quality score) may be calculated. For example, the execution unit 133 uses a quality estimation model learned by using a combination of character information before translation, character information after translation, and its score (correct answer score) as learning data to determine the quality estimation translation accuracy (quality score). It may be calculated. The execution unit 133 uses a quality estimation model learned as learning data including a score (correct answer score) set by the administrator of the information processing system 1 based on the character information before translation and the character information after translation. It may be used to calculate the quality estimation translation accuracy (quality score). Execution unit 133 compares the quality score with the threshold value (for example, 0.75 etc.). If the quality score of the character information is equal to or higher than the threshold value, the execution unit 133 determines that the quality of the character information is high (high score), and if the quality estimation translation accuracy (quality score) of the translated text is less than the threshold value. If so, it is determined that the quality of the translated text is low (low score).
 実行部133は、ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する。実行部133は、対象言語によるユーザの発話に対応する一の文字情報を含む1以上の文字情報に対して意味解析処理を実行する。実行部133は、一の文字情報が対象言語の翻訳先となる翻訳先言語に変換された翻訳文字情報を含む1以上の文字情報に対して意味解析処理を実行する。 Execution unit 133 executes semantic analysis processing for one or more character information corresponding to each of one or more languages including the target language which is the language corresponding to the user's utterance. The execution unit 133 executes a semantic analysis process on one or more character information including one character information corresponding to the user's utterance in the target language. The execution unit 133 executes a semantic analysis process on one or more character information including the translated character information in which one character information is converted into the translation destination language to be the translation destination of the target language.
 実行部133は、対象言語の一の文字情報を対象言語の別の表現に言い換えたパラフレーズを含む1以上の文字情報に対して意味解析処理を実行する。実行部133は、対象言語のパラフレーズが対象言語の翻訳先となる翻訳先言語に変換された翻訳パラフレーズを含む1以上の文字情報に対して意味解析処理を実行する。 Execution unit 133 executes a semantic analysis process on one or more character information including a paraphrase in which one character information of the target language is paraphrased into another expression of the target language. The execution unit 133 executes a semantic analysis process on one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to which the target language is translated.
 実行部133は、変換部132により変換された文字情報に対して意味解析処理を実行する。実行部133は、対象言語に対応する精度指標値が所定値以上である場合、処理コストを削減するために、対象言語以外の言語の意味解析処理を実行しない。実行部133は、対象言語に対応する精度指標値が所定値以上である場合、対象言語が言語識別可能である場合、対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する。 The execution unit 133 executes a semantic analysis process on the character information converted by the conversion unit 132. When the accuracy index value corresponding to the target language is equal to or higher than a predetermined value, the execution unit 133 does not execute the semantic analysis process of a language other than the target language in order to reduce the processing cost. When the accuracy index value corresponding to the target language is equal to or higher than a predetermined value, and when the target language is language identifiable, the execution unit 133 provides one or more character information corresponding to each of the one or more languages including the target language. On the other hand, the semantic analysis process is executed.
 実行部133は、文字情報の翻訳の品質を示す品質スコアが品質閾値より低い場合、その文字情報を用いた意味解析処理を実行しない。実行部133は、人による後編集が所定の時間内に完了する場合、後編集により生成された文字情報を用いて、意味解析処理を実行する。実行部133は、人による後編集が所定の時間内に完了しない場合、処理中断に関する処理を実行する。実行部133は、処理中断を行うことを通知する処理を実行する。実行部133は、ユーザの発話に対応する言語が意味解析処理を実行可能である場合、ユーザの発話した対象言語の文字情報に対して意味解析処理を実行する。 If the quality score indicating the translation quality of the character information is lower than the quality threshold value, the execution unit 133 does not execute the semantic analysis process using the character information. When the post-editing by a person is completed within a predetermined time, the execution unit 133 executes the semantic analysis process using the character information generated by the post-editing. When the post-editing by a person is not completed within a predetermined time, the execution unit 133 executes the process related to the process interruption. The execution unit 133 executes a process of notifying that the process is interrupted. When the language corresponding to the user's utterance can execute the semantic analysis process, the execution unit 133 executes the semantic analysis process on the character information of the target language spoken by the user.
 実行部133は、各言語の意味解析器を用いて言語ごとに意味解析処理を実行する。実行部133は、文字情報の入力に応じて、特定したドメインゴール等の意味フレームの情報と、その精度(確信度)を示すスコア(意味解析スコア)とを出力する意味解析器を用いて、意味解析処理を実行する。 Execution unit 133 executes the semantic analysis process for each language using the semantic analyzer of each language. The execution unit 133 uses a semantic analyzer that outputs information on a semantic frame such as a specified domain goal and a score (semantic analysis score) indicating its accuracy (confidence) in response to input of character information. Executes semantic analysis processing.
 実行部133は、意味解析の精度を示すスコア(意味解析スコア)を算出する。実行部133は、意味解析処理において特定したドメインゴールの確信度(精度)を示す意味解析スコアを算出する。実行部133は、意味解析処理に用いる意味解析器が出力するスコアを意味解析スコアとして用いてもよい。実行部133は、各言語の意味解析処理に用いる言語ごとの意味解析器が出力するスコアを各言語の意味解析スコアとして用いる。なお、実行部133は、種々の技術を適宜用いて、意味解析スコアを算出してもよい。 The execution unit 133 calculates a score (semantic analysis score) indicating the accuracy of the semantic analysis. The execution unit 133 calculates a semantic analysis score indicating the certainty (accuracy) of the domain goal specified in the semantic analysis process. The execution unit 133 may use the score output by the semantic analyzer used for the semantic analysis process as the semantic analysis score. The execution unit 133 uses the score output by the semantic analyzer for each language used for the semantic analysis processing of each language as the semantic analysis score of each language. The execution unit 133 may calculate the semantic analysis score by appropriately using various techniques.
 算出部134は、各種情報を算出する。算出部134は、各種の値を算出する。算出部134は、各種スコアを算出する。例えば、算出部134は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を算出する。算出部134は、端末装置10等の他の情報処理装置からの情報に基づいて、各種情報を算出する。算出部134は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を算出する。 The calculation unit 134 calculates various information. The calculation unit 134 calculates various values. The calculation unit 134 calculates various scores. For example, the calculation unit 134 calculates various types of information based on information from an external information processing device and information stored in the storage unit 120. The calculation unit 134 calculates various types of information based on information from other information processing devices such as the terminal device 10. The calculation unit 134 calculates various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
 算出部134は、取得部131により取得された各種情報に基づいて、各種情報を算出する。算出部134は、変換部132により変換された各種情報に基づいて、各種情報を算出する。算出部134は、実行部133により実行された各種処理に基づいて、各種情報を算出する。算出部134は、実行部133により実行された意味解析の結果に基づいて、各種情報を算出する。算出部134は、選択部135により選択された各種情報に基づいて、各種情報を算出する。算出部134は、逆変換部136により変換された各種情報に基づいて、各種情報を算出する。算出部134は、生成部137により生成された各種情報に基づいて、各種情報を算出する。 The calculation unit 134 calculates various information based on various information acquired by the acquisition unit 131. The calculation unit 134 calculates various information based on the various information converted by the conversion unit 132. The calculation unit 134 calculates various information based on various processes executed by the execution unit 133. The calculation unit 134 calculates various information based on the result of the semantic analysis executed by the execution unit 133. The calculation unit 134 calculates various information based on the various information selected by the selection unit 135. The calculation unit 134 calculates various information based on the various information converted by the inverse conversion unit 136. The calculation unit 134 calculates various information based on various information generated by the generation unit 137.
 算出部134は、1以上の文字情報の各々に対応する意味解析処理の結果に基づいて、1以上の文字情報の各々に対して、意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する。算出部134は、意味解析処理の結果の情報を入力として精度指標値を出力する関数を用いて、1以上の文字情報の各々の精度指標値を算出する。算出部134は、意味解析処理の結果に含まれるスコアを入力として精度指標値を出力する関数を用いて、1以上の文字情報の各々の精度指標値を算出する。算出部134は、意味解析処理の結果に含まれるスコア(意味解析スコア)を入力として精度指標値(意味解析精度)を出力する関数を用いて、1以上の文字情報の各々の精度指標値を算出する。算出部134は、一の言語の文字情報に対する意味解析処理のスコアと、一の言語を示す情報とを入力として精度指標値を出力する関数を用いて、一の言語の文字情報の精度指標値を算出する。算出部134は、対象言語の文字情報に対応する精度指標値を算出する。 The calculation unit 134 makes it possible to compare the accuracy of the semantic analysis processing for each of the one or more character information between a plurality of languages based on the result of the semantic analysis processing corresponding to each of the one or more character information. Calculate the index value. The calculation unit 134 calculates each accuracy index value of one or more character information by using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process. The calculation unit 134 calculates each accuracy index value of one or more character information by using a function that outputs an accuracy index value by inputting a score included in the result of the semantic analysis process. The calculation unit 134 uses a function that inputs a score (semantic analysis score) included in the result of the semantic analysis process and outputs an accuracy index value (semantic analysis accuracy) to obtain each accuracy index value of one or more character information. calculate. The calculation unit 134 uses a function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating one language, and the accuracy index value of the character information of one language. Is calculated. The calculation unit 134 calculates the accuracy index value corresponding to the character information of the target language.
 選択部135は、各種情報を選択する。選択部135は、各種情報を抽出する。選択部135は、各種情報を特定する。選択部135は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を選択する。選択部135は、端末装置10等の他の情報処理装置からの情報に基づいて、各種情報を選択する。選択部135は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を選択する。 The selection unit 135 selects various information. The selection unit 135 extracts various information. The selection unit 135 specifies various types of information. The selection unit 135 selects various information based on the information from the external information processing device and the information stored in the storage unit 120. The selection unit 135 selects various types of information based on information from other information processing devices such as the terminal device 10. The selection unit 135 selects various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
 選択部135は、取得部131により取得された各種情報に基づいて、各種情報を選択する。選択部135は、変換部132により変換された各種情報に基づいて、各種情報を選択する。選択部135は、実行部133により実行された各種処理に基づいて、各種情報を選択する。選択部135は、実行部133により実行された意味解析の結果に基づいて、各種情報を選択する。選択部135は、算出部134により算出された各種情報に基づいて、各種情報を選択する。選択部135は、逆変換部136により変換された各種情報に基づいて、各種情報を選択する。選択部135は、生成部137により生成された各種情報に基づいて、各種情報を選択する。 The selection unit 135 selects various information based on various information acquired by the acquisition unit 131. The selection unit 135 selects various information based on the various information converted by the conversion unit 132. The selection unit 135 selects various information based on various processes executed by the execution unit 133. The selection unit 135 selects various information based on the result of the semantic analysis executed by the execution unit 133. The selection unit 135 selects various information based on the various information calculated by the calculation unit 134. The selection unit 135 selects various information based on the various information converted by the inverse conversion unit 136. The selection unit 135 selects various information based on the various information generated by the generation unit 137.
 選択部135は、算出部134により算出された1以上の文字情報の各々の精度指標値に基づいて、1以上の文字情報のうち、処理に用いる文字情報である処理対象文字情報を選択する。選択部135は、精度指標値が最大である文字情報を処理対象文字情報として選択する。選択部135は、対象言語の文字情報に対応する精度指標値が所定値以上である場合、対象言語の文字情報を処理対象文字情報として選択する。 The selection unit 135 selects the processing target character information, which is the character information used for processing, from the one or more character information based on each accuracy index value of the one or more character information calculated by the calculation unit 134. The selection unit 135 selects the character information having the maximum accuracy index value as the character information to be processed. When the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the selection unit 135 selects the character information of the target language as the character information to be processed.
 逆変換部136は、各種情報を変換する。例えば、逆変換部136は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を変換する。逆変換部136は、端末装置10等の他の情報処理装置からの情報に基づいて、各種情報を変換する。逆変換部136は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を変換する。 The inverse conversion unit 136 converts various information. For example, the inverse conversion unit 136 converts various information based on the information from the external information processing device and the information stored in the storage unit 120. The inverse conversion unit 136 converts various information based on the information from other information processing devices such as the terminal device 10. The inverse conversion unit 136 converts various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. do.
 逆変換部136は、取得部131により取得された各種情報に基づいて、各種情報を変換する。逆変換部136は、変換部132により変換された各種情報に基づいて、各種情報を変換する。逆変換部136は、実行部133により算出された各種情報に基づいて、各種情報を変換する。逆変換部136は、算出部134が算出した各種情報に基づいて、各種情報を変換する。逆変換部136は、選択部135が選択した各種情報に基づいて、各種情報を変換する。逆変換部136は、生成部137により生成された各種情報に基づいて、各種情報を変換する。逆変換部136は、変換に基づいて、各種情報を変更する。取得部131により取得された情報に基づいて、各種情報を更新する。 The inverse conversion unit 136 converts various information based on various information acquired by the acquisition unit 131. The inverse conversion unit 136 converts various information based on the various information converted by the conversion unit 132. The inverse conversion unit 136 converts various information based on the various information calculated by the execution unit 133. The inverse conversion unit 136 converts various information based on the various information calculated by the calculation unit 134. The inverse conversion unit 136 converts various information based on the various information selected by the selection unit 135. The inverse conversion unit 136 converts various information based on the various information generated by the generation unit 137. The inverse conversion unit 136 changes various information based on the conversion. Various information is updated based on the information acquired by the acquisition unit 131.
 逆変換部136は、選択部135により選択された処理対象文字情報の言語に対応する意味解析処理の結果を対象言語に変換する。逆変換部136は、処理対象文字情報の言語に対応する意味解析処理の結果を対象言語に変換する。逆変換部136は、処理対象文字情報の言語が対象言語以外である場合、意味解析処理の結果を対象言語に変換する。逆変換部136は、意味解析処理の結果のうち一部を対象言語に変換する。逆変換部136は、意味解析処理の結果のうちスロット値を対象言語に変換する。 The inverse transformation unit 136 converts the result of the semantic analysis processing corresponding to the language of the processing target character information selected by the selection unit 135 into the target language. The inverse transformation unit 136 converts the result of the semantic analysis processing corresponding to the language of the processing target character information into the target language. When the language of the character information to be processed is other than the target language, the inverse transformation unit 136 converts the result of the semantic analysis process into the target language. The inverse transformation unit 136 converts a part of the result of the semantic analysis process into the target language. The inverse conversion unit 136 converts the slot value of the result of the semantic analysis process into the target language.
 逆変換部136は、各種情報を決定する。逆変換部136は、各種情報を判定する。逆変換部136は、各種処理の実行を判定する。逆変換部136は、逆変換の実行要否を判定する。逆変換部136は、逆変換が不要な情報については、対象言語に変換しなくてもよい。逆変換部136は、意味解析処理の結果のうち、逆変換が不要な情報については、対象言語に変換しなくてもよい。逆変換部136は、意味解析処理の結果のうち、言語に共通する情報については、対象言語に変換しなくてもよい。逆変換部136は、スロット値のうち、数値等の言語に共通する情報については、対象言語に変換しなくてもよい。 The inverse conversion unit 136 determines various information. The inverse conversion unit 136 determines various information. The inverse conversion unit 136 determines the execution of various processes. The inverse transformation unit 136 determines whether or not the inverse transformation needs to be executed. The inverse transformation unit 136 does not have to convert the information that does not require inverse transformation into the target language. The inverse transformation unit 136 does not have to convert the information that does not require inverse transformation among the results of the semantic analysis process into the target language. The inverse transformation unit 136 does not have to convert the information common to the languages among the results of the semantic analysis processing into the target language. The inverse conversion unit 136 does not have to convert the information common to the languages such as numerical values among the slot values into the target language.
 生成部137は、各種情報を生成する。生成部137は、外部の情報処理装置からの情報や記憶部120に記憶された情報に基づいて、各種情報を生成する。生成部137は、端末装置10等の他の情報処理装置からの情報に基づいて、各種情報を生成する。生成部137は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を生成する。 Generation unit 137 generates various information. The generation unit 137 generates various information based on the information from the external information processing device and the information stored in the storage unit 120. The generation unit 137 generates various information based on the information from other information processing devices such as the terminal device 10. The generation unit 137 generates various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
 生成部137は、取得部131により取得された各種情報に基づいて、各種情報を生成する。生成部137は、変換部132により変換された各種情報に基づいて、各種情報を生成する。生成部137は、実行部133の処理実行により生成された各種情報に基づいて、各種情報を生成する。生成部137は、算出部134が算出した各種情報に基づいて、各種情報を生成する。生成部137は、選択部135が選択した各種情報に基づいて、各種情報を生成する。生成部137は、逆変換部136により変換された各種情報に基づいて、各種情報を生成する。 The generation unit 137 generates various information based on the various information acquired by the acquisition unit 131. The generation unit 137 generates various information based on the various information converted by the conversion unit 132. The generation unit 137 generates various information based on various information generated by the processing execution of the execution unit 133. The generation unit 137 generates various information based on the various information calculated by the calculation unit 134. The generation unit 137 generates various information based on the various information selected by the selection unit 135. The generation unit 137 generates various information based on the various information converted by the inverse conversion unit 136.
 生成部137は、種々の技術を適宜用いて、外部の情報処理装置へ提供する画面(画像情報)等の種々の情報を生成する。生成部137は、端末装置10へ提供する画面(画像情報)等を生成する。例えば、生成部137は、記憶部120に記憶された情報に基づいて、端末装置10へ提供する画面(画像情報)等を生成する。 The generation unit 137 appropriately uses various techniques to generate various information such as a screen (image information) to be provided to an external information processing device. The generation unit 137 generates a screen (image information) or the like to be provided to the terminal device 10. For example, the generation unit 137 generates a screen (image information) or the like to be provided to the terminal device 10 based on the information stored in the storage unit 120.
 生成部137は、外部の情報処理装置へ提供する画面(画像情報)等が生成可能であれば、どのような処理により画面(画像情報)等を生成してもよい。例えば、生成部137は、画像生成や画像処理等に関する種々の技術を適宜用いて、端末装置10へ提供する画面(画像情報)を生成する。例えば、生成部137は、Java(登録商標)等の種々の技術を適宜用いて、端末装置10へ提供する画面(画像情報)を生成する。なお、生成部137は、CSSやJavaScript(登録商標)やHTMLの形式に基づいて、端末装置10へ提供する画面(画像情報)を生成してもよい。また、例えば、生成部137は、JPEG(Joint Photographic Experts Group)やGIF(Graphics Interchange Format)やPNG(Portable Network Graphics)など様々な形式で画面(画像情報)を生成してもよい。 The generation unit 137 may generate the screen (image information) or the like by any process as long as the screen (image information) or the like to be provided to the external information processing device can be generated. For example, the generation unit 137 generates a screen (image information) to be provided to the terminal device 10 by appropriately using various techniques related to image generation, image processing, and the like. For example, the generation unit 137 appropriately uses various techniques such as Java (registered trademark) to generate a screen (image information) to be provided to the terminal device 10. The generation unit 137 may generate a screen (image information) to be provided to the terminal device 10 based on the format of CSS, Javascript (registered trademark), or HTML. Further, for example, the generation unit 137 may generate a screen (image information) in various formats such as JPEG (Joint Photographic Experts Group), GIF (Graphics Interchange Format), and PNG (Portable Network Graphics).
 例えば、生成部137は、翻訳前の文字情報、翻訳後の文字情報及びそのスコア(正解スコア)の組合せを含む学習データを用いて、品質推定モデルを生成する。生成部137は、翻訳前の文字情報と翻訳後の文字情報とを基に、情報処理システム1の管理者等により設定されたスコア(正解スコア)を含む学習データを用いて、品質推定モデルを生成する。 For example, the generation unit 137 generates a quality estimation model using learning data including a combination of character information before translation, character information after translation, and a score (correct answer score) thereof. The generation unit 137 uses learning data including a score (correct answer score) set by the administrator of the information processing system 1 based on the character information before translation and the character information after translation to generate a quality estimation model. Generate.
 送信部138は、各種情報を送信する。送信部138は、外部の情報処理装置へ各種情報を送信する。送信部138は、外部の情報処理装置へ各種情報を提供する。例えば、送信部138は、端末装置10等の他の情報処理装置へ各種情報を送信する。送信部138は、記憶部120に記憶された情報を提供する。送信部138は、記憶部120に記憶された情報を送信する。 The transmission unit 138 transmits various information. The transmission unit 138 transmits various information to an external information processing device. The transmission unit 138 provides various information to an external information processing device. For example, the transmission unit 138 transmits various information to another information processing device such as the terminal device 10. The transmission unit 138 provides the information stored in the storage unit 120. The transmission unit 138 transmits the information stored in the storage unit 120.
 送信部138は、端末装置10等の他の情報処理装置からの情報に基づいて、各種情報を提供する。送信部138は、記憶部120に記憶された情報に基づいて、各種情報を提供する。送信部138は、言語情報記憶部121や意味フレーム情報記憶部122や解析精度情報記憶部123や閾値情報記憶部124や知識情報記憶部125に記憶された情報に基づいて、各種情報を提供する。 The transmission unit 138 provides various types of information based on information from other information processing devices such as the terminal device 10. The transmission unit 138 provides various information based on the information stored in the storage unit 120. The transmission unit 138 provides various information based on the information stored in the language information storage unit 121, the semantic frame information storage unit 122, the analysis accuracy information storage unit 123, the threshold information storage unit 124, and the knowledge information storage unit 125. ..
 送信部138は、端末装置10に実行させる機能を示す情報を端末装置10に送信する。送信部138は、選択部135により選択された機能(サービス)を示す情報を端末装置10に送信する。送信部138は、実行部133による指示に応じて、端末装置10に各種の情報を送信する。送信部138は、端末装置10に機能(サービス)の実行を指示する情報を送信する。送信部138は、生成部137によって生成された画像情報を送信する。 The transmission unit 138 transmits information indicating a function to be executed by the terminal device 10 to the terminal device 10. The transmission unit 138 transmits information indicating the function (service) selected by the selection unit 135 to the terminal device 10. The transmission unit 138 transmits various information to the terminal device 10 in response to an instruction from the execution unit 133. The transmission unit 138 transmits information instructing the terminal device 10 to execute a function (service). The transmission unit 138 transmits the image information generated by the generation unit 137.
[1-4.実施形態に係る端末装置の構成]
 次に、実施形態に係る情報処理を実行する情報処理装置の一例である端末装置10の構成について説明する。図8は、本開示の実施形態に係る端末装置の構成例を示す図である。
[1-4. Configuration of terminal device according to embodiment]
Next, the configuration of the terminal device 10 which is an example of the information processing device that executes the information processing according to the embodiment will be described. FIG. 8 is a diagram showing a configuration example of the terminal device according to the embodiment of the present disclosure.
 図8に示すように、端末装置10は、通信部11と、入力部12と、出力部13と、記憶部14と、制御部15と、センサ部16と、表示部17とを有する。 As shown in FIG. 8, the terminal device 10 includes a communication unit 11, an input unit 12, an output unit 13, a storage unit 14, a control unit 15, a sensor unit 16, and a display unit 17.
 通信部11は、例えば、NICや通信回路等によって実現される。通信部11は、ネットワークN(インターネット等)と有線又は無線で接続され、ネットワークNを介して、情報処理装置100等の他の装置等との間で情報の送受信を行う。 The communication unit 11 is realized by, for example, a NIC or a communication circuit. The communication unit 11 is connected to the network N (Internet or the like) by wire or wirelessly, and transmits / receives information to / from other devices such as the information processing device 100 via the network N.
 入力部12は、各種入力を受け付ける。入力部12は、センサ部16による検知を入力として受け付ける。入力部12は、ユーザの発話情報の入力を受け付ける。入力部12は、ユーザの身体動作による入力を受け付ける。入力部12は、ユーザのジェスチャや視線を入力として受け付ける。 The input unit 12 accepts various inputs. The input unit 12 receives the detection by the sensor unit 16 as an input. The input unit 12 accepts the input of the user's utterance information. The input unit 12 accepts input by the user's physical movement. The input unit 12 accepts the user's gesture and line of sight as input.
 入力部12は、ユーザから各種操作が入力される。入力部12は、音声を検知する機能を有するセンサ部16により音を入力として受け付ける。入力部12は、音声を検知するマイク(音センサ)により検知された音声情報を入力情報として受け付ける。入力部12は、ユーザの発話による音声を入力情報として受け付ける。 Various operations are input from the user to the input unit 12. The input unit 12 receives sound as input by the sensor unit 16 having a function of detecting voice. The input unit 12 receives the voice information detected by the microphone (sound sensor) that detects the voice as the input information. The input unit 12 receives the voice spoken by the user as input information.
 また、入力部12は、ユーザが利用する端末装置10への操作(ユーザ操作)をユーザによる操作入力として受け付けてもよい。入力部12は、通信部11を介して、リモコン(リモートコントローラー:remote controller)を用いたユーザの操作に関する情報を受け付けてもよい。また、入力部12は、端末装置10に設けられたボタンや、端末装置10に接続されたキーボードやマウスを有してもよい。 Further, the input unit 12 may accept an operation (user operation) on the terminal device 10 used by the user as an operation input by the user. The input unit 12 may receive information regarding the operation of the user using the remote controller (remote controller) via the communication unit 11. Further, the input unit 12 may have a button provided on the terminal device 10 or a keyboard or mouse connected to the terminal device 10.
 例えば、入力部12は、リモコンやキーボードやマウスと同等の機能を実現できるタッチパネルを有してもよい。この場合、入力部12は、表示部17を介して各種情報が入力される。入力部12は、各種センサにより実現されるタッチパネルの機能により、表示画面を介してユーザから各種操作を受け付ける。すなわち、入力部12は、端末装置10の表示部17を介してユーザから各種操作を受け付ける。例えば、入力部12は、端末装置10の表示部17を介してユーザの指定操作等の操作を受け付ける。例えば、入力部12は、タッチパネルの機能によりユーザの操作を受け付ける受付部として機能する。この場合、入力部12と受付部153とは一体であってもよい。なお、入力部12によるユーザの操作の検知方式には、タブレット端末では主に静電容量方式が採用されるが、他の検知方式である抵抗膜方式、表面弾性波方式、赤外線方式、電磁誘導方式など、ユーザの操作を検知できタッチパネルの機能が実現できればどのような方式を採用してもよい。 For example, the input unit 12 may have a touch panel capable of realizing functions equivalent to those of a remote controller, a keyboard, and a mouse. In this case, various information is input to the input unit 12 via the display unit 17. The input unit 12 receives various operations from the user via the display screen by the function of the touch panel realized by various sensors. That is, the input unit 12 receives various operations from the user via the display unit 17 of the terminal device 10. For example, the input unit 12 receives an operation such as a user's designated operation via the display unit 17 of the terminal device 10. For example, the input unit 12 functions as a reception unit that receives a user's operation by the function of the touch panel. In this case, the input unit 12 and the reception unit 153 may be integrated. As the detection method of the user's operation by the input unit 12, the capacitance method is mainly adopted in the tablet terminal, but other detection methods such as the resistance film method, the surface acoustic wave method, the infrared method, and the electromagnetic induction method are used. Any method may be adopted as long as the user's operation can be detected and the touch panel function can be realized.
 例えば、入力部12は、ユーザの発話を入力として受け付ける。入力部12は、センサ部16により検知されたユーザの発話を入力として受け付ける。入力部12は、センサ部16の音センサにより検知されたユーザの発話を入力として受け付ける。 For example, the input unit 12 accepts a user's utterance as an input. The input unit 12 receives the user's utterance detected by the sensor unit 16 as input. The input unit 12 receives the user's utterance detected by the sound sensor of the sensor unit 16 as an input.
 出力部13は、各種情報を出力する。出力部13は、音声を出力する機能を有する。例えば、出力部13は、音声を出力するスピーカーを有する。出力部13は、実行部152による制御に応じて、各種情報を音声出力する。出力部13は、ユーザに対して音声による情報の出力を行う。出力部13は、表示部17に表示される情報を音声により出力する。 The output unit 13 outputs various information. The output unit 13 has a function of outputting audio. For example, the output unit 13 has a speaker that outputs sound. The output unit 13 outputs various information by voice according to the control by the execution unit 152. The output unit 13 outputs information by voice to the user. The output unit 13 outputs the information displayed on the display unit 17 by voice.
 記憶部14は、例えば、RAM、フラッシュメモリ等の半導体メモリ素子、または、ハードディスク、光ディスク等の記憶装置によって実現される。記憶部14は、情報の表示に用いる各種情報を記憶する。 The storage unit 14 is realized by, for example, a semiconductor memory element such as a RAM or a flash memory, or a storage device such as a hard disk or an optical disk. The storage unit 14 stores various information used for displaying the information.
 図8に戻り、説明を続ける。制御部15は、例えば、CPUやMPU等によって、端末装置10内部に記憶されたプログラム(例えば、本開示に係る情報処理プログラム等の表示プログラム)がRAM等を作業領域として実行されることにより実現される。また、制御部15は、コントローラであり、例えば、ASICやFPGA等の集積回路により実現されてもよい。 Return to Fig. 8 and continue the explanation. The control unit 15 is realized by, for example, a CPU, an MPU, or the like executing a program stored inside the terminal device 10 (for example, a display program such as an information processing program according to the present disclosure) with a RAM or the like as a work area. Will be done. Further, the control unit 15 is a controller, and may be realized by an integrated circuit such as an ASIC or FPGA.
 図8に示すように、制御部15は、受信部151と、実行部152と、受付部153と、送信部154とを有し、以下に説明する情報処理の機能や作用を実現または実行する。なお、制御部15の内部構成は、図8に示した構成に限られず、後述する情報処理を行う構成であれば他の構成であってもよい。 As shown in FIG. 8, the control unit 15 includes a reception unit 151, an execution unit 152, a reception unit 153, and a transmission unit 154, and realizes or executes the information processing functions and operations described below. .. The internal configuration of the control unit 15 is not limited to the configuration shown in FIG. 8, and may be another configuration as long as it is a configuration for performing information processing described later.
 受信部151は、各種情報を受信する。受信部151は、外部の情報処理装置から各種情報を受信する。受信部151は、情報処理装置100等の他の情報処理装置から各種情報を受信する。 The receiving unit 151 receives various information. The receiving unit 151 receives various information from an external information processing device. The receiving unit 151 receives various information from other information processing devices such as the information processing device 100.
 受信部151は、情報処理装置100から機能(サービス)の実行を指示する情報を受信する。受信部151は、情報処理装置100から各種機能(サービス)の実行指示を受信する。例えば、受信部151は、情報処理装置100から機能(サービス)を指定する情報を機能の実行指示として受信する。受信部151は、コンテンツを受信する。受信部151は、情報処理装置100から表示するコンテンツを受信する。 The receiving unit 151 receives information instructing the execution of a function (service) from the information processing device 100. The receiving unit 151 receives execution instructions of various functions (services) from the information processing device 100. For example, the receiving unit 151 receives information specifying a function (service) from the information processing device 100 as a function execution instruction. The receiving unit 151 receives the content. The receiving unit 151 receives the content to be displayed from the information processing device 100.
 実行部152は、各種処理を実行する。実行部152は、各種処理の実行を決定する。実行部152は、外部の情報処理装置からの情報に基づいて、各種処理を実行する。実行部152は、情報処理装置100からの情報に基づいて、各種処理を実行する。実行部152は、情報処理装置100からの指示に応じて、各種処理を実行する。実行部152は、記憶部14に記憶された情報に基づいて、各種処理を実行する。実行部152は、機能(サービス)を実行する。 Execution unit 152 executes various processes. The execution unit 152 determines the execution of various processes. The execution unit 152 executes various processes based on information from an external information processing device. The execution unit 152 executes various processes based on the information from the information processing device 100. The execution unit 152 executes various processes in response to an instruction from the information processing device 100. The execution unit 152 executes various processes based on the information stored in the storage unit 14. The execution unit 152 executes a function (service).
 実行部152は、各種出力を制御する。実行部152は、出力部13による音声出力を制御する。実行部152は、各種表示を制御する。実行部152は、表示部17の表示を制御する。実行部152は、受信部151による受信に応じて、表示部17の表示を制御する。実行部152は、受信部151により受信された情報に基づいて、表示部17の表示を制御する。実行部152は、受付部153により受け付けられた情報に基づいて、表示部17の表示を制御する。実行部152は、受付部153による受付けに応じて、表示部17の表示を制御する。 The execution unit 152 controls various outputs. The execution unit 152 controls the audio output by the output unit 13. The execution unit 152 controls various displays. The execution unit 152 controls the display of the display unit 17. The execution unit 152 controls the display of the display unit 17 in response to the reception by the reception unit 151. The execution unit 152 controls the display of the display unit 17 based on the information received by the reception unit 151. The execution unit 152 controls the display of the display unit 17 based on the information received by the reception unit 153. The execution unit 152 controls the display of the display unit 17 in response to the reception by the reception unit 153.
 受付部153は、各種情報を受け付ける。受付部153は、入力部12を介してユーザによる入力を受け付ける。受付部153は、ユーザによる発話を入力として受け付ける。受付部153は、ユーザによる操作を受け付ける。受付部153は、表示部17により表示された情報に対するユーザの操作を受け付ける。受付部153は、ユーザによる文字入力を受け付ける。 Reception department 153 receives various information. The reception unit 153 receives input by the user via the input unit 12. The reception unit 153 accepts the utterance by the user as an input. The reception unit 153 accepts operations by the user. The reception unit 153 accepts the user's operation on the information displayed by the display unit 17. The reception unit 153 accepts character input by the user.
 送信部154は、外部の情報処理装置へ各種情報を送信する。例えば、送信部154は、情報処理装置100等の他の情報処理装置へ各種情報を送信する。送信部154は、記憶部14に記憶された情報を送信する。 The transmission unit 154 transmits various information to an external information processing device. For example, the transmission unit 154 transmits various information to another information processing device such as the information processing device 100. The transmission unit 154 transmits the information stored in the storage unit 14.
 送信部154は、情報処理装置100等の他の情報処理装置からの情報に基づいて、各種情報を送信する。送信部154は、記憶部14に記憶された情報に基づいて、各種情報を送信する。 The transmission unit 154 transmits various types of information based on information from other information processing devices such as the information processing device 100. The transmission unit 154 transmits various types of information based on the information stored in the storage unit 14.
 送信部154は、センサ部16により検知されたセンサ情報を情報処理装置100へ送信する。送信部154は、センサ部16の音センサにより検知されたユーザの発話情報を情報処理装置100へ送信する。 The transmission unit 154 transmits the sensor information detected by the sensor unit 16 to the information processing device 100. The transmission unit 154 transmits the user's utterance information detected by the sound sensor of the sensor unit 16 to the information processing device 100.
 送信部154は、ユーザにより入力された入力情報を情報処理装置100へ送信する。送信部154は、ユーザにより音声入力された入力情報を情報処理装置100へ送信する。送信部154は、ユーザの操作により入力された入力情報を情報処理装置100へ送信する。送信部154は、対象言語によるユーザの発話情報を情報処理装置100へ送信する。送信部154は、対象言語によるユーザの発話に対応する文字情報を情報処理装置100へ送信する。 The transmission unit 154 transmits the input information input by the user to the information processing device 100. The transmission unit 154 transmits the input information voice-input by the user to the information processing device 100. The transmission unit 154 transmits the input information input by the user's operation to the information processing device 100. The transmission unit 154 transmits the user's utterance information in the target language to the information processing device 100. The transmission unit 154 transmits the character information corresponding to the user's utterance in the target language to the information processing device 100.
 センサ部16は、種々のセンサ情報を検知する。センサ部16は、音を検知する音センサ(スピーカ)を有する。センサ部16は、画像を撮像する撮像部としての機能を有する。センサ部16は、画像センサの機能を有し、画像情報を検知する。センサ部16は、画像を入力として受け付ける画像入力部として機能する。なお、センサ部16は、上記に限らず、種々のセンサを有してもよい。センサ部16は、位置センサ、加速度センサ、ジャイロセンサ、温度センサ、湿度センサ、照度センサ、圧力センサ、近接センサ、ニオイや汗や心拍や脈拍や脳波等の生体情報を受信のためのセンサ等の種々のセンサを有してもよい。また、センサ部16における上記の各種情報を検知するセンサは共通のセンサであってもよいし、各々異なるセンサにより実現されてもよい。 The sensor unit 16 detects various sensor information. The sensor unit 16 has a sound sensor (speaker) that detects sound. The sensor unit 16 has a function as an imaging unit for capturing an image. The sensor unit 16 has an image sensor function and detects image information. The sensor unit 16 functions as an image input unit that receives an image as an input. The sensor unit 16 is not limited to the above, and may have various sensors. The sensor unit 16 is a position sensor, an acceleration sensor, a gyro sensor, a temperature sensor, a humidity sensor, an illuminance sensor, a pressure sensor, a proximity sensor, a sensor for receiving biological information such as odor, sweat, heartbeat, pulse, and brain wave. It may have various sensors. Further, the sensors that detect the above-mentioned various information in the sensor unit 16 may be common sensors, or may be realized by different sensors.
 表示部17は、端末装置10に設けられ各種情報を表示する。表示部17は、例えば液晶ディスプレイや有機EL(Electro-Luminescence)ディスプレイ等によって実現される。表示部17は、情報処理装置100から提供される情報を表示可能であれば、どのような手段により実現されてもよい。表示部17は、実行部152による制御に応じて、各種情報を表示する。 The display unit 17 is provided on the terminal device 10 and displays various information. The display unit 17 is realized by, for example, a liquid crystal display, an organic EL (Electro-Luminescence) display, or the like. The display unit 17 may be realized by any means as long as the information provided by the information processing device 100 can be displayed. The display unit 17 displays various information according to the control by the execution unit 152.
 表示部17は、受信部151により受信された各種情報を表示する。表示部17は、情報処理装置100から受信した応答を表示する。表示部17は、言語変換に関する情報を表示する。 The display unit 17 displays various information received by the reception unit 151. The display unit 17 displays the response received from the information processing device 100. The display unit 17 displays information related to language conversion.
[1-5.応答例]
 ここで、図9を用いて応答例を説明する。図9は、本開示の実施形態に係る応答の一例を示す図である。図9は、言語が認識できる形式の応答例を示す。
[1-5. Response example]
Here, a response example will be described with reference to FIG. FIG. 9 is a diagram showing an example of the response according to the embodiment of the present disclosure. FIG. 9 shows an example of a response in a language-recognizable format.
 情報処理システム1は、入力言語と意味解析処理を行った言語が異なる場合、そのことをユーザに認識させる情報を出力してもよい。例えば、情報処理システム1は、入力言語を翻訳して、翻訳した情報を用いて意味解析処理したことがわかるように、応答の際に、入力時、解析時、出力時の3フェーズにて、どの言語で処理したかがわかる形式で出力する。 The information processing system 1 may output information for the user to recognize when the input language and the language for which the semantic analysis processing is performed are different. For example, the information processing system 1 translates the input language and performs semantic analysis processing using the translated information in three phases of input, analysis, and output at the time of response. Output in a format that shows which language was processed.
 例えば、端末装置10は、表示部17に言語が認識できる形式の応答を表示する。図9の例では、情報処理システム1は、入力言語(対象言語)がミャンマー語であり、意味解析を行った特定言語(翻訳先言語)が日本語であり、出力言語(対象言語)がミャンマー語であることを示す情報を出力する。このように、情報処理システム1は、応答の際に、入力時、解析時、出力時の3フェーズにて、どの言語で処理したかがわかる形式で出力する。 For example, the terminal device 10 displays a response in a language-recognizable format on the display unit 17. In the example of FIG. 9, in the information processing system 1, the input language (target language) is Burmese, the specific language (translation destination language) for which semantic analysis is performed is Japanese, and the output language (target language) is Myanmar. Outputs information indicating that it is a word. In this way, the information processing system 1 outputs a response in a format that indicates in which language the processing was performed in three phases of input, analysis, and output.
[1-6.実施形態に係る情報処理の手順]
 次に、図10~図12を用いて、実施形態に係る各種情報処理の手順について説明する。
[1-6. Information processing procedure according to the embodiment]
Next, various information processing procedures according to the embodiment will be described with reference to FIGS. 10 to 12.
[1-6-1.情報処理装置に係る処理の手順]
 まず、図10を用いて、本開示の実施形態に係る情報処理装置に係る処理の流れについて説明する。図10は、本開示の実施形態に係る情報処理装置の処理を示すフローチャートである。具体的には、図10は、情報処理装置100による情報処理の手順を示すフローチャートである。
[1-6-1. Procedure for processing related to information processing equipment]
First, the flow of processing related to the information processing apparatus according to the embodiment of the present disclosure will be described with reference to FIG. FIG. 10 is a flowchart showing processing of the information processing apparatus according to the embodiment of the present disclosure. Specifically, FIG. 10 is a flowchart showing a procedure of information processing by the information processing apparatus 100.
 図10に示すように、情報処理装置100は、ユーザの発話に対応する対象言語を含む言語の各々に対応する文字情報に対して意味解析処理を実行する(ステップS101)。そして、情報処理装置100は、意味解析処理の結果に基づいて、文字情報の各々に対して、精度指標値を算出する(ステップS102)。例えば、情報処理装置100は、文字情報の各々に対して、意味解析処理の精度を複数言語間で比較可能にする精度指標値(意味解析精度)を算出する。 As shown in FIG. 10, the information processing apparatus 100 executes a semantic analysis process on the character information corresponding to each of the languages including the target language corresponding to the user's utterance (step S101). Then, the information processing apparatus 100 calculates an accuracy index value for each of the character information based on the result of the semantic analysis process (step S102). For example, the information processing apparatus 100 calculates an accuracy index value (semantic analysis accuracy) that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the character information.
[1-6-2.情報処理システムに係る処理の手順]
 次に、図11を用いて、情報処理システムに係る具体的な処理の一例について説明する。図11は、本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。なお、以下では、情報処理システム1が処理を行う場合を一例として説明するが、図11に示す処理は、情報処理システム1に含まれる情報処理装置100及び端末装置10のいずれの装置が行ってもよい。
[1-6-2. Procedure for processing related to information processing system]
Next, an example of specific processing related to the information processing system will be described with reference to FIG. FIG. 11 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure. In the following, a case where the information processing system 1 performs processing will be described as an example, but the processing shown in FIG. 11 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good.
 図11に示すように、情報処理システム1は、ユーザによる発話の音声情報を取得する(ステップS201)。例えば、情報処理システム1は、入力言語(対象言語)でのユーザによる発話の音声情報を取得する。 As shown in FIG. 11, the information processing system 1 acquires the voice information of the utterance by the user (step S201). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
 そして、情報処理システム1は、音声認識の処理を行う(ステップS202)。情報処理システム1は、ユーザによる発話の音声情報に対する音声認識の処理を行う。例えば、情報処理システム1は、音声認識により入力言語(対象言語)でのユーザによる発話の発話テキスト(文字情報)を発話情報として取得する。例えば、情報処理システム1は、言語識別(音声認識)が可能な言語かを判定し、可能な場合、音声認識の処理を行う。なお、情報処理システム1は、言語識別(音声認識)が可能な言語ではない場合、処理を終了してもよい。この場合、情報処理システム1は、対応可能な言語ではないことをユーザに通知してもよい。例えば、情報処理システム1は、発話情報を入力言語文字情報として用いる。なお、情報処理システム1は、発話情報の正規化が必要な場合、発話情報を入力言語文字情報に正規化してもよい。 Then, the information processing system 1 performs voice recognition processing (step S202). The information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the utterance text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition. For example, the information processing system 1 determines whether the language is capable of language identification (speech recognition), and if possible, performs voice recognition processing. If the information processing system 1 is not a language capable of language identification (speech recognition), the processing may be terminated. In this case, the information processing system 1 may notify the user that the language is not compatible. For example, the information processing system 1 uses the utterance information as input language character information. When it is necessary to normalize the utterance information, the information processing system 1 may normalize the utterance information to the input language character information.
 そして、情報処理システム1は、発話文展開を行う(ステップS203)。情報処理システム1は、ユーザの発話に対応する入力言語文字情報を言い換えたパラフレーズを生成する。情報処理システム1は、ユーザの発話に対応する入力言語文字情報とそのパラフレーズを含む疑似発話テキストリストを生成する。例えば、情報処理システム1は、図6に示すようにユーザの発話に対応する発話展開情報Aを言い換えた発話展開情報B、発話展開情報C、及び発話展開情報Dの3つ(複数)のパラフレーズを生成する。これにより、情報処理システム1は、ユーザの発話に対応する入力言語文字情報とそのパラフレーズを含む疑似発話テキストリストを生成する。 Then, the information processing system 1 develops the utterance sentence (step S203). The information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased. The information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase. For example, as shown in FIG. 6, the information processing system 1 has three (plurality) parameters of utterance development information B, utterance development information C, and utterance development information D, which are paraphrases of utterance development information A corresponding to the user's utterance. Generate a phrase. As a result, the information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
 そして、情報処理システム1は、言語展開を行う(ステップS204)。情報処理システム1は、入力言語文字情報やそのパラフレーズを、他の言語に翻訳(変換)する。情報処理システム1は、対象言語の入力言語文字情報を他の言語に翻訳する。情報処理システム1は、入力言語文字情報を翻訳先言語の文字情報(翻訳先文字情報)に変換する。例えば、情報処理システム1は、図6に示すように日本語の入力言語文字情報である発話展開情報Aを、英語、スペイン語、フランス語等に翻訳する。また、情報処理システム1は、対象言語の各パラフレーズを他の言語に翻訳する。情報処理システム1は、各パラフレーズを翻訳先言語の文字情報(翻訳先文字情報)に変換する。例えば、情報処理システム1は、図6に示すように日本語のパラフレーズである発話展開情報Bを、英語、スペイン語、フランス語等に翻訳する。同様に、情報処理システム1は、図6に示すように日本語のパラフレーズである発話展開情報Cや発話展開情報Dを、英語、スペイン語、フランス語等に翻訳する。これにより、情報処理システム1は、入力言語文字情報、そのパラフレーズ、入力言語文字情報を各言語に翻訳した翻訳文、及びパラフレーズを翻訳した翻訳文(翻訳パラフレーズ)を含む翻訳文リストを生成する。 Then, the information processing system 1 develops the language (step S204). The information processing system 1 translates (converts) the input language character information and its paraphrase into another language. The information processing system 1 translates the input language character information of the target language into another language. The information processing system 1 converts the input language character information into the character information of the translation destination language (translation destination character information). For example, as shown in FIG. 6, the information processing system 1 translates the utterance development information A, which is Japanese input language character information, into English, Spanish, French, and the like. Further, the information processing system 1 translates each paraphrase of the target language into another language. The information processing system 1 converts each paraphrase into character information (translation destination character information) of the translation destination language. For example, the information processing system 1 translates the utterance development information B, which is a Japanese paraphrase, into English, Spanish, French, and the like as shown in FIG. Similarly, as shown in FIG. 6, the information processing system 1 translates the utterance development information C and the utterance development information D, which are Japanese paraphrases, into English, Spanish, French, and the like. As a result, the information processing system 1 provides a list of translated sentences including the input language character information, its paraphrase, the translated sentence obtained by translating the input language character information into each language, and the translated sentence (translated paraphrase) obtained by translating the paraphrase. Generate.
 そして、情報処理システム1は、翻訳文リストのループ処理を行う(ステップS205)。情報処理システム1は、翻訳文リスト中の各文(翻訳文等)を1つずつ選択して、選択した文(「選択文」ともいう)に対して処理を行う。 Then, the information processing system 1 performs loop processing of the translated sentence list (step S205). The information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (also referred to as "selected sentence").
 情報処理システム1は、選択文に対して発話意味解析処理を行う(ステップS206)。情報処理システム1は、選択文に対する意味解析処理の結果を生成する。例えば、情報処理システム1は、発話意味解析処理により、選択文のドメインゴール等の情報を推定し、その推定した情報の確信度を示す意味解析スコア等の情報を含む意味フレームの情報を生成する。 The information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S206). The information processing system 1 generates the result of the semantic analysis process for the selected sentence. For example, the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
 そして、情報処理システム1は、発話意味解析処理の結果とスコア関数とに基づいて、解析精度変換を行う(ステップS207)。例えば、情報処理システム1は、式(1)を用いて、解析精度変換を行う。情報処理システム1は、意味解析スコアと選択文の言語を示す情報とを、式(1)に入力することにより、選択文の意味解析精度を示す精度指標値(意味解析精度)を算出する。また、情報処理システム1は、翻訳文リスト中の文のうち、精度指標値(意味解析精度)を算出が完了した文にフラグを付すなどして、処理済み文として識別可能にする。なお、情報処理システム1は、精度指標値(意味解析精度)を算出が完了した文を、翻訳文リストから除いてもよい。 Then, the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S207). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1). The information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed. The information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
 情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文を選択文として、ステップS206~S207の処理を行う。これにより、情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文の精度指標値(意味解析精度)を算出する。 The information processing system 1 performs the processes of steps S206 to S207 with each sentence in the translated sentence list as a selected sentence by the loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
 情報処理システム1は、翻訳文リスト中に未処理の文がある場合(ステップS208:No)、翻訳文リスト中の未処理の文を1つ選択して、ステップS206~S207の処理を行う。 When there is an unprocessed sentence in the translated sentence list (step S208: No), the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S206 to S207.
 一方、情報処理システム1は、翻訳文リスト中に未処理の文が無い場合(ステップS208:Yes)、翻訳文リストのループ処理を終了する。 On the other hand, the information processing system 1 ends the loop processing of the translated sentence list when there is no unprocessed sentence in the translated sentence list (step S208: Yes).
 そして、情報処理システム1は、意味フレームを選択する(ステップS209)。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文の意味解析結果を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文に対応する意味フレームを、以降の処理に用いる文に選択する。 Then, the information processing system 1 selects a semantic frame (step S209). The information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing. The information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing. The information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
 そして、情報処理システム1は、スロット逆変換を行う(ステップS210)。情報処理システム1は、精度指標値(意味解析精度)が最大である文の意味フレーム(精度最大の意味フレーム)を用いて、スロット逆変換を行う。情報処理システム1は、精度最大の意味フレーム中のスロット値を入力言語(対象言語)のスロット値に変換する。情報処理システム1は、特定言語(翻訳先言語)のスロット値を入力言語(対象言語)のスロット値に変換する。なお、情報処理システム1は、精度指標値(意味解析精度)が最大の文の言語が対象言語である場合など、逆変換が不要の場合はステップS210を行わなくてもよい。 Then, the information processing system 1 performs slot inverse transformation (step S210). The information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy). The information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language). The information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language). The information processing system 1 does not need to perform step S210 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
 そして、情報処理システム1は、応答生成を行う(ステップS211)。情報処理システム1は、画像や音やテキストなど、出力態様に応じた情報の生成を行う。そして、情報処理システム1は、生成した情報を出力する(ステップS212)。情報処理システム1は、画像やテキストを表示したり、音を出力したりする。 Then, the information processing system 1 generates a response (step S211). The information processing system 1 generates information such as images, sounds, and texts according to the output mode. Then, the information processing system 1 outputs the generated information (step S212). The information processing system 1 displays images and texts and outputs sounds.
[1-6-3.情報処理システムに係る処理の他の手順その1]
 次に、図12を用いて、情報処理システムに係る具体的な処理の他の一例について説明する。図12は、本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。なお、以下では、情報処理システム1が処理を行う場合を一例として説明するが、図12に示す処理は、情報処理システム1に含まれる情報処理装置100及び端末装置10のいずれの装置が行ってもよい。図11と同様の点については適宜説明を省略する。
[1-6-3. Other Procedures for Processing Related to Information Processing Systems Part 1]
Next, another example of specific processing related to the information processing system will be described with reference to FIG. FIG. 12 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure. In the following, a case where the information processing system 1 performs processing will be described as an example, but the processing shown in FIG. 12 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good. The same points as in FIG. 11 will be omitted as appropriate.
 図12に示すように、情報処理システム1は、ユーザによる発話の音声情報を取得する(ステップS301)。例えば、情報処理システム1は、入力言語(対象言語)でのユーザによる発話の音声情報を取得する。 As shown in FIG. 12, the information processing system 1 acquires the voice information of the utterance by the user (step S301). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
 そして、情報処理システム1は、音声認識の処理を行う(ステップS302)。情報処理システム1は、ユーザによる発話の音声情報に対する音声認識の処理を行う。例えば、情報処理システム1は、音声認識により入力言語(対象言語)でのユーザによる発話の発話テキスト(文字情報)を発話情報として取得する。例えば、情報処理システム1は、言語識別(音声認識)が可能な言語かを判定し、可能な場合、音声認識の処理を行う。なお、情報処理システム1は、言語識別(音声認識)が可能な言語ではない場合、処理を終了してもよい。この場合、情報処理システム1は、対応可能な言語ではないことをユーザに通知してもよい。例えば、情報処理システム1は、発話情報を入力言語文字情報として用いる。なお、情報処理システム1は、発話情報の正規化が必要な場合、発話情報を入力言語文字情報に正規化してもよい。 Then, the information processing system 1 performs voice recognition processing (step S302). The information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the utterance text (character information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition. For example, the information processing system 1 determines whether the language is capable of language identification (speech recognition), and if possible, performs voice recognition processing. If the information processing system 1 is not a language capable of language identification (speech recognition), the processing may be terminated. In this case, the information processing system 1 may notify the user that the language is not compatible. For example, the information processing system 1 uses the utterance information as input language character information. When it is necessary to normalize the utterance information, the information processing system 1 may normalize the utterance information to the input language character information.
 情報処理システム1は、発話情報を入力言語文字情報に対して発話意味解析処理を行う(ステップS303)。情報処理システム1は、入力言語文字情報に対する意味解析処理の結果を生成する。例えば、情報処理システム1は、発話意味解析処理により、入力言語文字情報のドメインゴール等の情報を推定し、その推定した情報の確信度を示す意味解析スコア等の情報を含む意味フレームの情報を生成する。 The information processing system 1 performs an utterance semantic analysis process on the input language character information for the utterance information (step S303). The information processing system 1 generates the result of the semantic analysis process for the input language character information. For example, the information processing system 1 estimates information such as a domain goal of input language character information by utterance meaning analysis processing, and provides information on a meaning frame including information such as a meaning analysis score indicating the certainty of the estimated information. Generate.
 そして、情報処理システム1は、入力言語文字情報を用いた意味解析処理の精度が低い場合(ステップS304:Yes)、発話文展開を行う(ステップS305)。例えば、情報処理システム1は、入力言語文字情報を用いた意味解析処理の意味解析スコアと所定の閾値との比較に基づいて、意味解析処理の精度が低いかどうかを判定する。情報処理システム1は、入力言語文字情報を用いた意味解析処理の意味解析スコアが所定の閾値以下である場合、意味解析処理の精度が低いと判定する。なお、情報処理システム1は、意味解析スコアを変換した精度指標値(意味解析精度)を用いて、意味解析処理の精度が低いかどうかを判定してもよい。この場合、情報処理システム1は、式(1)により算出された入力言語文字情報の精度指標値(意味解析精度)が所定の閾値以下である場合、意味解析処理の精度が低いと判定する。 Then, when the accuracy of the semantic analysis process using the input language character information is low (step S304: Yes), the information processing system 1 develops the utterance sentence (step S305). For example, the information processing system 1 determines whether or not the accuracy of the semantic analysis process is low based on the comparison between the semantic analysis score of the semantic analysis process using the input language character information and a predetermined threshold value. When the semantic analysis score of the semantic analysis process using the input language character information is equal to or less than a predetermined threshold value, the information processing system 1 determines that the accuracy of the semantic analysis process is low. The information processing system 1 may determine whether or not the accuracy of the semantic analysis process is low by using the accuracy index value (semantic analysis accuracy) obtained by converting the semantic analysis score. In this case, the information processing system 1 determines that the accuracy of the semantic analysis process is low when the accuracy index value (semantic analysis accuracy) of the input language character information calculated by the equation (1) is equal to or less than a predetermined threshold value.
 情報処理システム1は、ステップS305において、ユーザの発話に対応する入力言語文字情報を言い換えたパラフレーズを生成する。この点は、図11中の発話文展開と同様の為説明を省略する。情報処理システム1は、ユーザの発話に対応する入力言語文字情報とそのパラフレーズを含む疑似発話テキストリストを生成する。 In step S305, the information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased. Since this point is the same as the utterance sentence development in FIG. 11, the description thereof will be omitted. The information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
 そして、情報処理システム1は、言語展開を行う(ステップS306)。情報処理システム1は、入力言語文字情報やそのパラフレーズを、他の言語に翻訳(変換)する。図12の例では、情報処理システム1は、特定言語の情報を用いて、入力言語文字情報やそのパラフレーズを、他の言語に翻訳(変換)する。情報処理システム1は、特定言語の情報を用いて、入力言語文字情報やそのパラフレーズを、意味解析処理可能な他の言語に翻訳(変換)する。この点は、図11中の言語展開と同様の為説明を省略する。情報処理システム1は、入力言語文字情報、そのパラフレーズ、入力言語文字情報を各言語に翻訳した翻訳文、及びパラフレーズを翻訳した翻訳文(翻訳パラフレーズ)を含む翻訳文リストを生成する。 Then, the information processing system 1 develops the language (step S306). The information processing system 1 translates (converts) the input language character information and its paraphrase into another language. In the example of FIG. 12, the information processing system 1 translates (converts) the input language character information and its paraphrase into another language by using the information of the specific language. The information processing system 1 uses information in a specific language to translate (convert) input language character information and its paraphrases into another language capable of semantic analysis processing. Since this point is the same as the language development in FIG. 11, the description thereof will be omitted. The information processing system 1 generates a translation sentence list including input language character information, its paraphrase, a translation sentence obtained by translating the input language character information into each language, and a translation sentence (translation paraphrase) obtained by translating the paraphrase.
 そして、情報処理システム1は、翻訳文リストのループ処理を行う(ステップS307)。情報処理システム1は、翻訳文リスト中の各文(翻訳文等)を1つずつ選択して、選択した文(選択文)に対して処理を行う。 Then, the information processing system 1 performs loop processing of the translated sentence list (step S307). The information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (selected sentence).
 情報処理システム1は、選択文に対して発話意味解析処理を行う(ステップS308)。情報処理システム1は、選択文に対する意味解析処理の結果を生成する。例えば、情報処理システム1は、発話意味解析処理により、選択文のドメインゴール等の情報を推定し、その推定した情報の確信度を示す意味解析スコア等の情報を含む意味フレームの情報を生成する。 The information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S308). The information processing system 1 generates the result of the semantic analysis process for the selected sentence. For example, the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
 そして、情報処理システム1は、発話意味解析処理の結果とスコア関数とに基づいて、解析精度変換を行う(ステップS309)。例えば、情報処理システム1は、式(1)を用いて、解析精度変換を行う。情報処理システム1は、意味解析スコアと選択文の言語を示す情報とを、式(1)に入力することにより、選択文の意味解析精度を示す精度指標値(意味解析精度)を算出する。また、情報処理システム1は、翻訳文リスト中の文のうち、精度指標値(意味解析精度)を算出が完了した文にフラグを付すなどして、処理済み文として識別可能にする。なお、情報処理システム1は、精度指標値(意味解析精度)を算出が完了した文を、翻訳文リストから除いてもよい。 Then, the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S309). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1). The information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed. The information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
 情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文を選択文として、ステップS308~S309の処理を行う。これにより、情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文の精度指標値(意味解析精度)を算出する。 The information processing system 1 performs the processes of steps S308 to S309 with each sentence in the translated sentence list as a selected sentence by the loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
 情報処理システム1は、翻訳文リスト中に未処理の文がある場合(ステップS310:No)、翻訳文リスト中の未処理の文を1つ選択して、ステップS308~S309の処理を行う。 When there is an unprocessed sentence in the translated sentence list (step S310: No), the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S308 to S309.
 一方、情報処理システム1は、翻訳文リスト中に未処理の文が無い場合(ステップS310:Yes)、翻訳文リストのループ処理を終了する。 On the other hand, when there is no unprocessed sentence in the translated sentence list (step S310: Yes), the information processing system 1 ends the loop processing of the translated sentence list.
 そして、情報処理システム1は、意味フレームを選択する(ステップS311)。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文の意味解析結果を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文に対応する意味フレームを、以降の処理に用いる文に選択する。 Then, the information processing system 1 selects a semantic frame (step S311). The information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing. The information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing. The information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
 そして、情報処理システム1は、スロット逆変換を行う(ステップS312)。情報処理システム1は、精度指標値(意味解析精度)が最大である文の意味フレーム(精度最大の意味フレーム)を用いて、スロット逆変換を行う。情報処理システム1は、精度最大の意味フレーム中のスロット値を入力言語(対象言語)のスロット値に変換する。情報処理システム1は、特定言語(翻訳先言語)のスロット値を入力言語(対象言語)のスロット値に変換する。なお、情報処理システム1は、精度指標値(意味解析精度)が最大の文の言語が対象言語である場合など、逆変換が不要の場合はステップS312を行わなくてもよい。 Then, the information processing system 1 performs slot inverse transformation (step S312). The information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy). The information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language). The information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language). Note that the information processing system 1 does not have to perform step S312 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
 そして、情報処理システム1は、応答生成を行う(ステップS313)。情報処理システム1は、画像や音やテキストなど、出力態様に応じた情報の生成を行う。例えば、情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文の意味解析処理の結果を用いて、応答生成を行う。 Then, the information processing system 1 generates a response (step S313). The information processing system 1 generates information such as images, sounds, and texts according to the output mode. For example, the information processing system 1 generates a response using the result of the semantic analysis process of the sentence having the maximum accuracy index value (semantic analysis accuracy) of each sentence.
 なお、情報処理システム1は、入力言語文字情報を用いた意味解析処理の精度が低くない場合(ステップS304:No)、入力言語文字情報の意味解析処理の結果を用いて、応答生成を行う。例えば、情報処理システム1は、入力言語文字情報を用いた意味解析処理の精度が高い場合、ステップS305~S312の処理を行うことなく、入力言語文字情報の意味解析処理の結果を用いて、応答生成を行う。 If the accuracy of the semantic analysis process using the input language character information is not low (step S304: No), the information processing system 1 generates a response using the result of the semantic analysis process of the input language character information. For example, when the accuracy of the semantic analysis process using the input language character information is high, the information processing system 1 responds by using the result of the semantic analysis process of the input language character information without performing the processes of steps S305 to S312. Generate.
 そして、情報処理システム1は、生成した情報を出力する(ステップS314)。情報処理システム1は、画像やテキストを表示したり、音を出力したりする。 Then, the information processing system 1 outputs the generated information (step S314). The information processing system 1 displays images and texts and outputs sounds.
[1-6-4.情報処理システムに係る処理の他の手順その2]
 次に、図13を用いて、情報処理システムに係る具体的な処理の他の一例について説明する。図13は、本開示の実施形態に係る情報処理システムの処理を示すフローチャートである。なお、以下では、情報処理システム1が処理を行う場合を一例として説明するが、図13に示す処理は、情報処理システム1に含まれる情報処理装置100及び端末装置10のいずれの装置が行ってもよい。図11や図12と同様の点については適宜説明を省略する。
[1-6-4. Other Procedures for Processing Related to Information Processing Systems Part 2]
Next, another example of specific processing related to the information processing system will be described with reference to FIG. FIG. 13 is a flowchart showing processing of the information processing system according to the embodiment of the present disclosure. In the following, a case where the information processing system 1 performs processing will be described as an example, but the processing shown in FIG. 13 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good. The same points as those in FIGS. 11 and 12 will be omitted as appropriate.
 図13に示すように、情報処理システム1は、ユーザによる発話の音声情報を取得する(ステップS401)。例えば、情報処理システム1は、入力言語(対象言語)でのユーザによる発話の音声情報を取得する。 As shown in FIG. 13, the information processing system 1 acquires the voice information of the utterance by the user (step S401). For example, the information processing system 1 acquires voice information of a user's utterance in an input language (target language).
 そして、情報処理システム1は、音声認識の処理を行う(ステップS402)。情報処理システム1は、ユーザによる発話の音声情報に対する音声認識の処理を行う。例えば、情報処理システム1は、音声認識により入力言語(対象言語)でのユーザによる発話のテキスト(発話情報)を発話情報として取得する。 Then, the information processing system 1 performs voice recognition processing (step S402). The information processing system 1 performs voice recognition processing for voice information spoken by the user. For example, the information processing system 1 acquires the text (utterance information) of the utterance by the user in the input language (target language) as the utterance information by voice recognition.
 そして、情報処理システム1は、対象外言語かを判定する(ステップS403)。例えば、情報処理システム1は、言語識別(音声認識)できない言語を対象外言語であると判定する。 Then, the information processing system 1 determines whether the language is not the target language (step S403). For example, the information processing system 1 determines that a language that cannot be language-identified (speech recognition) is a non-target language.
 情報処理システム1は、対象外言語ではないと判定した場合(ステップS403:Yes)、発話文展開を行う(ステップS404)。情報処理システム1は、ユーザの発話に対応する入力言語文字情報を言い換えたパラフレーズを生成する。この点は、図11中の発話文展開と同様の為説明を省略する。情報処理システム1は、ユーザの発話に対応する入力言語文字情報とそのパラフレーズを含む疑似発話テキストリストを生成する。 When the information processing system 1 determines that the language is not a non-target language (step S403: Yes), the information processing system 1 expands the utterance sentence (step S404). The information processing system 1 generates a paraphrase in which the input language character information corresponding to the user's utterance is paraphrased. Since this point is the same as the utterance sentence development in FIG. 11, the description thereof will be omitted. The information processing system 1 generates a pseudo-utterance text list including input language character information corresponding to the user's utterance and its paraphrase.
 そして、情報処理システム1は、言語展開を行う(ステップS405)。情報処理システム1は、入力言語文字情報やそのパラフレーズを、他の言語に翻訳(変換)する。図12の例では、情報処理システム1は、特定言語の情報を用いて、入力言語文字情報やそのパラフレーズを、他の言語に翻訳(変換)する。情報処理システム1は、特定言語の情報を用いて、入力言語文字情報やそのパラフレーズを、意味解析処理可能な他の言語に翻訳(変換)する。この点は、図11中の言語展開と同様の為説明を省略する。情報処理システム1は、入力言語文字情報、そのパラフレーズ、入力言語文字情報を各言語に翻訳した翻訳文、及びパラフレーズを翻訳した翻訳文(翻訳パラフレーズ)を含む翻訳文リストを生成する。 Then, the information processing system 1 develops the language (step S405). The information processing system 1 translates (converts) the input language character information and its paraphrase into another language. In the example of FIG. 12, the information processing system 1 translates (converts) the input language character information and its paraphrase into another language by using the information of the specific language. The information processing system 1 uses information in a specific language to translate (convert) input language character information and its paraphrases into another language capable of semantic analysis processing. Since this point is the same as the language development in FIG. 11, the description thereof will be omitted. The information processing system 1 generates a translation sentence list including input language character information, its paraphrase, a translation sentence obtained by translating the input language character information into each language, and a translation sentence (translation paraphrase) obtained by translating the paraphrase.
 そして、情報処理システム1は、翻訳文リストのループ処理を行う(ステップS406)。情報処理システム1は、翻訳文リスト中の各文(翻訳文等)を1つずつ選択して、選択した文(選択文)に対して処理を行う。 Then, the information processing system 1 performs loop processing of the translated sentence list (step S406). The information processing system 1 selects each sentence (translated sentence, etc.) in the translated sentence list one by one, and processes the selected sentence (selected sentence).
 情報処理システム1は、選択文に対して品質推定を行う(ステップS407)。例えば、情報処理システム1は、選択文が翻訳文である場合、選択文(翻訳テキスト)の品質を推定する。例えば、情報処理システム1は、適宜の方法により翻訳文(翻訳テキスト)の品質推定翻訳精度(品質スコア)を算出し、その品質推定翻訳精度(品質スコア)と閾値(例えば0.75等)と比較する。そして、情報処理システム1は、翻訳文の品質推定翻訳精度(品質スコア)が閾値以上であれば、その選択文の品質が高い(ハイスコアである)と判定し、選択文の品質推定翻訳精度(品質スコア)が閾値未満であれば、その選択文の品質が低い(ロースコアである)と判定する。 The information processing system 1 estimates the quality of the selected sentence (step S407). For example, the information processing system 1 estimates the quality of the selected sentence (translated text) when the selected sentence is a translated sentence. For example, the information processing system 1 calculates the quality estimated translation accuracy (quality score) of the translated text (translated text) by an appropriate method, and sets the quality estimated translation accuracy (quality score) and the threshold value (for example, 0.75, etc.). compare. Then, if the quality estimated translation accuracy (quality score) of the translated sentence is equal to or higher than the threshold value, the information processing system 1 determines that the quality of the selected sentence is high (high score), and determines that the quality estimated translation accuracy of the selected sentence is high. If (quality score) is less than the threshold value, it is determined that the quality of the selected sentence is low (low score).
 情報処理システム1は、品質が低いと推定(判定)した場合(ステップS407:LOW)、その選択文について人手での編集が所定の時間内に完了するか判定する(ステップS408)。 When the information processing system 1 estimates (determines) that the quality is low (step S407: LOW), it determines whether manual editing of the selected sentence is completed within a predetermined time (step S408).
 情報処理システム1は、人手での編集が所定の時間内に完了しないと判定した場合(ステップS408:LONG TIME)、その選択文については処理を棄却する(ステップS415)。例えば、情報処理システム1は、その選択文について人手での編集が所定の時間内に完了しないと判定した場合、その選択文については処理を中断する。そして、情報処理システム1は、その選択文については中断理由を用いて、ステップS415の応答生成を行う。なお、情報処理システム1は、ある選択文について、処理を棄却した場合であっても、残りの文(未処理の文)に対して、ステップS406の翻訳文リストのループ処理を継続する。 When the information processing system 1 determines that the manual editing is not completed within a predetermined time (step S408: LONG TIME), the information processing system 1 rejects the processing of the selected sentence (step S415). For example, when the information processing system 1 determines that the manual editing of the selected sentence is not completed within a predetermined time, the processing of the selected sentence is interrupted. Then, the information processing system 1 generates a response in step S415 by using the reason for interruption for the selected sentence. The information processing system 1 continues the loop processing of the translated sentence list in step S406 for the remaining sentences (unprocessed sentences) even when the processing of a certain selected sentence is rejected.
 情報処理システム1は、その選択文について人手での編集が所定の時間内に完了すると判定した場合(ステップS408:SHORT TIME)、その選択文については人手により編集された文字情報を用いてステップS409~S410の処理を行う。情報処理システム1は、その選択文について人手での編集が所定の時間内に完了すると判定した場合、人手で編集した文(編集文)に選択文を置き換える。 When the information processing system 1 determines that the manual editing of the selected sentence is completed within a predetermined time (step S408: SHORT TIME), the selected sentence is used in the manually edited character information in step S409. ~ S410 is processed. When the information processing system 1 determines that the manual editing of the selected sentence is completed within a predetermined time, the information processing system 1 replaces the selected sentence with the manually edited sentence (edited sentence).
 また、情報処理システム1は、品質が高いと推定(判定)した場合(ステップS407:HIGH)、選択文を用いてステップS409~S410の処理を行う。 Further, when the information processing system 1 estimates (determines) that the quality is high (step S407: HIGH), the information processing system 1 performs the processes of steps S409 to S410 using the selection statement.
 情報処理システム1は、選択文に対して発話意味解析処理を行う(ステップS409)。情報処理システム1は、選択文に対する意味解析処理の結果を生成する。例えば、情報処理システム1は、発話意味解析処理により、選択文のドメインゴール等の情報を推定し、その推定した情報の確信度を示す意味解析スコア等の情報を含む意味フレームの情報を生成する。 The information processing system 1 performs utterance semantic analysis processing on the selected sentence (step S409). The information processing system 1 generates the result of the semantic analysis process for the selected sentence. For example, the information processing system 1 estimates information such as a domain goal of a selected sentence by a speech semantic analysis process, and generates information of a semantic frame including information such as a semantic analysis score indicating the certainty of the estimated information. ..
 そして、情報処理システム1は、発話意味解析処理の結果とスコア関数とに基づいて、解析精度変換を行う(ステップS410)。例えば、情報処理システム1は、式(1)を用いて、解析精度変換を行う。情報処理システム1は、意味解析スコアと選択文の言語を示す情報とを、式(1)に入力することにより、選択文の意味解析精度を示す精度指標値(意味解析精度)を算出する。また、情報処理システム1は、翻訳文リスト中の文のうち、精度指標値(意味解析精度)を算出が完了した文にフラグを付すなどして、処理済み文として識別可能にする。なお、情報処理システム1は、精度指標値(意味解析精度)を算出が完了した文を、翻訳文リストから除いてもよい。 Then, the information processing system 1 performs analysis accuracy conversion based on the result of the utterance semantic analysis process and the score function (step S410). For example, the information processing system 1 performs analysis accuracy conversion using the equation (1). The information processing system 1 calculates the accuracy index value (semantic analysis accuracy) indicating the semantic analysis accuracy of the selected sentence by inputting the semantic analysis score and the information indicating the language of the selected sentence into the equation (1). Further, the information processing system 1 makes it possible to identify the sentence in the translated sentence list as a processed sentence by adding a flag to the sentence for which the calculation of the accuracy index value (semantic analysis accuracy) has been completed. The information processing system 1 may exclude sentences for which the accuracy index value (semantic analysis accuracy) has been calculated from the translated sentence list.
 情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文を選択文として、ステップS409~S410の処理を行う。これにより、情報処理システム1は、翻訳文リストのループ処理により、翻訳文リスト中の各文の精度指標値(意味解析精度)を算出する。 The information processing system 1 performs the processes of steps S409 to S410 by selecting each sentence in the translated sentence list by loop processing of the translated sentence list. As a result, the information processing system 1 calculates the accuracy index value (semantic analysis accuracy) of each sentence in the translated sentence list by the loop processing of the translated sentence list.
 情報処理システム1は、翻訳文リスト中に未処理の文がある場合(ステップS411:No)、翻訳文リスト中の未処理の文を1つ選択して、ステップS409~S410の処理を行う。 When there is an unprocessed sentence in the translated sentence list (step S411: No), the information processing system 1 selects one unprocessed sentence in the translated sentence list and performs the processes of steps S409 to S410.
 一方、情報処理システム1は、翻訳文リスト中に未処理の文が無い場合(ステップS411:Yes)、翻訳文リストのループ処理を終了する。 On the other hand, the information processing system 1 ends the loop processing of the translated sentence list when there is no unprocessed sentence in the translated sentence list (step S411: Yes).
 そして、情報処理システム1は、意味フレームを選択する(ステップS412)。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文の意味解析結果を、以降の処理に用いる文に選択する。情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文に対応する意味フレームを、以降の処理に用いる文に選択する。 Then, the information processing system 1 selects a semantic frame (step S412). The information processing system 1 selects the sentence having the maximum accuracy index value (semantic analysis accuracy) from each sentence as the sentence to be used in the subsequent processing. The information processing system 1 selects the semantic analysis result of the sentence having the maximum accuracy index value (semantic analysis accuracy) among the sentences as the sentence to be used in the subsequent processing. The information processing system 1 selects, among the sentences, the semantic frame corresponding to the sentence having the maximum accuracy index value (semantic analysis accuracy) as the sentence to be used in the subsequent processing.
 そして、情報処理システム1は、スロット逆変換を行う(ステップS413)。情報処理システム1は、精度指標値(意味解析精度)が最大である文の意味フレーム(精度最大の意味フレーム)を用いて、スロット逆変換を行う。情報処理システム1は、精度最大の意味フレーム中のスロット値を入力言語(対象言語)のスロット値に変換する。情報処理システム1は、特定言語(翻訳先言語)のスロット値を入力言語(対象言語)のスロット値に変換する。なお、情報処理システム1は、精度指標値(意味解析精度)が最大の文の言語が対象言語である場合など、逆変換が不要の場合はステップS413を行わなくてもよい。 Then, the information processing system 1 performs slot inverse transformation (step S413). The information processing system 1 performs slot inverse conversion using a sentence meaning frame (semantic frame with maximum accuracy) having the maximum accuracy index value (semantic analysis accuracy). The information processing system 1 converts the slot value in the meaning frame having the maximum accuracy into the slot value of the input language (target language). The information processing system 1 converts the slot value of the specific language (translation destination language) into the slot value of the input language (target language). Note that the information processing system 1 does not have to perform step S413 when the inverse transformation is unnecessary, such as when the language of the sentence having the maximum accuracy index value (semantic analysis accuracy) is the target language.
 そして、情報処理システム1は、応答生成を行う(ステップS414)。情報処理システム1は、画像や音やテキストなど、出力態様に応じた情報の生成を行う。例えば、情報処理システム1は、各文のうち、精度指標値(意味解析精度)が最大の文の意味解析処理の結果を用いて、応答生成を行う。 Then, the information processing system 1 generates a response (step S414). The information processing system 1 generates information such as images, sounds, and texts according to the output mode. For example, the information processing system 1 generates a response using the result of the semantic analysis process of the sentence having the maximum accuracy index value (semantic analysis accuracy) of each sentence.
 また、情報処理システム1は、対象外言語であると判定した場合(ステップS403:No)、処理を棄却する(ステップS415)。例えば、情報処理システム1は、対象外言語であると判定した場合、処理を中断する。そして、情報処理システム1は、中断理由を用いて、ステップS415の応答生成を行う。例えば、情報処理システム1は、中断理由を用いて「対象外の言語のため処理を中断します」といった応答生成を行う。 Further, when the information processing system 1 determines that the language is not the target language (step S403: No), the information processing system 1 rejects the process (step S415). For example, when the information processing system 1 determines that the language is not the target language, the information processing system 1 interrupts the process. Then, the information processing system 1 generates a response in step S415 using the reason for interruption. For example, the information processing system 1 uses the reason for interruption to generate a response such as "processing is interrupted because of a language other than the target".
 そして、情報処理システム1は、生成した情報を出力する(ステップS416)。情報処理システム1は、画像やテキストを表示したり、音を出力したりする。 Then, the information processing system 1 outputs the generated information (step S416). The information processing system 1 displays images and texts and outputs sounds.
[1-7.情報処理システムによる処理の概念図]
 ここで、図14を用いて、情報処理システム1における各機能やハードウェア構成や処理を概念的に示す。図14は、情報処理システムによる処理の一例を示す概念図である。図14に示すシステム処理PS1は、情報処理システム1により実現される処理の一例を示す。例えば、図14は、情報処理システム1の構成概略図を示す。図14中の発話文展開、精度変換、スコア関数、意味フレーム言語選択、言語展開翻訳器、スロット逆変換、応答生成が意味解析の精度向上や支援を実現する上で重要なポイントとなる、例えば、発話文展開、精度変換、スコア関数、意味フレーム言語選択は処理を行う上で非常に重要な部分となる。
[1-7. Conceptual diagram of processing by information processing system]
Here, with reference to FIG. 14, each function, hardware configuration, and processing in the information processing system 1 are conceptually shown. FIG. 14 is a conceptual diagram showing an example of processing by the information processing system. The system process PS1 shown in FIG. 14 shows an example of the process realized by the information processing system 1. For example, FIG. 14 shows a schematic configuration diagram of the information processing system 1. Speech expansion, accuracy conversion, score function, semantic frame language selection, language expansion translator, slot inverse conversion, and response generation in FIG. 14 are important points for improving the accuracy and support of semantic analysis, for example. , Speech expansion, precision conversion, score function, and semantic frame language selection are very important parts of the process.
 図14に示すシステム処理PS1は、ユーザの発話が入力されてから、応答が出力されるまでの各処理や、各処理を実現するための機能やハードウェア構成を概念的に示す図である。例えば、システム処理PS1に示す各処理は、情報処理装置100により実行される。また、例えば、システム処理PS1中の言語展開翻訳器は、情報処理装置100の変換部132の機能により実現される。例えば、システム処理PS1中の発話意味解析器は、情報処理装置100の実行部133の機能により実現される。なお、以下では、情報処理システム1が処理を行う場合を一例として説明するが、図14に示す処理は、情報処理システム1に含まれる情報処理装置100及び端末装置10のいずれの装置が行ってもよい。図11~図13と同様の点については適宜説明を省略する。 The system process PS1 shown in FIG. 14 is a diagram conceptually showing each process from the input of the user's utterance to the output of the response, and the functions and hardware configurations for realizing each process. For example, each process shown in the system process PS1 is executed by the information processing apparatus 100. Further, for example, the language expansion translator in the system processing PS1 is realized by the function of the conversion unit 132 of the information processing device 100. For example, the utterance meaning analyzer in the system processing PS1 is realized by the function of the execution unit 133 of the information processing device 100. In the following, a case where the information processing system 1 performs processing will be described as an example, but the processing shown in FIG. 14 is performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. May be good. The same points as those in FIGS. 11 to 13 will be omitted as appropriate.
 システム処理PS1に示すように、情報処理システム1は、ユーザによる発話に対して言語識別の処理を行う。そして、情報処理システム1は、入力された発話の言語(入力言語)について言語識別ができた場合、発話文展開を行う。例えば、情報処理システム1は、発話文展開によりユーザの発話に対応する文字情報と、その文字情報のパラフレーズを含む発話文リストを生成する。例えば、情報処理システム1は、発話文展開により、図6中の発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dを含む発話文リストを生成する。この場合、情報処理システム1は、ユーザの発話が文字列化された発話展開情報Aや、発話展開情報Aのパラフレーズである発話展開情報B、発話展開情報C及び発話展開情報D等を含む発話文リストを生成する。 System processing As shown in PS1, the information processing system 1 performs language identification processing for utterances by the user. Then, when the information processing system 1 can identify the language of the input utterance language (input language), the information processing system 1 develops the utterance sentence. For example, the information processing system 1 generates a utterance sentence list including character information corresponding to the user's utterance and a paraphrase of the character information by expanding the utterance sentence. For example, the information processing system 1 generates an utterance sentence list including the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 by the utterance sentence expansion. In this case, the information processing system 1 includes the utterance development information A in which the user's utterance is converted into a character string, the utterance development information B which is a paraphrase of the utterance development information A, the utterance development information C, the utterance development information D, and the like. Generate an utterance list.
 そして、情報処理システム1は、言語展開翻訳器により、翻訳リストを生成する。なお、言語展開翻訳器は、入力言語を複数の言語の各々へ翻訳(変換)する複数の翻訳器であってもよい。図6の例では、言語展開翻訳器は、日本語を英語に翻訳する第1翻訳器、日本語をスペイン語に翻訳する第2翻訳器、日本語を英フランス語に翻訳する第3翻訳器等の複数の翻訳器により構成されてもよい。情報処理システム1は、発話文リスト中の各文(発話展開文)を各言語の翻訳器に入力することにより、発話展開文が各言語に変換された文字情報を生成する。 Then, the information processing system 1 generates a translation list by the language expansion translator. The language expansion translator may be a plurality of translators that translate (convert) an input language into each of a plurality of languages. In the example of FIG. 6, the language expansion translator is a first translator that translates Japanese into English, a second translator that translates Japanese into Spanish, a third translator that translates Japanese into English-French, and the like. It may be composed of a plurality of translators of. The information processing system 1 generates character information in which the utterance development sentence is converted into each language by inputting each sentence (utterance development sentence) in the utterance sentence list into the translator of each language.
 例えば、情報処理システム1は、発話文リストに含まれる各文(発話展開文)を各翻訳先言語の文に変換することにより、各発話展開文に対応する各翻訳先言語の文(翻訳文)を生成する。情報処理システム1は、各発話展開文に対応する各翻訳先言語の文(翻訳文)を含む翻訳リストを生成する。例えば、情報処理システム1は、図6中の発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dの各々について、各言語の翻訳器を用いて、発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dの各々に対応する各言語の翻訳文を生成する。具体的には、情報処理システム1は、図6中の発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dの各々について、図6中の翻訳先翻訳文に一覧で示すような各言語の翻訳文を生成する。英語を例とした場合、情報処理システム1は、図6中の発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dの各々を、英語への翻訳器(第1翻訳器)を用いて、発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dの各々に対応する4つの英文を生成する。また、情報処理システム1は、ユーザの発話に対応する言語も意味解析処理可能な言語(特定言語)である場合、発話文リストの各文も翻訳リストに追加して、翻訳リストを生成する。情報処理システム1は、図6の例では、日本語も特定言語であるため、発話展開情報A、発話展開情報B、発話展開情報C、発話展開情報Dを含む、翻訳リストを生成する。 For example, the information processing system 1 converts each sentence (speech expansion sentence) included in the utterance sentence list into a sentence of each translation destination language, thereby converting a sentence (translation sentence) of each translation destination language corresponding to each utterance development sentence. ) Is generated. The information processing system 1 generates a translation list including sentences (translated sentences) of each translation destination language corresponding to each utterance development sentence. For example, the information processing system 1 uses a translator of each language for each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. A translation of each language corresponding to each of the development information B, the utterance development information C, and the utterance development information D is generated. Specifically, the information processing system 1 lists each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 in the translated text in FIG. Generate translations for each language. Taking English as an example, the information processing system 1 translates each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D in FIG. 6 into English (first translator). ) Is used to generate four English sentences corresponding to each of the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D. Further, when the language corresponding to the user's utterance is also a language (specific language) capable of semantic analysis processing, the information processing system 1 adds each sentence of the utterance sentence list to the translation list to generate a translation list. In the example of FIG. 6, the information processing system 1 generates a translation list including the utterance development information A, the utterance development information B, the utterance development information C, and the utterance development information D because Japanese is also a specific language in the example of FIG.
 そして、情報処理システム1は、翻訳結果の品質推定を行う。例えば、情報処理システム1は、翻訳リスト中の翻訳文の品質を推定する。 Then, the information processing system 1 estimates the quality of the translation result. For example, the information processing system 1 estimates the quality of the translated text in the translation list.
 また、情報処理システム1は、少なくとも1つの翻訳文がハイスコアであると判定した場合、ハイスコアである翻訳文を用いて発話意味解析器により意味解析処理を行う。情報処理システム1は、翻訳リスト中の翻訳文のうち、少なくとも1つの品質が高いと判定した場合、品質が高い翻訳文を用いて発話意味解析器により意味解析処理を行う。この場合、全ての翻訳文がロースコアであると判定された言語(特定言語)がある場合、情報処理システム1は、その言語については処理を中断することを示す応答を生成する。すなわち、情報処理システム1は、ある特定言語へ翻訳された全翻訳文の品質が低いと判定した場合、処理を中断することを示す応答を生成する。この場合、情報処理システム1は、「XX言語からYY言語への翻訳精度が低いため中断します」といった応答生成を行う。例えば、情報処理システム1は、「入力言語で韓国語から意味解析用の英語への翻訳への翻訳精度が低いため中断します」といった応答生成を行う。 Further, when the information processing system 1 determines that at least one translated sentence has a high score, the information processing system 1 performs a semantic analysis process by the utterance semantic analyzer using the translated sentence having a high score. When the information processing system 1 determines that at least one of the translated sentences in the translation list is of high quality, the information processing system 1 performs a semantic analysis process by the utterance semantic analyzer using the translated sentence of high quality. In this case, if there is a language (specific language) in which all the translated sentences are determined to have a low score, the information processing system 1 generates a response indicating that the processing is interrupted for that language. That is, the information processing system 1 generates a response indicating that the processing is interrupted when it is determined that the quality of all the translated sentences translated into a specific language is low. In this case, the information processing system 1 generates a response such as "The translation accuracy from the XX language to the YY language is low, so the process is interrupted." For example, the information processing system 1 generates a response such as "The input language is interrupted because the translation accuracy from Korean to English for semantic analysis is low."
 なお、情報処理システム1は、入力言語が特定言語である場合、入力言語の発話展開文と、ハイスコアである翻訳文を用いて発話意味解析器により意味解析処理を行う。また、入力言語が特定言語である場合、情報処理システム1は、ハイスコアである翻訳文が無い場合であっても、発話展開文を用いて発話意味解析器により意味解析処理を行ってもよい。 When the input language is a specific language, the information processing system 1 performs semantic analysis processing by the utterance meaning analyzer using the utterance development sentence of the input language and the translated sentence having a high score. Further, when the input language is a specific language, the information processing system 1 may perform semantic analysis processing by the utterance meaning analyzer using the utterance expansion sentence even when there is no translated sentence having a high score. ..
 情報処理システム1は、全ての言語(特定言語)がロースコアであると判定した場合、後編集に関する処理を行う。例えば、情報処理システム1は、全ての翻訳文の品質が低いと判定した場合、後編集に関する処理を行う。なお、入力言語が特定言語である場合、情報処理システム1は、全ての翻訳文の品質が低いと判定した場合であっても、発話展開文を用いて発話意味解析器により意味解析処理を行い、後編集に関する処理を行わなくてもよい。 The information processing system 1 performs processing related to post-editing when it is determined that all languages (specific languages) have low scores. For example, when the information processing system 1 determines that the quality of all the translated sentences is low, the information processing system 1 performs a process related to post-editing. When the input language is a specific language, the information processing system 1 performs semantic analysis processing by the utterance semantic analyzer using the utterance expansion sentence even when it is determined that the quality of all the translated sentences is low. , It is not necessary to perform the processing related to post-editing.
 例えば、情報処理システム1は、入力言語が特定言語ではない場合であって、全ての言語の翻訳文がロースコアであると判定し、ユーザの発話に対する処理について即時性が必要ない場合、翻訳結果を人手で編集させる処理を実行する。情報処理システム1は、クラウドワーカに人手での編集を行わせる。例えば、情報処理システム1は、発話展開文とその翻訳文とをクラウドワーカが利用するデバイス(端末装置10等)に送信することにより、クラウドワーカに翻訳結果を人手で編集させる処理を行わせる。 For example, when the information processing system 1 determines that the input language is not a specific language and the translated sentences of all the languages have a low score, and the processing for the user's utterance does not require immediacy, the translation result is obtained. Executes the process of manually editing. The information processing system 1 causes a cloud worker to perform manual editing. For example, the information processing system 1 causes the cloud worker to manually edit the translation result by transmitting the utterance development sentence and the translated sentence to a device (terminal device 10 or the like) used by the cloud worker.
 また、情報処理システム1は、後編集の処理時間が長い(ロングタイム)である場合、処理を棄却する。例えば、情報処理システム1は、クラウドワーカ等による人手での編集が所定の時間内に完了しない場合、処理を棄却する。この場合、情報処理システム1は、「XX言語からYY言語への翻訳時間が長いため、中断します」といった応答生成を行う。例えば、情報処理システム1は、「入力言語のオランダ語から意味解析用の日本語への翻訳時間が長いため、中断します」といった応答生成を行う。 Further, the information processing system 1 rejects the processing when the post-editing processing time is long (long time). For example, the information processing system 1 rejects the process if the manual editing by a cloud worker or the like is not completed within a predetermined time. In this case, the information processing system 1 generates a response such as "The translation time from the XX language to the YY language is long, so the process is interrupted." For example, the information processing system 1 generates a response such as "The translation time from the input language Dutch to Japanese for semantic analysis is long, so the process is interrupted."
 また、情報処理システム1は、後編集の処理時間が短い(ショートタイム)である場合、人手により編集された翻訳文を用いて発話意味解析器により意味解析処理を行う。 Further, when the post-editing processing time is short (short time), the information processing system 1 performs the semantic analysis processing by the utterance semantic analyzer using the translated sentence edited by hand.
 上述のように、情報処理システム1は、翻訳リスト中の発話展開文やハイスコアである翻訳文について、発話意味解析器により意味解析処理を行うことにより、各文(意味解析対象文)の出力を得る。例えば、情報処理システム1は、各意味解析対象文について意味解析の結果を得る。例えば、情報処理システム1は、意味解析処理により各意味解析対象文に対応する意味フレームの情報を生成する。例えば、情報処理システム1は、意味解析処理により、各意味解析対象文に対応するDomain-Goal(ドメインゴール)が特定され、スロット値が設定された意味フレームの情報を生成する。また、情報処理システム1は、各意味解析対象文に対応するDomain-Goal(ドメインゴール)やスロット値の確信度を示すスコア(意味解析スコア)を含む意味フレームの情報を生成する。 As described above, the information processing system 1 outputs each sentence (semantic analysis target sentence) by performing semantic analysis processing on the utterance development sentence and the translated sentence having a high score in the translation list by the utterance semantic analyzer. To get. For example, the information processing system 1 obtains the result of the semantic analysis for each sentence to be analyzed. For example, the information processing system 1 generates information of a semantic frame corresponding to each semantic analysis target sentence by a semantic analysis process. For example, the information processing system 1 specifies the Domain-Goal (domain goal) corresponding to each semantic analysis target sentence by the semantic analysis process, and generates information of the semantic frame in which the slot value is set. Further, the information processing system 1 generates information of a semantic frame including a Domain-Goal (domain goal) corresponding to each semantic analysis target sentence and a score (semantic analysis score) indicating the certainty of the slot value.
 そして、情報処理システム1は、各意味解析対象文の意味解析スコア(解析スコア)を精度指標値である意味解析精度(%)に変換する。情報処理システム1は、意味解析スコア(解析スコア)とスコア関数とを用いて、意味解析精度(%)を算出する。情報処理システム1は、言語を示す情報と、意味解析スコア(解析スコア)とを入力とし、意味解析精度を出力するスコア関数とを用いて、意味解析精度を算出する。 Then, the information processing system 1 converts the semantic analysis score (analysis score) of each semantic analysis target sentence into the semantic analysis accuracy (%) which is an accuracy index value. The information processing system 1 calculates the semantic analysis accuracy (%) by using the semantic analysis score (analysis score) and the score function. The information processing system 1 calculates the semantic analysis accuracy by using the information indicating the language and the score function that outputs the semantic analysis accuracy by inputting the semantic analysis score (analysis score).
 そして、情報処理システム1は、意味フレームの言語の選択を行う。例えば、情報処理システム1は、発話意味解析処理が行われた意味解析対象文が複数ある場合、意味解析精度を基に意味解析対象文を選択することにより、意味解析対象文の意味フレームの言語をその後の処理に用いる言語に選択する。情報処理システム1は、意味解析精度が最大の意味解析対象文を選択することにより、意味解析対象文の意味フレームの言語をその後の処理に用いる言語に選択する。なお、情報処理システム1は、発話意味解析処理が行われた意味解析対象文が1つである場合、その意味解析対象文の意味フレームの言語をその後の処理に用いる言語に選択する。 Then, the information processing system 1 selects the language of the semantic frame. For example, in the information processing system 1, when there are a plurality of semantic analysis target sentences for which utterance semantic analysis processing has been performed, the language of the semantic frame of the semantic analysis target sentence is selected by selecting the semantic analysis target sentence based on the semantic analysis accuracy. Is selected as the language to be used for the subsequent processing. The information processing system 1 selects the language of the semantic frame of the semantic analysis target sentence as the language to be used for the subsequent processing by selecting the semantic analysis target sentence having the maximum semantic analysis accuracy. When the information processing system 1 has one semantic analysis target sentence for which the utterance semantic analysis process has been performed, the information processing system 1 selects the language of the semantic frame of the semantic analysis target sentence as the language to be used for the subsequent processing.
 そして、情報処理システム1は、知識DBを用いてスロット逆変換を行う。情報処理システム1は、逆翻訳または多言語辞書を利用してスロット逆変換を行う。例えば、情報処理システム1は、外部の知識情報提供サーバや知識情報記憶部125等の知識DBを用いてスロット逆変換を行う。例えば、情報処理システム1は、意味フレーム中の翻訳先言語のスロット値を入力言語(対象言語)のスロット値に逆変換する。なお、情報処理システム1は、意味フレームの言語が入力言語(対象言語)である場合など、逆変換が不要の場合はスロット逆変換の処理を行わなくてもよい。 Then, the information processing system 1 performs slot inverse transformation using the knowledge DB. The information processing system 1 performs reverse slot conversion using reverse translation or a multilingual dictionary. For example, the information processing system 1 performs slot inverse conversion using a knowledge database such as an external knowledge information providing server or a knowledge information storage unit 125. For example, the information processing system 1 reversely converts the slot value of the translation destination language in the semantic frame into the slot value of the input language (target language). The information processing system 1 does not have to perform the slot inverse transformation process when the inverse transformation is unnecessary, such as when the language of the semantic frame is the input language (target language).
 そして、情報処理システム1は、応答生成を行う。情報処理システム1は、意味解析処理の結果に対応する情報の生成を行う。情報処理システム1は、意味フレームのDomain-Goal(ドメインゴール)やスロット値の情報を基に、応答する情報の生成を行う。 Then, the information processing system 1 generates a response. The information processing system 1 generates information corresponding to the result of the semantic analysis process. The information processing system 1 generates response information based on the domain-goal (domain goal) of the semantic frame and the slot value information.
[1-7-1.情報処理システムによる処理の具体例]
 図15を用いて処理の具体例について説明する。図15は、情報処理システムによる具体的な処理の一例を示す図である。例えば、図15に示す処理例は、図12に示すフローに対応する。図15に示す処理は、情報処理システム1に含まれる情報処理装置100及び端末装置10のいずれの装置が行ってもよい。図11~図14と同様の点については適宜説明を省略する。
[1-7-1. Specific example of processing by information processing system]
A specific example of the process will be described with reference to FIG. FIG. 15 is a diagram showing an example of specific processing by the information processing system. For example, the processing example shown in FIG. 15 corresponds to the flow shown in FIG. The process shown in FIG. 15 may be performed by any of the information processing device 100 and the terminal device 10 included in the information processing system 1. The same points as those in FIGS. 11 to 14 will be omitted as appropriate.
 図15では、ユーザは、「XXXの曲をかけて(韓国語)」と韓国語で発話を行う。これにより、情報処理システム1は、ユーザの韓国語による入力IN1を取得する。なお、「XXX」は、所定のアーティスト名(歌手名)であるものとする。 In FIG. 15, the user speaks in Korean, saying "Play a song of XXX (Korean)". As a result, the information processing system 1 acquires the input IN1 of the user in Korean. It is assumed that "XXX" is a predetermined artist name (singer name).
 そして、情報処理システム1は、韓国語による意味解析処理を行う。これにより、情報処理システム1は、結果RS1に示すように、韓国語による意味解析の結果を取得する。図15例では、情報処理システム1は、韓国語による意味解析ではドメインが「料理」であり、そのスコア値が「低」であることを示す結果を取得する。情報処理システム1は、韓国語の意味解析器を用いて、ドメインが「料理」に特定され、その意味解析スコアが「0.2」等の低い数値である意味解析の結果を生成する。 Then, the information processing system 1 performs semantic analysis processing in Korean. As a result, the information processing system 1 acquires the result of the semantic analysis in Korean as shown in the result RS1. In the example of FIG. 15, the information processing system 1 acquires a result indicating that the domain is "cooking" and the score value is "low" in the semantic analysis in Korean. The information processing system 1 uses a Korean semantic analyzer to generate a semantic analysis result in which the domain is specified as "cooking" and the semantic analysis score is a low numerical value such as "0.2".
 韓国語による意味解析のスコアが低く、低精度であるため、情報処理システム1は、韓国語である「XXXの曲をかけて(韓国語)」の英語の翻訳文を用いて意味解析処理を行う。情報処理システム1は、英語の翻訳文「Play XXX song」を用いて、意味解析処理を行うことにより、結果RS2に示すように、ドメイン(ドメイン・ゴール)が「音楽再生」であり、そのスコア値が「高」であることを示す結果を取得する。情報処理システム1は、英語の意味解析器を用いて、ドメインが「音楽再生」に特定され、その意味解析スコアが「0.9」等の高い数値である意味解析の結果を生成する。 Since the score of the semantic analysis in Korean is low and the accuracy is low, the information processing system 1 performs the semantic analysis process using the English translation of the Korean word "Korean song (Korean)". conduct. The information processing system 1 performs a semantic analysis process using the English translation "Play XXX song", and as shown in the result RS2, the domain (domain goal) is "music playback" and its score. Get the result indicating that the value is "high". The information processing system 1 uses an English semantic analyzer to generate a semantic analysis result in which the domain is specified as "music reproduction" and the semantic analysis score is a high numerical value such as "0.9".
 情報処理システム1は、英語の意味解析結果を用いて、ユーザへの応答を決定する。情報処理システム1は、英語の意味解析結果を用いて、アーティスト「XXX」の曲を再生すると決定する。なお、「AAA & BBB」アーティスト「XXX」の曲名であるものとする。図15では、情報処理システム1は、「AAA & BBBを再生します(韓国語)」と韓国語で応答出力を行う。そして、情報処理システム1は、アーティスト「XXX」の曲「AAA & BBB」を再生する処理を実行する。 The information processing system 1 determines the response to the user using the English semantic analysis result. The information processing system 1 determines to play the song of the artist "XXX" using the English semantic analysis result. It is assumed that the song title is "AAA & BBB" artist "XXX". In FIG. 15, the information processing system 1 outputs a response in Korean saying "AAA & BBB is reproduced (Korean)". Then, the information processing system 1 executes a process of playing the song "AAA & BBB" of the artist "XXX".
 情報処理システム1は、入力の言語(韓国語)と解析の言語(英語)をユーザに通知してもよい。また、情報処理システム1は、ユーザに対してフィードバックを行う。情報処理システム1は、フィードバックFB1に示すように、「韓国語の精度が低いため、英語で解析しました(韓国語)」とユーザへ韓国語で通知する。なお、通知態様は、音声出力や画面への表示等の種々の態様であってもよい。また、情報処理システム1は、英語解析であることを示す色(例えば緑色等)の光を点灯させてもよいし、翻訳を意味する効果音(例えばピピ等)を出力したりしてもよい。 The information processing system 1 may notify the user of the input language (Korean) and the analysis language (English). In addition, the information processing system 1 provides feedback to the user. As shown in the feedback FB1, the information processing system 1 notifies the user in Korean that "the accuracy of Korean is low, so the analysis was performed in English (Korean)". The notification mode may be various modes such as voice output and display on the screen. Further, the information processing system 1 may turn on a light of a color (for example, green or the like) indicating that the analysis is in English, or may output a sound effect (for example, pip or the like) meaning translation. ..
[1-8.処理の詳細等]
 ここから、上述した各種処理の詳細について説明する。
[1-8. Details of processing, etc.]
From here, the details of the various processes described above will be described.
[1-8-1.言語識別]
 まず、上述した言語識別について記載する。言語識別とは、複数の入力言語を想定し、自動的に発話者の言語を特定する公知の技術であり、近年では音声認識処理と同時に用いられることが多い。また、言語識別は、音声認識の音響モデルから判断する場合と、言語モデルから判断する場合に大きく分かれる。前者は言語識別にかかる時間(遅延)を少なくすることができる点が利点であり、後者はより正確な言語識別が可能となり、遅延と精度のトレードオフの関係である。情報処理システム1は、音声認識の音響モデルから判断する処理と、言語モデルから判断する処理とのいずれにより、言語識別を行ってもよい。
[1-8-1. Language identification]
First, the above-mentioned language identification will be described. Language identification is a known technique that assumes a plurality of input languages and automatically identifies the language of the speaker, and is often used at the same time as voice recognition processing in recent years. In addition, language identification is roughly divided into a case of judging from an acoustic model of speech recognition and a case of judging from a language model. The former has the advantage that the time (delay) required for language identification can be reduced, and the latter enables more accurate language identification, which is a trade-off relationship between delay and accuracy. The information processing system 1 may perform language identification by either a process of determining from an acoustic model of voice recognition or a process of determining from a language model.
 また、入力言語が翻訳器の対応外の言語や、意味解析器の特定言語外の場合、処理を中断し、応答に中断理由を含める。例えば、情報処理システム1は、「ZZ言語は翻訳および意味解析で対応しておりません」などと言った出力を行う。 Also, if the input language is not supported by the translator or a specific language of the semantic analyzer, the processing is interrupted and the reason for the interruption is included in the response. For example, the information processing system 1 outputs such as "ZZ language does not support translation and semantic analysis".
[1-8-2.低精度の意味解析器の改善]
 次に、低精度の意味解析器の改善について、説明する。例えば、情報処理システム1は、発話展開数「N」と言語展開数「M」の掛け算した「N×M」個の文を処理する。
[1-8-2. Improvement of low-precision semantic analyzer]
Next, the improvement of the low-precision semantic analyzer will be described. For example, the information processing system 1 processes "N × M" sentences obtained by multiplying the number of utterance developments "N" and the number of language developments "M".
 なお、情報処理システム1は、例えば時間的な制約の観点がある場合には、まず入力発話と入力言語のみの意味解析処理を行ってもよい。この場合、情報処理システム1は、入力発話と入力言語のみの意味解析処理の結果が低精度の場合、発話文展開や言語展開の処理のいずれか、またはその両方を実行可能とする。例えば、情報処理システム1は、入力発話と入力言語のみの意味解析処理のスコア値(意味解析スコア)または意味解析精度が一定値(閾値)以下である場合、発話文展開や言語展開の処理のいずれか、またはその両方を実行可能とする。この場合、一定値(閾値)が低いほど、入力内容に対して、より低精度の意味解析処理を許容することになるが、この一定値(閾値)の値は適宜設定可能であるものとする。例えば、この一定値(閾値)の値は、処理の用途等に応じて適宜設定されてもよい。例えば、処理に要求される精度が高い程、一定値(閾値)の値を大きく設定してもよい。 Note that the information processing system 1 may first perform semantic analysis processing of only input utterance and input language when there is a viewpoint of time constraints, for example. In this case, the information processing system 1 makes it possible to execute either the utterance sentence expansion process, the language expansion process, or both when the result of the semantic analysis process of only the input utterance and the input language is low accuracy. For example, when the score value (semantic analysis score) or the semantic analysis accuracy of the semantic analysis processing of only the input utterance and the input language is less than a certain value (threshold), the information processing system 1 performs the processing of the utterance sentence expansion and the language expansion. Make either or both feasible. In this case, the lower the constant value (threshold value), the more inaccurate semantic analysis processing is allowed for the input content, but the value of this constant value (threshold value) can be set as appropriate. .. For example, the value of this constant value (threshold value) may be appropriately set according to the purpose of processing and the like. For example, the higher the accuracy required for processing, the larger the constant value (threshold value) may be set.
 図14の例は、入力言語が韓国語で意味解析処理が低精度によって料理ドメインとなり、その後、英語翻訳で変換した発話文によって、再度意味解析処理では高精度になり、音楽ドメインに変更される。この場合、情報処理システム1は、応答に「低精度のため、英語で翻訳して解析しました」などの応答文を含めることが可能である。 In the example of FIG. 14, the input language is Korean and the semantic analysis process becomes the cooking domain due to low accuracy, and then the utterance sentence converted by the English translation makes the semantic analysis process highly accurate again and is changed to the music domain. .. In this case, the information processing system 1 can include a response sentence such as "translated and analyzed in English because of low accuracy" in the response.
[1-8-3.品質推定]
 次に、品質推定について、説明する。品質推定は、翻訳器の出力がどの程度の精度であるかを推定する処理(モジュール)である。精度が低い場合、意味解析処理しても所望の結果を得ることが難しく、意味解析が別のドメインゴールを推定することによる、ユーザにとって不利益になることを避けるために、翻訳結果の段階で処理を中止させるためのものである。つまり、情報処理システム1は、品質推定のスコア値(品質スコア)が一定以上高いもの、通常の意味解析処理を行い、一定以下のものは処理を中断して、中断理由を応答に含める。例えば、英語、フランス語及びドイツ語の品質スコアのみが閾値「0.75」である場合、情報処理システム1は、閾値「0.75」以上の言語である英語、フランス語及びドイツ語の3つの言語は、通常の意味解析処理を行い、その他の言語は処理を中断して、中断理由を応答に含める。
[1-8-3. Quality estimation]
Next, quality estimation will be described. Quality estimation is a process (module) for estimating the accuracy of the output of a translator. If the accuracy is low, it is difficult to obtain the desired result even with the semantic analysis process, and in order to avoid the disadvantage to the user due to the semantic analysis estimating another domain goal, at the stage of the translation result. This is to stop the processing. That is, the information processing system 1 performs a normal semantic analysis process when the score value (quality score) of the quality estimation is higher than a certain level, interrupts the process when the score value is lower than a certain level, and includes the reason for the interruption in the response. For example, when only the quality scores of English, French and German have a threshold value of "0.75", the information processing system 1 has three languages of English, French and German, which are languages having a threshold value of "0.75" or higher. Performs normal semantic analysis processing, other languages interrupt the processing and include the reason for the interruption in the response.
 例えば、テキスト表示や音声応答の場合、情報処理システム1は、「オランダ語から日本語への翻訳精度が低いため、処理を中断します」などの出力を行う。これにより、ユーザは、どの言語が翻訳処理を正常にできなかったことがわかる。なお、情報処理システム1は、エラー音や、識別可能な色をデバイスで表示するなど、ユーザが認知可能であれば、種々の表現形式により出力(通知)を行ってもよい。 For example, in the case of text display or voice response, the information processing system 1 outputs such as "The processing is interrupted because the translation accuracy from Dutch to Japanese is low". This allows the user to know which language was unable to successfully translate. The information processing system 1 may output (notify) in various expression formats as long as it can be recognized by the user, such as displaying an error sound or an identifiable color on the device.
 また、情報処理システム1は、発話文展開および言語展開で生成したすべての翻訳文リストの各文に対して、品質推定のスコア値(品質スコア)を算出し、どれか一つでも一定以上の値なら、一定以上の特定言語すべて意味解析処理を実行する。一方、情報処理システム1は、すべての特定言語が一定以下の値である場合は、処理を中断して中断理由を応答に含める。例えば、情報処理システム1は、「処理可能な言語へ翻訳ですべて精度が低いため、処理を中断します」などの出力を行う。 Further, the information processing system 1 calculates a quality estimation score value (quality score) for each sentence of all the translated sentence lists generated by the utterance sentence expansion and the language expansion, and any one of them is above a certain level. If it is a value, the semantic analysis process is executed for all specific languages above a certain level. On the other hand, when all the specific languages have values below a certain level, the information processing system 1 interrupts the processing and includes the reason for the interruption in the response. For example, the information processing system 1 outputs such as "Translation to a processable language has low accuracy, so processing is interrupted."
[1-8-4.クラウドワーカ]
 次に、クラウドワーカについて説明する。品質推定追加の延長として、情報処理システム1は、すべての特定言語の翻訳結果が一定以下の場合、クラウドワーカ(翻訳編集者)により、人手による翻訳結果を修正する処理(後編集)を行わせることも可能である。ただし、人手編集のため、処理時間がかかるため、この処理は即時性が要求されない用途を想定される。そのため、一定以上の時間を経過しても処理が終わらない場合は、情報処理システム1は、処理を中断して応答に中断理由を含める。例えば、情報処理システム1は、「ヒンディー語から日本語への人手翻訳の時間が長いため、中断します」などの出力を行う。また、情報処理システム1は、ドメインゴールによって、一定以上の時間を調整してもよい。
[1-8-4. Cloud worker]
Next, the cloud worker will be described. As an extension of the addition of quality estimation, the information processing system 1 causes a cloud worker (translation editor) to manually correct the translation result (post-editing) when the translation result of all specific languages is below a certain level. It is also possible. However, since it is manually edited and it takes a long time to process, this process is expected to be used for which immediacy is not required. Therefore, if the processing is not completed even after a certain period of time has passed, the information processing system 1 interrupts the processing and includes the reason for the interruption in the response. For example, the information processing system 1 outputs such as "It will be interrupted because the manual translation from Hindi to Japanese takes a long time." Further, the information processing system 1 may adjust a certain time or more according to the domain goal.
[1-8-5.翻訳精度・時間の調整]
 情報処理システム1は、閾値等の種々の値を変更してもよい。例えば、情報処理システム1は、一定以上の翻訳精度や待ち時間を、システムのデフォルト値以外に、ユーザが自由に設定させてもよい。ユーザによっては品質が高いものだけを得たい場合は精度の閾値は高く設定し、時間がかかっても結果が欲しい場合は、待ち時間を長く設定することで、自由にカスタイマイズすることが可能であり、ユーザごとに最適化することが可能となる。
[1-8-5. Translation accuracy and time adjustment]
The information processing system 1 may change various values such as a threshold value. For example, in the information processing system 1, the user may freely set the translation accuracy and the waiting time above a certain level in addition to the default values of the system. Depending on the user, if you want to obtain only high quality products, set a high accuracy threshold, and if you want results even if it takes time, you can freely customize by setting a long waiting time. , It is possible to optimize for each user.
[2.その他の実施形態]
 上述した各実施形態に係る処理は、上記各実施形態や変形例以外にも種々の異なる形態(変形例)にて実施されてよい。
[2. Other embodiments]
The processing according to each of the above-described embodiments may be carried out in various different forms (modifications) in addition to the above-mentioned embodiments and modifications.
[2-1.クライアント側で意味解析処理等を行う構成例]
 実施形態においては、システム構成の一例として、情報処理装置100が変換処理や意味解析処や逆変換理等を行う場合を示したが、端末装置10が変換処理や意味解析処や逆変換理等を行ってもよい。すなわち、クライアント側の装置である端末装置10が上述した変換処理や意味解析処や逆変換理等を行う情報処理装置であってもよい。このように、情報処理システム1のシステム構成は、サーバ側の装置である情報処理装置100が変換処理や意味解析処や逆変換理等を行う構成に限らず、クライアント側の装置である端末装置10が上述した変換処理や意味解析処や逆変換理等を行う構成であってもよい。
[2-1. Configuration example of performing semantic analysis processing on the client side]
In the embodiment, as an example of the system configuration, the case where the information processing apparatus 100 performs conversion processing, semantic analysis processing, inverse transformation theory, etc. is shown, but the terminal device 10 performs conversion processing, semantic analysis processing, inverse transformation theory, etc. May be done. That is, the terminal device 10 which is a device on the client side may be an information processing device that performs the above-mentioned conversion processing, semantic analysis processing, inverse conversion theory, and the like. As described above, the system configuration of the information processing system 1 is not limited to the configuration in which the information processing device 100, which is a device on the server side, performs conversion processing, semantic analysis processing, inverse transformation, and the like, and is a terminal device which is a device on the client side. 10 may be configured to perform the above-mentioned conversion processing, semantic analysis processing, inverse transformation theory, and the like.
 端末装置10が上述した変換処理や意味解析処や逆変換理等を行う情報処理装置である場合、情報処理システム1では、クライアント側(端末装置10)で翻訳や意味解析や逆変換を行う。そして、サーバ側(情報処理装置100)は、その意味解析結果や逆変換結果の情報を端末装置10から取得して、各種の処理を行う。この場合、端末装置10の実行部152は、情報処理装置100の実行部133と同様の機能を有してもよい。また、端末装置10は、上述した変換部132と同様の機能を実現する変換部や、逆変換部136と同様の機能を実現する逆変換部を有してもよい。また、この場合、情報処理装置100は、変換部132や逆変換部136を有しなくてもよい。 When the terminal device 10 is an information processing device that performs the above-mentioned conversion processing, semantic analysis processing, inverse conversion theory, etc., the information processing system 1 performs translation, semantic analysis, and inverse conversion on the client side (terminal device 10). Then, the server side (information processing device 100) acquires the information of the semantic analysis result and the inverse conversion result from the terminal device 10 and performs various processes. In this case, the execution unit 152 of the terminal device 10 may have the same function as the execution unit 133 of the information processing device 100. Further, the terminal device 10 may have a conversion unit that realizes the same function as the conversion unit 132 described above, and an inverse conversion unit that realizes the same function as the inverse conversion unit 136. Further, in this case, the information processing apparatus 100 does not have to have the conversion unit 132 and the inverse conversion unit 136.
 また、情報処理システム1は、クライアント側(端末装置10)で発話の意味解析までを行い、サーバ側(情報処理装置100)で逆変換を行うシステム構成であってもよい。この場合、クライアント側の装置である端末装置10が上述した変換処理や意味解析処理を行う情報処理装置であり、サーバ側の装置である情報処理装置100が上述した逆変換処理を行う情報処理装置であってもよい。この場合、端末装置10の変換部や実行部152が変換処理や意味解析処理を行い、情報処理装置100の逆変換部136が逆変換処理を行う。 Further, the information processing system 1 may have a system configuration in which the client side (terminal device 10) analyzes the meaning of the utterance and the server side (information processing device 100) performs inverse conversion. In this case, the terminal device 10 which is a device on the client side is an information processing device which performs the above-mentioned conversion processing and the semantic analysis processing, and the information processing device 100 which is a device on the server side performs the above-mentioned inverse conversion processing. It may be. In this case, the conversion unit and the execution unit 152 of the terminal device 10 perform conversion processing and semantic analysis processing, and the inverse conversion unit 136 of the information processing device 100 performs reverse conversion processing.
 なお、上記は一例であり、情報処理システム1においては、各処理をいずれの装置が行ってもよい。このように、情報処理システム1は、各処理について、クライアント側の装置(端末装置10)及びサーバ側の装置(情報処理装置100)のいずれが行うシステム構成であってもよい。 Note that the above is an example, and in the information processing system 1, any device may perform each process. As described above, the information processing system 1 may have a system configuration in which either the client-side device (terminal device 10) or the server-side device (information processing device 100) performs each process.
[2-2.その他の構成例]
 なお、上記の例では、情報処理装置100と端末装置10とが別体である場合を示したが、これらの装置は一体であってもよい。
[2-2. Other configuration examples]
In the above example, the case where the information processing device 100 and the terminal device 10 are separate bodies is shown, but these devices may be integrated.
[2-3.その他]
 また、上記各実施形態において説明した各処理のうち、自動的に行われるものとして説明した処理の全部または一部を手動的に行うこともでき、あるいは、手動的に行われるものとして説明した処理の全部または一部を公知の方法で自動的に行うこともできる。この他、上記文書中や図面中で示した処理手順、具体的名称、各種のデータやパラメータを含む情報については、特記する場合を除いて任意に変更することができる。例えば、各図に示した各種情報は、図示した情報に限られない。
[2-3. others]
Further, among the processes described in each of the above embodiments, all or a part of the processes described as being automatically performed can be manually performed, or the processes described as being manually performed. It is also possible to automatically perform all or part of the above by a known method. In addition, the processing procedure, specific name, and information including various data and parameters shown in the above document and drawings can be arbitrarily changed unless otherwise specified. For example, the various information shown in each figure is not limited to the illustrated information.
 また、図示した各装置の各構成要素は機能概念的なものであり、必ずしも物理的に図示の如く構成されていることを要しない。すなわち、各装置の分散・統合の具体的形態は図示のものに限られず、その全部または一部を、各種の負荷や使用状況などに応じて、任意の単位で機能的または物理的に分散・統合して構成することができる。 Further, each component of each device shown in the figure is a functional concept, and does not necessarily have to be physically configured as shown in the figure. That is, the specific form of distribution / integration of each device is not limited to the one shown in the figure, and all or part of the device is functionally or physically dispersed / physically distributed in any unit according to various loads and usage conditions. Can be integrated and configured.
 また、上述してきた各実施形態及び変形例は、処理内容を矛盾させない範囲で適宜組み合わせることが可能である。 Further, each of the above-described embodiments and modifications can be appropriately combined as long as the processing contents do not contradict each other.
 また、本明細書に記載された効果はあくまで例示であって限定されるものでは無く、他の効果があってもよい。 Further, the effects described in the present specification are merely examples and are not limited, and other effects may be obtained.
[3.本開示に係る効果]
 上述のように、本開示に係る情報処理装置(実施形態では情報処理装置100)は、実行部(実施形態では実行部133)と、算出部(実施形態では算出部134)とを備える。実行部は、ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する。算出部は、1以上の文字情報の各々に対応する意味解析処理の結果に基づいて、1以上の文字情報の各々に対して、意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する。
[3. Effect of this disclosure]
As described above, the information processing device (information processing device 100 in the embodiment) according to the present disclosure includes an execution unit (execution unit 133 in the embodiment) and a calculation unit (calculation unit 134 in the embodiment). The execution unit executes a semantic analysis process on one or more character information corresponding to each of one or more languages including a target language which is a language corresponding to the user's utterance. The calculation unit is an accuracy index that makes it possible to compare the accuracy of the semantic analysis processing for each of the one or more character information among a plurality of languages based on the result of the semantic analysis processing corresponding to each of the one or more character information. Calculate the value.
 このように、本開示に係る情報処理装置は、1以上の文字情報の各々の意味解析処理の結果に基づいて、1以上の文字情報の各々に対して、意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出することで、言語間における意味解析精度の比較を可能にすることができる。 As described above, the information processing apparatus according to the present disclosure determines the accuracy of the semantic analysis processing for each of the one or more character information between a plurality of languages based on the result of the semantic analysis processing of each of the one or more character information. By calculating the accuracy index value that can be compared with, it is possible to compare the semantic analysis accuracy between languages.
 また、算出部は、意味解析処理の結果の情報を入力として精度指標値を出力する関数を用いて、1以上の文字情報の各々の精度指標値を算出する。このように、情報処理装置は、意味解析処理の結果の情報を入力として精度指標値を出力する関数を用いることで適切に精度指標値を算出することができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 Further, the calculation unit calculates each accuracy index value of one or more character information by using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process. As described above, the information processing apparatus can appropriately calculate the accuracy index value by using the function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、算出部は、意味解析処理の結果に含まれるスコアを入力として精度指標値を出力する関数を用いて、1以上の文字情報の各々の精度指標値を算出する。このように、情報処理装置は、意味解析処理の結果に含まれるスコアを入力として精度指標値を出力する関数を用いることで、スコアを適切に精度指標値に変換することができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 In addition, the calculation unit calculates the accuracy index value of each of one or more character information by using a function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process. As described above, the information processing apparatus can appropriately convert the score into the accuracy index value by using the function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、算出部は、一の言語の文字情報に対する意味解析処理のスコアと、一の言語を示す情報とを入力として精度指標値を出力する関数を用いて、一の言語の文字情報の精度指標値を算出する。このように、情報処理装置は、スコアと言語の情報とを入力として精度指標値を出力する関数を用いることで、スコアを適切に精度指標値に変換することができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 In addition, the calculation unit uses a function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating one language, and the accuracy index of the character information of one language. Calculate the value. As described above, the information processing apparatus can appropriately convert the score into the accuracy index value by using the function that outputs the accuracy index value by inputting the score and the language information. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、実行部は、対象言語によるユーザの発話に対応する一の文字情報を含む1以上の文字情報に対して意味解析処理を実行する。このように、情報処理装置は、対象言語の文字情報を含む1以上の文字情報の各々を対象として、精度指標値を算出することで、対象言語を含む言語間における意味解析精度の比較を可能にすることができる。 In addition, the execution unit executes a semantic analysis process on one or more character information including one character information corresponding to the user's utterance in the target language. In this way, the information processing device can compare the semantic analysis accuracy between languages including the target language by calculating the accuracy index value for each of one or more character information including the character information of the target language. Can be.
 また、実行部は、一の文字情報が対象言語の翻訳先となる翻訳先言語に変換された翻訳文字情報を含む1以上の文字情報に対して意味解析処理を実行する。このように、情報処理装置は、翻訳先言語に変換された文字情報を含む1以上の文字情報の各々を対象として、精度指標値を算出することで、翻訳先言語を含む言語間における意味解析精度の比較を可能にすることができる。 In addition, the execution unit executes a semantic analysis process on one or more character information including the translated character information in which one character information is converted into the translation destination language which is the translation destination of the target language. In this way, the information processing device calculates the accuracy index value for each of one or more character information including the character information converted into the translation destination language, thereby performing semantic analysis between the languages including the translation destination language. It is possible to compare the accuracy.
 また、実行部は、対象言語の一の文字情報を対象言語の別の表現に言い換えたパラフレーズを含む1以上の文字情報に対して意味解析処理を実行する。このように、情報処理装置は、対象言語の文字情報の別の表現に言い換えたパラフレーズを含む1以上の文字情報の各々を対象として、精度指標値を算出することで、ユーザの発話を複数の表現に展開した上で、その表現の各々の精度の比較を行うことができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 In addition, the execution unit executes a semantic analysis process on one or more character information including a paraphrase in which one character information of the target language is paraphrased into another expression of the target language. In this way, the information processing device calculates the accuracy index value for each of one or more character information including the paraphrase paraphrased into another expression of the character information of the target language, so that the user can make a plurality of utterances. After expanding to the expression of, it is possible to compare the accuracy of each of the expressions. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、実行部は、対象言語のパラフレーズが対象言語の翻訳先となる翻訳先言語に変換された翻訳パラフレーズを含む1以上の文字情報に対して意味解析処理を実行する。このように、情報処理装置は、対象言語の文字情報の別の表現に言い換えたパラフレーズが翻訳先言語に変換された文字情報を含む1以上の文字情報の各々を対象として、精度指標値を算出することで、ユーザの発話を複数の表現に展開し、かつその各表現を翻訳した上で、その表現の各々の精度の比較を行うことができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 In addition, the execution unit executes a semantic analysis process on one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to which the target language is translated. In this way, the information processing device sets the accuracy index value for each of one or more character information including the character information in which the paraphrase paraphrased into another expression of the character information of the target language is converted into the translation destination language. By calculating, it is possible to expand the user's utterance into a plurality of expressions, translate each expression, and then compare the accuracy of each expression. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、本開示に係る情報処理装置は、変換部(実施形態では変換部132)を備える。変換部は、対象言語に対応する文字情報を、対象言語の翻訳先となる翻訳先言語に対応する文字情報に変換する。実行部は、変換部により変換された文字情報に対して意味解析処理を実行する。このように、情報処理装置は、対象言語に対応する文字情報を、対象言語の翻訳先となる翻訳先言語に対応する文字情報に変換し、意味解析の処理を行うことで、各翻訳先言語に対応する精度指標値を算出することができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 Further, the information processing device according to the present disclosure includes a conversion unit (conversion unit 132 in the embodiment). The conversion unit converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language. The execution unit executes a semantic analysis process on the character information converted by the conversion unit. In this way, the information processing device converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language, and performs semantic analysis processing to perform the semantic analysis of each translation destination language. The accuracy index value corresponding to can be calculated. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、変換部は、対象言語に対応する文字情報を、意味解釈可能な言語に対応する文字情報に変換する。このように、情報処理装置は、対象言語に対応する文字情報を、意味解釈可能な言語(特定言語)に対応する文字情報に変換し、意味解析の処理を行うことで、各翻訳先言語に対応する精度指標値を算出することができる。したがって、情報処理装置は、言語間における意味解析精度の比較を可能にすることができる。 In addition, the conversion unit converts the character information corresponding to the target language into the character information corresponding to the language whose meaning can be interpreted. In this way, the information processing device converts the character information corresponding to the target language into the character information corresponding to the language that can interpret the meaning (specific language), and performs the semantic analysis process to convert the character information into each translation destination language. The corresponding accuracy index value can be calculated. Therefore, the information processing device can make it possible to compare the accuracy of semantic analysis between languages.
 また、本開示に係る情報処理装置は、選択部(実施形態では選択部135)を備える。選択部は、算出部により算出された1以上の文字情報の各々の精度指標値に基づいて、1以上の文字情報のうち、処理に用いる文字情報である処理対象文字情報を選択する。このように、情報処理装置は、精度指標値に基づいて、1以上の文字情報のうち、処理に用いる文字情報である処理対象文字情報を選択することで、処理に用いる文字情報を適切に選択することができる。したがって、情報処理装置は、言語間における意味解析精度の比較し、その比較結果を基に適切に処理を行うことができる。 Further, the information processing apparatus according to the present disclosure includes a selection unit (selection unit 135 in the embodiment). The selection unit selects the processing target character information, which is the character information used for processing, from the one or more character information based on each accuracy index value of the one or more character information calculated by the calculation unit. In this way, the information processing apparatus appropriately selects the character information used for processing by selecting the processing target character information which is the character information used for processing from among one or more character information based on the accuracy index value. can do. Therefore, the information processing apparatus can compare the accuracy of semantic analysis between languages and appropriately perform processing based on the comparison result.
 また、選択部は、精度指標値が最大である文字情報を処理対象文字情報として選択する。このように、情報処理装置は、精度指標値が最大である文字情報を処理対象文字情報として選択することで、精度が最大の文字情報を用いて処理を行うことができる。したがって、情報処理装置は、言語間における意味解析精度の比較し、その比較結果を基に適切に処理を行うことができる。 In addition, the selection unit selects the character information having the maximum accuracy index value as the character information to be processed. As described above, the information processing apparatus can perform processing using the character information having the maximum accuracy by selecting the character information having the maximum accuracy index value as the character information to be processed. Therefore, the information processing apparatus can compare the accuracy of semantic analysis between languages and appropriately perform processing based on the comparison result.
 また、算出部は、対象言語の文字情報に対応する精度指標値を算出する。選択部は、対象言語の文字情報に対応する精度指標値が所定値以上である場合、対象言語の文字情報を処理対象文字情報として選択する。このように、情報処理装置は、対象言語の文字情報に対応する精度指標値が所定値以上である場合、対象言語の文字情報を処理対象文字情報として選択することで、対象言語の精度が高い場合にその言語を用いて処理を行うことができる。したがって、情報処理装置は、対象言語を用いて適切に処理を行うことができる。 In addition, the calculation unit calculates the accuracy index value corresponding to the character information of the target language. When the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the selection unit selects the character information of the target language as the processing target character information. In this way, when the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the information processing apparatus selects the character information of the target language as the processing target character information, so that the accuracy of the target language is high. In some cases, the language can be used for processing. Therefore, the information processing apparatus can appropriately perform processing using the target language.
 また、実行部は、対象言語に対応する精度指標値が所定値以上である場合、対象言語以外の言語の意味解析処理を実行しない。このように、情報処理装置は、対象言語に対応する精度指標値が所定値以上である場合、対象言語以外の言語の意味解析処理を実行しないことで、処理負荷の増大を抑制することができる。したがって、情報処理装置は、処理負荷の増大を抑制しつつ、適切に処理を行うことができる。 Further, when the accuracy index value corresponding to the target language is equal to or higher than the predetermined value, the execution unit does not execute the semantic analysis process of a language other than the target language. As described above, when the accuracy index value corresponding to the target language is equal to or higher than the predetermined value, the information processing apparatus can suppress an increase in the processing load by not executing the semantic analysis process of a language other than the target language. .. Therefore, the information processing apparatus can appropriately perform processing while suppressing an increase in processing load.
 また、本開示に係る情報処理装置は、逆変換部(実施形態では逆変換部136)を備える。逆変換部は、処理対象文字情報の言語に対応する意味解析処理の結果を対象言語に変換する。このように、情報処理装置は、処理対象文字情報の言語に対応する意味解析処理の結果を対象言語に変換することで、対象言語に対応する意味解析の情報を得ることができる。したがって、情報処理装置は、対象言語を用いて適切に処理を行うことができる。 Further, the information processing apparatus according to the present disclosure includes an inverse conversion unit (in the embodiment, an inverse conversion unit 136). The inverse transformation unit converts the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language. In this way, the information processing apparatus can obtain the information of the semantic analysis corresponding to the target language by converting the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
 また、逆変換部は、処理対象文字情報の言語が対象言語以外である場合、意味解析処理の結果を対象言語に変換する。このように、情報処理装置は、処理対象文字情報の言語が対象言語以外である場合、意味解析処理の結果を対象言語に変換することで、対象言語以外で意味解析された結果を、対象言語で利用可能にした情報を得ることができる。したがって、情報処理装置は、対象言語を用いて適切に処理を行うことができる。 In addition, the inverse conversion unit converts the result of the semantic analysis process into the target language when the language of the processing target character information is other than the target language. In this way, when the language of the character information to be processed is other than the target language, the information processing device converts the result of the semantic analysis process into the target language, and the result of the semantic analysis other than the target language is converted into the target language. You can get the information available in. Therefore, the information processing apparatus can appropriately perform processing using the target language.
 また、逆変換部は、意味解析処理の結果のうち一部を対象言語に変換する。このように、情報処理装置は、意味解析処理の結果のうち一部を対象言語に変換することで、必要な情報のみを対象言語に変換することで、対象言語以外で意味解析された結果を、対象言語で利用可能にした情報を得ることができる。したがって、情報処理装置は、対象言語を用いて適切に処理を行うことができる。 In addition, the inverse transformation unit converts a part of the result of the semantic analysis process into the target language. In this way, the information processing device converts a part of the result of the semantic analysis process into the target language, and by converting only the necessary information into the target language, the result of the semantic analysis other than the target language is obtained. , You can get the information available in the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
 また、逆変換部は、意味解析処理の結果のうちスロット値を対象言語に変換する。このように、情報処理装置は、意味解析処理の結果のうちスロット値を対象言語に変換することで、例えばサービスの実行などに必要な情報(スロット値)のみを対象言語に変換することで、対象言語以外で意味解析された結果を、対象言語で利用可能にした情報を得ることができる。したがって、情報処理装置は、対象言語を用いて適切に処理を行うことができる。 In addition, the inverse conversion unit converts the slot value of the result of the semantic analysis process into the target language. In this way, the information processing device converts the slot value of the result of the semantic analysis process into the target language, thereby converting only the information (slot value) necessary for executing the service, for example, into the target language. It is possible to obtain information that makes the results of semantic analysis in a language other than the target language available in the target language. Therefore, the information processing apparatus can appropriately perform processing using the target language.
 また、実行部は、対象言語が言語識別可能である場合、対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する。このように、情報処理装置は、対象言語が言語識別可能である場合、意味解析処理を実行することで、言語識別可能な言語に対して適切に処理を行うことができる。また、情報処理装置は、言語識別可能ではない言語に対して処理を行い、適切ではない処理を行なったり、不適切な結果を出力したりすることを抑制することができる。 Further, when the target language is language identifiable, the execution unit executes a semantic analysis process on one or more character information corresponding to each of the one or more languages including the target language. As described above, when the target language is language-identifiable, the information processing apparatus can appropriately perform processing on the language-identifiable language by executing the semantic analysis process. In addition, the information processing device can perform processing on a language that is not language-identifiable, and can suppress improper processing or output of inappropriate results.
[4.ハードウェア構成]
 上述してきた各実施形態や変形例に係る情報処理装置100や端末装置10等の情報機器は、例えば図20に示すような構成のコンピュータ1000によって実現される。図20は、情報処理装置100や端末装置10等の情報処理装置の機能を実現するコンピュータ1000の一例を示すハードウェア構成図である。以下、実施形態に係る情報処理装置100を例に挙げて説明する。コンピュータ1000は、CPU1100、RAM1200、ROM(Read Only Memory)1300、HDD(Hard Disk Drive)1400、通信インターフェイス1500、及び入出力インターフェイス1600を有する。コンピュータ1000の各部は、バス1050によって接続される。
[4. Hardware configuration]
Information devices such as the information processing device 100 and the terminal device 10 according to each of the above-described embodiments and modifications are realized by, for example, a computer 1000 having a configuration as shown in FIG. FIG. 20 is a hardware configuration diagram showing an example of a computer 1000 that realizes the functions of information processing devices such as the information processing device 100 and the terminal device 10. Hereinafter, the information processing apparatus 100 according to the embodiment will be described as an example. The computer 1000 includes a CPU 1100, a RAM 1200, a ROM (Read Only Memory) 1300, an HDD (Hard Disk Drive) 1400, a communication interface 1500, and an input / output interface 1600. Each part of the computer 1000 is connected by a bus 1050.
 CPU1100は、ROM1300又はHDD1400に格納されたプログラムに基づいて動作し、各部の制御を行う。例えば、CPU1100は、ROM1300又はHDD1400に格納されたプログラムをRAM1200に展開し、各種プログラムに対応した処理を実行する。 The CPU 1100 operates based on the program stored in the ROM 1300 or the HDD 1400, and controls each part. For example, the CPU 1100 expands the program stored in the ROM 1300 or the HDD 1400 into the RAM 1200 and executes processing corresponding to various programs.
 ROM1300は、コンピュータ1000の起動時にCPU1100によって実行されるBIOS(Basic Input Output System)等のブートプログラムや、コンピュータ1000のハードウェアに依存するプログラム等を格納する。 The ROM 1300 stores a boot program such as a BIOS (Basic Input Output System) executed by the CPU 1100 when the computer 1000 is started, a program that depends on the hardware of the computer 1000, and the like.
 HDD1400は、CPU1100によって実行されるプログラム、及び、かかるプログラムによって使用されるデータ等を非一時的に記録する、コンピュータが読み取り可能な記録媒体である。具体的には、HDD1400は、プログラムデータ1450の一例である本開示に係る情報処理プログラムを記録する記録媒体である。 The HDD 1400 is a computer-readable recording medium that non-temporarily records a program executed by the CPU 1100 and data used by the program. Specifically, the HDD 1400 is a recording medium for recording an information processing program according to the present disclosure, which is an example of program data 1450.
 通信インターフェイス1500は、コンピュータ1000が外部ネットワーク1550(例えばインターネット)と接続するためのインターフェイスである。例えば、CPU1100は、通信インターフェイス1500を介して、他の機器からデータを受信したり、CPU1100が生成したデータを他の機器へ送信したりする。 The communication interface 1500 is an interface for the computer 1000 to connect to an external network 1550 (for example, the Internet). For example, the CPU 1100 receives data from another device or transmits data generated by the CPU 1100 to another device via the communication interface 1500.
 入出力インターフェイス1600は、入出力デバイス1650とコンピュータ1000とを接続するためのインターフェイスである。例えば、CPU1100は、入出力インターフェイス1600を介して、キーボードやマウス等の入力デバイスからデータを受信する。また、CPU1100は、入出力インターフェイス1600を介して、ディスプレイやスピーカーやプリンタ等の出力デバイスにデータを送信する。また、入出力インターフェイス1600は、所定の記録媒体(メディア)に記録されたプログラム等を読み取るメディアインターフェイスとして機能してもよい。メディアとは、例えばDVD(Digital Versatile Disc)、PD(Phase change rewritable Disk)等の光学記録媒体、MO(Magneto-Optical disk)等の光磁気記録媒体、テープ媒体、磁気記録媒体、または半導体メモリ等である。例えば、コンピュータ1000が実施形態に係る情報処理装置100として機能する場合、コンピュータ1000のCPU1100は、RAM1200上にロードされた情報処理プログラムを実行することにより、制御部130等の機能を実現する。また、HDD1400には、本開示に係る情報処理プログラムや、記憶部120内のデータが格納される。なお、CPU1100は、プログラムデータ1450をHDD1400から読み取って実行するが、他の例として、外部ネットワーク1550を介して、他の装置からこれらのプログラムを取得してもよい。 The input / output interface 1600 is an interface for connecting the input / output device 1650 and the computer 1000. For example, the CPU 1100 receives data from an input device such as a keyboard or mouse via the input / output interface 1600. Further, the CPU 1100 transmits data to an output device such as a display, a speaker, or a printer via the input / output interface 1600. Further, the input / output interface 1600 may function as a media interface for reading a program or the like recorded on a predetermined recording medium (media). The media is, for example, an optical recording medium such as DVD (Digital Versatile Disc) or PD (Phase change rewritable Disk), a magneto-optical recording medium such as MO (Magneto-Optical disk), a tape medium, a magnetic recording medium, or a semiconductor memory. Is. For example, when the computer 1000 functions as the information processing device 100 according to the embodiment, the CPU 1100 of the computer 1000 realizes the functions of the control unit 130 and the like by executing the information processing program loaded on the RAM 1200. Further, the information processing program according to the present disclosure and the data in the storage unit 120 are stored in the HDD 1400. The CPU 1100 reads the program data 1450 from the HDD 1400 and executes the program, but as another example, these programs may be acquired from another device via the external network 1550.
 なお、本技術は以下のような構成も取ることができる。
(1)
 ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する実行部と、
 前記1以上の文字情報の各々に対応する前記意味解析処理の結果に基づいて、前記1以上の文字情報の各々に対して、前記意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する算出部と、
 を備える情報処理装置。
(2)
 前記算出部は、
 前記意味解析処理の前記結果の情報を入力として前記精度指標値を出力する関数を用いて、前記1以上の文字情報の各々の前記精度指標値を算出する、
 (1)に記載の情報処理装置。
(3)
 前記算出部は、
 前記意味解析処理の前記結果に含まれるスコアを入力として前記精度指標値を出力する前記関数を用いて、前記1以上の文字情報の各々の前記精度指標値を算出する、
 (2)に記載の情報処理装置。
(4)
 前記算出部は、
 一の言語の文字情報に対する前記意味解析処理の前記スコアと、前記一の言語を示す情報とを入力として前記精度指標値を出力する前記関数を用いて、前記一の言語の文字情報の前記精度指標値を算出する、
 (3)に記載の情報処理装置。
(5)
 前記実行部は、
 前記対象言語による前記ユーザの発話に対応する一の文字情報を含む前記1以上の文字情報に対して前記意味解析処理を実行する、
 (1)~(4)のいずれか1つに記載の情報処理装置。
(6)
 前記実行部は、
 前記一の文字情報が前記対象言語の翻訳先となる翻訳先言語に変換された翻訳文字情報を含む前記1以上の文字情報に対して前記意味解析処理を実行する、
 (5)に記載の情報処理装置。
(7)
 前記実行部は、
 前記対象言語の前記一の文字情報を前記対象言語の別の表現に言い換えたパラフレーズを含む前記1以上の文字情報に対して前記意味解析処理を実行する、
 (5)または(6)に記載の情報処理装置。
(8)
 前記実行部は、
 前記対象言語の前記パラフレーズが前記対象言語の翻訳先となる翻訳先言語に変換された翻訳パラフレーズを含む前記1以上の文字情報に対して前記意味解析処理を実行する、
 (7)に記載の情報処理装置。
(9)
 前記対象言語に対応する文字情報を、前記対象言語の翻訳先となる翻訳先言語に対応する文字情報に変換する変換部、
 をさらに備え、
 前記実行部は、
 前記変換部により変換された文字情報に対して前記意味解析処理を実行する、
 (6)~(8)のいずれか1つに記載の情報処理装置。
(10)
 前記変換部は、
 前記対象言語に対応する文字情報を、意味解釈可能な言語に対応する文字情報に変換する、
 (9)に記載の情報処理装置。
(11)
 前記算出部により算出された前記1以上の文字情報の各々の前記精度指標値に基づいて、前記1以上の文字情報のうち、処理に用いる文字情報である処理対象文字情報を選択する選択部、
 をさらに備える(1)~(10)のいずれか1つに記載の情報処理装置。
(12)
 前記選択部は、
 前記精度指標値が最大である文字情報を前記処理対象文字情報として選択する、
 (11)に記載の情報処理装置。
(13)
 前記算出部は、
 前記対象言語の文字情報に対応する前記精度指標値を算出し、
 前記選択部は、
 前記対象言語の文字情報に対応する前記精度指標値が所定値以上である場合、前記対象言語の文字情報を前記処理対象文字情報として選択する、
 (11)または(12)に記載の情報処理装置。
(14)
 前記実行部は、
 前記対象言語に対応する前記精度指標値が所定値以上である場合、前記対象言語以外の言語の前記意味解析処理を実行しない、
 (13)に記載の情報処理装置。
(15)
 前記処理対象文字情報の言語に対応する前記意味解析処理の結果を前記対象言語に変換する逆変換部、
 をさらに備える(11)~(14)のいずれか1つに記載の情報処理装置。
(16)
 前記逆変換部は、
 前記処理対象文字情報の言語が前記対象言語以外である場合、前記意味解析処理の結果を前記対象言語に変換する、
 (15)に記載の情報処理装置。
(17)
 前記逆変換部は、
 前記意味解析処理の前記結果のうち一部を前記対象言語に変換する、
 (15)または(16)に記載の情報処理装置。
(18)
 前記逆変換部は、
 前記意味解析処理の前記結果のうちスロット値を前記対象言語に変換する、
 (15)~(17)のいずれか1つに記載の情報処理装置。
(19)
 前記実行部は、
 前記対象言語が言語識別可能である場合、前記対象言語を含む前記1以上の言語の各々に対応する前記1以上の文字情報に対して前記意味解析処理を実行する、
 (1)~(18)のいずれか1つに記載の情報処理装置。
(20)
 ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行し、
 前記1以上の文字情報の各々に対応する前記意味解析処理の結果に基づいて、前記1以上の文字情報の各々に対して、前記意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する、
 処理を実行する情報処理方法。
The present technology can also have the following configurations.
(1)
An execution unit that executes semantic analysis processing for one or more character information corresponding to each of one or more languages including a target language that is a language corresponding to a user's utterance.
An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information. The calculation unit that calculates the value and
Information processing device equipped with.
(2)
The calculation unit
Using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
The information processing device according to (1).
(3)
The calculation unit
Using the function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
The information processing device according to (2).
(4)
The calculation unit
Using the function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating the one language, the accuracy of the character information of the one language. Calculate the index value,
The information processing device according to (3).
(5)
The execution unit
The semantic analysis process is executed on the one or more character information including one character information corresponding to the utterance of the user in the target language.
The information processing device according to any one of (1) to (4).
(6)
The execution unit
The semantic analysis process is executed on the one or more character information including the translated character information in which the one character information is converted into the translation destination language to be the translation destination of the target language.
The information processing device according to (5).
(7)
The execution unit
The semantic analysis process is executed on the one or more character information including the paraphrase in which the one character information of the target language is paraphrased into another expression of the target language.
The information processing device according to (5) or (6).
(8)
The execution unit
The semantic analysis process is executed on the one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to be the translation destination of the target language.
The information processing device according to (7).
(9)
A conversion unit that converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language.
With more
The execution unit
The semantic analysis process is executed on the character information converted by the conversion unit.
The information processing device according to any one of (6) to (8).
(10)
The conversion unit
The character information corresponding to the target language is converted into the character information corresponding to the language that can interpret the meaning.
The information processing device according to (9).
(11)
A selection unit that selects processing target character information, which is character information used for processing, from the one or more character information based on the accuracy index value of each of the one or more character information calculated by the calculation unit.
The information processing apparatus according to any one of (1) to (10).
(12)
The selection unit
The character information having the maximum accuracy index value is selected as the processing target character information.
The information processing device according to (11).
(13)
The calculation unit
The accuracy index value corresponding to the character information of the target language is calculated, and the accuracy index value is calculated.
The selection unit
When the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the character information of the target language is selected as the processing target character information.
The information processing device according to (11) or (12).
(14)
The execution unit
When the accuracy index value corresponding to the target language is equal to or higher than a predetermined value, the semantic analysis process of a language other than the target language is not executed.
The information processing device according to (13).
(15)
An inverse conversion unit that converts the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language.
The information processing apparatus according to any one of (11) to (14).
(16)
The inverse conversion unit
When the language of the processing target character information is other than the target language, the result of the semantic analysis processing is converted into the target language.
The information processing device according to (15).
(17)
The inverse conversion unit
A part of the result of the semantic analysis process is converted into the target language.
The information processing apparatus according to (15) or (16).
(18)
The inverse conversion unit
Of the results of the semantic analysis process, the slot value is converted into the target language.
The information processing device according to any one of (15) to (17).
(19)
The execution unit
When the target language is language identifiable, the semantic analysis process is executed for the one or more character information corresponding to each of the one or more languages including the target language.
The information processing device according to any one of (1) to (18).
(20)
Semantic analysis processing is executed for one or more character information corresponding to each of one or more languages including the target language which is the language corresponding to the user's utterance.
An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information. Calculate the value,
An information processing method that executes processing.
 1 情報処理システム
 100 情報処理装置
 110 通信部
 120 記憶部
 121 言語情報記憶部
 122 意味フレーム情報記憶部
 123 解析精度情報記憶部
 124 閾値情報記憶部
 125 知識情報記憶部
 130 制御部
 131 取得部
 132 変換部
 133 実行部
 134 算出部
 135 選択部
 136 逆変換部
 137 生成部
 138 送信部
 10 端末装置
 11 通信部
 12 入力部
 13 出力部
 14 記憶部
 15 制御部
 151 受信部
 152 実行部
 153 受付部
 154 送信部
 16 センサ部
 17 表示部
1 Information processing system 100 Information processing device 110 Communication unit 120 Storage unit 121 Language information storage unit 122 Semantic frame information storage unit 123 Analysis accuracy information storage unit 124 Threshold information storage unit 125 Knowledge information storage unit 130 Control unit 131 Acquisition unit 132 Conversion unit 133 Execution unit 134 Calculation unit 135 Selection unit 136 Inverse conversion unit 137 Generation unit 138 Transmission unit 10 Terminal device 11 Communication unit 12 Input unit 13 Output unit 14 Storage unit 15 Control unit 151 Reception unit 152 Execution unit 153 Reception unit 154 Transmission unit 16 Sensor unit 17 Display unit

Claims (20)

  1.  ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行する実行部と、
     前記1以上の文字情報の各々に対応する前記意味解析処理の結果に基づいて、前記1以上の文字情報の各々に対して、前記意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する算出部と、
     を備える情報処理装置。
    An execution unit that executes semantic analysis processing for one or more character information corresponding to each of one or more languages including a target language that is a language corresponding to a user's utterance.
    An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information. The calculation unit that calculates the value and
    Information processing device equipped with.
  2.  前記算出部は、
     前記意味解析処理の前記結果の情報を入力として前記精度指標値を出力する関数を用いて、前記1以上の文字情報の各々の前記精度指標値を算出する、
     請求項1に記載の情報処理装置。
    The calculation unit
    Using a function that outputs the accuracy index value by inputting the information of the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
    The information processing device according to claim 1.
  3.  前記算出部は、
     前記意味解析処理の前記結果に含まれるスコアを入力として前記精度指標値を出力する前記関数を用いて、前記1以上の文字情報の各々の前記精度指標値を算出する、
     請求項2に記載の情報処理装置。
    The calculation unit
    Using the function that outputs the accuracy index value by inputting the score included in the result of the semantic analysis process, the accuracy index value of each of the one or more character information is calculated.
    The information processing device according to claim 2.
  4.  前記算出部は、
     一の言語の文字情報に対する前記意味解析処理の前記スコアと、前記一の言語を示す情報とを入力として前記精度指標値を出力する前記関数を用いて、前記一の言語の文字情報の前記精度指標値を算出する、
     請求項3に記載の情報処理装置。
    The calculation unit
    Using the function that outputs the accuracy index value by inputting the score of the semantic analysis process for the character information of one language and the information indicating the one language, the accuracy of the character information of the one language. Calculate the index value,
    The information processing device according to claim 3.
  5.  前記実行部は、
     前記対象言語による前記ユーザの発話に対応する一の文字情報を含む前記1以上の文字情報に対して前記意味解析処理を実行する、
     請求項1に記載の情報処理装置。
    The execution unit
    The semantic analysis process is executed on the one or more character information including one character information corresponding to the utterance of the user in the target language.
    The information processing device according to claim 1.
  6.  前記実行部は、
     前記一の文字情報が前記対象言語の翻訳先となる翻訳先言語に変換された翻訳文字情報を含む前記1以上の文字情報に対して前記意味解析処理を実行する、
     請求項5に記載の情報処理装置。
    The execution unit
    The semantic analysis process is executed on the one or more character information including the translated character information in which the one character information is converted into the translation destination language to be the translation destination of the target language.
    The information processing device according to claim 5.
  7.  前記実行部は、
     前記対象言語の前記一の文字情報を前記対象言語の別の表現に言い換えたパラフレーズを含む前記1以上の文字情報に対して前記意味解析処理を実行する、
     請求項5に記載の情報処理装置。
    The execution unit
    The semantic analysis process is executed on the one or more character information including the paraphrase in which the one character information of the target language is paraphrased into another expression of the target language.
    The information processing device according to claim 5.
  8.  前記実行部は、
     前記対象言語の前記パラフレーズが前記対象言語の翻訳先となる翻訳先言語に変換された翻訳パラフレーズを含む前記1以上の文字情報に対して前記意味解析処理を実行する、
     請求項7に記載の情報処理装置。
    The execution unit
    The semantic analysis process is executed on the one or more character information including the translation paraphrase in which the paraphrase of the target language is converted into the translation destination language to be the translation destination of the target language.
    The information processing device according to claim 7.
  9.  前記対象言語に対応する文字情報を、前記対象言語の翻訳先となる翻訳先言語に対応する文字情報に変換する変換部、
     をさらに備え、
     前記実行部は、
     前記変換部により変換された文字情報に対して前記意味解析処理を実行する、
     請求項6に記載の情報処理装置。
    A conversion unit that converts the character information corresponding to the target language into the character information corresponding to the translation destination language to be the translation destination of the target language.
    With more
    The execution unit
    The semantic analysis process is executed on the character information converted by the conversion unit.
    The information processing device according to claim 6.
  10.  前記変換部は、
     前記対象言語に対応する文字情報を、意味解釈可能な言語に対応する文字情報に変換する、
     請求項9に記載の情報処理装置。
    The conversion unit
    The character information corresponding to the target language is converted into the character information corresponding to the language that can interpret the meaning.
    The information processing device according to claim 9.
  11.  前記算出部により算出された前記1以上の文字情報の各々の前記精度指標値に基づいて、前記1以上の文字情報のうち、処理に用いる文字情報である処理対象文字情報を選択する選択部、
     をさらに備える請求項1に記載の情報処理装置。
    A selection unit that selects processing target character information, which is character information used for processing, from the one or more character information based on the accuracy index value of each of the one or more character information calculated by the calculation unit.
    The information processing apparatus according to claim 1.
  12.  前記選択部は、
     前記精度指標値が最大である文字情報を前記処理対象文字情報として選択する、
     請求項11に記載の情報処理装置。
    The selection unit
    The character information having the maximum accuracy index value is selected as the processing target character information.
    The information processing device according to claim 11.
  13.  前記算出部は、
     前記対象言語の文字情報に対応する前記精度指標値を算出し、
     前記選択部は、
     前記対象言語の文字情報に対応する前記精度指標値が所定値以上である場合、前記対象言語の文字情報を前記処理対象文字情報として選択する、
     請求項11に記載の情報処理装置。
    The calculation unit
    The accuracy index value corresponding to the character information of the target language is calculated, and the accuracy index value is calculated.
    The selection unit
    When the accuracy index value corresponding to the character information of the target language is equal to or higher than a predetermined value, the character information of the target language is selected as the processing target character information.
    The information processing device according to claim 11.
  14.  前記実行部は、
     前記対象言語に対応する前記精度指標値が所定値以上である場合、前記対象言語以外の言語の前記意味解析処理を実行しない、
     請求項13に記載の情報処理装置。
    The execution unit
    When the accuracy index value corresponding to the target language is equal to or higher than a predetermined value, the semantic analysis process of a language other than the target language is not executed.
    The information processing device according to claim 13.
  15.  前記処理対象文字情報の言語に対応する前記意味解析処理の結果を前記対象言語に変換する逆変換部、
     をさらに備える請求項11に記載の情報処理装置。
    An inverse conversion unit that converts the result of the semantic analysis process corresponding to the language of the character information to be processed into the target language.
    11. The information processing apparatus according to claim 11.
  16.  前記逆変換部は、
     前記処理対象文字情報の言語が前記対象言語以外である場合、前記意味解析処理の結果を前記対象言語に変換する、
     請求項15に記載の情報処理装置。
    The inverse conversion unit
    When the language of the processing target character information is other than the target language, the result of the semantic analysis processing is converted into the target language.
    The information processing device according to claim 15.
  17.  前記逆変換部は、
     前記意味解析処理の前記結果のうち一部を前記対象言語に変換する、
     請求項15または請求項16に記載の情報処理装置。
    The inverse conversion unit
    A part of the result of the semantic analysis process is converted into the target language.
    The information processing apparatus according to claim 15 or 16.
  18.  前記逆変換部は、
     前記意味解析処理の前記結果のうちスロット値を前記対象言語に変換する、
     請求項15に記載の情報処理装置。
    The inverse conversion unit
    Of the results of the semantic analysis process, the slot value is converted into the target language.
    The information processing device according to claim 15.
  19.  前記実行部は、
     前記対象言語が言語識別可能である場合、前記対象言語を含む前記1以上の言語の各々に対応する前記1以上の文字情報に対して前記意味解析処理を実行する、
     請求項1に記載の情報処理装置。
    The execution unit
    When the target language is language identifiable, the semantic analysis process is executed for the one or more character information corresponding to each of the one or more languages including the target language.
    The information processing device according to claim 1.
  20.  ユーザの発話に対応する言語である対象言語を含む1以上の言語の各々に対応する1以上の文字情報に対して意味解析処理を実行し、
     前記1以上の文字情報の各々に対応する前記意味解析処理の結果に基づいて、前記1以上の文字情報の各々に対して、前記意味解析処理の精度を複数言語間で比較可能にする精度指標値を算出する、
     処理を実行する情報処理方法。
    Semantic analysis processing is executed for one or more character information corresponding to each of one or more languages including the target language which is the language corresponding to the user's utterance.
    An accuracy index that makes it possible to compare the accuracy of the semantic analysis process between a plurality of languages for each of the one or more character information based on the result of the semantic analysis process corresponding to each of the one or more character information. Calculate the value,
    An information processing method that executes processing.
PCT/JP2021/004278 2020-02-14 2021-02-05 Information processing device and information processing method WO2021161908A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2020-023768 2020-02-14
JP2020023768A JP2021128629A (en) 2020-02-14 2020-02-14 Information processing apparatus and information processing method

Publications (1)

Publication Number Publication Date
WO2021161908A1 true WO2021161908A1 (en) 2021-08-19

Family

ID=77292201

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2021/004278 WO2021161908A1 (en) 2020-02-14 2021-02-05 Information processing device and information processing method

Country Status (2)

Country Link
JP (1) JP2021128629A (en)
WO (1) WO2021161908A1 (en)

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021018442A (en) * 2019-07-17 2021-02-15 日本電信電話株式会社 Sentence comparison device, sentence comparison method, and sentence comparison program

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2021018442A (en) * 2019-07-17 2021-02-15 日本電信電話株式会社 Sentence comparison device, sentence comparison method, and sentence comparison program

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
LUO WENTAO; YOSHIHIKO HAYASHI: "Computing Cross-Lingual Semantic Textual Similarities based on Machine Learning", IEICE TECHNICAL REPORT, NLC, vol. 114, no. 81 (NLC2014-16), 7 June 2014 (2014-06-07), JP, pages 85 - 90, XP009530454, ISSN: 0913-5685 *
TAKAAKI TANAKA, MASAAKI NAGATA, YUKI ARASE, MAKOTO ONIZUKA: "3G1-02: Semantic Graph-based Simple Semantic Textual Similarity Metrics", PROCEEDINGS OF THE 32ND NATIONAL CONFERENCE OF THE JAPANESE SOCIETY FOR ARTIFICIAL INTELLIGENCE DVD 2018; JUNE 05, 2018 - JUNE 08, 2018, 8 June 2018 (2018-06-08) - 8 June 2018 (2018-06-08), JP, pages 1 - 4, XP009530453, DOI: 10.11517/pjsai.JSAI2018.0_3G102 *

Also Published As

Publication number Publication date
JP2021128629A (en) 2021-09-02

Similar Documents

Publication Publication Date Title
US9805718B2 (en) Clarifying natural language input using targeted questions
US11049493B2 (en) Spoken dialog device, spoken dialog method, and recording medium
US9548052B2 (en) Ebook interaction using speech recognition
CN105426362A (en) Speech Translation Apparatus And Method
JPH1078952A (en) Voice synthesizing method and device therefor and hypertext control method and controller
US11586689B2 (en) Electronic apparatus and controlling method thereof
CN109543021B (en) Intelligent robot-oriented story data processing method and system
CN109256133A (en) A kind of voice interactive method, device, equipment and storage medium
EP4322029A1 (en) Method and apparatus for generating video corpus, and related device
CN115082602B (en) Method for generating digital person, training method, training device, training equipment and training medium for model
US20180288109A1 (en) Conference support system, conference support method, program for conference support apparatus, and program for terminal
KR20220127361A (en) Video translation method and apparatus, storage medium and electronic device
KR101104777B1 (en) System and Method for generating sign language animation
KR102345625B1 (en) Caption generation method and apparatus for performing the same
US11620328B2 (en) Speech to media translation
JP6676093B2 (en) Interlingual communication support device and system
WO2021161908A1 (en) Information processing device and information processing method
KR20120077794A (en) Method, translation apparatus and terminal for providing meaning of word of chinese language sentence in automatic translation system, and record medium storing program for executing the same
WO2021161856A1 (en) Information processing device and information processing method
CN112951274A (en) Voice similarity determination method and device, and program product
JP2004118720A (en) Translating device, translating method, and translating program
US11798542B1 (en) Systems and methods for integrating voice controls into applications
JP7257010B2 (en) SEARCH SUPPORT SERVER, SEARCH SUPPORT METHOD, AND COMPUTER PROGRAM
JP7297266B2 (en) SEARCH SUPPORT SERVER, SEARCH SUPPORT METHOD, AND COMPUTER PROGRAM
TWI767197B (en) Method and server for providing interactive voice tutorial

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21753763

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21753763

Country of ref document: EP

Kind code of ref document: A1