WO2019053878A1 - Knowledge generation device, knowledge generation method, and computer-readable recording medium - Google Patents

Knowledge generation device, knowledge generation method, and computer-readable recording medium Download PDF

Info

Publication number
WO2019053878A1
WO2019053878A1 PCT/JP2017/033450 JP2017033450W WO2019053878A1 WO 2019053878 A1 WO2019053878 A1 WO 2019053878A1 JP 2017033450 W JP2017033450 W JP 2017033450W WO 2019053878 A1 WO2019053878 A1 WO 2019053878A1
Authority
WO
WIPO (PCT)
Prior art keywords
knowledge
data
input data
learning
appropriate
Prior art date
Application number
PCT/JP2017/033450
Other languages
French (fr)
Japanese (ja)
Inventor
大地 木村
正明 土田
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Priority to JP2019541596A priority Critical patent/JP6828823B2/en
Priority to PCT/JP2017/033450 priority patent/WO2019053878A1/en
Publication of WO2019053878A1 publication Critical patent/WO2019053878A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N5/00Computing arrangements using knowledge-based models
    • G06N5/02Knowledge representation; Symbolic representation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N99/00Subject matter not provided for in other groups of this subclass

Definitions

  • the present invention relates to a knowledge generation apparatus and a knowledge generation method for generating knowledge for use in inference, and further relates to a computer readable recording medium recording a program for realizing these.
  • Hypothetical reasoning is a reasoning method for deriving hypotheses that explains the observed facts (input data) based on existing knowledge, and in recent years, it has become possible to use a computer to dramatically improve processing speed. It has become. According to the hypothesis inference, for example, with respect to an abnormal state occurring in an object, a manufacturing system, a running software program, etc., related facts obtained among hypotheses based on several possibilities leading to the state are obtained. You can get the best explained hypothesis.
  • Patent Document 1 a technique for performing such implication determination is disclosed, for example, in Patent Document 1.
  • a common partial structure is extracted for the first text and the second text to be determined.
  • any one of feature quantities based on dependencies between common partial structures and feature quantities based on dependencies between common partial structures and other structures is extracted.
  • the implication relationship between the first text and the second text is determined based on the extracted feature quantity.
  • the implication relationship between a plurality of sentences is determined by using information representing the structure of the sentences.
  • Patent Document 1 As described above, using the technology disclosed in Patent Document 1, it is possible to automatically create a large number of pieces of knowledge using data determined to be implication. However, there is a problem that the knowledge generated in this way is not all correct. And, if hypothesis inference is performed using incorrect knowledge, wrong inference results will be obtained.
  • An example of the object of the present invention is to provide a knowledge generation device, a knowledge generation method, and a computer readable recording medium which can solve the above problems and suppress the generation of false knowledge.
  • a knowledge generating device in one aspect of the present invention is: Statistically predicts whether input data and previously prepared knowledge data are acquired for generation of knowledge used in inference, and the acquired input data and knowledge generated using the knowledge data are appropriate.
  • a knowledge determination unit that determines using a model;
  • a knowledge generation unit that generates knowledge used in the inference using the input data determined by the knowledge determination unit as appropriate and the knowledge data;
  • the prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data. It is characterized by
  • a knowledge generation method in one aspect of the present invention is: (A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model, (B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
  • the prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data. It is characterized by
  • a computer readable recording medium in one aspect of the present invention is: On the computer (A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model, (B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a); Record the program, including instructions to execute
  • the prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data. It is characterized by
  • FIG. 1 is a block diagram showing a schematic configuration of a knowledge generation apparatus according to an embodiment of the present invention.
  • FIG. 2 is a block diagram specifically showing the configuration of the knowledge generation apparatus according to the embodiment of the present invention.
  • FIGS. 3 (a) and 3 (b) are explanatory diagrams showing learning processing performed by the knowledge generation apparatus in the embodiment of the present invention, and FIG. 3 (a) shows learning processing in the knowledge generation unit. 3 (b) shows a learning process in the model generation unit.
  • FIG. 4 is a flowchart showing an operation at the time of creation of a prediction model of the knowledge generation device in the embodiment of the present invention.
  • FIG. 5 is a flow chart showing an operation at the time of knowledge generation of the knowledge generation device in the embodiment of the present invention.
  • FIG. 6 is a block diagram showing an example of a computer for realizing the knowledge generation device in the embodiment of the present invention.
  • FIGS. 1 to 6 Embodiment Hereinafter, a knowledge generation apparatus, a knowledge generation method, and a computer readable recording medium according to an embodiment of the present invention will be described with reference to FIGS. 1 to 6.
  • FIG. 1 is a block diagram showing a schematic configuration of a knowledge generation apparatus according to an embodiment of the present invention.
  • the knowledge generation device 100 is a device for generating knowledge used in inference. As shown in FIG. 1, the knowledge generation device 100 includes a knowledge determination unit 10 and a knowledge generation unit 20.
  • the knowledge determination unit 10 acquires input data and knowledge data 40 prepared in advance for generation of knowledge used in inference. Also, the knowledge data 40 is composed of a set of inferences that can be used for inference. Next, the knowledge determination unit 10 determines, using the statistical prediction model 30, whether the knowledge generated using the acquired input data and the knowledge data 40 is appropriate.
  • the prediction model 30 is a model that outputs an appropriate degree of knowledge obtained from both in accordance with a combination of input data and knowledge data. Also, the knowledge generation unit 20 generates the knowledge to be used in the inference by using the input data and the knowledge data 40 determined by the knowledge determination unit 10 to be appropriate.
  • the present embodiment it is determined in advance whether the knowledge to be created is appropriate or not, and the knowledge is generated only when it is determined that the knowledge is appropriate, and hypothesis inference is performed. For this reason, according to the present embodiment, it is possible to suppress the generation of the wrong knowledge in the generation of the knowledge used for the hypothesis inference.
  • FIG. 2 is a block diagram specifically showing the configuration of the knowledge generation apparatus according to the embodiment of the present invention.
  • the knowledge generation apparatus 100 in addition to the knowledge determination unit 10 and the knowledge generation unit 20 described above, the knowledge generation apparatus 100 generates a model generation unit 50 that generates a prediction model 30, and input data And a data extraction unit 60 for extracting data used to generate knowledge from the above.
  • input data for example, text data making up an electronic mail, a blog, writing on an electronic bulletin board, news and the like can be mentioned.
  • the model generation unit 50 applies a combination of the knowledge generated by the knowledge generation unit 10, the input data for learning, and the knowledge for learning. And the knowledge of the correct answer given is obtained as learning data. Then, the model generation unit 50 performs a learning process using the acquired learning data to generate the prediction model 30.
  • the knowledge determination unit 10 applies the input data and the knowledge data 40 to the prediction model 30, and extracts the data extracted by the data extraction unit 60 (hereinafter referred to as “extracted data”) and the knowledge data 40. It is determined whether the knowledge generated by the knowledge generation unit 20 is appropriate using
  • the knowledge generation unit 20 generates a learning model (not shown in FIG. 2) for knowledge generation in advance, and generates extraction data and knowledge data that the knowledge determination unit 10 has determined to be appropriate. Apply it to the learned model to generate the knowledge used in inference.
  • FIGS. 3 (a) and 3 (b) are explanatory diagrams showing learning processing performed by the knowledge generation apparatus in the embodiment of the present invention, and FIG. 3 (a) shows learning processing in the knowledge generation unit. 3 (b) shows a learning process in the model generation unit.
  • the knowledge generation unit 20 generates knowledge by implication determination using a learning model. Therefore, the data extraction unit 60 performs predicate term structure analysis on the sentences included in the input data so as to enable implication determination, and extracts, for example, a subject, a predicate, and an object word in the input data.
  • the data extraction unit 60 extracts text data (extraction data) of a subject, a predicate, and an object from input data to be learning data.
  • the data is extracted and input to the knowledge generation unit 20.
  • the knowledge generation unit 20 determines whether the knowledge implies the extraction data and determines that the knowledge is implied. Generates new knowledge from the extracted data and the corresponding knowledge. Then, the knowledge generation unit 20 compares the generated knowledge with the correct knowledge (teacher data) prepared in advance, learns the comparison result (correct or incorrect answer), and creates a learning model.
  • the model generation unit 50 in the learning process of the model generation unit 50, acquires the entire input data to be learning data. That is, the model generation unit 50 acquires not only a subject, a predicate, an object in input data, but also a modifier.
  • the same input data is also input to the data extraction unit 60, and the data extraction unit 60 extracts a subject, a predicate, and an object from the input data, and passes them to the knowledge generation unit 20 as extraction data. .
  • the knowledge generation unit 20 applies the extraction data and the knowledge data 40 to a learning model to generate knowledge. After that, the generated knowledge is given a label indicating whether or not it is the correct solution based on the correct solution knowledge prepared in advance by an external system or the like.
  • the model generation unit 50 predicts, using the input data and the knowledge data 40, whether or not the label of the knowledge generated by the knowledge generation unit 20 from the same input data is correct. Then, the model generation unit 50 compares the prediction result with the label of the generated knowledge, and learns the comparison result (the prediction is correct or incorrect) to create the prediction model 30. Thereafter, the created prediction model 30 is used for determination by the knowledge determination unit 10.
  • FIGS. 4 and 5 the operation of the knowledge generation apparatus according to the embodiment of the present invention will be described using FIGS. 4 and 5.
  • FIG. 1 to 3 will be referred to as appropriate.
  • the knowledge generation method is implemented by operating the knowledge generation apparatus 100. Therefore, the explanation of the knowledge generation method in the present embodiment is replaced with the following explanation of the operation of the knowledge generation apparatus 100.
  • FIG. 4 is a flowchart showing an operation at the time of creation of a prediction model of the knowledge generation device in the embodiment of the present invention.
  • learning processing is executed in the knowledge generation unit 20 (see FIG. 3A), and a learning model has already been created.
  • the model generation unit 50 acquires the entire input data to be learning data (step A1).
  • the model generation unit 50 predicts, using the input data and the knowledge data 40, whether the label of the knowledge generated by the knowledge generation unit 20 from the same input data is a correct answer (step A2).
  • the same input data is also input to the data extraction unit 60 in synchronization with steps A1 and A2, and the data extraction unit 60 extracts a subject, a predicate, and an object from the input data, and uses these as extraction data. Pass it to the knowledge generation unit 20. Then, the knowledge generation unit 20 applies the extraction data and the knowledge data 40 to a learning model to generate knowledge. Thereafter, a label indicating whether or not the correct answer is given is added to the generated knowledge based on the correct answer knowledge prepared in advance by an external system or the like (see FIG. 3B).
  • step A2 the model generation unit 50 compares and contrasts the prediction result (correct or incorrect answer) of step A2 with the label (correct or incorrect answer) given to the knowledge generated by the knowledge generation unit 20.
  • the result is learned to create a prediction model 30 (step A3).
  • the prediction model 30 created in step A3 is used for the determination by the knowledge determination unit 10.
  • FIG. 5 is a flow chart showing an operation at the time of knowledge generation of the knowledge generation device in the embodiment of the present invention.
  • the data extraction unit 60 acquires input data, and inputs the acquired input data to the knowledge determination unit 10 (step B1).
  • the data extraction unit 60 extracts data used to generate knowledge from the acquired input data, and inputs the extracted data (extracted data) to the knowledge generation unit 20 (step B2). Specifically, for example, when the input data is an e-mail, the data extraction unit 60 executes a predicate term structure analysis for each sentence constituting the e-mail, and forms a subject of each sentence, The predicate and the object are extracted, and these are input to the knowledge generation unit 20.
  • the knowledge determination unit 10 applies the input data and knowledge data 40 acquired in step B1 to the prediction model 30, and uses the extraction data and knowledge data 40 extracted in step B2 to generate the knowledge generation unit 20. It is determined whether the knowledge generated at step S is proper (step B3).
  • step B5 is executed.
  • the knowledge generation unit 20 determines that the extraction data extracted in step B2 and the knowledge data 40 are learning models (FIGS. 3A and 3B) ) (See B.4) to generate knowledge to be used in inference.
  • the generated knowledge is used, for example, for inference performed in another system.
  • step B5 the knowledge determination unit 10 determines whether or not step B3 is performed on all the texts included in the input data acquired in step B1. As a result of the determination in step B5, when step B3 is not executed for all the text, step B3 is executed again. On the other hand, as a result of the determination in step B5, when step B3 is executed for all the texts, the processing in the knowledge generation device 100 ends.
  • steps B1 to B5 shown in FIG. 5 will be described using a specific example.
  • the input data acquired in step B1 is an e-mail, and that the sentence “We will unify the fare as the assumption” is included.
  • the data extraction unit 60 performs a predicate term structure analysis on "assuming that we have unified the fare as an assumption," and obtains the following result.
  • Predicate term structure We unify fares (to cases) into (to cases) (predicates) (In addition, there is no "case” in this sentence.)
  • the knowledge generation unit 20 normally calculates “unify the fare” from the knowledge and the extraction data. It is judged that it is implication to "adjust the price", and the following knowledge is newly generated. Knowledge: Unify fares ⁇ adjust prices
  • the knowledge determination unit 10 uses the entire input data, that is, a portion that is not extracted as extraction data to determine whether the generated knowledge is appropriate.
  • the data extraction unit 60 applies not only the extraction data but also the word “assumption” to the prediction model.
  • the prediction model outputs a low value (for example, 10% or less) as the appropriate degree when the word “assumption” is included.
  • the knowledge determination unit 10 determines that the knowledge generated by the knowledge generation unit 20 from the above-described input data is not appropriate.
  • the knowledge generation unit generates knowledge from input data only when it is determined that the data is appropriate.
  • the knowledge determination unit 10 makes the determination using the entire input data, the context is reflected in the knowledge generation by the knowledge generation unit 20, and the validity of the knowledge is enhanced.
  • the accuracy in the knowledge generation unit 20 is not very high, that is, even when the learning model is simple, it is possible to generate appropriate knowledge.
  • the program in the present embodiment may be a program that causes a computer to execute steps A1 to A3 shown in FIG.
  • the processor of the computer functions as the knowledge determination unit 10, the knowledge generation unit 20, the model generation unit 50, and the data extraction unit 60 to perform processing.
  • the prediction model 30 and the knowledge data 40 are realized by storing data files that configure these in a storage device such as a hard disk provided in a computer.
  • the program in the present embodiment may be executed by a computer system constructed by a plurality of computers.
  • each computer may function as any of the knowledge determination unit 10, the knowledge generation unit 20, the model generation unit 50, and the data extraction unit 60.
  • the prediction model 30 and the knowledge data 40 may be stored in a storage device of a computer different from the computer that executes the program in the present embodiment.
  • FIG. 6 is a block diagram showing an example of a computer for realizing the knowledge generation device in the embodiment of the present invention.
  • the computer 110 includes a CPU 111, a main memory 112, a storage device 113, an input interface 114, a display controller 115, a data reader / writer 116, and a communication interface 117. These units are communicably connected to each other via a bus 121.
  • the CPU 111 develops the program (code) in the present embodiment stored in the storage device 113 in the main memory 112 and executes various operations by executing these in a predetermined order.
  • the main memory 112 is typically a volatile storage device such as a dynamic random access memory (DRAM).
  • DRAM dynamic random access memory
  • the program in the present embodiment is provided in the state of being stored in computer readable recording medium 120.
  • the program in the present embodiment may be distributed on the Internet connected via communication interface 117.
  • the storage device 113 besides a hard disk drive, a semiconductor storage device such as a flash memory may be mentioned.
  • the input interface 114 mediates data transmission between the CPU 111 and an input device 118 such as a keyboard and a mouse.
  • the display controller 115 is connected to the display device 119 and controls the display on the display device 119.
  • the data reader / writer 116 mediates data transmission between the CPU 111 and the recording medium 120, and executes reading of a program from the recording medium 120 and writing of the processing result in the computer 110 to the recording medium 120.
  • the communication interface 117 mediates data transmission between the CPU 111 and another computer.
  • the recording medium 120 include general-purpose semiconductor storage devices such as CF (Compact Flash (registered trademark)) and SD (Secure Digital), magnetic recording media such as flexible disk (Flexible Disk), or CD- An optical recording medium such as a ROM (Compact Disk Read Only Memory) may be mentioned.
  • CF Compact Flash
  • SD Secure Digital
  • magnetic recording media such as flexible disk (Flexible Disk)
  • CD- An optical recording medium such as a ROM (Compact Disk Read Only Memory) may be mentioned.
  • the knowledge generation device 100 in the present embodiment can also be realized by using hardware corresponding to each unit, not a computer on which a program is installed. Furthermore, the knowledge generation device 100 may be partially realized by a program, and the remaining portion may be realized by hardware.
  • a knowledge determination unit that determines using a model
  • a knowledge generation unit that generates knowledge used in the inference using the input data determined by the knowledge determination unit as appropriate and the knowledge data
  • the prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
  • a knowledge generation apparatus characterized in that.
  • the apparatus further comprises a model generation unit that generates the prediction model.
  • the model generation unit is a combination of knowledge generated by the knowledge generation unit, input data for learning, and knowledge for learning when using a combination of input data for learning and knowledge data for learning.
  • the learning process is performed using the knowledge of the correct answer given to the symbol as learning data to generate the prediction model.
  • the knowledge generation device according to appendix 1.
  • the knowledge generation unit (Supplementary Note 3) It further comprises a data extraction unit for extracting data used for generating knowledge from the input data, The knowledge determination unit determines, using the prediction model, whether the data extracted by the data extraction unit and the knowledge generated using the knowledge data are appropriate. The knowledge generation unit generates knowledge used in the inference by using the data extracted by the data extraction unit and the knowledge data, which the knowledge determination unit has determined to be appropriate.
  • the knowledge generation device according to appendix 1 or 2.
  • (Supplementary Note 4) (A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model, (B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
  • the prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
  • a knowledge generation method characterized by
  • (Supplementary Note 6) extracting the data used to generate knowledge from the input data;
  • step (a) it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
  • step (b) using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
  • the knowledge generation method according to Appendix 4 or 5.
  • the program is stored in the computer (C) generating an instruction to further execute the step of generating the prediction model;
  • the knowledge generated by the execution of the step (b) the input data for the learning and the data Using the knowledge of the correct answer given to the combination of knowledge for learning as learning data, the learning process is performed to generate the prediction model.
  • the computer-readable recording medium according to appendix 7.
  • the program is stored in the computer (D) extracting from the input data data to be used for generating knowledge, further including instructions for further executing steps;
  • step (a) it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
  • step (b) using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
  • the computer-readable recording medium according to appendix 7 or 8.
  • the present invention generation of false knowledge can be suppressed in generation of knowledge used for hypothesis inference.
  • the present invention is useful for the efficiency of work to clarify the situation or cause of an accident, a crime, a cyber attack or the like.
  • the present invention is also useful for cause analysis and countermeasure investigation of disasters and system failures.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Physics & Mathematics (AREA)
  • Computing Systems (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A knowledge generation device 100 is provided with: a knowledge determination unit 10 for acquiring input data and knowledge data prepared in advance for generating knowledge to be used for an inference, and determining, using a statistical prediction model 30, whether knowledge generated using the acquired input data and knowledge data is appropriate or not; and a knowledge generation unit 20 for generating knowledge to be used for the inference, using the input data and knowledge data that the knowledge determination unit 10 has determined to be appropriate. The prediction model is a model for outputting the level of appropriateness of the knowledge acquired from the input data and the knowledge data in accordance with a combination of both.

Description

知識生成装置、知識生成方法、及びコンピュータ読み取り可能な記録媒体Knowledge generation apparatus, knowledge generation method, and computer readable recording medium
 本発明は、推論に用いるための知識を生成するための、知識生成装置、及び知識生成方法に関し、更には、これらを実現するためのプログラムを記録したコンピュータ読み取り可能な記録媒体に関する。 The present invention relates to a knowledge generation apparatus and a knowledge generation method for generating knowledge for use in inference, and further relates to a computer readable recording medium recording a program for realizing these.
 仮説推論は、既存の知識に基づいて、観測した事実(入力データ)を説明付ける、仮説を導く推理方法であり、近年においては、処理速度の飛躍的向上により、計算機を用いて行なわれるようになっている。仮説推論によれば、例えば、物体、製造システム、実行中のソフトウェアプログラムなどに生じた異常状態に対して、その状態に至る幾つかの可能性に基づく仮説のうち、得られている関連事実を最もよく説明できる仮説を得ることができる。 Hypothetical reasoning is a reasoning method for deriving hypotheses that explains the observed facts (input data) based on existing knowledge, and in recent years, it has become possible to use a computer to dramatically improve processing speed. It has become. According to the hypothesis inference, for example, with respect to an abnormal state occurring in an object, a manufacturing system, a running software program, etc., related facts obtained among hypotheses based on several possibilities leading to the state are obtained. You can get the best explained hypothesis.
 ところで、仮説推論にて用いられる知識を全て静的なデータベースによって用意しようとすると、データベースには大量の知識を格納しておく必要がある。これは、仮説を導くためには、入力データに対して、コンテキストに依存して解釈する必要があること、及び厳密には同一の入力データが入力されることが殆どないこと、による。 By the way, when it is going to prepare all the knowledge used by hypothesis inference by a static database, it is necessary to store a large amount of knowledge in the database. This is because, in order to derive a hypothesis, it is necessary to interpret the input data depending on the context, and strictly speaking, the same input data is rarely input.
 従って、大量の知識を予め用意することには限界があるため、既存の知識と入力データとに対して含意判定を行ない、含意判定の結果を用いることで、新たな知識を生成する試みがなされている。例えば、入力データが「運賃を統一する」であり、知識が「価格を調整する→カルテル発生」であるとする。この場合において、含意判定によって、前者が後者を含意すると判定されると、「運賃を統一する→価格を調整する」という新たな知識が生成される。 Therefore, there is a limit in preparing a large amount of knowledge in advance, so an implication determination is performed on existing knowledge and input data, and an attempt is made to generate new knowledge by using the result of the implication determination. ing. For example, it is assumed that the input data is "unify the fare" and the knowledge is "adjust the price → carte". In this case, if it is determined by the implication determination that the former implies the latter, new knowledge “unify the fare → adjust the price” is generated.
 また、このような含意判定を行なうための技術は、例えば、特許文献1に開示されている。特許文献1に開示された技術においては、まず、判定対象となる第1のテキストと第2のテキストとについて、共通部分構造が抽出される。続いて、共通部分構造間の依存関係に基づく特徴量、共通部分構造とそれ以外の構造との間の依存関係に基づく特徴量のうちいずれかが抽出される。その後、抽出された特徴量に基づいて、第1のテキストと第2のテキストとの間の含意関係が判定される。特許文献1に開示された技術では、文章の構造を表す情報を用いることで、複数の文章間の含意関係が判定される。 Further, a technique for performing such implication determination is disclosed, for example, in Patent Document 1. In the technology disclosed in Patent Document 1, first, a common partial structure is extracted for the first text and the second text to be determined. Subsequently, any one of feature quantities based on dependencies between common partial structures and feature quantities based on dependencies between common partial structures and other structures is extracted. Thereafter, the implication relationship between the first text and the second text is determined based on the extracted feature quantity. In the technique disclosed in Patent Document 1, the implication relationship between a plurality of sentences is determined by using information representing the structure of the sentences.
国際公開第2015/004155号公報International Publication No. 2015/004155
 上述のように、特許文献1に開示された技術を用いれば、含意と判断されたデータを用いて、自動的に多数の知識を新たに作成することが可能となる。しかしながら、このようにして生成された知識には、全て正しいとは限らないという問題が存在する。そして、正しくない知識を用いて仮説推論が行なわれた場合は、間違った推論結果が得られることになる。 As described above, using the technology disclosed in Patent Document 1, it is possible to automatically create a large number of pieces of knowledge using data determined to be implication. However, there is a problem that the knowledge generated in this way is not all correct. And, if hypothesis inference is performed using incorrect knowledge, wrong inference results will be obtained.
 本発明の目的の一例は、上記問題を解消し、間違った知識の生成を抑制し得る、知識生成装置、知識生成方法、及びコンピュータ読み取り可能な記録媒体を提供することにある。 An example of the object of the present invention is to provide a knowledge generation device, a knowledge generation method, and a computer readable recording medium which can solve the above problems and suppress the generation of false knowledge.
 上記目的を達成するため、本発明の一側面における知識生成装置は、
 推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、知識判定部と、
 前記知識判定部が適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、知識生成部と、を備え、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とする。
In order to achieve the above object, a knowledge generating device in one aspect of the present invention is:
Statistically predicts whether input data and previously prepared knowledge data are acquired for generation of knowledge used in inference, and the acquired input data and knowledge generated using the knowledge data are appropriate. A knowledge determination unit that determines using a model;
A knowledge generation unit that generates knowledge used in the inference using the input data determined by the knowledge determination unit as appropriate and the knowledge data;
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
It is characterized by
 また、上記目的を達成するため、本発明の一側面における知識生成方法は、
(a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
(b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、を有し、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とする。
Further, in order to achieve the above object, a knowledge generation method in one aspect of the present invention is:
(A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
(B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
It is characterized by
 更に、上記目的を達成するため、本発明の一側面におけるコンピュータ読み取り可能な記録媒体は、
コンピュータに、
(a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
(b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、
を実行させる命令を含む、プログラムを記録し、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とする。
Furthermore, in order to achieve the above object, a computer readable recording medium in one aspect of the present invention is:
On the computer
(A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
(B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
Record the program, including instructions to execute
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
It is characterized by
 以上のように本発明によれば、仮説推論に用いる知識の生成において、間違った知識の生成を抑制することができる。 As described above, according to the present invention, generation of false knowledge can be suppressed in generation of knowledge used for hypothesis inference.
図1は、本発明の実施の形態における知識生成装置の概略構成を示すブロック図である。FIG. 1 is a block diagram showing a schematic configuration of a knowledge generation apparatus according to an embodiment of the present invention. 図2は、本発明の実施の形態における知識生成装置の構成を具体的に示すブロック図である。FIG. 2 is a block diagram specifically showing the configuration of the knowledge generation apparatus according to the embodiment of the present invention. 図3(a)及び(b)は、本発明の実施の形態における知識生成装置で行われる学習処理を示す説明図であり、図3(a)は知識生成部での学習処理を示し、図3(b)はモデル生成部での学習処理を示している。FIGS. 3 (a) and 3 (b) are explanatory diagrams showing learning processing performed by the knowledge generation apparatus in the embodiment of the present invention, and FIG. 3 (a) shows learning processing in the knowledge generation unit. 3 (b) shows a learning process in the model generation unit. 図4は、本発明の実施の形態における知識生成装置の予測モデルの作成時の動作を示すフロー図である。FIG. 4 is a flowchart showing an operation at the time of creation of a prediction model of the knowledge generation device in the embodiment of the present invention. 図5は、本発明の実施の形態における知識生成装置の知識生成時の動作を示すフロー図である。FIG. 5 is a flow chart showing an operation at the time of knowledge generation of the knowledge generation device in the embodiment of the present invention. 図6は、本発明の実施の形態における知識生成装置を実現するコンピュータの一例を示すブロック図である。FIG. 6 is a block diagram showing an example of a computer for realizing the knowledge generation device in the embodiment of the present invention.
(実施の形態)
 以下、本発明の実施の形態における、知識生成装置、知識生成方法、及びコンピュータ読み取り可能な記録媒体について、図1~図6を参照しながら説明する。
Embodiment
Hereinafter, a knowledge generation apparatus, a knowledge generation method, and a computer readable recording medium according to an embodiment of the present invention will be described with reference to FIGS. 1 to 6.
[装置構成]
 最初に、本実施の形態における知識生成装置の概略構成について図1を用いて説明する。図1は、本発明の実施の形態における知識生成装置の概略構成を示すブロック図である。
[Device configuration]
First, a schematic configuration of a knowledge generation apparatus according to the present embodiment will be described with reference to FIG. FIG. 1 is a block diagram showing a schematic configuration of a knowledge generation apparatus according to an embodiment of the present invention.
 図1に示す、本実施の形態における知識生成装置100は、推論で用いる知識を生成するための装置である。図1に示すように、知識生成装置100は、知識判定部10と、知識生成部20とを備えている。 The knowledge generation device 100 according to the present embodiment shown in FIG. 1 is a device for generating knowledge used in inference. As shown in FIG. 1, the knowledge generation device 100 includes a knowledge determination unit 10 and a knowledge generation unit 20.
 知識判定部10は、まず、推論で用いる知識の生成のための、入力データ及び予め用意された知識データ40を取得する。また、知識データ40は、推論に用いることができる推論の集合で構成されている。次いで、知識判定部10は、取得した入力データ及び知識データ40を用いて生成される知識が、適正かどうかを、統計的な予測モデル30を用いて判定する。 First, the knowledge determination unit 10 acquires input data and knowledge data 40 prepared in advance for generation of knowledge used in inference. Also, the knowledge data 40 is composed of a set of inferences that can be used for inference. Next, the knowledge determination unit 10 determines, using the statistical prediction model 30, whether the knowledge generated using the acquired input data and the knowledge data 40 is appropriate.
 予測モデル30は、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである。また、知識生成部20は、知識判定部10が適正と判断した入力データ及び知識データ40を用いて、推論で用いる知識を生成する。 The prediction model 30 is a model that outputs an appropriate degree of knowledge obtained from both in accordance with a combination of input data and knowledge data. Also, the knowledge generation unit 20 generates the knowledge to be used in the inference by using the input data and the knowledge data 40 determined by the knowledge determination unit 10 to be appropriate.
 このように、本実施の形態においては、作成される知識について、予め適正かどうかが判定され、適正と判定される場合にのみ、この知識が生成され、仮説推論が行われる。このため、本実施の形態によれば、仮説推論に用いる知識の生成において、間違った知識の生成を抑制することができる。 As described above, in the present embodiment, it is determined in advance whether the knowledge to be created is appropriate or not, and the knowledge is generated only when it is determined that the knowledge is appropriate, and hypothesis inference is performed. For this reason, according to the present embodiment, it is possible to suppress the generation of the wrong knowledge in the generation of the knowledge used for the hypothesis inference.
 続いて、図2を用いて、本実施の形態における知識生成装置100の構成についてより具体的に説明する。図2は、本発明の実施の形態における知識生成装置の構成を具体的に示すブロック図である。 Subsequently, the configuration of the knowledge generation apparatus 100 according to the present embodiment will be more specifically described with reference to FIG. FIG. 2 is a block diagram specifically showing the configuration of the knowledge generation apparatus according to the embodiment of the present invention.
 図2に示すように、本実施の形態においては、知識生成装置100は、上述した、知識判定部10及び知識生成部20に加えて、予測モデル30を生成するモデル生成部50と、入力データから知識の生成に用いるデータを抽出するデータ抽出部60とを更に備えている。入力データとしては、例えば、電子メール、ブログ、電子掲示板の書き込み、ニュース等を構成しているテキストデータが挙げられる。 As shown in FIG. 2, in the present embodiment, in addition to the knowledge determination unit 10 and the knowledge generation unit 20 described above, the knowledge generation apparatus 100 generates a model generation unit 50 that generates a prediction model 30, and input data And a data extraction unit 60 for extracting data used to generate knowledge from the above. As input data, for example, text data making up an electronic mail, a blog, writing on an electronic bulletin board, news and the like can be mentioned.
 モデル生成部50は、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、知識生成部10によって生成された知識と、学習用の入力データ及び学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして取得する。そして、モデル生成部50は、取得した学習データを用いて、学習処理を行なって、予測モデル30を生成する。 When a combination of input data for learning and knowledge data for learning is used, the model generation unit 50 applies a combination of the knowledge generated by the knowledge generation unit 10, the input data for learning, and the knowledge for learning. And the knowledge of the correct answer given is obtained as learning data. Then, the model generation unit 50 performs a learning process using the acquired learning data to generate the prediction model 30.
 知識判定部10は、本実施の形態では、入力データ及び知識データ40を予測モデル30に適用して、データ抽出部60によって抽出されたデータ(以下「抽出データ」と表記する)及び知識データ40を用いて知識生成部20で生成される知識が、適正かどうかを判定する。 In the present embodiment, the knowledge determination unit 10 applies the input data and the knowledge data 40 to the prediction model 30, and extracts the data extracted by the data extraction unit 60 (hereinafter referred to as “extracted data”) and the knowledge data 40. It is determined whether the knowledge generated by the knowledge generation unit 20 is appropriate using
 知識生成部20は、本実施の形態では、予め、知識生成のための学習モデル(図2において図示せず)を生成し、知識判定部10が適正と判断した抽出データ及び知識データを、生成した学習モデルに適用して、推論で用いる知識を生成する。 In the present embodiment, the knowledge generation unit 20 generates a learning model (not shown in FIG. 2) for knowledge generation in advance, and generates extraction data and knowledge data that the knowledge determination unit 10 has determined to be appropriate. Apply it to the learned model to generate the knowledge used in inference.
 ここで、図3を用いて、モデル生成部50による学習処理と、知識生成部20による学習処理とについて具体的に説明する。図3(a)及び(b)は、本発明の実施の形態における知識生成装置で行われる学習処理を示す説明図であり、図3(a)は知識生成部での学習処理を示し、図3(b)はモデル生成部での学習処理を示している。 Here, the learning process by the model generation unit 50 and the learning process by the knowledge generation unit 20 will be specifically described using FIG. 3. FIGS. 3 (a) and 3 (b) are explanatory diagrams showing learning processing performed by the knowledge generation apparatus in the embodiment of the present invention, and FIG. 3 (a) shows learning processing in the knowledge generation unit. 3 (b) shows a learning process in the model generation unit.
 また、本実施の形態では、知識生成部20は、学習モデルを用いた含意判定によって、知識を生成するとする。このため、含意判定が可能となるように、データ抽出部60は、入力データに含まれる文章に対して述語項構造解析を行い、例えば、入力データ中の主語、述語、目的語を抽出する。 Further, in the present embodiment, the knowledge generation unit 20 generates knowledge by implication determination using a learning model. Therefore, the data extraction unit 60 performs predicate term structure analysis on the sentences included in the input data so as to enable implication determination, and extracts, for example, a subject, a predicate, and an object word in the input data.
 図3(a)に示すように、知識生成部20の学習処理においては、まず、データ抽出部60が、学習データとなる入力データから、主語、述語、目的語のテキストデータ(抽出データ)を抽出し、これを知識生成部20に入力する。 As shown in FIG. 3A, in the learning process of the knowledge generation unit 20, first, the data extraction unit 60 extracts text data (extraction data) of a subject, a predicate, and an object from input data to be learning data. The data is extracted and input to the knowledge generation unit 20.
 また、知識生成部20は、抽出データを取得すると、知識データ40を構成している知識毎に、当該知識が、抽出データを含意しているかどうかを判定し、含意していると判定した場合は、抽出データと該当する知識とで、新たな知識を生成する。そして、知識生成部20は、生成した知識と、予め用意されている正解知識(教師データ)とを対比し、対比結果(正解又は不正解)を学習して、学習モデルを作成する。 In addition, when the knowledge generation unit 20 acquires the extraction data, for each of the knowledge constituting the knowledge data 40, the knowledge generation unit 20 determines whether the knowledge implies the extraction data and determines that the knowledge is implied. Generates new knowledge from the extracted data and the corresponding knowledge. Then, the knowledge generation unit 20 compares the generated knowledge with the correct knowledge (teacher data) prepared in advance, learns the comparison result (correct or incorrect answer), and creates a learning model.
 また、図3(b)に示すように、モデル生成部50の学習処理においては、モデル生成部50は、知識生成部20と異なり、学習データとなる入力データの全体を取得する。つまり、モデル生成部50は、入力データ中の主語、述語、目的語だけでなく、修飾語等も取得する。 Further, as shown in FIG. 3B, in the learning process of the model generation unit 50, the model generation unit 50, unlike the knowledge generation unit 20, acquires the entire input data to be learning data. That is, the model generation unit 50 acquires not only a subject, a predicate, an object in input data, but also a modifier.
 また、このとき、同じ入力データがデータ抽出部60にも入力され、データ抽出部60は、入力データから、主語、述語、目的語を抽出し、これらを抽出データとして、知識生成部20に渡す。そして、知識生成部20は、抽出データと知識データ40とを、学習モデルに適用して、知識を生成する。その後、生成された知識には、外部のシステム等によって、予め用意された正解知識に基づいて、正解であるか否かを示すラベルが付与される。 At this time, the same input data is also input to the data extraction unit 60, and the data extraction unit 60 extracts a subject, a predicate, and an object from the input data, and passes them to the knowledge generation unit 20 as extraction data. . Then, the knowledge generation unit 20 applies the extraction data and the knowledge data 40 to a learning model to generate knowledge. After that, the generated knowledge is given a label indicating whether or not it is the correct solution based on the correct solution knowledge prepared in advance by an external system or the like.
 続いて、モデル生成部50は、入力データ及び知識データ40を用いて、同じ入力データから知識生成部20によって生成される知識のラベルが、正解であるかどうかを予測する。そして、モデル生成部50は、予測結果と、生成された知識のラベルとを対比し、対比結果(予測が正解又は不正解)を学習して、予測モデル30を作成する。その後、作成された予測モデル30は、知識判定部10による判定に用いられる。 Subsequently, the model generation unit 50 predicts, using the input data and the knowledge data 40, whether or not the label of the knowledge generated by the knowledge generation unit 20 from the same input data is correct. Then, the model generation unit 50 compares the prediction result with the label of the generated knowledge, and learns the comparison result (the prediction is correct or incorrect) to create the prediction model 30. Thereafter, the created prediction model 30 is used for determination by the knowledge determination unit 10.
[装置動作]
 次に、本発明の実施の形態における知識生成装置の動作について図4及び図5を用いて説明する。以下の説明においては、適宜図1~図3を参酌する。また、本実施の形態では、知識生成装置100を動作させることによって、知識生成方法が実施される。よって、本実施の形態における知識生成方法の説明は、以下の知識生成装置100の動作説明に代える。
[Device operation]
Next, the operation of the knowledge generation apparatus according to the embodiment of the present invention will be described using FIGS. 4 and 5. FIG. In the following description, FIGS. 1 to 3 will be referred to as appropriate. Further, in the present embodiment, the knowledge generation method is implemented by operating the knowledge generation apparatus 100. Therefore, the explanation of the knowledge generation method in the present embodiment is replaced with the following explanation of the operation of the knowledge generation apparatus 100.
 まず、図4を用いて、予測モデルの作成処理、即ち、モデル作成部50による学習処理について説明する。図4は、本発明の実施の形態における知識生成装置の予測モデルの作成時の動作を示すフロー図である。前提として、知識生成部20においては学習処理が実行され(図3(a)参照)、既に学習モデルが作成されているとする。 First, the process of creating a prediction model, that is, the learning process by the model creating unit 50 will be described using FIG. 4. FIG. 4 is a flowchart showing an operation at the time of creation of a prediction model of the knowledge generation device in the embodiment of the present invention. As a premise, it is assumed that learning processing is executed in the knowledge generation unit 20 (see FIG. 3A), and a learning model has already been created.
 図4に示すように、まず、モデル生成部50は、学習データとなる入力データの全体を取得する(ステップA1)。次に、モデル生成部50は、入力データ及び知識データ40を用いて、同じ入力データから知識生成部20によって生成される知識のラベルが、正解であるかどうかを予測する(ステップA2)。 As shown in FIG. 4, first, the model generation unit 50 acquires the entire input data to be learning data (step A1). Next, the model generation unit 50 predicts, using the input data and the knowledge data 40, whether the label of the knowledge generated by the knowledge generation unit 20 from the same input data is a correct answer (step A2).
 また、ステップA1及びA2に同期して、同じ入力データがデータ抽出部60にも入力され、データ抽出部60は、入力データから、主語、述語、目的語を抽出し、これらを抽出データとして、知識生成部20に渡す。そして、知識生成部20は、抽出データと知識データ40とを、学習モデルに適用して、知識を生成する。その後、生成された知識には、外部のシステム等によって、予め用意された正解知識に基づいて、正解であるか否かを示すラベルが付与される(図3(b)参照)。 The same input data is also input to the data extraction unit 60 in synchronization with steps A1 and A2, and the data extraction unit 60 extracts a subject, a predicate, and an object from the input data, and uses these as extraction data. Pass it to the knowledge generation unit 20. Then, the knowledge generation unit 20 applies the extraction data and the knowledge data 40 to a learning model to generate knowledge. Thereafter, a label indicating whether or not the correct answer is given is added to the generated knowledge based on the correct answer knowledge prepared in advance by an external system or the like (see FIG. 3B).
 ステップA2の実行後、モデル生成部50は、ステップA2の予測結果(正解又は不正解)と、知識生成部20が生成した知識に付与されたラベル(正解又は不正解)とを対比し、対比結果を学習して、予測モデル30を作成する(ステップA3)。ステップA3で作成された予測モデル30は、知識判定部10による判定に用いられる。 After execution of step A2, the model generation unit 50 compares and contrasts the prediction result (correct or incorrect answer) of step A2 with the label (correct or incorrect answer) given to the knowledge generated by the knowledge generation unit 20. The result is learned to create a prediction model 30 (step A3). The prediction model 30 created in step A3 is used for the determination by the knowledge determination unit 10.
 続いて、図5を用いて、知識の作成処理について説明する。図5は、本発明の実施の形態における知識生成装置の知識生成時の動作を示すフロー図である。 Subsequently, the process of creating knowledge will be described with reference to FIG. FIG. 5 is a flow chart showing an operation at the time of knowledge generation of the knowledge generation device in the embodiment of the present invention.
 図5に示すように、最初に、データ抽出部60は、入力データを取得し、取得した入力データを知識判定部10に入力する(ステップB1)。 As shown in FIG. 5, first, the data extraction unit 60 acquires input data, and inputs the acquired input data to the knowledge determination unit 10 (step B1).
 続いて、データ抽出部60は、取得した入力データから、知識の生成に用いるデータを抽出し、抽出したデータ(抽出データ)を知識生成部20に入力する(ステップB2)。具体的には、例えば、入力データが電子メールであった場合は、データ抽出部60は、電子メールを構成している文毎に、述語項構造解析を実行し、各文を構成する主語、述語、目的語を抽出し、これらを知識生成部20に入力する。 Subsequently, the data extraction unit 60 extracts data used to generate knowledge from the acquired input data, and inputs the extracted data (extracted data) to the knowledge generation unit 20 (step B2). Specifically, for example, when the input data is an e-mail, the data extraction unit 60 executes a predicate term structure analysis for each sentence constituting the e-mail, and forms a subject of each sentence, The predicate and the object are extracted, and these are input to the knowledge generation unit 20.
 次に、知識判定部10は、ステップB1で取得された入力データと知識データ40とを予測モデル30に適用して、ステップB2で抽出された抽出データ及び知識データ40を用いて知識生成部20で生成される知識が、適正かどうかを判定する(ステップB3)。 Next, the knowledge determination unit 10 applies the input data and knowledge data 40 acquired in step B1 to the prediction model 30, and uses the extraction data and knowledge data 40 extracted in step B2 to generate the knowledge generation unit 20. It is determined whether the knowledge generated at step S is proper (step B3).
 ステップB3の判定の結果、適正でないと判定された場合は、ステップB5が実行される。一方、ステップB3の判定の結果、適正であると判定された場合は、知識生成部20は、ステップB2で抽出された抽出データと知識データ40とを学習モデル(図3(a)及び(b)参照)に適用して、推論で用いる知識を生成する(ステップB4)。生成された知識は、例えば、別のシステムで行われる推論に用いられる。 As a result of the determination in step B3, when it is determined not to be appropriate, step B5 is executed. On the other hand, if it is determined as a result of the determination in step B3 that the knowledge generation unit 20 determines that the extraction data extracted in step B2 and the knowledge data 40 are learning models (FIGS. 3A and 3B) ) (See B.4) to generate knowledge to be used in inference. The generated knowledge is used, for example, for inference performed in another system.
 次に、ステップB5では、知識判定部10は、ステップB1で取得された入力データに含まれる全てのテキストに対して、ステップB3が実行されているかどうかを判定する。ステップB5の判定の結果、全てのテキストに対してステップB3が実行されていない場合は、再度ステップB3を実行する。一方、ステップB5の判定の結果、全てのテキストに対してステップB3が実行されている場合は、知識生成装置100における処理は終了する。 Next, in step B5, the knowledge determination unit 10 determines whether or not step B3 is performed on all the texts included in the input data acquired in step B1. As a result of the determination in step B5, when step B3 is not executed for all the text, step B3 is executed again. On the other hand, as a result of the determination in step B5, when step B3 is executed for all the texts, the processing in the knowledge generation device 100 ends.
 ここで、具体例を用いて、図5に示したステップB1~B5について説明する。まず、ステップB1で取得された入力データが電子メールであり、その中に「仮定の話として、我々が運賃を統一したとしましょう。」という文が含まれていたとする。 Here, steps B1 to B5 shown in FIG. 5 will be described using a specific example. First, it is assumed that the input data acquired in step B1 is an e-mail, and that the sentence “We will unify the fare as the assumption” is included.
 この場合、データ抽出部60は、「仮定の話として、我々が運賃を統一したとしましょう。」に対して述語項構造解析を行い、下記の結果を得る。
 述語項構造:我々(が格)が 運賃(を格)を (に格)に 統一(述語)
(なお、この文には「に格」は存在していない。)
In this case, the data extraction unit 60 performs a predicate term structure analysis on "assuming that we have unified the fare as an assumption," and obtains the following result.
Predicate term structure: We unify fares (to cases) into (to cases) (predicates)
(In addition, there is no "case" in this sentence.)
 また、知識データ40に知識として、「価格を調整する→カルテル発生」が含まれているとすると、知識生成部20は、通常、この知識と抽出データとから、「運賃を統一する」は「価格を調整する」に対して含意であると判断し、下記の知識を新たに生成する。
 知識:運賃を統一する→価格を調整する
In addition, assuming that “adjust price → cartel generation” is included in the knowledge data 40 as knowledge, the knowledge generation unit 20 normally calculates “unify the fare” from the knowledge and the extraction data. It is judged that it is implication to "adjust the price", and the following knowledge is newly generated.
Knowledge: Unify fares → adjust prices
 しかしながら、本実施の形態では、知識判定部10は、入力データ全体、即ち、抽出データとして抽出されていない部分も用いて、生成される知識が適正かどうかを判定する。上述の例であれば、データ抽出部60は、抽出データだけでなく、「仮定」という文言も予測モデルに適用する。 However, in the present embodiment, the knowledge determination unit 10 uses the entire input data, that is, a portion that is not extracted as extraction data to determine whether the generated knowledge is appropriate. In the above-described example, the data extraction unit 60 applies not only the extraction data but also the word “assumption” to the prediction model.
 このとき、予測モデルが、仮定という文言が含まれた場合に、適正度合として低い値(例えば、10%以下)を出力するとする。知識判定部10は、上述の入力データから知識生成部20によって生成される知識は適正でないと判定する。 At this time, it is assumed that the prediction model outputs a low value (for example, 10% or less) as the appropriate degree when the word “assumption” is included. The knowledge determination unit 10 determines that the knowledge generated by the knowledge generation unit 20 from the above-described input data is not appropriate.
 以上のように、本実施の形態では、入力データから作成されると予測される知識が、予め適正かどうかが判定される。そして、適正と判定される場合にのみ、知識生成部は、入力データから知識を生成する。 As described above, in the present embodiment, it is determined in advance whether knowledge predicted to be created from input data is appropriate. Then, the knowledge generation unit generates knowledge from input data only when it is determined that the data is appropriate.
 このため、本実施の形態によれば、仮説推論に用いる知識の生成において、間違った知識の生成を抑制でき、仮説推論の精度の向上を図ることができる。また、知識判定部10は、入力データ全体を用いて判定を行うため、知識生成部20による知識生成においてコンテキストが反映され、知識の妥当性が高まることになる。また、知識生成部20における精度がそれほど高くない場合、即ち、学習モデルが単純な場合であっても、適正な知識の生成が可能となる。 Therefore, according to the present embodiment, it is possible to suppress the generation of the wrong knowledge in the generation of the knowledge used for the hypothesis inference, and to improve the accuracy of the hypothesis inference. Further, since the knowledge determination unit 10 makes the determination using the entire input data, the context is reflected in the knowledge generation by the knowledge generation unit 20, and the validity of the knowledge is enhanced. In addition, even when the accuracy in the knowledge generation unit 20 is not very high, that is, even when the learning model is simple, it is possible to generate appropriate knowledge.
[プログラム]
 本実施の形態におけるプログラムは、コンピュータに、図4に示すステップA1~A3を実行させるプログラムであれば良い。このプログラムをコンピュータにインストールし、実行することによって、本実施の形態における知識生成装置と知識生成方法とを実現することができる。この場合、コンピュータのプロセッサは、知識判定部10、知識生成部20、モデル生成部50及びデータ抽出部60として機能し、処理を行なう。
[program]
The program in the present embodiment may be a program that causes a computer to execute steps A1 to A3 shown in FIG. By installing this program in a computer and executing it, the knowledge generation device and the knowledge generation method in the present embodiment can be realized. In this case, the processor of the computer functions as the knowledge determination unit 10, the knowledge generation unit 20, the model generation unit 50, and the data extraction unit 60 to perform processing.
 また、本実施の形態では、予測モデル30及び知識データ40は、コンピュータに備えられたハードディスク等の記憶装置に、これらを構成するデータファイルを格納することによって実現される。 Further, in the present embodiment, the prediction model 30 and the knowledge data 40 are realized by storing data files that configure these in a storage device such as a hard disk provided in a computer.
 また、本実施の形態におけるプログラムは、複数のコンピュータによって構築されたコンピュータシステムによって実行されても良い。この場合は、例えば、各コンピュータが、それぞれ、知識判定部10、知識生成部20、モデル生成部50及びデータ抽出部60のいずれかとして機能しても良い。また、予測モデル30及び知識データ40は、は、本実施の形態におけるプログラムを実行するコンピュータとは別のコンピュータの記憶装置に格納されていても良い。 Also, the program in the present embodiment may be executed by a computer system constructed by a plurality of computers. In this case, for example, each computer may function as any of the knowledge determination unit 10, the knowledge generation unit 20, the model generation unit 50, and the data extraction unit 60. Further, the prediction model 30 and the knowledge data 40 may be stored in a storage device of a computer different from the computer that executes the program in the present embodiment.
 ここで、本実施の形態におけるプログラムを実行することによって、知識生成装置100を実現するコンピュータについて図6を用いて説明する。図6は、本発明の実施の形態における知識生成装置を実現するコンピュータの一例を示すブロック図である。 Here, a computer for realizing the knowledge generation device 100 by executing the program according to the present embodiment will be described with reference to FIG. FIG. 6 is a block diagram showing an example of a computer for realizing the knowledge generation device in the embodiment of the present invention.
 図6に示すように、コンピュータ110は、CPU111と、メインメモリ112と、記憶装置113と、入力インターフェイス114と、表示コントローラ115と、データリーダ/ライタ116と、通信インターフェイス117とを備える。これらの各部は、バス121を介して、互いにデータ通信可能に接続される。 As shown in FIG. 6, the computer 110 includes a CPU 111, a main memory 112, a storage device 113, an input interface 114, a display controller 115, a data reader / writer 116, and a communication interface 117. These units are communicably connected to each other via a bus 121.
 CPU111は、記憶装置113に格納された、本実施の形態におけるプログラム(コード)をメインメモリ112に展開し、これらを所定順序で実行することにより、各種の演算を実施する。メインメモリ112は、典型的には、DRAM(Dynamic Random Access Memory)等の揮発性の記憶装置である。また、本実施の形態におけるプログラムは、コンピュータ読み取り可能な記録媒体120に格納された状態で提供される。なお、本実施の形態におけるプログラムは、通信インターフェイス117を介して接続されたインターネット上で流通するものであっても良い。 The CPU 111 develops the program (code) in the present embodiment stored in the storage device 113 in the main memory 112 and executes various operations by executing these in a predetermined order. The main memory 112 is typically a volatile storage device such as a dynamic random access memory (DRAM). In addition, the program in the present embodiment is provided in the state of being stored in computer readable recording medium 120. The program in the present embodiment may be distributed on the Internet connected via communication interface 117.
 また、記憶装置113の具体例としては、ハードディスクドライブの他、フラッシュメモリ等の半導体記憶装置が挙げられる。入力インターフェイス114は、CPU111と、キーボード及びマウスといった入力機器118との間のデータ伝送を仲介する。表示コントローラ115は、ディスプレイ装置119と接続され、ディスプレイ装置119での表示を制御する。 Further, as a specific example of the storage device 113, besides a hard disk drive, a semiconductor storage device such as a flash memory may be mentioned. The input interface 114 mediates data transmission between the CPU 111 and an input device 118 such as a keyboard and a mouse. The display controller 115 is connected to the display device 119 and controls the display on the display device 119.
 データリーダ/ライタ116は、CPU111と記録媒体120との間のデータ伝送を仲介し、記録媒体120からのプログラムの読み出し、及びコンピュータ110における処理結果の記録媒体120への書き込みを実行する。通信インターフェイス117は、CPU111と、他のコンピュータとの間のデータ伝送を仲介する。 The data reader / writer 116 mediates data transmission between the CPU 111 and the recording medium 120, and executes reading of a program from the recording medium 120 and writing of the processing result in the computer 110 to the recording medium 120. The communication interface 117 mediates data transmission between the CPU 111 and another computer.
 また、記録媒体120の具体例としては、CF(Compact Flash(登録商標))及びSD(Secure Digital)等の汎用的な半導体記憶デバイス、フレキシブルディスク(Flexible Disk)等の磁気記録媒体、又はCD-ROM(Compact DiskRead Only Memory)などの光学記録媒体が挙げられる。 Further, specific examples of the recording medium 120 include general-purpose semiconductor storage devices such as CF (Compact Flash (registered trademark)) and SD (Secure Digital), magnetic recording media such as flexible disk (Flexible Disk), or CD- An optical recording medium such as a ROM (Compact Disk Read Only Memory) may be mentioned.
 なお、本実施の形態における知識生成装置100は、プログラムがインストールされたコンピュータではなく、各部に対応したハードウェアを用いることによっても実現可能である。更に、知識生成装置100は、一部がプログラムで実現され、残りの部分がハードウェアで実現されていてもよい。 The knowledge generation device 100 in the present embodiment can also be realized by using hardware corresponding to each unit, not a computer on which a program is installed. Furthermore, the knowledge generation device 100 may be partially realized by a program, and the remaining portion may be realized by hardware.
 上述した実施の形態の一部又は全部は、以下に記載する(付記1)~(付記9)によって表現することができるが、以下の記載に限定されるものではない。 A part or all of the embodiment described above can be expressed by (Appendix 1) to (Appendix 9) described below, but is not limited to the following description.
(付記1)
 推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、知識判定部と、
 前記知識判定部が適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、知識生成部と、を備え、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とする知識生成装置。
(Supplementary Note 1)
Statistically predicts whether input data and previously prepared knowledge data are acquired for generation of knowledge used in inference, and the acquired input data and knowledge generated using the knowledge data are appropriate. A knowledge determination unit that determines using a model;
A knowledge generation unit that generates knowledge used in the inference using the input data determined by the knowledge determination unit as appropriate and the knowledge data;
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
A knowledge generation apparatus characterized in that.
(付記2)
 前記予測モデルを生成する、モデル生成部を更に備え、
 前記モデル生成部は、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記知識生成部によって生成された知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
付記1に記載の知識生成装置。
(Supplementary Note 2)
The apparatus further comprises a model generation unit that generates the prediction model.
The model generation unit is a combination of knowledge generated by the knowledge generation unit, input data for learning, and knowledge for learning when using a combination of input data for learning and knowledge data for learning. The learning process is performed using the knowledge of the correct answer given to the symbol as learning data to generate the prediction model.
The knowledge generation device according to appendix 1.
(付記3)
 前記入力データから、知識の生成に用いるデータを抽出する、データ抽出部を更に備え、
 前記知識判定部は、前記データ抽出部によって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
 前記知識生成部は、前記知識判定部が適正と判断した、前記データ抽出部によって抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
付記1または2に記載の知識生成装置。
(Supplementary Note 3)
It further comprises a data extraction unit for extracting data used for generating knowledge from the input data,
The knowledge determination unit determines, using the prediction model, whether the data extracted by the data extraction unit and the knowledge generated using the knowledge data are appropriate.
The knowledge generation unit generates knowledge used in the inference by using the data extracted by the data extraction unit and the knowledge data, which the knowledge determination unit has determined to be appropriate.
The knowledge generation device according to appendix 1 or 2.
(付記4)
(a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
(b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、を有し、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とする知識生成方法。
(Supplementary Note 4)
(A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
(B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
A knowledge generation method characterized by
(付記5)
(c)前記予測モデルを生成する、ステップを更に有し、
 前記(c)のステップにおいて、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記(b)のステップの実行によって生成される知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
付記4に記載の知識生成方法。
(Supplementary Note 5)
(C) generating the prediction model;
When a combination of input data for learning and knowledge data for learning is used in the step (c), the knowledge generated by the execution of the step (b), the input data for the learning and the data Using the knowledge of the correct answer given to the combination of knowledge for learning as learning data, the learning process is performed to generate the prediction model.
The knowledge generation method according to appendix 4.
(付記6)
(d)前記入力データから、知識の生成に用いるデータを抽出する、ステップを更に有し、
 前記(a)のステップにおいて、前記(d)のステップによって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
 前記(b)のステップにおいて、前記(a)のステップで適正と判断した、前記(d)のステップで抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
付記4または5に記載の知識生成方法。
(Supplementary Note 6)
(D) extracting the data used to generate knowledge from the input data;
In the step (a), it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
In the step (b), using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
The knowledge generation method according to Appendix 4 or 5.
(付記7)
コンピュータに、
(a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
(b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、
を実行させる命令を含む、プログラムを記録し、
 前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
ことを特徴とするコンピュータ読み取り可能な記録媒体。
(Appendix 7)
On the computer
(A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
(B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
Record the program, including instructions to execute
The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
A computer readable recording medium characterized in that.
(付記8)
前記プログラムが、前記コンピュータに、
(c)前記予測モデルを生成する、ステップを更に実行させる命令を含み、
 前記(c)のステップにおいて、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記(b)のステップの実行によって生成される知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
付記7に記載のコンピュータ読み取り可能な記録媒体。
(Supplementary Note 8)
The program is stored in the computer
(C) generating an instruction to further execute the step of generating the prediction model;
When a combination of input data for learning and knowledge data for learning is used in the step (c), the knowledge generated by the execution of the step (b), the input data for the learning and the data Using the knowledge of the correct answer given to the combination of knowledge for learning as learning data, the learning process is performed to generate the prediction model.
The computer-readable recording medium according to appendix 7.
(付記9)
前記プログラムが、前記コンピュータに、
(d)前記入力データから、知識の生成に用いるデータを抽出する、ステップを更に実行させる命令を含み、
 前記(a)のステップにおいて、前記(d)のステップによって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
 前記(b)のステップにおいて、前記(a)のステップで適正と判断した、前記(d)のステップで抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
付記7または8に記載のコンピュータ読み取り可能な記録媒体。
(Appendix 9)
The program is stored in the computer
(D) extracting from the input data data to be used for generating knowledge, further including instructions for further executing steps;
In the step (a), it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
In the step (b), using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
The computer-readable recording medium according to appendix 7 or 8.
 以上のように本発明によれば、仮説推論に用いる知識の生成において、間違った知識の生成を抑制することができる。本発明は、事故、犯罪、サイバー攻撃などの状況または原因を明らかにする作業の効率化に有用である。同様に、本発明は、災害及びシステム障害の原因分析と対策検討にも有用である。 As described above, according to the present invention, generation of false knowledge can be suppressed in generation of knowledge used for hypothesis inference. The present invention is useful for the efficiency of work to clarify the situation or cause of an accident, a crime, a cyber attack or the like. Similarly, the present invention is also useful for cause analysis and countermeasure investigation of disasters and system failures.
 10 知識判定部
 20 知識生成部
 30 予測モデル
 40 知識データ
 50 モデル生成部
 60 データ抽出部
 100 知識生成装置
 110 コンピュータ
 111 CPU
 112 メインメモリ
 113 記憶装置
 114 入力インターフェイス
 115 表示コントローラ
 116 データリーダ/ライタ
 117 通信インターフェイス
 118 入力機器
 119 ディスプレイ装置
 120 記録媒体
 121 バス
DESCRIPTION OF REFERENCE NUMERALS 10 knowledge determination unit 20 knowledge generation unit 30 prediction model 40 knowledge data 50 model generation unit 60 data extraction unit 100 knowledge generation device 110 computer 111 CPU
112 main memory 113 storage device 114 input interface 115 display controller 116 data reader / writer 117 communication interface 118 input device 119 display device 120 recording medium 121 bus

Claims (9)

  1.  推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、知識判定部と、
     前記知識判定部が適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、知識生成部と、を備え、
     前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
    ことを特徴とする知識生成装置。
    Statistically predicts whether input data and previously prepared knowledge data are acquired for generation of knowledge used in inference, and the acquired input data and knowledge generated using the knowledge data are appropriate. A knowledge determination unit that determines using a model;
    A knowledge generation unit that generates knowledge used in the inference using the input data determined by the knowledge determination unit as appropriate and the knowledge data;
    The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
    A knowledge generation apparatus characterized in that.
  2.  前記予測モデルを生成する、モデル生成部を更に備え、
     前記モデル生成部は、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記知識生成部によって生成された知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
    請求項1に記載の知識生成装置。
    The apparatus further comprises a model generation unit that generates the prediction model.
    The model generation unit is a combination of knowledge generated by the knowledge generation unit, input data for learning, and knowledge for learning when using a combination of input data for learning and knowledge data for learning. The learning process is performed using the knowledge of the correct answer given to the symbol as learning data to generate the prediction model.
    The knowledge generation device according to claim 1.
  3.  前記入力データから、知識の生成に用いるデータを抽出する、データ抽出部を更に備え、
     前記知識判定部は、前記データ抽出部によって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
     前記知識生成部は、前記知識判定部が適正と判断した、前記データ抽出部によって抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
    請求項1または2に記載の知識生成装置。
    It further comprises a data extraction unit for extracting data used for generating knowledge from the input data,
    The knowledge determination unit determines, using the prediction model, whether the data extracted by the data extraction unit and the knowledge generated using the knowledge data are appropriate.
    The knowledge generation unit generates knowledge used in the inference by using the data extracted by the data extraction unit and the knowledge data, which the knowledge determination unit has determined to be appropriate.
    The knowledge generation device according to claim 1.
  4. (a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
    (b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、を有し、
     前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
    ことを特徴とする知識生成方法。
    (A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
    (B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
    The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
    A knowledge generation method characterized by
  5. (c)前記予測モデルを生成する、ステップを更に有し、
     前記(c)のステップにおいて、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記(b)のステップの実行によって生成される知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
    請求項4に記載の知識生成方法。
    (C) generating the prediction model;
    When a combination of input data for learning and knowledge data for learning is used in the step (c), the knowledge generated by the execution of the step (b), the input data for the learning and the data Using the knowledge of the correct answer given to the combination of knowledge for learning as learning data, the learning process is performed to generate the prediction model.
    The knowledge generation method according to claim 4.
  6. (d)前記入力データから、知識の生成に用いるデータを抽出する、ステップを更に有し、
     前記(a)のステップにおいて、前記(d)のステップによって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
     前記(b)のステップにおいて、前記(a)のステップで適正と判断した、前記(d)のステップで抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
    請求項4または5に記載の知識生成方法。
    (D) extracting the data used to generate knowledge from the input data;
    In the step (a), it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
    In the step (b), using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
    The knowledge generation method according to claim 4 or 5.
  7. コンピュータに、
    (a)推論で用いる知識の生成のための、入力データ及び予め用意された知識データを取得し、取得した前記入力データ及び前記知識データを用いて生成される知識が、適正かどうかを、統計的な予測モデルを用いて判定する、ステップと、
    (b)前記(a)のステップで適正と判断した前記入力データ及び前記知識データを用いて、前記推論で用いる知識を生成する、ステップと、
    を実行させる命令を含む、プログラムを記録し、
     前記予測モデルは、入力データ及び知識データの組み合わせに応じて、両者から得られる知識の適正度合を出力するモデルである、
    ことを特徴とするコンピュータ読み取り可能な記録媒体。
    On the computer
    (A) Acquisition of input data and knowledge data prepared in advance for generation of knowledge used in inference, and statistics as to whether the acquired input data and knowledge generated using the knowledge data are appropriate Determining using a typical prediction model,
    (B) generating knowledge to be used in the inference using the input data and the knowledge data determined to be appropriate in the step (a);
    Record the program, including instructions to execute
    The prediction model is a model that outputs an appropriate degree of knowledge obtained from both according to a combination of input data and knowledge data.
    A computer readable recording medium characterized in that.
  8. 前記プログラムが、前記コンピュータに、
    (c)前記予測モデルを生成する、ステップを更に実行させる命令を含み、
     前記(c)のステップにおいて、学習用の入力データ及び学習用の知識データの組み合せを用いた場合に、前記(b)のステップの実行によって生成される知識と、前記学習用の入力データ及び前記学習用の知識の組み合せに対して与えられている正解の知識とを、学習データとして用いて、学習処理を行なって、前記予測モデルを生成する、
    請求項7に記載のコンピュータ読み取り可能な記録媒体。
    The program is stored in the computer
    (C) generating an instruction to further execute the step of generating the prediction model;
    When a combination of input data for learning and knowledge data for learning is used in the step (c), the knowledge generated by the execution of the step (b), the input data for the learning and the data Using the knowledge of the correct answer given to the combination of knowledge for learning as learning data, the learning process is performed to generate the prediction model.
    The computer readable recording medium according to claim 7.
  9. 前記プログラムが、前記コンピュータに、
    (d)前記入力データから、知識の生成に用いるデータを抽出する、ステップを更に実行させる命令を含み、
     前記(a)のステップにおいて、前記(d)のステップによって抽出されたデータ及び前記知識データを用いて生成される知識が、適正かどうかを、前記予測モデルを用いて判定し、
     前記(b)のステップにおいて、前記(a)のステップで適正と判断した、前記(d)のステップで抽出されたデータ及び前記知識データを用いて、前記推論で用いる知識を生成する、
    請求項7または8に記載のコンピュータ読み取り可能な記録媒体。
    The program is stored in the computer
    (D) extracting from the input data data to be used for generating knowledge, further including instructions for further executing steps;
    In the step (a), it is determined using the prediction model whether the data extracted in the step (d) and the knowledge generated using the knowledge data are appropriate;
    In the step (b), using the data extracted in the step (d) and the knowledge data determined to be appropriate in the step (a), the knowledge used in the inference is generated.
    A computer readable recording medium according to claim 7 or 8.
PCT/JP2017/033450 2017-09-15 2017-09-15 Knowledge generation device, knowledge generation method, and computer-readable recording medium WO2019053878A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2019541596A JP6828823B2 (en) 2017-09-15 2017-09-15 Knowledge generator, knowledge generation method, and program
PCT/JP2017/033450 WO2019053878A1 (en) 2017-09-15 2017-09-15 Knowledge generation device, knowledge generation method, and computer-readable recording medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2017/033450 WO2019053878A1 (en) 2017-09-15 2017-09-15 Knowledge generation device, knowledge generation method, and computer-readable recording medium

Publications (1)

Publication Number Publication Date
WO2019053878A1 true WO2019053878A1 (en) 2019-03-21

Family

ID=65723352

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2017/033450 WO2019053878A1 (en) 2017-09-15 2017-09-15 Knowledge generation device, knowledge generation method, and computer-readable recording medium

Country Status (2)

Country Link
JP (1) JP6828823B2 (en)
WO (1) WO2019053878A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07105159A (en) * 1993-09-30 1995-04-21 Omron Corp Information processor
WO2004036497A1 (en) * 2002-10-18 2004-04-29 Japan Science And Technology Agency Learning/thinking machine and learning/thinking method based on structured knowledge, computer system, and information generation method
WO2016035273A1 (en) * 2014-09-05 2016-03-10 日本電気株式会社 Text processing system, text processing method and storage medium storing computer program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07105159A (en) * 1993-09-30 1995-04-21 Omron Corp Information processor
WO2004036497A1 (en) * 2002-10-18 2004-04-29 Japan Science And Technology Agency Learning/thinking machine and learning/thinking method based on structured knowledge, computer system, and information generation method
WO2016035273A1 (en) * 2014-09-05 2016-03-10 日本電気株式会社 Text processing system, text processing method and storage medium storing computer program

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
OHNISHI YOSHIAKI ET AL.: "Creation and application to .....", PROCEEDINGS OF THE 14TH ANNUAL MEETING OF THE NATURAL LANGUAGE PROCESSING SOCIETY, 17 March 2008 (2008-03-17), pages 1152 - 1155 *

Also Published As

Publication number Publication date
JP6828823B2 (en) 2021-02-10
JPWO2019053878A1 (en) 2020-10-15

Similar Documents

Publication Publication Date Title
CN108089974B (en) Testing applications with defined input formats
CN112016274B (en) Medical text structuring method, device, computer equipment and storage medium
US11176019B2 (en) Automated breakpoint creation
CN112445775B (en) Fault analysis method, device, equipment and storage medium of photoetching machine
EP3640814A1 (en) User-friendly explanation production using generative adversarial networks
US20210157553A1 (en) Source code auto-suggestion based on structural and semantic features
CN110688111A (en) Configuration method, device, server and storage medium of business process
Saberi et al. A passive online technique for learning hybrid automata from input/output traces
US20230186212A1 (en) System, method, electronic device, and storage medium for identifying risk event based on social information
US20200364104A1 (en) Identifying a problem based on log data analysis
US10318650B2 (en) Identifying corrupted text segments
KR102553061B1 (en) Homepage layout providing system using artificial intelligence and the operating method thereof
CN116561298A (en) Title generation method, device, equipment and storage medium based on artificial intelligence
US20230419036A1 (en) Random token segmentation for training next token prediction models
US20160103811A1 (en) Enhanced documentation validation
CN110765069A (en) Data migration method, device and system
CN115687136A (en) Script program processing method, system, computer equipment and medium
KR102363958B1 (en) Method, apparatus and program for analyzing customer perception based on double clustering
JP6828823B2 (en) Knowledge generator, knowledge generation method, and program
US20220405473A1 (en) Machine learning for training nlp agent
US11765193B2 (en) Contextual embeddings for improving static analyzer output
CN112925523A (en) Object comparison method, device, equipment and computer readable medium
CN111679862A (en) Cloud host shutdown method and device, electronic equipment and medium
CN110659501A (en) Vulnerability processing tracking method and device, computer system and readable storage medium
CN111080433A (en) Credit risk assessment method and device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17924945

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 2019541596

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17924945

Country of ref document: EP

Kind code of ref document: A1