WO2021084807A1 - Information-providing system - Google Patents

Information-providing system Download PDF

Info

Publication number
WO2021084807A1
WO2021084807A1 PCT/JP2020/026693 JP2020026693W WO2021084807A1 WO 2021084807 A1 WO2021084807 A1 WO 2021084807A1 JP 2020026693 W JP2020026693 W JP 2020026693W WO 2021084807 A1 WO2021084807 A1 WO 2021084807A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
basic information
basic
target
similarity
Prior art date
Application number
PCT/JP2020/026693
Other languages
French (fr)
Japanese (ja)
Inventor
黒田 聡
Original Assignee
株式会社 情報システムエンジニアリング
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社 情報システムエンジニアリング filed Critical 株式会社 情報システムエンジニアリング
Publication of WO2021084807A1 publication Critical patent/WO2021084807A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files

Definitions

  • the present invention relates to an information providing system.
  • the information processing device of Patent Document 1 includes a reading means for reading a completed document, a recognition means for recognizing a fixed type and information structure of the document, a storage means for storing the contents written in the document, and a read document. If it is not the latest version, the information structure in the latest version of the pre-registered standard type corresponding to the recognized standard type and the comparison means for comparing the recognized information structure are changed as a result of comparison.
  • a generation means for generating a difference document having an information structure, an output means for outputting the generated difference document, and a document entered in the difference document and a document stored in the storage means when the difference document entered by the user is read.
  • a merging means for merging the entered contents is provided, and the storage means replaces the stored contents with the contents merged by the merging means and stores the contents.
  • Patent Document 1 recognizes a fixed type and an information structure, and compares an old manual which is an old document with a new manual which is a new document having the same attributes. It is a thing. Therefore, there is a problem that the edited part in the manual cannot be easily grasped from the specifications having different attributes.
  • an object of the present invention is to provide an information providing system capable of easily grasping an edited part.
  • the information providing system associates a plurality of basic information in which the basic content is divided into chunk structures with a plurality of target information in which the target content having an attribute different from the basic content is divided into chunk structures.
  • the relevance database stored in the above, the database for calculating the similarity of basic information constructed by machine learning using the plurality of basic information, the basic information acquisition means for acquiring specific basic information, the basic information, and the basic information.
  • the basic information comparison means for comparing the specific basic information and the basic information comparison means do not match the basic information with the specific basic information
  • the basic information similarity calculation database is referred to and the basics are referred to.
  • Basic information indicating the similarity between the information and the specific basic information
  • Basic information similarity calculation means for calculating the similarity, and the first basic information from a plurality of the basic information based on the basic information similarity. It is characterized by comprising a target information extraction means for selecting, referring to the relevance database, and extracting the target information corresponding to the first basic information as the first target information.
  • FIG. 1 is a schematic diagram showing an example of the configuration of the information providing system according to the present embodiment.
  • FIG. 2 is a schematic diagram showing an example of using the information providing system in the present embodiment.
  • FIG. 3 is a schematic diagram showing an example of the relevance database of the information providing system in the present embodiment.
  • FIG. 4 is a schematic diagram showing an example of a database for calculating the basic information similarity of the information providing system according to the present embodiment.
  • FIG. 5 is a schematic diagram showing an example of a database for calculating the target information similarity of the information providing system according to the present embodiment.
  • FIG. 6 is a schematic diagram showing an example of the configuration of the information providing device of the information providing system according to the present embodiment.
  • FIG. 1 is a schematic diagram showing an example of the configuration of the information providing system according to the present embodiment.
  • FIG. 2 is a schematic diagram showing an example of using the information providing system in the present embodiment.
  • FIG. 3 is a schematic diagram showing an example of the relevance database of the information providing system in
  • FIG. 7 is a schematic diagram showing an example of the function of the information providing device of the information providing system according to the present embodiment.
  • FIG. 8 is a flowchart showing an example of the operation of the information providing system according to the present embodiment.
  • FIG. 9 is a schematic diagram showing a second example of the information providing system according to the present embodiment.
  • FIG. 1 is a block diagram showing the overall configuration of the information providing system 100 according to the present embodiment.
  • the information providing system 100 is used by a user such as a manual creator who newly creates a manual for the device based on the specifications of the device, for example.
  • the information providing system 100 includes an information providing device 1.
  • the information providing device 1 may be connected to the user terminal 5 or the server 6 via, for example, the public communication network 7.
  • FIG. 2 is a schematic diagram showing an example using the information providing system 100 in the present embodiment.
  • the information providing device 1 acquires specific basic information x in the basic content X.
  • the information providing device 1 calculates the basic information similarity with respect to the acquired specific basic information x.
  • the information providing device 1 selects the first basic information b1 from a plurality of basic information based on the calculated basic information similarity.
  • the information providing device 1 refers to the relevance database and extracts the target information B1 corresponding to the selected first basic information b1 as the first target information.
  • the target information B1 corresponding to the acquired basic information b1 similar to the specific basic information x is an edited part based on the specific basic information x. Therefore, when editing the manual or the like as the target content from the specifications or the like as the basic content, it is only necessary to edit the target information B1 and the editing work of the target content can be performed in a short time.
  • the information providing device 1 refers to the target information similarity estimation processing database and calculates the target information similarity with respect to the first target information B1.
  • the information providing device 1 extracts the second target information B2 different from the first target information B1 based on the calculated target information similarity.
  • the second target information B2 which is similar to the first target information B1
  • FIG. 3 is a schematic diagram showing an example of the relevance database of the information providing system in the present embodiment.
  • the relevance database corresponds to a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content having attributes different from the basic content related to the basic content is divided into chunk structures. Let me remember.
  • the basic content and the target content include text information, and may further include chart information related to charts.
  • the target content is the content to be edited based on the basic content.
  • the different attributes referred to here mean, for example, that when the basic content is a specification of the device, the target content is information other than the specification of the device, for example, a manual, an advertisement, or FAQ (Frequently) of the device. Asked Questions) etc.
  • the target content may be information created based on the basic content.
  • the target content when the basic content is content in a first language such as English, the target content may be content other than the first language such as Japanese.
  • the basic content is resource data (data such as images, texts, icons, dialogs, menu contents, etc.) referenced by a program executed on the device, the target content is information other than the resource data, for example, Information such as manuals, specifications, messages, and icons related to resource data may be used.
  • Basic information includes text information.
  • the basic information may further include chart information regarding charts.
  • the basic information may include a basic information label consisting of a character string for identifying the basic information.
  • the basic information is, for example, when the basic content is a specification of a device such as a medical device, the basic information is information divided into chunk structures in which this specification is a mass of data.
  • the basic information is information in which basic contents such as specifications are divided into chunk structures such as sentences, chapters, paragraphs, and pages.
  • Basic information is information used to create the target content, in addition to the specifications divided into chunk structures, for example, incident information, various treatises, information that is the source of the target content, etc. divided into chunk structures. It may be.
  • Target information includes text information.
  • the target information may further include chart information related to charts.
  • the target information may include a target information label consisting of a character string for identifying the target information.
  • the target information for example, when the basic content is a specification of a device such as a medical device, the manual as the target content created based on this specification is a mass of data in which meaningful information is collected. Information divided into chunk structures.
  • the target information is information divided into chunk structures such as sentences, chapters, paragraphs, and pages of manuals and the like.
  • the target information may be created in a second language such as Japanese, which is different from the first language.
  • FIG. 4 is a schematic diagram showing an example of a database for calculating the basic information similarity of the information providing system according to the present embodiment.
  • FIG. 5 is a schematic diagram showing an example of a database for calculating the target information similarity of the information providing system according to the present embodiment.
  • the database for calculating the similarity of basic information is constructed by machine learning using the basic information.
  • basic information is vectorized and learned using a learning program as teacher data.
  • the basic information is stored in the basic information similarity calculation database as a parameter in a vectorized state in correspondence with the basic information label in the basic information.
  • the basic information may be stored in the basic information similarity calculation database as a parameter in a vectorized state in correspondence with the basic information.
  • the database for target information similarity estimation processing is constructed by machine learning using the target information.
  • target information is vectorized and learned using a learning program as teacher data.
  • the target information is stored in the target information similarity estimation processing database as a parameter in a vectorized state in correspondence with the target information label in the target information.
  • the target information may be stored in the target information similarity estimation processing database as a parameter in a vectorized state in correspondence with the target information.
  • FIG. 6 is a schematic diagram showing an example of the configuration of the information providing device 1 of the information providing system according to the present embodiment.
  • the information providing device 1 in addition to a personal computer (PC), an electronic device such as a smartphone or a tablet terminal may be used.
  • the information providing device 1 includes a housing 10, a CPU 101, a ROM 102, a RAM 103, a storage unit 104, and I / F 105 to 107. Each configuration 101 to 107 is connected by an internal bus 110.
  • the CPU (Central Processing Unit) 101 controls the entire information providing device 1.
  • the ROM (Read Only Memory) 102 stores the operation code of the CPU 101.
  • the RAM (Random Access Memory) 103 is a work area used during the operation of the CPU 101.
  • the storage unit 104 stores various information such as basic information, target information, basic information similarity calculation database, and target information similarity calculation database. As the storage unit 104, for example, in addition to an HDD (Hard Disk Drive), an SSD (solid state drive) or the like is used.
  • HDD Hard Disk Drive
  • SSD solid state drive
  • the I / F 105 is an interface for transmitting and receiving various information to and from the user terminal 5 and the like via the public communication network 7.
  • the I / F 106 is an interface for transmitting and receiving various information to and from the input portion 108.
  • a keyboard is used as the input portion 108, and a user who uses the information providing system 100 inputs or selects various information or a control command of the information providing device 1 via the input portion 108.
  • the I / F 107 is an interface for transmitting and receiving various information to and from the output portion 109.
  • the output unit 109 outputs various information stored in the storage unit 104, the processing status of the information providing device 1, and the like.
  • a display is used as the output portion 109, and a touch panel type may be used, for example. In this case, the output portion 109 may be configured to include the input portion 108.
  • FIG. 7 is a schematic diagram showing an example of the function of the information providing device 1 of the information providing system according to the present embodiment.
  • the information providing device 1 includes a basic information acquisition unit 31, a basic information comparison unit 32, a basic information similarity calculation unit 33, a target information extraction unit 34, a target information similarity calculation unit 35, and an input unit 15. It includes an output unit 16, a storage unit 17, and a control unit 18.
  • Each function shown in FIG. 7 is realized by the CPU 101 executing a program stored in the storage unit 104 or the like with the RAM 103 as a work area.
  • each function may be controlled by artificial intelligence, for example.
  • the "artificial intelligence" may be based on any well-known artificial intelligence technology.
  • the basic information acquisition unit 31 acquires various types of information such as basic information and specific basic information.
  • the specific basic information is the basic information for which the basic information similarity should be calculated from now on.
  • the basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31. The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
  • the specific basic information in the basic content X acquired by the basic information acquisition unit 31 includes “basic information x", “basic information a1", and “basic information c1".
  • the basic information comparison unit 32 compares the "basic information x", “basic information a1", and “basic information c1" included in the specific basic information with the basic information stored in the relevance database. It is assumed that "basic information a1” and “basic information c1" are stored in the relevance database, and "basic information x" is not stored.
  • the basic information comparison unit 32 determines that the "basic information a1" and “basic information c1" included in the specific basic information match the basic information stored in the relevance database database, and the determination ends. Further, the basic information comparison unit 32 determines that the "basic information x" does not match the basic information stored in the relevance database.
  • the basic information similarity calculation unit 33 refers to the basic information similarity calculation database and stores it in the basic information similarity calculation database.
  • the basic information similarity degree indicating the similarity between the basic information and the specific basic information acquired by the basic information acquisition unit 31 is calculated.
  • the basic information similarity calculation unit 33 calculates the basic information similarity using the feature amount of the basic information. As a feature amount of basic information, for example, basic information may be vectorized and expressed.
  • the basic information similarity calculation unit 33 vectorizes specific basic information and then performs vector calculation with the vectorized basic information in the basic information similarity calculation database to obtain the specific basic information and the basic information. Calculate the basic information similarity.
  • the basic information similarity calculation unit 33 does not calculate the basic information similarity when the basic information and the specific basic information match by the basic information comparison unit 32.
  • the basic information similarity indicates the degree of similarity between the specific basic information and the basic information, for example, a decimal number of 100 steps from 0 to 1 such as "0.98", a percentage, 10 steps, or 5 steps. It is shown in three or more stages such as.
  • the basic information similarity calculation unit 33 refers to the basic information similarity calculation database, and the "basic information x" included in the specific basic information and the "basic” stored in the basic information similarity calculation database.
  • the basic information similarity is calculated for each of "information a1", “basic information b1", “basic information c1", and "basic information b2".
  • the inner product of the "feature amount q2 of the basic information x" and the “feature amount p1 of the basic information a1" is calculated, for example, “0. It is calculated as “20”.
  • the basic information similarity between the "basic information x" and the “basic information a1” is “0.98”.
  • the basic information similarity between the "basic information x" and the “part information a1” is "0.33”.
  • the basic information similarity between the "basic information x" and the “basic information a1” is "0.85".
  • “basic information x" indicates that it is more similar to "basic information b1" than, for example, “basic information a1".
  • the target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to the relevance database, and selects the target information corresponding to the selected first basic information. Extract as the first target information. When one first basic information is selected from a plurality of basic information, the target information extraction unit 34 extracts one target information corresponding to the selected first basic information as the first target information. Further, when the target information extraction unit 34 selects a plurality of first basic information, the target information corresponding to each of the selected first basic information may be extracted as the first target information.
  • the target information extraction unit 34 may select as the first basic information from the respective basic information labels included in the plurality of basic information based on the calculated basic information similarity.
  • the target information extraction unit 34 may extract the target information corresponding to the basic information label stored in the relevance database as the first target information from the selected basic information label (first basic information). For example, the target information extraction unit 34 selects the basic information label 21, and extracts the target information B1 corresponding to the basic information label 21 stored in the relevance database as the first target information from the selected basic information label 21. You may. Since the basic information label is composed of a character string, the capacity of the basic information similarity calculation database can be reduced as compared with storing the basic information having the text information.
  • the target information extraction unit 34 has the highest basic of "basic information a1", “basic information b1", “basic information c1", and “basic information b2".
  • the "basic information b1" for which the information similarity is calculated is selected as the first basic information.
  • a threshold value may be set for the basic information similarity, and the basic information obtained by calculating the basic information similarity equal to or lower than the threshold value may be selected. This threshold value can be appropriately set on the user side.
  • the target information extraction unit 34 refers to the relevance database and extracts the "target information B1" corresponding to the "basic information b1" selected as the first basic information as the first target information.
  • the target information extraction unit 34 further extracts one or more second target information different from the first target information from the relevance database based on the target information similarity to be described later.
  • the target information extraction unit 34 may select one or a plurality of target information labels from the target information labels included in the plurality of target information based on the calculated target information similarity.
  • the target information extraction unit 34 may extract the target information corresponding to the target information label stored in the relevance database from the selected target information label as the second target information. For example, the target information extraction unit 34 selects the target information label 122, and extracts the target information B2 corresponding to the target information label 122 stored in the relevance database as the second target information from the selected basic information label 122. You may. Since the target information label is composed of a character string, the capacity of the target information similarity calculation database can be reduced as compared with storing the target information having the text information.
  • the target information similarity calculation unit 35 refers to the target information similarity estimation processing database, and determines the target information similarity indicating the similarity between the target information and the first target information extracted by the target information extraction unit 34. calculate.
  • the target information similarity calculation unit 35 calculates the target information similarity using the feature amount of the target information. As the feature amount of the target information, for example, the target information may be vectorized and expressed.
  • the target information similarity calculation unit 35 vectorizes the specific target information, and then performs a vector operation with the vectorized target information in the target information similarity estimation processing database to obtain the specific target information and the target information. Calculate the similarity of the target information of.
  • the target information similarity indicates the degree of similarity between the first target information and the target information, for example, a decimal number of 100 steps from 0 to 1 such as "0.95", a percentage, 10 steps, or 5 steps. It is shown in three or more stages such as.
  • the target information similarity calculation unit 35 refers to the target information similarity calculation database, and is similar to the target information “target information B1” extracted as the first target information by the target information extraction unit 34.
  • the degree of similarity of the target information is calculated with “target information A1", “target information B1", “target information C1”, and “target information B2” stored in the degree calculation database.
  • the inner product of the "feature amount Q1 of the target information B1" and the “feature amount P1 of the target information A1" is calculated, for example, "0. It is calculated as "30".
  • target information similarity between the "target information B1" and the “target information B1” is “1.00".
  • the target information similarity between the "target information B1” and the “target information C1” is “0.20”.
  • the target information similarity between the "target information B1" and the “target information B2” is "0.95".
  • target information B1" indicates that it is more similar to “target information B2" than, for example, "target information A1”.
  • the target information extraction unit 34 further extracts one or more second target information different from the first target information based on the degree of similarity of the target information.
  • the target information extraction unit 34 determines a predetermined target among "target information A1", “target information B1", “target information C1", and “target information B2".
  • the "target information B2" for which the information similarity is calculated is extracted as the second target information.
  • a threshold value may be set for the target information similarity, and the target information for which the target information similarity equal to or less than the threshold value is calculated may be selected. This threshold value can be appropriately set on the user side. Since the target information for which the target information similarity degree "1.00" is calculated matches the first target information, it may be excluded from being selected as the second target information.
  • the input unit 15 inputs various information to the information providing device 1.
  • the input unit 15 inputs various information such as learning data, basic information, and basic contents via the I / F 105, and also inputs various information from the input portion 108 via the I / F 106, for example.
  • the output unit 16 outputs various information such as target information to the output unit 109 or the like.
  • the output unit 16 transmits various information such as target information to the user terminal 5 or the like via, for example, the public communication network 7.
  • the storage unit 17 stores various information such as basic information and target information in the storage unit 104, and retrieves various information stored in the storage unit 104 as needed. Further, the storage unit 17 stores various databases such as a database for calculating the basic information similarity and a database for calculating the target information similarity in the storage unit 104, and retrieves various databases stored in the storage unit 104 as needed. ..
  • Control unit 18 executes machine learning for constructing a database for calculating basic information similarity using a plurality of basic information. Further, the control unit 18 executes machine learning for constructing a database for calculating the similarity of target information using a plurality of target information. The control unit 18 executes machine learning by linear regression, logistic regression, support vector machine, decision tree, regression tree, random forest, gradient boosting tree, neural network, bays, time series, clustering, ensemble learning, and the like.
  • the user terminal 5 indicates a terminal owned by the user.
  • the user terminal 5 in addition to electronic devices such as mobile phones (mobile terminals), smartphones, tablet terminals, wearable terminals, personal computers, and IoT (Internet of Things) devices, those embodied in all kinds of electronic devices are used. You may.
  • a holo lens registered trademark
  • HMD head-mounted display
  • the user terminal 5 may be connected to the information providing device 1 via, for example, the public communication network 7, or may be directly connected to, for example, the information providing device 1.
  • the user can acquire the first target information from the information providing device 1 by using the user terminal 5, and can display various acquired information on the display unit of the user terminal 5. Further, the user may use the user terminal 5 to perform various controls of the information providing device 1.
  • the server 6 stores the above-mentioned various information.
  • Various information sent via, for example, the public communication network 7 is stored in the server 6.
  • the server 6 stores the same information as the storage unit 104, and may send and receive various information to and from the information providing device 1 via the public communication network 7. That is, the information providing device 1 may use the server 6 instead of the storage unit 104.
  • the public communication network 7 is an Internet network or the like to which the information providing device 1 and the like are connected via a communication circuit.
  • the public communication network 7 may be composed of a so-called optical fiber communication network. Further, the public communication network 7 is not limited to the wired communication network, and may be realized by a known communication network such as a wireless communication network.
  • FIG. 8 is a flowchart showing an example of the operation of the information providing system 100 in the present embodiment.
  • the basic information acquisition unit 31 acquires one or more basic information in which basic contents such as specifications are divided into chunk structures as specific basic information (basic information acquisition step S31).
  • the basic information acquisition unit 31 may acquire specific basic contents including specific basic information.
  • the basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31 (basic information comparison step S32). The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
  • the target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to the relevance database, and selects the target information corresponding to the selected first basic information. It is extracted as the first target information (first target information extraction step S34).
  • the target information similarity calculation unit 35 refers to the target information similarity estimation processing database, and the target information stored in the target information similarity estimation processing database and the target information extracted by the target information extraction unit 34. 1 The target information similarity indicating the similarity with the target information is calculated (target information similarity calculation step S35).
  • the target information extraction unit 34 further extracts one or a plurality of second target information different from the first target information based on the target information similarity (second target information extraction step S36).
  • a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which target content having an attribute different from the basic content is divided into chunk structures are stored in correspondence with each other.
  • Relevance database basic information similarity calculation database constructed by machine learning using a plurality of basic information
  • basic information acquisition unit 31 for acquiring specific basic information, basic information, and specific basic information.
  • the basic information similarity calculation database is referred to, and the basic information and the specific basic information are displayed.
  • the basic information similarity calculation unit 33 that calculates the basic information similarity indicating the similarity of, and the first basic information are selected from a plurality of basic information based on the basic information similarity, and the relevance database is referred to. It includes a target information extraction unit 34 that extracts target information corresponding to the first basic information as the first target information.
  • the basic information similarity calculation unit 33 calculates the basic information similarity for specific basic information that does not match the basic information stored in the relevance database by the basic information comparison unit 32. That is, it is not necessary to calculate the basic information similarity for the specific basic information that matches the basic information stored in the relevance database by the basic information comparison unit 32. Therefore, the calculation of the basic information similarity can be performed more efficiently.
  • the first basic information is selected from a plurality of basic information based on the similarity of the basic information, the relevance database is referred to, and the target information corresponding to the first basic information is the first target. Extract as information.
  • the accuracy of selecting the first basic information can be improved.
  • the relevance database is referred to, and the target information corresponding to the first basic information is extracted as the first target information.
  • the part extracted as the first target information corresponds to the edited part in the target content such as the manual. Therefore, when editing the target content such as a manual, it is only necessary to edit the target information extracted as the first target information, and the editing work of the target content can be performed in a short time.
  • the past of the product created based on the past specifications The manual also needs to be created in a new manual.
  • the past specifications that are candidates for editing are selected from the new specifications as the basic contents, and the past manuals corresponding to the past specifications are the new specifications. It can be grasped that it is the target content to be edited by the book.
  • a new specification as specific basic content acquired by the basic information acquisition unit 31 a past specification which is a set of a plurality of basic information stored in the relevance database, and a set of a plurality of target information
  • a past manual is divided into chunk structures. Therefore, it is possible to efficiently extract only the parts changed by the new specifications from the past manuals. Therefore, based on the new specifications, the user can easily grasp the edited part in the past manual. Therefore, for example, when creating a new manual, the past manual can be used as it is for the parts that are not changed in the specifications, and only the parts that are changed in the new specifications can be newly created. So to speak, it is only necessary to edit the difference in the changed part in the specifications. Therefore, the manual editing work can be easily performed.
  • the target information similarity estimation processing database constructed by machine learning using a plurality of target information and the target information similarity estimation processing database are referred to, and the target information and the first.
  • the target information similarity calculation unit 35 for calculating the similarity between the target information and the target information, and the target information extraction unit 34 is different from the first target information based on the target information similarity.
  • the second target information is further extracted.
  • the second target information different from the first target information is further extracted based on the similarity of the target information.
  • the accuracy of selecting the second target information can be improved by selecting the second target information similar to the first target information based on the quantitatively evaluated degree of similarity of the target information. Therefore, when new information is included in the specific basic information or when there is a change, the second target information similar to the first target information is also extracted, so that any part of the target information in which the target content is divided is extracted. The user can immediately grasp whether or not the above applies. Therefore, when editing the target content, it is only necessary to edit the first target information and the target information extracted as the second target information, and the editing work of the target content can be performed in a short time.
  • a past specification that is a candidate to be changed is selected from the new specifications, and a past manual corresponding to this past specification and another past manual similar to the past manual are selected.
  • changes are necessary due to the new specifications.
  • the new specifications, the past specifications, and the past manuals are each divided into chunk structures. Therefore, it is possible to efficiently extract only the parts changed by the new specifications from the past manuals. At this time, a plurality of similar past manuals can be extracted as targets.
  • the user can easily and simultaneously grasp the relevant parts of a plurality of past manuals that should be based on the new specifications. Therefore, for example, when creating a new manual, the past manual can be used as it is for the parts that are not changed in the specifications, and only the parts that are changed in the specifications can be newly created. So to speak, it is only necessary to edit the difference in the changed part in the specifications. Therefore, the manual editing work can be easily performed.
  • the basic information acquisition step S31 is performed after the target information selection step S14.
  • the user can compare the first target information selected by the target information selection unit 14 and the first target information and the second target information extracted by the target information extraction unit 34. Therefore, in the first target information such as a manual, the relevant part to be edited can be immediately grasped.
  • FIG. 9 is a schematic view showing a second example of the information providing system 100 according to the present embodiment.
  • a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content classified by the attribute is divided into chunk units are stored. It has multiple relevance databases.
  • the information providing system 100 includes a first relevance database, a second relevance database, a third relevance database, and a fourth relevance database.
  • a plurality of target information regarding the target content as a manual of the device is stored in the first relevance database.
  • the second relevance database stores a plurality of target information regarding the target content as an advertisement of the device.
  • a plurality of target information regarding the target content as FAQ of the device is stored in the third relevance database.
  • the fourth relevance database stores a plurality of target information regarding the target content as resource data (data such as images, texts, icons, dialogs, menu contents, etc.) referred to by the program executed by the device.
  • the basic information acquisition unit 31 includes specific basic information x1 to x5, specific basic information e1, specific basic information f2, specific basic information g3, and specific basic information h2, which are included in the specific basic content X. (Basic information acquisition step S31).
  • the basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31. The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
  • the basic information comparison unit 32 includes "basic information x1" to "basic information x5", "basic information e1", “basic information f2", “basic information g3”, which are included in the specific basic information.
  • the “basic information h2" is compared with the basic information stored in the relevance database.
  • “Basic information e1” is stored in the first relevance database
  • “basic information f2” is stored in the second relevance database
  • “basic information g3" is stored in the third relevance database.
  • “Basic information h2” is stored in the fourth relevance database
  • “basic information x1" to "basic information x5" are not stored in any of these relevance databases.
  • the basic information comparison unit 32 stores the "basic information e1", “basic information f2", “basic information g3", and “basic information h2" included in the specific basic information in the relevance database. Is determined to match. Further, the basic information comparison unit 32 determines that the "basic information x1" to "basic information x5" do not match the basic information stored in the relevance database.
  • ⁇ Basic information similarity calculation step S33> When the basic information and the specific basic information do not match by the basic information comparison unit 32, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database and stores it in the basic information similarity calculation database. The basic information similarity degree indicating the similarity between the basic information and the specific basic information acquired by the basic information acquisition unit 31 is calculated.
  • the basic information similarity calculation unit 33 does not calculate the basic information similarity when the basic information and the specific basic information match by the basic information comparison unit 32.
  • the basic information similarity calculation unit 33 refers to the basic information similarity calculation database, and refers to the basic information similarity calculation database for "basic information x1" to "basic information x5" included in the specific basic information. Calculate the degree of similarity of basic information with each basic information stored in.
  • the target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to each relevance database, and corresponds to the selected first basic information. Information is extracted as the first target information. When one first basic information is selected from a plurality of basic information, the target information extraction unit 34 extracts one target information corresponding to the selected first basic information as the first target information. Further, when the target information extraction unit 34 selects a plurality of first basic information, the target information corresponding to each of the selected first basic information may be extracted as the first target information.
  • the target information extraction unit 34 may select as the first basic information from the respective basic information labels included in the plurality of basic information based on the calculated basic information similarity.
  • the target information extraction unit 34 may extract the target information corresponding to the basic information label stored in the relevance database as the first target information from the selected basic information label (first basic information). For example, the target information extraction unit 34 selects the basic information label 21, and extracts the target information B1 corresponding to the basic information label 21 stored in the relevance database as the first target information from the selected basic information label 21. You may. Since the basic information label is composed of a character string, the capacity of the basic information similarity calculation database can be reduced as compared with storing the basic information having the text information.
  • the target information extraction unit 34 uses the "basic information e1" for which the highest basic information similarity is calculated as the first basic information for the "basic information x1". select.
  • a threshold value may be set for the basic information similarity, and the basic information obtained by calculating the basic information similarity equal to or lower than the threshold value may be selected. This threshold value can be appropriately set on the user side.
  • the target information extraction unit 34 has "basic information f1" for "basic information x2", “basic information f3" for “basic information x3", and “basic information g2" for "basic information x4".
  • “basic information h2" is selected as the first basic information.
  • the target information extraction unit 34 refers to the first relevance database and extracts the "target information E1" corresponding to the "basic information e1" selected as the first basic information as the first target information. Similarly, the target information extraction unit 34 refers to the second relevance database and selects the "target information F1" corresponding to the “basic information f1" selected as the first basic information as the first basic information "basic”. The “target information F3" corresponding to the "information f3" is extracted as the first target information, respectively.
  • the target information extraction unit 34 refers to the third relevance database and extracts the "target information G2" corresponding to the "basic information g2" selected as the first basic information as the first target information.
  • the target information extraction unit 34 refers to the fourth relevance database and extracts the "target information H2" corresponding to the "basic information h2" selected as the first basic information as the first target information.
  • the target information extraction unit 34 may select one or a plurality of target information labels from the target information labels included in the plurality of target information based on the calculated target information similarity.
  • a relevance database that stores a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content classified by attribute is divided into chunk units. It is provided with a plurality of.
  • the target information corresponding to the first basic information is extracted as the first target information by referring to each relevance database classified for each attribute. Therefore, when new information is included in the specific basic information or when there is a change, the part extracted as the first target information corresponds to the edited part in the target content such as the manual. Can easily grasp the edited part in the target content. Therefore, when editing the target content such as a manual, it is only necessary to edit the target information extracted as the first target information, and the editing work of the target content can be performed in a short time.
  • the part extracted as the first target information is edited in each target content as manual, advertisement, or FAQ. It will correspond to the place. Therefore, the user can easily grasp the edited part in each target content for each attribute such as the manual, the advertisement, and the FAQ.
  • the third example of the information providing device 1 is different from the first example in that it further includes an access control unit.
  • the access control unit is realized, for example, by the CPU 101 executing a program stored in the storage unit 104 or the like with the RAM 103 as a work area.
  • the access control unit controls access to various databases and target contents. Access includes full access, read and write access, comment-only access, read-only access, and line-of-sight access prohibition.
  • the access control unit is performed based on the access control information.
  • the access control information includes a user name and the access assigned to each user name.
  • the access control information is stored in, for example, the storage unit 104.
  • the user When a user is assigned full access, the user has full read and write access to various databases and target content, and the user can use any aspect of the user interface. For example, for full access, the user can change the database configuration. If the user has read and write access, the user has read and write to the target content, but cannot change the database configuration. In the case of comment-only access, the user can insert comments into the target content, but cannot change various databases or target content. With read-only access, the user can view the target content, but cannot make changes to the various databases or the target content, and cannot insert any comments.
  • an access control unit is further provided.
  • a specific one or a plurality of users among the plurality of users can perform a predetermined access based on the access control information. That is, for multiple users who use various databases and target contents, the control of the editing type such as read-only and full access is linked with the authority based on the user's attributes, and for each of various databases and target contents. Can be managed. In particular, it is possible to prevent unintended editing by allowing only authorized users to edit such as writing while making only browsing accessible at the same time.
  • Information providing device 5 User terminal 6: Server 7: Public communication network 10: Housing 15: Input unit 16: Output unit 17: Storage unit 18: Control unit 31: Basic information acquisition unit 32: Basic information comparison unit 33 : Basic information similarity calculation unit 34: Target information extraction unit 35: Target information similarity calculation unit 100: Information providing system 101: CPU 102: ROM 103: RAM 104: Preservation unit 105: I / F 106: I / F 107: I / F 108: Input part 109: Output part 110: Internal bus S31: Basic information acquisition unit Step S32: Basic information comparison step S33: Basic information similarity calculation step S34: First target information extraction step S35: Target information similarity calculation step S36 : Second target information extraction step

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

[Problem] To provide a technique that enables facilitation of comprehension of a portion to be edited. [Solution] The information-providing system according to the present invention comprises: a relational database that stores a plurality of pieces of basic information obtained by dividing a basic content into chunk structures, and a plurality of pieces of object information obtained by dividing an object content into chunk structures, in association with each other; a database that is for calculation of basic information similarity and that is constructed by machine learning using the plurality of pieces of basic information; a basic information-obtaining means for obtaining a specific piece of basic information; a basic information-comparing means for comparing the pieces of basic information with the specific piece of basic information; a basic information similarity-calculating means for, when the specific piece of basic information is not identical to any of the pieces of basic information, calculating the basic information similarities indicating similarities between the pieces of basic information and the specific piece of basic information by referring to the database for calculation of the basic information similarity; and an object information-extraction means for extracting, as a first piece of object information, one of the pieces of object information corresponding to a selected first piece of basic information, on the basis of the basic information similarities.

Description

情報提供システムInformation provision system
 本発明は、情報提供システムに関する。 The present invention relates to an information providing system.
 特許文献1の情報処理装置は、記入済みの文書を読み込む読込手段、前記文書の定型種類及び情報構造を認識する認識手段、前記文書に記入された内容を記憶する記憶手段、読み込まれた文書が最新版でなかった場合に、認識された定型種類に対応する予め登録された定型種類の最新版における情報構造と、認識された情報構造との比較を行う比較手段、比較した結果、変更されている情報構造の差分文書を生成する生成手段、生成した差分文書を出力する出力手段、ユーザーが記入した差分文書が読み取られると、差分文書への記入内容と記憶手段に記憶している文書への記入内容をマージするマージ手段を具備し、前記記憶手段は、前記マージ手段がマージした内容で記憶済みの内容を置き換えて記憶するものである。 The information processing device of Patent Document 1 includes a reading means for reading a completed document, a recognition means for recognizing a fixed type and information structure of the document, a storage means for storing the contents written in the document, and a read document. If it is not the latest version, the information structure in the latest version of the pre-registered standard type corresponding to the recognized standard type and the comparison means for comparing the recognized information structure are changed as a result of comparison. A generation means for generating a difference document having an information structure, an output means for outputting the generated difference document, and a document entered in the difference document and a document stored in the storage means when the difference document entered by the user is read. A merging means for merging the entered contents is provided, and the storage means replaces the stored contents with the contents merged by the merging means and stores the contents.
特開2017-22431号公報Japanese Unexamined Patent Publication No. 2017-22431
 ところで、例えばある装置が古いバージョンから新しいバージョンにバージョンアップした際には、その装置マニュアルについても古いものから新しいものに編集する必要がある。装置マニュアルは、その製品の仕様書に基づいて作成されることから、装置マニュアルの作成者は、新しい仕様書に基づいて、新たなマニュアルを作成する必要がある。 By the way, for example, when a device is upgraded from an old version to a new version, it is necessary to edit the device manual from the old one to the new one. Since the device manual is created based on the specifications of the product, the creator of the device manual needs to create a new manual based on the new specifications.
 しかしながら、特許文献1に開示された情報処理装置は、定型種類と情報構造を認識するものであり、旧文書である古いマニュアルと、これと同じ属性の新文書である新しいマニュアルと、を比較するものである。このため、異なる属性を有する仕様書等からは、マニュアルにおける編集箇所を容易に把握することができないという問題点があった。 However, the information processing apparatus disclosed in Patent Document 1 recognizes a fixed type and an information structure, and compares an old manual which is an old document with a new manual which is a new document having the same attributes. It is a thing. Therefore, there is a problem that the edited part in the manual cannot be easily grasped from the specifications having different attributes.
 そこで本発明は、上述した問題に鑑みて案出されたものであり、その目的とするところは、編集箇所を容易に把握することが可能となる情報提供システムを提供することにある。 Therefore, the present invention was devised in view of the above-mentioned problems, and an object of the present invention is to provide an information providing system capable of easily grasping an edited part.
 本発明に係る情報提供システムは、基礎コンテンツがチャンク構造に分割された複数の基礎情報と、前記基礎コンテンツとは異なる属性の対象コンテンツがチャンク構造に分割された複数の対象情報と、を対応させて記憶される関連性データベースと、複数の前記基礎情報を用いて機械学習により構築される基礎情報類似度算出用データベースと、特定の基礎情報を取得する基礎情報取得手段と、前記基礎情報と、前記特定の基礎情報とを比較する基礎情報比較手段と、前記基礎情報比較手段により前記基礎情報と前記特定の基礎情報とが一致しない場合、前記基礎情報類似度算出用データベースを参照し、前記基礎情報と、前記特定の基礎情報と、の類似度を示す基礎情報類似度を算出する基礎情報類似度算出手段と、前記基礎情報類似度に基づいて、複数の前記基礎情報から第1基礎情報を選択し、前記関連性データベースを参照し、前記第1基礎情報に対応する前記対象情報を第1対象情報として抽出する対象情報抽出手段と、を備えることを特徴とする。 The information providing system according to the present invention associates a plurality of basic information in which the basic content is divided into chunk structures with a plurality of target information in which the target content having an attribute different from the basic content is divided into chunk structures. The relevance database stored in the above, the database for calculating the similarity of basic information constructed by machine learning using the plurality of basic information, the basic information acquisition means for acquiring specific basic information, the basic information, and the basic information. When the basic information comparison means for comparing the specific basic information and the basic information comparison means do not match the basic information with the specific basic information, the basic information similarity calculation database is referred to and the basics are referred to. Basic information indicating the similarity between the information and the specific basic information Basic information similarity calculation means for calculating the similarity, and the first basic information from a plurality of the basic information based on the basic information similarity. It is characterized by comprising a target information extraction means for selecting, referring to the relevance database, and extracting the target information corresponding to the first basic information as the first target information.
 本発明によれば、編集すべき箇所を容易に把握することが可能となる技術を提供することができる。 According to the present invention, it is possible to provide a technique capable of easily grasping a part to be edited.
図1は、本実施形態における情報提供システムの構成の一例を示す模式図である。FIG. 1 is a schematic diagram showing an example of the configuration of the information providing system according to the present embodiment. 図2は、本実施形態における情報提供システムを使用した一例を示す模式図である。FIG. 2 is a schematic diagram showing an example of using the information providing system in the present embodiment. 図3は、本実施形態における情報提供システムの関連性データベースの一例を示す模式図である。FIG. 3 is a schematic diagram showing an example of the relevance database of the information providing system in the present embodiment. 図4は、本実施形態における情報提供システムの基礎情報類似度算出用データベースの一例を示す模式図である。FIG. 4 is a schematic diagram showing an example of a database for calculating the basic information similarity of the information providing system according to the present embodiment. 図5は、本実施形態における情報提供システムの対象情報類似度算出用データベースの一例を示す模式図である。FIG. 5 is a schematic diagram showing an example of a database for calculating the target information similarity of the information providing system according to the present embodiment. 図6は、本実施形態における情報提供システムの情報提供装置の構成の一例を示す模式図である。FIG. 6 is a schematic diagram showing an example of the configuration of the information providing device of the information providing system according to the present embodiment. 図7は、本実施形態における情報提供システムの情報提供装置の機能の一例を示す模式図である。FIG. 7 is a schematic diagram showing an example of the function of the information providing device of the information providing system according to the present embodiment. 図8は、本実施形態における情報提供システムの動作の一例を示すフローチャートである。FIG. 8 is a flowchart showing an example of the operation of the information providing system according to the present embodiment. 図9は、本実施形態における情報提供システムにおける第2例を示す模式図である。FIG. 9 is a schematic diagram showing a second example of the information providing system according to the present embodiment.
 以下、本発明の実施形態における情報提供システムの一例について、図面を参照しながら説明する。 Hereinafter, an example of the information providing system according to the embodiment of the present invention will be described with reference to the drawings.
(情報提供システム100の構成)
 図1は、本実施形態における情報提供システム100の全体の構成を示すブロック図である。
(Configuration of information providing system 100)
FIG. 1 is a block diagram showing the overall configuration of the information providing system 100 according to the present embodiment.
 情報提供システム100は、例えば、装置の仕様書に基づいて装置のマニュアルを新たに作成するマニュアル作成者等のユーザに利用される。 The information providing system 100 is used by a user such as a manual creator who newly creates a manual for the device based on the specifications of the device, for example.
 図1に示すように、情報提供システム100は、情報提供装置1を備える。情報提供装置1は、例えば公衆通信網7を介してユーザ端末5やサーバ6に接続されてもよい。 As shown in FIG. 1, the information providing system 100 includes an information providing device 1. The information providing device 1 may be connected to the user terminal 5 or the server 6 via, for example, the public communication network 7.
 図2は、本実施形態における情報提供システム100を使用した一例を示す模式図である。情報提供装置1は、基礎コンテンツXにおける特定の基礎情報xを取得する。情報提供装置1は、取得した特定の基礎情報xに対する基礎情報類似度を算出する。情報提供装置1は、算出された基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報b1を選択する。情報提供装置1は、関連性データベースを参照し、選択された第1基礎情報b1に対応する対象情報B1を、第1対象情報として抽出する。これにより、取得した特定の基礎情報xに類似する基礎情報b1に対応する対象情報B1が、特定の基礎情報xに基づく編集箇所であることを把握することができる。このため、基礎コンテンツとしての仕様書等から対象コンテンツとしてのマニュアル等を編集する際に、対象情報B1を編集するだけでよく、対象コンテンツの編集作業を短時間で行うことができる。 FIG. 2 is a schematic diagram showing an example using the information providing system 100 in the present embodiment. The information providing device 1 acquires specific basic information x in the basic content X. The information providing device 1 calculates the basic information similarity with respect to the acquired specific basic information x. The information providing device 1 selects the first basic information b1 from a plurality of basic information based on the calculated basic information similarity. The information providing device 1 refers to the relevance database and extracts the target information B1 corresponding to the selected first basic information b1 as the first target information. As a result, it is possible to grasp that the target information B1 corresponding to the acquired basic information b1 similar to the specific basic information x is an edited part based on the specific basic information x. Therefore, when editing the manual or the like as the target content from the specifications or the like as the basic content, it is only necessary to edit the target information B1 and the editing work of the target content can be performed in a short time.
 また、情報提供装置1は、対象情報類似度推定処理用データベースを参照し、第1対象情報B1に対する対象情報類似度を算出する。情報提供装置1は、算出された対象情報類似度に基づいて、第1対象情報B1とは異なる第2対象情報B2を抽出する。これにより、第1対象情報B1に類似する第2対象情報B2も、特定の基礎情報xに基づく編集箇所であることを把握することができる。このため、基礎コンテンツとしての仕様書等から対象コンテンツとしてのマニュアル等を編集する際に、第1対象情報と第2対象情報とを編集するだけでよく、対象コンテンツの編集作業を短時間で行うことができる。 Further, the information providing device 1 refers to the target information similarity estimation processing database and calculates the target information similarity with respect to the first target information B1. The information providing device 1 extracts the second target information B2 different from the first target information B1 based on the calculated target information similarity. As a result, it can be understood that the second target information B2, which is similar to the first target information B1, is also an edited part based on the specific basic information x. Therefore, when editing a manual or the like as a target content from a specification or the like as a basic content, it is only necessary to edit the first target information and the second target information, and the editing work of the target content is performed in a short time. be able to.
 <関連性データベース>
 図3は、本実施形態における情報提供システムの関連性データベースの一例を示す模式図である。関連性データベースは、基礎情報と対象情報とが1対1で対応して、複数記憶される。関連性データベースは、基礎コンテンツがチャンク構造に分割された複数の基礎情報と、基礎コンテンツと関連して基礎コンテンツとは異なる属性の対象コンテンツがチャンク構造に分割された複数の対象情報を、それぞれ対応させて記憶される。基礎コンテンツと対象コンテンツとは、文章情報を含み、更に図表に関する図表情報を含んでいてもよい。対象コンテンツは、基礎コンテンツに基づいて編集すべき対象となるコンテンツである。
<Relevance database>
FIG. 3 is a schematic diagram showing an example of the relevance database of the information providing system in the present embodiment. In the relevance database, a plurality of basic information and target information are stored in a one-to-one correspondence. The relevance database corresponds to a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content having attributes different from the basic content related to the basic content is divided into chunk structures. Let me remember. The basic content and the target content include text information, and may further include chart information related to charts. The target content is the content to be edited based on the basic content.
 ここでいう異なる属性とは、例えば基礎コンテンツが装置の仕様書である場合には、対象コンテンツが当該装置の仕様書以外の情報であることをいい、例えば、装置のマニュアル、広告、FAQ(Frequently Asked Questions)等であればよい。このように、対象コンテンツは、基礎コンテンツに基づいて作成される情報であってもよい。例えば、基礎コンテンツが英語等の第1言語におけるコンテンツである場合には、対象コンテンツは日本語等の第1言語以外のコンテンツであってもよい。例えば、基礎コンテンツが装置で実行されるプログラムが参照するリソースデータ(画像、テキスト、アイコン、ダイアログ、メニュー内容などのデータ)である場合には、対象コンテンツはリソースデータ以外の情報であり、例えば、リソースデータに関するマニュアル、仕様書、メッセージ、アイコン等の情報であればよい。 The different attributes referred to here mean, for example, that when the basic content is a specification of the device, the target content is information other than the specification of the device, for example, a manual, an advertisement, or FAQ (Frequently) of the device. Asked Questions) etc. As described above, the target content may be information created based on the basic content. For example, when the basic content is content in a first language such as English, the target content may be content other than the first language such as Japanese. For example, when the basic content is resource data (data such as images, texts, icons, dialogs, menu contents, etc.) referenced by a program executed on the device, the target content is information other than the resource data, for example, Information such as manuals, specifications, messages, and icons related to resource data may be used.
 基礎情報は、文章情報を含む。基礎情報は、更に図表に関する図表情報を含んでもよい。基礎情報は、基礎情報を識別するための文字列からなる基礎情報ラベルを含んでいてもよい。基礎情報は、例えば基礎コンテンツが医療機器等の装置の仕様書である場合、この仕様書がひとまとまりのデータの塊となったチャンク構造に分割された情報である。基礎情報は、例えば仕様書等の基礎コンテンツが文章毎、章毎、段落毎、ページ毎等のチャンク構造に分割された情報である。基礎情報は、対象コンテンツの作成に用いられる情報として、仕様書がチャンク構造に分割されたもののほか、例えば、インシデント情報、各種論文、対象コンテンツの原典となる情報等がチャンク構造に分割されたものであってもよい。 Basic information includes text information. The basic information may further include chart information regarding charts. The basic information may include a basic information label consisting of a character string for identifying the basic information. The basic information is, for example, when the basic content is a specification of a device such as a medical device, the basic information is information divided into chunk structures in which this specification is a mass of data. The basic information is information in which basic contents such as specifications are divided into chunk structures such as sentences, chapters, paragraphs, and pages. Basic information is information used to create the target content, in addition to the specifications divided into chunk structures, for example, incident information, various treatises, information that is the source of the target content, etc. divided into chunk structures. It may be.
 対象情報は、文章情報を含む。対象情報は、更に図表に関する図表情報を含んでもよい。対象情報は、対象情報を識別するための文字列からなる対象情報ラベルを含んでいてもよい。対象情報は、例えば基礎コンテンツが医療機器等の装置の仕様書である場合、この仕様書に基づいて作成される対象コンテンツとしてのマニュアルが、意味のある情報がひとまとまりのデータの塊となったチャンク構造に分割された情報である。対象情報は、例えばマニュアル等の文章毎、章毎、段落毎、ページ毎等のチャンク構造に分割された情報である。また、基礎情報が英語等の第1言語で作成されている場合、対象情報は第1言語とは異なる日本語等の第2言語で作成されるものであってもよい。 Target information includes text information. The target information may further include chart information related to charts. The target information may include a target information label consisting of a character string for identifying the target information. As for the target information, for example, when the basic content is a specification of a device such as a medical device, the manual as the target content created based on this specification is a mass of data in which meaningful information is collected. Information divided into chunk structures. The target information is information divided into chunk structures such as sentences, chapters, paragraphs, and pages of manuals and the like. When the basic information is created in a first language such as English, the target information may be created in a second language such as Japanese, which is different from the first language.
 図4は、本実施形態における情報提供システムの基礎情報類似度算出用データベースの一例を示す模式図である。図5は、本実施形態における情報提供システムの対象情報類似度算出用データベースの一例を示す模式図である。 FIG. 4 is a schematic diagram showing an example of a database for calculating the basic information similarity of the information providing system according to the present embodiment. FIG. 5 is a schematic diagram showing an example of a database for calculating the target information similarity of the information providing system according to the present embodiment.
 <基礎情報類似度算出用データベース>
 基礎情報類似度算出用データベースは、基礎情報を用いて機械学習により構築される。機械学習の方法として、例えば基礎情報を教師データとして学習用プログラムを使いベクトル化して学習させる。基礎情報は、基礎情報における基礎情報ラベルに対応させて、ベクトル化された状態でパラメータとして基礎情報類似度算出用データベースに記憶される。基礎情報は、基礎情報に対応させて、ベクトル化された状態でパラメータとして基礎情報類似度算出用データベースに記憶されてもよい。
<Database for calculating basic information similarity>
The database for calculating the similarity of basic information is constructed by machine learning using the basic information. As a method of machine learning, for example, basic information is vectorized and learned using a learning program as teacher data. The basic information is stored in the basic information similarity calculation database as a parameter in a vectorized state in correspondence with the basic information label in the basic information. The basic information may be stored in the basic information similarity calculation database as a parameter in a vectorized state in correspondence with the basic information.
 <対象情報類似度推定処理用データベース>
 対象情報類似度推定処理用データベースは、対象情報を用いて機械学習により構築される。機械学習の方法として、例えば対象情報を教師データとして学習用プログラムを使いベクトル化して学習させる。対象情報は、対象情報における対象情報ラベルに対応させて、ベクトル化された状態でパラメータとして対象情報類似度推定処理用データベースに記憶される。対象情報は、対象情報に対応させて、ベクトル化された状態でパラメータとして対象情報類似度推定処理用データベースに記憶されてもよい。
<Database for target information similarity estimation processing>
The database for target information similarity estimation processing is constructed by machine learning using the target information. As a method of machine learning, for example, target information is vectorized and learned using a learning program as teacher data. The target information is stored in the target information similarity estimation processing database as a parameter in a vectorized state in correspondence with the target information label in the target information. The target information may be stored in the target information similarity estimation processing database as a parameter in a vectorized state in correspondence with the target information.
 <情報提供装置1>
 図6は、本実施形態における情報提供システムの情報提供装置1の構成の一例を示す模式図である。情報提供装置1として、パーソナルコンピュータ(PC)のほか、スマートフォンやタブレット端末等の電子機器が用いられてもよい。情報提供装置1は、筐体10と、CPU101と、ROM102と、RAM103と、保存部104と、I/F105~107とを備える。各構成101~107は、内部バス110により接続される。
<Information providing device 1>
FIG. 6 is a schematic diagram showing an example of the configuration of the information providing device 1 of the information providing system according to the present embodiment. As the information providing device 1, in addition to a personal computer (PC), an electronic device such as a smartphone or a tablet terminal may be used. The information providing device 1 includes a housing 10, a CPU 101, a ROM 102, a RAM 103, a storage unit 104, and I / F 105 to 107. Each configuration 101 to 107 is connected by an internal bus 110.
 CPU(Central Processing Unit)101は、情報提供装置1全体を制御する。ROM(Read Only Memory)102は、CPU101の動作コードを格納する。RAM(Random Access Memory)103は、CPU101の動作時に使用される作業領域である。保存部104は、基礎情報、対象情報、基礎情報類似度算出用データベース、対象情報類似度算出用データベース等の各種情報が保存される。保存部104として、例えばHDD(Hard Disk Drive)のほか、SSD(solid state drive)等が用いられる。 The CPU (Central Processing Unit) 101 controls the entire information providing device 1. The ROM (Read Only Memory) 102 stores the operation code of the CPU 101. The RAM (Random Access Memory) 103 is a work area used during the operation of the CPU 101. The storage unit 104 stores various information such as basic information, target information, basic information similarity calculation database, and target information similarity calculation database. As the storage unit 104, for example, in addition to an HDD (Hard Disk Drive), an SSD (solid state drive) or the like is used.
 I/F105は、公衆通信網7を介してユーザ端末5等との各種情報の送受信を行うためのインターフェースである。I/F106は、入力部分108との各種情報の送受信を行うためのインターフェースである。入力部分108として、例えばキーボードが用いられ、情報提供システム100を利用するユーザは、入力部分108を介して、各種情報又は情報提供装置1の制御コマンド等を入力又は選択する。I/F107は、出力部分109との各種情報の送受信を行うためのインターフェースである。出力部分109は、保存部104に保存された各種情報、又は情報提供装置1の処理状況等を出力する。出力部分109として、ディスプレイが用いられ、例えばタッチパネル式でもよい。この場合、出力部分109が入力部分108を含む構成としてもよい。 The I / F 105 is an interface for transmitting and receiving various information to and from the user terminal 5 and the like via the public communication network 7. The I / F 106 is an interface for transmitting and receiving various information to and from the input portion 108. For example, a keyboard is used as the input portion 108, and a user who uses the information providing system 100 inputs or selects various information or a control command of the information providing device 1 via the input portion 108. The I / F 107 is an interface for transmitting and receiving various information to and from the output portion 109. The output unit 109 outputs various information stored in the storage unit 104, the processing status of the information providing device 1, and the like. A display is used as the output portion 109, and a touch panel type may be used, for example. In this case, the output portion 109 may be configured to include the input portion 108.
 図7は、本実施形態における情報提供システムの情報提供装置1の機能の一例を示す模式図である。情報提供装置1は、基礎情報取得部31と、基礎情報比較部32と、基礎情報類似度算出部33と、対象情報抽出部34と、対象情報類似度算出部35と、入力部15と、出力部16と、記憶部17と、制御部18とを備える。なお、図7に示した各機能は、CPU101が、RAM103を作業領域として、保存部104等に記憶されたプログラムを実行することにより実現される。また、各機能は、例えば人工知能により制御されてもよい。ここで、「人工知能」は、いかなる周知の人工知能技術に基づくものであってもよい。 FIG. 7 is a schematic diagram showing an example of the function of the information providing device 1 of the information providing system according to the present embodiment. The information providing device 1 includes a basic information acquisition unit 31, a basic information comparison unit 32, a basic information similarity calculation unit 33, a target information extraction unit 34, a target information similarity calculation unit 35, and an input unit 15. It includes an output unit 16, a storage unit 17, and a control unit 18. Each function shown in FIG. 7 is realized by the CPU 101 executing a program stored in the storage unit 104 or the like with the RAM 103 as a work area. Moreover, each function may be controlled by artificial intelligence, for example. Here, the "artificial intelligence" may be based on any well-known artificial intelligence technology.
 <基礎情報取得部31>
 基礎情報取得部31は、基礎情報、特定の基礎情報等の各種情報を取得する。特定の基礎情報は、これから基礎情報類似度を算出すべき対象となる基礎情報である。
<Basic information acquisition department 31>
The basic information acquisition unit 31 acquires various types of information such as basic information and specific basic information. The specific basic information is the basic information for which the basic information similarity should be calculated from now on.
 <基礎情報比較部32>
 基礎情報比較部32は、関連性データベースに記憶された基礎情報と、基礎情報取得部31により取得した特定の基礎情報と、を比較する。基礎情報比較部32は、基礎情報と、特定の基礎情報と、が一致するか、一致しないか、を判定する。
<Basic information comparison unit 32>
The basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31. The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
 図4の例では、基礎情報取得部31により取得した、基礎コンテンツXにおける特定の基礎情報が「基礎情報x」、「基礎情報a1」、「基礎情報c1」を含むとする。そして、基礎情報比較部32は、特定の基礎情報に含まれる「基礎情報x」、「基礎情報a1」、「基礎情報c1」と、関連性データベースに記憶された基礎情報と、を比較する。関連性データベースには、「基礎情報a1」「基礎情報c1」が記憶され、「基礎情報x」が記憶されていないとする。このとき、基礎情報比較部32は、特定の基礎情報に含まれる「基礎情報a1」「基礎情報c1」が関連性データベースデータベースに記憶された基礎情報に一致すると判定し、判定終了となる。また、基礎情報比較部32は、「基礎情報x」が関連性データベースデータベースに記憶された基礎情報に一致しないと判定する。 In the example of FIG. 4, it is assumed that the specific basic information in the basic content X acquired by the basic information acquisition unit 31 includes "basic information x", "basic information a1", and "basic information c1". Then, the basic information comparison unit 32 compares the "basic information x", "basic information a1", and "basic information c1" included in the specific basic information with the basic information stored in the relevance database. It is assumed that "basic information a1" and "basic information c1" are stored in the relevance database, and "basic information x" is not stored. At this time, the basic information comparison unit 32 determines that the "basic information a1" and "basic information c1" included in the specific basic information match the basic information stored in the relevance database database, and the determination ends. Further, the basic information comparison unit 32 determines that the "basic information x" does not match the basic information stored in the relevance database.
 <基礎情報類似度算出部33>
 基礎情報類似度算出部33は、基礎情報比較部32により基礎情報と特定の基礎情報とが一致しない場合、基礎情報類似度算出用データベースを参照し、基礎情報類似度算出用データベースに記憶された基礎情報と、基礎情報取得部31により取得された特定の基礎情報と、の類似度を示す基礎情報類似度を算出する。基礎情報類似度算出部33は、基礎情報の特徴量を用いて、基礎情報類似度算出する。基礎情報の特徴量として、例えば基礎情報がベクトル化されて表現されてもよい。基礎情報類似度算出部33は、特定の基礎情報をベクトル化した上で、基礎情報類似度算出用データベース内でベクトル化された基礎情報とのベクトル演算により、特定の基礎情報と基礎情報との基礎情報類似度を算出する。
<Basic information similarity calculation unit 33>
When the basic information and the specific basic information do not match by the basic information comparison unit 32, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database and stores it in the basic information similarity calculation database. The basic information similarity degree indicating the similarity between the basic information and the specific basic information acquired by the basic information acquisition unit 31 is calculated. The basic information similarity calculation unit 33 calculates the basic information similarity using the feature amount of the basic information. As a feature amount of basic information, for example, basic information may be vectorized and expressed. The basic information similarity calculation unit 33 vectorizes specific basic information and then performs vector calculation with the vectorized basic information in the basic information similarity calculation database to obtain the specific basic information and the basic information. Calculate the basic information similarity.
 なお、基礎情報類似度算出部33は、基礎情報比較部32により基礎情報と特定の基礎情報とが一致する場合、基礎情報類似度の算出を行わない。 Note that the basic information similarity calculation unit 33 does not calculate the basic information similarity when the basic information and the specific basic information match by the basic information comparison unit 32.
 基礎情報類似度は、特定の基礎情報と、基礎情報とが類似する度合いを示しており、例えば「0.98」等の0~1までの100段階の小数、百分率、10段階、又は5段階等の3段階以上で示される。 The basic information similarity indicates the degree of similarity between the specific basic information and the basic information, for example, a decimal number of 100 steps from 0 to 1 such as "0.98", a percentage, 10 steps, or 5 steps. It is shown in three or more stages such as.
 図4の例では、基礎情報比較部32により特定の基礎情報に含まれる「基礎情報x」と関連性データベースに記憶された基礎情報と一致しない。かかる場合、基礎情報類似度算出部33は、基礎情報類似度算出用データベースを参照し、特定の基礎情報に含まれる「基礎情報x」と、基礎情報類似度算出用データベースに記憶された「基礎情報a1」、「基礎情報b1」、「基礎情報c1」、「基礎情報b2」と、それぞれ基礎情報類似度を算出する。「基礎情報x」と、「基礎情報a1」との基礎情報類似度は、「基礎情報xの特徴量q2」と「基礎情報a1の特徴量p1」の内積を演算して、例えば「0.20」として算出される。同様に「基礎情報x」と、「基礎情報a1」との基礎情報類似度は、「0.98」である。「基礎情報x」と、「部情報a1」との基礎情報類似度は、「0.33」である。「基礎情報x」と、「基礎情報a1」との基礎情報類似度は、「0.85」である。この場合、「基礎情報x」は、例えば「基礎情報a1」に比べて「基礎情報b1」と類似していることを示す。 In the example of FIG. 4, the "basic information x" included in the specific basic information by the basic information comparison unit 32 does not match the basic information stored in the relevance database. In such a case, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database, and the "basic information x" included in the specific basic information and the "basic" stored in the basic information similarity calculation database. The basic information similarity is calculated for each of "information a1", "basic information b1", "basic information c1", and "basic information b2". For the basic information similarity between the "basic information x" and the "basic information a1", the inner product of the "feature amount q2 of the basic information x" and the "feature amount p1 of the basic information a1" is calculated, for example, "0. It is calculated as "20". Similarly, the basic information similarity between the "basic information x" and the "basic information a1" is "0.98". The basic information similarity between the "basic information x" and the "part information a1" is "0.33". The basic information similarity between the "basic information x" and the "basic information a1" is "0.85". In this case, "basic information x" indicates that it is more similar to "basic information b1" than, for example, "basic information a1".
 <対象情報抽出部34>
 対象情報抽出部34は、算出された基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報を選択し、関連性データベースを参照し、選択した第1基礎情報に対応する対象情報を第1対象情報として抽出する。対象情報抽出部34は、複数の基礎情報から1つの第1基礎情報を選択したとき、選択した1つの第1基礎情報に対応する1つの対象情報を第1対象情報として抽出する。また、対象情報抽出部34は、複数の第1基礎情報を選択したとき、選択したそれぞれの第1基礎情報に対応する対象情報をそれぞれ第1対象情報として抽出してもよい。
<Target information extraction unit 34>
The target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to the relevance database, and selects the target information corresponding to the selected first basic information. Extract as the first target information. When one first basic information is selected from a plurality of basic information, the target information extraction unit 34 extracts one target information corresponding to the selected first basic information as the first target information. Further, when the target information extraction unit 34 selects a plurality of first basic information, the target information corresponding to each of the selected first basic information may be extracted as the first target information.
 対象情報抽出部34は、算出された基礎情報類似度に基づいて、複数の基礎情報に含まれるそれぞれの基礎情報ラベルから、第1基礎情報として選択してもよい。対象情報抽出部34は、選択した基礎情報ラベル(第1基礎情報)から、関連性データベースに記憶された基礎情報ラベルに対応する対象情報を第1対象情報として抽出してもよい。例えば、対象情報抽出部34は、基礎情報ラベル21を選択し、選択した基礎情報ラベル21から、関連性データベースに記憶された基礎情報ラベル21に対応する対象情報B1を第1対象情報として抽出してもよい。基礎情報ラベルは、文字列からなるため、文章情報を有する基礎情報を記憶させるよりも、基礎情報類似度算出用データベースの容量を低減することができる。 The target information extraction unit 34 may select as the first basic information from the respective basic information labels included in the plurality of basic information based on the calculated basic information similarity. The target information extraction unit 34 may extract the target information corresponding to the basic information label stored in the relevance database as the first target information from the selected basic information label (first basic information). For example, the target information extraction unit 34 selects the basic information label 21, and extracts the target information B1 corresponding to the basic information label 21 stored in the relevance database as the first target information from the selected basic information label 21. You may. Since the basic information label is composed of a character string, the capacity of the basic information similarity calculation database can be reduced as compared with storing the basic information having the text information.
 図4の例では、対象情報抽出部34は、基礎情報類似度を算出した結果、「基礎情報a1」、「基礎情報b1」、「基礎情報c1」、「基礎情報b2」のうち最も高い基礎情報類似度を算出した「基礎情報b1」を第1基礎情報として選択する。第1基礎情報として選択するとき、基礎情報類似度に閾値を設定し、その閾値以上又は以下の基礎情報類似度を算出した基礎情報を選択してもよい。この閾値は、ユーザ側で適宜設定することができる。 In the example of FIG. 4, as a result of calculating the basic information similarity, the target information extraction unit 34 has the highest basic of "basic information a1", "basic information b1", "basic information c1", and "basic information b2". The "basic information b1" for which the information similarity is calculated is selected as the first basic information. When selecting as the first basic information, a threshold value may be set for the basic information similarity, and the basic information obtained by calculating the basic information similarity equal to or lower than the threshold value may be selected. This threshold value can be appropriately set on the user side.
 そして、対象情報抽出部34は、関連性データベースを参照し、第1基礎情報として選択した「基礎情報b1」に対応する「対象情報B1」を第1対象情報として抽出する。 Then, the target information extraction unit 34 refers to the relevance database and extracts the "target information B1" corresponding to the "basic information b1" selected as the first basic information as the first target information.
 更に、対象情報抽出部34は、後述する対象情報類似度に基づいて、関連性データベースから、第1対象情報とは異なる第2対象情報を更に1又は複数抽出する。 Further, the target information extraction unit 34 further extracts one or more second target information different from the first target information from the relevance database based on the target information similarity to be described later.
 対象情報抽出部34は、算出された対象情報類似度に基づいて、複数の対象情報に含まれる対象情報ラベルから、1又は複数の対象情報ラベルを選択してもよい。対象情報抽出部34は、選択した対象情報ラベルから、関連性データベースに記憶された対象情報ラベルに対応する対象情報を、第2対象情報として抽出してもよい。例えば、対象情報抽出部34は、対象情報ラベル122を選択し、選択した基礎情報ラベル122から、関連性データベースに記憶された対象情報ラベル122に対応する対象情報B2を第2対象情報として抽出してもよい。対象情報ラベルは、文字列からなるため、文章情報を有する対象情報を記憶させるよりも、対象情報類似度算出用データベースの容量を低減することができる。 The target information extraction unit 34 may select one or a plurality of target information labels from the target information labels included in the plurality of target information based on the calculated target information similarity. The target information extraction unit 34 may extract the target information corresponding to the target information label stored in the relevance database from the selected target information label as the second target information. For example, the target information extraction unit 34 selects the target information label 122, and extracts the target information B2 corresponding to the target information label 122 stored in the relevance database as the second target information from the selected basic information label 122. You may. Since the target information label is composed of a character string, the capacity of the target information similarity calculation database can be reduced as compared with storing the target information having the text information.
 <対象情報類似度算出部35>
 対象情報類似度算出部35は、対象情報類似度推定処理用データベースを参照し、対象情報と、対象情報抽出部34により抽出された第1対象情報と、の類似度を示す対象情報類似度を算出する。対象情報類似度算出部35は、対象情報の特徴量を用いて、対象情報類似度を算出する。対象情報の特徴量として、例えば対象情報がベクトル化されて表現されてもよい。対象情報類似度算出部35は、特定の対象情報をベクトル化した上で、対象情報類似度推定処理用データベース内でベクトル化された対象情報とのベクトル演算により、特定の対象情報と対象情報との対象情報類似度を算出する。
<Target information similarity calculation unit 35>
The target information similarity calculation unit 35 refers to the target information similarity estimation processing database, and determines the target information similarity indicating the similarity between the target information and the first target information extracted by the target information extraction unit 34. calculate. The target information similarity calculation unit 35 calculates the target information similarity using the feature amount of the target information. As the feature amount of the target information, for example, the target information may be vectorized and expressed. The target information similarity calculation unit 35 vectorizes the specific target information, and then performs a vector operation with the vectorized target information in the target information similarity estimation processing database to obtain the specific target information and the target information. Calculate the similarity of the target information of.
 対象情報類似度は、第1対象情報と、対象情報とが類似する度合いを示しており、例えば「0.95」等の0~1までの100段階の小数、百分率、10段階、又は5段階等の3段階以上で示される。 The target information similarity indicates the degree of similarity between the first target information and the target information, for example, a decimal number of 100 steps from 0 to 1 such as "0.95", a percentage, 10 steps, or 5 steps. It is shown in three or more stages such as.
 図5の例では、対象情報類似度算出部35は、対象情報類似度算出用データベースを参照し、対象情報抽出部34により第1対象情報として抽出された「対象情報B1」と、対象情報類似度算出用データベースに記憶された「対象情報A1」、「対象情報B1」、「対象情報C1」、「対象情報B2」と、それぞれ対象情報類似度を算出する。「対象情報B1」と、「対象情報A1」との対象情報類似度は、「対象情報B1の特徴量Q1」と「対象情報A1の特徴量P1」の内積を演算して、例えば「0.30」と算出される。同様に、「対象情報B1」と、「対象情報B1」との対象情報類似度は、「1.00」である。「対象情報B1」と、「対象情報C1」との対象情報類似度は、「0.20」である。「対象情報B1」と、「対象情報B2」との対象情報類似度は、「0.95」である。この場合、「対象情報B1」は、例えば「対象情報A1」に比べて「対象情報B2」と類似していることを示す。 In the example of FIG. 5, the target information similarity calculation unit 35 refers to the target information similarity calculation database, and is similar to the target information “target information B1” extracted as the first target information by the target information extraction unit 34. The degree of similarity of the target information is calculated with "target information A1", "target information B1", "target information C1", and "target information B2" stored in the degree calculation database. For the target information similarity between the "target information B1" and the "target information A1", the inner product of the "feature amount Q1 of the target information B1" and the "feature amount P1 of the target information A1" is calculated, for example, "0. It is calculated as "30". Similarly, the target information similarity between the "target information B1" and the "target information B1" is "1.00". The target information similarity between the "target information B1" and the "target information C1" is "0.20". The target information similarity between the "target information B1" and the "target information B2" is "0.95". In this case, "target information B1" indicates that it is more similar to "target information B2" than, for example, "target information A1".
 上述したとおり、対象情報抽出部34は、対象情報類似度に基づいて、第1対象情報とは異なる第2対象情報を更に1又は複数抽出する。 As described above, the target information extraction unit 34 further extracts one or more second target information different from the first target information based on the degree of similarity of the target information.
 図5の例では、対象情報抽出部34は、対象情報類似度を算出した結果、「対象情報A1」、「対象情報B1」、「対象情報C1」、「対象情報B2」のうち所定の対象情報類似度を算出した「対象情報B2」を第2対象情報として抽出する。第2対象情報を選択するとき、対象情報類似度に閾値を設定し、その閾値以上又は以下の対象情報類似度を算出した対象情報を選択してもよい。この閾値は、ユーザ側で適宜設定することができる。なお、対象情報類似度「1.00」を算出した対象情報については、第1対象情報に一致することになるため、第2対象情報として選択されるのを除外してもよい。 In the example of FIG. 5, as a result of calculating the target information similarity, the target information extraction unit 34 determines a predetermined target among "target information A1", "target information B1", "target information C1", and "target information B2". The "target information B2" for which the information similarity is calculated is extracted as the second target information. When selecting the second target information, a threshold value may be set for the target information similarity, and the target information for which the target information similarity equal to or less than the threshold value is calculated may be selected. This threshold value can be appropriately set on the user side. Since the target information for which the target information similarity degree "1.00" is calculated matches the first target information, it may be excluded from being selected as the second target information.
 <入力部15>
 入力部15は、情報提供装置1に各種情報を入力する。入力部15は、I/F105を介して学習データ、基礎情報、基礎コンテンツ等の各種情報を入力するほか、例えばI/F106を介して入力部分108から各種情報を入力する。
<Input unit 15>
The input unit 15 inputs various information to the information providing device 1. The input unit 15 inputs various information such as learning data, basic information, and basic contents via the I / F 105, and also inputs various information from the input portion 108 via the I / F 106, for example.
 <出力部16>
 出力部16は、対象情報等の各種情報を出力部分109等に出力する。出力部16は、例えば公衆通信網7を介して、ユーザ端末5等に対象情報等の各種情報を送信する。
<Output unit 16>
The output unit 16 outputs various information such as target information to the output unit 109 or the like. The output unit 16 transmits various information such as target information to the user terminal 5 or the like via, for example, the public communication network 7.
 <記憶部17>
 記憶部17は、基礎情報や対象情報等の各種情報を保存部104に記憶し、必要に応じて保存部104に記憶された各種情報を取出す。また、記憶部17は、基礎情報類似度算出用データベース、対象情報類似度算出用データベース等の各種データベースを、保存部104に記憶し、必要に応じて保存部104に記憶された各種データベースを取出す。
<Memory unit 17>
The storage unit 17 stores various information such as basic information and target information in the storage unit 104, and retrieves various information stored in the storage unit 104 as needed. Further, the storage unit 17 stores various databases such as a database for calculating the basic information similarity and a database for calculating the target information similarity in the storage unit 104, and retrieves various databases stored in the storage unit 104 as needed. ..
 <制御部18>
 制御部18は、複数の基礎情報を用いて基礎情報類似度算出用データベースを構築するための機械学習を実行する。また、制御部18は、複数の対象情報を用いて対象情報類似度算出用データベースを構築するための機械学習を実行する。制御部18は、線形回帰、ロジスティック回帰、サポートベクターマシーン、決定木、回帰木、ランダムフォレスト、勾配ブースティング木、ニューラルネットワーク、ベイズ、時系列、クラスタリング、アンサンブル学習等により機械学習を実行する。
<Control unit 18>
The control unit 18 executes machine learning for constructing a database for calculating basic information similarity using a plurality of basic information. Further, the control unit 18 executes machine learning for constructing a database for calculating the similarity of target information using a plurality of target information. The control unit 18 executes machine learning by linear regression, logistic regression, support vector machine, decision tree, regression tree, random forest, gradient boosting tree, neural network, bays, time series, clustering, ensemble learning, and the like.
 <ユーザ端末5>
 ユーザ端末5は、ユーザが保有する端末を示す。ユーザ端末5は、携帯電話(携帯端末)、スマートフォン、タブレット型端末、ウェアラブル端末、パーソナルコンピュータ、IoT(Internet of Things)デバイス等の電子機器のほか、あらゆる電子機器で具現化されたものが用いられてもよい。ユーザ端末5は、HMD(ヘッドマウントディスプレイ)の1種類であるホロレンズ(登録商標)が用いられてもよい。ユーザ端末5は、例えば公衆通信網7を介して情報提供装置1と接続されるほか、例えば情報提供装置1と直接接続されてもよい。ユーザは、ユーザ端末5を用いて、情報提供装置1から第1対象情報を取得するほか、ユーザ端末5の表示部に取得した各種情報を表示させることができる。また、ユーザは、ユーザ端末5を用いて、情報提供装置1の各種制御を行ってもよい。
<User terminal 5>
The user terminal 5 indicates a terminal owned by the user. As the user terminal 5, in addition to electronic devices such as mobile phones (mobile terminals), smartphones, tablet terminals, wearable terminals, personal computers, and IoT (Internet of Things) devices, those embodied in all kinds of electronic devices are used. You may. As the user terminal 5, a holo lens (registered trademark), which is one type of HMD (head-mounted display), may be used. The user terminal 5 may be connected to the information providing device 1 via, for example, the public communication network 7, or may be directly connected to, for example, the information providing device 1. The user can acquire the first target information from the information providing device 1 by using the user terminal 5, and can display various acquired information on the display unit of the user terminal 5. Further, the user may use the user terminal 5 to perform various controls of the information providing device 1.
 <サーバ6>
 サーバ6には、上述した各種情報が記憶される。サーバ6には、例えば公衆通信網7を介して送られてきた各種情報が蓄積される。サーバ6には、例えば保存部104と同様の情報が記憶され、公衆通信網7を介して情報提供装置1と各種情報の送受信が行われてもよい。すなわち、情報提供装置1は、保存部104の代わりにサーバ6を用いてもよい。
<Server 6>
The server 6 stores the above-mentioned various information. Various information sent via, for example, the public communication network 7 is stored in the server 6. For example, the server 6 stores the same information as the storage unit 104, and may send and receive various information to and from the information providing device 1 via the public communication network 7. That is, the information providing device 1 may use the server 6 instead of the storage unit 104.
 <公衆通信網7>
 公衆通信網7は、情報提供装置1等が通信回路を介して接続されるインターネット網等である。公衆通信網7は、いわゆる光ファイバ通信網で構成されてもよい。また、公衆通信網7は、有線通信網には限定されず、無線通信網等の公知の通信網で実現してもよい。
<Public communication network 7>
The public communication network 7 is an Internet network or the like to which the information providing device 1 and the like are connected via a communication circuit. The public communication network 7 may be composed of a so-called optical fiber communication network. Further, the public communication network 7 is not limited to the wired communication network, and may be realized by a known communication network such as a wireless communication network.
(情報提供システム100の動作の第1例)
 次に、本実施形態における情報提供システム100の動作の一例について説明する。図8は、本実施形態における情報提供システム100の動作の一例を示すフローチャートである。
(First example of operation of information providing system 100)
Next, an example of the operation of the information providing system 100 in this embodiment will be described. FIG. 8 is a flowchart showing an example of the operation of the information providing system 100 in the present embodiment.
 <基礎情報取得ステップS31>
 基礎情報取得部31は、例えば仕様書等の基礎コンテンツがチャンク構造に分割された基礎情報を特定の基礎情報として1又は複数取得する(基礎情報取得ステップS31)。基礎情報取得部31は、特定の基礎情報を含む特定の基礎コンテンツを取得してもよい。
<Basic information acquisition step S31>
The basic information acquisition unit 31 acquires one or more basic information in which basic contents such as specifications are divided into chunk structures as specific basic information (basic information acquisition step S31). The basic information acquisition unit 31 may acquire specific basic contents including specific basic information.
 <基礎情報比較ステップS32>
 次に、基礎情報比較部32は、関連性データベースに記憶された基礎情報と、基礎情報取得部31により取得した特定の基礎情報と、を比較する(基礎情報比較ステップS32)。基礎情報比較部32は、基礎情報と、特定の基礎情報と、が一致するか、一致しないか、を判定する。
<Basic information comparison step S32>
Next, the basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31 (basic information comparison step S32). The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
 <基礎情報類似度算出ステップS33>
 次に、基礎情報類似度算出部33は、基礎情報比較部32により比較した結果、基礎情報と特定の基礎情報とが一致しない場合、基礎情報類似度算出用データベースを参照し、基礎情報類似度算出用データベースに記憶された基礎情報と、基礎情報取得部31により取得された特定の基礎情報と、の類似度を示す基礎情報類似度を算出する(基礎情報類似度算出ステップS33)。
<Basic information similarity calculation step S33>
Next, when the basic information similarity calculation unit 33 does not match the basic information and the specific basic information as a result of comparison by the basic information comparison unit 32, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database and the basic information similarity degree. The basic information similarity degree indicating the similarity between the basic information stored in the calculation database and the specific basic information acquired by the basic information acquisition unit 31 is calculated (basic information similarity calculation step S33).
 <第1対象情報抽出ステップS34>
 対象情報抽出部34は、算出された基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報を選択し、関連性データベースを参照し、選択した第1基礎情報に対応する対象情報を第1対象情報として抽出する(第1対象情報抽出ステップS34)。
<First target information extraction step S34>
The target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to the relevance database, and selects the target information corresponding to the selected first basic information. It is extracted as the first target information (first target information extraction step S34).
 <対象情報類似度算出ステップS35>
 次に、対象情報類似度算出部35は、対象情報類似度推定処理用データベースを参照し、対象情報類似度推定処理用データベースに記憶された対象情報と、対象情報抽出部34により抽出された第1対象情報と、の類似度を示す対象情報類似度を算出する(対象情報類似度算出ステップS35)。
<Target information similarity calculation step S35>
Next, the target information similarity calculation unit 35 refers to the target information similarity estimation processing database, and the target information stored in the target information similarity estimation processing database and the target information extracted by the target information extraction unit 34. 1 The target information similarity indicating the similarity with the target information is calculated (target information similarity calculation step S35).
 <第2対象情報抽出ステップS36>
 次に、対象情報抽出部34は、対象情報類似度に基づいて、第1対象情報とは異なる第2対象情報を更に1又は複数抽出する(第2対象情報抽出ステップS36)。
<Second target information extraction step S36>
Next, the target information extraction unit 34 further extracts one or a plurality of second target information different from the first target information based on the target information similarity (second target information extraction step S36).
 以上で、情報提供システム100の動作の一例が完了する。 This completes an example of the operation of the information providing system 100.
 本実施形態によれば、基礎コンテンツがチャンク構造に分割された複数の基礎情報と、基礎コンテンツとは異なる属性の対象コンテンツがチャンク構造に分割された複数の対象情報と、を対応させて記憶される関連性データベースと、複数の基礎情報を用いて機械学習により構築される基礎情報類似度算出用データベースと、特定の基礎情報を取得する基礎情報取得部31と、基礎情報と、特定の基礎情報とを比較する基礎情報比較部32と、基礎情報比較部32により基礎情報と特定の基礎情報とが一致しない場合、基礎情報類似度算出用データベースを参照し、基礎情報と、特定の基礎情報と、の類似度を示す基礎情報類似度を算出する基礎情報類似度算出部33と、基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報を選択し、関連性データベースを参照し、第1基礎情報に対応する対象情報を第1対象情報として抽出する対象情報抽出部34と、を備える。 According to the present embodiment, a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which target content having an attribute different from the basic content is divided into chunk structures are stored in correspondence with each other. Relevance database, basic information similarity calculation database constructed by machine learning using a plurality of basic information, basic information acquisition unit 31 for acquiring specific basic information, basic information, and specific basic information. When the basic information and the specific basic information do not match by the basic information comparison unit 32 and the basic information comparison unit 32, the basic information similarity calculation database is referred to, and the basic information and the specific basic information are displayed. The basic information similarity calculation unit 33 that calculates the basic information similarity indicating the similarity of, and the first basic information are selected from a plurality of basic information based on the basic information similarity, and the relevance database is referred to. It includes a target information extraction unit 34 that extracts target information corresponding to the first basic information as the first target information.
 本実施形態によれば、基礎情報類似度算出部33は、基礎情報比較部32により関連性データベースに記憶された基礎情報に一致しない特定の基礎情報について、基礎情報類似度の算出を行う。すなわち、基礎情報比較部32により関連性データベースに記憶された基礎情報に一致する特定の基礎情報については、基礎情報類似度の算出を行う必要がない。このため、基礎情報類似度の算出をより効率的に行うことができる。 According to the present embodiment, the basic information similarity calculation unit 33 calculates the basic information similarity for specific basic information that does not match the basic information stored in the relevance database by the basic information comparison unit 32. That is, it is not necessary to calculate the basic information similarity for the specific basic information that matches the basic information stored in the relevance database by the basic information comparison unit 32. Therefore, the calculation of the basic information similarity can be performed more efficiently.
 特に、本実施形態によれば、基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報を選択し、関連性データベースを参照し、第1基礎情報に対応する対象情報を第1対象情報として抽出する。これにより、定量的に評価された基礎情報類似度に基づいて、特定の基礎情報に類似する第1基礎情報を選択することで、第1基礎情報の選択の精度を向上させることができる。 In particular, according to the present embodiment, the first basic information is selected from a plurality of basic information based on the similarity of the basic information, the relevance database is referred to, and the target information corresponding to the first basic information is the first target. Extract as information. Thereby, by selecting the first basic information similar to the specific basic information based on the quantitatively evaluated basic information similarity, the accuracy of selecting the first basic information can be improved.
 特に、本実施形態によれば、関連性データベースを参照し、第1基礎情報に対応する対象情報を第1対象情報として抽出する。特定の基礎情報に新しい情報が含まれる場合や変更があった場合には、第1対象情報として抽出した箇所が、マニュアル等の対象コンテンツにおける編集箇所に該当することになる。このため、マニュアル等の対象コンテンツを編集する際には、第1対象情報として抽出された対象情報を編集するだけでよく、対象コンテンツの編集作業を短時間で行うことができる。 In particular, according to the present embodiment, the relevance database is referred to, and the target information corresponding to the first basic information is extracted as the first target information. When new information is included in the specific basic information or when there is a change, the part extracted as the first target information corresponds to the edited part in the target content such as the manual. Therefore, when editing the target content such as a manual, it is only necessary to edit the target information extracted as the first target information, and the editing work of the target content can be performed in a short time.
 例えば、ある装置が古いバージョンから新しいバージョンにバージョンアップして、過去の仕様書から一部が変更されて新しい仕様書となった場合には、過去の仕様書に基づいて作成した製品の過去のマニュアルも、新しいマニュアルに作成する必要がある。このとき、従来では、新しい仕様書から過去のマニュアルにおける編集箇所を把握することが難しかった。上記したように、本実施形態によれば、基礎コンテンツとしての新しい仕様書から、編集すべき候補となる過去の仕様書を選択し、この過去の仕様書に対応する過去のマニュアルが、新しい仕様書によって編集すべき対象となる対象コンテンツであると把握することができる。このとき、基礎情報取得部31により取得される特定の基礎コンテンツとして新しい仕様書、関連性データベースに記憶される複数の基礎情報の集合である過去の仕様書、及び、複数の対象情報の集合である過去のマニュアル、がそれぞれチャンク構造に分割されている。このため、過去のマニュアルから、新しい仕様書によって変更が生じた部分のみ、を効率よく抽出することができる。このため、新しい仕様書に基づいて、過去のマニュアルにおける編集箇所をユーザは容易に把握できる。よって、例えば新しいマニュアルを作成する際、仕様書で変更の無い部分については過去のマニュアルをそのまま流用し、新しい仕様書において変更のあった部分についてのみ、新たに作成することができる。いわば、仕様書で変更のあった部分のみを差分編集すればよいこととなる。このため、マニュアルの編集作業を容易に行うことが可能となる。 For example, if a device is upgraded from an old version to a new version and a part of the past specifications is changed to become a new specification, the past of the product created based on the past specifications The manual also needs to be created in a new manual. At this time, in the past, it was difficult to grasp the edited part in the past manual from the new specification. As described above, according to the present embodiment, the past specifications that are candidates for editing are selected from the new specifications as the basic contents, and the past manuals corresponding to the past specifications are the new specifications. It can be grasped that it is the target content to be edited by the book. At this time, a new specification as specific basic content acquired by the basic information acquisition unit 31, a past specification which is a set of a plurality of basic information stored in the relevance database, and a set of a plurality of target information A past manual is divided into chunk structures. Therefore, it is possible to efficiently extract only the parts changed by the new specifications from the past manuals. Therefore, based on the new specifications, the user can easily grasp the edited part in the past manual. Therefore, for example, when creating a new manual, the past manual can be used as it is for the parts that are not changed in the specifications, and only the parts that are changed in the new specifications can be newly created. So to speak, it is only necessary to edit the difference in the changed part in the specifications. Therefore, the manual editing work can be easily performed.
 また、本実施形態によれば、複数の対象情報を用いて機械学習により構築された対象情報類似度推定処理用データベースと、対象情報類似度推定処理用データベースを参照し、対象情報と、第1対象情報と、の類似度を示す対象情報類似度を算出する対象情報類似度算出部35と、を備え、対象情報抽出部34は、対象情報類似度に基づいて、第1対象情報とは異なる第2対象情報を更に抽出する。 Further, according to the present embodiment, the target information similarity estimation processing database constructed by machine learning using a plurality of target information and the target information similarity estimation processing database are referred to, and the target information and the first. The target information similarity calculation unit 35 for calculating the similarity between the target information and the target information, and the target information extraction unit 34 is different from the first target information based on the target information similarity. The second target information is further extracted.
 本実施形態によれば、対象情報類似度に基づいて、第1対象情報とは異なる第2対象情報を更に抽出する。これにより、定量的に評価された対象情報類似度に基づいて、第1対象情報に類似する第2対象情報を選択することで、第2対象情報の選択の精度を向上させることができる。このため、特定の基礎情報に新しい情報が含まれる場合や変更があった場合には、第1対象情報に類似する第2対象情報も抽出するため、対象コンテンツが分割された対象情報のどの部分に該当するかを、ユーザは即座に把握することができる。このため、対象コンテンツを編集する際には、第1対象情報と第2対象情報として抽出された対象情報を編集するだけでよく、対象コンテンツの編集作業を短時間で行うことができる。 According to this embodiment, the second target information different from the first target information is further extracted based on the similarity of the target information. As a result, the accuracy of selecting the second target information can be improved by selecting the second target information similar to the first target information based on the quantitatively evaluated degree of similarity of the target information. Therefore, when new information is included in the specific basic information or when there is a change, the second target information similar to the first target information is also extracted, so that any part of the target information in which the target content is divided is extracted. The user can immediately grasp whether or not the above applies. Therefore, when editing the target content, it is only necessary to edit the first target information and the target information extracted as the second target information, and the editing work of the target content can be performed in a short time.
 すなわち、ある装置が複数のバージョンを有しており、複数の過去の仕様書から一部が変更されて新しい仕様書となった場合には、複数の過去の仕様書に基づいて作成した製品のそれぞれの過去のマニュアルも、新しいマニュアルに作成する必要がある。本実施形態によれば、新しい仕様書から、変更すべき候補となる過去の仕様書を選択し、この過去の仕様書に対応する過去のマニュアルと、過去のマニュアルに類似する他の過去のマニュアルとが、新しい仕様書によって変更が必要であると把握することができる。このとき、新しい仕様書、過去の仕様書、過去のマニュアルがそれぞれチャンク構造に分割されている。このため、過去のマニュアルから、新しい仕様書によって変更が生じた部分のみ、を効率よく抽出することができる。このとき、類似する複数の過去のマニュアルを対象として抽出することができる。このため、新しい仕様書に基づいてすべき複数の過去のマニュアルの該当部分を、ユーザは容易に、かつ同時に把握できる。よって、例えば新しいマニュアルを作成する際、仕様書で変更の無い部分については過去のマニュアルをそのまま流用し、仕様書で変更のあった部分についてのみ、新たに作成することができる。いわば、仕様書で変更のあった部分のみを差分編集すればよいこととなる。このため、マニュアルの編集作業を容易に行うことが可能となる。 That is, if a device has multiple versions and some of the past specifications are changed to new specifications, the product created based on the multiple past specifications. Each past manual also needs to be created in a new manual. According to the present embodiment, a past specification that is a candidate to be changed is selected from the new specifications, and a past manual corresponding to this past specification and another past manual similar to the past manual are selected. However, it can be understood that changes are necessary due to the new specifications. At this time, the new specifications, the past specifications, and the past manuals are each divided into chunk structures. Therefore, it is possible to efficiently extract only the parts changed by the new specifications from the past manuals. At this time, a plurality of similar past manuals can be extracted as targets. Therefore, the user can easily and simultaneously grasp the relevant parts of a plurality of past manuals that should be based on the new specifications. Therefore, for example, when creating a new manual, the past manual can be used as it is for the parts that are not changed in the specifications, and only the parts that are changed in the specifications can be newly created. So to speak, it is only necessary to edit the difference in the changed part in the specifications. Therefore, the manual editing work can be easily performed.
 本実施形態によれば、対象情報選択ステップS14の後に、基礎情報取得ステップS31を行う。これにより、ユーザは、対象情報選択部14により選択した第1対象情報、並びに、対象情報抽出部34により抽出した第1対象情報及び第2対象情報を比較することができる。このため、マニュアル等の第1対象情報において、編集すべき該当箇所を即座に把握することができる。 According to this embodiment, the basic information acquisition step S31 is performed after the target information selection step S14. As a result, the user can compare the first target information selected by the target information selection unit 14 and the first target information and the second target information extracted by the target information extraction unit 34. Therefore, in the first target information such as a manual, the relevant part to be edited can be immediately grasped.
 <情報提供装置1の第2例>
 図9は、本実施形態における情報提供システム100における第2例を示す模式図である。情報提供システム100の第2例では、基礎コンテンツがチャンク構造に分割された複数の基礎情報と、属性ごとに分類された対象コンテンツがチャンク単位に分割された複数の対象情報と、が記憶される関連性データベースを複数備える。
<Second example of information providing device 1>
FIG. 9 is a schematic view showing a second example of the information providing system 100 according to the present embodiment. In the second example of the information providing system 100, a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content classified by the attribute is divided into chunk units are stored. It has multiple relevance databases.
 図9の例では、情報提供システム100は、第1関連性データベース、第2関連性データベース、第3関連性データベース、第4関連性データベース、を備える。例えば、基礎コンテンツが装置の仕様書である場合、第1関連性データベースには、装置のマニュアルとしての対象コンテンツに関する複数の対象情報が記憶される。第2関連性データベースには、装置の広告としての対象コンテンツに関する複数の対象情報が記憶される。第3関連性データベースには、装置のFAQとしての対象コンテンツに関する複数の対象情報が記憶される。第4関連性データベースには、装置で実行されるプログラムが参照するリソースデータ(画像、テキスト、アイコン、ダイアログ、メニュー内容などのデータ)としての対象コンテンツに関する複数の対象情報が記憶される。 In the example of FIG. 9, the information providing system 100 includes a first relevance database, a second relevance database, a third relevance database, and a fourth relevance database. For example, when the basic content is a specification of the device, a plurality of target information regarding the target content as a manual of the device is stored in the first relevance database. The second relevance database stores a plurality of target information regarding the target content as an advertisement of the device. A plurality of target information regarding the target content as FAQ of the device is stored in the third relevance database. The fourth relevance database stores a plurality of target information regarding the target content as resource data (data such as images, texts, icons, dialogs, menu contents, etc.) referred to by the program executed by the device.
 第2例に係る情報提供システム100の動作について説明する。 The operation of the information providing system 100 according to the second example will be described.
 <基礎情報取得ステップS31>
 先ず、基礎情報取得部31は、特定の基礎コンテンツXに含まれる、特定の基礎情報x1~x5、特定の基礎情報e1、特定の基礎情報f2、特定の基礎情報g3、特定の基礎情報h2、を取得する(基礎情報取得ステップS31)。
<Basic information acquisition step S31>
First, the basic information acquisition unit 31 includes specific basic information x1 to x5, specific basic information e1, specific basic information f2, specific basic information g3, and specific basic information h2, which are included in the specific basic content X. (Basic information acquisition step S31).
 <基礎情報比較ステップS32>
 基礎情報比較部32は、関連性データベースに記憶された基礎情報と、基礎情報取得部31により取得した特定の基礎情報と、を比較する。基礎情報比較部32は、基礎情報と、特定の基礎情報と、が一致するか、一致しないか、を判定する。
<Basic information comparison step S32>
The basic information comparison unit 32 compares the basic information stored in the relevance database with the specific basic information acquired by the basic information acquisition unit 31. The basic information comparison unit 32 determines whether the basic information and the specific basic information match or do not match.
 図9の例では、基礎情報比較部32は、特定の基礎情報に含まれる「基礎情報x1」~「基礎情報x5」、「基礎情報e1」、「基礎情報f2」、「基礎情報g3」、「基礎情報h2」と、関連性データベースに記憶された基礎情報と、を比較する。第1関連性データベースには、「基礎情報e1」が記憶され、第2関連性データベースには、「基礎情報f2」が記憶され、第3関連性データベースには、「基礎情報g3」が記憶され、第4関連性データベースには、「基礎情報h2」が記憶され、これら何れの関連性データベースには、「基礎情報x1」~「基礎情報x5」、が記憶されていないとする。このとき、基礎情報比較部32は、特定の基礎情報に含まれる「基礎情報e1」、「基礎情報f2」、「基礎情報g3」、「基礎情報h2」が関連性データベースに記憶された基礎情報に一致すると判定する。また、基礎情報比較部32は、「基礎情報x1」~「基礎情報x5」が関連性データベースデータベースに記憶された基礎情報に一致しないと判定する。 In the example of FIG. 9, the basic information comparison unit 32 includes "basic information x1" to "basic information x5", "basic information e1", "basic information f2", "basic information g3", which are included in the specific basic information. The "basic information h2" is compared with the basic information stored in the relevance database. "Basic information e1" is stored in the first relevance database, "basic information f2" is stored in the second relevance database, and "basic information g3" is stored in the third relevance database. , "Basic information h2" is stored in the fourth relevance database, and "basic information x1" to "basic information x5" are not stored in any of these relevance databases. At this time, the basic information comparison unit 32 stores the "basic information e1", "basic information f2", "basic information g3", and "basic information h2" included in the specific basic information in the relevance database. Is determined to match. Further, the basic information comparison unit 32 determines that the "basic information x1" to "basic information x5" do not match the basic information stored in the relevance database.
 <基礎情報類似度算出ステップS33>
 基礎情報類似度算出部33は、基礎情報比較部32により基礎情報と特定の基礎情報とが一致しない場合、基礎情報類似度算出用データベースを参照し、基礎情報類似度算出用データベースに記憶された基礎情報と、基礎情報取得部31により取得された特定の基礎情報と、の類似度を示す基礎情報類似度を算出する。
<Basic information similarity calculation step S33>
When the basic information and the specific basic information do not match by the basic information comparison unit 32, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database and stores it in the basic information similarity calculation database. The basic information similarity degree indicating the similarity between the basic information and the specific basic information acquired by the basic information acquisition unit 31 is calculated.
 なお、基礎情報類似度算出部33は、基礎情報比較部32により基礎情報と特定の基礎情報とが一致する場合、基礎情報類似度の算出を行わない。 Note that the basic information similarity calculation unit 33 does not calculate the basic information similarity when the basic information and the specific basic information match by the basic information comparison unit 32.
 図9の例では、基礎情報比較部32により特定の基礎情報に含まれる「基礎情報x1」~「基礎情報x5」と、それぞれの関連性データベースに記憶された基礎情報と一致しない。かかる場合、基礎情報類似度算出部33は、基礎情報類似度算出用データベースを参照し、特定の基礎情報に含まれる「基礎情報x1」~「基礎情報x5」について、基礎情報類似度算出用データベースに記憶されたそれぞれの基礎情報との基礎情報類似度を算出する。 In the example of FIG. 9, the "basic information x1" to "basic information x5" included in the specific basic information by the basic information comparison unit 32 do not match the basic information stored in the respective relevance databases. In such a case, the basic information similarity calculation unit 33 refers to the basic information similarity calculation database, and refers to the basic information similarity calculation database for "basic information x1" to "basic information x5" included in the specific basic information. Calculate the degree of similarity of basic information with each basic information stored in.
 <第1対象情報抽出ステップS34>
 対象情報抽出部34は、算出された基礎情報類似度に基づいて、複数の基礎情報から第1基礎情報を選択し、それぞれの関連性データベースを参照し、選択した第1基礎情報に対応する対象情報を第1対象情報として抽出する。対象情報抽出部34は、複数の基礎情報から1つの第1基礎情報を選択したとき、選択した1つの第1基礎情報に対応する1つの対象情報を第1対象情報として抽出する。また、対象情報抽出部34は、複数の第1基礎情報を選択したとき、選択したそれぞれの第1基礎情報に対応する対象情報をそれぞれ第1対象情報として抽出してもよい。
<First target information extraction step S34>
The target information extraction unit 34 selects the first basic information from a plurality of basic information based on the calculated basic information similarity, refers to each relevance database, and corresponds to the selected first basic information. Information is extracted as the first target information. When one first basic information is selected from a plurality of basic information, the target information extraction unit 34 extracts one target information corresponding to the selected first basic information as the first target information. Further, when the target information extraction unit 34 selects a plurality of first basic information, the target information corresponding to each of the selected first basic information may be extracted as the first target information.
 対象情報抽出部34は、算出された基礎情報類似度に基づいて、複数の基礎情報に含まれるそれぞれの基礎情報ラベルから、第1基礎情報として選択してもよい。対象情報抽出部34は、選択した基礎情報ラベル(第1基礎情報)から、関連性データベースに記憶された基礎情報ラベルに対応する対象情報を第1対象情報として抽出してもよい。例えば、対象情報抽出部34は、基礎情報ラベル21を選択し、選択した基礎情報ラベル21から、関連性データベースに記憶された基礎情報ラベル21に対応する対象情報B1を第1対象情報として抽出してもよい。基礎情報ラベルは、文字列からなるため、文章情報を有する基礎情報を記憶させるよりも、基礎情報類似度算出用データベースの容量を低減することができる。 The target information extraction unit 34 may select as the first basic information from the respective basic information labels included in the plurality of basic information based on the calculated basic information similarity. The target information extraction unit 34 may extract the target information corresponding to the basic information label stored in the relevance database as the first target information from the selected basic information label (first basic information). For example, the target information extraction unit 34 selects the basic information label 21, and extracts the target information B1 corresponding to the basic information label 21 stored in the relevance database as the first target information from the selected basic information label 21. You may. Since the basic information label is composed of a character string, the capacity of the basic information similarity calculation database can be reduced as compared with storing the basic information having the text information.
 図9の例では、対象情報抽出部34は、基礎情報類似度を算出した結果、「基礎情報x1」については、最も高い基礎情報類似度を算出した「基礎情報e1」を第1基礎情報として選択する。第1基礎情報として選択するとき、基礎情報類似度に閾値を設定し、その閾値以上又は以下の基礎情報類似度を算出した基礎情報を選択してもよい。この閾値は、ユーザ側で適宜設定することができる。同様に、対象情報抽出部34は、「基礎情報x2」については「基礎情報f1」を、「基礎情報x3」については「基礎情報f3」を、「基礎情報x4」については「基礎情報g2」を、「基礎情報x5」については「基礎情報h2」を、それぞれ第1基礎情報として選択する。 In the example of FIG. 9, as a result of calculating the basic information similarity, the target information extraction unit 34 uses the "basic information e1" for which the highest basic information similarity is calculated as the first basic information for the "basic information x1". select. When selecting as the first basic information, a threshold value may be set for the basic information similarity, and the basic information obtained by calculating the basic information similarity equal to or lower than the threshold value may be selected. This threshold value can be appropriately set on the user side. Similarly, the target information extraction unit 34 has "basic information f1" for "basic information x2", "basic information f3" for "basic information x3", and "basic information g2" for "basic information x4". , And for "basic information x5", "basic information h2" is selected as the first basic information.
 そして、対象情報抽出部34は、第1関連性データベースを参照し、第1基礎情報として選択した「基礎情報e1」に対応する「対象情報E1」を第1対象情報として抽出する。同様に、対象情報抽出部34は、第2関連性データベースを参照し、第1基礎情報として選択した「基礎情報f1」に対応する「対象情報F1」を、第1基礎情報として選択した「基礎情報f3」に対応する「対象情報F3」を、それぞれ第1対象情報として抽出する。対象情報抽出部34は、第3関連性データベースを参照し、第1基礎情報として選択した「基礎情報g2」に対応する「対象情報G2」を第1対象情報として抽出する。対象情報抽出部34は、第4関連性データベースを参照し、第1基礎情報として選択した「基礎情報h2」に対応する「対象情報H2」を第1対象情報として抽出する。 Then, the target information extraction unit 34 refers to the first relevance database and extracts the "target information E1" corresponding to the "basic information e1" selected as the first basic information as the first target information. Similarly, the target information extraction unit 34 refers to the second relevance database and selects the "target information F1" corresponding to the "basic information f1" selected as the first basic information as the first basic information "basic". The "target information F3" corresponding to the "information f3" is extracted as the first target information, respectively. The target information extraction unit 34 refers to the third relevance database and extracts the "target information G2" corresponding to the "basic information g2" selected as the first basic information as the first target information. The target information extraction unit 34 refers to the fourth relevance database and extracts the "target information H2" corresponding to the "basic information h2" selected as the first basic information as the first target information.
 対象情報抽出部34は、算出された対象情報類似度に基づいて、複数の対象情報に含まれる対象情報ラベルから、1又は複数の対象情報ラベルを選択してもよい。 The target information extraction unit 34 may select one or a plurality of target information labels from the target information labels included in the plurality of target information based on the calculated target information similarity.
 本実施形態によれば、基礎コンテンツがチャンク構造に分割された複数の基礎情報と、属性毎に分類された対象コンテンツがチャンク単位に分割された複数の対象情報と、が記憶される関連性データベースを複数備える。 According to the present embodiment, a relevance database that stores a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which the target content classified by attribute is divided into chunk units. It is provided with a plurality of.
 これにより、本実施形態によれば、属性毎に分類されたそれぞれの関連性データベースを参照し、第1基礎情報に対応する対象情報をそれぞれ第1対象情報として抽出する。このため、特定の基礎情報に新しい情報が含まれる場合や変更があった場合には、第1対象情報として抽出した箇所が、マニュアル等の対象コンテンツにおける編集箇所に該当することになるため、ユーザは対象コンテンツにおける編集箇所を容易に把握することができる。このため、マニュアル等の対象コンテンツを編集する際には、第1対象情報として抽出された対象情報を編集するだけでよく、対象コンテンツの編集作業を短時間で行うことができる。 Thereby, according to the present embodiment, the target information corresponding to the first basic information is extracted as the first target information by referring to each relevance database classified for each attribute. Therefore, when new information is included in the specific basic information or when there is a change, the part extracted as the first target information corresponds to the edited part in the target content such as the manual. Can easily grasp the edited part in the target content. Therefore, when editing the target content such as a manual, it is only necessary to edit the target information extracted as the first target information, and the editing work of the target content can be performed in a short time.
 特に、基礎コンテンツとして取得した仕様書から、マニュアル、広告、FAQとしてのそれぞれの対象コンテンツを編集する場合、第1対象情報として抽出した箇所が、マニュアル、広告、FAQとしてのそれぞれの対象コンテンツにおける編集箇所に該当することになる。このため、ユーザはマニュアル、広告、FAQ等の属性毎に、それぞれの対象コンテンツにおける編集箇所を容易に把握することができる。 In particular, when editing each target content as a manual, advertisement, or FAQ from the specifications acquired as basic content, the part extracted as the first target information is edited in each target content as manual, advertisement, or FAQ. It will correspond to the place. Therefore, the user can easily grasp the edited part in each target content for each attribute such as the manual, the advertisement, and the FAQ.
 <情報提供装置1の第3例>
 情報提供装置1の第3例では、更にアクセス制御部を備える点で、第1例と相違する。アクセス制御部は、例えば、CPU101が、RAM103を作業領域として、保存部104等に記憶されたプログラムを実行することにより実現される。
<Third example of information providing device 1>
The third example of the information providing device 1 is different from the first example in that it further includes an access control unit. The access control unit is realized, for example, by the CPU 101 executing a program stored in the storage unit 104 or the like with the RAM 103 as a work area.
 アクセス制御部は、各種データベースや対象コンテンツに対するアクセスを制御する。アクセスは、完全アクセス、読み取りアクセス及び書き込みアクセス、コメント専用アクセス、読み取り専用アクセス、並びアクセス禁止を含む。アクセス制御部は、アクセス制御情報に基づいて行われる。アクセス制御情報は、ユーザ名と、各ユーザ名に割り当てられるアクセスと、を含む。アクセス制御情報は、例えば、保存部104に保存される。 The access control unit controls access to various databases and target contents. Access includes full access, read and write access, comment-only access, read-only access, and line-of-sight access prohibition. The access control unit is performed based on the access control information. The access control information includes a user name and the access assigned to each user name. The access control information is stored in, for example, the storage unit 104.
 ユーザが完全アクセスを割当てられると、そのユーザは各種データベースや対象コンテンツに対して完全な読み取り及び書き込みアクセスを有し、さらにそのユーザは、ユーザインターフェースの任意の態様を使用できる。例えば、完全アクセスの場合、ユーザはデータベース構成を変更できる。ユーザが読み取り及び書き込みアクセスを有している場合、ユーザは読み取り及び書き込みを対象コンテンツに対して有するが、データベース構成を変更できない。コメント専用アクセスの場合、ユーザはコメントを対象コンテンツに挿入できるが、各種データベースや対象コンテンツを変更できない。読み取り専用アクセスの場合、ユーザは対象コンテンツを閲覧できるが、各種データベースやその対象コンテンツに変更を加えることはできず、またいかなるコメントも挿入できない。 When a user is assigned full access, the user has full read and write access to various databases and target content, and the user can use any aspect of the user interface. For example, for full access, the user can change the database configuration. If the user has read and write access, the user has read and write to the target content, but cannot change the database configuration. In the case of comment-only access, the user can insert comments into the target content, but cannot change various databases or target content. With read-only access, the user can view the target content, but cannot make changes to the various databases or the target content, and cannot insert any comments.
 例えば特定の基礎情報に基づいて新たな対象コンテンツを生成し、生成した新たな対象コンテンツを編集するとする。このとき、本実施形態によれば、アクセス制御部を更に備える。これにより、アクセス制御情報に基づいて、複数のユーザのうち特定の1又は複数のユーザが所定のアクセスを行うことができる。すなわち、各種データベースや対象コンテンツを利用する複数のユーザに対して、読み取り専用、完全アクセスが可能等の編集種別のコントロールと、ユーザの属性に基づく権限とを結び付けて、各種データベースや対象コンテンツ毎に管理することができる。特に、閲覧のみは同時にアクセス可能としつつ、書き込み等の編集に関しては権限を有するユーザにのみ許可することによって、意図しない編集を防ぐことができる。 For example, suppose that a new target content is generated based on specific basic information and the generated new target content is edited. At this time, according to the present embodiment, an access control unit is further provided. Thereby, a specific one or a plurality of users among the plurality of users can perform a predetermined access based on the access control information. That is, for multiple users who use various databases and target contents, the control of the editing type such as read-only and full access is linked with the authority based on the user's attributes, and for each of various databases and target contents. Can be managed. In particular, it is possible to prevent unintended editing by allowing only authorized users to edit such as writing while making only browsing accessible at the same time.
 本発明の実施形態を説明したが、実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 Although the embodiment of the present invention has been described, the embodiment is presented as an example and is not intended to limit the scope of the invention. These novel embodiments can be implemented in various other embodiments, and various omissions, replacements, and changes can be made without departing from the gist of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are also included in the scope of the invention described in the claims and the equivalent scope thereof.
1   :情報提供装置
5   :ユーザ端末
6   :サーバ
7   :公衆通信網
10  :筐体
15  :入力部
16  :出力部
17  :記憶部
18  :制御部
31  :基礎情報取得部
32  :基礎情報比較部
33  :基礎情報類似度算出部
34  :対象情報抽出部
35  :対象情報類似度算出部
100 :情報提供システム
101 :CPU
102 :ROM
103 :RAM
104 :保存部
105 :I/F
106 :I/F
107 :I/F
108 :入力部分
109 :出力部分
110 :内部バス
S31 :基礎情報取得部ステップ
S32 :基礎情報比較ステップ
S33 :基礎情報類似度算出ステップ
S34 :第1対象情報抽出ステップ
S35 :対象情報類似度算出ステップ
S36 :第2対象情報抽出ステップ
1: Information providing device 5: User terminal 6: Server 7: Public communication network 10: Housing 15: Input unit 16: Output unit 17: Storage unit 18: Control unit 31: Basic information acquisition unit 32: Basic information comparison unit 33 : Basic information similarity calculation unit 34: Target information extraction unit 35: Target information similarity calculation unit 100: Information providing system 101: CPU
102: ROM
103: RAM
104: Preservation unit 105: I / F
106: I / F
107: I / F
108: Input part 109: Output part 110: Internal bus S31: Basic information acquisition unit Step S32: Basic information comparison step S33: Basic information similarity calculation step S34: First target information extraction step S35: Target information similarity calculation step S36 : Second target information extraction step

Claims (3)

  1.  基礎コンテンツがチャンク構造に分割された複数の基礎情報と、前記基礎コンテンツとは異なる属性の対象コンテンツがチャンク構造に分割された複数の対象情報と、を対応させて記憶される関連性データベースと、
     複数の前記基礎情報を用いて機械学習により構築される基礎情報類似度算出用データベースと、
     特定の基礎情報を取得する基礎情報取得手段と、
     前記基礎情報と、前記特定の基礎情報とを比較する基礎情報比較手段と、
     前記基礎情報比較手段により前記基礎情報と前記特定の基礎情報とが一致しない場合、前記基礎情報類似度算出用データベースを参照し、前記基礎情報と、前記特定の基礎情報と、の類似度を示す基礎情報類似度を算出する基礎情報類似度算出手段と、
     前記基礎情報類似度に基づいて、複数の前記基礎情報から第1基礎情報を選択し、前記関連性データベースを参照し、前記第1基礎情報に対応する前記対象情報を第1対象情報として抽出する対象情報抽出手段と、を備えること
     を特徴とする情報提供システム。
    A relevance database in which a plurality of basic information in which the basic content is divided into chunk structures and a plurality of target information in which target content having an attribute different from the basic content is divided into chunk structures are stored in association with each other.
    A database for calculating basic information similarity constructed by machine learning using a plurality of the basic information,
    Basic information acquisition means to acquire specific basic information,
    A basic information comparison means for comparing the basic information with the specific basic information,
    When the basic information and the specific basic information do not match by the basic information comparison means, the basic information similarity calculation database is referred to, and the similarity between the basic information and the specific basic information is shown. Basic information similarity calculation means for calculating basic information similarity,
    Based on the basic information similarity, the first basic information is selected from the plurality of basic information, the relevance database is referred to, and the target information corresponding to the first basic information is extracted as the first target information. An information providing system characterized by having a target information extraction means.
  2.  複数の前記対象情報を用いて機械学習により構築された対象情報類似度推定処理用データベースと、
     前記対象情報類似度推定処理用データベースを参照し、前記対象情報と、前記対象情報抽出手段により抽出された前記第1対象情報と、の類似度を示す対象情報類似度を算出する対象情報類似度算出手段と、を備え、
     前記対象情報抽出手段は、前記対象情報類似度に基づいて、前記第1対象情報とは異なる第2対象情報を更に抽出すること
     を特徴とする請求項1記載の情報提供システム。
    A database for target information similarity estimation processing constructed by machine learning using a plurality of the target information,
    Target information similarity that refers to the target information similarity estimation processing database and calculates the target information similarity that indicates the similarity between the target information and the first target information extracted by the target information extraction means. With a calculation means,
    The information providing system according to claim 1, wherein the target information extracting means further extracts a second target information different from the first target information based on the target information similarity.
  3.  複数の前記基礎情報と、属性ごとに分類された前記対象コンテンツがチャンク構造に分割された複数の前記対象情報と、を対応させて記憶される前記関連性データベースを複数備えること
     を特徴とする請求項2記載の情報提供システム。
    A claim characterized by including a plurality of the relevance databases stored in association with a plurality of the basic information and a plurality of the target information in which the target content classified by an attribute is divided into chunk structures. The information providing system described in Item 2.
PCT/JP2020/026693 2019-10-31 2020-07-08 Information-providing system WO2021084807A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2019-199259 2019-10-31
JP2019199259A JP6703698B1 (en) 2019-10-31 2019-10-31 Information provision system

Publications (1)

Publication Number Publication Date
WO2021084807A1 true WO2021084807A1 (en) 2021-05-06

Family

ID=70858073

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2020/026693 WO2021084807A1 (en) 2019-10-31 2020-07-08 Information-providing system

Country Status (2)

Country Link
JP (1) JP6703698B1 (en)
WO (1) WO2021084807A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1145238A (en) * 1997-07-28 1999-02-16 Just Syst Corp Document management system and computer readable recording medium for recording program for having computer function as the same system
JP2010191519A (en) * 2009-02-16 2010-09-02 Nec Corp Document management device, method, and program
JP2012185667A (en) * 2011-03-04 2012-09-27 Takara Printing Co Ltd Financial document parallel translation display system
US20140372452A1 (en) * 2012-06-28 2014-12-18 International Business Machines Corporation Managing changes to one or more files via linked mapping records
JP6651189B1 (en) * 2019-03-29 2020-02-19 株式会社 情報システムエンジニアリング Data structure, learning method and information providing system for machine learning

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1145238A (en) * 1997-07-28 1999-02-16 Just Syst Corp Document management system and computer readable recording medium for recording program for having computer function as the same system
JP2010191519A (en) * 2009-02-16 2010-09-02 Nec Corp Document management device, method, and program
JP2012185667A (en) * 2011-03-04 2012-09-27 Takara Printing Co Ltd Financial document parallel translation display system
US20140372452A1 (en) * 2012-06-28 2014-12-18 International Business Machines Corporation Managing changes to one or more files via linked mapping records
JP6651189B1 (en) * 2019-03-29 2020-02-19 株式会社 情報システムエンジニアリング Data structure, learning method and information providing system for machine learning

Also Published As

Publication number Publication date
JP6703698B1 (en) 2020-06-03
JP2021071992A (en) 2021-05-06

Similar Documents

Publication Publication Date Title
US11657231B2 (en) Capturing rich response relationships with small-data neural networks
CN108287858B (en) Semantic extraction method and device for natural language
US20240028651A1 (en) System and method for processing documents
US20170116203A1 (en) Method of automated discovery of topic relatedness
CN104679826B (en) The method and system of context identification based on disaggregated model
KR102285142B1 (en) Apparatus and method for recommending learning data for chatbots
CN112749547A (en) Generation of text classifier training data
KR20210090576A (en) A method, an apparatus, an electronic device, a storage medium and a program for controlling quality
JP2017041171A (en) Test scenario generation support device and test scenario generation support method
US11520835B2 (en) Learning system, learning method, and program
CA3207902A1 (en) Auditing citations in a textual document
US20230030086A1 (en) System and method for generating ontologies and retrieving information using the same
CN117420998A (en) Client UI interaction component generation method, device, terminal and medium
CN117077679A (en) Named entity recognition method and device
WO2021084807A1 (en) Information-providing system
CN107908792B (en) Information pushing method and device
JP6936370B1 (en) Information processing system and information processing program
CN111753199B (en) User portrait construction method and device, electronic device and medium
JP6817690B2 (en) Extraction device, extraction method and its program, support device, display control device
CN114676155A (en) Code prompt information determining method, data set determining method and electronic equipment
CN117501283A (en) Text-to-question model system
CN113901793A (en) Event extraction method and device combining RPA and AI
JPWO2020099986A1 (en) Content classification method
CN113688615A (en) Method, device and storage medium for generating field annotation and understanding character string
Mande et al. Regular Expression Rule-Based Algorithm for Multiple Documents Key Information Extraction

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20882640

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20882640

Country of ref document: EP

Kind code of ref document: A1