WO2015056699A1 - Information extraction device and information extraction program - Google Patents

Information extraction device and information extraction program Download PDF

Info

Publication number
WO2015056699A1
WO2015056699A1 PCT/JP2014/077403 JP2014077403W WO2015056699A1 WO 2015056699 A1 WO2015056699 A1 WO 2015056699A1 JP 2014077403 W JP2014077403 W JP 2014077403W WO 2015056699 A1 WO2015056699 A1 WO 2015056699A1
Authority
WO
WIPO (PCT)
Prior art keywords
component
extraction
associative
mapped
output
Prior art date
Application number
PCT/JP2014/077403
Other languages
French (fr)
Japanese (ja)
Inventor
太輔 小田嶋
健一 新城
ヴェトハ グェン
Original Assignee
株式会社エーエヌラボ
太輔 小田嶋
健一 新城
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 株式会社エーエヌラボ, 太輔 小田嶋, 健一 新城 filed Critical 株式会社エーエヌラボ
Publication of WO2015056699A1 publication Critical patent/WO2015056699A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing

Definitions

  • a conventional search system simply outputs information obtained by keyword search and filtering as a list, and then leaves it to the user to select. In other words, there are many similar search results, and many similar words are included in the search results. It was a burden on the user side. In this way, even though the search results are presented, the reason why the search results have come out and the story are not presented, so you can select the one that suits your purpose from the filtered search results It was something that had to be done.
  • a keyword input means for inputting a keyword for searching for data by keyword
  • a data input means for inputting data for inputting data
  • a data storage means for storing a data feature amount and a keyword together with registered data
  • the keyword input A keyword search unit that searches for data by combining keywords input from the unit, a feature amount / keyword extraction unit that automatically extracts keywords and feature amounts from registered data, and a result of search performed by the keyword search unit
  • search result narrowing means for narrowing down without using a keyword (see Patent Document 1).
  • the problem to be solved by the present invention is not only the output of the result, but also the reason why the result is output for improving the user's satisfaction with the output of the result. It is an object to provide an information extraction apparatus and an information extraction program capable of presenting whether or not it has been reached.
  • the invention according to claim 1 of the present invention is an information extraction device, wherein a start component specifying means for specifying a start component to be extracted, and the start component specifying means Mapping means for mapping from the start component specified by the extraction result to the extraction result, reason extraction means for extracting the reason why the extraction result is mapped from the start component by the mapping means, and mapping by the mapping means And an output means for outputting the extraction result obtained and the reason extracted by the reason extraction means.
  • the mapping means extracts an associative component extraction means for extracting an associative component mapped to the start component specified by the start component specifying means, and the association Goal component extraction means for extracting a goal component mapped to the associative component extracted by the component extraction means, and an extraction result mapped to the goal component extracted by the goal component extraction means It is characterized by comprising a result extracting means for extracting.
  • the associative component extracting means is an information table in which an associative component associated with a start component is mapped, and the goal component extracting means is associated with an associative component.
  • the target component is extracted based on the mapped information table, and the output is performed on the basis of at least the associative component among a series of mapped start component, associative component, and goal component associative flow. It outputs by a means.
  • a weighting setting unit for which a predetermined weighting is set for each viewpoint, and the priority of each reason is totaled after reflecting the weighting set by the weighting setting unit.
  • a priority totalization unit and an output rank determination unit that determines an output rank of reasons in descending order of priorities tabulated by the priority tabulation unit, and the output unit outputs a reason for each extraction result, The reason for outputting for each extraction result is output according to the output order determined by the output order determining means.
  • a computer includes a start component specifying step for specifying a start component to be extracted, and a start component specified by the start component specifying step to an extraction result.
  • An output step for outputting the extracted reason is realized.
  • the associative component extracting step of extracting an associative component mapped to the start component specified by the start component specifying step, and the association A goal component extraction step for extracting a goal component mapped to the associative component extracted by the component extraction step, and an extraction result mapped to the goal component extracted by the goal component extraction step. And a result extracting step for extracting.
  • the associative component extracting step is an information table in which an associative component associated with a start component is mapped, and the goal component extracting step is associated with an associative component.
  • the target component is extracted based on the mapped information table, and the output is performed on the basis of at least the associative component among a series of mapped start component, associative component, and goal component associative flow. It outputs by a step.
  • the information table is prepared for each viewpoint, and the extraction in the associative component extraction step and the goal component extraction step uses only information tables belonging to the same viewpoint.
  • the reason why the data is output by the output step includes the above viewpoint.
  • the associative component associated with the start component is mapped in addition to the association component associated with the start component.
  • at least one or more other associative components are mapped, and the associative component extraction step is mapped to the start component specified by the start component specifying step. Extracting the associated associative component, using the information table in which other associative components associated with the associative component are mapped, extracting the associated associative component, and the goal component extracting step Is the association component finally extracted by the association component extraction step. Characterized in that extracts a goal component that is mappings.
  • a weight setting step in which a predetermined weight is set for each of the viewpoints, and the priority of each reason is totaled after reflecting the weight set by the weight setting step.
  • the present invention in order to output not only a simple extraction result but also a series of associative flows from which the extraction result is derived as a reason, the user's satisfaction with the output of the extraction result is improved. There is an effect that can be made.
  • FIG. 1 is a block diagram illustrating an internal configuration of an information extraction apparatus 101 in Embodiment 1.
  • FIG. It is the flowchart figure which showed the process sequence which outputs the extraction result in the information extraction device 101 in Example 1, and the reason that the extraction result came to be derived. It is explanatory drawing which showed mapping of the component etc. to the reason which came to derive the extraction result from the start component in Example 1.
  • FIG. FIG. 6 is an explanatory diagram showing an overall system configuration in Embodiment 2. It is the block diagram which showed the internal structure of the information extraction apparatus 101 in Example 2.
  • FIG. It is the flowchart figure which showed the process sequence which outputs the extraction result in the information extraction device 101 in Example 2, and the reason that the extraction result came to be derived. The output example of the reason which came to derive the extraction result in Example 2 and the extraction result is shown.
  • the information extraction apparatus 101 is a computer including a control unit such as a CPU and a storage unit such as a RAM and a ROM.
  • FIG. 1 is a block diagram showing the internal configuration of the information extraction apparatus 101 in this embodiment.
  • the specific internal configuration is associated with the target information specifying unit 201 that specifies information to be extracted, the start component specifying unit 202 that specifies the start component from the information to be extracted, the start component of the information table, and the like.
  • An associative component extraction unit 203 that extracts an associative component mapped to the start component based on the mapping information of the component, and an associative component based on the mapping information of the associative component and the goal component of the information table
  • the goal component extraction unit 204 that extracts the mapped goal component, the extraction result mapped to the goal component based on the mapping information of the goal component and the extraction result, and the reason why the extraction result has been derived Extraction result and reason extraction unit 205, extraction results and extraction results are derived
  • An output table 206 that outputs the reason for reaching the screen by screen display, printing, etc., an information table prepared for each viewpoint by mapping the start component and the associative component, and the associative component and the goal component,
  • the information table management unit 207 manages information on the mapping between the goal component and the extraction result.
  • FIG. 2 is a flowchart showing a processing procedure for outputting the extraction result and the reason why the extraction result has been derived in the information extraction apparatus 101 according to the present embodiment.
  • the information to be extracted is specified by the target information specifying unit 201 (step S102).
  • the start component specifying unit 202 specifies the start component from the information to be extracted (step S103).
  • an associative component associated with the start component is mapped and an information table prepared for each viewpoint is searched. It is determined whether or not the identified start component is hit (step S104).
  • step S109 the process proceeds to step S109, and the process ends.
  • the associative component associated with the start component is mapped and the associative component mapped to the start component is displayed in the information table prepared for each viewpoint. Extracted by the associative component extraction unit 203 (step S105).
  • the goal component associated with the association component is mapped and mapped to the association component in the information table prepared for each viewpoint.
  • the goal component extracted by the goal component extraction unit 204 is extracted (step S106).
  • the result and reason extraction unit 205 extracts and maps the extraction result mapped to the goal component based on the mapping information of the goal component managed by the information table management unit 207 and the extraction result.
  • a series of association flows and viewpoints of the start component, the association component, and the goal component are extracted as reasons (step S107).
  • the start component, the associative component, the goal component, and the viewpoint are not limited to all reasons, and some of them may be extracted as the reason. For example, only the associative component, only the associative component and the viewpoint may be extracted as the reason.
  • the extraction result and the reason why the extraction result has been derived are output by the output unit 206 (step S108), and the process ends (step S109).
  • the start component is specified by the start component specifying unit 202, and from the specified start component to the extraction result. Mapping is performed, the extraction result and the reason why the extraction result is derived are extracted by the result and reason extraction unit 205, and output by the output unit 206.
  • the mapping from the start component to the extraction result is performed by the associative component extraction unit 203, the goal component extraction unit 204, and the result and reason extraction unit 205.
  • FIG. 3 is an explanatory diagram showing mapping of components and the like up to the reason why the extraction result is derived from the start component.
  • FIG. 3 there are four start components A1, A3, D1, and D3.
  • B1 associated with A1, B2 and B3 associated with A2, B3 associated with A3, and B4 associated with A4 are mapped.
  • the start component A1 since A1 is included in the information table AB, A1 in the information table AB that is associated with the information table AB is first mapped to B1 associated with A1, B1 is mapped to C1 associated with B1 in the information table BC.
  • B1 is an associative component
  • C1 is a goal component.
  • the start component A1, the associative component B1, and the goal component C1 are mapped to form a series of associative flows.
  • C1 is mapped to the extraction result X
  • X is extracted as the extraction result
  • the reason why the extraction result X is derived is that the viewpoint 1 and the associative component B1 are extracted as C1 related items. Is done.
  • the reason for the extraction result X includes C3 and G1 related reasons as shown in FIG.
  • a tag may be extracted by text analysis of the newspaper article and specified as a start component.
  • some start components included in the information table managed by the information table etc. management unit 207 and mapped to the associative components associated with the start components are presented and directly started.
  • the component may be specified by the user.
  • the identification of the start component means that the meta information attached to the extraction target information is decomposed into text data elements.
  • the start component is configured by a tag that defines meta information of content that is extraction target information.
  • an information table (information table AB / information table DE) that is prepared for each viewpoint, such as a place, a landscape, an attraction, a restaurant, etc., mapped with an associative component associated with a start component is searched. Thus, it is determined whether there are start components A1, A3, D1, and D3.
  • an associative component mapped to the start component is extracted. For example, in the case of the start component A1, since the start component A1 exists in the information table AB, B1 mapped to A1 in the information table AB is extracted as an associative component.
  • associative component extraction is performed on the basis of information in the information table that is prepared for each viewpoint and that maps the associative component associated with the start component managed by the information table management unit 207.
  • the unit 203 extracts an associative component mapped to the start component specified by the start component specifying unit 202.
  • the start component and the associative component are not only mapped 1: 1 but also mapped 1: N. May be.
  • B2 and B3 are mapped to A2. If A2 is a shop with a good atmosphere, out of the information tables prepared for each viewpoint, if viewpoint A is the atmosphere, in information table AB, for example, a shop with a beautiful night view of B2, a quiet meal of B3 It may be mapped to two of the shops that can.
  • the information table in which the start component and the associative component are mapped is not included in the same viewpoint. Since there is only one, it is mapped in the same information table. For example, any of E1 to E4 in the information table DE included in the viewpoint B having a different viewpoint is not mapped to A2 in the information table AB included in the viewpoint i.
  • the information table existing from a certain viewpoint includes an information table in which an associative component associated with the start component is mapped, and an information table in which the goal component associated with the associative component is mapped. It is not limited to two.
  • viewpoint B the association between the information table DE to which the associative component associated with the start component is mapped and the information table FG to which the goal component associated with the associative component is mapped.
  • the associative component constituting the reason why G1 is selected is not one associative component, but a set of a plurality of associative components E1 and F1.
  • the associative components and the other associative components are not only mapped 1: 1 but also 1 : May be mapped to N.
  • F2 and F3 are mapped to E2 in the information table EF.
  • the information table used for extraction of the associative component managed by the information table management unit 207 includes the associative component in addition to the mapping of the associative component associated with the start component. From the same viewpoint, at least one or more may be prepared in which other associative components associated with the above are mapped.
  • the associative component extraction unit 203 extracts the associative component mapped to the start component specified by the start component specifying unit 202
  • another associative component associated with the associative component is displayed.
  • the mapped associative component is further extracted using the mapped information table, and the goal component extracting unit 204 is mapped to the associative component finally extracted by the associative component extracting unit 203. Extract elements.
  • the goal component extraction unit 204 includes: The goal component mapped to the associative component extracted by the associative component extracting unit 203 is extracted.
  • the associative component and the goal associative component are not only mapped 1: 1 but also 1: N. It may be mapped.
  • C1 and C2 are mapped to B2 in the information table BC. For example, this is the case where ribs and loin are mapped to yakiniku.
  • the information table in which the associative component and the goal associative component are mapped is not included in the same viewpoint. Since there is only one, it is mapped in the same information table.
  • the result and reason extraction unit 205 extracts the extraction result mapped to the goal component extracted by the goal component extraction unit 204. More specifically, when the meta information attached to the content decomposed into the goal component and the text data element matches, the content is output as an extraction result. However, content that is obtained by storing mapping information between content as an extraction result and meta information attached to the content in the information table management unit 207 in advance and searching for the mapping information using the goal component as a keyword. May be output as an extraction result.
  • the extraction result is mapped to the goal component and is not directly mapped to the start component or the associative component. Further, the goal association component and the extraction result are not limited to the case of 1: 1 mapping. In the case of FIG. 3, the goal component C3 is mapped not only to the extraction result X but also to Y, and the extraction result X is mapped not only to the goal component C3 but also to C1 and G1.
  • ⁇ Output> The mapping from the start component specified by the start component specifying unit 202 to the extraction result is performed, and not only the extraction result but also the reason that the extraction result is mapped from the start component is the result and reason extraction unit 205. And output by the output unit 206.
  • X is extracted as a result of extracting the ramen shop X as an extraction result.
  • the reason is that the reason is that the store owner of X trained in the same store as the store owner of the ramen shop that the user likes in the past.
  • the reason for searching for the extraction result in other words, the connection between the information tables, is more important than searching for the extraction result from among many.
  • the associative component linking the information table DE and the information table EF but also the information table EF and the information table FG are linked.
  • the associative component is also output as the reason.
  • the output content only needs to include the extraction result and the reason why the extraction result has been derived, and various outputs are possible. In the case of FIG. 3, three extraction results of X, Y, and Z are output.
  • the reason for X the viewpoint i and the associative component B1
  • the viewpoint i and the associative component B3, and the viewpoint b and the associative components E1 and F1 are output.
  • the element B3 and two of viewpoint B and associative components E1 and F1 are output, and as the reason of Z, one of viewpoint B and associative components E3 and F3 is output.
  • the reason for output includes a viewpoint, an associative component, and the like.
  • the present invention is not limited to the case where viewpoints, associative components, etc. are simply displayed side by side, and a sentence including these may be generated and output as a reason.
  • the output result to be output is not limited to text, but may be contents such as newspaper articles, photos, videos, music, and the like. In addition to the content that is the extraction result, content related to the content may be output together.
  • the information table in the present invention is a table in which a certain word and an associative word associated with the word are mapped and registered.
  • An information table exists for each of one or more viewpoints.
  • the information table management unit 207 manages at least two information tables each mapping the associative component associated with the start component and the goal component associated with the associative component for each viewpoint.
  • the information table for each viewpoint is not limited to these two, and the information table in which other associative components associated with the associative components are mapped between the two information tables is 1 or Multiple may be included. For example, as in the example shown in FIG. 3, there are two information tables for viewpoint A, but there are three information tables for viewpoint B.
  • 3 is an information table DE in which an associative component associated with the start component is mapped, and an information table in which other associative components associated with the associative component are mapped.
  • the associative configuration in addition to the two information tables in which the associative component associated with the start component and the goal component associated with the associative component are respectively mapped, the associative configuration
  • the goal component is a tag that defines the meta information of the extraction result, so the extraction result is the goal component It is only associated with an element and not directly with an intermediate associative component.
  • the information table managed by the information table management unit 207 is not limited to the case where the information table is stored and saved in the information extraction apparatus 101.
  • the mapping information itself of the information table is the information of the information extraction apparatus 101. It may exist outside.
  • FIG. 4 is an explanatory diagram showing the overall system configuration in the present embodiment.
  • a user terminal 103 installed on the user side is connected to the information extraction device 101 via a communication network 102.
  • the information extraction apparatus 101 is shown as one and the user terminal 103 is shown as two, but each of these may have a larger number.
  • the communication network 102 can use various networks such as the Internet, a LAN (local area network), and a WAN (wide area network).
  • networks such as the Internet, a LAN (local area network), and a WAN (wide area network).
  • the information extraction apparatus 101 is a computer including a control unit such as a CPU and a storage unit such as a RAM and a ROM.
  • the user terminal 103 includes a control unit such as a CPU, a storage unit such as a RAM and a ROM, a display unit such as a liquid crystal screen, an input unit such as a keyboard, a communication unit that controls communication with the Internet, and the like.
  • FIG. 5 is a block diagram showing the internal configuration of the information extraction apparatus 101 in this embodiment.
  • the specific internal configuration is associated with a target information specifying unit 301 that specifies information to be extracted, a start component specifying unit 302 that specifies a start component from the information to be extracted, a start component of an information table, and the like.
  • An associative component extracting unit 303 that extracts an associative component mapped to the start component based on the mapping information of the component, and an associative component based on the mapping information of the associative component and the goal component of the information table
  • the goal component extraction unit 304 that extracts the mapped goal component, the extraction result mapped to the goal component based on the mapping information of the goal component and the extraction result, and the reason why the extraction result has been derived Results and reason extraction unit 305, between viewpoints, between start components, associations Priority aggregation unit 306 that aggregates the priority of extraction results and reasons based on weighting between components or between extraction results, and output rank determination unit 307 that determines output ranks in descending order of the aggregated priorities
  • the output unit 308 for outputting the extraction result and the reason that the extraction result has been derived according to the output order that has been output by screen display or printing, the start component and the associative component, and the associative component and the goal component, respectively Information table that is mapped and prepared for each viewpoint
  • FIG. 6 is a flowchart showing a processing procedure for outputting the extraction result and the reason why the extraction result is derived in the information extraction apparatus 101 in this embodiment.
  • the target information specifying unit 301 specifies information to be extracted (step S202).
  • the user information management unit 310 determines the user's orientation. Specify (step S203).
  • the user's orientation may be specified by, for example, presenting a plurality of user's orientations and having the user select and input them.
  • the user orientation in the present embodiment means the direction in which the user's consciousness to extract information is directed, and more specifically, the user's extraction motivation, the user Refers to the attributes of Based on the user's orientation, the weighting setting unit 311 sets the weighting that varies for each viewpoint (step S204).
  • the weighting is not limited to between the viewpoints, and may be set between the start components, between the associative components, or between the extraction results.
  • the start component may be weighted by asking the user which of the start components is important. Further, the weighting may vary depending on user attributes and the like.
  • the start component is specified by the start component specifying unit 302 from the extraction target information specified by the target information specifying unit 301 (step S205).
  • the information table managed by the information table management unit 309 is searched for an information table prepared for each viewpoint by mapping the associative component associated with the start component, in step S205. It is determined whether or not the identified start component is hit (step S206).
  • step S211 the process proceeds to step S211, and the process ends.
  • the associative component associated with the start component is mapped and the associative component mapped to the start component is displayed in the information table prepared for each viewpoint. Extracted by the associative component extraction unit 303.
  • the goal component associated with the associative component is mapped and mapped to the associated component in the information table prepared for each viewpoint.
  • the goal component extraction unit 304 extracts the goal component that has been set.
  • the extraction result mapped to the goal component, the mapped start component, associative component, and goal extracts the result of the series of associations and viewpoints of the constituent elements (step S207).
  • the start component, the associative component, the goal component, and the viewpoint are not limited to all reasons, and some of them may be extracted as the reason. For example, only the associative component, only the associative component and the viewpoint may be extracted as the reason.
  • the priority of the extraction result and the reason is totalized by the priority totaling unit 306 based on the weighting between the viewpoints, between the start components, between the associative components, or between the extraction results (step S208).
  • the output rank determination unit 307 determines the output rank of the extraction result and the reason in descending order of the aggregated priority (step S209). Then, according to the determined output order, the extraction result and the reason why the extraction result has been derived are output by the output unit 308 (step S210), and the process ends (step S211).
  • the information extraction apparatus 101 stores and stores information on user orientation, including user extraction motives and user attributes, and specifies or stores user information.
  • an output rank determination unit 307 for determining the output rank of the extraction results and reasons in descending order of the priority tabulated by the priority tabulation unit 306, and the output unit 308 outputs the reasons for each extraction result, The reason for outputting for each extraction result is output according to the output order determined by the output order determining unit 307.
  • the extraction result output by the output unit 308 is output in accordance with the output order determined by the output order determination unit 307. Further, the priority aggregation by the priority aggregation unit 306 is performed by a kind of point calculation based on the weighting between viewpoints set by the weight setting unit 311. The weighting may be dynamically changed by the user, and may be further changed by the information extraction apparatus 101 learning depending on the usage status of the user.
  • FIG. 7 shows an output example of the extraction result in the second embodiment and the reason why the extraction result has been derived.
  • the target of the extraction is “I want to find a restaurant.”
  • the intention of the user is “To date her on a diet. Both of them work in a place where there is a lot of overtime, and they live in different locations but live along the Tokaido Line.
  • store A and store B are output as extraction results, and the reason why these extraction results have been derived is as follows. Is an example of output from the viewpoint of “location” and “atmosphere”.
  • the reason why the store A is extracted is output from three viewpoints, but in order of “accompanying person”, “location”, and “atmosphere”, reflecting the weighting for each viewpoint.
  • the information extraction apparatus 101 is a user concerned.
  • the user's intentions about what kind of object he wants to convey for what purpose is asked, and the photographer's name is output as the extraction result based on the answer.
  • the photographer's photograph can logically convey the importance of environmental issues to men and women in their 30s, it is possible to output the photographer's photograph and so on.
  • the photographer's name may be output as an extraction result, and the photograph taken by the photographer may be output together.
  • the result based on the user's intention is not presented, and the search result is presented based on a judgment criterion completely unrelated to the user's intention.
  • the extraction result and the priority of the reason can be flexibly changed, and extraction based on the user's orientation Present results and reasons. That is, it is possible to automatically change the viewpoint according to the user's intentionality and output.
  • the process until obtaining the extraction result is close to human thought, and there is an effect that the user's satisfaction with the output result is further improved. It is possible to present a series of associative flow of thinking up to the extraction result to the user as a story.
  • the object of the present invention is to supply a software program that implements the functions of the first and second embodiments to a system or apparatus, and the computer of the system or apparatus reads and executes the software program recorded on a storage medium. Is also achieved.
  • the program itself read from the storage medium realizes the novel function of the present invention, and the program and the storage medium storing the program constitute the present invention.
  • a storage medium for recording a software program for example, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a CD-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.
  • the program read from the storage medium is written to a memory provided in a function expansion board inserted into the computer of the system or apparatus or a function expansion unit connected to the computer
  • the program This includes the case where the CPU or the like provided in the function expansion board or unit performs part or all of the actual processing based on the above instruction and the above functions are realized by the processing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

[Problem] To provide an information extraction device and an information extraction program capable of not only outputting a result, but also presenting information about a reason which has led to the outputting of the result, in order to improve the understanding of how the result was outputted. [Solution] The information extraction device is provided with: a start constituent element specification means for specifying a start constituent element to be extracted; a mapping means for mapping from the start constituent element specified by the start constituent element specification means to an extraction result; a reason extraction means for extracting a reason which has led to the mapping of the extraction result, by the mapping means, from the start constituent element; and an output means for outputting the extraction result mapped by the mapping means and the reason extracted by the extraction means.

Description

情報抽出装置及び情報抽出プログラムInformation extraction apparatus and information extraction program
 本発明は、情報抽出処理を行う情報抽出装置及び情報抽出プログラムに関するものである。詳しくは、抽出結果だけでなく、その結果が導かれた理由も出力する情報抽出装置及び情報抽出プログラムに関するものである。 The present invention relates to an information extraction apparatus and an information extraction program for performing an information extraction process. Specifically, the present invention relates to an information extraction apparatus and an information extraction program that output not only the extraction result but also the reason why the result is derived.
 従来の検索システムは、情報をキーワード検索してフィルタリングしたものをリストとして出力するだけで、後は利用者の選択に任せるというものであった。つまり、検索すると似たようなものが検索結果として多く出てきて、かつ、その結果には多くの同じような言葉が含まれているので、利用者がそれらを見て自分が求めているものかどうかを自分で判断しなければならず、利用者側の負担が大きいものであった。
 このように、検索結果は提示されても、その検索結果が出てきた理由やストーリーといったものが提示されないので、フィルタリングして絞った検索結果の中から自分の目的に合ったものを自分で選択しなければならないものであった。
A conventional search system simply outputs information obtained by keyword search and filtering as a list, and then leaves it to the user to select. In other words, there are many similar search results, and many similar words are included in the search results. It was a burden on the user side.
In this way, even though the search results are presented, the reason why the search results have come out and the story are not presented, so you can select the one that suits your purpose from the filtered search results It was something that had to be done.
 例えば、キーワードによりデータを検索するデータ検索システムにおいて、キーワードを入力するキーワード入力手段と、データを入力するデータ入力手段と、登録データとともにデータ特徴量とキーワードを記憶するデータ記憶手段と、前記キーワード入力手段から入力されたキーワードを組み合わせてデータを検索するキーワード検索手段と、登録データからキーワードや特徴量を自動的に抽出する特徴量・キーワード抽出手段と、前記キーワード検索手段によって検索した結果に対し、キーワードを使わず絞り込む検索結果絞り込み手段とを備えたことを特徴とするデータ検索システムが提供されている(特許文献1を参照)。 For example, in a data search system for searching for data by keyword, a keyword input means for inputting a keyword, a data input means for inputting data, a data storage means for storing a data feature amount and a keyword together with registered data, and the keyword input A keyword search unit that searches for data by combining keywords input from the unit, a feature amount / keyword extraction unit that automatically extracts keywords and feature amounts from registered data, and a result of search performed by the keyword search unit, There has been provided a data search system including search result narrowing means for narrowing down without using a keyword (see Patent Document 1).
特開2000-48041号公報JP 2000-48041 A
 従来の検索システムでは、結果や関連キーワードは出力されても、その結果が出力されるまでの理由が提示されない。そのために、出力された結果が利用者にとって本当に意味のあるものかどうかがなかなか判断できないという問題があった。
 そこで本発明が解決しようとする課題は、単なる結果の出力だけでなく、その結果が出力されたことに対する利用者の納得性を向上させるため、結果がどのような理由に基づいて出力されるに至ったのかを提示することができる情報抽出装置及び情報抽出プログラムを提供することである。
In a conventional search system, even if a result or a related keyword is output, the reason until the result is output is not presented. Therefore, there is a problem that it is difficult to judge whether the output result is really meaningful to the user.
Therefore, the problem to be solved by the present invention is not only the output of the result, but also the reason why the result is output for improving the user's satisfaction with the output of the result. It is an object to provide an information extraction apparatus and an information extraction program capable of presenting whether or not it has been reached.
 上記の課題を解決するために本発明の請求項1に記載の発明は、情報抽出装置であって、抽出対象となるスタート構成要素を特定するスタート構成要素特定手段と、前記スタート構成要素特定手段によって特定されたスタート構成要素から抽出結果までのマッピングを行うマッピング手段と、前記マッピング手段によってスタート構成要素から抽出結果がマッピングされるに至った理由を抽出する理由抽出手段と、前記マッピング手段によってマッピングが行われた抽出結果及び前記理由抽出手段によって抽出された理由を出力する出力手段を備えていることを特徴とする。 In order to solve the above-mentioned problem, the invention according to claim 1 of the present invention is an information extraction device, wherein a start component specifying means for specifying a start component to be extracted, and the start component specifying means Mapping means for mapping from the start component specified by the extraction result to the extraction result, reason extraction means for extracting the reason why the extraction result is mapped from the start component by the mapping means, and mapping by the mapping means And an output means for outputting the extraction result obtained and the reason extracted by the reason extraction means.
 本発明の請求項2に記載の発明は、前記マッピング手段は、前記スタート構成要素特定手段によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出する連想構成要素抽出手段と、前記連想構成要素抽出手段によって抽出された連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出手段と、前記ゴール構成要素抽出手段によって抽出されたゴール構成要素にマッピングされている抽出結果を抽出する結果抽出手段とから構成されていることを特徴とする。 According to a second aspect of the present invention, the mapping means extracts an associative component extraction means for extracting an associative component mapped to the start component specified by the start component specifying means, and the association Goal component extraction means for extracting a goal component mapped to the associative component extracted by the component extraction means, and an extraction result mapped to the goal component extracted by the goal component extraction means It is characterized by comprising a result extracting means for extracting.
 本発明の請求項3に記載の発明は、前記連想構成要素抽出手段はスタート構成要素から連想される連想構成要素がマッピングされている情報テーブル、前記ゴール構成要素抽出手段は連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルに基づいて抽出を行い、マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れの内、少なくとも連想構成要素を理由として前記出力手段によって出力することを特徴とする。 According to a third aspect of the present invention, the associative component extracting means is an information table in which an associative component associated with a start component is mapped, and the goal component extracting means is associated with an associative component. The target component is extracted based on the mapped information table, and the output is performed on the basis of at least the associative component among a series of mapped start component, associative component, and goal component associative flow. It outputs by a means.
 本発明の請求項4に記載の発明は、前記情報テーブルは観点毎に用意されており、前記連想構成要素抽出手段及び前記ゴール構成要素抽出手段における抽出は同じ観点に属する情報テーブルだけを用いて行われ、前記出力手段によって出力される理由には前記観点も含まれていることを特徴とする。 According to a fourth aspect of the present invention, the information table is prepared for each viewpoint, and the associative component extraction means and the goal component extraction means use only information tables belonging to the same viewpoint. The reason why the data is output by the output means includes the above-mentioned viewpoint.
 本発明の請求項5に記載の発明は、前記連想構成要素抽出手段に用いられる情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされているものの他に、該連想構成要素から連想される他の連想構成要素がマッピングされているものが同一の観点でさらに少なくとも1以上用意されており、前記連想構成要素抽出手段は前記スタート構成要素特定手段によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出した後に、該連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルを用いてさらにマッピングされた連想構成要素を抽出し、前記ゴール構成要素抽出手段は前記連想構成要素抽出手段によって最終的に抽出された連想構成要素にマッピングされているゴール構成要素を抽出するものであることを特徴とする。 According to a fifth aspect of the present invention, in the information table used for the associative component extracting means, the associative component associated with the start component is mapped in addition to the associative component associated with the start component. In the same viewpoint, at least one or more other associative components mapped are prepared, and the associative component extracting means is mapped to the start component specified by the start component specifying means. After extracting the associated associative component, using the information table in which other associative components associated with the associative component are mapped, the mapped associative component is extracted, and the goal component extracting means Is mapped to the associative component finally extracted by the associative component extracting means. Characterized in that it is intended to extract the Le component.
 本発明の請求項6に記載の発明は、前記観点毎に所定の重み付けが設定される重み付け設定手段と、前記重み付け設定手段によって設定された重み付けを反映した上で各理由の優先度を集計する優先度集計手段と、前記優先度集計手段によって集計された優先度の高い順に理由の出力順位を決定する出力順位決定手段がさらに備えられ、前記出力手段は理由を抽出結果毎に出力し、該抽出結果毎に出力される理由は前記出力順位決定手段によって決定された出力順位に従って出力されることを特徴とする。 According to a sixth aspect of the present invention, a weighting setting unit for which a predetermined weighting is set for each viewpoint, and the priority of each reason is totaled after reflecting the weighting set by the weighting setting unit. A priority totalization unit; and an output rank determination unit that determines an output rank of reasons in descending order of priorities tabulated by the priority tabulation unit, and the output unit outputs a reason for each extraction result, The reason for outputting for each extraction result is output according to the output order determined by the output order determining means.
 本発明の請求項7に記載の発明は、コンピュータに、抽出対象となるスタート構成要素を特定するスタート構成要素特定ステップと、前記スタート構成要素特定ステップによって特定されたスタート構成要素から抽出結果までのマッピングを行うマッピングステップと、前記マッピングステップによってスタート構成要素から抽出結果がマッピングされるに至った理由を抽出する理由抽出ステップと、前記マッピングステップによってマッピングが行われた抽出結果及び前記理由抽出ステップによって抽出された理由を出力する出力ステップを実現させることを特徴とする。 According to a seventh aspect of the present invention, a computer includes a start component specifying step for specifying a start component to be extracted, and a start component specified by the start component specifying step to an extraction result. A mapping step for performing mapping, a reason extraction step for extracting the reason why the extraction result has been mapped from the start component by the mapping step, an extraction result mapped by the mapping step, and the reason extraction step. An output step for outputting the extracted reason is realized.
 本発明の請求項8に記載の発明は、前記マッピングステップは、前記スタート構成要素特定ステップによって特定されたスタート構成要素にマッピングされている連想構成要素を抽出する連想構成要素抽出ステップと、前記連想構成要素抽出ステップによって抽出された連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出ステップと、前記ゴール構成要素抽出ステップによって抽出されたゴール構成要素にマッピングされている抽出結果を抽出する結果抽出ステップとから構成されていることを特徴とする。 According to an eighth aspect of the present invention, in the mapping step, the associative component extracting step of extracting an associative component mapped to the start component specified by the start component specifying step, and the association A goal component extraction step for extracting a goal component mapped to the associative component extracted by the component extraction step, and an extraction result mapped to the goal component extracted by the goal component extraction step. And a result extracting step for extracting.
 本発明の請求項9に記載の発明は、前記連想構成要素抽出ステップはスタート構成要素から連想される連想構成要素がマッピングされている情報テーブル、前記ゴール構成要素抽出ステップは連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルに基づいて抽出を行い、マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れの内、少なくとも連想構成要素を理由として前記出力ステップによって出力することを特徴とする。 According to a ninth aspect of the present invention, the associative component extracting step is an information table in which an associative component associated with a start component is mapped, and the goal component extracting step is associated with an associative component. The target component is extracted based on the mapped information table, and the output is performed on the basis of at least the associative component among a series of mapped start component, associative component, and goal component associative flow. It outputs by a step.
 本発明の請求項10に記載の発明は、前記情報テーブルは観点毎に用意されており、前記連想構成要素抽出ステップ及び前記ゴール構成要素抽出ステップにおける抽出は同じ観点に属する情報テーブルだけを用いて行われ、前記出力ステップによって出力される理由には前記観点も含まれていることを特徴とする。 In the invention according to claim 10 of the present invention, the information table is prepared for each viewpoint, and the extraction in the associative component extraction step and the goal component extraction step uses only information tables belonging to the same viewpoint. The reason why the data is output by the output step includes the above viewpoint.
 本発明の請求項11に記載の発明は、前記連想構成要素抽出ステップに用いられる情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされているものの他に、該連想構成要素から連想される他の連想構成要素がマッピングされているものが同一の観点でさらに少なくとも1以上用意されており、前記連想構成要素抽出ステップは前記スタート構成要素特定ステップによって特定されたスタート構成要素にマッピングされている連想構成要素を抽出した後に、該連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルを用いてさらにマッピングされた連想構成要素を抽出し、前記ゴール構成要素抽出ステップは前記連想構成要素抽出ステップによって最終的に抽出された連想構成要素にマッピングされているゴール構成要素を抽出するものであることを特徴とする。 According to an eleventh aspect of the present invention, in the information table used in the associative component extracting step, the associative component associated with the start component is mapped in addition to the association component associated with the start component. In the same viewpoint, at least one or more other associative components are mapped, and the associative component extraction step is mapped to the start component specified by the start component specifying step. Extracting the associated associative component, using the information table in which other associative components associated with the associative component are mapped, extracting the associated associative component, and the goal component extracting step Is the association component finally extracted by the association component extraction step. Characterized in that extracts a goal component that is mappings.
 本発明の請求項12に記載の発明は、前記観点毎に所定の重み付けが設定される重み付け設定ステップと、前記重み付け設定ステップによって設定された重み付けを反映した上で各理由の優先度を集計する優先度集計ステップと、前記優先度集計ステップによって集計された優先度の高い順に理由の出力順位を決定する出力順位決定ステップをさらに実現させ、前記出力ステップは理由を抽出結果毎に出力し、該抽出結果毎に出力される理由は前記出力順位決定ステップによって決定された出力順位に従って出力されることを特徴とする。 According to a twelfth aspect of the present invention, a weight setting step in which a predetermined weight is set for each of the viewpoints, and the priority of each reason is totaled after reflecting the weight set by the weight setting step. A priority totalization step, and an output rank determination step for determining the output rank of the reasons in descending order of priority calculated by the priority totalization step, wherein the output step outputs a reason for each extraction result, The reason for outputting for each extraction result is output according to the output order determined by the output order determining step.
 本願発明によれば、単なる抽出結果だけでなく、その抽出結果が導き出された一連の連想の流れに関する情報を理由として出力するために、抽出結果が出力されたことに対する利用者の納得性を向上させることができるという効果がある。 According to the present invention, in order to output not only a simple extraction result but also a series of associative flows from which the extraction result is derived as a reason, the user's satisfaction with the output of the extraction result is improved. There is an effect that can be made.
実施例1における情報抽出装置101の内部構成を示したブロック図である。1 is a block diagram illustrating an internal configuration of an information extraction apparatus 101 in Embodiment 1. FIG. 実施例1における情報抽出装置101において抽出結果及びその抽出結果が導き出されるに至った理由を出力する処理手順を示したフローチャート図である。It is the flowchart figure which showed the process sequence which outputs the extraction result in the information extraction device 101 in Example 1, and the reason that the extraction result came to be derived. 実施例1におけるスタート構成要素から抽出結果が導き出されるに至った理由までの構成要素等のマッピングを示した説明図である。It is explanatory drawing which showed mapping of the component etc. to the reason which came to derive the extraction result from the start component in Example 1. FIG. 実施例2における全体のシステム構成を示した説明図である。FIG. 6 is an explanatory diagram showing an overall system configuration in Embodiment 2. 実施例2における情報抽出装置101の内部構成を示したブロック図である。It is the block diagram which showed the internal structure of the information extraction apparatus 101 in Example 2. FIG. 実施例2における情報抽出装置101において抽出結果及びその抽出結果が導き出されるに至った理由を出力する処理手順を示したフローチャート図である。It is the flowchart figure which showed the process sequence which outputs the extraction result in the information extraction device 101 in Example 2, and the reason that the extraction result came to be derived. 実施例2における抽出結果及びその抽出結果が導き出されるに至った理由の出力例を示したものである。The output example of the reason which came to derive the extraction result in Example 2 and the extraction result is shown.
 次に、本発明に係る情報抽出装置及び情報抽出プログラムの実施例につき、図面を参照しながら、以下に詳しく説明する。 Next, embodiments of the information extraction apparatus and the information extraction program according to the present invention will be described in detail below with reference to the drawings.
 情報抽出装置101は、図示しないが、CPU等の制御部とRAMやROM等の記憶部等を備えたコンピュータである。
 図1は、本実施例における情報抽出装置101の内部構成を示したブロック図である。
 その具体的な内部構成は、抽出対象となる情報を特定する対象情報特定部201、抽出対象となる情報からスタート構成要素を特定するスタート構成要素特定部202、情報テーブルのスタート構成要素等と連想構成要素のマッピング情報に基づいてスタート構成要素等にマッピングされている連想構成要素を抽出する連想構成要素抽出部203、情報テーブルの連想構成要素とゴール構成要素のマッピング情報に基づいて連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出部204、ゴール構成要素と抽出結果のマッピング情報に基づいてゴール構成要素とマッピングされている抽出結果及びその抽出結果が導かれるに至った理由を抽出する結果及び理由抽出部205、抽出結果やその抽出結果が導き出されるに至った理由等を画面表示や印刷等により出力する出力部206、スタート構成要素と連想構成要素、及び連想構成要素とゴール構成要素が各々マッピングされて観点毎に用意されている情報テーブル、並びにゴール構成要素と抽出結果のマッピングに関する情報等を管理する情報テーブル等管理部207、から構成されている。
Although not shown, the information extraction apparatus 101 is a computer including a control unit such as a CPU and a storage unit such as a RAM and a ROM.
FIG. 1 is a block diagram showing the internal configuration of the information extraction apparatus 101 in this embodiment.
The specific internal configuration is associated with the target information specifying unit 201 that specifies information to be extracted, the start component specifying unit 202 that specifies the start component from the information to be extracted, the start component of the information table, and the like. An associative component extraction unit 203 that extracts an associative component mapped to the start component based on the mapping information of the component, and an associative component based on the mapping information of the associative component and the goal component of the information table The goal component extraction unit 204 that extracts the mapped goal component, the extraction result mapped to the goal component based on the mapping information of the goal component and the extraction result, and the reason why the extraction result has been derived Extraction result and reason extraction unit 205, extraction results and extraction results are derived An output table 206 that outputs the reason for reaching the screen by screen display, printing, etc., an information table prepared for each viewpoint by mapping the start component and the associative component, and the associative component and the goal component, The information table management unit 207 manages information on the mapping between the goal component and the extraction result.
 次に、本発明における情報抽出装置101が有する機能について詳しく説明する。
 図2は、本実施例における情報抽出装置101において抽出結果及びその抽出結果が導き出されるに至った理由を出力する処理手順を示したフローチャート図である。
 実施の開始後(ステップS101)、対象情報特定部201により抽出対象となる情報を特定する(ステップS102)。
 スタート構成要素特定部202により、抽出対象となる情報からスタート構成要素を特定する(ステップS103)。
 次に、情報テーブル等管理部207で管理されている情報テーブルの内、スタート構成要素から連想される連想構成要素がマッピングされて観点毎に用意されている情報テーブルを検索して、ステップS103で特定したスタート構成要素がヒットするどうかを判断する(ステップS104)。
 もし、スタート構成要素がヒットしなかった場合は、判断結果は「No」となり、ステップS109へ進み、終了となる。
 一方、スタート構成要素がヒットした場合には、スタート構成要素から連想される連想構成要素がマッピングされて観点毎に用意されている情報テーブルにおいて、該スタート構成要素にマッピングされている連想構成要素を連想構成要素抽出部203により抽出する(ステップS105)。
 次に、情報テーブル等管理部207で管理されている情報テーブルの内、連想構成要素から連想されるゴール構成要素がマッピングされて観点毎に用意されている情報テーブルにおいて、該連想構成要素にマッピングされているゴール構成要素をゴール構成要素抽出部204により抽出する(ステップS106)。
 結果及び理由抽出部205により、情報テーブル等管理部207で管理されているゴール構成要素と抽出結果のマッピング情報に基づいて、ゴール構成要素にマッピングされている抽出結果を抽出すると共に、マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れや観点を理由として抽出する(ステップS107)。なお、スタート構成要素、連想構成要素、ゴール構成要素、及び観点を全て理由とする場合に限られるものではなく、その一部を理由として抽出しても良い。例えば、連想構成要素のみ、連想構成要素及び観点のみ、を理由として抽出しても良い。
 そして、抽出結果及びその抽出結果が導かれるに至った理由を出力部206により出力し(ステップS108)、終了となる(ステップS109)。
Next, functions of the information extraction apparatus 101 according to the present invention will be described in detail.
FIG. 2 is a flowchart showing a processing procedure for outputting the extraction result and the reason why the extraction result has been derived in the information extraction apparatus 101 according to the present embodiment.
After the start of implementation (step S101), the information to be extracted is specified by the target information specifying unit 201 (step S102).
The start component specifying unit 202 specifies the start component from the information to be extracted (step S103).
Next, in the information table managed by the information table management unit 207, an associative component associated with the start component is mapped and an information table prepared for each viewpoint is searched. It is determined whether or not the identified start component is hit (step S104).
If the start component is not hit, the determination result is “No”, the process proceeds to step S109, and the process ends.
On the other hand, when the start component is hit, the associative component associated with the start component is mapped and the associative component mapped to the start component is displayed in the information table prepared for each viewpoint. Extracted by the associative component extraction unit 203 (step S105).
Next, in the information table managed by the information table management unit 207, the goal component associated with the association component is mapped and mapped to the association component in the information table prepared for each viewpoint. The goal component extracted by the goal component extraction unit 204 is extracted (step S106).
The result and reason extraction unit 205 extracts and maps the extraction result mapped to the goal component based on the mapping information of the goal component managed by the information table management unit 207 and the extraction result. A series of association flows and viewpoints of the start component, the association component, and the goal component are extracted as reasons (step S107). The start component, the associative component, the goal component, and the viewpoint are not limited to all reasons, and some of them may be extracted as the reason. For example, only the associative component, only the associative component and the viewpoint may be extracted as the reason.
Then, the extraction result and the reason why the extraction result has been derived are output by the output unit 206 (step S108), and the process ends (step S109).
 このように、本実施例では、対象情報特定部201によって抽出対象となる情報を特定した後、スタート構成要素特定部202でスタート構成要素を特定し、特定されたスタート構成要素から抽出結果までのマッピングを行って、抽出結果及びその抽出結果が導き出されるに至った理由を結果及び理由抽出部205によって抽出し、出力部206によって出力する。
 なお、スタート構成要素から抽出結果までのマッピングは、連想構成要素抽出部203、ゴール構成要素抽出部204、結果及び理由抽出部205によって行われる。
As described above, in this embodiment, after the information to be extracted is specified by the target information specifying unit 201, the start component is specified by the start component specifying unit 202, and from the specified start component to the extraction result. Mapping is performed, the extraction result and the reason why the extraction result is derived are extracted by the result and reason extraction unit 205, and output by the output unit 206.
The mapping from the start component to the extraction result is performed by the associative component extraction unit 203, the goal component extraction unit 204, and the result and reason extraction unit 205.
 図3は、スタート構成要素から抽出結果が導き出されるに至った理由までの構成要素等のマッピングを示した説明図である。
 図3におけるスタート構成要素はA1、A3、D1、D3の4つである。観点はイ、ロの2つである。観点イにおける情報テーブルはAB及びBCの2つ、観点ロにおける情報テーブルはDE、EF及びFGの3つである。そして、情報テーブルAB内は、A1から連想されるB1、A2から連想されるB2及びB3、A3から連想されるB3、A4から連想されるB4、がそれぞれマッピングされている。
 スタート構成要素A1の例で説明すると、A1は情報テーブルAB内に含まれているので、まず情報テーブルABと結びつく、情報テーブルAB内のA1は、A1から連想されるB1とマッピングされており、B1は情報テーブルBC内において、B1から連想されるC1とマッピングされている。ここで、B1は連想構成要素、C1はゴール構成要素である。このように、スタート構成要素A1、連想構成要素B1、及びゴール構成要素C1がマッピングされて一連の連想の流れとなっている。そして、C1は抽出結果であるXとマッピングされているので、抽出結果としてはX、その抽出結果Xが導かれるに至った理由として、C1関連のものとしては観点イと連想構成要素B1が抽出される。ただし、抽出結果Xの理由としては、図3に示されているように、他にC3及びG1関連のものも含まれる。
FIG. 3 is an explanatory diagram showing mapping of components and the like up to the reason why the extraction result is derived from the start component.
In FIG. 3, there are four start components A1, A3, D1, and D3. There are two viewpoints, i and b. There are two information tables AB and BC in viewpoint A, and three information tables DE, EF, and FG in viewpoint B. In the information table AB, B1 associated with A1, B2 and B3 associated with A2, B3 associated with A3, and B4 associated with A4 are mapped.
In the example of the start component A1, since A1 is included in the information table AB, A1 in the information table AB that is associated with the information table AB is first mapped to B1 associated with A1, B1 is mapped to C1 associated with B1 in the information table BC. Here, B1 is an associative component, and C1 is a goal component. Thus, the start component A1, the associative component B1, and the goal component C1 are mapped to form a series of associative flows. Since C1 is mapped to the extraction result X, X is extracted as the extraction result, and the reason why the extraction result X is derived is that the viewpoint 1 and the associative component B1 are extracted as C1 related items. Is done. However, the reason for the extraction result X includes C3 and G1 related reasons as shown in FIG.
<抽出対象情報及びスタート構成要素の特定>
 まず、新聞記事、自然言語の文章、写真、質問に対する回答等の抽出対象情報となるコンテンツを準備する。このコンテンツの内容は限定されないが、最終的にスタート構成要素となるタグに落とせるものでなければならない。
 対象情報特定部201による抽出対象情報の特定には、様々な方法を用いることができる。例えば、利用者に対して質問して、「映画を見たい」と答えたら、予め用意したサンプルから見たいと思う映画を選ばせる、あるいは好きな映画のタイトルを入れてもらう、等が可能である。
 次に、スタート構成要素特定部202により、抽出対象となる情報からスタート構成要素を特定する。例えば、抽出対象となる情報が新聞記事であれば、該新聞記事をテキスト解析することによりタグを抽出して、スタート構成要素として特定しても良い。
 さらには、情報テーブル等管理部207で管理されている、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブルに含まれているスタート構成要素をいくつか提示して、直接、スタート構成要素を利用者に選んでもらって特定しても良い。
 ここで、スタート構成要素の特定とは、抽出対象情報に付属するメタ情報より、テキストデータの要素に分解することを指す。なお、スタート構成要素は、抽出対象情報であるコンテンツのメタ情報を定義するタグによって構成されている。
<Identification of extraction target information and start components>
First, contents to be extracted information such as newspaper articles, natural language sentences, photos, answers to questions, etc. are prepared. The content of this content is not limited, but it must be able to be dropped to the tag that will eventually become the starting component.
Various methods can be used for specifying the extraction target information by the target information specifying unit 201. For example, if you ask a user and answer "I want to watch a movie", you can select a movie you want to watch from a sample that you have prepared in advance, or you can enter the title of your favorite movie. is there.
Next, the start component specifying unit 202 specifies the start component from the information to be extracted. For example, if the information to be extracted is a newspaper article, a tag may be extracted by text analysis of the newspaper article and specified as a start component.
Furthermore, some start components included in the information table managed by the information table etc. management unit 207 and mapped to the associative components associated with the start components are presented and directly started. The component may be specified by the user.
Here, the identification of the start component means that the meta information attached to the extraction target information is decomposed into text data elements. The start component is configured by a tag that defines meta information of content that is extraction target information.
<スタート構成要素と連想構成要素のマッピング>
 先ず、場所・風景・アトラクション・レストラン等のような観点毎に用意されている、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブル(情報テーブルAB・情報テーブルDE)を検索して、スタート構成要素A1・A3・D1・D3があるかどうかを判断する。
 そして、スタート構成要素がヒットした場合には、該スタート構成要素にマッピングされる連想構成要素を抽出することになる。例えば、スタート構成要素A1の場合であれば、情報テーブルABにスタート構成要素A1が存在しているので、情報テーブルABにおいてA1にマッピングされているB1を連想構成要素として抽出する。
 このように、観点毎に用意されている、情報テーブル等管理部207で管理されているスタート構成要素から連想される連想構成要素がマッピングされている情報テーブルの情報に基づいて、連想構成要素抽出部203が、スタート構成要素特定部202によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出する。
<Mapping of start components and associative components>
First, an information table (information table AB / information table DE) that is prepared for each viewpoint, such as a place, a landscape, an attraction, a restaurant, etc., mapped with an associative component associated with a start component is searched. Thus, it is determined whether there are start components A1, A3, D1, and D3.
When the start component is hit, an associative component mapped to the start component is extracted. For example, in the case of the start component A1, since the start component A1 exists in the information table AB, B1 mapped to A1 in the information table AB is extracted as an associative component.
In this manner, associative component extraction is performed on the basis of information in the information table that is prepared for each viewpoint and that maps the associative component associated with the start component managed by the information table management unit 207. The unit 203 extracts an associative component mapped to the start component specified by the start component specifying unit 202.
 なお、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブル内において、スタート構成要素と連想構成要素とは1:1にマッピングされている場合だけでなく、1:Nにマッピングされていても良い。例えば、図3の情報テーブルABにおいて、A2に対してはB2及びB3がマッピングされている。A2が雰囲気が良いお店であれば、観点毎に用意されている情報テーブルの内、観点イが雰囲気であれば情報テーブルABにおいて、例えばB2の夜景のきれいなお店、B3の静かに食事ができるお店、の2つにマッピングされていても良い。
 ただし、観点を超えてマッピングされることはなく、複数の連想構成要素にマッピングされるとしても同じ観点内、即ちスタート構成要素と連想構成要素がマッピングされている情報テーブルは同じ観点内には一つしか存在していないので同じ情報テーブル内、でマッピングされる。例えば、観点イに含まれている情報テーブルAB内のA2に対して、観点が異なる観点ロに含まれている情報テーブルDE内のE1~E4のいずれかがマッピングされるということはない。
In the information table in which the associative component associated with the start component is mapped, the start component and the associative component are not only mapped 1: 1 but also mapped 1: N. May be. For example, in the information table AB of FIG. 3, B2 and B3 are mapped to A2. If A2 is a shop with a good atmosphere, out of the information tables prepared for each viewpoint, if viewpoint A is the atmosphere, in information table AB, for example, a shop with a beautiful night view of B2, a quiet meal of B3 It may be mapped to two of the shops that can.
However, it is not mapped beyond the viewpoint, and even if it is mapped to a plurality of associative components, the information table in which the start component and the associative component are mapped is not included in the same viewpoint. Since there is only one, it is mapped in the same information table. For example, any of E1 to E4 in the information table DE included in the viewpoint B having a different viewpoint is not mapped to A2 in the information table AB included in the viewpoint i.
 また、ある観点に存在している情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブル、及び連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルの2つに限られない。観点ロの場合のように、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブルDE及び連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルFGの間に、連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルEFが存在していても良い。D1の場合であれば、情報テーブルDEにおいてE1とマッピングされているが、その後、E1は中間の情報テーブルEFにおいてF1とマッピングされ、さらにF1は情報テーブルFGにおいてG1とマッピングされている。例えば、D1がデートしたい、E1が雰囲気が良い、F1が夜景がきれい、G1が高層階にあるレストランのような場合である。この場合、G1が選ばれた理由を構成する連想構成要素は、1つの連想構成要素ではなく、E1及びF1の複数の連想構成要素のセットとなる。
 なお、連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブル内において、連想構成要素と他の連想構成要素とは、1:1にマッピングされている場合だけでなく、1:Nにマッピングされていても良い。情報テーブルEF内のE2に対しては、F2及びF3がマッピングされている。
 このように、情報テーブル等管理部207で管理されている連想構成要素の抽出に用いられる情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされているものの他に、該連想構成要素から連想される他の連想構成要素がマッピングされているものが同一の観点でさらに少なくとも1以上用意されていても良い。この場合、連想構成要素抽出部203はスタート構成要素特定部202によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出した後に、該連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルを用いてさらにマッピングされた連想構成要素を抽出し、ゴール構成要素抽出部204は連想構成要素抽出部203によって最終的に抽出された連想構成要素にマッピングされているゴール構成要素を抽出する。
In addition, the information table existing from a certain viewpoint includes an information table in which an associative component associated with the start component is mapped, and an information table in which the goal component associated with the associative component is mapped. It is not limited to two. As in the case of viewpoint B, the association between the information table DE to which the associative component associated with the start component is mapped and the information table FG to which the goal component associated with the associative component is mapped. There may be an information table EF in which other associative components associated with the components are mapped. In the case of D1, it is mapped to E1 in the information table DE, but then E1 is mapped to F1 in the intermediate information table EF, and F1 is further mapped to G1 in the information table FG. For example, D1 wants to date, E1 has a good atmosphere, F1 has a beautiful night view, and G1 is a restaurant on a higher floor. In this case, the associative component constituting the reason why G1 is selected is not one associative component, but a set of a plurality of associative components E1 and F1.
In the information table in which other associative components associated with the associative components are mapped, the associative components and the other associative components are not only mapped 1: 1 but also 1 : May be mapped to N. F2 and F3 are mapped to E2 in the information table EF.
As described above, the information table used for extraction of the associative component managed by the information table management unit 207 includes the associative component in addition to the mapping of the associative component associated with the start component. From the same viewpoint, at least one or more may be prepared in which other associative components associated with the above are mapped. In this case, after the associative component extraction unit 203 extracts the associative component mapped to the start component specified by the start component specifying unit 202, another associative component associated with the associative component is displayed. The mapped associative component is further extracted using the mapped information table, and the goal component extracting unit 204 is mapped to the associative component finally extracted by the associative component extracting unit 203. Extract elements.
<連想構成要素とゴール構成要素のマッピング>
 観点毎に用意されている、情報テーブル等管理部207で管理されている連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルの情報に基づいて、ゴール構成要素抽出部204が、連想構成要素抽出部203によって抽出された連想構成要素にマッピングされているゴール構成要素を抽出する。
 なお、連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブル内において、連想構成要素とゴール連想構成要素とは、1:1にマッピングされている場合だけでなく、1:Nにマッピングされていても良い。情報テーブルBC内のB2に対しては、C1及びC2がマッピングされている。例えば、焼肉に対して、カルビ及びロースがマッピングされているような場合である。
 ただし、観点を超えてマッピングされることはなく、複数のゴール構成要素にマッピングされるとしても同じ観点内、即ち連想構成要素とゴール連想構成要素がマッピングされている情報テーブルは同じ観点内には一つしか存在していないので同じ情報テーブル内、でマッピングされる。
<Mapping of associative components and goal components>
Based on the information in the information table prepared by each viewpoint and mapping the goal component associated with the association component managed by the information table management unit 207, the goal component extraction unit 204 includes: The goal component mapped to the associative component extracted by the associative component extracting unit 203 is extracted.
In the information table in which the goal component associated with the associative component is mapped, the associative component and the goal associative component are not only mapped 1: 1 but also 1: N. It may be mapped. C1 and C2 are mapped to B2 in the information table BC. For example, this is the case where ribs and loin are mapped to yakiniku.
However, it is not mapped beyond the viewpoint, and even if it is mapped to a plurality of goal components, the information table in which the associative component and the goal associative component are mapped is not included in the same viewpoint. Since there is only one, it is mapped in the same information table.
<ゴール構成要素と抽出結果のマッピング>
 ゴール構成要素抽出部204によって抽出されたゴール構成要素にマッピングされている抽出結果を、結果及び理由抽出部205によって抽出する。より詳しくは、ゴール構成要素とテキストデータの要素に分解されたコンテンツに付属するメタ情報が一致する場合に、該コンテンツを抽出結果として出力する。 ただし、抽出結果となるコンテンツと該コンテンツに付属するメタ情報とのマッピング情報を予め情報テーブル等管理部207に記憶保存しておき、ゴール構成要素をキーワードに該マッピング情報を検索してヒットしたコンテンツを抽出結果として出力しても良い。
 なお、抽出結果はゴール構成要素とマッピングされ、スタート構成要素や連想構成要素と直接にマッピングされることはない。
 また、ゴール連想構成要素と抽出結果は、1:1にマッピングされている場合に限られない。図3の場合で説明すると、ゴール構成要素C3は抽出結果XだけでなくYにもマッピングされており、また抽出結果Xはゴール構成要素C3だけでなくC1やG1ともマッピングされている。
<Mapping of goal components and extraction results>
The result and reason extraction unit 205 extracts the extraction result mapped to the goal component extracted by the goal component extraction unit 204. More specifically, when the meta information attached to the content decomposed into the goal component and the text data element matches, the content is output as an extraction result. However, content that is obtained by storing mapping information between content as an extraction result and meta information attached to the content in the information table management unit 207 in advance and searching for the mapping information using the goal component as a keyword. May be output as an extraction result.
The extraction result is mapped to the goal component and is not directly mapped to the start component or the associative component.
Further, the goal association component and the extraction result are not limited to the case of 1: 1 mapping. In the case of FIG. 3, the goal component C3 is mapped not only to the extraction result X but also to Y, and the extraction result X is mapped not only to the goal component C3 but also to C1 and G1.
<出力>
 スタート構成要素特定部202によって特定されたスタート構成要素から抽出結果までのマッピングを行い、その抽出結果だけでなく、スタート構成要素から抽出結果がマッピングされるに至った理由を結果及び理由抽出部205によって抽出し、出力部206によって出力する。例えば、利用者の好きなラーメン屋の情報を元にお勧めのラーメン屋を抽出する場合であれば、Xというラーメン屋を抽出結果として抽出すると同時に、抽出結果としてXが導き出されるに至ったのは、Xの店主が利用者の好きなラーメン屋の店主と同じ店で昔修行していたからである、といった理由も併せて抽出して出力する。
 本発明では、数ある中から抽出結果を探し出して来たことより、その抽出結果を探し出して来たことに対する理由付け、言い換えると情報テーブル間の結び付きが重要である。そして、図3の観点ロの場合のように同じ観点内の情報テーブルが3以上だった場合、情報テーブルDEと情報テーブルEFを結び付ける連想構成要素だけでなく、情報テーブルEFと情報テーブルFGを結び付ける連想構成要素も理由として出力される。
 出力内容としては、抽出結果及びその抽出結果が導き出されるに至った理由が含まれているものであればよく、様々な出力が可能である。図3の場合であれば、抽出結果としてX、Y、Zの3つを出力している。そして、Xの理由としては観点イ及び連想構成要素B1、観点イ及び連想構成要素B3、並びに観点ロ及び連想構成要素E1・F1の3つを出力し、Yの理由としては観点イ及び連想構成要素B3、並びに観点ロ及び連想構成要素E1・F1の2つを出力し、Zの理由としては観点ロ及び連想構成要素E3・F3、の1つを出力している。
 このように、出力される理由には、観点や連想構成要素等が含まれる。また、観点、連想構成要素等を単に並べて示す場合だけに限られるものではなく、これらを含む文章を生成して理由として出力しても良い。
 また、出力される抽出結果は、テキストに限られるものではなく、新聞記事、写真、映像、音楽等のコンテンツであっても良い。また、抽出結果であるコンテンツの他に、該コンテンツに関連するコンテンツも併せて出力しても良い。
<Output>
The mapping from the start component specified by the start component specifying unit 202 to the extraction result is performed, and not only the extraction result but also the reason that the extraction result is mapped from the start component is the result and reason extraction unit 205. And output by the output unit 206. For example, in the case of extracting a recommended ramen shop based on the information of a user's favorite ramen shop, X is extracted as a result of extracting the ramen shop X as an extraction result. The reason is that the reason is that the store owner of X trained in the same store as the store owner of the ramen shop that the user likes in the past.
In the present invention, the reason for searching for the extraction result, in other words, the connection between the information tables, is more important than searching for the extraction result from among many. Then, when there are three or more information tables in the same viewpoint as in the viewpoint B of FIG. 3, not only the associative component linking the information table DE and the information table EF but also the information table EF and the information table FG are linked. The associative component is also output as the reason.
The output content only needs to include the extraction result and the reason why the extraction result has been derived, and various outputs are possible. In the case of FIG. 3, three extraction results of X, Y, and Z are output. Then, as the reason for X, the viewpoint i and the associative component B1, the viewpoint i and the associative component B3, and the viewpoint b and the associative components E1 and F1 are output. The element B3 and two of viewpoint B and associative components E1 and F1 are output, and as the reason of Z, one of viewpoint B and associative components E3 and F3 is output.
Thus, the reason for output includes a viewpoint, an associative component, and the like. Further, the present invention is not limited to the case where viewpoints, associative components, etc. are simply displayed side by side, and a sentence including these may be generated and output as a reason.
The output result to be output is not limited to text, but may be contents such as newspaper articles, photos, videos, music, and the like. In addition to the content that is the extraction result, content related to the content may be output together.
<情報テーブル>
 スタート構成要素と連想構成要素、及び連想構成要素とゴール構成要素が各々マッピングされて、観点毎に用意されている情報テーブルについて説明する。本発明における情報テーブルとは、ある語とその語から連想される連想語がマッピングされて登録されているテーブルである。
 情報テーブルは1又は複数の観点毎に存在する。そして、観点毎にスタート構成要素から連想される連想構成要素、連想構成要素から連想されるゴール構成要素が各々マッピングされている少なくとも2つの情報テーブルが、情報テーブル等管理部207によって管理されている。
 ただし、観点毎の情報テーブルは、この2つに限られるものではなく、この2つの情報テーブルの間に連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルが、1または複数入っていても良い。例えば図3に示した例のように、観点イの情報テーブルは2つであるが、観点ロの情報テーブルは3つである。
<Information table>
An information table prepared for each viewpoint by mapping the start component and the associative component, and the associative component and the goal component will be described. The information table in the present invention is a table in which a certain word and an associative word associated with the word are mapped and registered.
An information table exists for each of one or more viewpoints. In addition, the information table management unit 207 manages at least two information tables each mapping the associative component associated with the start component and the goal component associated with the associative component for each viewpoint. .
However, the information table for each viewpoint is not limited to these two, and the information table in which other associative components associated with the associative components are mapped between the two information tables is 1 or Multiple may be included. For example, as in the example shown in FIG. 3, there are two information tables for viewpoint A, but there are three information tables for viewpoint B.
 なお、図3の観点ロの情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされている情報テーブルDE、連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルEF、連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルFGの3つであるが、ゴール構成要素は連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルFGからのみ抽出される。つまり、あるゴール構成要素は情報テーブルFGから抽出されるが、別のゴール構成要素は中間の情報テーブルEFから抽出されるということはない。観点によっては、図3の観点ロのように、スタート構成要素から連想される連想構成要素、連想構成要素から連想されるゴール構成要素が各々マッピングされている2つの情報テーブルの他に、連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルが間に入っている場合はあるものの、ゴール構成要素は抽出結果のメタ情報を定義するタグであるので、抽出結果はゴール構成要素とのみ結び付き、中間の連想構成要素と直接結び付くということはない。
 また、情報テーブル等管理部207で管理されている情報テーブルは、情報抽出装置101の内部に記憶保存されている場合に限られるものではなく、該情報テーブルのマッピング情報自体は情報抽出装置101の外部に存在していても良い。
3 is an information table DE in which an associative component associated with the start component is mapped, and an information table in which other associative components associated with the associative component are mapped. There are three EFs, an information table FG to which the goal components associated with the associative components are mapped, but the goal components are from the information table FG to which the goal components associated with the associative components are mapped. Only extracted. That is, one goal component is extracted from the information table FG, but another goal component is not extracted from the intermediate information table EF. Depending on the viewpoint, as shown in FIG. 3B, in addition to the two information tables in which the associative component associated with the start component and the goal component associated with the associative component are respectively mapped, the associative configuration Although there may be an information table mapped between other associative components associated with the element, the goal component is a tag that defines the meta information of the extraction result, so the extraction result is the goal component It is only associated with an element and not directly with an intermediate associative component.
Further, the information table managed by the information table management unit 207 is not limited to the case where the information table is stored and saved in the information extraction apparatus 101. The mapping information itself of the information table is the information of the information extraction apparatus 101. It may exist outside.
 本実施例は、利用者の志向性をも反映して抽出結果や理由が出力される点、設定された重み付けに基づいて理由等の優先度を集計して出力順位を決定する点、利用者端末103と通信を行う点等が追加されているのが、実施例1の場合と異なる。他の部分は、実施例1と同様なので適宜説明を省略する。
 図4は、本実施例における全体のシステム構成を示した説明図である。
 本願の情報抽出装置は、図4に示すように、利用者側に設置された利用者端末103が、通信ネットワーク102を介して情報抽出装置101と接続されている。なお、図4に示したシステム構成において、情報抽出装置101は1台、利用者端末103は2台として示されているが、これらは各々それ以上の台数があってもよい。
In this embodiment, the extraction result and the reason are output reflecting the user's intentionality, the priority of the reason is calculated based on the set weight, and the output order is determined. The point that communication with the terminal 103 is added is different from the case of the first embodiment. Since other parts are the same as those in the first embodiment, description thereof will be omitted as appropriate.
FIG. 4 is an explanatory diagram showing the overall system configuration in the present embodiment.
As shown in FIG. 4, in the information extraction device of the present application, a user terminal 103 installed on the user side is connected to the information extraction device 101 via a communication network 102. In the system configuration shown in FIG. 4, the information extraction apparatus 101 is shown as one and the user terminal 103 is shown as two, but each of these may have a larger number.
 通信ネットワーク102は、インターネット、LAN(ローカル・エリア・ネットワーク)、WAN(広域ネットワーク)等、種々のネットワークを利用することが可能である。 The communication network 102 can use various networks such as the Internet, a LAN (local area network), and a WAN (wide area network).
 情報抽出装置101は、図示しないが、CPU等の制御部とRAMやROM等の記憶部等を備えたコンピュータである。
 そして、利用者端末103は、図示しないが、CPU等の制御部とRAMやROM等の記憶部と液晶画面等の表示部やキーボード等の入力部、インターネット等との通信を制御する通信部等を備えた情報処理機器であり、この中にはパーソナルコンピュータ、スマートフォン、携帯電話、タブレット端末、PHS、PDA、専用端末等が含まれる。
Although not shown, the information extraction apparatus 101 is a computer including a control unit such as a CPU and a storage unit such as a RAM and a ROM.
Although not shown, the user terminal 103 includes a control unit such as a CPU, a storage unit such as a RAM and a ROM, a display unit such as a liquid crystal screen, an input unit such as a keyboard, a communication unit that controls communication with the Internet, and the like. This includes information processing devices including personal computers, smartphones, mobile phones, tablet terminals, PHS, PDAs, dedicated terminals, and the like.
 図5は、本実施例における情報抽出装置101の内部構成を示したブロック図である。
 その具体的な内部構成は、抽出対象となる情報を特定する対象情報特定部301、抽出対象となる情報からスタート構成要素を特定するスタート構成要素特定部302、情報テーブルのスタート構成要素等と連想構成要素のマッピング情報に基づいてスタート構成要素等にマッピングされている連想構成要素を抽出する連想構成要素抽出部303、情報テーブルの連想構成要素とゴール構成要素のマッピング情報に基づいて連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出部304、ゴール構成要素と抽出結果のマッピング情報に基づいてゴール構成要素とマッピングされている抽出結果及びその抽出結果が導かれるに至った理由を抽出する結果及び理由抽出部305、観点間、スタート構成要素間、連想構成要素間、または抽出結果間等の重み付けに基づいて抽出結果や理由の優先度を集計する優先度集計部306、集計された優先度の高い順に出力順位を決定する出力順位決定部307、決定された出力順位に従って抽出結果やその抽出結果が導き出されるに至った理由等を画面表示や印刷等により出力する出力部308、スタート構成要素と連想構成要素、及び連想構成要素とゴール構成要素が各々マッピングされて観点毎に用意されている情報テーブル、並びにゴール構成要素と抽出結果のマッピングに関する情報等を管理する情報テーブル等管理部309、利用者の抽出動機や利用者の属性を含む利用者の志向性に関する情報を記憶保存したり特定したりする利用者情報管理部310、観点間、スタート構成要素間、連想構成要素間、または抽出結果間等の重み付けを設定する重み付け設定部311、通信ネットワーク102を介して利用者端末103との送受信を行う通信部312、から構成されている。
FIG. 5 is a block diagram showing the internal configuration of the information extraction apparatus 101 in this embodiment.
The specific internal configuration is associated with a target information specifying unit 301 that specifies information to be extracted, a start component specifying unit 302 that specifies a start component from the information to be extracted, a start component of an information table, and the like. An associative component extracting unit 303 that extracts an associative component mapped to the start component based on the mapping information of the component, and an associative component based on the mapping information of the associative component and the goal component of the information table The goal component extraction unit 304 that extracts the mapped goal component, the extraction result mapped to the goal component based on the mapping information of the goal component and the extraction result, and the reason why the extraction result has been derived Results and reason extraction unit 305, between viewpoints, between start components, associations Priority aggregation unit 306 that aggregates the priority of extraction results and reasons based on weighting between components or between extraction results, and output rank determination unit 307 that determines output ranks in descending order of the aggregated priorities The output unit 308 for outputting the extraction result and the reason that the extraction result has been derived according to the output order that has been output by screen display or printing, the start component and the associative component, and the associative component and the goal component, respectively Information table that is mapped and prepared for each viewpoint, information table management unit 309 that manages information on the mapping between goal components and extraction results, etc., including user extraction motives and user attributes User information management unit 310 that stores and stores information on orientation, between viewpoints, between start components, between association components, Others are constituted extracted weight setter 311 sets the weights of the results between the like, from the communication unit 312, for transmitting and receiving user terminal 103 via the communication network 102.
 次に、情報抽出装置101が有する機能について詳しく説明する。
 図6は、本実施例における情報抽出装置101において抽出結果及びその抽出結果が導き出されるに至った理由を出力する処理手順を示したフローチャート図である。
 実施の開始後(ステップS201)、対象情報特定部301により抽出対象となる情報を特定する(ステップS202)。
 次に、対象情報特定部301によって特定された抽出対象情報や利用者情報管理部310に記憶保存されている利用者の属性に関する情報等から、利用者情報管理部310によって利用者の志向性を特定する(ステップS203)。また、通信部312によって情報抽出装置101と利用者端末103の間で送受信を行い、情報抽出装置101からネットワーク102を介して利用者端末103に送信した質問に対して利用者が入力した回答情報を受信することにより、例えば、利用者の志向性を複数提示して利用者に選択入力してもらうなどして、利用者の志向性を特定しても良い。なお、本実施例における利用者の志向性とは、情報を抽出しようとする利用者の意識が向けられている方向性の意味であり、より具体的には、利用者の抽出動機、利用者の属性等を指す。
 そして、利用者の志向性に基づき、重み付け設定部311において、観点毎に変動する重み付けを設定する(ステップS204)。ただし、重み付けは、観点間だけに限られるものではなく、スタート構成要素間、連想構成要素間、または抽出結果間等で設定されていても良い。例えば、利用者にスタート構成要素の内、どれを重要視するのかの質問を行ってスタート構成要素間の重み付けを行っても良い。また、重み付けは、利用者の属性等によって変動しても良い。
 そして、対象情報特定部301によって特定された抽出対象情報から、スタート構成要素特定部302によりスタート構成要素を特定する(ステップS205)。
 次に、情報テーブル等管理部309で管理されている情報テーブルの内、スタート構成要素から連想される連想構成要素がマッピングされて観点毎に用意されている情報テーブルを検索して、ステップS205で特定したスタート構成要素がヒットするどうかを判断する(ステップS206)。
 もし、スタート構成要素がヒットしなかった場合は、判断結果は「No」となり、ステップS211へ進み、終了となる。
 一方、スタート構成要素がヒットした場合には、スタート構成要素から連想される連想構成要素がマッピングされて観点毎に用意されている情報テーブルにおいて、該スタート構成要素にマッピングされている連想構成要素を連想構成要素抽出部303により抽出する。次に、情報テーブル等管理部309で管理されている情報テーブルの内、連想構成要素から連想されるゴール構成要素がマッピングされて観点毎に用意されている情報テーブルにおいて、該連想構成要素にマッピングされているゴール構成要素をゴール構成要素抽出部304により抽出する。更に、情報テーブル等管理部309で管理されているゴール構成要素と抽出結果のマッピング情報に基づいて、ゴール構成要素にマッピングされている抽出結果と共に、マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れや観点を理由として、結果及び理由抽出部305により抽出する(ステップS207)。なお、スタート構成要素、連想構成要素、ゴール構成要素、及び観点を全て理由とする場合に限られるものではなく、その一部を理由として抽出しても良い。例えば、連想構成要素のみ、連想構成要素及び観点のみ、を理由として抽出しても良い。
 次に、観点間、スタート構成要素間、連想構成要素間、または抽出結果間等の重み付けに基づいて、優先度集計部306により抽出結果や理由の優先度を集計する(ステップS208)。
 集計された優先度の高い順に、出力順位決定部307により抽出結果や理由の出力順位を決定する(ステップS209)。
 そして、決定された出力順位に従って、抽出結果及びその抽出結果が導き出されるに至った理由を出力部308により出力し(ステップS210)、終了となる(ステップS211)。
Next, functions of the information extraction apparatus 101 will be described in detail.
FIG. 6 is a flowchart showing a processing procedure for outputting the extraction result and the reason why the extraction result is derived in the information extraction apparatus 101 in this embodiment.
After the start of implementation (step S201), the target information specifying unit 301 specifies information to be extracted (step S202).
Next, based on the extraction target information specified by the target information specifying unit 301 and information on user attributes stored and stored in the user information management unit 310, the user information management unit 310 determines the user's orientation. Specify (step S203). Also, reply information input by the user in response to the question transmitted from the information extraction apparatus 101 to the user terminal 103 via the network 102 by transmitting and receiving between the information extraction apparatus 101 and the user terminal 103 by the communication unit 312. , The user's orientation may be specified by, for example, presenting a plurality of user's orientations and having the user select and input them. Note that the user orientation in the present embodiment means the direction in which the user's consciousness to extract information is directed, and more specifically, the user's extraction motivation, the user Refers to the attributes of
Based on the user's orientation, the weighting setting unit 311 sets the weighting that varies for each viewpoint (step S204). However, the weighting is not limited to between the viewpoints, and may be set between the start components, between the associative components, or between the extraction results. For example, the start component may be weighted by asking the user which of the start components is important. Further, the weighting may vary depending on user attributes and the like.
Then, the start component is specified by the start component specifying unit 302 from the extraction target information specified by the target information specifying unit 301 (step S205).
Next, the information table managed by the information table management unit 309 is searched for an information table prepared for each viewpoint by mapping the associative component associated with the start component, in step S205. It is determined whether or not the identified start component is hit (step S206).
If the start component is not hit, the determination result is “No”, the process proceeds to step S211, and the process ends.
On the other hand, when the start component is hit, the associative component associated with the start component is mapped and the associative component mapped to the start component is displayed in the information table prepared for each viewpoint. Extracted by the associative component extraction unit 303. Next, in the information table managed by the information table management unit 309, the goal component associated with the associative component is mapped and mapped to the associated component in the information table prepared for each viewpoint. The goal component extraction unit 304 extracts the goal component that has been set. Further, based on the mapping information of the goal component managed by the information table management unit 309 and the extraction result, the extraction result mapped to the goal component, the mapped start component, associative component, and goal The result and reason extraction unit 305 extracts the result of the series of associations and viewpoints of the constituent elements (step S207). The start component, the associative component, the goal component, and the viewpoint are not limited to all reasons, and some of them may be extracted as the reason. For example, only the associative component, only the associative component and the viewpoint may be extracted as the reason.
Next, the priority of the extraction result and the reason is totalized by the priority totaling unit 306 based on the weighting between the viewpoints, between the start components, between the associative components, or between the extraction results (step S208).
The output rank determination unit 307 determines the output rank of the extraction result and the reason in descending order of the aggregated priority (step S209).
Then, according to the determined output order, the extraction result and the reason why the extraction result has been derived are output by the output unit 308 (step S210), and the process ends (step S211).
 本実施例では実施例1の場合とは異なり、情報抽出装置101に、利用者の抽出動機や利用者の属性を含む利用者の志向性に関する情報を記憶保存したり特定したりする利用者情報管理部310と、観点毎に所定の重み付けが設定される重み付け設定部311と、重み付け設定部311によって設定された重み付けを反映した上で抽出結果や理由の優先度を集計する優先度集計部306と、優先度集計部306によって集計された優先度の高い順に抽出結果や理由の出力順位を決定する出力順位決定部307等がさらに備えられ、出力部308は理由を抽出結果毎に出力し、該抽出結果毎に出力される理由は出力順位決定部307によって決定された出力順位に従って出力される。
 なお、出力部308によって出力される抽出結果は、出力順位決定部307によって決定された出力順位に従って出力される。
 また、優先度集計部306による優先度の集計は、重み付け設定部311によって設定される観点間の重み付け等に基づいて一種のポイント計算によって行われる。該重み付けは、利用者により動的に変動しても良く、さらには利用者の利用状況により情報抽出装置101が学習して変動させても良い。
In this embodiment, unlike the case of the first embodiment, the information extraction apparatus 101 stores and stores information on user orientation, including user extraction motives and user attributes, and specifies or stores user information. The management unit 310, the weight setting unit 311 in which a predetermined weight is set for each viewpoint, and the priority aggregation unit 306 that aggregates the priority of the extraction result and the reason after reflecting the weight set by the weight setting unit 311 And an output rank determination unit 307 for determining the output rank of the extraction results and reasons in descending order of the priority tabulated by the priority tabulation unit 306, and the output unit 308 outputs the reasons for each extraction result, The reason for outputting for each extraction result is output according to the output order determined by the output order determining unit 307.
The extraction result output by the output unit 308 is output in accordance with the output order determined by the output order determination unit 307.
Further, the priority aggregation by the priority aggregation unit 306 is performed by a kind of point calculation based on the weighting between viewpoints set by the weight setting unit 311. The weighting may be dynamically changed by the user, and may be further changed by the information extraction apparatus 101 learning depending on the usage status of the user.
 図7は、本実施例2における抽出結果及びその抽出結果が導き出されるに至った理由の出力例を示したものである。
 抽出対象が「レストランを探したい。」、利用者の志向性が「ダイエット中の彼女とデートするため。二人共、残業の多い職場で働き、場所は異なるが東海道線沿線に住んでいる。」の場合である。
 この場合、抽出結果としてA店とB店を出力し、更にそれらの抽出結果が導き出されるに至った理由を、A店については「ロケーション」「雰囲気」「同伴者」の観点から、B店については「ロケーション」「雰囲気」の観点から、出力している例である。A店が抽出された理由としては3つの観点から出力されているが、観点毎の重み付けを反映して、「同伴者」「ロケーション」「雰囲気」の順となっている。より具体的には、この場合はレストランを探す目的が「彼女とデートするため」であるので、「同伴者」の観点が一番重くなっている。次に、「二人共、残業の多い職場で働き、場所は異なるが東海道線沿線に住んでいる。」ので、「ロケーション」の観点が次に重くなっている。
FIG. 7 shows an output example of the extraction result in the second embodiment and the reason why the extraction result has been derived.
The target of the extraction is “I want to find a restaurant.” The intention of the user is “To date her on a diet. Both of them work in a place where there is a lot of overtime, and they live in different locations but live along the Tokaido Line. Is the case.
In this case, store A and store B are output as extraction results, and the reason why these extraction results have been derived is as follows. Is an example of output from the viewpoint of “location” and “atmosphere”. The reason why the store A is extracted is output from three viewpoints, but in order of “accompanying person”, “location”, and “atmosphere”, reflecting the weighting for each viewpoint. More specifically, in this case, since the purpose of searching for a restaurant is “to date her”, the viewpoint of “accompanied person” is the heaviest. Next, “They work in a workplace with a lot of overtime and live in different locations but live along the Tokaido Line.” The next point of view is “location”.
 他の出力例としては、例えば、ある企業がWEBサイトを作りたいが、そのWEBサイトに掲載する写真としてどういう写真を使って良いかがわからないという場合、情報抽出装置101が利用者である当該企業に対し、どのような対象にどのような目的で何を伝えたいのかという利用者の志向性等を質問して行き、その回答を基に抽出結果として写真家の名前を出力し、その理由として、この写真家の写真を使えば30代の男女にロジカルに環境問題の重要性を伝えることができるので、この写真家の写真を採用してはどうか、等と出力することが可能である。
 なお、抽出結果として写真家の名前を出力すると共に、その写真家が撮影した写真を併せて出力しても良い。
As another output example, for example, when a certain company wants to create a WEB site but does not know what kind of photograph can be used as a photograph to be posted on the WEB site, the information extraction apparatus 101 is a user concerned. On the other hand, the user's intentions about what kind of object he wants to convey for what purpose is asked, and the photographer's name is output as the extraction result based on the answer. Because the photographer's photograph can logically convey the importance of environmental issues to men and women in their 30s, it is possible to output the photographer's photograph and so on.
Note that the photographer's name may be output as an extraction result, and the photograph taken by the photographer may be output together.
 従来の検索システムでは、利用者の志向性に基づいた結果は提示されず、利用者の志向性と全く無関係の判断基準によって検索結果が提示されていた。しかし、本実施例では、利用者の志向性に応じて観点間等の重み付けをすることにより、抽出結果や理由の優先度を柔軟に変化させて対応し、利用者の志向性に基づいた抽出結果や理由を提示する。つまり、利用者の志向性に応じて自動で観点を変えて出力したりすることができる。
 本実施例によれば、抽出結果を得るまでの過程が人間の思考に近く、出力された結果に対する利用者の納得性がより向上するという効果がある。抽出結果に至るまでの思考の一連の連想の流れを、言わば一つのストーリーとして利用者に提示することができるのである。
In the conventional search system, the result based on the user's intention is not presented, and the search result is presented based on a judgment criterion completely unrelated to the user's intention. However, in this embodiment, by weighting between viewpoints according to the user's orientation, the extraction result and the priority of the reason can be flexibly changed, and extraction based on the user's orientation Present results and reasons. That is, it is possible to automatically change the viewpoint according to the user's intentionality and output.
According to the present embodiment, the process until obtaining the extraction result is close to human thought, and there is an effect that the user's satisfaction with the output result is further improved. It is possible to present a series of associative flow of thinking up to the extraction result to the user as a story.
 なお、本発明の目的は、上記の実施例1及び2の機能を実現するソフトウェアのプログラムをシステム又は装置に供給し、そのシステム又は装置のコンピュータが記憶媒体に記録されたソフトウェアのプログラムを読出し実行することによっても達成される。 The object of the present invention is to supply a software program that implements the functions of the first and second embodiments to a system or apparatus, and the computer of the system or apparatus reads and executes the software program recorded on a storage medium. Is also achieved.
 この場合、記憶媒体から読出されたプログラム自体が本発明の新規な機能を実現することになり、当該プログラムやそれを記憶した記憶媒体は本発明を構成することになる。 In this case, the program itself read from the storage medium realizes the novel function of the present invention, and the program and the storage medium storing the program constitute the present invention.
 ソフトウェアのプログラムを記録するための記憶媒体としては、例えば、ハードディスク、光ディスク、光磁気ディスク、CD-ROM、CD-R、磁気テープ、不揮発性のメモリカード,ROM等を用いることができる。 As a storage medium for recording a software program, for example, a hard disk, an optical disk, a magneto-optical disk, a CD-ROM, a CD-R, a magnetic tape, a nonvolatile memory card, a ROM, or the like can be used.
 さらに、記憶媒体から読出されたプログラムが、システム又は装置のコンピュータに挿入された機能拡張用のボードやコンピュータに接続された機能拡張用のユニットに備えられたメモリに書込まれた後、当該プログラムの指示に基づき、その機能拡張用のボードやユニットに備えられたCPU等が実際の処理の一部または全部を行ない、その処理によって上記の機能が実現される場合も含まれるものとする。 Further, after the program read from the storage medium is written to a memory provided in a function expansion board inserted into the computer of the system or apparatus or a function expansion unit connected to the computer, the program This includes the case where the CPU or the like provided in the function expansion board or unit performs part or all of the actual processing based on the above instruction and the above functions are realized by the processing.
101 情報抽出装置
102 通信ネットワーク
103 利用者端末
201、301 対象情報特定部
202、302 スタート構成要素特定部
203、303 連想構成要素抽出部
204、304 ゴール構成要素抽出部
205、305 結果及び理由抽出部
306 優先度集計部
307 出力順位決定部
206、308 出力部
207、309 情報テーブル等管理部
310 利用者情報管理部
311 重み付け設定部
312 通信部
DESCRIPTION OF SYMBOLS 101 Information extraction apparatus 102 Communication network 103 User terminal 201, 301 Target information specific | specification part 202, 302 Start component specific part 203, 303 Associative component extractor 204, 304 Goal component extractor 205, 305 Result and reason extraction part 306 Priority calculation unit 307 Output order determination unit 206, 308 Output unit 207, 309 Information table management unit 310 User information management unit 311 Weight setting unit 312 Communication unit

Claims (12)

  1. 抽出対象となるスタート構成要素を特定するスタート構成要素特定手段と、
    前記スタート構成要素特定手段によって特定されたスタート構成要素から抽出結果までのマッピングを行うマッピング手段と、
    前記マッピング手段によってスタート構成要素から抽出結果がマッピングされるに至った理由を抽出する理由抽出手段と、
    前記マッピング手段によってマッピングが行われた抽出結果及び前記理由抽出手段によって抽出された理由を出力する出力手段
    を備えていることを特徴とする情報抽出装置。
    Start component specifying means for specifying a start component to be extracted;
    Mapping means for performing mapping from the start component specified by the start component specifying means to the extraction result;
    Reason extraction means for extracting the reason why the extraction result is mapped from the start component by the mapping means;
    An information extraction apparatus comprising: an output unit that outputs an extraction result mapped by the mapping unit and a reason extracted by the reason extraction unit.
  2. 前記マッピング手段は、
    前記スタート構成要素特定手段によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出する連想構成要素抽出手段と、
    前記連想構成要素抽出手段によって抽出された連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出手段と、
    前記ゴール構成要素抽出手段によって抽出されたゴール構成要素にマッピングされている抽出結果を抽出する結果抽出手段
    とから構成されていることを特徴とする請求項1に記載の情報抽出装置。
    The mapping means includes
    An associative component extracting means for extracting an associative component mapped to the start component specified by the start component specifying means;
    Goal component extraction means for extracting a goal component mapped to the association component extracted by the associative component extraction means;
    2. The information extracting apparatus according to claim 1, further comprising: a result extracting unit that extracts an extraction result mapped to the goal component extracted by the goal component extracting unit.
  3. 前記連想構成要素抽出手段はスタート構成要素から連想される連想構成要素がマッピングされている情報テーブル、前記ゴール構成要素抽出手段は連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルに基づいて抽出を行い、
    マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れの内、少なくとも連想構成要素を理由として前記出力手段によって出力することを特徴とする請求項2に記載の情報抽出装置。
    The associative component extracting means is an information table in which an associative component associated with a start component is mapped, and the goal component extracting means is in an information table in which a goal component associated with an associative component is mapped. Extraction based on
    3. The information extracting apparatus according to claim 2, wherein the output means outputs at least the associative component among a series of associative flows of the mapped start component, associative component, and goal component. .
  4. 前記情報テーブルは観点毎に用意されており、
    前記連想構成要素抽出手段及び前記ゴール構成要素抽出手段における抽出は同じ観点に属する情報テーブルだけを用いて行われ、
    前記出力手段によって出力される理由には前記観点も含まれていることを特徴とする請求項3に記載の情報抽出装置。
    The information table is prepared for each viewpoint.
    The extraction in the associative component extraction means and the goal component extraction means is performed using only information tables belonging to the same viewpoint,
    The information extraction apparatus according to claim 3, wherein the reason for the output by the output means includes the viewpoint.
  5. 前記連想構成要素抽出手段に用いられる情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされているものの他に、該連想構成要素から連想される他の連想構成要素がマッピングされているものが同一の観点でさらに少なくとも1以上用意されており、
    前記連想構成要素抽出手段は前記スタート構成要素特定手段によって特定されたスタート構成要素にマッピングされている連想構成要素を抽出した後に、該連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルを用いてさらにマッピングされた連想構成要素を抽出し、
    前記ゴール構成要素抽出手段は前記連想構成要素抽出手段によって最終的に抽出された連想構成要素にマッピングされているゴール構成要素を抽出するものであることを特徴とする請求項4に記載の情報抽出装置。
    In the information table used for the associative component extracting means, in addition to the mapping of the associative component associated with the start component, other associative components associated with the associative component are mapped. At least one thing is prepared from the same viewpoint,
    The associative component extraction unit extracts an associative component mapped to the start component specified by the start component specifying unit, and then maps other associative components associated with the associative component. Extract the mapped associative components using the existing information table,
    5. The information extraction according to claim 4, wherein the goal component extraction unit extracts a goal component mapped to the association component finally extracted by the association component extraction unit. apparatus.
  6. 前記観点毎に所定の重み付けが設定される重み付け設定手段と、
    前記重み付け設定手段によって設定された重み付けを反映した上で各理由の優先度を集計する優先度集計手段と、
    前記優先度集計手段によって集計された優先度の高い順に理由の出力順位を決定する出力順位決定手段がさらに備えられ、
    前記出力手段は理由を抽出結果毎に出力し、該抽出結果毎に出力される理由は前記出力順位決定手段によって決定された出力順位に従って出力されることを特徴とする請求項4または5に記載の情報抽出装置。
    A weight setting means for setting a predetermined weight for each viewpoint;
    Priority counting means for counting the priority of each reason after reflecting the weight set by the weight setting means;
    An output rank determining means for determining an output rank of reasons in descending order of priorities counted by the priority counting means;
    6. The output means according to claim 4, wherein the output means outputs a reason for each extraction result, and the reason output for each extraction result is output according to the output order determined by the output order determination means. Information extraction device.
  7. コンピュータに、
    抽出対象となるスタート構成要素を特定するスタート構成要素特定ステップと、
    前記スタート構成要素特定ステップによって特定されたスタート構成要素から抽出結果までのマッピングを行うマッピングステップと、
    前記マッピングステップによってスタート構成要素から抽出結果がマッピングされるに至った理由を抽出する理由抽出ステップと、
    前記マッピングステップによってマッピングが行われた抽出結果及び前記理由抽出ステップによって抽出された理由を出力する出力ステップ
    を実現させることを特徴とする情報抽出プログラム。
    On the computer,
    A start component specifying step for specifying a start component to be extracted; and
    A mapping step for performing mapping from the start component specified by the start component specifying step to the extraction result;
    A reason extraction step for extracting the reason why the extraction result is mapped from the start component by the mapping step;
    An information extraction program that realizes an output step of outputting an extraction result mapped by the mapping step and a reason extracted by the reason extraction step.
  8. 前記マッピングステップは、
    前記スタート構成要素特定ステップによって特定されたスタート構成要素にマッピングされている連想構成要素を抽出する連想構成要素抽出ステップと、
    前記連想構成要素抽出ステップによって抽出された連想構成要素にマッピングされているゴール構成要素を抽出するゴール構成要素抽出ステップと、
    前記ゴール構成要素抽出ステップによって抽出されたゴール構成要素にマッピングされている抽出結果を抽出する結果抽出ステップ
    とから構成されていることを特徴とする請求項7に記載の情報抽出プログラム。
    The mapping step includes
    An associative component extraction step of extracting an associative component mapped to the start component specified by the start component specifying step;
    A goal component extraction step for extracting a goal component mapped to the associative component extracted by the associative component extraction step;
    8. The information extraction program according to claim 7, comprising a result extraction step of extracting an extraction result mapped to the goal component extracted by the goal component extraction step.
  9. 前記連想構成要素抽出ステップはスタート構成要素から連想される連想構成要素がマッピングされている情報テーブル、前記ゴール構成要素抽出ステップは連想構成要素から連想されるゴール構成要素がマッピングされている情報テーブルに基づいて抽出を行い、
    マッピングされたスタート構成要素、連想構成要素及びゴール構成要素の一連の連想の流れの内、少なくとも連想構成要素を理由として前記出力ステップによって出力することを特徴とする請求項8に記載の情報抽出プログラム。
    The associative component extracting step is an information table in which an associative component associated with a start component is mapped, and the goal component extracting step is in an information table in which a goal component associated with an associative component is mapped. Extraction based on
    9. The information extraction program according to claim 8, wherein the output step outputs the result of at least the associative component among a series of associative flows of the mapped start component, associative component, and goal component. .
  10. 前記情報テーブルは観点毎に用意されており、
    前記連想構成要素抽出ステップ及び前記ゴール構成要素抽出ステップにおける抽出は同じ観点に属する情報テーブルだけを用いて行われ、
    前記出力ステップによって出力される理由には前記観点も含まれていることを特徴とする請求項9に記載の情報抽出プログラム。
    The information table is prepared for each viewpoint.
    The extraction in the associative component extraction step and the goal component extraction step is performed using only information tables belonging to the same viewpoint,
    The information extraction program according to claim 9, wherein the reason for outputting in the output step includes the viewpoint.
  11. 前記連想構成要素抽出ステップに用いられる情報テーブルは、スタート構成要素から連想される連想構成要素がマッピングされているものの他に、該連想構成要素から連想される他の連想構成要素がマッピングされているものが同一の観点でさらに少なくとも1以上用意されており、
    前記連想構成要素抽出ステップは前記スタート構成要素特定ステップによって特定されたスタート構成要素にマッピングされている連想構成要素を抽出した後に、該連想構成要素から連想される他の連想構成要素がマッピングされている情報テーブルを用いてさらにマッピングされた連想構成要素を抽出し、
    前記ゴール構成要素抽出ステップは前記連想構成要素抽出ステップによって最終的に抽出された連想構成要素にマッピングされているゴール構成要素を抽出するものであることを特徴とする請求項10に記載の情報抽出プログラム。
    In the information table used in the associative component extraction step, in addition to the mapping of the associative component associated with the start component, other associative components associated with the associative component are mapped. At least one thing is prepared from the same viewpoint,
    The associative component extracting step extracts an associative component mapped to the start component identified by the start component identifying step, and then maps other associative components associated with the associative component. Extract the mapped associative components using the existing information table,
    The information extraction according to claim 10, wherein the goal component extraction step extracts a goal component mapped to the association component finally extracted by the association component extraction step. program.
  12. 前記観点毎に所定の重み付けが設定される重み付け設定ステップと、
    前記重み付け設定ステップによって設定された重み付けを反映した上で各理由の優先度を集計する優先度集計ステップと、
    前記優先度集計ステップによって集計された優先度の高い順に理由の出力順位を決定する出力順位決定ステップをさらに実現させ、
    前記出力ステップは理由を抽出結果毎に出力し、該抽出結果毎に出力される理由は前記出力順位決定ステップによって決定された出力順位に従って出力されることを特徴とする請求項10または11に記載の情報抽出プログラム。
    A weight setting step in which a predetermined weight is set for each viewpoint;
    A priority counting step of counting the priority of each reason after reflecting the weight set in the weight setting step;
    Further realizing an output rank determination step for determining the output rank of the reasons in descending order of priority calculated by the priority calculation step,
    12. The output step according to claim 10, wherein the output step outputs a reason for each extraction result, and the reason output for each extraction result is output according to the output order determined by the output order determination step. Information extraction program.
PCT/JP2014/077403 2013-10-18 2014-10-15 Information extraction device and information extraction program WO2015056699A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2013-216879 2013-10-18
JP2013216879A JP5522813B1 (en) 2013-10-18 2013-10-18 Information extraction apparatus and information extraction program

Publications (1)

Publication Number Publication Date
WO2015056699A1 true WO2015056699A1 (en) 2015-04-23

Family

ID=51175658

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2014/077403 WO2015056699A1 (en) 2013-10-18 2014-10-15 Information extraction device and information extraction program

Country Status (2)

Country Link
JP (1) JP5522813B1 (en)
WO (1) WO2015056699A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2022079010A (en) * 2020-11-15 2022-05-26 株式会社Personal AI Information selection support system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0644074A (en) * 1991-09-06 1994-02-18 Hitachi Ltd Knowledge base, inference method and descriptive text preparation method
JP2010204735A (en) * 2009-02-27 2010-09-16 Toshiba Corp Information recommendation device, information recommendation method, and information recommendation program
WO2011013191A1 (en) * 2009-07-27 2011-02-03 株式会社 東芝 Relevancy presentation device, method, and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH0644074A (en) * 1991-09-06 1994-02-18 Hitachi Ltd Knowledge base, inference method and descriptive text preparation method
JP2010204735A (en) * 2009-02-27 2010-09-16 Toshiba Corp Information recommendation device, information recommendation method, and information recommendation program
WO2011013191A1 (en) * 2009-07-27 2011-02-03 株式会社 東芝 Relevancy presentation device, method, and program

Also Published As

Publication number Publication date
JP2015079401A (en) 2015-04-23
JP5522813B1 (en) 2014-06-18

Similar Documents

Publication Publication Date Title
Chinsamy-Turan The microstructure of dinosaur bone
CN110457504B (en) Digital asset search techniques
JP5581408B2 (en) Information processing system, information processing apparatus, information processing method, and program
CN108959394A (en) The search result of cluster
CN102893280B (en) Data serching device, data search method and program
US20110282855A1 (en) Scoring relationships between objects in information retrieval
US20140379719A1 (en) System and method for tagging and searching documents
WO2014206151A1 (en) System and method for tagging and searching documents
CN103744887B (en) It is a kind of for the method for people search, device and computer equipment
JP2008234550A (en) Expert information retrieval device, expert information retrieval method and program
CN107451139A (en) File resource methods of exhibiting, device and corresponding smart machine
CN109753504A (en) Data query method and device
JP2015527677A (en) Social network search result presentation method and apparatus, and storage medium
CN107315735B (en) Method and equipment for note arrangement
JP5522813B1 (en) Information extraction apparatus and information extraction program
CN107562954A (en) Recommendation searching method, device and mobile terminal based on mobile terminal
JP2005128872A (en) Document retrieving system and document retrieving program
Sochenkova et al. Design and implementation of information retrieval mechanism for the virtual museum creation
KR20220004278A (en) Method for providing search service and system for the same
Lösch et al. Mapping microblog posts to encyclopedia articles.
JP2017076345A (en) Information processing device, information processing system and program
Fischer et al. Transgender Visibility, Abolitionism, and Resistive Organizing in the Age of Trump: A Conversation with CeCe McDonald and Joshua Allen
CN103593112B (en) A kind of information processing method and electronic equipment
KR102173685B1 (en) Apparatus for providing book-rating based on emotion tag
Wilkens et al. Document Ranking Applied to Second Language Learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14853560

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14853560

Country of ref document: EP

Kind code of ref document: A1