JP5485236B2 - FAQ creation support system and program - Google Patents

FAQ creation support system and program Download PDF

Info

Publication number
JP5485236B2
JP5485236B2 JP2011189366A JP2011189366A JP5485236B2 JP 5485236 B2 JP5485236 B2 JP 5485236B2 JP 2011189366 A JP2011189366 A JP 2011189366A JP 2011189366 A JP2011189366 A JP 2011189366A JP 5485236 B2 JP5485236 B2 JP 5485236B2
Authority
JP
Japan
Prior art keywords
sentence
query
representative sentence
representative
answer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2011189366A
Other languages
Japanese (ja)
Other versions
JP2013050896A (en
Inventor
哲男 小川
早織 新田
頌之 小松原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Toshiba Digital Solutions Corp
Original Assignee
Toshiba Corp
Toshiba Solutions Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp, Toshiba Solutions Corp filed Critical Toshiba Corp
Priority to JP2011189366A priority Critical patent/JP5485236B2/en
Priority to CN201210298999.3A priority patent/CN103020035B/en
Publication of JP2013050896A publication Critical patent/JP2013050896A/en
Application granted granted Critical
Publication of JP5485236B2 publication Critical patent/JP5485236B2/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Description

本発明の実施形態は、FAQ作成支援に関する。   Embodiments described herein relate generally to FAQ creation support.

従来から、問合せとその回答をまとめたFAQ(Frequently Asked Questions)が作成され、問合せユーザに対するコミュニケーションツールとして、また、問合せに対して回答をする情報提供者の業務ツールとして活用されている。   Conventionally, FAQ (Frequently Asked Questions) that summarizes inquiries and their answers has been created and used as a communication tool for inquiring users and as a business tool for information providers who answer inquiries.

一般的にFAQは、多数の問合せ内容とそれに対する回答を分析し、例えば、FAQ作成者が問合せ頻度の高い問合せを抽出したり、問合せ頻度の高い問合せの中から代表的な問合せを作成し、抽出又は作成した問合せとそれに対する適切な回答とを組み合わせて作成している。   In general, the FAQ analyzes a large number of query contents and answers to them, for example, a FAQ creator extracts a query with a high query frequency, or creates a representative query from queries with a high query frequency, Created by combining the extracted or created query with the appropriate answer.

特開2009−15769号公報JP 2009-15769 A

問合せ文及びその回答文を含む文書の文書群からFAQの候補となる問合せと回答の対を定量的に評価したFAQ作成環境を実現するFAQ作成支援システム及びプログラムを提供することを目的とする。   It is an object of the present invention to provide a FAQ creation support system and program for realizing a FAQ creation environment that quantitatively evaluates a query and answer pair that is a candidate for a FAQ from a document group including a query sentence and an answer sentence.

実施形態のFAQ作成支援システムは、問合せ文とその回答文を含む文書の文書集合において各文書それぞれの問合せ文から抽出される複数の問合せ代表文のうち同一の問合せ代表文に基づいて、一の問合せ代表文に複数の抽出元の問合せ文に対応する文書が関連付けられた問合せ代表文を記憶する第1記憶部と、各文書それぞれの回答文から抽出される複数の回答代表文のうち同一の回答代表文に基づいて、一の回答代表文に複数の抽出元の回答文に対応する文書が関連付けられた回答代表文を記憶する第2記憶部と、一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するFAQ候補評価情報を生成するFAQ候補制御部と、FAQ候補評価情報に対応する問合せ代表文と回答代表文との対に基づいて、問合せとその回答で構成されるFAQを生成するFAQ作成制御部と、を有し、FAQ候補制御部は、問合せ代表文及び回答代表文を縦横に配置したマトリクス図であって、問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示されたFAQ候補マトリクス図を生成し、FAQ作成制御部は、所定のコンピュータに表示されたFAQ候補マトリクス図上のFAQ候補評価情報が選択された場合に、問合せ文とその回答文を含む文書の文書集合の中から選択されたFAQ候補評価情報に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを抽出し、選択されたFAQ候補評価情報に対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むFAQ作成画面を生成しコンピュータに伝送する。 The FAQ creation support system of the embodiment is based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of documents including the query sentence and the answer sentence. A first storage unit that stores a query representative sentence in which a document corresponding to a plurality of source query sentences is associated with the query representative sentence; and a plurality of answer representative sentences extracted from the answer sentences of each document Based on the answer representative sentence, a second storage unit that stores an answer representative sentence in which a document corresponding to a plurality of source answer sentences is associated with one answer representative sentence, and each document associated with the one inquiry representative sentence Counts the number of documents that match each document associated with each of the answer representative sentences, and using the counted number of documents, FAQ candidates for a pair of one inquiry representative sentence and one answer representative sentence A FAQ candidate control unit for generating price information, a FAQ creation control unit for generating a FAQ composed of an inquiry and its answer based on a pair of an inquiry representative sentence and an answer representative sentence corresponding to the FAQ candidate evaluation information, It has a, FAQ candidate control unit is a matrix diagram of arranging the query representative sentence and answer representative sentence vertically and horizontally to a position intersecting the query representative sentence and the answer representative sentence aspect, query representative sentence applicable and answers A FAQ candidate matrix diagram in which FAQ candidate evaluation information corresponding to a pair with the representative sentence is displayed is generated, and the FAQ creation control unit selects the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer. Each of the source query sentences associated with the query representative sentence corresponding to the FAQ candidate evaluation information selected from the document set of the document including the query sentence and the answer sentence. Extracts and transmits the FAQ creation screen generates a computer including a query representative sentence and answer representative sentence corresponding to FAQ candidate evaluation information selected and extracted and the query statement of the extraction source.

第1実施形態のFAQ作成支援システムの適用例を示す図である。It is a figure which shows the example of application of the FAQ preparation assistance system of 1st Embodiment. 第1実施形態のFAQ作成支援機能を説明する図である。It is a figure explaining the FAQ preparation assistance function of 1st Embodiment. 第1実施形態のFAQ作成支援システムの構成ブロック図である。It is a block diagram of the FAQ creation support system of the first embodiment. 第1実施形態の問合せ−対応内容2軸マトリクス図の一例を示す図である。It is a figure which shows an example of the inquiry-correspondence content 2-axis matrix figure of 1st Embodiment. 第1実施形態のFAQ作成画面例を示す図である。It is a figure which shows the example of FAQ preparation screen of 1st Embodiment. 第1実施形態のFAQ情報例を示す図である。It is a figure which shows the FAQ information example of 1st Embodiment. 第1実施形態のFAQ作成支援処理フローを示す図である。It is a figure which shows the FAQ creation assistance process flow of 1st Embodiment. 第1実施形態の代表文生成部のブロック構成図(a)、代表文生成処理フロー(b)を示す図である。It is a figure which shows the block block diagram (a) of the representative sentence production | generation part of 1st Embodiment, and the representative sentence production | generation process flow (b). 第1実施形態の問合せ内容及び回答内容の代表文を生成する構文解析(依存構造木)処理を説明する図である。It is a figure explaining the parsing (dependency structure tree) process which produces | generates the typical sentence of the inquiry content and reply content of 1st Embodiment. 第1実施形態の構文解析処理における部分依存構造木の抽出例を示す図である。It is a figure which shows the example of extraction of the partial dependence structure tree in the syntax analysis process of 1st Embodiment. 第1実施形態の構文解析処理を通じて得られる代表文の一例を示す図である。It is a figure which shows an example of the representative sentence obtained through the syntax analysis process of 1st Embodiment. 第1実施形態の代表文候補情報の一例を示す図である。It is a figure which shows an example of the representative sentence candidate information of 1st Embodiment. 第1実施形態の集約代表文候補情報の一例を示す図である。It is a figure which shows an example of the aggregation representative sentence candidate information of 1st Embodiment. 第1実施形態の問合せ代表文情報の一例を示す図である。It is a figure which shows an example of the inquiry representative sentence information of 1st Embodiment. 第1実施形態の回答代表文情報の一例を示す図である。It is a figure which shows an example of the reply representative text information of 1st Embodiment. 第1実施形態の問合せ代表文階層情報の一例を示す図である。It is a figure which shows an example of the inquiry representative sentence hierarchy information of 1st Embodiment. 第1実施形態のケース情報入力画面例を示す図である。It is a figure which shows the example of a case information input screen of 1st Embodiment. 第1実施形態のケース情報の一例を示す図である。It is a figure which shows an example of case information of 1st Embodiment.

以下、実施形態につき、図面を参照して説明する。   Hereinafter, embodiments will be described with reference to the drawings.

(第1実施形態)
図1から図18は、第1実施形態のFAQ作成支援システムに係る図である。本実施形態に係るFAQ作成支援システムは、一例として、コンタクトセンターシステムに外的又は内的に適用することができ、顧客からの問合せ内容及びオペレータの回答(対応)内容に基づくFAQの作成支援機能を提供する。
(First embodiment)
FIG. 1 to FIG. 18 are diagrams related to the FAQ creation support system of the first embodiment. As an example, the FAQ creation support system according to the present embodiment can be applied to a contact center system externally or internally, and a FAQ creation support function based on the contents of an inquiry from a customer and the contents of an operator's response (response). I will provide a.

図1は、FAQ作成支援システム3がコンタクトセンターシステム1に適用された例である。コンタクトセンターシステム1は、顧客Cからの公衆回線網Nを介した着信(入電)をACDシステム2が各オペレータに分配し、顧客の問合せに対してオペレータが応答する。   FIG. 1 is an example in which the FAQ creation support system 3 is applied to the contact center system 1. In the contact center system 1, the ACD system 2 distributes incoming calls (incoming calls) from the customer C via the public line network N to each operator, and the operator responds to the customer's inquiry.

オペレータは、オペレータ端末5から顧客の問合せ内容(問合せ文)を入力し、また、問合せに対する回答内容(回答文)を入力する。顧客からの各問合せ及びその回答は、ケース情報としてそれぞれ所定のデータベースに登録され、対応履歴として蓄積される。   The operator inputs the customer inquiry content (inquiry text) from the operator terminal 5 and also inputs the response content (answer text) to the inquiry. Each inquiry and answer from the customer is registered as case information in a predetermined database and stored as a response history.

図17は、オペレータ端末5のディスプレイ装置に表示されるケース情報入力画面の一例であり、オペレータはケース情報入力画面を通じて顧客からの問合せ内容及び対応内容を含む各種の対応履歴情報を入力する。   FIG. 17 is an example of a case information input screen displayed on the display device of the operator terminal 5, and the operator inputs various response history information including inquiry contents and response contents from the customer through the case information input screen.

オペレータ端末5からケース情報入力画面を通じて入力されたケース情報は、所定のデータベース等の記憶部に記憶される。図18は、ケース情報の一例であり、1つの問合せに対して一意に割り当てられるケース番号(文書ID)、問合せタイプ、問合せ内容、対応内容、付帯情報等を含む。ケース情報は、同じ又は異なる顧客からの1つの問合せに対して1つのケース情報が生成されて登録される。なお、付帯情報は、例えば、問合せ内容を区分する製品分類や問合せ分類等の各ケース情報に付帯する情報である。   Case information input from the operator terminal 5 through the case information input screen is stored in a storage unit such as a predetermined database. FIG. 18 is an example of case information, and includes a case number (document ID) uniquely assigned to one inquiry, an inquiry type, inquiry contents, corresponding contents, incidental information, and the like. Case information is generated and registered for one inquiry from the same or different customers. Note that the incidental information is information incidental to each case information such as a product classification and an inquiry classification for classifying inquiry contents.

本実施形態のFAQ作成支援システム3は、オペレータの対応履歴として蓄積されたケース情報の問合せとその回答をそれぞれ分析し、問合せ内容及び回答内容の各代表文を生成(抽出)し、生成された代表文からFAQの候補となる問合せと回答の対を定量的に評価したFAQ作成環境を提供する。   The FAQ creation support system 3 according to the present embodiment analyzes case information queries and responses stored as operator response history, generates (extracts) each query statement and each representative sentence of the response content, and generates Provide a FAQ creation environment that quantitatively evaluates query and answer pairs that are FAQ candidates from representative sentences.

なお、オペレータ端末5を介したケース情報の登録処理は、コンタクトセンターシステム1の不図示の制御部が遂行することができ、また、FAQ作成支援システム3が遂行するように構成することもできる。   The case information registration process via the operator terminal 5 can be performed by a control unit (not shown) of the contact center system 1 or can be configured to be performed by the FAQ creation support system 3.

図2は、本実施形態のFAQ作成支援機能を説明する図である。オペレータ端末5からオペレータによって入力された顧客からの問合せ内容とその回答内容とを含むケース情報から、各ケース番号(文書ID)別に問合せ文の構文解析を通じた代表文生成処理を行い、問合せ代表文を生成する。また、対応内容についても各文書ID別に対応内容の構文解析を通じて代表文生成処理を行い、回答代表文を生成する。   FIG. 2 is a diagram for explaining the FAQ creation support function of this embodiment. From the case information including the inquiry contents from the customer inputted by the operator from the operator terminal 5 and the answer contents, a representative sentence generation process is performed through a syntax analysis of the inquiry sentence for each case number (document ID). Is generated. In addition, for the correspondence content, a representative sentence generation process is performed through syntactic analysis of the correspondence contents for each document ID to generate a representative answer sentence.

本実施形態の問合せ代表文は、問合せ文及びその回答文を含む文書の文書集合において複数の問合せ文から生成される要約文であり、抽出元の問合せ文それぞれが同じ要約文で集約され、1つの要約文に複数の問合せ文(及び、抽出元の文書ID)が関連付けられるFAQの候補文となる代表要約文である。回答代表文も同様である。   The query representative sentence of the present embodiment is a summary sentence generated from a plurality of query sentences in the document set of the document including the query sentence and its answer sentence. This is a representative summary sentence that is a candidate sentence of FAQ in which a plurality of query sentences (and document IDs of extraction sources) are associated with one summary sentence. The same is true for the answer representative.

また、本実施形態では、生成された代表文に紐付く生成元の文書群を対象としてさらに構文解析を遂行し、1つの代表文に紐付くサブ代表文を生成し、代表文とそのサブ代表文で構成される階層化された代表文を生成することができる。つまり、1つの代表文に関連付けられる複数の問合せ内容を対象に代表文生成処理を遂行することで、サブ代表文を当該代表文の下位階層として生成する。   Further, in the present embodiment, the syntactic analysis is further performed on the document group of the generation source associated with the generated representative sentence, a sub representative sentence associated with one representative sentence is generated, and the representative sentence and its sub representative are generated. A hierarchical representative sentence composed of sentences can be generated. In other words, by executing the representative sentence generation process for a plurality of inquiry contents associated with one representative sentence, the sub representative sentence is generated as a lower hierarchy of the representative sentence.

そして、問合せ文とその回答文を含む文書の文書集合において、各文書の問合せ文で構成される第1文書群を適切に表す複数の問合せ代表文(問合せ要約文)と、各文書の回答文で構成される第2文書群を適切に表す複数の回答代表文(回答要約文)とをマッチングし、問合せ代表文と回答代表文との関係を抽出元の文書ID数(文書数)で表した問合せ−回答マトリクス図を生成する。   In the document set of the document including the query sentence and the answer sentence, a plurality of query representative sentences (query summary sentences) appropriately representing the first document group composed of the query sentence of each document, and the answer sentence of each document Matching a plurality of response representative sentences (answer summary sentences) that appropriately represent the second document group composed of, the relationship between the query representative sentence and the answer representative sentence is represented by the number of document IDs (number of documents) of the extraction source A query-answer matrix diagram is generated.

具体的には、問合せ代表文を縦軸、回答代表文を横軸とし、問合せ代表文に紐付く文書IDと回答代表文に紐付く文書IDとの間のマッチング数(文書ID数、つまり文書数)をマトリクス表示した2軸表示画面を生成し、問合せ代表文と代表回答文との対を、抽出元の文書ID数で評価した問合せ−回答2軸マトリクス図を、FAQを作成する管理者等の管理者端末4に提供する。   Specifically, the query representative sentence is the vertical axis and the answer representative sentence is the horizontal axis, and the number of matching between the document ID associated with the query representative sentence and the document ID associated with the reply representative sentence (the document ID number, that is, the document 2) A matrix that displays a two-axis display screen, and a query-response two-axis matrix diagram in which a pair of a query representative sentence and a representative answer sentence is evaluated by the number of document IDs of the extraction source. Etc. to the administrator terminal 4.

また、問合せ−回答2軸マトリクス図の縦軸及び横軸の各要素が交差する文書ID数が表示される各表示ブロックが選択された場合、当該クロスする問合せ代表文とその回答代表文とを含むFAQ作成画面を管理者端末4に提供し、選択された表示ブロックに対応する問合せ代表文とその代表回答文の対に関連付けられる抽出元の文書群(各文書の問合せ文)が提供されるFAQの作成環境を実現する。   In addition, when each display block that displays the number of document IDs where the vertical and horizontal axes of the query-answer two-axis matrix diagram intersect is selected, the query representative sentence that intersects and the answer representative sentence are displayed. An FAQ creation screen including the same is provided to the administrator terminal 4 and a source document group (query text of each document) associated with a pair of the query representative sentence corresponding to the selected display block and the representative answer sentence is provided. Realize FAQ creation environment.

FAQ作成画面を通じて作成されたFAQは、FAQ情報として登録される。このFAQ情報は、オペレータの問合せ対応業務のフィードバック情報として活用でき、例えば、図17のケース情報入力画面の「FAQ検索」ボタンを選択すると、登録されたFAQ情報をオペレータ端末に表示させ、問合せ内容の入力作業の負担を低減させつつ、問合せに対する回答の迅速化及び回答内容の入力作業の負担も低減させることができる。   The FAQ created through the FAQ creation screen is registered as FAQ information. This FAQ information can be used as feedback information for the operator's inquiry handling work. For example, when the “FAQ Search” button on the case information input screen of FIG. 17 is selected, the registered FAQ information is displayed on the operator terminal, and the inquiry content is displayed. In addition, it is possible to reduce the burden of the input work, and to speed up the answer to the inquiry and to reduce the burden of the answer content input work.

図3は、FAQ作成支援システム3の構成ブロック図である。FAQ作成支援システム3は、FAQ作成支援サーバ100とDBサーバ200とを含んで構成される。FAQ作成支援サーバ100がFAQ作成支援機能全体を制御する制御部として機能し、DBサーバ200が、FAQ作成に用いられる各種情報を記憶する記憶部として機能する。本実施形態のFAQ作成支援システムは、1つ又は複数のコンピュータで構成することができる。   FIG. 3 is a configuration block diagram of the FAQ creation support system 3. The FAQ creation support system 3 includes a FAQ creation support server 100 and a DB server 200. The FAQ creation support server 100 functions as a control unit that controls the entire FAQ creation support function, and the DB server 200 functions as a storage unit that stores various types of information used for FAQ creation. The FAQ creation support system of this embodiment can be configured by one or a plurality of computers.

FAQ作成支援サーバ100は、管理者端末4(FAQの作成者が操作するFAQ作成者端末)との間の通信制御を遂行する通信制御部120、メモリ130、FAQ作成支援サーバ全体の制御を司るCPU(制御部)110とを含む。   The FAQ creation support server 100 controls the communication control unit 120 that performs communication control with the administrator terminal 4 (the FAQ creator terminal operated by the FAQ creator), the memory 130, and the entire FAQ creation support server. CPU (control unit) 110.

制御部110は、認証部111、代表文生成部112、FAQ候補制御部113及びFAQ作成制御部114を含んで構成される。   The control unit 110 includes an authentication unit 111, a representative sentence generation unit 112, an FAQ candidate control unit 113, and an FAQ creation control unit 114.

DBサーバ200は、画面情報210、ケース情報220、代表文情報230、FAQ情報240をそれぞれ記憶し、代表文情報230は、代表文生成部112による代表文生成処理を通じて生成される問合せ代表文231、階層化問合せ代表文232及び回答代表文233を含む。   The DB server 200 stores screen information 210, case information 220, representative sentence information 230, and FAQ information 240, respectively. The representative sentence information 230 is a query representative sentence 231 generated through representative sentence generation processing by the representative sentence generation unit 112. , A hierarchical inquiry representative sentence 232 and an answer representative sentence 233.

<代表文生成処理>
図8から図16を参照して、代表文生成処理について説明する。本実施形態の代表文生成処理は、問合せ文とその回答文を含む文書(ケース情報)の文書集合を入力データとして、問合せの代表文(要約)と回答の代表文をそれぞれ生成する。
<Representative sentence generation processing>
The representative sentence generation processing will be described with reference to FIGS. The representative sentence generation processing of this embodiment generates a representative sentence (summary) of an inquiry and a representative sentence of an answer by using a document set of documents (case information) including the inquiry sentence and the answer sentence as input data.

図8に示すように、代表文生成部112は、構文解析部1121、代表文候補抽出部1122、文生成集約部1123及び代表文決定部1124を含んで構成され、また、抽出ルール記憶部250を含む。なお、抽出ルール記憶部250は、DBサーバ200に含まれるように構成してもよく、また別途の記憶部としてFAQ作成支援システム3に含まれてもよい。   As shown in FIG. 8, the representative sentence generation unit 112 includes a syntax analysis unit 1121, a representative sentence candidate extraction unit 1122, a sentence generation aggregation unit 1123, and a representative sentence determination unit 1124, and an extraction rule storage unit 250. including. The extraction rule storage unit 250 may be configured to be included in the DB server 200 or may be included in the FAQ creation support system 3 as a separate storage unit.

代表文生成部112は、図18に示したケース情報220を参照し、各ケース情報の問合せフィールドに記憶されるオペレータが入力した顧客から受付けた問合せ文(テキスト形式)を1つの文書とし、分析対象元全ての問合せ文を集めた問合せの第1文書集合を生成する。同様に、分析対象元全ての回答を集めた回答の第2文書集合を生成する。各文書集合の問合せ文及び回答文は、元の文書IDが関連付けられている。   The representative sentence generation unit 112 refers to the case information 220 shown in FIG. 18, analyzes the inquiry sentence (text format) received from the customer input by the operator stored in the inquiry field of each case information as one document, and analyzes it. A first document set of a query in which query statements of all target sources are collected is generated. Similarly, a second document set of answers obtained by collecting all answers of the analysis source is generated. The original sentence ID is associated with the inquiry sentence and the answer sentence of each document set.

本実施形態では、構文解析処理と抽出ルールを用いて、問合せの文書集合から問合せの代表文を、回答の文書集合から回答の代表文をそれぞれ生成する。   In this embodiment, using a parsing process and an extraction rule, a query representative sentence is generated from the query document set, and a reply representative sentence is generated from the answer document set.

本実施形態の構文解析処理は、文を構成する複数の自立語および当該自立語間の係り受け関係を解析する。解析結果は、複数の自立語および当該自立語間の係り受け関係がノードおよびアークを用いて表現される。本実施形態の構文解析結果は、依存構造木として生成される。   The parsing process of this embodiment analyzes a plurality of independent words constituting a sentence and a dependency relationship between the independent words. In the analysis result, a plurality of independent words and dependency relationships between the independent words are expressed using nodes and arcs. The parsing result of this embodiment is generated as a dependency structure tree.

ノードは、依存構造木において自立語を表す。このノードには、当該自立語の見出し語、当該見出し語の品詞および当該見出し語の付属語が付与される。ノードに付与される自立語の見出し語は、当該自立語の文字列を示す。ノードに付与される見出し語の品詞は、当該見出し語(つまり、ノードによって表される自立語)の品詞を表す。ノードに付与される品詞には、例えば名詞、サ変名詞、動詞、形容詞、副詞および連体詞等が含まれる。   A node represents an independent word in the dependency structure tree. This node is given a headword of the independent word, a part of speech of the headword, and an adjunct to the headword. The headword of the independent word given to the node indicates a character string of the independent word. The part of speech of the headword given to the node represents the part of speech of the headword (that is, the independent word represented by the node). The part of speech given to a node includes, for example, a noun, a sa variable noun, a verb, an adjective, an adverb and a conjunction.

ノードに付与される見出し語の付属語は、当該見出し語に付随する付属語を表す。ノードに付与される見出し語の付属語には、例えば「が」、「を」、「の」および「に」のような助詞等が含まれる。   An adjunct word attached to a node represents an adjunct word accompanying the entry word. The adjunct to the headword given to the node includes particles such as “GA”, “NO”, “NO” and “NI”.

アークは、依存構造木においてノード間の構文的な係り受け関係を表す。このアークには、ノード間(自立語間)の係り受け関係の種類が付与される。アークに付与される係り受け関係の種類には、例えばガ格、ヲ格、連体修飾および隣接等が含まれる。なお、依存構造木においては、アークは例えば矢印により記述される。このアークの矢印は、ノード間の係り受け関係における係り元のノードから係り先のノードに向かうものとする。   An arc represents a syntactic dependency between nodes in a dependency structure tree. This arc is given a dependency type between nodes (independent words). The types of dependency relationships given to the arc include, for example, ga rating, wo rating, linkage modification, and adjacency. In the dependency structure tree, the arc is described by an arrow, for example. It is assumed that the arc arrow points from the source node to the destination node in the dependency relationship between the nodes.

以降の説明では、1つのアークを用いて表される2つのノード間の係り受け関係において、当該アークにおける係り先のノード(つまり、1つのアークにおける終点となるノード)を親ノードと称する。一方、1つのアークを用いて表される2つのノード間の係り受け関係において、当該アークにおける係り元ノード(つまり、1つのアークにおける始点となるノード)を子ノードと称する。   In the following description, in a dependency relationship between two nodes represented by using one arc, a node at the destination of the arc (that is, a node that is an end point in one arc) is referred to as a parent node. On the other hand, in a dependency relationship between two nodes represented by using one arc, a dependency source node in the arc (that is, a node that is a starting point in one arc) is referred to as a child node.

図9(a)は、2つのノードおよび当該ノード間の係り受け関係を表すアークを用いて表現される依存構造木の一例である。図9(a)の依存構文木は、ノード201およびノード202がアーク203によって繋がれ、ノード201が親ノード、ノード202が子ノードに相当する。図9(a)に示すような依存構造木を組み合せることにより、複数の自立語を含む文の構文解析結果(依存構造木)が表現される。   FIG. 9A is an example of a dependency structure tree expressed using arcs representing two nodes and a dependency relationship between the nodes. In the dependency syntax tree of FIG. 9A, the node 201 and the node 202 are connected by the arc 203, the node 201 corresponds to the parent node, and the node 202 corresponds to the child node. By combining dependency structure trees as shown in FIG. 9A, a syntax analysis result (dependency structure tree) of a sentence including a plurality of independent words is expressed.

図9(b)は、図18の文書ID「1」が付与されている文書を構成する問合せ文のうちの1つ目の文「急に黒のインクが出なくなり印刷ができません」の構文解析結果の一例である。   FIG. 9B shows a syntax analysis of the first sentence “suddenly black ink does not come out and cannot be printed” of the query sentences constituting the document with the document ID “1” in FIG. It is an example of a result.

ルートノードとは、親ノードを持たないノードであり、図9(b)の例では、見出し語が「できません」のノードである。また、ルートノードに対する子ノードを、第一世代子ノードと定義し、図9(b)の例では、第一世代子ノードは、見出し語が「出なくなり」のノードと、見出し語が「印刷」であるノードである。そして、子ノードが存在しない(つまり、アークにより子ノードとつながっていない)ノードをリーフノードと定義し、図9(b)の例では、見出し語が「急に」、「黒」「印刷」の各ノードが、リーフノードに相当する。   The root node is a node that does not have a parent node. In the example of FIG. 9B, the root word is a node that cannot be entered. Further, a child node for the root node is defined as a first generation child node. In the example of FIG. 9B, the first generation child node has a headword of “no longer appearing” and a headword of “print”. Is a node. Then, a node having no child node (that is, not connected to the child node by an arc) is defined as a leaf node. In the example of FIG. 9B, the headwords are “suddenly”, “black”, “print”. Each node corresponds to a leaf node.

本実施形態では、文書集合を構成する各文書内の各問合せ文に対して個別に構文解析を行い、解析結果として得られた依存構造木から、代表文の候補(代表文候補)を抽出する。代表文候補を抽出するルールは、代表文候補抽出ルールとして抽出ルール記憶部250に予め記憶されている。本実施形態では、解析結果として得られる依存構造木に対し、代表文候補抽出ルールを適用することにより、問合せ文から1つ又は複数の代表文候補を抽出する。   In the present embodiment, each query statement in each document constituting the document set is individually parsed, and representative sentence candidates (representative sentence candidates) are extracted from the dependency structure tree obtained as an analysis result. . Rules for extracting representative sentence candidates are stored in advance in the extraction rule storage unit 250 as representative sentence candidate extraction rules. In the present embodiment, one or more representative sentence candidates are extracted from the query sentence by applying the representative sentence candidate extraction rule to the dependency structure tree obtained as an analysis result.

代表文候補抽出ルールを用いた代表文候補の抽出処理について説明する。本実施形態では、1つの依存構造木に対して複数の抽出ルールの各々が適用されることにより、当該抽出ルール毎に当該依存構造木から部分依存構造木が抽出される。さらに、抽出された部分依存構造木に対して抽出ルールの各々が適用されることにより、部分依存構造木が抽出される。つまり、代表文候補の抽出処理においては、抽出ルール毎に抽出された各部分構造木が代表文候補となる。   The representative sentence candidate extraction process using the representative sentence candidate extraction rule will be described. In the present embodiment, by applying each of a plurality of extraction rules to one dependency structure tree, a partial dependency structure tree is extracted from the dependency structure tree for each extraction rule. Further, each of the extraction rules is applied to the extracted partial dependency structure tree, whereby the partial dependency structure tree is extracted. That is, in the representative sentence candidate extraction process, each partial structure tree extracted for each extraction rule becomes a representative sentence candidate.

図10は、代表文候補抽出ルールを説明する図であり、図10(a)は、1つの動詞を含む部分依存構造木を抽出するルール(抽出ルール1)、図10(b)は、分岐なし部分依存構造木を抽出するルール(抽出ルール2)の説明図である。   10A and 10B are diagrams for explaining representative sentence candidate extraction rules. FIG. 10A shows a rule for extracting a partial dependency structure tree including one verb (extraction rule 1), and FIG. 10B shows a branch. It is explanatory drawing of the rule (extraction rule 2) which extracts a none part dependence structure tree.

抽出ルール1は、1つの動詞を含む部分依存構造木のルールであり、図10(a)に示すように、依存構造木によって表される複数の自立語のうちの動詞に着目する。抽出ルール1によれば、依存構造木によって表される複数の自立語のうちの動詞に基づいて当該依存構造木が分割される。   The extraction rule 1 is a partial dependency structure tree rule including one verb, and as shown in FIG. 10A, focuses on a verb among a plurality of independent words represented by the dependency structure tree. According to the extraction rule 1, the dependency structure tree is divided based on a verb among a plurality of independent words represented by the dependency structure tree.

具体的には、抽出ルール1が適用される依存構造木において、ノードに付与されている見出し語の品詞が動詞であるノードおよび当該動詞ノードの親ノード間のアークが切断されることによって当該依存構造木が分割される。つまり、抽出ルール1では、分割された依存構造木の各々が部分依存構造木として抽出される。   Specifically, in the dependency structure tree to which the extraction rule 1 is applied, the dependency is obtained by cutting the arc between the node whose part of speech of the headword given to the node is a verb and the parent node of the verb node. The structural tree is split. That is, in the extraction rule 1, each divided dependency structure tree is extracted as a partial dependency structure tree.

抽出ルール2は、分岐なし部分依存構造木を抽出するルールであり、図10(b)に示すように、依存構造木(または、部分依存構造木)におけるルートノードおよび第1世代子ノード(つまり、ルートノードの子ノード)間の全てのアークの種類に着目する。これらのアークの中に、アークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格であるアークが存在する場合に、抽出ルール2を適用する。なお、この抽出ルール2が適用されるアークの種類は予め設定されている。   The extraction rule 2 is a rule for extracting a partial dependency structure tree without a branch, and as shown in FIG. 10B, the root node and the first generation child node (that is, the dependency tree) (that is, the partial dependency structure tree). Pay attention to all types of arcs between child nodes of the root node). In these arcs, the extraction rule 2 is applied when there are arcs of which the types of arc are ga, wo, ni, kara, place, and tool. The type of arc to which this extraction rule 2 is applied is set in advance.

まず、部分依存構造木におけるルートノードおよび第1世代子ノード間の全てのアークの中から、アークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格であるアークが探索される。   First, arcs whose arc types are ga, wo, ni, kara, place, and tool are searched from all arcs between the root node and the first generation child nodes in the partial dependency structure tree. Is done.

次に、ルートノードおよび第1世代子ノード間の全てのアークのうちアークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格以外であるアーク(つまり、探索されたアーク以外のアーク)が切断される。   Next, among all arcs between the root node and the first generation child node, arcs whose arc types are other than ga, wo, ni, kara, place, and tool (that is, arcs searched for) Other arcs) are cut.

この後、アークが切断された後の部分依存構造木において、ルートノードおよび各リーフノード間における全てのノードおよびアークを含む部分依存構造木が抽出される。図10(b)の例では、3つの部分依存構造木が抽出される。   Thereafter, in the partial dependency structure tree after the arc is cut, a partial dependency structure tree including all nodes and arcs between the root node and each leaf node is extracted. In the example of FIG. 10B, three partial dependency structure trees are extracted.

また、抽出ルール2は、部分依存構造木から分岐のない部分依存構造木が抽出され、抽出ルール2が適用されて抽出された部分依存構造木は、分岐なし依存構造木となる。   In addition, in the extraction rule 2, a partial dependency structure tree without a branch is extracted from the partial dependency structure tree, and the partial dependency structure tree extracted by applying the extraction rule 2 becomes a branch-less dependency structure tree.

図11は、図9(b)の示した文「急に黒のインクが出なくなり印刷ができません」の依存構造木に、抽出ルール1及び/又は抽出ルール2を適用して抽出された部分依存構造木の一例である。このように1つの問合せ文から、構文解析と抽出ルールを用いて、部分依存構造木、すなわち、依存構造木形式の代表文候補が1つ又は複数抽出される。これらの代表文候補には、代表文候補ID、抽出元の文書ID及び抽出元フィールド(問合せ、回答等)が関連付けられる。   FIG. 11 shows a partial dependency extracted by applying the extraction rule 1 and / or the extraction rule 2 to the dependency structure tree of the sentence “suddenly black ink does not come out and cannot be printed” shown in FIG. 9B. It is an example of a structural tree. In this way, one or more partial dependency structure trees, that is, representative sentence candidates in the dependency structure tree format, are extracted from one query sentence using syntax analysis and extraction rules. These representative sentence candidates are associated with a representative sentence candidate ID, an extraction source document ID, and an extraction source field (inquiry, answer, etc.).

図11に示すように、図9(b)の構文解析結果(依存構造木)に、抽出ルール1を適用すると、2つの動詞「出なくなり」と「できません」で依存構造木が分割され、2つの部分依存構造木A、Bが抽出される。また、図9(b)の構文解析結果(依存構造木)に、抽出ルール1及び抽出ルール2を適用する(抽出ルール1を適用して得られた部分依存構造木に対して抽出ルール2を適用する)と、部分依存構造木Aから部分依存構造木Cが抽出され、部分依存構造木Cから部分依存構造木Dが抽出される。   As shown in FIG. 11, when the extraction rule 1 is applied to the parsing result (dependency structure tree) of FIG. 9B, the dependency structure tree is divided by two verbs “cannot be output” and “cannot be performed”. Two partial dependency structure trees A and B are extracted. Further, the extraction rule 1 and the extraction rule 2 are applied to the syntax analysis result (dependency structure tree) of FIG. When applied, the partial dependency structure tree C is extracted from the partial dependency structure tree A, and the partial dependency structure tree D is extracted from the partial dependency structure tree C.

そして、これら複数の部分依存構造木A、B、C、Dそれぞれを、1つの問合せ文の代表文候補として抽出し、代表文候補から代表文候補文を生成する。つまり、ノード間の構文的な係り受け関係(アーク)に基づいて依存構造木形式から文形式に変換した各代表文候補文を生成する。   Then, each of the plurality of partial dependency structure trees A, B, C, and D is extracted as a representative sentence candidate of one query sentence, and a representative sentence candidate sentence is generated from the representative sentence candidate. That is, each representative sentence candidate sentence converted from the dependency structure tree form into the sentence form is generated based on the syntactic dependency relation (arc) between the nodes.

例えば、図11に示すように、動詞ノードの見出し語を終止形とすることによって各部分依存構造木から代表文候補文を生成することができ、部分依存構造木Aから代表文候補文「急に黒のインクが出なくなる」、部分依存構造木Bから代表文候補文「印刷ができません」、部分依存構造木Cから代表文候補文「黒のインクが出なくなる」、部分依存構造木Dから代表文候補文「印刷ができません」が、それぞれ生成される。   For example, as shown in FIG. 11, a representative sentence candidate sentence can be generated from each partial dependency structure tree by setting the head word of the verb node as a final form. From the partial dependency structure tree B, the representative sentence candidate sentence “cannot print”, from the partial dependency structure tree C, the representative sentence candidate sentence “no black ink comes out”, from the partial dependency structure tree D A representative sentence candidate sentence “cannot be printed” is generated.

図12は、生成された代表文候補文の一例を示す図であり、生成された代表文候補文に、代表文候補文ID、抽出元の文書ID、及び抽出元フィールドを関連付けて、代表文候補文ID別に記憶する。   FIG. 12 is a diagram illustrating an example of a generated representative sentence candidate sentence. A representative sentence candidate sentence is associated with a representative sentence candidate sentence ID, an extraction source document ID, and an extraction source field. Stored by candidate sentence ID.

例えば、代表文候補文ID1の代表文候補文は「急に黒のインクが出なくなる」であり、文書ID「1」が関連付けられ、文書ID「1」が関連付けられる代表文候補文ID2の代表文候補文は「印刷ができません」となる。また、文書ID「1」が関連付けられる代表文候補文ID4の代表文候補文は「印刷ができません」となる。このように1つの文書IDに複数の代表文候補文が関連付けられ、1つの文書(問合せ文)から複数生成される代表文候補(代表文候補文)が、抽出元の文書IDで管理される。   For example, the representative sentence candidate sentence of the representative sentence candidate sentence ID1 is “no black ink suddenly comes out”, the document ID “1” is associated, and the representative sentence candidate sentence ID2 is associated with the document ID “1”. The sentence candidate sentence is “cannot print”. Further, the representative sentence candidate sentence of the representative sentence candidate sentence ID4 associated with the document ID “1” is “cannot be printed”. In this way, a plurality of representative sentence candidate sentences are associated with one document ID, and a plurality of representative sentence candidates (representative sentence candidate sentences) generated from one document (query sentence) are managed by the document ID of the extraction source. .

次に、代表文候補文を構成する文字列が同じである問合せ文同士を集約する処理を行い、集約代表文候補文を生成する。集約代表文候補文に関連付けられる文書IDの集合(文書ID数)は、集約元の代表文候補文に関連付けられている文書ID群の和集合となる。   Next, a process of aggregating query sentences having the same character string constituting the representative sentence candidate sentence is performed to generate an aggregated representative sentence candidate sentence. A set of document IDs (number of document IDs) associated with the aggregated representative sentence candidate sentence is a union of document ID groups associated with the aggregated representative sentence candidate sentence.

図13は、図12に示した複数の代表文候補文を集約した集約代表文候補文の一例を示す図であり、集約代表文候補文ID「2」の集約代表文候補文「印刷ができません」に関連付けられる文書IDの集合は「1,2,・・・」となる。   FIG. 13 is a diagram illustrating an example of an aggregate representative sentence candidate sentence in which a plurality of representative sentence candidate sentences shown in FIG. 12 are aggregated, and the aggregate representative sentence candidate sentence “2” cannot be printed with the aggregate representative sentence candidate sentence ID “2”. The set of document IDs associated with "is" 1, 2, ... ".

すなわち、代表文候補文を集約する処理は、一の文書IDに関連付けられる代表文候補文を構成する文字列を用いて他の文書IDに関連付けられる代表文候補文を検索し、異なる文書ID同士を1つ代表文候補文に関連付けて集約する処理である。   That is, the process of aggregating representative sentence candidate sentences is performed by searching for representative sentence candidate sentences associated with other document IDs using character strings constituting representative sentence candidate sentences associated with one document ID, and different document IDs. Is associated with one representative sentence candidate sentence.

そして、集約された集約代表文候補文を用い、抽出元の文書集合の代表的な内容を表す代表文を抽出することにより、代表文を生成する。   Then, the representative sentence is generated by extracting the representative sentence representing the representative contents of the extraction source document set using the aggregated representative sentence candidate sentence.

例えば、代表的な内容の度合い(代表度)の一例として説明すると、文書IDが異なる数、すなわち、代表文候補文に関連付けられる抽出元の文書ID数(文書数)を用いることができ、集約代表文候補文に紐付いている文書ID数が多いものほど、代表度が高い文書集合の内容を表している文として、抽出(決定)することができる。   For example, as an example of a representative content level (representativeness), the number of different document IDs, that is, the number of source document IDs (number of documents) associated with the representative sentence candidate sentence can be used and aggregated. As the number of document IDs associated with the representative sentence candidate sentence is larger, it can be extracted (determined) as a sentence representing the contents of the document set having a higher representative degree.

そこで、図13の問合せの集約代表文候補文の各々に紐付いている文書IDの数を用いて、文書数(代表度)が多い順に所定数(例えば10個)の集約代表文候補文を抽出し、抽出された所定数の集約代表文候補文を代表文として決定することができる。図14は、集約代表文候補文の中から決定された問合せの代表文(要約)の一例を示す図であり、それぞれの代表文は、代表文ID別に、生成元の文書ID(代表文として決定された集約代表文候補文に紐付いている各文書ID)が関連付けられている。   Accordingly, a predetermined number (for example, 10) of aggregated representative sentence candidate sentences are extracted in descending order of the number of documents (representativeness) using the number of document IDs associated with each of the aggregated representative sentence candidate sentences of the query in FIG. Then, the predetermined number of extracted representative representative sentence candidate sentences can be determined as representative sentences. FIG. 14 is a diagram illustrating an example of a representative sentence (summary) of a query determined from the aggregated representative sentence candidate sentences. Each representative sentence is generated by document ID (representative sentence) for each representative sentence ID. Each document ID) associated with the determined aggregated representative sentence candidate sentence is associated.

このように本実施形態の代表文生成処理は、問合せ文とその回答文を含む文書の文書集合(複数の、問合せ文とその回答文を含む文書)から、当該文書集合の問合せ文及び回答文それぞれを適切に表す各代表文を生成する。各代表文は、生成元である文書集合の各文書が各々関連付けられ、異なる各文書が代表文で関連付けられる(異なる各文書が代表文によって集約(分類)される)。   As described above, the representative sentence generation processing according to the present embodiment performs the query sentence and the answer sentence of the document set from the document set (a plurality of documents including the query sentence and the answer sentence) including the query sentence and the answer sentence. Each representative sentence that appropriately represents each is generated. Each representative sentence is associated with each document of the document set as a generation source, and each different document is associated with the representative sentence (different documents are aggregated (classified) by the representative sentence).

図8(b)は、本実施形態の代表文生成処理フローを示す図である。   FIG. 8B is a diagram showing a representative sentence generation processing flow of the present embodiment.

構文解析部1121は、ケース情報220の問合せ文及びその回答文を対として含む各文書を文書ID別に抽出し(S301)、複数の自立語を含む問合せ文によって構成される文書集合を対象に、各問合せ文の構文解析処理を遂行する(S302)。   The syntax analysis unit 1121 extracts each document including the query sentence of the case information 220 and the answer sentence as a pair by document ID (S301), and targets a document set including query sentences including a plurality of independent words. The parsing process of each query statement is performed (S302).

構文解析部1121は、対象の文書集合に含まれる各文書を構成する問合せ文、つまり、当該文書集合に含まれる各文書中の全ての問合せ文それぞれについて構文解析を行う。構文解析部1121の構文解析の結果は、依存構造木によって表現される。なお、1つの問合せ文が構文解析された結果は、1つの依存構造木であり、問合せ文毎に依存構造木を生成する。   The syntax analysis unit 1121 performs syntax analysis on each query sentence that constitutes each document included in the target document set, that is, each query sentence included in each document included in the document set. The result of the syntax analysis by the syntax analysis unit 1121 is expressed by a dependency structure tree. Note that the result of parsing one query statement is one dependency structure tree, and a dependency structure tree is generated for each query statement.

代表文候補抽出部1122は、構文解析部1122によって生成された依存構造木の一部である部分構造木である代表文候補を、当該依存構造木から抽出する(ステップS303)。代表文候補抽出部1122は、抽出ルール記憶部250に格納されている抽出ルールを用いて代表文候補を抽出する。なお、代表文抽出部1122によって抽出される代表文候補(部分依存構造木)は、少なくとも2つの自立語および当該自立語間の係り受け関係を表す構造木である。また、代表文候補抽出部1122は、構文解析部1121によって問合せ文毎に生成された依存構造木の各々から代表文候補を抽出する。   The representative sentence candidate extraction unit 1122 extracts a representative sentence candidate that is a partial structure tree that is a part of the dependency structure tree generated by the syntax analysis unit 1122 from the dependency structure tree (step S303). The representative sentence candidate extraction unit 1122 extracts representative sentence candidates using the extraction rules stored in the extraction rule storage unit 250. The representative sentence candidate (partial dependency structure tree) extracted by the representative sentence extracting unit 1122 is a structure tree that represents at least two independent words and a dependency relationship between the independent words. The representative sentence candidate extraction unit 1122 extracts representative sentence candidates from each of the dependency structure trees generated for each query sentence by the syntax analysis unit 1121.

抽出ルール記憶部250に記憶されている抽出ルールは、依存構造木に適用され、当該依存構造木から代表文候補を抽出することができるルールである。本実施形態では異なる複数の抽出ルール1、2が抽出ルール記憶部250に記憶され、本実施形態の代表文候補抽出部1122は、抽出ルール1を適用した後に抽出ルール2を適用し、各部分構造木である代表文候補を対象依存構造木から抽出する。   The extraction rules stored in the extraction rule storage unit 250 are rules that can be applied to the dependency structure tree and extract representative sentence candidates from the dependency structure tree. In the present embodiment, a plurality of different extraction rules 1 and 2 are stored in the extraction rule storage unit 250, and the representative sentence candidate extraction unit 1122 of the present embodiment applies the extraction rule 2 after applying the extraction rule 1, and each part A representative sentence candidate that is a structure tree is extracted from the object-dependent structure tree.

文生成集約部1123は、代表文候補抽出部1122によって抽出された代表文候補(部分構造木)によって表される複数の自立語および当該自立語間の係り受け関係に基づいて、当該代表文候補から代表文候補文(平文)を生成する(S304)。   The sentence generation / aggregation unit 1123 determines the representative sentence candidate based on a plurality of independent words represented by the representative sentence candidate (partial structure tree) extracted by the representative sentence candidate extraction unit 1122 and the dependency relationship between the independent words. The representative sentence candidate sentence (plain text) is generated from (S304).

次に、文生成集約部1123は、生成された代表文候補文を集約することによって、集約代表文候補文を生成する(ステップS305)。文生成集約部1123は、生成された代表文候補文のうち、同一の代表文候補文を1つの集約代表文候補文に集約する。このとき、文生成集約部1123は、集約代表文候補文を識別するための集約代表文候補文ID別に、当該集約代表文候補文に集約された代表文候補文に関連付けられた各文書IDを関連付ける。   Next, the sentence generation / aggregation unit 1123 generates an aggregated representative sentence candidate sentence by aggregating the generated representative sentence candidate sentences (step S305). The sentence generation / aggregation unit 1123 aggregates the same representative sentence candidate sentences into one aggregated representative sentence candidate sentence among the generated representative sentence candidate sentences. At this time, the sentence generation / aggregation unit 1123 sets each document ID associated with the representative sentence candidate sentence aggregated in the aggregated representative sentence candidate sentence for each aggregated representative sentence candidate sentence ID for identifying the aggregated representative sentence candidate sentence. Associate.

代表文決定部1124は、文生成集約部1123によって生成された集約代表文候補文の中から代表文を決定(選択)する(ステップS306)。このとき、代表文決定部1124は、文生成集約部1123によって生成された集約代表文候補文に付与された文書IDの数(つまり、当該集約代表文候補文に集約された代表文候補文の数)に基づいて代表文を決定する。   The representative sentence determination unit 1124 determines (selects) a representative sentence from the aggregated representative sentence candidate sentences generated by the sentence generation / aggregation unit 1123 (step S306). At this time, the representative sentence determination unit 1124 determines the number of document IDs assigned to the aggregated representative sentence candidate sentences generated by the sentence generation / aggregation part 1123 (that is, the representative sentence candidate sentences aggregated in the aggregated representative sentence candidate sentences). The representative sentence is determined based on the number.

代表文決定部1124は、代表文として決定された複数の集約代表文候補文に代表文識別IDを割り当てるとともに、当該集約代表文候補文に集約された代表文候補文に関連付けられる各文書IDを関連付けてDBサーバ200(所定の記憶領域)に記憶する(S307)。   The representative sentence determination unit 1124 assigns representative sentence identification IDs to a plurality of aggregate representative sentence candidate sentences determined as representative sentences, and assigns each document ID associated with the representative sentence candidate sentences aggregated in the aggregate representative sentence candidate sentences. The database is associated and stored in the DB server 200 (predetermined storage area) (S307).

また、代表文生成部112は、ケース情報220の問合せ文及びその回答文を対として含む各文書を文書ID別に抽出し、複数の自立語を含む回答文によって構成される文書群を対象に、代表文生成処理を遂行し、生成元の文書IDが紐付けられた回答の代表文(回答の要約文)を生成し、回答代表文として決定された複数の集約代表文候補文に代表文識別IDを割り当てるとともに、当該集約代表文候補文に集約された代表文候補文に関連付けられる各文書IDを関連付けてDBサーバ200(所定の記憶領域)に記憶する(図15)。   In addition, the representative sentence generation unit 112 extracts each document including the query sentence of the case information 220 and the answer sentence as a pair for each document ID, and targets a document group composed of answer sentences including a plurality of independent words. Performs representative sentence generation processing, generates a representative sentence (answer summary sentence) associated with the document ID of the generation source, and identifies a representative sentence to a plurality of aggregate representative sentence candidate sentences determined as the representative sentence In addition to assigning an ID, each document ID associated with the representative sentence candidate sentence aggregated with the aggregated representative sentence candidate sentence is associated and stored in the DB server 200 (predetermined storage area) (FIG. 15).

なお、本実施形態では、生成された問合せ代表文に紐付く生成元の文書群を対象にさらに代表文生成処理を遂行し、生成された一の問合せ代表文に対するサブクラスの問合せ代表文(サブ問合せ代表文)を生成する。   In the present embodiment, a representative sentence generation process is further performed on a generation source document group associated with the generated query representative sentence, and a subclass query representative sentence (sub-query) for the generated one query representative sentence is processed. Representative sentence).

例えば、図14の代表文ID「1」が付与されている代表文「印刷ができません」に紐付いている各文書IDの複数の文書を対象として代表文生成処理を遂行することで、図16に示すように、代表文「印刷ができません」に対して「黒のインクが出なくなる」、「黒のみできません」といった、サブ代表文を生成する。すなわち、代表文「印刷ができません」が含まれている文書群を対象として生成される代表文が、上位階層の代表文「印刷ができません」の下位階層のサブ代表文として関連付けられ、問合せ代表文の階層構造を含む階層化問合せ代表文を生成することができる。なお、生成されるサブ代表文は、サブ代表文ID別に上位階層の代表文IDに関連付けられて、DBサーバ200(所定の記憶領域)に記憶される。   For example, by performing the representative sentence generation processing for a plurality of documents with respective document IDs associated with the representative sentence “cannot be printed” assigned the representative sentence ID “1” in FIG. As shown, sub representative sentences such as “black ink cannot be produced” and “black cannot be produced” are generated for the representative sentence “cannot print”. In other words, a representative sentence generated for a group of documents containing the representative sentence “Cannot print” is associated as a sub representative sentence in the lower hierarchy of the representative sentence “Cannot print” in the upper hierarchy, and the query representative sentence. It is possible to generate a hierarchical query representative sentence including the hierarchical structure. The generated sub representative sentence is stored in the DB server 200 (predetermined storage area) in association with the upper representative sentence ID for each sub representative sentence ID.

<FAQ作成支援機能>
図4から図7を参照して、本実施形態のFAQ作成支援機能について説明する。本実施形態のFAQ作成支援機能は、代表文生成処理で生成された階層化問合せ代表文及び回答代表文を用いて、問合せ−回答マトリクス図(FAQ候補マトリクス図)及びFAQ作成画面を通じたFAQ作成環境を管理者端末4(FAQ作成者)に提供する。
<FAQ creation support function>
The FAQ creation support function of this embodiment will be described with reference to FIGS. The FAQ creation support function of the present embodiment uses the hierarchical query representative sentence and the answer representative sentence generated by the representative sentence generation process, and creates a FAQ through an inquiry-response matrix diagram (FAQ candidate matrix diagram) and a FAQ creation screen. The environment is provided to the administrator terminal 4 (FAQ creator).

FAQ作成制御部114は、認証部111による認証処理を経た管理者端末4から伝送されるFAQ作成要求に基づいて、FAQ作成処理を遂行する。   The FAQ creation control unit 114 performs FAQ creation processing based on the FAQ creation request transmitted from the administrator terminal 4 that has undergone authentication processing by the authentication unit 111.

FAQ作成制御部114は、FAQ作成要求を受信した場合、FAQ候補制御部113に、階層化問合せ代表文及び回答代表文を用いた問合せ−回答マトリクス図の生成命令を出力する。   When the FAQ creation control unit 114 receives the FAQ creation request, the FAQ creation control unit 114 outputs, to the FAQ candidate control unit 113, an instruction to generate a query-answer matrix diagram using the hierarchical query representative sentence and the answer representative sentence.

生成命令が入力されたFAQ候補制御部113は、FAQ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行する。FAQ候補制御部113は、DBサーバ200に記憶されている階層化問合せ代表文233及び回答代表文232を参照して、一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するFAQ候補評価情報を生成する。   The FAQ candidate control unit 113 to which the generation instruction is input performs processing for generating FAQ candidate evaluation information and processing for generating an inquiry-answer matrix diagram. The FAQ candidate control unit 113 refers to the hierarchical query representative sentence 233 and the answer representative sentence 232 stored in the DB server 200, and associates each document related to one query representative sentence with each of the answer representative sentences. The number of documents matching each existing document is counted, and FAQ candidate evaluation information for a pair of one inquiry representative sentence and one answer representative sentence is generated using each counted document number.

本実施形態のFAQ候補評価情報は、カウントされた文書数であり、例えば、問合せ代表文「印刷ができません」に関連付けられる文書IDが、「1、2、5、8、9、10、11・・・・」であり、回答代表文「印刷エラーの自動修復を実施する」に関連付けられる文書IDが「8、11、25、31、・・・」である場合、2つの文書ID「8」と「11」がマッチングし、文書ID数を「2」とカウントする。
る。
The FAQ candidate evaluation information of this embodiment is the number of counted documents. For example, the document ID associated with the inquiry representative sentence “cannot be printed” is “1, 2, 5, 8, 9, 10, 11,. .., And the document ID associated with the reply representative sentence “Perform automatic error correction of printing” is “8, 11, 25, 31,...”, Two document IDs “8” And “11” are matched, and the number of document IDs is counted as “2”.
The

また、FAQ候補制御部113は、一のサブ問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いてサブ問合せ代表文と回答代表文とのペアに対するFAQ候補評価情報も生成する。   Further, the FAQ candidate control unit 113 counts the number of documents in which each document associated with one sub-query representative sentence matches each document associated with each answer representative sentence, and uses the counted number of documents. FAQ candidate evaluation information for a pair of a sub inquiry representative sentence and an answer representative sentence is also generated.

本実施形態のFAQ候補評価情報の生成処理によって生成されるFAQ候補評価情報(文書ID数)は、一の問合せ代表文と一の回答代表文との関係を表す情報であり、FAQ候補制御部113は、生成されたFAQ候補評価情報(カウントした文書ID数)を、該当の問合せ代表文と回答代表文の行と列が交差する位置の表示ブロックに表示し、問合せ代表文と回答代表文との対を定量的に評価した評価情報として提供する。なお、カウント数が「0」である場合は、「0」を表示ブロックに表示する。   The FAQ candidate evaluation information (number of document IDs) generated by the FAQ candidate evaluation information generation process of the present embodiment is information indicating the relationship between one inquiry representative sentence and one answer representative sentence, and is an FAQ candidate control unit. 113 displays the generated FAQ candidate evaluation information (the number of document IDs counted) in a display block at a position where the row and column of the corresponding query representative sentence and the reply representative sentence intersect, and the query representative sentence and the reply representative sentence It is provided as evaluation information that quantitatively evaluates the pair. When the count number is “0”, “0” is displayed on the display block.

FAQ候補制御部113は、複数の問合せ代表文別及びサブ問合せ代表文別に全ての回答代表文との間のFAQ候補評価情報を生成し、問合せ代表文及び回答代表文を縦横に配置したマトリクス図であって、問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示された問合せ−回答マトリクス図を生成する。このとき、FAQ候補制御部113は、問合せ代表文のサブ問合せ代表文を該当の問合せ代表文と共に、マトリクス図に配置し、サブ問合せ代表文と回答代表文とが縦横で交わる位置に、サブ問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示されるように、問合せ−回答マトリクス図を生成する。   The FAQ candidate control unit 113 generates FAQ candidate evaluation information between all answer representative sentences for each query representative sentence and sub-query representative sentence, and the query representative sentence and the answer representative sentence are arranged vertically and horizontally. A query-response matrix diagram is generated in which FAQ candidate evaluation information corresponding to a pair of the query representative sentence and the answer representative sentence is displayed at a position where the query representative sentence and the answer representative sentence intersect vertically and horizontally. . At this time, the FAQ candidate control unit 113 arranges the sub-query representative sentence of the query representative sentence together with the corresponding query representative sentence in the matrix diagram, and the sub-query at the position where the sub-query representative sentence and the answer representative sentence intersect vertically and horizontally. An inquiry-answer matrix diagram is generated so that FAQ candidate evaluation information corresponding to a pair of representative sentence and answer representative sentence is displayed.

例えば、縦軸に図16の問合せ代表文を代表文ID別及びサブ代表文ID別に配列し、同様に横軸に図15の回答代表文を回答代表文ID別に配列する。なお、図16の例では、代表文ID「1」に紐付く下位階層のサブ代表文は、代表文ID「1」と代表文ID「2」との間に配列する。   For example, the inquiry representative sentences in FIG. 16 are arranged by representative sentence ID and sub representative sentence ID on the vertical axis, and similarly, the answer representative sentences in FIG. 15 are arranged by answer representative sentence ID on the horizontal axis. In the example of FIG. 16, the sub-representative sentences in the lower hierarchy linked to the representative sentence ID “1” are arranged between the representative sentence ID “1” and the representative sentence ID “2”.

図4は、管理者端末4の表示される2次元の問合せ−回答マトリクス図の一例であり、縦軸(列)に各問合せ代表文(サブ問合せ代表文を含む)、横軸(行)に各回答代表文がそれぞれ配置され、問合せ代表文と回答代表文との対の関係を文書ID数で定量的に表した格子状の各表示ブロックが含まれる。   FIG. 4 is an example of a two-dimensional query-answer matrix diagram displayed on the administrator terminal 4. Each query representative sentence (including sub-query representative sentences) is shown on the vertical axis (column), and each horizontal axis (row) is shown on the horizontal axis (row). Each answer representative sentence is arranged, and each grid-like display block that quantitatively represents the relationship between the query representative sentence and the answer representative sentence in terms of the number of document IDs is included.

管理者端末4のFAQ作成者は、マウス等の操作入力手段を用いて、問合せ−回答マトリクス図上の問合せ代表文と回答代表文との関係を表した文書ID数が表示される格子状の各表示ブロックを選択することができる。FAQ作成者によって格子状の各表示ブロックを選択されると、FAQ作成制御部114は、選択された表示ブロックに対応する問合せ代表文及び回答代表文に基づくFAQ作成画面を管理者端末4に提供する。   The FAQ creator of the manager terminal 4 uses an operation input means such as a mouse to display a grid-like number in which the number of document IDs representing the relationship between the query representative sentence and the reply representative sentence on the query-answer matrix diagram is displayed. Each display block can be selected. When each of the grid-like display blocks is selected by the FAQ creator, the FAQ creation control unit 114 provides the administrator terminal 4 with a FAQ creation screen based on the inquiry representative sentence and the answer representative sentence corresponding to the selected display block. To do.

FAQ作成制御部114は、問合せ−回答マトリクス図上の格子状の表示ブロック(FAQ候補評価情報)が選択された場合、選択された表示ブロックの文書ID数が「1」以上であるか否かを判別する。   If the grid-like display block (FAQ candidate evaluation information) on the inquiry-answer matrix diagram is selected, the FAQ creation control unit 114 determines whether the number of document IDs of the selected display block is “1” or more. Is determined.

選択された表示ブロックの文書ID数が「1」以上であると判別された場合、FAQ作成制御部114は、選択された表示ブロックに対応する問合せ代表文及び回答代表文を含むFAQ作成画面を生成し、管理者端末4に伝送する。選択された表示ブロックの文書ID数が「1」以上でない、すなわち、選択された表示ブロックの文書ID数が「0」であると判別された場合、FAQ作成制御部114は、FAQ作成画面を通じた問合せ代表文及び回答代表文に基づくFAQ作成処理を遂行しない。   When it is determined that the number of document IDs of the selected display block is “1” or more, the FAQ creation control unit 114 displays an FAQ creation screen including an inquiry representative sentence and an answer representative sentence corresponding to the selected display block. It is generated and transmitted to the administrator terminal 4. When it is determined that the number of document IDs of the selected display block is not “1” or more, that is, the number of document IDs of the selected display block is “0”, the FAQ creation control unit 114 passes through the FAQ creation screen. The FAQ preparation process based on the inquiry representative sentence and the answer representative sentence is not performed.

このとき、FAQ作成制御部114は、ケース情報220を参照して問合せ文とその回答文を含む文書の文書集合の中から選択された表示ブロック(FAQ候補評価情報)に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書IDに基づいて抽出し、選択された選択された表示ブロックに対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むFAQ作成画面を生成する。   At this time, the FAQ creation control unit 114 refers to the case information 220 and displays the query representative sentence corresponding to the display block (FAQ candidate evaluation information) selected from the document set of the document including the query sentence and the answer sentence. Each query source of each extraction source to be associated is extracted based on the document ID, and includes a query representative sentence and an answer representative sentence corresponding to the selected selected display block, and a query sentence of each extracted source. Generate a FAQ creation screen.

図5は、本実施形態のFAQ作成画面例である。図5の例では、図4の問合せ−回答マトリクス図における問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」とに対応する表示ブロック「25」が選択された場合のFAQ作成画面である。問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」との対の定量的な評価(関係)を示す25個の問合せ文(25個の文書IDに紐付く問合せ文)を表示する問合せ一覧ブロックA、問合せ一覧ブロックAで選択された1の文書(文書IDに紐付く文書)の問合せ文とその回答文を表示する詳細情報表示ブロックB、問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」それぞれが、各入力欄に表示された新規FAQ作成ブロックCを含んで構成されている。   FIG. 5 is an example of the FAQ creation screen of the present embodiment. In the example of FIG. 5, the display block “25” corresponding to the query representative sentence “cannot print” and the reply representative sentence “check ink remaining amount” in the query-answer matrix diagram of FIG. 4 is selected. It is a FAQ creation screen. Twenty-five query sentences (inquiry sentences associated with 25 document IDs) showing a quantitative evaluation (relationship) between the inquiry representative sentence “cannot print” and the answer representative sentence “check remaining ink level” Query list block A for displaying, query information of one document (document linked to document ID) selected in query list block A and the detailed information display block B for displaying the response text, query representative sentence "cannot print" Each of the reply representative sentences “confirm ink remaining amount” includes a new FAQ creation block C displayed in each input field.

FAQ作成者は、FAQ作成画面において問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」とで分類される25個の抽出元の複数の各文書を問合せ一覧ブロックで見ることができ、問合せ一覧ブロックで選択した抽出元の1の文書の問合せ文とその回答文を詳細情報表示ブロックで見ることができる。FAQ作成制御部114は、問合せ一覧ブロックAに対する表示及び問合せ一覧ブロックAで選択された1の文書の問合せ文とその回答文を詳細情報表示ブロックBに表示する表示の各制御を遂行する。   The FAQ creator sees a plurality of 25 source documents in the query list block that are categorized as a query representative sentence “cannot print” and a reply representative sentence “check remaining ink” on the FAQ creation screen. It is possible to view the inquiry sentence and the answer sentence of the one source document selected in the inquiry list block in the detailed information display block. The FAQ creation control unit 114 performs each control of display on the query list block A and display of the query text of one document selected in the query list block A and its answer text on the detailed information display block B.

また、FAQ作成者は、新規FAQ作成ブロックの問合せ内容入力欄に表示された問合せ代表文「印刷できません」を編集したり、対応内容入力欄に表示された回答代表文「インクの残量を確認する」を編集することができ、登録ボタンを選択することで、FAQ作成者は、問合せ内容入力欄及び対応内容入力欄のそれぞれに表示(入力)されている問合せ文とその回答文を対としたFAQを登録(作成)することができる。   In addition, the FAQ creator edits the query representative sentence “Cannot print” displayed in the inquiry content input field of the new FAQ creation block, or the answer representative sentence “Check ink remaining amount” displayed in the corresponding content input field. By selecting the registration button, the FAQ creator makes a pair of the query text and the answer text displayed (input) in each of the query content input field and the corresponding content input field. FAQ can be registered (created).

図6は、本実施形態のFAQ作成画面から登録されたFAQ情報の一例を示す図であり、FAQ作成制御部114は、登録ボタンの選択操作に基づいて、作成されたFAQ毎にFAQ識別ID、件名、問合せ内容、問合せ代表文ID、対応内容、回答代表文IDをFAQ情報240として記憶する。なお、本実施形態では、問合せ−回答マトリクス図上の問合せ代表文及び回答代表文との関係に基づくFAQ作成環境を提供するので、FAQ作成画面が問合せ代表文及び回答代表文に紐付くことになる。このため、FAQ作成画面を通じて作成されたFAQ情報には、問合せ−回答マトリクス図上の該当する問合せ代表文ID及び回答代表文IDがそれぞれ含まれて、FAQ情報240に記憶される。   FIG. 6 is a diagram illustrating an example of FAQ information registered from the FAQ creation screen according to the present embodiment, and the FAQ creation control unit 114 uses a FAQ identification ID for each created FAQ based on a selection button selection operation. The subject name, the inquiry content, the inquiry representative sentence ID, the correspondence contents, and the answer representative sentence ID are stored as FAQ information 240. In the present embodiment, an FAQ creation environment based on the relationship between the query representative sentence and the reply representative sentence on the query-answer matrix diagram is provided, so that the FAQ creation screen is associated with the query representative sentence and the reply representative sentence. Become. For this reason, the FAQ information created through the FAQ creation screen includes the corresponding query representative sentence ID and answer representative sentence ID on the query-answer matrix diagram, respectively, and is stored in the FAQ information 240.

なお、FAQ作成制御部114は、選択された表示ブロックに対応する問合せ代表文及び回答代表文を含むFAQ作成画面して管理者端末4に提供し、問合せ一覧ブロックAにする選択操作に基づいて、ケース情報220を参照して問合せ文とその回答文を含む文書の文書集合の中から選択された表示ブロック(FAQ候補評価情報)に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書IDに基づいて抽出し、問合せ一覧ブロックAに、抽出した該当の問合せ文を表示するように制御できる。   The FAQ creation control unit 114 provides an FAQ creation screen including an inquiry representative sentence and an answer representative sentence corresponding to the selected display block to be provided to the administrator terminal 4 based on a selection operation to make the inquiry list block A. , Each extraction source query sentence associated with the query representative sentence corresponding to the display block (FAQ candidate evaluation information) selected from the document set of the document including the query sentence and the answer sentence with reference to the case information 220 Can be extracted based on the document ID, and the corresponding query sentence extracted can be displayed in the query list block A.

また、図4の例において問合せ−回答マトリクス図における問合せ代表文「印刷できません」のサブ問い合わせ代表文「ドキュメントが保留状態となる」と回答代表文「インクの残量を確認する」とに対応する表示ブロックが選択された場合、FAQ作成制御部114は、図5の新規FAQ作成ブロックCの問合せ内容入力欄に、選択されたサブ問合せ代表文とその上位層の問合せ代表文とを組み合わせた「印刷できません。ドキュメント保留状態となる」をFAQ候補として自動生成して表示することができる。つまり、サブクラスの問合せ代表文が選択された場合、上位層の問合せ代表文とその下位層の問合せ代表文とを組み合わせたFAQ作成候補(問合せ内容)を生成し、新規FAQ作成ブロックCに自動的に表示させることができる。   Further, in the example of FIG. 4, it corresponds to the sub-representative representative sentence “document is put on hold” of the inquiry representative sentence “cannot print” in the inquiry-answer matrix diagram and the reply representative sentence “check remaining ink level”. When the display block is selected, the FAQ creation control unit 114 combines the selected sub-query representative sentence and the query representative sentence of the higher layer in the inquiry content input column of the new FAQ creation block C in FIG. "Cannot print. Document will be on hold" can be automatically generated and displayed as a FAQ candidate. That is, when a subclass query representative sentence is selected, a FAQ creation candidate (query content) is generated by combining the query representative sentence of the upper layer and the query representative sentence of the lower layer, and is automatically sent to the new FAQ creation block C. Can be displayed.

図7は、本実施形態のFAQ作成支援サーバ100のFAQ作成支援処理フローを示す図である。   FIG. 7 is a diagram illustrating a FAQ creation support process flow of the FAQ creation support server 100 according to the present embodiment.

認証部111は、管理者端末4からFAQ作成要求を受信すると(S201)、認証処理を遂行する(S101)。   When receiving the FAQ creation request from the administrator terminal 4 (S201), the authentication unit 111 performs an authentication process (S101).

認証処理を経たFAQ作成要求に基づいて、FAQ候補制御部113は、FAQ候補制御部113は、DBサーバ200に記憶されている階層化問合せ代表文233及び回答代表文232を取得し(S102)、複数の問合せ代表文別及び/又はサブ問合せ代表文別に全ての回答代表文との間のFAQ候補評価情報を生成するFAQ候補評価情報の生成処理(S103)、及び問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示された問合せ−回答マトリクス図の生成処理を遂行する(S104)。   Based on the FAQ creation request that has undergone the authentication process, the FAQ candidate control unit 113 acquires the hierarchical query representative sentence 233 and the answer representative sentence 232 stored in the DB server 200 (S102). , FAQ candidate evaluation information generation processing for generating FAQ candidate evaluation information between all answer representative sentences for each of a plurality of query representative sentences and / or sub-query representative sentences (S103), and a query representative sentence and a reply representative sentence A query-response matrix diagram is generated in which FAQ candidate evaluation information corresponding to the pair of the corresponding query representative sentence and the answer representative sentence is displayed at a position where the above and the other intersect each other vertically and horizontally (S104).

FAQ候補制御部113は、生成した問合せ−回答マトリクス図を管理者端末4に伝送する(S105)。   The FAQ candidate control unit 113 transmits the generated inquiry-answer matrix diagram to the administrator terminal 4 (S105).

管理者端末4に表示された問合せ−回答マトリクス図上の問合せ代表文と回答代表文との関係を表した文書ID数が表示される格子状の各表示ブロックが選択されると(S106、S202)、FAQ作成制御部114は、ケース情報220を参照して選択された表示ブロックに対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書IDに基づいて抽出し(S107)、選択された表示ブロックに対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むFAQ作成画面を生成する(S108)。   When each grid-like display block displaying the number of document IDs representing the relationship between the inquiry representative sentence and the answer representative sentence on the inquiry-answer matrix diagram displayed on the administrator terminal 4 is selected (S106, S202). ), The FAQ creation control unit 114 extracts each source query sentence associated with the query representative sentence corresponding to the display block selected with reference to the case information 220 based on the document ID (S107). A FAQ creation screen including the inquiry representative sentence and the answer representative sentence corresponding to the displayed display block and the extracted inquiry sentence of each extraction source is generated (S108).

FAQ作成制御部114は、生成されたFAQ作成画面を管理者端末4に伝送するとともに、FAQ作成者による操作入力に基づくFAQ作成画面の表示制御を遂行する(S109)。FAQ作成制御部114は、登録ボタンが選択された場合(S203)、問合せ内容入力欄及び対応内容入力欄のそれぞれに表示(入力)されている問合せ文とその回答文を対としたFAQをFAQ情報240に登録する(S110)。   The FAQ creation control unit 114 transmits the generated FAQ creation screen to the administrator terminal 4 and performs display control of the FAQ creation screen based on an operation input by the FAQ creator (S109). When the registration button is selected (S203), the FAQ creation control unit 114 sets a FAQ that is a pair of the query text and the answer text displayed (input) in each of the query content input field and the corresponding content input field. The information 240 is registered (S110).

本実施形態のFAQ作成支援システムは、問合せ文とその回答文を含む文書の文書集合において、各文書の問合せ文で構成される第1文書群を適切に表す複数の問合せ代表文(問合せ要約文)と、各文書の回答文で構成される第2文書群を適切に表す複数の回答代表文(回答要約文)とをマッチングし、問合せ代表文と回答代表文と対の関係を抽出元の文書ID数で定量的に表すことにより、FAQ作成要否を容易かつ適切に判断できるFAQ作成環境を提供することが可能となる。   The FAQ creation support system according to the present embodiment includes a plurality of query representative sentences (query summary sentences) that appropriately represent the first document group composed of the query sentences of each document in a document set of documents including a query sentence and its answer sentence. ) And multiple answer representative sentences (answer summary sentences) that appropriately represent the second document group composed of the answer sentences of each document, and the relationship between the query representative sentence and the answer representative sentence is extracted By representing quantitatively by the number of document IDs, it is possible to provide an FAQ creation environment in which it is possible to easily and appropriately determine whether or not an FAQ is necessary.

例えば、FAQ作成者は、問合せとその回答に関し、同一内容の事象がどれだけ登録されているかを定量的に知ることができ、FAQ作成要否を容易かつ適切に判断することができる。   For example, the FAQ creator can quantitatively know how many events of the same content are registered regarding the inquiry and the answer, and can easily and appropriately determine whether or not the FAQ needs to be created.

このため、従来、人手で大量の対応履歴を読んで、一から(何もない状態から)FAQ候補を作成していた作業に比べ、作業時間及び作業負担を低減できる。特に、FAQ候補が文で表現されていることから、キーワードの場合に比べてFAQに仕上げる作業、例えば、キーワードからFAQ候補となる文書を起こす(作成する)手間を低減でき、FAQ作成の作業効率を向上させることができる。   For this reason, it is possible to reduce the work time and work load compared to the work of conventionally reading a large number of response histories manually and generating FAQ candidates from scratch (from an empty state). In particular, since the FAQ candidates are expressed in sentences, it is possible to reduce the work of finishing the FAQ compared to the case of keywords, for example, the trouble of generating (creating) a document that is a FAQ candidate from the keywords, and the efficiency of FAQ creation Can be improved.

さらに、本実施形態では、問合せ代表文及び回答代表文それぞれを縦横に配置し、問合せ代表文と回答代表文との対の定量的な関係を示す格子状の各表示ブロックを含む問合せ−回答マトリクス図をFAQ作成者に提供するので、FAQを作成するための問合せ文とその回答文を含む文書それぞれが適切に分類された定量的な評価を実現できるとともに、問合せ文とその回答文を含む文書の文書集合から作成するFAQ候補文の全体像(例えば、多い問合せとその回答の傾向など)を、容易に一目で把握することができる。   Further, in the present embodiment, the query-answer matrix including each of the query representative sentences and the answer representative sentences arranged vertically and horizontally, and each grid-like display block indicating the quantitative relationship between the query representative sentences and the answer representative sentences. Since the figure is provided to the FAQ creator, it is possible to realize a quantitative evaluation in which the query sentence for creating the FAQ and the document including the answer sentence are appropriately classified, and the document including the query sentence and the answer sentence. It is possible to easily grasp at a glance the overall image of FAQ candidate sentences created from a set of documents (for example, the tendency of many queries and their responses).

また、問合せ−回答マトリクス図上の表示ブロックの選択に基づくFAQ作成画面を提供するので、定量的な評価で集約(分類)された、問合せ文とその回答文を含む文書の文書集合に対するFAQ作成環境を提供することができる。つまり、FAQ作成画面自体が問合せ代表文及び回答代表文によって文書集合に集約したFAQ作成環境となるので、FAQの主題を外れることなく、適切なFAQを作成することができる。   In addition, since the FAQ creation screen is provided based on the selection of the display block on the query-answer matrix diagram, the FAQ creation is performed on the document set including the query sentence and the answer sentence aggregated (classified) by quantitative evaluation. An environment can be provided. That is, since the FAQ creation screen itself is a FAQ creation environment that is aggregated into a document set by an inquiry representative sentence and an answer representative sentence, an appropriate FAQ can be created without departing from the subject matter of the FAQ.

また、FAQ作成画面に問合せ代表文及び回答代表文とともに、問合せ代表文に関連付く各抽出元の問合せ文それぞれを抽出して表示するので、要約された文書(代表文)の細かいニュアンスや背景を把握しながら、FAQを作成することができる。   In addition, since each query source of each source associated with the query representative sentence is extracted and displayed along with the query representative sentence and answer representative sentence on the FAQ creation screen, the detailed nuances and background of the summarized document (representative sentence) can be displayed. FAQ can be created while grasping.

以上、上述の実施形態では、階層化問合せ代表文233及び回答代表文232に基づいて、FAQ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行する一例を説明したが、問合せ代表文231及び回答代表文232に基づいて、FAQ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行することできる。   As described above, in the above-described embodiment, an example in which the processing for generating the FAQ candidate evaluation information and the processing for generating the query-response matrix diagram based on the hierarchical query representative sentence 233 and the answer representative sentence 232 has been described. Based on the sentence 231 and the answer representative sentence 232, the FAQ candidate evaluation information generation process and the inquiry-answer matrix diagram generation process can be performed.

また、問合せ−回答マトリクス図は、図4に示した複数の問合せ代表文と複数の回答代表文で構成されたマトリクス図以外にも、例えば、1つの問合せ代表文と複数の回答代表文で構成されたマトリクス図やFAQ候補評価情報が所定値以上(文書ID数が所定数以上)の問合せ代表文と回答代表文との対のみを含むマトリクス図を生成することもできる。   In addition to the matrix diagram composed of a plurality of query representative sentences and a plurality of reply representative sentences shown in FIG. 4, the query-answer matrix diagram is composed of, for example, one query representative sentence and a plurality of reply representative sentences. It is also possible to generate a matrix diagram including only a pair of a query representative sentence and a reply representative sentence in which the matrix diagram and FAQ candidate evaluation information are equal to or greater than a predetermined value (the number of document IDs is equal to or greater than a predetermined number).

また、問合せ代表文に紐付く生成元の文書群を対象にさらに代表文生成処理を遂行してサブクラスの問合せ代表文を生成する2段階のサブクラス化の一例を示したが、これに限らず、サブクラスの問合せ代表文に対してさらに代表文生成処理を遂行して3段階、4段階・・・と、2階層以上の多段のサブクラス化された問合せ代表文を生成することもできる。   In addition, although an example of two-stage subclassing that generates a subclass query representative sentence by performing a representative sentence generation process on a generation source document group associated with the query representative sentence, the present invention is not limited to this. It is also possible to perform a representative sentence generation process on the query representative sentences of the subclass to generate multi-stage subclassified query representative sentences of three levels, four levels,...

また、上述の実施形態では、問合せ代表文のサブクラス化を一例に説明したが、例えば、回答代表文をサブクラス化することもできる。つまり、図15の示した回答代表文それぞれに紐付く生成元の文書群を対象にさらに代表文生成処理を遂行し、各回答代表文に対応したサブクラスのサブ回答代表文を生成することができ、問合せ代表文及び回答代表文の両方又はいずれか一方をサブクラス化したFAQ作成環境を提供することができる。なお、回答代表文の階層化したマトリクス図表示においても、問合せ代表文の階層化表示と同様に、図4の例の横軸に回答代表文を代表文ID別及びサブ代表文ID別に配列し、回答代表文IDに紐付く下位階層のサブ代表文を、隣り合う代表文IDの間に配列することができる。   In the above-described embodiment, the subclassification of the inquiry representative sentence has been described as an example. However, for example, the answer representative sentence can be subclassified. That is, it is possible to further generate a representative sentence generation process for the source document group associated with each of the answer representative sentences shown in FIG. 15, and generate a sub-answer representative sentence of a subclass corresponding to each answer representative sentence. In addition, it is possible to provide an FAQ creation environment in which either or both of the inquiry representative sentence and the answer representative sentence are subclassified. Note that, in the hierarchical matrix display of the answer representative sentences, the answer representative sentences are arranged by representative sentence ID and sub representative sentence ID on the horizontal axis in the example of FIG. The sub-representative sentences in the lower hierarchy associated with the answer representative sentence ID can be arranged between adjacent representative sentence IDs.

また、回答代表文のサブクラス化も、2階層以上の多段のサブクラス化を行うことができる。さらに、問合せ−回答マトリクス図において問合せ代表文の下位階層であるサブ問合せ代表文が選択された場合の処理と同様に、図5において上位階層の回答代表文とその下位階層のサブ回答代表文とを組み合わせたFAQ作成候補(対応内容)を生成し、新規FAQ作成ブロックCに自動的に表示させることもできる。   In addition, subclassification of answer representative sentences can be performed in multiple stages of two or more layers. Further, similar to the processing when the sub-query representative sentence that is the lower hierarchy of the query representative sentence is selected in the query-answer matrix diagram, the upper-level answer representative sentence and the sub-answer representative sentence of the lower hierarchy in FIG. Can be generated and automatically displayed on the new FAQ creation block C.

また、ケース情報220の文書ID毎の各付帯情報に含まれる製品分類や問合せ分類等を用い、特定の製品や問合せタイプに関する問合せ文とその回答文を含む文書の文書集合を対象にした、代表文生成処理、FAQ候補評価情報の生成処理、又は問合せ−回答マトリクス図の生成処理を遂行するように構成することができ、特定の製品や問合せタイプに応じた個別にFAQ作成環境を提供することもできる。   In addition, a representative for a document set of documents including a query sentence related to a specific product or a query type and an answer sentence using a product classification or a query classification included in each incidental information for each document ID of the case information 220. It can be configured to perform sentence generation processing, FAQ candidate evaluation information generation processing, or query-response matrix diagram generation processing, and provide an FAQ creation environment individually according to a specific product or inquiry type You can also.

また、FAQ作成支援サーバ100の代表文生成部112は、個別の代表文生成装置として構成することができ、FAQ作成支援サーバ100と代表文生成装置とが連動したFAQ作成支援システムとして構成することもできる。   The representative sentence generation unit 112 of the FAQ creation support server 100 can be configured as an individual representative sentence generation apparatus, and is configured as an FAQ creation support system in which the FAQ creation support server 100 and the representative sentence generation apparatus are linked. You can also.

また、上述の実施形態の各処理は、コンピュータで実行可能なプログラムとして実現することも可能であり、当該プログラムがインストールされたコンピュータは、実施形態に係るFAQ作成支援機能の各処理を遂行する情報処理装置として動作することが可能である。例えば、不図示の補助記憶装置に当該プログラムが格納され、CPU等の制御部が補助記憶装置に格納されたプログラムを主記憶装置に読み出し、主記憶装置に読み出された該プログラムを制御部が実行し、コンピュータに実施形態に係る各処理を動作させることができる。   Each process of the above-described embodiment can also be realized as a computer-executable program, and the computer in which the program is installed is information for performing each process of the FAQ creation support function according to the embodiment. It is possible to operate as a processing device. For example, the program is stored in an auxiliary storage device (not shown), and a control unit such as a CPU reads the program stored in the auxiliary storage device to the main storage device, and the control unit reads the program read to the main storage device. It is possible to execute and cause the computer to operate each process according to the embodiment.

また、上記プログラムは、コンピュータ読取可能な記録媒体に記録された状態で、コンピュータに適用することも可能であり、インターネット等のネットワークを通じてコンピュータにダウンロードすることも可能である。コンピュータ読取可能な記録媒体としては、CD−ROM等の光ディスク、DVD−ROM等の相変化型光ディスク、MO(Magnet Optical)やMD(Mini Disk)などの光磁気ディスク、フロッピー(登録商標)ディスクやリムーバブルハードディスクなどの磁気ディスク、コンパクトフラッシュ(登録商標)、スマートメディア、SDメモリカード、メモリスティック等のメモリカードが挙げられる。また、特別に設計されて構成された集積回路(ICチップ等)等のハードウェア装置も記録媒体として含まれる。   The program can be applied to a computer in a state where the program is recorded on a computer-readable recording medium, or can be downloaded to a computer through a network such as the Internet. Computer-readable recording media include optical disks such as CD-ROM, phase change optical disks such as DVD-ROM, magneto-optical disks such as MO (Magnet Optical) and MD (Mini Disk), floppy (registered trademark) disks, Examples include magnetic disks such as removable hard disks, memory cards such as compact flash (registered trademark), smart media, SD memory cards, and memory sticks. A hardware device such as an integrated circuit (IC chip or the like) specially designed and configured is also included as a recording medium.

なお、本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。   In addition, although some embodiment of this invention was described, these embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

1 コンタクトセンターシステム
2 ACDシステム
3 FAQ作成支援システム
4 管理者端末
5 オペレータ端末
100 FAQ作成支援サーバ
110 CPU(制御部)
111 認証部
112 代表文生成部
1121 構文解析部
1122 代表文候補抽出部
1123 文生成集約部
1124 代表文決定部
113 FAQ候補制御部
114 FAQ作成制御部
120 通信制御部
130 メモリ
200 DBサーバ
210 画面情報
220 ケース情報
230 代表文情報
231 問合せ代表文情報
232 階層化問合せ代表文情報
233 回答代表文情報
240 FAQ情報
250 抽出ルール記憶部
1 Contact Center System 2 ACD System 3 FAQ Creation Support System 4 Administrator Terminal 5 Operator Terminal 100 FAQ Creation Support Server 110 CPU (Control Unit)
111 Authentication Unit 112 Representative Sentence Generation Unit 1121 Syntax Analysis Unit 1122 Representative Sentence Candidate Extraction Unit 1123 Sentence Generation Aggregation Unit 1124 Representative Sentence Determination Unit 113 FAQ Candidate Control Unit 114 FAQ Creation Control Unit 120 Communication Control Unit 130 Memory 200 DB Server 210 Screen Information 220 Case information 230 Representative sentence information 231 Inquiry representative sentence information 232 Hierarchical inquiry representative sentence information 233 Answer representative sentence information 240 FAQ information 250 Extraction rule storage unit

Claims (3)

問合せ文とその回答文を含む文書の文書集合において各文書それぞれの問合せ文から抽出される複数の問合せ代表文のうち同一の問合せ代表文に基づいて、一の問合せ代表文に複数の抽出元の問合せ文に対応する文書が関連付けられた前記問合せ代表文を記憶する第1記憶部と、
前記各文書それぞれの回答文から抽出される複数の回答代表文のうち同一の回答代表文に基づいて、一の回答代表文に複数の抽出元の回答文に対応する文書が関連付けられた前記回答代表文を記憶する第2記憶部と、
一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、前記カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するFAQ候補評価情報を生成するFAQ候補制御部と、
前記FAQ候補評価情報に対応する問合せ代表文と回答代表文との対に基づいて、問合せとその回答で構成されるFAQを生成するFAQ作成制御部と、を有し
前記FAQ候補制御部は、前記問合せ代表文及び前記回答代表文を縦横に配置したマトリクス図であって、前記問合せ代表文と前記回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示されたFAQ候補マトリクス図を生成し、
前記FAQ作成制御部は、所定のコンピュータに表示された前記FAQ候補マトリクス図上の前記FAQ候補評価情報が選択された場合に、前記問合せ文とその回答文を含む文書の文書集合の中から選択されたFAQ候補評価情報に対応する問合せ代表文に関連付く前記各抽出元の問合せ文それぞれを抽出し、前記選択されたFAQ候補評価情報に対応する問合せ代表文及び回答代表文と、前記抽出された各抽出元の問合せ文とを含むFAQ作成画面を生成し、前記コンピュータに伝送することを特徴とするFAQ作成支援システム。
Based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of the document including the query sentence and the answer sentence, a plurality of extraction sources are included in one query representative sentence. A first storage unit for storing the query representative sentence associated with the document corresponding to the query sentence;
The answer in which the documents corresponding to the answer sentences of a plurality of extraction sources are associated with one answer representative sentence based on the same answer representative sentence among the answer representative sentences extracted from the answer sentences of the respective documents. A second storage unit for storing a representative sentence;
Each document associated with one query representative sentence counts the number of documents that match each document associated with each answer representative sentence, and one query representative sentence and one answer are counted using the counted number of each document. FAQ candidate control unit for generating FAQ candidate evaluation information for a pair with a representative sentence;
A FAQ creation control unit that generates a FAQ composed of a query and its answer based on a pair of a query representative sentence and a reply representative sentence corresponding to the FAQ candidate evaluation information. It is a matrix diagram in which the query representative sentence and the answer representative sentence are arranged vertically and horizontally, and at the position where the query representative sentence and the answer representative sentence intersect vertically and horizontally, the corresponding query representative sentence and answer representative sentence are paired. Generate a FAQ candidate matrix diagram displaying corresponding FAQ candidate evaluation information,
When the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer is selected, the FAQ creation control unit selects from the document set of documents including the inquiry sentence and the answer sentence. Each of the extraction source query sentences associated with the query representative sentence corresponding to the FAQ candidate evaluation information is extracted, and the query representative sentence and answer representative sentence corresponding to the selected FAQ candidate evaluation information are extracted. A FAQ creation support system that generates a FAQ creation screen including a query sentence of each extraction source and transmits it to the computer.
前記第1記憶部は、前記問合せ代表文に関連付く複数の各文書の問合せ文から抽出される前記問合せ代表文に対するサブ問合せ代表文のうち同一のサブ問合せ代表文に基づいて、一のサブ問合せ代表文に複数の抽出元の問合せ文に対応する文書が関連付けられた前記副問合せ代表文をさらに記憶し、
前記FAQ候補制御部は、一のサブ問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、前記カウントされた各文書数を用いて一のサブ問合せ代表文と一の回答代表文とのペアに対するFAQ候補評価情報を生成するとともに、
前記問合せ代表文及び前記問合せ代表文の前記サブ問合せ代表文と、前記回答代表文とを縦横に配置し、前記問合せ代表文と前記回答代表文とが縦横で交わる位置及び前記サブ問合せ代表文と前記回答代表文とが縦横で交わる位置それぞれに、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報及びサブ問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示された前記FAQ候補マトリクス図を生成することを特徴とする請求項に記載のFAQ作成支援システム。
The first storage unit includes one subquery based on the same subquery representative sentence among the subquery representative sentences for the query representative sentence extracted from the query sentences of a plurality of documents associated with the query representative sentence. Further storing the subquery representative sentence in which a document corresponding to the query sentence of a plurality of extraction sources is associated with the representative sentence;
The FAQ candidate control unit counts the number of documents in which each document associated with one subquery representative sentence matches each document associated with each answer representative sentence, and uses the counted number of documents to determine one. In addition to generating FAQ candidate evaluation information for a pair of a subquery representative sentence and one answer representative sentence,
The query representative sentence and the sub-query representative sentence of the query representative sentence and the answer representative sentence are arranged vertically and horizontally, the position where the query representative sentence and the answer representative sentence intersect vertically and horizontally, and the sub-query representative sentence FAQ candidate evaluation information corresponding to a pair of corresponding query representative sentence and answer representative sentence and FAQ candidate evaluation corresponding to a pair of sub-query representative sentence and answer representative sentence at each position where the answer representative sentence intersects vertically and horizontally The FAQ creation support system according to claim 1 , wherein the FAQ candidate matrix diagram displaying information is generated.
問合せ文とその回答文を含む文書の文書集合において各文書それぞれの問合せ文から抽出される複数の問合せ代表文のうち同一の問合せ代表文に基づいて、一の問合せ代表文に複数の抽出元の問合せ文に対応する文書が関連付けられた前記問合せ代表文を記憶する第1記憶部と、前記各文書それぞれの回答文から抽出される複数の回答代表文のうち同一の回答代表文に基づいて、一の回答代表文に複数の抽出元の回答文に対応する文書が関連付けられた前記回答代表文を記憶する第2記憶部とに接続可能なコンピュータに、
一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、前記カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するFAQ候補評価情報を生成する機能と、前記FAQ候補評価情報に対応する問合せ代表文と回答代表文との対に基づいて、問合せとその回答で構成されるFAQを生成する機能と、を実現させ、
前記FAQ候補評価情報を生成する機能は、前記問合せ代表文及び前記回答代表文を縦横に配置したマトリクス図であって、前記問合せ代表文と前記回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するFAQ候補評価情報が表示されたFAQ候補マトリクス図を生成し、
前記FAQを生成する機能は、所定のコンピュータに表示された前記FAQ候補マトリクス図上の前記FAQ候補評価情報が選択された場合に、前記問合せ文とその回答文を含む文書の文書集合の中から選択されたFAQ候補評価情報に対応する問合せ代表文に関連付く前記各抽出元の問合せ文それぞれを抽出し、前記選択されたFAQ候補評価情報に対応する問合せ代表文及び回答代表文と、前記抽出された各抽出元の問合せ文とを含むFAQ作成画面を生成し、前記コンピュータに伝送することを特徴とするFAQ作成支援プログラム。
Based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of the document including the query sentence and the answer sentence, a plurality of extraction sources are included in one query representative sentence. Based on the same answer representative sentence among a plurality of answer representative sentences extracted from the answer sentence of each of the documents, the first storage unit that stores the query representative sentence associated with the document corresponding to the query sentence, A computer connectable to a second storage unit that stores the answer representative sentences in which documents corresponding to a plurality of answer sentences of a plurality of extraction sources are associated with one answer representative sentence;
Each document associated with one query representative sentence counts the number of documents that match each document associated with each answer representative sentence, and one query representative sentence and one answer are counted using the counted number of each document. Based on a function for generating FAQ candidate evaluation information for a pair with a representative sentence and a pair of an inquiry representative sentence and an answer representative sentence corresponding to the FAQ candidate evaluation information, an FAQ including an inquiry and an answer is generated. And realize the function,
The function for generating the FAQ candidate evaluation information is a matrix diagram in which the inquiry representative sentence and the answer representative sentence are arranged vertically and horizontally, and at the position where the inquiry representative sentence and the answer representative sentence intersect vertically and horizontally, Generate a FAQ candidate matrix diagram displaying FAQ candidate evaluation information corresponding to a pair of inquiry representative sentence and answer representative sentence,
When the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer is selected, the function for generating the FAQ is selected from a document set of documents including the inquiry sentence and the answer sentence. Each of the extraction source query sentences associated with the query representative sentence corresponding to the selected FAQ candidate evaluation information is extracted, and the query representative sentence and answer representative sentence corresponding to the selected FAQ candidate evaluation information, and the extraction A FAQ creation support program that generates a FAQ creation screen including each extracted query message and transmits it to the computer.
JP2011189366A 2011-08-31 2011-08-31 FAQ creation support system and program Active JP5485236B2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2011189366A JP5485236B2 (en) 2011-08-31 2011-08-31 FAQ creation support system and program
CN201210298999.3A CN103020035B (en) 2011-08-31 2012-08-21 FAQ makes accessory system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2011189366A JP5485236B2 (en) 2011-08-31 2011-08-31 FAQ creation support system and program

Publications (2)

Publication Number Publication Date
JP2013050896A JP2013050896A (en) 2013-03-14
JP5485236B2 true JP5485236B2 (en) 2014-05-07

Family

ID=47968654

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2011189366A Active JP5485236B2 (en) 2011-08-31 2011-08-31 FAQ creation support system and program

Country Status (2)

Country Link
JP (1) JP5485236B2 (en)
CN (1) CN103020035B (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109947905B (en) * 2017-08-15 2023-02-21 富士通株式会社 Method and equipment for generating question and answer pairs
CN109325040B (en) * 2018-07-13 2020-11-10 众安信息技术服务有限公司 FAQ question-answer library generalization method, device and equipment
JP7202224B2 (en) * 2018-09-13 2023-01-11 シャープ株式会社 Information processing device, user terminal device, control method and control program
JP6743108B2 (en) * 2018-10-31 2020-08-19 西日本電信電話株式会社 PATTERN RECOGNITION MODEL AND PATTERN LEARNING DEVICE, GENERATION METHOD THEREOF, FAQ EXTRACTION METHOD USING THE SAME, PATTERN RECOGNITION DEVICE, AND PROGRAM
JP7267714B2 (en) * 2018-11-06 2023-05-02 株式会社東芝 Knowledge information creation support device

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002230012A (en) * 2000-12-01 2002-08-16 Sumitomo Electric Ind Ltd Document clustering device
JP2003030224A (en) * 2001-07-17 2003-01-31 Fujitsu Ltd Device for preparing document cluster, system for retrieving document and system for preparing faq
JP4081065B2 (en) * 2004-10-22 2008-04-23 クオリカ株式会社 FAQ data creation apparatus, method, and program
CN1794233A (en) * 2005-12-28 2006-06-28 刘文印 Network user interactive asking answering method and its system
JP2008046852A (en) * 2006-08-15 2008-02-28 Oki Electric Ind Co Ltd Helpdesk system and analysis server
CN100416570C (en) * 2006-09-22 2008-09-03 浙江大学 FAQ based Chinese natural language ask and answer method
JP2008084151A (en) * 2006-09-28 2008-04-10 Just Syst Corp Information display device and information display method
CN101630312A (en) * 2009-08-19 2010-01-20 腾讯科技(深圳)有限公司 Clustering method for question sentences in question-and-answer platform and system thereof
JP5075953B2 (en) * 2009-10-30 2012-11-21 株式会社東芝 Representative sentence extraction device and program

Also Published As

Publication number Publication date
CN103020035B (en) 2016-05-11
JP2013050896A (en) 2013-03-14
CN103020035A (en) 2013-04-03

Similar Documents

Publication Publication Date Title
US11100124B2 (en) Systems and methods for similarity and context measures for trademark and service mark analysis and repository searches
CN105069560B (en) The record information of a kind of knowledge based storehouse and rule base extracts and signature identification analysis system and method
US8935197B2 (en) Systems and methods for facilitating open source intelligence gathering
US9672283B2 (en) Structured and social data aggregator
JP5879260B2 (en) Method and apparatus for analyzing content of microblog message
JP5536851B2 (en) Method and system for symbolic linking and intelligent classification of information
JP2000348041A (en) Document retrieval method, device therefor and mechanically readable recording medium
US9646246B2 (en) System and method for using a statistical classifier to score contact entities
US20080065630A1 (en) Method and Apparatus for Assessing Similarity Between Online Job Listings
US20040249796A1 (en) Query classification
US20100079464A1 (en) Information processing apparatus capable of easily generating graph for comparing of a plurality of commercial products
CN101911069A (en) Method and system for discovery and modification of data clusters and synonyms
KR20140045452A (en) Summarization of conversation threads
US20140195449A1 (en) System and method for automatic building of business contacts temporal social network using corporate emails and internet
KR20160124079A (en) Systems and methods for in-memory database search
JP5485236B2 (en) FAQ creation support system and program
JP5237353B2 (en) SEARCH DEVICE, SEARCH SYSTEM, SEARCH METHOD, SEARCH PROGRAM, AND COMPUTER-READABLE RECORDING MEDIUM CONTAINING SEARCH PROGRAM
JP4962980B2 (en) Search result classification apparatus and method using click log
US20200394194A1 (en) Multi-vertical entity-based search system
CN102591897A (en) Apparatus and method for searching document
EP3432161A1 (en) Information processing system and information processing method
Perot et al. LMDX: Language Model-based Document Information Extraction and Localization
Putra et al. Tourists perception in Bali using social media and online media sentiment analysis
KR101589626B1 (en) Method for establishing start-up data or management data from big data based on lexico semantic pattern analysis
He et al. Mining feature-opinion from reviews based on dependency parsing

Legal Events

Date Code Title Description
A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20130430

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20130628

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20130723

A521 Written amendment

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20131023

A911 Transfer to examiner for re-examination before appeal (zenchi)

Free format text: JAPANESE INTERMEDIATE CODE: A911

Effective date: 20131031

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20140124

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20140219

R150 Certificate of patent or registration of utility model

Ref document number: 5485236

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

S533 Written request for registration of change of name

Free format text: JAPANESE INTERMEDIATE CODE: R313533

R350 Written notification of registration of transfer

Free format text: JAPANESE INTERMEDIATE CODE: R350