JP5485236B2

JP5485236B2 - FAQ creation support system and program

Info

Publication number: JP5485236B2
Application number: JP2011189366A
Authority: JP
Inventors: 哲男小川; 早織新田; 頌之小松原
Original assignee: Toshiba Corp; Toshiba Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2011-08-31
Filing date: 2011-08-31
Publication date: 2014-05-07
Anticipated expiration: 2031-08-31
Also published as: CN103020035B; JP2013050896A; CN103020035A

Description

本発明の実施形態は、ＦＡＱ作成支援に関する。 Embodiments described herein relate generally to FAQ creation support.

従来から、問合せとその回答をまとめたＦＡＱ（Frequently Asked Questions）が作成され、問合せユーザに対するコミュニケーションツールとして、また、問合せに対して回答をする情報提供者の業務ツールとして活用されている。 Conventionally, FAQ (Frequently Asked Questions) that summarizes inquiries and their answers has been created and used as a communication tool for inquiring users and as a business tool for information providers who answer inquiries.

一般的にＦＡＱは、多数の問合せ内容とそれに対する回答を分析し、例えば、ＦＡＱ作成者が問合せ頻度の高い問合せを抽出したり、問合せ頻度の高い問合せの中から代表的な問合せを作成し、抽出又は作成した問合せとそれに対する適切な回答とを組み合わせて作成している。 In general, the FAQ analyzes a large number of query contents and answers to them, for example, a FAQ creator extracts a query with a high query frequency, or creates a representative query from queries with a high query frequency, Created by combining the extracted or created query with the appropriate answer.

特開２００９−１５７６９号公報JP 2009-15769 A

問合せ文及びその回答文を含む文書の文書群からＦＡＱの候補となる問合せと回答の対を定量的に評価したＦＡＱ作成環境を実現するＦＡＱ作成支援システム及びプログラムを提供することを目的とする。 It is an object of the present invention to provide a FAQ creation support system and program for realizing a FAQ creation environment that quantitatively evaluates a query and answer pair that is a candidate for a FAQ from a document group including a query sentence and an answer sentence.

実施形態のＦＡＱ作成支援システムは、問合せ文とその回答文を含む文書の文書集合において各文書それぞれの問合せ文から抽出される複数の問合せ代表文のうち同一の問合せ代表文に基づいて、一の問合せ代表文に複数の抽出元の問合せ文に対応する文書が関連付けられた問合せ代表文を記憶する第１記憶部と、各文書それぞれの回答文から抽出される複数の回答代表文のうち同一の回答代表文に基づいて、一の回答代表文に複数の抽出元の回答文に対応する文書が関連付けられた回答代表文を記憶する第２記憶部と、一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するＦＡＱ候補評価情報を生成するＦＡＱ候補制御部と、ＦＡＱ候補評価情報に対応する問合せ代表文と回答代表文との対に基づいて、問合せとその回答で構成されるＦＡＱを生成するＦＡＱ作成制御部と、を有し、ＦＡＱ候補制御部は、問合せ代表文及び回答代表文を縦横に配置したマトリクス図であって、問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するＦＡＱ候補評価情報が表示されたＦＡＱ候補マトリクス図を生成し、ＦＡＱ作成制御部は、所定のコンピュータに表示されたＦＡＱ候補マトリクス図上のＦＡＱ候補評価情報が選択された場合に、問合せ文とその回答文を含む文書の文書集合の中から選択されたＦＡＱ候補評価情報に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを抽出し、選択されたＦＡＱ候補評価情報に対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むＦＡＱ作成画面を生成しコンピュータに伝送する。 The FAQ creation support system of the embodiment is based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of documents including the query sentence and the answer sentence. A first storage unit that stores a query representative sentence in which a document corresponding to a plurality of source query sentences is associated with the query representative sentence; and a plurality of answer representative sentences extracted from the answer sentences of each document Based on the answer representative sentence, a second storage unit that stores an answer representative sentence in which a document corresponding to a plurality of source answer sentences is associated with one answer representative sentence, and each document associated with the one inquiry representative sentence Counts the number of documents that match each document associated with each of the answer representative sentences, and using the counted number of documents, FAQ candidates for a pair of one inquiry representative sentence and one answer representative sentence A FAQ candidate control unit for generating price information, a FAQ creation control unit for generating a FAQ composed of an inquiry and its answer based on a pair of an inquiry representative sentence and an answer representative sentence corresponding to the FAQ candidate evaluation information, It has a, FAQ candidate control unit is a matrix diagram of arranging the query representative sentence and answer representative sentence vertically and horizontally to a position intersecting the query representative sentence and the answer representative sentence aspect, query representative sentence applicable and answers A FAQ candidate matrix diagram in which FAQ candidate evaluation information corresponding to a pair with the representative sentence is displayed is generated, and the FAQ creation control unit selects the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer. Each of the source query sentences associated with the query representative sentence corresponding to the FAQ candidate evaluation information selected from the document set of the document including the query sentence and the answer sentence. Extracts and transmits the FAQ creation screen generates a computer including a query representative sentence and answer representative sentence corresponding to FAQ candidate evaluation information selected and extracted and the query statement of the extraction source.

第１実施形態のＦＡＱ作成支援システムの適用例を示す図である。It is a figure which shows the example of application of the FAQ preparation assistance system of 1st Embodiment. 第１実施形態のＦＡＱ作成支援機能を説明する図である。It is a figure explaining the FAQ preparation assistance function of 1st Embodiment. 第１実施形態のＦＡＱ作成支援システムの構成ブロック図である。It is a block diagram of the FAQ creation support system of the first embodiment. 第１実施形態の問合せ−対応内容２軸マトリクス図の一例を示す図である。It is a figure which shows an example of the inquiry-correspondence content 2-axis matrix figure of 1st Embodiment. 第１実施形態のＦＡＱ作成画面例を示す図である。It is a figure which shows the example of FAQ preparation screen of 1st Embodiment. 第１実施形態のＦＡＱ情報例を示す図である。It is a figure which shows the FAQ information example of 1st Embodiment. 第１実施形態のＦＡＱ作成支援処理フローを示す図である。It is a figure which shows the FAQ creation assistance process flow of 1st Embodiment. 第１実施形態の代表文生成部のブロック構成図（ａ）、代表文生成処理フロー（ｂ）を示す図である。It is a figure which shows the block block diagram (a) of the representative sentence production | generation part of 1st Embodiment, and the representative sentence production | generation process flow (b). 第１実施形態の問合せ内容及び回答内容の代表文を生成する構文解析（依存構造木）処理を説明する図である。It is a figure explaining the parsing (dependency structure tree) process which produces | generates the typical sentence of the inquiry content and reply content of 1st Embodiment. 第１実施形態の構文解析処理における部分依存構造木の抽出例を示す図である。It is a figure which shows the example of extraction of the partial dependence structure tree in the syntax analysis process of 1st Embodiment. 第１実施形態の構文解析処理を通じて得られる代表文の一例を示す図である。It is a figure which shows an example of the representative sentence obtained through the syntax analysis process of 1st Embodiment. 第１実施形態の代表文候補情報の一例を示す図である。It is a figure which shows an example of the representative sentence candidate information of 1st Embodiment. 第１実施形態の集約代表文候補情報の一例を示す図である。It is a figure which shows an example of the aggregation representative sentence candidate information of 1st Embodiment. 第１実施形態の問合せ代表文情報の一例を示す図である。It is a figure which shows an example of the inquiry representative sentence information of 1st Embodiment. 第１実施形態の回答代表文情報の一例を示す図である。It is a figure which shows an example of the reply representative text information of 1st Embodiment. 第１実施形態の問合せ代表文階層情報の一例を示す図である。It is a figure which shows an example of the inquiry representative sentence hierarchy information of 1st Embodiment. 第１実施形態のケース情報入力画面例を示す図である。It is a figure which shows the example of a case information input screen of 1st Embodiment. 第１実施形態のケース情報の一例を示す図である。It is a figure which shows an example of case information of 1st Embodiment.

以下、実施形態につき、図面を参照して説明する。 Hereinafter, embodiments will be described with reference to the drawings.

（第１実施形態）
図１から図１８は、第１実施形態のＦＡＱ作成支援システムに係る図である。本実施形態に係るＦＡＱ作成支援システムは、一例として、コンタクトセンターシステムに外的又は内的に適用することができ、顧客からの問合せ内容及びオペレータの回答（対応）内容に基づくＦＡＱの作成支援機能を提供する。 (First embodiment)
FIG. 1 to FIG. 18 are diagrams related to the FAQ creation support system of the first embodiment. As an example, the FAQ creation support system according to the present embodiment can be applied to a contact center system externally or internally, and a FAQ creation support function based on the contents of an inquiry from a customer and the contents of an operator's response (response). I will provide a.

図１は、ＦＡＱ作成支援システム３がコンタクトセンターシステム１に適用された例である。コンタクトセンターシステム１は、顧客Ｃからの公衆回線網Ｎを介した着信（入電）をＡＣＤシステム２が各オペレータに分配し、顧客の問合せに対してオペレータが応答する。 FIG. 1 is an example in which the FAQ creation support system 3 is applied to the contact center system 1. In the contact center system 1, the ACD system 2 distributes incoming calls (incoming calls) from the customer C via the public line network N to each operator, and the operator responds to the customer's inquiry.

オペレータは、オペレータ端末５から顧客の問合せ内容（問合せ文）を入力し、また、問合せに対する回答内容（回答文）を入力する。顧客からの各問合せ及びその回答は、ケース情報としてそれぞれ所定のデータベースに登録され、対応履歴として蓄積される。 The operator inputs the customer inquiry content (inquiry text) from the operator terminal 5 and also inputs the response content (answer text) to the inquiry. Each inquiry and answer from the customer is registered as case information in a predetermined database and stored as a response history.

図１７は、オペレータ端末５のディスプレイ装置に表示されるケース情報入力画面の一例であり、オペレータはケース情報入力画面を通じて顧客からの問合せ内容及び対応内容を含む各種の対応履歴情報を入力する。 FIG. 17 is an example of a case information input screen displayed on the display device of the operator terminal 5, and the operator inputs various response history information including inquiry contents and response contents from the customer through the case information input screen.

オペレータ端末５からケース情報入力画面を通じて入力されたケース情報は、所定のデータベース等の記憶部に記憶される。図１８は、ケース情報の一例であり、１つの問合せに対して一意に割り当てられるケース番号（文書ＩＤ）、問合せタイプ、問合せ内容、対応内容、付帯情報等を含む。ケース情報は、同じ又は異なる顧客からの１つの問合せに対して１つのケース情報が生成されて登録される。なお、付帯情報は、例えば、問合せ内容を区分する製品分類や問合せ分類等の各ケース情報に付帯する情報である。 Case information input from the operator terminal 5 through the case information input screen is stored in a storage unit such as a predetermined database. FIG. 18 is an example of case information, and includes a case number (document ID) uniquely assigned to one inquiry, an inquiry type, inquiry contents, corresponding contents, incidental information, and the like. Case information is generated and registered for one inquiry from the same or different customers. Note that the incidental information is information incidental to each case information such as a product classification and an inquiry classification for classifying inquiry contents.

本実施形態のＦＡＱ作成支援システム３は、オペレータの対応履歴として蓄積されたケース情報の問合せとその回答をそれぞれ分析し、問合せ内容及び回答内容の各代表文を生成（抽出）し、生成された代表文からＦＡＱの候補となる問合せと回答の対を定量的に評価したＦＡＱ作成環境を提供する。 The FAQ creation support system 3 according to the present embodiment analyzes case information queries and responses stored as operator response history, generates (extracts) each query statement and each representative sentence of the response content, and generates Provide a FAQ creation environment that quantitatively evaluates query and answer pairs that are FAQ candidates from representative sentences.

なお、オペレータ端末５を介したケース情報の登録処理は、コンタクトセンターシステム１の不図示の制御部が遂行することができ、また、ＦＡＱ作成支援システム３が遂行するように構成することもできる。 The case information registration process via the operator terminal 5 can be performed by a control unit (not shown) of the contact center system 1 or can be configured to be performed by the FAQ creation support system 3.

図２は、本実施形態のＦＡＱ作成支援機能を説明する図である。オペレータ端末５からオペレータによって入力された顧客からの問合せ内容とその回答内容とを含むケース情報から、各ケース番号（文書ＩＤ）別に問合せ文の構文解析を通じた代表文生成処理を行い、問合せ代表文を生成する。また、対応内容についても各文書ＩＤ別に対応内容の構文解析を通じて代表文生成処理を行い、回答代表文を生成する。 FIG. 2 is a diagram for explaining the FAQ creation support function of this embodiment. From the case information including the inquiry contents from the customer inputted by the operator from the operator terminal 5 and the answer contents, a representative sentence generation process is performed through a syntax analysis of the inquiry sentence for each case number (document ID). Is generated. In addition, for the correspondence content, a representative sentence generation process is performed through syntactic analysis of the correspondence contents for each document ID to generate a representative answer sentence.

本実施形態の問合せ代表文は、問合せ文及びその回答文を含む文書の文書集合において複数の問合せ文から生成される要約文であり、抽出元の問合せ文それぞれが同じ要約文で集約され、１つの要約文に複数の問合せ文（及び、抽出元の文書ＩＤ）が関連付けられるＦＡＱの候補文となる代表要約文である。回答代表文も同様である。 The query representative sentence of the present embodiment is a summary sentence generated from a plurality of query sentences in the document set of the document including the query sentence and its answer sentence. This is a representative summary sentence that is a candidate sentence of FAQ in which a plurality of query sentences (and document IDs of extraction sources) are associated with one summary sentence. The same is true for the answer representative.

また、本実施形態では、生成された代表文に紐付く生成元の文書群を対象としてさらに構文解析を遂行し、１つの代表文に紐付くサブ代表文を生成し、代表文とそのサブ代表文で構成される階層化された代表文を生成することができる。つまり、１つの代表文に関連付けられる複数の問合せ内容を対象に代表文生成処理を遂行することで、サブ代表文を当該代表文の下位階層として生成する。 Further, in the present embodiment, the syntactic analysis is further performed on the document group of the generation source associated with the generated representative sentence, a sub representative sentence associated with one representative sentence is generated, and the representative sentence and its sub representative are generated. A hierarchical representative sentence composed of sentences can be generated. In other words, by executing the representative sentence generation process for a plurality of inquiry contents associated with one representative sentence, the sub representative sentence is generated as a lower hierarchy of the representative sentence.

そして、問合せ文とその回答文を含む文書の文書集合において、各文書の問合せ文で構成される第１文書群を適切に表す複数の問合せ代表文（問合せ要約文）と、各文書の回答文で構成される第２文書群を適切に表す複数の回答代表文（回答要約文）とをマッチングし、問合せ代表文と回答代表文との関係を抽出元の文書ＩＤ数（文書数）で表した問合せ−回答マトリクス図を生成する。 In the document set of the document including the query sentence and the answer sentence, a plurality of query representative sentences (query summary sentences) appropriately representing the first document group composed of the query sentence of each document, and the answer sentence of each document Matching a plurality of response representative sentences (answer summary sentences) that appropriately represent the second document group composed of, the relationship between the query representative sentence and the answer representative sentence is represented by the number of document IDs (number of documents) of the extraction source A query-answer matrix diagram is generated.

具体的には、問合せ代表文を縦軸、回答代表文を横軸とし、問合せ代表文に紐付く文書ＩＤと回答代表文に紐付く文書ＩＤとの間のマッチング数（文書ＩＤ数、つまり文書数）をマトリクス表示した２軸表示画面を生成し、問合せ代表文と代表回答文との対を、抽出元の文書ＩＤ数で評価した問合せ−回答２軸マトリクス図を、ＦＡＱを作成する管理者等の管理者端末４に提供する。 Specifically, the query representative sentence is the vertical axis and the answer representative sentence is the horizontal axis, and the number of matching between the document ID associated with the query representative sentence and the document ID associated with the reply representative sentence (the document ID number, that is, the document 2) A matrix that displays a two-axis display screen, and a query-response two-axis matrix diagram in which a pair of a query representative sentence and a representative answer sentence is evaluated by the number of document IDs of the extraction source. Etc. to the administrator terminal 4.

また、問合せ−回答２軸マトリクス図の縦軸及び横軸の各要素が交差する文書ＩＤ数が表示される各表示ブロックが選択された場合、当該クロスする問合せ代表文とその回答代表文とを含むＦＡＱ作成画面を管理者端末４に提供し、選択された表示ブロックに対応する問合せ代表文とその代表回答文の対に関連付けられる抽出元の文書群（各文書の問合せ文）が提供されるＦＡＱの作成環境を実現する。 In addition, when each display block that displays the number of document IDs where the vertical and horizontal axes of the query-answer two-axis matrix diagram intersect is selected, the query representative sentence that intersects and the answer representative sentence are displayed. An FAQ creation screen including the same is provided to the administrator terminal 4 and a source document group (query text of each document) associated with a pair of the query representative sentence corresponding to the selected display block and the representative answer sentence is provided. Realize FAQ creation environment.

ＦＡＱ作成画面を通じて作成されたＦＡＱは、ＦＡＱ情報として登録される。このＦＡＱ情報は、オペレータの問合せ対応業務のフィードバック情報として活用でき、例えば、図１７のケース情報入力画面の「ＦＡＱ検索」ボタンを選択すると、登録されたＦＡＱ情報をオペレータ端末に表示させ、問合せ内容の入力作業の負担を低減させつつ、問合せに対する回答の迅速化及び回答内容の入力作業の負担も低減させることができる。 The FAQ created through the FAQ creation screen is registered as FAQ information. This FAQ information can be used as feedback information for the operator's inquiry handling work. For example, when the “FAQ Search” button on the case information input screen of FIG. 17 is selected, the registered FAQ information is displayed on the operator terminal, and the inquiry content is displayed. In addition, it is possible to reduce the burden of the input work, and to speed up the answer to the inquiry and to reduce the burden of the answer content input work.

図３は、ＦＡＱ作成支援システム３の構成ブロック図である。ＦＡＱ作成支援システム３は、ＦＡＱ作成支援サーバ１００とＤＢサーバ２００とを含んで構成される。ＦＡＱ作成支援サーバ１００がＦＡＱ作成支援機能全体を制御する制御部として機能し、ＤＢサーバ２００が、ＦＡＱ作成に用いられる各種情報を記憶する記憶部として機能する。本実施形態のＦＡＱ作成支援システムは、１つ又は複数のコンピュータで構成することができる。 FIG. 3 is a configuration block diagram of the FAQ creation support system 3. The FAQ creation support system 3 includes a FAQ creation support server 100 and a DB server 200. The FAQ creation support server 100 functions as a control unit that controls the entire FAQ creation support function, and the DB server 200 functions as a storage unit that stores various types of information used for FAQ creation. The FAQ creation support system of this embodiment can be configured by one or a plurality of computers.

ＦＡＱ作成支援サーバ１００は、管理者端末４（ＦＡＱの作成者が操作するＦＡＱ作成者端末）との間の通信制御を遂行する通信制御部１２０、メモリ１３０、ＦＡＱ作成支援サーバ全体の制御を司るＣＰＵ（制御部）１１０とを含む。 The FAQ creation support server 100 controls the communication control unit 120 that performs communication control with the administrator terminal 4 (the FAQ creator terminal operated by the FAQ creator), the memory 130, and the entire FAQ creation support server. CPU (control unit) 110.

制御部１１０は、認証部１１１、代表文生成部１１２、ＦＡＱ候補制御部１１３及びＦＡＱ作成制御部１１４を含んで構成される。 The control unit 110 includes an authentication unit 111, a representative sentence generation unit 112, an FAQ candidate control unit 113, and an FAQ creation control unit 114.

ＤＢサーバ２００は、画面情報２１０、ケース情報２２０、代表文情報２３０、ＦＡＱ情報２４０をそれぞれ記憶し、代表文情報２３０は、代表文生成部１１２による代表文生成処理を通じて生成される問合せ代表文２３１、階層化問合せ代表文２３２及び回答代表文２３３を含む。 The DB server 200 stores screen information 210, case information 220, representative sentence information 230, and FAQ information 240, respectively. The representative sentence information 230 is a query representative sentence 231 generated through representative sentence generation processing by the representative sentence generation unit 112. , A hierarchical inquiry representative sentence 232 and an answer representative sentence 233.

＜代表文生成処理＞
図８から図１６を参照して、代表文生成処理について説明する。本実施形態の代表文生成処理は、問合せ文とその回答文を含む文書（ケース情報）の文書集合を入力データとして、問合せの代表文（要約）と回答の代表文をそれぞれ生成する。 <Representative sentence generation processing>
The representative sentence generation processing will be described with reference to FIGS. The representative sentence generation processing of this embodiment generates a representative sentence (summary) of an inquiry and a representative sentence of an answer by using a document set of documents (case information) including the inquiry sentence and the answer sentence as input data.

図８に示すように、代表文生成部１１２は、構文解析部１１２１、代表文候補抽出部１１２２、文生成集約部１１２３及び代表文決定部１１２４を含んで構成され、また、抽出ルール記憶部２５０を含む。なお、抽出ルール記憶部２５０は、ＤＢサーバ２００に含まれるように構成してもよく、また別途の記憶部としてＦＡＱ作成支援システム３に含まれてもよい。 As shown in FIG. 8, the representative sentence generation unit 112 includes a syntax analysis unit 1121, a representative sentence candidate extraction unit 1122, a sentence generation aggregation unit 1123, and a representative sentence determination unit 1124, and an extraction rule storage unit 250. including. The extraction rule storage unit 250 may be configured to be included in the DB server 200 or may be included in the FAQ creation support system 3 as a separate storage unit.

代表文生成部１１２は、図１８に示したケース情報２２０を参照し、各ケース情報の問合せフィールドに記憶されるオペレータが入力した顧客から受付けた問合せ文（テキスト形式）を１つの文書とし、分析対象元全ての問合せ文を集めた問合せの第１文書集合を生成する。同様に、分析対象元全ての回答を集めた回答の第２文書集合を生成する。各文書集合の問合せ文及び回答文は、元の文書ＩＤが関連付けられている。 The representative sentence generation unit 112 refers to the case information 220 shown in FIG. 18, analyzes the inquiry sentence (text format) received from the customer input by the operator stored in the inquiry field of each case information as one document, and analyzes it. A first document set of a query in which query statements of all target sources are collected is generated. Similarly, a second document set of answers obtained by collecting all answers of the analysis source is generated. The original sentence ID is associated with the inquiry sentence and the answer sentence of each document set.

本実施形態では、構文解析処理と抽出ルールを用いて、問合せの文書集合から問合せの代表文を、回答の文書集合から回答の代表文をそれぞれ生成する。 In this embodiment, using a parsing process and an extraction rule, a query representative sentence is generated from the query document set, and a reply representative sentence is generated from the answer document set.

本実施形態の構文解析処理は、文を構成する複数の自立語および当該自立語間の係り受け関係を解析する。解析結果は、複数の自立語および当該自立語間の係り受け関係がノードおよびアークを用いて表現される。本実施形態の構文解析結果は、依存構造木として生成される。 The parsing process of this embodiment analyzes a plurality of independent words constituting a sentence and a dependency relationship between the independent words. In the analysis result, a plurality of independent words and dependency relationships between the independent words are expressed using nodes and arcs. The parsing result of this embodiment is generated as a dependency structure tree.

ノードは、依存構造木において自立語を表す。このノードには、当該自立語の見出し語、当該見出し語の品詞および当該見出し語の付属語が付与される。ノードに付与される自立語の見出し語は、当該自立語の文字列を示す。ノードに付与される見出し語の品詞は、当該見出し語（つまり、ノードによって表される自立語）の品詞を表す。ノードに付与される品詞には、例えば名詞、サ変名詞、動詞、形容詞、副詞および連体詞等が含まれる。 A node represents an independent word in the dependency structure tree. This node is given a headword of the independent word, a part of speech of the headword, and an adjunct to the headword. The headword of the independent word given to the node indicates a character string of the independent word. The part of speech of the headword given to the node represents the part of speech of the headword (that is, the independent word represented by the node). The part of speech given to a node includes, for example, a noun, a sa variable noun, a verb, an adjective, an adverb and a conjunction.

ノードに付与される見出し語の付属語は、当該見出し語に付随する付属語を表す。ノードに付与される見出し語の付属語には、例えば「が」、「を」、「の」および「に」のような助詞等が含まれる。 An adjunct word attached to a node represents an adjunct word accompanying the entry word. The adjunct to the headword given to the node includes particles such as “GA”, “NO”, “NO” and “NI”.

アークは、依存構造木においてノード間の構文的な係り受け関係を表す。このアークには、ノード間（自立語間）の係り受け関係の種類が付与される。アークに付与される係り受け関係の種類には、例えばガ格、ヲ格、連体修飾および隣接等が含まれる。なお、依存構造木においては、アークは例えば矢印により記述される。このアークの矢印は、ノード間の係り受け関係における係り元のノードから係り先のノードに向かうものとする。 An arc represents a syntactic dependency between nodes in a dependency structure tree. This arc is given a dependency type between nodes (independent words). The types of dependency relationships given to the arc include, for example, ga rating, wo rating, linkage modification, and adjacency. In the dependency structure tree, the arc is described by an arrow, for example. It is assumed that the arc arrow points from the source node to the destination node in the dependency relationship between the nodes.

以降の説明では、１つのアークを用いて表される２つのノード間の係り受け関係において、当該アークにおける係り先のノード（つまり、１つのアークにおける終点となるノード）を親ノードと称する。一方、１つのアークを用いて表される２つのノード間の係り受け関係において、当該アークにおける係り元ノード（つまり、１つのアークにおける始点となるノード）を子ノードと称する。 In the following description, in a dependency relationship between two nodes represented by using one arc, a node at the destination of the arc (that is, a node that is an end point in one arc) is referred to as a parent node. On the other hand, in a dependency relationship between two nodes represented by using one arc, a dependency source node in the arc (that is, a node that is a starting point in one arc) is referred to as a child node.

図９（ａ）は、２つのノードおよび当該ノード間の係り受け関係を表すアークを用いて表現される依存構造木の一例である。図９（ａ）の依存構文木は、ノード２０１およびノード２０２がアーク２０３によって繋がれ、ノード２０１が親ノード、ノード２０２が子ノードに相当する。図９（ａ）に示すような依存構造木を組み合せることにより、複数の自立語を含む文の構文解析結果（依存構造木）が表現される。 FIG. 9A is an example of a dependency structure tree expressed using arcs representing two nodes and a dependency relationship between the nodes. In the dependency syntax tree of FIG. 9A, the node 201 and the node 202 are connected by the arc 203, the node 201 corresponds to the parent node, and the node 202 corresponds to the child node. By combining dependency structure trees as shown in FIG. 9A, a syntax analysis result (dependency structure tree) of a sentence including a plurality of independent words is expressed.

図９（ｂ）は、図１８の文書ＩＤ「１」が付与されている文書を構成する問合せ文のうちの１つ目の文「急に黒のインクが出なくなり印刷ができません」の構文解析結果の一例である。 FIG. 9B shows a syntax analysis of the first sentence “suddenly black ink does not come out and cannot be printed” of the query sentences constituting the document with the document ID “1” in FIG. It is an example of a result.

ルートノードとは、親ノードを持たないノードであり、図９（ｂ）の例では、見出し語が「できません」のノードである。また、ルートノードに対する子ノードを、第一世代子ノードと定義し、図９（ｂ）の例では、第一世代子ノードは、見出し語が「出なくなり」のノードと、見出し語が「印刷」であるノードである。そして、子ノードが存在しない（つまり、アークにより子ノードとつながっていない）ノードをリーフノードと定義し、図９（ｂ）の例では、見出し語が「急に」、「黒」「印刷」の各ノードが、リーフノードに相当する。 The root node is a node that does not have a parent node. In the example of FIG. 9B, the root word is a node that cannot be entered. Further, a child node for the root node is defined as a first generation child node. In the example of FIG. 9B, the first generation child node has a headword of “no longer appearing” and a headword of “print”. Is a node. Then, a node having no child node (that is, not connected to the child node by an arc) is defined as a leaf node. In the example of FIG. 9B, the headwords are “suddenly”, “black”, “print”. Each node corresponds to a leaf node.

本実施形態では、文書集合を構成する各文書内の各問合せ文に対して個別に構文解析を行い、解析結果として得られた依存構造木から、代表文の候補（代表文候補）を抽出する。代表文候補を抽出するルールは、代表文候補抽出ルールとして抽出ルール記憶部２５０に予め記憶されている。本実施形態では、解析結果として得られる依存構造木に対し、代表文候補抽出ルールを適用することにより、問合せ文から１つ又は複数の代表文候補を抽出する。 In the present embodiment, each query statement in each document constituting the document set is individually parsed, and representative sentence candidates (representative sentence candidates) are extracted from the dependency structure tree obtained as an analysis result. . Rules for extracting representative sentence candidates are stored in advance in the extraction rule storage unit 250 as representative sentence candidate extraction rules. In the present embodiment, one or more representative sentence candidates are extracted from the query sentence by applying the representative sentence candidate extraction rule to the dependency structure tree obtained as an analysis result.

代表文候補抽出ルールを用いた代表文候補の抽出処理について説明する。本実施形態では、１つの依存構造木に対して複数の抽出ルールの各々が適用されることにより、当該抽出ルール毎に当該依存構造木から部分依存構造木が抽出される。さらに、抽出された部分依存構造木に対して抽出ルールの各々が適用されることにより、部分依存構造木が抽出される。つまり、代表文候補の抽出処理においては、抽出ルール毎に抽出された各部分構造木が代表文候補となる。 The representative sentence candidate extraction process using the representative sentence candidate extraction rule will be described. In the present embodiment, by applying each of a plurality of extraction rules to one dependency structure tree, a partial dependency structure tree is extracted from the dependency structure tree for each extraction rule. Further, each of the extraction rules is applied to the extracted partial dependency structure tree, whereby the partial dependency structure tree is extracted. That is, in the representative sentence candidate extraction process, each partial structure tree extracted for each extraction rule becomes a representative sentence candidate.

図１０は、代表文候補抽出ルールを説明する図であり、図１０（ａ）は、１つの動詞を含む部分依存構造木を抽出するルール（抽出ルール１）、図１０（ｂ）は、分岐なし部分依存構造木を抽出するルール（抽出ルール２）の説明図である。 10A and 10B are diagrams for explaining representative sentence candidate extraction rules. FIG. 10A shows a rule for extracting a partial dependency structure tree including one verb (extraction rule 1), and FIG. 10B shows a branch. It is explanatory drawing of the rule (extraction rule 2) which extracts a none part dependence structure tree.

抽出ルール１は、１つの動詞を含む部分依存構造木のルールであり、図１０（ａ）に示すように、依存構造木によって表される複数の自立語のうちの動詞に着目する。抽出ルール１によれば、依存構造木によって表される複数の自立語のうちの動詞に基づいて当該依存構造木が分割される。 The extraction rule 1 is a partial dependency structure tree rule including one verb, and as shown in FIG. 10A, focuses on a verb among a plurality of independent words represented by the dependency structure tree. According to the extraction rule 1, the dependency structure tree is divided based on a verb among a plurality of independent words represented by the dependency structure tree.

具体的には、抽出ルール１が適用される依存構造木において、ノードに付与されている見出し語の品詞が動詞であるノードおよび当該動詞ノードの親ノード間のアークが切断されることによって当該依存構造木が分割される。つまり、抽出ルール１では、分割された依存構造木の各々が部分依存構造木として抽出される。 Specifically, in the dependency structure tree to which the extraction rule 1 is applied, the dependency is obtained by cutting the arc between the node whose part of speech of the headword given to the node is a verb and the parent node of the verb node. The structural tree is split. That is, in the extraction rule 1, each divided dependency structure tree is extracted as a partial dependency structure tree.

抽出ルール２は、分岐なし部分依存構造木を抽出するルールであり、図１０（ｂ）に示すように、依存構造木（または、部分依存構造木）におけるルートノードおよび第１世代子ノード（つまり、ルートノードの子ノード）間の全てのアークの種類に着目する。これらのアークの中に、アークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格であるアークが存在する場合に、抽出ルール２を適用する。なお、この抽出ルール２が適用されるアークの種類は予め設定されている。 The extraction rule 2 is a rule for extracting a partial dependency structure tree without a branch, and as shown in FIG. 10B, the root node and the first generation child node (that is, the dependency tree) (that is, the partial dependency structure tree). Pay attention to all types of arcs between child nodes of the root node). In these arcs, the extraction rule 2 is applied when there are arcs of which the types of arc are ga, wo, ni, kara, place, and tool. The type of arc to which this extraction rule 2 is applied is set in advance.

まず、部分依存構造木におけるルートノードおよび第１世代子ノード間の全てのアークの中から、アークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格であるアークが探索される。 First, arcs whose arc types are ga, wo, ni, kara, place, and tool are searched from all arcs between the root node and the first generation child nodes in the partial dependency structure tree. Is done.

次に、ルートノードおよび第１世代子ノード間の全てのアークのうちアークの種類がガ格、ヲ格、ニ格、カラ格、場所格および道具格以外であるアーク（つまり、探索されたアーク以外のアーク）が切断される。 Next, among all arcs between the root node and the first generation child node, arcs whose arc types are other than ga, wo, ni, kara, place, and tool (that is, arcs searched for) Other arcs) are cut.

この後、アークが切断された後の部分依存構造木において、ルートノードおよび各リーフノード間における全てのノードおよびアークを含む部分依存構造木が抽出される。図１０（ｂ）の例では、３つの部分依存構造木が抽出される。 Thereafter, in the partial dependency structure tree after the arc is cut, a partial dependency structure tree including all nodes and arcs between the root node and each leaf node is extracted. In the example of FIG. 10B, three partial dependency structure trees are extracted.

また、抽出ルール２は、部分依存構造木から分岐のない部分依存構造木が抽出され、抽出ルール２が適用されて抽出された部分依存構造木は、分岐なし依存構造木となる。 In addition, in the extraction rule 2, a partial dependency structure tree without a branch is extracted from the partial dependency structure tree, and the partial dependency structure tree extracted by applying the extraction rule 2 becomes a branch-less dependency structure tree.

図１１は、図９（ｂ）の示した文「急に黒のインクが出なくなり印刷ができません」の依存構造木に、抽出ルール１及び／又は抽出ルール２を適用して抽出された部分依存構造木の一例である。このように１つの問合せ文から、構文解析と抽出ルールを用いて、部分依存構造木、すなわち、依存構造木形式の代表文候補が１つ又は複数抽出される。これらの代表文候補には、代表文候補ＩＤ、抽出元の文書ＩＤ及び抽出元フィールド（問合せ、回答等）が関連付けられる。 FIG. 11 shows a partial dependency extracted by applying the extraction rule 1 and / or the extraction rule 2 to the dependency structure tree of the sentence “suddenly black ink does not come out and cannot be printed” shown in FIG. 9B. It is an example of a structural tree. In this way, one or more partial dependency structure trees, that is, representative sentence candidates in the dependency structure tree format, are extracted from one query sentence using syntax analysis and extraction rules. These representative sentence candidates are associated with a representative sentence candidate ID, an extraction source document ID, and an extraction source field (inquiry, answer, etc.).

図１１に示すように、図９（ｂ）の構文解析結果（依存構造木）に、抽出ルール１を適用すると、２つの動詞「出なくなり」と「できません」で依存構造木が分割され、２つの部分依存構造木Ａ、Ｂが抽出される。また、図９（ｂ）の構文解析結果（依存構造木）に、抽出ルール１及び抽出ルール２を適用する（抽出ルール１を適用して得られた部分依存構造木に対して抽出ルール２を適用する）と、部分依存構造木Ａから部分依存構造木Ｃが抽出され、部分依存構造木Ｃから部分依存構造木Ｄが抽出される。 As shown in FIG. 11, when the extraction rule 1 is applied to the parsing result (dependency structure tree) of FIG. 9B, the dependency structure tree is divided by two verbs “cannot be output” and “cannot be performed”. Two partial dependency structure trees A and B are extracted. Further, the extraction rule 1 and the extraction rule 2 are applied to the syntax analysis result (dependency structure tree) of FIG. When applied, the partial dependency structure tree C is extracted from the partial dependency structure tree A, and the partial dependency structure tree D is extracted from the partial dependency structure tree C.

そして、これら複数の部分依存構造木Ａ、Ｂ、Ｃ、Ｄそれぞれを、１つの問合せ文の代表文候補として抽出し、代表文候補から代表文候補文を生成する。つまり、ノード間の構文的な係り受け関係（アーク）に基づいて依存構造木形式から文形式に変換した各代表文候補文を生成する。 Then, each of the plurality of partial dependency structure trees A, B, C, and D is extracted as a representative sentence candidate of one query sentence, and a representative sentence candidate sentence is generated from the representative sentence candidate. That is, each representative sentence candidate sentence converted from the dependency structure tree form into the sentence form is generated based on the syntactic dependency relation (arc) between the nodes.

例えば、図１１に示すように、動詞ノードの見出し語を終止形とすることによって各部分依存構造木から代表文候補文を生成することができ、部分依存構造木Ａから代表文候補文「急に黒のインクが出なくなる」、部分依存構造木Ｂから代表文候補文「印刷ができません」、部分依存構造木Ｃから代表文候補文「黒のインクが出なくなる」、部分依存構造木Ｄから代表文候補文「印刷ができません」が、それぞれ生成される。 For example, as shown in FIG. 11, a representative sentence candidate sentence can be generated from each partial dependency structure tree by setting the head word of the verb node as a final form. From the partial dependency structure tree B, the representative sentence candidate sentence “cannot print”, from the partial dependency structure tree C, the representative sentence candidate sentence “no black ink comes out”, from the partial dependency structure tree D A representative sentence candidate sentence “cannot be printed” is generated.

図１２は、生成された代表文候補文の一例を示す図であり、生成された代表文候補文に、代表文候補文ＩＤ、抽出元の文書ＩＤ、及び抽出元フィールドを関連付けて、代表文候補文ＩＤ別に記憶する。 FIG. 12 is a diagram illustrating an example of a generated representative sentence candidate sentence. A representative sentence candidate sentence is associated with a representative sentence candidate sentence ID, an extraction source document ID, and an extraction source field. Stored by candidate sentence ID.

例えば、代表文候補文ＩＤ１の代表文候補文は「急に黒のインクが出なくなる」であり、文書ＩＤ「１」が関連付けられ、文書ＩＤ「１」が関連付けられる代表文候補文ＩＤ２の代表文候補文は「印刷ができません」となる。また、文書ＩＤ「１」が関連付けられる代表文候補文ＩＤ４の代表文候補文は「印刷ができません」となる。このように１つの文書ＩＤに複数の代表文候補文が関連付けられ、１つの文書（問合せ文）から複数生成される代表文候補（代表文候補文）が、抽出元の文書ＩＤで管理される。 For example, the representative sentence candidate sentence of the representative sentence candidate sentence ID1 is “no black ink suddenly comes out”, the document ID “1” is associated, and the representative sentence candidate sentence ID2 is associated with the document ID “1”. The sentence candidate sentence is “cannot print”. Further, the representative sentence candidate sentence of the representative sentence candidate sentence ID4 associated with the document ID “1” is “cannot be printed”. In this way, a plurality of representative sentence candidate sentences are associated with one document ID, and a plurality of representative sentence candidates (representative sentence candidate sentences) generated from one document (query sentence) are managed by the document ID of the extraction source. .

次に、代表文候補文を構成する文字列が同じである問合せ文同士を集約する処理を行い、集約代表文候補文を生成する。集約代表文候補文に関連付けられる文書ＩＤの集合（文書ＩＤ数）は、集約元の代表文候補文に関連付けられている文書ＩＤ群の和集合となる。 Next, a process of aggregating query sentences having the same character string constituting the representative sentence candidate sentence is performed to generate an aggregated representative sentence candidate sentence. A set of document IDs (number of document IDs) associated with the aggregated representative sentence candidate sentence is a union of document ID groups associated with the aggregated representative sentence candidate sentence.

図１３は、図１２に示した複数の代表文候補文を集約した集約代表文候補文の一例を示す図であり、集約代表文候補文ＩＤ「２」の集約代表文候補文「印刷ができません」に関連付けられる文書ＩＤの集合は「１,２,・・・」となる。 FIG. 13 is a diagram illustrating an example of an aggregate representative sentence candidate sentence in which a plurality of representative sentence candidate sentences shown in FIG. 12 are aggregated, and the aggregate representative sentence candidate sentence “2” cannot be printed with the aggregate representative sentence candidate sentence ID “2”. The set of document IDs associated with "is" 1, 2, ... ".

すなわち、代表文候補文を集約する処理は、一の文書ＩＤに関連付けられる代表文候補文を構成する文字列を用いて他の文書ＩＤに関連付けられる代表文候補文を検索し、異なる文書ＩＤ同士を１つ代表文候補文に関連付けて集約する処理である。 That is, the process of aggregating representative sentence candidate sentences is performed by searching for representative sentence candidate sentences associated with other document IDs using character strings constituting representative sentence candidate sentences associated with one document ID, and different document IDs. Is associated with one representative sentence candidate sentence.

そして、集約された集約代表文候補文を用い、抽出元の文書集合の代表的な内容を表す代表文を抽出することにより、代表文を生成する。 Then, the representative sentence is generated by extracting the representative sentence representing the representative contents of the extraction source document set using the aggregated representative sentence candidate sentence.

例えば、代表的な内容の度合い（代表度）の一例として説明すると、文書ＩＤが異なる数、すなわち、代表文候補文に関連付けられる抽出元の文書ＩＤ数（文書数）を用いることができ、集約代表文候補文に紐付いている文書ＩＤ数が多いものほど、代表度が高い文書集合の内容を表している文として、抽出（決定）することができる。 For example, as an example of a representative content level (representativeness), the number of different document IDs, that is, the number of source document IDs (number of documents) associated with the representative sentence candidate sentence can be used and aggregated. As the number of document IDs associated with the representative sentence candidate sentence is larger, it can be extracted (determined) as a sentence representing the contents of the document set having a higher representative degree.

そこで、図１３の問合せの集約代表文候補文の各々に紐付いている文書ＩＤの数を用いて、文書数（代表度）が多い順に所定数（例えば１０個）の集約代表文候補文を抽出し、抽出された所定数の集約代表文候補文を代表文として決定することができる。図１４は、集約代表文候補文の中から決定された問合せの代表文（要約）の一例を示す図であり、それぞれの代表文は、代表文ＩＤ別に、生成元の文書ＩＤ（代表文として決定された集約代表文候補文に紐付いている各文書ＩＤ）が関連付けられている。 Accordingly, a predetermined number (for example, 10) of aggregated representative sentence candidate sentences are extracted in descending order of the number of documents (representativeness) using the number of document IDs associated with each of the aggregated representative sentence candidate sentences of the query in FIG. Then, the predetermined number of extracted representative representative sentence candidate sentences can be determined as representative sentences. FIG. 14 is a diagram illustrating an example of a representative sentence (summary) of a query determined from the aggregated representative sentence candidate sentences. Each representative sentence is generated by document ID (representative sentence) for each representative sentence ID. Each document ID) associated with the determined aggregated representative sentence candidate sentence is associated.

このように本実施形態の代表文生成処理は、問合せ文とその回答文を含む文書の文書集合（複数の、問合せ文とその回答文を含む文書）から、当該文書集合の問合せ文及び回答文それぞれを適切に表す各代表文を生成する。各代表文は、生成元である文書集合の各文書が各々関連付けられ、異なる各文書が代表文で関連付けられる（異なる各文書が代表文によって集約（分類）される）。 As described above, the representative sentence generation processing according to the present embodiment performs the query sentence and the answer sentence of the document set from the document set (a plurality of documents including the query sentence and the answer sentence) including the query sentence and the answer sentence. Each representative sentence that appropriately represents each is generated. Each representative sentence is associated with each document of the document set as a generation source, and each different document is associated with the representative sentence (different documents are aggregated (classified) by the representative sentence).

図８（ｂ）は、本実施形態の代表文生成処理フローを示す図である。 FIG. 8B is a diagram showing a representative sentence generation processing flow of the present embodiment.

構文解析部１１２１は、ケース情報２２０の問合せ文及びその回答文を対として含む各文書を文書ＩＤ別に抽出し（Ｓ３０１）、複数の自立語を含む問合せ文によって構成される文書集合を対象に、各問合せ文の構文解析処理を遂行する（Ｓ３０２）。 The syntax analysis unit 1121 extracts each document including the query sentence of the case information 220 and the answer sentence as a pair by document ID (S301), and targets a document set including query sentences including a plurality of independent words. The parsing process of each query statement is performed (S302).

構文解析部１１２１は、対象の文書集合に含まれる各文書を構成する問合せ文、つまり、当該文書集合に含まれる各文書中の全ての問合せ文それぞれについて構文解析を行う。構文解析部１１２１の構文解析の結果は、依存構造木によって表現される。なお、１つの問合せ文が構文解析された結果は、１つの依存構造木であり、問合せ文毎に依存構造木を生成する。 The syntax analysis unit 1121 performs syntax analysis on each query sentence that constitutes each document included in the target document set, that is, each query sentence included in each document included in the document set. The result of the syntax analysis by the syntax analysis unit 1121 is expressed by a dependency structure tree. Note that the result of parsing one query statement is one dependency structure tree, and a dependency structure tree is generated for each query statement.

代表文候補抽出部１１２２は、構文解析部１１２２によって生成された依存構造木の一部である部分構造木である代表文候補を、当該依存構造木から抽出する（ステップＳ３０３）。代表文候補抽出部１１２２は、抽出ルール記憶部２５０に格納されている抽出ルールを用いて代表文候補を抽出する。なお、代表文抽出部１１２２によって抽出される代表文候補（部分依存構造木）は、少なくとも２つの自立語および当該自立語間の係り受け関係を表す構造木である。また、代表文候補抽出部１１２２は、構文解析部１１２１によって問合せ文毎に生成された依存構造木の各々から代表文候補を抽出する。 The representative sentence candidate extraction unit 1122 extracts a representative sentence candidate that is a partial structure tree that is a part of the dependency structure tree generated by the syntax analysis unit 1122 from the dependency structure tree (step S303). The representative sentence candidate extraction unit 1122 extracts representative sentence candidates using the extraction rules stored in the extraction rule storage unit 250. The representative sentence candidate (partial dependency structure tree) extracted by the representative sentence extracting unit 1122 is a structure tree that represents at least two independent words and a dependency relationship between the independent words. The representative sentence candidate extraction unit 1122 extracts representative sentence candidates from each of the dependency structure trees generated for each query sentence by the syntax analysis unit 1121.

抽出ルール記憶部２５０に記憶されている抽出ルールは、依存構造木に適用され、当該依存構造木から代表文候補を抽出することができるルールである。本実施形態では異なる複数の抽出ルール１、２が抽出ルール記憶部２５０に記憶され、本実施形態の代表文候補抽出部１１２２は、抽出ルール１を適用した後に抽出ルール２を適用し、各部分構造木である代表文候補を対象依存構造木から抽出する。 The extraction rules stored in the extraction rule storage unit 250 are rules that can be applied to the dependency structure tree and extract representative sentence candidates from the dependency structure tree. In the present embodiment, a plurality of different extraction rules 1 and 2 are stored in the extraction rule storage unit 250, and the representative sentence candidate extraction unit 1122 of the present embodiment applies the extraction rule 2 after applying the extraction rule 1, and each part A representative sentence candidate that is a structure tree is extracted from the object-dependent structure tree.

文生成集約部１１２３は、代表文候補抽出部１１２２によって抽出された代表文候補（部分構造木）によって表される複数の自立語および当該自立語間の係り受け関係に基づいて、当該代表文候補から代表文候補文（平文）を生成する（Ｓ３０４）。 The sentence generation / aggregation unit 1123 determines the representative sentence candidate based on a plurality of independent words represented by the representative sentence candidate (partial structure tree) extracted by the representative sentence candidate extraction unit 1122 and the dependency relationship between the independent words. The representative sentence candidate sentence (plain text) is generated from (S304).

次に、文生成集約部１１２３は、生成された代表文候補文を集約することによって、集約代表文候補文を生成する（ステップＳ３０５）。文生成集約部１１２３は、生成された代表文候補文のうち、同一の代表文候補文を１つの集約代表文候補文に集約する。このとき、文生成集約部１１２３は、集約代表文候補文を識別するための集約代表文候補文ＩＤ別に、当該集約代表文候補文に集約された代表文候補文に関連付けられた各文書ＩＤを関連付ける。 Next, the sentence generation / aggregation unit 1123 generates an aggregated representative sentence candidate sentence by aggregating the generated representative sentence candidate sentences (step S305). The sentence generation / aggregation unit 1123 aggregates the same representative sentence candidate sentences into one aggregated representative sentence candidate sentence among the generated representative sentence candidate sentences. At this time, the sentence generation / aggregation unit 1123 sets each document ID associated with the representative sentence candidate sentence aggregated in the aggregated representative sentence candidate sentence for each aggregated representative sentence candidate sentence ID for identifying the aggregated representative sentence candidate sentence. Associate.

代表文決定部１１２４は、文生成集約部１１２３によって生成された集約代表文候補文の中から代表文を決定（選択）する（ステップＳ３０６）。このとき、代表文決定部１１２４は、文生成集約部１１２３によって生成された集約代表文候補文に付与された文書ＩＤの数（つまり、当該集約代表文候補文に集約された代表文候補文の数）に基づいて代表文を決定する。 The representative sentence determination unit 1124 determines (selects) a representative sentence from the aggregated representative sentence candidate sentences generated by the sentence generation / aggregation unit 1123 (step S306). At this time, the representative sentence determination unit 1124 determines the number of document IDs assigned to the aggregated representative sentence candidate sentences generated by the sentence generation / aggregation part 1123 (that is, the representative sentence candidate sentences aggregated in the aggregated representative sentence candidate sentences). The representative sentence is determined based on the number.

代表文決定部１１２４は、代表文として決定された複数の集約代表文候補文に代表文識別ＩＤを割り当てるとともに、当該集約代表文候補文に集約された代表文候補文に関連付けられる各文書ＩＤを関連付けてＤＢサーバ２００（所定の記憶領域）に記憶する（Ｓ３０７）。 The representative sentence determination unit 1124 assigns representative sentence identification IDs to a plurality of aggregate representative sentence candidate sentences determined as representative sentences, and assigns each document ID associated with the representative sentence candidate sentences aggregated in the aggregate representative sentence candidate sentences. The database is associated and stored in the DB server 200 (predetermined storage area) (S307).

また、代表文生成部１１２は、ケース情報２２０の問合せ文及びその回答文を対として含む各文書を文書ＩＤ別に抽出し、複数の自立語を含む回答文によって構成される文書群を対象に、代表文生成処理を遂行し、生成元の文書ＩＤが紐付けられた回答の代表文（回答の要約文）を生成し、回答代表文として決定された複数の集約代表文候補文に代表文識別ＩＤを割り当てるとともに、当該集約代表文候補文に集約された代表文候補文に関連付けられる各文書ＩＤを関連付けてＤＢサーバ２００（所定の記憶領域）に記憶する（図１５）。 In addition, the representative sentence generation unit 112 extracts each document including the query sentence of the case information 220 and the answer sentence as a pair for each document ID, and targets a document group composed of answer sentences including a plurality of independent words. Performs representative sentence generation processing, generates a representative sentence (answer summary sentence) associated with the document ID of the generation source, and identifies a representative sentence to a plurality of aggregate representative sentence candidate sentences determined as the representative sentence In addition to assigning an ID, each document ID associated with the representative sentence candidate sentence aggregated with the aggregated representative sentence candidate sentence is associated and stored in the DB server 200 (predetermined storage area) (FIG. 15).

なお、本実施形態では、生成された問合せ代表文に紐付く生成元の文書群を対象にさらに代表文生成処理を遂行し、生成された一の問合せ代表文に対するサブクラスの問合せ代表文（サブ問合せ代表文）を生成する。 In the present embodiment, a representative sentence generation process is further performed on a generation source document group associated with the generated query representative sentence, and a subclass query representative sentence (sub-query) for the generated one query representative sentence is processed. Representative sentence).

例えば、図１４の代表文ＩＤ「１」が付与されている代表文「印刷ができません」に紐付いている各文書ＩＤの複数の文書を対象として代表文生成処理を遂行することで、図１６に示すように、代表文「印刷ができません」に対して「黒のインクが出なくなる」、「黒のみできません」といった、サブ代表文を生成する。すなわち、代表文「印刷ができません」が含まれている文書群を対象として生成される代表文が、上位階層の代表文「印刷ができません」の下位階層のサブ代表文として関連付けられ、問合せ代表文の階層構造を含む階層化問合せ代表文を生成することができる。なお、生成されるサブ代表文は、サブ代表文ＩＤ別に上位階層の代表文ＩＤに関連付けられて、ＤＢサーバ２００（所定の記憶領域）に記憶される。 For example, by performing the representative sentence generation processing for a plurality of documents with respective document IDs associated with the representative sentence “cannot be printed” assigned the representative sentence ID “1” in FIG. As shown, sub representative sentences such as “black ink cannot be produced” and “black cannot be produced” are generated for the representative sentence “cannot print”. In other words, a representative sentence generated for a group of documents containing the representative sentence “Cannot print” is associated as a sub representative sentence in the lower hierarchy of the representative sentence “Cannot print” in the upper hierarchy, and the query representative sentence. It is possible to generate a hierarchical query representative sentence including the hierarchical structure. The generated sub representative sentence is stored in the DB server 200 (predetermined storage area) in association with the upper representative sentence ID for each sub representative sentence ID.

＜ＦＡＱ作成支援機能＞
図４から図７を参照して、本実施形態のＦＡＱ作成支援機能について説明する。本実施形態のＦＡＱ作成支援機能は、代表文生成処理で生成された階層化問合せ代表文及び回答代表文を用いて、問合せ−回答マトリクス図（ＦＡＱ候補マトリクス図）及びＦＡＱ作成画面を通じたＦＡＱ作成環境を管理者端末４（ＦＡＱ作成者）に提供する。 <FAQ creation support function>
The FAQ creation support function of this embodiment will be described with reference to FIGS. The FAQ creation support function of the present embodiment uses the hierarchical query representative sentence and the answer representative sentence generated by the representative sentence generation process, and creates a FAQ through an inquiry-response matrix diagram (FAQ candidate matrix diagram) and a FAQ creation screen. The environment is provided to the administrator terminal 4 (FAQ creator).

ＦＡＱ作成制御部１１４は、認証部１１１による認証処理を経た管理者端末４から伝送されるＦＡＱ作成要求に基づいて、ＦＡＱ作成処理を遂行する。 The FAQ creation control unit 114 performs FAQ creation processing based on the FAQ creation request transmitted from the administrator terminal 4 that has undergone authentication processing by the authentication unit 111.

ＦＡＱ作成制御部１１４は、ＦＡＱ作成要求を受信した場合、ＦＡＱ候補制御部１１３に、階層化問合せ代表文及び回答代表文を用いた問合せ−回答マトリクス図の生成命令を出力する。 When the FAQ creation control unit 114 receives the FAQ creation request, the FAQ creation control unit 114 outputs, to the FAQ candidate control unit 113, an instruction to generate a query-answer matrix diagram using the hierarchical query representative sentence and the answer representative sentence.

生成命令が入力されたＦＡＱ候補制御部１１３は、ＦＡＱ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行する。ＦＡＱ候補制御部１１３は、ＤＢサーバ２００に記憶されている階層化問合せ代表文２３３及び回答代表文２３２を参照して、一の問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いて一の問合せ代表文と一の回答代表文とのペアに対するＦＡＱ候補評価情報を生成する。 The FAQ candidate control unit 113 to which the generation instruction is input performs processing for generating FAQ candidate evaluation information and processing for generating an inquiry-answer matrix diagram. The FAQ candidate control unit 113 refers to the hierarchical query representative sentence 233 and the answer representative sentence 232 stored in the DB server 200, and associates each document related to one query representative sentence with each of the answer representative sentences. The number of documents matching each existing document is counted, and FAQ candidate evaluation information for a pair of one inquiry representative sentence and one answer representative sentence is generated using each counted document number.

本実施形態のＦＡＱ候補評価情報は、カウントされた文書数であり、例えば、問合せ代表文「印刷ができません」に関連付けられる文書ＩＤが、「１、２、５、８、９、１０、１１・・・・」であり、回答代表文「印刷エラーの自動修復を実施する」に関連付けられる文書ＩＤが「８、１１、２５、３１、・・・」である場合、２つの文書ＩＤ「８」と「１１」がマッチングし、文書ＩＤ数を「２」とカウントする。
る。 The FAQ candidate evaluation information of this embodiment is the number of counted documents. For example, the document ID associated with the inquiry representative sentence “cannot be printed” is “1, 2, 5, 8, 9, 10, 11,. .., And the document ID associated with the reply representative sentence “Perform automatic error correction of printing” is “8, 11, 25, 31,...”, Two document IDs “8” And “11” are matched, and the number of document IDs is counted as “2”.
The

また、ＦＡＱ候補制御部１１３は、一のサブ問合せ代表文に関連付く各文書が回答代表文それぞれに関連付いている各文書とマッチングする文書数をカウントし、カウントされた各文書数を用いてサブ問合せ代表文と回答代表文とのペアに対するＦＡＱ候補評価情報も生成する。 Further, the FAQ candidate control unit 113 counts the number of documents in which each document associated with one sub-query representative sentence matches each document associated with each answer representative sentence, and uses the counted number of documents. FAQ candidate evaluation information for a pair of a sub inquiry representative sentence and an answer representative sentence is also generated.

本実施形態のＦＡＱ候補評価情報の生成処理によって生成されるＦＡＱ候補評価情報（文書ＩＤ数）は、一の問合せ代表文と一の回答代表文との関係を表す情報であり、ＦＡＱ候補制御部１１３は、生成されたＦＡＱ候補評価情報（カウントした文書ＩＤ数）を、該当の問合せ代表文と回答代表文の行と列が交差する位置の表示ブロックに表示し、問合せ代表文と回答代表文との対を定量的に評価した評価情報として提供する。なお、カウント数が「０」である場合は、「０」を表示ブロックに表示する。 The FAQ candidate evaluation information (number of document IDs) generated by the FAQ candidate evaluation information generation process of the present embodiment is information indicating the relationship between one inquiry representative sentence and one answer representative sentence, and is an FAQ candidate control unit. 113 displays the generated FAQ candidate evaluation information (the number of document IDs counted) in a display block at a position where the row and column of the corresponding query representative sentence and the reply representative sentence intersect, and the query representative sentence and the reply representative sentence It is provided as evaluation information that quantitatively evaluates the pair. When the count number is “0”, “0” is displayed on the display block.

ＦＡＱ候補制御部１１３は、複数の問合せ代表文別及びサブ問合せ代表文別に全ての回答代表文との間のＦＡＱ候補評価情報を生成し、問合せ代表文及び回答代表文を縦横に配置したマトリクス図であって、問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するＦＡＱ候補評価情報が表示された問合せ−回答マトリクス図を生成する。このとき、ＦＡＱ候補制御部１１３は、問合せ代表文のサブ問合せ代表文を該当の問合せ代表文と共に、マトリクス図に配置し、サブ問合せ代表文と回答代表文とが縦横で交わる位置に、サブ問合せ代表文と回答代表文との対に対応するＦＡＱ候補評価情報が表示されるように、問合せ−回答マトリクス図を生成する。 The FAQ candidate control unit 113 generates FAQ candidate evaluation information between all answer representative sentences for each query representative sentence and sub-query representative sentence, and the query representative sentence and the answer representative sentence are arranged vertically and horizontally. A query-response matrix diagram is generated in which FAQ candidate evaluation information corresponding to a pair of the query representative sentence and the answer representative sentence is displayed at a position where the query representative sentence and the answer representative sentence intersect vertically and horizontally. . At this time, the FAQ candidate control unit 113 arranges the sub-query representative sentence of the query representative sentence together with the corresponding query representative sentence in the matrix diagram, and the sub-query at the position where the sub-query representative sentence and the answer representative sentence intersect vertically and horizontally. An inquiry-answer matrix diagram is generated so that FAQ candidate evaluation information corresponding to a pair of representative sentence and answer representative sentence is displayed.

例えば、縦軸に図１６の問合せ代表文を代表文ＩＤ別及びサブ代表文ＩＤ別に配列し、同様に横軸に図１５の回答代表文を回答代表文ＩＤ別に配列する。なお、図１６の例では、代表文ＩＤ「１」に紐付く下位階層のサブ代表文は、代表文ＩＤ「１」と代表文ＩＤ「２」との間に配列する。 For example, the inquiry representative sentences in FIG. 16 are arranged by representative sentence ID and sub representative sentence ID on the vertical axis, and similarly, the answer representative sentences in FIG. 15 are arranged by answer representative sentence ID on the horizontal axis. In the example of FIG. 16, the sub-representative sentences in the lower hierarchy linked to the representative sentence ID “1” are arranged between the representative sentence ID “1” and the representative sentence ID “2”.

図４は、管理者端末４の表示される２次元の問合せ−回答マトリクス図の一例であり、縦軸（列）に各問合せ代表文（サブ問合せ代表文を含む）、横軸（行）に各回答代表文がそれぞれ配置され、問合せ代表文と回答代表文との対の関係を文書ＩＤ数で定量的に表した格子状の各表示ブロックが含まれる。 FIG. 4 is an example of a two-dimensional query-answer matrix diagram displayed on the administrator terminal 4. Each query representative sentence (including sub-query representative sentences) is shown on the vertical axis (column), and each horizontal axis (row) is shown on the horizontal axis (row). Each answer representative sentence is arranged, and each grid-like display block that quantitatively represents the relationship between the query representative sentence and the answer representative sentence in terms of the number of document IDs is included.

管理者端末４のＦＡＱ作成者は、マウス等の操作入力手段を用いて、問合せ−回答マトリクス図上の問合せ代表文と回答代表文との関係を表した文書ＩＤ数が表示される格子状の各表示ブロックを選択することができる。ＦＡＱ作成者によって格子状の各表示ブロックを選択されると、ＦＡＱ作成制御部１１４は、選択された表示ブロックに対応する問合せ代表文及び回答代表文に基づくＦＡＱ作成画面を管理者端末４に提供する。 The FAQ creator of the manager terminal 4 uses an operation input means such as a mouse to display a grid-like number in which the number of document IDs representing the relationship between the query representative sentence and the reply representative sentence on the query-answer matrix diagram is displayed. Each display block can be selected. When each of the grid-like display blocks is selected by the FAQ creator, the FAQ creation control unit 114 provides the administrator terminal 4 with a FAQ creation screen based on the inquiry representative sentence and the answer representative sentence corresponding to the selected display block. To do.

ＦＡＱ作成制御部１１４は、問合せ−回答マトリクス図上の格子状の表示ブロック（ＦＡＱ候補評価情報）が選択された場合、選択された表示ブロックの文書ＩＤ数が「１」以上であるか否かを判別する。 If the grid-like display block (FAQ candidate evaluation information) on the inquiry-answer matrix diagram is selected, the FAQ creation control unit 114 determines whether the number of document IDs of the selected display block is “1” or more. Is determined.

選択された表示ブロックの文書ＩＤ数が「１」以上であると判別された場合、ＦＡＱ作成制御部１１４は、選択された表示ブロックに対応する問合せ代表文及び回答代表文を含むＦＡＱ作成画面を生成し、管理者端末４に伝送する。選択された表示ブロックの文書ＩＤ数が「１」以上でない、すなわち、選択された表示ブロックの文書ＩＤ数が「０」であると判別された場合、ＦＡＱ作成制御部１１４は、ＦＡＱ作成画面を通じた問合せ代表文及び回答代表文に基づくＦＡＱ作成処理を遂行しない。 When it is determined that the number of document IDs of the selected display block is “1” or more, the FAQ creation control unit 114 displays an FAQ creation screen including an inquiry representative sentence and an answer representative sentence corresponding to the selected display block. It is generated and transmitted to the administrator terminal 4. When it is determined that the number of document IDs of the selected display block is not “1” or more, that is, the number of document IDs of the selected display block is “0”, the FAQ creation control unit 114 passes through the FAQ creation screen. The FAQ preparation process based on the inquiry representative sentence and the answer representative sentence is not performed.

このとき、ＦＡＱ作成制御部１１４は、ケース情報２２０を参照して問合せ文とその回答文を含む文書の文書集合の中から選択された表示ブロック（ＦＡＱ候補評価情報）に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書ＩＤに基づいて抽出し、選択された選択された表示ブロックに対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むＦＡＱ作成画面を生成する。 At this time, the FAQ creation control unit 114 refers to the case information 220 and displays the query representative sentence corresponding to the display block (FAQ candidate evaluation information) selected from the document set of the document including the query sentence and the answer sentence. Each query source of each extraction source to be associated is extracted based on the document ID, and includes a query representative sentence and an answer representative sentence corresponding to the selected selected display block, and a query sentence of each extracted source. Generate a FAQ creation screen.

図５は、本実施形態のＦＡＱ作成画面例である。図５の例では、図４の問合せ−回答マトリクス図における問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」とに対応する表示ブロック「２５」が選択された場合のＦＡＱ作成画面である。問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」との対の定量的な評価（関係）を示す２５個の問合せ文（２５個の文書ＩＤに紐付く問合せ文）を表示する問合せ一覧ブロックＡ、問合せ一覧ブロックＡで選択された１の文書（文書ＩＤに紐付く文書）の問合せ文とその回答文を表示する詳細情報表示ブロックＢ、問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」それぞれが、各入力欄に表示された新規ＦＡＱ作成ブロックＣを含んで構成されている。 FIG. 5 is an example of the FAQ creation screen of the present embodiment. In the example of FIG. 5, the display block “25” corresponding to the query representative sentence “cannot print” and the reply representative sentence “check ink remaining amount” in the query-answer matrix diagram of FIG. 4 is selected. It is a FAQ creation screen. Twenty-five query sentences (inquiry sentences associated with 25 document IDs) showing a quantitative evaluation (relationship) between the inquiry representative sentence “cannot print” and the answer representative sentence “check remaining ink level” Query list block A for displaying, query information of one document (document linked to document ID) selected in query list block A and the detailed information display block B for displaying the response text, query representative sentence "cannot print" Each of the reply representative sentences “confirm ink remaining amount” includes a new FAQ creation block C displayed in each input field.

ＦＡＱ作成者は、ＦＡＱ作成画面において問合せ代表文「印刷できません」と回答代表文「インクの残量を確認する」とで分類される２５個の抽出元の複数の各文書を問合せ一覧ブロックで見ることができ、問合せ一覧ブロックで選択した抽出元の１の文書の問合せ文とその回答文を詳細情報表示ブロックで見ることができる。ＦＡＱ作成制御部１１４は、問合せ一覧ブロックＡに対する表示及び問合せ一覧ブロックＡで選択された１の文書の問合せ文とその回答文を詳細情報表示ブロックＢに表示する表示の各制御を遂行する。 The FAQ creator sees a plurality of 25 source documents in the query list block that are categorized as a query representative sentence “cannot print” and a reply representative sentence “check remaining ink” on the FAQ creation screen. It is possible to view the inquiry sentence and the answer sentence of the one source document selected in the inquiry list block in the detailed information display block. The FAQ creation control unit 114 performs each control of display on the query list block A and display of the query text of one document selected in the query list block A and its answer text on the detailed information display block B.

また、ＦＡＱ作成者は、新規ＦＡＱ作成ブロックの問合せ内容入力欄に表示された問合せ代表文「印刷できません」を編集したり、対応内容入力欄に表示された回答代表文「インクの残量を確認する」を編集することができ、登録ボタンを選択することで、ＦＡＱ作成者は、問合せ内容入力欄及び対応内容入力欄のそれぞれに表示（入力）されている問合せ文とその回答文を対としたＦＡＱを登録（作成）することができる。 In addition, the FAQ creator edits the query representative sentence “Cannot print” displayed in the inquiry content input field of the new FAQ creation block, or the answer representative sentence “Check ink remaining amount” displayed in the corresponding content input field. By selecting the registration button, the FAQ creator makes a pair of the query text and the answer text displayed (input) in each of the query content input field and the corresponding content input field. FAQ can be registered (created).

図６は、本実施形態のＦＡＱ作成画面から登録されたＦＡＱ情報の一例を示す図であり、ＦＡＱ作成制御部１１４は、登録ボタンの選択操作に基づいて、作成されたＦＡＱ毎にＦＡＱ識別ＩＤ、件名、問合せ内容、問合せ代表文ＩＤ、対応内容、回答代表文ＩＤをＦＡＱ情報２４０として記憶する。なお、本実施形態では、問合せ−回答マトリクス図上の問合せ代表文及び回答代表文との関係に基づくＦＡＱ作成環境を提供するので、ＦＡＱ作成画面が問合せ代表文及び回答代表文に紐付くことになる。このため、ＦＡＱ作成画面を通じて作成されたＦＡＱ情報には、問合せ−回答マトリクス図上の該当する問合せ代表文ＩＤ及び回答代表文ＩＤがそれぞれ含まれて、ＦＡＱ情報２４０に記憶される。 FIG. 6 is a diagram illustrating an example of FAQ information registered from the FAQ creation screen according to the present embodiment, and the FAQ creation control unit 114 uses a FAQ identification ID for each created FAQ based on a selection button selection operation. The subject name, the inquiry content, the inquiry representative sentence ID, the correspondence contents, and the answer representative sentence ID are stored as FAQ information 240. In the present embodiment, an FAQ creation environment based on the relationship between the query representative sentence and the reply representative sentence on the query-answer matrix diagram is provided, so that the FAQ creation screen is associated with the query representative sentence and the reply representative sentence. Become. For this reason, the FAQ information created through the FAQ creation screen includes the corresponding query representative sentence ID and answer representative sentence ID on the query-answer matrix diagram, respectively, and is stored in the FAQ information 240.

なお、ＦＡＱ作成制御部１１４は、選択された表示ブロックに対応する問合せ代表文及び回答代表文を含むＦＡＱ作成画面して管理者端末４に提供し、問合せ一覧ブロックＡにする選択操作に基づいて、ケース情報２２０を参照して問合せ文とその回答文を含む文書の文書集合の中から選択された表示ブロック（ＦＡＱ候補評価情報）に対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書ＩＤに基づいて抽出し、問合せ一覧ブロックＡに、抽出した該当の問合せ文を表示するように制御できる。 The FAQ creation control unit 114 provides an FAQ creation screen including an inquiry representative sentence and an answer representative sentence corresponding to the selected display block to be provided to the administrator terminal 4 based on a selection operation to make the inquiry list block A. , Each extraction source query sentence associated with the query representative sentence corresponding to the display block (FAQ candidate evaluation information) selected from the document set of the document including the query sentence and the answer sentence with reference to the case information 220 Can be extracted based on the document ID, and the corresponding query sentence extracted can be displayed in the query list block A.

また、図４の例において問合せ−回答マトリクス図における問合せ代表文「印刷できません」のサブ問い合わせ代表文「ドキュメントが保留状態となる」と回答代表文「インクの残量を確認する」とに対応する表示ブロックが選択された場合、ＦＡＱ作成制御部１１４は、図５の新規ＦＡＱ作成ブロックＣの問合せ内容入力欄に、選択されたサブ問合せ代表文とその上位層の問合せ代表文とを組み合わせた「印刷できません。ドキュメント保留状態となる」をＦＡＱ候補として自動生成して表示することができる。つまり、サブクラスの問合せ代表文が選択された場合、上位層の問合せ代表文とその下位層の問合せ代表文とを組み合わせたＦＡＱ作成候補（問合せ内容）を生成し、新規ＦＡＱ作成ブロックＣに自動的に表示させることができる。 Further, in the example of FIG. 4, it corresponds to the sub-representative representative sentence “document is put on hold” of the inquiry representative sentence “cannot print” in the inquiry-answer matrix diagram and the reply representative sentence “check remaining ink level”. When the display block is selected, the FAQ creation control unit 114 combines the selected sub-query representative sentence and the query representative sentence of the higher layer in the inquiry content input column of the new FAQ creation block C in FIG. "Cannot print. Document will be on hold" can be automatically generated and displayed as a FAQ candidate. That is, when a subclass query representative sentence is selected, a FAQ creation candidate (query content) is generated by combining the query representative sentence of the upper layer and the query representative sentence of the lower layer, and is automatically sent to the new FAQ creation block C. Can be displayed.

図７は、本実施形態のＦＡＱ作成支援サーバ１００のＦＡＱ作成支援処理フローを示す図である。 FIG. 7 is a diagram illustrating a FAQ creation support process flow of the FAQ creation support server 100 according to the present embodiment.

認証部１１１は、管理者端末４からＦＡＱ作成要求を受信すると（Ｓ２０１）、認証処理を遂行する（Ｓ１０１）。 When receiving the FAQ creation request from the administrator terminal 4 (S201), the authentication unit 111 performs an authentication process (S101).

認証処理を経たＦＡＱ作成要求に基づいて、ＦＡＱ候補制御部１１３は、ＦＡＱ候補制御部１１３は、ＤＢサーバ２００に記憶されている階層化問合せ代表文２３３及び回答代表文２３２を取得し（Ｓ１０２）、複数の問合せ代表文別及び／又はサブ問合せ代表文別に全ての回答代表文との間のＦＡＱ候補評価情報を生成するＦＡＱ候補評価情報の生成処理（Ｓ１０３）、及び問合せ代表文と回答代表文とが縦横で交わる位置に、該当の問合せ代表文と回答代表文との対に対応するＦＡＱ候補評価情報が表示された問合せ−回答マトリクス図の生成処理を遂行する（Ｓ１０４）。 Based on the FAQ creation request that has undergone the authentication process, the FAQ candidate control unit 113 acquires the hierarchical query representative sentence 233 and the answer representative sentence 232 stored in the DB server 200 (S102). , FAQ candidate evaluation information generation processing for generating FAQ candidate evaluation information between all answer representative sentences for each of a plurality of query representative sentences and / or sub-query representative sentences (S103), and a query representative sentence and a reply representative sentence A query-response matrix diagram is generated in which FAQ candidate evaluation information corresponding to the pair of the corresponding query representative sentence and the answer representative sentence is displayed at a position where the above and the other intersect each other vertically and horizontally (S104).

ＦＡＱ候補制御部１１３は、生成した問合せ−回答マトリクス図を管理者端末４に伝送する（Ｓ１０５）。 The FAQ candidate control unit 113 transmits the generated inquiry-answer matrix diagram to the administrator terminal 4 (S105).

管理者端末４に表示された問合せ−回答マトリクス図上の問合せ代表文と回答代表文との関係を表した文書ＩＤ数が表示される格子状の各表示ブロックが選択されると（Ｓ１０６、Ｓ２０２）、ＦＡＱ作成制御部１１４は、ケース情報２２０を参照して選択された表示ブロックに対応する問合せ代表文に関連付く各抽出元の問合せ文それぞれを文書ＩＤに基づいて抽出し（Ｓ１０７）、選択された表示ブロックに対応する問合せ代表文及び回答代表文と、抽出された各抽出元の問合せ文とを含むＦＡＱ作成画面を生成する（Ｓ１０８）。 When each grid-like display block displaying the number of document IDs representing the relationship between the inquiry representative sentence and the answer representative sentence on the inquiry-answer matrix diagram displayed on the administrator terminal 4 is selected (S106, S202). ), The FAQ creation control unit 114 extracts each source query sentence associated with the query representative sentence corresponding to the display block selected with reference to the case information 220 based on the document ID (S107). A FAQ creation screen including the inquiry representative sentence and the answer representative sentence corresponding to the displayed display block and the extracted inquiry sentence of each extraction source is generated (S108).

ＦＡＱ作成制御部１１４は、生成されたＦＡＱ作成画面を管理者端末４に伝送するとともに、ＦＡＱ作成者による操作入力に基づくＦＡＱ作成画面の表示制御を遂行する（Ｓ１０９）。ＦＡＱ作成制御部１１４は、登録ボタンが選択された場合（Ｓ２０３）、問合せ内容入力欄及び対応内容入力欄のそれぞれに表示（入力）されている問合せ文とその回答文を対としたＦＡＱをＦＡＱ情報２４０に登録する（Ｓ１１０）。 The FAQ creation control unit 114 transmits the generated FAQ creation screen to the administrator terminal 4 and performs display control of the FAQ creation screen based on an operation input by the FAQ creator (S109). When the registration button is selected (S203), the FAQ creation control unit 114 sets a FAQ that is a pair of the query text and the answer text displayed (input) in each of the query content input field and the corresponding content input field. The information 240 is registered (S110).

本実施形態のＦＡＱ作成支援システムは、問合せ文とその回答文を含む文書の文書集合において、各文書の問合せ文で構成される第１文書群を適切に表す複数の問合せ代表文（問合せ要約文）と、各文書の回答文で構成される第２文書群を適切に表す複数の回答代表文（回答要約文）とをマッチングし、問合せ代表文と回答代表文と対の関係を抽出元の文書ＩＤ数で定量的に表すことにより、ＦＡＱ作成要否を容易かつ適切に判断できるＦＡＱ作成環境を提供することが可能となる。 The FAQ creation support system according to the present embodiment includes a plurality of query representative sentences (query summary sentences) that appropriately represent the first document group composed of the query sentences of each document in a document set of documents including a query sentence and its answer sentence. ) And multiple answer representative sentences (answer summary sentences) that appropriately represent the second document group composed of the answer sentences of each document, and the relationship between the query representative sentence and the answer representative sentence is extracted By representing quantitatively by the number of document IDs, it is possible to provide an FAQ creation environment in which it is possible to easily and appropriately determine whether or not an FAQ is necessary.

例えば、ＦＡＱ作成者は、問合せとその回答に関し、同一内容の事象がどれだけ登録されているかを定量的に知ることができ、ＦＡＱ作成要否を容易かつ適切に判断することができる。 For example, the FAQ creator can quantitatively know how many events of the same content are registered regarding the inquiry and the answer, and can easily and appropriately determine whether or not the FAQ needs to be created.

このため、従来、人手で大量の対応履歴を読んで、一から（何もない状態から）ＦＡＱ候補を作成していた作業に比べ、作業時間及び作業負担を低減できる。特に、ＦＡＱ候補が文で表現されていることから、キーワードの場合に比べてＦＡＱに仕上げる作業、例えば、キーワードからＦＡＱ候補となる文書を起こす（作成する）手間を低減でき、ＦＡＱ作成の作業効率を向上させることができる。 For this reason, it is possible to reduce the work time and work load compared to the work of conventionally reading a large number of response histories manually and generating FAQ candidates from scratch (from an empty state). In particular, since the FAQ candidates are expressed in sentences, it is possible to reduce the work of finishing the FAQ compared to the case of keywords, for example, the trouble of generating (creating) a document that is a FAQ candidate from the keywords, and the efficiency of FAQ creation Can be improved.

さらに、本実施形態では、問合せ代表文及び回答代表文それぞれを縦横に配置し、問合せ代表文と回答代表文との対の定量的な関係を示す格子状の各表示ブロックを含む問合せ−回答マトリクス図をＦＡＱ作成者に提供するので、ＦＡＱを作成するための問合せ文とその回答文を含む文書それぞれが適切に分類された定量的な評価を実現できるとともに、問合せ文とその回答文を含む文書の文書集合から作成するＦＡＱ候補文の全体像（例えば、多い問合せとその回答の傾向など）を、容易に一目で把握することができる。 Further, in the present embodiment, the query-answer matrix including each of the query representative sentences and the answer representative sentences arranged vertically and horizontally, and each grid-like display block indicating the quantitative relationship between the query representative sentences and the answer representative sentences. Since the figure is provided to the FAQ creator, it is possible to realize a quantitative evaluation in which the query sentence for creating the FAQ and the document including the answer sentence are appropriately classified, and the document including the query sentence and the answer sentence. It is possible to easily grasp at a glance the overall image of FAQ candidate sentences created from a set of documents (for example, the tendency of many queries and their responses).

また、問合せ−回答マトリクス図上の表示ブロックの選択に基づくＦＡＱ作成画面を提供するので、定量的な評価で集約（分類）された、問合せ文とその回答文を含む文書の文書集合に対するＦＡＱ作成環境を提供することができる。つまり、ＦＡＱ作成画面自体が問合せ代表文及び回答代表文によって文書集合に集約したＦＡＱ作成環境となるので、ＦＡＱの主題を外れることなく、適切なＦＡＱを作成することができる。 In addition, since the FAQ creation screen is provided based on the selection of the display block on the query-answer matrix diagram, the FAQ creation is performed on the document set including the query sentence and the answer sentence aggregated (classified) by quantitative evaluation. An environment can be provided. That is, since the FAQ creation screen itself is a FAQ creation environment that is aggregated into a document set by an inquiry representative sentence and an answer representative sentence, an appropriate FAQ can be created without departing from the subject matter of the FAQ.

また、ＦＡＱ作成画面に問合せ代表文及び回答代表文とともに、問合せ代表文に関連付く各抽出元の問合せ文それぞれを抽出して表示するので、要約された文書（代表文）の細かいニュアンスや背景を把握しながら、ＦＡＱを作成することができる。 In addition, since each query source of each source associated with the query representative sentence is extracted and displayed along with the query representative sentence and answer representative sentence on the FAQ creation screen, the detailed nuances and background of the summarized document (representative sentence) can be displayed. FAQ can be created while grasping.

以上、上述の実施形態では、階層化問合せ代表文２３３及び回答代表文２３２に基づいて、ＦＡＱ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行する一例を説明したが、問合せ代表文２３１及び回答代表文２３２に基づいて、ＦＡＱ候補評価情報の生成処理及び問合せ−回答マトリクス図の生成処理を遂行することできる。 As described above, in the above-described embodiment, an example in which the processing for generating the FAQ candidate evaluation information and the processing for generating the query-response matrix diagram based on the hierarchical query representative sentence 233 and the answer representative sentence 232 has been described. Based on the sentence 231 and the answer representative sentence 232, the FAQ candidate evaluation information generation process and the inquiry-answer matrix diagram generation process can be performed.

また、問合せ−回答マトリクス図は、図４に示した複数の問合せ代表文と複数の回答代表文で構成されたマトリクス図以外にも、例えば、１つの問合せ代表文と複数の回答代表文で構成されたマトリクス図やＦＡＱ候補評価情報が所定値以上（文書ＩＤ数が所定数以上）の問合せ代表文と回答代表文との対のみを含むマトリクス図を生成することもできる。 In addition to the matrix diagram composed of a plurality of query representative sentences and a plurality of reply representative sentences shown in FIG. 4, the query-answer matrix diagram is composed of, for example, one query representative sentence and a plurality of reply representative sentences. It is also possible to generate a matrix diagram including only a pair of a query representative sentence and a reply representative sentence in which the matrix diagram and FAQ candidate evaluation information are equal to or greater than a predetermined value (the number of document IDs is equal to or greater than a predetermined number).

また、問合せ代表文に紐付く生成元の文書群を対象にさらに代表文生成処理を遂行してサブクラスの問合せ代表文を生成する２段階のサブクラス化の一例を示したが、これに限らず、サブクラスの問合せ代表文に対してさらに代表文生成処理を遂行して３段階、４段階・・・と、２階層以上の多段のサブクラス化された問合せ代表文を生成することもできる。 In addition, although an example of two-stage subclassing that generates a subclass query representative sentence by performing a representative sentence generation process on a generation source document group associated with the query representative sentence, the present invention is not limited to this. It is also possible to perform a representative sentence generation process on the query representative sentences of the subclass to generate multi-stage subclassified query representative sentences of three levels, four levels,...

また、上述の実施形態では、問合せ代表文のサブクラス化を一例に説明したが、例えば、回答代表文をサブクラス化することもできる。つまり、図１５の示した回答代表文それぞれに紐付く生成元の文書群を対象にさらに代表文生成処理を遂行し、各回答代表文に対応したサブクラスのサブ回答代表文を生成することができ、問合せ代表文及び回答代表文の両方又はいずれか一方をサブクラス化したＦＡＱ作成環境を提供することができる。なお、回答代表文の階層化したマトリクス図表示においても、問合せ代表文の階層化表示と同様に、図４の例の横軸に回答代表文を代表文ＩＤ別及びサブ代表文ＩＤ別に配列し、回答代表文ＩＤに紐付く下位階層のサブ代表文を、隣り合う代表文ＩＤの間に配列することができる。 In the above-described embodiment, the subclassification of the inquiry representative sentence has been described as an example. However, for example, the answer representative sentence can be subclassified. That is, it is possible to further generate a representative sentence generation process for the source document group associated with each of the answer representative sentences shown in FIG. 15, and generate a sub-answer representative sentence of a subclass corresponding to each answer representative sentence. In addition, it is possible to provide an FAQ creation environment in which either or both of the inquiry representative sentence and the answer representative sentence are subclassified. Note that, in the hierarchical matrix display of the answer representative sentences, the answer representative sentences are arranged by representative sentence ID and sub representative sentence ID on the horizontal axis in the example of FIG. The sub-representative sentences in the lower hierarchy associated with the answer representative sentence ID can be arranged between adjacent representative sentence IDs.

また、回答代表文のサブクラス化も、２階層以上の多段のサブクラス化を行うことができる。さらに、問合せ−回答マトリクス図において問合せ代表文の下位階層であるサブ問合せ代表文が選択された場合の処理と同様に、図５において上位階層の回答代表文とその下位階層のサブ回答代表文とを組み合わせたＦＡＱ作成候補（対応内容）を生成し、新規ＦＡＱ作成ブロックＣに自動的に表示させることもできる。 In addition, subclassification of answer representative sentences can be performed in multiple stages of two or more layers. Further, similar to the processing when the sub-query representative sentence that is the lower hierarchy of the query representative sentence is selected in the query-answer matrix diagram, the upper-level answer representative sentence and the sub-answer representative sentence of the lower hierarchy in FIG. Can be generated and automatically displayed on the new FAQ creation block C.

また、ケース情報２２０の文書ＩＤ毎の各付帯情報に含まれる製品分類や問合せ分類等を用い、特定の製品や問合せタイプに関する問合せ文とその回答文を含む文書の文書集合を対象にした、代表文生成処理、ＦＡＱ候補評価情報の生成処理、又は問合せ−回答マトリクス図の生成処理を遂行するように構成することができ、特定の製品や問合せタイプに応じた個別にＦＡＱ作成環境を提供することもできる。 In addition, a representative for a document set of documents including a query sentence related to a specific product or a query type and an answer sentence using a product classification or a query classification included in each incidental information for each document ID of the case information 220. It can be configured to perform sentence generation processing, FAQ candidate evaluation information generation processing, or query-response matrix diagram generation processing, and provide an FAQ creation environment individually according to a specific product or inquiry type You can also.

また、ＦＡＱ作成支援サーバ１００の代表文生成部１１２は、個別の代表文生成装置として構成することができ、ＦＡＱ作成支援サーバ１００と代表文生成装置とが連動したＦＡＱ作成支援システムとして構成することもできる。 The representative sentence generation unit 112 of the FAQ creation support server 100 can be configured as an individual representative sentence generation apparatus, and is configured as an FAQ creation support system in which the FAQ creation support server 100 and the representative sentence generation apparatus are linked. You can also.

また、上述の実施形態の各処理は、コンピュータで実行可能なプログラムとして実現することも可能であり、当該プログラムがインストールされたコンピュータは、実施形態に係るＦＡＱ作成支援機能の各処理を遂行する情報処理装置として動作することが可能である。例えば、不図示の補助記憶装置に当該プログラムが格納され、ＣＰＵ等の制御部が補助記憶装置に格納されたプログラムを主記憶装置に読み出し、主記憶装置に読み出された該プログラムを制御部が実行し、コンピュータに実施形態に係る各処理を動作させることができる。 Each process of the above-described embodiment can also be realized as a computer-executable program, and the computer in which the program is installed is information for performing each process of the FAQ creation support function according to the embodiment. It is possible to operate as a processing device. For example, the program is stored in an auxiliary storage device (not shown), and a control unit such as a CPU reads the program stored in the auxiliary storage device to the main storage device, and the control unit reads the program read to the main storage device. It is possible to execute and cause the computer to operate each process according to the embodiment.

また、上記プログラムは、コンピュータ読取可能な記録媒体に記録された状態で、コンピュータに適用することも可能であり、インターネット等のネットワークを通じてコンピュータにダウンロードすることも可能である。コンピュータ読取可能な記録媒体としては、ＣＤ−ＲＯＭ等の光ディスク、ＤＶＤ−ＲＯＭ等の相変化型光ディスク、ＭＯ（Magnet Optical）やＭＤ(Mini Disk)などの光磁気ディスク、フロッピー（登録商標）ディスクやリムーバブルハードディスクなどの磁気ディスク、コンパクトフラッシュ（登録商標）、スマートメディア、ＳＤメモリカード、メモリスティック等のメモリカードが挙げられる。また、特別に設計されて構成された集積回路（ＩＣチップ等）等のハードウェア装置も記録媒体として含まれる。 The program can be applied to a computer in a state where the program is recorded on a computer-readable recording medium, or can be downloaded to a computer through a network such as the Internet. Computer-readable recording media include optical disks such as CD-ROM, phase change optical disks such as DVD-ROM, magneto-optical disks such as MO (Magnet Optical) and MD (Mini Disk), floppy (registered trademark) disks, Examples include magnetic disks such as removable hard disks, memory cards such as compact flash (registered trademark), smart media, SD memory cards, and memory sticks. A hardware device such as an integrated circuit (IC chip or the like) specially designed and configured is also included as a recording medium.

なお、本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 In addition, although some embodiment of this invention was described, these embodiment is shown as an example and is not intending limiting the range of invention. These novel embodiments can be implemented in various other forms, and various omissions, replacements, and changes can be made without departing from the scope of the invention. These embodiments and modifications thereof are included in the scope and gist of the invention, and are included in the invention described in the claims and the equivalents thereof.

１コンタクトセンターシステム
２ＡＣＤシステム
３ＦＡＱ作成支援システム
４管理者端末
５オペレータ端末
１００ＦＡＱ作成支援サーバ
１１０ＣＰＵ（制御部）
１１１認証部
１１２代表文生成部
１１２１構文解析部
１１２２代表文候補抽出部
１１２３文生成集約部
１１２４代表文決定部
１１３ＦＡＱ候補制御部
１１４ＦＡＱ作成制御部
１２０通信制御部
１３０メモリ
２００ＤＢサーバ
２１０画面情報
２２０ケース情報
２３０代表文情報
２３１問合せ代表文情報
２３２階層化問合せ代表文情報
２３３回答代表文情報
２４０ＦＡＱ情報
２５０抽出ルール記憶部 1 Contact Center System 2 ACD System 3 FAQ Creation Support System 4 Administrator Terminal 5 Operator Terminal 100 FAQ Creation Support Server 110 CPU (Control Unit)
111 Authentication Unit 112 Representative Sentence Generation Unit 1121 Syntax Analysis Unit 1122 Representative Sentence Candidate Extraction Unit 1123 Sentence Generation Aggregation Unit 1124 Representative Sentence Determination Unit 113 FAQ Candidate Control Unit 114 FAQ Creation Control Unit 120 Communication Control Unit 130 Memory 200 DB Server 210 Screen Information 220 Case information 230 Representative sentence information 231 Inquiry representative sentence information 232 Hierarchical inquiry representative sentence information 233 Answer representative sentence information 240 FAQ information 250 Extraction rule storage unit

Claims

Based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of the document including the query sentence and the answer sentence, a plurality of extraction sources are included in one query representative sentence. A first storage unit for storing the query representative sentence associated with the document corresponding to the query sentence;
The answer in which the documents corresponding to the answer sentences of a plurality of extraction sources are associated with one answer representative sentence based on the same answer representative sentence among the answer representative sentences extracted from the answer sentences of the respective documents. A second storage unit for storing a representative sentence;
Each document associated with one query representative sentence counts the number of documents that match each document associated with each answer representative sentence, and one query representative sentence and one answer are counted using the counted number of each document. FAQ candidate control unit for generating FAQ candidate evaluation information for a pair with a representative sentence;
A FAQ creation control unit that generates a FAQ composed of a query and its answer based on a pair of a query representative sentence and a reply representative sentence corresponding to the FAQ candidate evaluation information. It is a matrix diagram in which the query representative sentence and the answer representative sentence are arranged vertically and horizontally, and at the position where the query representative sentence and the answer representative sentence intersect vertically and horizontally, the corresponding query representative sentence and answer representative sentence are paired. Generate a FAQ candidate matrix diagram displaying corresponding FAQ candidate evaluation information,
When the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer is selected, the FAQ creation control unit selects from the document set of documents including the inquiry sentence and the answer sentence. Each of the extraction source query sentences associated with the query representative sentence corresponding to the FAQ candidate evaluation information is extracted, and the query representative sentence and answer representative sentence corresponding to the selected FAQ candidate evaluation information are extracted. A FAQ creation support system that generates a FAQ creation screen including a query sentence of each extraction source and transmits it to the computer.

The first storage unit includes one subquery based on the same subquery representative sentence among the subquery representative sentences for the query representative sentence extracted from the query sentences of a plurality of documents associated with the query representative sentence. Further storing the subquery representative sentence in which a document corresponding to the query sentence of a plurality of extraction sources is associated with the representative sentence;
The FAQ candidate control unit counts the number of documents in which each document associated with one subquery representative sentence matches each document associated with each answer representative sentence, and uses the counted number of documents to determine one. In addition to generating FAQ candidate evaluation information for a pair of a subquery representative sentence and one answer representative sentence,
The query representative sentence and the sub-query representative sentence of the query representative sentence and the answer representative sentence are arranged vertically and horizontally, the position where the query representative sentence and the answer representative sentence intersect vertically and horizontally, and the sub-query representative sentence FAQ candidate evaluation information corresponding to a pair of corresponding query representative sentence and answer representative sentence and FAQ candidate evaluation corresponding to a pair of sub-query representative sentence and answer representative sentence at each position where the answer representative sentence intersects vertically and horizontally The FAQ creation support system according to claim 1 , wherein the FAQ candidate matrix diagram displaying information is generated.

Based on the same query representative sentence among a plurality of query representative sentences extracted from the query sentences of each document in the document set of the document including the query sentence and the answer sentence, a plurality of extraction sources are included in one query representative sentence. Based on the same answer representative sentence among a plurality of answer representative sentences extracted from the answer sentence of each of the documents, the first storage unit that stores the query representative sentence associated with the document corresponding to the query sentence, A computer connectable to a second storage unit that stores the answer representative sentences in which documents corresponding to a plurality of answer sentences of a plurality of extraction sources are associated with one answer representative sentence;
Each document associated with one query representative sentence counts the number of documents that match each document associated with each answer representative sentence, and one query representative sentence and one answer are counted using the counted number of each document. Based on a function for generating FAQ candidate evaluation information for a pair with a representative sentence and a pair of an inquiry representative sentence and an answer representative sentence corresponding to the FAQ candidate evaluation information, an FAQ including an inquiry and an answer is generated. And realize the function,
The function for generating the FAQ candidate evaluation information is a matrix diagram in which the inquiry representative sentence and the answer representative sentence are arranged vertically and horizontally, and at the position where the inquiry representative sentence and the answer representative sentence intersect vertically and horizontally, Generate a FAQ candidate matrix diagram displaying FAQ candidate evaluation information corresponding to a pair of inquiry representative sentence and answer representative sentence,
When the FAQ candidate evaluation information on the FAQ candidate matrix diagram displayed on a predetermined computer is selected, the function for generating the FAQ is selected from a document set of documents including the inquiry sentence and the answer sentence. Each of the extraction source query sentences associated with the query representative sentence corresponding to the selected FAQ candidate evaluation information is extracted, and the query representative sentence and answer representative sentence corresponding to the selected FAQ candidate evaluation information, and the extraction A FAQ creation support program that generates a FAQ creation screen including each extracted query message and transmits it to the computer.