JP4008313B2

JP4008313B2 - Question type learning device, question type learning program, recording medium recording the program, recording medium recording a learning sample, question type identification device, question type identification program, recording medium recording the program

Info

Publication number: JP4008313B2
Application number: JP2002253853A
Authority: JP
Inventors: 潤鈴木; 裕佐々木; 英作前田
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 2002-08-30
Filing date: 2002-08-30
Publication date: 2007-11-14
Anticipated expiration: 2022-08-30
Also published as: JP2004094521A

Description

【０００１】
【発明の属する技術分野】
本発明は質問タイプ学習方法、装置、プログラム、および同プログラムを記録した記録媒体、ならびに質問タイプ同定方法、装置、プログラム、および同プログラムを記録した記録媒体に関する。
【０００２】
【従来の技術】
近年、情報検索／抽出、計算機との対話、質問応答など、人間から計算機に対して自然文で質問して計算機に解答させる技術が使用される場面が増えてきている。質問タイプ同定とは、質問が与えられたときに、その質問が「何を訊いているのか」という意図を解析することである。質問タイプは、質問の意図を表すクラスであり、質問の意図を解析することは、質問タイプを同定する分類問題と捉えることができることから、質問タイプ同定と呼ぶ。
【０００３】
計算機にとって人間が与える質問の意図を理解するのは非常に難しい問題である。人間と同様に、計算機にとっても、与えられた質問の意図を的確に理解できないと、質問に答えることはできない。
【０００４】
従来、質問のパターンを限定し意味的制約のもとで解析を行う手法、人手によりルールを作成する方法などが提案されている。意味制約を用いる方法は、各質問タイプを表す表現を事前に決定しておき、その表現にあった質問しか受け付けられないという問題がある。ルールによる方法の場合は、質問文に出現する表現をルールに変換することで理論上は、多種多様な表現の質問を扱うことが可能であるが、ルールを作成するのに人手が非常にかかる点が問題となる。また、仮にコストをかけて作成する場合でも、様々なパラメータを調整し最適な同定ルールを決定するのは極めて困難である。
【０００５】
このような問題を解決する一つの方法として、機械学習方法を用いて学習サンプルから自動で質問タイプの同定方法を学習する方法がある。機械学習方法では、その問題で分類すべき対象の特徴である「素性」を抽出し、その素性の集合を用いて分類器を学習するという手順で問題に適用する。
【０００６】
【発明が解決しようとする課題】
このような機械学習方法では、素性の数が重要な要素となる。本発明で取り扱う質問タイプ同定問題は、質問の意図を判別する問題であるため、高精度な識別を行うためには多種多様な情報が必要となる。特に、質問文中にどのような単語が出現するかという情報と、出現した単語同士がどのような関係で使われているかという情報が重要になる。つまり、質問タイプ同定問題は、質問タイプを特徴付けるために必要な素性数は非常に多くなる。機械学習方法では、学習サンプルが無限に存在するならば、素性数が多くなっても問題にはならないが、実際には学習サンプルは限られた数しか存在しないため、学習サンプル数に対して素性数が多いと、学習サンプルに対して過学習してしまい、未知のデータに対しての精度が低くなるという問題がある。
【０００７】
一般的に、機械学習方法をもちいる場合は、分類を行うのに最も有効であると考えられる必要最小限の素性のみを用いることが望ましい。しかし、質問タイプ同定では、素性の集合の中で、実際にどれが有効であるかを人間が直観的に判断するのは非常に難しい。このため、分類を行うのに最も有効であると考えられる必要最小限の素性を抽出するのは非常に困難であり、従来提案されている機械学習方法をそのまま適用すると高精度な分類器が学習できないという問題がある。
【０００８】
本発明の目的は、質問タイプ同定のための高精度な分類器を構成する質問タイプ学習方法、装置、プログラム、同プログラムを記録した記録媒体および学習データを記録した記録媒体を提供することにある。
【０００９】
本発明の他の目的は、前記分類器を用いて未知の質問から質問タイプを同定する質問タイプ同定方法、装置、プログラム、および記録媒体を提供することにある。
【００１０】
【課題を解決するための手段】
まず、特徴抽出部により、質問の学習サンプルから質問タイプを特徴付ける素性を抽出する。質問タイプを特徴付ける素性を抽出する方法として、単語属性Ｎ−ｇｒａｍを用いる。単語属性とは、単語、品詞、意味情報のことであり、Ｎ−ｇｒａｍとは、それらの連鎖を意味する。
【００１１】
この単語属性Ｎ−ｇｒａｍを用いることにより、質問タイプを特徴付ける素性を自動的に、かつ、網羅的に抽出することができる。例えば、人手により有効と考えられる素性を列挙する場合に比べてタイプを特徴付ける素性を効率的に抽出できる。
【００１２】
表１に単語属性Ｎ−ｇｒａｍの例を示す。
【００１３】
【表１】

ｗ，ｐ，ｓはそれぞれ単語、品詞、意味情報を表し、質問はｗ₁，ｗ₂，．．．，ｗ_nで構成されているとする。この場合、抽出される特徴は、単語属性Ｎ−ｇｒａｍに基づき、表２のようになる。ここで、Ｎ−ｇｒａｍの要素が１つのものを１−ｇｒａｍ、要素が２つの連鎖を２−ｇｒａｍと呼び、以下、要素３を３−ｇｒａｍ、要素４を４−ｇｒａｍ、．．．と呼ぶ。
【００１４】
【表２】

ここで、抽出された一つの単語属性の連鎖が一つの素性となる。
【００１５】
次に、素性ベクトル作成部により、単語属性Ｎ−ｇｒａｍにより抽出された素性から素性空間を構成し、各質問を素性ベクトルに変換する。最後に、分類器構成部により、抽出された特徴と質問の学習サンプルから、大量の素性を用いても高精度な分類器を学習可能な統計的機械学習方法を用いて学習する。サンプル数に対して素性数が多い問題でも、高精度な分類器を学習できる機械学習手法としてＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ（参考文献：V.Vapnik, The Nature of Statistical Learning Theory. Spring-Verlag, New York, 1995. 参照）を用いる。
【００１６】
学習時のＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅの入力は、クラスラベルと素性ベクトルのペアで表される。
【００１７】
【数１】

ここで、
【００１８】
【外１】

はｉ番目のサンプルの特徴ベクトルでｎ次元ベクトルであり、ｙ_iはサンプルｉのクラスを表すスカラー変数である。ただし、ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅは二クラス分類器なので、分類すべきクラスは正例（＋１）、負例（−１）の２つである。
【００１９】
学習サンプルの例を表３に示す。このような学習サンプルは記録媒体に記録して質問タイプの同定の際に利用することが可能であり、データのみ流通させることができる。
【００２０】
【表３】

素性数ｎの場合は、ｎ次元の素性空間を考えることにより、各質問から作成される素性ベクトルはｎ次元素性空間の一点を表すことになる。ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅは、図３のように、ｎ次元素性空間内で、２クラスを分離する超平面のうち、２クラス間のマージンが最大になる平面を最適識別平面として選択する学習手法である。
【００２１】
また、ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅは二クラス分類器であるので、多クラス分類手法を用いて適用する。多クラス分類手法としては、例えばｏｎｅｖｓ．ｒｅｓｔ法があり、分類対象となるクラスがｎ個存在するとき、任意の１クラスとそれ以外のｎ−１クラスを分類する分類器をｎ個作成することで、二クラス分類器で多クラスの分類を行う。この場合の識別境界は図４のようになる。
【００２２】
最後に、以上のようにして作成された分類器を用いて、未知の質問が入力されたときに、該質問がどの質問タイプかを識別する。これは、未知の質問を表３と同様の方法で素性ベクトルに変換し、ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅにより作成した分類器を用いて行う。識別ステップにより決定される質問タイプは、未知の質問から作成される素性ベクトルが素性空間上のどの位置に配置されるかで決定される。
【００２３】
【発明の実施の形態】
次に、本発明の実施の形態について図面を参照して説明する。
【００２４】
図１（１），（２）はそれぞれ本発明の一実施形態の質問タイプ学習装置、質問タイプ同定装置の構成図である。
【００２５】
質問タイプ学習装置は特徴抽出部１と素性ベクトル作成部２と分類器構成部３を有している。質問タイプ同定装置は素性ベクトル作成部４と質問タイプ同定部５を有している。
【００２６】
図２は本実施形態の質問タイプ学習装置と質問タイプ同定装置の処理の流れを示すフローチャートである。
【００２７】
ステップ１１に、特徴抽出部１は、質問の学習サンプルから質問タイプを特徴付ける素性を、単語Ｎ−ｇｒａｍを用いて抽出する。ステップ１２に、素性ベクトル作成部２は単語属性Ｎ−ｇｒａｍにより抽出された素性から素性空間を構成し、各質問を素性ベクトルに変換する。ステップ１３に、分類器構成部３は素性ベクトルからＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅを用いて、質問タイプを同定する分類器を構成する。
【００２８】
未知の質問が入力されたとき、素性ベクトル作成部４で、未知の質問から質問タイプを特徴づける素性を、単語Ｎ-ｇｒａｍを用いて抽出し、抽出された素性を素性ベクトルに変換し、質問タイプ同定部５で、分類器構成部３で作成された分類器を用いて質問タイプを同定する。
【００２９】
次に、本実施形態の具体例として、質問応答システムにおいて、質問タイプ同定問題を扱う場合を示す。
【００３０】
質問応答システムの場合は、質問タイプは主に解答の種類の分類を表すタイプとなる。例えば、「人名」「地名」「人数」「日時」等である。
【００３１】
まず、質問タイプ学習ステップを説明する。
【００３２】
質問応答システムの場合の学習サンプルの例として、「日本の首都はどこですか」を考える。この質問は場所を訊いている質問なので、識別すべき質問タイプは「場所」となる。このように、学習サンプルは表４に示すように質問と識別すべき質問タイプのペアで与えられる。
【００３３】
【表４】

このような学習サンプルから、各質問タイプを特徴付ける素性を抽出する。まず、質問を単語単位に分割し、各単語に品詞、意味情報を付加する。抽出される単語属性Ｎ−ｇｒａｍとは、例えば「日本の首都はどこですか」という質問なら、図５で示すように各単語自身とその品詞、意味情報を要素とした組み合わせの連鎖である。考えられる全ての単語属性Ｎ−ｇｒａｍを抽出する。
【００３４】
上記の「日本の首都はどこですか」という質問の場合、例えば表５のようになる。
【００３５】
【表５】

抽出された単語属性Ｎ−ｇｒａｍを素性として、素性空間を作成し、この素性空間を用いて統計的機械学習手法ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅを用いて最適識別平面を決定し分類器を作成する。
【００３６】
次に、質問タイプ同定ステップについて説明する。
【００３７】
これは、ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅにより作成された分類器を用いて、新たに入力される未知のサンプルの質問タイプを識別するステップである。
【００３８】
入力された未知の質問に対して、素性ベクトルを作成し、出力される質問タイプをその質問の質問タイプと判定する。
【００３９】
質問応答システムに適用する場合と同様に、対話システムや情報検索／抽出システムにも、それぞれのシステムに適した質問タイプを事前に決定すれば、それ以外は全く同様の方法で適用することができる。例えば、対話システムの場合で、スケジュール管理のタスクを取り扱っている場合は、質問タイプを「新規のスケジュール入力」「スケジュール変更」「スケジュール確認」等と決めることで適用することが可能である。また、情報検索／抽出システムの場合は、テキスト分類のカテゴリを質問タイプにすることで、検索範囲を限定することが可能になり、より高精度な検索が行えることができるようになる。
【００４０】
なお、本発明の質問学習方法および質問タイプ同定方法は専用のハードウェアにより実現されるもの以外に、その機能を実現するためのプログラムを、コンピュータ読み取り可能な記録媒体に記録して、この記録媒体に記録されたプログラムをコンピュータシステムに読み込ませ、実行するものであってもよい。コンピュータ読み取り可能な記録媒体とは、フロッピーディスク、光磁気ディスク、ＣＤ−ＲＯＭ等の記録媒体、コンピュータシステムに内蔵されるハードディスク装置等の記憶装置を指す。さらに、コンピュータ読み取り可能な記録媒体は、インターネットを介してプログラムを送信する場合のように、短時間の間、動的にプログラムを保持するもの（伝送媒体もしくは伝送波）、その場合のサーバとなるコンピュータシステム内部の揮発性メモリのように、一定時間プログラムを保持しているものも含む。
【００４１】
【発明の効果】
以上説明したように、本発明によれば、質問タイプを特徴付ける単語の属性の構造を抽出する単語属性Ｎ−ｇｒａｍと、サンプル数に対して素性数が多い問題設定でも高精度な分類器を学習できる統計的機械学習手法ＳＶＭ（ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅ）を組み合わせることにより、高精度な分類器を作成することが可能になり、これにより、質問のサンプルさえ作成すれば、質問タイプ同定のための高精度な分類器が作成可能となる。
【図面の簡単な説明】
【図１】本発明の一実施形態の質問学習装置と質問タイプ同定装置の構成図である。
【図２】図１中の質問学習装置と質問タイプ同定装置の処理を示すフローチャートである。
【図３】ＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅの概念図である。
【図４】多クラス分類により拡張されたＳｕｐｐｏｒｔＶｅｃｔｏｒＭａｃｈｉｎｅの概念図である。
【図５】単語属性Ｎ−ｇｒａｍの抽出例を示す図である。
【符号の説明】
１特徴抽出部
２素性ベクトル作成部
３分類器構成部
４素性ベクトル作成部
５質問タイプ同定部
１１〜１５ステップ[0001]
BACKGROUND OF THE INVENTION
The present invention relates to a question type learning method, apparatus, program, and a recording medium recording the program, and a question type identification method, apparatus, program, and a recording medium recording the program.
[0002]
[Prior art]
In recent years, there have been an increasing number of scenes where techniques such as information retrieval / extraction, dialogue with a computer, question answering, etc., are used to ask a computer a natural sentence and answer the computer. Question type identification refers to analyzing the intent of what a question asks when asked. The question type is a class representing the intention of the question, and analyzing the intention of the question can be regarded as a classification problem for identifying the question type, and hence is called question type identification.
[0003]
It is very difficult for a computer to understand the intention of a human question. Like human beings, a computer cannot answer a question unless it understands the intention of the given question accurately.
[0004]
Conventionally, a method for limiting the question pattern and performing analysis under semantic restrictions, a method for manually creating rules, and the like have been proposed. The method using semantic constraints has a problem that expressions representing each question type are determined in advance, and only questions that match the expressions can be accepted. In the case of the rule method, it is theoretically possible to handle questions with a wide variety of expressions by converting expressions that appear in the question sentence to rules, but it takes a lot of manpower to create the rules. The point becomes a problem. Further, even when the cost is to be created, it is extremely difficult to adjust the various parameters and determine the optimum identification rule.
[0005]
As one method of solving such a problem, there is a method of automatically learning a question type identification method from a learning sample using a machine learning method. In the machine learning method, a “feature” that is a feature of an object to be classified in the problem is extracted, and the classifier is learned using the set of the features and applied to the problem.
[0006]
[Problems to be solved by the invention]
In such a machine learning method, the number of features is an important factor. Since the question type identification problem handled in the present invention is a problem of determining the intention of a question, a wide variety of information is required to perform highly accurate identification. In particular, information on what words appear in the question sentence and information on how the appearing words are used are important. In other words, the question type identification problem requires a very large number of features to characterize the question type. In the machine learning method, if there are infinite number of learning samples, there is no problem even if the number of features increases. However, since there are actually only a limited number of learning samples, there are features for the number of learning samples. If the number is large, the learning sample is over-learned, and there is a problem that the accuracy for unknown data is lowered.
[0007]
Generally, when using a machine learning method, it is desirable to use only the minimum necessary features that are considered to be most effective for performing classification. However, in question type identification, it is very difficult for a human to intuitively determine which is actually effective in a set of features. For this reason, it is very difficult to extract the minimum necessary features that are considered to be most effective for classification, and a highly accurate classifier learns by applying the machine learning method proposed in the past. There is a problem that you can not.
[0008]
An object of the present invention is to provide a question type learning method, apparatus, program, recording medium recording the program, and recording medium recording learning data, which constitute a highly accurate classifier for question type identification. .
[0009]
Another object of the present invention is to provide a question type identification method, apparatus, program, and recording medium for identifying a question type from an unknown question using the classifier.
[0010]
[Means for Solving the Problems]
First, a feature extracting unit extracts features that characterize a question type from a question learning sample. The word attribute N-gram is used as a method for extracting features that characterize the question type. The word attributes are words, parts of speech, and semantic information, and N-gram means a chain of them.
[0011]
By using this word attribute N-gram, the features that characterize the question type can be extracted automatically and exhaustively. For example, the features that characterize the type can be extracted more efficiently than when features that are considered to be more effective by hand are listed.
[0012]
Table 1 shows an example of the word attribute N-gram.
[0013]
[Table 1]

w, p, and s represent word, part of speech, and semantic information, respectively, and the questions are w ₁ , w ₂ ,. . . , W _n . In this case, the extracted features are as shown in Table 2 based on the word attribute N-gram. Here, one element of N-gram is called 1-gram, and a chain of two elements is called 2-gram. Hereinafter, element 3 is 3-gram, element 4 is 4-gram,. . . Call it.
[0014]
[Table 2]

Here, one extracted word attribute chain is one feature.
[0015]
Next, a feature vector creating unit constructs a feature space from the features extracted by the word attribute N-gram, and converts each question into a feature vector. Finally, the classifier configuration unit learns from the extracted feature and question learning samples using a statistical machine learning method capable of learning a highly accurate classifier using a large amount of features. Support Vector Machine Machine (reference: V. Vapnik, The Nature of Statistical Learning Theory. Spring-Verlag, New York, 1995) .).
[0016]
The Support Vector Machine input at the time of learning is represented by a pair of a class label and a feature vector.
[0017]
[Expression 1]

here,
[0018]
[Outside 1]

Is the feature vector of the i-th sample and an n-dimensional vector, and y _i is a scalar variable representing the class of sample i. However, since Support Vector Machine is a two-class classifier, there are two classes to be classified, positive example (+1) and negative example (-1).
[0019]
Table 3 shows examples of learning samples. Such a learning sample can be recorded on a recording medium and used when identifying a question type, and only data can be distributed.
[0020]
[Table 3]

In the case of the feature number n, by considering an n-dimensional feature space, the feature vector created from each question represents one point of the nth-order elemental space. The Support Vector Machine is a learning method in which, as shown in FIG. 3, a plane in which the margin between two classes is maximized is selected as an optimum discrimination plane among the hyperplanes separating two classes in the nth-order elemental space. .
[0021]
In addition, since Support Vector Machine is a two-class classifier, it is applied using a multi-class classification method. As a multi-class classification method, for example, one vs. There is a rest method, and when there are n classes to be classified, by creating n classifiers that classify any one class and other n-1 classes, a two-class classifier Perform classification. The identification boundary in this case is as shown in FIG.
[0022]
Finally, the classifier created as described above is used to identify which question type the question is when an unknown question is input. This is done by converting an unknown question into a feature vector in the same manner as in Table 3 and using a classifier created by the Support Vector Machine. The question type determined by the identification step is determined by where in the feature space the feature vector created from the unknown question is placed.
[0023]
DETAILED DESCRIPTION OF THE INVENTION
Next, embodiments of the present invention will be described with reference to the drawings.
[0024]
1A and 1B are configuration diagrams of a question type learning device and a question type identification device, respectively, according to an embodiment of the present invention.
[0025]
The question type learning apparatus includes a feature extraction unit 1, a feature vector creation unit 2, and a classifier configuration unit 3. The question type identification device has a feature vector creation unit 4 and a question type identification unit 5.
[0026]
FIG. 2 is a flowchart showing a processing flow of the question type learning device and the question type identification device of this embodiment.
[0027]
In step 11, the feature extraction unit 1 extracts features that characterize the question type from the question learning sample using the word N-gram. In step 12, the feature vector creation unit 2 constructs a feature space from the features extracted by the word attribute N-gram, and converts each question into a feature vector. In step 13, the classifier configuration unit 3 configures a classifier that identifies the question type using the Support Vector Machine from the feature vector.
[0028]
When an unknown question is input, the feature vector creation unit 4 extracts a feature characterizing the question type from the unknown question using the word N-gram, converts the extracted feature into a feature vector, The type identification unit 5 identifies the question type using the classifier created by the classifier configuration unit 3.
[0029]
Next, as a specific example of this embodiment, a case where a question type identification problem is handled in a question answering system will be described.
[0030]
In the case of a question answering system, the question type is a type mainly representing a classification of answer types. For example, “person name”, “place name”, “number of people”, “date and time”, and the like.
[0031]
First, the question type learning step will be described.
[0032]
As an example of a learning sample for a question answering system, consider "Where is the capital of Japan?" Since this question asks for a place, the question type to be identified is “place”. Thus, the learning samples are given as question type pairs to be identified as shown in Table 4.
[0033]
[Table 4]

Features that characterize each question type are extracted from such learning samples. First, the question is divided into words, and parts of speech and semantic information are added to each word. The extracted word attribute N-gram is, for example, a chain of combinations in which each word itself, its part of speech, and semantic information are elements as shown in FIG. Extract all possible word attributes N-gram.
[0034]
In the case of the above-mentioned question “Where is the capital of Japan?”
[0035]
[Table 5]

Using the extracted word attribute N-gram as a feature, a feature space is created, and using this feature space, an optimal discrimination plane is determined using a statistical machine learning method Support Vector Machine to create a classifier.
[0036]
Next, the question type identification step will be described.
[0037]
This is a step of identifying a newly input unknown sample question type using a classifier created by the Support Vector Machine.
[0038]
A feature vector is created for the input unknown question, and the output question type is determined as the question type of the question.
[0039]
As in the case of applying to a question answering system, if a question type suitable for each system is determined in advance, it can be applied in exactly the same way to a dialogue system and an information search / extraction system. . For example, in the case of an interactive system, when a task of schedule management is handled, it can be applied by determining the question type as “new schedule input”, “schedule change”, “schedule confirmation”, or the like. Further, in the case of an information search / extraction system, by making the text classification category a question type, it becomes possible to limit the search range and perform a more accurate search.
[0040]
The question learning method and question type identification method of the present invention are recorded on a computer-readable recording medium in addition to those realized by dedicated hardware. The program recorded on the computer may be read into a computer system and executed. The computer-readable recording medium refers to a recording medium such as a floppy disk, a magneto-optical disk, a CD-ROM, or a storage device such as a hard disk device built in the computer system. Furthermore, a computer-readable recording medium is a server that dynamically holds a program (transmission medium or transmission wave) for a short period of time, as in the case of transmitting a program via the Internet, and a server in that case. Some of them hold programs for a certain period of time, such as volatile memory inside computer systems.
[0041]
【The invention's effect】
As described above, according to the present invention, the word attribute N-gram for extracting the structure of the word attribute characterizing the question type and the high-precision classifier can be learned even with the problem setting having a large number of features with respect to the number of samples. Combining statistical machine learning techniques SVM (Support Vector Machine) can create a high-accuracy classifier, which enables high accuracy for question type identification, as long as a sample of questions is created. Classifiers can be created.
[Brief description of the drawings]
FIG. 1 is a configuration diagram of a question learning device and a question type identification device according to an embodiment of the present invention.
FIG. 2 is a flowchart showing processing of the question learning device and the question type identification device in FIG. 1;
FIG. 3 is a conceptual diagram of a Support Vector Machine.
FIG. 4 is a conceptual diagram of Support Vector Machine extended by multi-class classification.
FIG. 5 is a diagram illustrating an extraction example of a word attribute N-gram.
[Explanation of symbols]
1 Feature Extraction Unit 2 Feature Vector Creation Unit 3 Classifier Configuration Unit 4 Feature Vector Creation Unit 5 Question Type Identification Units 11-15 Steps

Claims

Means for extracting features that characterize each question type from a sample of question sentences prepared in advance using a word attribute N-gram;
Means for constructing a feature space from the extracted features and converting each question into a feature vector;
A question type learning apparatus comprising means for configuring a classifier that identifies a question type when an unknown question is input using a support vector machine from the feature vector.

A question type learning program for causing a computer to function as each means constituting the question type learning device according to claim 1.

A recording medium on which the question type learning program according to claim 2 is recorded.

A recording medium in which a feature sample obtained by the question type learning apparatus according to claim 1 and a learning sample composed of a pair of a class including a positive example and a negative example are recorded.

Means for extracting features that characterize the question type from unknown questions using the word attribute N-gram;
Means for converting the extracted features into feature vectors;
A question type identification device comprising means for identifying a question type using a classifier that identifies a question type when an unknown question is input using a support vector machine from the feature vector.

A question type identification program for causing a computer to function as each means constituting the question type identification device according to claim 5 .

The recording medium which recorded the question type identification program of Claim 6 .