JP4935243B2

JP4935243B2 - Search program, information search device, and information search method

Info

Publication number: JP4935243B2
Application number: JP2006229048A
Authority: JP
Inventors: 哲朗 ▲高▼橋; 寛治内野; 青史岡本
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2006-08-25
Filing date: 2006-08-25
Publication date: 2012-05-23
Anticipated expiration: 2026-08-25
Also published as: JP2008052548A

Description

本発明は、ウェブページなどのテキストを対象とした検索プログラム、情報検索装置及び情報検索方法に関し、特にユーザが入力したクエリによる検索失敗時に候補クエリを自動生成して再検索可能とする検索プログラム、情報検索装置及び情報検索方法に関する。
The present invention relates to a search program, an information search device, and an information search method for text such as a web page, and in particular, a search program that automatically generates a candidate query and makes it possible to search again when a search fails due to a query input by a user, The present invention relates to an information search apparatus and an information search method.

従来、ウェブページなどの文書検索にあっては、ユーザは検索目的に合わせてクエリを作成して検索を行う対話型の情報検索が行われている。 Conventionally, when searching for a document such as a web page, an interactive information search is performed in which a user creates a query according to a search purpose and performs a search.

ここで、クエリとはデータベース管理システムに対しデータの抽出や更新といった問合せを文字列として表したものであり、例えば「今日の東京地方の天気はどうなりそう？」といった文字列であり、近年、ウェブページの検索にも利用されている。 Here, a query represents a query such as data extraction or update for a database management system as a character string. For example, a query such as “What is the weather in Tokyo today?” It is also used for searching web pages.

このようなクエリを使用したウェブページ等の情報検索によれば、１又は複数の単語をキーワードとして行う検索に比べ、ユーザの検索したい内容が適確に表現でき、ユーザが期待している内容に絞り込まれた検索結果が得られ、キーワード検索にように多数の検索結果の中から改めてユーザがページを開きながら希望する検索結果を探し出すという手間を解消することができる。

特開平１１−０５３３８２号公報特開２００４−１３９１５４号公報特開２００５−２５１０９２号公報 According to the information search such as a web page using such a query, the content that the user wants to search can be expressed more accurately than the search that uses one or more words as keywords, and the content that the user expects Narrowed search results are obtained, and it is possible to eliminate the trouble of the user searching for a desired search result while opening a page again from a large number of search results as in keyword search.

JP-A-11-053382 JP 2004-139154 A JP 2005-251092 A

しかしながら、このような従来のクエリを使用した対話型の情報検索にあっては、適切なクエリの作成にはスキルが必要であり、クエリが適切でない場合には検索結果の件数が非常に少ないか又は０件の検索結果になり、必要とするページにたどり着くために手間と時間がかかるという問題がある。 However, in such interactive information search using conventional queries, skill is required to create an appropriate query, and if the query is not appropriate, the number of search results is very small. Alternatively, there is a problem that it takes time and labor to reach the required page because there are no search results.

このように入力されたクエリでは１件も見付からない原因、所謂０件ヒットの原因は，その大部分はクエリが長すぎることが挙げられる。一般的な検索エンジンでは、入力されたクエリの文字列を含むページが検索結果として返されるが、例えば「今日の東京地方の天気はどうなりそう？」などのような長いクエリが入力された場合、その文字列と同じ文字列が含まれる文書がない場合に、０件ヒットとなる。 The reason why no query is found in the query input in this way, that is, the cause of the so-called zero hit is that the query is too long. In general search engines, pages containing the query string entered are returned as search results. For example, when a long query such as "What is the weather in Tokyo today?" When there is no document containing the same character string as that character string, 0 hits are found.

本発明は、ユーザが入力したクエリが適切でない場合に、適切な候補クエリをユーザに提示して検索のヒット件数をコントロールし、少ない手間と時間でユーザを適切なページへ導くことを可能とする検索プログラム、情報検索装置及び情報検索方法を提供する。 The present invention makes it possible to control the number of search hits by presenting an appropriate candidate query to the user when the query inputted by the user is not appropriate, and to guide the user to an appropriate page with less effort and time. A search program, an information search device, and an information search method are provided.

また本発明は、過去のクエリ変換履歴を元にユーザへの候補クエリの提示を行なうことにより、ユーザのクエリを作成するスキルを向上させる検索プログラム、情報検索装置及び情報検索方法を提供する。
The present invention also provides a search program, an information search device, and an information search method that improve the skill of creating a user query by presenting candidate queries to the user based on past query conversion history.

（プログラム）
本発明は検索プログラムを提供する。本発明の検索プログラムは、コンピュータに、
ユーザが指定したクエリを入力するクエリ入力ステップと、
クエリを使用して検索する検索ステップと、
検索ステップで検索に失敗した場合、前記クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索ステップで検索させる候補クエリ生成ステップと、
を実行させ、
更に、候補クエリ生成ステップは、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、前記設定した分解条件に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする。 (program)
The present invention provides a search program. The search program of the present invention is stored in a computer.
A query input step for entering a query specified by the user;
A search step to search using a query;
If the search fails in the search step, a candidate query generation step that presents to the user a plurality of candidate queries generated by decomposing the query and causes the search step to search using the query selected by the user;
And execute
Furthermore, the candidate query generation step sets in advance any one of extraction of nouns, extraction of only nouns and verbs, and extraction of noun phrases as a decomposition method of the pre-decomposition query, and pre-decomposition based on the set decomposition conditions. A combination of a phrase and a phrase extracted from the query is generated as a candidate query.

候補クエリ生成ステップは、検索ステップにより検索結果が１件も得られない０件ヒットの場合に検索失敗と判定して候補クエリ生成処理を実行する。 The candidate query generation step executes a candidate query generation process by determining that the search has failed when no search result is obtained by the search step and no hit is found.

本発明の検索プログラムは、コンピュータに、更に、
クエリ入力ステップで入力した分解前クエリと前記候補クエリ生成ステップでユーザが選択した分解後クエリの組について、各々の特徴量を抽出してクエリ分解知識データベースに登録するステップを実行させ、
候補クエリ生成ステップは、処理対象としている分解前クエリと分解後クエリの組について各々の特徴量を抽出し、知識データベースに登録されている分解前クエリと分解後クエリの組の各特徴量との類似度からスコアを算出して処理対象の分解後クエリに付与し、スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させる。 The search program of the present invention further includes a computer,
For the set of the pre-decomposition query input in the query input step and the post-decomposition query selected by the user in the candidate query generation step, the step of extracting each feature quantity and registering it in the query decomposition knowledge database is executed,
In the candidate query generation step, each feature amount is extracted for a set of pre-decomposition query and post-decomposition query to be processed, and each feature amount of the pre-decomposition query and post-decomposition query pair registered in the knowledge database is extracted. A score is calculated from the similarity and given to the post-decomposition query to be processed, and the post-decomposition query sorted in order of score is presented to the user as a candidate query for selection.

また本発明の検索プログラムは、コンピュータに、更に、
検索ステップで検索に成功した分解後クエリの特徴量を抽出してクエリ選考知識データベースに登録するステップを実行させ、
候補クエリ生成ステップは、処理対象としている分解後クエリの特徴量を抽出し、クエリ選考知識データベースに登録されている分解後クエリとの特徴量との類似度からスコアを算出して処理対象の分解後クエリに付与し、スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させる。 The search program of the present invention is further stored in a computer.
The feature of the post-decomposition query that was successfully searched in the search step is extracted and registered in the query selection knowledge database.
In the candidate query generation step, the feature quantity of the decomposed query to be processed is extracted, the score is calculated from the similarity to the feature quantity with the decomposed query registered in the query selection knowledge database, and the processing target is decomposed A post-decomposition query that is given to the post-query and sorted in the order of score is presented to the user as a candidate query for selection.

分解前クエリの特徴量は、形態素解析された構成要素、単語数、特定の意味をもつ内容語の１又は複数の組合せを含み、
分解後クエリの特徴量は、名詞のみの抽出、名詞と動詞のみの抽出又は名詞句の抽出の分解手法、抽出した単語の分解前クエリ上の位置、抽出した単語の位置、抽出した単語数を示す要素数、分解前クエリの単語数に対する分解後の要素数の割合、又は抽出した単語の意味を示す意味ラベルの１又は複数の組合せを含む。 The feature amount of the query before decomposition includes one or a plurality of combinations of the constituent element subjected to morphological analysis, the number of words, and the content word having a specific meaning,
The feature value of the query after decomposition is the decomposition method of extracting nouns only, extracting nouns and verbs only, or extracting noun phrases, the position of extracted words on the query before decomposition, the position of extracted words, the number of extracted words 1 or a combination of one or more semantic labels indicating the meaning of the extracted word, or the ratio of the number of elements after decomposition to the number of words in the query before decomposition.

（装置）
本発明は情報検索装置を提供する。本発明の情報検索装置は、
ユーザが指定したクエリを入力するクエリ入力部と、
クエリを使用して検索する検索部と、
検索部で検索に失敗した場合、クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索部に検索させる候補クエリ生成部と、
を備え、
更に、候補クエリ生成部は、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、設定した分解手法に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする。 (apparatus)
The present invention provides an information retrieval apparatus. The information retrieval apparatus of the present invention is
A query input part for inputting a query specified by the user;
A search part that searches using a query;
If the search unit fails to search, a candidate query generation unit that presents a plurality of candidate queries generated by disassembling the query and causes the search unit to search using a query selected by the user;
With
Further, the candidate query generation unit presets any of noun extraction, noun and verb extraction, or noun phrase extraction as a decomposition method of the pre-decomposition query, and the pre-decomposition query based on the set decomposition method A phrase extracted from the above and a combination of phrases are generated as candidate queries.

（方法）
本発明は情報検索方法を提供する。本発明の情報検索方法は、
ユーザが指定したクエリを入力するクエリ入力ステップと、
クエリを使用して検索する検索ステップと、
前記検索ステップで検索に失敗した場合、クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索ステップで検索させる候補クエリ生成ステップと、
を備え、
更に、候補クエリ生成ステップは、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、設定した分解条件に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする。
(Method)
The present invention provides an information retrieval method. The information retrieval method of the present invention is
A query input step for entering a query specified by the user;
A search step to search using a query;
If the search fails in the search step, a candidate query generation step of presenting a plurality of candidate queries generated by decomposing the query to the user and causing the search step to search using the query selected by the user;
With
In addition, the candidate query generation step presets any of noun extraction, noun and verb extraction, and noun phrase extraction as a decomposition method of the pre-decomposition query, and the pre-decomposition query based on the set decomposition conditions A phrase extracted from the above and a combination of phrases are generated as candidate queries.

本発明によれば、検索結果が０件ヒットとなるユーザが入力したクエリが適切でない場合に、ユーザが入力したクエリを例えば名詞と動詞のみを抽出するという分解手法に基づいて分解し、分解した単語またはその組合せを候補クエリとして生成してユーザに提示し、ユーザが選択した候補クエリによる再検索により、少ない手間と時間（少ないコスト）で例えばユーザを適切なページへ導くような検索結果を得ることができる。 According to the present invention, when a query input by a user whose search result is 0 hits is not appropriate, the query input by the user is decomposed and decomposed based on, for example, a decomposition method of extracting only nouns and verbs. A word or a combination thereof is generated as a candidate query and presented to the user, and a search result that leads the user to an appropriate page, for example, is obtained with less effort and time (less cost) by re-searching with the candidate query selected by the user. be able to.

また過去のユーザによる選択した分解前クエリと分解後クエリの組合せや、検索に成功した分解後クエリの履歴を知識データベースとして保存し、新たに候補クエリを生成する際に、知識データベースの履歴との類似度に応じたコストを候補クエリに付与してユーザに提示することで、候補クエリをコストにより評価可能とし、コストの高い候補クエリを選択する過程を通じた学習効果により、ユーザのクエリを作成するスキルを向上させることができる。
Also, the combination of the pre-decomposition query and post-decomposition query selected by the past user and the history of the post-decomposition query that was successfully searched are saved as a knowledge database, and when a new candidate query is generated, By assigning the cost according to the similarity to the candidate query and presenting it to the user, the candidate query can be evaluated by the cost, and the user's query is created by the learning effect through the process of selecting the high-cost candidate query. Skills can be improved.

図１は本発明による情報検索装置の実施形態を示した機能構成のブロック図である。図１において、本実施形態の情報検索装置は、クエリ入力部１０、検索制御部１２、検索部として機能する検索エンジン１４、検索結果表示部１８、候補クエリ処理部２０、分解候補表示指定部２２、知識記録部２４、クエリ分解知識データベース３０及びクエリ選考知識データベース３２を備えている。 FIG. 1 is a block diagram of a functional configuration showing an embodiment of an information retrieval apparatus according to the present invention. In FIG. 1, the information search apparatus of the present embodiment includes a query input unit 10, a search control unit 12, a search engine 14 that functions as a search unit, a search result display unit 18, a candidate query processing unit 20, and a decomposition candidate display designation unit 22. A knowledge recording unit 24, a query decomposition knowledge database 30, and a query selection knowledge database 32.

また候補クエリ処理部２０には、分解候補生成部３４、スコア付与部３６及び分解候補提示部３８の機能が設けられている。また知識記録部２４にはクエリ分解知識記録部２６とクエリ選考知識記録部２８が設けられている。 Further, the candidate query processing unit 20 is provided with functions of a decomposition candidate generation unit 34, a score assignment unit 36, and a decomposition candidate presentation unit 38. The knowledge recording unit 24 includes a query decomposition knowledge recording unit 26 and a query selection knowledge recording unit 28.

クエリ分解知識記録部２６はクエリ分解知識データベース３０に対し登録処理を行うもので、分解前クエリ特徴抽出部４０、検索成功クエリ特徴抽出部４２及びデータベース登録部４４を備えている。クエリ選考知識記録部２８はクエリ選考知識データベース３２に対する登録を行うもので、検索成功クエリ特徴抽出部４６及びデータベース登録部４８を備えている。 The query decomposition knowledge recording unit 26 performs registration processing for the query decomposition knowledge database 30 and includes a pre-decomposition query feature extraction unit 40, a search success query feature extraction unit 42, and a database registration unit 44. The query selection knowledge recording unit 28 performs registration with respect to the query selection knowledge database 32 and includes a search success query feature extraction unit 46 and a database registration unit 48.

クエリ入力部１０はユーザが指定した検索のための文字列であるクエリを入力する。検索制御部１２はクエリ入力部１０から入力されたクエリを検索エンジン１４に出力し、これを受けて検索エンジン１４がインターネット１６上に存在するウェブページの検索を行い、検索結果を検索結果表示部１８に表示することになる。 The query input unit 10 inputs a query that is a character string for search specified by the user. The search control unit 12 outputs the query input from the query input unit 10 to the search engine 14, and in response to this, the search engine 14 searches a web page existing on the Internet 16, and the search result is displayed as a search result display unit. 18 will be displayed.

検索制御部１２は、クエリ入力部１０からの入力クエリを使用した検索エンジン１４による検索結果が失敗した場合、本実施形態にあっては検索結果が１件も得られずに０件ヒットとなった場合には候補クエリ処理部２０を動作する。 When the search result by the search engine 14 using the input query from the query input unit 10 fails, the search control unit 12 hits 0 hits without obtaining any search results in this embodiment. If it does, the candidate query processing unit 20 operates.

候補クエリ処理部２０は、検索エンジン１４で０件ヒットとなった場合に、クエリ入力部１０から入力されたクエリを分解して複数の候補クエリを生成し、この候補クエリを分解候補表示指定部２２に表示し、ユーザが選択したクエリを使用して検索エンジン１４に検索を行わせる。 The candidate query processing unit 20 generates a plurality of candidate queries by decomposing the query input from the query input unit 10 when the search engine 14 hits 0 hits, and the candidate query is displayed as a decomposition candidate display designation unit. The search engine 14 performs a search using the query selected by the user and selected by the user.

候補クエリ処理部２０による候補クエリの生成は、分解前のクエリ（入力クエリ）の分解手法として例えば次の手法を設定している。
（１）名詞のみ抽出
（２）名詞と動詞のみ抽出
（３）名詞句の抽出 The candidate query generation by the candidate query processing unit 20 sets, for example, the following method as a decomposition method of the query (input query) before decomposition.
(1) Extract nouns only (2) Extract nouns and verbs only (3) Extract noun phrases

このような分解手法につき、いずれかの分解手法を予め設定しておくことで、設定した分解手法に基づいて分解前クエリから語句を抽出し、抽出した語句及び語句の組合せを候補クエリとして生成する。この処理は分解候補生成部３４により行われる。 By setting one of the decomposition methods in advance for such a decomposition method, a phrase is extracted from the pre-decomposition query based on the set decomposition method, and a combination of the extracted word and phrase is generated as a candidate query. . This process is performed by the decomposition candidate generation unit 34.

スコア付与部３６は、クエリ分解知識データベース３０またはクエリ選考知識データベース３２に保存されている履歴知識を利用して、分解候補生成部３４で生成した各分解候補にスコアを付与する。分解候補提示部３８は、スコアが付与された分解候補をスコア順にソートした後に、分解候補表示指定部２２に表示し、ユーザの指定を受ける。 The score assigning unit 36 assigns a score to each decomposition candidate generated by the decomposition candidate generating unit 34 using the history knowledge stored in the query decomposition knowledge database 30 or the query selection knowledge database 32. The decomposition candidate presentation unit 38 sorts the decomposition candidates to which scores have been assigned in the order of the scores, and then displays them on the decomposition candidate display designation unit 22 to receive user designation.

ここでクエリ分解知識データベース３０には、クエリ入力部１０で入力した分解前クエリと、候補クエリ処理部２０で生成してユーザが選択した分解後クエリの組（ペア）について、各々の特徴量を抽出してクエリ分解知識として登録されている。このクエリ分解知識データベース３０に対する登録は、知識記録部２４に設けたクエリ分解知識記録部２６により行われる。 Here, in the query decomposition knowledge database 30, each feature amount is set for a pair (pair) of the query before decomposition input by the query input unit 10 and the query generated by the candidate query processing unit 20 and selected by the user. Extracted and registered as query decomposition knowledge. Registration with the query decomposition knowledge database 30 is performed by a query decomposition knowledge recording unit 26 provided in the knowledge recording unit 24.

一方、クエリ選考知識データベース３２には、検索エンジン１４による検索で検索に成功した分解後クエリの特徴量を抽出してクエリ選考知識として登録している。このクエリ選考知識データベース３２の登録は、知識記録部２４に設けたクエリ選考知識記録部２８により行われる。 On the other hand, in the query selection knowledge database 32, the feature amount of the decomposed query that has been successfully searched by the search by the search engine 14 is extracted and registered as query selection knowledge. The registration of the query selection knowledge database 32 is performed by the query selection knowledge recording unit 28 provided in the knowledge recording unit 24.

クエリ分解知識データベース３０を利用したスコア付与部３６によるスコアの付与は、現在処理対象としている分解前のクエリと分解後のクエリの組につき各々の特徴量を抽出し、クエリ分解知識データベース３０に登録されている分解前クエリと分解後クエリの各特徴量との類似度からスコアを算出して、分解候補である分解後クエリに付与する。 The score assignment by the score assigning unit 36 using the query decomposition knowledge database 30 is performed by extracting each feature amount for a set of a query before decomposition and a query after decomposition that are currently processed and registering them in the query decomposition knowledge database 30. A score is calculated from the similarity between the pre-decomposition query and the feature values of the post-decomposition query, and is given to the post-decomposition query that is a decomposition candidate.

またスコア付与部３６によるクエリ選考知識データベース３２を利用したスコア付与にあっては、処理対象としている検索に成功した分解後クエリの特徴量を抽出し、クエリ選考知識データベース３２に登録されている分解後クエリの特徴量との類似度からスコアを計算して付与する。 In addition, in the score assignment using the query selection knowledge database 32 by the score assignment unit 36, the feature amount of the decomposed query that has succeeded in the search to be processed is extracted, and the decomposition registered in the query selection knowledge database 32 A score is calculated and given from the similarity to the feature value of the post-query.

図２は図１の実施形態の機能を実現する本実施形態の検索プログラムが実行されるコンピュータのハードウェア環境のブロック図である。図２において、ＣＰＵ５０のバス５２に対しては、ＲＡＭ５４、ＲＯＭ５５、ハードディスクドライブ５６、キーボード６０，マウス６２及びディスプレイ６４を接続するデバイスインタフェース５８、更にインターネットと接続されるネットワークアダプタ６６が接続されている。 FIG. 2 is a block diagram of a hardware environment of a computer in which the search program of this embodiment that implements the functions of the embodiment of FIG. 1 is executed. In FIG. 2, a RAM 54, a ROM 55, a hard disk drive 56, a keyboard 60, a mouse 62 and a device interface 58 for connecting a display 64 and a network adapter 66 connected to the Internet are connected to the bus 52 of the CPU 50. .

本実施形態による検索プログラムはハードディスクドライブ５６に格納されており、コンピュータを起動すると、ＢＩＯＳの動作により、ハードディスクドライブ５６からブートプログラムがＲＡＭ５４に読み出されて実行されることで起動し、ハードディスクドライブ５６からＲＡＭ５４にＯＳを読出し配置して実行した後に、ハードディスクドライブ５６から本実施形態の検索プログラムをＲＯＭ５４に読出し配置して、ＣＰＵ５０により実行する。 The search program according to the present embodiment is stored in the hard disk drive 56. When the computer is started, the boot program is read from the hard disk drive 56 to the RAM 54 and executed by the operation of the BIOS. After the OS is read from the RAM 54 to the RAM 54 and executed, the search program of the present embodiment is read from the hard disk drive 56 to the ROM 54 and executed by the CPU 50.

図３は本実施形態による情報検索処理のフローチャートである。図３において、図１の実施形態を参照して処理を説明すると次のようになる。 FIG. 3 is a flowchart of the information search process according to this embodiment. In FIG. 3, the process will be described with reference to the embodiment of FIG.

まずステップＳ１でユーザが作成した入力クエリをクエリ入力部１０から読み込み、ステップＳ２で検索エンジン１４により入力クエリを使用したインターネット１６上のウェブページの検索を行う。 First, in step S1, an input query created by the user is read from the query input unit 10, and in step S2, a search engine 14 searches a web page on the Internet 16 using the input query.

この検索結果につき、ステップＳ３で０件ヒットか否かを判定する。０件ヒットでなければステップＳ９に進み、検索結果を検索結果表示部１８に出力した後、ステップＳ１０で知識記録部２４に設けているクエリ選考知識記録部２８によりクエリ選考知識記録処理をクエリ選考知識データベース３２に対し行う。 For this search result, it is determined in step S3 whether or not there are zero hits. If it is not 0 hits, the process proceeds to step S9, the search result is output to the search result display unit 18, and the query selection knowledge recording process provided in the knowledge recording unit 24 in step S10 is used for query selection knowledge recording processing. This is performed on the knowledge database 32.

一方、ステップＳ３で検索に失敗して０件ヒットが判別された場合には、ステップＳ４に進み、候補クエリ処理部２０による候補クエリ生成処理を行い、分解候補表示指定部２２に候補クエリを表示する。 On the other hand, if the search fails in step S3 and 0 hits are determined, the process proceeds to step S4, the candidate query generation process is performed by the candidate query processing unit 20, and the candidate query is displayed on the decomposition candidate display designating unit 22. To do.

続いてステップＳ５でユーザの分解候補選択を判別すると、ステップＳ６に進み、ユーザが選択した分解クエリである候補クエリを使用した検索を検索エンジン１４で行い、ステップＳ７で検索結果を検索結果表示部１８に出力する。続いてステップＳ８で、知識記録部２４に設けているクエリ分解知識記録部２６によりクエリ分解知識記録処理を実行し、クエリ分解知識データベース３０に処理結果を登録する。 Subsequently, when the user's decomposition candidate selection is determined in step S5, the process proceeds to step S6, and a search using the candidate query that is the decomposition query selected by the user is performed by the search engine 14, and the search result is displayed in the search result display unit in step S7. 18 is output. Subsequently, in step S8, the query decomposition knowledge recording unit 26 provided in the knowledge recording unit 24 executes query decomposition knowledge recording processing, and registers the processing result in the query decomposition knowledge database 30.

図４は図３のステップＳ８のクエリ分解知識記録処理の詳細を示したフローチャートである。図４において、クエリ分解知識記録処理は、ステップＳ１で現在処理対象となっている分解前クエリ（入力クエリ）の特徴量を抽出し、次にステップＳ２でユーザが選択した分解クエリの特徴量を抽出し、ステップＳ３で分解前クエリの特徴量と分解後クエリの特徴量のペアをクエリ分解知識としてクエリ分解知識データベース３０に登録する。 FIG. 4 is a flowchart showing details of the query decomposition knowledge recording process in step S8 of FIG. In FIG. 4, in the query decomposition knowledge recording process, the feature amount of the pre-decomposition query (input query) that is currently processed in step S1 is extracted, and the feature amount of the decomposition query selected by the user in step S2 is then extracted. In step S3, a pair of the feature quantity of the query before decomposition and the feature quantity of the query after decomposition is registered in the query decomposition knowledge database 30 as query decomposition knowledge.

図５は図４に対応した図１のクエリ分解知識記録部２６の具体的な処理動作を示した説明図である。図５にあっては、ユーザが作成して入力した分解前クエリ６８として「今日の東京地方の天気はどうなりそう？」を処理対象としており、この分解前クエリ６８からユーザが分解クエリとして「今日＋東京地方＋天気」を選択して検索を行って検索に成功し、これが検索成功クエリ７０として得られている。 FIG. 5 is an explanatory diagram showing a specific processing operation of the query decomposition knowledge recording unit 26 of FIG. 1 corresponding to FIG. In FIG. 5, “What will happen to the weather in Tokyo today?” Is processed as a pre-decomposition query 68 created and input by the user. A search is performed by selecting “today + Tokyo region + weather” and the search is successful, and this is obtained as a search success query 70.

クエリ分解知識記録部２６にあっては、まず分解前クエリ特徴抽出部４０により分解前クエリ６８の特徴量７２を抽出する。分解前クエリ６８の特徴量７２としては、その文字列に含んでいる要素を形態素解析により分類し、それぞれの品詞を抽出する。 In the query decomposition knowledge recording unit 26, first, the feature amount 72 of the pre-decomposition query 68 is extracted by the pre-decomposition query feature extraction unit 40. As the feature amount 72 of the query 68 before decomposition, elements included in the character string are classified by morphological analysis, and each part of speech is extracted.

また分解前クエリ６８の文字列を構成する「単語数」を抽出しており、この場合には「単語数：９」となっている。また入力前クエリ６８に含まれる特定の意味を持つ内容語につき「内容語数」を抽出しており、この場合には「内容語数：３」となっている。更に、分解前クエリ６８の文末のフレーズとして「述語」を抽出している。 Further, the “number of words” constituting the character string of the query 68 before decomposition is extracted, and in this case, “number of words: 9”. Further, the “number of content words” is extracted for the content words having a specific meaning included in the pre-input query 68, and in this case, “number of content words: 3”. Further, “predicate” is extracted as a phrase at the end of the sentence of the query 68 before decomposition.

次に検索成功クエリ特徴量抽出部４２において、検索成功クエリ７０から特徴量７４を抽出する。検索成功クエリ７０における特徴量７４の抽出は、検索成功クエリ７０の分解手法として、この例では「名詞のみ」、分解前クエリ６８における検索成功クエリ７０の各単語の位置、この実施形態にあっては「今日」は１番目、「東京地方」は３番目、「天気」は５番目であることから、「位置（単語）：１，３，５」を抽出している。 Next, the search success query feature quantity extraction unit 42 extracts the feature quantity 74 from the search success query 70. Extraction of the feature quantity 74 in the search success query 70 is, as a decomposition method of the search success query 70, “noun only” in this example, the position of each word of the search success query 70 in the pre-decomposition query 68, in this embodiment. Since “Today” is the first, “Tokyo region” is the third, and “weather” is the fifth, “position (word): 1, 3, 5” is extracted.

また検索成功クエリ７０における各単語の位置として「位置（内容語）：１，２，３」を抽出している。また検索成功クエリ７０の単語数として「単語数：３」を抽出している。また検索成功クエリ７０における要素数の割合（要素数／単語数）として「３／９」を抽出している。更に検索成功クエリ７０における単語の意味ラベルとして、この例では「東京地方」に対応して「ＬＯＣＡＴＩＯＮ」を抽出している。 Further, “position (content word): 1, 2, 3” is extracted as the position of each word in the search success query 70. Further, “word count: 3” is extracted as the word count of the search success query 70. Further, “3/9” is extracted as the ratio of the number of elements in the search success query 70 (number of elements / number of words). Further, in this example, “LOCATION” is extracted corresponding to “Tokyo region” as the meaning label of the word in the search success query 70.

図５に示す特徴量７２，７４の抽出は一例であり、これ以外にも必要に応じて、分解前クエリ６８及び検索成功クエリ７０における各文字列に関する適宜の特徴を抽出することができる。 Extraction of the feature amounts 72 and 74 shown in FIG. 5 is an example, and other appropriate features regarding the character strings in the pre-decomposition query 68 and the search success query 70 can be extracted as necessary.

図６は図３のステップＳ１０のクエリ選考知識記録処理の詳細を示したフローチャートである。図６において、クエリ選考知識記録処理部２８は検索成功となった状態で、まずステップＳ１で検索成功クエリが分解後クエリか否かチェックする。分解後クエリであった場合にはステップＳ２に進み、分解後クエリの特徴量の抽出を実行し、ステップＳ３で抽出した特徴量をクエリ選考知識データベース３２に登録する。一方、ステップＳ１で検索成功クエリが分解前クエリ、即ち入力クエリであった場合には、ステップＳ２，Ｓ３の特徴量抽出と登録はスキップし、クエリ選考知識の登録は行わない。 FIG. 6 is a flowchart showing details of the query selection knowledge recording process in step S10 of FIG. In FIG. 6, the query selection knowledge recording processing unit 28 first checks whether or not the search success query is a post-decomposition query in step S1 in a state where the search is successful. If the query is a post-decomposition query, the process proceeds to step S2, the feature quantity of the post-decomposition query is extracted, and the feature quantity extracted in step S3 is registered in the query selection knowledge database 32. On the other hand, if the search success query is a pre-decomposition query, that is, an input query in step S1, the feature amount extraction and registration in steps S2 and S3 are skipped, and the query selection knowledge is not registered.

図７は図６に対応した図１のクエリ選考知識記録部２８による具体的な処理動作を示した説明図である。図７にあっては、検索成功クエリ７０として分解後クエリである「今日＋東京地方＋天気」を対象としており、これを検索成功クエリ特徴抽出部４６で特徴量の抽出を行い、特徴量７４を取得する。この特徴量７４の内容は、図５の検索成功クエリ特徴抽出部４２で抽出した特徴量７４と同じである。続いて、データベース登録部４８が特徴量７４をクエリ選考知識データベース３２に登録する。 FIG. 7 is an explanatory diagram showing a specific processing operation by the query selection knowledge recording unit 28 of FIG. 1 corresponding to FIG. In FIG. 7, “Today + Tokyo region + weather”, which is a post-decomposition query, is targeted as the search success query 70, and the feature quantity is extracted by the search success query feature extraction unit 46. To get. The content of the feature quantity 74 is the same as the feature quantity 74 extracted by the search success query feature extraction unit 42 of FIG. Subsequently, the database registration unit 48 registers the feature quantity 74 in the query selection knowledge database 32.

図８は図３のステップＳ４の候補クエリ生成処理の詳細を示したフローチャートである。図８において、候補クエリ生成処理は、まずステップＳ１で予め設定している候補クエリの分解手法を取得する。この分解手法は
（１）名詞抽出のみ
（２）名詞及び動詞抽出のみ
（３）名詞句抽出のみ
のいずれかである。続いてステップＳ２に進み、分解候補生成部３４によりステップＳ１で取得した分解手法を使用して候補クエリを分解して、単語及び単語の組合せからなる分解候補を生成する。次にステップＳ３で分解前クエリと分解候補の各特徴量を抽出し、ステップＳ４で分解前クエリと分解後クエリのペアごとにクエリ分解知識データベース３０の保存知識との類似度に基づくスコアを計算する。 FIG. 8 is a flowchart showing details of the candidate query generation process in step S4 of FIG. In FIG. 8, in the candidate query generation process, first, a candidate query decomposition method preset in step S1 is acquired. This decomposition method is either (1) noun extraction only, (2) noun and verb extraction only, or (3) noun phrase extraction only. Then, it progresses to step S2 and decomposes | disassembles a candidate query using the decomposition | disassembly method acquired by step S1 by the decomposition | disassembly candidate production | generation part 34, and the decomposition | disassembly candidate which consists of a word and the combination of a word is produced | generated. Next, in step S3, the feature quantities of the pre-decomposition query and the decomposition candidate are extracted, and in step S4, a score based on the similarity between the stored knowledge in the query decomposition knowledge database 30 is calculated for each pair of the pre-decomposition query and the post-decomposition query. To do.

このステップＳ３の特徴量抽出とステップＳ４のスコア計算は、スコア付与部３６により行われる。最終的に、ステップＳ５で分解候補提示部３８がスコア順にソートした後に分解候補をユーザに提示することになる。 The feature amount extraction in step S3 and the score calculation in step S4 are performed by the score assigning unit 36. Finally, after the decomposition candidate presentation unit 38 sorts in order of score in step S5, the decomposition candidates are presented to the user.

図９は図８に対応した図１の候補クエリ処理部２０の具体的な処理動作を示した説明図である。図９において、分解前クエリ７６として「今日の川崎の天気は？」が分解候補生成部３４に入力されている。分解候補生成部３４は、そのとき設定されている分解手法に基づき、例えば「今日」、「川崎」、「天気」、「今日の川崎」、川崎の天気」、「今日＋川崎」、「今日＋川崎＋天気」などの分解候補を生成した分解候補一覧７８を生成する。 FIG. 9 is an explanatory diagram showing a specific processing operation of the candidate query processing unit 20 of FIG. 1 corresponding to FIG. In FIG. 9, “What is the weather in Kawasaki today?” Is input to the decomposition candidate generation unit 34 as the pre-decomposition query 76. For example, “Today”, “Kawasaki”, “Weather”, “Today's Kawasaki”, “Today's Kawasaki”, “Today + Kawasaki”, “Today” A decomposition candidate list 78 that generates decomposition candidates such as “+ Kawasaki + weather” is generated.

ここで分解候補一覧７８の「川崎の天気」まではスコア付与部３６によるスコア計算で求めたスコアがカッコ内の数値で示すように既に付与されている。 Here, up to “Kawasaki weather” in the decomposition candidate list 78, the score obtained by the score calculation by the score assigning unit 36 is already given as indicated by the numerical value in parentheses.

次に分解候補８０についてスコアを計算する場合を説明すると次のようになる。まず分解前クエリ７６について特徴量７６−１を抽出する。これは最初の分解候補「今日」の際に予め行われていることから、それを使用する。 Next, the case where the score is calculated for the decomposition candidate 80 will be described as follows. First, the feature value 76-1 is extracted from the pre-decomposition query 76. Since this is performed in advance at the time of the first decomposition candidate “today”, it is used.

一方、現在処理対象となっている分解候補８０である「今日＋川崎」につき特徴量８０−１を抽出する。この特徴量８０−１の抽出処理は、図５に示したクエリ分解知識記録部２６に設けた分解前クエリ特徴抽出部４０及び検索成功クエリ特徴抽出部４２による処理と同じになる。 On the other hand, the feature quantity 80-1 is extracted for “today + kawasaki”, which is the decomposition candidate 80 currently being processed. The extraction process of the feature amount 80-1 is the same as the process performed by the pre-decomposition query feature extraction unit 40 and the search success query feature extraction unit 42 provided in the query decomposition knowledge recording unit 26 shown in FIG.

このようにして分解前クエリ７６の特徴量７６−１と現在処理対象としている分解候補８０の特徴量８０−１が得られたならば、特徴量７６−１と特徴量８０−１をペアとし、クエリ分解知識データベース３０に、このとき登録されている保存知識８２−１〜８２−３のそれぞれとの類似度を計算する。 When the feature amount 76-1 of the pre-decomposition query 76 and the feature amount 80-1 of the decomposition candidate 80 currently being processed are obtained in this way, the feature amount 76-1 and the feature amount 80-1 are paired. The similarity with each of the stored knowledge 82-1 to 82-3 registered at this time in the query decomposition knowledge database 30 is calculated.

この例では、保存知識８２−１に含まれる分解前クエリ特徴量８４と分解後クエリ特徴量８６のペアとの間で特徴量７６−１と特徴量８０−１の類似度ｓｉｍ１，ｓｉｍ２をそれぞれ計算し、両者の乗算結果として、保存知識８２−１との間のスコアを求める。このようなスコア計算を残りの保存知識８２−２，８２−３との間で行って、その総和をスコアとして求め、分解候補８０に算出したスコアを付与する。 In this example, the similarities sim1 and sim2 of the feature quantity 76-1 and the feature quantity 80-1 are respectively determined between the pair of the pre-decomposition query feature quantity 84 and the post-decomposition query feature quantity 86 included in the stored knowledge 82-1. A score between the saved knowledge 82-1 is obtained as a result of multiplication between the two. Such score calculation is performed between the remaining stored knowledge 82-2 and 82-3, the sum is obtained as a score, and the calculated score is given to the decomposition candidate 80.

スコア付与部３６における類似度計算８８は次式で行われる。
The similarity calculation 88 in the score assigning unit 36 is performed by the following equation.

スコア付与部３６による分解候補一覧７８のそれぞれに対するスコア付与が済むと、分解候補一覧７８をスコア順にソートした後、分解候補一覧７８を分解候補表示指定部２２に表示し、ユーザによる分解候補の選択指定を行わせることになる。 When the score assignment unit 36 finishes assigning scores to each of the decomposition candidate lists 78, the decomposition candidate list 78 is sorted in the order of score, and then the decomposition candidate list 78 is displayed on the decomposition candidate display designating unit 22 and the user selects a decomposition candidate. You will be asked to specify.

図１０は図３のステップＳ４の候補クエリ生成処理の他の実施形態を示したフローチャートであり、この実施形態にあっては、図１のクエリ選考知識データベース３２を使用して、分解候補につきスコアを付与するようにしたことを特徴とする。 FIG. 10 is a flowchart showing another embodiment of the candidate query generation process in step S4 of FIG. 3. In this embodiment, the query selection knowledge database 32 of FIG. It is characterized by providing the above.

図１０において、候補クエリ生成処理のステップＳ１，Ｓ２，Ｓ３については、図８のクエリ分解知識データベース３０を使用する場合と同じであるが、ステップＳ４にあっては分解候補ごとにクエリ選考知識データベース３２の保存知識との類似度に基づくスコアを計算して付与しており、ステップＳ５でスコア順にソートしてユーザに分解候補を提示して選択させることになる。 10, steps S1, S2, and S3 of the candidate query generation process are the same as when the query decomposition knowledge database 30 of FIG. 8 is used, but in step S4, the query selection knowledge database for each decomposition candidate. The score based on the similarity to the stored knowledge of 32 is calculated and assigned, and in step S5, the score is sorted in the order of the score and the user is presented with the decomposition candidates to be selected.

図１１は図１０に対応した図１の候補クエリ処理部２０による他の実施形態の具体的動作である。図１１において、分解候補生成部３４は図９の場合と同様、分解前クエリ７６に設定された分解手法を適用して分解候補一覧７８を生成している。分解候補一覧７８にあっては、４行目の「川崎の天気」まではスコアの付与が終了しており、次の分解候補８０である「今日＋川崎」のスコア計算を例にとっている。 FIG. 11 shows a specific operation of another embodiment by the candidate query processing unit 20 of FIG. 1 corresponding to FIG. In FIG. 11, as in the case of FIG. 9, the decomposition candidate generation unit 34 generates a decomposition candidate list 78 by applying the decomposition method set in the pre-decomposition query 76. In the decomposition candidate list 78, score assignment has been completed up to "Kawasaki weather" on the fourth line, and the score calculation of "Today + Kawasaki", which is the next decomposition candidate 80, is taken as an example.

このスコア計算はスコア付与部３６で行われる。スコア付与部３６は、対象とする分解候補８０から特徴量８０−１を抽出し、クエリ選考知識データベース３２に登録されている保存知識９０−１〜９０−３のそれぞれとの間で類似度を計算して、その総和をスコアとして求める。 This score calculation is performed by the score assigning unit 36. The score assigning unit 36 extracts the feature value 80-1 from the target decomposition candidate 80, and calculates the similarity with each of the stored knowledge 90-1 to 90-3 registered in the query selection knowledge database 32. Calculate and calculate the sum as a score.

例えば保存知識９０−１については検索成功クエリ特徴量９２が保存されており、これと分解候補８０の特徴量８０−１との間で類似度計算９４を行ってスコアｓｉｍ３を求める。そして、このような類似度計算を残りの保存知識９０−２，９０−３との間で行って、その総和をスコアとする。この場合の類似度計算９４は次式で行われる。 For example, for the saved knowledge 90-1, the search success query feature value 92 is stored, and the similarity calculation 94 is performed between this and the feature value 80-1 of the decomposition candidate 80 to obtain the score sim3. Then, such similarity calculation is performed with the remaining stored knowledge 90-2 and 90-3, and the sum is used as a score. The similarity calculation 94 in this case is performed by the following equation.

図１２は図１のクエリ分解知識記録部２６による処理動作の他の具体例を示した説明図である。図１２において、この具体例にあっては、分解前クエリ９６が「立川にある災害医療センターの電話番号」であり、検索成功クエリ９８が「立川＋災害医療センター＋電話番号」の場合であり、この分解前クエリ９６と検索成功クエリ９８をクエリ分解知識記録部２６に入力して処理する。 FIG. 12 is an explanatory diagram showing another specific example of the processing operation by the query decomposition knowledge recording unit 26 of FIG. In FIG. 12, in this specific example, the query 96 before decomposition is “the telephone number of the disaster medical center in Tachikawa”, and the search success query 98 is “Tachikawa + disaster medical center + phone number”. The pre-decomposition query 96 and the search success query 98 are input to the query decomposition knowledge recording unit 26 for processing.

まず分解前クエリ特徴抽出部４０は、分解前クエリ９６から特徴量１００を抽出している。続いて検索成功クエリ特徴抽出部４２が検索成功クエリ９８から特徴量１０２を抽出している。このようにして抽出した特徴量１００，１０２をペアとして、データベース登録部４４がクエリ分解知識データベース３０にクエリ分解知識として登録することになる。 First, the pre-decomposition query feature extraction unit 40 extracts the feature quantity 100 from the pre-decomposition query 96. Subsequently, the search success query feature extraction unit 42 extracts the feature quantity 102 from the search success query 98. The database registration unit 44 registers the feature quantities 100 and 102 extracted in this way as a pair in the query decomposition knowledge database 30 as query decomposition knowledge.

図１３は図１のクエリ選考知識記録部２８による処理動作の他の具体例を示した説明図である。図１３にあっては、検索成功クエリ１０４として「立川＋災害医療センター＋電話番号」を対象としており、この検索成功クエリ１０４を検索成功クエリ特徴抽出部４６で処理して特徴量１０２を抽出する。 FIG. 13 is an explanatory diagram showing another specific example of the processing operation by the query selection knowledge recording unit 28 of FIG. In FIG. 13, “Tachikawa + disaster medical center + phone number” is targeted as the search success query 104, and the search success query feature extraction unit 46 processes this search success query 104 to extract the feature quantity 102. .

この特徴量１０２は図１２の検索成功クエリ特徴抽出部４６による特徴量と同じである。続いてデータベース登録部４８が、抽出した特徴量１０２をクエリ選考知識データベース３２に登録する。 This feature quantity 102 is the same as the feature quantity by the search success query feature extraction unit 46 of FIG. Subsequently, the database registration unit 48 registers the extracted feature quantity 102 in the query selection knowledge database 32.

図１４は図１の候補クエリ処理部２０による処理動作の他の具体例を示した説明図である。図１４にあっては、分解前クエリ９６として「立川にある災害医療センターの電話番号」を対象としており、まず分解候補生成部３４で分解候補一覧１０５を生成する。 FIG. 14 is an explanatory diagram showing another specific example of the processing operation by the candidate query processing unit 20 of FIG. In FIG. 14, “disaster medical center telephone number in Tachikawa” is targeted as the pre-decomposition query 96, and the decomposition candidate list 105 is first generated by the decomposition candidate generation unit 34.

続いてスコア付与部３６が分解前クエリ９６から特徴量９６−１を抽出し、また処理対象としている分解候補１０６である「立川＋電話番号」から特徴量１０６−１を抽出する。続いて、クエリ分解知識データベース３０に登録されている保存知識１０８−１，１０８−２，１０８−３のそれぞれの保存ペア、例えば保存知識１０８−１にあっては分解前クエリ特徴量１１０と分解後クエリ特徴量１１１との間で類似度計算１１４により類似度ｓｉｍ１，ｓｉｍ２を計算して乗算し、これを全ての保存知識につき繰り返す。 Subsequently, the score assigning unit 36 extracts the feature quantity 96-1 from the pre-decomposition query 96, and also extracts the feature quantity 106-1 from “Tachikawa + phone number” which is the decomposition candidate 106 to be processed. Subsequently, each saved pair of saved knowledge 108-1, 108-2, 108-3 registered in the query decomposition knowledge database 30, for example, in the saved knowledge 108-1, is decomposed with the query feature quantity 110 before decomposition. Similarities sim1 and sim2 are calculated and multiplied by the similarity calculation 114 with the post-query feature quantity 111, and this is repeated for all stored knowledge.

具体的には前記（１）式に示した類似度計算を行ってスコアを求め、分解候補１０６に付与する。全ての分解候補につきスコア計算が終了したならば、分解候補提示部３８がスコア順にソートして分解候補表示指定部２２に提示してユーザの指定を受けることになる。 Specifically, the score is obtained by performing the similarity calculation shown in the equation (1) and given to the decomposition candidate 106. If the score calculation is completed for all the decomposition candidates, the decomposition candidate presenting unit 38 sorts them in the order of the scores and presents them to the decomposition candidate display designating unit 22 to receive the user's designation.

図１５は図１の候補クエリ処理部２０の他の実施形態による処理動作の他の具体例を示した説明図である。図１５にあっては、クエリ選考知識データベース３２を利用したスコア計算であり、分解前クエリ９６に対し分解候補生成部３４で分解候補一覧１０５を生成する点は図１４と同じである。 FIG. 15 is an explanatory diagram showing another specific example of the processing operation according to another embodiment of the candidate query processing unit 20 of FIG. In FIG. 15, score calculation is performed using the query selection knowledge database 32, and the decomposition candidate list 105 is generated by the decomposition candidate generation unit 34 for the pre-decomposition query 96 as in FIG. 14.

続いてスコア付与部３６が現在処理対象としている分解候補１０６から特徴量１０６−１を抽出し、クエリ選考知識データベース３２に保存されている保存知識１１６−１〜１１６−３のクエリ選考知識、例えば保存知識１１６−１における検索成功クエリ特徴量１１８との間で類似度計算１２０を行い、これを全ての保存知識１１６−１〜１１６−３について行って総和を求める前記（１）を実行することでスコアを求め、分解候補１０６に付与する。 Subsequently, the score assigning unit 36 extracts the feature quantity 106-1 from the decomposition candidate 106 that is currently processed, and the query selection knowledge of the stored knowledge 116-1 to 116-3 stored in the query selection knowledge database 32, for example, The similarity calculation 120 is performed with the search success query feature value 118 in the stored knowledge 116-1, and this is performed for all the stored knowledge 116-1 to 116-3 to execute the above (1) to obtain the sum. To obtain a score and assign it to the decomposition candidate 106.

全ての分解文字列のスコア計算が済んだならば、分解候補提示部３８でスコア順にソートした後に、分解候補表示指定部２２に表示してユーザによる選択指定を受けることになる。 When the scores of all the decomposed character strings have been calculated, after sorting in the order of score by the decomposition candidate presenting section 38, they are displayed on the decomposition candidate display designating section 22 and selected by the user.

また本発明は情報検索用のプログラムを提供するものであり、このプログラムは図３，図４，図６，図８，図１０のフローチャートに示した処理内容を持つことになる。 Further, the present invention provides a program for information retrieval, and this program has the processing contents shown in the flowcharts of FIGS. 3, 4, 6, 8, and 10.

また本発明はコンピュータにより読取可能な検索プログラムを格納した記録媒体を提供する。ここでＣＤ−ＲＯＭ、フロッピィディスク（Ｒ）、ＤＶＤディスク、光磁気ディスク、ＩＣカードなどの可搬型記憶媒体や、コンピュータシステムの内外に備えられたハードディスクドライブなどの記憶装置の他、回線を介してプログラムを保持するデータベース、あるいは他のコンピュータシステム並びにそのデータベースや、更に回線上の伝送媒体を含むものである。 The present invention also provides a recording medium storing a computer-readable search program. Here, portable storage media such as CD-ROM, floppy disk (R), DVD disk, magneto-optical disk, and IC card, and storage devices such as hard disk drives provided inside and outside the computer system, as well as via lines It includes a database holding the program, other computer systems, the database, and a transmission medium on a line.

また図１に示した本発明の情報検索装置としての機能は、スタンドアローンでネットワークに接続されて個人的に使用されるパーソナルコンピュータに組み込まれてもよいし、企業内の内部ネットワークに設置しているサーバに組み込まれ、サーバに対し接続しているクライアントからのクエリを使用した検索要求に対し検索処理を実行して検索結果を返すようにしてもよい。 Further, the function as the information retrieval apparatus of the present invention shown in FIG. 1 may be incorporated into a personal computer that is connected to a network in a stand-alone manner and used personally, or installed in an internal network in a company. A search process may be executed in response to a search request using a query from a client connected to the server and returned to the search result.

なお本発明は、その目的と利点を損なうことのない適宜の変形を含み、更に上記の実施形態に示した数値による限定は受けない。 The present invention includes appropriate modifications that do not impair the objects and advantages thereof, and is not limited by the numerical values shown in the above embodiments.

ここで本発明の特徴をまとめて列挙すると次の付記のようになる。

（付記）
（付記１）
コンピュータに、
ユーザが指定したクエリを入力するクエリ入力ステップと、
前記クエリを使用して検索する検索ステップと、
前記検索ステップで検索に失敗した場合、前記クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索ステップで検索させる候補クエリ生成ステップと、
を実行させ、
更に、前記候補クエリ生成ステップは、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、前記設定した分解条件に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする検索プログラム。（１） Here, the features of the present invention are enumerated as follows.

(Appendix)
(Appendix 1)
On the computer,
A query input step for entering a query specified by the user;
A search step for searching using the query;
If the search fails in the search step, a candidate query generation step of presenting a plurality of candidate queries generated by decomposing the query to the user and causing the search step to search using the query selected by the user;
And execute
Further, in the candidate query generation step, any one of noun extraction, noun and verb extraction, and noun phrase extraction is set in advance as a decomposition method of the pre-decomposition query, and decomposition is performed based on the set decomposition condition. A search program that generates a combination of a phrase and a phrase extracted from a previous query as a candidate query. (1)

（付記２）
付記１記載の検索プログラムに於いて、前記候補クエリ生成ステップは、前記検索ステップにより検索結果が１件も得られない０件ヒットの場合に検索失敗と判定して候補クエリ生成処理を実行することを特徴とする検索プログラム。 (Appendix 2)
In the search program according to attachment 1, the candidate query generation step executes a candidate query generation process by determining that the search has failed when no search result is obtained by the search step and no hit is found. A search program characterized by

（付記３）
付記１記載の検索プログラムに於いて、前記コンピュータに、更に、前記クエリ入力ステップで入力した分解前クエリと前記候補クエリ生成ステップでユーザが選択した分解後クエリの組について、各々の特徴量を抽出してクエリ分解知識データベースに登録するステップを実行させ、
前記候補クエリ生成ステップは、処理対象としている分解前クエリと分解後クエリの組について各々の特徴量を抽出し、前記知識データベースに登録されている分解前クエリと分解後クエリの組の各特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする検索プログラム。（２） (Appendix 3)
In the search program according to appendix 1, each feature amount is further extracted from the computer into a set of the pre-decomposition query input in the query input step and the post-decomposition query selected by the user in the candidate query generation step. And execute the step of registering in the query decomposition knowledge database,
The candidate query generation step extracts each feature quantity for a set of pre-decomposition query and post-decomposition query to be processed, and each feature quantity of the pre-decomposition query and post-decomposition query registered in the knowledge database A search program characterized in that a score is calculated from the similarity to and given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the score is presented to the user as a candidate query and selected. (2)

（付記４）
付記１記載の検索プログラムに於いて、前記コンピュータに、更に、前記検索ステップで検索に成功した分解後クエリの特徴量を抽出してクエリ選考知識データベースに登録するステップを実行させ、
前記候補クエリ生成ステップは、処理対象としている分解後クエリの特徴量を抽出し、前記クエリ選考知識データベースに登録されている分解後クエリとの特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする検索プログラム。（３） (Appendix 4)
In the search program according to appendix 1, the computer is further caused to execute a step of extracting a feature quantity of the decomposed query that has been successfully searched in the search step and registering it in a query selection knowledge database,
The candidate query generation step extracts a feature amount of a post-decomposition query to be processed, calculates a score from a similarity with a post-decomposition query registered in the query selection knowledge database, and performs the processing A search program characterized in that a post-decomposition query that is given to a target post-decomposition query and sorted in the order of the score is presented to a user as a candidate query and selected. (3)

（付記５）
付記１又は２記載の検索プログラムに於いて、
前記分解前クエリの特徴量は、形態素解析された構成要素、単語数、特定の意味をもつ内容語の１又は複数の組合せを含み、
前記分解後クエリの特徴量は、名詞のみの抽出、名詞と動詞のみの抽出又は名詞句の抽出の分解手法、抽出した単語の分解前クエリ上の位置、抽出した単語の位置、抽出した単語数を示す要素数、分解前クエリの単語数に対する分解後の要素数の割合、又は抽出した単語の意味を示す意味ラベルの１又は複数の組合せを含むことを特徴とする検索プログラム。 (Appendix 5)
In the search program described in Appendix 1 or 2,
The feature amount of the query before decomposition includes one or a plurality of combinations of a constituent element subjected to morphological analysis, the number of words, and a content word having a specific meaning,
The feature value of the post-decomposition query is a noun only extraction, noun and verb only extraction or noun phrase extraction decomposition method, extracted word position on the query before decomposition, extracted word position, number of extracted words , A ratio of the number of elements after decomposition to the number of words of a query before decomposition, or a combination of one or more meaning labels indicating the meaning of extracted words.

（付記６）
ユーザが指定したクエリを入力するクエリ入力部と、
前記クエリを使用して検索する検索部と、
前記検索部で検索に失敗した場合、前記クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索部に検索させる候補クエリ生成部と、
を備え、
更に、前記候補クエリ生成部は、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、前記設定した分解手法に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする情報検索装置。（４） (Appendix 6)
A query input part for inputting a query specified by the user;
A search unit for searching using the query;
A candidate query generation unit that presents to the user a plurality of candidate queries generated by decomposing the query when the search fails in the search unit, and causes the search unit to search using the query selected by the user;
With
Further, the candidate query generation unit preliminarily sets any one of extraction of nouns, extraction of only nouns and verbs, and extraction of noun phrases as a decomposition method of the query before decomposition, and decomposes based on the set decomposition method. An information search apparatus that generates a combination of a phrase and a phrase extracted from a previous query as a candidate query. (4)

（付記７）
付記６記載の情報検索装置に於いて、前記候補クエリ生成部は、前記検索部により検索結果が１件も得られない０件ヒットの場合に検索失敗と判定して候補クエリ生成処理を実行することを特徴とする情報検索装置。 (Appendix 7)
In the information search device according to attachment 6, the candidate query generation unit performs a candidate query generation process by determining that the search has failed when no search result is obtained by the search unit and no hit is found. An information retrieval apparatus characterized by that.

（付記８）
付記６記載の情報検索装置に於いて、更に、前記クエリ入力部で入力した分解前クエリと前記候補クエリ生成部でユーザが選択した分解後クエリの組について、各々の特徴量を抽出して登録したクエリ分解知識データベースを備え、
前記候補クエリ生成部は、処理対象としている分解前クエリと分解後クエリの組について各々の特徴量を抽出し、前記クエリ分解知識データベースに登録されている分解前クエリと分解後クエリの組の各特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする情報検索装置。 (Appendix 8)
The information search device according to appendix 6, wherein each feature is extracted and registered for a set of the pre-decomposition query input by the query input unit and the post-decomposition query selected by the user by the candidate query generation unit. Query decomposition knowledge database
The candidate query generation unit extracts each feature amount of a set of a pre-decomposition query and a post-decomposition query to be processed, and each of the set of the pre-decomposition query and the post-decomposition query registered in the query decomposition knowledge database An information search apparatus characterized in that a score is calculated from a similarity to a feature amount, is given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the score is presented to the user as a candidate query and selected. .

（付記９）
付記６記載の情報検索装置に於いて、更に、前記検索部で検索に成功した分解後クエリの特徴量を抽出して登録したクエリ選考知識データベースを備え、
前記候補クエリ生成部は、処理対象としている分解後クエリの特徴量を抽出し、前記クエリ選考知識データベースに登録されている分解後クエリとの特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする情報検索装置。 (Appendix 9)
The information search device according to appendix 6, further comprising a query selection knowledge database in which feature values of the decomposed query that has been successfully searched by the search unit are extracted and registered,
The candidate query generation unit extracts a feature amount of a decomposed query to be processed, calculates a score from a similarity to a feature amount of the decomposed query registered in the query selection knowledge database, and performs the processing An information search apparatus characterized in that a post-decomposition query given to a target post-decomposition query and sorted in the order of scores is presented to a user as a candidate query and selected.

（付記１０）
付記６又は７記載の情報検索装置に於いて、
前記分解前クエリの特徴量は、形態素解析された構成要素、単語数、特定の意味をもつ内容語の１又は複数の組合せを含み、
前記分解後クエリの特徴量は、名詞のみの抽出、名詞と動詞のみの抽出又は名詞句の抽出の分解手法、抽出した単語の分解前クエリ上の位置、抽出した単語の位置、抽出した単語数を示す要素数、分解前クエリの単語数に対する分解後の要素数の割合、又は抽出した単語の意味を示す意味ラベルの１又は複数の組合せを含むことを特徴とする情報検索装置。 (Appendix 10)
In the information search device according to appendix 6 or 7,
The feature amount of the query before decomposition includes one or a plurality of combinations of a constituent element subjected to morphological analysis, the number of words, and a content word having a specific meaning,
The feature value of the post-decomposition query is a noun only extraction, noun and verb only extraction or noun phrase extraction decomposition method, extracted word position on the query before decomposition, extracted word position, number of extracted words An information search apparatus comprising one or more combinations of the number of elements indicating the ratio of the number of elements after decomposition with respect to the number of words in the query before decomposition, or the meaning label indicating the meaning of the extracted word.

（付記１１）
ユーザが指定したクエリを入力するクエリ入力ステップと、
前記クエリを使用して検索する検索ステップと、
前記検索ステップで検索に失敗した場合、前記クエリを分解して生成した複数の候補クエリをユーザに提示し、ユーザが選択したクエリを使用して前記検索ステップで検索させる候補クエリ生成ステップと、
を備え、
更に、前記候補クエリ生成ステップは、分解前クエリの分解手法として、名詞のみの抽出、名詞と動詞のみの抽出、名詞句の抽出のいずれかを予め設定し、前記設定した分解条件に基づいて分解前クエリから抽出した語句及び語句の組合わせを候補クエリとして生成することを特徴とする情報検索方法。たことを特徴とする情報検索方法。（５） (Appendix 11)
A query input step for entering a query specified by the user;
A search step for searching using the query;
If the search fails in the search step, a candidate query generation step of presenting a plurality of candidate queries generated by decomposing the query to the user and causing the search step to search using the query selected by the user;
With
Further, in the candidate query generation step, any one of noun extraction, noun and verb extraction, and noun phrase extraction is set in advance as a decomposition method of the pre-decomposition query, and decomposition is performed based on the set decomposition condition. An information search method, wherein a phrase extracted from a previous query and a combination of phrases are generated as a candidate query. An information search method characterized by that. (5)

（付記１２）
付記１１記載の情報検索方法に於いて、前記候補クエリ生成ステップは、前記検索ステップにより検索結果が１件も得られない０件ヒットの場合に検索失敗と判定して候補クエリ生成処理を実行することを特徴とする情報検索方法。 (Appendix 12)
In the information search method according to attachment 11, in the candidate query generation step, if no search result is obtained by the search step and no hit is found, it is determined that the search has failed and a candidate query generation process is executed. An information search method characterized by that.

（付記１３）
付記１１記載の情報検索方法に於いて、更に、前記クエリ入力ステップで入力した分解前クエリと前記候補クエリ生成ステップでユーザが選択した分解後クエリの組について、各々の特徴量を抽出してクエリ分解知識データベースに登録するステップを備え、
前記候補クエリ生成ステップは、処理対象としている分解前クエリと分解後クエリの組について各々の特徴量を抽出し、前記クエリ分解知識データベースに登録されている分解前クエリと分解後クエリの組の各特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする情報検索方法。 (Appendix 13)
The information search method according to claim 11, further comprising: extracting a feature amount of each of the pair of the pre-decomposition query input in the query input step and the post-decomposition query selected by the user in the candidate query generation step, and executing the query Comprising the step of registering in the decomposition knowledge database;
In the candidate query generation step, each feature amount is extracted for a set of pre-decomposition query and post-decomposition query to be processed, and each set of pre-decomposition query and post-decomposition query registered in the query decomposition knowledge database An information search method characterized in that a score is calculated from a similarity with a feature quantity, given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the score is presented to the user as a candidate query and selected. .

（付記１４）
付記１１記載の情報検索方法に於いて、更に、前記検索ステップで検索に成功した分解後クエリの特徴量を抽出してクエリ選考知識データベースに登録するステップを備え、
前記候補クエリ生成ステップは、処理対象としている分解後クエリの特徴量を抽出し、前記クエリ選考知識データベースに登録されている分解後クエリとの特徴量との類似度からスコアを算出して前記処理対象の分解後クエリに付与し、前記スコア順にソートした分解後クエリを候補クエリとしてユーザに提示して選択させることを特徴とする情報検索方法。 (Appendix 14)
The information search method according to appendix 11, further comprising a step of extracting a feature amount of the decomposed query that has been successfully searched in the search step and registering it in a query selection knowledge database.
The candidate query generation step extracts a feature amount of a post-decomposition query to be processed, calculates a score from a similarity with a post-decomposition query registered in the query selection knowledge database, and performs the processing An information search method, characterized in that a post-decomposition query given to a target post-decomposition query and sorted in the score order is presented to a user as a candidate query and selected.

（付記１５）
付記１１又は１２記載の情報検索方法に於いて、
前記分解前クエリの特徴量は、形態素解析された構成要素、単語数、特定の意味をもつ内容語の１又は複数の組合せを含み、
前記分解後クエリの特徴量は、名詞のみの抽出、名詞と動詞のみの抽出又は名詞句の抽出の分解手法、抽出した単語の分解前クエリ上の位置、抽出した単語の位置、抽出した単語数を示す要素数、分解前クエリの単語数に対する分解後の要素数の割合、又は抽出した単語の意味を示す意味ラベルの１又は複数の組合せを含むことを特徴とする情報検索方法。
(Appendix 15)
In the information retrieval method according to appendix 11 or 12,
The feature amount of the query before decomposition includes one or a plurality of combinations of a constituent element subjected to morphological analysis, the number of words, and a content word having a specific meaning,
The feature value of the post-decomposition query is a noun only extraction, noun and verb only extraction or noun phrase extraction decomposition method, extracted word position on the query before decomposition, extracted word position, number of extracted words And a ratio of the number of elements after decomposition to the number of words in a query before decomposition, or a combination of one or more meaning labels indicating the meaning of extracted words.

本発明による情報検索装置の実施形態を示した機能構成のブロック図The block diagram of the function structure which showed embodiment of the information search device by this invention 本実施形態の検索プログラムが実行されるコンピュータのハードウェア環境のブロック図Block diagram of the hardware environment of a computer on which the search program of this embodiment is executed 本発明による情報検索処理のフローチャートFlowchart of information retrieval processing according to the present invention 図３のステップＳ８のクエリ分解知識記録処理の詳細を示したフローチャートThe flowchart which showed the detail of the query decomposition | disassembly knowledge recording process of step S8 of FIG. 図４に対応した図１のクエリ分解知識記録処理部の動作を示した説明図Explanatory drawing which showed operation | movement of the query decomposition | disassembly knowledge recording process part of FIG. 1 corresponding to FIG. 図３のステップＳ１０のクエリ選考知識記録処理の詳細を示したフローチャートThe flowchart which showed the detail of the query selection knowledge recording process of step S10 of FIG. 図６に対応した図１のクエリ選考知識記録部の動作を示した説明図Explanatory drawing which showed operation | movement of the query selection knowledge recording part of FIG. 1 corresponding to FIG. 図３のステップＳ４の候補クエリ生成処理の詳細を示したフローチャートThe flowchart which showed the detail of the candidate query production | generation process of step S4 of FIG. 図８に対応した図１の候補クエリ処理部の動作を示した説明図Explanatory drawing which showed operation | movement of the candidate query process part of FIG. 1 corresponding to FIG. 図３のステップＳ４の候補クエリ生成処理の他の実施形態を示したフローチャートThe flowchart which showed other embodiment of the candidate query production | generation process of step S4 of FIG. 図１０に対応した図１の候補クエリ処理部の動作を示した説明図Explanatory drawing which showed operation | movement of the candidate query process part of FIG. 1 corresponding to FIG. 図１のクエリ分解知識記録処理部による動作の他の具体例を示した説明図Explanatory drawing which showed the other specific example of operation | movement by the query decomposition knowledge recording process part of FIG. 図１のクエリ選考知識記録部による動作の他の具体例を示した説明図Explanatory drawing which showed the other specific example of operation | movement by the query selection knowledge recording part of FIG. 図１の候補クエリ処理部による動作の他の具体例を示した説明図Explanatory drawing which showed the other specific example of operation | movement by the candidate query process part of FIG. 図１の候補クエリ処理部の他の実施形態による動作の他の具体例を示した説明図Explanatory drawing which showed the other specific example of operation | movement by other embodiment of the candidate query process part of FIG.

Explanation of symbols

１０：クエリ入力部
１２：検索制御部
１４：検索エンジン
１６：インターネット
１８：検索結果表示部
２０：候補クエリ処理部
２２：分解候補表示指定部
２４：知識記録部
２６：クエリ分解知識記録部
２８：クエリ選考知識記録部
３０：クエリ分解知識データベース
３２：クエリ選考知識データベース
３４：分解候補生成部
３６：スコア付与部
３８：分解候補提示部
４０：分解前クエリ特徴抽出部
４２，４６：検索成功クエリ特徴抽出部
４４，４８：データベース登録部
５０：ＣＰＵ
５２：バス
５４：ＲＡＭ
５５：ＲＯＭ
５６：ハードディスクドライブ
５８：デバイスインタフェース
６０：キーボード
６２：マウス
６４：ディスプレイ
６６：ネットワークアダプタ
６８，７６，９６：分解前クエリ
７０，９８，１０４：検索成功クエリ
７２，７４，７６−１，８０−１，９６−１，１００，１０２，１０６−１：特徴量
７８，１０５：分解候補一覧
８０，１０６：分解後クエリ（候補クエリ）
８２−１〜８２−３，９０−１〜９０−３，１０８−１〜１０８−３，１１６−１〜１１６−３：保存知識
８４，１１０：分解前クエリ特徴量
８６，１１２：分解後クエリ特徴量
８８，９４，１１４：類似度計算
９２：検索成功クエリ特徴量 10: Query input unit 12: Search control unit 14: Search engine 16: Internet 18: Search result display unit 20: Candidate query processing unit 22: Decomposition candidate display designation unit 24: Knowledge recording unit 26: Query decomposition knowledge recording unit 28: Query selection knowledge recording unit 30: Query decomposition knowledge database 32: Query selection knowledge database 34: Decomposition candidate generation unit 36: Score assignment unit 38: Decomposition candidate presentation unit 40: Predecomposition query feature extraction unit 42, 46: Search success query feature Extraction units 44 and 48: database registration unit 50: CPU
52: Bus 54: RAM
55: ROM
56: Hard disk drive 58: Device interface 60: Keyboard 62: Mouse 64: Display 66: Network adapter 68, 76, 96: Pre-disassembly queries 70, 98, 104: Search success queries 72, 74, 76-1, 80-1 , 96-1, 100, 102, 106-1: feature amount 78, 105: decomposition candidate list 80, 106: post-decomposition query (candidate query)
82-1 to 82-3, 90-1 to 90-3, 108-1 to 108-3, 116-1 to 116-3: stored knowledge 84, 110: pre-decomposition query feature value 86, 112: post-decomposition query Feature amount 88, 94, 114: Similarity calculation 92: Search success query feature amount

Claims

On the computer,
A query input step for entering a query specified by the user;
A search step for searching using the query;
If the search fails in the search step, a candidate query generation step of presenting a plurality of candidate queries generated by decomposing the query to the user and causing the search step to search using the query selected by the user;
Query decomposition knowledge database registration step of extracting each feature quantity and registering it in the query decomposition knowledge database for a set of the pre-decomposition query input in the query input step and the post-decomposition query selected by the user in the candidate query generation step; ,
And execute
Further, in the candidate query generation step, any one of noun extraction, noun and verb extraction, and noun phrase extraction is set in advance as a decomposition method of the pre-decomposition query, and decomposition is performed based on the set decomposition condition. extracting each feature quantity for the set of decomposition after the query and degradation before a query that is a product to Rutotomoni processed combinations of words and phrases extracted from the previous query as a candidate query, registered in the query decomposition knowledge database The score is calculated from the similarity between each feature quantity of the combination of the pre-decomposition query and the post-decomposition query and is given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the scores is presented to the user as a candidate query The search program characterized by making it select .

The search program according to claim 1, further comprising: a query selection knowledge database registration step for extracting, in the computer, a feature amount of a post-decomposition query that has been successfully searched in the search step and registering it in a query selection knowledge database ;
And execute
The candidate query generation step extracts a feature amount of a post-decomposition query to be processed, calculates a score from a similarity with a post-decomposition query registered in the query selection knowledge database, and performs the processing A search program characterized in that a post-decomposition query that is given to a target post-decomposition query and sorted in the order of the score is presented to a user as a candidate query and selected.

A query input part for inputting a query specified by the user;
A search unit for searching using the query;
A candidate query generation unit that presents to the user a plurality of candidate queries generated by decomposing the query when the search fails in the search unit, and causes the search unit to search using the query selected by the user;
A query decomposition knowledge database registration unit for extracting each feature quantity and registering it in a query decomposition knowledge database for a set of a pre-decomposition query input in the query input step and a post-decomposition query selected by the user in the candidate query generation step; ,
With
Further, the candidate query generation unit preliminarily sets any one of extraction of nouns, extraction of only nouns and verbs, and extraction of noun phrases as a decomposition method of the query before decomposition, and decomposes based on the set decomposition method. A phrase extracted from the previous query and a combination of phrases are generated as candidate queries, and each feature amount is extracted for the combination of the pre-decomposition query and the post-decomposition query to be processed, and is registered in the query decomposition knowledge database. A score is calculated from the similarity between each feature quantity of the pair of the pre-decomposition query and the post-decomposition query and is given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the scores is presented to the user as a candidate query. An information search device characterized by being selected by the user.

Computer
A query input step for entering a query specified by the user;
A search step for searching using the query;
If the search fails in the search step, a candidate query generation step of presenting a plurality of candidate queries generated by decomposing the query to the user and causing the search step to search using the query selected by the user;
Query decomposition knowledge database registration step of extracting each feature quantity and registering it in the query decomposition knowledge database for a set of the pre-decomposition query input in the query input step and the post-decomposition query selected by the user in the candidate query generation step; ,
With
Further, in the candidate query generation step, any one of noun extraction, noun and verb extraction, and noun phrase extraction is set in advance as a decomposition method of the pre-decomposition query, and decomposition is performed based on the set decomposition condition. A phrase extracted from the previous query and a combination of phrases are generated as candidate queries, and each feature amount is extracted for the combination of the pre-decomposition query and the post-decomposition query to be processed, and is registered in the query decomposition knowledge database. A score is calculated from the similarity between each feature quantity of the pair of the pre-decomposition query and the post-decomposition query and is given to the post-decomposition query to be processed, and the post-decomposition query sorted in the order of the scores is presented to the user as a candidate query. And an information retrieval method characterized by the fact that it is selected .