JP2020173611A

JP2020173611A - Search processing program, search processing method, and search processing device

Info

Publication number: JP2020173611A
Application number: JP2019075082A
Authority: JP
Inventors: 拡山下; Hiroshi Yamashita; 浩太夏目; Kota Natsume; 尚洋伊藤; Naohiro Ito; 彩山口; Aya Yamaguchi
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2019-04-10
Filing date: 2019-04-10
Publication date: 2020-10-22
Anticipated expiration: 2039-04-10
Also published as: JP7279481B2

Abstract

To execute search to which an attribute is reflected for such a subject to be searched having a keyword being set for each attribute, even when input of a keyword is received in a state without having the attribute being designated.SOLUTION: A search processing device refers to an attribute candidate table 800, for identifying attribute keywords (words 711 to 714) from words 711 to 715 included in an input sentence 710. Next, the search processing device identifies registered words corresponding to the identified attribute keywords. Then, the search processing device extracts a document group 1110 including the identified registered words, as to attributes corresponding to the identified attribute keywords. Next, the search processing device identifies the word 715 other than the attribute keywords included in the input sentence 710 as a search keyword. Then, the search processing device identifies a search subject document 1120 from the document group 1110, based on the identified search keyword.SELECTED DRAWING: Figure 11

Description

本発明は、検索処理プログラム、検索処理方法および検索処理装置に関する。 The present invention relates to a search processing program, a search processing method, and a search processing apparatus.

近年、企業内に蓄積されているテキストや画像などの非構造のデータをナレッジ活用することが流行している。その中で、企業内に蓄積されているドキュメント（例えば、製品マニュアル、論文など）を、効率的かつ簡単に検索することができる仕組みが求められている。 In recent years, it has become popular to utilize unstructured data such as texts and images accumulated in a company for knowledge. Under such circumstances, there is a demand for a mechanism that enables efficient and easy search of documents (for example, product manuals, papers, etc.) accumulated in a company.

先行技術としては、ユーザが入力した分野キーワードに基づいて、分野・フォルダ対応情報データベースを参照して分野キーワードを属性として含むフォルダを選択し、検索インデックスを参照して選択されたフォルダを対象として検索を実行するものがある。 As a prior art, based on the field keyword entered by the user, the field / folder correspondence information database is referred to to select a folder containing the field keyword as an attribute, and the search index is referred to to search the selected folder. There is something to do.

特開２００６−７２７０５号公報Japanese Unexamined Patent Publication No. 2006-72705

しかしながら、従来技術では、検索対象のドキュメントの属性を考慮した検索を行うことが難しい。例えば、全文検索により検索した検索結果の集合に対して、ユーザに属性を指定させることで、検索結果を絞り込むことが考えられるが、検索キーワードの入力とは別に、属性を指定する操作が必要となり手間がかかる。 However, in the prior art, it is difficult to perform a search in consideration of the attributes of the document to be searched. For example, it is conceivable to narrow down the search results by letting the user specify attributes for the set of search results searched by full-text search, but it is necessary to specify the attributes separately from the input of the search keyword. It takes time and effort.

一つの側面では、本発明は、属性を指定しない状態でキーワードの入力を受け付けても、キーワードが属性毎に設定されている検索対象について、属性を反映した検索を実行することを目的とする。 On one aspect, it is an object of the present invention to execute a search reflecting an attribute for a search target in which a keyword is set for each attribute even if an input of a keyword is accepted without specifying an attribute.

１つの実施態様では、検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定し、前記第１のワードが存在すると判定した場合に、前記第１のワードと一致又は類似するワードと対応付けられている属性において前記第１のワードを含む検索対象群を抽出し、前記複数のワードに含まれる前記第１のワード以外の第２のワードと前記検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、前記検索対象群の中から前記検索対象を特定する、検索処理プログラムが提供される。 In one embodiment, among the plurality of words received for the search for the search target, one or a plurality of words associated with each attribute of the search target previously stored in the storage unit are matched or similar. When it is determined whether or not the first word exists and it is determined that the first word exists, the first word is in the attribute associated with the word that matches or is similar to the first word. A search target group including a word is extracted, and the search is performed based on a comparison result between a second word other than the first word included in the plurality of words and a word included in the data of each of the search target groups. A search processing program that identifies the search target from the target group is provided.

本発明の一側面によれば、属性を指定しない状態でキーワードの入力を受け付けても、キーワードが属性毎に設定されている検索対象について、属性を反映した検索を実行することができる。 According to one aspect of the present invention, even if the input of the keyword is accepted without specifying the attribute, the search target in which the keyword is set for each attribute can be searched by reflecting the attribute.

図１は、実施の形態にかかる検索処理方法の一実施例を示す説明図である。FIG. 1 is an explanatory diagram showing an embodiment of a search processing method according to an embodiment. 図２は、検索処理システム２００のシステム構成例を示す説明図である。FIG. 2 is an explanatory diagram showing a system configuration example of the search processing system 200. 図３は、検索処理装置１０１のハードウェア構成例を示すブロック図である。FIG. 3 is a block diagram showing a hardware configuration example of the search processing device 101. 図４は、ドキュメント管理ＤＢ２２０の記憶内容の一例を示す説明図である。FIG. 4 is an explanatory diagram showing an example of the stored contents of the document management DB 220. 図５は、製品属性テーブル２３０の記憶内容の一例を示す説明図である。FIG. 5 is an explanatory diagram showing an example of the stored contents of the product attribute table 230. 図６は、検索処理装置１０１の機能的構成例を示すブロック図である。FIG. 6 is a block diagram showing a functional configuration example of the search processing device 101. 図７は、単語の比較例を示す説明図である。FIG. 7 is an explanatory diagram showing a comparative example of words. 図８は、属性候補テーブル８００の記憶内容の一例を示す説明図である。FIG. 8 is an explanatory diagram showing an example of the stored contents of the attribute candidate table 800. 図９は、属性キーワードに対応する属性の絞り込み例を示す説明図（その１）である。FIG. 9 is an explanatory diagram (No. 1) showing an example of narrowing down the attributes corresponding to the attribute keywords. 図１０は、属性キーワードに対応する属性の絞り込み例を示す説明図（その２）である。FIG. 10 is an explanatory diagram (No. 2) showing an example of narrowing down the attributes corresponding to the attribute keywords. 図１１は、ドキュメントの検索例を示す説明図である。FIG. 11 is an explanatory diagram showing a document search example. 図１２は、検索画面の画面例を示す説明図である。FIG. 12 is an explanatory diagram showing a screen example of the search screen. 図１３は、検索処理装置１０１の検索処理手順の一例を示すフローチャート（その１）である。FIG. 13 is a flowchart (No. 1) showing an example of the search processing procedure of the search processing device 101. 図１４は、検索処理装置１０１の検索処理手順の一例を示すフローチャート（その２）である。FIG. 14 is a flowchart (No. 2) showing an example of the search processing procedure of the search processing device 101. 図１５は、マッチング処理の具体的な処理手順の一例を示すフローチャート（その１）である。FIG. 15 is a flowchart (No. 1) showing an example of a specific processing procedure of the matching process. 図１６は、マッチング処理の具体的な処理手順の一例を示すフローチャート（その２）である。FIG. 16 is a flowchart (No. 2) showing an example of a specific processing procedure of the matching process. 図１７は、検索処理の具体的な処理手順の一例を示すフローチャート（その１）である。FIG. 17 is a flowchart (No. 1) showing an example of a specific processing procedure of the search process. 図１８は、検索処理の具体的な処理手順の一例を示すフローチャート（その２）である。FIG. 18 is a flowchart (No. 2) showing an example of a specific processing procedure of the search process. 図１９は、検索条件の一例を示す説明図である。FIG. 19 is an explanatory diagram showing an example of search conditions.

以下に図面を参照して、本発明にかかる検索処理プログラム、検索処理方法および検索処理装置の実施の形態を詳細に説明する。 Hereinafter, embodiments of a search processing program, a search processing method, and a search processing apparatus according to the present invention will be described in detail with reference to the drawings.

（実施の形態）
図１は、実施の形態にかかる検索処理方法の一実施例を示す説明図である。図１において、検索処理装置１０１は、属性を指定しない状態で入力されたキーワードに基づいて、属性を反映した検索を実行するコンピュータである。属性は、検索対象を特徴付ける情報である。 (Embodiment)
FIG. 1 is an explanatory diagram showing an embodiment of a search processing method according to an embodiment. In FIG. 1, the search processing device 101 is a computer that executes a search reflecting the attributes based on the keywords input in a state where the attributes are not specified. Attributes are information that characterizes the search target.

検索対象は、例えば、製品マニュアルや論文などのドキュメント（文書データ）である。検索対象には、属性を示す属性情報が付与されている。例えば、検索対象が製品マニュアルの場合、属性は、製品名、エディション（版数）、ＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）、バージョン情報などである。 The search target is, for example, a document (document data) such as a product manual or a paper. Attribute information indicating the attribute is given to the search target. For example, when the search target is a product manual, the attributes are the product name, edition (version number), OS (Operating System), version information, and the like.

既存の検索手法として、検索対象のドキュメントに属性情報を付与しておき、検索時は、入力されたキーワードが、本文や属性情報に含まれるドキュメントを検索結果として出力するものがある。ただし、本文と属性情報を区別せずに、入力されたキーワードで検索を行うと、検索結果に含まれるノイズ（ユーザが意図しない情報）が多くなってしまう。 As an existing search method, attribute information is added to a document to be searched, and at the time of search, the input keyword outputs a document included in the text or attribute information as a search result. However, if the search is performed using the entered keyword without distinguishing between the text and the attribute information, the noise (information not intended by the user) included in the search result will increase.

また、検索結果に含まれるノイズを減らす手法として、例えば、本文に対して検索した検索結果の集合に対して、ユーザに属性を指定させることで、検索結果を絞り込むものがある。しかし、検索キーワードの入力とは別に、属性を指定する操作が必要となり、手間がかかるという問題がある。また、ユーザが属性の詳細を把握していないと、検索結果を絞り込むための属性を指定することができない。 Further, as a method of reducing noise included in the search result, for example, there is a method of narrowing down the search result by having the user specify an attribute for a set of search results searched for the text. However, there is a problem that it takes time and effort because an operation of specifying an attribute is required separately from inputting a search keyword. In addition, if the user does not know the details of the attribute, the attribute for narrowing down the search result cannot be specified.

そこで、本実施の形態では、属性を指定しない状態でキーワードの入力を受け付けても、キーワードが属性毎に設定されている検索対象について、属性を反映した検索を可能にする検索処理方法について説明する。以下、検索処理装置１０１の処理例について説明する。 Therefore, in the present embodiment, even if the input of the keyword is accepted without specifying the attribute, the search processing method that enables the search reflecting the attribute for the search target in which the keyword is set for each attribute will be described. .. Hereinafter, a processing example of the search processing device 101 will be described.

（１）検索処理装置１０１は、検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部１１０に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定する。 (1) The search processing device 101 includes one or a plurality of words associated with each attribute of the search target previously stored in the storage unit 110 among the plurality of words received for the search for the search target. Determine if there is a first word that matches or is similar.

ここで、複数のワードは、検索時にキーワードとして入力される２以上のワードである。検索対象の属性毎に対応付けられた１又は複数のワードは、当該属性に対応するキーワードとして予め登録された語句に含まれる１又は複数のワードである。例えば、属性「製品名」のキーワードとしては、実際に存在する製品名が設定される。 Here, the plurality of words are two or more words that are input as keywords at the time of search. The one or more words associated with each attribute to be searched are one or more words included in the words and phrases registered in advance as keywords corresponding to the attribute. For example, as a keyword of the attribute "product name", a product name that actually exists is set.

具体的には、例えば、検索処理装置１０１は、受け付けた複数のワードの各ワードと、検索対象の属性毎に対応付けられた各ワードとをそれぞれ比較する。そして、検索処理装置１０１は、比較した結果に基づいて、受け付けた複数のワードの中に、検索対象の属性毎に対応付けられたいずれかのワードと一致又は類似する第１のワードが存在するか否かを判定する。 Specifically, for example, the search processing device 101 compares each word of the received plurality of words with each word associated with each attribute of the search target. Then, the search processing device 101 has a first word that matches or is similar to any word associated with each attribute of the search target among the plurality of received words based on the comparison result. Judge whether or not.

図１の例では、検索対象に対する検索のために、ワード１１１（ｘｘｘ）とワード１１２（ａａａ）とを受け付けた場合を想定する。また、記憶部１１０には、検索対象の属性１に対応付けられたワード「ｘｘｘ」と、検索対象の属性２に対応付けられたワード「ｙｙｙ」と、検索対象の属性３に対応付けられたワード「ｚｚｚ」とが記憶されている場合を想定する。 In the example of FIG. 1, it is assumed that the word 111 (xxx) and the word 112 (aaa) are accepted for the search for the search target. Further, the storage unit 110 is associated with the word "xxx" associated with the search target attribute 1, the word "yyy" associated with the search target attribute 2, and the search target attribute 3. It is assumed that the word "zzz" is stored.

この場合、検索処理装置１０１は、ワード１１１，１１２の各ワードと、検索対象の属性１〜３毎に対応付けられた各ワードとをそれぞれ比較する。ここでは、受け付けたワード１１１（ｘｘｘ）が、検索対象の属性１に対応付けられたワード「ｘｘｘ」と一致する。このため、第１のワード「ｘｘｘ」が存在すると判定される。 In this case, the search processing device 101 compares each of the words 111 and 112 with each word associated with each of the attributes 1 to 3 of the search target. Here, the received word 111 (xxx) matches the word "xxx" associated with the attribute 1 to be searched. Therefore, it is determined that the first word "xxx" exists.

（２）検索処理装置１０１は、第１のワードが存在すると判定した場合に、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象群を抽出する。ここで、抽出元となる複数の検索対象は、例えば、企業内に蓄積されているドキュメントである。 (2) When the search processing device 101 determines that the first word exists, it extracts a search target group including the first word in the attribute associated with the word that matches or is similar to the first word. To do. Here, the plurality of search targets that are the extraction sources are, for example, documents accumulated in the company.

また、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象とは、検索対象を特徴付ける属性のうち、第１のワードと一致又は類似するワードと対応付けられている属性の属性値に、第１のワードを含む検索対象である。 Further, in the attribute associated with the word that matches or is similar to the first word, the search target including the first word is a word that matches or is similar to the first word among the attributes that characterize the search target. The search target includes the first word in the attribute value of the associated attribute.

すなわち、検索処理装置１０１は、複数の検索対象の中から、第１のワードと一致又は類似するワードと対応付けられている属性の属性値に、第１のワードが含まれる検索対象群を抽出する。これにより、第１のワードと一致又は類似するワードと対応付けられている属性の属性値に、第１のワードを含む検索対象群を絞り込むことができる。 That is, the search processing device 101 extracts a search target group in which the first word is included in the attribute value of the attribute associated with the word that matches or is similar to the first word from the plurality of search targets. To do. As a result, the search target group including the first word can be narrowed down to the attribute value of the attribute associated with the word that matches or is similar to the first word.

図１の例では、検索処理装置１０１は、抽出元１２０の中から、属性１の属性値に、第１のワード「ｘｘｘ」が含まれる検索対象群を抽出する。なお、抽出元１２０に含まれる各検索対象には、属性１〜３の属性値を示す属性情報が付与されている。ここでは、検索対象群１２１が抽出された場合を想定する。 In the example of FIG. 1, the search processing device 101 extracts a search target group in which the first word “xxx” is included in the attribute value of the attribute 1 from the extraction source 120. In addition, attribute information indicating the attribute values of attributes 1 to 3 is given to each search target included in the extraction source 120. Here, it is assumed that the search target group 121 is extracted.

（３）検索処理装置１０１は、受け付けた複数のワードに含まれる第１のワード以外の第２のワードと、抽出した検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、検索対象群の中から検索対象を特定する。ここで、検索対象群それぞれのデータとは、例えば、ドキュメントの本文情報である。 (3) The search processing device 101 searches based on the comparison result between the second word other than the first word included in the received plurality of words and the word included in the data of each of the extracted search target groups. Identify the search target from the target group. Here, the data of each search target group is, for example, the text information of the document.

具体的には、例えば、検索処理装置１０１は、受け付けた複数のワードのうち第１のワード以外の第２のワードを特定する。つぎに、検索処理装置１０１は、特定した第２のワードと、抽出した検索対象群それぞれのデータに含まれるワードとを比較する。そして、検索処理装置１０１は、抽出した検索対象群の中から、第２のワードと一致するワードがデータに含まれる検索対象を特定する。 Specifically, for example, the search processing device 101 identifies a second word other than the first word among the plurality of received words. Next, the search processing device 101 compares the specified second word with the word included in the data of each of the extracted search target groups. Then, the search processing device 101 identifies a search target whose data includes a word that matches the second word from the extracted search target group.

図１の例では、検索処理装置１０１は、受け付けたワード１１１，１１２のうち第１のワード「ｘｘｘ」以外の第２のワード「ａａａ」を特定する。つぎに、検索処理装置１０１は、第２のワード「ａａａ」と、抽出した検索対象群１２１それぞれのデータに含まれるワードとを比較する。そして、検索処理装置１０１は、検索対象群１２１の中から、第２のワード「ａａａ」と一致するワードがデータに含まれる検索対象を特定する。ここでは、検索対象集合１２２が特定されている。 In the example of FIG. 1, the search processing device 101 identifies a second word “aaa” other than the first word “xxx” among the received words 111 and 112. Next, the search processing device 101 compares the second word “aaa” with the word included in the data of each of the extracted search target group 121. Then, the search processing device 101 identifies a search target whose data includes a word that matches the second word "aaa" from the search target group 121. Here, the search target set 122 is specified.

このように、検索処理装置１０１によれば、属性を指定しない状態でキーワードの入力を受け付けても、キーワードが属性毎に設定されている検索対象について、属性を反映した検索を実行することができる。 As described above, according to the search processing device 101, even if the input of the keyword is accepted without specifying the attribute, the search target in which the keyword is set for each attribute can be searched by reflecting the attribute. ..

図１の例では、ユーザが意識して入力しなくても、検索対象に対する検索のために受け付けたワード１１１，１１２を、絞り込み用のワード（第１のワード）と、全文検索用のワード（第２のワード）とに分類することができる。これにより、ユーザが指定したい属性情報を推定して検索対象群１２１を絞り込んだ上で全文検索を行うことができ、検索結果に含まれるノイズを減らして、ユーザが欲しい情報を見つけやすくすることができる。 In the example of FIG. 1, the words 111 and 112 received for the search for the search target are narrowed down (first word) and full-text search words (words for full-text search) without the user having to consciously input the words. It can be classified into the second word). As a result, it is possible to estimate the attribute information that the user wants to specify, narrow down the search target group 121, and then perform a full-text search, reduce noise included in the search results, and make it easier for the user to find the information that the user wants. it can.

（検索処理システム２００のシステム構成例）
つぎに、実施の形態にかかる検索処理システム２００のシステム構成例について説明する。検索処理システム２００は、図１に示した検索処理装置１０１を含むコンピュータシステムであり、例えば、企業内に蓄積されているドキュメントをナレッジ活用するためのシステムに適用される。 (System configuration example of search processing system 200)
Next, a system configuration example of the search processing system 200 according to the embodiment will be described. The search processing system 200 is a computer system including the search processing device 101 shown in FIG. 1, and is applied to, for example, a system for utilizing knowledge of documents stored in a company.

図２は、検索処理システム２００のシステム構成例を示す説明図である。図２において、検索処理システム２００は、検索処理装置１０１と、クライアント装置２０１と、を含む。検索処理システム２００において、検索処理装置１０１およびクライアント装置２０１は、有線又は無線のネットワーク２１０を介して接続される。ネットワーク２１０は、例えば、ＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）、ＷＡＮ（ＷｉｄｅＡｒｅａＮｅｔｗｏｒｋ）、インターネットなどである。 FIG. 2 is an explanatory diagram showing a system configuration example of the search processing system 200. In FIG. 2, the search processing system 200 includes a search processing device 101 and a client device 201. In the search processing system 200, the search processing device 101 and the client device 201 are connected via a wired or wireless network 210. The network 210 is, for example, a LAN (Local Area Network), a WAN (Wide Area Network), the Internet, or the like.

ここで、検索処理装置１０１は、ドキュメント管理ＤＢ（Ｄａｔａｂａｓｅ）２２０および製品属性テーブル２３０を有する。例えば、検索処理装置１０１は、サーバである。検索処理装置１０１は、複数のサーバにより実現されることにしてもよい。各種ＤＢ等２２０，２３０の記憶内容については、図４および図５を用いて後述する。図１に示した記憶部１１０は、例えば、製品属性テーブル２３０に相当する。 Here, the search processing device 101 has a document management DB (Database) 220 and a product attribute table 230. For example, the search processing device 101 is a server. The search processing device 101 may be realized by a plurality of servers. The stored contents of 220 and 230 of various DBs and the like will be described later with reference to FIGS. 4 and 5. The storage unit 110 shown in FIG. 1 corresponds to, for example, the product attribute table 230.

ただし、各種ＤＢ等２２０，２３０は、検索処理装置１０１がアクセス可能な他のコンピュータが有することにしてもよい。この場合、検索処理装置１０１は、他のコンピュータにアクセスして、各種ＤＢ等２２０，２３０の記憶内容を参照することができる。 However, the various DBs 220 and 230 may be provided by another computer that can be accessed by the search processing device 101. In this case, the search processing device 101 can access another computer and refer to the stored contents of 220 and 230 such as various DBs.

クライアント装置２０１は、検索処理システム２００のユーザが使用するコンピュータである。ユーザは、例えば、検索処理システム２００が導入された企業の従業員である。クライアント装置２０１は、例えば、ＰＣ（ＰｅｒｓｏｎａｌＣｏｍｐｕｔｅｒ）、タブレットＰＣ、スマートフォンなどである。 The client device 201 is a computer used by the user of the search processing system 200. The user is, for example, an employee of a company in which the search processing system 200 is introduced. The client device 201 is, for example, a PC (Personal Computer), a tablet PC, a smartphone, or the like.

なお、図２の例では、クライアント装置２０１を１台のみ表示したが、検索処理システム２００には複数のクライアント装置２０１が含まれていてもよい。 In the example of FIG. 2, only one client device 201 is displayed, but the search processing system 200 may include a plurality of client devices 201.

（検索処理装置１０１のハードウェア構成例）
図３は、検索処理装置１０１のハードウェア構成例を示すブロック図である。図３において、検索処理装置１０１は、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）３０１と、メモリ３０２と、ディスクドライブ３０３と、ディスク３０４と、通信Ｉ／Ｆ（Ｉｎｔｅｒｆａｃｅ）３０５と、可搬型記録媒体Ｉ／Ｆ３０６と、を有する。また、各構成部は、バス３００によってそれぞれ接続される。 (Example of hardware configuration of search processing device 101)
FIG. 3 is a block diagram showing a hardware configuration example of the search processing device 101. In FIG. 3, the search processing device 101 includes a CPU (Central Processing Unit) 301, a memory 302, a disk drive 303, a disk 304, a communication I / F (Interface) 305, and a portable recording medium I / F 306. Have. Further, each component is connected by a bus 300.

ここで、ＣＰＵ３０１は、検索処理装置１０１の全体の制御を司る。ＣＰＵ３０１は、複数のコアを有していてもよい。メモリ３０２は、例えば、ＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）およびフラッシュＲＯＭなどを有する。具体的には、例えば、フラッシュＲＯＭがＯＳ（ＯｐｅｒａｔｉｎｇＳｙｓｔｅｍ）のプログラムを記憶し、ＲＯＭがアプリケーションプログラムを記憶し、ＲＡＭがＣＰＵ３０１のワークエリアとして使用される。メモリ３０２に記憶されるプログラムは、ＣＰＵ３０１にロードされることで、コーディングされている処理をＣＰＵ３０１に実行させる。 Here, the CPU 301 controls the entire search processing device 101. The CPU 301 may have a plurality of cores. The memory 302 includes, for example, a ROM (Read Only Memory), a RAM (Random Access Memory), a flash ROM, and the like. Specifically, for example, the flash ROM stores the OS (Operating System) program, the ROM stores the application program, and the RAM is used as the work area of the CPU 301. The program stored in the memory 302 is loaded into the CPU 301 to cause the CPU 301 to execute the coded process.

ディスクドライブ３０３は、ＣＰＵ３０１の制御に従ってディスク３０４に対するデータのリード／ライトを制御する。ディスク３０４は、ディスクドライブ３０３の制御で書き込まれたデータを記憶する。ディスク３０４としては、例えば、磁気ディスク、光ディスクなどが挙げられる。 The disk drive 303 controls data read / write to the disk 304 according to the control of the CPU 301. The disk 304 stores the data written under the control of the disk drive 303. Examples of the disk 304 include a magnetic disk and an optical disk.

通信Ｉ／Ｆ３０５は、通信回線を通じてネットワーク２１０に接続され、ネットワーク２１０を介して外部のコンピュータ（例えば、図２に示したクライアント装置２０１）に接続される。そして、通信Ｉ／Ｆ３０５は、ネットワーク２１０と装置内部とのインターフェースを司り、外部のコンピュータからのデータの入出力を制御する。通信Ｉ／Ｆ３０５には、例えば、モデムやＬＡＮアダプタなどを採用することができる。 The communication I / F 305 is connected to the network 210 through a communication line, and is connected to an external computer (for example, the client device 201 shown in FIG. 2) via the network 210. The communication I / F 305 controls the interface between the network 210 and the inside of the device, and controls the input / output of data from an external computer. For the communication I / F 305, for example, a modem, a LAN adapter, or the like can be adopted.

可搬型記録媒体Ｉ／Ｆ３０６は、ＣＰＵ３０１の制御に従って可搬型記録媒体３０７に対するデータのリード／ライトを制御する。可搬型記録媒体３０７は、可搬型記録媒体Ｉ／Ｆ３０６の制御で書き込まれたデータを記憶する。可搬型記録媒体３０７としては、例えば、ＣＤ（ＣｏｍｐａｃｔＤｉｓｃ）−ＲＯＭ、ＤＶＤ（ＤｉｇｉｔａｌＶｅｒｓａｔｉｌｅＤｉｓｋ）、ＵＳＢ（ＵｎｉｖｅｒｓａｌＳｅｒｉａｌＢｕｓ）メモリなどが挙げられる。 The portable recording medium I / F 306 controls data read / write to the portable recording medium 307 according to the control of the CPU 301. The portable recording medium 307 stores the data written under the control of the portable recording medium I / F 306. Examples of the portable recording medium 307 include a CD (Compact Disc) -ROM, a DVD (Digital Versaille Disk), and a USB (Universal Serial Bus) memory.

なお、検索処理装置１０１は、上述した構成部のほかに、例えば、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）、入力装置、ディスプレイ等を有することにしてもよい。また、検索処理装置１０１は、上述した構成部のうち、例えば、ディスクドライブ３０３、ディスク３０４、可搬型記録媒体Ｉ／Ｆ３０６、可搬型記録媒体３０７を有していなくてもよい。 In addition to the above-mentioned components, the search processing device 101 may include, for example, an SSD (Solid State Drive), an input device, a display, and the like. Further, the search processing device 101 does not have to have, for example, a disk drive 303, a disk 304, a portable recording medium I / F 306, and a portable recording medium 307 among the above-described components.

また、図２に示したクライアント装置２０１についても、検索処理装置１０１と同様のハードウェア構成により実現することができる。ただし、クライアント装置２０１は、上述した構成部のほかに、入力装置およびディスプレイを有する。入力装置は、文字、数字、各種指示などの入力のためのキーを有し、データの入力を行う。入力装置は、キーボードやマウスなどであってもよく、また、タッチパネル式の入力パッドやテンキーなどであってもよい。ディスプレイは、カーソル、アイコンあるいはツールボックスをはじめ、文書、画像、機能情報などのデータを表示する表示装置である。ディスプレイとしては、例えば、液晶ディスプレイや有機ＥＬ（Ｅｌｅｃｔｒｏｌｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイなどを採用することができる。 Further, the client device 201 shown in FIG. 2 can also be realized by the same hardware configuration as the search processing device 101. However, the client device 201 has an input device and a display in addition to the above-described components. The input device has keys for inputting characters, numbers, various instructions, etc., and inputs data. The input device may be a keyboard, a mouse, or the like, or may be a touch panel type input pad, a numeric keypad, or the like. A display is a display device that displays data such as a cursor, an icon, a toolbox, a document, an image, and functional information. As the display, for example, a liquid crystal display, an organic EL (Electroluminescence) display, or the like can be adopted.

（各種ＤＢ等２２０，２３０の記憶内容）
つぎに、図４および図５を用いて、検索処理装置１０１が有する各種ＤＢ等２２０，２３０の記憶内容について説明する。各種ＤＢ等２２０，２３０は、例えば、図３に示した検索処理装置１０１のメモリ３０２、ディスク３０４などの記憶装置により実現される。 (Memory contents of 220 and 230 such as various DBs)
Next, the stored contents of various DBs and the like 220 and 230 included in the search processing device 101 will be described with reference to FIGS. 4 and 5. The various DBs 220 and 230 are realized, for example, by storage devices such as the memory 302 and the disk 304 of the search processing device 101 shown in FIG.

図４は、ドキュメント管理ＤＢ２２０の記憶内容の一例を示す説明図である。図４において、ドキュメント管理ＤＢ２２０は、ドキュメントＩＤ、製品名、エディション、ファミリ製品名、ＯＳおよび本文情報のフィールドを有する。各フィールドに情報を設定することで、ドキュメント管理情報（例えば、ドキュメント管理情報４００−１，４００−２）がレコードとして記憶される。 FIG. 4 is an explanatory diagram showing an example of the stored contents of the document management DB 220. In FIG. 4, the document management DB 220 has fields for document ID, product name, edition, family product name, OS, and text information. By setting information in each field, document management information (for example, document management information 400-1, 400-2) is stored as a record.

ここで、ドキュメントＩＤは、検索対象のドキュメントを一意に識別する識別子である。ドキュメントは、例えば、１ページ１ファイルのＨＴＭＬ（ＨｙｐｅｒＴｅｘｔＭａｒｋｕｐＬａｎｇｕａｇｅ）形式のデータである。ここでは、検索対象のドキュメントが「製品マニュアル」である場合を例に挙げて説明する。 Here, the document ID is an identifier that uniquely identifies the document to be searched. The document is, for example, HTML (HyperText Markup Language) format data of one file per page. Here, a case where the document to be searched is a "product manual" will be described as an example.

製品名、エディション、ファミリ製品名およびＯＳは、検索対象のドキュメントを特徴付ける属性の一例である。製品名は、製品の名称である。エディションは、製品のエディションである。ファミリ製品名は、関連製品の名称である。ＯＳは、製品が動作するＯＳである。 The product name, edition, family product name, and OS are examples of attributes that characterize the document to be searched. The product name is the name of the product. The edition is the edition of the product. Family product names are the names of related products. The OS is the OS on which the product operates.

本文情報は、検索対象のドキュメントの本文を示す。ただし、本文情報は、ドキュメントの格納場所を示す情報（例えば、ＵＲＬ：ＵｎｉｆｏｒｍＲｅｓｏｕｒｃｅＬｏｃａｔｏｒ）であってもよい。この場合、クライアント装置２０１は、本文情報が示す格納場所から本文を読み出すことで、本文を参照することができる。 The text information indicates the text of the document to be searched. However, the text information may be information indicating the storage location of the document (for example, URL: Uniform Resource Locator). In this case, the client device 201 can refer to the text by reading the text from the storage location indicated by the text information.

例えば、ドキュメント管理情報４００−１は、ドキュメントＤ１の製品名「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」、エディション「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」、ファミリ製品名「ｘｘｘ」、ＯＳ「Ｗｉｎｄｏｗｓ（登録商標）」および本文情報「・・・」を示す。 For example, the document management information 400-1 includes the product name "Interstage Charge Manager" of the document D1, the edition "Enterprise Edition", the family product name "xxx", the OS "Windows (registered trademark)", and the text information "...". Is shown.

また、ドキュメント管理情報４００−２は、ドキュメントＤ２の製品名「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」、エディション「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」、ファミリ製品名「ｘｘｘ」、ＯＳ「Ｌｉｎｕｘ（登録商標）」および本文情報「・・・」を示す。 In addition, the document management information 400-2 includes the product name "Interstage Charge Manager" of the document D2, the edition "Enterprise Edition", the family product name "xxx", the OS "Linux (registered trademark)", and the text information "...". Is shown.

図５は、製品属性テーブル２３０の記憶内容の一例を示す説明図である。図５において、製品属性テーブル２３０は、属性名および登録語のフィールドを有し、各フィールドに情報を設定することで、製品属性情報（例えば、製品属性情報５００−１〜５００−３）をレコードとして記憶する。 FIG. 5 is an explanatory diagram showing an example of the stored contents of the product attribute table 230. In FIG. 5, the product attribute table 230 has fields for attribute names and registered words, and by setting information in each field, product attribute information (for example, product attribute information 500-1 to 500-3) is recorded. Remember as.

ここで、属性名は、検索対象のドキュメントを特徴付ける属性の名称を示す。登録語は、属性名の属性に対応付けられた１又は複数のワードを含む語句である。例えば、属性名「製品名」の登録語としては、実際に存在する製品名が設定される。また、属性名「エディション」の登録語としては、実際に存在する製品のエディションが設定される。属性名「ファミリ製品名」の登録語としては、実際に存在する関連製品名が設定される。属性名「ＯＳ」の登録語としては、実際に存在するＯＳ名が設定される。 Here, the attribute name indicates the name of the attribute that characterizes the document to be searched. A registered word is a phrase containing one or more words associated with an attribute of an attribute name. For example, as a registered word of the attribute name "product name", a product name that actually exists is set. In addition, the edition of the product that actually exists is set as the registered word of the attribute name "edition". As the registered word of the attribute name "family product name", the related product name that actually exists is set. As the registered word of the attribute name "OS", the OS name that actually exists is set.

例えば、製品属性情報５００−１は、属性名「製品名」の登録語「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」を示す。 For example, the product attribute information 500-1 indicates the registered word “Interstage Charge Manager” of the attribute name “product name”.

（検索処理装置１０１の機能的構成例）
図６は、検索処理装置１０１の機能的構成例を示すブロック図である。図６において、検索処理装置１０１は、受付部６０１と、判定部６０２と、抽出部６０３と、特定部６０４と、出力部６０５と、記憶部６１０と、を含む。具体的には、例えば、受付部６０１〜出力部６０５は、図３に示したメモリ３０２、ディスク３０４、可搬型記録媒体３０７などの記憶装置に記憶されたプログラムをＣＰＵ３０１に実行させることにより、または、通信Ｉ／Ｆ３０５により、その機能を実現する。各機能部の処理結果は、例えば、メモリ３０２、ディスク３０４などの記憶装置に記憶される。記憶部６１０は、例えば、メモリ３０２、ディスク３０４などの記憶装置により実現される。具体的には、例えば、記憶部６１０は、図４に示したドキュメント管理ＤＢ２２０や、図５に示した製品属性テーブル２３０を記憶する。 (Example of Functional Configuration of Search Processing Device 101)
FIG. 6 is a block diagram showing a functional configuration example of the search processing device 101. In FIG. 6, the search processing device 101 includes a reception unit 601, a determination unit 602, an extraction unit 603, a specific unit 604, an output unit 605, and a storage unit 610. Specifically, for example, the reception units 601 to the output units 605 may cause the CPU 301 to execute a program stored in a storage device such as the memory 302, the disk 304, or the portable recording medium 307 shown in FIG. , Communication I / F 305 realizes the function. The processing result of each functional unit is stored in a storage device such as a memory 302 or a disk 304, for example. The storage unit 610 is realized by a storage device such as a memory 302 or a disk 304, for example. Specifically, for example, the storage unit 610 stores the document management DB 220 shown in FIG. 4 and the product attribute table 230 shown in FIG.

受付部６０１は、検索対象に対する検索のための複数のワードを受け付ける。ここで、複数のワードは、検索時にキーワードとして入力される２以上のワードである。複数のワードの入力は、例えば、図２に示したクライアント装置２０１において、入力装置（不図示）を用いたユーザの操作入力により行われる。 The reception unit 601 accepts a plurality of words for searching for the search target. Here, the plurality of words are two or more words that are input as keywords at the time of search. The input of the plurality of words is performed, for example, by the user's operation input using the input device (not shown) in the client device 201 shown in FIG.

具体的には、例えば、受付部６０１は、クライアント装置２０１から検索依頼を受信することにより、受信した検索依頼に含まれる入力文を受け付ける。入力文は、キーワードとして入力された複数のワードを含む。検索依頼は、例えば、後述の図１２に示す検索画面１２００において行われる。ただし、受付部６０１は、自装置の入力装置（不図示）を用いたユーザの操作入力により、複数のワードを受け付けることにしてもよい。 Specifically, for example, the reception unit 601 receives an input sentence included in the received search request by receiving the search request from the client device 201. The input sentence includes a plurality of words input as keywords. The search request is made, for example, on the search screen 1200 shown in FIG. 12, which will be described later. However, the reception unit 601 may accept a plurality of words by the user's operation input using the input device (not shown) of the own device.

判定部６０２は、受け付けた複数のワードの中に、記憶部６１０に予め記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと、一致又は類似する第１のワードが存在するか否かを判定する。ここで、検索対象の属性毎に対応付けられた１又は複数のワードは、当該属性に対応するキーワードとして予め登録された語句（登録語）に含まれる１又は複数のワードである。 Among the plurality of received words, the determination unit 602 includes a first word that matches or is similar to one or a plurality of words associated with each attribute of the search target stored in advance in the storage unit 610. Determine whether or not to do so. Here, the one or more words associated with each attribute to be searched are one or a plurality of words included in a word (registered word) registered in advance as a keyword corresponding to the attribute.

以下の説明では、ワードを「単語」と表記する場合がある。 In the following description, a word may be referred to as a "word".

具体的には、例えば、判定部６０２は、受け付けた入力文を単語（ワード）に分割する。この際、判定部６０２は、例えば、入力文から分割した単語の語順も保持する。これにより、検索対象に対する検索のための複数のワードが得られる。なお、単語に分割する処理は、例えば、辞書情報や形態素解析等の技術を利用して行うことができる。 Specifically, for example, the determination unit 602 divides the received input sentence into words. At this time, the determination unit 602 also holds, for example, the word order of the words divided from the input sentence. As a result, a plurality of words for searching for the search target can be obtained. The process of dividing into words can be performed by using techniques such as dictionary information and morphological analysis.

また、判定部６０２は、図５に示した製品属性テーブル２３０を参照して、検索対象の属性に対応付けられた登録語を単語に分割する。この際、判定部６０２は、例えば、登録語から分割した単語の語順も保持する。これにより、検索対象の属性に対応付けられた１又は複数のワードが得られる。 Further, the determination unit 602 divides the registered word associated with the attribute to be searched into words by referring to the product attribute table 230 shown in FIG. At this time, the determination unit 602 also holds, for example, the word order of the words divided from the registered words. As a result, one or more words associated with the attribute to be searched can be obtained.

つぎに、判定部６０２は、分割した単語毎に、入力文から分割した単語と、登録語から分割した単語とを比較する。属性に対応付けられた登録語は、複数の単語で構成されているものもある。このため、判定部６０２は、例えば、入力文に含まれる単語列と、登録語に含まれる単語列とを比較することにしてもよい。単語列は、複数の単語を語順どおりに並べたものである。 Next, the determination unit 602 compares the word divided from the input sentence with the word divided from the registered word for each divided word. Some registered words associated with attributes are composed of a plurality of words. Therefore, the determination unit 602 may, for example, compare the word string included in the input sentence with the word string included in the registered word. A word sequence is a sequence of a plurality of words arranged in word order.

ここで、比較した単語（あるいは、単語列）の文字列が完全一致した場合、判定部６０２は、単語（あるいは、単語列）が一致すると判定する。一方、比較した単語（あるいは、単語列）の文字列が完全一致しない場合には、判定部６０２は、単語（あるいは、単語列）が一致しないと判定する。 Here, when the character strings of the compared words (or word strings) completely match, the determination unit 602 determines that the words (or word strings) match. On the other hand, when the character strings of the compared words (or word strings) do not completely match, the determination unit 602 determines that the words (or word strings) do not match.

ただし、入力文は自由入力のため、入力時にスペルミスが発生する場合がある。ある程度のスペルミスを許容すべく、判定部６０２は、数文字程度の違いであれば、単語（あるいは、単語列）が類似すると判定することにしてもよい。許容する文字数は、単語（あるいは、単語列）の文字数に応じて定めることにしてもよい。 However, since the input text is free input, spelling mistakes may occur during input. In order to allow a certain amount of spelling mistakes, the determination unit 602 may determine that the words (or word strings) are similar if the difference is about several characters. The number of characters allowed may be determined according to the number of characters in the word (or word string).

例えば、４文字以上の単語については、許容する文字数を「２」とし、４文字未満の単語については、許容する文字数を「０」とする。この場合、判定部６０２は、４文字以上の単語については、比較した単語の文字列が完全一致しない場合であっても、不一致の文字数が２以下であれば、単語が類似すると判定する。また、判定部６０２は、４文字未満の単語については、不一致の文字数が「１」であっても、単語が類似しないと判定する。 For example, for words of 4 or more characters, the allowable number of characters is "2", and for words of less than 4 characters, the allowable number of characters is "0". In this case, the determination unit 602 determines that the words are similar for words having 4 or more characters, even if the character strings of the compared words do not exactly match, if the number of mismatched characters is 2 or less. Further, the determination unit 602 determines that the words having less than 4 characters are not similar even if the number of mismatched characters is "1".

また、比較した単語（あるいは、単語列）の文字列が完全一致しない場合、判定部６０２は、単語同士（あるいは、単語列同士）の編集距離を算出することにしてもよい。編集距離（レーベンシュタイン距離）は、二つの文字列がどの程度異なっているかを示す指標値である。そして、判定部６０２は、算出した編集距離が予め決められた閾値以下であれば、比較した単語（あるいは、単語列同士）の文字列が完全一致しない場合であっても、単語（あるいは、単語列同士）が類似すると判定することにしてもよい。 Further, when the character strings of the compared words (or word strings) do not completely match, the determination unit 602 may calculate the editing distance between the words (or the word strings). The edit distance (Levenshtein distance) is an index value indicating how different the two character strings are. Then, if the calculated editing distance is equal to or less than a predetermined threshold value, the determination unit 602 may use the word (or word) even if the character strings of the compared words (or word strings) do not completely match. It may be determined that the columns) are similar.

また、属性名（例えば、製品名）によっては、入力文において略語で表される場合がある。このため、判定部６０２は、例えば、略語と正式名との対応関係を示す辞書情報を使用して、略語を正式名に変換した上で、変換後の単語（正式名）と、登録語から分割した単語とを比較することにしてもよい。 Further, depending on the attribute name (for example, the product name), it may be represented by an abbreviation in the input sentence. Therefore, the determination unit 602 converts the abbreviation into a formal name by using, for example, dictionary information indicating the correspondence between the abbreviation and the formal name, and then uses the converted word (formal name) and the registered word. You may want to compare it with the divided words.

そして、判定部６０２は、入力文から分割した複数の単語（あるいは、単語列）のうち、登録語から分割した少なくともいずれかの単語（あるいは、単語列）と一致又は類似する単語（あるいは、単語列）を、第１のワードとして特定する。なお、単語の比較例については、図７を用いて後述する。 Then, the determination unit 602 matches or resembles at least one word (or word string) divided from the registered word among the plurality of words (or word strings) divided from the input sentence (or word). Column) is specified as the first word. A comparative example of words will be described later with reference to FIG. 7.

抽出部６０３は、第１のワードが存在すると判定された場合に、複数の検索対象の中から、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象群を抽出する。ここで、複数の検索対象は、例えば、企業内に蓄積されているドキュメントである。 When it is determined that the first word exists, the extraction unit 603 includes the first word in the attribute associated with the word that matches or is similar to the first word from among a plurality of search targets. Extract the search target group. Here, the plurality of search targets are, for example, documents accumulated in the company.

第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象とは、第１のワードと一致又は類似するワードと対応付けられている属性の属性値に、第１のワードを含む検索対象である。すなわち、第１のワードを、属性により検索対象を絞り込む際のキーワード（属性キーワード）として用いる。 In the attribute associated with the word matching or similar to the first word, the search target including the first word is the attribute value of the attribute associated with the word matching or similar to the first word. , Is a search target including the first word. That is, the first word is used as a keyword (attribute keyword) when narrowing down the search target by the attribute.

ただし、第１のワードには、スペルミスが含まれている場合がある。また、第１のワードは、略語で表されている場合がある。このため、抽出部６０３は、第１のワードと一致又は類似するワードと対応付けられている属性において、当該ワード（第１のワードと一致又は類似するワード）を含む検索対象群を抽出することにしてもよい。 However, the first word may contain misspellings. In addition, the first word may be represented by an abbreviation. Therefore, the extraction unit 603 extracts a search target group including the word (word that matches or is similar to the first word) in the attribute associated with the word that matches or is similar to the first word. It may be.

また、抽出部６０３は、第１のワードが存在すると判定された場合に、複数の検索対象の中から、第１のワードと一致又は類似するワードと対応付けられている属性において、当該属性に対応付けられた１又は複数のワードを含む検索対象群を抽出することにしてもよい。 Further, when it is determined that the first word exists, the extraction unit 603 sets the attribute in the attribute associated with the word that matches or is similar to the first word from among a plurality of search targets. A search target group including one or a plurality of associated words may be extracted.

より具体的には、例えば、抽出部６０３は、図４に示したドキュメント管理ＤＢ２２０を参照して、第１のワードと一致又は類似するワードと対応付けられている属性において、当該属性に対応付けられた登録語を含むドキュメント群を抽出する。 More specifically, for example, the extraction unit 603 refers to the document management DB 220 shown in FIG. 4 and associates it with the attribute in the attribute associated with the word that matches or is similar to the first word. Extract the document group including the registered words.

ここで、第１のワードと一致又は類似するワードと対応付けられている属性を属性「ＯＳ」とし、当該属性「ＯＳ」に対応付けられた登録語を「Ｗｉｎｄｏｗｓ」とする。この場合、判定部６０２は、ドキュメント管理ＤＢ２２０を参照して、属性「製品名」の属性値に、登録語「Ｗｉｎｄｏｗｓ」を含むドキュメント群（例えば、ドキュメントＤ１）を抽出する。 Here, the attribute associated with the word that matches or is similar to the first word is defined as the attribute "OS", and the registered word associated with the attribute "OS" is defined as "Windows". In this case, the determination unit 602 refers to the document management DB 220 and extracts a document group (for example, document D1) including the registered word "Windows" in the attribute value of the attribute "product name".

これにより、入力文からユーザが指定したい属性（属性値）を推定して、検索対象のドキュメントを絞り込むことができる。 As a result, the attribute (attribute value) that the user wants to specify can be estimated from the input sentence, and the documents to be searched can be narrowed down.

受け付けた複数のワードの中に、第１のワードが複数存在する場合がある。この場合、抽出部６０３は、例えば、複数の第１のワードの少なくともいずれかのワードと一致又は類似するワードと対応付けられている属性において、当該属性に対応付けられた登録語を含む検索対象群を抽出することにしてもよい。また、抽出部６０３は、複数の第１のワードの各ワードと一致又は類似するワードと対応付けられている各属性において、当該各属性に対応付けられた登録語を含む検索対象群を抽出することにしてもよい。 There may be a plurality of first words among the plurality of received words. In this case, the extraction unit 603 is, for example, a search target including a registered word associated with the attribute in an attribute associated with a word that matches or is similar to at least one of the plurality of first words. A group may be extracted. Further, the extraction unit 603 extracts a search target group including a registered word associated with each attribute in each attribute associated with a word that matches or is similar to each word of the plurality of first words. You may decide.

特定部６０４は、受け付けた複数のワードに含まれる第１のワード以外の第２のワードと、抽出された検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、検索対象群の中から検索対象を特定する。 The specific unit 604 sets the search target group based on the comparison result between the second word other than the first word included in the received plurality of words and the word included in the data of each of the extracted search target groups. Identify the search target from the inside.

具体的には、例えば、特定部６０４は、入力文のうちの第１ワード以外の単語を第２のワードとして特定する。また、特定部６０４は、ドキュメント管理ＤＢ２２０を参照して、抽出されたドキュメント群それぞれの本文情報を取得する。つぎに、特定部６０４は、特定した第２のワードと、ドキュメント群それぞれの本文情報に含まれるワードとを比較する。 Specifically, for example, the identification unit 604 specifies a word other than the first word in the input sentence as the second word. In addition, the specific unit 604 refers to the document management DB 220 and acquires the text information of each of the extracted document groups. Next, the specific unit 604 compares the specified second word with the word included in the text information of each document group.

そして、特定部６０４は、抽出されたドキュメント群の中から、第２のワードと一致するワードが本文情報に含まれるドキュメントを特定する。また、特定部６０４は、ドキュメント群の中から、第２のワードと類似するワードが本文情報に含まれるドキュメントを特定することにしてもよい。 Then, the identification unit 604 identifies a document in which the word matching the second word is included in the text information from the extracted document group. Further, the specifying unit 604 may specify a document in which a word similar to the second word is included in the text information from the document group.

すなわち、特定部６０４は、入力文のうちの第１ワード以外の単語（第２のワード）を検索キーワードとして、抽出されたドキュメント群の中からドキュメントを特定する。 That is, the identification unit 604 specifies a document from the extracted document group using a word other than the first word (second word) in the input sentence as a search keyword.

また、第２のワードが複数存在する場合がある。この場合、特定部６０４は、ドキュメント群の中から、複数の第２のワードの少なくともいずれかのワードと一致又は類似するワードが本文情報に含まれるドキュメントを特定することにしてもよい（ＯＲ検索）。また、特定部６０４は、ドキュメント群の中から、複数の第２のワードの各ワードと一致又は類似するワードが本文情報に全て含まれるドキュメントを特定することにしてもよい（ＡＮＤ検索）。 In addition, there may be a plurality of second words. In this case, the specific unit 604 may specify a document whose body information includes a word that matches or is similar to at least one of a plurality of second words from the document group (OR search). ). Further, the identification unit 604 may specify a document in which all the words that match or are similar to each word of the plurality of second words are included in the text information from the document group (AND search).

出力部６０５は、受け付けた複数のワードに対して、特定された検索対象を出力する。具体的には、例えば、出力部６０５は、クライアント装置２０１からの検索依頼に対する検索結果として、特定されたドキュメントを示す情報（例えば、タイトル、要約など）を出力する。この結果、クライアント装置２０１において、検索依頼に対する検索結果が、ディスプレイ（不図示）に表示される。 The output unit 605 outputs the specified search target for the plurality of received words. Specifically, for example, the output unit 605 outputs information (for example, a title, a summary, etc.) indicating the specified document as a search result for the search request from the client device 201. As a result, in the client device 201, the search result for the search request is displayed on the display (not shown).

また、出力部６０５は、受け付けた複数のワードに対して、特定された検索対象とともに、第１のワードと一致又は類似するワードと対応付けられている属性に対応付けられた語句を出力することにしてもよい。ここで、属性に対応付けられた語句は、属性に対応付けられた１又は複数のワードからなる語句であり、例えば、属性に対応付けられた登録語（図５参照）である。 Further, the output unit 605 outputs the words and phrases associated with the attributes associated with the first word and the words that match or are similar to the first word, together with the specified search target, for the received plurality of words. You may do it. Here, the phrase associated with the attribute is a phrase composed of one or a plurality of words associated with the attribute, and is, for example, a registered word associated with the attribute (see FIG. 5).

より具体的には、例えば、出力部６０５は、特定されたドキュメントを示す情報とともに、第１のワードと一致又は類似するワードと対応付けられている属性に対応付けられた登録語を表示する検索画面を、クライアント装置２０１に表示することにしてもよい。これにより、検索依頼に対する検索結果を表示する際にどのような属性で検索結果を絞り込んだのかを特定可能な情報を提示することができる。検索画面の画面例については、図１２を用いて後述する。 More specifically, for example, the output unit 605 is a search that displays the registered word associated with the attribute associated with the word that matches or is similar to the first word, together with the information indicating the specified document. The screen may be displayed on the client device 201. As a result, it is possible to present information that can identify the attributes used to narrow down the search results when displaying the search results for the search request. A screen example of the search screen will be described later with reference to FIG.

なお、上述した説明では、第１のワードが存在すると判定された場合に、抽出部６０３が、複数の検索対象の中から、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象群を抽出することにしたが、これに限らない。 In the above description, when it is determined that the first word exists, the extraction unit 603 has an attribute associated with a word that matches or is similar to the first word from among a plurality of search targets. In, it was decided to extract the search target group including the first word, but the present invention is not limited to this.

例えば、第１のワードが存在すると判定された場合に、抽出部６０３が、受け付けた複数のワードに含まれる第１のワード以外の第２のワードと、複数の検索対象それぞれのデータに含まれるワードとの比較結果に基づいて、複数の検索対象の中から検索対象群を抽出することにしてもよい。すなわち、抽出部６０３は、複数のワードのうちの第１ワード以外の第２のワードを検索キーワードとして、複数の検索対象の中から検索対象群を抽出する。そして、特定部６０４が、抽出された検索対象群の中から、第１のワードと一致又は類似するワードと対応付けられている属性において、第１のワードを含む検索対象を特定することにしてもよい。 For example, when it is determined that the first word exists, the extraction unit 603 is included in the second word other than the first word included in the received plurality of words and in the data of each of the plurality of search targets. A search target group may be extracted from a plurality of search targets based on the comparison result with the word. That is, the extraction unit 603 extracts a search target group from a plurality of search targets using a second word other than the first word among the plurality of words as a search keyword. Then, the specific unit 604 identifies the search target including the first word in the attribute associated with the word that matches or is similar to the first word from the extracted search target group. May be good.

（単語の比較例）
つぎに、入力文から分割した単語（ワード）と、検索対象の属性毎に対応付けられた登録語から分割した単語との比較例について説明する。 (Example of word comparison)
Next, a comparison example of a word divided from the input sentence and a word divided from the registered words associated with each attribute to be searched will be described.

図７は、単語の比較例を示す説明図である。図７において、入力文７１０は、クライアント装置２０１からの検索依頼に含まれる入力文の一例である。判定部６０２は、入力文７１０を単語に分割する。図７の例では、入力文７１０が、単語７１１（Ｃｈａｒｓｅｔ）、単語７１２（Ｅｎｔｅｒｐｒｉｓｅ）、単語７１３（Ｅｄｉｔｉｏｎ）、単語７１４（Ｗｉｎｄｏｗｓ）および単語７１５（インストール）に分割される。 FIG. 7 is an explanatory diagram showing a comparative example of words. In FIG. 7, the input sentence 710 is an example of the input sentence included in the search request from the client device 201. The determination unit 602 divides the input sentence 710 into words. In the example of FIG. 7, the input sentence 710 is divided into the word 711 (Charset), the word 712 (Enterprise), the word 713 (Edition), the word 714 (Windows), and the word 715 (installation).

また、判定部６０２は、製品属性テーブル２３０を参照して、検索対象の属性毎に、当該属性に対応付けられた登録語それぞれを単語に分割する。ここでは、属性を「製品名」、「エディション」、「ファミリ製品名」および「ＯＳ」とする。また、属性「製品名」に対応付けられた登録語を「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」と「Ｅｎｔｅｒｐｒｉｓｅ」とする。 Further, the determination unit 602 refers to the product attribute table 230 and divides each registered word associated with the attribute to be searched into a word for each attribute. Here, the attributes are "product name", "edition", "family product name", and "OS". Further, the registered words associated with the attribute "product name" are "Interstage Charge Manager" and "Enterpress".

この場合、判定部６０２は、登録語「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」を単語に分割する。ここでは、単語７２１（Ｉｎｔｅｒｓｔａｇｅ）、単語７２２（Ｃｈａｒｓｅｔ）および単語７２３（Ｍａｎａｇｅｒ）に分割される。登録語「Ｅｎｔｅｒｐｒｉｓｅ」については、１単語のため分割されず、単語７３１（Ｅｎｔｅｒｐｒｉｓｅ）となる。 In this case, the determination unit 602 divides the registered word "Interstage Charge Manager" into words. Here, it is divided into the word 721 (Interstage), the word 722 (Charset), and the word 723 (Manager). The registered word "Enterprise" is not divided because it is one word, and becomes the word 731 (Enterprise).

また、属性「エディション」に対応付けられた登録語を「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」とする。この場合、判定部６０２は、登録語「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」を単語に分割する。ここでは、単語７４１（Ｅｎｔｅｒｐｒｉｓｅ）および単語７４２（Ｅｄｉｔｉｏｎ）に分割される。 Further, the registered word associated with the attribute "edition" is defined as "Enterprise Edition". In this case, the determination unit 602 divides the registered word "Enterprise Edition" into words. Here, it is divided into the word 741 (Enterprise) and the word 742 (Edition).

また、属性「ファミリ製品名」に対応付けられた登録語を「開発パッケージｆｏｒＷｉｎｄｏｗｓ」とする。この場合、判定部６０２は、登録語「開発パッケージｆｏｒＷｉｎｄｏｗｓ」を単語に分割する。ここでは、単語７５１（開発）、単語７５２（パッケージ）、単語７５３（ｆｏｒ）および単語７５４（Ｗｉｎｄｏｗｓ）に分割される。また、属性「ＯＳ」に対応付けられた登録語を「Ｗｉｎｄｏｗｓ」とする。登録語「Ｗｉｎｄｏｗｓ」については、１単語のため分割されず、単語７６１（Ｗｉｎｄｏｗｓ）となる。 In addition, the registered word associated with the attribute "family product name" is defined as "development package for Windows". In this case, the determination unit 602 divides the registered word "development package for Windows" into words. Here, it is divided into word 751 (development), word 752 (package), word 753 (for) and word 754 (Windows). Further, the registered word associated with the attribute "OS" is defined as "Windows". The registered word "Windows" is not divided because it is one word, and becomes the word 761 (Windows).

つぎに、判定部６０２は、分割した単語毎に、入力文から分割した単語７１１〜７１５と、登録語から分割した単語７２１〜７２３，７３１，７４１，７４２，７５１〜７５４，７６１とを比較する。例えば、単語７１１について、単語７２２との比較結果は「一致」であり、それ以外の単語との比較結果は「不一致かつ非類似」である。 Next, the determination unit 602 compares the words 711 to 715 divided from the input sentence with the words 721 to 723, 731, 741, 742, 751 to 754, 761 divided from the registered words for each divided word. .. For example, for word 711, the comparison result with word 722 is "match", and the comparison result with other words is "mismatch and dissimilarity".

また、単語７１２について、単語７３１，７４１との比較結果は「一致」であり、それ以外の単語との比較結果は「不一致かつ非類似」である。また、単語７１３について、単語７４２との比較結果は「一致」であり、それ以外の単語との比較結果は「不一致かつ非類似」である。また、単語７１４について、単語７５４，７６１との比較結果は「一致」であり、それ以外の単語との比較結果は「不一致かつ非類似」である。また、単語７１５については、全単語との比較結果が「不一致かつ非類似」である。 Further, regarding the word 712, the comparison result with the words 731 and 741 is "match", and the comparison result with the other words is "disagreement and dissimilarity". Further, regarding the word 713, the comparison result with the word 742 is "match", and the comparison result with the other words is "disagreement and dissimilarity". Further, regarding the word 714, the comparison result with the words 754 and 761 is "match", and the comparison result with the other words is "mismatch and dissimilarity". Further, for the word 715, the comparison result with all the words is "mismatch and dissimilar".

比較の結果、入力文から分割した単語７１１〜７１５のうち、登録語から分割した少なくともいずれかの単語と一致又は類似すると判定された単語（第１のワード）は、例えば、図８に示すような属性候補テーブル８００に記憶される。属性候補テーブル８００は、例えば、メモリ３０２、ディスク３０４などの記憶装置により実現される。 As a result of comparison, among the words 711 to 715 divided from the input sentence, the word (first word) determined to match or be similar to at least one of the words divided from the registered words is, for example, as shown in FIG. It is stored in the attribute candidate table 800. The attribute candidate table 800 is realized by a storage device such as a memory 302 or a disk 304, for example.

図８は、属性候補テーブル８００の記憶内容の一例を示す説明図である。図８において、属性候補テーブル８００は、属性名、属性キーワードおよび登録語のフィールドを有し、各フィールドに情報を設定することで、属性候補情報８００−１〜８００−５がレコードとして記憶される。 FIG. 8 is an explanatory diagram showing an example of the stored contents of the attribute candidate table 800. In FIG. 8, the attribute candidate table 800 has fields of attribute name, attribute keyword, and registered word, and by setting information in each field, attribute candidate information 800-1 to 800-5 is stored as a record. ..

ここで、属性名は、検索対象のドキュメントを特徴付ける属性の名称を示す。属性キーワードは、入力文から分割した単語のうち、属性名の属性に対応付けられた登録語から分割した少なくともいずれかの単語と一致又は類似する単語（第１のワード）を示す。属性キーワードは、属性により検索対象を絞り込む際に用いるキーワードに相当する。登録語は、属性名の属性に対応付けられた登録語を示す。 Here, the attribute name indicates the name of the attribute that characterizes the document to be searched. The attribute keyword indicates a word (first word) that matches or is similar to at least one word divided from the registered word associated with the attribute of the attribute name among the words divided from the input sentence. Attribute keywords correspond to keywords used when narrowing down the search target by attribute. The registered word indicates a registered word associated with the attribute of the attribute name.

例えば、属性候補情報８００−１は、属性「製品名」の属性キーワード「Ｃｈａｒｓｅｔ」および登録語「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」を示す。属性候補テーブル８００によれば、入力文（例えば、入力文７１０）に含まれる単語のうち、検索対象の属性に対応付けられた登録語に含まれる単語と一致又は類似する単語（第１のワード）を特定することができる。 For example, the attribute candidate information 800-1 indicates the attribute keyword “Charset” of the attribute “product name” and the registered word “Interstage Charge Manager”. According to the attribute candidate table 800, among the words included in the input sentence (for example, the input sentence 710), the words that match or are similar to the words included in the registered words associated with the attribute to be searched (first word). ) Can be specified.

（属性キーワードに対応する属性の絞り込み）
ここで、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合がある。すなわち、属性キーワードに対応する属性が複数存在する場合がある。この場合、複数の属性の中から、属性キーワード（第１のワード）に対応する属性を選定することにしてもよい。 (Narrowing down the attributes corresponding to the attribute keywords)
Here, there may be a plurality of attributes associated with a word that matches or is similar to the first word. That is, there may be a plurality of attributes corresponding to the attribute keywords. In this case, the attribute corresponding to the attribute keyword (first word) may be selected from the plurality of attributes.

例えば、抽出部６０３は、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、受け付けた複数のワードのうち、各属性に対応付けられた１又は複数のワードと一致又は類似するワードの数を算出することにしてもよい。この際、抽出部６０３は、各属性について、受け付けた複数のワードのうち、各属性に対応付けられた１又は複数のワードと連続して一致又は類似するワードの数を算出することにしてもよい。そして、抽出部６０３は、複数の属性の中から、算出したワードの数が最大の属性を特定することにしてもよい。この場合、抽出部６０３は、特定した属性において、第１のワード（または、当該属性に対応付けられた登録語）を含む検索対象群を抽出する。 For example, when the extraction unit 603 has a plurality of attributes associated with a word that matches or is similar to the first word, for each attribute of the plurality of attributes, among the plurality of words received, each attribute The number of words that match or are similar to one or more associated words may be calculated. At this time, the extraction unit 603 may calculate the number of words that continuously match or are similar to one or more words associated with each attribute among the plurality of received words for each attribute. Good. Then, the extraction unit 603 may specify the attribute having the largest number of calculated words from the plurality of attributes. In this case, the extraction unit 603 extracts the search target group including the first word (or the registered word associated with the attribute) in the specified attribute.

より詳細に説明すると、例えば、抽出部６０３は、図８に示した属性候補テーブル８００を参照して、属性キーワード（第１のワード）と一致又は類似するワードと対応付けられている属性が複数存在するか否かを判断する。図８の例では、属性キーワード「Ｅｎｔｅｒｐｒｉｓｅ」は、属性「製品名」および属性「エディション」に対応している。 More specifically, for example, the extraction unit 603 refers to the attribute candidate table 800 shown in FIG. 8 and has a plurality of attributes associated with words that match or are similar to the attribute keyword (first word). Determine if it exists. In the example of FIG. 8, the attribute keyword "Enterprise" corresponds to the attribute "product name" and the attribute "edition".

この場合、抽出部６０３は、例えば、属性「製品名」および属性「エディション」の各属性について、入力文に含まれる単語のうち、各属性に対応付けられた登録語に含まれる単語と連続して一致又は類似する単語の数を算出する。ここで、図９を用いて、単語の数（単語長）の算出例について説明する。 In this case, for example, for each attribute of the attribute "product name" and the attribute "edition", the extraction unit 603 is continuous with the word included in the registered word associated with each attribute among the words included in the input sentence. Calculate the number of matching or similar words. Here, an example of calculating the number of words (word length) will be described with reference to FIG.

図９は、属性キーワードに対応する属性の絞り込み例を示す説明図（その１）である。抽出部６０３は、例えば、属性候補情報８００−２を参照して、入力文７１０（図７参照）に含まれる単語７１１〜７１５のうち、属性「製品名」に対応付けられた登録語「Ｅｎｔｅｒｐｒｉｓｅ」に含まれる単語７３１と連続して一致又は類似する単語の数（単語長）を算出する。ここでは、単語長「１」が算出される。 FIG. 9 is an explanatory diagram (No. 1) showing an example of narrowing down the attributes corresponding to the attribute keywords. For example, the extraction unit 603 refers to the attribute candidate information 800-2, and among the words 711 to 715 included in the input sentence 710 (see FIG. 7), the registered word "Enterprise" associated with the attribute "product name". The number of words (word length) that continuously match or are similar to the word 731 included in "" is calculated. Here, the word length "1" is calculated.

また、抽出部６０３は、例えば、属性候補情報８００−３を参照して、入力文７１０に含まれる単語７１１〜７１５のうち、属性「エディション」に対応付けられた登録語「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」に含まれる単語７４１，７４２と連続して一致又は類似する単語の数（単語長）を算出する。ここでは、単語長「２」が算出される。 Further, the extraction unit 603 is included in the registered word "Enterprise Edition" associated with the attribute "edition" among the words 711 to 715 included in the input sentence 710 with reference to the attribute candidate information 800-3, for example. The number of words (word length) that continuously match or are similar to the words 741 and 742 that are used is calculated. Here, the word length "2" is calculated.

そして、抽出部６０３は、属性「製品名」および属性「エディション」の中から、算出した単語長が最大の属性「エディション」を特定する。この場合、抽出部６０３は、属性候補テーブル８００から、属性候補情報８００−２を削除する。これにより、属性キーワード「Ｅｎｔｅｒｐｒｉｓｅ」に対応する属性を一つに絞り込むことができる。 Then, the extraction unit 603 specifies the attribute "edition" having the maximum calculated word length from the attribute "product name" and the attribute "edition". In this case, the extraction unit 603 deletes the attribute candidate information 800-2 from the attribute candidate table 800. As a result, the attributes corresponding to the attribute keyword "Enterprise" can be narrowed down to one.

また、属性キーワード「Ｗｉｎｄｏｗｓ」は、属性「ファミリ製品名」および属性「ＯＳ」に対応している。ただし、属性キーワード「Ｗｉｎｄｏｗｓ」については、属性「ファミリ製品名」および属性「ＯＳ」の各属性の単語長がともに「１」となり、単語長から属性を一つに絞り込むことができない。 In addition, the attribute keyword "Windows" corresponds to the attribute "family product name" and the attribute "OS". However, for the attribute keyword "Windows", the word length of each attribute of the attribute "family product name" and the attribute "OS" is "1", and the attribute cannot be narrowed down to one from the word length.

そこで、抽出部６０３は、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、各属性に対応付けられた１又は複数のワードのうち、受け付けた複数のワードに含まれるワードと一致又は類似するワードが占める文字列比を算出することにしてもよい。そして、抽出部６０３は、複数の属性の中から、算出した文字列比が最大の属性を特定することにしてもよい。この場合、抽出部６０３は、特定した属性において、第１のワード（または、当該属性に対応付けられた登録語）を含む検索対象群を抽出する。 Therefore, when there are a plurality of attributes associated with a word that matches or is similar to the first word, the extraction unit 603 may use one or a plurality of attributes associated with each attribute for each attribute of the plurality of attributes. Among the words, the character string ratio occupied by words that match or are similar to the words included in the plurality of accepted words may be calculated. Then, the extraction unit 603 may specify the attribute having the maximum calculated character string ratio from the plurality of attributes. In this case, the extraction unit 603 extracts the search target group including the first word (or the registered word associated with the attribute) in the specified attribute.

より詳細に説明すると、例えば、抽出部６０３は、属性「ファミリ製品名」および属性「ＯＳ」の各属性について、各属性に対応付けられた登録語のうち、入力文に含まれる単語と一致又は類似する単語が占める文字列比を算出する。ここで、図１０を用いて、文字列比の算出例を説明する。 More specifically, for example, the extraction unit 603 matches each attribute of the attribute "family product name" and the attribute "OS" with the word included in the input sentence among the registered words associated with each attribute. Calculate the character string ratio occupied by similar words. Here, an example of calculating the character string ratio will be described with reference to FIG.

図１０は、属性キーワードに対応する属性の絞り込み例を示す説明図（その２）である。抽出部６０３は、例えば、属性候補情報８００−４を参照して、属性「ファミリ製品名」に対応付けられた登録語「開発パッケージｆｏｒＷｉｎｄｏｗｓ」のうち、入力文７１０（図７参照）に含まれる単語７１１〜７１５と一致又は類似する単語が占める文字列比を算出する。 FIG. 10 is an explanatory diagram (No. 2) showing an example of narrowing down the attributes corresponding to the attribute keywords. The extraction unit 603 is included in the input sentence 710 (see FIG. 7) of the registered word “development package for Windows” associated with the attribute “family product name” with reference to the attribute candidate information 800-4, for example. Calculate the character string ratio occupied by words that match or are similar to the words 711 to 715.

ここでは、登録語「開発パッケージｆｏｒＷｉｎｄｏｗｓ」のうち、単語７５４（Ｗｉｎｄｏｗｓ）が、入力文７１０に含まれる単語７１４と一致するため、文字列比「７／１９」が算出される。分母の「１９」は、登録語「開発パッケージｆｏｒＷｉｎｄｏｗｓ」の文字数である。分子の「７」は、「Ｗｉｎｄｏｗｓ」の文字数である。 Here, since the word 754 (Windows) in the registered word "development package for Windows" matches the word 714 included in the input sentence 710, the character string ratio "7/19" is calculated. The denominator "19" is the number of characters in the registered word "development package for Windows". The "7" in the numerator is the number of characters in "Windows".

また、抽出部６０３は、例えば、属性候補情報８００−５を参照して、属性「ＯＳ」に対応付けられた登録語「Ｗｉｎｄｏｗｓ」のうち、入力文７１０に含まれる単語７１１〜７１５と一致又は類似する単語が占める文字列比を算出する。ここでは、登録語「Ｗｉｎｄｏｗｓ」が、入力文７１０に含まれる単語７１４と一致するため、文字列比「７／７」が算出される。 Further, the extraction unit 603 refers to, for example, the attribute candidate information 800-5, and matches or matches the words 711 to 715 included in the input sentence 710 among the registered words "Windows" associated with the attribute "OS". Calculate the string ratio occupied by similar words. Here, since the registered word "Windows" matches the word 714 included in the input sentence 710, the character string ratio "7/7" is calculated.

そして、抽出部６０３は、属性「ファミリ製品名」および属性「ＯＳ」の中から、算出した文字列比が最大の属性「ＯＳ」を特定する。この場合、抽出部６０３は、属性候補テーブル８００から、属性候補情報８００−４を削除する。これにより、属性キーワード「Ｗｉｎｄｏｗｓ」に対応する属性を一つに絞り込むことができる。 Then, the extraction unit 603 specifies the attribute "OS" having the maximum calculated character string ratio from the attribute "family product name" and the attribute "OS". In this case, the extraction unit 603 deletes the attribute candidate information 800-4 from the attribute candidate table 800. As a result, the attributes corresponding to the attribute keyword "Windows" can be narrowed down to one.

なお、抽出部６０３は、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、予め決められた属性毎の優先度に従って、複数の属性のうちの最も優先度が高い属性を特定することにしてもよい。 When there are a plurality of attributes associated with a word that matches or is similar to the first word, the extraction unit 603 has the highest priority among the plurality of attributes according to a predetermined priority for each attribute. You may want to identify attributes with high.

また、属性キーワードとして使用しない単語を、ストップワードとして予め登録しておくことにしてもよい。例えば、多くの登録語に含まれるような単語を、ストップワードとして予め登録しておくことにしてもよい。この場合、判定部６０２は、入力文の中に、属性に対応付けられた登録語に含まれる単語と一致又は類似する単語が存在しても、その単語がストップワードであれば、第１のワードとして採用しないことにしてもよい。これにより、多くの登録語がヒットして、検索結果に含まれるノイズの数が増えるのを防ぐことができる。 Further, a word that is not used as an attribute keyword may be registered in advance as a stop word. For example, a word included in many registered words may be registered in advance as a stop word. In this case, even if the input sentence contains a word that matches or is similar to the word included in the registered word associated with the attribute, if the word is a stop word, the determination unit 602 first It may not be adopted as a ward. As a result, it is possible to prevent many registered words from being hit and increasing the number of noises included in the search results.

（ドキュメントの検索例）
つぎに、図１１を用いて、検索対象のドキュメントの検索例について説明する。 (Document search example)
Next, a search example of the document to be searched will be described with reference to FIG.

図１１は、ドキュメントの検索例を示す説明図である。図１１において、抽出部６０３は、例えば、図１０に示した属性候補テーブル８００を参照して、入力文７１０に含まれる単語７１１〜７１５のうちの属性キーワードを特定する。ここでは、単語７１１〜７１４が属性キーワードとして特定される。 FIG. 11 is an explanatory diagram showing a document search example. In FIG. 11, the extraction unit 603 specifies, for example, the attribute keyword among the words 711 to 715 included in the input sentence 710 with reference to the attribute candidate table 800 shown in FIG. Here, the words 711 to 714 are specified as attribute keywords.

つぎに、抽出部６０３は、属性候補テーブル８００を参照して、特定した属性キーワードに対応する登録語を特定する。ここでは、属性キーワード（単語７１１）に対応する登録語として、属性「製品名」の登録語「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」が特定される。また、属性キーワード（単語７１２，７１３）に対応する登録語として、属性「エディション」の登録語「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」が特定される。属性キーワード（単語７１４）に対応する登録語として、属性「ＯＳ」の登録語「Ｗｉｎｄｏｗｓ」が特定される。 Next, the extraction unit 603 refers to the attribute candidate table 800 and identifies the registered word corresponding to the specified attribute keyword. Here, as the registered word corresponding to the attribute keyword (word 711), the registered word "Interstage Charge Manager" of the attribute "product name" is specified. Further, as a registered word corresponding to the attribute keyword (words 712, 713), the registered word "Enterprise Edition" of the attribute "edition" is specified. As a registered word corresponding to the attribute keyword (word 714), the registered word "Windows" of the attribute "OS" is specified.

そして、抽出部６０３は、ドキュメント管理ＤＢ２２０を参照して、下記（ｉ）、（ｉｉ）および（ｉｉｉ）の少なくともいずれかの条件を満たすドキュメント群を抽出する。また、抽出部６０３は、ドキュメント管理ＤＢ２２０を参照して、下記（ｉ）、（ｉｉ）および（ｉｉｉ）の条件を全て満たすドキュメント群を抽出することにしてもよい。 Then, the extraction unit 603 extracts a document group satisfying at least one of the following (i), (ii), and (iii) with reference to the document management DB 220. Further, the extraction unit 603 may refer to the document management DB 220 and extract a document group that satisfies all of the following conditions (i), (ii), and (iii).

（ｉ）属性「製品名」の属性値に、登録語「ＩｎｔｅｒｓｔａｇｅＣｈａｒｓｅｔＭａｎａｇｅｒ」を含む。 (I) The attribute value of the attribute "product name" includes the registered word "Interstage Charge Manager".

（ｉｉ）属性「エディション」の属性値に、登録語「ＥｎｔｅｒｐｒｉｓｅＥｄｉｔｉｏｎ」を含む。 (Ii) The registered word "Enterprise Edition" is included in the attribute value of the attribute "edition".

（ｉｉｉ）属性「ＯＳ」の属性値に、登録語「Ｗｉｎｄｏｗｓ」を含む。 (Iii) The registered word "Windows" is included in the attribute value of the attribute "OS".

図１１の例では、上記（ｉ）、（ｉｉ）および（ｉｉｉ）の条件を全て満たすドキュメント群１１１０が抽出された場合を想定する。 In the example of FIG. 11, it is assumed that the document group 1110 satisfying all the above conditions (i), (ii) and (iii) is extracted.

つぎに、特定部６０４は、入力文７１０に含まれる単語７１１〜７１５のうちの属性キーワード以外の単語（第２のワード）を検索キーワードとして特定する。ここでは、単語７１５が検索キーワードとして特定される。そして、特定部６０４は、特定した検索キーワードと、抽出されたドキュメント群１１１０それぞれの本文情報に含まれるワードとの比較結果に基づいて、ドキュメント群１１１０の中から、検索対象のドキュメントを特定する。 Next, the specifying unit 604 specifies a word (second word) other than the attribute keyword among the words 711 to 715 included in the input sentence 710 as a search keyword. Here, the word 715 is specified as a search keyword. Then, the specific unit 604 identifies the document to be searched from the document group 1110 based on the comparison result between the specified search keyword and the word included in the text information of each of the extracted document group 1110.

図１１の例では、ドキュメント群１１１０の中から、本文情報に検索キーワード「インストール」を含むドキュメント集合１１２０が特定されている。ドキュメント集合１１２０は、ユーザが欲しい属性を持つドキュメント群１１１０を絞り込んだ上で全文検索を行った結果であり、ノイズの少ない検索結果であるといえる。 In the example of FIG. 11, a document set 1120 including the search keyword “install” in the text information is specified from the document group 1110. The document set 1120 is the result of performing a full-text search after narrowing down the document group 1110 having the attributes desired by the user, and can be said to be a search result with less noise.

（検索画面の画面例）
つぎに、図１２を用いて、クライアント装置２０１に表示される検索画面の画面例について説明する。 (Screen example of search screen)
Next, a screen example of the search screen displayed on the client device 201 will be described with reference to FIG.

図１２は、検索画面の画面例を示す説明図である。図１２において、検索画面１２００は、ボックス１２０１に入力された入力文を含む検索依頼に対する検索結果１２１０を表示する操作画面の一例である。検索結果１２１０は、検索対象情報１２１１〜１２１３を含む。 FIG. 12 is an explanatory diagram showing a screen example of the search screen. In FIG. 12, the search screen 1200 is an example of an operation screen for displaying the search result 1210 for the search request including the input sentence input in the box 1201. The search result 1210 includes search target information 121 to 1213.

検索対象情報１２１１〜１２１３は、例えば、検索対象のドキュメントのタイトルや要約を示す。検索画面１２００において、入力装置（不図示）を用いたユーザの操作入力により、検索対象情報１２１１〜１２１３のいずれかを選択すると、ドキュメントの本文情報を閲覧することができる。 The search target information 121 to 1213 indicates, for example, the title or summary of the document to be searched. On the search screen 1200, the text information of the document can be browsed by selecting any of the search target information 1211 to 1213 by the user's operation input using the input device (not shown).

また、検索画面１２００には、属性情報１２２０が表示されている。属性情報１２２０は、入力文に含まれる属性キーワード（第１のワード）と一致又は類似するワードと対応付けられている属性に対応付けられた登録語を示す。属性情報１２２０によれば、ユーザは、どのような属性で検索結果が絞り込まれたのかを特定することができる。 Further, the attribute information 1220 is displayed on the search screen 1200. The attribute information 1220 indicates a registered word associated with an attribute associated with a word that matches or is similar to the attribute keyword (first word) included in the input sentence. According to the attribute information 1220, the user can specify what kind of attribute the search result is narrowed down.

（検索処理装置１０１の検索処理手順）
つぎに、図１３および図１４を用いて、検索処理装置１０１の検索処理手順について説明する。 (Search processing procedure of the search processing device 101)
Next, the search processing procedure of the search processing device 101 will be described with reference to FIGS. 13 and 14.

図１３および図１４は、検索処理装置１０１の検索処理手順の一例を示すフローチャートである。図１３のフローチャートにおいて、まず、検索処理装置１０１は、クライアント装置２０１から検索依頼を受信したか否かを判断する（ステップＳ１３０１）。ここで、検索処理装置１０１は、検索依頼を受信するのを待つ（ステップＳ１３０１：Ｎｏ）。 13 and 14 are flowcharts showing an example of the search processing procedure of the search processing device 101. In the flowchart of FIG. 13, first, the search processing device 101 determines whether or not a search request has been received from the client device 201 (step S1301). Here, the search processing device 101 waits for receiving the search request (step S1301: No).

そして、検索処理装置１０１は、検索依頼を受信した場合（ステップＳ１３０１：Ｙｅｓ）、受信した検索依頼に含まれる入力文を単語（ワード）に分割する（ステップＳ１３０２）。つぎに、検索処理装置１０１は、製品属性テーブル２３０に属性名が登録されている属性のうち選択されていない未選択の属性を選択する（ステップＳ１３０３）。 Then, when the search processing device 101 receives the search request (step S1301: Yes), the search processing device 101 divides the input sentence included in the received search request into words (step S1302). Next, the search processing device 101 selects an unselected attribute among the attributes whose attribute names are registered in the product attribute table 230 (step S1303).

つぎに、検索処理装置１０１は、製品属性テーブル２３０を参照して、選択した属性の登録語のうち選択されていない未選択の登録語を選択する（ステップＳ１３０４）。そして、検索処理装置１０１は、選択した登録語を単語に分割する（ステップＳ１３０５）。つぎに、検索処理装置１０１は、入力文から分割した単語と、登録語から分割した単語とを比較するマッチング処理を実行する（ステップＳ１３０６）。 Next, the search processing device 101 refers to the product attribute table 230 and selects an unselected registered word among the registered words of the selected attribute (step S1304). Then, the search processing device 101 divides the selected registered word into words (step S1305). Next, the search processing device 101 executes a matching process of comparing the word divided from the input sentence with the word divided from the registered word (step S1306).

なお、マッチング処理の具体的な処理手順については、図１５および図１６を用いて後述する。 The specific processing procedure of the matching process will be described later with reference to FIGS. 15 and 16.

そして、検索処理装置１０１は、選択した属性に対応する属性キーワード検出フラグを参照して、属性キーワードがあるか否かを判断する（ステップＳ１３０７）。属性キーワードは、入力文から分割した単語のうち、登録語から分割した１又は複数の単語と、一致又は類似する単語（第１のワード）である。 Then, the search processing device 101 refers to the attribute keyword detection flag corresponding to the selected attribute, and determines whether or not there is an attribute keyword (step S1307). The attribute keyword is a word (first word) that matches or is similar to one or more words divided from the registered word among the words divided from the input sentence.

属性キーワード検出フラグは、入力文中から、登録語の属性キーワードが検出されたか否かを示すフラグである。属性キーワード検出フラグは、入力文中に、登録語を構成する単語が含まれていた場合、「ｔｒｕｅ」となり、登録語を構成する単語が含まれていなかった場合、「ｆａｌｓｅ」となる。 The attribute keyword detection flag is a flag indicating whether or not the attribute keyword of the registered word is detected in the input sentence. The attribute keyword detection flag is "true" when the input sentence contains a word constituting the registered word, and is "false" when the word constituting the registered word is not included.

ここで、属性キーワードがない場合（ステップＳ１３０７：Ｎｏ）、検索処理装置１０１は、ステップＳ１３０９に移行する。一方、属性キーワードがある場合（ステップＳ１３０７：Ｙｅｓ）、選択した属性の属性名と対応付けて、属性キーワードおよび当該属性キーワードと一致又は類似する単語を含む登録語を、属性候補テーブル８００に登録する（ステップＳ１３０８）。なお、属性キーワードは、選択した属性に対応する属性キーワードの単語リストに登録された単語である。 Here, if there is no attribute keyword (step S1307: No), the search processing device 101 shifts to step S1309. On the other hand, when there is an attribute keyword (step S1307: Yes), the attribute keyword and a registered word including a word that matches or is similar to the attribute keyword are registered in the attribute candidate table 800 in association with the attribute name of the selected attribute. (Step S1308). The attribute keyword is a word registered in the word list of the attribute keyword corresponding to the selected attribute.

つぎに、検索処理装置１０１は、製品属性テーブル２３０を参照して、選択した属性の登録語のうち選択されていない未選択の登録語があるか否かを判断する（ステップＳ１３０９）。ここで、未選択の登録語がある場合（ステップＳ１３０９：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１３０４に戻る。 Next, the search processing device 101 refers to the product attribute table 230 and determines whether or not there is an unselected registered word among the registered words of the selected attribute (step S1309). Here, if there is an unselected registered word (step S1309: Yes), the search processing device 101 returns to step S1304.

一方、未選択の登録語がない場合（ステップＳ１３０９：Ｎｏ）、検索処理装置１０１は、製品属性テーブル２３０に属性名が登録されている属性のうち選択されていない未選択の属性があるか否かを判断する（ステップＳ１３１０）。ここで、未選択の属性がある場合（ステップＳ１３１０：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１３０３に戻る。 On the other hand, when there is no unselected registered word (step S1309: No), the search processing device 101 determines whether or not there is an unselected attribute among the attributes whose attribute name is registered in the product attribute table 230. (Step S1310). Here, if there is an unselected attribute (step S1310: Yes), the search processing device 101 returns to step S1303.

一方、未選択の属性がない場合（ステップＳ１３１０：Ｎｏ）、検索処理装置１０１は、図１４に示すステップＳ１４０１に移行する。 On the other hand, when there is no unselected attribute (step S1310: No), the search processing device 101 shifts to step S1401 shown in FIG.

図１４のフローチャートにおいて、まず、検索処理装置１０１は、属性候補テーブル８００を参照して、属性キーワードに対応する属性が複数存在するか否かを判断する（ステップＳ１４０１）。ここで、複数の属性が存在しない場合（ステップＳ１４０１：Ｎｏ）、検索処理装置１０１は、ステップＳ１４０３に移行する。 In the flowchart of FIG. 14, first, the search processing device 101 refers to the attribute candidate table 800 and determines whether or not a plurality of attributes corresponding to the attribute keywords exist (step S1401). Here, when the plurality of attributes do not exist (step S1401: No), the search processing device 101 shifts to step S1403.

一方、属性キーワードに対応する属性が複数存在する場合（ステップＳ１４０１：Ｙｅｓ）、検索処理装置１０１は、複数の属性の中から、属性キーワードに対応する属性を特定する（ステップＳ１４０２）。属性の特定には、例えば、図９で説明した単語長や、図１０で説明した文字列比が用いられる。 On the other hand, when a plurality of attributes corresponding to the attribute keywords exist (step S1401: Yes), the search processing device 101 identifies the attribute corresponding to the attribute keyword from the plurality of attributes (step S1402). For the identification of the attribute, for example, the word length described in FIG. 9 and the character string ratio described in FIG. 10 are used.

つぎに、検索処理装置１０１は、入力文から属性キーワード以外の単語を検索キーワードとして特定する（ステップＳ１４０３）。そして、検索処理装置１０１は、属性キーワードに対応する属性に対応付けられた登録語と、特定した検索キーワードとに基づいて、検索条件を作成する（ステップＳ１４０４）。属性キーワードに対応する属性に対応付けられた登録語は、絞込条件となる登録語であり、属性候補テーブル８００から特定される。なお、検索条件の具体例については、図１９を用いて後述する。 Next, the search processing device 101 identifies a word other than the attribute keyword from the input sentence as a search keyword (step S1403). Then, the search processing device 101 creates a search condition based on the registered word associated with the attribute corresponding to the attribute keyword and the specified search keyword (step S1404). The registered word associated with the attribute corresponding to the attribute keyword is a registered word that is a narrowing condition, and is specified from the attribute candidate table 800. A specific example of the search condition will be described later with reference to FIG.

つぎに、検索処理装置１０１は、作成した検索条件に基づいて、検索対象のドキュメントを検索する検索処理を実行する（ステップＳ１４０５）。そして、検索処理装置１０１は、検索結果を出力して（ステップＳ１４０６）、本フローチャートによる一連の処理を終了する。検索結果は、後述する検索結果格納リストに格納されたレコードを示す情報である。 Next, the search processing device 101 executes a search process for searching the document to be searched based on the created search condition (step S1405). Then, the search processing device 101 outputs a search result (step S1406), and ends a series of processes according to this flowchart. The search result is information indicating a record stored in the search result storage list described later.

これにより、属性を指定しない状態で入力文（検索キーワード）の入力を受け付けても、属性を反映した検索を実行することができる。 As a result, even if the input of the input sentence (search keyword) is accepted without specifying the attribute, the search reflecting the attribute can be executed.

つぎに、図１５および図１６を用いて、図１３に示したステップＳ１３０６のマッチング処理の具体的な処理手順について説明する。 Next, a specific processing procedure of the matching process of step S1306 shown in FIG. 13 will be described with reference to FIGS. 15 and 16.

図１５および図１６は、マッチング処理の具体的な処理手順の一例を示すフローチャートである。図１５のフローチャートにおいて、まず、検索処理装置１０１は、図１３に示したステップＳ１３０３において選択した属性に対応する属性キーワード検出フラグを「ｆａｌｓｅ」に設定する（ステップＳ１５０１）。 15 and 16 are flowcharts showing an example of a specific processing procedure of the matching process. In the flowchart of FIG. 15, first, the search processing device 101 sets the attribute keyword detection flag corresponding to the attribute selected in step S1303 shown in FIG. 13 to “false” (step S1501).

つぎに、検索処理装置１０１は、選択した属性に対応する属性キーワードの単語リストを初期化する（ステップＳ１５０２）。そして、検索処理装置１０１は、「ｉ」を「ｉ＝０」で初期化し（ステップＳ１５０３）、「ｊ」を「ｊ＝０」で初期化する（ステップＳ１５０４）。 Next, the search processing device 101 initializes the word list of the attribute keywords corresponding to the selected attributes (step S1502). Then, the search processing device 101 initializes "i" with "i = 0" (step S1503) and initializes "j" with "j = 0" (step S1504).

つぎに、検索処理装置１０１は、図１３に示したステップＳ１３０４において選択した登録語のｉ番目の単語と、入力文のｊ番目の単語とを比較する（ステップＳ１５０５）。ただし、最初の単語を０番目の単語とする。そして、検索処理装置１０１は、登録語のｉ番目の単語と、入力文のｊ番目の単語とがマッチしたか否かを判断する（ステップＳ１５０６）。なお、単語がマッチするとは、単語が一致又は類似することである。 Next, the search processing device 101 compares the i-th word of the registered word selected in step S1304 shown in FIG. 13 with the j-th word of the input sentence (step S1505). However, the first word is the 0th word. Then, the search processing device 101 determines whether or not the i-th word of the registered word and the j-th word of the input sentence match (step S1506). Note that a word match means that the words match or are similar.

ここで、単語がマッチしない場合（ステップＳ１５０６：Ｎｏ）、検索処理装置１０１は、図１６に示すステップＳ１６０５に移行する。一方、単語がマッチする場合（ステップＳ１５０６：Ｙｅｓ）、検索処理装置１０１は、選択した属性に対応する属性キーワード検出フラグが「ｆａｌｓｅ」であるか否かを判断する（ステップＳ１５０７）。 Here, if the words do not match (step S1506: No), the search processing device 101 shifts to step S1605 shown in FIG. On the other hand, when the words match (step S1506: Yes), the search processing device 101 determines whether or not the attribute keyword detection flag corresponding to the selected attribute is "false" (step S1507).

ここで、属性キーワード検出フラグが「ｔｒｕｅ」の場合（ステップＳ１５０７：Ｎｏ）、検索処理装置１０１は、ステップＳ１５０９に移行する。一方、属性キーワード検出フラグが「ｆａｌｓｅ」の場合（ステップＳ１５０７：Ｙｅｓ）、検索処理装置１０１は、属性キーワード検出フラグに「ｔｒｕｅ」を設定する（ステップＳ１５０８）。 Here, when the attribute keyword detection flag is "true" (step S1507: No), the search processing device 101 shifts to step S1509. On the other hand, when the attribute keyword detection flag is "false" (step S1507: Yes), the search processing device 101 sets the attribute keyword detection flag to "true" (step S1508).

そして、検索処理装置１０１は、選択した属性に対応する属性キーワードの単語リストの末尾に、入力文のｊ番目の単語を格納して（ステップＳ１５０９）、図１６に示すステップＳ１６０１に移行する。 Then, the search processing device 101 stores the j-th word of the input sentence at the end of the word list of the attribute keywords corresponding to the selected attribute (step S1509), and proceeds to step S1601 shown in FIG.

図１６のフローチャートにおいて、まず、検索処理装置１０１は、「ｉ」を「ｉ＝ｉ＋１」とし（ステップＳ１６０１）、「ｊ」を「ｊ＝ｊ＋１」とする（ステップＳ１６０２）。そして、検索処理装置１０１は、「ｊ」が入力文を構成する単語数以上となったか否かを判断する（ステップＳ１６０３）。 In the flowchart of FIG. 16, first, the search processing device 101 sets “i” to “i = i + 1” (step S1601) and “j” to “j = j + 1” (step S1602). Then, the search processing device 101 determines whether or not "j" is equal to or greater than the number of words constituting the input sentence (step S1603).

ここで、「ｊ」が入力文を構成する単語数以上となった場合（ステップＳ１６０３：Ｙｅｓ）、検索処理装置１０１は、図１５に示したステップＳ１５０４に戻る。一方、「ｊ」が入力文を構成する単語数未満の場合（ステップＳ１６０３：Ｎｏ）、検索処理装置１０１は、「ｉ」が登録語を構成する単語数以上となったか否かを判断する（ステップＳ１６０４）。 Here, when "j" is equal to or greater than the number of words constituting the input sentence (step S1603: Yes), the search processing device 101 returns to step S1504 shown in FIG. On the other hand, when "j" is less than the number of words constituting the input sentence (step S1603: No), the search processing device 101 determines whether or not "i" is equal to or more than the number of words constituting the registered word (step S1603: No). Step S1604).

ここで、「ｉ」が登録語を構成する単語数未満の場合（ステップＳ１６０４：Ｎｏ）、検索処理装置１０１は、図１５に示したステップＳ１５０５に戻る。一方、「ｉ」が登録語を構成する単語数以上の場合（ステップＳ１６０４：Ｙｅｓ）、検索処理装置１０１は、マッチング処理を呼び出したステップに戻る。 Here, when "i" is less than the number of words constituting the registered word (step S1604: No), the search processing device 101 returns to step S1505 shown in FIG. On the other hand, when "i" is equal to or greater than the number of words constituting the registered word (step S1604: Yes), the search processing device 101 returns to the step in which the matching process is called.

また、検索処理装置１０１は、図１５に示したステップＳ１５０６から移行した場合には、「ｉ」を「ｉ＝ｉ＋１」とする（ステップＳ１６０５）。そして、検索処理装置１０１は、「ｊ」を「ｊ＝０」で初期化して（ステップＳ１６０６）、ステップＳ１６０４に移行する。 Further, when the search processing device 101 shifts from step S1506 shown in FIG. 15, “i” is set to “i = i + 1” (step S1605). Then, the search processing device 101 initializes "j" with "j = 0" (step S1606), and proceeds to step S1604.

これにより、入力文から分割された単語と登録語から分割された単語とを比較して、マッチする単語を、属性に対応する属性キーワードの単語リストに登録することができる。 As a result, the word divided from the input sentence and the word divided from the registered word can be compared, and the matching word can be registered in the word list of the attribute keyword corresponding to the attribute.

つぎに、図１７および図１８を用いて、図１４に示したステップＳ１４０５の検索処理の具体的な処理手順について説明する。 Next, a specific processing procedure of the search process of step S1405 shown in FIG. 14 will be described with reference to FIGS. 17 and 18.

図１７および図１８は、検索処理の具体的な処理手順の一例を示すフローチャートである。図１７のフローチャートにおいて、まず、検索処理装置１０１は、絞込合致レコード格納リストを初期化する（ステップＳ１７０１）。つぎに、検索処理装置１０１は、ドキュメント管理ＤＢ２２０から選択されていない未選択のレコードを選択する（ステップＳ１７０２）。 17 and 18 are flowcharts showing an example of a specific processing procedure of the search process. In the flowchart of FIG. 17, first, the search processing device 101 initializes the refined match record storage list (step S1701). Next, the search processing device 101 selects an unselected record that has not been selected from the document management DB 220 (step S1702).

つぎに、検索処理装置１０１は、検索対象のドキュメントを特徴付ける属性のうち選択されていない未選択の属性を選択する（ステップＳ１７０３）。つぎに、検索処理装置１０１は、図１４に示したステップＳ１４０４において作成された検索条件（または、属性候補テーブル８００）を参照して、選択した属性に対応する登録語のうち選択されていない未選択の登録語を、絞込条件の登録語として選択する（ステップＳ１７０４）。 Next, the search processing device 101 selects an unselected attribute that is not selected from the attributes that characterize the document to be searched (step S1703). Next, the search processing device 101 refers to the search condition (or the attribute candidate table 800) created in step S1404 shown in FIG. 14, and is not selected among the registered words corresponding to the selected attribute. The selected registered word is selected as the registered word of the narrowing condition (step S1704).

以下の説明では、ステップＳ１７０２において選択されたレコードを「対象レコード」と表記し、ステップＳ１７０３において選択された属性を「対象属性」と表記する場合がある。 In the following description, the record selected in step S1702 may be referred to as "target record", and the attribute selected in step S1703 may be referred to as "target attribute".

そして、検索処理装置１０１は、選択した絞込条件の登録語と、対象レコードの対象属性の属性値とが一致するか否かを判断する（ステップＳ１７０５）。ここで、絞込条件の登録語と対象レコードの属性値とが一致しない場合（ステップＳ１７０５：Ｎｏ）、検索処理装置１０１は、ステップＳ１７０７に移行する。 Then, the search processing device 101 determines whether or not the registered word of the selected narrowing-down condition and the attribute value of the target attribute of the target record match (step S1705). Here, when the registered word of the narrowing-down condition and the attribute value of the target record do not match (step S1705: No), the search processing device 101 proceeds to step S1707.

一方、絞込条件の登録語と対象レコードの属性値とが一致する場合（ステップＳ１７０５：Ｙｅｓ）、検索処理装置１０１は、絞込合致レコード格納リストに対象レコードを格納する（ステップＳ１７０６）。そして、検索処理装置１０１は、対象属性に対応する登録語のうち選択されていない未選択の登録語があるか否かを判断する（ステップＳ１７０７）。 On the other hand, when the registered word of the narrowing down condition and the attribute value of the target record match (step S1705: Yes), the search processing device 101 stores the target record in the narrowing down matching record storage list (step S1706). Then, the search processing device 101 determines whether or not there is an unselected registered word among the registered words corresponding to the target attribute (step S1707).

ここで、未選択の登録語がある場合（ステップＳ１７０７：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１７０４に戻る。一方、未選択の登録語がない場合（ステップＳ１７０７：Ｎｏ）、検索処理装置１０１は、検索対象のドキュメントを特徴付ける属性のうち選択されていない未選択の属性があるか否かを判断する（ステップＳ１７０８）。 Here, if there is an unselected registered word (step S1707: Yes), the search processing device 101 returns to step S1704. On the other hand, when there is no unselected registered word (step S1707: No), the search processing device 101 determines whether or not there is an unselected unselected attribute among the attributes that characterize the document to be searched (step). S1708).

ここで、未選択の属性がある場合（ステップＳ１７０８：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１７０３に戻る。一方、未選択の属性がない場合（ステップＳ１７０８：Ｎｏ）、検索処理装置１０１は、ドキュメント管理ＤＢ２２０から選択されていない未選択のレコードがあるか否かを判断する（ステップＳ１７０９）。 Here, if there is an unselected attribute (step S1708: Yes), the search processing device 101 returns to step S1703. On the other hand, when there is no unselected attribute (step S1708: No), the search processing device 101 determines whether or not there is an unselected record that has not been selected from the document management DB 220 (step S1709).

ここで、未選択のレコードがある場合（ステップＳ１７０９：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１７０２に戻る。一方、未選択のレコードがない場合には（ステップＳ１７０９：Ｎｏ）、検索処理装置１０１は、図１８に示すステップＳ１８０１に移行する。 Here, if there is an unselected record (step S1709: Yes), the search processing device 101 returns to step S1702. On the other hand, when there is no unselected record (step S1709: No), the search processing device 101 shifts to step S1801 shown in FIG.

図１８のフローチャートにおいて、まず、検索処理装置１０１は、検索結果格納リストを初期化する（ステップＳ１８０１）。つぎに、検索処理装置１０１は、絞込合致レコード格納リストから選択されていない未選択のレコードを選択する（ステップＳ１８０２）。 In the flowchart of FIG. 18, first, the search processing device 101 initializes the search result storage list (step S1801). Next, the search processing device 101 selects an unselected record that has not been selected from the refined match record storage list (step S1802).

そして、検索処理装置１０１は、検索条件を参照して、選択したレコードの本文情報に、検索キーワードが含まれるか否かを判断する（ステップＳ１８０３）。ここで、検索キーワードが含まれない場合（ステップＳ１８０３：Ｎｏ）、検索処理装置１０１は、ステップＳ１８０５に移行する。 Then, the search processing device 101 refers to the search condition and determines whether or not the search keyword is included in the body information of the selected record (step S1803). Here, when the search keyword is not included (step S1803: No), the search processing device 101 shifts to step S1805.

一方、検索キーワードが含まれる場合（ステップＳ１８０３：Ｙｅｓ）、検索処理装置１０１は、検索結果格納リストに、選択したレコードを格納する（ステップＳ１８０４）。そして、検索処理装置１０１は、絞込合致レコード格納リストから選択されていない未選択のレコードがあるか否かを判断する（ステップＳ１８０５）。 On the other hand, when the search keyword is included (step S1803: Yes), the search processing device 101 stores the selected record in the search result storage list (step S1804). Then, the search processing device 101 determines whether or not there is an unselected record that has not been selected from the narrowed-down match record storage list (step S1805).

ここで、未選択のレコードがある場合（ステップＳ１８０５：Ｙｅｓ）、検索処理装置１０１は、ステップＳ１８０２に戻る。一方、未選択のレコードがない場合（ステップＳ１８０５：Ｎｏ）、検索処理装置１０１は、検索処理を呼び出したステップに戻る。 Here, if there is an unselected record (step S1805: Yes), the search processing device 101 returns to step S1802. On the other hand, when there is no unselected record (step S1805: No), the search processing device 101 returns to the step in which the search processing is called.

これにより、既存の全文検索のような検索のやり方で、ユーザが欲しい属性を持つドキュメント集合に絞り込んだ検索を行うことができる。 This makes it possible to narrow down the search to a set of documents having the attributes that the user wants, using a search method similar to the existing full-text search.

なお、図１７の例では、各属性に対応する登録語のＯＲをとって、検索対象のドキュメントを絞り込むことにしたが、これに限らない。例えば、検索処理装置１０１は、各属性に対応する登録語のＡＮＤをとって、検索対象のドキュメントを絞り込むことにしてもよい。 In the example of FIG. 17, the OR of the registered word corresponding to each attribute is taken to narrow down the documents to be searched, but the present invention is not limited to this. For example, the search processing device 101 may narrow down the documents to be searched by ANDing the registered words corresponding to each attribute.

ここで、各属性に対応する登録語のＡＮＤをとって、検索対象のドキュメントを絞り込む場合の検索条件の具体例について説明する。 Here, a specific example of the search condition when narrowing down the documents to be searched by taking the AND of the registered words corresponding to each attribute will be described.

図１９は、検索条件の一例を示す説明図である。図１９において、検索条件１９００は、絞込条件１９１０と、全文検索条件１９２０とを含む。絞込条件１９１０は、１つの属性内の登録語同士でＯＲをとり、各属性の登録語同士でＡＮＤをとる条件である。全文検索条件１９２０は、検索キーワードで全文検索を行う条件である。 FIG. 19 is an explanatory diagram showing an example of search conditions. In FIG. 19, the search condition 1900 includes a narrowing condition 1910 and a full text search condition 1920. The narrowing-down condition 1910 is a condition in which OR is taken between the registered words in one attribute and AND is taken between the registered words of each attribute. The full-text search condition 1920 is a condition for performing a full-text search using a search keyword.

図１４に示したステップＳ１４０５の検索処理を検索条件１９００に基づき実行する場合、検索処理装置１０１は、例えば、絞込合致レコード格納リストを、属性毎に別リストで保持し、全属性の絞込合致レコード格納リストに格納されたレコードを、最終的な絞込合致レコードとする。 When the search process of step S1405 shown in FIG. 14 is executed based on the search condition 1900, the search processing device 101 holds, for example, a refined match record storage list for each attribute as a separate list, and narrows down all attributes. The record stored in the match record storage list is used as the final refined match record.

以上説明したように、実施の形態にかかる検索処理装置１０１によれば、検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部６１０に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定することができる。また、検索処理装置１０１によれば、第１のワードが存在すると判定した場合に、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象群を抽出することができる。そして、検索処理装置１０１によれば、受け付けた複数のワードに含まれる第１のワード以外の第２のワードと、抽出した検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、検索対象群の中から検索対象を特定することができる。 As described above, according to the search processing device 101 according to the embodiment, for each attribute of the search target stored in advance in the storage unit 610 among the plurality of words received for the search for the search target. It can be determined whether or not there is a first word that matches or is similar to one or more associated words. Further, according to the search processing device 101, when it is determined that the first word exists, the search target group including the first word in the attribute associated with the word that matches or is similar to the first word is selected. Can be extracted. Then, according to the search processing device 101, based on the comparison result between the second word other than the first word included in the received plurality of words and the word included in the data of each of the extracted search target groups, the search processing device 101 is used. The search target can be specified from the search target group.

これにより、属性を指定しない状態でキーワードの入力を受け付けても、キーワードが属性毎に設定されている検索対象について、属性を反映した検索を実行することができる。より詳細に説明すると、例えば、ユーザが意識して入力しなくても、入力文に含まれる複数の単語（ワード）を、絞り込み用のキーワード（属性キーワード）と、全文検索用のキーワード（検索キーワード）とに分類することができる。このため、ユーザが欲しい属性を持つ検索対象群を絞り込んだ上で全文検索を行うことが可能となり、検索結果に含まれるノイズ（ユーザが意図しない情報）を減らして、ユーザが欲しい情報を見つけやすくすることができる。 As a result, even if the input of the keyword is accepted without specifying the attribute, the search target in which the keyword is set for each attribute can be searched by reflecting the attribute. More specifically, for example, a keyword for narrowing down a plurality of words (words) included in an input sentence (attribute keyword) and a keyword for full-text search (search keyword) without the user having to consciously input the word (word). ) And can be classified. For this reason, it is possible to perform a full-text search after narrowing down the search target group having the attributes that the user wants, reducing noise (information not intended by the user) contained in the search results and making it easier for the user to find the information that the user wants. can do.

また、検索処理装置１０１によれば、第１のワードと一致又は類似するワードと対応付けられている属性において、当該属性に対応付けられた１又は複数のワードを含む検索対象群を抽出することができる。 Further, according to the search processing device 101, in the attribute associated with a word that matches or is similar to the first word, a search target group including one or a plurality of words associated with the attribute is extracted. Can be done.

これにより、予め属性に対応付けられた登録語（キーワード）を属性値に含む検索対象群を抽出することができ、属性による絞り込みがより利くようにして、ノイズを少なくすることができる。例えば、属性「製品名」の登録語として、実際に存在する特定の製品名が設定されていれば、属性「製品名」が、特定の製品名であるドキュメント群に絞り込むことができる。 As a result, it is possible to extract a search target group that includes a registered word (keyword) associated with an attribute in advance in the attribute value, and it is possible to narrow down by attribute more effectively and reduce noise. For example, if a specific product name that actually exists is set as a registered word of the attribute "product name", the attribute "product name" can be narrowed down to a document group that is a specific product name.

また、検索処理装置１０１によれば、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、受け付けた複数のワードのうち、各属性に対応付けられた１又は複数のワードと一致又は類似するワードの数（単語長）を算出することができる。そして、検索処理装置１０１によれば、複数の属性の中から、算出したワードの数が最大の属性を特定し、特定した属性において、第１のワードを含む検索対象群を抽出することができる。 Further, according to the search processing device 101, when there are a plurality of attributes associated with a word that matches or is similar to the first word, among the plurality of words received for each attribute of the plurality of attributes. The number of words (word length) that match or are similar to one or more words associated with each attribute can be calculated. Then, according to the search processing device 101, the attribute having the largest number of calculated words can be specified from the plurality of attributes, and the search target group including the first word can be extracted from the specified attributes. ..

これにより、複数の属性候補の中から、登録語に含まれる単語が入力文内により多く含まれる属性を、属性キーワード（第１のワード）に対応する属性として選定することができる。このため、検索対象群を絞り込む際に、属性による絞り込みがより利くようにして、ノイズを少なくすることができる。 As a result, from among the plurality of attribute candidates, the attribute in which the word included in the registered word is contained more in the input sentence can be selected as the attribute corresponding to the attribute keyword (first word). Therefore, when narrowing down the search target group, it is possible to make it easier to narrow down by attribute and reduce noise.

また、検索処理装置１０１によれば、第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、各属性に対応付けられた１又は複数のワードのうち、受け付けた複数のワードに含まれるワードと一致又は類似するワードが占める文字列比を算出し、複数の属性の中から、算出した文字列比が最大の属性を特定することができる。 Further, according to the search processing device 101, when there are a plurality of attributes associated with a word that matches or is similar to the first word, each attribute of the plurality of attributes is associated with each attribute1. Or, among a plurality of words, the character string ratio occupied by words that match or are similar to the words included in the received multiple words is calculated, and the attribute having the maximum calculated character string ratio is specified from the plurality of attributes. be able to.

これにより、複数の属性候補の中から、登録語の中で入力文に含まれる単語の文字列が占める割合（充填率）が高い属性を、属性キーワード（第１のワード）に対応する属性として選定することができる。このため、検索対象群を絞り込む際に、属性による絞り込みがより利くようにして、ノイズを少なくすることができる。 As a result, from among a plurality of attribute candidates, the attribute having a high ratio (filling rate) of the word string included in the input sentence among the registered words is set as the attribute corresponding to the attribute keyword (first word). Can be selected. Therefore, when narrowing down the search target group, it is possible to make it easier to narrow down by attribute and reduce noise.

また、検索処理装置１０１によれば、受け付けた複数のワードに対して、特定した検索対象を出力することができる。また、検索処理装置１０１によれば、特定した検索対象とともに、第１のワードと一致又は類似するワードと対応付けられている属性に対応付けられた登録語（語句）を出力することができる。 Further, according to the search processing device 101, it is possible to output a specified search target for a plurality of received words. Further, according to the search processing device 101, it is possible to output a registered word (phrase) associated with an attribute associated with a word that matches or is similar to the first word together with the specified search target.

これにより、ノイズの少ない検索結果を提供することができる。また、検索結果を表示する際に、どのような属性で検索結果を絞り込んだのかを特定可能な情報を提示することができる。 As a result, it is possible to provide search results with less noise. In addition, when displaying the search results, it is possible to present information that can identify the attributes used to narrow down the search results.

また、検索処理装置１０１によれば、第１のワードが存在すると判定した場合に、受け付けた複数のワードに含まれる第１のワード以外の第２のワードと、複数の検索対象それぞれのデータに含まれるワードとの比較結果に基づいて、複数の検索対象の中から検索対象群を抽出することができる。そして、検索処理装置１０１によれば、抽出した検索対象群の中から、第１のワードと一致又は類似するワードと対応付けられている属性において第１のワードを含む検索対象を特定することができる。 Further, according to the search processing device 101, when it is determined that the first word exists, the second word other than the first word included in the received plurality of words and the data of each of the plurality of search targets A search target group can be extracted from a plurality of search targets based on the comparison result with the included words. Then, according to the search processing device 101, it is possible to specify the search target including the first word in the attribute associated with the word that matches or is similar to the first word from the extracted search target group. it can.

これにより、入力文に含まれる複数の単語（ワード）を属性キーワードと検索キーワードとに分類し、検索キーワードを用いて全文検索を行った上で、属性キーワードを用いてユーザが欲しい属性を持つ検索対象群を絞り込むことができる。 As a result, multiple words included in the input sentence are classified into attribute keywords and search keywords, a full-text search is performed using the search keywords, and then a search with the attributes desired by the user is performed using the attribute keywords. The target group can be narrowed down.

これらのことから、実施の形態にかかる検索処理装置１０１によれば、ドキュメント検索において、ユーザが入力した入力文（検索文）から属性情報を自動抽出して、検索条件を最適化することができる。これにより、既存の全文検索のような検索のやり方で、ユーザが欲しい属性を持つドキュメント集合に絞り込んだ検索を行うことが可能となり、利便性を向上させることができる。 From these facts, according to the search processing apparatus 101 according to the embodiment, in the document search, the attribute information can be automatically extracted from the input sentence (search sentence) input by the user, and the search condition can be optimized. .. As a result, it is possible to perform a search narrowed down to a set of documents having the attributes desired by the user by a search method such as an existing full-text search, and it is possible to improve convenience.

なお、本実施の形態で説明した検索処理方法は、予め用意されたプログラムをパーソナル・コンピュータやワークステーション等のコンピュータで実行することにより実現することができる。本検索処理プログラムは、ハードディスク、フレキシブルディスク、ＣＤ−ＲＯＭ、ＤＶＤ、ＵＳＢメモリ等のコンピュータで読み取り可能な記録媒体に記録され、コンピュータによって記録媒体から読み出されることによって実行される。また、本検索処理プログラムは、インターネット等のネットワークを介して配布してもよい。 The search processing method described in the present embodiment can be realized by executing a program prepared in advance on a computer such as a personal computer or a workstation. This search processing program is recorded on a computer-readable recording medium such as a hard disk, flexible disk, CD-ROM, DVD, or USB memory, and is executed by being read from the recording medium by the computer. Further, the search processing program may be distributed via a network such as the Internet.

また、本実施の形態で説明した検索処理装置１０１は、スタンダードセルやストラクチャードＡＳＩＣ（ＡｐｐｌｉｃａｔｉｏｎＳｐｅｃｉｆｉｃＩｎｔｅｇｒａｔｅｄＣｉｒｃｕｉｔ）などの特定用途向けＩＣやＦＰＧＡなどのＰＬＤ（ＰｒｏｇｒａｍｍａｂｌｅＬｏｇｉｃＤｅｖｉｃｅ）によっても実現することができる。 Further, the search processing device 101 described in the present embodiment can also be realized by a standard cell, an IC for a specific purpose such as a structured ASIC (Application Specific Integrated Circuit), or a PLD (Programmable Logic Device) such as an FPGA.

上述した実施の形態に関し、さらに以下の付記を開示する。 The following additional notes are further disclosed with respect to the above-described embodiment.

（付記１）検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定し、
前記第１のワードが存在すると判定した場合に、前記第１のワードと一致又は類似するワードと対応付けられている属性において前記第１のワードを含む検索対象群を抽出し、
前記複数のワードに含まれる前記第１のワード以外の第２のワードと前記検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、前記検索対象群の中から前記検索対象を特定する、
処理をコンピュータに実行させることを特徴とする検索処理プログラム。 (Appendix 1) Among the plurality of words received for the search for the search target, the first word that matches or is similar to one or more words associated with each attribute of the search target stored in the storage unit in advance. Determine if the word exists and
When it is determined that the first word exists, a search target group including the first word in the attribute associated with the word matching or similar to the first word is extracted.
The search target is specified from the search target group based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups. To do
A search processing program characterized by having a computer execute processing.

（付記２）前記抽出する処理は、
前記第１のワードと一致又は類似するワードと対応付けられている属性において、当該属性に対応付けられた１又は複数のワードを含む検索対象群を抽出する、ことを特徴とする付記１に記載の検索処理プログラム。 (Appendix 2) The extraction process is
The description in Appendix 1, wherein a search target group including one or a plurality of words associated with the attribute is extracted from the attributes associated with the word matching or similar to the first word. Search processing program.

（付記３）前記抽出する処理は、
前記第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、受け付けた前記複数のワードのうち、前記各属性に対応付けられた１又は複数のワードと一致又は類似するワードの数を算出し、
前記複数の属性の中から、算出した前記ワードの数が最大の属性を特定し、
特定した前記属性において、前記第１のワードを含む検索対象群を抽出する、ことを特徴とする付記１または２に記載の検索処理プログラム。 (Appendix 3) The extraction process is
When there are a plurality of attributes associated with a word that matches or is similar to the first word, each attribute of the plurality of attributes is associated with each of the received plurality of words. Calculate the number of words that match or are similar to one or more words
From the plurality of attributes, the attribute having the largest number of calculated words is specified.
The search processing program according to Appendix 1 or 2, wherein a search target group including the first word is extracted from the specified attribute.

（付記４）前記抽出する処理は、
前記第１のワードと一致又は類似するワードと対応付けられている属性が複数存在する場合、当該複数の属性の各属性について、前記各属性に対応付けられた１又は複数のワードのうち、受け付けた前記複数のワードに含まれるワードと一致又は類似するワードが占める文字列比を算出し、
前記複数の属性の中から、算出した前記文字列比が最大の属性を特定し、
特定した前記属性において、前記第１のワードを含む検索対象群を抽出する、ことを特徴とする付記１〜３のいずれか一つに記載の検索処理プログラム。 (Appendix 4) The extraction process is
When there are a plurality of attributes associated with words that match or are similar to the first word, each attribute of the plurality of attributes is accepted from among one or more words associated with each attribute. Calculate the character string ratio occupied by words that match or are similar to the words included in the plurality of words.
From the plurality of attributes, the attribute having the maximum calculated character string ratio is specified.
The search processing program according to any one of Supplementary note 1 to 3, wherein a search target group including the first word is extracted from the specified attribute.

（付記５）受け付けた前記複数のワードに対して、特定した前記検索対象を出力する、処理を前記コンピュータに実行させることを特徴とする付記１〜４のいずれか一つに記載の検索処理プログラム。 (Supplementary Note 5) The search processing program according to any one of Supplementary note 1 to 4, wherein the computer is made to execute a process of outputting the specified search target for the plurality of received words. ..

（付記６）前記属性毎に対応付けられた１又は複数のワードは、前記属性毎に対応付けられた語句から分割された１又は複数のワードであり、
前記出力する処理は、
特定した前記検索対象とともに、前記第１のワードと一致又は類似するワードと対応付けられている属性に対応付けられた語句を出力する、ことを特徴とする付記５に記載の検索処理プログラム。 (Appendix 6) The one or more words associated with each attribute are one or more words divided from the words and phrases associated with each attribute.
The output process is
The search processing program according to Appendix 5, characterized in that, together with the specified search target, words and phrases associated with attributes associated with words that match or are similar to the first word are output.

（付記７）前記第１のワードが存在すると判定した場合に、前記複数のワードに含まれる前記第１のワード以外の第２のワードと、複数の検索対象それぞれのデータに含まれるワードとの比較結果に基づいて、前記複数の検索対象の中から検索対象群を抽出し、
抽出した前記検索対象群の中から、前記第１のワードと一致又は類似するワードと対応付けられている属性において前記第１のワードを含む検索対象を特定する、
処理を前記コンピュータに実行させることを特徴とする付記１に記載の検索処理プログラム。 (Appendix 7) When it is determined that the first word exists, the second word other than the first word included in the plurality of words and the word included in the data of each of the plurality of search targets Based on the comparison result, a search target group is extracted from the plurality of search targets, and the search target group is extracted.
From the extracted search target group, a search target including the first word is specified in an attribute associated with a word that matches or is similar to the first word.
The search processing program according to Appendix 1, wherein the processing is executed by the computer.

（付記８）検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定し、
前記第１のワードが存在すると判定した場合に、前記第１のワードと一致又は類似するワードと対応付けられている属性において前記第１のワードを含む検索対象群を抽出し、
前記複数のワードに含まれる前記第１のワード以外の第２のワードと前記検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、前記検索対象群の中から前記検索対象を特定する、
処理をコンピュータが実行することを特徴とする検索処理方法。 (Appendix 8) Among the plurality of words received for the search for the search target, the first word that matches or is similar to one or a plurality of words associated with each attribute of the search target stored in the storage unit in advance. Determine if the word exists and
When it is determined that the first word exists, a search target group including the first word in the attribute associated with the word matching or similar to the first word is extracted.
The search target is specified from the search target group based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups. To do
A search processing method characterized in that the processing is performed by a computer.

（付記９）検索対象に対する検索のために受け付けた複数のワードの中に、予め記憶部に記憶されていた検索対象の属性毎に対応付けられた１又は複数のワードと一致又は類似する第１のワードが存在するか否かを判定する判定部と、
前記判定部が前記第１のワードが存在すると判定した場合に、前記第１のワードと一致又は類似するワードと対応付けられている属性において前記第１のワードを含む検索対象群を抽出する抽出部と、
前記複数のワードに含まれる前記第１のワード以外の第２のワードと、前記抽出部が抽出した前記検索対象群それぞれのデータに含まれるワードとの比較結果に基づいて、前記検索対象群の中から前記検索対象を特定する特定部と、
を有することを特徴とする検索処理装置。 (Appendix 9) Among the plurality of words received for the search for the search target, the first word that matches or is similar to one or a plurality of words associated with each attribute of the search target stored in the storage unit in advance. Judgment unit that determines whether or not the word of
Extraction that extracts the search target group including the first word in the attribute associated with the word that matches or is similar to the first word when the determination unit determines that the first word exists. Department and
Based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups extracted by the extraction unit, the search target group A specific part that identifies the search target from among
A search processing device characterized by having.

１０１検索処理装置
１１０，６１０記憶部
２００検索処理システム
２０１クライアント装置
２１０ネットワーク
２２０ドキュメント管理ＤＢ
２３０製品属性テーブル
３００バス
３０１ＣＰＵ
３０２メモリ
３０３ディスクドライブ
３０４ディスク
３０５通信Ｉ／Ｆ
３０６可搬型記録媒体Ｉ／Ｆ
３０７可搬型記録媒体
６０１受付部
６０２判定部
６０３抽出部
６０４特定部
６０５出力部
７１０入力文
８００属性候補テーブル
１２００検索画面
１９００検索条件 101 Search processing device 110, 610 Storage unit 200 Search processing system 201 Client device 210 Network 220 Document management DB
230 Product attribute table 300 Bus 301 CPU
302 Memory 303 Disk drive 304 Disk 305 Communication I / F
306 Portable recording medium I / F
307 Portable recording medium 601 Reception unit 602 Judgment unit 603 Extraction unit 604 Specific unit 605 Output unit 710 Input text 800 Attribute candidate table 1200 Search screen 1900 Search conditions

Claims

Among the plurality of words received for the search for the search target, there is a first word that matches or is similar to one or more words associated with each attribute of the search target stored in the storage unit in advance. Judge whether to do or not,
When it is determined that the first word exists, a search target group including the first word in the attribute associated with the word matching or similar to the first word is extracted.
The search target is specified from the search target group based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups. To do
A search processing program characterized by having a computer execute processing.

The extraction process is
The first aspect of the present invention is to extract a search target group including one or a plurality of words associated with the attribute in an attribute associated with a word that matches or is similar to the first word. The search processing program described.

The extraction process is
When there are a plurality of attributes associated with a word that matches or is similar to the first word, each attribute of the plurality of attributes is associated with each of the received plurality of words. Calculate the number of words that match or are similar to one or more words
From the plurality of attributes, the attribute having the largest number of calculated words is specified.
The search processing program according to claim 1 or 2, wherein a search target group including the first word is extracted from the specified attribute.

The extraction process is
When there are a plurality of attributes associated with words that match or are similar to the first word, each attribute of the plurality of attributes is accepted from among one or more words associated with each attribute. Calculate the character string ratio occupied by words that match or are similar to the words included in the plurality of words.
From the plurality of attributes, the attribute having the maximum calculated character string ratio is specified.
The search processing program according to any one of claims 1 to 3, wherein a search target group including the first word is extracted from the specified attribute.

The search processing program according to any one of claims 1 to 4, wherein the computer executes a process of outputting the specified search target for the plurality of received words.

Among the plurality of words received for the search for the search target, there is a first word that matches or is similar to one or more words associated with each attribute of the search target stored in the storage unit in advance. Judge whether to do or not,
When it is determined that the first word exists, a search target group including the first word in the attribute associated with the word matching or similar to the first word is extracted.
The search target is specified from the search target group based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups. To do
A search processing method characterized in that the processing is performed by a computer.

Among the plurality of words received for the search for the search target, there is a first word that matches or is similar to one or more words associated with each attribute of the search target stored in the storage unit in advance. A judgment unit that determines whether or not to do
Extraction that extracts the search target group including the first word in the attribute associated with the word that matches or is similar to the first word when the determination unit determines that the first word exists. Department and
Based on the comparison result between the second word other than the first word included in the plurality of words and the word included in the data of each of the search target groups extracted by the extraction unit, the search target group A specific part that identifies the search target from among
A search processing device characterized by having.