JP7127351B2

JP7127351B2 - KANSEI SEARCH PROGRAM, INFORMATION PROCESSING DEVICE, AND KANSEI SEARCH METHOD

Info

Publication number: JP7127351B2
Application number: JP2018092079A
Authority: JP
Inventors: 健太郎村瀬
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2018-05-11
Filing date: 2018-05-11
Publication date: 2022-08-30
Anticipated expiration: 2038-05-11
Also published as: JP2019197464A

Description

本発明は、感性検索プログラム，情報処理装置および感性検索方法に関する。 The present invention relates to a sensitivity search program, an information processing apparatus, and a sensitivity search method.

近年、様々なＷｅｂサービスが提供されている。中でも検索サービスは、ホテルの検索や飲食店の検索、ＥＣ（Electronic Commerce）サイトの商品検索など、利用頻度の多いサービスである。 In recent years, various web services have been provided. Among them, search services are frequently used services such as hotel searches, restaurant searches, and product searches on EC (Electronic Commerce) sites.

しかし、食事や洋服など、スペックで定義される価値よりも雰囲気やイメージなどの感性的な価値が重視される商品を検索する場合、商品紹介文と検索する人物が利用する感性的なキーワードとは必ずしも一致しない。そのため、キーワードマッチングによる検索システムは有効に機能しないという問題がある。 However, when searching for products such as food and clothes, where emotional values such as atmosphere and image are more important than values defined by specs, what are the product descriptions and emotional keywords used by the person searching? not necessarily match. Therefore, there is a problem that a search system based on keyword matching does not function effectively.

そこで、感性的な価値が重視される商品を、直感的に検索できる感性検索システムが知られている。 Therefore, an emotional search system is known that can intuitively search for products whose emotional value is emphasized.

感性検索システムにおいては、検索対象のイメージの種類を示す側面情報（外見的特徴、人柄など）を定義する。また、単語に対して、検索対象が有するイメージを側面情報毎に定量的な値（感性ベクトル）で表した感性情報を定義する。 In the Kansei search system, side information (external features, personality, etc.) indicating the type of image to be searched is defined. For words, Kansei information is defined in which an image of a search target is represented by a quantitative value (kansei vector) for each aspect information.

そして、従来の感性検索システムにおいては、ユーザが検索のために入力したクエリに含まれる複数の語（各表現）を無条件に感性ベクトルに変換し、感性ベクトルの類似度に基づき検索結果を表示する。 In conventional emotion search systems, multiple words (each expression) included in a query input by a user for searching are unconditionally converted into emotion vectors, and search results are displayed based on the degree of similarity of the emotion vectors. do.

例えば、ユーザが「オフでも使えるブロードのワイシャツ」とのクエリを入力して検索する例について示す。このクエリ「オフでも使えるブロードのワイシャツ」においては、素材を示す「ブロードの」およびシャツの種類を特定する「ワイシャツ」の各語はユーザが望む外せない条件であると言える。これに対して、「オフでも使える」との語は曖昧性を含む表現となっている。 For example, an example in which a user enters a query of "broad dress shirt that can be used even when off" to perform a search will be described. In this query "broad shirt that can be used even when off", the words "broad shirt" that indicates the material and "shirt" that specifies the type of shirt can be said to be indispensable conditions desired by the user. On the other hand, the term "can be used even when turned off" is an expression containing ambiguity.

従来の感性検索システムにおいては、ユーザクエリに含まれる「ブロードの」，「ワイシャツ」および「オフでも使える」をそれぞれ感性ベクトルに置き換え、これらの感性ベクトルの値を総合した総合感性ベクトルを算出する。そして、この総合感性ベクトルの値との類似度に基づく検索結果を出力する。 In a conventional sensitivity search system, "broadcloth", "shirt", and "can be used even off" included in a user query are replaced with sensitivity vectors, and the values of these sensitivity vectors are combined to calculate a comprehensive sensitivity vector. Then, it outputs a search result based on the degree of similarity with the value of this comprehensive sensitivity vector.

特開２０１１－４８５２７号公報JP 2011-48527 A

しかしながら、従来の感性検索システムにおいては、上述の如く、ユーザが検索のために入力したクエリに含まれる複数の語（各表現）を無条件に感性ベクトルに変換し、感性ベクトルの類似度を主にして、検索結果を表示する。 However, as described above, in the conventional affective search system, a plurality of words (each expression) contained in a query input by a user for searching are unconditionally converted into affective vectors, and the similarity of the affective vectors is mainly used. to display the search results.

そのため、検索結果として、「ワイシャツ」ではない、「カジュアルシャツでよりドレス寄りのシャツ」が提示されてしまうことがあり、ユーザが外せない条件である、「ブロードの」「ドレスシャツ（ワイシャツ）」が反映されない結果が得られてしまう。すなわち、ユーザが意図しない検索結果が出力される。 As a result, the search result may show "a casual shirt that is more dress-like" instead of "a dress shirt". result is not reflected. That is, a search result not intended by the user is output.

１つの側面では、本発明は、感性検索においてユーザの意図が反映された検索結果を得ることができるようにすることを目的とする。 In one aspect, an object of the present invention is to enable a user's intention to be reflected in a search result to be obtained in an emotional search.

このため、この感性検索プログラムは、クエリに含まれる複数の語において、検索対象データベースの属性項目として用いられる語彙を事前に抽出した属性項目語彙集合に属す語彙を、ユーザが外せない必須条件を指定する語彙と推定し、前記複数の語において前記属性項目語彙集合以外に属す語彙について、出現頻度が所定の閾値を超えるものを、曖昧性を含む表現として感性情報に展開可能な複数の第１語をとして推定する一方、前記出現頻度が前記所定の閾値以下のものを、前記必須条件を指定する語彙と推定し、前記複数の第１語に優先順位を設定し、前記優先順位に応じて前記複数の第１語の中から選択した第１語について検索対象情報に対する感性検索を行なう処理を、コンピュータに実行させる。 For this reason, this sensitivity search program designates, in a plurality of words included in a query, terms that belong to an attribute item vocabulary set that is preliminarily extracted to be used as attribute items of a database to be searched, and that the user cannot remove the terms. and among the plurality of words belonging to other than the attribute item vocabulary set, a plurality of first words that can be developed into sensibility information as expressions containing ambiguity if the occurrence frequency exceeds a predetermined threshold on the other hand, presumes words whose frequency of appearance is equal to or less than the predetermined threshold as vocabulary specifying the essential condition, sets priorities to the plurality of first words, and according to the priorities, A computer is caused to execute a process of carrying out a sensitivity search on search target information for a first word selected from among a plurality of first words.

一実施形態によれば、感性検索においてユーザの意図が反映された検索結果を得ることができる。 According to one embodiment, it is possible to obtain a search result that reflects the user's intention in the sensitivity search.

説明文語彙集合における語彙の出現頻度の関係性を示す図である。FIG. 10 is a diagram showing the relationship of appearance frequencies of vocabularies in the explanatory sentence vocabulary set; 実施形態の一例としての情報処理システムの構成を模式的に示す図である。1 is a diagram schematically showing the configuration of an information processing system as an example of an embodiment; FIG. 実施形態の一例としての情報処理システムにおけるクライアント端末の表示画面を例示する図である。FIG. 4 is a diagram illustrating a display screen of a client terminal in the information processing system as one example of an embodiment; 実施形態の一例としての情報処理システムにおける管理サーバのハードウェア構成を例示する図である。1 is a diagram illustrating a hardware configuration of a management server in an information processing system as one example of an embodiment; FIG. 実施形態の一例としての情報処理システムにおける管理サーバの機能構成を示す図である。1 is a diagram showing a functional configuration of a management server in an information processing system as one example of an embodiment; FIG. 実施形態の一例としての情報処理システムにおける感性ベクトル定義辞書を例示する図である。FIG. 10 is a diagram illustrating a Kansei vector definition dictionary in the information processing system as one example of an embodiment; 実施形態の一例としての情報処理システムにおける同一のユーザクエリに対する再検索処理を説明するためのフローチャートである。9 is a flowchart for explaining re-search processing for the same user query in the information processing system as an example of the embodiment; 実施形態の一例としての情報処理システムにおける処理を説明するフローチャートである。4 is a flowchart illustrating processing in an information processing system as an example of an embodiment;

以下、図面を参照して本感性検索プログラム，情報処理装置および感性検索方法にかかる実施の形態を説明する。ただし、以下に示す実施形態はあくまでも例示に過ぎず、実施形態で明示しない種々の変形例や技術の適用を排除する意図はない。すなわち、本実施形態を、その趣旨を逸脱しない範囲で種々変形（実施形態および各変形例を組み合わせる等）して実施することができる。また、各図は、図中に示す構成要素のみを備えるという趣旨ではなく、他の機能等を含むことができる。 Hereinafter, embodiments of the present sensitivity search program, information processing apparatus, and sensitivity search method will be described with reference to the drawings. However, the embodiments shown below are merely examples, and are not intended to exclude the application of various modifications and techniques not explicitly described in the embodiments. That is, the present embodiment can be modified in various ways (such as by combining the embodiment and each modified example) without departing from the spirit of the embodiment. Also, each drawing does not mean that it has only the constituent elements shown in the drawing, but can include other functions and the like.

（Ａ）概要
最初に、実施形態の一例としての情報処理システム１（図２参照）による処理の概要を説明する。
本情報処理システム１においては、ユーザから入力される検索クエリ文中で、（１）曖昧性を含む表現として感性ベクトルに展開可能な語（語彙）、および、（２）ユーザが外せない条件を指定する語彙の推定をそれぞれ行なう。 (A) Overview First, an overview of processing by the information processing system 1 (see FIG. 2) as an example of the embodiment will be described.
In this information processing system 1, in the search query text input by the user, (1) words (vocabulary) that can be developed into sensitivity vectors as expressions containing ambiguity, and (2) conditions that the user cannot remove are specified. Each word is estimated.

例えば、衣類を検索するための「ブロード生地のお洒落なパンツ」という検索クエリには、「ブロード生地の」、「お洒落な」および「パンツ」の３つの語（語彙）が含まれる。 For example, the search query "broadcloth fashionable pants" for searching clothing includes three words (vocabulary): "broadcloth", "fashionable" and "pants".

これらの３つの語彙のうち、「お洒落な」は曖昧性を含み抽象度が高い表現（語彙）である。このような曖昧性を含む語彙は、商品の説明文中で多様な他の語の修飾にも用いられるため、統計的に出現頻度が高い傾向がある。以下、曖昧性を含み抽象度が高い表現の語彙を「曖昧な抽象表現」という場合がある。 Among these three vocabularies, "fashionable" is an expression (vocabulary) that contains ambiguity and has a high degree of abstraction. Vocabulary containing such ambiguity tends to appear statistically with high frequency because it is also used to modify various other words in descriptions of products. Hereinafter, the vocabulary of expressions that include ambiguity and have a high degree of abstraction may be referred to as "ambiguous abstract expressions".

これに対して、「ブロード生地の」および「パンツ」は、ユーザが外せない条件（必須条件）を指定する語彙であり、これらの語彙に基づいて感性検索を行なった結果として、例えば「サテン生地」や「スカート」が提示されることとなった場合、その検索結果にユーザが満足することはない。 On the other hand, "broad fabric" and "pants" are terms that specify conditions (essential conditions) that the user cannot remove. or "skirt", the user will not be satisfied with the search results.

このようなユーザが外せない条件を指定する語彙である「ブロード生地の」は、個々の商品の特徴を具体的に限定的に修飾するため、商品の説明文集合の中で、統計的に出現頻度が低い傾向がある。以下、特徴を具体的に限定的に修飾する語彙を、「具体的な限定的修飾表現」という場合がある。 The term “broad cloth”, which is a term that specifies conditions that users cannot exclude, is used to modify the characteristics of individual products in a specific and limited manner. It tends to be infrequent. Hereinafter, a term that specifically and restrictively modifies a feature may be referred to as a "specific restrictive modifying expression".

ただし、ユーザが外せない条件を指定する語彙であっても、検索ＤＢ（Data Base）において属性項目として設定される一般的な用語（例えば、「スカート」）は、使用頻度（出現頻度）が高い。 However, even if it is a vocabulary that specifies conditions that the user cannot exclude, general terms (for example, "skirt") that are set as attribute items in the search DB (Data Base) have a high frequency of use (appearance frequency). .

また、以下、ユーザが検索を行なうために入力するクエリ（検索クエリ）をユーザクエリという場合がある。 Further, hereinafter, a query (search query) input by a user to perform a search may be referred to as a user query.

図１は説明文語彙集合における語彙の出現頻度の関係性を示す図である。 FIG. 1 is a diagram showing the relationship of the frequency of occurrence of vocabularies in the explanatory sentence vocabulary set.

検索対象商品ＤＢ１０１には、商品の説明文を構成する説明文集合とＤＢの属性項目（ＤＢ属性項目）とが含まれる。ＤＢ属性項目を構成する語彙をＤＢ属性項目語彙集合という場合がある。 The search target product DB 101 includes a set of description texts that constitute description texts of products and attribute items of the DB (DB attribute items). A vocabulary that constitutes a DB attribute item may be referred to as a DB attribute item vocabulary set.

この図１に示す例においては、検索対象商品ＤＢ１０１に格納される商品の説明文に含まれる語彙（説明文集合）とＤＢの属性項目として用いられている語彙（ＤＢ属性項目，ＤＢ属性項目語彙集合）とを合わせて説明文語彙集合Ｘで表す。 In the example shown in FIG. 1, the vocabulary included in the product description stored in the search target product DB 101 (explanatory text set) and the vocabulary used as the attribute item of the DB (DB attribute item, DB attribute item vocabulary set) are collectively represented as an explanatory sentence vocabulary set X.

この説明文語彙集合Ｘには、曖昧な抽象表現の集合Ａと、具体的な限定的修飾表現の集合Ｂとが含まれる。また、具体的な限定的修飾表現の集合Ｂには、ＤＢ属性項目語彙集合Ｃが含まれる（集合ＣはＤＢ属性項目も含む）。 This descriptive text vocabulary set X includes a set A of ambiguous abstract expressions and a set B of specific restrictive modifying expressions. Further, the set B of specific restrictive modifying expressions includes a DB attribute item vocabulary set C (the set C also includes DB attribute items).

説明文語彙集合Ｘにおける語彙の出現頻度の関係性は以下のように表すことができる。 The relationship of the frequency of occurrence of vocabulary in the explanatory sentence vocabulary set X can be expressed as follows.

｛語彙集合Ａの頻度｝＞｛（語彙集合Ｂ－Ｃ）の頻度｝
本情報処理システム１においては、以下の手順１～４に従って、（１）曖昧性を含む表現として感性ベクトルに展開可能な語（語彙）と、（２）ユーザが外せない条件を指定する語彙との推定をそれぞれ行なう。 {Frequency of vocabulary set A}>{Frequency of (vocabulary set BC)}
In this information processing system 1, according to the following procedures 1 to 4, (1) words (vocabularies) that can be developed into sensitivity vectors as expressions including ambiguity, and (2) vocabulary that specifies conditions that the user cannot remove. are estimated respectively.

語彙の推定には、検索対象となる商品の説明文の集合全体の中で使われている語彙の頻度（出現頻度）を用いる。 For estimating the vocabulary, the frequency (appearance frequency) of the vocabulary used in the entire set of descriptions of the product to be searched is used.

手順１：本情報処理システム１において、事前に検索対象商品ＤＢ１０１（図１参照）中の商品紹介説明文中の語彙を抽出し集合Ｘとする。また、これらの語彙の頻度情報を求める。 Procedure 1: In the information processing system 1, the vocabulary in the product introduction description in the search target product DB 101 (see FIG. 1) is extracted in advance and made into a set X. Also, the frequency information of these vocabularies is obtained.

手順２：事前に検索対象商品ＤＢ１０１のＤＢ属性項目として用いられている語彙を抽出し、これらの語彙を集合Ｃとする。このような検索対象のＤＢにおいて項目情報として予め設定されている語彙は、無条件に（２）ユーザが外せない条件であると推定する。 Procedure 2: Vocabularies used as DB attribute items in the search target product DB 101 are extracted in advance, and these vocabularies are defined as a set C. Vocabulary preset as item information in such a DB to be searched is unconditionally presumed to be (2) the user's unavoidable condition.

手順３：ユーザクエリ中で集合Ｃに相当する語彙は（２）ユーザが外せない条件を指定する語彙であると推定する。 Procedure 3: Presume that the vocabulary corresponding to set C in the user query is (2) a vocabulary that specifies a condition that the user cannot omit.

手順４：ユーザクエリにおける残りの語彙で、集合Ｘ中の出現頻度が所定の閾値を超えるものを、（１）曖昧性を含む表現として感性ベクトルに展開可能な語彙と推定する。また、閾値以下のものを、（２）ユーザが外せない条件を指定する語彙と推定する。 Step 4: Among the remaining vocabularies in the user query, the vocabularies whose frequency of appearance in the set X exceeds a predetermined threshold are estimated as (1) vocabularies that can be developed into sensitivity vectors as expressions containing ambiguity. In addition, words below the threshold value are assumed to be (2) vocabulary specifying conditions that cannot be removed by the user.

このように、集合Ｃ以外の語彙について、頻度が所定の閾値以下のものを（１）曖昧性を含む表現として感性ベクトルに展開可能な語と推定し、閾値を超えるものを（２）ユーザが外せない条件を含む語と推定する。 In this way, with respect to vocabulary other than the set C, words whose frequency is less than a predetermined threshold are (1) words that can be developed into a sentiment vector as expressions containing ambiguity, and words that exceed the threshold are (2) It is presumed to be a word that includes an unavoidable condition.

例えば、検索対象商品ＤＢ１０１のＤＢ属性項目として「ワイシャツ」，「Ｔシャツ」，「ポロシャツ」等が含まれているものとし、ユーザクエリが「オフでも使えるブロードのワイシャツ」である例について示す。 For example, it is assumed that "shirt", "T-shirt", "polo shirt", etc. are included as DB attribute items of the search target product DB 101, and an example in which the user query is "a broad shirt that can be used even when off" is shown.

このユーザクエリには、「オフでも使える」，「ブロードの」および「ワイシャツ」の３つの語彙が含まれる。 This user query includes three terms: "can be used even when off", "broadcloth", and "shirt".

ここで、「ワイシャツ」は検索対象商品ＤＢ１０１のＤＢ属性項目として含まれているので、（２）ユーザが外せない条件を指定する語彙と特定される。 Here, since "shirt" is included as a DB attribute item in the search target product DB 101, it is identified as (2) a vocabulary specifying a condition that the user cannot remove.

また、集合Ｘにおいて、「オフでも使える」の出現頻度が閾値よりも大きく，「ブロードの」の出現頻度が閾値以下である場合には、「オフでも使える」は（１）曖昧性を含む表現として感性ベクトルに展開可能な語彙と推定され、「ブロードの」は（２）ユーザが外せない条件を指定する語彙と推定される。 Also, in the set X, if the appearance frequency of "can be used even when turned off" is greater than the threshold and the appearance frequency of "broad" is less than or equal to the threshold, "can be used even when turned off" is (1) an expression containing ambiguity , and "broad" is presumed to be a vocabulary specifying (2) a condition that the user cannot remove.

また、本情報処理システム１においては、（１）曖昧性を含む表現として感性ベクトルに展開可能な語彙については、検索対象商品ＤＢ１０１において出現頻度が多い語彙を優先して、順次、感性ベクトルに展開し、この展開の度に展開された感性ベクトルを考慮した全文検索を行なう。 In addition, in the information processing system 1, (1) words that can be developed into sensitivity vectors as expressions including ambiguity are developed into sensitivity vectors, with priority given to vocabulary that appears frequently in the search target product DB 101. Then, a full-text search is performed in consideration of the sensitivity vector developed each time this development is performed.

（Ｂ）構成
図２は実施形態の一例としての情報処理システム１のハードウェア構成を模式的に示す図である。 (B) Configuration FIG. 2 is a diagram schematically showing the hardware configuration of the information processing system 1 as an example of the embodiment.

情報処理システム１は、図２に示すように、管理サーバ１０，ネットワーク３０および１つ以上（図１に示す例では２つ）のクライアント端末２０－１，２０－２を備える。 As shown in FIG. 2, the information processing system 1 includes a management server 10, a network 30, and one or more (two in the example shown in FIG. 1) client terminals 20-1 and 20-2.

管理サーバ１０とクライアント端末２０－１，２０－２とはネットワーク３０を介して通信可能に接続されている。 The management server 10 and the client terminals 20-1 and 20-2 are connected via a network 30 so as to be communicable.

クライアント端末２０－１，２０－２は、ＰＣ（Personal Computer）やスマートフォン，タブレット等の情報処理装置（コンピュータ）であって、管理サーバ１０が提供するサービスをユーザが利用するために用いられる。本実施形態においては、ユーザがクライアント端末２０－１，２０－２を用いて管理サーバ１０が提供する検索システムを利用する。 The client terminals 20-1 and 20-2 are information processing devices (computers) such as PCs (Personal Computers), smart phones, tablets, etc., and are used by users to use services provided by the management server 10. FIG. In this embodiment, the user utilizes the search system provided by the management server 10 using the client terminals 20-1 and 20-2.

なお、以下、クライアント端末を示す符号としては、複数のクライアント端末のうち１つを特定する必要があるときには符号２０－１，２０－２を用いるが、任意のクライアント端末を指すときには符号２０を用いる。 Note that, hereinafter, as a code indicating a client terminal, codes 20-1 and 20-2 are used when it is necessary to specify one of a plurality of client terminals, but code 20 is used when an arbitrary client terminal is indicated. .

クライアント端末２０は、例えば、図示しないＣＰＵ（Central Processing Unit），メモリ，入力装置および出力装置を備え、ＣＰＵがプログラムを実行することで種々の機能を実現する。 The client terminal 20 includes, for example, a CPU (Central Processing Unit), a memory, an input device, and an output device (not shown), and implements various functions by the CPU executing programs.

例えば、クライアント端末２０において、ユーザがキーボードやタッチパネル等の入力装置を用いて検索を行なうためのクエリ（ユーザクエリ；検索条件）を入力し、管理サーバ１０からこの検索クエリに応じて送信された検索結果を取得する。この取得された検索結果は、ディスプレイやタッチパネル等の出力装置（表示装置）に出力される。 For example, in the client terminal 20, a user inputs a query (user query; search condition) for performing a search using an input device such as a keyboard or a touch panel, and a search transmitted from the management server 10 in response to this search query. Get results. The acquired search results are output to an output device (display device) such as a display or touch panel.

なお、ユーザクエリの入力は、図示しないマイクを用いた音声入力によって行なってもよい。また、検索結果の出力は図示しないスピーカを用いた音声出力によって行なってもよい。 Note that the user query may be input by voice input using a microphone (not shown). Also, the search results may be output by voice using a speaker (not shown).

また、本情報処理システム１においては、ユーザは、検索サービスを用いるために、入力装置を用いて検索対象に関するテキストを検索条件（ユーザクエリ）として入力する。ユーザクエリはキーワードであってもよく、また任意のテキスト文（自由文，フリーテキスト）であってもよい。 Further, in the information processing system 1, the user uses an input device to input a text related to a search target as a search condition (user query) in order to use the search service. A user query may be a keyword, or may be an arbitrary text sentence (free sentence, free text).

図３は実施形態の一例としての情報処理システム１におけるクライアント端末２０の表示画面を例示する図である。 FIG. 3 is a diagram illustrating a display screen of the client terminal 20 in the information processing system 1 as one example of the embodiment.

この図３に例示する表示画面は、管理サーバ１０がウェブページとして提供する検索サービスをブラウザによって表示するものである。 The display screen illustrated in FIG. 3 displays a search service provided by the management server 10 as a web page by a browser.

この表示画面は、インタフェース画面５０１と検索結果表示画面５０３とを備える。 This display screen includes an interface screen 501 and a search result display screen 503 .

インタフェース画面５０１は、ユーザと検索サービスとの対話に用いられる。例えば、図３に例示するインタフェース画面５０１においては、検索サービスからユーザに対する「どこに着ていく服を探していますか？」との質問が表示されている。 Interface screen 501 is used for interaction between the user and the search service. For example, on the interface screen 501 illustrated in FIG. 3, a query from the search service to the user, "Where are you looking for clothes to wear?" is displayed.

ユーザはこのインタフェース画面５０１に表示された検索サービスからの質問に答える形式で、ユーザクエリ入力ボックス５０２からユーザクエリ（検索クエリ）を入力する。この図３に示す例においては、ユーザは「オフでも使えるブロードのワイシャツ」とのユーザクエリを入力している。 A user inputs a user query (search query) from a user query input box 502 in the form of answering a question from the search service displayed on this interface screen 501 . In the example shown in FIG. 3, the user has entered a user query "broad dress shirt that can be used even when off".

検索サービスはこのユーザクエリ入力ボックス５０２から入力されたユーザクエリに基づき、検索対象商品ＤＢ１０１に対する検索を実行する。 The search service searches the search target product DB 101 based on the user query input from this user query input box 502 .

検索結果表示画面５０３には、検索システムが行なった検索の結果（検索結果）が出力される。図３に例示する検索結果表示画面５０３においては、入力されたユーザクエリに基づいて管理サーバ１０が検索を行なった結果として、複数のシャツの画像と説明文とが一覧として表示されている。 A search result display screen 503 outputs the search result (search result) performed by the search system. In the search result display screen 503 illustrated in FIG. 3, images and descriptions of a plurality of shirts are displayed as a list as a result of the search performed by the management server 10 based on the input user query.

ネットワーク３０は、例えば、イーサネット（登録商標）等のコンピュータネットワークである。クライアント端末２０や管理サーバ１０はネットワーク３０に対して、有線もしくは無線のいずれの形態で接続されてもよい。 The network 30 is, for example, a computer network such as Ethernet (registered trademark). The client terminal 20 and the management server 10 may be connected to the network 30 by wire or wirelessly.

管理サーバ１０は、例えばサーバ機能を有する情報処理装置（コンピュータ）である。管理サーバ１０は、検索対象商品ＤＢ１０１（図１参照）を備え、ユーザによりクライアント端末２０から入力されたユーザクエリに基づいて検索対象商品ＤＢ１０１に対して検索を行ない、その検索結果をクライアント端末２０に応答する。 The management server 10 is, for example, an information processing device (computer) having a server function. The management server 10 includes a search target product DB 101 (see FIG. 1), searches the search target product DB 101 based on a user query input from the client terminal 20 by the user, and sends the search result to the client terminal 20. respond.

本実施形態においては、本情報処理システム１が提供する検索サービスとして、衣料品を検索対象商品として取り扱う商品検索システムである例について示す。 In this embodiment, as a search service provided by the information processing system 1, an example of a product search system that handles clothing as search target products will be described.

図４は実施形態の一例としての情報処理システム１における管理サーバ１０のハードウェア構成を例示する図である。 FIG. 4 is a diagram illustrating the hardware configuration of the management server 10 in the information processing system 1 as an example of the embodiment.

管理サーバ１０は、サーバ機能を備えた情報処理装置（コンピュータ）であり、図４に示すように、ＣＰＵ１１，メモリ１２，記憶装置１３およびネットワークインタフェース（Ｉ／Ｆ）１４を備える。 The management server 10 is an information processing device (computer) having a server function, and as shown in FIG.

また管理サーバ１０には、図４に示すように、検索対象商品ＤＢ１０１，商品ＤＢ属性項目語彙ＤＢ１０２，感性ベクトル定義辞書１０３および説明文語彙頻度情報ＤＢ１０４が備えられている。なお、これらの検索対象商品ＤＢ１０１，商品ＤＢ属性項目語彙ＤＢ１０２，感性ベクトル定義辞書１０３および説明文語彙頻度情報ＤＢ１０４についての説明は後述する。 As shown in FIG. 4, the management server 10 is provided with a search target product DB 101, a product DB attribute item vocabulary DB 102, an affective vector definition dictionary 103, and an explanation sentence vocabulary frequency information DB 104. FIG. The search target product DB 101, product DB attribute item vocabulary DB 102, sensitivity vector definition dictionary 103, and explanation sentence vocabulary frequency information DB 104 will be described later.

記憶装置１３は、ＨＤＤ（Hard Disk Drive），ＳＳＤ（Solid State Drive），ＳＣＭ（Storage Class Memory）等の記憶装置であって、種々のデータやプログラムを格納するものである。 The storage device 13 is a storage device such as a HDD (Hard Disk Drive), SSD (Solid State Drive), SCM (Storage Class Memory), etc., and stores various data and programs.

ネットワークインタフェース１４は、ネットワーク３０と通信可能に接続するインタフェースであり、例えば、ＬＡＮ（Local Area Network）インタフェースである。 The network interface 14 is an interface communicably connected to the network 30, and is, for example, a LAN (Local Area Network) interface.

メモリ１２はＲＯＭ（Read Only Memory）およびＲＡＭ（Random Access Memory）を含む記憶メモリである。メモリ１２のＲＯＭには、ＯＳ（Operating System）や検索処理にかかるソフトウェアプログラムやこのプログラム用のデータ類が書き込まれている。メモリ１２上のソフトウェアプログラムは、ＣＰＵ１１に適宜読み込まれて実行される。また、メモリ１２のＲＡＭは、一次記憶メモリあるいはワーキングメモリとして利用される。 The memory 12 is a storage memory including ROM (Read Only Memory) and RAM (Random Access Memory). The ROM of the memory 12 is written with an OS (Operating System), a software program for search processing, and data for this program. A software program on the memory 12 is appropriately read and executed by the CPU 11 . Also, the RAM of the memory 12 is used as a primary storage memory or a working memory.

ＣＰＵ１１は、制御ユニット（制御回路）や演算ユニット（演算回路），キャッシュメモリ（レジスタ群）等を内蔵する処理装置（プロセッサ）であり、種々の制御や演算を行なう。ＣＰＵ１１は、メモリ１２に格納されたＯＳやプログラムを実行することにより、種々の機能を実現する。本情報処理システム１においては、ＣＰＵ１１が感性検索プログラムを実行することで検索システム（商品検索システム）が実現される。 The CPU 11 is a processing unit (processor) containing a control unit (control circuit), an arithmetic unit (arithmetic circuit), a cache memory (register group), etc., and performs various controls and operations. The CPU 11 implements various functions by executing the OS and programs stored in the memory 12 . In the information processing system 1, a search system (product search system) is realized by the CPU 11 executing a sensitivity search program.

図５は実施形態の一例としての情報処理システム１の管理サーバ１０の機能構成を示す図である。 FIG. 5 is a diagram showing the functional configuration of the management server 10 of the information processing system 1 as an example of the embodiment.

ＣＰＵ１１は、メモリ１２に格納されたＯＳやプログラムを実行することにより、図５に例示する、検索対象商品ＤＢ１０１，商品ＤＢ属性項目語彙ＤＢ１０２，感性ベクトル定義辞書１０３および説明文語彙頻度情報ＤＢ１０４としての各機能を実現する。 By executing the OS and programs stored in the memory 12, the CPU 11 creates a search target product DB 101, a product DB attribute item vocabulary DB 102, an affective vector definition dictionary 103, and a description sentence vocabulary frequency information DB 104, which are illustrated in FIG. Realize each function.

また、ＣＰＵ１１は、メモリ１２に格納されたＯＳやプログラムを実行することにより、図５に例示する、商品ＤＢ属性項目語彙抽出部１１１，説明文語彙抽出部１１２，説明文語彙頻度計算部１１３，感性ベクトル計算部１１４，ユーザクエリ入力部１１５，入力テキスト解析部１１６，感性ベクトル展開語彙特定部１１０，ハイブリッド検索部１２０および検索結果表示部１２６としての機能を実現する。 Further, by executing the OS and programs stored in the memory 12, the CPU 11 executes the product DB attribute item vocabulary extraction unit 111, the explanation sentence vocabulary extraction unit 112, the explanation sentence vocabulary frequency calculation unit 113, The functions of the sensitivity vector calculation unit 114, the user query input unit 115, the input text analysis unit 116, the sensitivity vector expansion vocabulary identification unit 110, the hybrid search unit 120, and the search result display unit 126 are realized.

また、感性ベクトル展開語彙特定部１１０は、ＤＢ属性項目語彙特定部１１７，語彙頻度設定部１１８，感性ベクトル展開語彙特定部１１９，商品ＤＢ属性項目語彙ＤＢ１０２および説明文語彙頻度情報ＤＢ１０４としての機能を備える。 In addition, the sensitivity vector expansion vocabulary identification unit 110 functions as a DB attribute item vocabulary identification unit 117, a vocabulary frequency setting unit 118, a sensitivity vector expansion vocabulary identification unit 119, a product DB attribute item vocabulary DB 102, and a description sentence vocabulary frequency information DB 104. Prepare.

さらに、ハイブリッド検索部１２０は、検索制御部１２１，感性ベクトル展開部１２２，感性検索実行部１２３，キーワード検索実行部１２４および検索結果統合部１２５としての機能を備える。 Further, the hybrid search unit 120 has functions as a search control unit 121 , a sensitivity vector expansion unit 122 , a sensitivity search execution unit 123 , a keyword search execution unit 124 and a search result integration unit 125 .

検索対象商品ＤＢ１０１は、検索対象となる商品に関する情報を格納するデータベースである。 The search target product DB 101 is a database that stores information about products to be searched.

検索対象商品ＤＢ１０１には、商品に関する情報として、例えば、商品ＩＤ，カテゴリ属性とその項目，商品の説明文および商品に対する感性ベクトルが格納される。 The search target product DB 101 stores, as product information, product IDs, category attributes and their items, product descriptions, and product sensitivity vectors, for example.

商品ＩＤは、商品を一意に特定する識別情報であり、例えば、“ID = 0237”のように、英数字や記号の組み合わせで構成される情報が商品ＩＤとして予め商品毎に設定される。 The product ID is identification information that uniquely identifies the product. For example, information composed of a combination of alphanumeric characters and symbols such as “ID=0237” is set in advance for each product as the product ID.

カテゴリ属性は、商品に関する情報の種類である。例えば、「シャツの種類（シャツ）」，「色」，「柄」等の情報がカテゴリ属性として用いられる。項目は、カテゴリ属性の具体的な値である。 A category attribute is a type of information about a product. For example, information such as "type of shirt (shirt)", "color", and "pattern" is used as category attributes. An item is a concrete value of a category attribute.

以下に、カテゴリ属性と項目との具体的な組み合わせを例示する。 Specific combinations of category attributes and items are exemplified below.

（カテゴリ属性：項目）＝（シャツ：ドレスシャツ），（色：白），（柄：ストライプ），・・・
説明文は、商品を説明する文章である。例えば、「ブロード生地のゆったりとしたワイシャツ」という文章が用いられる。 (Category attribute: item) = (shirt: dress shirt), (color: white), (pattern: stripe), ...
The description is text that describes the product. For example, the sentence "a loose shirt made of broadcloth" is used.

感性ベクトルとは、検索対象（本実施形態では商品）が有するイメージ（印象）を、側面情報毎にベクトル（感性ベクトル）を用いて定量的に表す情報である。 A sensitivity vector is information that quantitatively expresses an image (impression) of a search target (product in this embodiment) using a vector (sensitivity vector) for each aspect information.

すなわち、感性ベクトルは、語彙について、その語彙から人が受ける印象を、１つ以上（本実施形態では３種類）の感性軸での数値（感性スコア，ベクトル値）で表す
側面情報は、「外見的特徴」、「容姿」、「性格」など、感性表現が表すイメージの種類を示す情報であり、既存のオントロジ等の予め定めた分類体系に従い、１つの感性表現に対して少なくとも１つの側面情報を付与するものとする。 That is, the kansei vector expresses the impression that a person receives from the vocabulary by numerical values (kansei score, vector value) on one or more (three types in this embodiment) kansei axes. It is information indicating the type of image represented by the emotional expression, such as "characteristics", "appearance", "personality", etc., and at least one side information for one emotional expression according to a predetermined classification system such as an existing ontology. shall be given.

なお、側面情報として日本語語彙大系の一般名詞意味属性体系のカテゴリ（高級な，柔らかな：「品質」、明るい：「性格」など）を利用してもよい。 As side information, the categories of the general noun semantic attribute system of the Japanese lexical overview (high grade, soft: "quality", bright: "personality", etc.) may be used.

感性軸は側面情報に対応するものとする。また、感性ベクトルを感性情報といってもよい。 It is assumed that the sensitivity axis corresponds to side information. Also, the sensitivity vector may be referred to as sensitivity information.

感性ベクトルは、予め定められた次元数を有するベクトル空間の各次元の軸（感性軸）に対して予め定められた値を有する。例えば、感性ベクトルの次元数を３とし、第１次元の軸＝トラディショナル、第２次元の軸＝カジュアル、第３次元の軸＝こなれ感、と定めているものとする。 The sensitivity vector has a predetermined value for each dimensional axis (sensitivity axis) of a vector space having a predetermined number of dimensions. For example, it is assumed that the number of dimensions of the sensitivity vector is 3, the first dimension axis=traditional, the second dimension axis=casual, and the third dimension axis=familiar.

すなわち、本例においては、シャツに関する側面情報として、「トラディショナル」，「カジュアル」および「こなれ感」を用いる。感性ベクトルは、これらの側面情報を感性軸として、それぞれを数値である感性スコアを用いて表す。 That is, in this example, "traditional," "casual," and "comfortable" are used as side information about shirts. The kansei vector expresses each of these side information as a kansei axis using a kansei score, which is a numerical value.

感性スコアとしては、例えば、１以下の数値を用いてもよい。例えば、よりトラディショナルであるほど感性スコアの値が大きいものとする。同様に、よりカジュアルであるほど感性スコアの値が大きく、よりこなれ感があるほど感性スコアの値が大きいものとする。 As the sensitivity score, for example, a numerical value of 1 or less may be used. For example, it is assumed that the more traditional, the higher the value of the sensitivity score. Similarly, it is assumed that the more casual the user is, the higher the value of the sensitivity score, and the more sophisticated the user is, the higher the value of the sensitivity score.

本実施形態においては、感性ベクトルを以下のように、感性スコアを表す数値の組み合わせで表す。
（トラディショナル，カジュアル，こなれ感）＝（0.6, 0.7, 0.2）
検索対象商品ＤＢ１０１においては、商品毎にこのような感性ベクトルを備える。 In this embodiment, the sensitivity vector is represented by a combination of numerical values representing sensitivity scores as follows.
(traditional, casual, mature) = (0.6, 0.7, 0.2)
The search target product DB 101 has such a sensitivity vector for each product.

なお、検索対象商品ＤＢ１０１において、感性ベクトルは、外部から与えられた（例えば人手で定義された）ものを格納してもよい。また、後述する感性ベクトル定義辞書１０３を用いて感性ベクトル計算部１１４が算出した値を格納してもよい。 In addition, in the search target product DB 101, the sensitivity vector given from the outside (for example, manually defined) may be stored. Also, a value calculated by the sensitivity vector calculation unit 114 using the sensitivity vector definition dictionary 103, which will be described later, may be stored.

なお、感性ベクトルは既知の種々の手法を用いて実現することができ、その詳細な説明は省略する。 Note that the sensitivity vector can be realized using various known techniques, and detailed description thereof will be omitted.

感性ベクトル定義辞書１０３は、語彙に対して、１つ以上（本実施形態では、トラディショナル，カジュアルおよびこなれ感の３種類）の感性軸（側面情報）に対応する数値（感性スコア，ベクトル値）の組み合わせを対応付けて構成されている。 The kansei vector definition dictionary 103 stores numerical values (kansei scores, vector values) corresponding to one or more kansei axes (side information) (three types of traditional, casual, and sophisticated in this embodiment) for a vocabulary. It is configured by associating combinations.

感性ベクトル定義辞書１０３は、後述の如く、感性ベクトル計算部１１４が、検索対象商品ＤＢ１０１の各商品説明文に対して感性ベクトルを計算する際や、ユーザクエリ中に含まれる語彙（感性ベクトル展開語彙）に対して感性ベクトルを計算するために用いられる。 As will be described later, the sensitivity vector definition dictionary 103 is used when the sensitivity vector calculation unit 114 calculates the sensitivity vector for each product description in the search target product DB 101, and also includes vocabulary included in user queries (sensitivity vector expansion vocabulary). ) is used to calculate the sensitivity vector.

図６は実施形態の一例としての情報処理システム１における感性ベクトル定義辞書１０３を例示する図である。 FIG. 6 is a diagram exemplifying the emotion vector definition dictionary 103 in the information processing system 1 as an example of the embodiment.

図６に例示する感性ベクトル定義辞書１０３は、語彙に対して、トラディショナル，カジュアルおよびこなれ感の３つの感性軸の感性スコアを定義することで構成されている。 The sensitivity vector definition dictionary 103 illustrated in FIG. 6 is configured by defining sensitivity scores on three sensitivity axes of traditional, casual, and familiar with respect to vocabulary.

例えば、語彙「ドレスシャツ」について、感性軸「トラディショナル」の感性スコアは0.9であり、感性軸「カジュアル」の感性スコアは0.1であり、感性軸「こなれ感」の感性スコアは0.1である。 For example, for the vocabulary "dress shirt", the sensitivity score for the sensitivity axis "traditional" is 0.9, the sensitivity score for the sensitivity axis "casual" is 0.1, and the sensitivity score for the sensitivity axis "feeling mature" is 0.1.

感性ベクトル計算部１１４は、検索対象商品ＤＢ１０１の各商品説明文に対して、感性ベクトル定義辞書１０３を参照しながら、各商品ＩＤの商品の説明文に対応する感性ベクトルを計算する。感性ベクトル計算部１１４は、例えば、感性ベクトル定義辞書１０３に登録された語彙が説明文中に出現する度に、各感性軸（トラディショナル，カジュアル，こなれ感）毎に感性スコアを加算し、最後に各感性スコアの平均値を並べた値を感性ベクトルとして算出する。 The sensitivity vector calculation unit 114 refers to the sensitivity vector definition dictionary 103 for each product description in the search target product DB 101 to calculate a sensitivity vector corresponding to the product description of each product ID. For example, each time a vocabulary registered in the Kansei vector definition dictionary 103 appears in a description, the Kansei vector calculation unit 114 adds a Kansei score for each Kansei axis (traditional, casual, familiar). A value obtained by arranging the average values of the sensitivity scores is calculated as a sensitivity vector.

以下、語彙に対して感性ベクトルの算出を行なうことを感性ベクトルに展開するという場合がある。 In the following, there is a case where calculation of sensitivity vectors for a vocabulary is expanded into sensitivity vectors.

また、感性ベクトル計算部１１４は、ユーザクエリに含まれる語彙に対しても、同様の手法で感性ベクトルを計算する。 The sensitivity vector calculation unit 114 also calculates sensitivity vectors for the vocabulary included in the user query by the same method.

商品ＤＢ属性項目語彙ＤＢ１０２は、検索対象商品ＤＢ１０１中において、カテゴリ属性名として利用されている語彙および項目名として利用されている語彙の一覧を格納するデータベースである。 The product DB attribute item vocabulary DB 102 is a database that stores a list of vocabulary used as category attribute names and vocabulary used as item names in the search target product DB 101 .

以下、検索対象商品ＤＢ１０１においてカテゴリ属性名や項目名として用いられている語彙を商品ＤＢ属性項目語彙という場合がある。すなわち、商品ＤＢ属性項目語彙ＤＢ１０２は商品ＤＢ属性項目語彙を格納する。 Hereinafter, vocabulary used as category attribute names and item names in the search target product DB 101 may be referred to as product DB attribute item vocabulary. That is, the product DB attribute item vocabulary DB 102 stores product DB attribute item vocabulary.

商品ＤＢ属性項目語彙抽出部１１１は、検索対象商品ＤＢ１０１から、カテゴリ属性名と項目名として利用されている語彙の一覧を抽出しリスト化することで、商品ＤＢ属性項目語彙ＤＢ１０２を作成する。商品ＤＢ属性項目語彙抽出部１１１による商品ＤＢ属性項目語彙ＤＢ１０２の作成は、例えば、検索システムの初期設定時に行なってもよく、また、検索対象商品ＤＢ１０１の更新等が行なわれる度に行なってもよい。 The product DB attribute item vocabulary extraction unit 111 creates a product DB attribute item vocabulary DB 102 by extracting a list of vocabularies used as category attribute names and item names from the search target product DB 101 and listing them. The creation of the product DB attribute item vocabulary DB 102 by the product DB attribute item vocabulary extraction unit 111 may be performed, for example, when the search system is initialized, or may be performed each time the search target product DB 101 is updated. .

説明文語彙頻度情報ＤＢ１０４は、検索対象商品ＤＢ１０１に格納されている商品の各説明文中に出現する各語彙の出現頻度情報を格納するデータベースである。 The explanation sentence vocabulary frequency information DB 104 is a database that stores appearance frequency information of each vocabulary appearing in each explanation sentence of the product stored in the search target product DB 101 .

説明文語彙抽出部１１２は、検索対象商品ＤＢ１０１に格納されている商品の各説明文に対して、例えば形態素解析処理を行なうことで、説明文中に利用されている語彙を抽出しリスト（語彙リスト）を作成する。説明文語彙抽出部１１２は、説明文中において連続する名詞形態素を一つにまとめる複合名詞化の処理や、動詞や形容詞などの活用形を終止形に揃えるなどの整形処理を後処理として行なってもよい。この後処理により、頻度情報の精度を上げることができる。 The explanatory text vocabulary extraction unit 112 performs, for example, morphological analysis processing on each explanatory text of the product stored in the search target product DB 101, thereby extracting the vocabulary used in the description and creating a list (vocabulary list). ). The descriptive text vocabulary extraction unit 112 may also perform post-processing such as compound noun conversion processing that combines consecutive noun morphemes in the descriptive text, and shaping processing such as aligning conjugations of verbs and adjectives into final forms. good. This post-processing can improve the accuracy of the frequency information.

説明文語彙抽出部１１２による説明文語彙頻度情報ＤＢ１０４の作成は、例えば、検索システムの初期設定時に行なってもよく、また、検索対象商品ＤＢ１０１の更新等が行なわれる度に行なってもよい。 Creation of the explanatory sentence vocabulary frequency information DB 104 by the explanatory sentence vocabulary extraction unit 112 may be performed, for example, at the initial setting of the search system, or may be performed each time the search target product DB 101 is updated.

説明文語彙頻度計算部１１３は、説明文語彙抽出部１１２によって抽出された各語彙の出現頻度を計算する。この語彙の出現頻度の計算は既知の手法を用いることで実現でき、その説明は省略する。出現頻度の計算後、説明文語彙頻度計算部１１３は、最大頻度で各頻度を除算することで正規化を行なう。説明文語彙頻度計算部１１３は、正規化された頻度情報を説明文語彙頻度情報ＤＢ１０４に登録する。 The explanatory sentence vocabulary frequency calculation unit 113 calculates the appearance frequency of each vocabulary extracted by the explanatory sentence vocabulary extraction unit 112 . The calculation of the appearance frequency of this vocabulary can be realized by using a known technique, and the explanation thereof will be omitted. After calculating the frequency of occurrence, the explanatory sentence vocabulary frequency calculator 113 normalizes by dividing each frequency by the maximum frequency. The explanatory sentence vocabulary frequency calculation unit 113 registers the normalized frequency information in the explanatory sentence vocabulary frequency information DB 104 .

ユーザクエリ入力部１１５は、ユーザが入力するユーザクエリを受け付ける。例えば、図３に例示するウェブページにおいて、ユーザがインタフェース画面５０１のユーザクエリ入力ボックス５０２から入力した情報がユーザクエリ入力部１１５に入力される。ユーザクエリ入力部１１５は、クエリを受け付ける入力部の一例である。 User query input unit 115 receives a user query input by a user. For example, in the web page illustrated in FIG. 3 , information input by the user from the user query input box 502 of the interface screen 501 is input to the user query input unit 115 . The user query input unit 115 is an example of an input unit that receives queries.

入力テキスト解析部１１６は、ユーザクエリ入力部１１５が受け付けたユーザクエリを形態素解析する。入力テキスト解析部１１６は、説明文語彙抽出部１１２による処理と同様の手法を用いて、ユーザクエリに含まれている語彙を抽出する。入力テキスト解析部１１６は、ユーザクエリから抽出した語彙について、連続する名詞形態素を一つにまとめる複合名詞化の処理や、動詞や形容詞などの活用形を終止形に揃えるなどの整形処理を後処理として行なってもよい。 The input text analysis unit 116 morphologically analyzes the user query received by the user query input unit 115 . The input text analysis unit 116 extracts the vocabulary included in the user query using the same method as the processing by the explanatory text vocabulary extraction unit 112 . The input text analysis unit 116 post-processes the vocabulary extracted from the user query, such as compound noun processing that combines continuous noun morphemes into one, and shaping processing such as aligning conjugations such as verbs and adjectives into final forms. You can do it as

感性ベクトル展開語彙特定部１１９は、ユーザクエリに含まれる各語彙について、感性ベクトルに展開するか否かの特定を行なう。すなわち、感性ベクトル展開語彙特定部１１９は、後述するハイブリッド検索部１２０において感性ベクトルでマッチングすべき語彙と、キーワードでマッチングすべき語彙とを特定し、感性ベクトルへの展開の順位付けを行なう。 The sensitivity vector expansion vocabulary specifying unit 119 specifies whether or not to expand each vocabulary included in the user query into sensitivity vectors. That is, the sensitivity vector expansion vocabulary specifying unit 119 specifies vocabulary to be matched by sensitivity vectors and vocabulary to be matched by keywords in the hybrid search unit 120, which will be described later, and ranks the expansion into sensitivity vectors.

ＤＢ属性項目語彙特定部１１７は、商品ＤＢ属性項目語彙ＤＢ１０２を参照して、ユーザクエリに含まれる語彙が検索対象商品ＤＢ１０１においてカテゴリ属性名や項目名として用いられているか否かを判断する。ユーザクエリから抽出された語彙が商品ＤＢ属性項目語彙ＤＢ１０２に登録されている語彙と一致した場合に、ＤＢ属性項目語彙特定部１１７は、当該語彙を商品ＤＢ属性項目語彙と判断する。 The DB attribute item vocabulary identification unit 117 refers to the product DB attribute item vocabulary DB 102 to determine whether or not the vocabulary included in the user query is used as the category attribute name or item name in the search target product DB 101 . When the vocabulary extracted from the user query matches the vocabulary registered in the product DB attribute item vocabulary DB 102, the DB attribute item vocabulary identification unit 117 determines that the vocabulary is the product DB attribute item vocabulary.

すなわち、ＤＢ属性項目語彙特定部１１７は、商品ＤＢ属性項目語彙ＤＢ１０２を参照することで、ユーザクエリに含まれる１つ以上の語彙の中において、商品ＤＢ属性項目語彙に該当する語彙（第２語）を特定する。 That is, by referring to the product DB attribute item vocabulary DB 102, the DB attribute item vocabulary identification unit 117 identifies one or more terms included in the user query that correspond to the product DB attribute item vocabulary (second term ).

語彙頻度設定部１１８は、ユーザクエリに含まれる語彙であって、商品ＤＢ属性項目語彙に該当しない語彙について、説明文語彙頻度情報ＤＢ１０４を参照して、各語彙の出現頻度（出現頻度情報）を特定する。なお、出現頻度は既知の手法で取得することができ、その説明は省略する。 The vocabulary frequency setting unit 118 refers to the explanatory text vocabulary frequency information DB 104 for vocabulary that is included in the user query and does not correspond to the product DB attribute item vocabulary, and sets the appearance frequency (appearance frequency information) of each vocabulary. Identify. Note that the appearance frequency can be obtained by a known technique, and the explanation thereof is omitted.

頻度情報を正規化した値を感性ベクトル展開のための重みスコアとして各語に付与してもよい。 A value obtained by normalizing frequency information may be given to each word as a weight score for Kansei vector expansion.

感性ベクトル展開語彙特定部１１９は、ユーザクエリに含まれる語彙について、感性ベクトルによるマッチングで検索を行なうものであるか、キーワードマッチングによる検索を行なうものであるかの推定（切り分け）を行なう。感性ベクトル展開語彙特定部１１９は、第１推定部の一例である。 The sensitivity vector expansion vocabulary identification unit 119 estimates (separates) whether the vocabulary included in the user query is to be searched by matching by sensitivity vectors or by keyword matching. Kansei vector expansion vocabulary identifying unit 119 is an example of a first estimating unit.

感性ベクトル展開語彙特定部１１９は、入力テキスト解析部１１６によってユーザクエリから抽出された各語彙について、説明文語彙頻度情報ＤＢ１０４（集合Ｘ）を参照して、各語彙の、検索対象商品ＤＢ１０１に格納されている商品の各説明文中における出現頻度情報を取得する。 For each vocabulary extracted from the user query by the input text analysis unit 116, the emotion vector expansion vocabulary identification unit 119 refers to the explanatory sentence vocabulary frequency information DB 104 (set X), and stores the vocabulary in the search target product DB 101. Acquire the appearance frequency information in each description of the product.

感性ベクトル展開語彙特定部１１９は、ユーザクエリから抽出された各語彙の出現頻度を所定の閾値と比較することで、当該語彙が（１）曖昧性を含む表現として感性ベクトルに展開可能な語彙であるか、（２）ユーザが外せない条件を指定する語彙であるかを推定する。 The sensitivity vector expansion vocabulary identification unit 119 compares the frequency of appearance of each vocabulary extracted from the user query with a predetermined threshold value, and determines that the vocabulary is (1) a vocabulary that can be expanded to sensitivity vectors as an expression containing ambiguity. or (2) it is a vocabulary that specifies a condition that the user cannot omit.

以下、（１）曖昧性を含む表現として感性ベクトルに展開可能な語彙を、感性ベクトル展開語彙という場合がある。この感性ベクトル展開語彙が、クエリに含まれる複数の語（語彙）において、曖昧性を含む表現（感性表現）として感性情報（感性ベクトル）に展開可能な複数の第１語に相当する。 Hereinafter, (1) a vocabulary that can be expanded into an affective vector as an expression containing ambiguity is sometimes referred to as an affective vector expanded vocabulary. This sensitivity vector expansion vocabulary corresponds to a plurality of first words that can be developed into sensitivity information (sensitivity vector) as an expression including ambiguity (sensitivity expression) among the plurality of words (vocabulary) included in the query.

また、以下、（２）ユーザが外せない条件を指定する語彙を、必須条件語彙という場合がある。この必須条件語彙が、クエリに含まれる語彙のうち第１語以外の第２語に相当する。 In addition, hereinafter, (2) a vocabulary specifying a condition that a user cannot remove may be referred to as an essential condition vocabulary. This essential condition vocabulary corresponds to the second word other than the first word among the vocabulary included in the query.

感性ベクトル展開語彙特定部１１９は、ユーザクエリから抽出された語彙の出現頻度が所定の閾値よりも高い場合には、当該語彙を感性ベクトル展開語彙であると推定する。 If the appearance frequency of the vocabulary extracted from the user query is higher than a predetermined threshold, the sensitivity vector expanded vocabulary identification unit 119 estimates that the vocabulary is the sensitivity vector expanded vocabulary.

また、感性ベクトル展開語彙特定部１１９は、ユーザクエリから抽出された語彙の出現頻度が所定の閾値以下の場合には、当該語彙を必須条件語彙であると推定する。 Further, when the appearance frequency of a vocabulary extracted from a user query is equal to or less than a predetermined threshold value, the sensitivity vector expansion vocabulary identifying unit 119 estimates that the vocabulary is a prerequisite vocabulary.

さらに、感性ベクトル展開語彙特定部１１９は、ユーザクエリに含まれる語彙であって、ＤＢ属性項目語彙特定部１１７によって商品ＤＢ属性項目語彙に該当すると特定された語彙を必須条件語彙であると推定する。 Furthermore, the sensitivity vector expansion vocabulary identification unit 119 presumes that the vocabulary included in the user query and identified by the DB attribute item vocabulary identification unit 117 as corresponding to the product DB attribute item vocabulary is the essential condition vocabulary. .

感性ベクトル展開語彙は、後述するハイブリッド検索部１２０において、感性検索実行部１２３による感性ベクトル検索が行なわれる。一方、必須条件語彙は、ハイブリッド検索部１２０において、キーワード検索実行部１２４によるキーワード検索が行なわれる。 The sensitivity vector expanded vocabulary is subjected to sensitivity vector search by sensitivity search execution section 123 in hybrid search section 120, which will be described later. On the other hand, in the hybrid search unit 120, keyword search by the keyword search execution unit 124 is performed for the essential condition vocabulary.

また、感性ベクトル展開語彙特定部１１９は、感性ベクトルに展開する優先順位が高い語彙として、設定された頻度情報が最も大きい語彙から順にソートし、最後に、商品ＤＢ属性項目語彙に該当する語彙が並ぶように並び替える。感性ベクトル展開語彙特定部１１９は、設定部の一例である。 In addition, the sensitivity vector expansion vocabulary identification unit 119 sorts the vocabulary in descending order of the set frequency information as the vocabulary with the highest priority to be expanded to the sensitivity vector, and finally, the vocabulary corresponding to the product DB attribute item vocabulary. Arrange them so that they line up. The sensitivity vector expansion vocabulary specifying unit 119 is an example of a setting unit.

例えば、ユーザクエリに含まれる語彙において、感性ベクトル展開語彙が複数ある場合には、これらの複数の感性ベクトル展開語彙において、検索対象商品ＤＢ１０１における出現頻度が高い語彙を、感性ベクトルに展開する優先順位を高く設定する。 For example, if there are a plurality of emotion vector-expanded vocabularies in the vocabulary included in the user query, the order of priority for expanding the vocabulary with a high frequency of occurrence in the search target product DB 101 into emotion vectors among the plurality of emotion vector-expanded vocabularies. set higher.

これにより、後述するハイブリッド検索部１２０において、検索対象商品ＤＢ１０１の説明文中における出現頻度が高い語彙が優先して感性ベクトル検索されることとなる。 As a result, the hybrid search unit 120, which will be described later, preferentially searches for vocabularies with a high frequency of appearance in the descriptions of the search target product DB 101 for sensitivity vector search.

なお、優先順位を設定することを重み付けを行なうと表現してもよい。重みが大きい（重い）ことが優先順位が高いことに相当し、重みが小さい（軽い）ことが優先順位が低いことに相当する。 Note that setting the priority order may be expressed as weighting. A large (heavy) weight corresponds to a high priority, and a small (light) weight corresponds to a low priority.

ハイブリッド検索部１２０は、優先順位が高い語彙から順に感性ベクトルに展開し、展開されていないユーザクエリ語彙はキーワードマッチングで検索する。また、展開された語彙に対しては、感性ベクトルによるマッチングで検索を行なう。その後、これらの両者の検索結果の論理積（ＡＮＤ）を取って、最終的な検索結果とする。 The hybrid search unit 120 expands the vocabularies into sensitivity vectors in descending order of priority, and searches user query vocabularies that have not been expanded by keyword matching. Also, the expanded vocabulary is searched by matching using sensitivity vectors. After that, the logical product (AND) of these two search results is taken as the final search result.

ハイブリッド検索部１２０は、この検索を、優先順位が高い語彙から順に感性ベクトルに展開しながら、展開する語彙が無くなるまで続ける。 The hybrid search unit 120 continues this search until there are no more words to expand while expanding the vocabularies into sensitivity vectors in descending order of priority.

検索制御部１２１は、特定の順位までの語彙を感性ベクトルに展開した上で、感性ベクトルによる検索を実行し、キーワードによる全文検索を実行し、結果を統合処理するまでをワンセットとする一連の処理を、感性ベクトル展開の優先順位が上位の語彙から順に感性ベクトルに展開しながら、繰り返し行なう。 The search control unit 121 expands the vocabulary up to a specific rank into sensitivity vectors, executes a search using the sensitivity vectors, executes a full-text search using keywords, and integrates the results. The processing is repeated while developing the vocabularies into sensitivity vectors in order from the vocabulary having the highest priority of sensitivity vector development.

検索制御部１２１は、感性ベクトル展開のための重みが最も大きい物から順に感性ベクトルへ展開し、展開する度に全文検索を行なう。 The search control unit 121 develops the sensitivity vector in descending order of the weight for the sensitivity vector development, and performs a full-text search every time it is developed.

感性ベクトル展開部１２２は、検索制御部１２１が指定する順位までの語彙の感性ベクトルを、感性ベクトル定義辞書１０３を参照しながら感性ベクトル計算部１１４により計算し、求められた各語彙の感性ベクトルの平均値を求める。 The sensitivity vector expansion unit 122 calculates the sensitivity vectors of the vocabularies up to the rank specified by the search control unit 121 by the sensitivity vector calculation unit 114 while referring to the sensitivity vector definition dictionary 103, and converts the sensitivity vectors of the obtained vocabularies. Find the average value.

感性検索実行部１２３は、感性ベクトル展開部１２２によって算出された感性ベクトルの平均値に一定の閾値以上の類似度で類似する感性ベクトルを持った商品説明文（＝商品ＩＤ）を抽出する。類似度の判定は、コサイン距離等、既知の種々の手法を用いて行なうことができる。感性検索実行部１２３は感性検索部の一例である。 The emotion search execution unit 123 extracts a product description (=product ID) having an emotion vector similar to the average value of the emotion vectors calculated by the emotion vector development unit 122 with a degree of similarity greater than or equal to a certain threshold. Similarity determination can be performed using various known techniques such as cosine distance. The sensitivity search execution unit 123 is an example of a sensitivity search unit.

キーワード検索実行部１２４は、ユーザクエリに含まれる語彙のうち、感性ベクトルに展開されていない語彙をキーワードとして、検索対象商品ＤＢ１０１の説明文を対象に全文検索（キーワード検索）を実行する。キーワード検索実行部１２４は、キーワード検索部の一例である。 The keyword search execution unit 124 performs a full-text search (keyword search) on the descriptions of the search target product DB 101 using, as keywords, the vocabulary included in the user query that has not been expanded into the sensitivity vector. The keyword search execution unit 124 is an example of a keyword search unit.

検索結果統合部１２５は、感性検索実行部１２３による検索結果とキーワード検索実行部１２４による検索結果との論理積となる結果を抽出し、感性検索実行部１２３の類似度の高い結果から並べる。 The search result integration unit 125 extracts logical product results of the search results of the emotional search execution unit 123 and the search results of the keyword search execution unit 124, and arranges the results in descending order of similarity of the sensitivity search execution unit 123.

検索結果統合部１２５は、検索制御部１２１による２回目以降の検索である場合には、前回までの検索結果に後続するように（次になるように）、新しく検索した検索結果（今回の検索結果）を追加する。 In the case of the second or later search by the search control unit 121, the search result integration unit 125 arranges the newly searched search result (the current search result).

検索結果表示部１２６は、検索結果統合部１２５によって統合された検索結果一覧をユーザに表示する。検索結果統合部１２５および検索結果表示部１２６は、出力処理部の一例である。 The search result display unit 126 displays the search result list integrated by the search result integration unit 125 to the user. The search result integration unit 125 and the search result display unit 126 are examples of an output processing unit.

検索結果を上位から順に示すことで、あたかも、必須条件語彙に従って、検索結果をソートされた結果を得る。 By displaying the search results in descending order, it is as if the search results were sorted according to the essential condition vocabulary.

また、検索結果表示部１２６は、展開・検索を行なった順に、検索結果をユーザに提示することで、あたかも、曖昧性を含む表現の部分は感性的に合致する他の表現で、且つ、ユーザの所望する具体的な条件はそのままその条件に合致した結果がでるように、検索結果をソートされたようにユーザに提示する。なお、ユーザへの検索結果の表示は、インタラクティブに時系列で追加表示してもよく、また、一括表示しても良い。 In addition, the search result display unit 126 presents the search results to the user in the order of development and search, so that the part of the expression containing ambiguity is expressed as if it were another expression that matches the sensibility of the user. The search results are presented to the user in a sorted manner so that the results matching the desired specific conditions are obtained as they are. The display of the search results to the user may be interactively additionally displayed in chronological order, or may be collectively displayed.

例えば、検索結果表示部１２６は、図３に例示した表示画面の検索結果表示画面５０３に表示させる画像データを作成する。 For example, the search result display unit 126 creates image data to be displayed on the search result display screen 503 of the display screen illustrated in FIG.

また、本情報処理システム１においては、ユーザクエリに含まれる全ての語彙に対して感性ベクトル検索を行なってもよい。 Further, in the information processing system 1, the sensitivity vector search may be performed for all vocabulary included in the user query.

すなわち、ハイブリッド検索部１２０において、検索制御部１２１は、ユーザクエリに含まれる全ての語彙のうち、感性ベクトル展開語彙特定部１１９により必須条件語彙として判定された語彙を感性ベクトル展開語彙として再度の検索を行なってもよい。 That is, in the hybrid search unit 120, the search control unit 121 searches again for the vocabulary determined as the essential condition vocabulary by the emotion vector expansion vocabulary identification unit 119 among all the vocabulary included in the user query as the emotion vector expansion vocabulary. may be performed.

以下、ユーザクエリに含まれる全ての語彙のうち、感性ベクトル検索が行なわれていない語彙を未展開の語彙という場合がある。 Hereinafter, among all the vocabularies included in the user query, the vocabularies that have not undergone the emotion vector search may be referred to as unexpanded vocabularies.

検索制御部１２１は、未展開の語彙がある場合には、ユーザクエリに含まれる全ての語彙に対して感性ベクトル検索が行なわれるまで（未展開の語彙がなくなるまで）、未展開の語彙を１つずつ感性ベクトル展開語彙に切り替えながら、同じユーザクエリについて再検索を繰り返し行なってもよい。 If there is an unexpanded vocabulary, the search control unit 121 sets the unexpanded vocabulary to 1 until the sensitivity vector search is performed for all the vocabularies included in the user query (until there are no more unexpanded vocabularies). Re-searching for the same user query may be repeated while switching to the Kansei vector expansion vocabulary one by one.

図７は実施形態の一例としての情報処理システム１における同一のユーザクエリに対する再検索処理を説明するためのフローチャート（ステップＡ１～Ａ５）である。 FIG. 7 is a flowchart (steps A1 to A5) for explaining re-search processing for the same user query in the information processing system 1 as an example of the embodiment.

ステップＡ１において、ユーザがユーザクエリを入力する。本例においては、「オフでも使えるブロードのワイシャツ」とのテキスト文がユーザクエリである。 At step A1, a user enters a user query. In this example, the user query is a text sentence "a broad shirt that can be used even when off".

ステップＡ２において、感性ベクトル展開語彙特定部１１９がユーザクエリに含まれる全ての語彙について、感性ベクトル展開語彙（Ａ），必須条件語彙（Ｂ）および商品ＤＢ属性項目語彙の推定をそれぞれ行なう。 In step A2, the emotion vector-expanded vocabulary identification unit 119 estimates the emotion vector-expanded vocabulary (A), essential condition vocabulary (B), and product DB attribute item vocabulary for all the vocabularies included in the user query.

ステップＡ１において入力されたユーザクエリにおいて、検索対象商品ＤＢ１０１の説明文における語彙「オフでも使える」および「ブロードの」の各出現頻度は、0.2，0.01である。感性ベクトル展開語彙特定部１１９は、これらの出現頻度を予め設定された閾値と比較することで、語彙「オフでも使える」を感性ベクトル展開語彙と推定し、語彙「ブロードの」を必須条件語彙と推定する。また、語彙「ワイシャツ」については、ＤＢ属性項目語彙特定部１１７により商品ＤＢ属性項目語彙であると判断されたものとする。 In the user query input in step A1, the frequencies of appearance of the words "can be used even when turned off" and "broad" in the description of the search target product DB 101 are 0.2 and 0.01, respectively. The affective vector expansion vocabulary identification unit 119 compares these appearance frequencies with a preset threshold value to estimate the vocabulary "can be used even when turned off" as the affective vector expansion vocabulary, and the vocabulary "broad" as the essential condition vocabulary. presume. It is also assumed that the DB attribute item vocabulary specifying unit 117 has determined that the vocabulary "shirt" is the product DB attribute item vocabulary.

ステップＡ３において、ユーザクエリに基づく１回目の検索が実行される。
感性ベクトル展開部１２２は、語彙「オフでも使える」を感性ベクトル検索の対象とし、語彙「ブロードの」および語彙「ワイシャツ」をキーワード検索の対象とする。 At step A3, a first search based on the user query is performed.
The sensitivity vector development unit 122 searches for the vocabulary "can be used even when off" for sensitivity vector search, and searches for the vocabulary "broad" and "shirt" as keywords.

感性ベクトル展開部１２２は、感性ベクトル展開語彙と推定された語彙「オフでも使える」について、感性ベクトルに展開する。すなわち、感性ベクトル展開部１２２は語彙「オフでも使える」を感性ベクトル計算部１１４によって感性ベクトルに展開させる。 The sensitivity vector expansion unit 122 expands the vocabulary "can be used even when turned off" estimated as the sensitivity vector expansion vocabulary into sensitivity vectors. That is, the sensitivity vector developing unit 122 causes the sensitivity vector calculation unit 114 to develop the vocabulary "can be used even when off" into sensitivity vectors.

語彙「オフでも使える」を感性ベクトルに展開することで、感性ベクトルとして（トラディショナル，カジュアル，こなれ感）=（ 0.6, 0.7, 0.2 ）が得られたものとする。 It is assumed that (traditional, casual, mature) = (0.6, 0.7, 0.2) is obtained as a sensitivity vector by expanding the vocabulary "can be used even when off" to a sensitivity vector.

感性検索実行部１２３は、この得られた感性ベクトル（ 0.6, 0.7, 0.2 ）を用いて検索対象商品ＤＢ１０１に対する検索（感性ベクトル検索）を行なう（１回目の検索）。これにより、感性ベクトルの平均値に一定の閾値以上の類似度で類似する感性ベクトルを持った商品説明文（＝商品ＩＤ）が抽出される。 The sensitivity search execution unit 123 uses the obtained sensitivity vector (0.6, 0.7, 0.2) to search the search target product DB 101 (sensitivity vector search) (first search). As a result, the item description (=item ID) having an affective vector similar to the average value of the affective vector with a degree of similarity equal to or higher than a certain threshold is extracted.

また、キーワード検索実行部１２４が、検索対象商品ＤＢ１０１に対して、語彙「ブロードの」および「ワイシャツ」をキーワードとするキーワード検索を行なう。 In addition, the keyword search execution unit 124 performs a keyword search on the search target product DB 101 using the words “broad” and “shirt” as keywords.

検索結果統合部１２５は、これらの感性検索実行部１２３による検索結果とキーワード検索実行部１２４による検索結果とを統合する。検索結果表示部１２６は、この統合された検索結果をユーザに提示する。 The search result integration unit 125 integrates the search results from the emotional search execution unit 123 and the search results from the keyword search execution unit 124 . The search result display unit 126 presents the integrated search results to the user.

図７に示す例においては、ステップＡ３における１回目の検索においては、以下の順で検索結果が出力される。 In the example shown in FIG. 7, in the first search in step A3, search results are output in the following order.

ドレスシャツＸ（ブロード生地のゆったりとしたワイシャツ）
ドレスシャツＹ（着心地の良いブロード生地のワイシャツ）
ドレスシャツＺ（こなれ感のあるワイシャツ。ブロード生地が特徴。） Dress shirt X (a loose-fitting broadcloth dress shirt)
Dress shirt Y (comfortable broadcloth dress shirt)
Dress shirt Z (dress shirt with a sense of familiarity. Features broad fabric.)

この１回目の検索結果においては、必須条件語彙および商品ＤＢ属性項目語彙についてはキーワード検索を実行することで、ユーザが外せないと考える条件が満たされた検索結果を得ることができる。 By performing a keyword search on the essential condition vocabulary and the product DB attribute item vocabulary in the first search result, it is possible to obtain a search result that satisfies the conditions that the user considers indispensable.

検索結果表示部１２６は、例えば、図３に例示したウェブページの検索結果表示画面５０３に、上述の如く示される順序で検索結果を提示する。 The search result display unit 126 presents the search results in the above-described order on the search result display screen 503 of the web page illustrated in FIG. 3, for example.

また、検索結果表示部１２６は、検索結果表示画面５０３へ検索結果を表示させるとともに、例えば、「このような検索結果が得られました」のようなメッセージを提示してもよい。 In addition, the search result display unit 126 may display the search results on the search result display screen 503 and present a message such as "such search results have been obtained", for example.

ユーザは、検索結果表示画面５０３に表示された検索結果を見て満足できない場合には、インタフェース画面５０１において、例えば、「他には？」のような他の検索結果の表示を求める入力を行なってもよい。 If the user is not satisfied with the search results displayed on the search result display screen 503, the user makes an input requesting display of other search results, such as "What else?", on the interface screen 501. may

これにより、ステップＡ４において、同一のユーザクエリに基づく２回目の検索（再検索）が行なわれる。 As a result, in step A4, a second search (research) based on the same user query is performed.

同一ユーザクエリに基づく２回目の検索において、検索制御部１２１は、ユーザクエリに含まれる未展開の語彙であって、必須条件語彙であると判断された語彙のうち商品ＤＢ属性項目語彙でない語彙（本実施形態では「ブロードの」）を優先して、感性ベクトル検索の対象とする。 In the second search based on the same user query, the search control unit 121 selects the unexpanded vocabulary included in the user query that is determined to be the essential condition vocabulary and is not the product DB attribute item vocabulary ( In this embodiment, "broad") is preferentially targeted for the sensitivity vector search.

感性ベクトル展開部１２２は、語彙「ブロードの」および「オフでも使える」を感性ベクトル検索の対象とし、語彙「ワイシャツ」をキーワード検索の対象とする。 The affective vector expansion unit 122 searches for the words “broad” and “can be used even when off” for affective vector search, and searches for the word “shirt” as a keyword.

感性ベクトル展開部１２２は語彙「ブロードの」および語彙「オフでも使える」を感性ベクトル計算部１１４によって感性ベクトルに展開させる。 The sensitivity vector developing unit 122 develops the vocabulary “broad” and the vocabulary “can be used even when off” into sensitivity vectors by the sensitivity vector calculation unit 114 .

語彙「ブロードの」および語彙「オフでも使える」を感性ベクトルに展開することで、感性ベクトルとして（トラディショナル，カジュアル，こなれ感）=（ 0.7, 0.5, 0.2 ）が得られたものとする。 It is assumed that (traditional, casual, mature) = (0.7, 0.5, 0.2) is obtained as a sensitivity vector by developing the vocabulary "broad" and the vocabulary "can be used even off" into sensitivity vectors.

感性検索実行部１２３は、この得られた感性ベクトル（ 0.7, 0.5, 0.2 ）を用いて検索対象商品ＤＢ１０１に対する検索（感性ベクトル検索）を行なう（２回目の検索）。これにより、感性ベクトルの平均値に一定の閾値以上の類似度で類似する感性ベクトルを持った商品説明文（＝商品ＩＤ）が抽出される。 The sensitivity search execution unit 123 uses the obtained sensitivity vector (0.7, 0.5, 0.2) to search the search target product DB 101 (sensitivity vector search) (second search). As a result, the item description (=item ID) having an affective vector similar to the average value of the affective vector with a degree of similarity equal to or higher than a certain threshold is extracted.

また、キーワード検索実行部１２４が、検索対象商品ＤＢ１０１に対して、語彙「ワイシャツ」をキーワードとするキーワード検索を行なう。 In addition, the keyword search execution unit 124 performs a keyword search on the search target product DB 101 using the vocabulary "shirt" as a keyword.

検索結果統合部１２５は、これらの感性検索実行部１２３による検索結果とキーワード検索実行部１２４による検索結果とを統合して、検索結果を出力する。 The search result integration unit 125 integrates the search results from the emotional search execution unit 123 and the search results from the keyword search execution unit 124, and outputs the search results.

図７に示す例においては、ステップＡ４における２回目の検索においては、以下の順で検索結果が出力される。 In the example shown in FIG. 7, in the second search in step A4, search results are output in the following order.

ドレスシャツＸ（ブロード生地のゆったりとしたワイシャツ）
ドレスシャツＹ（着心地の良いブロード生地のワイシャツ）
ドレスシャツＺ（こなれ感のあるワイシャツ。ブロード生地が特徴。）
・・・
ドレスシャツα（明るめのカジュアル感のあるワイシャツ）
ドレスシャツβ（肌触りの良い形状記憶ワイシャツ） Dress shirt X (a loose-fitting broadcloth dress shirt)
Dress shirt Y (comfortable broadcloth dress shirt)
Dress shirt Z (dress shirt with a sense of familiarity. Features broad fabric.)
・・・
Dress shirt α (bright casual dress shirt)
Dress shirt β (a comfortable shape-memory dress shirt)

この２回目の検索結果においては、語彙「ブロードの」について感性ベクトル検索を行なうことで、素材がブロードでないドレスシャツα，βが検索結果に含まれる。すなわち、ユーザが外せない条件である語彙「ブロードの」を感性ベクトル検索することで検索結果にユーザが意図しない検索結果が含まれることとなる。 In the result of the second search, dress shirts α and β whose material is not broad are included in the search result by carrying out the sensitivity vector search for the word “broad”. In other words, the search results that are unintended by the user are included in the search results by performing the sensitivity vector search for the word "broad", which is a condition that the user cannot remove.

ただし、感性ベクトル検索による検索結果はユーザの意図に合わないものであるおそれがあるので、語彙「ブロードの」による検索結果を下位に表示させ、ユーザの意図に合う検索結果を上位に表示させる。 However, since there is a possibility that the search result of the sensitivity vector search does not match the user's intention, the search result of the word "broad" is displayed at the lower level, and the search result that matches the user's intention is displayed at the higher level.

これにより、検索結果の上位により、ユーザが外せないと考える条件が満たされた検索結果を得ることができるとともに、検索結果の下位において、ユーザが意図しない意外性のある検索結果を得ることもできる。 As a result, it is possible to obtain search results that satisfy the conditions that the user considers indispensable from the top of the search results, and to obtain unexpected search results that the user does not intend from the bottom of the search results. .

ユーザは、検索結果表示画面５０３に表示された検索結果を見て満足できない場合には、インタフェース画面５０１において、例えば、「他には？」のような他の検索結果の表示を求める入力を再度行なってもよい。 If the user is not satisfied with the search results displayed on the search result display screen 503, the user can re-enter an input requesting display of other search results, such as "What else?", on the interface screen 501. You can do it.

これにより、ステップＡ５において、同一のユーザクエリに基づく３回目の検索（再検索）が行なわれる。 As a result, in step A5, a third search (research) based on the same user query is performed.

本例において、同一ユーザクエリに基づく３回目の検索においては、必須条件語彙であると判断された、商品ＤＢ属性項目語彙でない未展開の語彙は残っていない。そこで、検索制御部１２１は、ユーザクエリに含まれる未展開の語彙であって、商品ＤＢ属性項目語彙であると判断された語彙（本実施形態では「ワイシャツ」）を、感性ベクトル検索の対象とする。 In this example, in the third search based on the same user query, there are no unexpanded vocabularies that are determined to be essential condition vocabularies and are not product DB attribute item vocabularies. Therefore, the search control unit 121 selects unexpanded vocabulary included in the user query and determined to be product DB attribute item vocabulary ("shirt" in this embodiment) as a subject of the emotion vector search. do.

感性ベクトル展開部１２２は、語彙「ワイシャツ」，語彙「ブロードの」および語彙「オフでも使える」を感性ベクトル検索の対象にする。 The sensitivity vector development unit 122 searches the vocabulary "shirt", the vocabulary "broad", and the vocabulary "can be used even when off" for sensitivity vector search.

感性ベクトル展開部１２２は、語彙「ワイシャツ」，語彙「ブロードの」および語彙「オフでも使える」を感性ベクトル計算部１１４によって感性ベクトルに展開させる。 The sensitivity vector developing unit 122 causes the sensitivity vector calculation unit 114 to develop the vocabulary “shirt”, the vocabulary “broad”, and the vocabulary “can be used even when off” into sensitivity vectors.

語彙「ワイシャツ」，語彙「ブロードの」および語彙「オフでも使える」の全てを感性ベクトルに展開することで、感性ベクトルとして（トラディショナル，カジュアル，こなれ感）=（0.8, 0.4, 0.2）が得られたものとする。 By expanding all of the vocabulary “shirt”, the vocabulary “broad”, and the vocabulary “can be used off-duty” into the Kansei vector, (traditional, casual, mature) = (0.8, 0.4, 0.2) is obtained as the Kansei vector. shall be assumed.

感性検索実行部１２３は、この得られた感性ベクトル（0.8, 0.4, 0.2）を用いて検索対象商品ＤＢ１０１に対する検索（感性ベクトル検索）を行なう（３回目の検索）。これにより、感性ベクトルの平均値に一定の閾値以上の類似度で類似する感性ベクトルを持った商品説明文（＝商品ＩＤ）が抽出される。 The sensitivity search execution unit 123 uses the obtained sensitivity vector (0.8, 0.4, 0.2) to search the search target product DB 101 (sensory vector search) (third search). As a result, the item description (=item ID) having an affective vector similar to the average value of the affective vector with a degree of similarity equal to or higher than a certain threshold is extracted.

検索結果統合部１２５は、この感性検索実行部１２３による検索結果を検索結果として出力する。 The search result integration unit 125 outputs the search result by the sensitivity search execution unit 123 as a search result.

図７に示す例においては、ステップＡ５における３回目の検索においては、以下の順で検索結果が出力される。 In the example shown in FIG. 7, in the third search in step A5, search results are output in the following order.

ドレスシャツＸ（ブロード生地のゆったりとしたワイシャツ）
ドレスシャツＹ（着心地の良いブロード生地のワイシャツ）
ドレスシャツＺ（こなれ感のあるワイシャツ。ブロード生地が特徴。）
・・・
ドレスシャツα（明るめのカジュアル感のあるワイシャツ）
ドレスシャツβ（肌触りの良い形状記憶ワイシャツ）
・・・
カジュアルシャツγ（ややカジュアルな・・・）
カジュアルシャツε（シルクの・・・） Dress shirt X (a loose-fitting broadcloth dress shirt)
Dress shirt Y (comfortable broadcloth dress shirt)
Dress shirt Z (dress shirt with a sense of familiarity. Features broad fabric.)
・・・
Dress shirt α (bright casual dress shirt)
Dress shirt β (a comfortable shape-memory dress shirt)
・・・
Casual shirt γ (somewhat casual...)
Casual shirt ε (silk...)

この３回目の検索結果においては、ユーザクエリに含まれる全ての語彙「ワイシャツ」，語彙「ブロードの」および語彙「オフでも使える」について感性ベクトル検索を行なうことで、ワイシャツでないでないカジュアルシャツγ，εが検索結果に含まれる。すなわち、ユーザが外せない条件である語彙「ワイシャツ」をも感性ベクトル検索することで検索結果にユーザが意図しない検索結果が含まれることとなる。 In this third search result, by performing an affective vector search on all the words "shirt", "broad shirt", and "can be used even when you are off" included in the user query, casual shirts γ, ε that are not white shirts is included in the search results. In other words, the search results that are not intended by the user will be included in the search results by performing the sensitivity vector search even for the word "shirt", which is a condition that the user cannot remove.

ただし、語彙「ワイシャツ」および語彙「ブロードの」を感性ベクトル検索による検索結果はユーザの意図に合わないおそれがあるので、下位に表示させ、ユーザの意図に合う検索結果を上位に表示させる。 However, since there is a possibility that the search result of the sensitivity vector search for the vocabulary "shirt" and the vocabulary "broad" may not match the user's intention, they are displayed at the lower level, and the search result that matches the user's intention is displayed at the higher level.

（Ｃ）動作
上述の如く構成された実施形態の一例としての情報処理システム１における処理を、図８に示すフローチャート（ステップＳ１～Ｓ１７）に従って説明する。 (C) Operation Processing in the information processing system 1 as an example of the embodiment configured as described above will be described according to the flowchart (steps S1 to S17) shown in FIG.

ステップＳ１において、商品ＤＢ属性項目語彙抽出部１１１が、検索対象商品ＤＢ１０１から、カテゴリ属性名と項目名として利用されている語彙の一覧を抽出する。 In step S<b>1 , the product DB attribute item vocabulary extraction unit 111 extracts a list of vocabulary used as category attribute names and item names from the search target product DB 101 .

ステップＳ２において、商品ＤＢ属性項目語彙抽出部１１１は、ステップＳ１において抽出した語彙をリスト化することで、ＤＢ属性項目語彙ＤＢ１０２を作成する。 In step S2, the product DB attribute item vocabulary extraction unit 111 creates a DB attribute item vocabulary DB 102 by listing the vocabulary extracted in step S1.

ステップＳ３において、説明文語彙抽出部１１２が、検索対象商品ＤＢ１０１に格納されている商品の各説明文に対して、例えば形態素解析処理を行なうことで、説明文中に利用されている語彙を抽出する。 In step S3, the description vocabulary extraction unit 112 extracts the vocabulary used in the description by performing, for example, morphological analysis processing on each description of the product stored in the search target product DB 101. .

ステップＳ４において、説明文語彙頻度計算部１１３は、説明文語彙抽出部１１２によって抽出された各語彙の出現頻度を計算する。出現頻度の計算後、説明文語彙頻度計算部１１３は、最大頻度で各頻度を除算することで正規化を行なう。 In step S<b>4 , the explanatory sentence vocabulary frequency calculation unit 113 calculates the appearance frequency of each vocabulary extracted by the explanatory sentence vocabulary extraction unit 112 . After calculating the frequency of occurrence, the explanatory sentence vocabulary frequency calculator 113 normalizes by dividing each frequency by the maximum frequency.

ステップＳ５において、説明文語彙頻度計算部１１３は、ステップＳ４において正規化し頻度情報を説明文語彙頻度情報ＤＢ１０４に登録することで説明文語彙頻度情報ＤＢ１０４を作成する。 In step S5, the explanatory sentence vocabulary frequency calculation unit 113 creates the explanatory sentence vocabulary frequency information DB 104 by registering the normalized frequency information in the explanatory sentence vocabulary frequency information DB 104 in step S4.

ステップＳ６において、感性ベクトル計算部１１４が、検索対象商品ＤＢ１０１の各商品説明文に対して、感性ベクトル定義辞書１０３を参照しながら、各商品ＩＤの商品の説明文に対応する感性ベクトルの値を計算する。 In step S6, the sensitivity vector calculation unit 114 refers to the sensitivity vector definition dictionary 103 for each product description in the search target product DB 101, and calculates the value of the sensitivity vector corresponding to the product description of each product ID. calculate.

感性ベクトル計算部１１４は、例えば、感性ベクトル定義辞書１０３に登録された語彙が説明文中に出現する度に、各感性軸（トラディショナル，カジュアル，こなれ感）毎に感性スコアを加算し、最後に各感性スコアの平均値を並べた値を感性ベクトルとして算出する。 For example, each time a vocabulary registered in the Kansei vector definition dictionary 103 appears in a description, the Kansei vector calculation unit 114 adds a Kansei score for each Kansei axis (traditional, casual, familiar). A value obtained by arranging the average values of the sensitivity scores is calculated as a sensitivity vector.

ステップＳ７において、ユーザからのユーザクエリが入力され、ユーザクエリ入力部１１５が、この入力されたユーザクエリを受け付ける。 In step S7, a user query is input from the user, and the user query input unit 115 accepts this input user query.

ステップＳ８において、入力テキスト解析部１１６が、ユーザが入力したユーザクエリを解析する。 In step S8, the input text analysis unit 116 analyzes the user query input by the user.

ステップＳ９において、ＤＢ属性項目語彙特定部１１７が、商品ＤＢ属性項目語彙ＤＢ１０２を参照して、ユーザクエリに含まれる語彙が検索対象商品ＤＢ１０１においてカテゴリ属性名や項目名として用いられているか否かを判断する。すなわち、ＤＢ属性項目語彙特定部１１７は、ユーザクエリに含まれる語彙が商品ＤＢ属性項目語彙であるか否かを判断する。 In step S9, the DB attribute item vocabulary identification unit 117 refers to the product DB attribute item vocabulary DB 102 to determine whether or not the vocabulary included in the user query is used as a category attribute name or item name in the search target product DB 101. to decide. That is, the DB attribute item vocabulary identification unit 117 determines whether or not the vocabulary included in the user query is the product DB attribute item vocabulary.

ステップＳ１０において、語彙頻度設定部１１８は、ユーザクエリに含まれる、商品ＤＢ属性項目語彙に該当しない語彙について、説明文語彙頻度情報ＤＢ１０４を参照して、各語彙の出現頻度（出現頻度情報）を設定する。 In step S10, the vocabulary frequency setting unit 118 refers to the description sentence vocabulary frequency information DB 104 for vocabulary that is included in the user query and does not correspond to the product DB attribute item vocabulary, and sets the appearance frequency (appearance frequency information) of each vocabulary. set.

ステップＳ１１において、感性ベクトル展開語彙特定部１１９は、ユーザクエリから抽出された各語彙の出現頻度を所定の閾値と比較することで、当該語彙が感性ベクトル展開語彙であるか必須条件語彙であるかを特定する。 In step S11, the sensitivity vector-expanded vocabulary identification unit 119 compares the appearance frequency of each vocabulary extracted from the user query with a predetermined threshold value to determine whether the vocabulary is an emotion vector-expanded vocabulary or an essential condition vocabulary. identify.

また、感性ベクトル展開語彙特定部１１９は、感性ベクトルに展開する優先順位が高い語彙として、設定された頻度情報が最も大きい語彙から順にソートし、最後に、商品ＤＢ属性項目語彙に該当する語彙が並ぶように並び替える。 In addition, the sensitivity vector expansion vocabulary identification unit 119 sorts the vocabulary in descending order of the set frequency information as the vocabulary with the highest priority to be expanded to the sensitivity vector, and finally, the vocabulary corresponding to the product DB attribute item vocabulary. Arrange them so that they line up.

ステップＳ１２において、検索制御部１２１は、ユーザクエリに含まれる語彙であって、感性ベクトルに未展開の語彙が残っているかを確認する。 In step S12, the search control unit 121 confirms whether or not there is a vocabulary included in the user query that has not yet been developed in the sensitivity vector.

確認の結果、未展開の語彙が残っている場合には（ステップＳ１２のＹｅｓルート参照）、ステップＳ１３に移行する。 As a result of the confirmation, if unexpanded vocabulary remains (see Yes route of step S12), the process proceeds to step S13.

ステップＳ１３においては、感性ベクトル展開部１２２は、ユーザクエリに含まれる語彙のうち未展開の語彙を感性ベクトル検索の対象に含めて、感性ベクトル計算部１１４に感性ベクトルに展開させる。 In step S13, the sensitivity vector expansion unit 122 includes the unexpanded vocabulary among the vocabulary included in the user query in the sensitivity vector search target, and causes the sensitivity vector calculation unit 114 to expand the sensitivity vector.

ステップＳ１４において、感性検索実行部１２３は、ステップＳ１３において得られた感性ベクトルにより検索対象商品ＤＢ１０１の商品説明文に対する感性ベクトル検索を実行する。 In step S14, the sensitivity search execution unit 123 executes sensitivity vector search for the product description in the search target product DB 101 using the sensitivity vector obtained in step S13.

ステップＳ１５において、キーワード検索実行部１２４が、検索対象商品ＤＢ１０１に対して、必須条件語彙や商品ＤＢ属性項目語彙を用いたキーワード検索（全文検索）を行なう。なお、キーワード検索の対象となる語彙が無い場合には、このステップＳ１５の処理はスキップされる。 In step S15, the keyword search execution unit 124 performs a keyword search (full-text search) on the search target product DB 101 using essential condition vocabulary and product DB attribute item vocabulary. Note that if there is no vocabulary to be searched for by keyword, the process of step S15 is skipped.

ステップＳ１６において、検索結果統合部１２５が、ステップＳ１４において行なった感性ベクトル検索の結果と、ステップＳ１５において行なったキーワード検索の結果とを統合する。その後、ステップＳ１２に戻る。 In step S16, the search result integration unit 125 integrates the result of the sensitivity vector search performed in step S14 and the result of the keyword search performed in step S15. After that, the process returns to step S12.

一方、ステップＳ１２における確認の結果、未展開の語彙が残っていない場合には（ステップＳ１２のＮｏルート参照）、ステップＳ１７に移行する。 On the other hand, as a result of the confirmation in step S12, if there is no unexpanded vocabulary left (see No route in step S12), the process proceeds to step S17.

ステップＳ１７においては、検索結果表示部１２６が、ステップＳ１６において検索結果統合部１２５が統合した検索結果を出力する。 In step S17, the search result display unit 126 outputs the search results integrated by the search result integration unit 125 in step S16.

（Ｄ）効果
このように、実施形態の一例としての情報処理システム１によれば、感性ベクトル展開語彙特定部１１９が、ユーザクエリに含まれる各語彙について、検索対象商品ＤＢ１０１の説明文中の出現頻度に基づき感性ベクトル展開語彙と必須条件語彙との推定を行なう。 (D) Effect As described above, according to the information processing system 1 as an example of the embodiment, the sensitivity vector expansion vocabulary identification unit 119 determines the appearance frequency in the description of the search target product DB 101 for each vocabulary included in the user query. Then, the Kansei vector expansion vocabulary and the essential condition vocabulary are estimated based on the above.

また、ＤＢ属性項目語彙特定部１１７が、ユーザクエリに含まれる語彙において、語彙が検索対象商品ＤＢ１０１においてカテゴリ属性名や項目名として用いられている商品ＤＢ属性項目語彙を特定する。 In addition, the DB attribute item vocabulary identification unit 117 identifies product DB attribute item vocabulary whose vocabulary is used as category attribute names and item names in the search target product DB 101 in the vocabulary included in the user query.

ハイブリッド検索部１２０において、感性ベクトル展開語彙について感性ベクトル検索を行ない、必須条件語彙および商品ＤＢ属性項目語彙に対してキーワード検索を行ない、検索結果統合部１２５がこれらの検索結果を統合して検索結果として出力する。 In the hybrid search unit 120, a sensitivity vector search is performed on the sensitivity vector expanded vocabulary, and a keyword search is performed on the essential condition vocabulary and the product DB attribute item vocabulary. output as

すなわち、ユーザが意図する条件と想定される必須条件語彙および商品ＤＢ属性項目語彙に対してキーワード検索を行なうことで、検索結果にユーザの検索意図を反映させることができる。また、感性ベクトル展開語彙に対して感性ベクトル検索を行なうことで、直感的な感性検索も行なうことができ、感性ベクトル展開語彙について曖昧性を持たせた検索結果を得ることができる。 That is, by performing a keyword search on essential condition vocabulary and product DB attribute item vocabulary assumed to be conditions intended by the user, the user's search intention can be reflected in the search results. In addition, by performing an affective vector search on the affective vector expanded vocabulary, it is possible to perform an intuitive affective search, and to obtain search results in which the affective vector expanded vocabulary is ambiguous.

感性ベクトル展開語彙特定部１１９が、感性ベクトルに展開する優先順位が高い語彙として、設定された頻度情報が最も大きい語彙から順にソートし、最後に、商品ＤＢ属性項目語彙に該当する語彙が並ぶように並び替える。そして、ハイブリッド検索部１２０において、検索対象商品ＤＢ１０１の説明文中における出現頻度が高い語彙が優先して感性ベクトル検索される。 The kansei vector expansion vocabulary identification unit 119 sorts the words in descending order of the set frequency information as the vocabularies with the highest priority to be expanded to the kansei vector, and finally, the vocabularies corresponding to the product DB attribute item vocabulary are arranged. Sort by Then, in the hybrid search unit 120, vocabularies with a high appearance frequency in the descriptions of the search target product DB 101 are preferentially searched for by sensitivity vectors.

（Ｅ）その他
開示の技術は上述した実施形態に限定されるものではなく、本実施形態の趣旨を逸脱しない範囲で種々変形して実施することができる。本実施形態の各構成および各処理は、必要に応じて取捨選択することができ、あるいは適宜組み合わせてもよい。 (E) Others The technology disclosed herein is not limited to the above-described embodiments, and various modifications can be made without departing from the spirit of the embodiments. Each configuration and each process of the present embodiment can be selected as required, or may be combined as appropriate.

例えば、上述した実施形態においては、検索対象商品ＤＢ１０１，商品ＤＢ属性項目語彙ＤＢ１０２，感性ベクトル定義辞書１０３および説明文語彙頻度情報ＤＢ１０４が管理サーバ１０に備えられた例を示したが（図４参照）、これに限定されるものではない。これらの検索対象商品ＤＢ１０１，商品ＤＢ属性項目語彙ＤＢ１０２，感性ベクトル定義辞書１０３および説明文語彙頻度情報ＤＢ１０４のうち、少なくとも一部を他のサーバコンピュータに備えてもよく種々変形して実施することができる。 For example, in the above-described embodiment, an example in which the search target product DB 101, the product DB attribute item vocabulary DB 102, the emotion vector definition dictionary 103, and the description sentence vocabulary frequency information DB 104 are provided in the management server 10 was shown (see FIG. 4). ), but not limited to: At least a part of the search target product DB 101, product DB attribute item vocabulary DB 102, emotion vector definition dictionary 103, and explanation sentence vocabulary frequency information DB 104 may be provided in another server computer, and various modifications may be made. can.

また、上述した実施形態においては、検索システムとして衣類を商品として取り扱う商品検索システムである例について示したが、これに限定されるものではなく、種々変形して実施することができる。 Also, in the above-described embodiment, an example of a product search system that handles clothes as products has been shown as a search system, but the present invention is not limited to this, and various modifications can be made.

さらに、上述した実施形態においては、商品の感性ベクトルとして、「トラディショナル」，「カジュアル」および「こなれ感」をそれぞれ感性軸とする３次元である例について示したが、これに限定されるものではなく、種々変形して実施することができる。例えば、次元数は２次元以下もしくは４次元以上であってもよく、また、「トラディショナル」，「カジュアル」および「こなれ感」以外の感性軸を備えてもよい。 Furthermore, in the above-described embodiment, an example of a three-dimensional product sensitivity vector having "traditional", "casual", and "familiar" as sensitivity axes is shown, but it is not limited to this. Instead, it can be implemented with various modifications. For example, the number of dimensions may be two or less or four or more, and sensitivity axes other than "traditional," "casual," and "familiar" may be provided.

また、上述した開示により本実施形態を当業者によって実施・製造することが可能である。 Moreover, the present embodiment can be implemented and manufactured by those skilled in the art based on the above disclosure.

（Ｆ）付記
以上の実施形態に関し、さらに以下の付記を開示する。 (F) Supplementary Note The following Supplementary Note will be disclosed with respect to the above embodiment.

（付記１）
クエリに含まれる複数の語において、曖昧性を含む表現として感性情報に展開可能な複数の第１語を推定し、
前記複数の第１語に優先順位を設定し、
前記優先順位に応じて前記複数の第１語の中から選択した第１語について検索対象情報に対する感性検索を行なう
処理を、コンピュータに実行させる、感性検索プログラム。 (Appendix 1)
estimating a plurality of first words that can be developed into emotional information as expressions containing ambiguity in the plurality of words included in the query;
Prioritizing the plurality of first words;
A feeling search program for causing a computer to execute a process of performing a feeling search on search target information for a first word selected from the plurality of first words according to the priority.

（付記２）
前記クエリに含まれる複数の語において、前記第１語以外の第２語を推定し、
前記第２語について前記検索対象情報に対するキーワード検索を行ない
前記感性検索の結果と前記キーワード検索の結果とを統合して出力する
処理を、前記コンピュータに実行させる、付記１記載の感性検索プログラム。 (Appendix 2)
estimating a second word other than the first word in a plurality of words included in the query;
The Kansei search program according to Supplementary Note 1, causing the computer to execute a process of performing a keyword search on the search target information for the second word, and integrating and outputting the result of the Kansei search and the result of the keyword search.

（付記３）
前記検索対象情報中における出現頻度に基づいて前記第１語の推定を行なう処理を、前記コンピュータに実行させる、付記１または２記載の感性検索プログラム。 (Appendix 3)
3. The sensibility search program according to appendix 1 or 2, causing the computer to execute a process of estimating the first word based on the appearance frequency in the search target information.

（付記４）
クエリを受け付ける入力部と、
前記クエリに含まれる複数の語において、曖昧性を含む表現として感性情報に展開可能な複数の第１語を推定する第１推定部と、
前記複数の第１語に優先順位を設定する設定部と、
前記優先順位に応じて前記複数の第１語の中から選択した第１語について検索対象情報に対する感性検索を行なう感性検索部と
を備える、情報処理装置。 (Appendix 4)
an input unit that accepts queries;
a first estimation unit for estimating a plurality of first words that can be developed into sensitivity information as expressions containing ambiguity in the plurality of words included in the query;
a setting unit that sets a priority order for the plurality of first words;
An information processing apparatus, comprising: a sensitivity search unit that performs a sensitivity search on search target information for a first word selected from the plurality of first words according to the priority order.

（付記５）
前記クエリに含まれる複数の語において、前記第１語以外の第２語を推定する第２推定部と、
前記第２語について前記検索対象情報に対するキーワード検索を行なうキーワード検索部と、
前記感性検索の結果と前記キーワード検索の結果とを統合して出力する出力処理部と
を備える、付記４記載の情報処理装置。 (Appendix 5)
a second estimation unit for estimating a second word other than the first word in a plurality of words included in the query;
a keyword search unit that performs a keyword search on the search target information for the second word;
5. The information processing apparatus according to appendix 4, further comprising an output processing unit that integrates and outputs the result of the sensitivity search and the result of the keyword search.

（付記６）
前記第１推定部が、
前記検索対象情報中における出現頻度に基づいて前記第１語の推定を行なう、付記４または５記載の情報処理装置。 (Appendix 6)
The first estimation unit
6. The information processing apparatus according to appendix 4 or 5, wherein the first word is estimated based on the appearance frequency in the search target information.

（付記７）
クエリに含まれる複数の語において、曖昧性を含む表現として感性情報に展開可能な複数の第１語を推定し、
前記複数の第１語に優先順位を設定する処理と、
前記優先順位に応じて前記複数の第１語の中から選択した第１語について検索対象情報に対する感性検索を行なう処理と
を備える、感性検索方法。 (Appendix 7)
estimating a plurality of first words that can be developed into emotional information as expressions containing ambiguity in the plurality of words included in the query;
a process of prioritizing the plurality of first words;
and performing a sensitivity search on search target information for a first word selected from the plurality of first words according to the priority.

（付記８）
前記クエリに含まれる複数の語において、前記第１語以外の第２語を推定する処理と、
前記第２語について前記検索対象情報に対するキーワード検索を行なう処理と、
前記感性検索の結果と前記キーワード検索の結果とを統合して出力する処理と
を備える、付記７記載の感性検索方法。 (Appendix 8)
A process of estimating a second word other than the first word in a plurality of words included in the query;
a process of performing a keyword search for the search target information for the second word;
7. The kansei search method according to appendix 7, further comprising a process of integrating and outputting the result of the kansei search and the result of the keyword search.

（付記９）
前記検索対象情報中における出現頻度に基づいて前記第１語の推定を行なう処理
を備える、付記７または８記載の感性検索方法。 (Appendix 9)
The sensibility search method according to appendix 7 or 8, comprising a process of estimating the first word based on the appearance frequency in the search target information.

１情報処理システム
１０管理サーバ
２０－１，２０－２，２０クライアント端末
３０ネットワーク
１１ＣＰＵ
１２メモリ
１３記憶装置
１４ネットワークインタフェース
１０１検索対象商品ＤＢ
１０２商品ＤＢ属性項目語彙
１０３感性ベクトル定義辞書
１０４説明文語彙頻度情報ＤＢ
１１１商品ＤＢ属性項目語彙抽出部
１１２説明文語彙抽出部
１１３説明文語彙頻度計算部
１１４感性ベクトル計算部
１１５ユーザクエリ入力部
１１６入力テキスト解析部
１１７ＤＢ属性項目語彙特定部
１１８語彙頻度設定部
１１９感性ベクトル展開語彙特定部
１２０ハイブリッド検索部
１２１検索制御部
１２２感性ベクトル展開部
１２３感性検索実行部
１２４キーワード検索実行部
１２５検索結果統合部
１２６検索結果表示部
５０１インタフェース画面
５０２ユーザクエリ入力ボックス
５０３検索結果表示画面 1 Information Processing System 10 Management Server 20-1, 20-2, 20 Client Terminal 30 Network 11 CPU
12 memory 13 storage device 14 network interface 101 search target product DB
102 Product DB attribute item vocabulary 103 Kansei vector definition dictionary 104 Description sentence vocabulary frequency information DB
111 Product DB attribute item vocabulary extraction unit 112 Description vocabulary extraction unit 113 Description sentence vocabulary frequency calculation unit 114 Kansei vector calculation unit 115 User query input unit 116 Input text analysis unit 117 DB attribute item vocabulary identification unit 118 Vocabulary frequency setting unit 119 Kansei Vector expansion vocabulary identification unit 120 Hybrid search unit 121 Search control unit 122 Kansei vector expansion unit 123 Kansei search execution unit 124 Keyword search execution unit 125 Search result integration unit 126 Search result display unit 501 Interface screen 502 User query input box 503 Search result display screen

Claims

Vocabularies belonging to an attribute item vocabulary set, which are pre-extracted vocabularies used as attribute items of a database to be searched in a plurality of words included in a query, are presumed to be vocabularies specifying essential conditions that a user cannot remove, and With respect to words belonging to the attribute item vocabulary set other than the attribute item vocabulary set, those whose frequency of appearance exceeds a predetermined threshold are estimated as a plurality of first words that can be developed into sensibility information as expressions containing ambiguity; estimating words below the predetermined threshold as vocabulary specifying the essential condition ;
Prioritizing the plurality of first words;
A feeling search program for causing a computer to execute a process of performing a feeling search on search target information for a first word selected from the plurality of first words according to the priority.

performing a keyword search on the search target information for the vocabulary specifying the essential condition ;
2. The Kansei search program according to claim 1, causing said computer to execute a process of integrating and outputting the result of the Kansei search and the result of the keyword search.

an input unit that accepts queries;
in the plurality of words included in the query, vocabularies belonging to an attribute item vocabulary set obtained by extracting in advance vocabularies used as attribute items of a database to be searched are presumed to be vocabularies specifying essential conditions that a user cannot remove; Among the words belonging to other than the attribute item vocabulary set, those whose appearance frequency exceeds a predetermined threshold are estimated as a plurality of first words that can be developed into sensitivity information as expressions containing ambiguity, while the appearance frequency is equal to or less than the predetermined threshold, an estimation unit for estimating vocabulary specifying the essential condition ;
a setting unit that sets a priority order for the plurality of first words;
An information processing apparatus, comprising: a sensitivity search unit that performs a sensitivity search on search target information for a first word selected from the plurality of first words according to the priority order.

the computer
Vocabularies belonging to an attribute item vocabulary set, which are pre-extracted vocabularies used as attribute items of a database to be searched in a plurality of words included in a query, are presumed to be vocabularies specifying essential conditions that a user cannot remove, and With respect to words belonging to the attribute item vocabulary set other than the attribute item vocabulary set, those whose frequency of appearance exceeds a predetermined threshold are estimated as a plurality of first words that can be developed into sensibility information as expressions containing ambiguity; A process of estimating words below the predetermined threshold as vocabulary specifying the essential condition ;
a process of prioritizing the plurality of first words;
and performing a sensitivity search on search target information for a first word selected from the plurality of first words according to the priority.