WO2010008047A1 - Integrated evaluation device, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium - Google Patents

Integrated evaluation device, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium Download PDF

Info

Publication number
WO2010008047A1
WO2010008047A1 PCT/JP2009/062892 JP2009062892W WO2010008047A1 WO 2010008047 A1 WO2010008047 A1 WO 2010008047A1 JP 2009062892 W JP2009062892 W JP 2009062892W WO 2010008047 A1 WO2010008047 A1 WO 2010008047A1
Authority
WO
WIPO (PCT)
Prior art keywords
evaluation
integrated
search
similarity
weight
Prior art date
Application number
PCT/JP2009/062892
Other languages
French (fr)
Japanese (ja)
Inventor
孝 白木
威 有熊
展久 白石
洋一 永井
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Publication of WO2010008047A1 publication Critical patent/WO2010008047A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Definitions

  • the present invention relates to an integrated evaluation apparatus, an integrated evaluation system, an integrated evaluation method, an integrated evaluation program, and a recording medium, and in particular, an integrated evaluation apparatus, an integrated evaluation system, an integrated evaluation method, and an integrated evaluation program that take into account the similarity between the evaluation apparatuses. And a recording medium.
  • a search method for extracting data satisfying a search request from data stored in a search target DB is known.
  • search for multimedia data such as images, videos, music, etc.
  • text search are performed.
  • a search may be performed by combining not only one feature quantity but also a plurality of feature quantities. Taking an image search as an example, this corresponds to the case of extracting an image that satisfies the search request for both color and pattern (see Patent Document 1).
  • Non-Patent Document 1 an integrated search called meta search and blend search that combine output results of a plurality of evaluation systems. Since the range of the evaluation target of each evaluation system may be different, the coverage of the evaluation target can be expanded by an integrated search combining a plurality. In addition, when relying on a certain search system, the evaluation system may censor arbitrarily lower the evaluation of an evaluation target, but it can be prevented by an integrated search.
  • Non-Patent Document 2 describes a point addition method for calculating importance based on a user's operation history. By using this method, points can be added to the search system that has influenced the item selected by the user. Furthermore, it is considered that the result of collecting and optimizing evaluation values by many users depends on the similarity of the evaluation system. It is also known that the similarity between ranking results output by each evaluation system is calculated using Kendall's rank correlation coefficient (Non-Patent Document 3) or the like.
  • the following cases can be considered when the point addition method for calculating the importance based on the user operation history is used.
  • the integration is optimized for this user In the search, it can be considered that the importance of the evaluation system A is raised and the importance of B and C is lowered (case 1).
  • the first problem was that there was no way to adjust the distribution in consideration of the algorithm relationship between the evaluation systems.
  • the problem is particularly serious when there are a large number of evaluation systems that can be used so much as spam.
  • the second problem is that there is no way to adjust the distribution of multiple evaluation systems in consideration of the similarity among the relationships of the evaluation systems. There was a tendency to raise and lower, and it had a sensitive effect on the overall result, and the integrated system could fall into a situation where it could not demonstrate the benefits of ensuring diversity. The situation where this problem occurs is almost the same as the first problem. For example, an integrated system generally recruits evaluation systems, and Company A uses Evaluation System A and Company B uses Evaluation System B. When Company C applies for evaluation system C, Company A is unfair by applying for A ', A' ', A' '' with the same content as A in order to increase their influence. There is a possibility of spam rating system to try to increase its influence on.
  • the third problem is that the distribution of a plurality of evaluation systems may be adjusted by reflecting the history of selecting user information, but accurate adjustment is difficult without the amount of user history, and the user When collecting a large number of histories, there is a scalability problem, and it was not possible to optimize effectively at the stage where the user's history is not acquired or at the stage where the history is small.
  • the fourth problem is that the evaluation system spam is effective in the same way as the second problem because the similarity of the evaluation system is not considered even when the user's history called relevance feedback is dealt with immediately. Is in an environment where
  • the fifth problem is that even when the user's history called relevance feedback is dealt with immediately, the similarity of the search device is not taken into consideration, so that the integrated evaluation system is the same as the third problem. Convergence to optimize according to was slow.
  • the present invention has been made in view of such a situation, and when the results of a plurality of evaluation devices are integrated to obtain a result, the similarity between the evaluation devices is taken into consideration and the importance is distributed at the time of integration.
  • the purpose of this method is to obtain an integrated evaluation result in which the influence of the evaluation device performing a more original evaluation is deepened by suppressing the weight of the result by the evaluation device having a high degree of similarity.
  • An integrated evaluation apparatus includes: means for connecting to a plurality of evaluation apparatuses that perform evaluation according to evaluation conditions; means for requesting the evaluation apparatus for evaluation under evaluation conditions input from a user terminal; and An integrated evaluation device comprising an integration means for combining and integrating the acquired evaluation results, considering the similarity between the evaluation device similarity calculation means for calculating the similarity between the plurality of evaluation devices, Weight setting means for setting the weight of each evaluation apparatus by pressing the weight of the evaluation apparatus having a high degree of similarity, and the integration means integrates the evaluation results in consideration of the set weight.
  • a first integrated evaluation system includes the integrated evaluation apparatus according to the present invention, a plurality of evaluation apparatuses, and a user terminal.
  • a second integrated evaluation system is the integrated evaluation apparatus according to the present invention, and further acquires user behavior information for the integrated evaluation result created by integrating the evaluation results by the integrating means.
  • a user action history storage means for storing the acquired user action information, wherein the integration means stores the user action history storage means for the user who has made an evaluation request under the evaluation condition. If it is stored in, further considering the stored user behavior information history, comprising an integrated evaluation device characterized by integrating the evaluation results, a plurality of evaluation devices, and a user terminal,
  • the user terminal includes user behavior input means for sending information of user behavior performed on the integrated evaluation result acquired from the integrated evaluation device to the integrated evaluation device. Characterized in that it obtain.
  • the integrated evaluation method includes a step of requesting a plurality of evaluation devices for evaluation under an evaluation condition input from a user terminal, and a step of obtaining an evaluation result corresponding to the request from the plurality of evaluation devices; The step of calculating the similarity between the plurality of evaluation devices, the step of setting the weight of each evaluation device by pressing the weight of the evaluation device having a high degree of similarity in consideration of the calculated similarity, and the set And integrating the evaluation results in consideration of the weights.
  • the integrated evaluation program includes a process for requesting a plurality of evaluation apparatuses to perform an evaluation under an evaluation condition input from a user terminal, and a process for obtaining an evaluation result corresponding to the request from the plurality of evaluation apparatuses.
  • a process for calculating the similarity between the plurality of evaluation apparatuses a process for setting the weight of each evaluation apparatus by pressing the weight of the evaluation apparatus having a high degree of similarity in consideration of the calculated similarity.
  • the recording medium according to the present invention is a computer-readable recording medium that records the processing of the integrated evaluation program of the present invention.
  • the similarity between the evaluation devices is taken into consideration, and the result of the evaluation device having a high similarity is used for the importance distribution at the time of integration.
  • FIG. 1 is a schematic configuration diagram of an integrated search system according to an embodiment of the present invention. It is a flowchart which shows the operation
  • the plurality of evaluation devices may be evaluation devices that perform evaluation with a broad meaning such as importance (score) and ordering of information according to evaluation conditions, such as a search engine and an inference engine.
  • the integrated evaluation device is a device that integrates outputs from the plurality of evaluation devices.
  • the system is not limited to an integrated device such as an evaluation device or an integrated evaluation device, and may be a system (evaluation system, integrated evaluation system) composed of a plurality of devices having these functions as a whole.
  • a plurality of search engines that output search results according to search conditions are connected, and an integrated search device that integrates such search results will be described as an example of an integrated evaluation device. It is not limited to.
  • the integrated search system includes at least a terminal, an integrated search device, and a search engine.
  • the integrated search device considers the similarity of each search engine when integrating search results from each search engine. By setting the weight of each search engine, it is possible to reduce the importance of search engines that have many similarities (search engines with high similarity) and to integrate the influence of search engines that perform more unique searches. The search result can be obtained.
  • the calculation of the relative importance between the search engines can also be used in the compatibility feedback system reflecting the result of the information selection of the user who receives the service, and the search engine is selected by the user in the past. Even if the importance of is increased, it can be considered that an appropriate integrated search result is obtained by suppressing the increase in importance of a search engine having many similar ones.
  • FIG. 1 is a schematic configuration diagram of an integrated search system according to the present embodiment.
  • the integrated search system includes a terminal 1, an integrated search device 2, and a plurality of search engines A to D (reference numerals 3 to 6).
  • the terminal 1 and the integrated search device 2, and the integrated search device 2 and each search engine are connected via a network as an example, but the present invention is not limited to this, What is necessary is just to be able to transmit and receive various information regardless of wireless.
  • search engines A to D four search engines are used (search engines A to D), but this is an example, and two or more different search engines may be used.
  • “different” means that the search method and the search target DB are different, for example, each is calculated by a different algorithm.
  • the terminal 1 includes a search request input unit 11 and a search result output unit 12. Although this embodiment has been described using one terminal, the present invention is not limited to this.
  • the search request input unit 11 included in the terminal 1 transmits the input search conditions to the search request reception unit 21 included in the integrated search device 2 via the network 9.
  • the search conditions input to the search request unit 11 include search information, profile information such as the user's gender and age, presence information such as the location information of the terminal 1 possessed by the user, and preferences representing user preferences. Information may be used.
  • the position information is used as a search condition, for example, the current position is acquired from GPS (Global Positioning Systems) of the terminal 1, and a search for the position is requested.
  • the preference information includes various information such as information acquired by the terminal 1 from past user operations, such as search requests made in the past, Internet browsing history, and other user preference information registered in advance, and is not particularly limited. Also, a combination of search conditions such as preference information and search keywords may be used.
  • each search engine is data similar to “Shinjuku”, and the search result further considers the preference information. Is output. For example, if the preference information is information such as “I like exercise”, “I like sweet things” or “I do n’t like classics”, it is information about sports facilities and sweetness information that satisfy the search condition of the keyword “Shinjuku”. Outputs event information etc.
  • the search result output unit 12 provided in the terminal 1 receives the search results in the integrated search device 2 from the search result integration unit 27 provided in the integrated search device 2, integrates these, and outputs them.
  • the integrated search device 2 is a device that integrates the output results of the search engine A (3), the search engine B (4), the search engine C (5), and the search engine D (6) and outputs them to the terminal 1.
  • the integrated search device 2 includes a search request reception unit 21, a search request unit 22, a search result reception unit 23, a search engine similarity calculation unit 24, a user behavior acquisition unit 25, a weight calculation unit 26, a search engine similarity DB 261, and a search result.
  • the integration unit 27 is configured.
  • FIG. 2 is a flowchart showing an operation process of the federated search apparatus according to the present embodiment.
  • the search request receiving unit 21 transmits the search conditions received from the search request input unit 11 included in the terminal 1 to the search request unit 22 (steps S1 and S2).
  • the search request unit 22 transmits the search request contents (search conditions) received from the search request receiving unit 21 to the search engines A to D to be connected via the network 10 (step S3).
  • the search result reception unit 23 transmits the search results received from the search engines A to D to the search engine similarity calculation unit 24 and the search result integration unit 27 (step S4). Details of the search results in the search engines A to D will be described later.
  • the search engine similarity calculation unit 24 calculates the similarity between search engines based on the search results of the search engines A to D. Details of similarity calculation between search engines will be described later.
  • the calculation result (similarity data between search engines) in the search engine similarity calculation unit 24 is input to the search engine similarity DB 261.
  • the search engine similarity DB 261 stores similarity data between search engines received from the search engine similarity calculation unit 24.
  • the search engine similarity calculation unit 24 inputs the “similarity data between search engines” to the search engine similarity DB 261
  • the search engine similarity calculation unit 24 confirms that there is update data in the search engine similarity DB 261 with respect to the weight calculation unit 26. Notification is made (step S5).
  • the weight calculation unit 26 notified of the presence of the update data inquires the search engine similarity DB 261 and acquires the similarity data between the search engines (step S6).
  • the weight calculation unit 26 calculates the weight of each search engine in consideration of the acquired similarity data between the search engines, and transmits the weight to the search result integration unit 27 (step S7). Details of weight calculation of each search engine will be described later.
  • the search result integration unit 27 integrates the search result of each search engine acquired from the search result reception unit 23 and the weight of each search engine acquired from the weight calculation unit 26, and calculates an integrated search result (step S8). . Details of the integrated search result calculation will be described later.
  • the calculated integrated search result is transmitted to the search result output unit 12 provided in the terminal 1 (step S9).
  • the search result output unit 12 of the terminal 1 outputs the integrated search result sent from the integrated search device 2 and provides it to the user.
  • search condition will be described assuming that, for example, a search keyword “mobile phone” is input.
  • the search engines A to D search for information to be searched for the search request by the search keyword “mobile phone” received from the search request unit 22 according to the respective algorithms, and attach the scores according to the scores or the scores.
  • the received order is transmitted to the search result receiving unit 23.
  • the same five filtered information (information a to e) is output in different orders as the result of the search engine A to D outputting information similar to the search keyword “mobile phone” (search result).
  • An example of each search result ordered by the search engines A to D is shown in FIG.
  • the ordering shown in FIG. 3 is information that is highly evaluated by each search engine as the value is smaller.
  • the search engine similarity calculation unit 24 calculates the similarity for each pair (A and B, A and C, A and D, etc.) of all search engines.
  • Various methods for calculating the degree of similarity can be considered.
  • the degree of correlation is represented by a real number between 0 and 1 using Kendall's rank correlation coefficient (see Non-Patent Document 2). 0 is uncorrelated (reverse), 1 is exactly the same.
  • FIG. 4 shows the results (degree of correlation) for the permutations of the search results of each search engine shown in FIG.
  • the search result of the search engine A is evaluated as “information a> b> c> d> e”
  • the search result of the search engine B is “information a> b> c”. > E> d ”.
  • the similarity between the search engines A and B is “0.9” (see FIG. 4).
  • the similarity data calculated by the search engine similarity calculation unit 24 is input to the search engine similarity DB 261.
  • the similarity calculation method is not limited to Kendall's rank correlation coefficient, and an inner product of vectors of each permutation may be used. Moreover, the real number from 0 to 1 is an example, and the present invention is not limited to this.
  • the vector w shown in Equation 2 is the weight of each search engine. That is, in the vector s, the weight is uniform and tied (for example, all search engines A to D are 0.25), but the weights corrected in consideration of the similarity of each search engine (search engine A; 0.243, B; 0.229, C; 0.243, D; 0.285).
  • the weight calculation unit 26 transmits the calculated weight data of each search engine to the search result integration unit 27.
  • FIG. 5 shows a matrix M ′ obtained by converting the permutation of information a to e in each search result shown in FIG. 3 into points.
  • the search result integration unit 27 calculates M′w and obtains the integration of search results considering the weight of each search engine.
  • information b, information a, information c, information e, and information d can be ordered in descending order.
  • the search result integration unit 27 transmits the integrated search result to the search result output unit 12.
  • the search result output unit 12 outputs the integrated search result sent from the integrated search device 2 as shown in FIG. 6, for example, and presents it to the user.
  • the importance of the search engines having many similarities is increased by considering the weights of the respective search engines.
  • FIG. 7 shows a schematic configuration of the integrated search system according to the present embodiment.
  • the integrated search device 2 includes a search history DB 262.
  • the search history DB 262 stores search results from each search engine for each search condition requested in the past.
  • the search result history may be stored in various units such as for each user context, for each user, for each user group, or all of them.
  • the search history DB may be stored for a certain period, or a certain amount or more may be transferred to another storage device for storage.
  • the search engine similarity calculation unit 24 refers to each search result in at least one or more search conditions executed in the past, in addition to the search results for the currently executed search conditions. Then, the similarity is calculated.
  • the search history filtered by the preference information and context of the user who uses the terminal 1 that has made the search request is extracted, and the similarity of the search engines is calculated.
  • the weight of the search engine is set in consideration of the similarity when matched to the user or the context. If the user preference information is acquired when a search request is received from the terminal 1, the latest preference information can be referred to.
  • the search engine similarity DB 261 may store a history of similarity data input in the past and refer to it when calculating weights.
  • the history of similarity data may be stored in various units such as for each user context, for each user, for each user group, or all of them. Further, in consideration of the storage capacity of the search engine similarity DB 261, the search engine similarity DB 261 may be stored for a certain period, or a certain amount or more may be stored in another storage device.
  • the search history filtered by the preference information or context of the user who uses the terminal 1 that has requested the search when extracting the history of similarity data may be extracted. This makes it possible to obtain an integrated search result in which the influence of a search engine that performs a unique search is deepened.
  • This embodiment is an integrated search device that uses relevance feedback, that is, a user operation (such as clicking on a search result list) performed on a search result output to the search result output unit 12 of the terminal 1 or purchasing activity.
  • a user operation such as clicking on a search result list
  • the weight setting that considers the similarity between the search engines is further integrated to easily increase the influence of the search engine that performs the original search. A search result is obtained.
  • the feedback to the weight setting of the user behavior information performed on the search result is obtained by, for example, each search engine that acquires and stores the click behavior when the user clicks on the search result displayed on the terminal 1
  • Relevance feedback is performed by reflecting it in the weight of. This makes the weight setting closer to the user's preference, but in order to obtain integrated search results that tend to deepen the influence of the search engine that performs the original search, suppress how to increase the importance (weight). This is also necessary.
  • FIG. 8 shows a schematic configuration of the integrated search system according to the present embodiment.
  • the present embodiment is particularly characterized in that the terminal 1 includes a user behavior input unit 13, and the integrated search device 2 includes a user behavior acquisition unit 25 and a user behavior history DB 263.
  • the user behavior input unit 13 included in the terminal 1 is configured to retrieve user behavior information when the user's operation performed on the search result output unit 12 or a user behavior such as a user purchase activity is performed. Is transmitted to the user action acquisition unit 25. Note that the user behavior information is transmitted from the terminal 1 in addition to the case where the user behavior is performed. The user behavior information is temporarily stored in the terminal 1, and the integrated search device 2 is used every predetermined period or at the time of a search request. May be communicated to.
  • the user behavior acquisition unit 25 included in the integrated search device 2 inputs the user behavior information received from the user behavior input unit 13 to the user behavior history DB 263. At the time of input, the fact that there is update data is transmitted to the weight calculation unit 26.
  • the storage in the user behavior history DB 263 may be provided with an identification number for each terminal, and user behavior information may be managed for each same identification number.
  • the user behavior history DB 262 provided in the integrated search device 2 stores the user behavior information received from the user behavior acquisition unit 25.
  • the operation process of this embodiment is as follows.
  • the general flow from acceptance of a search request to calculation of an integrated search result is the same as the processing shown in FIG. 2, but this embodiment is particularly characterized by the processing of the weight calculation unit 26 in steps S6 and S7 of FIG.
  • the weight calculation unit 26 acquires user behavior information from the user behavior history DB when notified by the user behavior acquisition unit 25 that there is update data.
  • the weight calculator 26 calculates the weight of each search engine based on the similarity data between the search engines and the user behavior information.
  • the calculation of the weight will be further described in detail. This embodiment will be described using the values in FIGS. 3 to 6 as an example.
  • the user behavior information is information that the information b ranked at the top in FIG. 6 is clicked
  • feedback is performed to increase the weight of the search engine that has affected the information b.
  • the weight is calculated in consideration of similar data between search engines.
  • Equation 4 The solution obtained by Equation 4 is normalized so that the sum of vector elements becomes 1, as shown in Equation 5 below, and set as the weight of each search engine.
  • FIG. 9 shows the calculation result (integrated search result) of the total score reflecting the weight shown in Equation 5 above. Comparing FIG. 6 with the new FIG. 9, in search engines A, B, and D where the score of information b is the same at 0.8, there are relatively few D similar search engines (see FIG. 4). Information e has moved to the top because it gained a large weight.
  • the program for the CPU to execute the processing shown in the flowchart of FIG. 2 and the processing in each embodiment constitutes a program according to the present invention.
  • a recording medium for recording the program a semiconductor storage unit, an optical and / or magnetic storage unit, or the like can be used.

Abstract

Provided are an integrated evaluation device, an integrated evaluation system, an integrated evaluation method, an integrated evaluation program, and a recording medium which are capable of obtaining the result of the integrated evaluation in which the influence by an evaluation device for performing more unique evaluation is emphasized by giving less weight to the results by evaluation devices with high similarity when allocating importance in the integration of the results of the evaluation. The integrated evaluation device is provided with a means connected to a plurality of evaluation devices for performing evaluation according to evaluation conditions, a means for requesting evaluation under evaluation conditions inputted from user terminals to the evaluation devices, and an integration means for integrating by combining the results of the evaluation obtained from the evaluation devices, characterized in that the integrated evaluation device comprises an evaluation-device-similarity calculating means for calculating the similarity between the plurality of evaluation devices, and a weight setting means for setting the weight of each of the evaluation devices with less weight given to evaluation devices with high similarity in consideration of the calculated similarity, and the integration means integrates the results of the evaluation in consideration of the set weights.

Description

統合評価装置、統合評価システム、統合評価方法、統合評価プログラム及び記録媒体Integrated evaluation apparatus, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium
 本発明は、統合評価装置、統合評価システム、統合評価方法、統合評価プログラム及び記録媒体に関し、特に各評価装置間の類似度を考慮した統合評価装置、統合評価システム、統合評価方法、統合評価プログラム及び記録媒体に関する。 The present invention relates to an integrated evaluation apparatus, an integrated evaluation system, an integrated evaluation method, an integrated evaluation program, and a recording medium, and in particular, an integrated evaluation apparatus, an integrated evaluation system, an integrated evaluation method, and an integrated evaluation program that take into account the similarity between the evaluation apparatuses. And a recording medium.
 検索対象DB(Date Base)に蓄積したデータ内から検索要求を満たすデータを抽出する検索方法が知られている。このような検索方法では、画像、映像、音楽等のマルチメディアデータに対する検索や、テキストの検索が行われている。また、検索では一つの特徴量だけではなく、複数の特徴量を組み合わせて検索を行うこともある。画像の検索を例にとると、色、模様等が共に検索要求を満たす画像の抽出を行うといった場合がそれに当たる(特許文献1参照)。 A search method for extracting data satisfying a search request from data stored in a search target DB (Date Base) is known. In such a search method, search for multimedia data such as images, videos, music, etc., and text search are performed. In the search, a search may be performed by combining not only one feature quantity but also a plurality of feature quantities. Taking an image search as an example, this corresponds to the case of extracting an image that satisfies the search request for both color and pattern (see Patent Document 1).
 また、検索の対象が例えば写真のみではなく、写真や図面、テキスト文書等が組み合わされた文書等、多様な種類の文書であっても、有用な検索結果を得るため、写真画像検索とテキスト検索との各スコアを統合し、得られた総合スコアは上位となる画像を候補として抽出する画像検索システムが提案されている(特許文献2参照)。また、事例ベースから最も要求を満たす事例を検索する場合において、問題と候補間の関係を計算する検索装置が提案されている(特許文献3参照)。 In addition, even if the search target is not only a photo, but also a variety of types of documents such as a combination of photos, drawings, text documents, etc., in order to obtain useful search results, photo image search and text search An image search system has been proposed in which the scores obtained from the above are integrated, and the obtained overall score is extracted as a candidate image (see Patent Document 2). Also, a search device that calculates a relationship between a problem and a candidate when searching for a case that satisfies the most demand from the case base has been proposed (see Patent Document 3).
 また、複数の評価システムの出力結果を組み合わせるメタサーチ、ブレンドサーチと呼ばれる統合サーチが知られている(非特許文献1)。各評価システムの評価対象の範囲が異なることもあるため、複数を組み合わせる統合サーチによって評価対象のcoverageを広げることが可能である。またあるひとつの検索システムに頼った場合には、その評価システムがある評価対象の評価を恣意的に下げる検閲をすることもあるが、統合サーチによって防ぐことができる。 Also, an integrated search called meta search and blend search that combine output results of a plurality of evaluation systems is known (Non-Patent Document 1). Since the range of the evaluation target of each evaluation system may be different, the coverage of the evaluation target can be expanded by an integrated search combining a plurality. In addition, when relying on a certain search system, the evaluation system may censor arbitrarily lower the evaluation of an evaluation target, but it can be prevented by an integrated search.
 統合サーチでは、各評価システムの及ぼす影響度をその精度により区別すること、個人や状況に合わせて設定することが考えられる。そのために重要度の算出方法が必要となる。非特許文献2では、ユーザの操作履歴により重要度を算出する加点法が記述されている。この方法を用いれば、ユーザが選択したアイテムに影響を与えた検索システムに加点することができる。さらに、数多くのユーザによる評価値を収集して最適化した結果が、評価システムの類似度に依存していることが考察されている。また、各評価システムが出力したランク付け結果間の類似度について、ケンドールの順位相関係数(非特許文献3)などを用いて算出することも知られている。 In integrated search, it is possible to distinguish the degree of influence of each evaluation system according to its accuracy and to set it according to the individual and the situation. Therefore, a calculation method of importance is required. Non-Patent Document 2 describes a point addition method for calculating importance based on a user's operation history. By using this method, points can be added to the search system that has influenced the item selected by the user. Furthermore, it is considered that the result of collecting and optimizing evaluation values by many users depends on the similarity of the evaluation system. It is also known that the similarity between ranking results output by each evaluation system is calculated using Kendall's rank correlation coefficient (Non-Patent Document 3) or the like.
特開2001-134584号公報JP 2001-134484 A 特開2007-172077号公報JP 2007-172077 A 特許第3311778号公報Japanese Patent No. 3311778
 上述した統合サーチにおいて、ユーザの操作履歴により重要度を算出する加点法を利用すると、以下のようなケースが考え得る。例えば、A、B、Cと3つのそれぞれ異なる評価システムが存在する時に、あるユーザの行動履歴が評価システムAを好み、B、Cを好まないと判断された時に、このユーザに最適化する統合サーチでは、評価システムAの重要度を上げ、B、Cの重要度を下げる修正がなされることが考えられる(ケース1)。しかしAとほぼ同じ評価システムA'、A''、A'''が存在し、A、A'、A''、A'''、B、Cと6つの異なる評価システムを統合するときに、あるユーザの行動履歴が評価システムAを好み、B、Cを好まないと判断された時には、AだけではなくAと同等であるA'、A''、A'''の重要度も上げ、B、Cの重要度を下げることになる(ケース2)。ケース1とケース2において、評価システムA、B、Cは個々のシステムの内容が変わらないにも関わらず、重要度の修正内容がA'、A''、A'''の有無により相対的に変化することは避けられない。この状況において次のような問題がある。 In the integrated search described above, the following cases can be considered when the point addition method for calculating the importance based on the user operation history is used. For example, when there are three different evaluation systems A, B, and C, when a user's action history likes evaluation system A, and it is determined that B and C are not preferred, the integration is optimized for this user In the search, it can be considered that the importance of the evaluation system A is raised and the importance of B and C is lowered (case 1). However, there are almost the same evaluation systems A ', A' ', A' '' as A, and when integrating six different evaluation systems with A, A ', A' ', A' '', B, C. When it is determined that a user's action history likes evaluation system A and does not like B or C, not only A but also A ', A' ', and A' '', which are equivalent to A, are increased in importance. , B and C will be reduced (Case 2). In cases 1 and 2, evaluation systems A, B, and C are relative to each other depending on the presence / absence of A ′, A ″, and A ′ ″, although the contents of individual systems remain unchanged. It is inevitable to change. There are the following problems in this situation.
 第1の問題点は、評価システム間のアルゴリズムの関係性を考慮して配分を調整する方法がなかった。特にスパムといわれるほど利用できる複数の評価システムが大量に存在する場合に問題が大きくなる。 The first problem was that there was no way to adjust the distribution in consideration of the algorithm relationship between the evaluation systems. The problem is particularly serious when there are a large number of evaluation systems that can be used so much as spam.
 第2の問題点は、評価システムの関係性の中でも類似度を考慮して複数の評価システムの配分を調整する方法がなかったため、似た評価システムの数が多い評価システムが、同時に重要度を上げ下げする傾向にあり、総合結果に対して過敏に影響を与え、統合システムが多様性を確保するメリットを発揮できない状況に陥ることがあった。本問題点が起こす状況は第1の問題点とほぼ同じであり、例えば、ある統合システムが、一般に評価システムを公募して、それらをA社が評価システムAを、B社が評価システムBを、C社が評価システムCを、応募した時に、A社が自分の影響力を高めるためにAと同じ内容のA’、A’’、A’’’をも、応募することにより、不公正に自らの影響力を高めようとする評価システムスパムを行う可能性がある。 The second problem is that there is no way to adjust the distribution of multiple evaluation systems in consideration of the similarity among the relationships of the evaluation systems. There was a tendency to raise and lower, and it had a sensitive effect on the overall result, and the integrated system could fall into a situation where it could not demonstrate the benefits of ensuring diversity. The situation where this problem occurs is almost the same as the first problem. For example, an integrated system generally recruits evaluation systems, and Company A uses Evaluation System A and Company B uses Evaluation System B. When Company C applies for evaluation system C, Company A is unfair by applying for A ', A' ', A' '' with the same content as A in order to increase their influence. There is a possibility of spam rating system to try to increase its influence on.
 第3の問題点は、ユーザの情報を選択した履歴を反映させて複数の評価システムの配分を調整することはあるが、ユーザの履歴の量がないと正確な調整が困難であり、またユーザの履歴を大量に集める場合にはスケーラビリティの問題があり、ユーザの履歴を取得しない段階や、履歴の量が少ない段階は効果的に最適化を行うことが出来なかった。 The third problem is that the distribution of a plurality of evaluation systems may be adjusted by reflecting the history of selecting user information, but accurate adjustment is difficult without the amount of user history, and the user When collecting a large number of histories, there is a scalability problem, and it was not possible to optimize effectively at the stage where the user's history is not acquired or at the stage where the history is small.
 第4の問題点は、適合性フィードバックと呼ばれるユーザの履歴に即時に対応する場合においても、評価システムの類似度を考慮することはなかったため、第2の問題点と同様に評価システムスパムが効果を発揮する環境にある。 The fourth problem is that the evaluation system spam is effective in the same way as the second problem because the similarity of the evaluation system is not considered even when the user's history called relevance feedback is dealt with immediately. Is in an environment where
 第5の課題は、適合性フィードバックと呼ばれるユーザの履歴に即時に対応する場合においても、検索装置の類似度を考慮することはなかったため、第3の問題点と同様に統合した評価システムがユーザに合わせて最適化する収束性が遅かった。 The fifth problem is that even when the user's history called relevance feedback is dealt with immediately, the similarity of the search device is not taken into consideration, so that the integrated evaluation system is the same as the third problem. Convergence to optimize according to was slow.
 本発明はこのような状況に鑑みてなされたものであり、複数の評価装置の結果を統合して結果を出す場合に、各評価装置間の類似度を考慮し、統合時の重要度の配分について、類似度の高い評価装置による結果の重みを押えることで、より独自の評価を行う評価装置の影響を濃くした統合評価結果を得ることを目的とする。 The present invention has been made in view of such a situation, and when the results of a plurality of evaluation devices are integrated to obtain a result, the similarity between the evaluation devices is taken into consideration and the importance is distributed at the time of integration. The purpose of this method is to obtain an integrated evaluation result in which the influence of the evaluation device performing a more original evaluation is deepened by suppressing the weight of the result by the evaluation device having a high degree of similarity.
 本発明に係る統合評価装置は、評価条件に従って評価を行う複数の評価装置と接続する手段と、ユーザ端末から入力された評価条件での評価を前記評価装置に要求する手段と、前記評価装置から取得した評価結果を組み合わせて統合する統合手段とを備える統合評価装置であって、前記複数の評価装置間の類似度を算出する評価装置類似度算出手段と、前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定するウェイト設定手段と、を備え、前記統合手段は、前記設定されたウェイトを考慮して前記評価結果を統合することを特徴とする。 An integrated evaluation apparatus according to the present invention includes: means for connecting to a plurality of evaluation apparatuses that perform evaluation according to evaluation conditions; means for requesting the evaluation apparatus for evaluation under evaluation conditions input from a user terminal; and An integrated evaluation device comprising an integration means for combining and integrating the acquired evaluation results, considering the similarity between the evaluation device similarity calculation means for calculating the similarity between the plurality of evaluation devices, Weight setting means for setting the weight of each evaluation apparatus by pressing the weight of the evaluation apparatus having a high degree of similarity, and the integration means integrates the evaluation results in consideration of the set weight. And
 本発明に係る第1の統合評価システムは、上記本発明に係る統合評価装置と、複数の評価装置と、ユーザ端末と、を備えることを特徴とする。 A first integrated evaluation system according to the present invention includes the integrated evaluation apparatus according to the present invention, a plurality of evaluation apparatuses, and a user terminal.
 本発明に係る第2の統合評価システムは、上記本発明に係る統合評価装置であってさらに前記統合手段により評価結果を統合して作成された統合評価結果に対するユーザ行動情報を取得するユーザ行動取得手段と、前記取得したユーザ行動情報を保管するユーザ行動履歴保管手段と、を備え、前記統合手段は、前記評価条件での評価要求を行ったユーザのユーザ行動情報履歴が前記ユーザ行動履歴保管手段に保管されている場合は前記保管されているユーザ行動情報履歴をさらに考慮し、前記評価結果を統合することを特徴とする統合評価装置と、複数の評価装置と、ユーザ端末と、を備え、前記ユーザ端末は、前記統合評価装置から取得した統合評価結果に対して行ったユーザ行動の情報を前記統合評価装置へ送るユーザ行動入力手段を備えることを特徴とする。 A second integrated evaluation system according to the present invention is the integrated evaluation apparatus according to the present invention, and further acquires user behavior information for the integrated evaluation result created by integrating the evaluation results by the integrating means. And a user action history storage means for storing the acquired user action information, wherein the integration means stores the user action history storage means for the user who has made an evaluation request under the evaluation condition. If it is stored in, further considering the stored user behavior information history, comprising an integrated evaluation device characterized by integrating the evaluation results, a plurality of evaluation devices, and a user terminal, The user terminal includes user behavior input means for sending information of user behavior performed on the integrated evaluation result acquired from the integrated evaluation device to the integrated evaluation device. Characterized in that it obtain.
 本発明に係る統合評価方法は、ユーザ端末から入力された評価条件での評価を、複数の評価装置に要求するステップと、前記複数の評価装置から前記要求に対応した評価結果を取得するステップと、前記複数の評価装置間の類似度を算出するステップと、前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定するステップと、前記設定されたウェイトを考慮して前記評価結果を統合するステップと、を備えることを特徴とする。 The integrated evaluation method according to the present invention includes a step of requesting a plurality of evaluation devices for evaluation under an evaluation condition input from a user terminal, and a step of obtaining an evaluation result corresponding to the request from the plurality of evaluation devices; The step of calculating the similarity between the plurality of evaluation devices, the step of setting the weight of each evaluation device by pressing the weight of the evaluation device having a high degree of similarity in consideration of the calculated similarity, and the set And integrating the evaluation results in consideration of the weights.
 本発明に係る統合評価プログラムは、ユーザ端末から入力された評価条件での評価を、複数の評価装置に要求する処理と、前記複数の評価装置から前記要求に対応した評価結果を取得する処理と、前記複数の評価装置間の類似度を算出する処理と、前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定する処理と、前記設定されたウェイトを考慮して前記評価結果を統合する処理と、を備えることを特徴とする。 The integrated evaluation program according to the present invention includes a process for requesting a plurality of evaluation apparatuses to perform an evaluation under an evaluation condition input from a user terminal, and a process for obtaining an evaluation result corresponding to the request from the plurality of evaluation apparatuses. A process for calculating the similarity between the plurality of evaluation apparatuses, a process for setting the weight of each evaluation apparatus by pressing the weight of the evaluation apparatus having a high degree of similarity in consideration of the calculated similarity. And a process of integrating the evaluation results in consideration of the weights.
 本発明に係る記録媒体は、上記本発明の統合評価プログラムの処理を記録するコンピュータ読取り可能な記録媒体である。 The recording medium according to the present invention is a computer-readable recording medium that records the processing of the integrated evaluation program of the present invention.
 本発明によれば、複数の評価装置の結果を統合して結果を出す場合に、各評価装置間の類似度を考慮し、統合時の重要度の配分について、類似度の高い評価装置による結果の重みを押えることで、より独自の評価を行う評価装置の影響を濃くした統合評価結果を得ることが出来る。 According to the present invention, when the results of a plurality of evaluation devices are integrated to obtain a result, the similarity between the evaluation devices is taken into consideration, and the result of the evaluation device having a high similarity is used for the importance distribution at the time of integration. By suppressing the weight of, it is possible to obtain an integrated evaluation result in which the influence of the evaluation device that performs more independent evaluation is deepened.
本発明の実施形態に係る統合検索システムの概略構成図である。1 is a schematic configuration diagram of an integrated search system according to an embodiment of the present invention. 本発明の実施形態に係る統合検索装置の動作処理を示すフローチャートである。It is a flowchart which shows the operation | movement process of the integrated search apparatus which concerns on embodiment of this invention. 本発明の実施形態に係る各検索エンジンが出力した情報の順序付けを示す図である。It is a figure which shows the ordering of the information which each search engine which concerns on embodiment of this invention output. 図3の順序付けから算出された、各検索エンジン間の類似度を示す図である。It is a figure which shows the similarity between each search engine calculated from the ordering of FIG. 図3の順序付けをスコアに変換した図である。It is the figure which converted the ordering of FIG. 3 into the score. 本発明の実施形態に係る統合検索結果の出力を示す図である。It is a figure which shows the output of the integrated search result which concerns on embodiment of this invention. 本発明の他の実施形態に係る統合検索システムの概略構成図である。It is a schematic block diagram of the integrated search system which concerns on other embodiment of this invention. 本発明の他の実施形態に係る統合検索システムの概略構成図である。It is a schematic block diagram of the integrated search system which concerns on other embodiment of this invention. 本発明の他の実施形態に係る統合検索結果の出力を示す図である。It is a figure which shows the output of the integrated search result which concerns on other embodiment of this invention.
 以下に、本発明の実施形態について図面を用いて詳細に説明する。なお、以下に述べる実施形態は、本発明の好適な実施形態であるから、技術的に好ましい種々の限定が付されているが、本発明の範囲は、以下の説明において特に本発明を限定する旨の記載がない限り、これらの態様に限られるものではない。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings. The embodiments described below are preferred embodiments of the present invention, and thus various technically preferable limitations are given. However, the scope of the present invention is particularly limited in the following description. As long as there is no description of the effect, it is not restricted to these aspects.
 本明細書において、複数の評価装置とは、検索エンジン、推論エンジン等、評価条件に従って情報に対する重要度(スコア)や、その順序付け等の広い意での評価を行う評価装置であれば良い。統合評価装置とは、これら複数の評価装置からの出力を統合する装置である。なお、評価装置、統合評価装置といった一体の装置に限られず、これらの機能を全体として有する複数の装置から成るシステム(評価システム、統合評価システム)でも良い。なお、以下の説明においては、検索条件に従って検索結果を出力する検索エンジンが複数接続され、かかる検索結果を統合する統合検索装置を、統合評価装置の一例として用いて説明するが、本発明はこれに限られるものではない。 In this specification, the plurality of evaluation devices may be evaluation devices that perform evaluation with a broad meaning such as importance (score) and ordering of information according to evaluation conditions, such as a search engine and an inference engine. The integrated evaluation device is a device that integrates outputs from the plurality of evaluation devices. The system is not limited to an integrated device such as an evaluation device or an integrated evaluation device, and may be a system (evaluation system, integrated evaluation system) composed of a plurality of devices having these functions as a whole. In the following description, a plurality of search engines that output search results according to search conditions are connected, and an integrated search device that integrates such search results will be described as an example of an integrated evaluation device. It is not limited to.
 本実施形態に係る統合検索システムは、少なくとも端末、統合検索装置、検索エンジンから構成され、統合検索装置は、各検索エンジンからの検索結果を統合する際に、各検索エンジンの類似度を考慮して各検索エンジンのウェイトを設定することにより、類似するものが多い検索エンジン(類似性の高い検索エンジン)の重要度の上げ方を抑え、より独自の検索を行う検索エンジンの影響を濃くした統合検索結果を得ることが出来ることを特徴とする。 The integrated search system according to the present embodiment includes at least a terminal, an integrated search device, and a search engine. The integrated search device considers the similarity of each search engine when integrating search results from each search engine. By setting the weight of each search engine, it is possible to reduce the importance of search engines that have many similarities (search engines with high similarity) and to integrate the influence of search engines that perform more unique searches. The search result can be obtained.
 また、各検索エンジン間の相対的重要度の計算は、サービスを受けるユーザの情報選択の結果を反映する適合性フィードバックシステムにおいても用いることが可能であり、ユーザが過去に選択することにより検索エンジンの重要度が上がっている場合でも、類似するものが多い検索エンジンの重要度の上げ方を抑えて適切な統合検索結果を得ることも考え得る。 The calculation of the relative importance between the search engines can also be used in the compatibility feedback system reflecting the result of the information selection of the user who receives the service, and the search engine is selected by the user in the past. Even if the importance of is increased, it can be considered that an appropriate integrated search result is obtained by suppressing the increase in importance of a search engine having many similar ones.
 以下、さらに本発明の実施形態に係る統合検索システムについて詳述する。 Hereinafter, the integrated search system according to the embodiment of the present invention will be described in detail.
[第1の実施形態]
 図1は、本実施形態に係る統合検索システムの概略構成図である。統合検索システムは、端末1と、統合検索装置2と、複数の検索エンジンA~D(符号3~6)とから構成される。なお本実施形態では、端末1と統合検索装置2、統合検索装置2と各検索エンジンとは、一例としてネットワークを介して接続されているが、本発明はこれに限られることはなく、有線・無線を問わず各種情報の伝達、送受信が出来れば良い。
[First Embodiment]
FIG. 1 is a schematic configuration diagram of an integrated search system according to the present embodiment. The integrated search system includes a terminal 1, an integrated search device 2, and a plurality of search engines A to D (reference numerals 3 to 6). In the present embodiment, the terminal 1 and the integrated search device 2, and the integrated search device 2 and each search engine are connected via a network as an example, but the present invention is not limited to this, What is necessary is just to be able to transmit and receive various information regardless of wireless.
 複数の検索エンジンは、本実施形態では4つ用いているが(検索エンジンA~D)、これは一例であり、異なる2以上の検索エンジンであれば良い。ここで、異なるとは、それぞれが別のアルゴリズムにより計算する等、検索方法や検索対象とするDBが異なることを言う。 In the present embodiment, four search engines are used (search engines A to D), but this is an example, and two or more different search engines may be used. Here, “different” means that the search method and the search target DB are different, for example, each is calculated by a different algorithm.
 端末1は、検索要求入力部11と、検索結果出力部12と、から構成される。本実施形態では端末を1つ用いて説明しているが、これに限られることはない。 The terminal 1 includes a search request input unit 11 and a search result output unit 12. Although this embodiment has been described using one terminal, the present invention is not limited to this.
 端末1が備える検索要求入力部11は、入力された検索条件を、ネットワーク9を介し、統合検索装置2が備える検索要求受付部21へ送信する。 The search request input unit 11 included in the terminal 1 transmits the input search conditions to the search request reception unit 21 included in the integrated search device 2 via the network 9.
 ここで検索要求部11に入力する検索条件とは、検索キーワードの他、ユーザの性別・年齢などのプロファイル情報や、ユーザが所持する端末1の位置情報等のプレゼンス情報、ユーザ嗜好を表すプリファレンス情報などでも良い。位置情報を検索条件とする場合は、例えば端末1が有するGPS(Global Positioning Systems)から現在位置を取得し、位置に関する検索を要求する。プリファレンス情報とは、端末1が過去のユーザ操作から取得した情報、例えば過去に行った検索要求や、インターネット閲覧履歴、その他予め登録したユーザの嗜好情報等、様々考えられ、特に限定しない。また、このようなプリファレンス情報と検索キーワード等、組み合わせた検索条件でも良い。例えば検索キーワード「新宿」を要求する際に、プリファレンス情報も検索要求受付部21へ送信した場合、各検索エンジンは「新宿」に類似するデータであって、さらにプリファレンス情報を考慮した検索結果を出力する。プリファレンス情報が例えば「運動が好き」「甘いものが好き」「クラシックが嫌い」というような情報である場合、キーワード「新宿」という検索条件を満たす運動施設や甘み処情報であり、クラシックコンサートを除くイベント情報等を出力する。 Here, the search conditions input to the search request unit 11 include search information, profile information such as the user's gender and age, presence information such as the location information of the terminal 1 possessed by the user, and preferences representing user preferences. Information may be used. When the position information is used as a search condition, for example, the current position is acquired from GPS (Global Positioning Systems) of the terminal 1, and a search for the position is requested. The preference information includes various information such as information acquired by the terminal 1 from past user operations, such as search requests made in the past, Internet browsing history, and other user preference information registered in advance, and is not particularly limited. Also, a combination of search conditions such as preference information and search keywords may be used. For example, when requesting the search keyword “Shinjuku”, if the preference information is also transmitted to the search request receiving unit 21, each search engine is data similar to “Shinjuku”, and the search result further considers the preference information. Is output. For example, if the preference information is information such as “I like exercise”, “I like sweet things” or “I do n’t like classics”, it is information about sports facilities and sweetness information that satisfy the search condition of the keyword “Shinjuku”. Outputs event information etc.
 端末1が備える検索結果出力部12は、統合検索装置2が備える検索結果統合部27から統合検索装置2における検索結果を受信し、これらを統合し、出力する。 The search result output unit 12 provided in the terminal 1 receives the search results in the integrated search device 2 from the search result integration unit 27 provided in the integrated search device 2, integrates these, and outputs them.
 統合検索装置2は、検索エンジンA(3)、検索エンジンB(4)、検索エンジンC(5)、検索エンジンD(6)の出力結果を統合して、端末1へ出力する装置である。統合検索装置2は、検索要求受付部21、検索要求部22、検索結果受付部23、検索エンジン類似度算出部24、ユーザ行動取得部25、ウェイト算出部26、検索エンジン類似度DB261、検索結果統合部27、から構成される。 The integrated search device 2 is a device that integrates the output results of the search engine A (3), the search engine B (4), the search engine C (5), and the search engine D (6) and outputs them to the terminal 1. The integrated search device 2 includes a search request reception unit 21, a search request unit 22, a search result reception unit 23, a search engine similarity calculation unit 24, a user behavior acquisition unit 25, a weight calculation unit 26, a search engine similarity DB 261, and a search result. The integration unit 27 is configured.
 次に、上記構成を用いて、本実施形態に係る統合検索装置の動作処理について図2を用いて説明する。図2は、本実施形態に係る統合検索装置の動作処理を示すフローチャートである。 Next, the operation processing of the integrated search device according to the present embodiment will be described with reference to FIG. FIG. 2 is a flowchart showing an operation process of the federated search apparatus according to the present embodiment.
 検索要求受付部21は、端末1が備える検索要求入力部11から受信した検索条件を、検索要求部22へ伝達する(ステップS1、S2)。 The search request receiving unit 21 transmits the search conditions received from the search request input unit 11 included in the terminal 1 to the search request unit 22 (steps S1 and S2).
 検索要求部22は、検索要求受付部21から受け取った検索要求内容(検索条件)を、接続する検索エンジンA~Dにネットワーク10を介して送信する(ステップS3)。 The search request unit 22 transmits the search request contents (search conditions) received from the search request receiving unit 21 to the search engines A to D to be connected via the network 10 (step S3).
 検索結果受付部23は、各検索エンジンA~Dから受け取った検索結果を、検索エンジン類似度算出部24及び検索結果統合部27へ伝達する(ステップS4)。検索エンジンA~Dにおける検索結果の詳細は後述する。 The search result reception unit 23 transmits the search results received from the search engines A to D to the search engine similarity calculation unit 24 and the search result integration unit 27 (step S4). Details of the search results in the search engines A to D will be described later.
 検索エンジン類似度算出部24は、各検索エンジンA~Dの検索結果に基づいて検索エンジン間の類似度を算出する。検索エンジン間の類似度算出の詳細は後述する。 The search engine similarity calculation unit 24 calculates the similarity between search engines based on the search results of the search engines A to D. Details of similarity calculation between search engines will be described later.
 検索エンジン類似度算出部24での算出結果(検索エンジン間の類似度データ)を検索エンジン類似度DB261へ入力する。検索エンジン類似度DB261は、検索エンジン類似度算出部24から受け取った検索エンジン間の類似度データを保管する。検索エンジン類似度算出部24は、上記「検索エンジン間の類似度データ」を検索エンジン類似度DB261へ入力した際に、ウェイト算出部26に対し、検索エンジン類似度DB261に更新データがあることを通知する(ステップS5)。 The calculation result (similarity data between search engines) in the search engine similarity calculation unit 24 is input to the search engine similarity DB 261. The search engine similarity DB 261 stores similarity data between search engines received from the search engine similarity calculation unit 24. When the search engine similarity calculation unit 24 inputs the “similarity data between search engines” to the search engine similarity DB 261, the search engine similarity calculation unit 24 confirms that there is update data in the search engine similarity DB 261 with respect to the weight calculation unit 26. Notification is made (step S5).
 更新データがあることを通知されたウェイト算出部26は、検索エンジン類似度DB261に問い合せて各検索エンジン間の類似度データを取得する(ステップS6)。ウェイト算出部26は、取得した各検索エンジン間の類似度データを考慮して各検索エンジンのウェイトを算出し、検索結果統合部27へ伝達する(ステップS7)。各検索エンジンのウェイト算出の詳細は後述する。 The weight calculation unit 26 notified of the presence of the update data inquires the search engine similarity DB 261 and acquires the similarity data between the search engines (step S6). The weight calculation unit 26 calculates the weight of each search engine in consideration of the acquired similarity data between the search engines, and transmits the weight to the search result integration unit 27 (step S7). Details of weight calculation of each search engine will be described later.
 検索結果統合部27は、検索結果受付部23から取得した各検索エンジンの検索結果と、ウェイト算出部26から取得した各検索エンジンのウェイトとを統合し、統合検索結果を算出する(ステップS8)。統合検索結果算出の詳細は後述する。 The search result integration unit 27 integrates the search result of each search engine acquired from the search result reception unit 23 and the weight of each search engine acquired from the weight calculation unit 26, and calculates an integrated search result (step S8). . Details of the integrated search result calculation will be described later.
 算出した統合検索結果を、端末1が備える検索結果出力部12へ送信する(ステップS9)。端末1の検索結果出力部12は、統合検索装置2から送られた統合検索結果を出力し、ユーザに提供する。 The calculated integrated search result is transmitted to the search result output unit 12 provided in the terminal 1 (step S9). The search result output unit 12 of the terminal 1 outputs the integrated search result sent from the integrated search device 2 and provides it to the user.
 次に、各処理について詳述する。なお本実施形態では、検索条件について、例えば検索キーワード「携帯電話」が入力されたとして説明する。 Next, each process will be described in detail. In the present embodiment, the search condition will be described assuming that, for example, a search keyword “mobile phone” is input.
[検索エンジンA~Dにおける検索結果(図2、ステップS3、4)]
 検索エンジンA~Dは、検索要求部22から受信した検索キーワード「携帯電話」による検索要求について、それぞれが持つアルゴリズムによって、検索対象の情報を検索し、それらのスコア、もしくはそのスコアに応じて付けられた順序を、検索結果受付部23へ伝達する。本実施形態では、検索エンジンA~Dが検索キーワード「携帯電話」に類似する情報を出力した結果(検索結果)として、フィルタリングされた同じ情報5つ(情報a~e)を異なる順序付けで出力したとする。検索エンジンA~Dが順序付けした各検索結果の一例を図3に示す。図3に示す順序付けは、値が小さいほど各検索エンジンが高く評価した情報とする。
[Search Results in Search Engines A to D (FIG. 2, Steps S3, 4)]
The search engines A to D search for information to be searched for the search request by the search keyword “mobile phone” received from the search request unit 22 according to the respective algorithms, and attach the scores according to the scores or the scores. The received order is transmitted to the search result receiving unit 23. In the present embodiment, the same five filtered information (information a to e) is output in different orders as the result of the search engine A to D outputting information similar to the search keyword “mobile phone” (search result). And An example of each search result ordered by the search engines A to D is shown in FIG. The ordering shown in FIG. 3 is information that is highly evaluated by each search engine as the value is smaller.
[各検索エンジン間の類似度算出(図2、ステップS5)]
 検索エンジン類似度算出部24は、すべての検索エンジンの各ペア(AとB、AとC、AとD等)に対してその類似度を算出する。類似度の算出手法については様々考え得るが、ここでは例えばケンドールの順位相関係数(非特許文献2参照)を用いて0から1の間の実数で、相関の程度を表す。0が相関なし(逆側)、1が全く一致する時である。
[Similarity calculation between search engines (FIG. 2, step S5)]
The search engine similarity calculation unit 24 calculates the similarity for each pair (A and B, A and C, A and D, etc.) of all search engines. Various methods for calculating the degree of similarity can be considered. Here, for example, the degree of correlation is represented by a real number between 0 and 1 using Kendall's rank correlation coefficient (see Non-Patent Document 2). 0 is uncorrelated (reverse), 1 is exactly the same.
 図3に示した各検索エンジンの検索結果の順列に対する結果(相関の程度)を図4に示す。例えば図3に示すように検索エンジンAの検索結果は「情報a>b>c>d>e」と評価しているのに対し、検索エンジンBの検索結果は、「情報a>b>c>e>d」と評価している。この場合ケンドールの順位相関係数を用いると、検索エンジンAとBの類似度は「0.9」となる(図4参照)。検索エンジン類似度算出部24で算出した類似度データは、検索エンジン類似度DB261へ入力される。 FIG. 4 shows the results (degree of correlation) for the permutations of the search results of each search engine shown in FIG. For example, as shown in FIG. 3, the search result of the search engine A is evaluated as “information a> b> c> d> e”, whereas the search result of the search engine B is “information a> b> c”. > E> d ”. In this case, using Kendall's rank correlation coefficient, the similarity between the search engines A and B is “0.9” (see FIG. 4). The similarity data calculated by the search engine similarity calculation unit 24 is input to the search engine similarity DB 261.
 なお、類似度の算出方法は、ケンドールの順位相関係数に限定することはなく、それぞれの順列のベクトルの内積などを用いても良い。また0から1までの実数で表すことは一例であり、これに限られることはない。 The similarity calculation method is not limited to Kendall's rank correlation coefficient, and an inner product of vectors of each permutation may be used. Moreover, the real number from 0 to 1 is an example, and the present invention is not limited to this.
[ウェイトの算出(図2、ステップS7)]
 ウェイト算出部26は、ベクトルsをs=(1/4、 1/4、 1/4、 1/4)と設定し、s-cMsを求める。Mは図4で示す各検索エンジン間の類似度のマトリクスであり、cは定数である。ここでc=0.2とすれば、下記数1のように算出できる。
[Calculation of Weight (FIG. 2, Step S7)]
The weight calculation unit 26 sets the vector s as s = (1/4, 1/4, 1/4, 1/4) T, and obtains s-cMs. M is a matrix of similarity between the search engines shown in FIG. 4, and c is a constant. If c = 0.2, it can be calculated as in the following formula 1.
Figure JPOXMLDOC01-appb-M000001
Figure JPOXMLDOC01-appb-M000001
 上記数1で算出した要素の合計が1になるように正規化すると、ベクトルwは下記数2となる。 When normalization is performed so that the sum of the elements calculated in the above equation 1 becomes 1, the vector w becomes the following equation 2.
Figure JPOXMLDOC01-appb-M000002
Figure JPOXMLDOC01-appb-M000002
 上記数2に示すベクトルwが各検索エンジンのウェイトとなる。つまりベクトルsではウェイト均一でタイであったものが(例えば検索エンジンA~D全て0.25)、各検索エンジンの類似度を考慮して補正したウェイト(検索エンジンA;0.243、B;0.229、C;0.243、D;0.285)にできる。ウェイト算出部26は、算出した各検索エンジンのウェイトデータを検索結果統合部27へ伝達する。 The vector w shown in Equation 2 is the weight of each search engine. That is, in the vector s, the weight is uniform and tied (for example, all search engines A to D are 0.25), but the weights corrected in consideration of the similarity of each search engine (search engine A; 0.243, B; 0.229, C; 0.243, D; 0.285). The weight calculation unit 26 transmits the calculated weight data of each search engine to the search result integration unit 27.
[統合検索結果の算出(図2、ステップS8)]
 図3に示す各検索結果における情報a~eの順列をポイントに変換したマトリクスM’を図5に示す。検索結果統合部27は、M’wを計算し、各検索エンジンのウェイトを考慮した検索結果の統合を得る。
[Calculation of Integrated Search Result (FIG. 2, Step S8)]
FIG. 5 shows a matrix M ′ obtained by converting the permutation of information a to e in each search result shown in FIG. 3 into points. The search result integration unit 27 calculates M′w and obtains the integration of search results considering the weight of each search engine.
Figure JPOXMLDOC01-appb-M000003
Figure JPOXMLDOC01-appb-M000003
 上記数3により、値が大きい順に、情報b、情報a、情報c、情報e、情報dと順序付けができる。検索結果統合部27は、かかる統合検索結果を検索結果出力部12へ伝達する。 According to the above formula 3, information b, information a, information c, information e, and information d can be ordered in descending order. The search result integration unit 27 transmits the integrated search result to the search result output unit 12.
 検索結果出力部12は、統合検索装置2から送られた統合検索結果を、例えば図6に示すように出力し、ユーザに提示する。 The search result output unit 12 outputs the integrated search result sent from the integrated search device 2 as shown in FIG. 6, for example, and presents it to the user.
 上記本実施形態により、各検索エンジンからの検索結果を統合する際に、各検索エンジンのウェイトを考慮することにより、類似するものが多い検索エンジン(類似性の高い検索エンジン)の重要度の上げ方を抑え(ウェイトを小さくし)、より独自の検索を行う検索エンジンの影響を濃くした統合検索結果を得ることが出来る。また、これにより検索エンジンの影響力を高めるだけのためのスパム的な登録に対して悪影響を減らす効果がある。 By integrating the search results from the respective search engines according to the present embodiment, the importance of the search engines having many similarities (highly similar search engines) is increased by considering the weights of the respective search engines. You can obtain integrated search results that reduce the influence of the search engine (reduce the weight) and intensify the influence of search engines that perform more unique searches. This also has the effect of reducing adverse effects on spam-like registrations that only increase the influence of search engines.
[第2の実施形態]
 本発明の他の実施形態としては、図2ステップS5の各検索エンジン間の類似度算出において、現在実行した検索条件に対する検索結果からだけではなく、過去に実行した少なくとも1以上の検索結果を参照して類似度を算出する。
[Second Embodiment]
As another embodiment of the present invention, in the similarity calculation between the search engines in step S5 in FIG. 2, not only the search result for the currently executed search condition but also at least one or more search results executed in the past are referred to. Then, the similarity is calculated.
 本実施形態に係る統合検索システムの概略構成を図7に示す。図1との相違は、統合検索装置2が検索履歴DB262を備える点である。検索履歴DB262は、過去に要求された検索条件毎の各検索エンジンからの検索結果を保管する。なお、検索結果の履歴は、ユーザコンテキスト毎、ユーザ毎、ユーザグループ毎等、様々な単位で保管しても良いし、全て保管しても良い。また、検索履歴DBの記憶容量を考慮し、一定期間だけ保管したり、一定量以上は他の記憶装置に移して保管したりしても良い。 FIG. 7 shows a schematic configuration of the integrated search system according to the present embodiment. The difference from FIG. 1 is that the integrated search device 2 includes a search history DB 262. The search history DB 262 stores search results from each search engine for each search condition requested in the past. Note that the search result history may be stored in various units such as for each user context, for each user, for each user group, or all of them. In consideration of the storage capacity of the search history DB, the search history DB may be stored for a certain period, or a certain amount or more may be transferred to another storage device for storage.
 検索エンジン類似度算出部24は、各検索エンジン間の類似度を算出する際に、現在実行した検索条件に対する検索結果の他に、過去に実行した少なくとも1以上の検索条件における各検索結果を参照して類似度を算出することを特徴とする。 When calculating the similarity between the search engines, the search engine similarity calculation unit 24 refers to each search result in at least one or more search conditions executed in the past, in addition to the search results for the currently executed search conditions. Then, the similarity is calculated.
 これにより、さらに、類似性の高い検索エンジンが多い検索エンジンの重要度の上げ方を抑え、より独自の検索を行う検索エンジンの影響を濃くした統合検索結果を得ることが出来る。 This can further reduce the importance of search engines with many similar search engines, and obtain integrated search results with a deeper influence from search engines that perform more unique searches.
 また、検索履歴DB262から複数の検索履歴を抽出する場合に、検索要求を行った端末1を利用するユーザのプリファレンス情報やコンテキストによりフィルタリングした検索履歴を抽出し、検索エンジンの類似度を算出することにより、ユーザやコンテキストに合わせた場合の類似性を考慮した検索エンジンのウェイトを設定することも可能である。なお、ユーザのプリファレンス情報は、端末1から検索要求を受け付けた際に取得すれば、最新のプリファレンス情報を参照することが出来る。 When a plurality of search histories are extracted from the search history DB 262, the search history filtered by the preference information and context of the user who uses the terminal 1 that has made the search request is extracted, and the similarity of the search engines is calculated. Thus, it is also possible to set the weight of the search engine in consideration of the similarity when matched to the user or the context. If the user preference information is acquired when a search request is received from the terminal 1, the latest preference information can be referred to.
[第3の実施形態]
 上記第2の実施形態では、検索要求を行って検索結果を受け付ける度に、検索履歴DBから検索履歴を取得し、各検索エンジン間の類似度を算出しているが、その他、例えば図1に示す構成において、検索エンジン類似度DB261が過去に入力された類似度データの履歴を保管し、ウェイト算出の際に参照することも考え得る。なお、類似度データの履歴は、ユーザコンテキスト毎、ユーザ毎、ユーザグループ毎等、様々な単位で保管しても良いし、全て保管しても良い。また、検索エンジン類似度DB261の記憶容量を考慮し、一定期間だけ保管したり、一定量以上は他の記憶装置に移して保管したりしても良い。また、類似度データの履歴を抽出する際に検索要求を行った端末1を利用するユーザのプリファレンス情報やコンテキストによりフィルタリングした検索履歴を抽出しても良い。これにより、より独自の検索を行う検索エンジンの影響を濃くした統合検索結果を得ることが出来る。
[Third Embodiment]
In the second embodiment, every time a search request is made and a search result is received, the search history is acquired from the search history DB and the similarity between the search engines is calculated. In the configuration shown, the search engine similarity DB 261 may store a history of similarity data input in the past and refer to it when calculating weights. The history of similarity data may be stored in various units such as for each user context, for each user, for each user group, or all of them. Further, in consideration of the storage capacity of the search engine similarity DB 261, the search engine similarity DB 261 may be stored for a certain period, or a certain amount or more may be stored in another storage device. Alternatively, the search history filtered by the preference information or context of the user who uses the terminal 1 that has requested the search when extracting the history of similarity data may be extracted. This makes it possible to obtain an integrated search result in which the influence of a search engine that performs a unique search is deepened.
[第4の実施形態]
 また、検索条件による絞込みを行わず、各検索エンジンの検索対象である全情報の順序付けに基づき、各検索エンジン間の類似度を算出することも可能である。また、各検索エンジンの検索対象の範囲に基づき、各検索エンジン間の類似度を算出することも可能である。
[Fourth Embodiment]
It is also possible to calculate the similarity between the search engines based on the ordering of all information that is the search target of each search engine without narrowing down by the search condition. It is also possible to calculate the similarity between the search engines based on the search target range of each search engine.
 以上、第2から第4の実施形態に示したように、各検索装置間の類似度の算出は様々考え得る。 As described above, as shown in the second to fourth embodiments, there are various ways of calculating the similarity between the search devices.
[第5の実施形態]
 次に、本発明の他の実施形態について説明する。本実施形態は、適合性フィードバックを用いる統合検索装置、すなわち、端末1の検索結果出力部12に出力された検索結果に対して行ったユーザによる操作(検索結果リストのクリック等)や、購買活動等のユーザ行動情報を、ウェイト設定にフィードバックする統合検索装置において、さらに各検索エンジン間の類似度を考慮したウェイト設定を行うことで、より独自の検索を行う検索エンジンの影響を濃くしやすい統合検索結果を得ることを特徴とする。
[Fifth Embodiment]
Next, another embodiment of the present invention will be described. This embodiment is an integrated search device that uses relevance feedback, that is, a user operation (such as clicking on a search result list) performed on a search result output to the search result output unit 12 of the terminal 1 or purchasing activity. In an integrated search device that feeds back user behavior information to the weight setting, the weight setting that considers the similarity between the search engines is further integrated to easily increase the influence of the search engine that performs the original search. A search result is obtained.
 検索結果に対して行ったユーザ行動情報のウェイト設定へのフィードバックは、例えば端末1に表示した検索結果に対してユーザがクリックしたとき、当該クリック行動を取得して、保管していた各検索エンジンのウェイトに反映させることにより適合性フィードバックを行う。これにより、よりユーザのプリファレンスに近いウェイト設定となるが、より独自の検索を行う検索エンジンの影響を濃くしやすい統合検索結果を得るためには、かかる重要度(ウェイト)の上げ方を抑えることも必要となるためである。 The feedback to the weight setting of the user behavior information performed on the search result is obtained by, for example, each search engine that acquires and stores the click behavior when the user clicks on the search result displayed on the terminal 1 Relevance feedback is performed by reflecting it in the weight of. This makes the weight setting closer to the user's preference, but in order to obtain integrated search results that tend to deepen the influence of the search engine that performs the original search, suppress how to increase the importance (weight). This is also necessary.
 本実施形態に係る統合検索システムの概略構成を図8に示す。図8に示すように、本実施形態では特に、端末1においてユーザ行動入力部13、統合検索装置2においてユーザ行動取得部25及びユーザ行動履歴DB263を備えることを特徴とする。 FIG. 8 shows a schematic configuration of the integrated search system according to the present embodiment. As shown in FIG. 8, the present embodiment is particularly characterized in that the terminal 1 includes a user behavior input unit 13, and the integrated search device 2 includes a user behavior acquisition unit 25 and a user behavior history DB 263.
 端末1が備えるユーザ行動入力部13は、検索結果出力部12に対して行ったユーザによる操作や、ユーザの購買活動等のユーザ行動が行われた場合に、ユーザ行動情報を、統合検索装置2が備えるユーザ行動取得部25へ伝達する。なお、端末1からのユーザ行動情報の伝達タイミングは、ユーザ行動が行われた場合の他、一旦はユーザ行動情報を端末1で保管しておき、一定期間毎や、検索要求時に統合検索装置2に伝達しても良い。 The user behavior input unit 13 included in the terminal 1 is configured to retrieve user behavior information when the user's operation performed on the search result output unit 12 or a user behavior such as a user purchase activity is performed. Is transmitted to the user action acquisition unit 25. Note that the user behavior information is transmitted from the terminal 1 in addition to the case where the user behavior is performed. The user behavior information is temporarily stored in the terminal 1, and the integrated search device 2 is used every predetermined period or at the time of a search request. May be communicated to.
 統合検索装置2が備えるユーザ行動取得部25は、ユーザ行動入力部13から受け取ったユーザ行動情報を、ユーザ行動履歴DB263へ入力する。入力時にウェイト算出部26に対して更新データがあることを伝達する。ユーザ行動履歴DB263での保管は、端末ごとに識別番号を付与し、同じ識別番号毎にユーザ行動情報を管理しても良い。 The user behavior acquisition unit 25 included in the integrated search device 2 inputs the user behavior information received from the user behavior input unit 13 to the user behavior history DB 263. At the time of input, the fact that there is update data is transmitted to the weight calculation unit 26. The storage in the user behavior history DB 263 may be provided with an identification number for each terminal, and user behavior information may be managed for each same identification number.
 統合検索装置2が備えるユーザ行動履歴DB262は、ユーザ行動取得部25から受け取ったユーザ行動情報を保管する。 The user behavior history DB 262 provided in the integrated search device 2 stores the user behavior information received from the user behavior acquisition unit 25.
 本実施形態の動作処理は以下の通りである。検索要求の受け付けから統合検索結果の算出までの大まかな流れは図2に示す処理と同様だが、本実施形態は特に図2のステップS6、S7のウェイト算出部26の処理に特徴を有する。ウェイト算出部26は、各検索エンジンのウェイトを算出する際、ユーザ行動取得部25から更新データがあることを通知されていた場合はユーザ行動履歴DBからユーザ行動情報を取得する。そしてウェイト算出部26は、各検索エンジン間の類似度データ及びユーザ行動情報に基づき、各検索エンジンのウェイトを算出する。 The operation process of this embodiment is as follows. The general flow from acceptance of a search request to calculation of an integrated search result is the same as the processing shown in FIG. 2, but this embodiment is particularly characterized by the processing of the weight calculation unit 26 in steps S6 and S7 of FIG. When calculating the weight of each search engine, the weight calculation unit 26 acquires user behavior information from the user behavior history DB when notified by the user behavior acquisition unit 25 that there is update data. The weight calculator 26 calculates the weight of each search engine based on the similarity data between the search engines and the user behavior information.
 ウェイトの算出についてさらに詳述する。なお本実施形態は、例として図3~図6の値を用いて説明する。例えば、ユーザ行動情報が、図6で一番上位にランクされている情報bをクリックしたという情報であった場合、これにより、情報bに影響を与えた検索エンジンのウェイトを上げるフィードバックが行われるが、さらに検索エンジン間の類似データを考慮してウェイトを算出する。ここでは、既に検索エンジン間の類似データを考慮して算出した各検索エンジンのウェイトw(上記数2参照)を用いて算出する。すなわち、情報bに対する各検索エンジンのスコアベクトルsb=(0.8、 0.8、 0.4、 0.8)の場合(図6参照)、上記数2に示すwの値と、c'1=0.1、c'2=0.06として、w+c'1s-c'2M'sを算出する。 The calculation of the weight will be further described in detail. This embodiment will be described using the values in FIGS. 3 to 6 as an example. For example, if the user behavior information is information that the information b ranked at the top in FIG. 6 is clicked, feedback is performed to increase the weight of the search engine that has affected the information b. However, the weight is calculated in consideration of similar data between search engines. Here, the calculation is performed using the weight w of each search engine that has already been calculated in consideration of similar data between search engines (see Equation 2 above). That is, when the score vector sb = (0.8, 0.8, 0.4, 0.8) T of each search engine for the information b (see FIG. 6), the value of w shown in the above equation 2 and c ′ 1 = 0.1, c ′ 2 Assuming = 0.06, w + c ′ 1 s−c ′ 2 M ′s is calculated.
Figure JPOXMLDOC01-appb-M000004
Figure JPOXMLDOC01-appb-M000004
 上記数4で得られた解を、下記数5に示すように、ベクトル要素の総和が1になるように正規化し、各検索エンジンのウェイトとして設定する。 The solution obtained by Equation 4 is normalized so that the sum of vector elements becomes 1, as shown in Equation 5 below, and set as the weight of each search engine.
Figure JPOXMLDOC01-appb-M000005
Figure JPOXMLDOC01-appb-M000005
 この結果は、事前の各検索エンジンのウェイトベクトルw(上記数2参照)と比較すると、類似性を考慮していない場合は、下記数6に示すように検索エンジンA、B、Dは同じ加点をされるはずが、本実施形態は類似性を考慮することにより、互いに類似度の高い検索エンジンがあるA、Bよりも独自性の強いDがより大きな加点がされている。 This result is the same as the weight vector w of each search engine in advance (see Equation 2 above). However, in the present embodiment, D is more unique than A and B, which have search engines with a high degree of similarity, in consideration of the similarity.
Figure JPOXMLDOC01-appb-M000006
Figure JPOXMLDOC01-appb-M000006
 上記数5に示すウェイトを反映した総合スコアの計算結果(統合検索結果)を図9に示す。図6と新たな図9とを比較すると、情報bのスコアが0.8で同じであった検索エンジンA、B、Dにおいて、類似する検索エンジンが相対的に少ないDが(図4参照)、より大きなウェイトを得たため、情報eが上位に移動している。 FIG. 9 shows the calculation result (integrated search result) of the total score reflecting the weight shown in Equation 5 above. Comparing FIG. 6 with the new FIG. 9, in search engines A, B, and D where the score of information b is the same at 0.8, there are relatively few D similar search engines (see FIG. 4). Information e has moved to the top because it gained a large weight.
 図9において、情報b以外の情報を選択(クリック、購買等)した場合も、同様の処理を繰り返し、ユーザ行動と類似度を考慮したウェイト配分にすることができる。 In FIG. 9, when information other than the information b is selected (click, purchase, etc.), the same processing can be repeated, and weight distribution can be made in consideration of user behavior and similarity.
 なお、同時に複数のユーザ行動情報のスコアベクトルsを加算した和のベクトルで実行するなどして反映させても良く、特に制限することはない。 Note that it may be reflected by executing a sum vector obtained by adding score vectors s of a plurality of user behavior information at the same time, and is not particularly limited.
 上記実施形態により、適合性フィードバックを用いた統合検索装置で、ユーザの行動履歴が多く集まったとしてもその計算量のスケーラビリティがない可能性があるが、本発明でそれを回避することができる。また、ユーザによる情報選択などの好みや意思を間接的に記す行動履歴が多く集まらない段階においても、類似性を考慮したウェイト配合を実現することが出来る。 According to the above embodiment, even if a large number of user behavior histories are collected in the integrated search device using relevance feedback, there is a possibility that the calculation amount is not scalable, but this can be avoided by the present invention. In addition, weight blending considering similarity can be realized even in a stage where many action histories that indirectly describe preferences and intentions such as information selection by the user are not collected.
 なお、図2のフローチャートに示す処理や、各実施形態における処理を、CPUが実行するためのプログラムは本発明によるプログラムを構成する。このプログラムを記録する記録媒体としては、半導体記憶部や光学的及び/又は磁気的な記憶部等を用いることができる。このようなプログラム及び記録媒体を、前述した各実施形態とは異なる構成のシステム等で用い、そこのCPUで上記プログラムを実行させることにより、本発明と実質的に同じ効果を得ることができる。 The program for the CPU to execute the processing shown in the flowchart of FIG. 2 and the processing in each embodiment constitutes a program according to the present invention. As a recording medium for recording the program, a semiconductor storage unit, an optical and / or magnetic storage unit, or the like can be used. By using such a program and a recording medium in a system having a configuration different from that of each of the above-described embodiments and causing the CPU to execute the program, substantially the same effect as the present invention can be obtained.
 以上、本発明を好適な実施形態に基づき具体的に説明したが、本発明は上記のものに限定されるものではなく、その要旨を逸脱しない範囲で種々変更可能であることは言うまでもない。 As described above, the present invention has been specifically described based on the preferred embodiments, but the present invention is not limited to the above-described ones, and it is needless to say that various modifications can be made without departing from the scope of the present invention.
 この出願は、2008年7月17日に出願された日本出願特願2008-186483を基礎とする優先権を主張し、その開示の全てをここに取り込む。 This application claims priority based on Japanese Patent Application No. 2008-188483 filed on July 17, 2008, the entire disclosure of which is incorporated herein.
 1  端末
 2  統合検索装置
 3  検索エンジンA
 4  検索エンジンB
 5  検索エンジンC
 6  検索エンジンD
 9、10  ネットワーク
 11  検索要求入力部
 12  検索結果出力部
 13  ユーザ行動入力部
 21  検索要求受付部
 22  検索要求部
 23  検索結果受付部
 24  検索エンジン類似度算出部
 25  ユーザ行動取得部
 26  ウェイト算出部
 27  検索結果統合部
 261  検索エンジン類似度DB
 262  検索履歴DB
 263  ユーザ行動履歴DB
1 Terminal 2 Integrated Search Device 3 Search Engine A
4 Search Engine B
5 Search engine C
6 Search Engine D
9, 10 Network 11 Search request input unit 12 Search result output unit 13 User behavior input unit 21 Search request reception unit 22 Search request unit 23 Search result reception unit 24 Search engine similarity calculation unit 25 User behavior acquisition unit 26 Weight calculation unit 27 Search result integration unit 261 Search engine similarity DB
262 Search history DB
263 User behavior history DB

Claims (12)

  1.  評価条件に従って評価を行う複数の評価装置と接続する手段と、ユーザ端末から入力された評価条件での評価を前記評価装置に要求する手段と、前記評価装置から取得した評価結果を組み合わせて統合する統合手段とを備える統合評価装置であって、
     前記複数の評価装置間の類似度を算出する評価装置類似度算出手段と、
     前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定するウェイト設定手段と、を備え、
     前記統合手段は、前記設定されたウェイトを考慮して前記評価結果を統合することを特徴とする統合評価装置。
    The means for connecting to a plurality of evaluation devices that perform evaluation according to the evaluation conditions, the means for requesting the evaluation device for evaluation under the evaluation conditions input from the user terminal, and the evaluation results acquired from the evaluation device are combined and integrated. An integrated evaluation device comprising an integration means,
    Evaluation device similarity calculation means for calculating the similarity between the plurality of evaluation devices;
    Considering the calculated similarity, weight setting means for setting the weight of each evaluation device by pressing the weight of the evaluation device having a high similarity,
    The integrated evaluation apparatus, wherein the integration unit integrates the evaluation results in consideration of the set weight.
  2.  前記評価装置類似度算出手段は、前記各評価結果に基づいて前記類似度を算出することを特徴とする請求項1記載の統合評価装置。 2. The integrated evaluation apparatus according to claim 1, wherein the evaluation apparatus similarity calculation means calculates the similarity based on each evaluation result.
  3.  前記出力結果の履歴を保管する出力結果履歴保管手段を備え、
     前記評価装置類似度算出手段は、前記履歴保管手段に保管されている各評価結果の履歴に基づいて算出することを特徴とする請求項1記載の統合評価装置。
    An output result history storage means for storing the history of the output results;
    The integrated evaluation apparatus according to claim 1, wherein the evaluation apparatus similarity calculation unit calculates the evaluation result based on a history of each evaluation result stored in the history storage unit.
  4.  前記評価装置間の類似度の履歴を保管する類似度履歴保管手段を備え、
     前記ウェイト設定手段は、さらに、前記類似度履歴保管手段に保管されている各評価装置間の類似度を考慮することを特徴とする請求項1記載の統合評価装置。
    A similarity history storage means for storing a history of similarity between the evaluation devices;
    2. The integrated evaluation apparatus according to claim 1, wherein the weight setting unit further considers the similarity between the evaluation apparatuses stored in the similarity history storage unit.
  5.  前記評価装置類似度算出手段は、前記評価装置の各評価対象全体での評価結果に基づいて類似度を算出することを特徴とする請求項1記載の統合評価装置。 2. The integrated evaluation apparatus according to claim 1, wherein the evaluation apparatus similarity calculation means calculates the similarity based on an evaluation result of each evaluation object of the evaluation apparatus as a whole.
  6.  前記評価装置類似度算出手段は、前記評価装置の各評価対象の範囲に基づいて類似度を算出することを特徴とする請求項1記載の統合評価装置。 The integrated evaluation apparatus according to claim 1, wherein the evaluation apparatus similarity calculation means calculates a similarity based on a range of each evaluation object of the evaluation apparatus.
  7.  前記統合手段により評価結果を統合して作成された統合評価結果に対するユーザ行動情報を取得するユーザ行動取得手段と、
     前記取得したユーザ行動情報を保管するユーザ行動履歴保管手段と、を備え、
     前記統合手段は、前記評価条件での評価要求を行ったユーザのユーザ行動情報履歴が前記ユーザ行動履歴保管手段に保管されている場合は前記保管されているユーザ行動情報履歴をさらに考慮し、前記評価結果を統合することを特徴とする請求項1記載の統合評価装置。
    User action acquisition means for acquiring user action information for an integrated evaluation result created by integrating evaluation results by the integration means;
    User behavior history storage means for storing the acquired user behavior information,
    The integration means further considers the stored user behavior information history when the user behavior information history of the user who made the evaluation request under the evaluation condition is stored in the user behavior history storage means, The integrated evaluation apparatus according to claim 1, wherein the evaluation results are integrated.
  8.  請求項1から6の何れか1項記載の統合評価装置と、複数の評価装置と、ユーザ端末と、を備えることを特徴とする統合評価システム。 An integrated evaluation system comprising: the integrated evaluation device according to any one of claims 1 to 6, a plurality of evaluation devices, and a user terminal.
  9.  請求項7記載の統合評価装置と、複数の評価装置と、ユーザ端末と、を備え、
     前記ユーザ端末は、前記統合評価装置から取得した統合評価結果に対して行ったユーザ行動の情報を前記統合評価装置へ送るユーザ行動入力手段を備えることを特徴とする統合評価システム。
    An integrated evaluation device according to claim 7, a plurality of evaluation devices, and a user terminal,
    The said user terminal is provided with the user action input means which sends the information of the user action performed with respect to the integrated evaluation result acquired from the said integrated evaluation apparatus to the said integrated evaluation apparatus, The integrated evaluation system characterized by the above-mentioned.
  10.  ユーザ端末から入力された評価条件での評価を、複数の評価装置に要求するステップと、
     前記複数の評価装置から前記要求に対応した評価結果を取得するステップと、
     前記複数の評価装置間の類似度を算出するステップと、
     前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定するステップと、
     前記設定されたウェイトを考慮して前記評価結果を統合するステップと、を備えることを特徴とする統合評価方法。
    Requesting a plurality of evaluation devices for evaluation under evaluation conditions input from a user terminal;
    Obtaining an evaluation result corresponding to the request from the plurality of evaluation devices;
    Calculating a similarity between the plurality of evaluation devices;
    Taking into account the calculated similarity, pressing the weight of the evaluation device having a high similarity, and setting the weight of each evaluation device;
    Integrating the evaluation results in consideration of the set weight, and an integrated evaluation method.
  11.  ユーザ端末から入力された評価条件での評価を、複数の評価装置に要求する処理と、
     前記複数の評価装置から前記要求に対応した評価結果を取得する処理と、
     前記複数の評価装置間の類似度を算出する処理と、
     前記算出した類似度を考慮し、類似度の高い評価装置の重みを押えて各評価装置のウェイトを設定する処理と、
     前記設定されたウェイトを考慮して前記評価結果を統合する処理と、を備えることを特徴とする統合評価プログラム。
    A process of requesting evaluations from a plurality of evaluation devices under evaluation conditions input from a user terminal;
    Processing for obtaining an evaluation result corresponding to the request from the plurality of evaluation devices;
    Processing for calculating the similarity between the plurality of evaluation devices;
    In consideration of the calculated similarity, processing to set the weight of each evaluation device by pressing the weight of the evaluation device having a high similarity,
    And a process of integrating the evaluation results in consideration of the set weights.
  12.  請求項11記載の統合評価プログラムの処理を記録するコンピュータ読取り可能な記録媒体。 A computer-readable recording medium for recording the processing of the integrated evaluation program according to claim 11.
PCT/JP2009/062892 2008-07-17 2009-07-16 Integrated evaluation device, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium WO2010008047A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2008-186483 2008-07-17
JP2008186483 2008-07-17

Publications (1)

Publication Number Publication Date
WO2010008047A1 true WO2010008047A1 (en) 2010-01-21

Family

ID=41550447

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2009/062892 WO2010008047A1 (en) 2008-07-17 2009-07-16 Integrated evaluation device, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium

Country Status (1)

Country Link
WO (1) WO2010008047A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011210120A (en) * 2010-03-30 2011-10-20 Yahoo Japan Corp Device and method for retrieving multiple servers
JP2014035719A (en) * 2012-08-10 2014-02-24 Nippon Telegr & Teleph Corp <Ntt> Contribution information system, calculation device, and calculation method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004287532A (en) * 2003-03-19 2004-10-14 Ricoh Co Ltd Integral metal retrieving device and method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004287532A (en) * 2003-03-19 2004-10-14 Ricoh Co Ltd Integral metal retrieving device and method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
MENG W ET AL.: "Building Efficient and Effective Metasearch Engine", ACM COMPUTING SURVEYS, vol. 34, no. 1, March 2002 (2002-03-01), pages 48 - 89 *
SHIGEYOSHI ONO ET AL.: "Shako Kitei o Mochiita Meta-kensaku ni Okeru Rank List no Togo Hoho no Teian", THE TRANSACTIONS OF THE INSTITUTE OF ELECTRONICS, INFORMATION AND COMMUNICATION ENGINEERS, vol. J88-D- I, no. 3, 1 March 2005 (2005-03-01), pages 657 - 667 *
YASUAKI YOSHIDA ET AL.: "Shoyo Kensaku Engine no Ranking ni Kansuru Teiryoteki Hyoka to Tokucho Kaiseki", IPSJ SIG NOTES, vol. 2007, no. 65, 3 July 2007 (2007-07-03), pages 441 - 446 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2011210120A (en) * 2010-03-30 2011-10-20 Yahoo Japan Corp Device and method for retrieving multiple servers
JP2014035719A (en) * 2012-08-10 2014-02-24 Nippon Telegr & Teleph Corp <Ntt> Contribution information system, calculation device, and calculation method

Similar Documents

Publication Publication Date Title
US9569499B2 (en) Method and apparatus for recommending content on the internet by evaluating users having similar preference tendencies
US8566370B2 (en) Ranking media content for cloud-based searches
US8782035B2 (en) Syndicated data stream content provisioning
TWI636416B (en) Method and system for multi-phase ranking for content personalization
JP5687288B2 (en) Social and contextual search for enterprise business applications
US8370332B2 (en) Blending mobile search results
US10380649B2 (en) System and method for logistic matrix factorization of implicit feedback data, and application to media environments
KR20140091530A (en) Relevance of name and other search queries with social network features
CN107590243A (en) The personalized service recommendation method to be sorted based on random walk and diversity figure
US20120295633A1 (en) Using user&#39;s social connection and information in web searching
JP2019125313A (en) Learning device, learning method, and learning program
JP2007058398A (en) Content recommendation device, content recommendation method and computer program
JP6203304B2 (en) Information processing apparatus, information processing method, and information processing program
JP6312913B1 (en) Information processing apparatus, information processing method, and information processing program
JP6570978B2 (en) Cluster selection device
KR20180080175A (en) Optimized Digital Component Analysis System
KR101334096B1 (en) Item based recommendation engiine recommending highly associated item
WO2010008047A1 (en) Integrated evaluation device, integrated evaluation system, integrated evaluation method, integrated evaluation program, and recording medium
US20150206220A1 (en) Recommendation Strategy Portfolios
US20150039602A1 (en) Systems and methods for providing ordered results for search queries
JP2009301540A (en) Information retrieval device and information retrieval method
JP2016110260A (en) Content search result provision system and content search result provision method
JP5094915B2 (en) Search device
Sarala et al. A Consumption History and QoS based Web Service Ranking Technique
Slaimi et al. A multigraph approach for web services recommendation

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09797971

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09797971

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP