JP7393060B2

JP7393060B2 - Personalized search method and search system combining attention mechanism

Info

Publication number: JP7393060B2
Application number: JP2022578858A
Authority: JP
Inventors: 琳暴; 英磊宋; 春晋; 志強盖
Original assignee: Jiangsu University of Science and Technology
Current assignee: Jiangsu University of Science and Technology
Priority date: 2021-04-14
Filing date: 2022-03-28
Publication date: 2023-12-06
Anticipated expiration: 2042-03-28
Also published as: CN113127737A; CN113127737B; JP2023530370A; WO2022218139A1

Description

本発明はデータマイニングの技術分野に属し、具体的には、パーソナライズド検索方法及び検索システムに関する。 The present invention belongs to the technical field of data mining, and specifically relates to a personalized search method and search system.

ビッグデータ、クラウドコンピューティング、ユビキタスネットワーク等の技術の急速な発展に伴い、インターネットの規模及びユーザー数は急激に増加し、ユーザーは既にデータの能動的な創造者となり、大量のマルチソース異種ユーザー生成コンテンツが集まり、様々な情報が錯綜し且つ爆発的な増加を示している。ユーザー生成コンテンツは大量の動的に進化した複雑なマルチソース異種データを含み、ソース及び構造の多様化、スパース性、マルチモード、不完全性、社会的伝播等の特徴を有し、豊かな価値のある情報及び巨大なマイニング潜在力を秘めており、様々なインターネットプラットフォーム及びモバイルアプリケーション業者が情報を取得し、業績やサービスを向上させる重要なソースでもあり、典型的なビッグデータ環境となっている。しかしながら、これらの複雑なマルチソース異種ユーザー生成コンテンツはユーザーに新しい情報をもたらす一方、ユーザーが情報をフィルタリング、選別及び処理して最終的に決定を行う困難性の増加につながり、すなわち、「情報過負荷」の問題を引き起こしてしまう。パーソナライズド検索及び推薦アルゴリズムは、ユーザーと情報を結ぶ架け橋として、大量のマルチソース異種ユーザー生成データを十分に活用し、ユーザーの潜在的なニーズ及び認知の嗜好に応じてユーザーの行動及び発展動態を予測し、できるだけユーザーが大量の情報からユーザーのニーズ及び趣味嗜好に一致するコンテンツをフィルタリングすることを支援し、「情報過負荷」を効果的に軽減し、ユーザー体験及びウエブサイトプラットフォームの商業的利益を向上させることができる。 With the rapid development of big data, cloud computing, ubiquitous network and other technologies, the scale and number of users of the Internet have increased rapidly, and users have already become active creators of data, generating a large amount of multi-source heterogeneous users. Content is gathering, and various information is becoming complicated and increasing explosively. User-generated content contains a large amount of dynamically evolving, complex multi-source heterogeneous data, has characteristics such as source and structure diversification, sparsity, multi-mode, incompleteness, social propagation, etc., and has rich value. It has a large amount of information and huge mining potential, and is also an important source for various Internet platforms and mobile application companies to obtain information and improve their business performance and services, making it a typical big data environment. . However, while these complex multi-source heterogeneous user-generated contents bring new information to users, they also lead to increased difficulty for users to filter, sort and process information and ultimately make decisions, i.e., “information overload”. This will cause a load problem. Personalized search and recommendation algorithms, as a bridge between users and information, make full use of large amounts of multi-source heterogeneous user-generated data, and adjust user behavior and development dynamics according to users' latent needs and cognitive preferences. Anticipate and help users filter content that matches their needs and hobbies from a large amount of information as much as possible, effectively reducing "information overload" and improving the user experience and the commercial interests of the website platform. can be improved.

ユーザー生成コンテンツ向けのパーソナライズド検索タスクの本質は、マルチソース異種ユーザー生成データで構成される動的進化空間においてユーザーのニーズ及びパーソナライズド嗜好を満たす最適目標を探索することであり、すなわち、動的定性指標最適化問題である。この複雑な定性指標最適化問題は、その目的関数及び性能指標を数学関数で正確に記述できないだけでなく、その最適化問題の決定変数も単純な構造化データではなく、多くの場合、主観性、曖昧さ、不確実性及び不整合性が大きく、ユーザーは経験知識及び趣味嗜好に基づいて検索対象となるアイテムに対して定性分析、評価及び決定を行う必要があるため、特定の正確な数学モデルを構築して記述を行うことが困難である。近年、提案されている人間のインテリジェント評価を融合した対話型共進化計算は、ユーザーの主観的な認知経験、インテリジェント評価・決定及び従来の進化計算を組み合わせ、上記複雑なパーソナライズド検索の定性指標最適化問題を処理する効果的な方法である。 The essence of the personalized search task for user-generated content is to search for an optimal goal that satisfies users' needs and personalized preferences in a dynamic evolving space consisting of multi-source heterogeneous user-generated data, i.e., dynamic It is a qualitative index optimization problem. This complex qualitative index optimization problem not only cannot accurately describe its objective function and performance index with mathematical functions, but also the decision variables of the optimization problem are not simple structured data and are often subject to subjectivity. , since there is a large degree of ambiguity, uncertainty and inconsistency, and users need to perform qualitative analysis, evaluation and decisions on the items to be searched based on their experiential knowledge and hobbies and preferences, certain exact mathematics are not required. It is difficult to construct and describe a model. In recent years, an interactive co-evolutionary calculation that combines human intelligent evaluation has been proposed, which combines the user's subjective cognitive experience, intelligent evaluation/decision, and traditional evolutionary calculation to optimize the qualitative indicators for the above-mentioned complex personalized search. It is an effective way to handle the problem.

出願番号がＣＮ２０２０１０２１６５５７４の中国特許は、制限付きボルツマンマシン駆動ベースの対話型パーソナライズド検索方法を開示しているが、ユーザーの趣味嗜好モデルの構築は異なるアイテム属性の決定変数がユーザー嗜好に対して異なる影響力を持つことを考慮せず、使用されるアイテムの決定変数に同じ重みを適用するため、ユーザー嗜好に対する各決定変数の影響を十分に反映できず、それによって、より正確なユーザー嗜好モデルの構築が困難であり、さらにユーザーによるパーソナライズド検索の効果を損なってしまう。 The Chinese patent with application number CN2020102165574 discloses a restricted Boltzmann machine-driven based interactive personalized search method, in which the construction of a user's taste preference model is such that the decision variables of different item attributes are different for user preferences. Because it applies the same weight to the decision variables of the used items without considering their influence, the influence of each decision variable on user preferences cannot be fully reflected, thereby creating a more accurate user preference model. They are difficult to construct and further reduce the effectiveness of personalized searches for users.

発明の目的について、従来技術に存在する問題に対して、本発明は注意メカニズムを融合したパーソナライズド検索方法及び検索システムを提供し、検索方法は異なる決定成分がユーザー嗜好に対して異なる影響力を持つことを考慮することで、ユーザーがパーソナライズド検索をより効果的に行うことを支援できる。 Regarding the purpose of the invention, to address the problems existing in the prior art, the present invention provides a personalized search method and search system that integrates an attention mechanism, and the search method is designed so that different determining components have different influences on user preferences. By considering what you have, you can help users conduct personalized searches more effectively.

技術的解決手段について、本発明の一態様によれば、注意メカニズムを融合したパーソナライズド検索方法を開示し、
ユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含むユーザー生成コンテンツを収集して取得し、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得するステップ１と、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムでユーザー嗜好を含む優位アイテムグループＤを構成するステップ２であって、ＤにおけるアイテムはセットＳを構成し、Ｓ＝｛（ｕ，ｘ_ｉ，Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ）｝であり、ここで、ｘ_ｉ∈Ｄであり、Ｃ_ｉはアイテムｘ_ｉのカテゴリラベルベクトルであり、Ｔ_ｉはアイテムｘ_ｉに対するユーザーのテキストコメントのベクトル化表現であり、Ｇ_ｉはアイテムｘ_ｉの画像特徴のベクトル化表現であり、ｉ＝１，２，・・・，｜Ｄ｜であり、｜Ｄ｜はＤにおけるアイテム数を表すステップ２と、
注意メカニズムを融合したユーザー嗜好感知モデルを構築し、前記モデルは深層信念ネットワークに基づいて３層の制限付きボルツマンマシンから構成され、１層目の制限付きボルツマンマシンの可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目のＲＢＭを構成し、ｈ_２は可視層として隠れ層ｈ_３とともに３層目のＲＢＭを構成し、前記注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であり、
優位アイテムグループＤを使用して、コントラスティブダイバージェンス学習アルゴリズムで注意メカニズムを融合したユーザー嗜好感知モデルにおける１層目のＲＢＭを訓練し、そのモデルパラメータθ_１＝｛ｗ_１，ａ_１，ｂ_１｝を取得し、
１層目のＲＢＭモデルの訓練が完了した後、隠れユニットの状態が与えられると、各可視ユニットの活性化状態は条件付き独立であり、あるアイテムｘ_ｉのベクトル表現［Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ］は可視層に入力され、その第１組、第２組及び第３組の可視ユニットの活性化確率はそれぞれ

であり、ここで、ａ_１，ｊ、ａ_１，ｋ及びａ_１，ｌはそれぞれ第１組、第２組及び第３組の可視ユニットのバイアスを表し、
様々なマルチソース異種データの情報エントロピーを計算し、アイテムカテゴリラベルの情報エントロピーは、
であり、
テキストコメントベクトルの情報エントロピーは、
であり、
アイテム画像特徴ベクトルの情報エントロピーは、
であり、
ここで、ｃ_ｉｊはアイテムｘ_ｉのカテゴリラベルベクトルＣ_ｉのｊ番目の要素を表し、ｐ_（ｃ_ｉｊ）はＲＢＭ１においてアイテムカテゴリラベルのベクトル表現のｊ番目の要素に対応する可視ユニットの活性化確率を表し、
ｔ_ｉｋはアイテムｘ_ｉに対するユーザーｕのテキストコメントのベクトル化表現Ｔ_ｉのｋ番目の要素を表し、ｐ_（ｔ_ｉｋ）はＲＢＭ１においてユーザーのテキストコメントのベクトル表現のｋ番目の要素に対応する可視ユニットの活性化確率を表し、
ｇ_ｉｌは、アイテムｘ_ｉの画像特徴のベクトル化表現Ｇ_ｉのｌ番目の要素を表し、ｐ_{（ｇｉｌ）}はＲＢＭ１においてアイテム画像特徴のベクトル表現のｌ番目の要素に対応する可視ユニットの活性化確率を表し、
次に、総情報エントロピーに対する様々な情報エントロピーの割合を重み係数として計算し、
ここで、Ｈ（ｘ_ｉ）＝Ｈ（Ｃ_ｉ）＋Ｈ（Ｔ_ｉ）＋Ｈ（Ｇ_ｉ）であり、
ベクトルＣ_ｉ、Ｔ_ｉ、Ｇ_ｉを組み合わせてアイテムｘ_ｉの決定ベクトルΨ_ｉを構成してｖ_１、ｖ_２、ｖ_３における各可視ユニットに入力すると、隠れ層ｈ_１における各隠れユニットの活性化状態は条件付き独立であり、ｍ_１番目の隠れユニットの活性化確率は、
であり、
ここで、ｍ_１＝１，２，・・・，Ｍ_１であり、
はｈ_１におけるｍ_１番目の隠れユニットのバイアスであり、ｖ_１ｊはＲＭＢ１の第１組の可視ユニットｖ_１におけるｊ番目の可視ユニットの状態であり、ｖ_２ｋはＲＭＢ１の第２組の可視ユニットｖ_２におけるｋ番目の可視ユニットの状態であり、ｖ_３ｌはＲＭＢ１の第３組の可視ユニットｖ_３におけるｌ番目の可視ユニットの状態であり、
はｗ_１における要素値であり、ＲＢＭ１におけるｎ番目の可視ユニットとｍ_１番目の隠れユニットとの間の接続重みを表し、ｎ＝１，２，・・・，Φであり、
は隠れ層ｈ_１におけるｍ_１番目の隠れユニットの状態を表し、
はｓｉｇｍｏｉｄ活性化関数であり、
ＲＢＭ１の訓練が完了した後、式（９）に従ってアイテムｘ_ｉに対応する各隠れユニットの状態を取得し、さらに優位アイテムグループＤにおける各アイテムの各決定成分に対するユーザーの嗜好度、すなわち可視層のユニットの活性化確率を、注意重み係数ａｔ_ｎ（ｘ_ｉ）として取得し、
ここで、
はΨ_ｉがＲＢＭ１の可視層の各可視ユニットの状態とする場合、隠れ層ｈ_ｉにおけるｍ_ｉ番目の隠れユニットの状態を表し、ａｔ_ｎ（ｘ_ｉ）はアイテムｘ_ｉの各決定成分ψ_ｉｎの注意重みを表し、
注意重み係数ａｔ_ｎ（ｘ_ｉ）をアイテムｘ_ｉの各決定成分の重み係数として優位アイテムグループＤにおけるアイテムｘ_ｉに対して注意メカニズムに基づく符号化を行い、符号化後、ｘ_ａｔｉで表し、
ｘ_ａｔｉを事前訓練後のＲＢＭ１に入力して可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）を得て、
ここで、
はｘ_ａｔｉのｎ’番目の要素であり、
ＲＢＭ１の可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）で自己注意メカニズム演算を行い、アイテム個体のユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）を動的に学習し、
ここで、ｓｏｆｔｍａｘ（）関数はすべての重み係数の和が１であることを確保し、関数ａ（Ｖ_ＲＢＭ１（ｘ_ａｔｉ），ｗ_１）はユーザー嗜好特徴に対するアイテムｘ_ｉの注意重み係数を測定し、計算は
であり、
ユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）とアイテムｘ_ｉの元の決定ベクトルＣ_ｉ，Ｔ_ｉ，Ｇ_ｉとを組み合わせて、注意メカニズムを融合したアイテム決定ベクトルを生成し、
注意メカニズムを融合したアイテム決定ベクトル
で訓練セットを構成し、ＤＢＮにおけるＲＢＭ１、ＲＢＭ２、ＲＢＭ３モデルを層ごとに訓練し、訓練が完了した後、注意メカニズムを融合するＤＢＮベースのユーザー嗜好感知モデル及びその最適化モデルパラメータθを取得するステップ３と、
訓練済みの注意メカニズムを融合したＤＢＮベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表すステップ４と、
個体群のサイズＮを設定し、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズムを採用して、Ｎ個の新個体を生成し、各個体は１つのアイテムであり、ｖ番目の新個体のカテゴリラベルベクトル
（ｖ＝１，２，・・・，Ｎ）の設定ステップは、
（５．１）ｖ＝１とし、
（５．２）［０，１］の乱数ｚを生成し、ｚ≦Ｐ（ψ_ｊ＝１）である場合、ｖ番目の新個体のカテゴリラベルベクトル
のｊ番目の要素は１であり、そうでない場合、０であり、
（５．３）ｖに１を足し、ｖ＞Ｎになるまでステップ（５．２）を繰り返すことであるステップ５と、
検索空間においてＮ個の新個体のカテゴリラベルベクトル
の類似度と最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成するステップ６と、
推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値
を計算し、
ここで、
及び
はそれぞれ推薦対象アイテムセットＳ^ｕにおけるアイテムのエネルギー関数の最大値及び最小値を表し、
はアイテムｘ^＊のエネルギー関数であり、ｘ^＊∈Ｓ^ｕであり、その計算は、
であり、
ここで、
はアイテムｘ^＊のｎ番目の決定成分であるステップ７と、
Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎであるステップ８と、を含み、
ユーザーの対話型検索プロセスの推進及びユーザー行動の動的進化に伴い、現在のユーザーの最新の評価データに応じて優位アイテムグループＤを更新し、注意メカニズムを融合したユーザー嗜好感知モデルを再度訓練し、抽出されたユーザー嗜好特徴を動的に更新すると同時に、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を更新する。 Regarding the technical solution, according to one aspect of the present invention, a personalized search method integrated with an attention mechanism is disclosed,
Collect and retrieve user-generated content including all items that user u has rated, scores and text comments for each item, images for each item, and other users' usefulness evaluation scores for user u's ratings, and text comments. Step 1 of vectorizing the item image, extracting features from the item image, and obtaining the eigenvector;
Step 2 of configuring a dominant item group D including user preferences with items having a user score larger than a predetermined score threshold and a reliability larger than a predetermined confidence threshold, the items in D forming a set S; , S={(u, x _i , C _i , T _i , G _i )}, where x _i ∈D, C _i is the category label vector of item x _i , and T _i is the item is a vectorized representation of the user's text comment for x _i , G _i is a vectorized representation of the image features of item x _i , i = 1, 2, ..., |D|, and |D| Step 2 representing the number of items in D;
A user preference sensing model that combines attention mechanisms is constructed, and the model is composed of three layers of restricted Boltzmann machines based on a deep belief network, and the visible layer of the first layer of restricted Boltzmann machines is the first set of visible layers. unit v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is a visible layer that includes a second layer RBM together with a hidden layer h ₂ . The parameters of _the user preference _sensing model that combines the above attention mechanisms are θ={θ ₁ , θ ₂ , θ ₃ }={ w ₁ , a ₁ , b ₁ , w ₂ , a ₂ , b ₂ , w ₃ , a ₃ , b ₃ },
Using the dominant item group D, train the first layer RBM in the user preference sensing model that combines the attention mechanism with the contrastive divergence learning algorithm, and set the model parameters θ ₁ = {w ₁ , a ₁ , b ₁ } get
After the training of the first layer RBM model is completed, given the state of the hidden unit, the activation state of each visible unit is conditionally independent, and the vector representation of some item x _i [C _i , T _i , G _i ] is input to the visible layer, and the activation probabilities of the first set, second set, and third set of visible units are respectively

, where a _1,j , a _1,k and a _1,l represent the biases of the first, second and third sets of visible units, respectively;
The information entropy of various multi-source heterogeneous data is calculated, and the information entropy of item category label is
and
The information entropy of the text comment vector is
and
The information entropy of the item image feature vector is
and
Here, c _ij represents the j-th element of the category label vector C _i of item x _i , and p ₍ c _ij) is the activation of the visible unit corresponding to the j-th element of the vector representation of the item category label in RBM1. represents the probability,
t _ik represents the kth element of the vectorized representation T _i of user u's text comment for item x _i , and p ₍ t _ik) is the visible vector representation T i of the vectorized representation of user u's text comment in RBM1. represents the activation probability of the unit,
g _il represents the lth element of the vectorized representation G _i of the image features of item x _i , and p _(gil) is the activation of the visible unit corresponding to the lth element of the vectorized representation of the item image features in RBM1. represents the probability,
Next, calculate the ratio of various information entropies to the total information entropy as a weighting factor,
Here, H(x _i )=H(C _i )+H(T _i )+H(G _i ),
When vectors C _i , T _i , G _i are combined to form the decision vector Ψ _i of item x _i and inputted to each visible unit in v ₁ , v ₂ , v ₃ , the activation of each hidden unit in hidden layer h ₁ is The activation states are conditionally independent, and the activation probability of the m _- th hidden unit is
and
Here, m ₁ =1, 2,..., M ₁ ,
is the bias of the m _-th hidden unit in h ₁ , v _1j is the state of the j-th visible unit in the first set of visible units v ₁ of RMB1, and v _2k is the state of the j-th visible unit in the second set of visible units of RMB1. v ₂ is the state of the k-th visible unit, v _3l is the state of the l-th visible unit in the third set of visible units v ₃ of RMB1,
is the element value in w ₁ and represents the connection weight between the nth visible unit and m _1st hidden unit in RBM1, where n = 1, 2, ..., Φ,
represents the state of the m _- th hidden unit in the hidden layer _h1 ,
is the sigmoid activation function,
After the training of RBM1 is completed, the state of each hidden unit corresponding to item x _i is obtained according to equation (9), and the user's preference degree for each determining component of each item in dominant item group D, that is, the state of the visible layer Obtain the activation probability of the unit as an attention weighting coefficient at _n (x _i ),
here,
If Ψ _i is the state of each visible unit in the visible layer of RBM1, represents the state of the m _i -th hidden unit in the hidden layer h _i , and at _n (x _i ) is each determinant component ψ _in of item x _i represents the attention weight of
Encoding is performed on the item x _i in the dominant item group D based on the attention mechanism using the attention weight coefficient at _n (x _i ) as the weight coefficient of each determining component of the item x _i , and after encoding, it is expressed as x _ati ,
x _ati is input to RBM1 after pre-training to obtain the activation probability V _RBM1 (x _ati ) of the visible unit,
here,
is the n'th element of x _ati ,
Perform a self-attention mechanism calculation using the activation probability V _RBM1 (x _ati ) of the visible unit of RBM1, dynamically learn the user preference attention weight vector A (x _ati ) for each item,
Here, the softmax() function ensures that the sum of all weighting coefficients is 1, and the function a(V _RBM1 (x _ati ), w ₁ ) measures the attention weighting coefficient of item x _i with respect to user preference features. And the calculation is
and
Combining the user preference attention weight vector A(x _ati ) and the original decision vector C _i , T _i , G _i of item x _i to generate an item decision vector that fuses the attention mechanism;
Item decision vector fused with attention mechanism
construct a training set, train the RBM1, RBM2, and RBM3 models in the DBN layer by layer, and after the training is completed, obtain the DBN-based user preference sensing model that fuses the attention mechanism and its optimization model parameters θ. Step 3 and
Build a user preference-based distribution estimation probability model P(x) based on the DBN-based user preference sensing model fused with the trained attention mechanism and its model parameters;
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item Step 4 representing the preference probability of
Set the population size N, use the user preference-based distribution estimation probability model P(x), and adopt the distribution estimation algorithm to generate N new individuals, each individual with one item. Yes, the category label vector of the vth new individual
The setting step for (v=1, 2,...,N) is
(5.1) Let v=1,
(5.2) Generate a random number z of [0, 1], and if z≦P (ψ _j =1), the category label vector of the vth new individual
The jth element of is 1, otherwise 0,
(5.3) Step 5, which is to add 1 to v and repeat step (5.2) until v>N;
Category label vector of N new individuals in search space
Step 6 of selecting the N items with the highest similarity to constitute a recommendation target item set S ^u ;
Adaptation value of each item in the recommendation target item set S ^u
Calculate,
here,
as well as
respectively represent the maximum value and minimum value of the energy function of the item in the recommendation target item set S ^u ,
is the energy function of item x ^* , x ^* ∈S ^u , and its calculation is
and
here,
is the nth determining component of item x ^* , step 7;
Selecting the top N items with the highest adaptation value in S ^u as the search results, TopN<N, step 8;
With the promotion of the user's interactive search process and the dynamic evolution of user behavior, we will update the dominant item group D according to the latest evaluation data of the current user, and retrain the user preference sensing model that combines the attention mechanism. , dynamically updates the extracted user preference features, and at the same time updates the user preference-based distribution estimation probability model P(x).

別の態様によれば、本発明は上記パーソナライズド検索方法を実現する検索システムをさらに開示し、
ユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含むユーザーｕの生成コンテンツを収集して取得し、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得するユーザー生成コンテンツ取得モジュールと、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムでユーザー嗜好を含む優位アイテムグループＤを構成する優位アイテムグループ構築モジュールと、
注意メカニズムを融合したユーザー嗜好感知モデルを構築して訓練するユーザー嗜好感知モデル構築訓練モジュールであって、前記モデルは深層信念ネットワークに基づいて３層の制限付きボルツマンマシンから構成され、１層目の制限付きボルツマンマシンの可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目の制限付きボルツマンマシンを構成し、ｈ_２は可視層として、隠れ層ｈ_３とともに３層目の制限付きボルツマンマシンを構成し、前記注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であるユーザー嗜好感知モデル構築訓練モジュールと、
訓練済みの注意メカニズムを融合した深層信念ネットワークベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表すユーザー嗜好ベースの分布推定確率モデル構築モジュールと、
ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズムを採用して、それぞれ１つのアイテムであるＮ（Ｎは所定の個体群のサイズである）個の新個体を生成し、各新個体のカテゴリラベルベクトルを設定する個体群生成モジュールと、
検索空間においてＮ個の新個体のカテゴリラベルベクトル
との類似度が最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成する推薦対象アイテムセット構築モジュールと、
推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値を計算する適応値計算モジュールと、
Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎである検索結果選択モジュールと、を含む。 According to another aspect, the present invention further discloses a search system that implements the personalized search method,
Collecting and retrieving content generated by user u, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings; a user-generated content acquisition module that vectorizes text comments, performs feature extraction on item images, and acquires eigenvectors;
a superior item group construction module that configures a superior item group D including user preferences with items whose user score is greater than a predetermined score threshold and whose reliability is greater than a predetermined reliability threshold;
A user preference sensing model building and training module that builds and trains a user preference sensing model that integrates an attention mechanism, the model is composed of a three-layer restricted Boltzmann machine based on a deep belief network, and the first layer The visible layer of the restricted Boltzmann machine includes a first set of visible units v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is the visible layer. , a second-layer restricted Boltzmann machine is constructed with the hidden layer _h2 , _h2 is the visible layer, and a third-layer restricted Boltzmann machine is constructed with the hidden layer _h3 . The parameters of the preference sensing model are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w 2 , a ₂ , b ₂ _, w ₃ , a ₃ , b ₃ } A preference sensing model construction training module;
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item a user preference-based distribution estimation probability model construction module representing preference probabilities;
Using the user preference-based distribution estimation probability model P(x), a distribution estimation algorithm is adopted to generate N (N is the size of a given population) new individuals, each of which is one item. and a population generation module that sets a category label vector for each new individual;
Category label vector of N new individuals in search space
a recommendation target item set construction module that selects N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
an adaptation value calculation module that calculates the adaptation value of each item in the recommendation target item set S ^u ;
A search result selection module that selects TopN items with the highest adaptation value in ^Su as search results, and where TopN<N.

有益な効果について、本発明に開示されているパーソナライズド検索方法は、ユーザースコア、アイテムカテゴリラベル、ユーザーテキストコメント、評価信頼度及びアイテム画像情報を含むマルチソース異種ユーザー生成コンテンツを十分に活用し、注意メカニズムを融合したユーザー嗜好感知モデルを構築し、このユーザー嗜好感知モデルに基づいて、ユーザー嗜好ベースの分布推定確率モデルを構築し、ユーザー嗜好を含む新しい実行可能解アイテムを生成し、適応値が最も高い複数のアイテムを最終検索結果として選択する。該方法は、ビッグデータ環境におけるマルチソース異種ユーザー生成コンテンツ向けのパーソナライズド検索タスクをよく処理でき、ユーザーがパーソナライズド検索を行うように効果的にガイドし、ユーザーができるだけ早く満足の行く解を検索することを支援し、パーソナライズド検索アルゴリズムの総合的性能を向上させる。 For beneficial effects, the personalized search method disclosed in the present invention makes full use of multi-source heterogeneous user-generated content including user scores, item category labels, user text comments, rating reliability and item image information; Build a user preference sensing model that fuses the attention mechanism, and based on this user preference sensing model, build a user preference-based distribution estimation probability model, generate a new executable solution item that includes user preferences, and calculate the adaptive value. Select the highest multiple items as the final search result. The method can well handle the personalized search task for multi-source heterogeneous user-generated content in big data environment, effectively guide users to perform personalized searches, and help users search for satisfactory solutions as soon as possible. improve the overall performance of personalized search algorithms.

図１は本発明に開示されている注意メカニズムを融合したパーソナライズド検索方法のフローチャートである。FIG. 1 is a flowchart of the personalized search method integrated with the attention mechanism disclosed in the present invention. 図２は注意メカニズムを融合したユーザー嗜好感知モデルの構造模式図である。Figure 2 is a schematic structural diagram of a user preference sensing model that combines attention mechanisms. 図３は注意メカニズムを融合したパーソナライズド検索システムの構成模式図である。FIG. 3 is a schematic diagram of the configuration of a personalized search system that combines an attention mechanism.

以下、図面及び特定の実施形態を参照しながら本発明をさらに説明する。 The invention will now be further described with reference to the drawings and specific embodiments.

図１に示すように、本発明は注意メカニズムを融合したパーソナライズド検索方法を開示し、以下のステップ１～８を含む。 As shown in FIG. 1, the present invention discloses a personalized search method integrated with an attention mechanism, which includes the following steps 1-8.

ステップ１では、ユーザー生成コンテンツを収集して取得し、前記ユーザー生成コンテンツはユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、お及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含み、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得し、
本実施例では、テキストコメントのベクトル化表現のステップは、テキストコメントにおけるストップワード及び句読点等を除去し、データ前処理を行い、文献ＤｅｖｌｉｎＪ，ＣｈａｎｇＭＷ，ＬｅｅＫ，ｅｔａｌ．ＢＥＲＴ：Ｐｒｅ－ｔｒａｉｎｉｎｇｏｆＤｅｅｐＢｉｄｉｒｅｃｔｉｏｎａｌＴｒａｎｓｆｏｒｍｅｒｓｆｏｒＬａｎｇｕａｇｅＵｎｄｅｒｓｔａｎｄｉｎｇ［Ｊ］．ａｒＸｉｖ：１８１０．０４８０５ｖ２［ｃｓ．ＣＬ］２４Ｍａｙ２０１９．におけるＢＥＲＴモデルを使用して、ユーザーテキストコメントに対してベクトル化表現を行うことである。 In step 1, user-generated content is collected and obtained, the user-generated content includes all items rated by user u, scores and text comments for each item, images of each item, and other information for user u's ratings. Contains the user's usefulness evaluation score, vectorizes the text comment, performs feature extraction on the item image, and obtains the eigenvector.
In this embodiment, the step of vectorized representation of text comments involves removing stop words, punctuation marks, etc. in text comments, performing data preprocessing, and performing data preprocessing as described in the literature Devlin J, Chang M W, Lee K, et al. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding [J]. arXiv:1810.04805v2 [cs. CL] 24 May 2019. The first step is to perform a vectorized representation on user text comments using the BERT model.

アイテム画像の特徴抽出は、文献ＫｒｉｚｈｅｖｓｋｙＡ，ＳｕｔｓｋｅｖｅｒＩ，ＨｉｎｔｏｎＧＥ．ＩｍａｇｅＮｅｔｃｌａｓｓｉｆｉｃａｔｉｏｎｗｉｔｈｄｅｅｐｃｏｎｖｏｌｕｔｉｏｎａｌｎｅｕｒａｌｎｅｔｗｏｒｋｓ．Ｉｎ：Ｐｒｏｃｅｅｄｉｎｇｓｏｆｔｈｅ２５ｔｈＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＮｅｕｒａｌＩｎｆｏｒｍａｔｉｏｎＰｒｏｃｅｓｓｉｎｇＳｙｓｔｅｍｓ．ＬａｋｅＴａｈｏｅ，Ｎｅｖａｄａ，ＵＳＡ：ＣｕｒｒａｎＡｓｓｏｃｉａｔｅｓＩｎｃ．，２０１２．１０９７－１１０５．におけるＡｌｅｘＮｅｔモデルを使用して、アイテム画像に対して特徴抽出及びベクトル化表現を行う。 Feature extraction of item images is described in the literature Krizhevsky A, Sutskever I, Hinton G E. Image Net classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems. Lake Tahoe, Nevada, USA: Curran Associates Inc. , 2012. 1097-1105. Feature extraction and vectorization expression are performed on item images using the AlexNet model.

ユーザーｕの評価に対するほかのユーザーの有用性評価とは、ほかのユーザーがあるアイテムに対する現在のユーザーｕの評価情報に対して有用性評価を行うことであり、有用であると判定する場合、１でマークし、そうでない場合、０でマークし、すべてのほかのユーザーがあるアイテムに対する現在のユーザーｕの評価情報に対して行う評価を統計し、１でマークされるものの総数がユーザーｕの評価に対するほかのユーザーの有用性評価スコアである。例えば、現在のユーザーｕがアイテムｘに対して評価を行い、ユーザーＡ及びユーザーＢは該評価に対して有用性評価を行い、該評価はアイテムｘに対する現在のユーザーの評価の信頼度を反映し、すべてのほかのユーザーがアイテムｘに対する現在のユーザーｕの評価に対して行う有用性評価を統計することで、無効な評価又は偽のコメントをフィルタリングできる。 Another user's usefulness evaluation of user u's evaluation means that other users evaluate the usefulness of user u's current evaluation information for a certain item, and when it is determined that it is useful, 1 If not, mark it with 0, and calculate the ratings that all other users give to the current user u's rating information for a certain item, and the total number of items marked with 1 is the user u's rating. This is the usefulness rating score of other users. For example, current user u makes a rating for item x, user A and user B give a usefulness rating for the rating, and the rating reflects the reliability of the current user's rating for item x. , invalid ratings or fake comments can be filtered by statistics on the usefulness ratings that all other users give to the current user u's ratings for item x.

ユーザーｕの評価に対するほかのユーザーの有用性評価スコアとユーザーｕが評価するアイテムの総数との比は、アイテムに対するユーザーｕの評価の信頼度である。 The ratio of the usefulness evaluation score of other users to the user u's evaluation and the total number of items evaluated by the user u is the reliability of the user u's evaluation of the item.

ステップ２では、ユーザー嗜好の優位アイテムグループＤを構築し、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムはユーザー嗜好アイテムである。ユーザーの嗜好が曖昧さ、不確実性及び動的変化の特性を有するため、本実施例は既存のユーザー嗜好アイテムグループに一定のランダム性を導入することでユーザーの選択範囲を拡大し、それによりユーザーの選択は現在の嗜好情報の範囲内に過度に制限されることなく、実際の環境及びユーザー嗜好の動的変化に適応できる。それによって、スコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテム、及び検索空間においてランダムにサンプリングされる複数の新アイテムで優位アイテムグループＤを構成する。優位アイテムグループＤに追加される新アイテムはユーザー嗜好を含む可能性もあれば、ユーザー嗜好を含まない可能性もあり、ランダムなものであり、アイテムグループの多様性を高める。優位アイテムグループＤに対する新アイテムの割合は３０％以下であり、本実施例では、新アイテムは優位アイテムグループＤのアイテムの総数の１０％を占める。 In step 2, construct a superior item group D of user preferences,
An item whose user score is greater than a predetermined score threshold and whose reliability is greater than a predetermined reliability threshold is a user preference item. Since user preferences have the characteristics of ambiguity, uncertainty and dynamic change, this embodiment expands the user selection range by introducing a certain randomness into the existing user preference item groups, thereby User choices are not unduly restricted within current preference information and can adapt to dynamic changes in the actual environment and user preferences. As a result, a dominant item group D is composed of items whose scores are larger than a predetermined score threshold and whose reliability is larger than a predetermined reliability threshold, and a plurality of new items randomly sampled in the search space. New items added to the superior item group D may or may not include user preferences, and are random, increasing the diversity of the item group. The ratio of new items to the dominant item group D is 30% or less, and in this embodiment, the new items account for 10% of the total number of items in the dominant item group D.

新アイテムは検索空間においてランダムにサンプリングされるため、現在のユーザーｕはそれを評価した可能性もあれば、評価していない可能性もある。現在のユーザーｕが新アイテムを評価していない場合、現在のユーザーｕの類似ユーザーｕ’が該新アイテムに対して行ったテキストコメントを該新アイテムに対するユーザーｕの評価として使用し、ユーザーｕの複数の類似ユーザーがいずれも該新アイテムを評価した場合、ユーザーｕとの類似度が最も大きいユーザーの評価を選択する。現在のユーザーｕの類似ユーザーがいずれも該新アイテムを評価していない場合、該新アイテムに対するユーザーｕの評価はランダムに値を割り当てる方式を使用する。 Since the new item is randomly sampled in the search space, the current user u may or may not have rated it. If the current user u has not rated the new item, the text comment made by the current user u's similar user u' to the new item is used as the user u's rating for the new item, and If a plurality of similar users have all evaluated the new item, the evaluation of the user with the greatest degree of similarity to user u is selected. If none of the users similar to the current user u have rated the new item, user u's evaluation of the new item uses a method of randomly assigning values.

ユーザーｕの類似ユーザーは、ユーザーｕと共通のスコアアイテムを有し且つ類似度が所定の類似度閾値よりも大きいユーザーである。ユーザーｕと共通のスコアアイテムを有するユーザーｕ’について、ｕ’≠ｕであり、ｕとｕ’との類似度Ｓｉｍ（ｕ，ｕ’）は、
であり、
ここで、
はユーザーｕ及びｕ’がいずれも評価したアイテムセットを表し、
は
におけるアイテムｘ’に対するユーザーｕのスコアであり、
はｘ’に対するユーザーｕ’のスコアであり、
はユーザーｕが評価したすべてのアイテムの平均スコアであり、
はユーザーｕ’が評価したすべてのアイテムの平均スコアである。 A user similar to user u is a user who has a common score item with user u and whose degree of similarity is greater than a predetermined similarity threshold. For user u' who has the same score item as user u, u'≠u, and the similarity Sim(u, u') between u and u' is
and
here,
represents the item set that both users u and u' evaluated,
teeth
is user u's score for item x' in
is the score of user u' for x',
is the average score of all items rated by user u,
is the average score of all items rated by user u'.

優位アイテムグループＤでセットＳを構成し、Ｓ＝｛（ｕ，ｘ_ｉ，Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ）｝であり、ｘ_ｉ∈Ｄであり、Ｃ_ｉはアイテムｘ_ｉのカテゴリラベルベクトルであり、長さがカテゴリの総数ｎ_ｉであり、Ｃ_ｉにおける各要素ｃ_ｉｊは二値変数であり、ｃ_ｉｊ＝１はアイテムｘ_ｉがｊ種のラベルを有することを表し、ｊ＝１，２，・・・，ｎ_１であり、且つ異なるカテゴリラベル同士は非相互排他的であり、１つのアイテムには複数のカテゴリラベルが同時に存在し得る。Ｔ_ｉはアイテムｘ_ｉに対するユーザーのテキストコメントのベクトル化表現であり、長さがｎ_２であり、Ｇ_ｉはアイテムｘ_ｉの画像特徴のベクトル化表現であり、長さがｎ_３であり、ｉ＝１，２，Ｌ，｜Ｄ｜であり、｜Ｄ｜はＤにおけるアイテム数を表す。 Dominant item group D constitutes set S, S={(u, x _i , C _i , T _i , G _i )}, x _i ∈D, and C _i is the category label vector of item x _i , the length is the total number of categories n _i , each element c _ij in C _i is a binary variable, c _ij =1 represents that the item x _i has j types of labels, and j = 1 , ₂ , . T _i is a vectorized representation of the user's text comment for item x _i and has length n ₂ ; G _i is a vectorized representation of the image features of item x _i and has length n ₃ ; i=1, 2, L, |D|, where |D| represents the number of items in D.

ベクトルＣ_ｉ、Ｔ_ｉ、Ｇ_ｉを組み合わせて長さがΦのベクトルΨ_ｉを形成し、アイテムｘ_ｉの元の決定ベクトルを構成し、その各要素Ψ_ｉｎはアイテムｘ_ｉの決定成分であり、Φ＝ｎ_１＋ｎ_２＋ｎ_３，ｎ＝１，２，・・・，Φである。 Vectors C _i , T _i , G _i are combined to form a vector Ψ _i of length Φ and constitute the original decision vector of item x _i , each element Ψ _in of which is a decision component of item x _i . , Φ=n ₁ +n ₂ +n ₃ , n=1, 2, . . . , Φ.

ステップ３では、注意メカニズムを融合したユーザー嗜好感知モデルを構築し、図２に示すように、該モデルは深層信念ネットワーク（ＤｅｅｐＢｅｌｉｅｆＮｅｔｗｏｒｋ、ＤＢＮ）に基づいて３層の制限付きボルツマンマシン（ＲｅｓｔｒｉｃｔｅｄＢｏｌｔｚｍａｎｎＭａｃｈｉｎｅ、ＲＢＭ）から構成され、１層目の制限付きボルツマンマシンＲＢＭ１の可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、第１組の可視ユニットｖ_１はｎ_１個のユニットを有し、各ユニットは二値変数であり、第２組及び第３組の可視ユニットｖ_２及びｖ_３はそれぞれｎ_２及びｎ_３個のユニットを有し、各ユニットはいずれも実数変数であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目の制限付きボルツマンマシンＲＢＭ２を構成し、ｈ_２は可視層として、隠れ層ｈ_３とともに３層目の制限付きボルツマンマシンＲＢＭ３を構成する。ｈ_１、ｈ_２、及びｈ_３はそれぞれＭ_１、Ｍ_２及びＭ_３個の隠れユニットを有し、各隠れユニットはいずれも実数変数であり、各ＲＢＭについて、隠れユニットの数は可視ユニットの総数の０．８～１．２倍から選択され、本実施例では、０．８倍に設定される。それによって、ｈ_１における隠れユニットの数Ｍ_１は、Ｍ_１＝┌０．８＊Φ┐であり、Φｎ_１＋ｎ_２＋ｎ_３であり、┌ｇ┐は切り上げ演算であり、ｈ_２における隠れユニットの数Ｍ_２は、Ｍ_２＝┌０．８＊Ｍ_１┐であり、ｈ_３における隠れユニットの数Ｍ_３は、Ｍ_３＝┌０．８＊Ｍ_２┐である。注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であり、ここで、｛ｗ_１，ａ_１，ｂ_１｝、｛ｗ_２，ａ_２，ｂ_２｝及び｛ｗ_３，ａ_３，ｂ_３｝はそれぞれＲＢＭ１、ＲＢＭ２、及びＲＢＭ３のモデルパラメータを表し、ｗ_τはτ層目のＲＢＭ可視ユニットと隠れユニットとの間の接続重みを表し、ａ_τ及びｂ_τそれぞれはτ層目のＲＢＭ可視ユニット及び隠れユニットのバイアスを表し、τ∈｛１，２，３｝である。 In step 3, a user preference sensing model that combines the attention mechanism is constructed, and as shown in Figure 2, the model is a three-layer Restricted Boltzmann machine based on a deep belief network (DBN). Machine, RBM), and the visible layer of the first layer restricted Boltzmann machine RBM1 includes a first set of visible units v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , The hidden layer is h ₁ , the first set of visible units v ₁ has n ₁ units, each unit is a binary variable, and the second and third sets of visible units v ₂ and v ₃ have n ₂ and n ₃ units, respectively, each unit is a real variable, h ₁ constitutes the second layer restricted Boltzmann machine RBM2 together with the hidden layer h ₂ as the visible layer, and h ₂ constitutes the third layer restricted Boltzmann machine RBM3 together with the hidden layer _h3 as a visible layer. h ₁ , h ₂ and h ₃ each have M ₁ , M ₂ and M ₃ hidden units, each hidden unit is a real variable, and for each RBM, the number of hidden units is equal to the number of visible units. It is selected from 0.8 to 1.2 times the total number, and in this embodiment, it is set to 0.8 times. Thereby, the number M ₁ of hidden units in h ₁ is M ₁ =┌0.8*Φ┐, Φn ₁ +n ₂ +n ₃ , ┌g┐ is the round-up operation, and the hidden units in h ₂ The number M ₂ of hidden units in h 3 is M ₂ =┌0.8*M ₁ ┐, and the number M ₃ of hidden units in h ₃ is M ₃ =┌0.8*M ₂ ┐. The parameters of the user preference sensing model that combines the attention mechanism are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w ₂ , a ₂ , b ₂ , w ₃ , a ₃ , b ₃ }, where {w ₁ , a ₁ , b ₁ }, {w ₂ , a ₂ , b ₂ } and {w ₃ , a ₃ , b ₃ } are RBM1, RBM2, and RBM3, respectively. represents the model parameters, w _τ represents the connection weight between the τ-th RBM visible unit and hidden unit, a _τ and b _τ represent the bias of the τ-th RBM visible unit and hidden unit, respectively, and τ ∈{1, 2, 3}.

優位アイテムグループＤを使用して、コントラスティブダイバージェンス学習アルゴリズムで注意メカニズムを融合したユーザー嗜好感知モデルにおける１層目の制限付きボルツマンマシンＲＢＭ１を訓練し、そのモデルパラメータθ_１＝｛ｗ_１，ａ_１，ｂ_１｝を取得する。このステップでは、ＲＢＭ１のみを訓練することは、ＲＢＭ１の事前訓練とみなされてもよく、後続のステップでは、ＲＢＭ１、ＲＢＭ２、ＲＢＭ３を再度層ごとに訓練する。アイテムｘ_１の決定ベクトルΨ_ｉはＣ_ｉ、Ｔ_ｉ、及びＧ_ｉを組み合わせてなり、Ｃ_ｉ、Ｔ_ｉ、及びＧ_ｉに含まれるユーザー嗜好情報は異なり、例えば、カテゴリラベルベクトルＣ_ｉの長さｎ_ｉは通常アイテムの画像特徴のベクトル化表現Ｇ_ｉの長さｎ_３未満であり、アイテムの決定ベクトルにおける各成分を平等に取り扱うと、多い情報量を含むデータは少ない嗜好情報を含むデータを埋没させ、このような少ない嗜好情報を含むデータはユーザー嗜好感知モデルの構築に対して有益な補充であり、無視できない。したがって、本発明は、各データタイプで表される情報エントロピーと組み合わせて、重みを利用してユーザー嗜好感知モデルの可視層のニューラルユニットに入力される様々なマルチソース異種データの成分を調整することを考慮することで、様々なタイプのデータがユーザー嗜好感知モデルの構築に効果的に貢献できることを確保する。 Using the dominant item group D, train the first-layer restricted Boltzmann machine RBM1 in the user preference sensing model that combines the attention mechanism with the contrastive divergence learning algorithm, and calculate its model parameters θ ₁ = {w ₁ , a ₁ , b ₁ }. In this step, training only RBM1 may be considered as pre-training of RBM1, and in subsequent steps, RBM1, RBM2, RBM3 are trained layer by layer again. The decision vector Ψ _i of item x ₁ is a combination of C _i , T _i , and G _i , and the user preference information contained in C _i , T _i , and G _i is different, for example, the length of the category label vector C _i s n _i is usually the length _n of the vectorized representation G _i of the item's image feature. If each component in the item's decision vector is treated equally, data containing a large amount of information will be compared to data containing a small amount of preference information. Data containing such little preference information is a useful addition to the construction of a user preference sensing model and cannot be ignored. Therefore, the present invention utilizes weights to adjust the components of various multi-source heterogeneous data input to the neural unit of the visible layer of the user preference sensing model, in combination with the information entropy represented by each data type. ensuring that different types of data can effectively contribute to building user preference sensing models.

１層目のＲＢＭモデルの訓練が完了した後、隠れユニットの状態が与えられると、各可視ユニットの活性化状態は条件付き独立であり、あるアイテムｘ_１のベクトル表現［Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ］は可視層に入力され、その第１組、第２組及び第３組の可視ユニットの活性化確率はそれぞれ
であり、ここで、ａ_１，ｊ、ａ_１，ｋ及びａ_１，ｌはそれぞれ第１組、第２組及び第３組の可視ユニットのバイアスを表し、ａ_１，ｊ、ａ_１，ｋ及びａ_１，ｌを組み合わせてａ_１を形成し、ｊ＝１，２，・・・，ｎ_１，ｋ＝１，２，・・・，ｎ_２，ｌ＝１，２，・・・，ｎ_３である。 After the training of the first layer RBM model is completed, given the state of hidden units, the activation state of each visible unit is conditionally independent, and the vector representation of some item x ₁ [C _i , T _i , G _i ] is input to the visible layer, and the activation probabilities of the first set, second set, and third set of visible units are respectively
, where a _1,j , a _1,k and a _1,l represent the biases of the first, second and third sets of visible units, respectively, and a _1,j , a _1,k and a _1,l are combined to form a ₁ , j=1,2,..., _n1 ,k=1,2,..., _n2 ,l=1,2,..., It is _n3 .

情報エントロピーの式
に従って様々なマルチソース異種データの情報エントロピーを計算し、
アイテムカテゴリラベルの情報エントロピーは、
であり、
テキストコメントベクトルの情報エントロピーは、
であり、
アイテム画像特徴ベクトルの情報エントロピーは、
であり、
ここで、ｃ_ｉｊはアイテムｘ_ｉのカテゴリラベルベクトルＣ_ｉのｊ番目の要素を表し、ｐ_（ｃ_ｉｊ）はＲＢＭ１においてアイテムカテゴリラベルのベクトル表現のｊ番目の要素に対応する可視ユニットの活性化確率を表し、
ｔ_ｉｋはアイテムｘ_ｉに対するユーザーｕのテキストコメントのベクトル化表現Ｔ_ｉのｋ番目の要素を表し、ｐ_（ｔ_ｉｋ）はＲＢＭ１においてユーザーのテキストコメントのベクトル表現のｋ番目の要素に対応する可視ユニットの活性化確率を表し、
ｇ_ｉｌは、アイテムｘ_ｉの画像特徴のベクトル化表現Ｇ_ｉのｌ番目の要素を表し、ｐ_{（ｇｉｌ）}はＲＢＭ１においてアイテム画像特徴のベクトル表現のｌ番目の要素に対応する可視ユニットの活性化確率を表し、
次に、さらに、総情報エントロピーに対する様々な情報エントロピーの割合を重み係数として計算し、
ここで、Ｈ（ｘ_ｉ）＝Ｈ（Ｃ_ｉ）＋Ｈ（Ｔ_ｉ）＋Ｈ（Ｇ_ｉ）であり、
可視ユニットの状態が与えられ、すなわち、ベクトルＣ_ｉ、Ｔ_ｉ、Ｇ_ｉを組み合わせてアイテムｘ_ｉの決定ベクトルΨ_ｉを構成してｖ_１、ｖ_２、ｖ_３における各可視ユニットに入力すると、隠れ層ｈ_１における各隠れユニットの活性化状態は条件付き独立であり、ｍ_１番目の隠れユニットの活性化確率は、
であり、
ここで、ｍ_１＝１，２，・・・，Ｍ_１であり、
はｈ_１におけるｍ_１番目の隠れユニットのバイアスであり、ｖ_１ｊはＲＭＢ１の第１組の可視ユニットｖ_１におけるｊ番目の可視ユニットの状態であり、すなわちＣ_ｉのｊ番目の要素の値であり、ｖ_２ｋはＲＭＢ１の第２組の可視ユニットｖ_２におけるｋ番目の可視ユニットの状態であり、すなわちＴ_iのｋ番目の要素の値であり、ｖ_３ｌはＲＭＢ１の第３組の可視ユニットｖ_３におけるｌ番目の可視ユニットの状態であり、すなわちＧ_ｉのｌ番目の要素の値であり、
はｗ_１における要素値であり、ＲＢＭ１におけるｎ番目の可視ユニットとｍ_１番目の隠れユニットとの間の接続重みを表し、ｎ＝１，２，・・・，Φであり、
は隠れ層ｈ_１にけるｍ_１番目の隠れユニットの状態を表し、
はｓｉｇｍｏｉｄ活性化関数である。 Information entropy formula
Compute the information entropy of various multi-source heterogeneous data according to
The information entropy of the item category label is
and
The information entropy of the text comment vector is
and
The information entropy of the item image feature vector is
and
Here, c _ij represents the j-th element of the category label vector C _i of item x _i , and p ₍ c _ij) is the activation of the visible unit corresponding to the j-th element of the vector representation of the item category label in RBM1. represents the probability,
t _ik represents the kth element of the vectorized representation T _i of user u's text comment for item x _i , and p ₍ t _ik) is the visible vector representation T i of the vectorized representation of user u's text comment in RBM1. represents the activation probability of the unit,
g _il represents the lth element of the vectorized representation G _i of the image features of item x _i , and p _(gil) is the activation of the visible unit corresponding to the lth element of the vectorized representation of the item image features in RBM1. represents the probability,
Next, further calculate the ratio of various information entropies to the total information entropy as a weighting factor,
Here, H(x _i )=H(C _i )+H(T _i )+H(G _i ),
Given the states of the visible units, i.e., combining the vectors C _i , T _i , G _i to form the decision vector Ψ _i of item x _i to input each visible unit in v ₁ , v ₂ , v ₃ , The activation state of each hidden unit in the hidden layer _h1 is conditionally independent, and the activation probability of the _m1th hidden unit is
and
Here, m ₁ =1, 2,..., M ₁ ,
is the bias of the m _-th hidden unit in h ₁ , and v _1j is the state of the j-th visible unit in the first set of visible units v ₁ of RMB1, i.e., the value of the j-th element of C _i . , v _2k is the state of the k-th visible unit in the second set of visible units v ₂ of RMB1, i.e. the value of the k-th element of T _i , and v _3l is the state of the k-th visible unit in the second set of visible units v 2 of RMB1; v is the state of the l-th visible unit at ₃ , i.e. the value of the l-th element of G _i ,
is the element value in w ₁ and represents the connection weight between the nth visible unit and m _1st hidden unit in RBM1, where n = 1, 2, ..., Φ,
represents the state of the m _- th hidden unit in the hidden layer _h1 ,
is the sigmoid activation function.

隠れユニットの状態が与えられると、各可視ユニットの活性化状態も条件付き独立であり、ｎ番目の可視ユニットの活性化確率は、
であり、
ここで、ａ_１，ｎは可視層におけるｎ番目の可視ユニットのバイアスを表す。 Given the state of the hidden unit, the activation state of each visible unit is also conditionally independent, and the activation probability of the nth visible unit is
and
Here, a _1,n represents the bias of the nth visible unit in the visible layer.

ＲＢＭ１の訓練が完了した後、式（５）に従ってアイテムｘ_ｉに対応する各隠れユニットの状態を取得でき、さらに優位アイテムグループＤにおける各アイテムの各決定成分に対するユーザーの嗜好度、すなわち可視層のユニットの活性化確率を、注意重み係数ａｔ_ｎ（ｘ_ｉ）として取得でき、
ここで、
はΨ_ｉがＲＢＭ１の可視層の各可視ユニットの状態とする場合、隠れ層ｈ_ｉにおけるｍ_ｉ番目の隠れユニットの状態を表し、ａｔ_ｎ（ｘ_ｉ）はアイテムｘ_ｉの各決定成分ψ_ｉｎの注意重みを表し、自己適応の特性を反映する。 After the training of RBM1 is completed, the state of each hidden unit corresponding to _item The activation probability of a unit can be obtained as an attention weighting coefficient at _n (x _i ),
here,
When Ψ _i is the state of each visible unit in the visible layer of RBM1, represents the state of the m _i -th hidden unit in the hidden layer h _i , and at _n (x _i ) is each determinant component ψ _in of item x _i represents the attentional weight of , and reflects the characteristics of self-adaptation.

注意重み係数ａｔ_ｎ（ｘ_ｉ）をアイテムｘ_ｉの各決定成分の重み係数として優位アイテムグループＤにおけるアイテムｘ_ｉに対して注意メカニズムに基づく符号化を行い、符号化後、ｘ_ａｔｉで表し、
ここで、ｉ＝１，２，Ｌ，｜Ｄ｜であり、
ｘ_ａｔｉを事前訓練後のＲＢＭ１に入力して可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）を得て、

ここで、
はｘ_ａｔｉのｎ’番目の要素である。 Encoding is performed on the item x _i in the dominant item group D based on the attention mechanism using the attention weight coefficient at _n (x _i ) as the weight coefficient of each determining component of the item x _i , and after encoding, it is expressed as x _ati ,
Here, i=1, 2, L, |D|,
x _ati is input to RBM1 after pre-training to obtain the activation probability V _RBM1 (x _ati ) of the visible unit,

here,
is the n'th element of x _ati .

式（９）は実際に隠れユニットの活性化確率と可視ユニットの活性化確率とを入れ子にし、すなわち、
であり、
取得されたＲＢＭ１モデルにおける可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）及び文献ＬｉＪ，ＷａｎｇＹ，ＭｃａｕｌｅｙＪ．ＴｉｍｅＩｎｔｅｒｖａｌＡｗａｒｅＳｅｌｆ－ＡｔｔｅｎｔｉｏｎｆｏｒＳｅｑｕｅｎｔｉａｌＲｅｃｏｍｍｅｎｄａｔｉｏｎ．Ｉｎ：ＷＳＤＭ ‘２０：ＴｈｅＴｈｉｒｔｅｅｎｔｈＡＣＭＩｎｔｅｒｎａｔｉｏｎａｌＣｏｎｆｅｒｅｎｃｅｏｎＷｅｂＳｅａｒｃｈａｎｄＤａｔａＭｉｎｉｎｇ．ＡＣＭ，２０２０．に提供されていた自己注意メカニズムを利用して、ＲＢＭ１の可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）で自己注意メカニズム演算を行い、アイテム個体のユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）を動的に学習し、
ここで、ｓｏｆｔｍａｘ（）関数はすべての重み係数の和が１であることを確保する。関数ａ（Ｖ_ＲＢＭ１（ｘ_ａｔｉ），ｗ_１）はユーザー嗜好特徴に対するアイテムｘ_ｉの注意重み係数を測定し、計算は、
であり、
ユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）とアイテムｘ_ｉの元の決定ベクトルＣ_ｉ，Ｔ_ｉ，Ｇ_ｉとを組み合わせて、注意メカニズムを融合したアイテム決定ベクトルを生成し、
注意メカニズムを融合したアイテム決定ベクトル
で訓練セットを構成し、ＤＢＮにおけるＲＢＭ１、ＲＢＭ２、ＲＢＭ３モデルを層ごとに訓練し、まず、ＲＢＭ１を訓練し、パラメータ｛ｗ_１，ａ_１，ｂ_１｝を得て、ｂ_１をＲＢＭ２におけるａ_２に伝達した上で、ＲＢＭ２を訓練し、最適化パラメータ｛ｗ_２，ａ_２，ｂ_２｝を得て、ｂ_２をＲＢＭ３におけるａ_３に伝達した上でＲＢＭ３を訓練し、最適化パラメータ｛ｗ_３，ａ_３，ｂ_３｝を取得し、それによってＤＢＮネットワークにおける３層のＲＢＭモデルの相互影響、相互関連付けを実現してネットワーク全体を形成する。訓練が完了した後、注意メカニズムを融合したＤＢＮベースのユーザー嗜好感知モデル及びその最適化モデルパラメータθを取得する。 Equation (9) actually nests the activation probabilities of hidden units and visible units, i.e.,
and
The activation probability V _RBM1 (x _ati ) of the visible unit in the obtained RBM1 model and the literature Li J, Wang Y, Mcauley J. Time Interval Aware Self-Attention for Sequential Recommendation. In: WSDM '20: The Thirteenth ACM International Conference on Web Search and Data Mining. ACM, 2020. Using _the _self _- attention mechanism provided in learn from
Here, the softmax() function ensures that the sum of all weighting factors is one. The function a(V _RBM1 (x _ati ), w ₁ ) measures the attention weighting coefficient of item x _i with respect to user preference features, and the calculation is
and
Combining the user preference attention weight vector A(x _ati ) and the original decision vector C _i , T _i , G _i of item x _i to generate an item decision vector that fuses the attention mechanism;
Item decision vector fused with attention mechanism
Construct a training set, and train RBM1, RBM2, and RBM3 models in DBN layer by layer. First, train RBM1, obtain parameters {w ₁ , a ₁ , b ₁ }, and set b ₁ to a in RBM2. ₂ , train RBM2, obtain the optimization parameters {w ₂ , a ₂ , b ₂ }, transfer b ₂ to a ₃ in RBM3, train RBM3, and obtain the optimization parameters { w ₃ , a ₃ , b ₃ }, thereby realizing the mutual influence and mutual association of the three-layer RBM model in the DBN network to form the entire network. After the training is completed, the DBN-based user preference sensing model fused with the attention mechanism and its optimization model parameters θ are obtained.

ここでのＤＢＮのモデル訓練方法は改良された注意メカニズムベースのＤＢＮモデル訓練方法であり、自己適応重み情報を使用してユーザー嗜好特徴を抽出し、注意を重要な特徴に集中させ、実際の応用シーンにおける各アイテムの異なるタイプの属性決定成分のユーザー嗜好特徴への影響をより適切に表現し、ユーザー嗜好特徴をより細かく表現することを目的とする。 The DBN model training method here is an improved attention mechanism-based DBN model training method, which uses self-adaptive weight information to extract user preference features, focus attention on important features, and improve practical application. The purpose is to more appropriately represent the influence of different types of attribute determining components of each item in a scene on user preference features, and to express user preference features in more detail.

ステップ４では、訓練済みの注意メカニズムを融合したＤＢＮベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表し、その計算は、
まず、優位アイテムグループＤに基づいてユーザー嗜好ベースの確率分布モデルＰ（ｘ）を計算し、
ここでｐ（ｘ）はΦ次元ベクトルであり、そのｎ番目の要素Ｐ（ψ_ｎ）はユーザー嗜好アイテムのｎ番目の決定成分の活性化確率であり、Ｐ（ψ_ｎ）に対して下界制約を行い、制約後の値はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率Ｐ（ψ_ｎ）であり、すなわち、
であり、
ここで、εは所定の下界閾値であり、本実施例では、ε＝０．１であり、すなわち、式（１８）に従って算出される活性化確率が０．１未満の決定成分に対して、その活性化確率値を０．１に設定し、該制約は決定成分の活性化確率が小さい場合、一定の確率値で該決定成分をランダムにサンプリングすることを考慮することで、生成される個体群の多様性を高め、進化最適化アルゴリズムの早期収束によって最適解を見逃さないことを防止する。 In step 4, a user preference-based distribution estimation probability model P(x) is constructed based on the DBN-based user preference sensing model that combines the trained attention mechanism and its model parameters;
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item represents the preference probability of, and its calculation is,
First, a user preference-based probability distribution model P(x) is calculated based on the dominant item group D,
Here, p(x) is a Φ-dimensional vector, its nth element P(ψ _n ) is the activation probability of the nth determining component of the user preference item, and a lower bound constraint is applied to P(ψ _n ). , and the value after the constraint is the user's preference probability P(ψ _n ) for the nth determining component of the item, that is,
and
Here, ε is a predetermined lower bound threshold, and in this example, ε=0.1, that is, for a determining component whose activation probability calculated according to equation (18) is less than 0.1, The activation probability value is set to 0.1, and the constraint is that when the activation probability of a determining component is small, the determined component is randomly sampled with a certain probability value. Increase the diversity of the group and prevent the optimal solution from being missed by early convergence of the evolutionary optimization algorithm.

ステップ５では、個体群のサイズＮを設定し、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズム（ＥｓｔｉｍａｔｉｏｎｏｆＤｉｓｔｒｉｂｕｔｉｏｎＡｌｇｏｒｉｔｈｍｓ、ＥＤＡ）でＮ個の新個体を生成し、各個体は１つのアイテムであり、ｖ番目の新個体のカテゴリラベルベクトル
（ｖ＝１，２，・・・，Ｎ）の設定ステップは、
（５．１）ｖ＝１とし、
（５．２）［０，１］の乱数ｚを生成し、ｚ≦Ｐ（ψ_ｊ＝１）である場合、ｖ番目の新個体のカテゴリラベルベクトル
のｊ番目の要素は１であり、そうでない場合、０であり、
（５．３）ｖに１を足し、ｖ＞Ｎになるまでステップ（５．２）を繰り返すことである。 In step 5, the population size N is set, and N new individuals are generated using a distribution estimation algorithm (EDA) using the user preference-based distribution estimation probability model P(x). , each individual is one item, and the category label vector of the vth new individual is
The setting step for (v=1, 2,..., N) is
(5.1) Let v=1,
(5.2) Generate a random number z of [0, 1], and if z≦P (ψ _j =1), the category label vector of the vth new individual
The jth element of is 1, otherwise 0,
(5.3) Add 1 to v and repeat step (5.2) until v>N.

ステップ６では、検索空間においてＮ個の新個体のカテゴリラベルベクトル
との類似度が最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成し、本実施例では、ユークリッド距離を使用して類似度計算を行い、すなわち、２つのベクトル間のユークリッド距離が小さいほど、両者の類似度が高い。 In step 6, category label vectors of N new individuals are added in the search space.
In this example, the Euclidean distance ^between two vectors is used to calculate the similarity. The smaller the distance, the higher the degree of similarity between the two.

ステップ７では、推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値を計算し、
本発明では、エネルギー関数を使用してアイテムの適応値を計算し、推薦対象アイテムセットＳ^ｕにおけるアイテムｘ^＊について、その適応値
の計算は、
であり、
ここで、
及び
はそれぞれ推薦対象アイテムセットＳ^ｕにおけるアイテムのエネルギー関数の最大値及び最小値を表し、
はアイテムｘ^＊のエネルギー関数（ｘ^＊∈Ｓ^ｕ）であり、その計算は、
であり、
ここで、ａ_１，ｎはＲＢＭ１の可視層におけるｎ番目の可視ユニットのバイアスを表し、
はアイテムｘ^＊のｎ番目の決定成分であり、
はｈ_１におけるｍ_１番目の隠れユニットのバイアスであり、
はｗ_１における要素値であり、ＲＢＭ１におけるｎ番目の可視ユニットｍ_１番目の隠れユニットとの間の接続重みを表す。 In step 7, the adaptation value of each item in the recommendation target item set S ^u is calculated,
In the present invention, the adaptive value of the item is calculated using the energy function, and the adaptive value of the item x ^* in the recommendation target item set S ^u is
The calculation of
and
here,
as well as
respectively represent the maximum value and minimum value of the energy function of the item in the recommendation target item set S ^u ,
is the energy function (x ^* ∈ S ^u ) of item x ^* , and its calculation is
and
Here, a _1,n represents the bias of the nth visible unit in the visible layer of RBM1,
is the nth determining component of item x ^* ,
is the bias of the m _- th hidden unit in h ₁ ,
is the element value in w ₁ and represents the connection weight between the nth visible unit m and the _m1th hidden unit in RBM1.

ステップ８では、Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎである。 In step 8, TopN items with the highest adaptation values in S ^u are selected as the search results, and TopN<N.

マルチソース異種ユーザー生成コンテンツの動的進化の特徴及びユーザー趣味嗜好の不確実性によって、パーソナライズド進化検索プロセスの早期段階では、優位アイテムグループＤに含まれるユーザー嗜好情報が不十分であり、したがって、この訓練されたユーザー嗜好感知モデルに基づいて抽出されるユーザー嗜好特徴は大まかである。これに伴ってユーザーの対話型検索プロセスの推進及びユーザー行動の動的進化に伴い、現在のユーザーの最新の評価データに応じて優位アイテムグループＤを更新し、注意メカニズムを融合したユーザー嗜好感知モデルを再度訓練し、抽出されたユーザー嗜好特徴を動的に更新し、ユーザー嗜好変化をタイムリーに追跡すると同時に、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を更新することで、パーソナライズド進化検索の進行方向を効果的にガイドし、ユーザーができるだけ早く満足の行く解を検索することを支援し、複雑な環境におけるパーソナライズド検索タスクをスムーズに完了する。 Due to the dynamic evolution characteristics of multi-source heterogeneous user-generated content and the uncertainty of user tastes and preferences, the user preference information contained in the dominant item group D is insufficient at the early stage of the personalized evolutionary search process, and therefore, The user preference features extracted based on this trained user preference sensing model are rough. Along with this, along with the promotion of the user's interactive search process and the dynamic evolution of user behavior, the dominant item group D is updated according to the latest evaluation data of the current user, and a user preference sensing model that combines an attention mechanism is developed. By re-training and dynamically updating the extracted user preference features to track changes in user preferences in a timely manner, and at the same time updating the user preference-based distribution estimation probability model P(x), personalized evolution can be achieved. It effectively guides the search direction, helps users search for satisfactory solutions as quickly as possible, and smoothly completes personalized search tasks in complex environments.

本実施例は上記パーソナライズド検索方法を実現する注意メカニズムを融合したパーソナライズド検索システムをさらに開示し、図３に示すように、
ユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含むユーザーｕ生成コンテンツを収集して取得し、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得するユーザー生成コンテンツ取得モジュール１と、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムでユーザー嗜好を含む優位アイテムグループＤを構成する優位アイテムグループ構築モジュール２と、
ステップ３に従って注意メカニズムを融合したユーザー嗜好感知モデルを構築して訓練するユーザー嗜好感知モデル構築訓練モジュールであって、前記モデルは深層信念ネットワークに基づいて３層の制限付きボルツマンマシンから構成され、１層目の制限付きボルツマンマシンの可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目の制限付きボルツマンマシンを構成し、ｈ_２は可視層として、隠れ層ｈ_３とともに３層目の制限付きボルツマンマシンを構成し、前記注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であるユーザー嗜好感知モデル構築訓練モジュール３と、
訓練済みの注意メカニズムを融合した深層信念ネットワークベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表すユーザー嗜好ベースの分布推定確率モデル構築モジュール４と、
ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズムを採用して、それぞれ１つのアイテムであるＮ（Ｎは所定の個体群のサイズである）個の新個体を生成し、各新個体のカテゴリラベルベクトルを設定する個体群生成モジュール５と、
検索空間においてＮ個の新個体のカテゴリラベルベクトル
との類似度が最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成する推薦対象アイテムセット構築モジュール６と、
ステップ７に従って推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値を計算する適応値計算モジュール７と、
Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎである検索結果選択モジュール８と、を含む。 This embodiment further discloses a personalized search system that combines an attention mechanism to realize the above-mentioned personalized search method, and as shown in FIG. 3,
Collect and retrieve user u-generated content, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings, and a user-generated content acquisition module 1 that vectorizes comments, performs feature extraction on item images, and acquires eigenvectors;
a superior item group construction module 2 that configures a superior item group D including user preferences with items whose user score is greater than a predetermined score threshold and whose reliability is greater than a predetermined reliability threshold;
A user preference sensing model building and training module for building and training a user preference sensing model fused with an attention mechanism according to step 3, the model comprising three layers of restricted Boltzmann machines based on a deep belief network; The visible layer of the Boltzmann machine with limited layers includes a first set of visible units v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , and the hidden layers are h ₁ and h ₁ constitutes _a second-layer restricted Boltzmann machine with hidden layer _h2 as a visible layer, and constitutes a third-layer restricted Boltzmann machine with hidden layer _h3 as a visible layer, and the above attention mechanism is The parameters of the fused user preference sensing model are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w ₂ , a ₂ , b ₂ , w ₃ , a ₃ , b ₃ } User preference sensing model construction training module 3, which is
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item a user preference-based distribution estimation probability model construction module 4 representing preference probabilities;
Using the user preference-based distribution estimation probability model P(x), a distribution estimation algorithm is adopted to generate N (N is the size of a given population) new individuals, each of which is one item. and a population generation module 5 that sets a category label vector for each new individual;
Category label vector of N new individuals in search space
a recommendation target item set construction module 6 that selects N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
an adaptation value calculation module 7 that calculates the adaptation value of each item in the recommendation target item set S ^u according to step 7;
A search result selection module 8 that selects TopN items with the highest adaptation value in ^Su as search results, and where TopN<N.

（付記）
（付記１）
注意メカニズムを融合したパーソナライズド検索方法であって、
ユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含むユーザーｕ生成コンテンツを収集して取得し、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得するステップ１と、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムでユーザー嗜好を含む優位アイテムグループＤを構成するステップ２であって、ＤにおけるアイテムはセットＳを構成し、Ｓ＝｛（ｕ，ｘ_ｉ，Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ）｝であり、ここで、ｘ_ｉ∈Ｄであり、Ｃ_ｉはアイテムｘ_ｉのカテゴリラベルベクトルであり、Ｔ_ｉはアイテムｘ_ｉに対するユーザーのテキストコメントのベクトル化表現であり、Ｇ_ｉはアイテムｘ_ｉの画像特徴のベクトル化表現であり、ｉ＝１，２，・・・，｜Ｄ｜であり、｜Ｄ｜はＤにおけるアイテム数を表すステップ２と、
注意メカニズムを融合したユーザー嗜好感知モデルを構築し、前記モデルは深層信念ネットワークに基づいて３層の制限付きボルツマンマシンから構成され、１層目の制限付きボルツマンマシンの可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目の制限付きボルツマンマシンを構成し、ｈ_２は可視層として、隠れ層ｈ_３とともに３層目の制限付きボルツマンマシンを構成し、前記注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であり、
優位アイテムグループＤを利用して、コントラスティブダイバージェンス学習アルゴリズムで注意メカニズムを融合したユーザー嗜好感知モデルにおける１層目の制限付きボルツマンマシンを訓練し、そのモデルパラメータθ１＝｛ｗ_１，ａ_１，ｂ_１｝を取得し、
１層目のＲＢＭモデルの訓練が完了した後、隠れユニットの状態が与えられると、各可視ユニットの活性化状態は条件付き独立であり、あるアイテムｘｉのベクトル表現［Ｃ_ｉ，Ｔ_ｉ，Ｇ_ｉ］は可視層に入力され、その第１組、第２組及び第３組の可視ユニットの活性化確率はそれぞれ
であり、
ここで、ａ_１，_ｊ、ａ_１，_ｋ及びａ_１，_ｌはそれぞれ第１組、第２組及び第３組の可視ユニットのバイアスを表し、
様々なマルチソース異種データの情報エントロピーを計算し、アイテムカテゴリラベルの情報エントロピーは、
であり、
テキストコメントベクトルの情報エントロピーは、
であり、
アイテム画像特徴ベクトルの情報エントロピーは、
であり、
ここで、ｃ_ｉｊはアイテムｘ_ｉのカテゴリラベルベクトルＣ_ｉのｊ番目の要素を表し、ｐ_{（ｃｉｊ）}はＲＢＭ１においてアイテムカテゴリラベルのベクトル表現のｊ番目の要素に対応する可視ユニットの活性化確率を表し、
ｔ_ｉｋはアイテムｘ_ｉに対するユーザーｕのテキストコメントのベクトル化表現Ｔ_ｉのｋ番目の要素を表し、ｐ（ｔ_ｉｋ）はＲＢＭ１においてユーザーのテキストコメントのベクトル表現のｋ番目の要素に対応する可視ユニットの活性化確率を表し、
ｇ_ｉｌは、アイテムｘ_ｉの画像特徴のベクトル化表現Ｇ_ｉのｌ番目の要素を表し、ｐ_{（ｇｉｌ）}はＲＢＭ１においてアイテム画像特徴のベクトル表現のｌ番目の要素に対応する可視ユニットの活性化確率を表し、
次に、総情報エントロピーに対する様々な情報エントロピーの割合を重み係数として計算し、
ここで、Ｈ（ｘ_ｉ）＝Ｈ（Ｃ_ｉ）＋Ｈ（Ｔ_ｉ）＋Ｈ（Ｇ_ｉ）であり、
ベクトルＣ_ｉ、Ｔ_ｉ、Ｇ_ｉを組み合わせてアイテムｘ_ｉの決定ベクトルΨ_ｉを構成してｖ_１、ｖ_２、ｖ_３における各可視ユニットに入力すると、隠れ層ｈ_１における各隠れユニットの活性化状態は条件付き独立であり、ｍ_１番目の隠れユニットの活性化確率は、
であり、
ここで、ｍ_１＝１，２，・・・，Ｍ１であり、
はｈ_１におけるｍ_１番目の隠れユニットのバイアスであり、ｖ_１ｊはＲＭＢ１の第１組の可視ユニットｖ_１におけるｊ番目の可視ユニットの状態であり、ｖ_２ｋはＲＭＢ１の第２組の可視ユニットｖ_２におけるｋ番目の可視ユニットの状態であり、ｖ_３ｌはＲＭＢ１の第３組の可視ユニットｖ_３におけるｌ番目の可視ユニットの状態であり、
はｗ_１における要素値であり、ＲＢＭ１におけるｎ番目の可視ユニットとｍ_１番目の隠れユニットとの間の接続重みを表し、ｎ＝１，２，・・・，Φであり、
は隠れ層ｈ_１におけるｍ_１番目の隠れユニットの状態を表し、
はｓｉｇｍｏｉｄ活性化関数であり、
ＲＢＭ１の訓練が完了した後、式（９）に従ってアイテムｘ_ｉに対応する各隠れユニットの状態を取得し、さらに優位アイテムグループＤにおける各アイテムの各決定成分に対するユーザーの嗜好度、すなわち可視層のユニットの活性化確率を、注意重み係数ａｔ_ｎ（ｘ_ｉ）として取得し、
ここで、
はΨ_ｉがＲＢＭ１の可視層の各可視ユニットの状態とする場合、隠れ層ｈ_ｉにおけるｍ_ｉ番目の隠れユニットの状態を表し、ａｔ_ｎ（ｘ_ｉ）はアイテムｘ_ｉの各決定成分ψ_ｉｎの注意重みを表し、
注意重み係数ａｔ_ｎ（ｘ_ｉ）をアイテムｘ_ｉの各決定成分の重み係数として優位アイテムグループＤにおけるアイテムｘ_ｉに対して注意メカニズムに基づく符号化を行い、符号化後、ｘ_ａｔｉで表し、
ｘ_ａｔｉを事前訓練後のＲＢＭ１に入力して可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）を得て、
ここで、
はｘ_ａｔｉのｎ’番目の要素であり、
ＲＢＭ１の可視ユニットの活性化確率Ｖ_ＲＢＭ１（ｘ_ａｔｉ）で自己注意メカニズム演算を行い、アイテム個体のユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）を動的に学習し、
ここで、ｓｏｆｔｍａｘ（）関数はすべての重み係数の和が１であることを確保し、関数ａ（Ｖ_ＲＢＭ１（ｘ_ａｔｉ），ｗ_１）はユーザー嗜好特徴に対するアイテムｘ_ｉの注意重み係数を測定し、計算は、
であり、
ユーザー嗜好注意重みベクトルＡ（ｘ_ａｔｉ）とアイテムｘ_ｉの元の決定ベクトルＣ_ｉ，Ｔ_ｉ，Ｇ_ｉとを組み合わせて注意メカニズムを融合したアイテム決定ベクトルを生成し、
注意メカニズムを融合したアイテム決定ベクトル
で訓練セットを構成し、ＤＢＮにおけるＲＢＭ１、ＲＢＭ２、ＲＢＭ３モデルを層ごとに訓練し、訓練が完了した後、注意メカニズムを融合した深層信念ネットワークベースのユーザー嗜好感知モデル及びその最適化モデルパラメータθを取得するステップ３と、
訓練済みの注意メカニズムを融合した深層信念ネットワークベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表すステップ４と、
個体群のサイズＮを設定し、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズムでＮ個の新個体を生成し、各個体は１つのアイテムであり、ｖ番目の新個体のカテゴリラベルベクトル
（ｖ＝１，２，・・・，Ｎ）の設定ステップは、
（５．１）ｖ＝１とし、
（５．２）［０，１］の乱数ｚを生成し、ｚ≦Ｐ（ψ_ｊ＝１）である場合、ｖ番目の新個体のカテゴリラベルベクトル
のｊ番目の要素は１であり、そうでない場合、０であり、
（５．３）ｖに１を足し、ｖ＞Ｎになるまでステップ（５．２）を繰り返すことであるステップ５と、
検索空間においてＮ個の新個体のカテゴリラベルベクトル
との類似度が最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成するステップ６と、
推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値
を計算し、
ここで、
及び
はそれぞれ推薦対象アイテムセットＳ^ｕにおけるアイテムのエネルギー関数の最大値及び最小値を表し、
はアイテムｘ^＊のエネルギー関数であり、ｘ^＊∈Ｓ^ｕであり、その計算は、
であり、
ここで、
はアイテムｘ^＊のｎ番目の決定成分であるステップ７と、
Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎであるステップ８と、を含み、
ユーザーの対話型検索プロセスの推進及びユーザー行動の動的進化に伴い、現在のユーザーの最新の評価データに応じて優位アイテムグループＤを更新し、注意メカニズムを融合したユーザー嗜好感知モデルを再度訓練し、抽出されたユーザー嗜好特徴を動的に更新すると同時に、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を更新する、
ことを特徴とする注意メカニズムを融合したパーソナライズド検索方法。 (Additional note)
(Additional note 1)
A personalized search method that combines an attention mechanism,
Collect and retrieve user u-generated content, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings, and Step 1 of vectorizing comments, extracting features from item images, and obtaining eigenvectors;
Step 2 of configuring a dominant item group D including user preferences with items having a user score larger than a predetermined score threshold and a reliability larger than a predetermined confidence threshold, the items in D forming a set S; , S={(u, x _i , C _i , T _i , G _i )}, where x _i ∈D, C _i is the category label vector of item x _i , and T _i is the item is a vectorized representation of the user's text comment for x _i , G _i is a vectorized representation of the image features of item x _i , i = 1, 2, ..., |D|, and |D| Step 2 representing the number of items in D;
A user preference sensing model that combines attention mechanisms is constructed, and the model is composed of three layers of restricted Boltzmann machines based on a deep belief network, and the visible layer of the first layer of restricted Boltzmann machines is the first set of visible layers. It includes a unit v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is the visible layer with the restriction of the second layer together with the hidden layer h ₂ A Boltzmann machine is constructed, _h2 is a visible layer, and a hidden layer _h3 constitutes a third layer restricted Boltzmann machine, and the parameters of the user preference sensing model that combines the above attention mechanism are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w ₂ , a ₂ , b ₂ , w ₃ , a ₃ , b ₃ },
Using the dominant item group D, train the first-layer restricted Boltzmann machine in the user preference sensing model that combines the attention mechanism with the contrastive divergence learning algorithm, and set the model parameters θ1={w ₁ , a ₁ , b ₁ },
After the training of the first layer RBM model is completed, given the state of hidden units, the activation state of each visible unit is conditionally independent, and the vector representation of some item xi [C _i , T _i , G _i ] is input to the visible layer, and the activation probabilities of the first, second, and third visible units are respectively
and
Here, a ₁ , _j , a ₁ , _k and a ₁ , _l represent the biases of the first set, second set and third set of visible units, respectively;
The information entropy of various multi-source heterogeneous data is calculated, and the information entropy of item category label is
and
The information entropy of the text comment vector is
and
The information entropy of the item image feature vector is
and
Here, c _ij represents the j-th element of the category label vector C _i of item x _i , and p _(cij) is the activation probability of the visible unit corresponding to the j-th element of the vector representation of the item category label in RBM1. represents,
t _ik represents the kth element of the vectorized representation T _i of user u's text comment for item x _i , and p(t _ik ) is the visible vector representation T i of the vectorized representation of user u's text comment in RBM1. represents the activation probability of the unit,
g _il represents the lth element of the vectorized representation G _i of the image features of item x _i , and p _(gil) is the activation of the visible unit corresponding to the lth element of the vectorized representation of the item image features in RBM1. represents the probability,
Next, calculate the ratio of various information entropies to the total information entropy as a weighting factor,
Here, H(x _i )=H(C _i )+H(T _i )+H(G _i ),
When vectors C _i , T _i , G _i are combined to form the decision vector Ψ _i of item x _i and inputted to each visible unit in v ₁ , v ₂ , v ₃ , the activation of each hidden unit in hidden layer h ₁ is The activation states are conditionally independent, and the activation probability of the m _- th hidden unit is
and
Here, m ₁ =1, 2,...,M1,
is the bias of the m _-th hidden unit in h ₁ , v _1j is the state of the j-th visible unit in the first set of visible units v ₁ of RMB1, and v _2k is the state of the j-th visible unit in the second set of visible units of RMB1. v ₂ is the state of the k-th visible unit, v _3l is the state of the l-th visible unit in the third set of visible units v ₃ of RMB1,
is the element value in w ₁ and represents the connection weight between the nth visible unit and m _1st hidden unit in RBM1, where n = 1, 2, ..., Φ,
represents the state of the m _- th hidden unit in the hidden layer _h1 ,
is the sigmoid activation function,
After the training of RBM1 is completed, the state of each hidden unit corresponding to item x _i is obtained according to equation (9), and the user's preference degree for each determining component of each item in dominant item group D, that is, the state of the visible layer Obtain the activation probability of the unit as an attention weighting coefficient at _n (x _i ),
here,
If Ψ _i is the state of each visible unit in the visible layer of RBM1, represents the state of the m _i -th hidden unit in the hidden layer h _i , and at _n (x _i ) is each determinant component ψ _in of item x _i represents the attention weight of
Encoding is performed on the item x _i in the dominant item group D based on the attention mechanism using the attention weight coefficient at _n (x _i ) as the weight coefficient of each determining component of the item x _i , and after encoding, it is expressed as x _ati ,
x _ati is input to RBM1 after pre-training to obtain the activation probability V _RBM1 (x _ati ) of the visible unit,
here,
is the n'th element of x _ati ,
Perform a self-attention mechanism calculation using the activation probability V _RBM1 (x _ati ) of the visible unit of RBM1, dynamically learn the user preference attention weight vector A (x _ati ) for each item,
Here, the softmax() function ensures that the sum of all weighting coefficients is 1, and the function a(V _RBM1 (x _ati ), w ₁ ) measures the attention weighting coefficient of item x _i with respect to user preference features. And the calculation is
and
Generating an item decision vector that combines the attention mechanism by combining the user preference attention weight vector A(x _ati ) and the original decision vectors C _i , T _i , G _i of item x _i ;
Item decision vector fused with attention mechanism
Construct a training set, train the RBM1, RBM2, and RBM3 models in DBN layer by layer, and after the training is completed, create a deep belief network-based user preference sensing model that integrates the attention mechanism and its optimized model parameters θ. Step 3 to obtain
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item Step 4 representing the preference probability of
Set the population size N, and use the user preference-based distribution estimation probability model P(x) to generate N new individuals with the distribution estimation algorithm, each individual being one item, and the v-th Category label vector of new individual
The setting step for (v=1, 2,...,N) is
(5.1) Let v=1,
(5.2) Generate a random number z of [0, 1], and if z≦P (ψ _j =1), the category label vector of the vth new individual
The jth element of is 1, otherwise 0,
(5.3) Step 5, which is to add 1 to v and repeat step (5.2) until v>N;
Category label vector of N new individuals in search space
a step 6 of selecting N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
Adaptation value of each item in the recommendation target item set S ^u
Calculate,
here,
as well as
respectively represent the maximum value and minimum value of the energy function of the item in the recommendation target item set S ^u ,
is the energy function of item x ^* , x ^* ∈S ^u , and its calculation is
and
here,
is the nth determining component of item x ^* , step 7;
Selecting the top N items with the highest adaptation value in S ^u as the search results, TopN<N, step 8;
With the promotion of users' interactive search process and the dynamic evolution of user behavior, we will update the dominant item group D according to the latest evaluation data of current users, and retrain the user preference sensing model that combines the attention mechanism. , dynamically updating the extracted user preference features and simultaneously updating the user preference-based distribution estimation probability model P(x);
A personalized search method that combines an attention mechanism characterized by:

（付記２）
前記優位アイテムグループＤには割合がηの新アイテムがさらに含まれ、前記新アイテムは検索空間においてランダムにサンプリングすることによって得られる、
ことを特徴とする付記１に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Additional note 2)
The superior item group D further includes new items with a ratio of η, and the new items are obtained by randomly sampling in the search space.
A personalized search method that combines the attention mechanism described in Appendix 1, characterized in that:

（付記３）
現在のユーザーｕが新アイテムを評価していない場合、現在のユーザーｕの類似ユーザーｕ’が該新アイテムに対して行ったテキストコメントを該新アイテムに対するユーザーｕの評価として使用し、ユーザーｕの複数の類似ユーザーがいずれも該新アイテムを評価した場合、ユーザーｕとの類似度が最も大きいユーザーの評価を選択し、現在のユーザーｕの類似ユーザーがいずれも該新アイテムを評価していない場合、該新アイテムに対するユーザーｕの評価はランダムに値を割り当てる方式を使用する、
ことを特徴とする付記２に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 3)
If the current user u has not rated the new item, the text comment made by the current user u's similar user u' to the new item is used as user u's rating for the new item, and If multiple similar users have all rated the new item, select the rating of the user with the highest degree of similarity to user u, and if none of the current user u's similar users have rated the new item. , user u's evaluation of the new item uses a method of randomly assigning values;
A personalized search method that combines the attention mechanism described in Appendix 2, characterized in that:

（付記４）
ユーザーｕの類似ユーザーは、ユーザーｕと共通のスコアアイテムを有し且つ類似度が所定の類似度閾値よりも大きいユーザーであり、ユーザーｕと共通のスコアアイテムを有するユーザーｕ’について、ｕ’≠ｕであり、ｕとｕ’との類似度Ｓｉｍ（ｕ，ｕ’）は、
であり、
ここで、
はユーザーｕ及びｕ’がいずれも評価したアイテムセットを表し、
は
におけるアイテムｘ’に対するユーザーｕのスコアであり、
はｘ’に対するユーザーｕ’のスコアであり、
はユーザーｕが評価したすべてのアイテムの平均スコアであり、
はユーザーｕ’が評価したすべてのアイテムの平均スコアである、
ことを特徴とする付記３に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Additional note 4)
A similar user to user u is a user who has a common score item with user u and whose degree of similarity is greater than a predetermined similarity threshold, and for user u' who has a common score item with user u, u'≠ u, and the similarity Sim(u, u') between u and u' is
and
here,
represents the item set that both users u and u' evaluated,
teeth
is user u's score for item x' in
is the score of user u' for x',
is the average score of all items rated by user u,
is the average score of all items rated by user u',
A personalized search method that combines the attention mechanism described in Appendix 3, characterized in that:

（付記５）
ＤＢＮにおけるＲＢＭ１、ＲＢＭ２、ＲＢＭ３モデルを層ごとに訓練する前記ステップは、具体的には、
まず、ＲＢＭ１を訓練し、パラメータ｛ｗ_１，ａ_１，ｂ_１｝を得て、ｂ_１をＲＢＭ２におけるａ_２に伝達した上でＲＢＭ２を訓練し、最適化パラメータ｛ｗ_２，ａ_２，ｂ_２｝を取得し、ｂ_２をＲＢＭ３におけるａ_３に伝達した上でＲＢＭ３を訓練し、最適化パラメータ｛ｗ_３，ａ_３，ｂ_３｝を取得することである、
ことを特徴とする付記１に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 5)
Specifically, the step of training the RBM1, RBM2, and RBM3 models in the DBN layer by layer includes:
First, train RBM1, obtain the parameters {w ₁ , a ₁ , b ₁ }, transfer b ₁ to a ₂ in RBM2, train RBM2, and obtain the optimization parameters {w ₂ , a ₂ , b ₂ }, transmit b ₂ to a ₃ in RBM3, train RBM3, and obtain optimization parameters {w ₃ , a ₃ , b ₃ }.
A personalized search method that combines the attention mechanism described in Appendix 1, characterized in that:

（付記６）
アイテムのｎ番目の決定成分に対する前記ユーザーの嗜好確率Ｐ（ψ_ｎ）の計算は、
まず、優位アイテムグループＤに基づいてユーザー嗜好ベースの確率分布モデルｐ（ｘ）を計算し、
ｐ（ｘ）はΦ次元ベクトルであり、そのｎ番目の要素Ｐ（ψ_ｎ）はユーザー嗜好アイテムのｎ番目の決定成分の活性化確率であり、Ｐ（ψ_ｎ）に対して下界制約を行い、制約後の値はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率Ｐ（ψ_ｎ）であり、すなわち、
であり、
εは所定の下界閾値である、
ことを特徴とする付記１に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 6)
The calculation of the user's preference probability P(ψ _n ) for the nth determining component of the item is
First, a user preference-based probability distribution model p(x) is calculated based on the dominant item group D,
p(x) is a Φ-dimensional vector, its nth element P(ψ _n ) is the activation probability of the nth determining component of the user preference item, and a lower bound constraint is applied to P(ψ _n ). , the value after the constraint is the user's preference probability P(ψ _n ) for the nth determining component of the item, i.e.
and
ε is a predetermined lower bound threshold;
A personalized search method that combines the attention mechanism described in Appendix 1, characterized in that:

（付記７）
前記３層の制限付きボルツマンマシンでは、各層の制限付きボルツマンマシンにおける隠れ層の隠れユニットの数は可視層における可視ユニットの数の０．８～１．２倍である、
ことを特徴とする付記１に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 7)
In the three-layer restricted Boltzmann machine, the number of hidden units in the hidden layer in each layer of the restricted Boltzmann machine is 0.8 to 1.2 times the number of visible units in the visible layer.
A personalized search method that combines the attention mechanism described in Appendix 1, characterized in that:

（付記８）
優位アイテムグループＤに対する新アイテムの割合はη＜３０％である、
ことを特徴とする付記２に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 8)
The ratio of new items to superior item group D is η<30%,
A personalized search method that combines the attention mechanism described in Appendix 2, characterized by the following.

（付記９）
前記ステップ６はユークリッド距離を使用して類似度計算を行い、すなわち、２つのベクトル間のユークリッド距離が小さいほど、両者の類似度が高い、
ことを特徴とする付記１に記載の注意メカニズムを融合したパーソナライズド検索方法。 (Appendix 9)
The step 6 calculates the similarity using Euclidean distance, that is, the smaller the Euclidean distance between two vectors, the higher the similarity between them.
A personalized search method that combines the attention mechanism described in Appendix 1, characterized in that:

（付記１０）
注意メカニズムを融合したパーソナライズド検索システムであって、
ユーザーｕが評価したすべてのアイテム、各アイテムに対するスコア及びテキストコメント、各アイテムの画像、及びユーザーｕの評価に対するほかのユーザーの有用性評価スコアを含むユーザーｕ生成コンテンツを収集して取得し、テキストコメントをベクトル化し、アイテム画像に対して特徴抽出を行い、固有ベクトルを取得するユーザー生成コンテンツ取得モジュールと、
ユーザースコアが所定のスコア閾値よりも大きく且つ信頼度が所定の信頼度閾値よりも大きいアイテムでユーザー嗜好を含む優位アイテムグループＤを構成する優位アイテムグループ構築モジュールと、
注意メカニズムを融合したユーザー嗜好感知モデルを構築して訓練するユーザー嗜好感知モデル構築訓練モジュールであって、前記モデルは深層信念ネットワークに基づいて３層の制限付きボルツマンマシンから構成され、１層目の制限付きボルツマンマシンの可視層は第１組の可視ユニットｖ_１、第２組の可視ユニットｖ_２及び第３組の可視ユニットｖ_３を含み、隠れ層はｈ_１であり、ｈ_１は可視層として、隠れ層ｈ_２とともに２層目の制限付きボルツマンマシンを構成し、ｈ_２は可視層として、隠れ層ｈ_３とともに３層目の制限付きボルツマンマシンを構成し、前記注意メカニズムを融合したユーザー嗜好感知モデルのパラメータはθ＝｛θ_１，θ_２，θ_３｝＝｛ｗ_１，ａ_１，ｂ_１，ｗ_２，ａ_２，ｂ_２，ｗ_３，ａ_３，ｂ_３｝であるユーザー嗜好感知モデル構築訓練モジュールと、
訓練済みの注意メカニズムを融合した深層信念ネットワークベースのユーザー嗜好感知モデル及びそのモデルパラメータに基づいて、ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を構築し、
Ｐ（ｘ）＝［Ｐ（ψ_１），Ｐ（ψ_２），・・・，Ｐ（ψ_ｎ），・・・，Ｐ（ψ_Φ）］（１７）
ここで、（ψ_１，ψ_２，・・・，ψ_ｎ，・・・，ψ_Φ）はアイテムｘの元の決定ベクトルであり、Ｐ（ψ_ｎ）はアイテムのｎ番目の決定成分に対するユーザーの嗜好確率を表すユーザー嗜好ベースの分布推定確率モデル構築モジュールと、
ユーザー嗜好ベースの分布推定確率モデルＰ（ｘ）を使用して、分布推定アルゴリズムを採用して、それぞれ１つのアイテムであるＮ（Ｎは所定の個体群のサイズである）個の新個体を生成し、各新個体のカテゴリラベルベクトルを設定する個体群生成モジュールと、
検索空間においてＮ個の新個体カテゴリラベルベクトル
との類似度が最も高いＮ個のアイテムを選択して推薦対象アイテムセットＳ^ｕを構成する推薦対象アイテムセット構築モジュールと、
推薦対象アイテムセットＳ^ｕにおける各アイテムの適応値を計算する適応値計算モジュールと、
Ｓ^ｕにおける適応値が最も高い上位ＴｏｐＮ個のアイテムを検索結果として選択し、ＴｏｐＮ＜Ｎである検索結果選択モジュールと、を含む、ことを特徴とする注意メカニズムを融合したパーソナライズド検索システム。 (Appendix 10)
A personalized search system that combines an attention mechanism,
Collect and retrieve user u-generated content, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings, and a user-generated content acquisition module that vectorizes comments, extracts features from item images, and acquires eigenvectors;
a superior item group construction module that configures a superior item group D including user preferences with items whose user score is greater than a predetermined score threshold and whose reliability is greater than a predetermined reliability threshold;
A user preference sensing model building and training module that builds and trains a user preference sensing model that integrates an attention mechanism, the model is composed of a three-layer restricted Boltzmann machine based on a deep belief network, and the first layer The visible layer of the restricted Boltzmann machine includes a first set of visible units v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is the visible layer. , a second-layer restricted Boltzmann machine is constructed with the hidden layer _h2 , _h2 is the visible layer, and a third-layer restricted Boltzmann machine is constructed with the hidden layer _h3 . The parameters of the preference sensing model are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w 2 , a ₂ , b ₂ _, w ₃ , a ₃ , b ₃ } A preference sensing model construction training module;
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item a user preference-based distribution estimation probability model construction module representing preference probabilities;
Using the user preference-based distribution estimation probability model P(x), a distribution estimation algorithm is adopted to generate N (N is the size of a given population) new individuals, each of which is one item. and a population generation module that sets a category label vector for each new individual;
N new individual category label vectors in the search space
a recommendation target item set construction module that selects N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
an adaptation value calculation module that calculates the adaptation value of each item in the recommendation target item set S ^u ;
A personalized search system integrating an attention mechanism, comprising: a search result selection module that selects TopN items with the highest adaptation value in ^Su as search results, and where TopN<N.

Claims

A personalized search method that combines an attention mechanism,
Collect and retrieve user u-generated content, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings, and Step 1 of vectorizing comments, extracting features from item images, and obtaining eigenvectors;
Step 2 of configuring a dominant item group D including user preferences with items having a user score larger than a predetermined score threshold and a reliability larger than a predetermined confidence threshold, the items in D forming a set S; , S={(u, x _i , C _i , T _i , G _i )}, where x _i ∈D, C _i is the category label vector of item x _i , and T _i is the item is a vectorized representation of the user's text comment for x _i , G _i is a vectorized representation of the image features of item x _i , i = 1, 2, ..., |D|, and |D| Step 2 representing the number of items in D;
A user preference sensing model that combines attention mechanisms is constructed, and the model is composed of three layers of restricted Boltzmann machines based on a deep belief network, and the visible layer of the first layer of restricted Boltzmann machines is the first set of visible layers. It includes a unit v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is the visible layer with the restriction of the second layer together with the hidden layer h ₂ A Boltzmann machine is constructed, _h2 is a visible layer, and a hidden layer _h3 constitutes a third layer restricted Boltzmann machine, and the parameters of the user preference sensing model that combines the above attention mechanism are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w ₂ , a ₂ , b ₂ , w ₃ , a ₃ , b ₃ },
Using the dominant item group D, train the first-layer restricted Boltzmann machine in the user preference sensing model that combines the attention mechanism with the contrastive divergence learning algorithm, and calculate the model parameters θ ₁ = {w ₁ , a ₁ , b ₁ },
After the training of the first layer RBM model is completed, given the state of the hidden unit, the activation state of each visible unit is conditionally independent, and the vector representation of some item x _i [C _i , T _i , G _i ] is input to the visible layer, and the activation probabilities of the first set, second set, and third set of visible units are respectively
and
Here, a _1,j , a _1,k and a _1,l represent the biases of the first set, second set and third set of visible units, respectively;
The information entropy of various multi-source heterogeneous data is calculated, and the information entropy of item category label is
and
The information entropy of the text comment vector is
and
The information entropy of the item image feature vector is
and
Here, c _ij represents the j-th element of the category label vector C _i of item x _i , and p ₍ c _ij) is the activation of the visible unit corresponding to the j-th element of the vector representation of the item category label in RBM1. represents the probability,
t _ik represents the kth element of the vectorized representation T _i of user u's text comment for item x _i , and p ₍ t _ik) is the visible vector representation T i of the vectorized representation of user u's text comment in RBM1. represents the activation probability of the unit,
g _il represents the lth element of the vectorized representation G _i of the image features of item x _i , and p _(gil) is the activation of the visible unit corresponding to the lth element of the vectorized representation of the item image features in RBM1. represents the probability,
Next, calculate the ratio of various information entropies to the total information entropy as a weighting factor,
Here, H(x _i )=H(C _i )+H(T _i )+H(G _i ),
When vectors C _i , T _i , G _i are combined to form the decision vector Ψ _i of item x _i and inputted to each visible unit in v ₁ , v ₂ , v ₃ , the activation of each hidden unit in hidden layer h ₁ is The activation states are conditionally independent, and the activation probability of the m _- th hidden unit is
and
Here, m ₁ =1, 2,..., M ₁ ,
is the bias of the m _-th hidden unit in h ₁ , v _1j is the state of the j-th visible unit in the first set of visible units v ₁ of RMB1, and v _2k is the state of the j-th visible unit in the second set of visible units of RMB1. v ₂ is the state of the k-th visible unit, v _3l is the state of the l-th visible unit in the third set of visible units v ₃ of RMB1,
is the element value in w ₁ and represents the connection weight between the nth visible unit and m _1st hidden unit in RBM1, where n = 1, 2, ..., Φ,
represents the state of the m _- th hidden unit in the hidden layer _h1 ,
is the sigmoid activation function,
After the training of RBM1 is completed, the state of each hidden unit corresponding to item x _i is obtained according to equation (9), and the user's preference degree for each determining component of each item in dominant item group D, that is, the state of the visible layer Obtain the activation probability of the unit as an attention weighting coefficient at _n (x _i ),
here,
If Ψ _i is the state of each visible unit in the visible layer of RBM1, represents the state of the m _i -th hidden unit in the hidden layer h _i , and at _n (x _i ) is each determinant component ψ _in of item x _i represents the attention weight of
Encoding is performed on the item x _i in the dominant item group D based on the attention mechanism using the attention weight coefficient at _n (x _i ) as the weight coefficient of each determining component of the item x _i , and after encoding, it is expressed as x _ati ,
x _ati is input to RBM1 after pre-training to obtain the activation probability V _RBM1 (x _ati ) of the visible unit,
here,
is the n'th element of x _ati ,
Perform a self-attention mechanism calculation using the activation probability V _RBM1 (x _ati ) of the visible unit of RBM1, dynamically learn the user preference attention weight vector A (x _ati ) for each item,
Here, the softmax() function ensures that the sum of all weighting coefficients is 1, and the function a(V _RBM1 (x _ati ), w ₁ ) measures the attention weighting coefficient of item x _i with respect to user preference features. And the calculation is
and
Generating an item decision vector that combines the attention mechanism by combining the user preference attention weight vector A(x _ati ) and the original decision vectors C _i , T _i , G _i of item x _i ;
Item decision vector fused with attention mechanism
Construct a training set, train the RBM1, RBM2, and RBM3 models in DBN layer by layer, and after the training is completed, create a deep belief network-based user preference sensing model that integrates the attention mechanism and its optimized model parameters θ. Step 3 to obtain
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item Step 4 representing the preference probability of
Set the population size N, and use the user preference-based distribution estimation probability model P(x) to generate N new individuals with the distribution estimation algorithm, each individual being one item, and the v-th Category label vector of new individual
The setting step for (v=1, 2,...,N) is
(5.1) Let v=1,
(5.2) Generate a random number z of [0, 1], and if z≦P (ψ _j =1), the category label vector of the vth new individual
The jth element of is 1, otherwise 0,
(5.3) Step 5, which is to add 1 to v and repeat step (5.2) until v>N;
Category label vector of N new individuals in search space
a step 6 of selecting N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
Adaptation value of each item in the recommendation target item set S ^u
Calculate,
here,
as well as
respectively represent the maximum value and minimum value of the energy function of the item in the recommendation target item set S ^u ,
is the energy function of item x ^* , x ^* ∈S ^u , and its calculation is
and
here,
is the nth determining component of item x ^* , step 7;
Selecting the top N items with the highest adaptation value in S ^u as the search results, TopN<N, step 8;
With the promotion of users' interactive search process and the dynamic evolution of user behavior, we will update the dominant item group D according to the latest evaluation data of current users, and retrain the user preference sensing model that combines the attention mechanism. , dynamically updating the extracted user preference features and simultaneously updating the user preference-based distribution estimation probability model P(x);
A personalized search method that combines an attention mechanism characterized by:

The superior item group D further includes new items with a ratio of η, and the new items are obtained by randomly sampling in the search space.
A personalized search method combining the attention mechanism according to claim 1.

If the current user u has not rated the new item, the text comment made by the current user u's similar user u' to the new item is used as user u's rating for the new item, and If multiple similar users have all rated the new item, select the rating of the user with the highest degree of similarity to user u, and if none of the current user u's similar users have rated the new item. , user u's evaluation of the new item uses a method of randomly assigning values;
A personalized search method combining the attention mechanism according to claim 2.

A similar user to user u is a user who has a common score item with user u and whose degree of similarity is greater than a predetermined similarity threshold, and for user u' who has a common score item with user u, u'≠ u, and the similarity Sim(u, u') between u and u' is
and
here,
represents the item set that both users u and u' evaluated,
teeth
is user u's score for item x' in
is the score of user u' for x',
is the average score of all items rated by user u,
is the average score of all items rated by user u',
A personalized search method combining the attention mechanism according to claim 3.

Specifically, the step of training the RBM1, RBM2, and RBM3 models in the DBN layer by layer includes:
First, train RBM1, obtain the parameters {w ₁ , a ₁ , b ₁ }, transfer b ₁ to a ₂ in RBM2, train RBM2, and obtain the optimization parameters {w ₂ , a ₂ , b ₂ }, transmit b ₂ to a ₃ in RBM3, train RBM3, and obtain optimization parameters {w ₃ , a ₃ , b ₃ }.
A personalized search method combining the attention mechanism according to claim 1.

The calculation of the user's preference probability P(ψ _n ) for the nth determining component of the item is
First, a user preference-based probability distribution model p(x) is calculated based on the dominant item group D,
p(x) is a Φ-dimensional vector, and its nth element P(ψ _n ) is the activation probability of the nth determining component of the user preference item, and a lower bound constraint is applied to P(ψ _n ). , the value after the constraint is the user's preference probability P(ψ _n ) for the nth determining component of the item, i.e.
and
ε is a predetermined lower bound threshold;
A personalized search method combining the attention mechanism according to claim 1.

In the three-layer restricted Boltzmann machine, the number of hidden units in the hidden layer in each layer of the restricted Boltzmann machine is 0.8 to 1.2 times the number of visible units in the visible layer.
A personalized search method combining the attention mechanism according to claim 1.

The ratio of new items to superior item group D is η<30%,
A personalized search method combining the attention mechanism according to claim 2.

The step 6 calculates the similarity using Euclidean distance, that is, the smaller the Euclidean distance between two vectors, the higher the similarity between them.
A personalized search method combining the attention mechanism according to claim 1.

A personalized search system that combines an attention mechanism,
Collect and retrieve user u-generated content, including all items rated by user u, scores and text comments for each item, images of each item, and other users' usefulness evaluation scores for user u's ratings, and a user-generated content acquisition module that vectorizes comments, extracts features from item images, and acquires eigenvectors;
a superior item group construction module that configures a superior item group D including user preferences with items whose user score is greater than a predetermined score threshold and whose reliability is greater than a predetermined reliability threshold;
A user preference sensing model building and training module that builds and trains a user preference sensing model that integrates an attention mechanism, the model is composed of a three-layer restricted Boltzmann machine based on a deep belief network, and the first layer The visible layer of the restricted Boltzmann machine includes a first set of visible units v ₁ , a second set of visible units v ₂ and a third set of visible units v ₃ , the hidden layer is h ₁ , and h ₁ is the visible layer. , a second-layer restricted Boltzmann machine is constructed with the hidden layer _h2 , _h2 is the visible layer, and a third-layer restricted Boltzmann machine is constructed with the hidden layer _h3 . The parameters of the preference sensing model are θ={θ ₁ , θ ₂ , θ ₃ }={w ₁ , a ₁ , b ₁ , w 2 , a ₂ , b ₂ _, w ₃ , a ₃ , b ₃ } A preference sensing model construction training module;
Build a user preference-based distribution estimation probability model P(x) based on a deep belief network-based user preference sensing model that combines a trained attention mechanism and its model parameters,
P(x)=[P(ψ ₁ ), P(ψ ₂ ), ..., P(ψ _n ), ..., P(ψ _Φ )] (17)
Here, (ψ ₁ , ψ ₂ , ..., ψ _n , ..., ψ _Φ ) is the original decision vector of item x, and P (ψ _n ) is the user's decision vector for the nth decision component of item a user preference-based distribution estimation probability model construction module representing preference probabilities;
Using the user preference-based distribution estimation probability model P(x), a distribution estimation algorithm is adopted to generate N (N is the size of a given population) new individuals, each of which is one item. and a population generation module that sets a category label vector for each new individual;
N new individual category label vectors in the search space
a recommendation target item set construction module that selects N items having the highest degree of similarity to constitute a recommendation target item set S ^u ;
an adaptation value calculation module that calculates the adaptation value of each item in the recommendation target item set S ^u ;
A personalized search system integrating an attention mechanism, comprising: a search result selection module that selects TopN items with the highest adaptation value in ^Su as search results, and where TopN<N.