JP7495363B2

JP7495363B2 - Personality information generation model, device and method using domain-independent RNN

Info

Publication number: JP7495363B2
Application number: JP2021015575A
Authority: JP
Inventors: 雄一石川
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2021-02-03
Filing date: 2021-02-03
Publication date: 2024-06-04
Anticipated expiration: 2041-02-03
Also published as: JP2022118820A

Description

本発明は、ユーザのパーソナリティや心理状態を推定する技術に関する。 The present invention relates to a technology for estimating a user's personality and psychological state.

人間の性格、人格、気質や、個性、さらには価値観といったような精神面の指標を全体的に捉えた特性であるパーソナリティは、各種商品・サービスの購入・選択といった様々な行動領域（行動ドメイン）における行動内容に少なからぬ影響を与えることが知られている。このため近年、ユーザの行動を予測したり、提供する商品・サービスをパーソナライズしたりすることを目的として、ユーザのパーソナリティを把握する試みが盛んに行われている。 Personality, which is a characteristic that holistically captures mental indicators such as a person's character, personality, temperament, individuality, and even values, is known to have a significant impact on behavior in various behavioral domains, such as the purchase and selection of various products and services. For this reason, in recent years, there have been many attempts to understand users' personalities in order to predict their behavior and personalize the products and services they provide.

今日、人間のパーソナリティは「特性論」と呼ばれる考え方に従い把握するのが一般的である。この特性論では、人間のパーソナリティは、複数の特性で構成されていて各特性の高低（スコア）で表現することができるとしている。例えば、特性論に基づくパーソナリティモデルの代表例である「Big Five」は、人間のパーソナリティが知的好奇心（O: Openness to experience）、誠実性（C: Conscientiousness）、外向性（E: Extraversion）、協調性（A: Agreeableness）、及び情緒安定性（N: Neuroticism）の５つの特性で構成されるとしており、人間の性格をこれら５特性（OCEAN）のスコアの組で表現する。 Today, human personality is generally understood according to a concept called "trait theory." This trait theory states that human personality is composed of multiple traits and can be expressed by the level (score) of each trait. For example, the "Big Five," a representative example of a personality model based on trait theory, states that human personality is composed of five traits: intellectual curiosity (O: Openness to experience), conscientiousness (C: Conscientiousness), extraversion (E: Extraversion), agreeableness (A: Agreeableness), and emotional stability (N: Neuroticism), and expresses human character as a set of scores for these five traits (OCEAN).

このようなパーソナリティを測定する手法として、質問紙調査がよく用いられる。例えば、Big Five測定用の質問紙として代表的な「Big Five Scales」は、合計60の項目からなる質問群を被験者に回答させるものであり、例えば知的好奇心（O）の測定においては、「独創的な」や「多才の」等の質問に対し「まったくあてはまらない（1点）」～「非常にあてはまる（７点）」の7段階で回答させ、全質問の合計得点で知的好奇心（O）のスコアを決定する仕組みとなっている。 Questionnaire surveys are often used as a method of measuring such personality traits. For example, the "Big Five Scales," a representative questionnaire for measuring the Big Five, has subjects answer a set of questions consisting of a total of 60 items. When measuring intellectual curiosity (O), for example, subjects are asked to answer questions such as "creative" and "versatile" on a seven-point scale from "not at all applicable (1 point)" to "very applicable (7 points)," and the score for intellectual curiosity (O) is determined by the total score for all questions.

一方、膨大なユーザに対し質問紙調査を実施することは多大なコストがかかることを踏まえ、最初に少数のユーザに対して質問紙調査を実施し、その回答結果と当該ユーザの行動データとを用いて当該ユーザのパーソナリティを推定する技術の研究も広く行われている。 On the other hand, given the enormous cost involved in conducting questionnaire surveys of a huge number of users, there is also widespread research into technology that first administers questionnaire surveys to a small number of users and then uses the responses to these surveys and the behavioral data of those users to estimate their personalities.

非特許文献１はこのような技術のサーベイ論文であり、例えば、ＳＮＳ（Social Networking Service）への投稿内容や、ＳＮＳで繋がっている友人の数、携帯端末の発信履歴や、端末位置情報に基づき導出される移動パターン、さらには通話における発話の速度・抑揚や、声の大小等、様々なユーザの行動データを用いて、当該ユーザのパーソナリティを推定する技術が紹介されている。 Non-Patent Document 1 is a survey paper on such technologies, and introduces a technology that estimates a user's personality using various user behavioral data, such as the content posted to SNS (Social Networking Service), the number of friends connected on SNS, the call history of a mobile device, movement patterns derived based on device location information, and even the speed and intonation of speech during phone calls, as well as the volume of the voice.

A. Vinciarelli and G. Mohammadi, "A Survey of Personality Computing,", IEEE Transactions on Affective Computing, vol. 5, no. 3, pp. 273-291, ２０１４年，＜https://doi.org/10.1109/TAFFC.2014.2330816＞A. Vinciarelli and G. Mohammadi, "A Survey of Personality Computing," IEEE Transactions on Affective Computing, vol. 5, no. 3, pp. 273-291, 2014, <https://doi.org/10.1109/TAFFC.2014.2330816> Donkers, T., Loepp, B., & Ziegler, J., "Sequential user-based recurrent neural network recommendations", RecSys'17 - Proceedings of the 11th ACM Conference on Recommender Systems, pp. 152-160, ２０１７年, ＜https://doi.org/10.1145/3109859.3109877＞Donkers, T., Loepp, B., & Ziegler, J., "Sequential user-based recurrent neural network recommendations", RecSys'17 - Proceedings of the 11th ACM Conference on Recommender Systems, pp. 152-160, 2017, ＜https://doi.org/10.1145/3109859.3109877＞

以上説明したように従来、パーソナリティの測定手法として様々な技術が開発されてきたが、これらの技術に共通する解決すべき課題として、（ａ）測定粒度（情報粒度）の粗さと、（ｂ）信頼性の低さとが挙げられる。 As explained above, various techniques have been developed to measure personality, but these techniques share common issues that need to be resolved: (a) coarse measurement granularity (information granularity) and (b) low reliability.

最初に（ａ）測定粒度の粗さであるが、これは、ユーザの行動データから当該ユーザのパーソナリティを推定する技術を含め、「特性論」に基づきパーソナリティを把握する技術に顕著な課題となっている。例えば、Big Fiveの５つの特性は元来、辞書から抜き出した膨大な数の性格表現単語を、統計的な手法によって５つの類似概念単語グループに集約し、この集約結果に基づいて人間が解釈し易いように決定されたものである。 First, (a) the coarseness of measurement granularity is a significant issue for technologies that understand personality based on "trait theory," including technologies that estimate a user's personality from the user's behavioral data. For example, the Big Five traits were originally determined by aggregating a huge number of personality-descriptive words extracted from dictionaries into five groups of words with similar concepts using statistical methods, and then based on the results of this aggregation, in a way that would be easy for humans to interpret.

ここで、この集約の過程において、性格表現単語の本来有する意味合いにおける細かな若しくは微妙な相違部分が失われてしまっている。例えば、Big Fiveのうちの１つである誠実性（C）は、「真面目」「勤勉」「計画性がある」等の性格表現を一特性に集約したものとなっており、この誠実性（C）の測定結果として得られるのは単一のスコアに過ぎず、集約前の性格表現１つ１つの意味合いの相違部分は、測定結果から失われているのである。 However, in the process of aggregation, small or subtle differences in the original meanings of the personality expressions are lost. For example, conscientiousness (C), one of the Big Five, is aggregating personality expressions such as "serious," "hardworking," and "organized" into a single trait, and the measurement result of this conscientiousness (C) is merely a single score, and the differences in meaning of each personality expression before aggregation are lost from the measurement results.

次に（ｂ）信頼性の低さであるが、これは特に、質問紙調査によるパーソナリティ測定を含む技術に顕著な課題となっている。例えば、質問紙の質問に対する被験者の回答態度には通常、相当のばらつきが存在する。具体的には、自分の本来の考えではなく「こうあるべき」との考えに基づいて回答を行うケースや、回答自体を面倒な作業と捉えて出鱈目な回答を行うケースが生じることも少なくない。その結果、パーソナリティの測定結果における信頼性が低下してしまうのである。 Next, (b) low reliability is a significant problem for techniques that involve measuring personality through questionnaire surveys. For example, there is usually considerable variation in the way subjects respond to questions in a questionnaire. Specifically, it is not rare for subjects to answer based on how they "should be" rather than on their own original thoughts, or to consider answering itself to be a tedious task and give false answers. As a result, the reliability of personality measurement results is reduced.

また、仮に本来想定する（素直な且つ真面目な）回答態度が担保されたとしても、そもそも質問紙調査で設定されるような質問項目や回答方法では、測定対象とする特性の領域全てを測定し切れない可能性が高く、測定結果における信頼性は低くならざるを得ない。例えば、Big Fiveの質問紙は20種以上存在するが、質問数、質問表現や、（5段階で回答するか、はい／いいえで回答するか等の）回答方法は、それぞれで相当に異なるものとなっている。またその影響もあって、同一の被験者であっても回答する質問紙の種別によって測定結果が相違することは広く知られており、２つの質問紙間において、協調性（A）の測定結果の相関rが0.40を下回るケースも発生している。 Even if the expected (honest and serious) response attitude is guaranteed, the questions and response methods set in a questionnaire survey are unlikely to measure all of the areas of the characteristics to be measured, and the reliability of the measurement results is inevitably low. For example, there are more than 20 types of Big Five questionnaires, but the number of questions, question wording, and response methods (such as answering on a 5-point scale or yes/no) are significantly different for each. As a result, it is widely known that the measurement results of the same subject differ depending on the type of questionnaire answered, and there have been cases where the correlation r of the measurement results of agreeableness (A) between two questionnaires is below 0.40.

さらに、ユーザの行動データから当該ユーザのパーソナリティを推定する技術においても、使用する推定モデルは通常、質問紙調査による測定結果を用いて訓練・構築されるので、以上に説明した（ｂ）信頼性の低さが、解決すべき重大な課題となっているのである。 Furthermore, even in technology that estimates a user's personality from the user's behavioral data, the estimation models used are usually trained and constructed using measurement results from questionnaire surveys, so the low reliability described above (b) is a serious issue that needs to be resolved.

このような現状を受け、本願発明者は、質問紙調査による測定結果を用いずに、商品・サービスのレコメンド（推薦）技術の分野において研究されている動的なユーザ表現学習（Sequential URL, Sequential User Representation Learning）技術を応用することによって、上記の２つの課題を解決するパーソナリティ推定が実現することに思い至った。 In light of this current situation, the inventors of the present application came up with the idea of realizing personality estimation that solves the above two problems by applying dynamic user representation learning (Sequential URL, Sequential User Representation Learning) technology, which is being researched in the field of product and service recommendation technology, without using measurement results from questionnaire surveys.

ここでSequential URL技術は、音楽再生や、動画閲覧、さらには広告クリックや、アイテム購買等に係る時系列の行動データから、(例えば音楽や映画の好みといったような)ユーザ属性の分散表現を獲得する技術である。例えば非特許文献２には、このSequential URL技術として、ユーザの映画視聴についての時系列行動データを、回帰ニューラルネットワーク（ＲＮＮ, Recurrent Neural Network）の一種であるＧＲＵ（Gated Recurrent Unit）へ入力し、ユーザ属性の分散表現を獲得する技術が開示されている。 The Sequential URL technology is a technology that acquires distributed representations of user attributes (such as preferences for music or movies) from time-series behavioral data related to music playback, video viewing, ad clicks, item purchases, etc. For example, Non-Patent Document 2 discloses a technology for this Sequential URL technology in which time-series behavioral data on a user's movie viewing is input into a GRU (Gated Recurrent Unit), which is a type of Recurrent Neural Network (RNN), to acquire distributed representations of user attributes.

具体的に、この非特許文献２に記載された技術では、ユーザの識別情報を入力とする分散表現抽出部から出力された当該ユーザの分散表現と、当該ユーザによって視聴された映画の識別情報を入力とする分散表現抽出器から出力された当該映画の分散表現とを入力とするＧＲＵセルが開示されている。ここで非特許文献２は、十分な量のデータによってこのＧＲＵセル及び分散表現抽出器を訓練することによって、分散表現抽出部から出力される当該ユーザの分散表現には、静的であって容易には変化しないユーザ属性、例えば当該ユーザの映画の好み、が反映されるとしている。 Specifically, the technology described in Non-Patent Document 2 discloses a GRU cell that receives as input the distributed representation of a user output from a distributed representation extraction unit that receives the user's identification information as input, and the distributed representation of a movie output from a distributed representation extractor that receives the identification information of a movie watched by the user as input. Non-Patent Document 2 claims that by training the GRU cell and the distributed representation extractor with a sufficient amount of data, the distributed representation of the user output from the distributed representation extraction unit reflects user attributes that are static and do not change easily, such as the user's movie preferences.

しかしながら、このようなSequential URL技術によって獲得されるユーザ属性の分散表現は当然ながら、使用する行動データの属する行動ドメイン（領域）に強く依存する量となっており、到底、ユーザの（精神面の指標を全体的に捉えた特性である）パーソナリティと解釈されるものとはなっていない。例えば、映画視聴との行動ドメインに係る行動データを用いた場合、例えば当該ユーザの映画の好み、といったような映画視聴に係るユーザ属性しか抽出することができない。またそれ故従来、Sequential URL技術を用いてユーザのパーソナリティを推定する試みは、何らなされてこなかったのである。 However, the distributed representation of user attributes obtained by such Sequential URL technology is naturally highly dependent on the behavioral domain (area) to which the behavioral data used belongs, and is in no way interpretable as the user's personality (a characteristic that captures mental indicators holistically). For example, when behavioral data related to the behavioral domain of movie viewing is used, only user attributes related to movie viewing, such as the user's movie preferences, can be extracted. For this reason, no attempt has been made to estimate a user's personality using Sequential URL technology to date.

そこで、本発明は、情報粒度及び信頼性のより高いパーソナリティ情報を生成することができるパーソナリティ情報生成モデル、パーソナリティ推定装置、及びパーソナリティ推定方法を提供することを目的とする。 The present invention aims to provide a personality information generation model, a personality estimation device, and a personality estimation method that can generate personality information with higher information granularity and reliability.

本発明によれば、複数の行動ドメインにおけるユーザの各行動ドメインに係る情報から、当該ユーザのパーソナリティを推定するコンピュータを機能させるパーソナリティ情報生成モデルであって、
当該行動ドメイン毎に設定された複数の行動表現生成部であって、各々が、当該行動ドメインにおける当該ユーザの行動の内容を識別する情報から、当該行動ドメインにおける当該ユーザの行動を表現する情報である行動表現情報を生成する、複数の行動表現生成部と、
当該行動ドメイン毎に設定された複数のドメイン特定回帰ニューラルネットワーク（ＲＮＮ）セルであって、各々が、当該行動ドメインにおける当該ユーザの当該行動表現情報を受け取り、当該ユーザの当該行動表現情報を前回受け取った時点で自ら生成した、ＲＮＮの隠れ状態情報に相当するドメイン特定隠れ状態情報に対し、今回受け取った当該ユーザの当該行動表現情報を反映させて、同じくＲＮＮの隠れ状態情報に相当する新たなドメイン特定隠れ状態情報を生成する、複数のドメイン特定ＲＮＮセルと、
当該ユーザを識別する情報から、当該ユーザを表現する情報であるユーザ表現情報を生成するユーザ表現生成部と、
生成された当該ドメイン特定隠れ状態情報と、生成された当該ユーザ表現情報とを受け取り、当該ユーザの当該ドメイン特定隠れ状態情報を前回受け取った時点で自ら生成した、ＲＮＮの隠れ状態情報に相当するドメイン非依存隠れ状態情報に対し、今回受け取った当該ユーザの当該ドメイン特定隠れ状態情報及び当該ユーザ表現情報を反映させて、同じくＲＮＮの隠れ状態情報に相当する新たなドメイン非依存隠れ状態情報を生成するドメイン非依存ＲＮＮセルと
してコンピュータを機能させ、
複数のドメイン特定ＲＮＮセル及びドメイン非依存ＲＮＮセルと合わせて、ＲＮＮの誤差逆伝播法によって訓練されたユーザ表現生成部は、当該ユーザの当該ユーザ表現情報を、当該ユーザに係るドメイン非依存的な且つ静的な情報という意味で当該ユーザのパーソナリティを表現した情報として生成する
ことを特徴とするパーソナリティ情報生成モデルが提供される。 According to the present invention, there is provided a personality information generation model for causing a computer to function to estimate a personality of a user from information related to each of a plurality of behavioral domains of the user, the personality information generation model comprising:
a plurality of behavioral expression generating units set for each of the behavioral domains, each generating behavioral expression information that is information expressing the behavior of the user in the behavioral domain from information that identifies the content of the behavior of the user in the behavioral domain;
a plurality of domain-specific regression neural network (RNN) cells set for each of the behavioral domains, each of which receives the behavioral expression information of the user in the behavioral domain, and generates new domain-specific hidden state information corresponding to the hidden state information of the RNN by reflecting the behavioral expression information of the user received this time on domain-specific hidden state information corresponding to the RNN hidden state information that the RNN cell itself generated when it previously received the behavioral expression information of the user ;
a user expression generating unit that generates user expression information that represents the user from information that identifies the user;
a computer is caused to function as a domain-independent RNN cell that receives the generated domain-specific hidden state information and the generated user expression information, and generates new domain-independent hidden state information corresponding to the RNN hidden state information by reflecting the currently received domain-specific hidden state information of the user and the user expression information on domain-independent hidden state information corresponding to the RNN hidden state information that the computer generated when it previously received the domain-specific hidden state information of the user ;
A user expression generating unit trained by the RNN backpropagation method in combination with a plurality of domain-specific RNN cells and domain-independent RNN cells generates user expression information of the user as information expressing the personality of the user in the sense of domain-independent and static information related to the user.
A personality information generation model is provided.

この本発明によるパーソナリティ情報生成モデルにおいて、ドメイン非依存ＲＮＮセルは、当該ドメイン特定ＲＮＮセルにおいて最近に生成された又は生成された直後の当該ドメイン特定隠れ状態情報を受け取り、当該ドメイン非依存隠れ状態情報を生成することも好ましい。 In the personality information generation model according to the present invention, it is also preferable that the domain-independent RNN cell receives the domain-specific hidden state information that has been recently generated or has just been generated in the domain-specific RNN cell, and generates the domain-independent hidden state information.

また、本発明によるパーソナリティ情報生成モデルの一実施形態として、本モデルは、当該ドメイン特定ＲＮＮセルから出力された当該ドメイン特定隠れ状態情報に対し所定の形の変換処理を施し、当該変換処理の施されたドメイン特定隠れ状態情報をドメイン非依存ＲＮＮセルへ入力する第１の変換処理部としてコンピュータを更に機能させることも好ましい。 In addition, as one embodiment of the personality information generation model according to the present invention, it is also preferable that the model further causes the computer to function as a first conversion processing unit that performs a predetermined type of conversion processing on the domain-specific hidden state information output from the domain-specific RNN cell, and inputs the domain-specific hidden state information that has been subjected to the conversion processing to a domain-independent RNN cell.

さらに、本発明によるパーソナリティ情報生成モデルにおいて、当該ドメイン特定ＲＮＮセルは、当該行動表現情報を受け取る時点からみて最近に生成された当該ドメイン非依存隠れ状態情報も受け取り、このドメイン非依存隠れ状態情報も用いて当該ドメイン特定隠れ状態情報を生成することも好ましい。 Furthermore, in the personality information generation model according to the present invention, it is also preferable that the domain-specific RNN cell also receives the domain-independent hidden state information that was most recently generated from the time of receiving the behavioral expression information, and generates the domain-specific hidden state information using this domain-independent hidden state information as well.

また、本発明によるパーソナリティ情報生成モデルの他の実施形態として、本モデルは、ドメイン非依存ＲＮＮセルから出力された当該ドメイン非依存隠れ状態情報に対し所定の形の変換処理を施し、当該変換処理の施されたドメイン非依存隠れ状態情報を当該ドメイン特定ＲＮＮセルへ入力する第２の変換処理部としてコンピュータを更に機能させることも好ましい。 In another embodiment of the personality information generation model according to the present invention, it is also preferable that the model further functions as a second conversion processing unit that performs a predetermined type of conversion processing on the domain-independent hidden state information output from the domain-independent RNN cell, and inputs the domain-independent hidden state information that has been subjected to the conversion processing to the domain-specific RNN cell.

さらに、本発明によるパーソナリティ情報生成モデルにおいて、ユーザ表現生成部は、生成した当該ユーザ表現情報を、当該ユーザのパーソナリティを表現した情報として出力することも好ましい。また、ドメイン非依存ＲＮＮセルは、生成した当該ドメイン非依存隠れ状態情報を、当該ユーザの動的な心理状態に係る情報である動的心理状態情報として出力することも好ましい。 Furthermore, in the personality information generation model according to the present invention, it is also preferable that the user expression generation unit outputs the generated user expression information as information expressing the personality of the user, and the domain-independent RNN cell outputs the generated domain-independent hidden state information as dynamic psychological state information, which is information related to the dynamic psychological state of the user.

また、本発明に係る複数のドメイン特定ＲＮＮセル及びドメイン非依存ＲＮＮセルの各々は、新たな隠れ状態情報を生成するべくセルに入力される情報に対し所定の処理を施す忘却ゲート、更新ゲート及びリセットゲートのうちの少なくとも１つを備えていることも好ましい。 It is also preferable that each of the multiple domain-specific RNN cells and domain-independent RNN cells of the present invention includes at least one of a forget gate, an update gate, and a reset gate that performs a predetermined process on the information input to the cell to generate new hidden state information.

本発明によれば、また、以上に述べたパーソナリティ情報生成モデルを訓練して得られた学習済みのユーザ表現生成部を用いて、当該ユーザのパーソナリティを表現した情報を生成し、出力するパーソナリティ推定装置が提供される。 According to the present invention, there is also provided a personality estimation device that generates and outputs information expressing the personality of a user using a learned user expression generation unit obtained by training the personality information generation model described above.

本発明によれば、さらに、
以上に述べたパーソナリティ情報生成モデルに対し、ＲＮＮの誤差逆伝播法によって訓練を行い、
当該訓練によって得られた訓練済みの前記ユーザ表現生成部を用いて、当該ユーザに係るドメイン非依存的な且つ静的な情報という意味で当該ユーザのパーソナリティを表現した情報となっている、当該ユーザの当該ユーザ表現情報を生成し、出力する
ことを特徴とするコンピュータにおけるパーソナリティ推定方法が提供される。 According to the present invention, further comprising:
The personality information generation model described above is trained using the RNN backpropagation method ,
A personality estimation method in a computer is provided, characterized in that, using the trained user expression generation unit obtained by the training, the user expression information of the user is information that expresses the personality of the user in the sense of domain-independent and static information related to the user , and is output.

本発明のパーソナリティ情報生成モデル、パーソナリティ推定装置、及びパーソナリティ推定方法によれば、測定粒度及び信頼性のより高いパーソナリティ情報を生成することができる。 The personality information generation model, personality estimation device, and personality estimation method of the present invention make it possible to generate personality information with higher measurement granularity and reliability.

本発明によるパーソナリティ情報生成モデルの一実施形態を示す模式図である。FIG. 2 is a schematic diagram showing an embodiment of a personality information generation model according to the present invention. 本発明に係るドメイン特定ＲＮＮセル及びドメイン非依存ＲＮＮセルにおける内部構成の一実施形態を示す機能ブロック図である。FIG. 2 is a functional block diagram showing an embodiment of the internal configuration of a domain-specific RNN cell and a domain-independent RNN cell according to the present invention. 本発明によるパーソナリティ情報生成モデルで生成される各種情報の特徴を説明するための模式図、及び本発明によるパーソナリティ推定装置の一実施形態を示す機能ブロック図である。FIG. 1 is a schematic diagram for explaining characteristics of various pieces of information generated by a personality information generation model according to the present invention, and a functional block diagram showing an embodiment of a personality estimation device according to the present invention. 本発明によるパーソナリティ情報生成モデルの他の実施形態を示す模式図である。FIG. 13 is a schematic diagram showing another embodiment of a personality information generation model according to the present invention.

以下、本発明の実施形態について、図面を用いて詳細に説明する。 The following describes in detail an embodiment of the present invention with reference to the drawings.

［パーソナリティ情報生成モデル］
図１は、本発明によるパーソナリティ情報生成モデルの一実施形態を示す模式図である。 [Personality information generation model]
FIG. 1 is a schematic diagram showing an embodiment of a personality information generation model according to the present invention.

図１に示した本実施形態のパーソナリティ情報生成モデル１は、
（ａ）パーソナリティ推定対象である推定対象ユーザにおける複数の行動ドメイン（図１では「アイテム購入」及び「動画閲覧」の２つ）に関する情報である「行動ドメイン情報」を用いて、
（ｂ）推定対象ユーザのパーソナリティを表現した情報である「パーソナリティ情報」を生成する
機械学習モデルである。 The personality information generation model 1 of the present embodiment shown in FIG.
(a) Using “behavioral domain information” which is information on a plurality of behavioral domains (two in FIG. 1: “item purchase” and “video viewing”) of a target user who is a personality estimation target,
(b) A machine learning model that generates “personality information”, which is information expressing the personality of the user to be estimated.

具体的に、パーソナリティ情報生成モデル１は、
（Ａ）複数の行動ドメインの行動ドメイン毎に設定されたドメイン特定回帰ニューラルネットワーク（ＲＮＮ）セルであって、当該行動ドメインにおける推定対象ユーザの「ドメイン行動情報」を受け取り、前の時点で自ら生成した隠れ状態情報である「ドメイン特定隠れ状態情報」に対し「ドメイン行動情報」を反映させて、新たな「ドメイン特定隠れ状態情報」を生成する複数のドメイン特定ＲＮＮセル（図１ではＤＳＬ(Domain Specific Layer)１セル１１及びＤＳＬ２セル１２の２つ）と、
（Ｂ）推定対象ユーザを識別する情報である「ユーザ識別情報」から、推定対象ユーザを表現する情報である「ユーザ表現情報」を生成するユーザ表現生成部（図１ではユーザ分散表現抽出部１０ｕ）と、
（Ｃ）上記（Ａ）で生成された「ドメイン特定隠れ状態情報」と、上記（Ｂ）で生成された「ユーザ表現情報」とを受け取り、前の時点で自ら生成した隠れ状態情報である「ドメイン非依存隠れ状態情報」に対し、「ドメイン特定隠れ状態情報」及び「ユーザ表現情報」を反映させて、新たな「ドメイン非依存隠れ状態情報」を生成するドメイン非依存ＲＮＮセル（図１ではＤＩＬ(Domain Independent layer)１０）と
してコンピュータを機能させる機械学習モデルとなっている。 Specifically, the personality information generation model 1 is as follows:
(A) A plurality of domain-specific regression neural network (RNN) cells (two in FIG. 1 , a DSL (Domain Specific Layer) 1 cell 11 and a DSL 2 cell 12) are set for each of a plurality of behavioral domains, and receive “domain behavior information” of a user to be estimated in the behavioral domain, and generate new “domain-specific hidden state information” by reflecting the “domain behavior information” on “domain-specific hidden state information” that is hidden state information generated by the RNN cells themselves at a previous point in time;
(B) a user expression generation unit (in FIG. 1, a user distributed expression extraction unit 10u) that generates “user expression information” that is information expressing a user to be estimated from “user identification information” that is information that identifies a user to be estimated;
(C) This is a machine learning model that causes a computer to function as a domain-independent RNN cell (DIL (Domain Independent layer) 10 in FIG. 1 ) that receives the “domain-specific hidden state information” generated in (A) above and the “user expression information” generated in (B) above, and generates new “domain-independent hidden state information” by reflecting the “domain-specific hidden state information” and the “user expression information” on the “domain-independent hidden state information” that is the hidden state information that the computer itself generated at a previous point in time.

ここで本実施形態において、上記（Ａ）の「行動ドメイン情報」は、推定対象ユーザの当該行動ドメインにおける（購入や閲覧といった）行動の内容を示す情報（例えば購入したアイテムや閲覧した動画の識別情報）を受け取った行動表現生成部（図１ではアイテム分散表現抽出部１１ｉや動画分散表現抽出部１２ｍ）において生成される行動表現情報となっている。したがって、「行動ドメイン情報」（行動表現情報）は、各行動ドメインにおいて行動が発生する度に生成され、その結果、全体として時系列データ群をなすものとなる。 In this embodiment, the "behavior domain information" in (A) above is behavior expression information generated by a behavior expression generation unit (in FIG. 1, the item dispersed representation extraction unit 11i and the video dispersed representation extraction unit 12m) that receives information indicating the content of the behavior (such as purchase or viewing) of the estimated target user in the behavior domain (for example, identification information of a purchased item or a viewed video). Therefore, "behavior domain information" (behavior expression information) is generated each time a behavior occurs in each behavior domain, and as a result, the information as a whole forms a time series data group.

また、上記（Ｂ）のユーザ表現生成部（ユーザ分散表現抽出部１０ｕ）は、後に詳細に説明するが、複数のドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２）と合せて訓練されるのであり、結果的に当該訓練後は、特定の行動ドメインに偏らないドメイン非依存の表現生成演算（分散表現抽出演算）を実行する。これにより、ここで生成される「ユーザ表現情報」は、推定対象ユーザの「パーソナリティ情報」として把握されるものとなるのである。 In addition, the user expression generation unit (user distributed expression extraction unit 10u) in (B) above is trained together with multiple domain-specific RNN cells (DSL1 cell 11 and DSL2 cell 12), which will be described in detail later, and as a result, after the training, it executes a domain-independent expression generation calculation (distributed expression extraction calculation) that is not biased toward a specific behavioral domain. As a result, the "user expression information" generated here is understood as the "personality information" of the estimated target user.

このように、パーソナリティ情報生成モデル１は、例えば従来使用されてきた、信頼性に関し課題を有する質問紙調査による測定結果を用いることなく又はそれに依存することなく、推定対象ユーザの「行動ドメイン情報」から、信頼性のより高い「パーソナリティ情報」を生成することができるのである。 In this way, the personality information generation model 1 can generate more reliable "personality information" from the "behavioral domain information" of the estimated target user without using or relying on measurement results from questionnaire surveys, which have traditionally been used but have issues with reliability.

またさらに、このような「パーソナリティ情報」は本実施形態において、推定対象ユーザの「ユーザ識別情報」（例えば当該ユーザを示すone-hotベクトル）を受け取ったユーザ表現生成部（ユーザ分散表現抽出部１０ｕ）が出力する、多次元の（例えば数十～数百次元の）ユーザ表現ベクトルであり、例えば多数の数値の羅列となっている。 Furthermore, in this embodiment, such "personality information" is a multidimensional (e.g., tens to hundreds of dimensions) user expression vector output by a user expression generation unit (user distributed expression extraction unit 10u) that receives "user identification information" of the user to be estimated (e.g., a one-hot vector indicating the user), and is, for example, a list of numerous numerical values.

このユーザ表現ベクトルは、本願発明者が、単語分散表現に代表される公知の分散表現（埋め込み（embedding））技術にヒントを得て創作したものであり、以後、パーソナリティ表現ベクトル、又はpersonality embedding vectorとも称することとする。 This user expression vector was created by the inventor of this application, inspired by well-known embedding techniques such as word embedding, and will hereafter be referred to as a personality expression vector or personality embedding vector.

ここで、人工知能による自然言語処理において欠かせない技術となっている単語分散表現では、１つ１つの単語が、例えば数百次元の（多数の数値の羅列である）word embedding vectorで表現され、意味の近い単語同士ほどこのベクトル間の距離が小さくなる。すなわちword embedding vectorは、対応する単語の意味が埋め込まれたものと捉えることができる。また、word embedding vector同士の演算も可能となっており、例えば有名な例として、"king"－"man"＋"woman"＝"queen"といったような加減算を行うことが可能となっている。 Here, in word embedding representation, which is an essential technique in natural language processing using artificial intelligence, each word is represented by a word embedding vector of, for example, several hundred dimensions (a string of many numerical values), and the closer the words are in meaning, the smaller the distance between these vectors. In other words, a word embedding vector can be thought of as having the meaning of the corresponding word embedded in it. It is also possible to perform calculations between word embedding vectors, and one famous example is addition and subtraction, such as "king" - "man" + "woman" = "queen."

これと同様にして、本発明に係るpersonality embedding vectorも、対応するユーザのパーソナリティの特徴が埋め込まれたものと捉えることができ、それ故、ベクトル間距離をパーソナリティの類似度としたり、またベクトル同士の演算を行ったりして、パーソナリティに関しさらに有益な情報が導出・生成可能となることも期待されるのである。 In a similar manner, the personality embedding vector of the present invention can be considered to have the personality characteristics of the corresponding user embedded in it, and therefore it is expected that by using the distance between vectors as a measure of personality similarity or by performing calculations between vectors, it will be possible to derive and generate even more useful information regarding personality.

いずれにしても、パーソナリティ情報生成モデル１で生成される「パーソナリティ情報」（personality embedding vector）は、分散表現による表現情報となっているので、その情報粒度は、（パーソナリティを５次元で表現する）Big Fiveに代表される従来のパーソナリティ指標と比較して各段に高いものとなっている。 In any case, the "personality information" (personality embedding vector) generated by the personality information generation model 1 is representation information based on distributed representation, so the information granularity is much higher than that of conventional personality indices such as the Big Five (which express personality in five dimensions).

さらに、分散表現としての「パーソナリティ情報」は、人間が直接に解釈できるものではないが、例えば、様々な用途の機械学習モデルにおける入力データとして利用され、人工知能がユーザのパーソナリティを理解して、当該ユーザの行動予測やサービスのパーソナライズ等を実施するようなことも可能になると期待される。 Furthermore, although "personality information" as a distributed representation cannot be directly interpreted by humans, it is expected that it can be used, for example, as input data in machine learning models for various purposes, enabling artificial intelligence to understand a user's personality and predict the user's behavior and personalize services, etc.

なお、本発明によるパーソナリティ情報生成モデルにおいて、ドメイン特定ＲＮＮセルの数、すなわち取り扱う行動ドメインの数は、当然、図１に示したような２つに限定されるものではなく、３つ以上とすることも可能である。また、取り扱う行動ドメインも、図１に示したアイテム購入や動画閲覧に限定されるものではなく、行動主体のパーソナリティに依存する又はその影響を受け得る行動に係るドメインであれば種々様々なものが、本発明に係る行動ドメインとして採用可能である。例えば、行動ドメイン「広告クリック」の時系列データが取得される場合に、「広告クリック」に係るドメイン行動情報を取り込むドメイン特定ＤＮＮセルを追加して、パーソナリティ情報生成モデルを構成してもよい。 In the personality information generation model according to the present invention, the number of domain-specific RNN cells, i.e., the number of behavioral domains handled, is naturally not limited to two as shown in FIG. 1, and can be three or more. In addition, the behavioral domains handled are not limited to the item purchases and video viewing shown in FIG. 1, and various domains related to behaviors that depend on or can be influenced by the personality of the behavior subject can be adopted as behavioral domains according to the present invention. For example, when time-series data of the behavioral domain "ad click" is acquired, a personality information generation model may be constructed by adding a domain-specific DNN cell that incorporates domain behavior information related to "ad click".

ここで、「パーソナリティ情報」や上述した「ドメイン非依存隠れ状態情報」は、このように互いに異なる複数の又は多数の行動ドメインに係るセルからの「ドメイン特定隠れ状態情報」を受けてドメイン非依存化しているのであり、このうち特に「パーソナリティ情報」は、後に図３を用いて詳細に説明するが、推定対象ユーザにおける静的且つドメイン非依存の特性としてのパーソナリティに係る情報と捉えることが可能となるのである。 Here, the "personality information" and the above-mentioned "domain-independent hidden state information" are made domain-independent by receiving "domain-specific hidden state information" from cells related to multiple or many different behavioral domains. Of these, the "personality information" in particular can be considered as information related to the personality of the estimated target user as a static, domain-independent characteristic, as will be explained in detail later using Figure 3.

［モデル構成，パーソナリティ推定方法］
以下、本実施形態のパーソナリティ情報生成モデル１の構成について、より詳細に説明を行う。同じく図１によれば、パーソナリティ情報生成モデル１は、
（ア）行動ドメイン「アイテム購入」に係るドメイン特定ＲＮＮセルとしてのＤＳＬ１セル１１と、ユーザ分散表現抽出部１１ｕと、アイテム分散表現抽出部１１ｉと、出力層１１ｏと、
（イ）行動ドメイン「動画閲覧」に係るドメイン特定ＲＮＮセルとしてのＤＳＬ２セル１２と、ユーザ分散表現抽出部１２ｕと、動画分散表現抽出部１２ｍと、出力層１２ｏと、
（ウ）ドメイン非依存ＲＮＮセルとしてのＤＩＬセル１０と、ユーザ表現生成部としてのユーザ分散表現抽出部１０ｕと、出力層１０ｏと
を、コンピュータ（に搭載されたプログラム）によって具現される機能構成部として備えている。 [Model construction, personality estimation method]
The configuration of the personality information generation model 1 of this embodiment will be described in more detail below.
(A) A DSL1 cell 11 as a domain-specific RNN cell related to the behavior domain “item purchase”, a user distributed representation extraction unit 11u, an item distributed representation extraction unit 11i, and an output layer 11o;
(a) A DSL2 cell 12 as a domain-specific RNN cell related to the behavior domain “video viewing”, a user distributed expression extraction unit 12u, a video distributed expression extraction unit 12m, and an output layer 12o;
(c) It has a DIL cell 10 as a domain-independent RNN cell, a user distributed expression extraction unit 10u as a user expression generation unit, and an output layer 10o as functional components embodied by a computer (a program installed in the computer).

ここで図１には、上述した（ア）～（ウ）の機能構成部（図１の左端側の機能ブロック群）が実行する処理を、時間経過の向きが右向きとなっている時間軸上で展開した様子が示されている。なお、上記（ア）～（ウ）の各々について設定された計３つの時間軸は、それぞれ独自の値をとる時点についての時間軸となっている。 Figure 1 shows the processing executed by the functional components (a) to (c) (the functional blocks on the left side of Figure 1) as described above, deployed on a time axis with time progression pointing to the right. Note that the three time axes set for each of (a) to (c) above are time axes for points in time when each has its own unique value.

また、各時間軸における時点の表記であるが、例えばＤＳＬ２セル１２に係る時点(t2+1)は、ＤＳＬ２セル１２が時点t2でドメイン行動情報（動画embedding vector）を受け取った後、次にドメイン行動情報（動画embedding vector）を受け取った時点を意味するものとする。ここでその次に受け取った時点は当然、(t2+2)となる。さらに、時点(t2+1)から見て、時点t2は「前の時点」となるのである。また、この時点(t2+1)で処理を行うＤＳＬ２セル１２を、以後行う説明の便宜上、ＤＳＬ２セル１２(t2+1)と称することにする。またさらに、ＤＳＬ１セル１１やＤＩＬセル１０についても以後、同様の処理時点の表記、及び処理時点を含む表記を行うこととする。 As for the notation of time points on each time axis, for example, time point (t2+1) for DSL2 cell 12 means the time point at which DSL2 cell 12 next receives domain behavior information (video embedding vector) after receiving domain behavior information (video embedding vector) at time point t2. Here, the next reception time point is naturally (t2+2). Furthermore, from the perspective of time point (t2+1), time point t2 is the "previous time point." Furthermore, for the convenience of the following explanation, the DSL2 cell 12 that performs processing at this time point (t2+1) will be referred to as DSL2 cell 12(t2+1). Furthermore, similar notations of processing time points and notations including the processing time will be used hereafter for DSL1 cell 11 and DIL cell 10.

以下、上述した各機能構成部について具体的に説明を行う。同じく図１において、時点t1におけるアイテム分散表現抽出部１１ｉであるアイテム分散表現抽出部１１ｉ(t1)は、時点t1において「（推定対象ユーザである）ユーザUserAによってアイテムitem1が購入された」とのイベントを受けて、
（ａ）アイテムitem1のアイテム識別情報であるone-hotベクトルi1^(t1)を受け取り、
（ｂ）受け取ったone-hotベクトルi1^(t1)に対し、アイテム分散表現抽出演算子としての行列W_i ^DSL1を作用させて（積算して）、アイテムitem1のアイテム表現ベクトル（ドメイン行動情報）r_i1^(t1)を生成する。 Each of the above-mentioned functional components will be specifically described below. In FIG. 1, the item distributed expression extraction unit 11i(t1), which is the item distributed expression extraction unit 11i at time t1, receives an event that "item item1 has been purchased by user UserA (the user to be estimated)" at time t1,
(a) Receive a one-hot vector i1 ^(t1) which is the item identification information of item item1,
(b) The received one-hot vector i1 ^(t1) is applied (multiplied) by the matrix W _i ^DSL1 as an item distributed representation extraction operator to generate an item representation vector (domain behavior information) r_i1 ^(t1) for item item1.

また、時点t1におけるユーザ分散表現抽出部１１ｕであるユーザ分散表現抽出部１１ｕ(t1)は、上記（ａ）のone-hotベクトルi1^(t1)の受け取りに合わせ、
（ｃ）ユーザUserAのユーザ識別情報であるone-hotベクトルua^(t1)を受け取り、
（ｄ）受け取ったone-hotベクトルua^(t1)に対し、ユーザ分散表現抽出演算子としての行列W_u ^DSL1を作用させて（積算して）、ユーザUserAの行動ドメイン「アイテム購入」に係るユーザ表現ベクトルr_ua^(t1)を生成する。 In addition, the user distributed expression extraction unit 11u(t1), which is the user distributed expression extraction unit 11u at the time point t1, receives the one-hot vector i1 ^(t1) in (a) above,
(c) receiving a one-hot vector u a ^(t1) which is the user identification information of user UserA;
(d) The received one-hot vector ua ^(t1) is applied (multiplied) by the matrix W _u ^DSL1 as a user distributed expression extraction operator to generate a user expression vector r_ua ^(t1) related to the behavioral domain “item purchase” of user UserA.

同じく図１において、時点t1におけるＤＳＬ１セル１１であるＤＳＬ１セル１１(t1)は、
（ｅ）上記（ｂ）で生成されたアイテム表現ベクトル（ドメイン行動情報）r_i1^(t1)と、上記（ｄ）で生成されたユーザ表現ベクトルr_ua^(t1)とを受け取り、さらに、
（ｆ）（この後詳細に説明するが、）上記（ｅ）のドメイン行動情報（r_i1^(t1)）を受け取る時点t1からみて最近に（図１では時点tIに）生成されたドメイン非依存隠れ状態情報h^DIL,(tI)も受け取り、
（ｇ）前の時点(t1-1)で自ら生成したドメイン特定隠れ状態情報h^DSL1,(t1-1)に対し、受け取ったアイテム表現ベクトル（ドメイン行動情報）r_i1^(t1)、ユーザ表現ベクトルr_ua^(t1)及びドメイン非依存隠れ状態情報h^DIL,(tI)を反映させて、新たなドメイン特定隠れ状態情報h^DSL1,(t1)を生成する（この生成処理については、後に図２を用いてより具体的に説明する）。 Also in FIG. 1, DSL1 cell 11(t1) at time t1 is
(e) receiving the item expression vector (domain behavior information) r_i1 ^(t1) generated in (b) above and the user expression vector r_ua ^(t1) generated in (d) above,
(f) (as will be described in detail later) also receiving domain-independent hidden state information h DIL,(tI) generated most recently (at time tI in FIG. 1) from time t1 when the domain action information (r_i1 ^(t1 ⁾⁾ of (e) is received;
(g) Generate new domain-specific hidden state information h ^DSL1,(t1) by reflecting the received item representation vector (domain behavior information) r_i1 ^(t1) , user representation vector r_ua ^(t1) , and domain-independent hidden state information h ^DIL,(tI) to the domain-specific hidden state information h ^{DSL1,(t1-1) generated by itself at the previous point in time (t1-1)} (this generation process will be explained in more detail later using Figure 2).

以上、アイテム分散表現抽出部１１ｉ、ユーザ分散表現抽出部１１ｕ、及びＤＳＬ１セル１１における時点t1での処理について説明を行ったが、勿論他の時点での処理、例えば次の時点(t1+1)に係るアイテム分散表現抽出部１１ｉ(t1+1)、ユーザ分散表現抽出部１１ｕ(t1+1)、及びＤＳＬ１セル１１(t1+1)での処理、についても上記（ａ）～（ｇ）と同様の処理が実行される。 The above describes the processing at time t1 in the item shared representation extraction unit 11i, the user shared representation extraction unit 11u, and the DSL1 cell 11. Of course, the same processing as (a) to (g) above is also performed for processing at other times, for example, the processing at the item shared representation extraction unit 11i(t1+1), the user shared representation extraction unit 11u(t1+1), and the DSL1 cell 11(t1+1) relating to the next time (t1+1).

さらに、他の行動ドメイン（動画閲覧）に係る動画分散表現抽出部１２ｍ、ユーザ分散表現抽出部１２ｕ、及びＤＳＬ２セル１２における各時点での処理、例えば時点t2に係る動画分散表現抽出部１２ｍ(t2)、ユーザ分散表現抽出部１２ｕ(t2)、及びＤＳＬ２セル１２(t2)での処理、についても、上記（ａ）～（ｇ）と同様の処理が実行される。ただし、この場合、例えば時点t2での動画分散表現抽出部１２ｍ(t2)は、
（ｂ’）受け取った（ユーザUserAによって閲覧された動画Mov1を示す）one-hotベクトルm1^(t2)に対し、動画分散表現抽出演算子としての行列W_m ^DSL2を作用させて（積算して）、動画Mov1の動画表現ベクトル（ドメイン行動情報）r_m1^(t2)を生成する
のである。また、ＤＳＬ２セル１２(t2)は、このドメイン行動情報r_m1^(t2)を取り入れて処理を行うことになる。 Furthermore, the same processes as those described above (a) to (g) are also executed for the video dispersed expression extraction unit 12m, the user dispersed expression extraction unit 12u, and the DSL2 cell 12 related to other behavioral domains (video viewing), for example, the video dispersed expression extraction unit 12m(t2), the user dispersed expression extraction unit 12u(t2), and the DSL2 cell 12(t2) related to time t2. However, in this case, for example, the video dispersed expression extraction unit 12m(t2) at time t2,
(b') The matrix W _m ^DSL2 as a video distributed expression extraction operator is applied (accumulated) to the received one-hot vector m1 ^(t2) (indicating the video Mov1 viewed by user UserA) to generate a video expression vector (domain behavior information) r_m1 ^(t2) of the video Mov1. The DSL2 cell 12 (t2) incorporates this domain behavior information r_m1 ^(t2) and performs processing.

同じく図１において、時点tIにおけるユーザ分散表現抽出部（ユーザ表現生成部）１０ｕであるユーザ分散表現抽出部１0ｕ(tI)は、
（ｈ）ユーザUserAのユーザ識別情報であるone-hotベクトルua^(tI)を受け取り、
（ｉ）受け取ったone-hotベクトルua^(tI)に対し、ユーザ分散表現抽出演算子としての行列W_u ^DILを作用させて（積算して）、ユーザUserAの行動ドメインに依存しないユーザ表現ベクトルr_ua^(tI)を生成する。 Similarly, in FIG. 1, the user distributed expression extraction unit 10u(tI), which is the user distributed expression extraction unit (user expression generation unit) 10u at time tI,
(h) Receive a one-hot vector u a ^{(t I )} , which is the user identification information of user UserA,
(i) The received one-hot vector ua ^(tI) is applied (multiplied) by a matrix W _u ^DIL as a user distributed expression extraction operator to generate a user expression vector r_ua ^{(tI) that} is independent of the behavioral domain of user UserA.

また、時点tIにおけるＤＩＬセル１０であるＤＩＬセル１０(tI)は、
（ｊ）上記（ｉ）で生成されたユーザ表現ベクトルr_ua^(tI)を受け取り、さらに、
（ｋ）時点tIからみて最近に（図１では時点t2に）生成されたドメイン特定隠れ状態情報h^DSL2,(t2)も受け取り、
（ｌ）前の時点(tI-1)で自ら生成したドメイン非依存隠れ状態情報h^DIL,(tI-1)に対し、受け取ったユーザ表現ベクトルr_ua^(tI)及びドメイン特定隠れ状態情報h^DSL2,(t2)を反映させて、新たなドメイン非依存隠れ状態情報h^DIL,(tI)を生成する（この生成処理についても、後に図２を用いてより具体的に説明する）。 Furthermore, DIL cell 10(tI), which is the DIL cell 10 at time tI, is
(j) receiving the user expression vector r_ua ^(tI) generated in (i) above; and
(k) also receiving domain-specific hidden state information h ^{DSL2,(t 2 )} generated most recently (at time t 2 in FIG. 1 ) from time t I ;
(l) Generate new domain-independent hidden state information h ^DIL,(tI) by reflecting the received user expression vector r_ua ^(tI) and the domain-specific hidden state information h ^DSL2,(t2) on the domain-independent hidden state information h ^DIL,(tI-1) generated by itself at the previous point in time (tI-1) (this generation process will also be explained in more detail later using Figure 2).

なお、以上説明した処理（ｈ）～（ｌ）に係る時点tIは、例えば定期的に（所定時間経過毎に）設定された時点（の１つ）とすることも可能ではあるが、本実施形態ではより好適な設定として、いずれかのドメイン特定ＲＮＮセル（図１ではＤＳＬ１セル１１及びＤＳＬ２セル１２のいずれか）においてドメイン特定隠れ状態情報が生成された直後の時点となっている。 Note that the time tI related to the above-described processes (h) to (l) can be, for example, a time (one of) that is set periodically (every predetermined time has elapsed), but in this embodiment, a more preferable setting is the time immediately after domain-specific hidden state information is generated in one of the domain-specific RNN cells (in FIG. 1, either DSL1 cell 11 or DSL2 cell 12).

言い換えると、例えば以上に説明した場合においては、時点t2においてＤＳＬ２セル１２(t2)がドメイン特定隠れ状態情報h^DSL2,(t2)を生成したのを受けて、その直後（すなわち時点tI）に、上記処理（ｈ）～（ｌ）が発動するのである。またさらに、同じく図１に示したように、時点t1においてＤＳＬ１セル１１(t1)がドメイン特定隠れ状態情報h^DSL1,(t1)を生成したのを受け、その直後（時点tI+1）に、生成された直後のドメイン特定隠れ状態情報h^DSL1,(t1)を受け取って、上記処理（ｈ）～（ｌ）と同様の処理が実行されるのである。 In other words, in the case described above, for example, when the DSL2 cell 12(t2) generates the domain-specific hidden state information h ^DSL2,(t2) at time t2, the above-mentioned processes (h) to (l) are initiated immediately thereafter (i.e., at time tI).Furthermore, as also shown in FIG. 1, when the DSL1 cell 11(t1) generates the domain-specific hidden state information h ^DSL1,(t1) at time t1, the immediately-generated domain-specific hidden state information h ^DSL1,(t1) is received immediately thereafter (at time tI+1), and the same processes as the above-mentioned processes (h) to (l) are executed.

このように本実施形態では、ＤＩＬセル１０は、いずれかのドメイン特定ＲＮＮセルでドメイン特定隠れ状態情報が生成される度に、当該ドメイン特定隠れ状態情報を受け取って、「ドメイン非依存隠れ状態情報」を生成する。 In this manner, in this embodiment, each time domain-specific hidden state information is generated in any domain-specific RNN cell, the DIL cell 10 receives the domain-specific hidden state information and generates "domain-independent hidden state information."

ここで図１において、推定対象であるユーザUserAは、「(時点t2で)動画Mov1を閲覧」→「(時点t1で)アイテムitem1を購入」→「(時点t2+1で)動画Mov2を閲覧」をこの順で行っている。ＤＩＬセル１０は、これらの行動に係る表現ベクトル（ドメイン行動情報）を順次受け取ったドメイン特定ＲＮＮセルで生成されたドメイン特定隠れ状態情報を順次受け取り、これにより、これらのドメイン特定隠れ状態情報の生成（受け取り）順序や前後関係の情報も加味されており、特定の行動ドメインでの行動内容だけに依存することのない（ドメイン非依存化した）「ドメイン非依存隠れ状態」を生成することが可能となるのである。 In FIG. 1, the user UserA, who is the subject of estimation, performs the following actions in this order: "viewing video Mov1 (at time t2)" → "purchasing item item1 (at time t1)" → "viewing video Mov2 (at time t2+1)". The DIL cell 10 sequentially receives domain-specific hidden state information generated by the domain-specific RNN cell which sequentially receives the expression vectors (domain behavior information) related to these actions. This takes into account the order in which these domain-specific hidden state information was generated (received) and the contextual information, making it possible to generate "domain-independent hidden states" that are not dependent solely on the content of actions in a specific behavioral domain (domain-independent).

さらに本実施形態では、上記（ｆ）で述べたように、このような特徴的な「ドメイン非依存隠れ状態」が適宜、ドメイン特定ＲＮＮセル（ＤＳＬ１セル１１やＤＳＬ２セル１２）への入力として用いられる。その結果、後にＤＩＬセル１０が受け取るドメイン特定隠れ状態情報も、上述したような特徴が反映されたものとなり、その結果、「ドメイン非依存隠れ状態」のドメイン非依存化が促進する。言い換えると、ＤＩＬセル１０はさらに、特定の行動ドメインに依存しない、推定対象ユーザの全体的な属性を反映した情報処理を実行することができるようになるのである。 Furthermore, in this embodiment, as described in (f) above, such characteristic "domain-independent hidden states" are appropriately used as inputs to domain-specific RNN cells (DSL1 cell 11 and DSL2 cell 12). As a result, the domain-specific hidden state information later received by DIL cell 10 also reflects the above-mentioned characteristics, which promotes the domain independence of the "domain-independent hidden states." In other words, DIL cell 10 can further perform information processing that reflects the overall attributes of the estimated target user, independent of a specific behavioral domain.

図２は、本発明に係るドメイン特定ＲＮＮセル及びドメイン非依存ＲＮＮセルにおける内部構成の一実施形態を示す機能ブロック図である。 Figure 2 is a functional block diagram showing one embodiment of the internal configuration of a domain-specific RNN cell and a domain-independent RNN cell according to the present invention.

最初に図２（Ａ）には、時点tpでのドメイン特定ＲＮＮセルであるＤＳＬ(p)セル(１０+p)(tp)（p＝1, 2, ・・・）の内部構成が示されている。具体的に、ＤＳＬ(p)セル(１０+p)(tp)は、ＲＮＮを構成可能な公知のＧＲＵ（Gated Recurrent Unit）を採用した構成となっており、以下に示す機能構成部（ａ）～（ｆ）を備えている。
（ａ）[Ｔ]：複数の入力ベクトルに重み行列を積算し、さらに積算結果にバイアスベクトルを加算して、得られたベクトルを出力する。なお、図２（Ａ）における２つの[Ｔ]はそれぞれ独立した（互いに異なる）重み行列及びバイアスベクトルを有する。
（ｂ）[σ]：入力ベクトルの各要素をsigmoid関数へ入力し、得られたベクトルを出力する。図２（Ａ）における２つの[σ]は、それぞれリセットゲートr及び更新ゲートuとして機能する。
（ｃ）[○の中に・]：２つの入力ベクトルの要素同士を乗じ、得られたベクトルを出力する。
（ｄ）[－１]：入力ベクトルの各要素に－1を乗じ、得られたベクトルを出力する。
（ｅ）[tanh]：入力ベクトルの各要素をtanh関数へ入力し、得られたベクトル（隠れ状態候補k）を出力する。
（ｆ）［＋］：２つの入力ベクトルの要素同士を足し合わせ、得られたベクトルを出力する。 2A shows the internal configuration of a DSL(p) cell (10+p)(tp) (p = 1, 2, ...), which is a domain-specific RNN cell at time tp. Specifically, the DSL(p) cell (10+p)(tp) is configured using a known GRU (Gated Recurrent Unit) capable of configuring an RNN, and includes the following functional components (a) to (f).
(a) [T]: Multiply multiple input vectors by a weight matrix, add a bias vector to the multiplication result, and output the resulting vector. Note that the two [T] in Fig. 2(A) each have an independent (different) weight matrix and bias vector.
(b) [σ]: Each element of the input vector is input to a sigmoid function, and the resulting vector is output. The two [σ] in Fig. 2(A) function as a reset gate r and an update gate u, respectively.
(c) [• in a circle]: Multiply the elements of two input vectors and output the resulting vector.
(d) [-1]: Multiply each element of the input vector by -1 and output the resulting vector.
(e) [tanh]: Input each element of the input vector into the tanh function, and output the resulting vector (hidden state candidate k).
(f) [+]: Add the elements of two input vectors together and output the resulting vector.

ここで、更新ゲートu（一方の[σ]）は、入力されたr_ua^(tp)、r_ip^(tp)や、h^DIL,(tI-1)を変換した情報に基づき、１つ前の（最後の）ドメイン特定隠れ状態情報h^DSLp,(tp-1)からどれだけの情報を保持し取り込むかを決定する。また、リセットゲートr（他方の[σ]）は、１つ前の（最後の）ドメイン特定隠れ状態情報h^DSLp,(tp-1)から捨てるべき情報を決定する。ＤＳＬ(p)セル(１０+p)(tp)は、このような処理によって、対応する行動ドメインに係る行動の時系列データとなっているドメイン行動情報における動的な（時間に依存する）情報を反映した「ドメイン特定隠れ状態情報」を生成することが可能となるのである。 Here, the update gate u (one side [σ]) decides how much information to retain and take in from the previous ( ^last ) domain-specific hidden state information h ^DSLp,(tp-1) based on the information obtained by converting the input r_ua ^{(tp), r_ip(tp)} and h ^DIL,(tI-1) . Also, the reset gate r (the other side [σ]) decides how much information to discard from the previous (last) domain-specific hidden state information h ^DSLp,(tp-1) . Through such processing, the DSL(p) cell (10+p)(tp) can generate "domain-specific hidden state information" that reflects the dynamic (time-dependent) information in the domain behavior information, which is the time-series data of behavior related to the corresponding behavior domain.

次いで図２（Ｂ）には、時点tIでのドメイン非依存ＲＮＮセルであるＤＩＬセル１０(tI)の内部構成が示されている。このＤＩＬセル１０(tI)は、上述したＤＳＬ(p)セル(１０+p)(tp)と同様の構成となっており、具体的には上記機能構成部（ａ）～（ｆ）を含むＧＲＵを採用した構成となっている。 Next, FIG. 2(B) shows the internal configuration of DIL cell 10(tI), which is a domain-independent RNN cell at time tI. This DIL cell 10(tI) has a configuration similar to the above-mentioned DSL(p) cell (10+p)(tp), and more specifically, it is configured to employ a GRU that includes the above-mentioned functional components (a) to (f).

これにより、ＤＩＬセル１０(tI)は、複数のドメイン行動情報における動的な（時間に依存する）情報を反映した複数のドメイン特定隠れ状態情報を取り込んでドメイン非依存化した、推定対象ユーザの属性に関する動的な（時間に依存する）情報となっている「ドメイン非依存隠れ状態情報」を生成することが可能となるのである。 This enables the DIL cell 10(tI) to generate "domain-independent hidden state information" that is dynamic (time-dependent) information about the attributes of the estimated target user by incorporating multiple domain-specific hidden state information that reflects dynamic (time-dependent) information in multiple domain behavior information and making it domain-independent.

なお、図２（Ａ）に示したドメイン特定ＲＮＮセル（ＤＳＬ(p)セル(１０+p)(tp)）においても、図２（Ｂ）に示したドメイン非依存ＲＮＮセル（ＤＩＬセル１０(tI)）においても、内部構成としてＧＲＵ以外の様々なＲＮＮユニットを採用することが可能であり、例えば、入力ゲート、出力ゲート及び忘却ゲートを備えたＬＳＴＭ（Long-Short Term Memory）を採用してもよい。いずれにしても、パーソナリティ情報生成モデル１における複数のドメイン特定ＲＮＮセル及びドメイン非依存ＲＮＮセルの各々については、新たな隠れ状態情報を生成するべくセルに入力される情報に対し所定の処理を施す忘却ゲート、更新ゲート及びリセットゲートのうちの少なくとも１つを備えた構成とすることも好ましいのである。 In addition, in both the domain-specific RNN cell (DSL(p) cell (10+p)(tp)) shown in FIG. 2(A) and the domain-independent RNN cell (DIL cell 10(tI)) shown in FIG. 2(B), various RNN units other than GRUs can be adopted as the internal configuration, for example, a long-short term memory (LSTM) equipped with an input gate, an output gate, and a forget gate may be adopted. In any case, it is also preferable that each of the multiple domain-specific RNN cells and domain-independent RNN cells in the personality information generation model 1 is configured to have at least one of a forget gate, an update gate, and a reset gate that perform a predetermined process on the information input to the cell to generate new hidden state information.

＜パーソナリティ情報生成モデルの訓練＞
図１に戻って、ここでパーソナリティ情報生成モデル１の訓練（トレーニング）について説明を行う。 <Training the personality information generation model>
Returning to FIG. 1, the training of the personality information generation model 1 will now be described.

最初に、ＤＳＬ１セル１１、アイテム分散表現抽出部１１ｉ及びユーザ分散表現抽出部１１ｕの訓練（トレーニング）においては、多数のユーザについて各ユーザが購入したアイテムの識別情報であるone-hotベクトルの時系列データを準備し、これを順次アイテム分散表現抽出部１１ｉへ入力する。また合わせて、当該ユーザのユーザ識別情報であるone-hotベクトルをユーザ分散表現抽出部１１ｕへ入力する。これにより、ＤＳＬ１セル１１は、各時点において前の時点でのドメイン特定隠れ状態情報を更新して当該時点でのドメイン特定隠れ状態情報を生成し、これを受けて出力層１１ｏが、購入アイテムの予測結果（次の時点に当該ユーザが購入すると予測されるアイテムの識別情報、又は各候補アイテムの購入される確率）を出力する（本実施形態において、出力層１１ｏはそのような出力を行うように設定されている）。 First, in training the DSL1 cell 11, the item distributed representation extraction unit 11i, and the user distributed representation extraction unit 11u, time series data of one-hot vectors, which are identification information of items purchased by each user for a large number of users, are prepared and input sequentially to the item distributed representation extraction unit 11i. In addition, the one-hot vector, which is the user identification information of the user, is input to the user distributed representation extraction unit 11u. As a result, the DSL1 cell 11 updates the domain-specific hidden state information at the previous time point at each time point to generate domain-specific hidden state information at the current time point, and in response, the output layer 11o outputs a prediction result of the purchased item (identification information of the item predicted to be purchased by the user at the next time point, or the probability of each candidate item being purchased) (in this embodiment, the output layer 11o is configured to perform such output).

ここで、出力されたアイテムの予測結果と正解のアイテムの識別情報（実際に購入されたアイテムのone-hotベクトル）との差異を損失とし、公知のＲＮＮの誤差逆伝播法を用いて、ＤＳＬ１セル１１内の各パラメータ（[Ｔ]の重み行列やバイアスベクトルを決めるパラメータ等）や、行列W_i ^DSL1及び行列W_u ^DSL1を決めるパラメータを調整する訓練を行うのである。 Here, the difference between the output item prediction result and the correct item identification information (the one-hot vector of the item actually purchased) is considered as the loss, and training is performed using the well-known RNN error backpropagation method to adjust each parameter in DSL1 cell 11 (such as parameters that determine the weight matrix and bias vector of [T]) and the parameters that determine the matrix W _i ^DSL1 and matrix W _u ^DSL1 .

なお、ＤＳＬ２セル１２、動画分散表現抽出部１２ｍ及びユーザ分散表現抽出部１２ｕの訓練についても、上記のＤＳＬ１セル１１等の訓練と同様にして実施することができる。具体的にこの場合、出力層１２ｏから出力された閲覧動画の予測結果（次の時点に当該ユーザが閲覧すると予測される動画の識別情報、又は各候補動画の閲覧される確率）と正解の動画の識別情報（実際に閲覧された動画のone-hotベクトル）との差異を損失とし、公知のＲＮＮの誤差逆伝播法を用いて、ＤＳＬ２セル１２内の各パラメータ（[Ｔ]の重み行列やバイアスベクトルを決めるパラメータ等）や、行列W_m ^DSL2及び行列W_u ^DSL2を決めるパラメータを調整する訓練を行うのである。さらに、３つ目又はそれ以降のドメイン特定ＲＮＮセル系が設定されている場合も、同様にして訓練することが可能である。 Training of the DSL2 cell 12, the video distributed expression extraction unit 12m, and the user distributed expression extraction unit 12u can be performed in the same manner as the training of the DSL1 cell 11, etc. Specifically, in this case, the difference between the prediction result of the viewed video output from the output layer 12o (identification information of the video predicted to be viewed by the user at the next time point, or the viewing probability of each candidate video) and the correct video identification information (one-hot vector of the video actually viewed) is treated as a loss, and training is performed to adjust each parameter in the DSL2 cell 12 (parameters determining the weight matrix and bias vector of [T], etc.) and parameters determining the matrix W _m ^DSL2 and matrix W _u ^DSL2 using a known RNN backpropagation method. Furthermore, training can be performed in the same manner when a third or subsequent domain-specific RNN cell system is set.

次いで、ＤＩＬセル１０及びユーザ分散表現抽出部１０ｕの訓練においては、各ドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２の各々）における上述した損失を、各ドメイン特定ＲＮＮセルとＤＩＬセル１０との間のリンクを通して、ＤＩＬセル１０及びユーザ分散表現抽出部１０ｕへ逆伝播させて訓練を行う。例えば、図１において、ＤＳＬ１セル１１での時点t1までの入力に基づく購入アイテムの予測結果と正解アイテムの識別情報との差異である損失は、ドメイン非依存隠れ状態情報h^DIL,(tI)の伝達されるリンクを通して逆伝播し、ＤＩＬセル１０内の各パラメータ（[Ｔ]の重み行列やバイアスベクトルを決めるパラメータ等）や、ユーザ分散表現抽出部１０ｕの行列W_u ^DILを決めるパラメータの訓練に用いられるのである。 Next, in training the DIL cell 10 and the user distributed expression extraction unit 10u, the above-mentioned loss in each domain-specific RNN cell (each of the DSL1 cell 11 and the DSL2 cell 12) is back-propagated to the DIL cell 10 and the user distributed expression extraction unit 10u through the link between each domain-specific RNN cell and the DIL cell 10 to perform training. For example, in Fig. 1, the loss, which is the difference between the predicted result of the purchase item based on the input up to time t1 in the DSL1 cell 11 and the identification information of the correct item, is back-propagated through the link that transmits the domain-independent hidden state information ^hDIL,(tI) and is used to train each parameter in the DIL cell 10 (parameters that determine the weight matrix and bias vector of [T], etc.) and parameters that determine the ^matrix _WuDIL of the user distributed expression extraction unit 10u.

すなわち本実施形態において、ＤＩＬセル１０及びユーザ分散表現抽出部１０ｕの訓練は、ドメイン非依存ＲＮＮセル独自の教師データを用いることなく、ドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２）からの誤差逆伝播のみで実施されるのである。 In other words, in this embodiment, training of the DIL cell 10 and the user distributed expression extraction unit 10u is performed only by backpropagation of errors from the domain-specific RNN cells (DSL1 cell 11 and DSL2 cell 12) without using teacher data unique to the domain-independent RNN cell.

図３は、本発明によるパーソナリティ情報生成モデルで生成される各種情報の特徴を説明するための模式図、及び本発明によるパーソナリティ推定装置の一実施形態を示す機能ブロック図である。 Figure 3 is a schematic diagram for explaining the characteristics of various pieces of information generated by the personality information generation model according to the present invention, and a functional block diagram showing one embodiment of a personality estimation device according to the present invention.

図３の上方に示したように、パーソナリティを含む広い意味でのユーザの属性は、
（ａ）動的な（時間に依存する、又は時間変化し易い）属性か、静的な（時間に依存しない、又は時間変化し難い）属性か、及び
（ｂ）ドメイン依存的な（特定の行動ドメインにのみ影響を与える、又は特定の行動ドメインに係るものである）属性か、ドメイン非依存的な（様々な行動ドメインの行動に影響し得る、又は特定の行動ドメインにのみ係るものではない）属性か
といった観点から、４つのグループに分類することができる。具体的に図３において、当該４つのグループはそれぞれ、上記（ａ）に係る横軸と上記（ｂ）に係る縦軸とで構成されるユーザ属性グラフにおける第１～第４象限に係るグループとなっている。 As shown in the upper part of Figure 3, user attributes in a broad sense, including personality, are as follows:
They can be classified into four groups from the viewpoint of whether they are (a) dynamic (time-dependent or easily changed over time) or static (time-independent or not easily changed over time) and (b) domain-dependent (affecting only a specific behavioral domain or relating to a specific behavioral domain) or domain-independent (affecting behaviors in various behavioral domains or not relating to only a specific behavioral domain). Specifically, in FIG. 3, the four groups are groups related to the first to fourth quadrants of a user attribute graph composed of the horizontal axis related to (a) and the vertical axis related to (b).

ここでユーザ属性の１つであるパーソナリティは、静的であって且つドメイン非依存であるので、第１象限に係るグループに属することになる。したがって、あるユーザについての行動ドメインに係る時系列データを用いて、当該ユーザのパーソナリティを推定する場合には、その推定の過程で、ユーザ属性情報からこの第１象限に相当する成分のみを分離抽出する必要が生じる。しかしながら、例えば上述した従来技術としてのSequential URL技術によっても、このような分離抽出を行うことはできなかった。 Here, personality, which is one of the user attributes, is static and domain-independent, and therefore belongs to the group related to the first quadrant. Therefore, when estimating the personality of a certain user using time-series data related to the behavioral domain of that user, it becomes necessary to separate and extract only the components corresponding to this first quadrant from the user attribute information during the estimation process. However, such separation and extraction could not be performed, for example, even with the Sequential URL technology described above as a conventional technology.

これに対し、パーソナリティ情報生成モデル１における学習済み（訓練済み）のＤＩＬセル１０及びユーザ分散表現抽出部１０ｕは、ＤＩＬセル１０が複数のドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２）からドメイン特定隠れ状態情報を適宜受け取り、さらに各ドメイン特定ＲＮＮセルへ適宜ドメイン非依存隠れ状態情報を与えてきた過程でドメイン非依存化しており、このうち特に、ユーザ分散表現抽出部１０ｕ（の行列W_u ^DIL）は、ユーザの識別情報を受けて当該ユーザの静的な属性に係る情報を出力するようになっている。 In contrast, the learned (trained) DIL cell 10 and user distributed expression extraction unit 10u in the personality information generation model 1 have been made domain-independent in the process in which the DIL cell 10 appropriately receives domain-specific hidden state information from multiple domain-specific RNN cells (DSL1 cell 11 and DSL2 cell 12) and further appropriately provides domain-independent hidden state information to each domain-specific RNN cell. In particular, the user distributed expression extraction unit 10u (its matrix W _u ^DIL ) is configured to receive user identification information and output information related to the static attributes of the user.

その結果、ユーザ分散表現抽出部１０ｕ（の行列W_u ^DIL）の出力するユーザ表現ベクトルr_uaは、静的且つドメイン非依存に対応する第１象限に属するユーザ属性の情報となり、したがって、「パーソナリティ情報」であるパーソナリティ表現ベクトル（personality embedding vector）とみなすことができるのである。ここで勿論、ユーザ分散表現抽出部１０ｕは、生成したパーソナリティ表現ベクトル（personality embedding vector）を、「パーソナリティ情報」としてモデルの外部へ出力してもよい。さらに、この学習済み（訓練済み）のユーザ分散表現抽出部１０ｕ（の行列W_u ^DIL）を取り出し、パーソナリティ表現抽出器として利用することも可能である。 As a result, the user expression vector r_ua output by the user distributed expression extraction unit 10u (its matrix _Wu ^DIL ) becomes information on user attributes belonging to the first quadrant, which is static and domain-independent, and can therefore be regarded as a personality expression vector (personality embedding vector), which is "personality information". Here, the user distributed expression extraction unit 10u may of course output the generated personality expression vector (personality embedding vector) to the outside of the model as "personality information". Furthermore, it is also possible to take out this learned (trained) user distributed expression extraction unit 10u (its matrix _Wu ^DIL ) and use it as a personality expression extractor.

また、学習済み（訓練済み）のＤＩＬセル１０は、上述したようにドメイン非依存化しているので、このＤＩＬセル１０から出力されるドメイン非依存隠れ状態情報h^DILは、行動ドメイン全般に影響し得る又は特定の行動ドメインに依存しない、処理時点に係るユーザ属性を反映した情報となっている。すなわち、このドメイン非依存隠れ状態情報h^DILは、動的且つドメイン非依存に対応する第２象限に属するユーザ属性の情報となり、したがって、ユーザの動的な心理状態に係る情報である動的心理状態情報とみなすことができるのである。ここで勿論、ＤＩＬセル１０は、生成したメイン非依存隠れ状態情報h^DILを、動的心理状態情報としてモデルの外部へ出力してもよい。 In addition, since the learned (trained) DIL cell 10 is domain-independent as described above, the domain-independent hidden state information h ^DIL output from this DIL cell 10 is information reflecting user attributes related to the processing time that may affect the behavioral domain in general or is independent of a specific behavioral domain. In other words, this domain-independent hidden state information h ^DIL is information of user attributes belonging to the second quadrant corresponding to dynamic and domain independence, and therefore can be regarded as dynamic psychological state information that is information related to the dynamic psychological state of the user. Here, the DIL cell 10 may of course output the generated main-independent hidden state information h ^DIL to the outside of the model as dynamic psychological state information.

ちなみに、この動的心理状態情報には、例えばユーザの疲労度、感情状態や、ムード(忙しい／暇である、気分が良い／悪い等)が該当する。ＤＩＬセル１０は、このような動的に変化し且つ様々な行動ドメインの行動に影響し得るユーザ属性を生成することも可能となっているのである。ここで、所定期間において複数生成された時系列データとしての動的心理状態情報は、当該所定期間におけるユーザの動的心理状態の変遷を表す情報と捉えることもできるのである。 Incidentally, this dynamic psychological state information includes, for example, the user's fatigue level, emotional state, and mood (busy/bored, feeling good/bad, etc.). The DIL cell 10 is also capable of generating user attributes that change dynamically and can affect behavior in various behavioral domains. Here, the dynamic psychological state information generated multiple times over a given period as time-series data can also be considered as information that represents the changes in the user's dynamic psychological state over that given period.

また、以上説明したことをまとめると、パーソナリティ情報生成モデル１は、ＤＩＬセル１０及び複数のドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２）を、互いに隠れ状態情報をやり取りさせつつ稼働させることによって、ドメイン依存の（第３及び第４象限の）ユーザ属性と、ドメイン非依存の（第１及び第２象限の）ユーザ属性とを分離していることが理解されるのである。 To summarize the above explanation, it can be understood that the personality information generation model 1 separates domain-dependent (third and fourth quadrants) user attributes from domain-independent (first and second quadrants) user attributes by operating the DIL cell 10 and multiple domain-specific RNN cells (DSL1 cell 11 and DSL2 cell 12) while exchanging hidden state information with each other.

さらにそのやり取りにおいて、自身の生成した隠れ状態情報を、その順序や前後関係も反映した形で（すなわち各種行動発生の時系列性も反映した形で）相手のセルに取り込ませることによって、結局、ＤＩＬセル１０とそれに繋がったユーザ分散表現抽出部１０ｕとの間における動的な（第２象限の）ユーザ属性と、静的な（第１象限の）ユーザ属性との分離がより促進されるようになることも理解される。 Furthermore, in this exchange, by incorporating the hidden state information generated by oneself into the other cell in a manner that reflects its order and context (i.e., in a manner that reflects the chronological order of the occurrence of various actions), it is understood that this ultimately promotes the separation of dynamic (second quadrant) user attributes from static (first quadrant) user attributes between the DIL cell 10 and the user distributed expression extraction unit 10u connected to it.

ちなみに、各ドメイン特定ＲＮＮセル（ＤＳＬ１セル１１及びＤＳＬ２セル１２の各々）が出力するドメイン特定隠れ状態情報は、該当する行動ドメインの行動についての動的な（時間に依存する、又は時間変化し易い）ユーザ属性を反映した情報となっており、すなわち第３象限のユーザ属性に係る情報であると理解される。ここで具体的に、この第３象限のユーザ属性は、アイテム購入履歴や動画閲覧履歴等に係る心理状態情報、例えば「見続けた特定ジャンルの映画に飽きてしまったという感情」や「別のジャンルの映画を見たいとの感情」、さらには「見続けた映画の中で気に入った俳優が出演している他の映画に興味を抱くという感情」といったような、ユーザの経た履歴故に生じる心理状態の情報と捉えることができる。 Incidentally, the domain-specific hidden state information output by each domain-specific RNN cell (each of DSL1 cell 11 and DSL2 cell 12) is information reflecting dynamic (time-dependent or time-varying) user attributes for the behavior of the corresponding behavioral domain, that is, it is understood to be information related to the user attributes in the third quadrant. Specifically, the user attributes in the third quadrant can be understood as psychological state information related to item purchase history, video viewing history, etc., such as psychological state information arising from the user's history, such as "the feeling of being bored with a certain genre of movies that you have been watching", "the feeling of wanting to watch movies of a different genre", and even "the feeling of becoming interested in other movies that feature a favorite actor from among the movies you have been watching".

また、合わせて訓練されたユーザ分散表現抽出部（１１ｕ，１２ｕ）が出力するユーザ表現ベクトルr_uaは、該当する行動ドメインの行動についての静的な（時間に依存しない、又は時間変化し難い）ユーザ属性を反映した情報となっており、すなわち第４象限のユーザ属性（例えば購入アイテムの傾向や動画の好み）に係る情報であると理解されるのである。 The user expression vector r_ua output by the user distributed representation extraction unit (11u, 12u) that has been trained in conjunction with the above is information that reflects static (time-independent or time-unchanging) user attributes for the behavior in the corresponding behavioral domain, i.e., it is understood to be information related to user attributes in the fourth quadrant (e.g., tendencies in purchasing items and video preferences).

ここで勿論、このような第３象限のユーザ属性に係る情報も、第４象限のユーザ属性に係る情報も、モデル外に出力されて、様々な場面で（例えば、該当する行動ドメインに関係する企業でのマーケティング活動において）利用されることも可能となる。 Of course, this information relating to user attributes in both the third and fourth quadrants can be output outside the model and used in various situations (for example, in marketing activities at companies related to the corresponding behavioral domain).

［モデルの他の実施形態］
図４は、本発明によるパーソナリティ情報生成モデルの他の実施形態を示す模式図である。 Other embodiments of the model
FIG. 4 is a schematic diagram showing another embodiment of a personality information generation model according to the present invention.

図４に示すように、本実施形態のパーソナリティ情報生成モデル１’は、
（ａ１）ＤＳＬ１セル１１から出力されたドメイン特定隠れ状態情報h^DSL1に対し所定の形の変換処理（f1out）を施し、変換処理（f1out）の施されたドメイン特定隠れ状態情報f1out(h^DSL1)をＤＩＬセル１０へ入力する、第１の変換処理部としての変換処理部１１f1outと、
（ａ２）ＤＳＬ２セル１２から出力されたドメイン特定隠れ状態情報h^DSL2に対し所定の形の変換処理（f2out）を施し、変換処理（f2out）の施されたドメイン特定隠れ状態情報f2out(h^DSL2)をＤＩＬセル１０へ入力する、第１の変換処理部としての変換処理部１２f2outと
を備えている。 As shown in FIG. 4, the personality information generation model 1′ of this embodiment is as follows:
(a1) a conversion processing unit 11f1out as a first conversion processing unit, which performs a predetermined type of conversion processing (f1out) on the domain-specific hidden state information h ^DSL1 output from the DSL1 cell 11, and inputs the domain-specific hidden state information f1out(h ^DSL1 ) that has been subjected to the conversion processing (f1out) to the DIL cell 10;
(a2) A conversion processing unit 12f2out as a first conversion processing unit is provided, which performs a predetermined form of conversion processing (f2out) on the domain-specific hidden state information h ^DSL2 output from the DSL2 cell 12, and inputs the domain-specific hidden state information f2out(h ^DSL2 ) that has been subjected to the conversion processing (f2out) to the DIL cell 10.

ちなみに図４では、変換処理部１１f1out及び変換処理部１２f2outがそれぞれ、時点t1及び時点(t2+1)に展開された場合の状況が示されている。また、このような第１の変換処理部は、設定されたドメイン特定ＲＮＮセルの数（図４では２つ）だけ用意されることも好ましい。 Incidentally, FIG. 4 shows the situation when the conversion processing unit 11f1out and the conversion processing unit 12f2out are deployed at time t1 and time (t2+1), respectively. It is also preferable to prepare as many such first conversion processing units as there are configured domain-specific RNN cells (two in FIG. 4).

さらに、各変換処理（f1out，f2out）は例えば、ドメイン特定隠れ状態情報に対し重み行列を積算する処理であってもよく、それにバイアスベクトルを更に足し合わせる処理であってもよい。またさらに、それをsigmoid関数、tanh関数や、ReLu関数等へ入力して得られた値を出力する処理とすることも可能である。いずれにしても、第１の変換処理部（変換処理部１１f1out及び変換処理部１２f2out）を設定してこのような変換処理を実行することによって、各行動ドメインにおけるユーザの行動がドメイン非依存の動的なユーザ属性（図３における第２象限のユーザ属性）に与える影響を調整することができるのである。 Furthermore, each conversion process (f1out, f2out) may be, for example, a process of multiplying a weight matrix by domain-specific hidden state information, or may be a process of further adding a bias vector to it. Furthermore, it is also possible to input it into a sigmoid function, a tanh function, a ReLu function, or the like, and output the obtained value. In any case, by setting the first conversion processing unit (conversion processing unit 11f1out and conversion processing unit 12f2out) and executing such a conversion process, it is possible to adjust the influence that the user's behavior in each behavior domain has on the domain-independent dynamic user attributes (user attributes in the second quadrant in Figure 3).

ここで、このドメイン非依存の動的なユーザ属性は、疲労度、眠気、感情や、ムードといったような動的心理特性となるが、このような特性に対し各行動ドメインにおけるユーザの行動がどの程度の影響を与えるかは、行動ドメイン毎に異なってくると考えらえる。例えば、日用品のショッピングサイトにおける購買行動よりも、動画配信サービスにおける動画閲覧行動のほうが、疲労度、感情や、ムードに与える影響は大きいと考えられる。したがって、ドメイン特定ＲＮＮセル毎に（異なる変換処理を実行する）第１の変換処理部を設定することで、このような影響の調整が可能となるのである。 Here, these domain-independent dynamic user attributes are dynamic psychological characteristics such as fatigue level, drowsiness, emotions, and mood, and the extent to which user behavior in each behavioral domain affects such characteristics is thought to differ for each behavioral domain. For example, video viewing behavior on a video distribution service is thought to have a greater impact on fatigue level, emotions, and mood than purchasing behavior on a shopping site for daily necessities. Therefore, by setting a first conversion processing unit (which executes different conversion processing) for each domain-specific RNN cell, it is possible to adjust such influences.

また、本実施形態のパーソナリティ情報生成モデル１’は、さらに、
（ｂ１）ＤＩＬセル１０から出力されたドメイン非依存隠れ状態情報h^DILに対し所定の形の変換処理（f1in）を施し、変換処理（f1in）の施されたドメイン非依存隠れ状態情報f1in(h^DIL)をＤＳＬ１セル１１へ入力する、第２の変換処理部としての変換処理部１１f1inと、
（ｂ２）ＤＩＬセル１０から出力されたドメイン非依存隠れ状態情報h^DILに対し所定の形の変換処理（f2in）を施し、変換処理（f2in）の施されたドメイン非依存隠れ状態情報f2in(h^DIL)をＤＳＬ２セル１２へ入力する、第２の変換処理部としての変換処理部１２f2inと、
を備えている。 In addition, the personality information generation model 1′ of this embodiment further includes:
(b1) a conversion processing unit 11f1in as a second conversion processing unit, which performs a predetermined type of conversion processing (f1in) on the domain-independent hidden state information h ^DIL output from the DIL cell 10, and inputs the domain-independent hidden state information f1in(h ^DIL ) subjected to the conversion processing (f1in) to the DSL1 cell 11;
(b2) a conversion processing unit 12f2in as a second conversion processing unit, which performs a predetermined type of conversion processing (f2in) on the domain-independent hidden state information h ^DIL output from the DIL cell 10, and inputs the domain-independent hidden state information f2in(h ^DIL ) subjected to the conversion processing (f2in) to the DSL2 cell 12;
It is equipped with:

ちなみに図４では、変換処理部１１f1in及び変換処理部１２f2inについてもそれぞれ、時点t1及び時点(t2+1)に展開された場合の状況が示されている。また、このような第２の変換処理部についても、設定されたドメイン特定ＲＮＮセルの数（図４では２つ）だけ用意されることも好ましい。 Incidentally, FIG. 4 also shows the situation when the conversion processing unit 11f1in and the conversion processing unit 12f2in are deployed at time t1 and time (t2+1), respectively. It is also preferable to prepare as many such second conversion processing units as there are configured domain-specific RNN cells (two in FIG. 4).

さらに各変換処理（f1in，f2in）も、第１変換処理部における変換処理（f1inやf2in）と同様の形の処理とすることができる。いずれにしても、第２の変換処理部（変換処理部１１f1in及び変換処理部１２f2in）を設定してこのような変換処理を実行することによっても、結局、行動ドメインに依存しないユーザ属性が、各行動ドメインに依存する動的なユーザ属性（図３における第３象限のユーザ属性，動的心理状態情報）に与える影響を調整することが可能となるのである。 Furthermore, each conversion process (f1in, f2in) can be a process similar to the conversion process (f1in and f2in) in the first conversion processing unit. In any case, by setting the second conversion processing unit (conversion processing unit 11f1in and conversion processing unit 12f2in) and executing such a conversion process, it is possible to adjust the influence of user attributes that are independent of the behavioral domain on dynamic user attributes that are dependent on each behavioral domain (user attributes in the third quadrant in Figure 3, dynamic psychological state information).

また、以上に説明した第１の変換処理部も第２の変換処理部も、ＤＳＬ１セル１１やＤＳＬ２セル１２と合わせて訓練されることも好ましい。これにより、上述したような各行動ドメインの行動が及ぼす影響の調整を、より確実に実施することが担保される。 It is also preferable that the first conversion processing unit and the second conversion processing unit described above are trained together with the DSL1 cell 11 and the DSL2 cell 12. This ensures that the effects of the behavior of each behavioral domain as described above can be adjusted more reliably.

［パーソナリティ推定装置，パーソナリティ推定プログラム］
以下、図３に戻って、以上に説明したようなパーソナリティ情報生成モデル１又はパーソナリティ情報生成モデル１’（図４）を搭載しており、推定対象ユーザのパーソナリティを推定するパーソナリティ推定装置９について説明する。ちなみに以下、本装置はパーソナリティ情報生成モデル１を搭載したものとして説明を行う。 [Personality estimation device, personality estimation program]
3, a personality estimation device 9 that is equipped with the personality information generation model 1 or the personality information generation model 1' (FIG. 4) as described above and estimates the personality of a user to be estimated will be described below. Incidentally, the following description will be given assuming that this device is equipped with the personality information generation model 1.

図３の下部に示した本実施形態のパーソナリティ推定装置９は、搭載したパーソナリティ情報生成モデル１を訓練して得られた学習済みのユーザ分散表現抽出部１０ｕ（ユーザ表現生成部）を用いて、推定対象ユーザの「パーソナリティ情報」を生成し、出力する。 The personality estimation device 9 of this embodiment shown in the lower part of Figure 3 generates and outputs "personality information" of the user to be estimated using a learned user distributed expression extraction unit 10u (user expression generation unit) obtained by training the installed personality information generation model 1.

具体的に図３において、パーソナリティ推定装置９の入力部９１は、通信機能を備えていて、例えば外部に設置された行動ドメイン関連の管理サーバ（例えばウェブショッピング管理サーバや動画配信管理サーバ）から、多数のユーザにおける各行動ドメインでの行動に係る時系列情報を取得し、訓練部９２へ出力する。また、推定対象ユーザに係る情報を受け取り、パーソナリティ推定部９３へ出力する。 Specifically, in FIG. 3, the input unit 91 of the personality estimation device 9 has a communication function, and acquires time-series information on the behavior of a large number of users in each behavioral domain from, for example, an externally installed behavioral domain-related management server (e.g., a web shopping management server or a video distribution management server), and outputs the information to the training unit 92. It also receives information on the estimation target user, and outputs the information to the personality estimation unit 93.

訓練部９２は、受け取った多数のユーザにおける各行動ドメインでの行動に係る時系列情報から教師データを生成し、これを用いてパーソナリティ情報生成モデル１の訓練を実施する。 The training unit 92 generates teacher data from the received time-series information relating to the behavior of the multiple users in each behavioral domain, and uses this to train the personality information generation model 1.

パーソナリティ推定部９３は、受け取った推定対象ユーザに係る情報から、推定対象ユーザのユーザ識別情報（one-hotベクトル）を生成して、これを、訓練を受けたパーソナリティ情報生成モデル１における学習済みのユーザ分散表現抽出部１０ｕ（ユーザ表現生成部）へ入力し、このユーザ分散表現抽出部１０ｕから、推定対象ユーザのユーザ表現ベクトルを取得する。このユーザ表現ベクトルはすで説明したように、推定対象ユーザのパーソナリティの分散表現と捉えることができ、すなわちパーソナリティ表現ベクトル（personality embedding vector）とみなすことができるものとなっている。 The personality estimation unit 93 generates user identification information (one-hot vector) of the estimated user from the received information related to the estimated user, inputs this to the learned user distributed expression extraction unit 10u (user expression generation unit) in the trained personality information generation model 1, and obtains the user expression vector of the estimated user from the user distributed expression extraction unit 10u. As already explained, this user expression vector can be regarded as a distributed expression of the estimated user's personality, that is, as a personality expression vector (personality embedding vector).

出力部９４は、受け取ったパーソナリティ表現ベクトル（personality embedding vector）を、推定対象ユーザの「パーソナリティ情報」として（通信機能を備えている場合に）外部へ送信したり、（表示機能を備えている場合に）表示したりする。 The output unit 94 transmits the received personality embedding vector to the outside as "personality information" of the estimated target user (if it has a communication function) or displays it (if it has a display function).

ここで、訓練部９２及びパーソナリティ推定部９３は、本発明によるパーソナリティ推定方法の一実施形態を実施する主要機能構成部であり、また、本発明によるパーソナリティ推定プログラムの一実施形態を保存したプロセッサ・メモリの機能と捉えることもできる。またこのことから、パーソナリティ推定装置９は、パーソナリティ推定の専用装置であってもよいが、本発明によるパーソナリティ推定プログラムを搭載した、例えばクラウドサーバ、非クラウドのサーバ装置、パーソナル・コンピュータ（ＰＣ）、ノート型若しくはタブレット型コンピュータ、又はスマートフォン等とすることも可能である。 The training unit 92 and the personality estimation unit 93 are the main functional components that implement one embodiment of the personality estimation method according to the present invention, and can also be considered as functions of a processor/memory that stores one embodiment of the personality estimation program according to the present invention. As a result, the personality estimation device 9 may be a dedicated device for personality estimation, but it can also be, for example, a cloud server, a non-cloud server device, a personal computer (PC), a notebook or tablet computer, or a smartphone, etc., that is equipped with the personality estimation program according to the present invention.

以上詳細に説明したように、本発明によれば、例えば従来使用されてきた質問紙調査による測定結果を用いることなく又はそれに依存することなく、推定対象ユーザの行動ドメイン情報から、情報粒度及び信頼性のより高いパーソナリティ情報を生成することができる。さらに、このように生成したユーザのパーソナリティ情報を用いて、例えばマーケティングの分野において、提供する商品・サービスをパーソナライズし、例えば好適な又は有効なレコメンド等を実施することも可能となるのである。 As described above in detail, according to the present invention, it is possible to generate personality information with higher information granularity and reliability from behavioral domain information of the estimated target user without using or relying on measurement results from, for example, a questionnaire survey that has been used in the past. Furthermore, by using the user's personality information generated in this way, it is possible to personalize the products and services provided, for example, in the field of marketing, and to implement suitable or effective recommendations, etc.

また、例えば子供達に対し質の高い、且つ個々の性格に合った教育を提供するために、本発明によって生成した当該子供達の（その行動内容から推定した）パーソナリティ情報を活用することもできる。すなわち本発明によれば、国連が主導する持続可能な開発目標（ＳＤＧｓ）の目標４「すべての人々に包摂的かつ公平で質の高い教育を提供し、生涯学習の機会を促進する」に貢献することも可能となるのである。 In addition, for example, the personality information of children (estimated from their behavior) generated by the present invention can be used to provide them with a high-quality education that is tailored to their individual personalities. In other words, the present invention can contribute to Goal 4 of the United Nations-led Sustainable Development Goals (SDGs), which is to "Ensure inclusive and equitable quality education and promote lifelong learning opportunities for all."

さらに、例えば大人達に対し、環境に害を及ぼさないディーセント・ワーク（働きがいのある人間らしい仕事）や、質の高い、且つ個々の性格に適した仕事を提供するために、本発明によって生成した当該大人達の（その行動内容から推定した）パーソナリティ情報を活用することもできる。すなわち本発明によれば、国連が主導するＳＤＧｓの目標８「すべての人々のための包摂的かつ持続可能な経済成長、雇用およびディーセント・ワークを推進する」に貢献することも可能となるのである。 Furthermore, the personality information of adults (estimated from their behavior) generated by the present invention can be used to provide them with decent work that does not harm the environment, and high-quality work that suits their individual personalities. In other words, the present invention can contribute to Goal 8 of the United Nations-led SDGs, "Promote inclusive and sustainable economic growth, employment and decent work for all."

またさらに、例えば消費者に対し、当該消費者の性格や生活行動の現状に沿った、持続可能な消費とライフスタイルについての教育を提供するために、本発明によって生成した当該消費者の（その行動内容から推定した）パーソナリティ情報や生活行動履歴・消費活動履歴を活用することもできる。すなわち本発明によれば、国連が主導するＳＤＧｓの目標１２「持続可能な消費と生産のパターンを確保する」に貢献することも可能となるのである。 Furthermore, for example, the personality information (estimated from the consumer's behavior) generated by the present invention, as well as the consumer's lifestyle history and consumption activity history, can be utilized to provide the consumer with education on sustainable consumption and lifestyles that are in line with the consumer's personality and current lifestyle. In other words, the present invention can contribute to the achievement of Goal 12 of the United Nations-led SDGs, "Ensure sustainable consumption and production patterns."

上述した本発明の種々の実施形態について、本発明の技術思想及び見地の範囲の種々の変更、修正及び省略は、当業者によれば容易に行うことができる。上述の説明はあくまで例であって、何ら制約しようとするものではない。本発明は、特許請求の範囲及びその均等物として限定するものにのみ制約される。 With respect to the various embodiments of the present invention described above, various changes, modifications, and omissions within the scope of the technical ideas and viewpoints of the present invention can be easily made by a person skilled in the art. The above description is merely an example and is not intended to be restrictive in any way. The present invention is limited only by the scope of the claims and their equivalents.

１、１’ パーソナリティ情報生成モデル
１０ＤＩＬセル（ドメイン非依存ＲＮＮモデル）
１０ｏ、１１ｏ、１２ｏ出力層
１０ｕユーザ分散表現抽出部（ユーザ表現生成部）
１１ＤＳＬ１セル（ドメイン特定ＲＮＮモデル）
１１f1in、１１f1out、１２f2in、１２f2out 変換処理部
１１ｉアイテム分散表現抽出部
１１ｕ、１２ｕユーザ分散表現抽出部
１２ＤＳＬ２セル（ドメイン特定ＲＮＮモデル）
１２ｍ動画分散表現抽出部
９パーソナリティ推定装置
９１入力部
９２訓練部
９３パーソナリティ推定部
９４出力部 1, 1' Personality information generation model 10 DIL cell (domain-independent RNN model)
10o, 11o, 12o Output layer 10u User distributed expression extraction unit (user expression generation unit)
11 DSL1 cell (domain specific RNN model)
11f1in, 11f1out, 12f2in, 12f2out Conversion processing unit 11i Item distributed representation extraction unit 11u, 12u User distributed representation extraction unit 12 DSL2 cell (domain-specific RNN model)
12m Video distributed expression extraction unit 9 Personality estimation device 91 Input unit 92 Training unit 93 Personality estimation unit 94 Output unit

Claims

A personality information generation model that causes a computer to function to estimate a personality of a user from information related to each of a plurality of behavioral domains of the user, the personality information generation model comprising:
a plurality of behavioral expression generating units set for each of the behavioral domains, each generating behavioral expression information that is information expressing the behavior of the user in the behavioral domain from information that identifies the content of the behavior of the user in the behavioral domain;
a plurality of domain-specific regression neural network (RNN) cells set for each of the behavioral domains, each of which receives the behavioral expression information of the user in the behavioral domain, and generates new domain-specific hidden state information corresponding to the hidden state information of the RNN by reflecting the behavioral expression information of the user received this time on domain-specific hidden state information corresponding to the RNN hidden state information that the RNN cell itself generated when it previously received the behavioral expression information of the user ;
a user expression generating unit that generates user expression information that represents the user from information that identifies the user;
a computer is caused to function as a domain-independent RNN cell that receives the generated domain-specific hidden state information and the generated user expression information, and generates new domain-independent hidden state information corresponding to the RNN hidden state information by reflecting the currently received domain-specific hidden state information of the user and the currently received user expression information on domain-independent hidden state information corresponding to the RNN hidden state information that the computer generated when the domain-specific hidden state information of the user was previously received ;
The user expression generating unit, trained by the backpropagation method of the RNN together with the plurality of domain-specific RNN cells and the domain-independent RNN cells, generates the user expression information of the user as information expressing the personality of the user in the sense of domain-independent and static information related to the user.
A personality information generation model characterized by :

The personality information generation model according to claim 1, characterized in that the domain-independent RNN cell receives domain-specific hidden state information that has been recently generated or has just been generated in the domain-specific RNN cell, and generates the domain-independent hidden state information.

The personality information generation model according to claim 1 or 2, further comprising a computer that functions as a first conversion processing unit that performs a predetermined type of conversion processing on the domain-specific hidden state information output from the domain-specific RNN cell, and inputs the domain-specific hidden state information that has been subjected to the conversion processing to the domain-independent RNN cell.

The personality information generation model according to any one of claims 1 to 3, characterized in that the domain-specific RNN cell also receives the domain-independent hidden state information that was generated most recently from the time of receiving the behavioral expression information, and generates the domain-specific hidden state information using the domain-independent hidden state information.

The personality information generation model according to claim 4, further comprising a computer that functions as a second conversion processing unit that performs a predetermined type of conversion processing on the domain-independent hidden state information output from the domain-independent RNN cell and inputs the domain-independent hidden state information that has been subjected to the conversion processing to the domain-specific RNN cell.

6. The personality information generation model according to claim 1, wherein the user expression generation unit outputs the generated user expression information as information expressing the personality of the user.

The personality information generation model according to any one of claims 1 to 6, characterized in that the domain-independent RNN cell outputs the generated domain-independent hidden state information as dynamic psychological state information, which is information related to the dynamic psychological state of the user.

The personality information generation model according to any one of claims 1 to 7, characterized in that each of the plurality of domain-specific RNN cells and the domain-independent RNN cell has at least one of a forget gate, an update gate, and a reset gate that perform a predetermined process on information input to the cell to generate new hidden state information.

A personality estimation device, characterized in that it generates and outputs information expressing the personality of a user using the learned user expression generation unit obtained by training a personality information generation model described in any one of claims 1 to 8.

The personality information generation model according to any one of claims 1 to 8 is trained by a backpropagation method of an RNN ;
A personality estimation method in a computer, characterized in that the trained user expression generation unit obtained by the training is used to generate and output user expression information of the user, the user expression information being information that expresses the personality of the user in the sense of being domain-independent and static information related to the user .