JP6400445B2

JP6400445B2 - Conversation analyzer, conversation analysis system, conversation analysis method, and conversation analysis program

Info

Publication number: JP6400445B2
Application number: JP2014239953A
Authority: JP
Inventors: 大作若松; 啓一郎帆足
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2014-11-27
Filing date: 2014-11-27
Publication date: 2018-10-03
Anticipated expiration: 2034-11-27
Also published as: JP2016103081A

Description

本発明は、人間同士のリアルなコミュニケーションの状況を分析し、リアルな人間同士の情報伝達の好ましさを評価可能とし、更に、特定の話題に関する他人への影響が大きいユーザ（インフルエンサー）を抽出可能とする会話分析装置、会話分析システム、会話分析方法及び会話分析プログラムに関する。 The present invention analyzes the situation of real communication between humans, makes it possible to evaluate the preference of information transmission between real humans, and further enables users (influencers) who have a great influence on others on a specific topic. The present invention relates to a conversation analysis apparatus, a conversation analysis system, a conversation analysis method, and a conversation analysis program that can be extracted.

人間同士のリアルなコミュニケーションの状況を分析する手法としては、例えば特許文献１〜３に記載の技術が開示されている。
特許文献１では、ネットワーク上におけるバーチャルな人間関係が構築できるSNS(Social Networking Service)があり、このSNSに登録された人間関係とその投稿履歴から、どのSNSユーザがネットワーク上で影響力があるかを抽出することが行われる。 For example, techniques disclosed in Patent Documents 1 to 3 are disclosed as techniques for analyzing the situation of real communication between humans.
In Patent Document 1, there is SNS (Social Networking Service) that can build a virtual human relationship on the network, and which SNS user has influence on the network from the human relationship registered in this SNS and its posting history Is performed.

特許文献２では、複数の一般ユーザによる会話の開始と終了とを正確に認識し、必要な会話のみを記録する技術が開示されている。具体的には、会話の相手を識別しておき、会話の開始と終了をウェアラブル装置により記録し、サーバ装置で前記記録した会話をテキストに変換し、後から検索できる会話管理システムが提供されている。 Patent Document 2 discloses a technique for accurately recognizing the start and end of a conversation by a plurality of general users and recording only the necessary conversation. Specifically, a conversation management system is provided in which a conversation partner is identified, the start and end of a conversation are recorded by a wearable device, the recorded conversation is converted into text by a server device, and can be searched later. Yes.

特許文献３では、作業者に装着される複数のセンサと、前記複数のセンサから送信されたデータを受信する基地局と、ネットワークを介して前記基地局から前記データを受信する管理計算機とを備えるセンサネットワークシステムにおいて、センサに割り当てた識別子を赤外線通信により互いのセンサで検出し、その検出した時間を計測することで、作業者が対面していたとしてグループ化し、さらにそのグループを作業者のコミュニティとして対応付ける情報を保持するシステムが提供されている。 Patent Document 3 includes a plurality of sensors attached to an operator, a base station that receives data transmitted from the plurality of sensors, and a management computer that receives the data from the base station via a network. In the sensor network system, the identifier assigned to the sensor is detected by each other sensor by infrared communication, and the detected time is measured to group the workers as if they are facing each other. A system for holding information to be associated is provided.

人間同士のコミュニケーションには発声による会話だけではなく、表情やしぐさ等のノンバーバル情報もコミュニケーションを活性化させる要素であり、話す側のスキルに関わらず、聞く側のスキルにも関係する。聞く側のスキルとして、発話以外のノンバーバル情報として「表情」や「しぐさ」が注目されており、具体的には話し相手を認めコミュニケーションが促進される要素として、表情としては「笑顔」、しぐさとしては「頷き」が知られている（非特許文献１及び非特許文献２参照）。
更に、人に幸福感を与える良いコミュニケーションとして「親切と感謝の行動」が知られている（非特許文献３参照）。 For communication between humans, not only verbal conversation but also non-verbal information such as facial expressions and gestures is an element that activates communication, which is related to the listening skill regardless of the speaking skill. As a listening skill, attention has been paid to facial expressions and gestures as non-verbal information other than utterances. Specifically, facial expressions include smiles and facial expressions as elements that promote communication. “Golding” is known (see Non-Patent Document 1 and Non-Patent Document 2).
Furthermore, “kindness and gratitude” is known as a good communication that gives people happiness (see Non-Patent Document 3).

特許第５４８９９２４号公報Japanese Patent No. 5499924 特開２０１１−１８０６４５号公報JP 2011-180645 A 特許第５４１７６２７号公報Japanese Patent No. 5417627

『対話の流れと頷きパターン変化』（HAIシンポジウム2010、井上雅史ら）“Dialogue flow and whispering pattern change” (HAI Symposium 2010, Masafumi Inoue et al.) 『コミュニケーションにおける表情および身体動作の役割』（高木幸子）https://dspace.wul.waseda.ac.jp/dspace/handle/2065/27539“The role of facial expression and body movement in communication” (Sachiko Takagi) https://dspace.wul.waseda.ac.jp/dspace/handle/2065/27539 『親切と感謝の行動が幸福感に及ぼす影響』（北村瑞穂）“Effects of kindness and gratitude on happiness” (Mizuho Kitamura)

しかしながら、特許文献１に記載の技術は、あくまでSNS上のユーザ関係からインフルエンサーを効率よく発見するのみに留まり、リアルな人間関係からのインフルエンサー抽出には言及していない。 However, the technique described in Patent Document 1 is not limited to extracting influencers from real human relationships, but only to efficiently find influencers from user relationships on SNS.

特許文献２の技術によれば、リアルな会話の開始と終了をウェアラブル装置で推測し、会話を記録し管理することができる。しかしながら、会話を記録するのみでコミュニケーションの巧みさを分析することはできない。 According to the technique of Patent Document 2, it is possible to guess the start and end of a real conversation with a wearable device, and record and manage the conversation. However, it is not possible to analyze the skill of communication only by recording conversations.

特許文献３においては、リアルな対面の記録をウェアラブル装置で推測し、コミュニティとして人間関係の作図を行っていたが、その一方、その人がある話題でインフルエンサーの役割を示すかについての言及はない。 In Patent Document 3, a real face-to-face recording was inferred with a wearable device, and human relations were drawn as a community. On the other hand, a reference to whether the person shows the role of an influencer in a certain topic is Absent.

このように上述した従来技術では、SNS等のビッグデータを用いて人間関係を分析する技術は存在するものの、リアルな人間関係を分析し、ある話題に関してインフルエンサーを抽出することはできなかった。
また、会話やその時の感情を記録する技術はあったものの、会話の巧みさを分析する技術は存在しなかった。 As described above, in the related art described above, although there is a technique for analyzing human relationships using big data such as SNS, it has been impossible to analyze real human relationships and extract influencers on a certain topic.
Although there was a technique for recording conversation and feelings at that time, there was no technique for analyzing the skill of conversation.

本発明は、上記実情に鑑みて提案されたもので、ウェアラブル端末を用いてリアルな会話を記録分析し、その利用者の会話相手の受容度および利用者の会話の巧みさを分析し、ある話題について他人への影響が大きいユーザ（インフルエンサー）を抽出する会話分析装置、会話分析システム、会話分析方法及び会話分析プログラムを提供することを目的としている。 The present invention has been proposed in view of the above circumstances, and records and analyzes a real conversation using a wearable terminal, analyzes the user's conversation partner acceptance and the user's conversation skill, It is an object of the present invention to provide a conversation analysis device, a conversation analysis system, a conversation analysis method, and a conversation analysis program that extract users (influencers) who have a great influence on other people about a topic.

上記目的を達成するため請求項１の会話分析装置は、人同士の会話を検出するため利用者の頭部に装着するウェアラブル端末であって、
前記会話の中で発声する重要な言葉抽出のために複数の重要語を記憶する重要語データベースと、
前記会話の音声情報を収集するマイクロフォンと、
前記音声情報から重要語を認識する音声評価部と、
前記音声評価部の認識結果から所定の重み付けをして会話に対する相手の受容度を分析する会話分析部と、
前記受容度を前記利用者にフィードバックする告知部と、
を備えたことを特徴としている。 In order to achieve the above object, the conversation analyzer according to claim 1 is a wearable terminal that is worn on a user's head to detect a conversation between people,
An important word database storing a plurality of important words for extracting important words uttered in the conversation;
A microphone for collecting voice information of the conversation;
A voice evaluation unit for recognizing important words from the voice information;
A conversation analysis unit that analyzes a partner's acceptability for a conversation with a predetermined weighting from the recognition result of the voice evaluation unit;
A notification unit that feeds back the acceptance to the user;
It is characterized by having.

請求項２は、請求項１の会話分析装置において、
会話相手の顔表情と頭部の動きを映像として収集するカメラと、
前記映像から単位会話時間あたりの会話相手の笑顔時間および頷き回数を認識する映像評価部と、を備え、
前記会話分析部は、前記音声評価部及び映像評価部の認識結果から一定の重み付けをして、その会話に対する相手の受容度を分析することを特徴としている。 Claim 2 is the conversation analyzer of claim 1 ,
A camera that collects the facial expression of the conversation partner and the movement of the head as a video,
A video evaluation unit for recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video,
The conversation analysis unit is characterized in that it receives a certain weight from the recognition results of the voice evaluation unit and the video evaluation unit, and analyzes the acceptability of the other party for the conversation.

請求項３は、請求項１又は請求項２に記載の会話分析装置において、
前記告知部は、前記受容度を聴覚で認識できる音声再生部で構成されたことを特徴としている。 Claim 3 is the conversation analysis device according to claim 1 or claim 2 ,
The notification unit is configured by a voice reproduction unit that can recognize the acceptance level by hearing.

請求項４は、請求項１又は請求項２に記載の会話分析装置において、
前記告知部は、前記受容度を視覚で認識できる表示装置で構成されたことを特徴としている。
請求項５は、請求項１又は請求項２に記載の会話分析装置において、
前記音声評価部は、会話のトーンおよび／またはテンポを分析する機能を備え、
前記会話分析部は、相手と利用者の会話のトーンおよび／またはテンポの差を集計し一定の重み付けをして会話の巧みさを分析する機能を備え、
前記告知部は、前記分析した会話の巧みさを利用者にフィードバックする機能を備えることを特徴としている。 Claim 4 is the conversation analyzer according to claim 1 or claim 2 ,
The notification unit is configured by a display device that can visually recognize the acceptability.
Claim 5 is the conversation analysis apparatus according to claim 1 or claim 2 ,
The voice evaluation unit has a function of analyzing the tone and / or tempo of conversation,
The conversation analysis unit has a function of analyzing the skill of the conversation by summing up the difference in tone and / or tempo of the conversation between the other party and the user and giving a certain weight.
The notification unit has a function of feeding back the skill of the analyzed conversation to a user.

請求項６は、請求項５の会話分析装置において、
前記利用者の頭部の動きを認識する加速度センサと、
前記加速度センサから利用者の単位会話時間当たりの頷き回数を分析する頷き認識部と、を備え、
前記会話分析部は、前記頷き認識部による認識結果に所定の重み付けをして前記会話の巧みさを分析することを特徴としている。 Claim 6 is the conversation analyzer of claim 5 ,
An acceleration sensor for recognizing the movement of the user's head;
A whisker recognition unit that analyzes the number of whistling per unit conversation time of the user from the acceleration sensor,
The conversation analysis unit analyzes the skill of the conversation by giving a predetermined weight to the recognition result by the whisper recognition unit.

請求項７は、請求項５又は請求項６の会話分析装置において、
前記会話分析部は、会話中の単語の繰り返しを集計して一定の重み付けをして分析する機能を備えて前記会話の巧みさを分析することを特徴としている。 Claim 7 is the conversation analysis device according to claim 5 or claim 6 ,
The conversation analysis unit is characterized by analyzing the skill of the conversation with a function of counting and repeating the words in the conversation and performing a constant weighting.

請求項８は、請求項１乃至請求項７のいずれか１項に記載の会話分析装置において、
前記会話分析部による分析は、一つの会話の音声認識結果から会話中に出現した単語がどのくらい特徴的であるかを識別するための指標（TF-IDF）を利用して特徴語を分析して、前記受容度および/または前記会話の巧みさと共に、一会話毎に記録する記憶部を備えることを特徴としている。 Claim 8 is the conversation analysis apparatus according to any one of claims 1 to 7,
The analysis by the conversation analysis unit analyzes a feature word using an index (TF-IDF) for identifying how characteristic the word that appears in the conversation is from the speech recognition result of one conversation. A storage unit is provided for recording each conversation together with the degree of acceptance and / or the skill of the conversation.

請求項９の会話分析システムは、請求項１乃至請求項８のいずれか１項に記載の会話分析装置と、前記会話分析装置の会話ログを集約して管理する会話評価サーバ装置とを備え、
前記会話評価サーバ装置は、
前記会話分析装置の全利用者の会話ログを分析して会話評価を行う制御部と、
前記会話ログを会話ログデータベースに記憶する記憶部と、
を備えることを特徴としている。
請求項１０は、請求項９の会話分析システムにおいて、
前記会話分析装置は、前記会話中で発声するある特定の話題に対するインフルエンサー抽出のために複数の話題語を記憶する話題語データベースを備え、
前記音声評価部は、更に前記音声情報から話題語を認識し、
前記記憶部は、一会話毎に前記認識した話題語を会話ログへ記録し、
前記会話評価サーバ装置の制御部は、複数利用者の会話ログを分析し前記話題語に対する会話評価を行うに際し、評価したい話題語が含まれる会話ログを抽出し、
抽出した会話ログの複数利用者の中から会話の受容度または会話の巧みさが平均値より高い場合に前記話題語に対して影響力のある利用者をインフルエンサーとして抽出することを特徴としている。 A conversation analysis system according to claim 9 includes the conversation analysis apparatus according to any one of claims 1 to 8 , and a conversation evaluation server apparatus that collects and manages conversation logs of the conversation analysis apparatus,
The conversation evaluation server device
A control unit that analyzes a conversation log of all users of the conversation analyzer and evaluates a conversation;
A storage unit for storing the conversation log in a conversation log database;
It is characterized by having.
Claim 10 is the conversation analysis system of claim 9 ,
The conversation analysis device comprises a topic word database for storing a plurality of topic words for influencer extraction for a specific topic uttered during the conversation,
The voice evaluation unit further recognizes a topic word from the voice information,
The storage unit records the recognized topic word for each conversation in a conversation log,
The control unit of the conversation evaluation server device extracts a conversation log including a topic word to be evaluated when analyzing a conversation log of a plurality of users and performing a conversation evaluation on the topic word.
It is characterized in that users who have an influence on the topic word are extracted as influencers when the conversation acceptance or skill of conversation is higher than the average value from a plurality of users of the extracted conversation logs. .

請求項１１は、請求項１０の会話分析システムにおいて、
前記会話分析装置は、会話開始において自身の利用者の識別子と、会話相手となる利用者の識別子とを交換する通信部を備え、
前記会話評価サーバの制御部は、前記会話ログを分析して、会話相手となる利用者の識別子と会話時間を更に記録し、前記利用者に付与された識別子間の会話ログについて、総会話時間を集計した利用者間の結び付き度と、受容度を集計し利用者間の大小関係を利用者の人間関係として管理する人間関係管理機能を備え、
前記会話評価サーバの記憶部は、前記利用者の人間関係を紐づけて記憶する人間関係データベースを備えることを特徴としている。 Claim 11 is the conversation analysis system of claim 10,
The conversation analysis device comprises a communication unit for exchanging an identifier of a user of the user at the start of conversation and an identifier of a user who is a conversation partner,
The control unit of the conversation evaluation server analyzes the conversation log, further records the identifier of the user who is a conversation partner and the conversation time, and the total conversation time for the conversation log between the identifiers assigned to the user. It is equipped with a human relationship management function that tabulates the degree of connection between users and the degree of acceptance, and manages the magnitude relationship between users as the user's human relationship,
The storage unit of the conversation evaluation server includes a human relationship database that stores the human relationships of the users in association with each other.

請求項１２は、会話分析装置と会話評価サーバとを備えた会話分析システムに対して、コンピュータソフトウエアによる情報処理により会話を分析評価する方法であって、
利用者と会話相手との会話中に生じる音声情報から予め設定された話題語および重要語を認識する手順と、
前記会話相手の顔表情と頭部の動きを映像として取得し、該映像から単位会話時間あたりの会話相手の笑顔時間および頷き回数を認識する手順と、
前記各認識結果から会話の話題に対する前記会話相手の受容度を分析する手順と、
前記受容度を前記利用者にフィードバックする手順と、
を含むことを特徴としている。 Claim 12 is a method for analyzing and evaluating a conversation by information processing by computer software for a conversation analysis system including a conversation analysis device and a conversation evaluation server,
A procedure for recognizing preset topic words and important words from voice information generated during conversation between a user and a conversation partner;
Acquiring the facial expression of the conversation partner and the movement of the head as a video, and recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video;
A procedure for analyzing the degree of acceptance of the conversation partner with respect to the topic of conversation from each recognition result;
Feeding back the acceptance to the user;
It is characterized by including.

請求項１３は、会話分析装置と会話評価サーバとを備えた会話分析システムに対して、コンピュータソフトウエアによる情報処理により会話を分析評価する方法であって、
利用者と会話相手との会話中に生じる音声情報から予め設定された話題語および重要語を認識する手順と、
前記会話相手の顔表情と頭部の動きを映像として取得し、該映像から単位会話時間あたりの会話相手の笑顔時間および頷き回数を認識する手順と、
前記会話相手と利用者の会話のトーンおよび／またはテンポの差を集計する手順と、
前記各認識結果及び前記集計結果から所定の重み付けを行って会話の話題に対する会話の巧みさを分析する手順と、
前記会話の巧みさを前記利用者にフィードバックする手順と、
を含むことを特徴としている。 Claim 13 is a method for analyzing and evaluating a conversation by information processing by computer software for a conversation analysis system including a conversation analysis device and a conversation evaluation server,
A procedure for recognizing preset topic words and important words from voice information generated during conversation between a user and a conversation partner;
Acquiring the facial expression of the conversation partner and the movement of the head as a video, and recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video;
A procedure for counting the difference in tone and / or tempo of conversation between the conversation partner and the user;
A procedure for analyzing the skill of the conversation with respect to the topic of conversation by performing a predetermined weighting from each recognition result and the total result,
A procedure for feeding back the skill of the conversation to the user;
It is characterized by including.

請求項１４は、請求項１２又は請求項１３の会話分析方法において、  Claim 14 is the conversation analysis method of claim 12 or claim 13,
前記複数利用者と各会話相手との会話について、予め付与された利用者を識別するための識別子と、前記認識した話題語および重要語との前記分析した会話の話題に対する前記会話相手の受容度または会話の巧みさを会話ログとして記憶し、当該複数の会話ログを会話評価サーバ装置で集約して管理する手順と、  Conversation of the conversation partner with respect to the topic of the analyzed conversation of the identifier for identifying the user given in advance and the recognized topic word and important word for the conversation between the plurality of users and each conversation partner Alternatively, the skill of conversation is stored as a conversation log, and the plurality of conversation logs are aggregated and managed by the conversation evaluation server device, and
前記複数利用者の会話ログを分析し前記話題語に対する会話評価を行うに際し、評価したい話題語が含まれる会話ログを抽出する手順と、  When analyzing the conversation log of the plurality of users and performing conversation evaluation on the topic word, a procedure for extracting a conversation log including the topic word to be evaluated;
抽出した会話ログの複数利用者の中から会話の受容度または会話の巧みさが平均値より高い場合に前記話題語に対して影響力のある利用者をインフルエンサーとして抽出する手順と、  A procedure for extracting a user who has an influence on the topic word as an influencer when the conversation acceptance or skill of the conversation is higher than an average value from a plurality of users of the extracted conversation log;
を含むことを特徴としている。It is characterized by including.
請求項１５は、請求項１４の会話分析方法において、  Claim 15 is the conversation analysis method of claim 14,
会話開始において自身の利用者の識別子と、会話相手となる利用者の識別子とを交換する手順と、  A procedure for exchanging the identifier of the user and the identifier of the user who is the conversation partner at the start of the conversation;
前記会話評価サーバで集約して管理する複数利用者の会話ログには、会話相手となる利用者の識別子と会話時間を更に記録する手順と、  In the conversation log of a plurality of users that are aggregated and managed by the conversation evaluation server, a procedure for further recording an identifier and a conversation time of a user who is a conversation partner;
前記利用者に付与された識別子間の会話ログについて、総会話時間を集計した利用者間の結び付き度と、受容度を集計し利用者間の大小関係とを人間関係データベースとして記憶する手順と、  For the conversation log between the identifiers given to the user, a procedure for storing the degree of connection between the users who totaled the total conversation time, and the degree of acceptance and the magnitude relationship between the users as a human relationship database;
前記人間関係データベースの情報を、評価したい話題で抽出する手順と、  A procedure for extracting information of the human relation database by a topic to be evaluated;
当該抽出した人間関係を人間関係マップとして表示する手順と、  A procedure for displaying the extracted human relationship as a human relationship map;
を含むことを特徴としている。It is characterized by including.

請求項１６の会話分析プログラムは、
請求項１２乃至請求項１５のいずれか１項に記載の会話分析方法の各手順をコンピュータ上に構築することを特徴としている。 The conversation analysis program according to claim 16 comprises:
Each procedure of the conversation analysis method according to any one of claims 12 to 15 is constructed on a computer.

請求項１の会話分析装置によれば、相手との会話の音声から重要語を検出し、音声分析結果から会話に対する相手の受容度を計算し、相手が会話に関する受容度（受け入れ度合い）をリアルタイムに利用者にフィードバックすることができる。
また、重要語データベースにより、相手が会話を受け入れていると判定でき、会話が意味あるものであったかを確認するための重要語を設定することができる。 According to the conversation analysis apparatus of claim 1, an important word is detected from the voice of the conversation with the other party, the other party's acceptance level for the conversation is calculated from the voice analysis result, and the other party's degree of acceptance (acceptance level) regarding the conversation is real-time You can give feedback to users.
In addition, it is possible to determine from the important word database that the other party has accepted the conversation, and it is possible to set an important word for confirming whether the conversation is meaningful.

請求項２の会話分析装置によれば、会話中の話題が相手に伝わったかを相手の頷く映像、笑顔の映像から検出し、音声分析結果と映像分析結果から会話に対する相手の受容度を計算し、相手の受容度（受け入れ度合い）をリアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis apparatus of claim 2 , it is detected from the other person's whispering video and smiley video whether the conversation topic is transmitted to the other party, and the other party's acceptability for the conversation is calculated from the voice analysis result and the video analysis result. It is possible to feed back the acceptability (acceptance level) of the other party to the user in real time.

請求項３の会話分析装置によれば、告知部を音声再生部で構成することで、会話中における相手の受容度を聴覚で把握することができる。 According to the conversation analysis device of claim 3 , by configuring the notification unit with the voice reproduction unit, it is possible to grasp the degree of acceptance of the partner during the conversation by hearing.

請求項４の会話分析装置によれば、告知部を表示部で構成することで、会話中における相手の受容度を視覚で把握することができる。 According to the conversation analysis device of the fourth aspect , by configuring the notification unit with the display unit, it is possible to visually grasp the acceptability of the partner during the conversation.

請求項５の会話分析装置によれば、会話の調子（トーンおよび／またはテンポ）を分析することで、相手と同調していて気持ちよく会話を進めているかといった会話の巧みさについて、リアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis device of claim 5 , by analyzing the tone of the conversation (tone and / or tempo), it is possible to determine in real time the skill of the conversation, such as whether or not the conversation is proceeding comfortably in synchronization with the other party. Can provide feedback.

請求項６の会話分析装置によれば、利用者自身の単位会話時間あたりの頷きの回数を分析することで、相手の言葉を利用者が一旦受け止めていることを相手に示しながら会話を進めているかといった会話の巧みさについて、リアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis device of claim 6 , by analyzing the number of whispering per unit conversation time of the user himself, the conversation can be advanced while showing the partner that the user has received the word of the partner once. It is possible to provide feedback to the user in real time about the skill of the conversation.

請求項７の会話分析装置によれば、利用者と会話の相手の言葉のやり取りの中で単語を繰り返して発言することで、相手の言葉を利用者が一旦受け止めしっかりと理解していることを相手に示しながら会話を進めているかの会話の巧みさについて、リアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis device of claim 7 , by repeatedly speaking a word in the exchange of words between the user and the conversation partner, the user once accepts and firmly understands the partner's word. It is possible to provide feedback to the user in real time about the skill of the conversation as to whether the conversation is progressing while showing to the other party.

請求項８の会話分析装置によれば、利用者と会話の相手の言葉のやり取りの分析結果を１つの会話毎に記録し、更にその会話内容の特徴的な単語を記録するので、利用者が振り返りする際に会話の記憶を手繰り寄せ易くすることができる。 According to the conversation analysis apparatus of claim 8 , the analysis result of the exchange of words between the user and the conversation partner is recorded for each conversation, and the characteristic words of the conversation contents are further recorded. When looking back, it is possible to make the memory of conversation easier.

請求項９の会話分析システムによれば、会話分析装置の全利用者の会話ログを分析して管理可能なシステムとすることで、利用者が行った会話の受容度や会話の巧みさについて利用者全体に対して秀でているか利用者毎に評価できる他、特定の話題に関してどの利用者が他人に対し、受け入れられているか、会話が巧みであるかについて評価できる。
請求項１０の会話分析システムによれば、音声分析結果から相手の受容度を計算するに際して、話題語を認識するシステムとすることで、特定の話題に関する利用者による会話の受容度を評価することができる。また、話題語データベースにより、会話を評価したい特定の話題に関する話題語を設定することができる。 According to the conversation analysis system of claim 9 , the conversation log of all users of the conversation analysis apparatus is analyzed and managed, and the system is used for acceptability of conversation performed by the user and skill of conversation. In addition to being able to evaluate for each user whether it is superior to the entire user, it is possible to evaluate which user is accepted by others regarding a specific topic and whether the conversation is skillful.
According to the conversation analysis system of claim 10, when calculating the acceptability of the other party from the voice analysis result, the acceptability of the conversation regarding the specific topic by the user is evaluated by using a system that recognizes the topic word. Can do. Further, topic words relating to a specific topic for which conversation is to be evaluated can be set by the topic word database.

請求項１１の会話分析システムによれば、会話分析装置の全利用者の会話ログを分析し、人間関係を紐づけることにより、どの利用者がどの話題でどの人間関係の中で中心的か（ある話題に関してインフルエンサーの抽出する）、又は、受け入れられているかを管理可能なシステムとすることができる。 According to the conversation analysis system of claim 11, by analyzing the conversation logs of all users of the conversation analysis apparatus and associating the human relations, which user is central in which human relations in which topics ( It can be a system that can manage whether an influencer is extracted or accepted for a topic.

請求項１２の会話分析方法によれば、相手との会話の音声から話題語と重要語を検出し、会話中の話題が相手に伝わったかを相手の頷く映像、笑顔の映像から検出し、音声分析結果と映像分析結果から話題に対する相手の受容度を計算し、相手が話題に関する受容度（受け入れ度合い）をリアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis method of claim 12, the topic word and the important word are detected from the voice of the conversation with the other party, and whether the topic during the conversation is transmitted to the other party is detected from the other person's whispering video and the smiling face video, and the voice. From the analysis result and the video analysis result, it is possible to calculate the acceptability of the other party with respect to the topic, and the other party can feed back the acceptability (acceptance degree) regarding the topic to the user in real time.

請求項１３の会話分析方法によれば、会話相手と利用者の会話のトーンおよび／またはテンポの差を集計し分析することで、話題に対する会話の巧みさをリアルタイムに利用者にフィードバックすることができる。 According to the conversation analysis method of the thirteenth aspect, by summing up and analyzing the difference in the tone and / or tempo of the conversation between the conversation partner and the user, it is possible to feed back the skill of the conversation to the topic in real time to the user. it can.

請求項１４の会話分析方法によれば、複数利用者と各会話相手との会話ログを会話評価サーバ装置で集約して管理し、複数利用者の会話ログを分析することで、リアルな人間関係におけるある話題に関するインフルエンサーを抽出することができる。
また、請求項１５の会話分析方法によれば、会話ログについての分析を行うことで、利用者間の人間関係としての利用者間の結び付き度と、利用者間の受容度の大小関係とを人間関係データベースとして記憶でき、この人間関係データベースの情報から人間関係マップとして表示することもできる他、評価したい話題で抽出した人間関係マップとして表示することができる。 According to the conversation analysis method of claim 14, the conversation log of a plurality of users and each conversation partner is collected and managed by the conversation evaluation server device, and the conversation log of the plurality of users is analyzed, thereby realizing a realistic human relationship. It is possible to extract influencers related to certain topics.
According to the conversation analysis method of claim 15, by analyzing the conversation log, the degree of connection between users as a human relation between users and the magnitude relation of the acceptability between users are obtained. It can be stored as a human relationship database, can be displayed as a human relationship map from the information in this human relationship database, and can also be displayed as a human relationship map extracted on the topic to be evaluated.

請求項１６の会話分析プログラムによれば、前記会話分析方法の各手順をコンピュータ上で行わせることができる。 According to the conversation analysis program of the sixteenth aspect , each procedure of the conversation analysis method can be performed on a computer .

会話分析システムの構成説明図である。1 is a configuration explanatory diagram of a conversation analysis system. FIG. 会話分析装置（ウェアラブル端末）のユーザへの装着状態を示すモデル図である。It is a model figure which shows the mounting state to the user of a conversation analyzer (wearable terminal). 会話分析装置（ウェアラブル端末）の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of a conversation analyzer (wearable terminal). 会話評価サーバ装置の構成の一例を示すブロック図である。It is a block diagram which shows an example of a structure of a conversation evaluation server apparatus. 会話分析方法の手順を示すフローチャートである。It is a flowchart which shows the procedure of the conversation analysis method. （ａ）（ｂ）はカメラ映像から会話相手の頷きを検出する場合の正面顔表情説明図である。(A) (b) is a front face facial expression explanatory drawing in the case of detecting a whisper of a conversation partner from a camera image. （ａ）（ｂ）はカメラ映像から会話相手の笑顔を検出する場合の正面顔表情説明図である。(A) and (b) are front face facial expression explanatory drawings in the case of detecting the smile of the conversation partner from the camera video. （ａ）（ｂ）はカメラ映像から利用者自身の頷きを検出する場合の側面顔表情説明図である。(A) and (b) are side face facial expression explanatory drawings in the case of detecting the user's own whisper from the camera video. 図４の会話評価サーバ装置の記憶部が記憶する会話ログデータベースのレコード表である。It is a record table of the conversation log database which the memory | storage part of the conversation evaluation server apparatus of FIG. 4 memorize | stores. 会話分析装置（ウェアラブル端末）の構成の他の例を示すブロック図である。It is a block diagram which shows the other example of a structure of a conversation analyzer (wearable terminal). 会話評価サーバ装置の構成の他の例を示すブロック図である。It is a block diagram which shows the other example of a structure of a conversation evaluation server apparatus. 図１１の会話評価サーバ装置の記憶部が記憶する会話ログデータベースのレコード表である。It is a record table of the conversation log database which the memory | storage part of the conversation evaluation server apparatus of FIG. 11 memorize | stores. （ａ）（ｂ）は会話分析システムの稼働中における利用者１，２の通信可能範囲と、利用者２が取得するカメラ映像を説明するためのモデル図である。(A) (b) is a model figure for demonstrating the communicable range of the users 1 and 2 during the operation | movement of a conversation analysis system, and the camera image | video which the user 2 acquires. ユーザ同士の人間関係を示す人間関係マップのモデル図である。It is a model figure of the human relationship map which shows the human relationship between users.

本発明の一実施形態に係る会話分析システムについて、図１〜図９を参照しながら説明する。
会話分析システムは、図１に示すように、ウェアラブル端末で構成される会話分析装置１と、このウェアラブル端末に対してインターネット等のネットワーク２を介して接続された会話評価サーバ装置３とから構成されている。
会話分析装置（ウェアラブル端末）１及び会話評価サーバ３は、基本デバイスが記憶されたＲＯＭ、各種のプログラムやデータが記憶されるハードディスクドライブ装置（ＨＤＤ）、プログラムを実行するＣＰＵ等を主要部分とするコンピュータを備え、ＨＤＤに会話分析プログラムが格納されることで、会話分析システムを構築している。 A conversation analysis system according to an embodiment of the present invention will be described with reference to FIGS.
As shown in FIG. 1, the conversation analysis system includes a conversation analysis apparatus 1 composed of a wearable terminal and a conversation evaluation server apparatus 3 connected to the wearable terminal via a network 2 such as the Internet. ing.
The conversation analysis apparatus (wearable terminal) 1 and the conversation evaluation server 3 are mainly composed of a ROM that stores basic devices, a hard disk drive (HDD) that stores various programs and data, a CPU that executes the programs, and the like. A conversation analysis system is constructed by including a computer and storing a conversation analysis program in the HDD.

会話分析装置（ウェアラブル端末）１の外観は、図２に示すように、利用者の両耳にテンプル部５を架け渡すことで利用者の顔前面に装着可能に構成され、会話相手の音声を収集する相手用マイク１１ａ及びカメラ１２が利用者の眼の高さ近くに装着されるとともに、利用者の口近くの位置にユーザ用マイク１１ｂが装着されている。また、ウェアラブル端末１には、利用者の視覚前方に虚像が結像する表示部４０が設けられている。 As shown in FIG. 2, the appearance of the conversation analysis device (wearable terminal) 1 is configured so that it can be worn on the front of the user's face by laying the temple portions 5 on both ears of the user. The collecting microphone 11a and the camera 12 to be collected are mounted near the height of the user's eyes, and the user microphone 11b is mounted at a position near the user's mouth. Further, the wearable terminal 1 is provided with a display unit 40 on which a virtual image is formed in front of the user's vision.

会話分析装置（ウェアラブル端末）１は、図３に示すように、利用者が相手と会話する場合の音声情報や映像情報等の各種情報を取得するセンサ部１０と、各種情報の分析を行う制御部２０と、分析結果を記憶する記憶部３０と、分析結果を表示する表示部４０と、分析結果を情報として外部へ送信する第１の通信部５０とを備えた会話分析装置（ウェアラブル端末）１を備えている。 As shown in FIG. 3, the conversation analysis device (wearable terminal) 1 includes a sensor unit 10 that acquires various types of information such as voice information and video information when the user has a conversation with the other party, and a control that analyzes the various types of information. Conversation analyzer (wearable terminal) comprising: unit 20, storage unit 30 that stores analysis results, display unit 40 that displays analysis results, and first communication unit 50 that transmits the analysis results to the outside as information 1 is provided.

センサ部１０は、周囲の音声情報を取得するマイクロフォン１１、会話相手の顔の表情や頭の動きを映像情報として取得するカメラ１２、利用者（ユーザ）の頭の動きを取得するジャイロ・加速度センサ１３、装着センサ１４から構成されている。
マイクロフォン１１は、相手用マイク１１ａとユーザ用マイク１１ｂとの２つを備え、それぞれ指向性マイクを利用している。 The sensor unit 10 includes a microphone 11 that acquires surrounding voice information, a camera 12 that acquires facial expressions and head movements of conversation partners as video information, and a gyro / acceleration sensor that acquires user (user) head movements. 13 and a mounting sensor 14.
The microphone 11 includes two microphones 11a and 11b, and each uses a directional microphone.

相手用マイク１１ａは、カメラ１２の撮像方向に指向性をもつ受話マイクから構成されている。
ユーザ用マイク１１ｂは、利用者の口側に向けた送話マイクとその反対面に環境マイクの２つを備えたダブルマイク（送話用／環境音の２マイク）で構成され、利用者が発話した音声信号を１つのマイクで取得したときよりもSN比の高い信号を得るようになっている。 The counterpart microphone 11 a is composed of a receiving microphone having directivity in the imaging direction of the camera 12.
The user microphone 11b is composed of a double microphone (two microphones for transmitting / environmental sound) having two microphones, one for transmitting to the mouth side of the user and the other for environmental microphones on the opposite side. A signal with a higher S / N ratio is obtained than when a spoken voice signal is acquired by a single microphone.

ユーザ用マイク１１ｂのダブルマイクに代わるマイクの構成としては、直接利用者の肌に当接させる方式のマイクロフォンとして、喉の声帯（咽喉）マイクや、頬や顎骨や耳の後ろの乳様突起などの皮膚に当接させる骨伝導マイクなどを利用して利用者が発話していることを認識させても良い。 As a microphone configuration that replaces the double microphone of the user microphone 11b, as a microphone that directly contacts the user's skin, a vocal cord (throat) microphone, a cheek, a jawbone, or a mastoid behind the ear, etc. The user may recognize that the user is speaking using a bone conduction microphone or the like that contacts the skin.

カメラ１２は、テンプル部５の前端等、ウェアラブル端末１の適宜位置に装着されることで、利用者の顔が向く前方を撮像できるようになっている。 The camera 12 can be imaged in front of the user's face by being mounted at an appropriate position of the wearable terminal 1 such as the front end of the temple unit 5.

ジャイロ・加速度センサ１３は、ユーザ自身の頷きを検出するため、利用者の頭部の傾きを検出するためのものである。
このジャイロ・加速度センサ１３は、３軸（Ｘ，Ｙ，Ｚ方向）の加速度センサを使用しても良いが、利用者の「頷き」を正確に検出するために、３軸ジャイロを検出するとともに３軸（Ｘ，Ｙ，Ｚ方向）の加速度を検出する６軸センサを使用するのが好ましい。 The gyro / acceleration sensor 13 is for detecting the inclination of the user's head in order to detect the user's own whispering.
The gyro / acceleration sensor 13 may use a triaxial (X, Y, Z direction) acceleration sensor, but detects a triaxial gyro in order to accurately detect the user's “whit”. It is preferable to use a six-axis sensor that detects acceleration in three axes (X, Y, and Z directions).

装着センサ１４は、利用者がウェアラブル端末１を頭部に到着したことを検出するセンサであり、例えばテンプル部５を押し広げることによってセンサのスイッチが入る仕組みで構成されている。 The wearing sensor 14 is a sensor that detects that the user has arrived at the head of the wearable terminal 1, and has a mechanism in which the sensor is switched on by, for example, expanding the temple unit 5.

制御部２０は、マイクロフォン１１からの発話を検出する音声評価部２１と、カメラ１２の映像情報から会話相手の顔表情（笑顔や頷き）を認識する映像評価部２２と、ジャイロ・加速度センサ１３から利用者の頷きを認識する頷き認識部２３と、利用者への装着を検知する装着認識部２４と、各部からの検出結果を総合的に評価して会話分析を行う会話分析部２５とを備えている。 The control unit 20 includes a voice evaluation unit 21 that detects an utterance from the microphone 11, a video evaluation unit 22 that recognizes the facial expression (smile or whisper) of the conversation partner from the video information of the camera 12, and the gyro / acceleration sensor 13. A whisker recognizing unit 23 that recognizes a user's whistling, a wear recognizing unit 24 that detects wearing by a user, and a conversation analyzing unit 25 that comprehensively evaluates detection results from each unit and performs conversation analysis. ing.

音声評価部２１は、発話検出機能と、発話者認識機能と、音声認識機能と、形態素解析機能と、トーン／テンポ計測機能と、を有している。
発話検出機能は、マイクロフォン１１（相手用マイク１１ａ及びユーザ用マイク１１ｂ）から検出した音声情報から会話の開始と終了を検出する。
音声認識機能は、検出された会話をテキスト化する。
形態素解析機能は、テキストを形態素化して単語に分ける。
トーン／テンポ計測機能は、会話における会話トーン（大きさ、高さ）の一致性、会話テンポの一致性を測定する。 The voice evaluation unit 21 has a speech detection function, a speaker recognition function, a voice recognition function, a morphological analysis function, and a tone / tempo measurement function.
The utterance detection function detects the start and end of a conversation from voice information detected from the microphone 11 (the partner microphone 11a and the user microphone 11b).
The voice recognition function converts the detected conversation into text.
The morphological analysis function morphizes the text and divides it into words.
The tone / tempo measurement function measures the consistency of conversation tones (size, height) and conversation tempo in a conversation.

発話者認識機能は、ウェアラブル端末１の利用者の発話であることを認識するものである。すなわち、発話された音声を基に、話者認識により利用者が発話したか判定（発話者判定）する。発話者認識には、発話者を照合するための発話者モデルが必要となる。この発話者モデルは、ウェアラブル端末利用前に利用者に発声させ事前に作成し、後述する利用者登録データ部３１に話者認識用のデータとして登録させておく。また、ウェアラブル端末１の利用中に、発声された音声から発話者モデルを随時更新しておくようにしても良い。 The speaker recognition function recognizes that the user of the wearable terminal 1 is speaking. That is, based on the spoken voice, it is determined whether the user has spoken by speaker recognition (speaker determination). Speaker recognition requires a speaker model for collating speakers. This speaker model is created in advance by making the user speak before using the wearable terminal, and is registered as data for speaker recognition in a user registration data unit 31 described later. Further, while using the wearable terminal 1, the speaker model may be updated as needed from the spoken voice.

映像評価部２２は、カメラ１２からの映像情報から相手の顔情報を検出し、相手の笑顔を検出する笑顔認識機能と、相手の頷きを検出する頷き認識機能を備えている。 The video evaluation unit 22 has a smile recognition function that detects the other party's smile by detecting the other party's face information from the video information from the camera 12, and a whip recognition function that detects the other party's whisper.

頷き認識部２３は、ジャイロ・加速度センサ１３からの情報により利用者の頷きを検出する。 The whispering recognition unit 23 detects the whispering of the user based on information from the gyro / acceleration sensor 13.

装着認識部２４は、装着センサ１４からの信号により、利用者へのウェアラブル端末１の装着を検知する。 The wearing recognition unit 24 detects wearing of the wearable terminal 1 to the user based on a signal from the wearing sensor 14.

会話分析部２５は、音声評価部２１からのマイクロフォン１１の音声情報による分析データ、映像評価部２２からのカメラ１２の映像情報による分析データ、頷き認識部２３からのジャイロ・加速度センサ１３による分析データを総合的に評価して会話分析を行い、その分析結果は記憶部３０の会話ログ３４に記憶される。 The conversation analysis unit 25 is analysis data based on voice information of the microphone 11 from the voice evaluation unit 21, analysis data based on video information of the camera 12 from the video evaluation unit 22, and analysis data based on the gyro / acceleration sensor 13 from the whisper recognition unit 23. Are comprehensively evaluated and conversation analysis is performed, and the analysis result is stored in the conversation log 34 of the storage unit 30.

記憶部３０には、話者認識用のデータが登録される利用者登録データ部３１、重要語データベース３２、話題語データベース３３を備えるとともに、制御部２０からの会話分析の分析結果が会話ログ３４に記録されている。
利用者登録データ部３１に登録される話者認識用のデータにより、会話の発話者の認識が可能となる。 The storage unit 30 includes a user registration data unit 31 in which speaker recognition data is registered, an important word database 32, and a topic word database 33. The analysis result of the conversation analysis from the control unit 20 is a conversation log 34. Is recorded.
The speaker recognition data registered in the user registration data unit 31 enables the recognition of the speaker of the conversation.

重要語データベース３２には、会話の巧みさを評価するために円滑な会話を進める上で重要な言葉（会話が意味あるものであったかを確認するための重要語）が予め登録されている。重要語データベースに登録されている重要語は、例えば、「ありがとう」「サンキュー」「感謝」等の感謝の言葉、「凄い」「良い」「カッコいい」「欲しい」「かわいい」「素晴らしい」「クール」等のポジティブ表現、「わかった」「了解」「OK」「はい」「そうします」「そうです」「その通り」等の肯定応答である言葉が登録されている。 In the important word database 32, important words (important words for confirming whether the conversation is meaningful) are registered in advance in order to promote smooth conversation in order to evaluate the skill of the conversation. Important words registered in the important word database are words of thanks such as “Thank you”, “Thank you”, “Thank you”, “Terrible” “Good” “Cool” “Want” “Cute” “Great” “Cool” "," "OK", "OK", "Yes", "Yes", "Yes", and "Yes" are registered.

話題語データベース３３には、ある話題の会話を分析し、その話題に対して影響力の強い利用者（インフルエンサー）を見つけるための単語（話題語）がデータリストとして登録されている。例えば、通信キャリアの場合、「携帯電話」「スマートホン」「ネットワーク」「電波」「通信会社名」「サービス名」等を話題語として登録しておく。様々な分野での話題語を予め登録しておくことが好ましい。 In the topic word database 33, words (topic words) for analyzing a conversation of a certain topic and finding a user (influencer) having a strong influence on the topic are registered as a data list. For example, in the case of a communication carrier, “mobile phone”, “smart phone”, “network”, “radio wave”, “communication company name”, “service name”, etc. are registered as topic words. It is preferable to register topic words in various fields in advance.

表示部４０は、制御部２０による分析結果に応じた文字やアイコン等によるアドバイスが利用者の視界前方に虚像として表示されるように、ウェアラブル端末１に装着されている。具体的には、表示部４０に「もっとゆっくり」「もっと大きく」「笑ってみて」等の文字やアイコン等がアドバイスとして表示される。
なお本例では、表示により視覚的に告知するようにしたが、スピーカ等の音声再生部で発する音声により聴覚を通じてアドバイスを告知するようにしても良い。 The display unit 40 is attached to the wearable terminal 1 so that advice based on characters, icons, and the like corresponding to the analysis result by the control unit 20 is displayed as a virtual image in front of the user's field of view. Specifically, characters, icons, and the like such as “more slowly”, “larger”, and “smile” are displayed on the display unit 40 as advice.
In this example, the notification is visually notified by display, but the advice may be notified through hearing by sound generated by a sound reproduction unit such as a speaker.

第１の通信部５０は、後述する会話評価サーバ装置３へ記録するためにデータ送受信を行う。 The first communication unit 50 performs data transmission and reception for recording in the conversation evaluation server device 3 to be described later.

続いて、会話評価サーバ装置３の構成について説明する。
会話評価サーバ装置３は、ウェアラブル端末１から会話分析結果をデータ受信する第１の通信部６０と、会話評価を行う制御部７０と、受信したデータを記録する記録部８０とから構成されている。
制御部７０は、第１の通信部６０を介してウェアラブル端末１における会話分析結果の情報を評価する会話評価部７１を備えている。
記録部８０は、前記分析評価結果の情報を会話ログデータベース８１に記録し管理する。また、会話ログデータベース８１を使用することで、様々な問合せに対応することができる。様々な問合せの詳細については後述する。 Next, the configuration of the conversation evaluation server device 3 will be described.
The conversation evaluation server device 3 includes a first communication unit 60 that receives data of a conversation analysis result from the wearable terminal 1, a control unit 70 that performs conversation evaluation, and a recording unit 80 that records the received data. .
The control unit 70 includes a conversation evaluation unit 71 that evaluates information of a conversation analysis result in the wearable terminal 1 via the first communication unit 60.
The recording unit 80 records and manages the information of the analysis evaluation result in the conversation log database 81. Further, by using the conversation log database 81, various inquiries can be handled. Details of various inquiries will be described later.

次に、上述した会話分析システムを使用して会話分析装置（ウェアラブル端末）１で会話分析及び会話評価を行う場合の手順について、図５のフローチャートを参照して説明する。
先ず、ウェアラブル端末１が利用者の頭部に装着されているかどうかを装着認識部２４で判断する（ステップ１０１）。
ウェアラブル端末１が装着されている場合は次の処理を行い、装着されていないと判断すると処理を終了する（ステップ１０２）。 Next, a procedure for performing conversation analysis and conversation evaluation with the conversation analysis apparatus (wearable terminal) 1 using the conversation analysis system described above will be described with reference to the flowchart of FIG.
First, the wear recognition unit 24 determines whether or not the wearable terminal 1 is worn on the user's head (step 101).
If the wearable terminal 1 is worn, the following process is performed. If it is determined that the wearable terminal 1 is not worn, the process is terminated (step 102).

ウェアラブル端末１が装着されている場合、会話開始の検出の有無を判定する（ステップ１０３）。具体的には、ユーザ用マイク（送話用マイク／環境音用マイクの２マイク）１１ｂから音声を収集し、利用者の周囲の環境について、騒音レベルを10秒毎に評価する（環境騒音キャリブレーション）。環境音（環境ノイズ）は、様々な方向から発生することを考えると拡散性があると考えられるので、２つのマイクで検出する騒音レベルの差は少ないものと判断する。
利用者が発話すると、ユーザ用マイク１１ｂの送話用マイクへ環境音用マイクよりも大きな音声信号が入る。送話用マイクと環境音用マイクの音声の差を測定し、6dB以上の差がある場合、信号が入った（利用者が発話した）として会話分析を開始する。 When the wearable terminal 1 is worn, it is determined whether or not the conversation start is detected (step 103). Specifically, voices are collected from the user microphone (two microphones for transmitting microphone / microphone for environmental sound) 11b, and the noise level is evaluated every 10 seconds for the environment around the user (environmental noise calibration). ) Considering that the environmental sound (environmental noise) is generated from various directions, it is considered that the sound is diffusive. Therefore, it is determined that the difference in the noise level detected by the two microphones is small.
When the user speaks, a voice signal larger than the environmental sound microphone is input to the transmission microphone of the user microphone 11b. Measure the voice difference between the microphone for sending and the microphone for environmental sound, and if there is a difference of 6dB or more, start conversation analysis as if there was a signal (the user uttered).

「会話分析を開始」の場合、以下の処理が行われる。
・会話開始時刻を記憶し、会話時間計測を開始する。
・会話終了時を検出するための会話終了検出タイマを起動する。
・相手用マイク１１ａをONにし、相手の声の音声取得を開始する。
・カメラ１２をONにして映像取得を開始する。
・ジャイロ・加速度センサ１３をONにし、センサからのデータ収集を開始する。 In the case of “start conversation analysis”, the following processing is performed.
・ Memorize conversation start time and start conversation time measurement.
・ Start the conversation end detection timer to detect when the conversation ends.
・ Turn on the other party's microphone 11a and start acquiring the voice of the other party's voice.
・ Turn on camera 12 and start video acquisition.
・ Turn on the gyro / acceleration sensor 13 and start collecting data from the sensor.

続いて、ウェアラブル端末１の制御部２０が上述した各情報を取得することで会話分析を行う（ステップ１０４）。
会話分析については、後述する。 Subsequently, the control unit 20 of the wearable terminal 1 performs the conversation analysis by acquiring the above-described information (step 104).
Conversation analysis will be described later.

会話終了を判定する（ステップ１０５）。
会話終了は、例えば会話終了タイマを30秒とし、ユーザ発話が30秒間検出しなかった場合、会話終了と判定する。
・会話時間を確定する。
・相手用マイク１１ａをOFFにする。
・カメラ１２をOFFにする。
・ジャイロ・加速度センサ１３をOFFにする。 The end of the conversation is determined (step 105).
For example, when the conversation end timer is set to 30 seconds and no user utterance is detected for 30 seconds, the conversation end is determined to be ended.
・ Confirm conversation time.
・ Turn off the partner microphone 11a.
・ Turn off the camera 12.
・ Turn off the gyro / acceleration sensor 13.

会話終了と判定した場合、会話の評価・記録を行う（ステップ１０６）。
会話の評価・記録については、後述する。 If it is determined that the conversation has ended, the conversation is evaluated and recorded (step 106).
Conversation evaluation and recording will be described later.

前記会話分析（ステップ１０４）は、次の手順で行われる。
「音声のテキスト化、会話のテンポ（発話速度）の分析」
先ず、ユーザ用マイク１１ｂで集音した音声と相手用マイク１１ａで集音した音声をそれぞれ音声認識によるテキスト化（ディクテーション）する。
また、音声認識したテキストのスピード（話す速度）をユーザと相手の発声毎に分析する。
ここで、「会話のテンポが一致する」とは、音声認識機能により音声が一度音素化されるが、この音素化された数とその時間の比をテンポとする。１回発声される毎にこのテンポを計算し、１つ前の発声のテンポと差を取り、１会話が終わるまで差を加算し続ける。また加算回数も計測する。 The conversation analysis (step 104) is performed according to the following procedure.
“Speech of speech, analysis of conversation tempo”
First, the voice collected by the user microphone 11b and the voice collected by the counterpart microphone 11a are converted into text (dictation) by voice recognition.
In addition, the speed of the voice-recognized text (speaking speed) is analyzed for each utterance of the user and the partner.
Here, “the conversation tempo matches” means that the speech is once phonemeized by the speech recognition function, and the ratio between the number of phonemes and the time is defined as the tempo. This tempo is calculated for each utterance, and the difference from the tempo of the previous utterance is taken, and the difference is continuously added until the end of one conversation. The number of additions is also measured.

「会話テキストの分析、同一語の繰り返し回数分析」
前記テキストを形態素解析して単語に分割し、会話が終わるまで記憶する。
ここで、「同一語の繰り返し回数」は、音声認識機能により音声がテキスト化されて、１発声毎に音声認識されたテキスト中の名詞と、その前の発声で音声認識したテキスト中の名詞とを比較し、同一語があれば同一語の繰り返しとして「＋１」加算する。 "Analysis of conversation text, number of repetitions of the same word"
The text is morphologically analyzed and divided into words and stored until the conversation ends.
Here, “the number of repetitions of the same word” is defined as the noun in the text that is voice-recognized for each utterance and the noun in the text that is voice-recognized by the previous utterance. If there is the same word, “+1” is added as a repetition of the same word.

「会話のトーンの分析」
ここで、「会話のトーンが一致する」とは、相手の声量と自身の声量がほぼ一致していることである。
マイクロフォン１１で収録された音声の１発声毎のA特性音圧レベルの平均値を計測し、１つ前の相手の発声の音圧レベルとの差を取り、１会話が終わるまで差を加算する。加算回数もカウントしておく。 "Analysis of conversation tones"
Here, “the conversation tone matches” means that the voice volume of the other party and the voice volume of the other party are almost the same.
Measure the average value of the A-weighted sound pressure level for each utterance of the sound recorded by the microphone 11, take the difference from the sound pressure level of the previous partner's utterance, and add the difference until the end of one conversation . Also count the number of additions.

「カメラ映像からの相手の顔検出時間の計算」
カメラ１２から映像を取得し、映像の中に顔のある位置をパターン認識で検出する。
パターン認識する特徴点は、頭部輪郭に対する両目の位置とその間の鼻、口である。
顔を検出していた時間を積算する。 "Calculation of opponent's face detection time from camera video"
An image is acquired from the camera 12, and a position where a face is present in the image is detected by pattern recognition.
The feature points for pattern recognition are the positions of both eyes relative to the head contour and the nose and mouth between them.
Accumulate the time during which the face was detected.

「カメラ映像からの相手の頷き回数の計算」
カメラ１２の映像から検出した顔の位置を更に分析し、頷いたかどうかを判定する。
映像スライス毎に目と口を囲む四角形の縦横の比を算出し、その比の時系列変化から推定する。
すなわち、図６（ａ）に示すような、目と口を囲む四角（点線で囲まれた領域）が正方形に近い（縦：横の比が大きい）状態から、図６（ｂ）に示すような、目と口を囲む四角（点線で囲まれた領域）が潰れた長方形（縦：横の比が小さい）状態に徐々に変化し、その後、また元の大きさ（図６（ａ））に戻る変化を１つの「頷き」と判断する。カメラ１２の映像から、「頷き」と判定した時刻と回数を記録する。 "Calculating the number of whispering of the other party from the camera video"
The position of the face detected from the image of the camera 12 is further analyzed, and it is determined whether or not it has been whispered.
For each video slice, the ratio of the vertical and horizontal dimensions of the rectangle surrounding the eyes and mouth is calculated and estimated from the time series change of the ratio.
That is, as shown in FIG. 6B, from the state where the square (the area surrounded by the dotted line) surrounding the eyes and mouth is close to a square (longitudinal: horizontal ratio is large) as shown in FIG. The square surrounding the eyes and mouth (the area surrounded by the dotted line) gradually changes to a crushed rectangle (vertical: horizontal ratio is small), and then the original size (FIG. 6 (a)). The change that returns to is judged as one “whisper”. From the video of the camera 12, the time and number of times determined as “whispering” are recorded.

「カメラ映像からの相手の笑顔判定時間の計算」
カメラ１２の映像から検出した顔の位置を更に分析し、笑顔かどうかを判定する。
笑顔は、図７に示すように、普通の表情の状態（図７（ａ））と、笑顔の状態（図７（ｂ））との目尻の下がりと口角の上がりやしわ（ホウレイ線の深さ）の関係を取得し、予め決めておいた閾値と比較することで笑顔を推定する。笑顔と判定していた時間を積算する。 "Calculating the smile time of the other party from the camera video"
The position of the face detected from the image of the camera 12 is further analyzed to determine whether it is a smile.
As shown in FIG. 7, the smile is a normal expression state (FIG. 7 (a)) and a smile state (FIG. 7 (b)). ) Is acquired and compared with a predetermined threshold value to estimate a smile. Accumulate the time that was determined to be a smile.

「ジャイロ・加速度センサからの頷きの計算」
利用者自身の「頷き」は、ジャイロ・加速度センサ１３からのデータにより判定する。
すなわち、１回の頷きとは、図８に示すように、利用者が正面を向いた状態（図８（ａ））から首を前に傾けた状態（図８（ｂ））へ、そして再び正面を向いた状態（図８（ａ））に戻る動作とする。
例えば、６軸のジャイロ・加速度センサ１３が眼鏡のテンプル部（つる）５の部分に配置されているとした場合、各軸を以下のように定義する。
X軸：利用者の左右方向に貫く軸。X軸を中心にした回転を「ピッチ」とする。
Y軸：利用者の前後方向に貫く軸。Y軸を中心にした回転を「ロール」とする。
Z軸：利用者の上下方向に貫く軸。Z軸を中心にした回転を「ヨー」とする。
まず重力により下方向へ重力ベクトルを３軸加速度センサから認識できる。つまりZ軸下方向への加速度が恒常的にある状態である。
利用者が、頷き首を前に倒すと、図８（ｂ）の状態になる。ジャイロセンサのピッチ角が推定位置より下がる。また、Z軸下方への加速度が一時的に減少し、Y軸前方向に重力成分が表れる。
利用者が、首を元倒すと、図８（ａ）の状態に戻る。
ジャイロセンサのピッチ角が水平位置にもとに戻り。また、Z軸下方への加速度が一時的に増大し、Y軸前方向に重力成分が減少する。
この一連の変化をパターン認識し、１回の頷き回数とする。 "Calculation of whispering from gyroscope and acceleration sensor"
The user's own “whispering” is determined based on data from the gyro / acceleration sensor 13.
That is, as shown in FIG. 8, the one-time whispering is from a state where the user is facing the front (FIG. 8A) to a state where the neck is tilted forward (FIG. 8B), and again. The operation returns to the state of facing the front (FIG. 8A).
For example, when the 6-axis gyro / acceleration sensor 13 is arranged in the temple portion (vine) 5 of the glasses, each axis is defined as follows.
X axis: An axis that penetrates the user in the horizontal direction. Rotation around the X axis is called “pitch”.
Y axis: An axis that penetrates the user in the longitudinal direction. The rotation around the Y axis is called “roll”.
Z axis: An axis that penetrates the user in the vertical direction. The rotation around the Z axis is called “Yaw”.
First, the gravity vector can be recognized downward from the triaxial acceleration sensor by gravity. That is, there is a constant acceleration in the downward direction of the Z axis.
When the user tilts the whispered neck forward, the state shown in FIG. The pitch angle of the gyro sensor falls below the estimated position. In addition, the acceleration downward in the Z axis temporarily decreases, and a gravity component appears in the forward direction of the Y axis.
When the user pulls the neck down, the state returns to the state of FIG.
The pitch angle of the gyro sensor returns to the horizontal position. Further, the acceleration downward in the Z axis temporarily increases, and the gravity component decreases in the forward direction of the Y axis.
This series of changes is pattern-recognized, and the number of times of singing is one.

ウェアラブル端末（会話分析装置）１の制御部２０の分析結果のデータ（センサ部１０の各データ）に基づいて、会話分析部２５は会話の受容度および会話の巧みさを評価する指標を計算し、会話ログ３４に記録し会話が継続する間は更新をする。指標の計算の詳細については後述する。 Based on the analysis result data (each data of the sensor unit 10) of the control unit 20 of the wearable terminal (conversation analysis device) 1, the conversation analysis unit 25 calculates an index for evaluating the acceptance of the conversation and the skill of the conversation. It is recorded in the conversation log 34 and updated while the conversation continues. Details of the index calculation will be described later.

続いて、会話の受容度および会話の巧みさを評価する各指標の計算方法について説明する。
これらの指標を求めるために、以下の単位時間あたりの各行動を計測する。
会話が終わるまで待たずに例えば15秒周期で指標を再計算し更新する。
・頷かれ回数（相手の頷き回数）
・頷き回数（自身の頷き回数）
・重要語の回数（自身及び相手による発話）
・笑い時間
・対向時間
・会話トーンの一致性
・会話テンポの一致性
・繰り返し回数 Then, the calculation method of each parameter | index which evaluates the acceptance degree of conversation and the skill of a conversation is demonstrated.
In order to obtain these indices, the following behaviors per unit time are measured.
For example, the index is recalculated and updated every 15 seconds without waiting for the conversation to end.
・ Number of beatings (number of whispering of opponent)
・ Number of whispers (number of whispers)
・ Number of important words (utterances by yourself and the other party)
・ Laughter time, opposite time, conversation tone consistency, conversation tempo consistency, number of repetitions

頷き回数：NnrおよびNnd
頷きは相手を認め、肯定している行動である。会話単位時間あたりで頷かれの多い人は、相手にわかりやすい話し方をしており、相手に受け入れられていると考えられる。
Nnr = (相手の頷き回数)／（会話時間）
Nnd = (自身の頷き回数)／（会話時間） Number of whirlings: Nnr and Nnd
Whispering is an action that recognizes and affirms the other party. A person who is often asked about the unit time of the conversation has an easy-to-understand way of speaking to the other party and is considered accepted by the other party.
Nnr = (number of whisperings of the other party) / (conversation time)
Nnd = (Number of whispers) / (Conversation time)

ここで、頷かれ回数（相手の頷き回数）は、カメラ１２に相手の顔が映り、頷き検出機能により「頷き」と推定していた回数とする。
ここで、頷き回数（自身の頷き回数）は、ジャイロ加速度センサ１３からの頷き検出機能により「頷き」と推定していた回数とする。
会話を進める中で、予め決めていた閾値よりNnrが低い場合は、表示部４０に相手の頷きが足りない旨を表示する。
会話を進める中で、予め決めていた閾値よりNndが低い場合は、表示部４０に自身の頷きが足りない旨を表示する。 Here, the number of hits (the number of times the other party whispered) is the number of times that the other party's face was reflected on the camera 12 and was estimated to be “whited” by the whisper detection function.
Here, the number of times of whispering (the number of times of whispering) is the number of times that the whispering is estimated by the whispering detection function from the gyro acceleration sensor 13.
When Nnr is lower than a predetermined threshold during the conversation, the display unit 40 displays that the other party's whisper is insufficient.
If Nnd is lower than a predetermined threshold value during the conversation, the display unit 40 displays that it is not enough.

笑われた時間:Tsr
笑いは会話を円滑に進めるための行動である。
会話単位時間あたりで笑われの多い人は、楽しく円滑に会話を進めていることを示す。
Tsr = (相手が笑った時間)／（相手の顔を検出していた時間） Laughed time: Tsr
Laughter is an action to facilitate conversation.
A person who is often laughed per conversation time unit shows that the conversation is proceeding happily and smoothly.
Tsr = (time when the other party laughed) / (time when the other party's face was detected)

ここで、「相手が笑った時間」は、カメラに相手の顔が映り、笑顔検出機能により笑顔と推定していた時間とする。
会話を進める中で、予め決めていた閾値よりTsrが低い場合は、表示部に相手の笑顔が足りない旨を表示する。 Here, “the time when the other party laughed” is the time when the other party's face was reflected on the camera and the smile was detected by the smile detection function.
If Tsr is lower than a predetermined threshold during the conversation, the display unit displays that the other party is not smiling.

重要語の回数：NtrおよびNtd
重要語とは、重要語データベースの単語と一致した回数を記録する。会話単位時間あたりで重要後を発言する場合、重要語を発言する側にとって会話の内容が有益で肯定的な内容であることを示す。
Ntr = (相手の重要語の発声回数)／（会話時間）
Ntd = (自身の重要語の発声回数)／（会話時間） Number of important words: Ntr and Ntd
The important word is recorded as the number of matches with the word in the important word database. When saying important after a conversation unit time, it shows that the content of the conversation is useful and positive for the person who speaks the important word.
Ntr = (number of utterances of the partner's important words) / (conversation time)
Ntd = (Number of utterances of own important words) / (Conversation time)

ここで、「重要語の発声回数」は、音声認識機能により音声がテキスト化されて、そのテキスト中重要語が含まれた数とする。
会話を進める中で、予め決めていた閾値よりNtrが低い場合は、表示部４０に会話内容を変更する旨を表示する。 Here, the “number of utterances of important words” is a number in which the speech is converted into text by the speech recognition function and the important words are included in the text.
When Ntr is lower than a predetermined threshold during the conversation, the display unit 40 displays that the conversation content is to be changed.

対向時間:Ted
会話をする時は、相手の目を見ることで信頼感が得られる。
会話単位時間あたりで相手を見ている人は、集中して、あるいは興味を持って会話していることを示す。
Ted = (相手の顔を見ていた時間)／（会話時間） Opposite time: Ted
When talking, you can gain confidence by looking at the other person's eyes.
A person who is looking at the other person per unit time of conversation shows that he / she is concentrating or talking with interest.
Ted = (Time spent watching the other person's face) / (Conversation time)

ここで、「相手の顔を見ていた時間」は、カメラに相手の顔が映り、顔検出機能により目と口の顔検出ができていた時間とする。
会話を進める中で、予め決めていた閾値よりTedが低い場合は、表示部４０に相手をもっと見る旨を表示する。 Here, “the time when the other party's face was being seen” is the time when the other party's face was reflected on the camera and the face detection function was able to detect the eyes and mouth.
When Ted is lower than a predetermined threshold during the conversation, the display unit 40 displays that the other party is seen more.

会話のトーンが一致している時間:Rst
会話のトーンとは、声の張り具合である。声の張り具合が異なる会話は心理的にお互いの同調感が感じられない。
Rst= (会話のトーンの差の総和)／（加算した回数） Time when conversation tone matches: Rst
The tone of conversation is the tone of the voice. Conversations with different voice tensions are psychologically insensitive to each other.
Rst = (sum of conversation tone differences) / (number of additions)

加算したものを加算回数で割ると、差の平均値が出る。
Rstが大きいほど声量の差があり、会話が巧みでないことを示す。
会話を進める中で、予め決めていた閾値よりRstが大きい場合は、表示部４０に相手の声の調子をまねる旨を表示する。 Divide the sum by the number of additions to get the average difference.
The larger Rst, the greater the difference in voice volume, indicating that the conversation is not skillful.
When Rst is larger than a predetermined threshold during the conversation, the display unit 40 displays that the other party's voice is imitated.

会話のテンポが一致している時間:Rss
会話のテンポとは、発声のスピードである。発声のスピード感が異なる会話は心理的に同調感が感じられない。 Time when conversation tempo matches: Rss
The conversation tempo is the speed of speaking. Conversations with different utterance speeds are not psychologically synchronized.

Rss = (会話のテンポの差の総和)／（加算した回数）
Rssが大きいほどテンポがズレていて、会話が巧みでないことを示す。
会話を進める中で、予め決めていた閾値よりRssが大きい場合は、表示部４０に相手の声の調子をまねる旨を表示する。 Rss = (sum of differences in conversation tempo) / (number of additions)
The larger Rss, the more tempo is shifted, indicating that the conversation is not skillful.
When Rss is larger than a predetermined threshold during the conversation, the display unit 40 displays that the other party's voice is imitated.

同一語の繰り返し発生回数:Nrd
会話である言葉が繰り返される際、両者から同一な言葉が発声されるとその言葉について理解されたと感じる。
Nrd = (同一語の発声回数)／（会話時間）
同一語の発声回数が多い程、相手の話を傾聴し、一度受け止めていると感じさせる。
つまりNrdが大きい程会話が巧みであるとする。
会話を進める中で、予め決めていた閾値よりNrdが小さい場合は、表示部４０に繰り返しをする旨を表示する。 Repeated occurrences of the same word: Nrd
When a word that is a conversation is repeated, if the same word is uttered from both, it is felt that the word has been understood.
Nrd = (Number of utterances of the same word) / (Conversation time)
The more the same word is spoken, the more the listener listens and feels like they are once.
In other words, the larger the Nrd, the better the conversation.
When Nrd is smaller than a predetermined threshold during the conversation, the display unit 40 is displayed to repeat.

これらの値を以下の式で計算して、その１会話の相手受容度（Rc）と会話の巧みさ（Cｃ）を数値化する。
Rcが高い程、会話が受け入れられていることを示し、Ccが高い程会話が巧みだったことを示す。
会話の相手受容度
Rc= a1×Nnr + a2×Ntr + a3×Tsr
会話の巧みさ
Cc= a1×Nnr + a2×Ntr + a3×Tsr + a4×Nnd + a5×Ntd + a6×Ted - a7×Rst - a8×Rss + a9×Nrd
それぞれの項のa1〜a9は、各項の所定の重み付け定数（0以上の値）である。 These values are calculated by the following formula, and the other party's acceptance (Rc) and skill of the conversation (Cc) are quantified.
A higher Rc indicates that the conversation is accepted, and a higher Cc indicates that the conversation is more skillful.
Conversation acceptability of conversation
Rc = a1 × Nnr + a2 × Ntr + a3 × Tsr
Skill of conversation
Cc = a1 × Nnr + a2 × Ntr + a3 × Tsr + a4 × Nnd + a5 × Ntd + a6 × Ted-a7 × Rst-a8 × Rss + a9 × Nrd
A1 to a9 of each term are predetermined weighting constants (values of 0 or more) of each term.

また、音声評価部２１においては、音声認識結果のテキストを形態素解析し，名詞に分類された単語を全て文書として会話ログ３４に記憶する。
以上の処理を会話が終了したと判定されまで継続する。 The voice evaluation unit 21 performs morphological analysis on the text of the voice recognition result, and stores all words classified as nouns in the conversation log 34 as documents.
The above processing is continued until it is determined that the conversation has ended.

前記の会話の評価・記録（ステップ１０６）については、以下の手順で行う。
会話が終了したと判定されると、ウェアラブル端末（会話分析装置）１の制御部２０の会話分析部２５は、前記会話の受容度と会話の巧みさの指標を確定した会話時間で更新し、会話ログ３４の全会話文書を用いて以下の方法で、その会話の特徴語を求める。 The conversation evaluation / recording (step 106) is performed in the following procedure.
When it is determined that the conversation has ended, the conversation analysis unit 25 of the control unit 20 of the wearable terminal (conversation analysis device) 1 updates the conversation acceptance and the conversation skill index with the determined conversation time, Using all the conversation documents in the conversation log 34, the characteristic words of the conversation are obtained by the following method.

文書中の全単語のTF(term frequency)値を計算し、TF値の最も大きい単語をその会話の特徴語としてもよい。TF値は、その文章の中でその単語が出現した頻度を表す。 The TF (term frequency) value of all the words in the document may be calculated, and the word having the largest TF value may be used as the feature word of the conversation. The TF value represents the frequency of occurrence of the word in the sentence.

また、最頻出している単語の代わりに、そのユーザの会話の文書の全単語と調査したい期間の全会話文書を対象として、TF-IDF値を計算し、TF-IDF値の最も高い単語を特徴語として会話記録する。
出現頻度は、TF-IDF（Term Frequency - Inverse Document Frequency：単語の出現頻度−逆出現頻度）値であってもよい。TF-IDFとは、文書中に出現した単語がどのくらい特徴的であるかを識別するための指標をいう。TF(term frequency)は、その文書の中でその単語が出現した頻度を表す。
IDF(inverse document frequency)は、全文書の中でその単語を含む文書数の自然対数で表し、この値が高い程その単語が全文書内で普く頻出していない単語であることを現す。
そして、TF値×IDF値が高い値を持つ単語ほど、重要であるとする。 Also, instead of the most frequently occurring word, calculate the TF-IDF value for all the words in the user's conversation document and all the conversation documents for the period you want to investigate, and select the word with the highest TF-IDF value. Record conversations as feature words.
The appearance frequency may be a TF-IDF (Term Frequency-Inverse Document Frequency: word appearance frequency-inverse appearance frequency) value. TF-IDF is an index for identifying how characteristic a word that appears in a document is. TF (term frequency) represents the frequency of occurrence of the word in the document.
IDF (inverse document frequency) is represented by the natural logarithm of the number of documents including the word in all documents, and the higher this value, the more frequently the word is a word that does not appear frequently in all documents.
A word having a higher value of TF value × IDF value is more important.

また、話題語データベース３３に予め登録しておいた話題語が含まれる場合、その話題語を会話ログ３４に記録する。
以上が１会話のみに着目し、その会話の受容度および会話の巧みさを評価する場合の手順となる。
１つの会話についての会話ログの生成が完了したので、ウェアラブル端末１の会話分析部２５は第１の通信部５０を介して音声評価サーバ３に会話ログを転送する。
会話評価サーバ３の制御部７０は第1の通信部６０から会話ログを受信すると、会話ログを記憶部８０の会話ログデータベース８１に記録する。 When a topic word registered in advance in the topic word database 33 is included, the topic word is recorded in the conversation log 34.
The above is the procedure for focusing on only one conversation and evaluating the acceptance of the conversation and the skill of the conversation.
Since the generation of the conversation log for one conversation is completed, the conversation analysis unit 25 of the wearable terminal 1 transfers the conversation log to the voice evaluation server 3 via the first communication unit 50.
When receiving the conversation log from the first communication unit 60, the control unit 70 of the conversation evaluation server 3 records the conversation log in the conversation log database 81 of the storage unit 80.

１会話の分析結果を記録する会話ログデータベース８１のレコード値を図９に示す。
会話ログは、ユーザID毎の１会話で管理され、会話開始時刻及び会話時間とともに、上述した各指標が記憶されている。
また、TF値等を計算するため、会話中の全名詞のリストが記憶されるとともに、会話中に存在した特徴語のリスト及び話題語のリストが記憶されている。 FIG. 9 shows record values of the conversation log database 81 that records the analysis result of one conversation.
The conversation log is managed in one conversation for each user ID, and each index described above is stored together with the conversation start time and conversation time.
In addition, in order to calculate the TF value and the like, a list of all nouns in the conversation is stored, and a list of feature words and a list of topic words that existed in the conversation are stored.

会話評価サーバ装置３の会話評価部７１では、この会話のログを更に分析し、利用者の全会話を分析し、全利用者のRcとCcと比較（平均や標準偏差）することで、普く会話の受容度の指標Rcと会話の巧みさの指標Ccが高い利用者は、会話が受け入れられており会話が巧みと評価できる。 The conversation evaluation unit 71 of the conversation evaluation server apparatus 3 further analyzes this conversation log, analyzes all the conversations of the users, and compares (averages and standard deviations) the Rc and Cc of all the users. A user who has a high conversation acceptance index Rc and a conversation skill index Cc can accept that the conversation is accepted and that the conversation is skillful.

また、この会話のログの話題語毎に更に分析することで、どのような会話の話題が相手に受け入れられたかを評価できる。
例えば、話題語が含まれる会話記録を参照し、会話の巧みさ度Ccを評価する。
話題語が会話に含まれた全ユーザを検索し、そのユーザ毎に話題語が出現した全会話数でCcの統計値（平均値や標準偏差）を計算する。会話の巧みさ度Ccが比較的高ければ、その話題に対して影響力のあるユーザであるといえる。 Further, by analyzing each topic word of the conversation log, it is possible to evaluate what conversation topic has been accepted by the other party.
For example, referring to a conversation record including a topic word, the skill level Cc of the conversation is evaluated.
All users whose topic words are included in the conversation are searched, and the statistical value (average value or standard deviation) of Cc is calculated by the total number of conversations in which the topic words appear for each user. If conversation skill level Cc is relatively high, it can be said that the user has an influence on the topic.

上述した会話分析システムによれば、ユーザが行った会話の巧みさの評価が可能になるので、例えば以下のようなケースに使用することができる。
使用例１：接客時の店員の会話の巧みさの評価に利用する。店員と顧客の会話が顧客に受け入れられており店員の会話が巧みで巧く進んでいるか店員はウェアラブル端末を使用することでリアルタイムに確認することもでき、店員の管理者はどの店員の会話が巧みだったか、会話評価サーバに問合せすることにより確認することができる。
使用例２：まだウェアラブル端末が一般ユーザに普及していない時にでも、そのウェアラブル端末を使用しているユーザについて、ある話題語に対する他人への影響力を推測することができる。 According to the conversation analysis system described above, the skill of the conversation performed by the user can be evaluated, so that it can be used in the following cases, for example.
Example of use 1: Used to evaluate cleverness of a store clerk during customer service. The store clerk and customer conversations are accepted by the customer, and the store clerk can check in real time by using a wearable device whether the store clerk's conversation is clever and skillful. It can be confirmed by inquiring to the conversation evaluation server whether it was skillful.
Usage example 2: Even when wearable terminals are not yet widespread among general users, it is possible to estimate the influence of a wearable terminal on another topic word for a user using the wearable terminal.

次に、本発明に係る会話分析システムの実施形態の他の例について、図１０〜１４を参照しながら説明する。
この会話分析システムでは、図１０の会話分析装置（ウェアラブル端末）と、図１１の会話評価サーバ装置３を使用することで、インフルエンサー（他人への影響が大きいユーザ）の抽出を行う。
すなわち、この会話分析システムでは、会話の相手を特定し、リアルな人間関係を自動的に推測可能としている。会話相手との人間関係を知るため、利用者自身の他、会話相手も識別するようになっている。 Next, another example of the embodiment of the conversation analysis system according to the present invention will be described with reference to FIGS.
In this conversation analysis system, influencers (users who have a great influence on others) are extracted by using the conversation analysis apparatus (wearable terminal) of FIG. 10 and the conversation evaluation server apparatus 3 of FIG.
That is, in this conversation analysis system, it is possible to automatically identify realistic human relationships by specifying a conversation partner. In addition to the user himself / herself, the conversation partner is also identified in order to know the human relationship with the conversation partner.

この会話分析システムでは、会話分析装置（ウェアラブル端末）１を利用者自身の他、会話の相手も使用していることが前提となる。会話の両者がウェアラブル端末１を使用することで、システムで一意なユーザ識別子（ユーザID）を各利用者に付与する。ウェアラブル端末１は会話中の相手のユーザIDを知ることで、図１〜４の会話分析システムにおける相手側の「頷き」や「笑顔」の度合いがどのユーザから得られたのか具体的に知ることができる。 In this conversation analysis system, it is assumed that the conversation analysis apparatus (wearable terminal) 1 is used not only by the user itself but also by a conversation partner. By using the wearable terminal 1 for both conversations, a unique user identifier (user ID) is given to each user in the system. The wearable terminal 1 knows the user ID of the other party during the conversation, and specifically knows from which user the degree of “swing” and “smile” on the other side in the conversation analysis system of FIGS. Can do.

図１０の会話分析装置（ウェアラブル端末）１と、図１１の会話評価サーバ装置３を利用した会話分析システムにおいて、図３及び図４と同一構成を採る部分については同一符号を付している。以下、図３及び図４の会話分析システムと異なる構成を中心に説明する。
この会話分析システムのウェアラブル端末１では、ユーザIDをウェアラブル端末間で交換するための第２の通信部５１を備える。
具体的には、IR(赤外線)通信のIrDA規格であるIrSS（登録商標）を用いても良く、赤外線の指向性を制御して、例えば正面中心から左右30°（合計60°）指向性で送受信する。また、赤外線通信の代わり超音波通信でも良いし、GHz帯以上の指向性を持つ電波無線通信でも良い。 In the conversation analysis system using the conversation analysis apparatus (wearable terminal) 1 in FIG. 10 and the conversation evaluation server apparatus 3 in FIG. 11, parts having the same configurations as those in FIG. 3 and FIG. In the following, a description will be given focusing on a configuration different from the conversation analysis system of FIGS.
The wearable terminal 1 of this conversation analysis system includes a second communication unit 51 for exchanging user IDs between wearable terminals.
Specifically, IrSS (registered trademark), which is an IrDA standard for IR (infrared) communication, may be used. For example, by controlling the directivity of infrared light, for example, 30 ° left and right (total 60 °) directivity from the front center. Send and receive. Also, ultrasonic communication may be used instead of infrared communication, and radio wave communication having directivity of GHz band or higher may be used.

また、図９に対応する会話ログデータベースのレコード値（図１２）では、相手ユーザを特定するための相手ユーザIDが追加されている。
ユーザIDは、ウェアラブル端末１に予め製造シリアル番号を記録して使用しても良いし、ユーザ登録の際にシステムで一意になるような任意なコードを登録しても良い。
第２の通信部５１を用いることで、図１３に示すように、互いに対面した時に相手側のウェアラブル端末１との通信可能範囲における直接的な通信が可能となり、ID交換し会話の相手を自動的に特定することができる。 Further, in the record value (FIG. 12) of the conversation log database corresponding to FIG. 9, a partner user ID for identifying the partner user is added.
The user ID may be recorded and used in advance on the wearable terminal 1, or an arbitrary code that is unique in the system at the time of user registration may be registered.
By using the second communication unit 51, as shown in FIG. 13, when facing each other, direct communication within the communicable range with the wearable terminal 1 on the other side is possible, and ID exchange is performed to automatically communicate with the other party. Can be identified.

この会話分析システムの会話評価サーバ装置３では、制御部１０がユーザIDから会話ログを分析して利用者の人間関係を管理するための人間関係管理機能７２を備えている。また、記憶部３０は、利用者の人間関係を紐づけて記憶する人間関係データベース８２を備えている。
人間関係管理機能では、会話の相手を特定することで互いのウェアラブル端末で計測した会話の巧みさ指標Ccを算出するための指標を、会話ログを記録する毎に自身のIDと相手のユーザIDと会話開始時刻から照合し会話ログが一致するものであれば、それぞれの自身の笑顔の時間（Tsd:図１２）も相手のウェアラブル端末の計測結果の会話ログから知ることができ、指針のデータとして補完することができる。 The conversation evaluation server device 3 of this conversation analysis system includes a human relationship management function 72 for the control unit 10 to analyze a conversation log from a user ID and manage a user's human relationship. The storage unit 30 also includes a human relationship database 82 that stores the user's human relationships in association with each other.
In the human relationship management function, each time a conversation log is recorded, an ID for calculating the skill level Cc of the conversation measured by each wearable terminal by specifying the conversation partner, its own ID and the other party's user ID If the conversation logs match each other from the conversation start time, their own smile time (Tsd: Fig. 12) can also be known from the conversation log of the measurement result of the other wearable terminal, and the guideline data Can be complemented as

さらに、図１４に示すような、ユーザ同士のリアルな人間関係マップを自動的に構築する。
図１４の人間関係マップでは、会話ログの自身のIDと相手のIDとの結び付き度を、分析したい任意の期間の会話ログにおけるユーザ間の総会話時間でリンクの太さで示し、リンクの方向を全Rcの差で示している。また、受け入れ度である相手受容度（Rc）が大きい方へ情報が流れていることを意味する。
例えば、図１４のID:W1とID:W2のユーザを例に説明すると、他と比較してユーザ同士のリンクが太いので強い結び付きがあり、矢印の向きがユーザW2の方向がより大きいので、ユーザW2がユーザW1をより受け入れている（受容度が高い）ことを示している。また、情報の流れがユーザW2の方向にあるものと推測できる。 Further, a real human relationship map between users as shown in FIG. 14 is automatically constructed.
In the human relationship map of FIG. 14, the degree of connection between the ID of the conversation log and the ID of the other party is indicated by the total conversation time between users in the conversation log for an arbitrary period to be analyzed, and the link direction. Is shown by the difference of total Rc. In addition, it means that information is flowing toward the person with the higher acceptance level (Rc).
For example, if the user of ID: W1 and ID: W2 in FIG. 14 is described as an example, since the link between the users is thicker than others, there is a strong connection, and the direction of the arrow is larger in the direction of the user W2, This shows that the user W2 has more accepted the user W1 (higher acceptance). Further, it can be estimated that the information flow is in the direction of the user W2.

また、人間関係マップにおいては、ある話題毎にフィルタリングして俯瞰することもできる。
このように、全ユーザの人間関係を図１４のようにマッピングして、ある話題でフィルタリングすることで、その話題で中心的なユーザを見つけることができる。即ち、リアルな人間関係における特定の話題に関するインフルエンサーを見つけることができる。 In addition, in the human relationship map, it is possible to provide a bird's-eye view by filtering for each topic.
Thus, by mapping the human relationships of all users as shown in FIG. 14 and filtering on a certain topic, it is possible to find a central user on that topic. That is, an influencer related to a specific topic in a real human relationship can be found.

上述した会話分析装置によれば、人同士のリアルな会話を分析し、ある分野のインフルエンサーを抽出することができるので、インフルエンサーとされた利用者に対してダイレクトメール等で話題語に関連する最新情報や広告を送りこむこと等で、そのユーザの周りのリアルな人間関係を通じて情報伝達が期待できる。 According to the conversation analysis device described above, it is possible to analyze a real conversation between people and extract influencers in a certain field, so it is related to topic words by direct mail etc. for users who are regarded as influencers By sending the latest information and advertisements to be transmitted, information transmission can be expected through real human relations around the user.

また、店舗接客において会話分析装置を使用することで、店員と客とのリアルなミュニケーションについて、店員がどれだけ客に認められて納得される会話を実施したかを計測できるので、定員の客観的な接客力を判定することができる。 In addition, by using a conversation analysis device at the store customer service, it is possible to measure how much the store clerk recognizes and convinces with the customer about the realistic communication between the store clerk and the customer. Can determine the customer service.

１…会話分析装置（ウェアラブル端末）、２…インターネット、３…会話評価サーバ装置、５…テンプル部、１０…センサ部、１１…マイクロフォン、１１ａ…相手用マイク、１１ｂ…ユーザ用マイク、１２…カメラ、１３…ジャイロ・加速度センサ、１４…装着センサ、２０…制御部、２１…音声評価部、２２…映像評価部、２３…頷き認識部、２４…装着認識部、２５…会話分析部、３０…記憶部、３１…利用者登録データ部、３２…重要語データベース、３３…話題語データベース、３４…会話ログ、４０…表示部、５０…第１の通信部、５１…第２の通信部、６０…第１の通信部、７０…制御部、７１…会話評価部、７２…人間関係管理部、８０…記憶部、８１…会話ログデータベース、８２…人間関係データベース。 DESCRIPTION OF SYMBOLS 1 ... Conversation analysis apparatus (wearable terminal), 2 ... Internet, 3 ... Conversation evaluation server apparatus, 5 ... Temple part, 10 ... Sensor part, 11 ... Microphone, 11a ... Microphone for other party, 11b ... Microphone for user, 12 ... Camera 13 ... Gyro / acceleration sensor, 14 ... Mounting sensor, 20 ... Control unit, 21 ... Speech evaluation unit, 22 ... Video evaluation unit, 23 ... Glow recognition unit, 24 ... Mounting recognition unit, 25 ... Conversation analysis unit, 30 ... Storage unit 31 User registration data unit 32 Key word database 33 Topic word database 34 Conversation log 40 Display unit 50 First communication unit 51 Second communication unit 60 ... 1st communication part, 70 ... Control part, 71 ... Conversation evaluation part, 72 ... Human relationship management part, 80 ... Memory | storage part, 81 ... Conversation log database 82 ... human relations database.

Claims

A wearable device that is worn on the user's head to detect human conversations,
An important word database storing a plurality of important words for extracting important words uttered in the conversation;
A microphone for collecting voice information of the conversation;
A voice evaluation unit for recognizing important words from the voice information;
A conversation analysis unit that analyzes a partner's acceptability for a conversation with a predetermined weighting from the recognition result of the voice evaluation unit;
A notification unit that feeds back the acceptance to the user;
Conversation analyzer characterized by comprising.

A camera that collects the facial expression of the conversation partner and the movement of the head as a video,
A video evaluation unit for recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video,
The conversation analysis unit performs a predetermined weighting from the recognition results of the voice evaluation unit and the video evaluation unit, and analyzes a partner's acceptability for the conversation.
The conversation analysis device according to claim 1 .

The notification unit, conversation analysis apparatus according to claim 1 or claim 2 which is constituted by the audio reproduction unit that can be recognized by hearing the acceptance.

The notification unit, conversation analysis apparatus according to claim 1 or claim 2 which is composed of a display device capable of recognizing the acceptance visually.

The voice evaluation unit has a function of analyzing the tone and / or tempo of conversation,
The conversation analysis unit has a function of analyzing the skill of the conversation by summing up the difference in tone and / or tempo of the conversation between the other party and the user and giving a certain weight.
The conversation analysis apparatus according to claim 1 , wherein the notification unit has a function of feeding back the skill of the analyzed conversation to a user.

An acceleration sensor for recognizing the movement of the user's head;
A whisker recognition unit that analyzes the number of whistling per unit conversation time of the user from the acceleration sensor,
The conversation analysis apparatus according to claim 5 , wherein the conversation analysis unit analyzes the skill of the conversation by applying a predetermined weight to the recognition by the whisper recognition unit.

The conversation analysis unit, the user and the mating of both claim 5 or analyzing the skillfulness of the conversation with the ability to analyze by a constant weighting aggregates the repetition of the words uttered in the conversation Item 7. The conversation analysis device according to item 6 .

The analysis by the conversation analysis unit analyzes a feature word using an index (TF-IDF) for identifying how characteristic the word that appears in the conversation is from the speech recognition result of one conversation. The conversation analysis apparatus according to any one of claims 1 to 7 , further comprising a storage unit that records each conversation together with the acceptance level and / or skill of the conversation.

A conversation analysis apparatus according to any one of claims 1 to 8 , and a conversation evaluation server apparatus that collects and manages conversation logs of the conversation analysis apparatus,
The conversation evaluation server device
A control unit that analyzes a conversation log of all users of the conversation analyzer and evaluates a conversation;
A storage unit for storing the conversation log in a conversation log database;
A conversation analysis system characterized by comprising:

  The conversation analysis device comprises a topic word database for storing a plurality of topic words for influencer extraction for a specific topic uttered during the conversation,
  The voice evaluation unit further recognizes a topic word from the voice information,
  The storage unit records the recognized topic word for each conversation in a conversation log,
  The control unit of the conversation evaluation server device extracts a conversation log including a topic word to be evaluated when analyzing a conversation log of a plurality of users and performing a conversation evaluation on the topic word.
  Extract users who have an influence on the topic word as influencers when the acceptance or skill of the conversation is higher than the average value from the multiple users of the extracted conversation logs
The conversation analysis system according to claim 9.

  The conversation analysis device comprises a communication unit for exchanging an identifier of a user of the user at the start of conversation and an identifier of a user who is a conversation partner,
  The control unit of the conversation evaluation server analyzes the conversation log, further records the identifier of the user who is a conversation partner and the conversation time, and the total conversation time for the conversation log between the identifiers assigned to the user. It is equipped with a human relationship management function that tabulates the degree of connection between users and the degree of acceptance, and manages the magnitude relationship between users as the user's human relationship,
  The conversation analysis system according to claim 10, wherein the storage unit of the conversation evaluation server includes a human relationship database that stores the human relationships of the users in association with each other.

A method for analyzing and evaluating a conversation by information processing using computer software for a conversation analysis system including a conversation analysis device and a conversation evaluation server,
A procedure for recognizing preset topic words and important words from voice information generated during conversation between a user and a conversation partner;
Acquiring the facial expression of the conversation partner and the movement of the head as a video, and recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video;
A procedure for analyzing the degree of acceptance of the conversation partner with respect to the topic of conversation from each recognition result;
Feeding back the acceptance to the user;
Conversation analysis method characterized by including.

A method for analyzing and evaluating a conversation by information processing using computer software for a conversation analysis system including a conversation analysis device and a conversation evaluation server,
A procedure for recognizing preset topic words and important words from voice information generated during conversation between a user and a conversation partner;
Acquiring the facial expression of the conversation partner and the movement of the head as a video, and recognizing the smile time and number of whispering of the conversation partner per unit conversation time from the video;
A procedure for counting the difference in tone and / or tempo of conversation between the conversation partner and the user;
A procedure for analyzing the skill of the conversation with respect to the topic of conversation by performing a predetermined weighting from each recognition result and the total result,
A procedure for feeding back the skill of the conversation to the user;
Conversation analysis method characterized by including.

Conversation of the conversation partner with respect to the topic of the analyzed conversation of the identifier for identifying the user given in advance and the recognized topic word and important word for the conversation between the plurality of users and each conversation partner Alternatively, the skill of conversation is stored as a conversation log, and the plurality of conversation logs are aggregated and managed by the conversation evaluation server device, and
When analyzing the conversation log of the plurality of users and performing conversation evaluation on the topic word, a procedure for extracting a conversation log including the topic word to be evaluated;
A procedure for extracting a user who has an influence on the topic word as an influencer when the conversation acceptance or skill of the conversation is higher than an average value from a plurality of users of the extracted conversation log;
The conversation analysis method according to claim 12 or 13, characterized by comprising:

  A procedure for exchanging the identifier of the user and the identifier of the user who is the conversation partner at the start of the conversation;
  In the conversation log of a plurality of users that are aggregated and managed by the conversation evaluation server, a procedure for further recording an identifier and a conversation time of a user who is a conversation partner;
  For the conversation log between the identifiers given to the user, a procedure for storing the degree of connection between the users who totaled the total conversation time, and the degree of acceptance and the magnitude relationship between the users as a human relationship database;
  A procedure for extracting information of the human relation database by a topic to be evaluated;
  A procedure for displaying the extracted human relationship as a human relationship map;
The conversation analysis method according to claim 14, further comprising:

The conversation analysis program characterized by constructing each procedure of the conversation analysis method of any one of Claims 12 thru | or 15 on a computer.