JP2023178102A

JP2023178102A - Information display control device and information display method

Info

Publication number: JP2023178102A
Application number: JP2022091170A
Authority: JP
Inventors: 純一糟谷; Junichi Kasuya; 麻梨子楠美; Mariko Kusumi; 博一小池; Hiroichi Koike; 功大中村; Masahiro Nakamura; 雄大櫻田; Takehiro SAKURADA; 廉渡辺; Ren WATANABE; 早紀内田; Saki Uchida
Original assignee: JVCKenwood Corp
Current assignee: JVCKenwood Corp
Priority date: 2022-06-03
Filing date: 2022-06-03
Publication date: 2023-12-14

Abstract

To appropriately display reference information from a keyword based on contents of speeches made by a plurality of persons.SOLUTION: A control device 20 comprises: a speech recognition unit 21 which recognizes speeches of a plurality of participants participating in a communication with at least speeches; an identification unit 22 which identifies a main speaker among the participants; an extraction unit 26 which extracts a keyword needing reference information out of the speeches recognized by the speech recognition unit 21; an information acquisition unit 27 which acquires the reference information for the keyword extracted by the extraction unit 26; and a display control unit 28 which displays the reference information acquired by the information acquisition unit 27. The display control unit 28 displays a condition for displaying the reference condition by weighting the keyword based on a speech made by the main speaker.SELECTED DRAWING: Figure 3

Description

本発明は、情報表示制御装置および情報表示方法に関する。 The present invention relates to an information display control device and an information display method.

会議や会話において、相手が発言した語句の付加的情報を表示する技術が知られている（例えば、特許文献１参照）。 2. Description of the Related Art In a meeting or conversation, a technique for displaying additional information on words and phrases uttered by the other party is known (for example, see Patent Document 1).

特開２０２１－００７００３号公報JP 2021-007003 Publication

複数人数が発言する会議または会話などにおいては、様々な発言者の発言に基づく付加情報などの参照情報が多数表示される。このような場合、表示された情報を見ることにより、会議または会話における発言の聴取が疎かになるおそれがある。 In a meeting or conversation where multiple people speak, a large amount of reference information such as additional information based on the comments of various speakers is displayed. In such a case, viewing the displayed information may cause the user to ignore what is being said in the meeting or conversation.

本開示は、上記に鑑みてなされたものであって、複数人数での発話内容に基づくキーワードから、適切に参照情報を表示させることを目的とする。 The present disclosure has been made in view of the above, and an object of the present disclosure is to appropriately display reference information from keywords based on the content of utterances by a plurality of people.

上述した課題を解決し、目的を達成するために、本開示に係る情報表示制御装置は、少なくとも発話によるコミュニケーションに参加している複数の参加者の発話を認識する発話認識部と、前記複数の参加者における、主となる発言者を特定する特定部と、前記発話認識部が認識した発話から、参照情報を要するキーワードを抽出する抽出部と、前記抽出部が抽出した前記キーワードに対する前記参照情報を取得する情報取得部と、前記情報取得部が取得した前記参照情報を表示する表示制御部と、を備え、前記表示制御部は、前記参照情報を表示する条件を、前記主となる発言者の発話に基づくキーワードに対して重み付けを行って表示する。 In order to solve the above-mentioned problems and achieve the purpose, an information display control device according to the present disclosure includes at least a speech recognition unit that recognizes the utterances of a plurality of participants participating in speech communication; an identification unit that identifies a main speaker in a participant; an extraction unit that extracts a keyword requiring reference information from the utterance recognized by the utterance recognition unit; and reference information for the keyword extracted by the extraction unit. and a display control unit that displays the reference information acquired by the information acquisition unit, and the display control unit sets conditions for displaying the reference information according to the main speaker. Keywords based on utterances are weighted and displayed.

本開示に係る情報表示方法は、少なくとも発話によるコミュニケーションに参加している複数の参加者の発話を認識する発話認識ステップと、前記複数の参加者における、主となる発言者を特定する特定ステップと、前記発話認識ステップで認識した発話から、参照情報を要するキーワードを抽出する抽出ステップと、前記抽出ステップで抽出した前記キーワードに対する前記参照情報を取得する情報取得ステップと、前記情報取得ステップで取得した前記参照情報を表示する表示ステップと、を含み、前記表示ステップにおいては、前記参照情報を表示する条件を、前記主となる発言者の発話に基づくキーワードに対して重み付けを行って表示する。 The information display method according to the present disclosure includes at least a speech recognition step of recognizing utterances of a plurality of participants participating in speech communication, and a specifying step of identifying a main speaker among the plurality of participants. , an extraction step of extracting a keyword requiring reference information from the utterance recognized in the utterance recognition step, an information acquisition step of acquiring the reference information for the keyword extracted in the extraction step, and an information acquisition step of acquiring the reference information for the keyword extracted in the extraction step; a displaying step of displaying the reference information; in the displaying step, conditions for displaying the reference information are displayed by weighting keywords based on utterances of the main speaker.

本開示によれば、複数人数での発話内容に基づくキーワードから、適切に参照情報を表示させることができるという効果を奏する。 According to the present disclosure, it is possible to appropriately display reference information based on keywords based on the content of utterances by a plurality of people.

図１は、情報表示装置の使用例の一例を示す概略図である。FIG. 1 is a schematic diagram showing an example of how an information display device is used. 図２は、情報表示装置の使用例の他の例を示す概略図である。FIG. 2 is a schematic diagram showing another example of how the information display device is used. 図３は、第一実施形態に係る情報表示制御装置を有する情報表示装置の構成例を示すブロック図である。FIG. 3 is a block diagram showing a configuration example of an information display device having an information display control device according to the first embodiment. 図４は、キーワード情報の一例を示す図である。FIG. 4 is a diagram showing an example of keyword information. 図５は、キーワードの必要性レベルを説明する図である。FIG. 5 is a diagram illustrating the necessity level of keywords. 図６は、第一実施形態に係る情報表示制御装置における処理の流れの一例を示すフローチャートである。FIG. 6 is a flowchart showing an example of the flow of processing in the information display control device according to the first embodiment. 図７は、第二実施形態に係る情報表示制御装置における処理の流れの一例を示すフローチャートである。FIG. 7 is a flowchart illustrating an example of the flow of processing in the information display control device according to the second embodiment. 図８は、第三実施形態に係る情報表示制御装置における処理の流れの一例を示すフローチャートである。FIG. 8 is a flowchart showing an example of the flow of processing in the information display control device according to the third embodiment.

以下に添付図面を参照して、本開示に係る情報表示制御装置および情報表示方法の実施形態を詳細に説明する。なお、以下の実施形態により本発明が限定されるものではない。 Embodiments of an information display control device and an information display method according to the present disclosure will be described in detail below with reference to the accompanying drawings. Note that the present invention is not limited to the following embodiments.

［第一実施形態］
＜情報表示装置＞
図１は、情報表示装置の使用例の一例を示す概略図である。図２は、情報表示装置の使用例の他の例を示す概略図である。図３は、第一実施形態に係る情報表示制御装置（以下、「制御装置」という。）２０を有する情報表示装置１０の構成例を示すブロック図である。情報表示装置１０は、例えば、会議または会話のような、少なくとも発話によるコミュニケーションにおいて、参加者の発話に含まれるキーワードの参照情報を表示する。 [First embodiment]
<Information display device>
FIG. 1 is a schematic diagram showing an example of how an information display device is used. FIG. 2 is a schematic diagram showing another example of how the information display device is used. FIG. 3 is a block diagram showing a configuration example of an information display device 10 having an information display control device (hereinafter referred to as "control device") 20 according to the first embodiment. The information display device 10 displays reference information of keywords included in utterances of participants in at least utterance communication such as a meeting or conversation.

図１に示すように、情報表示装置１０は、複数人数の参加者が１か所に集まって行われる会議（以下、「リアル会議」という。）または会話において使用される。この場合に使用される情報表示装置１０は、例えば、参加者が使用するスマートフォンのような情報端末装置の機能の１つとして実装されていてもよい。図１に示すリアル会議は、発表者Ｍ１１と、他の参加者Ｍ１２、他の参加者Ｍ１３、他の参加者Ｍ１４および他の参加者Ｍ１５が参加している。他の参加者Ｍ１５は、情報表示装置１０を使用している。 As shown in FIG. 1, the information display device 10 is used in a conference (hereinafter referred to as a "real conference") or conversation in which a plurality of participants gather in one place. The information display device 10 used in this case may be implemented as one of the functions of an information terminal device such as a smartphone used by the participant, for example. The real conference shown in FIG. 1 includes a presenter M11, another participant M12, another participant M13, another participant M14, and another participant M15. Another participant M15 is using the information display device 10.

図２に示すように、情報表示装置１０は、複数人数の参加者が参加して、オンラインで行われる会議（以下、「ＷＥＢ会議」という。）または会話において使用される。この場合に使用される情報表示装置１０は、例えば、図示しないＷＥＢ会議システムを介してオンラインのＷＥＢ会議または会話に参加する際に使用されるノート型のパーソナルコンピュータやスマートフォンのような情報端末装置の機能の１つとして実装されていてもよい。 As shown in FIG. 2, the information display device 10 is used in an online conference (hereinafter referred to as a "WEB conference") or conversation in which a plurality of participants participate. The information display device 10 used in this case is, for example, an information terminal device such as a notebook personal computer or a smartphone used when participating in an online web conference or conversation via a web conference system (not shown). It may be implemented as one of the functions.

以下の説明では、リアル会議またはＷＥＢ会議のどちらかの会議において使用される場合について説明する。リアル会議とＷＥＢ会議との区別を特に要しない場合、会議という。 In the following description, a case will be described in which the system is used in either a real conference or a web conference. When there is no particular need to distinguish between a real conference and a web conference, they are referred to as a conference.

情報表示装置１０は、会議において、複数人数での発話内容に基づくキーワードを、主発言者の発話に対して重み付けを行って抽出する。情報表示装置１０は、図３に示すように、マイクロフォン１１と、表示部１２と、キーワード情報記憶部１８と、通信部１９と、制御装置２０とを備える。ＷＥＢ会議に使用する情報表示装置１０は、さらに図示しないカメラを備える。 The information display device 10 extracts keywords based on the content of utterances by a plurality of people in a conference by weighting the utterances of the main speaker. The information display device 10 includes a microphone 11, a display section 12, a keyword information storage section 18, a communication section 19, and a control device 20, as shown in FIG. The information display device 10 used for the web conference further includes a camera (not shown).

マイクロフォン１１は、情報表示装置１０を使用している会議の参加者の音声を収音するマイクロフォンである。マイクロフォン１１は、収音した音声データを制御装置２０の発話認識部２１に出力する。図１に示すように、リアル会議に使用される情報表示装置１０である場合、マイクロフォン１１は、発言者の音声を収音する。図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、マイクロフォン１１は、情報表示装置１０を使用する参加者自身の音声を収音する。 The microphone 11 is a microphone that picks up the voices of participants in the conference who are using the information display device 10 . The microphone 11 outputs the collected audio data to the speech recognition unit 21 of the control device 20 . As shown in FIG. 1, in the case of an information display device 10 used for a real conference, a microphone 11 picks up the voice of a speaker. As shown in FIG. 2, when the information display device 10 is used for a web conference, the microphone 11 picks up the voices of the participants using the information display device 10.

表示部１２は、例えば、液晶ディスプレイまたは有機ＥＬ（Ｅｌｅｃｔｒｏ－Ｌｕｍｉｎｅｓｃｅｎｃｅ）ディスプレイなどを含むディスプレイである。表示部１２は、制御装置２０の表示制御部２８から出力された映像信号に基づいて、映像を表示する。 The display unit 12 is a display including, for example, a liquid crystal display or an organic EL (Electro-Luminescence) display. The display unit 12 displays video based on the video signal output from the display control unit 28 of the control device 20.

表示部１２には、会議の参加者の発話に含まれるキーワードに対応する参照情報が表示される。表示部１２において参照情報が表示される範囲を、参照情報表示範囲１１０という。図１に示すように、リアル会議に使用される情報表示装置１０である場合、参照情報表示範囲１１０は、表示部１２の全面に表示されてもよい。図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、参照情報表示範囲１１０は、表示部１２の一部、例えば、下部、上部、左側部または右側部に表示されてもよい。ＷＥＢ会議に使用される情報表示装置１０である場合、参照情報表示範囲１１０は、表示部１２にポップアップ表示されてもよい。参照情報表示範囲１１０は、例えば、３行以下程度の文字が表示される範囲である。 The display unit 12 displays reference information corresponding to keywords included in the utterances of conference participants. The range in which reference information is displayed on the display unit 12 is referred to as a reference information display range 110. As shown in FIG. 1, in the case of the information display device 10 used for a real conference, the reference information display range 110 may be displayed on the entire surface of the display unit 12. As shown in FIG. 2, in the case of the information display device 10 used for a web conference, the reference information display range 110 is displayed on a part of the display unit 12, for example, at the bottom, top, left side, or right side. Good too. In the case of the information display device 10 used for a web conference, the reference information display range 110 may be displayed as a pop-up on the display unit 12. The reference information display range 110 is, for example, a range in which about three lines or less of characters are displayed.

図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、表示部１２は、参加者の映像、および、画面共有された資料などが表示される。表示部１２は、例えば、主となる発言者（以下、「主発言者」という。）の映像１０１が、他の参加者の映像１０２、映像１０３より大きく表示される。 As shown in FIG. 2, in the case of the information display device 10 used for a web conference, the display unit 12 displays images of participants, screen-shared materials, and the like. The display unit 12 displays, for example, a video 101 of a main speaker (hereinafter referred to as "main speaker") larger than videos 102 and 103 of other participants.

キーワード情報記憶部１８は、キーワード情報として、キーワードごとに参照情報を記憶する。キーワード情報記憶部１８は、例えば、ＲＡＭ（ＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ）、フラッシュメモリなどの半導体メモリ素子などの記録媒体である。キーワード情報記憶部１８は、通信部１９を介して無線接続される外部記録部であってもよい。 The keyword information storage unit 18 stores reference information for each keyword as keyword information. The keyword information storage unit 18 is, for example, a recording medium such as a semiconductor memory device such as a RAM (Random Access Memory) or a flash memory. The keyword information storage section 18 may be an external recording section wirelessly connected via the communication section 19.

キーワード情報記憶部１８は、キーワード情報として、キーワードごとに、必要性レベルおよび参照情報を記憶する。必要性レベルとは、キーワード毎に対する参照情報の表示が必要となる指針を示すレベルであり、例えば、難解性レベルおよび周知性レベルなどを含む。 The keyword information storage unit 18 stores, as keyword information, a necessity level and reference information for each keyword. The necessity level is a level indicating a guideline for displaying reference information for each keyword, and includes, for example, a difficulty level and a well-known level.

キーワードは、補足的に説明を行うことが必要とされる単語である。キーワードは、例えば、一般的に知られていない語句、専門用語、略語、難解語および流行語などである。難解性レベルは、難解語であるキーワードの難解性を示す。難解性レベルは、例えば、１から５までの数値によって５段階で示される。難解性レベルは、難解性が高いほど、つまり難解な単語であるほど数値が大きくなる。周知性レベルは、一般的に知られていない語句、専門用語、略語などのキーワードにおける周知性を示す。周知性レベルは、例えば、１から５までの数値によって５段階で示される。周知性レベルは、周知性が高いほど、つまり周知されている単語であるほど数値が大きくなる。 Keywords are words that require supplementary explanation. Keywords include, for example, words that are not generally known, technical terms, abbreviations, difficult words, and buzzwords. The difficulty level indicates the difficulty of a keyword that is a difficult word. For example, the difficulty level is indicated in five stages using numerical values from 1 to 5. Regarding the difficulty level, the higher the difficulty, that is, the more difficult the word is, the larger the numerical value becomes. The popularity level indicates the popularity of a keyword, such as a phrase, a technical term, or an abbreviation that is not generally known. The well-known level is indicated by a numerical value from 1 to 5, for example, in five stages. As for the well-known level, the higher the well-known word, that is, the more well-known the word, the larger the numerical value becomes.

参照情報は、例えば、キーワードの意味、具体的な説明説明である。キーワードが略語の場合、参照情報は、略していない正式名称でもよい。なお、説明は、文字以外に、例えば、写真または図などでもよい。参照情報は、例えば、キーワードの付加的な情報である付加情報でもよい。 The reference information is, for example, the meaning of the keyword and a specific explanation. If the keyword is an abbreviation, the reference information may be the full, abbreviated name. Note that the explanation may be, for example, a photograph or a diagram in addition to text. The reference information may be, for example, additional information that is additional information about the keyword.

参照情報は、人が数秒程度で読むことが可能な、例えば１００文字以下程度の文字数としてもよい。 The reference information may have a number of characters, for example, about 100 characters or less, which can be read by a person in about a few seconds.

図４は、キーワード情報の一例を示す図である。図４に示すキーワード情報は、キーワードごとに、必要性レベルが記憶され、キーワードに基づき、参照情報が検索される。参照情報は、後述する通信部１９を用いて検索される情報に加えて、予め記憶されていてもよい。例えば、キーワード「○○○」については、難解性レベル「５」、周知性レベル「１」、参照情報「○・・・○」と記憶されている。例えば、キーワード「△△△」については、難解性レベル「３」、周知性レベル「５」、参照情報「△・・・△」と記憶されている。 FIG. 4 is a diagram showing an example of keyword information. In the keyword information shown in FIG. 4, a necessity level is stored for each keyword, and reference information is searched based on the keyword. The reference information may be stored in advance in addition to information searched using the communication unit 19, which will be described later. For example, for the keyword "○○○", difficulty level "5", well-known level "1", and reference information "○...○" are stored. For example, for the keyword "△△△", difficulty level "3", well-known level "5", and reference information "△...△" are stored.

図４の例においては、例えば、様々なキーワードに対して５段階の必要性レベルが対応付けられている。この場合において、必要性レベルの一例である難解性レベルは、難解性レベル「１」が、難解性が低いことを示しており、言い換えると、分かりやすいキーワードである。また、難解性レベル「５」が、難解性が高いことを示しており、言い換えると、容易に理解できないキーワードである。また、必要性レベルの一例である周知性レベルは、周知性レベル「１」が、周知性が低いことを示しており、言い換えると、一般的に周知されていないキーワードである。また、周知性レベル「５」が、周知性が高いことを示しており、言い換えると、一般的に良く周知されているキーワードである。 In the example of FIG. 4, for example, five levels of necessity are associated with various keywords. In this case, regarding the difficulty level, which is an example of the necessity level, a difficulty level of "1" indicates that the difficulty is low; in other words, it is an easy-to-understand keyword. Further, the difficulty level "5" indicates that the keyword is highly difficult to understand, in other words, it is a keyword that cannot be easily understood. Furthermore, regarding the well-known level, which is an example of the necessity level, a well-known level "1" indicates that the well-known level is low. In other words, it is a keyword that is not generally known. Further, a well-known level of "5" indicates that the keyword is highly well-known; in other words, it is a keyword that is generally well-known.

参照情報が、通信部１９によってＷＥＢ上で検索される場合は、例えば、キーワードに対するウィキペディア（登録商標）のような特定のＷＥＢメディアによる情報、キーワードに関する最新のニュースなどが検索される。また、キーワードが商品名やサービス名である場合は、その商品やサービスを提供する提供元のＷＥＢサイトなどが検索される。さらには、ＷＥＢ上のサーチエンジンによって検索された結果に対し、広告を除く最上位に表示される内容を参照情報としてもよい。 When reference information is searched on the web by the communication unit 19, for example, information from a specific web media such as Wikipedia (registered trademark) for a keyword, the latest news regarding the keyword, etc. are searched. Furthermore, if the keyword is a product name or service name, the website of the provider that provides the product or service is searched. Furthermore, the content displayed at the top of the results searched by a search engine on the web, excluding advertisements, may be used as reference information.

通信部１９は、無線通信などを行うための通信ユニットである。通信部１９は、例えば、Ｗｉ－Ｆｉ（登録商標）や携帯電話回線などを用いた通信方法で通信を行い、イントラネットやインターネットに接続される。通信部１９は、制御装置２０の通信制御部２９によって通信を制御される。図１に示すように、リアル会議に使用される情報表示装置１０である場合、通信部１９は、外部記録部であるキーワード情報記憶部１８からキーワード情報を取得や、参照情報の検索を行う。図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、通信部１９は、他の参加者が使用する情報表示装置１０との間で、参加者の音声および映像を相互に通信するとともに、参照情報の検索を行う。より詳しくは、通信部１９は、マイクロフォン１１で収音した参加者の音声、および、カメラで撮影した参加者の映像を、他の参加者が使用する情報表示装置１０へ送信する。通信部１９は、他の参加者が使用する情報表示装置１０から、他の参加者の音声および映像を受信する。 The communication section 19 is a communication unit for performing wireless communication and the like. The communication unit 19 communicates using a communication method using, for example, Wi-Fi (registered trademark) or a mobile phone line, and is connected to an intranet or the Internet. Communication of the communication unit 19 is controlled by a communication control unit 29 of the control device 20 . As shown in FIG. 1, in the case of the information display device 10 used for a real conference, the communication section 19 acquires keyword information from the keyword information storage section 18, which is an external recording section, and searches for reference information. As shown in FIG. 2, in the case of the information display device 10 used for a web conference, the communication unit 19 communicates audio and video of participants with the information display device 10 used by other participants. and searches for reference information. More specifically, the communication unit 19 transmits the participant's voice captured by the microphone 11 and the participant's video captured by the camera to the information display device 10 used by other participants. The communication unit 19 receives audio and video of other participants from the information display device 10 used by the other participants.

以下の説明において、入力された音声とは、リアル会議に使用される情報表示装置１０である場合、マイクロフォン１１で収音した音声である。入力された音声とは、ＷＥＢ会議に使用される情報表示装置１０である場合、マイクロフォン１１で収音した音声、および、通信部１９を介して取得した、他の参加者が使用する情報表示装置１０から取得した他の参加者の音声である。 In the following description, the input voice is the voice picked up by the microphone 11 in the case of the information display device 10 used for a real conference. Inputted audio includes, in the case of the information display device 10 used for the web conference, the audio collected by the microphone 11 and the information display device used by other participants acquired via the communication unit 19. This is the voice of another participant obtained from 10.

＜情報表示制御装置＞
制御装置２０は、情報表示装置１０の各部を制御する。制御装置２０は、例えば、ＣＰＵ（ＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）やＭＰＵ（ＭｉｃｒｏＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）などで構成された演算処理装置（制御装置）であり、ＲＡＭ又はＲＯＭ（ＲｅａｄＯｎｌｙＭｅｍｏｒｙ）などの記憶装置を有する。制御装置２０は、記憶されているプログラムをメモリにロードして、プログラムに含まれる命令を実行する。制御装置２０には上述したＲＡＭなどの内部メモリが含まれ、内部メモリは制御装置２０におけるデータの一時記憶などに用いられる。制御装置２０は、プログラムの実行によって実現される機能ブロックなどの構成要素として、発話認識部２１と、特定部２２と、抽出部２６と、情報取得部２７と、表示制御部２８と、通信制御部２９とを有する。 <Information display control device>
The control device 20 controls each part of the information display device 10. The control device 20 is, for example, an arithmetic processing device (control device) configured with a CPU (Central Processing Unit), an MPU (Micro Processing Unit), etc., and has a storage device such as a RAM or a ROM (Read Only Memory). The control device 20 loads a stored program into memory and executes instructions included in the program. The control device 20 includes an internal memory such as the above-mentioned RAM, and the internal memory is used for temporary storage of data in the control device 20. The control device 20 includes a speech recognition section 21, an identification section 22, an extraction section 26, an information acquisition section 27, a display control section 28, and a communication control section, as components such as functional blocks realized by executing a program. 29.

発話認識部２１は、少なくとも発話によるコミュニケーションに参加している複数の参加者の発話を認識する。発話認識部２１は、入力された音声を分析し、音声に含まれる発話内容を認識する。発話認識部２１は、入力された音声に対して、音素毎または単語毎の音響モデル分析を行い、音素モデルや言語モデルとの対比を行うことで、発話内容を認識する。 The utterance recognition unit 21 recognizes at least the utterances of a plurality of participants participating in communication through speech. The speech recognition unit 21 analyzes the input speech and recognizes the content of the speech contained in the speech. The utterance recognition unit 21 performs acoustic model analysis for each phoneme or word on input speech, and recognizes the content of the utterance by comparing the acoustic model with a phoneme model and a language model.

特定部２２は、複数の参加者における、主発言者を特定する。特定部２２は、例えば、ＷＥＢ会議またはリアル会議における、ファシリテータ、発表者、リーダー、または、会議の主催者のような主たる参加者を主発言者として特定してもよい。 The specifying unit 22 specifies the main speaker among the plurality of participants. The specifying unit 22 may specify, for example, a main participant such as a facilitator, a presenter, a leader, or a conference organizer in a web conference or a real conference as the main speaker.

特定部２２は、入力された音声から、例えば、ＷＥＢ会議またはリアル会議において最初に発言した人、または、最初に例えば１０秒以上のように所定時間以上発言した人を、主発言者として特定してもよい。特定部２２は、入力された音声から、例えば、リアル会議の場合、リアル会議において最初に発言した人の声の特徴を声紋分析などによって特定して、主発言者として特定してもよい。 The identifying unit 22 identifies, from the input audio, the person who first spoke in the web conference or real conference, or the person who first spoke for a predetermined period of time, such as 10 seconds or more, as the main speaker. It's okay. For example, in the case of a real conference, the identification unit 22 may identify the characteristics of the voice of the person who spoke first in the real conference from the input voice by voiceprint analysis, and identify the person as the main speaker.

特定部２２は、例えば、ＷＥＢ会議の場合、ＷＥＢ会議システムから取得したＷＥＢ会議の開催情報から、ＷＥＢ会議を招集した人を主発言者として特定してもよい。特定部２２は、リアル会議の場合、参加者のメール情報またはスケジュール情報などからリアル会議の開催情報を取得して、主発言者を特定してもよい。 For example, in the case of a web conference, the identifying unit 22 may identify the person who convened the web conference as the main speaker from the web conference holding information acquired from the web conference system. In the case of a real conference, the identification unit 22 may acquire holding information of the real conference from participants' email information or schedule information, and identify the main speaker.

特定部２２は、ＷＥＢ会議の場合、ＷＥＢ会議システムから取得した情報から、画面共有、つまり発言するための資料などの表示を共有している人を主発言者として特定してもよい。 In the case of a web conference, the identifying unit 22 may identify the person who is sharing the screen, that is, the display of materials for speaking, as the main speaker from the information acquired from the web conference system.

特定部２２は、ＷＥＢ会議の場合、ＷＥＢ会議システムから取得した情報から、発言するために発言ボタンまたは挙手ボタンなどの所定の操作を行った人を、主発言者として特定してもよい。 In the case of a web conference, the identification unit 22 may identify, as the main speaker, a person who has performed a predetermined operation such as a speak button or a raise hand button in order to speak from information acquired from the web conference system.

特定部２２は、ＷＥＢ会議の場合、図２に示す映像１０１のように、最も大きく映像が表示されている他の参加者を、主発言者として特定してもよい。 In the case of a web conference, the identification unit 22 may identify the other participant whose image is displayed the largest, such as the image 101 shown in FIG. 2, as the main speaker.

特定部２２は、リアル会議の場合、カメラが撮影した参加者の映像から、立ち上がっている人、挙手した人、または、ホワイトボードまたはプロジェクタによって投影された投影画面の前に立っている人などを主発言者として特定してもよい。 In the case of a real conference, the identification unit 22 identifies people who are standing up, people who have raised their hands, people who are standing in front of a whiteboard or a projection screen projected by a projector, etc. from images of participants captured by a camera. It may also be specified as the main speaker.

抽出部２６は、発話認識部２１が認識した発話から、参照情報を要するキーワードを抽出する。抽出部２６は、キーワードを抽出する条件を、主発言者の発話に対して重み付けを行って抽出する。より詳しくは、抽出部２６は、主発言者の発話を、主発言者以外の他の発言者の発話よりも優先して、言い換えると、主発言者の発話の重みを重くしてキーワードを抽出する。 The extraction unit 26 extracts keywords requiring reference information from the utterance recognized by the utterance recognition unit 21. The extraction unit 26 extracts conditions for extracting keywords by weighting the utterances of the main speaker. More specifically, the extraction unit 26 extracts keywords by giving priority to the utterances of the main speaker over the utterances of other speakers other than the main speaker, in other words, giving higher weight to the utterances of the main speaker. do.

主発言者は、他の発言者に比べて、会議の議題に沿った、重要な発言を行うことが多い。そこで、主発言者の発話を、主発言者以外の他の発言者の発話よりも優先してキーワードを抽出することにより、会議の内容の理解に重要なキーワードを優先的に抽出する。 Compared to other speakers, the main speaker often makes important statements that are in line with the agenda of the meeting. Therefore, by extracting keywords by giving priority to the utterances of the main speaker over the utterances of other speakers other than the main speaker, keywords important for understanding the content of the meeting are preferentially extracted.

抽出部２６は、抽出部２６によって抽出されるキーワードが、特定部２２によって特定された主発言者の声の特徴に一致するか否かによって、抽出されたキーワードが主発言者による発話であるか否かを判定すればよい。 The extraction unit 26 determines whether the extracted keyword is uttered by the main speaker depending on whether the keyword extracted by the extraction unit 26 matches the voice characteristics of the main speaker identified by the identification unit 22. All you have to do is decide whether or not.

本実施形態では、抽出部２６は、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルに応じて、参照情報を要するキーワードとして抽出してもよい。より詳しくは、抽出部２６は、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルが所定の閾値以上である場合、参照情報を要するキーワードとして抽出する。抽出部２６は、主発言者の発話に基づくキーワードである場合、主発言者以外の他の発言者に対して必要性レベルの閾値を低くする。 In the present embodiment, the extraction unit 26 may extract keywords that require reference information depending on the level of necessity of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21. More specifically, when the necessity level of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21 is equal to or higher than a predetermined threshold, the extraction unit 26 extracts the keyword as a keyword requiring reference information. If the keyword is based on the utterance of the main speaker, the extraction unit 26 lowers the necessity level threshold for speakers other than the main speaker.

図５は、キーワードにおける参照情報の必要性レベルおよび必要レベルの一例である難解性レベルと周知性レベルを説明する図である。抽出部２６は、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルが閾値ｔ１以上である場合、参照情報を要するキーワードとして抽出する。抽出部２６は、主発言者の発話に基づくキーワードである場合、抽出したキーワードに対する参照情報の必要性レベルが閾値ｔ２（ｔ２＜ｔ１）以上である場合、参照情報を要するキーワードとして抽出する。例えば、必要性レベルが図５に示すように５段階に分類されている場合、抽出部２６は、主発言者の発話に基づくキーワードに対して、必要性レベルが例えば３以上である場合、参照情報を要するキーワードとして抽出し、主発言者以外の発話に基づくキーワードに対して、必要性レベルが例えば４以上である場合、参照情報を要するキーワードとして抽出する。 FIG. 5 is a diagram illustrating a difficulty level and a well-known level, which are examples of the necessity level and necessity level of reference information in a keyword. If the necessity level of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21 is equal to or higher than the threshold t1, the extraction unit 26 extracts the keyword as a keyword requiring reference information. In the case of a keyword based on the utterance of the main speaker, the extraction unit 26 extracts the keyword as a keyword requiring reference information if the necessity level of reference information for the extracted keyword is equal to or higher than a threshold value t2 (t2<t1). For example, when the necessity level is classified into five stages as shown in FIG. A keyword is extracted as a keyword requiring information, and if the necessity level is, for example, 4 or higher for a keyword based on an utterance by a person other than the main speaker, the keyword is extracted as a keyword requiring reference information.

本実施形態では、抽出部２６は、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルとして難解性レベルが所定閾値以上である場合、参照情報を要するキーワードとして抽出してもよい。例えば、難解性レベルが図５に示すように５段階に分類されている場合、抽出部２６は、主発言者の発話に基づくキーワードに対して、難解性レベルが例えば３以上である場合、参照情報を要するキーワードとして抽出し、主発言者以外の発話に基づくキーワードに対して、難解性レベルが例えば４以上である場合、参照情報を要するキーワードとして抽出する。言い換えると、主発言者の発話に基づくキーワードにおいては、難解性レベルが低いキーワード、つまり、より難解ではないキーワードも含めて、参照情報を要するキーワードとする。 In the present embodiment, when the difficulty level as the necessity level of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21 is equal to or higher than a predetermined threshold, the extraction unit 26 extracts the keyword as a keyword requiring reference information. Good too. For example, when the difficulty level is classified into five levels as shown in FIG. The keyword is extracted as a keyword requiring information, and if the difficulty level is, for example, 4 or higher for a keyword based on an utterance by a person other than the main speaker, the keyword is extracted as a keyword requiring reference information. In other words, keywords based on the utterances of the main speaker are keywords that require reference information, including keywords with a low difficulty level, that is, keywords that are less difficult to understand.

本実施形態では、抽出部２６は、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルとして周知性レベルが所定閾値未満である場合、参照情報を要するキーワードとして抽出してもよい。例えば、周知性レベルが図５に示すように５段階に分類されている場合、抽出部２６は、周知性レベルが例えば３未満である場合、参照情報を要するキーワードとして抽出する。抽出部２６は、主発言者の発話に基づくキーワードに対して、周知性レベルが例えば３以下である場合、参照情報を要するキーワードとして抽出し、主発言者以外の発話に基づくキーワードに対して、周知性レベルが例えば２以下である場合、参照情報を要するキーワードとして抽出する。言い換えると、主発言者の発話に基づくキーワードにおいては、周知性レベルが高いキーワード、つまり、より周知されているキーワードも含めて、参照情報を要するキーワードとする。 In the present embodiment, when the well-known level as the necessity level of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21 is less than a predetermined threshold, the extraction unit 26 extracts the keyword as a keyword requiring reference information. Good too. For example, when the well-known level is classified into five levels as shown in FIG. 5, the extraction unit 26 extracts the keyword as requiring reference information if the well-known level is less than 3, for example. The extraction unit 26 extracts a keyword based on an utterance by a person other than the main speaker as a keyword requiring reference information when the well-known level is, for example, 3 or less, and extracts the keyword based on an utterance by a person other than the main speaker as a keyword that requires reference information. If the well-known level is, for example, 2 or less, the keyword is extracted as requiring reference information. In other words, keywords based on the utterances of the main speaker are keywords that require reference information, including keywords with a high level of well-known knowledge, that is, keywords that are more well-known.

図５において、難解性レベルが閾値ｔ１以上であることは、必要性レベルが閾値ｔ１以上であることに対応し、難解性レベルが閾値ｔ２以上であることは、必要性レベルが閾値ｔ２以上であることに対応する。また、周知性レベルが閾値ｔ１以下であることは、必要性レベルが閾値ｔ１以上であることに対応し、周知性レベルが閾値ｔ２以下であることは、必要性レベルが閾値ｔ２以上であることに対応する。 In FIG. 5, the difficulty level being greater than or equal to the threshold t1 corresponds to the necessity level being greater than or equal to the threshold t1, and the difficulty level being greater than or equal to the threshold t2 means that the necessity level is greater than or equal to the threshold t2. respond to something. Furthermore, the fact that the well-known level is below the threshold t1 corresponds to the necessity level being above the threshold t1, and the well-known level below the threshold t2 means that the necessity level is above the threshold t2. corresponds to

情報取得部２７は、抽出部２６が抽出したキーワード、つまり、抽出部２６が参照情報を要すると判断したキーワードに対する参照情報を取得する。本実施形態では、情報取得部２７は、ＷＥＢ上の情報に基づき、キーワードに対する参照情報を取得する。 The information acquisition unit 27 acquires reference information for the keyword extracted by the extraction unit 26, that is, the keyword for which the extraction unit 26 determines that reference information is required. In this embodiment, the information acquisition unit 27 acquires reference information for keywords based on information on the web.

表示制御部２８は、表示部１２における映像の表示を制御する。表示制御部２８は、表示部１２に対して映像の映像信号を出力する。 The display control unit 28 controls the display of images on the display unit 12. The display control unit 28 outputs a video signal of a video to the display unit 12.

表示制御部２８は、情報取得部２７が取得した参照情報の表示部１２への表示を制御する。表示制御部２８は、抽出部２６によって抽出されたキーワードに対応する参照情報を表示部１２の参照情報表示範囲１１０に表示するための映像信号を出力する。表示制御部２８は、参照情報を、例えば５秒から１０秒間程度の所定期間表示する。表示制御部２８は、参照情報を表示中に、次の参照情報を表示する場合、所定期間の経過前でも次の参照情報に表示を変えてもよい。表示制御部２８は、参照情報を表示中に、次の参照情報を表示する場合、例えば、必要性レベルが高い参照情報を表示してもよい。 The display control unit 28 controls the display of the reference information acquired by the information acquisition unit 27 on the display unit 12. The display control unit 28 outputs a video signal for displaying the reference information corresponding to the keyword extracted by the extraction unit 26 in the reference information display range 110 of the display unit 12. The display control unit 28 displays the reference information for a predetermined period of time, for example, about 5 seconds to 10 seconds. When displaying the next reference information while displaying the reference information, the display control unit 28 may change the display to the next reference information even before the predetermined period has elapsed. When displaying the next reference information while displaying the reference information, the display control unit 28 may display reference information with a high necessity level, for example.

ＷＥＢ会議に使用される情報表示装置１０である場合、表示制御部２８は、ＷＥＢ会議の映像の表示を制御する。表示制御部２８は、カメラが撮影している映像を含むＷＥＢ会議の映像の映像信号を出力する。 When the information display device 10 is used for a web conference, the display control unit 28 controls the display of images of the web conference. The display control unit 28 outputs a video signal of the web conference video including the video captured by the camera.

通信制御部２９は、通信部１９による通信を制御する。図１に示すように、リアル会議に使用される情報表示装置１０である場合、通信制御部２９は、通信部１９を介して、外部記録部であるキーワード情報記憶部１８からキーワード情報の受信や、参照情報の検索を行うよう制御する。図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、通信制御部２９は、通信部１９を介して、他の参加者が使用する情報表示装置１０との間で、参加者の音声および映像を相互に通信するとともに、参照情報の検索を行うよう制御する。より詳しくは、通信制御部２９は、通信部１９を介して、マイクロフォン１１で収音した参加者の音声、および、カメラで撮影した参加者の映像を、他の参加者が使用する情報表示装置１０へ送信するよう制御する。通信制御部２９は、通信部１９を介して、他の参加者が使用する情報表示装置１０から、他の参加者の音声および映像を受信するよう制御する。 The communication control unit 29 controls communication by the communication unit 19. As shown in FIG. 1, in the case of the information display device 10 used in a real conference, the communication control unit 29 receives keyword information from the keyword information storage unit 18, which is an external recording unit, via the communication unit 19. , controls to search for reference information. As shown in FIG. 2, when the information display device 10 is used for a web conference, the communication control unit 29 communicates with the information display device 10 used by other participants via the communication unit 19. It communicates the audio and video of the participants with each other and controls the search for reference information. More specifically, the communication control unit 29 transmits, via the communication unit 19, the participant's voice captured by the microphone 11 and the participant's video captured by the camera to an information display device used by other participants. 10. The communication control unit 29 controls the communication unit 19 to receive audio and video of other participants from the information display device 10 used by the other participants.

＜情報表示制御装置における情報処理＞
次に、図６を用いて、制御装置２０における処理の流れについて説明する。図６に示す処理は、図１に示すように、リアル会議に使用される情報表示装置１０である場合、参照情報を表示させるためのアプリケーションを起動することで開始される。図２に示すように、ＷＥＢ会議に使用される情報表示装置１０である場合、ＷＥＢ会議を行うアプリケーションの起動、またはＷＥＢ会議の開始によって開始される。この場合、ＷＥＢ会議を行うアプリケーションに、参照情報を表示させる機能が含まれていてもよく、ユーザの操作によって、参照情報を表示させる機能のオン／オフを選択可能としてもよい。情報表示装置１０が起動されることで、図６に示すフローチャートの処理が開始される。 <Information processing in the information display control device>
Next, the flow of processing in the control device 20 will be explained using FIG. 6. The process shown in FIG. 6 is started by starting an application for displaying reference information in the case of the information display device 10 used for a real conference as shown in FIG. As shown in FIG. 2, in the case of the information display device 10 used for a web conference, the process starts by activating an application for holding a web conference or by starting a web conference. In this case, the application for conducting the web conference may include a function for displaying reference information, and the function for displaying reference information may be turned on or off by user operation. When the information display device 10 is activated, the processing of the flowchart shown in FIG. 6 is started.

制御装置２０は、主発言者を特定する（ステップＳ１０１）。より詳しくは、制御装置２０は、特定部２２によって、例えば、入力された音声から、ファシリテータ、発表者、リーダー、または、会議の主催者のような主たる参加者を主発言者として特定する。なお、主発言者を特定する方法はこれに限定されない。制御装置２０は、ステップＳ１０２へ進む。 The control device 20 identifies the main speaker (step S101). More specifically, the control device 20 uses the specifying unit 22 to specify, for example, a main participant such as a facilitator, a presenter, a leader, or a conference organizer as the main speaker from the input voice. Note that the method for identifying the main speaker is not limited to this. The control device 20 proceeds to step S102.

制御装置２０は、発話認識を開始する（ステップＳ１０２）。より詳しくは、制御装置２０は、発話認識部２１によって、入力された音声を分析し、音声に含まれる発話内容を認識する。制御装置２０は、ステップＳ１０３へ進む。 The control device 20 starts speech recognition (step S102). More specifically, the control device 20 uses the speech recognition unit 21 to analyze the input speech and recognize the speech content included in the speech. The control device 20 proceeds to step S103.

制御装置２０は、キーワードが抽出されたか否かを判定する（ステップＳ１０３）。より詳しくは、制御装置２０は、抽出部２６によって、発話認識部２１が認識した発話から、参照情報を要するキーワードを抽出する。制御装置２０は、抽出部２６によって、発話認識部２１が認識した発話から抽出したキーワードに対する参照情報の必要性レベルが所定の閾値以上である場合、参照情報を要するキーワードとして抽出する。制御装置２０は、キーワードが抽出されたと判定する場合（ステップＳ１０３でＹｅｓ）、ステップＳ１０４へ進む。制御装置２０は、キーワードが抽出されたと判定しない場合（ステップＳ１０３でＮｏ）、ステップＳ１０８へ進む。 The control device 20 determines whether a keyword has been extracted (step S103). More specifically, the control device 20 uses the extraction unit 26 to extract keywords requiring reference information from the utterance recognized by the utterance recognition unit 21. If the necessity level of reference information for the keyword extracted from the utterance recognized by the utterance recognition unit 21 is equal to or higher than a predetermined threshold, the control device 20 extracts the keyword as a keyword requiring reference information by the extraction unit 26 . If the control device 20 determines that the keyword has been extracted (Yes in step S103), the process proceeds to step S104. If the control device 20 does not determine that the keyword has been extracted (No in step S103), the process proceeds to step S108.

キーワードが抽出されたと判定する場合（ステップＳ１０３でＹｅｓ）、制御装置２０は、抽出されたキーワードは主発言者による発話であるか否かを判定する（ステップＳ１０４）。より詳しくは、制御装置２０は、抽出部２６によって、抽出されたキーワードが、特定部２２により特定された主発言者の声の特徴に一致するか否かに基づいて、抽出されたキーワードが主発言者による発話であるか否かを判定する。制御装置２０は、抽出されたキーワードは主発言者による発話であると判定する場合（ステップＳ１０４でＹｅｓ）、ステップＳ１０６へ進む。制御装置２０は、抽出されたキーワードは主発言者による発話であると判定しない場合（ステップＳ１０４でＮｏ）、ステップＳ１０５へ進む。 When determining that the keyword has been extracted (Yes in step S103), the control device 20 determines whether the extracted keyword is uttered by the main speaker (step S104). More specifically, the control device 20 causes the extraction unit 26 to determine whether the extracted keyword is the main speaker based on whether or not the extracted keyword matches the voice characteristics of the main speaker specified by the identification unit 22. It is determined whether the utterance is by the speaker. When the control device 20 determines that the extracted keyword is uttered by the main speaker (Yes in step S104), the process proceeds to step S106. If the control device 20 does not determine that the extracted keyword is uttered by the main speaker (No in step S104), the process proceeds to step S105.

抽出されたキーワードは主発言者による発話であると判定しない場合（ステップＳ１０４でＮｏ）、制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ１以上であるか否かを判定する（ステップＳ１０５）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報の必要性レベルを取得する。このように、制御装置２０は、抽出部２６によって、主発言者の発話に基づくキーワードである場合、主発言者以外の他の発言者に対して閾値を低くして参照情報を要するキーワードとして抽出する。制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ１以上であると判定する場合（ステップＳ１０５でＹｅｓ）、ステップＳ１０７へ進む。制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ１以上であると判定しない場合（ステップＳ１０５でＮｏ）、ステップＳ１０８へ進む。 If it is not determined that the extracted keyword is uttered by the main speaker (No in step S104), the control device 20 determines whether the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold value t1. (Step S105). More specifically, the control device 20 uses the information acquisition unit 27 to acquire the necessity level of reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . In this manner, when the keyword is based on the utterance of the main speaker, the control device 20 uses the extraction unit 26 to lower the threshold value for speakers other than the main speaker and extract the keyword as requiring reference information. do. When the control device 20 determines that the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold value t1 (Yes in step S105), the process proceeds to step S107. If the control device 20 does not determine that the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold value t1 (No in step S105), the process proceeds to step S108.

抽出されたキーワードは主発言者による発話であると判定する場合（ステップＳ１０４でＹｅｓ）、制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ２以上であるか否かを判定する（ステップＳ１０６）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報の必要性レベルを取得する。制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ２以上であると判定する場合（ステップＳ１０６でＹｅｓ）、ステップＳ１０７へ進む。制御装置２０は、抽出されたキーワードの参照情報の必要性レベルが閾値ｔ２以上であると判定しない場合（ステップＳ１０６でＮｏ）、ステップＳ１０８へ進む。 When determining that the extracted keyword is uttered by the main speaker (Yes in step S104), the control device 20 determines whether the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold t2. (Step S106). More specifically, the control device 20 uses the information acquisition unit 27 to acquire the necessity level of reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . When the control device 20 determines that the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold value t2 (Yes in step S106), the process proceeds to step S107. If the control device 20 does not determine that the necessity level of the reference information of the extracted keyword is equal to or higher than the threshold t2 (No in step S106), the process proceeds to step S108.

制御装置２０は、参照情報を取得して、所定期間表示する（ステップＳ１０７）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報を取得する。制御装置２０は、表示制御部２８によって、抽出部２６によって抽出されたキーワードに対応する参照情報の映像信号を所定期間出力する。制御装置２０は、ステップＳ１０８へ進む。 The control device 20 acquires the reference information and displays it for a predetermined period of time (step S107). More specifically, the control device 20 uses the information acquisition unit 27 to acquire reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . The control device 20 causes the display control unit 28 to output a video signal of reference information corresponding to the keyword extracted by the extraction unit 26 for a predetermined period of time. The control device 20 proceeds to step S108.

制御装置２０は、処理を終了するか否かを判定する（ステップＳ１０８）。例えば、情報表示装置１０の電源がＯＦＦにされたことや、図示しない操作部の操作がされたことなどで、図６に示すフローチャートを開始するトリガとなったアプリケーションなどを終了させる操作が行われたことで、処理を終了することが判定される。制御装置２０は、処理を終了すると判定された場合（ステップＳ１０８でＹｅｓ）、本処理を終了する。制御装置２０は、処理を終了すると判定されない場合（ステップＳ１０８でＮｏ）、ステップＳ１０３の処理を再度実行する。 The control device 20 determines whether to end the process (step S108). For example, the information display device 10 may be powered off or an operation unit (not shown) may be operated to terminate the application that triggered the start of the flowchart shown in FIG. 6. As a result, it is determined that the process is to be terminated. When the control device 20 determines to end the process (Yes in step S108), it ends the process. If the control device 20 does not determine to end the process (No in step S108), it executes the process in step S103 again.

ステップＳ１０５、ステップＳ１０６において、必要性レベルとして難解性レベルが所定閾値以上であるか否かを条件にする場合について説明する。難解性レベルを示す数値が大きいほど難解となる５段階で、様々なキーワードに難解性レベルが対応付けられている場合、ステップＳ１０５においては、制御装置２０は、抽出されたキーワードの参照情報の難解性レベルが４以上であるか否かを判定する。ステップＳ１０６においては、制御装置２０は、抽出されたキーワードの参照情報の難解性レベルが３以上であるか否かを判定する。 A case will be described in which, in steps S105 and S106, the necessity level is set to be whether the difficulty level is equal to or higher than a predetermined threshold. If difficulty levels are associated with various keywords in 5 levels, the larger the numerical value indicating the difficulty level, the more difficult it is, in step S105, the control device 20 controls the difficulty level of the reference information of the extracted keyword. It is determined whether the sex level is 4 or higher. In step S106, the control device 20 determines whether the difficulty level of the extracted keyword reference information is 3 or higher.

ステップＳ１０５、ステップＳ１０６において、必要性レベルとして周知性レベルが所定閾値未満であるか否かを条件にする場合について説明する。周知性レベルを示す数値が大きいほど周知となる５段階で、様々なキーワードに周知性レベルが対応付けられている場合、ステップＳ１０５においては、制御装置２０は、抽出されたキーワードの参照情報の周知性レベルが２以下であるか否かを判定する。ステップＳ１０６においては、制御装置２０は、抽出されたキーワードの参照情報の周知性レベルが３以下であるか否かを判定する。 A case will be described in which, in steps S105 and S106, whether or not the well-known level is less than a predetermined threshold is set as the necessity level. If various keywords are associated with a well-known level in five stages, the larger the value indicating the well-known level, the more well-known the keyword is, in step S105, the control device 20 controls the public knowledge of the reference information of the extracted keyword. It is determined whether the sex level is 2 or less. In step S106, the control device 20 determines whether the well-known level of the reference information of the extracted keyword is 3 or less.

＜効果＞
上述したように、本実施形態では、キーワードを抽出する条件を、主発言者の発話に対して重み付けを行って抽出する。本実施形態によれば、複数人数での発話内容に基づくキーワードから、主発言者の発話に対して重み付けを行って適切に参照情報を表示させることができる。本実施形態は、表示される参照情報を絞ることにより、参加者が表示される参照情報の確認に時間を要し、会議における発言の聴取が疎かになる可能性を抑えることができる。 <Effect>
As described above, in this embodiment, the conditions for extracting keywords are extracted by weighting the utterances of the main speaker. According to this embodiment, reference information can be appropriately displayed by weighting the utterances of the main speaker based on keywords based on the content of utterances by a plurality of people. In the present embodiment, by narrowing down the reference information to be displayed, it is possible to reduce the possibility that participants will take time to check the displayed reference information and will not be able to listen to what is being said in the meeting.

本実施形態では、認識した発話から抽出したキーワードに対する参照情報の必要性レベルが所定の閾値以上である場合に参照情報を要するキーワードとして抽出する。本実施形態では、主発言者の発話に基づくキーワードである場合は、主発言者以外の他の発言者に対して閾値を低くしてキーワードを抽出する。本実施形態は、主発言者の発話に基づくキーワードの参照情報を、他の発言者の発話に基づくキーワードの参照情報より優先して表示することができる。本実施形態は、主発言者を優先して適切に参照情報を表示させることができる。 In this embodiment, when the necessity level of reference information for a keyword extracted from a recognized utterance is equal to or higher than a predetermined threshold, the keyword is extracted as a keyword requiring reference information. In this embodiment, if the keyword is based on the utterance of the main speaker, the keyword is extracted by lowering the threshold value for speakers other than the main speaker. In this embodiment, keyword reference information based on the utterance of the main speaker can be displayed with priority over keyword reference information based on the utterances of other speakers. In this embodiment, reference information can be appropriately displayed with priority given to the main speaker.

本実施形態では、認識した発話から抽出したキーワードに対する参照情報の必要性レベルとして難解性レベルが所定閾値以上である場合に、参照情報を要するキーワードとして抽出する。本実施形態は、難解性レベルが高いキーワードの参照情報を適切に表示することができる。 In this embodiment, when the difficulty level as the necessity level of reference information for a keyword extracted from a recognized utterance is equal to or higher than a predetermined threshold, the keyword is extracted as a keyword requiring reference information. This embodiment can appropriately display reference information for keywords with a high level of difficulty.

本実施形態では、認識した発話から抽出したキーワードに対する参照情報の必要性レベルとして周知性レベルが所定閾値以下である場合に、参照情報を要するキーワードとして抽出する。本実施形態は、周知性レベルが低いキーワードの参照情報を適切に表示することができる。 In this embodiment, when the well-known level as the necessity level of reference information for a keyword extracted from a recognized utterance is less than or equal to a predetermined threshold, the keyword is extracted as a keyword requiring reference information. The present embodiment can appropriately display reference information for keywords with a low level of public awareness.

本実施形態においては、主発言者の発話に基づくキーワードと、主発言者以外の他の発言者の発話に基づくキーワードに対して、抽出部２６が、参照情報を要するキーワードを、異なる重み付け条件で抽出する構成とした。本実施形態においては、抽出部２６に代えて、表示制御部２８が同等の機能を有する構成としてもよい。 In this embodiment, the extraction unit 26 extracts keywords requiring reference information using different weighting conditions for keywords based on the utterances of the main speaker and keywords based on the utterances of speakers other than the main speaker. It was configured to extract. In this embodiment, instead of the extraction section 26, the display control section 28 may have an equivalent function.

この場合、表示制御部２８は、キーワードを表示する条件を、主となる発言者の発話に対して重み付けを行って表示させる。表示制御部２８は、例えば、主発言者の発話に基づくキーワードの必要性レベルが閾値ｔ２以上の場合に、参照情報を表示させ、主発言者以外の他の発言者の発話に基づくキーワードの必要性レベルが閾値ｔ１以上の場合に、参照情報を表示させる処理を行う。 In this case, the display control unit 28 weights the utterances of the main speaker to display the keyword display conditions. For example, when the necessity level of the keyword based on the utterance of the main speaker is equal to or higher than the threshold value t2, the display control unit 28 displays reference information, and displays the necessity level of the keyword based on the utterance of other speakers other than the main speaker. When the gender level is equal to or higher than the threshold t1, a process of displaying reference information is performed.

この場合、図６のステップＳ１０７において、抽出部２６は、必要性レベルがｔ２以上のキーワードに対する参照情報を取得する。また、ステップＳ１０７において、表示制御部２８は、ステップＳ１０４がＹｅｓの場合、必要性レベルがｔ２以上のキーワードに対する参照情報を表示させ、ステップＳ１０４がＮｏの場合、必要性レベルがｔ１以上のキーワードに対する参照情報を表示させる。 In this case, in step S107 of FIG. 6, the extraction unit 26 acquires reference information for keywords whose necessity level is t2 or higher. In addition, in step S107, the display control unit 28 displays reference information for keywords whose necessity level is t2 or higher when step S104 is Yes, and displays reference information for keywords whose necessity level is t1 or higher when step S104 is No. Display reference information.

［第二実施形態］
図７を参照しながら、本実施形態に係る情報表示装置１０について説明する。図７は、第二実施形態に係る情報表示制御装置における処理の流れの一例を示すフローチャートである。情報表示装置１０は、基本的な構成は第一実施形態の情報表示装置１０と同様である。以下の説明においては、情報表示装置１０と同様の構成要素には、同一の符号または対応する符号を付し、その詳細な説明は省略する。本実施形態では、情報表示装置１０は、会議において、複数人数での発話内容に基づくキーワードから、主発言者の発話に対して重み付けを行って表示態様を変えて参照情報を表示する。本実施形態では、情報表示装置１０は、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して長い時間表示する。本実施形態では、表示制御部２８における処理が第一実施形態と異なる。 [Second embodiment]
The information display device 10 according to this embodiment will be described with reference to FIG. 7. FIG. 7 is a flowchart illustrating an example of the flow of processing in the information display control device according to the second embodiment. The basic configuration of the information display device 10 is the same as that of the information display device 10 of the first embodiment. In the following description, components similar to those of the information display device 10 are denoted by the same or corresponding symbols, and detailed description thereof will be omitted. In the present embodiment, the information display device 10 displays reference information in a conference by weighting the utterances of the main speaker based on the content of utterances by a plurality of people and changing the display mode. In the present embodiment, the information display device 10 displays reference information for a keyword based on the utterance of the main speaker for a longer time than reference information for a keyword based on the utterances of other speakers. In this embodiment, the processing in the display control unit 28 is different from the first embodiment.

本実施形態では、キーワード情報記憶部１８は、キーワード情報として、キーワードごとに参照情報を記憶していればよい。本実施形態では、キーワード情報は必要性レベルを含まなくてよい。 In this embodiment, the keyword information storage unit 18 only needs to store reference information for each keyword as keyword information. In this embodiment, the keyword information does not need to include the necessity level.

表示制御部２８は、参照情報を表示する条件を、主発言者の発話に基づくキーワードに対して重み付けを行って表示する。より詳しくは、表示制御部２８は、主発言者の発話を、主発言者以外の他の発言者の発話よりも優先して、言い換えると、主発言者の発話の重みを重くしてキーワードの参照情報を表示する。 The display control unit 28 displays the conditions for displaying the reference information by weighting the keywords based on the utterances of the main speaker. More specifically, the display control unit 28 gives priority to the utterances of the main speaker over the utterances of other speakers other than the main speaker, in other words, gives higher weight to the utterances of the main speaker and displays keywords. Display reference information.

本実施形態では、表示制御部２８は、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合と表示態様を変えて表示する。表示制御部２８は、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に比べて、目立つように、または、確認しやすいように表示態様を変えて表示する。 In this embodiment, the display control unit 28 displays the reference information for the keyword based on the utterance of the main speaker in a different display manner than when the reference information is for the keyword based on the utterances of other speakers. . When the display control unit 28 is the reference information for the keyword based on the utterance of the main speaker, the display control unit 28 makes the display more conspicuous or easier to confirm than when the reference information is for the keyword based on the utterances of other speakers. Change the display mode to display.

表示態様を変えるとは、例えば、表示する時間を変えること、または、表示する情報量を変えることである。表示態様を変えるとは、例えば、表示する文字の大きさを変えること、色を変えること、または、参照情報表示範囲１１０の大きさを変えること、表示する位置を変えることでもよい。 Changing the display mode means, for example, changing the display time or changing the amount of information to be displayed. Changing the display mode may include, for example, changing the size or color of the displayed characters, changing the size of the reference information display range 110, or changing the display position.

本実施形態では、表示制御部２８は、参照情報を表示する時間を、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して長い時間表示させる。表示制御部２８は、主発言者の発話に基づくキーワードの参照情報の表示時間を例えば５秒間とし、他の発言者の発話に基づくキーワードの参照情報の表示時間を例えば３秒間として参照情報を表示する。 In the present embodiment, the display control unit 28 determines the time to display the reference information, when the reference information is for a keyword based on the utterance of the main speaker, and when the reference information is for the keyword based on the utterance of another speaker. to be displayed for a long time. The display control unit 28 displays the reference information by setting the display time of the keyword reference information based on the utterance of the main speaker to 5 seconds, for example, and setting the display time of the keyword reference information based on the utterances of other speakers to 3 seconds, for example. do.

主発言者は、他の発言者に比べて、会議の話題に沿った発言のような重要な発言を行うことが多い。そこで、主発言者の発話に基づくキーワードに対する参照情報を、主発言者以外の他の発言者の発話に基づくキーワードに対する参照情報よりも長い期間表示することにより、会議の内容の理解に重要なキーワードをより確認しやすくする。 The main speaker often makes important statements, such as statements that are related to the topic of the meeting, more often than other speakers. Therefore, by displaying reference information for keywords based on the utterances of the main speaker for a longer period of time than reference information for keywords based on the utterances of other speakers other than the main speaker, we can display reference information for keywords based on the utterances of other speakers other than the main speaker. Make it easier to check.

次に、図７を用いて、制御装置２０における処理の流れについて説明する。図７に示すステップＳ１１１からステップＳ１１４、ステップＳ１１７の処理は、図６に示すフローチャートのステップＳ１０１からステップＳ１０４、ステップＳ１０８と同様の処理を行う。 Next, the flow of processing in the control device 20 will be described using FIG. 7. The processing from step S111 to step S114 and step S117 shown in FIG. 7 is the same as the processing from step S101 to step S104 and step S108 of the flowchart shown in FIG.

抽出されたキーワードは主発言者による発話であると判定する場合（ステップＳ１１４でＹｅｓ）、制御装置２０は、参照情報を取得して、第一期間ｔｍ１（ｔｍ１＜ｔｍ２）表示する（ステップＳ１１５）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報を取得する。制御装置２０は、表示制御部２８によって、抽出部２６によって抽出されたキーワードに対応する参照情報の映像信号を第一期間ｔｍ１出力する。制御装置２０は、ステップＳ１１７へ進む。 When determining that the extracted keyword is uttered by the main speaker (Yes in step S114), the control device 20 acquires reference information and displays it for a first period tm1 (tm1<tm2) (step S115). . More specifically, the control device 20 uses the information acquisition unit 27 to acquire reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . The control device 20 causes the display control unit 28 to output a video signal of reference information corresponding to the keyword extracted by the extraction unit 26 for a first period tm1. The control device 20 proceeds to step S117.

抽出されたキーワードは主発言者による発話であると判定しない場合（ステップＳ１１４でＮｏ）、制御装置２０は、参照情報を取得して、第二期間ｔｍ２表示する（ステップＳ１１６）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報を取得する。制御装置２０は、表示制御部２８によって、抽出部２６によって抽出されたキーワードに対応する参照情報の映像信号を第二期間ｔｍ２出力する。制御装置２０は、ステップＳ１１７へ進む。 If it is not determined that the extracted keyword is uttered by the main speaker (No in step S114), the control device 20 acquires the reference information and displays it for the second period tm2 (step S116). More specifically, the control device 20 uses the information acquisition unit 27 to acquire reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . The control device 20 causes the display control unit 28 to output a video signal of reference information corresponding to the keyword extracted by the extraction unit 26 for a second period tm2. The control device 20 proceeds to step S117.

＜効果＞
上述したように、参照情報を表示する条件を、主発言者の発話に基づくキーワードに対して重み付けを行って表示する。本実施形態によれば、複数人数での発話内容に基づくキーワードから、適切に参照情報を表示させることができる。 <Effect>
As described above, the conditions for displaying reference information are displayed by weighting the keywords based on the utterances of the main speaker. According to this embodiment, reference information can be appropriately displayed based on keywords based on the content of utterances by a plurality of people.

本実施形態では、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合と表示態様を変えて表示する。本実施形態は、主発言者の発話に基づくキーワードの参照情報を、他の発言者の発話に基づくキーワードの参照情報に比べて、例えば、確認しやすい表示態様で表示することができる。 In this embodiment, when the reference information is for a keyword based on the utterance of the main speaker, the display format is changed from that when the reference information is for the keyword based on the utterances of other speakers. In this embodiment, the keyword reference information based on the utterance of the main speaker can be displayed in a display format that is easier to confirm, for example, than the keyword reference information based on the utterances of other speakers.

本実施形態では、参照情報を表示する時間を、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して長い時間表示させることができる。 In this embodiment, reference information is displayed for a longer time when the reference information is for a keyword based on the utterance of the main speaker than when the reference information is for a keyword based on the utterances of other speakers. can be done.

［第三実施形態］
図８は、第三実施形態に係る情報表示制御装置における処理の流れの一例を示すフローチャートである。情報表示装置１０は、基本的な構成は第二実施形態の情報表示装置１０と同様である。以下の説明においては、情報表示装置１０と同様の構成要素には、同一の符号または対応する符号を付し、その詳細な説明は省略する。本実施形態では、情報表示装置１０は、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して情報量を多く表示する。本実施形態では、表示制御部２８における処理が第二実施形態と異なる。 [Third embodiment]
FIG. 8 is a flowchart showing an example of the flow of processing in the information display control device according to the third embodiment. The information display device 10 has the same basic configuration as the information display device 10 of the second embodiment. In the following description, components similar to those of the information display device 10 are denoted by the same or corresponding symbols, and detailed description thereof will be omitted. In the present embodiment, the information display device 10 displays a larger amount of information when the reference information is for a keyword based on the utterance of the main speaker than when the reference information is for the keyword based on the utterances of other speakers. do. In this embodiment, the processing in the display control unit 28 is different from the second embodiment.

本実施形態では、キーワード情報記憶部１８は、キーワード情報の参照情報は、例えば、最大情報量ｌｖ１の参照情報と、最大情報量ｌｖ２の参照情報とを記憶してもよい。この場合、例えば、最大情報量ｌｖ１の参照情報は３行程度の参照情報であり、最大情報量ｌｖ２の参照情報は１行程度の参照情報である。 In the present embodiment, the keyword information storage unit 18 may store, for example, reference information for the maximum information amount lv1 and reference information for the maximum information amount lv2 as the reference information for the keyword information. In this case, for example, the reference information of the maximum information amount lv1 is about three lines of reference information, and the reference information of the maximum information amount lv2 is about one line of reference information.

本実施形態では、表示制御部２８は、参照情報を表示する情報量（以下、「最大情報量」という。）を、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して多く表示させる。表示制御部２８は、主発言者の発話に基づくキーワードの参照情報を表示する情報量を例えば最大３行（最大情報量３行）とし、他の発言者の発話に基づくキーワードの参照情報を表示する情報量を例えば最大１行（最大情報量１行）とする。 In the present embodiment, the display control unit 28 sets the amount of information to display reference information (hereinafter referred to as "maximum amount of information") to other utterances when the reference information is for a keyword based on the utterance of the main speaker. If the information is a reference to a keyword based on a person's utterance, it will be displayed more often. The display control unit 28 sets the amount of information for displaying the keyword reference information based on the utterance of the main speaker to a maximum of three lines (maximum amount of information 3 lines), and displays the keyword reference information based on the utterances of other speakers. The amount of information to be displayed is, for example, one line at most (maximum amount of information one line).

表示制御部２８は、例えば、取得した参照情報の先頭から、最大情報量分を表示してもよい。表示制御部２８は、例えば、取得した参照情報のうち、表示する最大情報量に応じて、最大情報量ｌｖ１の参照情報または最大情報量ｌｖ２の参照情報を表示してもよい。 For example, the display control unit 28 may display the maximum amount of information from the beginning of the acquired reference information. For example, the display control unit 28 may display the reference information with the maximum amount of information lv1 or the reference information with the maximum amount of information lv2, depending on the maximum amount of information to be displayed among the acquired reference information.

主発言者は、他の発言者に比べて、会議の話題に沿った発言のような重要な発言を行うことが多い。そこで、主発言者の発話に基づくキーワードに対する参照情報を、主発言者以外の他の発言者の発話に基づくキーワードに対する参照情報よりも情報量を多く表示することにより、会議の内容の理解に重要なキーワードの理解をより深めることが可能になる。 The main speaker often makes important statements, such as statements that are related to the topic of the meeting, more often than other speakers. Therefore, by displaying reference information for keywords based on the main speaker's utterances in a larger amount than reference information for keywords based on the utterances of other speakers other than the main speaker, we can display information that is important for understanding the content of the meeting. It is possible to deepen the understanding of keywords.

次に、図８を用いて、制御装置２０における処理の流れについて説明する。図８に示すステップＳ１２１からステップＳ１２４、ステップＳ１２７の処理は、図６に示すフローチャートのステップＳ１０１からステップＳ１０４、ステップＳ１０８と同様の処理を行う。 Next, the flow of processing in the control device 20 will be described using FIG. 8. The processing from step S121 to step S124 and step S127 shown in FIG. 8 is the same as the processing from step S101 to step S104 and step S108 of the flowchart shown in FIG. 6.

抽出されたキーワードは主発言者による発話であると判定する場合（ステップＳ１２４でＹｅｓ）、制御装置２０は、参照情報を取得して、最大情報量ｌｖ１（ｌｖ１＜ｌｖ２）を表示する（ステップＳ１２５）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報を取得する。制御装置２０は、表示制御部２８によって、抽出部２６によって抽出されたキーワードに対応する、最大情報量ｌｖ１の参照情報の映像信号を出力する。制御装置２０は、ステップＳ１２７へ進む。 When determining that the extracted keyword is uttered by the main speaker (Yes in step S124), the control device 20 acquires reference information and displays the maximum information amount lv1 (lv1<lv2) (step S125). ). More specifically, the control device 20 uses the information acquisition unit 27 to acquire reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . The control device 20 causes the display control unit 28 to output a video signal of reference information with a maximum information amount lv1 corresponding to the keyword extracted by the extraction unit 26. The control device 20 proceeds to step S127.

抽出されたキーワードは主発言者による発話であると判定しない場合（ステップＳ１２４でＮｏ）、制御装置２０は、参照情報を取得して、最大情報量ｌｖ２（ステップＳ１２６）。より詳しくは、制御装置２０は、情報取得部２７によって、キーワード情報記憶部１８のキーワード情報から、抽出部２６が抽出したキーワードに対する参照情報を取得する。制御装置２０は、表示制御部２８によって、抽出部２６によって抽出されたキーワードに対応する、最大情報量ｌｖ２の参照情報の映像信号を出力する。制御装置２０は、ステップＳ１２７へ進む。 If it is not determined that the extracted keyword is uttered by the main speaker (No in step S124), the control device 20 acquires the reference information and sets the maximum information amount lv2 (step S126). More specifically, the control device 20 uses the information acquisition unit 27 to acquire reference information for the keyword extracted by the extraction unit 26 from the keyword information in the keyword information storage unit 18 . The control device 20 causes the display control unit 28 to output a video signal of reference information with a maximum information amount lv2 corresponding to the keyword extracted by the extraction unit 26. The control device 20 proceeds to step S127.

＜効果＞
上述したように、本実施形態では、参照情報を表示する情報量を、主発言者の発話に基づくキーワードに対する参照情報である場合は、他の発言者の発話に基づくキーワードに対する参照情報である場合に対して多く表示させることができる。 <Effect>
As described above, in this embodiment, the amount of information to display the reference information is divided into two types: when the reference information is for a keyword based on the utterance of the main speaker, and when it is the reference information for the keyword based on the utterances of other speakers. can be displayed in large numbers.

本開示に係る情報表示装置は、上述した実施形態以外にも種々の異なる形態にて実施されてもよい。 The information display device according to the present disclosure may be implemented in various different forms other than the embodiments described above.

図示した情報表示装置の各構成要素は、機能概念的なものであり、必ずしも物理的に図示の如く構成されていなくてもよい。すなわち、各装置の具体的形態は、図示のものに限られず、各装置の処理負担や使用状況などに応じて、その全部または一部を任意の単位で機能的または物理的に分散または統合してもよい。 Each component of the illustrated information display device is functionally conceptual, and does not necessarily have to be physically configured as illustrated. In other words, the specific form of each device is not limited to what is shown in the diagram, and all or part of it may be functionally or physically distributed or integrated into arbitrary units depending on the processing load and usage status of each device. It's okay.

情報表示装置の構成は、例えば、ソフトウェアとして、メモリにロードされたプログラムなどによって実現される。上記実施形態では、これらのハードウェアまたはソフトウェアの連携によって実現される機能ブロックとして説明した。すなわち、これらの機能ブロックについては、ハードウェアのみ、ソフトウェアのみ、または、それらの組み合わせによって種々の形で実現できる。 The configuration of the information display device is realized by, for example, a program loaded into a memory as software. The above embodiments have been described as functional blocks realized by cooperation of these hardware or software. That is, these functional blocks can be realized in various forms using only hardware, only software, or a combination thereof.

上記した構成要素には、当業者が容易に想定できるもの、実質的に同一のものを含む。さらに、上記した構成は適宜組み合わせが可能である。また、本発明の要旨を逸脱しない範囲において構成の種々の省略、置換または変更が可能である。 The above-mentioned components include those that can be easily imagined by those skilled in the art and that are substantially the same. Furthermore, the above configurations can be combined as appropriate. Furthermore, various omissions, substitutions, or changes in the configuration are possible without departing from the gist of the present invention.

［変形例］
上記では、主発言者が使用する情報表示装置１０と、他の発言者が使用する情報表示装置１０とでは、参照情報の表示を変えてもよい。例えば、主発言者が使用する情報表示装置１０は、参照情報を表示しないようにしてもよい。 [Modified example]
In the above description, the display of reference information may be different between the information display device 10 used by the main speaker and the information display device 10 used by other speakers. For example, the information display device 10 used by the main speaker may not display reference information.

１０情報表示装置
１１マイクロフォン
１２表示部
１８キーワード情報記憶部
１９通信部
２０制御装置（情報表示制御装置）
２１発話認識部
２２特定部
２６抽出部
２７情報取得部
２８表示制御部
２９通信制御部 10 Information display device 11 Microphone 12 Display section 18 Keyword information storage section 19 Communication section 20 Control device (information display control device)
21 Speech recognition unit 22 Specification unit 26 Extraction unit 27 Information acquisition unit 28 Display control unit 29 Communication control unit

Claims

a speech recognition unit that recognizes utterances of at least a plurality of participants participating in communication by speech;
a specifying unit that specifies a main speaker among the plurality of participants;
an extraction unit that extracts a keyword requiring reference information from the utterance recognized by the utterance recognition unit;
an information acquisition unit that acquires the reference information for the keyword extracted by the extraction unit;
a display control unit that displays the reference information acquired by the information acquisition unit;
Equipped with
The display control unit displays conditions for displaying the reference information by weighting keywords based on utterances of the main speaker.
Information display control device.

The display control unit displays the reference information for a keyword based on the utterance of the main speaker in a different display manner than when the reference information is for the keyword based on the utterance of another speaker. ,
The information display control device according to claim 1.

The display control unit determines the time to display the reference information, if the reference information is for a keyword based on the utterance of the main speaker, the reference information is for a keyword based on the utterance of another speaker. Display for a long time,
The information display control device according to claim 2.

The display control unit sets the amount of information to display the reference information to, if the reference information is for a keyword based on the utterance of the main speaker, the reference information is for a keyword based on the utterance of another speaker. display more in a given case,
The information display control device according to claim 2.

a speech recognition step for recognizing utterances of a plurality of participants participating in at least speech communication;
a step of identifying a main speaker among the plurality of participants;
an extraction step of extracting keywords requiring reference information from the utterances recognized in the utterance recognition step;
an information acquisition step of acquiring the reference information for the keyword extracted in the extraction step;
a display step of displaying the reference information acquired in the information acquisition step;
including;
In the displaying step, conditions for displaying the reference information are displayed by weighting keywords based on the utterances of the main speaker.
An information display method executed by an information display control device.