JP2006251042A

JP2006251042A - Information processor, information processing method and program

Info

Publication number: JP2006251042A
Application number: JP2005064063A
Authority: JP
Inventors: Takeshi Mizunashi; 豪水梨
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 2005-03-08
Filing date: 2005-03-08
Publication date: 2006-09-21

Abstract

<P>PROBLEM TO BE SOLVED: To provide an information processor by which the circumstances of a conference can simply be known. <P>SOLUTION: A conference circumstances analysis apparatus 1 is provided with an uttering database 2 which stores conference data including uttering data, an analysis section 3 which adds up the number of uttered words from the uttering data stored in the uttering database 2 to obtain an average of the number of the uttered words and a standard deviation of the number of the uttered words, an annotation database 4 which stores the average of the number of the uttered words and the standard deviation of the uttered words obtained by the analysis section 3 while relating them to the uttering data and a display control section 5 which displays a graph in which the average of the number of the uttered words and the standard deviation of the number of the uttered words are made to correspond to the time axis of the uttering data. Thus, the respective amount of uttering and its leaning of the conference attendees can be computed and a large leaning case in which a single person utters continuously and a small leaning case in which a plurality of persons are uniformly uttering can be determined. <P>COPYRIGHT: (C)2006,JPO&NCIPI

Description

本発明は、情報処理装置、情報処理方法およびプログラムに関する。 The present invention relates to an information processing apparatus, an information processing method, and a program.

従来、会議において、互いに関連のある発言相互の関係が発言者チャートに表示できると便利である。例えば、一の会議参加者が、他の会議参加者に対して意見を述べたり、質問をぶつけた時に、他の会議参加者が、それに対して回答や反論を行うインタラクティブな場面が、チャートから判別できると、そこでは、何らかの議論があったことが分かり、記録された会議情報の検索者は、それを手掛かりとして、再生したい議論部分を、簡単に検索することができると期待される。これを実現する従来技術として以下のようなものが提案されている。 Conventionally, in a conference, it is convenient to be able to display the mutual relations of the comments that are related to each other on the speaker chart. For example, when one conference participant gives an opinion to another conference participant or asks a question, an interactive scene in which another conference participant responds or refutes is shown on the chart. If it can be discriminated, it will be understood that there was some discussion, and it is expected that the searcher of the recorded conference information can easily search for the discussion part to be reproduced by using it as a clue. The following are proposed as conventional techniques for realizing this.

特許文献１記載の装置は、発言者毎の発言区間の情報と、発言者毎の姿勢とを対応付けて記録し、さらに発言が誰に対するものであるかを特し、所定の時間区間における発言の流れを表示するので、発言相互の関係が表示できる。 The device described in Patent Document 1 records information on a speech section for each speaker and an attitude for each speaker in association with each other, further identifies who the speech is for, and a speech in a predetermined time section. Since the flow of the message is displayed, the relationship between the statements can be displayed.

特開平１１−２５９５０１号公報JP-A-11-259501

しかしながら、上記従来技術では、発言相互の関係を知ることができるものの、単一の発言者がしばらく発言を続けているプレゼンテーション的な場面や、多くの参加者が均一に話している議論的な場面などのように、発言者の偏り方、盛り上がり方などの会議の様態を知ることはできなかった。したがって、その会議記録のデータを初めから終わりまで流しながら確認しなければならなかった。 However, although the above-mentioned prior art can know the relationship between the statements, a presentation scene where a single speaker continues to speak for a while or a discussion scene where many participants speak uniformly. As such, I could not know the mode of the conference, such as how the speakers are biased and excited. Therefore, the meeting record data had to be confirmed while flowing from the beginning to the end.

そこで、本発明は、上記問題点に鑑みてなされたもので、会議の様態を簡単に知ることができる情報処理装置、情報処理方法およびプログラムを提供することを目的とする。 Therefore, the present invention has been made in view of the above problems, and an object thereof is to provide an information processing apparatus, an information processing method, and a program capable of easily knowing the state of a conference.

上記課題を解決するために、本発明は、発話データを記憶する記憶手段と、前記記憶手段に記憶された発話データから発話単語数を集計して、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求める解析手段とを備える情報処理装置である。本発明によれば、発話データから発話単語数を集計して、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めることで、たとえば、会議参加者それぞれの発話量やその偏りを算出することができるため、ひとりがずっと発話している場合のように大きな偏りがある場合や複数の人々が均等に発話している場合のように小さな偏りがある場合を判定することができる。このようにして、たとえば単一の発言者がしばらく発言を続けているプレゼンテーション的な場面や、多くの参加者が均一に話している議論的な場面など、発言者の偏り方、盛り上がり方などの会議の様態を簡単に知ることができる。 In order to solve the above problems, the present invention provides a storage means for storing utterance data, the number of utterance words from the utterance data stored in the storage means, an average of the utterance words, and a standard of the number of utterance words It is an information processing apparatus provided with the analysis means which calculates | requires at least one of a deviation. According to the present invention, the number of utterance words is totaled from the utterance data, and at least one of the average number of utterance words and the standard deviation of the number of utterance words is obtained. Since it is possible to calculate, it is possible to determine a case where there is a large bias such as when one person is uttering all the time or a case where there is a small bias such as when a plurality of people are speaking equally. In this way, for example, a presentation scene where a single speaker has been speaking for a while, a discussion scene where many participants are speaking uniformly, etc. You can easily know the mode of the meeting.

本発明の情報処理装置は、前記解析手段が求めた発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データに関連付けて記憶する記憶手段をさらに備える。本発明の情報処理装置は、前記解析手段が求めた発話単語数の平均または前記発話単語数の標準偏差を前記発話データに関連付けて記憶する記憶手段と、前記記憶手段の中から入力された検索条件を満たす発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を検索する検索手段とをさらに備える。 The information processing apparatus according to the present invention further includes storage means for storing at least one of the average number of utterance words obtained by the analysis means and the standard deviation of the number of utterance words in association with the utterance data. The information processing apparatus according to the present invention includes a storage unit that stores an average of the number of utterance words obtained by the analysis unit or a standard deviation of the number of utterance words in association with the utterance data, and a search input from the storage unit Search means for searching for at least one of the average number of spoken words satisfying the condition and the standard deviation of the number of spoken words is further provided.

本発明の情報処理装置は、前記発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データの時間軸に対応させたグラフを表示する表示制御手段をさらに備える。前記解析手段は、前記発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データのアノテーションとして付与することを特徴とする。これにより、付与されたアノテーションに基づいて後で検索を行うことができる。 The information processing apparatus of the present invention further includes display control means for displaying a graph in which at least one of the average number of spoken words and the standard deviation of the number of spoken words is associated with the time axis of the speech data. The analysis means assigns at least one of the average of the number of spoken words and the standard deviation of the number of spoken words as an annotation of the speech data. Thereby, a search can be performed later based on the assigned annotation.

本発明は、所定時間で切り出された発話データに基づいて、話者を特定するステップと、前記所定時間で切り出された発話データに基づいて、各話者の発話単語数を集計するステップと、前記各話者の発話単語数に基づいて、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めるステップとを有する情報処理方法である。本発明によれば、発話データから発話単語数を集計して、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めることで、たとえば、会議参加者それぞれの発話量やその偏りを算出することができるため、ひとりがずっと発話している場合のように大きな偏りがある場合や複数の人々が均等に発話している場合のように小さな偏りがある場合を判定することができる。 The present invention includes a step of identifying a speaker based on utterance data cut out at a predetermined time, and a step of counting the number of utterance words of each speaker based on the utterance data cut out at the predetermined time; And obtaining at least one of an average of the number of spoken words and a standard deviation of the number of spoken words based on the number of spoken words of each speaker. According to the present invention, the number of utterance words is totaled from the utterance data, and at least one of the average number of utterance words and the standard deviation of the number of utterance words is obtained. Since it is possible to calculate, it is possible to determine a case where there is a large bias such as when one person is uttering all the time or a case where there is a small bias such as when a plurality of people are speaking equally.

本発明の情報処理方法は、前記発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データの時間軸に対応させたグラフを表示するステップをさらに有する。本発明の情報処理方法は、前記話者単語数の平均および前記話者単語数の標準偏差の少なくとも一方を、前記発話データのアノテーションとして付与するステップをさらに有する。 The information processing method of the present invention further includes a step of displaying a graph in which at least one of the average number of spoken words and the standard deviation of the number of spoken words is associated with the time axis of the utterance data. The information processing method of the present invention further includes a step of assigning at least one of an average of the number of speaker words and a standard deviation of the number of speaker words as an annotation of the utterance data.

本発明は、所定時間で切り出された発話データに基づいて、話者を特定するステップ、前記所定時間で切り出された発話データに基づいて、各話者の発話単語数を集計するステップ、前記各話者の発話単語数に基づいて、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めるステップをコンピュータに実行させるためのプログラムである。本発明によれば、発話データから発話単語数を集計して、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めることで、たとえば、会議参加者それぞれの発話量やその偏りを算出することができるため、ひとりがずっと発話している場合のように大きな偏りがある場合や複数の人々が均等に発話している場合のように小さな偏りがある場合を判定することができる。 The present invention includes a step of identifying a speaker based on utterance data cut out at a predetermined time, a step of counting the number of utterance words of each speaker based on the utterance data cut out at the predetermined time, A program for causing a computer to execute a step of obtaining at least one of an average of spoken words and a standard deviation of spoken words based on the number of spoken words of a speaker. According to the present invention, the number of utterance words is totaled from the utterance data, and at least one of the average number of utterance words and the standard deviation of the number of utterance words is obtained. Since it is possible to calculate, it is possible to determine a case where there is a large bias such as when one person is uttering all the time or a case where there is a small bias such as when a plurality of people are speaking equally.

本発明のプログラムは、前記発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データの時間軸に対応させたグラフを表示するための情報を生成するステップをさらにコンピュータに実行させる。 The program according to the present invention further causes the computer to generate information for displaying a graph in which at least one of the average number of spoken words and the standard deviation of the number of spoken words is associated with the time axis of the speech data Let

本発明によれば、たとえば会議の様態を簡単に知ることができる情報処理装置、情報処理方法およびプログラムを提供できる。 According to the present invention, it is possible to provide, for example, an information processing apparatus, an information processing method, and a program that can easily know the mode of a meeting.

以下、本発明を実施するための最良の形態について説明する。 Hereinafter, the best mode for carrying out the present invention will be described.

図１は、本発明の第１実施例に係る会議様態解析装置（情報処理装置）のブロック図である。図１に示すように、会議様態解析装置１は、発話データベース２、解析部３、アノテーションデータベース４、表示制御部５および表示部６を備える。発話データベース２は、少なくとも会議参加者の音声データ（発話データ）を含む会議記録データを記憶する。 FIG. 1 is a block diagram of a conference mode analysis apparatus (information processing apparatus) according to a first embodiment of the present invention. As shown in FIG. 1, the meeting state analysis apparatus 1 includes an utterance database 2, an analysis unit 3, an annotation database 4, a display control unit 5, and a display unit 6. The utterance database 2 stores conference record data including at least voice data (utterance data) of conference participants.

図２は発話データベース２の内容を示す図である。図２に示すように、発話データベース２は、時刻、話者（Ａ、Ｂ、Ｃ、Ｄ）、発話内容のフィールドを含む。図２では発話内容をテキストで示しているが実際に発話データベース２に格納されているのは、音声データである。解析部３は、発話データベース２に記憶された発話データから各話者の発話単語数を集計して、発話単語数の平均値および発話単語数の標準偏差の少なくとも一方を求め、発話単語数の平均および発話単語数の標準偏差を発話データのアノテーションとして付与する。ここで発話単語数の標準偏差とは、発話単語数の平均からの隔たり（距離）を平均化したものである。 FIG. 2 is a diagram showing the contents of the utterance database 2. As shown in FIG. 2, the utterance database 2 includes fields of time, speaker (A, B, C, D), and utterance content. In FIG. 2, the content of the utterance is shown in text, but what is actually stored in the utterance database 2 is voice data. The analysis unit 3 totals the number of utterance words of each speaker from the utterance data stored in the utterance database 2 to obtain at least one of the average value of the utterance words and the standard deviation of the number of utterance words. The average and the standard deviation of the number of spoken words are given as annotations of the speech data. Here, the standard deviation of the number of spoken words is an average of the distance (distance) from the average of the number of spoken words.

アノテーションデータベース４は、解析部３が求めた発話単語数の平均および発話単語数の標準偏差を時刻を用いて発話データに関連付けて記憶する。表示制御部５は、表示を制御するものであり、各話者の発話単語数の平均および前記発話単語数の標準偏差を発話データの時間軸に対応させたグラフを表示する。表示部６は、たとえばディスプレイ装置により構成される。 The annotation database 4 stores the average of the number of utterance words and the standard deviation of the number of utterance words obtained by the analysis unit 3 in association with the utterance data using the time. The display control unit 5 controls the display, and displays a graph in which the average number of utterance words of each speaker and the standard deviation of the number of utterance words are associated with the time axis of the utterance data. The display unit 6 is configured by a display device, for example.

次に、解析部３の動作について説明する。図３は、解析部３の動作フローチャートである。ステップＳ１１で、解析部３は、発話データベース２から一定時間（たとえば１分間）のデータを切り出す。ステップＳ１２で、解析部３は、たとえば話者Ｂ、話者Ｃ、話者Ｅさんのように登場する話者を特定する。ステップＳ１３で、解析部３は、それぞれの話者のその１分間の発話単語数を集計する。ここでは、たとえば、話者Ｂさんの発話単語数が５０単語、話者Ｃさんの発話単語数が５単語、話者Ｅさんの発話単語数が５単語であるとする。ステップＳ１４で、解析部３は、発話単語数の平均値Ｘを計算する。また、解析部３は、式１を用いて発話単語数の標準偏差σを計算する。 Next, the operation of the analysis unit 3 will be described. FIG. 3 is an operation flowchart of the analysis unit 3. In step S11, the analysis unit 3 cuts out data for a predetermined time (for example, 1 minute) from the utterance database 2. In step S12, the analysis unit 3 identifies speakers appearing like speaker B, speaker C, and speaker E, for example. In step S <b> 13, the analysis unit 3 counts the number of utterance words per minute for each speaker. Here, for example, it is assumed that the number of utterance words of speaker B is 50 words, the number of utterance words of speaker C is 5 words, and the number of utterance words of speaker E is 5 words. In step S14, the analysis unit 3 calculates the average value X of the number of spoken words. In addition, the analysis unit 3 calculates the standard deviation σ of the number of utterance words using Expression 1.

ここで、Ｎは話者数、ｘは各話者の発話単語数を示す。ステップＳ１５で、発話単語数の平均値Ｘと発話単語数の標準偏差σをアノテーションデータベース４に保存する。

Here, N represents the number of speakers, and x represents the number of words spoken by each speaker. In step S15, the average value X of the number of spoken words and the standard deviation σ of the number of spoken words are stored in the annotation database 4.

図４は、表示部６による表示例である。図４において、横軸は会議経過時間で発話データベース２に記憶された会議記録データの会議開始時刻と会議終了時刻に対応する。縦軸は破線が各話者の発話単語数の平均値Ｘ、実線が発話単語数の標準偏差σをそれぞれ示す。発話単語数の平均値Ｘが大きくで、発話単語数の標準偏差σが小さい場合、参加者が均等に話していることがわかる（１）。発話単語数の平均値Ｘが小さく、発話単語数の標準偏差σが小さい場合、皆が話していないことがわかる（２）。発話単語数の平均値Ｘが大きく、発話単語数の標準偏差σが大きい場合、一人が集中して話していることがわかる（３）。時間軸上である１点を選べば、そこの音声を再生することができる。また、点線で示すバー２０を移動またはクリックすることで、その時点の話者データ（音声）を再生することができる。以上のように、参加者の音声を解析することにより、たとえば単一の発言者がしばらく発言を続けているプレゼンテーション的な場面や、多くの参加者が均一に話している議論的な場面など、発言者の偏り方、盛り上がり方などの会議の様態を判定することができる。 FIG. 4 is a display example by the display unit 6. In FIG. 4, the horizontal axis corresponds to the conference start time and conference end time of the conference record data stored in the utterance database 2 as the conference elapsed time. In the vertical axis, the broken line indicates the average value X of the number of spoken words of each speaker, and the solid line indicates the standard deviation σ of the number of spoken words. When the average value X of the number of spoken words is large and the standard deviation σ of the number of spoken words is small, it can be seen that the participants are speaking evenly (1). When the average value X of the number of spoken words is small and the standard deviation σ of the number of spoken words is small, it can be seen that everyone is not speaking (2). When the average value X of the number of uttered words is large and the standard deviation σ of the number of uttered words is large, it can be seen that one person is speaking in a concentrated manner (3). If one point on the time axis is selected, the sound there can be reproduced. Also, by moving or clicking the bar 20 indicated by the dotted line, the speaker data (voice) at that time can be reproduced. As described above, by analyzing the voice of the participant, for example, a presentation scene where a single speaker has been speaking for a while, a discussion scene where many participants are speaking uniformly, etc. It is possible to determine the state of the conference such as how the speakers are biased and excited.

次に、第２実施例について説明する。図５は、本発明の第２実施例に係る会議様態検索装置（情報処理装置）のブロック図である。図５に示すように、会議様態検索装置１００は、発話データベース２、解析部３、アノテーションデータベース４、表示制御部５、表示部６、検索条件設定部７および検索部８を備える。なお、実施例１と同一部分は同一符号を付して説明を省略する。発話データベース２は、少なくとも会議参加者の音声データ（発話データ）を含む会議記録データを複数の会議分だけ記憶する。 Next, a second embodiment will be described. FIG. 5 is a block diagram of a conference mode search apparatus (information processing apparatus) according to the second embodiment of the present invention. As shown in FIG. 5, the meeting state search device 100 includes an utterance database 2, an analysis unit 3, an annotation database 4, a display control unit 5, a display unit 6, a search condition setting unit 7, and a search unit 8. In addition, the same part as Example 1 attaches | subjects the same code | symbol, and abbreviate | omits description. The utterance database 2 stores conference record data including at least voice data (utterance data) of conference participants for a plurality of conferences.

解析部３は、各会議ごとに、発話データベース２に記憶された発話データから各話者の発話単語数を集計して、発話単語数の平均値および発話単語数の標準偏差の少なくとも一方を求める。アノテーションデータベース４は、解析部３が求めた発話単語数の平均および発話単語数の標準偏差を時刻を用いて発話データに関連付けて各会議ごと（会議１〜会議ｎ）に記憶する。アノテーションデータベース４には、ひとつの会議記録データに対して、時刻、発話単語数の平均（発話量の平均）、発話単語数の標準偏差の形式のデータの集合がひとつ格納される。 The analysis unit 3 aggregates the number of utterance words of each speaker from the utterance data stored in the utterance database 2 for each conference, and obtains at least one of the average value of the number of utterance words and the standard deviation of the number of utterance words. . The annotation database 4 stores the average of the number of utterance words and the standard deviation of the number of utterance words obtained by the analysis unit 3 for each conference (conference 1 to conference n) in association with the utterance data using time. The annotation database 4 stores one set of data in the format of time, average number of spoken words (average amount of spoken words), and standard deviation of the number of spoken words for one meeting record data.

検索条件設定部７は、たとえばマウスやキーボード等の入力手段から構成され、「参加者が均等に発言している部分」、「皆が発話していない部分」、「一人が集中して話している部分」などの会議様態を設定することができ、また、発話単語の平均、発話単語の標準偏差を直接指定することもできる。これにより、会議様態をキーとした会議記録データの検索も行うことができる。検索部８は、アノテーションデータベース４の中から入力された検索条件を満たす発話単語数の平均および前記発話単語数の標準偏差を検索する。ここでは、会議３と会議５がヒットしたとする。 The search condition setting unit 7 is composed of input means such as a mouse and a keyboard, for example, “parts where the participants speak evenly”, “parts where no one speaks”, “one person speaks intensively” It is possible to set a meeting mode such as “a part that is present”, and it is also possible to directly specify the average of spoken words and the standard deviation of spoken words. Thereby, it is also possible to search the conference record data using the conference mode as a key. The search unit 8 searches the annotation database 4 for the average number of utterance words that satisfy the input search condition and the standard deviation of the number of utterance words. Here, it is assumed that conference 3 and conference 5 are hit.

表示制御部５は、図６に示すように、会議３と会議５の発話単語数の平均および発話単語数の標準偏差を発話データの時間軸に対応させたグラフを表示部６に表示し、さらに会議３と会議５の発話単語数と発話単語数の標準偏差の該当箇所を識別可能に表示する。 As shown in FIG. 6, the display control unit 5 displays a graph in which the average number of utterance words and the standard deviation of the number of utterance words in the conference 3 and the conference 5 are associated with the time axis of the utterance data on the display unit 6. Furthermore, the number of utterance words and the standard deviation of the number of utterance words of the conference 3 and the conference 5 are displayed in an identifiable manner.

本実施例による会議様態検索装置によれば、上述のようなデータの集合を、多くの会議記録データに対して作成し、蓄積しておくことによって、多くの会議記録データの中から、たとえば「参加者が均等に発言している部分」「皆が発話していない部分」「一人が集中して話している部分」などのような会議様態をキーとして所望の場面を検索することができる。 According to the conference mode retrieval apparatus according to the present embodiment, a set of data as described above is created and stored for a large amount of conference record data. It is possible to search for a desired scene using a conference mode such as a part where participants are speaking equally, a part where everyone is not speaking, or a part where one person is speaking intensively.

なお、本発明による情報処理方法は、例えば、ＣＰＵ（Central Processing Unit）、ＲＯＭ(Read Only Memory)、ＲＡＭ(Random Access Memory)等を用いて実現され、プログラムをハードディスク装置や、ＣＤ−ＲＯＭ、ＤＶＤまたはフレキシブルディスクなどの可搬型記憶媒体等からインストールし、または通信回路からダウンロードし、ＣＰＵがこのプログラムを実行することで、各ステップが実現される。すなわち、プログラムは、所定時間で切り出された発話データに基づいて、話者を特定するステップ、前記所定時間で切り出された発話データに基づいて、各話者の発話単語数を集計するステップ、前記各話者の発話単語数に基づいて、発話単語数の平均および発話単語数の標準偏差の少なくとも一方を求めるステップ、前記発話単語数の平均および前記発話単語数の標準偏差の少なくとも一方を前記発話データの時間軸に対応させたグラフを表示するための情報を生成するステップをＣＰＵ（コンピュータ）に実行させる。 The information processing method according to the present invention is realized using, for example, a CPU (Central Processing Unit), a ROM (Read Only Memory), a RAM (Random Access Memory), and the like, and the program is stored in a hard disk device, a CD-ROM, or a DVD. Alternatively, each step is realized by installing from a portable storage medium such as a flexible disk or downloading from a communication circuit and the CPU executing this program. That is, the program specifies a speaker based on utterance data cut out at a predetermined time, totals the number of utterance words of each speaker based on the utterance data cut out at the predetermined time, Determining at least one of an average of the number of spoken words and a standard deviation of the number of spoken words based on the number of spoken words of each speaker, and calculating at least one of the average of the number of spoken words and the standard deviation of the number of spoken words of the speech A step of generating information for displaying a graph corresponding to the time axis of data is executed by a CPU (computer).

以上本発明の好ましい実施例について詳述したが、本発明は係る特定の実施例に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形、変更が可能である。 Although the preferred embodiments of the present invention have been described in detail above, the present invention is not limited to the specific embodiments, and various modifications, within the scope of the gist of the present invention described in the claims, It can be changed.

本発明の第１実施例に係る会議様態解析装置のブロック図である。It is a block diagram of the meeting state analysis apparatus which concerns on 1st Example of this invention. 発話データベース２の内容を示す図である。It is a figure which shows the content of the speech database. 解析部３の動作フローチャートである。5 is an operation flowchart of the analysis unit 3. 表示部６による表示例である。It is a display example by the display unit 6. 本発明の第２実施例に係る会議様態検索装置のブロック図である。It is a block diagram of the meeting mode search apparatus which concerns on 2nd Example of this invention. 会議３と会議５の発話単語数の平均および発話単語数の標準偏差を発話データの時間軸に対応させたグラフを示す図である。It is a figure which shows the graph which matched the average of the number of utterance words of the meeting 3 and the meeting 5, and the standard deviation of the number of utterance words with the time-axis of utterance data.

Explanation of symbols

１会議様態解析装置
２発話データベース
３解析部
４アノテーションデータベース
５表示制御部
６表示部
１００会議様態検索装置
７検索条件設定部
８検索部
DESCRIPTION OF SYMBOLS 1 Meeting state analysis apparatus 2 Utterance database 3 Analysis part 4 Annotation database 5 Display control part 6 Display part 100 Meeting state search apparatus 7 Search condition setting part 8 Search part

Claims

Storage means for storing utterance data;
An information processing apparatus comprising: an analysis unit that aggregates the number of utterance words from the utterance data stored in the storage unit and obtains at least one of an average of the number of utterance words and a standard deviation of the number of utterance words.

The information processing apparatus according to claim 1, further comprising a storage unit that stores at least one of an average of the number of utterance words obtained by the analysis unit and a standard deviation of the number of utterance words in association with the utterance data. .

Storage means for storing an average of the number of utterance words obtained by the analysis means or a standard deviation of the number of utterance words in association with the utterance data;
The information according to claim 1, further comprising: search means for searching for at least one of an average of the number of spoken words satisfying a search condition input from the storage means and a standard deviation of the number of spoken words. Processing equipment.

4. The display control means for displaying a graph in which at least one of the average number of utterance words and the standard deviation of the number of utterance words is associated with the time axis of the utterance data. The information processing apparatus described in 1.

The information processing apparatus according to claim 1, wherein the analysis unit assigns at least one of an average of the utterance words and a standard deviation of the utterance words as an annotation of the utterance data.

Identifying a speaker based on utterance data cut out at a predetermined time;
Based on the utterance data cut out in the predetermined time, the step of counting the number of utterance words of each speaker;
Obtaining at least one of an average of the number of spoken words and a standard deviation of the number of spoken words based on the number of spoken words of each speaker.

The information processing method according to claim 6, further comprising a step of displaying a graph in which at least one of the average number of spoken words and the standard deviation of the number of spoken words is associated with a time axis of the speech data. .

The information processing method according to claim 6, further comprising a step of assigning at least one of an average of the number of speaker words and a standard deviation of the number of speaker words as an annotation of the utterance data.

Identifying a speaker based on utterance data cut out at a predetermined time;
A step of counting the number of utterance words of each speaker based on the utterance data cut out at the predetermined time;
A program for causing a computer to execute a step of obtaining at least one of an average of the number of spoken words and a standard deviation of the number of spoken words based on the number of spoken words of each speaker.

The computer further executes a step of generating information for displaying a graph in which at least one of the average number of spoken words and the standard deviation of the number of spoken words is associated with a time axis of the speech data. Program.