JP2021081483A

JP2021081483A - Minutes data creating system

Info

Publication number: JP2021081483A
Application number: JP2019206619A
Authority: JP
Inventors: 信一三浦; Shinichi Miura; 永芳宗; Yong Fang Zong; 信一岩岡; Shinichi Iwaoka
Original assignee: Maeda Corp
Current assignee: Maeda Corp
Priority date: 2019-11-15
Filing date: 2019-11-15
Publication date: 2021-05-27

Abstract

To individually acquire speech voice of a conference participant, serially arrange the speech voice, and easily create exact minutes data.SOLUTION: A minutes data creating system comprises: speech voice acquisition means 10 which acquires speech voice of a conference participant for each conference participant; speech voice storage means 20 which serially stores the acquired speech voice; text conversion means 30 which converts the stored speech voice into text data; text data storage means 40 which stores the converted text data in association with speech time data; minutes data creation means 50 which creates minutes data based on the stored text data; and minutes data storage means 60 which stores the created minutes data.SELECTED DRAWING: Figure 1

Description

本発明は議事録データ作成システムに関するものであり、詳しくは、会議に参加した者の発言に基づいて議事録データを作成するためのシステムに関するものである。 The present invention relates to a minutes data creation system, and more particularly to a system for creating minutes data based on the statements of those who participated in the meeting.

一般的な会議では、発言者の発言内容を音声録音しておき、議事録データ作成者がメモをとったり、録音データに基づいて議事録データを書き起こしたりしている。しかし、各発言者の発言内容を正確に議事録データに残すためには時間と手間を要するだけではなく、録音データを聞き取って発言者を特定しなくてはならない。そこで、発言者各自が姓名を名乗り、あるいは、議長の指名に基づいて発言を行う等の工夫が必要となっていた。このような煩雑な議事進行を避けるとともに、正確な議事録データを作成するため、種々の技術が提案されている（例えば、特許文献１〜特許文献５参照）。 In a general meeting, the content of the speaker's remarks is recorded by voice, and the minutes data creator takes notes or transcribes the minutes data based on the recorded data. However, not only does it take time and effort to accurately record the content of each speaker's remarks in the minutes data, but it is also necessary to listen to the recorded data to identify the speaker. Therefore, it is necessary for each speaker to give his / her first and last name, or to make a statement based on the nomination of the chairman. Various techniques have been proposed in order to avoid such complicated proceedings and to create accurate minutes data (see, for example, Patent Documents 1 to 5).

特許文献１に記載された技術は、マイクロホンから音声入力された会話情報に基づいて、テキスト化されたデータベースを自動的に構築するための音声認識装置に関するものである。この音声認識装置は、マイクロホンから入力された音声データを音声データ記憶部に格納し、音素認識処理部で音素データに変換した後に音素データ記憶部に格納する。同時に、音素データを発声辞書学習処理部で個人別発声辞書、共通発声辞書と照合し、照合結果を音素認識処理部に渡す。また、音声データ記憶部に格納された音素データを単語認識処理部において単語認識した後に、認識された単語データを単語データ記憶部に記憶するようになっている。 The technique described in Patent Document 1 relates to a voice recognition device for automatically constructing a text database based on conversation information voice input from a microphone. This voice recognition device stores the voice data input from the microphone in the voice data storage unit, converts it into phoneme data by the phoneme recognition processing unit, and then stores it in the phoneme data storage unit. At the same time, the phoneme data is collated with the individual phonation dictionary and the common phonation dictionary in the phonation dictionary learning processing unit, and the collation result is passed to the phoneme recognition processing unit. Further, after the phoneme data stored in the voice data storage unit is recognized by the word recognition processing unit, the recognized word data is stored in the word data storage unit.

特許文献２に記載された技術は、会議参加者を区別し、その発言における音声データより議事録データを自動的に生成するための議事録データ記録システムに関するものである。この議事録データ記録システムは、会議の参加者の人数に等しい数の複数のコンピュータ端末装置と、データ処理サーバーと、通信ネットワークシステムと、ネットワークサーバーとから構成されている。コンピュータ端末装置は、接続したマイクロホンより入力された音声信号のアナログ波形をデジタル信号に変換するＡ／Ｄ変換装置を備えており、デジタル化された会議参加者のそれぞれの音声データを音声認識技術により文章データに変換して発言者の氏名を示すデータと発言の時刻を示すデータを付加しながら、ネットワークサーバーの記憶領域に記憶、蓄積して議事録データを生成するようになっている。 The technique described in Patent Document 2 relates to a minutes data recording system for distinguishing conference participants and automatically generating minutes data from audio data in their remarks. This minutes data recording system is composed of a plurality of computer terminal devices equal to the number of participants in the conference, a data processing server, a communication network system, and a network server. The computer terminal device is equipped with an A / D conversion device that converts the analog waveform of the voice signal input from the connected microphone into a digital signal, and the voice data of each digitized conference participant is converted into a digital signal by voice recognition technology. The minutes data is generated by storing and accumulating in the storage area of the network server while adding data indicating the speaker's name and data indicating the time of the speaker by converting it into text data.

特許文献３に記載された技術は、会議全体の流れとともに各参加者の発言を全て個別に記憶し、かつ必要な発言のデータを取得できるようにするための議事進行支援システムに関するものである。この議事進行支援システムにおいて、会議の参加者は、マイクロホンとカメラとを備えた発言入力装置を１台ずつ使用して発言を行う。発言入力装置によって入力された音声データと参加者の顔写真の画像データは、発言入力装置ごとに記憶装置の対応するフォルダに個別に記録される。全体発言データ作成部は、各発言入力装置に入力された音声データ及び画像データを時系列に並べた全体発言データを作成する。音声／テキスト変換部は、全体発言データの音声部分をテキストデータに変換する。議事録データ作成部は、テキストデータと画像データとによって議事録データを作成する。これらのデータは、会議中、または会議終了後に記憶装置から読み出すことができるようになっている。 The technique described in Patent Document 3 relates to a proceedings progress support system for individually memorizing all the remarks of each participant along with the flow of the entire meeting and acquiring necessary remark data. In this proceedings support system, the participants of the conference make a statement using one speech input device equipped with a microphone and a camera. The voice data input by the speech input device and the image data of the participant's face photograph are individually recorded in the corresponding folder of the storage device for each speech input device. The overall speech data creation unit creates overall speech data in which the voice data and image data input to each speech input device are arranged in chronological order. The voice / text conversion unit converts the voice portion of the entire speech data into text data. The minutes data creation unit creates minutes data from text data and image data. These data can be read from the storage device during or after the meeting.

特許文献４に記載された技術は、複数人の会話を録音した場合に、会話者を自動的に区別して、特定者の会話内容のみを自動再生するための音声録音再生装置に関するものである。この音声録音再生装置は、複数のマイクロホンからの入力音声を音声メモリに録音（記録）するとともに、音声メモリに録音された録音データを再生制御するコントローラを備えている。コントローラは、マイクロホンから入力される音声の音量を計測する音量計測手段と、音量の計測結果に基づいて発話者を特定する発話者特定手段とを備えている。そして、発話者特定手段により特定された発話者が切り替わったことを検出した場合に、録音データにマークを付与することで、再生時にはそのマークを利用して、選択された発話者の録音データを抽出再生するようになっている。 The technique described in Patent Document 4 relates to a voice recording / playback device for automatically distinguishing conversational persons and automatically reproducing only the conversation content of a specific person when a conversation of a plurality of persons is recorded. This voice recording / playback device includes a controller that records (records) input voices from a plurality of microphones in a voice memory and controls playback of the recorded data recorded in the voice memory. The controller includes a volume measuring means for measuring the volume of the voice input from the microphone and a speaker identifying means for identifying the speaker based on the volume measurement result. Then, when it is detected that the speaker specified by the speaker identification means has been switched, a mark is added to the recorded data, and the mark is used during playback to display the recorded data of the selected speaker. It is designed to be extracted and played.

特許文献５に記載された技術は、複数の音声入力装置で個別に生成された複数の音声データを用いて、議事録データを生成するための議事録データ生成装置に関するものである。この議事録データ生成装置は、同一の音空間に配置された複数の音声入力装置を使用して、音空間において行われた会話の議事録データを生成するための装置であり、複数の音声入力装置から取得された複数の音声データのそれぞれが示す音声波形に基づいて、複数の音声データのそれぞれの時間軸を共通の時間軸に対応付ける時間軸調整部と、複数の音声データのそれぞれに対して、当該音声データが示す音声レベルに基づいて発話区間を検出する処理を行い、検出された発話区間に対して音声認識処理を行う音声認識部と、音声認識処理により得られたテキストデータを共通の時間軸に沿って並べることにより、議事録データを生成する議事録データ生成部とを備えている。 The technique described in Patent Document 5 relates to a minutes data generation device for generating minutes data by using a plurality of voice data individually generated by a plurality of voice input devices. This minutes data generation device is a device for generating the minutes data of a conversation performed in the sound space by using a plurality of voice input devices arranged in the same sound space, and is a device for generating the minutes data of a conversation performed in the sound space, and a plurality of voice inputs. For each of the time axis adjustment unit that associates the time axis of each of the plurality of voice data with the common time axis based on the voice waveform indicated by each of the plurality of voice data acquired from the device, and each of the plurality of voice data. , The text data obtained by the voice recognition process is common to the voice recognition unit that performs the process of detecting the utterance section based on the voice level indicated by the voice data and performs the voice recognition process for the detected utterance section. It is equipped with a minutes data generation unit that generates minutes data by arranging them along the time axis.

特開２００２−２１５１８４号公報Japanese Unexamined Patent Publication No. 2002-215184 特開２００３−１７７７７６号公報Japanese Unexamined Patent Publication No. 2003-1777776 特開２００５−１９７８６７号公報Japanese Unexamined Patent Publication No. 2005-197867 特開２００７−１０８５１８号公報JP-A-2007-108518 特開２０１７−１６７３１８号公報JP-A-2017-167318

しかし、上述した各特許文献に記載された技術を含めて、議事録データを作成するための従来の技術は、大がかりな装置構成となってしまったり、各発言者を区別して識別できなかったり、時系列的に重複した発言データを取得できなかったり等、さらなる改善の余地があった。 However, the conventional techniques for creating minutes data, including the techniques described in the above-mentioned patent documents, have a large-scale device configuration, and each speaker cannot be distinguished and identified. There was room for further improvement, such as the inability to acquire duplicate remark data in chronological order.

本発明は、上述した事情に鑑み提案されたもので、会議参加者の発言音声を個別に取得するとともに、時系列的に整理して、正確な議事録データを容易に作成することが可能な議事録データ作成システムを提供することを目的とする。 The present invention has been proposed in view of the above circumstances, and it is possible to individually acquire the speech voices of the conference participants and organize them in chronological order to easily create accurate minutes data. The purpose is to provide a minutes data creation system.

本発明に係る議事録データ作成システムは、上述した目的を達成するため、以下の特徴点を有している。なお、本発明において、会議とは、企業の社員や団体の構成員が一堂に会して発言を行う一般的な会議だけではなく、データ通信回線を介して音声データや画像データを相互に送受信するような、いわゆるテレビ会議システムや、特定の発言者のみが発言を行うパネルディスカッション、主として発言を行う講演者と質問として発言を行う聴衆とからなる講演会等、複数の人間が個別に発言を行う種々の態様を含んでいる。 The minutes data creation system according to the present invention has the following features in order to achieve the above-mentioned object. In the present invention, the conference is not only a general conference in which employees of a company or members of an organization gather together to make a statement, but also exchange voice data and image data with each other via a data communication line. Multiple people individually speak, such as a so-called video conferencing system, a panel discussion in which only a specific speaker speaks, and a lecture consisting of a speaker who mainly speaks and an audience who speaks as a question. It includes various aspects to be performed.

本発明に係る議事録データ作成システムの主要な構成は、会議参加者の発言音声を会議参加者毎に取得する発言音声取得手段と、取得した発言音声を時系列的に記憶する発言音声記憶手段と、記憶した発言音声をテキストデータに変換するテキスト変換手段と、変換したテキストデータに対して発言時間データを付帯して記憶するテキストデータ記憶手段と、記憶したテキストデータに基づいて議事録データを作成する議事録データ作成手段と、作成した議事録データを記憶する議事録データ記憶手段とからなる。 The main configuration of the minutes data creation system according to the present invention is a speech voice acquisition means for acquiring the speech voices of the conference participants for each conference participant and a speech voice storage means for storing the acquired speech voices in chronological order. A text conversion means for converting the stored speech voice into text data, a text data storage means for storing the converted text data with speech time data, and minutes data based on the stored text data. It consists of a means for creating minutes data to be created and a means for storing minutes data for storing the created minutes data.

また、上述した構成に加えて、議事録データ作成手段は、時系列的に重複したテキストデータが存在する場合に、予め定めた優先順位に従って議事録データを作成することが可能である。 Further, in addition to the above-described configuration, the minutes data creating means can create the minutes data in accordance with a predetermined priority when there are overlapping text data in time series.

また、上述した構成に加えて、優先順位は、会議参加者の個人識別情報に付帯させた発言順位、時系列的に先行する発言、発言音声の音量のいずれか一つに基づいて決定することが可能である。 Further, in addition to the above-described configuration, the priority order shall be determined based on any one of the speech order attached to the personal identification information of the conference participants, the speech preceding in chronological order, and the volume of the speech voice. Is possible.

また、上述した構成に加えて、変換したテキストデータの文脈を判断する文脈判断手段と、判断した文脈に基づいてテキストデータをグループ分けするグループ分け手段とを備えた構成とすることが可能である。 Further, in addition to the above-described configuration, it is possible to have a configuration including a context determining means for determining the context of the converted text data and a grouping means for grouping the text data based on the determined context. ..

また、上述した構成に加えて、議事録データ作成言語として定めた言語以外の言語による発言音声を議事録データ作成言語に翻訳する翻訳手段を備えることが可能である。 Further, in addition to the above-described configuration, it is possible to provide a translation means for translating the speech voice in a language other than the language defined as the minutes data creation language into the minutes data creation language.

また、上述した構成に加えて、作成された議事録データを会議参加者に送信する議事録データ送信手段を備えることが可能である。このような構成の場合、議事録データ送信手段は、発言を行った会議参加者の発言を特定して、当該発言を行った会議参加者に対して当該発言の議事録データのみを送信するとともに、予め定めた議事録データ確認者に対して、議事録データのすべてを送信することが可能である。 Further, in addition to the above-described configuration, it is possible to provide a minutes data transmission means for transmitting the created minutes data to the conference participants. In such a configuration, the minutes data transmission means identifies the remarks of the conference participant who made the remark, and transmits only the minutes data of the remark to the conference participant who made the remark. , It is possible to send all the minutes data to the predetermined minutes data confirmer.

また、上述した構成に加えて、議事録データを受信した会議参加者が修正した修正データを受信する修正データ受信手段と、受信した修正データに基づいて議事録データを修正する議事録データ修正手段とを備えることが可能である。 Further, in addition to the above-described configuration, a modified data receiving means for receiving the modified data modified by the conference participant who received the minutes data, and a minutes data modifying means for modifying the minutes data based on the received modified data. And can be provided.

本発明に係る議事録データ作成システムによれば、会議参加者の発言音声を個別かつ時系列的に取得し、音声データをテキストデータに変換するととともに、テキストデータに対して発言時間データ（発言時刻データ）を付帯して議事録データを作成するので、会議参加者の発言を時系列的に並べて、正確な議事録データを容易に作成することが可能となる。 According to the minutes data creation system according to the present invention, the speech data of the conference participants is individually and timely acquired, the speech data is converted into text data, and the speech time data (speak time) is obtained with respect to the text data. Since the minutes data is created with the data) attached, it is possible to easily create accurate minutes data by arranging the remarks of the conference participants in chronological order.

本発明の実施形態に係る議事録データ作成システムの機能ブロック図。The functional block diagram of the minutes data creation system which concerns on embodiment of this invention. 本発明の実施形態に係る議事録データ作成システムの構成を示す模式図。The schematic diagram which shows the structure of the minutes data creation system which concerns on embodiment of this invention. 本発明の実施形態に係る議事録データ作成システムにおける議事録データ作成手順を示すフローチャート。The flowchart which shows the minutes data creation procedure in the minutes data creation system which concerns on embodiment of this invention.

以下、図面を参照して、本発明の実施形態に係る議事録データ作成システムを説明する。図１〜３は本発明の実施形態に係る議事録データ作成システムを説明するもので、図１は機能ブロック図、図２は構成模式図、図３は議事録データ作成手順のフローチャートである。 Hereinafter, the minutes data creation system according to the embodiment of the present invention will be described with reference to the drawings. 1 to 3 show a minutes data creation system according to an embodiment of the present invention, FIG. 1 is a functional block diagram, FIG. 2 is a schematic configuration diagram, and FIG. 3 is a flowchart of a minutes data creation procedure.

＜議事録データ作成システムの概略構成＞
本発明の実施形態に係る議事録データ作成システム２００は、会議参加者（以下、参加者と略記することがある。）の発言音声に基づいて議事録データを作成するためのシステムに関するものであり、図１に示すように、主要な構成要素として、発言音声取得手段１０、発言音声記憶手段２０、テキスト変換手段３０、テキストデータ記憶手段４０、議事録データ作成手段５０、議事録データ記憶手段６０を備えている。さらに、これらの構成要素に加えて、翻訳手段７０、文脈判断手段８０、グループ分け手段９０、議事録データ送信手段１００、修正データ受信手段１１０、議事録データ修正手段１２０を備えていてもよい。各手段は、それぞれの機能を発揮する機器、コンピュータ及びこれにインストールされたプログラムにより構成される。 <Summary configuration of minutes data creation system>
The minutes data creation system 200 according to the embodiment of the present invention relates to a system for creating minutes data based on the voice of a conference participant (hereinafter, may be abbreviated as a participant). As shown in FIG. 1, as the main components, the speech voice acquisition means 10, the speech voice storage means 20, the text conversion means 30, the text data storage means 40, the minutes data creation means 50, the minutes data storage means 60. It has. Further, in addition to these components, a translation means 70, a context determination means 80, a grouping means 90, a minutes data transmission means 100, a correction data receiving means 110, and a minutes data correction means 120 may be provided. Each means is composed of a device, a computer, and a program installed in the device, which exerts its respective function.

＜発言音声取得手段＞
発言音声取得手段１０は、会議参加者の発言音声を会議参加者毎に取得するための手段である。発言音声取得手段１０は、例えば、各参加者がそれぞれ装着した骨伝導マイクロホンや、各参加者の発言音声をそれぞれ個別に取得可能な指向性マイクロホン等からなる。以下、本実施形態では、発言音声取得手段１０として骨伝導マイクロホンを使用した例について説明する。なお、複数の参加者の中から発言を行っている参加者の音声を個別に取得可能な機器であれば、他の機器を発言音声取得手段１０としてもよい。 <Voice acquisition means>
The speech voice acquisition means 10 is a means for acquiring the speech voice of the conference participants for each conference participant. The speech voice acquisition means 10 includes, for example, a bone conduction microphone worn by each participant, a directional microphone capable of individually acquiring the speech voice of each participant, and the like. Hereinafter, in the present embodiment, an example in which a bone conduction microphone is used as the speech voice acquisition means 10 will be described. In addition, as long as it is a device capable of individually acquiring the voice of a participant who is speaking from a plurality of participants, another device may be used as the speaking voice acquisition means 10.

参加者の音声を個別に識別して取得するには、例えば、各参加者の発言音声を取得するマイクロホンに対して発言者を特定するための個人識別ＩＤを紐付けしたり、発言音声データの声紋を分析して発言者を特定したりすればよい。 To individually identify and acquire the voice of each participant, for example, a personal identification ID for identifying the speaker can be associated with the microphone that acquires the voice of each participant, or the voice data of the participant can be identified and acquired. The voiceprint may be analyzed to identify the speaker.

本実施形態の実施例である骨伝導マイクロホンは、図２に示すように、各参加者がそれぞれ装着する装置であり、各参加者がそれぞれ所持する携帯情報端末１３０（例えば、スマートホン）との間で近距離無線通信が可能となっている。また、携帯情報端末１３０は議事録データ作成システム２００を構成するコンピュータ（サーバー）１５０との間で無線通信が可能となっており、骨伝導マイクロホンで集音した各参加者の発言音声は、無線ＬＡＮや無線電話回線等の無線通信回線を介してコンピュータ（サーバー）１５０に送信される。無線ＬＡＮを用いてデータの送受信を行う場合には、携帯情報端末１３０とサーバーとの間にルーター１４０が介在している。 As shown in FIG. 2, the bone conduction microphone according to the embodiment of the present embodiment is a device worn by each participant, and is associated with a mobile information terminal 130 (for example, a smart phone) possessed by each participant. Short-range wireless communication is possible between them. In addition, the personal digital assistant 130 is capable of wireless communication with the computer (server) 150 constituting the minutes data creation system 200, and the voice of each participant collected by the bone conduction microphone is wireless. It is transmitted to the computer (server) 150 via a wireless communication line such as a LAN or a wireless telephone line. When data is transmitted / received using a wireless LAN, a router 140 is interposed between the mobile information terminal 130 and the server.

＜発言音声記憶手段＞
発言音声記憶手段２０は、取得した発言音声を時系列的に記憶するための手段であり、ＨＤＤ等の大容量記憶装置からなる。すなわち、発言音声記憶手段２０は骨伝導マイクロホンに入力され、電気信号に変換された音声信号データを時系列的に記憶する。時系列的に記憶するとは、各発言者の音声信号データを時間の経過に従って記憶することをいう。 <Speech voice storage means>
The speech voice storage means 20 is a means for storing the acquired speech voice in a time series, and includes a large-capacity storage device such as an HDD. That is, the speech voice storage means 20 stores the voice signal data input to the bone conduction microphone and converted into an electric signal in time series. To memorize in chronological order means to memorize the voice signal data of each speaker according to the passage of time.

＜テキスト変換手段＞
テキスト変換手段３０は、発言音声記憶手段２０に記憶した発言音声をテキストデータに変換するための手段である。発言音声をテキストデータに変換する技術は、種々提案されており、本発明では、既存の音声認識・テキスト変換ソフトウェアを使用する。一般的な音声認識・テキスト変換ソフトウェアは、人の音声をマイクロホンに入力して音声データ（デジタルデータ）として録音し、ノイズ除去を行った後に、音波から音素を特定するとともに、音素の並びを特定して単語に変換する。そして、単語の並びから文章を作成して、テキストとして出力するようになっている。音素の並びを特定して単語に変換するには、音声認識辞書を使用する。現在使用されている音声認識辞書は、ディープラーニングによる機械学習を行うようになっている。 <Text conversion means>
The text conversion means 30 is a means for converting the speech voice stored in the speech voice storage means 20 into text data. Various techniques for converting speech to text data have been proposed, and in the present invention, existing speech recognition / text conversion software is used. General voice recognition / text conversion software inputs human voice into a microphone, records it as voice data (digital data), removes noise, identifies phonemes from sound waves, and identifies the sequence of phonemes. And convert it to a word. Then, a sentence is created from a sequence of words and output as a text. Use a speech recognition dictionary to identify phoneme sequences and convert them into words. Currently used speech recognition dictionaries are designed to perform machine learning by deep learning.

＜翻訳手段＞
翻訳手段７０は、議事録データ作成言語として定めた言語以外の言語による発言音声を議事録データ作成言語に翻訳するための手段である。議事録データ作成言語とは、我が国おいては一般的に日本語であるが、発言者の言語構成に合わせて、英語、フランス語、スペイン語、中国語、韓国語等、他の言語としてもよい。 <Translation means>
The translation means 70 is a means for translating the speech voice in a language other than the language defined as the minutes data creation language into the minutes data creation language. The minutes data creation language is generally Japanese in Japan, but other languages such as English, French, Spanish, Chinese, and Korean may be used according to the language structure of the speaker. ..

一般的に、翻訳手段７０は機械翻訳を行うコンピュータソフトウェア（翻訳ソフト）からなり、基本的には、文法を機械的に解釈して単語を切り分け、切り分けた単語に基づいて辞書から訳語を抽出して、文法に合致するように訳語を並べることにより、入力された音声データを議事録データ作成言語に翻訳する。現在使用されている翻訳ソフトは、ディープラーニングによる機械学習を行って、翻訳の精度を向上させるようになっている。 Generally, the translation means 70 consists of computer software (translation software) that performs machine translation. Basically, the grammar is mechanically interpreted to separate words, and the translated words are extracted from the dictionary based on the separated words. Then, by arranging the translated words so as to match the grammar, the input voice data is translated into the minutes data creation language. The translation software currently in use is designed to improve the accuracy of translation by performing machine learning by deep learning.

＜テキストデータ記憶手段＞
テキストデータ記憶手段４０は、変換したテキストデータに対して発言時間データを付帯し、発言時間付帯テキストデータとして記憶するための手段であり、ＨＤＤ等の大容量記憶装置からなる。また、発言の翻訳を行った場合には、翻訳されたテキストデータに対して発言時間データを付帯し、発言時間付帯テキストデータとして記憶する。 <Text data storage means>
The text data storage means 40 is a means for attaching speech time data to the converted text data and storing it as text data incidental to the speech time, and is composed of a large-capacity storage device such as an HDD. Further, when the remark is translated, the remark time data is attached to the translated text data and stored as the remark time incidental text data.

発言者から取得した音声信号データは、発言音声記憶手段２０において時系列的に記憶されているため、発言音声をテキストデータに変換する際に発言時間データを取得し、テキストデータに発言時間データを付帯させることができる。 Since the voice signal data acquired from the speaker is stored in the speech voice storage means 20 in time series, the speech time data is acquired when the speech voice is converted into text data, and the speech time data is converted into the text data. It can be attached.

＜議事録データ作成手段＞
議事録データ作成手段５０は、記憶したテキストデータ（発言時間付帯テキストデータ）に基づいて議事録データを作成するための手段である。すなわち、議事録データ作成手段５０は、発言時間付帯テキストデータを構成する発言時間データに基づいて、音声信号から変換されたテキストデータを時系列的に並べることにより議事録データを作成する。また、議事録データの作成に際して、発言者を特定するためのデータを付してもよい。例えば、発言者Ａの発言、発言者Ｂの発言、発言者Ｃの発言を明示して、発言内容を時系列的に記載することにより議事録データを作成する。なお、作成された議事録データは、議事録データ記憶手段６０であるＨＤＤ等の大容量記憶装置に記憶される。 <Means for creating minutes data>
The minutes data creating means 50 is a means for creating minutes data based on the stored text data (text data incidental to the speaking time). That is, the minutes data creating means 50 creates the minutes data by arranging the text data converted from the voice signal in chronological order based on the speech time data constituting the speech time incidental text data. In addition, when creating the minutes data, data for identifying the speaker may be attached. For example, the minutes data is created by clearly indicating the remarks of the speaker A, the remarks of the speaker B, and the remarks of the speaker C, and describing the contents of the remarks in chronological order. The created minutes data is stored in a large-capacity storage device such as an HDD, which is the minutes data storage means 60.

また、議事録データ作成手段５０では、時系列的に重複した発言時間付帯テキストデータが存在する場合に、予め定めた優先順位に従って議事録データを作成することが好ましい。時系列的に重複した発言時間付帯テキストデータとは、例えば、発言者Ａが発言している間に発言者Ｂも発言を行った場合や、発言者Ａと発言者Ｂが同時に発言を開始した場合のことをいう。 Further, in the minutes data creating means 50, it is preferable to create the minutes data in a predetermined priority order when there is text data with a speech time that overlaps in time series. The time-series overlapping text data with speaking time is, for example, when speaker B also speaks while speaker A is speaking, or when speaker A and speaker B start speaking at the same time. It refers to the case.

議事録データは、発言を文章化して記載したものであり、二人以上の発言者の発言内容を並行して記載することも可能であるが、議事録データとして読む場合には、発言内容が錯綜して、発言内容を的確に認識することができない場合もある。そこで、本発明では、時系列的に重複した発言時間付帯テキストデータが存在する場合に、予め定めた優先順位に従って議事録データを作成するようになっている。 The minutes data is a written description of the remarks, and it is possible to describe the remarks of two or more speakers in parallel, but when reading as the minutes data, the remarks are described. In some cases, it may not be possible to accurately recognize the content of the statement due to confusion. Therefore, in the present invention, when there is text data with a speech time that overlaps in time series, the minutes data is created according to a predetermined priority order.

優先順位は、会議参加者の個人識別情報に付帯させた順位、時系列的に先行する発言、発言音声の音量のいずれか一つに基づいて決定することが可能である。参加者の個人識別情報に付帯させた発言順位に基づく優先順位とは、例えば、議長を第１順位とし、職能等級が上位の者から順に、第２順位、第３順位、・・・とする態様である。なお、発言順位は、他の序列に従って定めてもよい。この場合、各参加者に個人識別ＩＤを付与し、個人識別ＩＤと発言順位とを紐付けしておけばよい。 The priority can be determined based on any one of the order attached to the personal identification information of the conference participants, the chronologically preceding speech, and the volume of the speech voice. The priority based on the order of remarks attached to the personal identification information of the participants is, for example, the chairperson is the first order, and the person with the highest function grade is the second order, the third order, and so on. It is an aspect. The order of remarks may be determined according to another order. In this case, a personal identification ID may be given to each participant, and the personal identification ID and the order of remarks may be associated with each other.

時系列的に先行する発言に基づく優先順位とは、例えば、発言者Ａが発言を始めた後に、発言者Ａの発言に重複して発言者Ｂも発言を始めた場合に、発言者Ａを第１順位、発言者Ｂを第２順位とすることである。また、発言音声の音量に基づく優先順位とは、例えば、発言者Ａと発言者Ｂが重複して発言を行っている場合に、発言者Ａの音声が発言者Ｂの音声よりも音量が大きかったとすると、発言者Ａを第１順位、発言者Ｂを第２順位とすることである。 The priority based on the preceding remarks in chronological order is, for example, when the speaker A starts speaking and then the speaker B also starts speaking in duplicate with the remark of the speaker A, the speaker A is set. The first rank and the speaker B are the second rank. Further, the priority based on the volume of the speaking voice is, for example, that the voice of the speaker A is louder than the voice of the speaker B when the speaker A and the speaker B are speaking in duplicate. If so, the speaker A is the first rank and the speaker B is the second rank.

時系列的に重複した発言時間付帯テキストデータが存在する場合に、予め定めた優先順位に従って議事録データを作成することにより、各参加者の発言内容の区別が容易となり、各参加者の発言を混同するおそれがなくなる。 When there is text data with a remark time that overlaps in chronological order, by creating the minutes data according to a predetermined priority, it becomes easy to distinguish the remark contents of each participant, and the remarks of each participant can be made. There is no risk of confusion.

＜文脈判断手段＞
文脈判断手段８０は、テキスト変換手段３０により発言音声から変換したテキストデータの文脈を判断するための手段である。文脈判断手段８０は、コンピュータソフトウェア（文脈判断ソフト）からなり、基本的には、各発言者のテキストデータから単語を切り分け、切り分けた単語の集合に基づいてどのような内容の発言がなされているかを判断する。この文脈判断ソフトは、ディープラーニングによる機械学習を行って、文脈判断の精度を向上させることが好ましい。 <Means of contextual judgment>
The context determination means 80 is a means for determining the context of the text data converted from the spoken voice by the text conversion means 30. The context judgment means 80 is composed of computer software (context judgment software), and basically, words are separated from the text data of each speaker, and what kind of content is made based on the set of the separated words. To judge. It is preferable that this context judgment software performs machine learning by deep learning to improve the accuracy of context judgment.

＜グループ分け手段＞
グループ分け手段９０は、判断した文脈（発言内容）に基づいてテキストデータをグループ分けするための手段である。グループ分け手段９０は、コンピュータソフトウェア（グループ分けソフト）からなり、基本的には、判断した文脈の類似度に基づいて、発言（テキストデータ）のグループ分けを行う。すなわち、文脈判断手段８０により判断した各発言者の発言内容を比較して、文脈の類似度を判定する。 <Grouping means>
The grouping means 90 is a means for grouping text data based on a determined context (statement content). The grouping means 90 is composed of computer software (grouping software), and basically groups remarks (text data) based on the degree of similarity of the determined context. That is, the similarity of the context is determined by comparing the remark contents of each speaker determined by the context determination means 80.

例えば、発言者Ａ、発言者Ｂ、発言者Ｃの発言内容が「建築物の構造」に関するもの（発言内容に「構造」、「柱」、「梁」、「壁」、「強度」、「安全係数」等の単語が使われている）場合であって、発言者Ｄ、発言者Ｅの発言内容が「建築物の材料」に関するもの（発言内容に「材料」、「モルタル」、「樹脂」、「不燃」、「耐食」、「軽量」等の単語が使われている）場合に、発言者Ａ、発言者Ｂ、発言者Ｃをグループ１とし、発言者Ｄ、発言者Ｅをグループ２として、グループ分けを行う。なお、上述した単語はあくまで一例であり、必ずしも上述した単語が例示した発言内容に関連するものとなるわけではなく、発言内容の全体に基づいて総合的に発言内容の類似度が判断される。 For example, the content of the statements made by the speaker A, the speaker B, and the speaker C is related to the "structure of the building" (the content of the statement is "structure", "pillar", "beam", "wall", "strength", " In the case where words such as "safety factor" are used), the statements made by speaker D and speaker E are related to "building materials" ("materials", "mortar", and "resin" are used in the statements. , "Incombustible", "corrosion resistance", "lightweight", etc.), speaker A, speaker B, and speaker C are group 1, and speaker D and speaker E are group. As 2, grouping is performed. It should be noted that the above-mentioned words are merely examples, and the above-mentioned words are not necessarily related to the illustrated contents of remarks, and the degree of similarity of the contents of remarks is comprehensively judged based on the whole contents of remarks.

このような場合には、予め定めたグループ分け基準に従って、一つの議事録データの中で、グループ１とグループ２とを識別可能としてもよいし、グループ１とグループ２とで別々の議事録データを作成してもよい。 In such a case, group 1 and group 2 may be distinguishable in one minutes data according to a predetermined grouping standard, or separate minutes data for group 1 and group 2. May be created.

テキストデータに変換された発言内容の文脈を判断し、判断した文脈（発言内容）に基づいてテキストデータをグループ分けすることにより、同一の場所に居る複数の参加者の一部が、他の参加者の発言とは関連性のない発言を行った場合に、各参加者の発言を区別することができる。このため、議事録データの内容に混乱が生じないようにして、適切な議事録データを作成することができる。 By judging the context of the content of the statement converted into text data and grouping the text data based on the determined context (content of the statement), some of the multiple participants in the same place can participate in the other. It is possible to distinguish the remarks of each participant when the remarks that are not related to the remarks of the person are made. Therefore, it is possible to create appropriate minutes data without causing confusion in the contents of the minutes data.

＜議事録データ記憶手段＞
議事録データ記憶手段６０は、議事録データ作成手段５０で作成した議事録データを記憶するための手段であり、ＨＤＤ等の大容量記憶装置からなる。また、議事録データ記憶手段６０は、議事録データが修正された場合に、修正議事録データを記憶する手段として機能させることが可能である。議事録データの修正については、後に詳述する。 <Means for storing minutes data>
The minutes data storage means 60 is a means for storing the minutes data created by the minutes data creation means 50, and is composed of a large-capacity storage device such as an HDD. Further, the minutes data storage means 60 can function as a means for storing the corrected minutes data when the minutes data is modified. The revision of the minutes data will be described in detail later.

＜議事録データ送信手段＞
議事録データ送信手段１００は、作成された議事録データを会議参加者に送信するための手段であり、テキストデータである議事録データを送信するためのソフトウェアと、データ送信を行うためのハードウェアとからなる。この議事録データ送信手段１００は、予め設定した各参加者のメールアドレスに対して、テキストデータである議事録データを送信する。 <Means for transmitting minutes data>
The minutes data transmission means 100 is a means for transmitting the created minutes data to the conference participants, software for transmitting the minutes data which is text data, and hardware for transmitting the data. It consists of. The minutes data transmission means 100 transmits the minutes data, which is text data, to the preset e-mail addresses of the participants.

この場合、発言を行った会議参加者の発言を特定して、当該発言を行った会議参加者に対して当該発言の議事録データのみを送信するとともに、予め定めた議事録データ確認者に対して、議事録データのすべてを送信することが好ましい。すなわち、議事録データの中から各参加者の発言である個人議事録データを特定し、各参加者のメールアドレスに対して、当該参加者の発言である個人議事録データをそれぞれ送信する。さらに、議長や議事録データ作成者に指定された者を議事録データ確認者としておき、議事録データ確認者のメールアドレスに対しては、議事録データのすべてを送信する。 In this case, the remarks of the conference participants who made the remarks are specified, and only the minutes data of the remarks are transmitted to the conference participants who made the remarks, and to the predetermined minutes data confirmer. It is preferable to send all the minutes data. That is, the individual minutes data that is the remark of each participant is specified from the minutes data, and the personal minutes data that is the remark of the participant is transmitted to each participant's e-mail address. Furthermore, the person designated as the chairman or the creator of the minutes data is set as the minutes data confirmer, and all the minutes data is sent to the email address of the minutes data confirmer.

このような態様とすることにより、各参加者は、自らの個人議事録データのみを確認することになり、各参加者の作業負担を軽減することができる。また、議長や議事録データ作成者等の議事録データ確認者は、元来、議事録データの内容のすべてについて確認する必要があるため、議事録データ確認者に対しては議事録データのすべてを送信する必要がある。 By adopting such an aspect, each participant can confirm only his / her own personal minutes data, and the work load of each participant can be reduced. In addition, since it is necessary for the person who confirms the minutes data, such as the chairperson and the creator of the minutes data, to confirm all the contents of the minutes data from the beginning, all the minutes data is for the person who confirms the minutes data. Need to be sent.

＜修正データ受信手段＞
修正データ受信手段１１０は、議事録データを受信した会議参加者が修正した修正データを受信するための手段であり、修正データ（テキストデータ）を受信するためのソフトウェアと、データ受信を行うためのハードウェアからなる。すなわち、個人議事録データあるいはすべての議事録データを受信した参加者は、受信した個人議事録データあるいはすべての議事録データの内容を確認して、修正すべき箇所があれば、修正データを作成して、議事録データ作成システム２００を構成するメインコンピュータに送信する。 <Modified data receiving means>
The modified data receiving means 110 is a means for receiving the modified data by the conference participant who has received the minutes data, and is a software for receiving the modified data (text data) and a software for receiving the data. Consists of hardware. That is, the participant who received the personal minutes data or all the minutes data checks the contents of the received personal minutes data or all the minutes data, and creates the correction data if there is a part to be corrected. Then, it is transmitted to the main computer constituting the minutes data creation system 200.

修正データは、修正箇所を示すデータと修正内容のデータであってもよいし、各参加者がそれぞれ受信した個人議事録データあるいはすべての議事録データの全体に対して修正を行った後のデータであってもよい。 The correction data may be data indicating the correction location and data of the correction content, or data after correction is made to the individual minutes data received by each participant or all the minutes data as a whole. It may be.

＜議事録データ修正手段＞
議事録データ修正手段１２０は、受信した修正データに基づいて議事録データを修正するための手段である。議事録データ修正手段１２０は、コンピュータソフトウェア（議事録データ修正ソフト）からなり、作成議事録データ記憶手段６０に記憶された議事録データに対して修正を行う。修正された議事録データは、修正された議事録データであることを識別可能として、議事録データ記憶手段６０に記憶される。なお、作成議事録データ記憶手段６０とは別個に、修正された議事録データの記憶手段を設けてもよい。 <Means for correcting minutes data>
The minutes data correction means 120 is a means for correcting the minutes data based on the received correction data. The minutes data correction means 120 comprises computer software (minute data correction software), and corrects the minutes data stored in the created minutes data storage means 60. The modified minutes data is stored in the minutes data storage means 60 so that it can be identified as the modified minutes data. In addition to the created minutes data storage means 60, a modified minutes data storage means may be provided.

また、議事録データ送信手段１００の機能により、各参加者に対して、修正された議事録データを再度送信してもよい。この場合、各参加者が、受信した議事録データに対して最終稿である旨のデータ（フラグ）を付して送信することにより、各参加者の個人議事録データあるいはすべての議事録データが最終稿となる。 In addition, the modified minutes data may be transmitted again to each participant by the function of the minutes data transmission means 100. In this case, each participant sends the received minutes data with data (flag) indicating that it is the final draft, so that the individual minutes data of each participant or all the minutes data can be obtained. This is the final draft.

このように、議事録データを各参加者あるいは議事録データ確認者に対して送信して確認を仰ぐことにより、より一層正確な議事録データを作成することができる。すなわち、会議における発言は、各参加者がその場で考えた事項を話し言葉として発言するものである。この場合、言い間違え、不適切な表現、誤解を招く表現が含まれる場合もあり、修正の機会を与えることにより、より一層適切な議事録データを作成することができる。 In this way, by transmitting the minutes data to each participant or the person who confirms the minutes data and asking for confirmation, it is possible to create more accurate minutes data. That is, the remarks at the meeting are those in which each participant speaks the matters considered on the spot as spoken words. In this case, mistakes, inappropriate expressions, and misleading expressions may be included, and by giving an opportunity for correction, even more appropriate minutes data can be created.

＜議事録データ作成の手順＞
図３を参照して、本発明の実施形態に係る議事録データ作成システム２００を用いて議事録データを作成する手順を説明する。議事録データ作成の前提として、各参加者に対して、各参加者の発言音声を個別に取得可能な骨伝導マイクロホン等を装着させており、骨伝導マイクロホンは議事録データ作成システム２００の構成要素であるコンピュータとデータの送受信が可能となっている。また、各参加者には固有の識別情報（個人ＩＤ）を付与するとともに、識別情報（個人ＩＤ）に対して各参加者のメールアドレスが紐付けられている。 <Procedure for creating minutes data>
With reference to FIG. 3, a procedure for creating minutes data using the minutes data creation system 200 according to the embodiment of the present invention will be described. As a prerequisite for creating the minutes data, each participant is equipped with a bone conduction microphone or the like that can individually acquire the speech voice of each participant, and the bone conduction microphone is a component of the minutes data creation system 200. It is possible to send and receive data to and from the computer. In addition, unique identification information (individual ID) is given to each participant, and the e-mail address of each participant is associated with the identification information (individual ID).

なお、図３に示す議事録データ作成手順は、本発明の実施形態に係る議事録データ作成システム２００におけるすべての機能を有している場合を想定しているが、基本的な機能以外に関する手順については省略可能な場合がある。 The minutes data creation procedure shown in FIG. 3 assumes that the minutes data creation system 200 according to the embodiment of the present invention has all the functions, but the procedure is related to other than the basic functions. May be optional.

本発明の実施形態に係る議事録データ作成システム２００を用いて議事録データを作成するには、各参加者にそれぞれ装着した骨伝導マイクロホン（発言音声取得手段１０）の機能により、各参加者の発言音声をそれぞれ取得し（Ｓ１）。発言音声記憶手段２０の機能により、取得した発言音声を時系列的に記憶する（Ｓ２）。 In order to create the minutes data using the minutes data creation system 200 according to the embodiment of the present invention, each participant can use the function of the bone conduction microphone (speech voice acquisition means 10) attached to each participant. Each of the speech voices is acquired (S1). The acquired speech voice is stored in time series by the function of the speech voice storage means 20 (S2).

続いて、テキスト変換手段３０の機能により、発言音声記憶手段２０に記憶した発言音声をテキストデータに変換する（Ｓ３）。ここで、特定の言語（例えば、日本語）を議事録データ作成言語として定めた場合には、議事録データ作成言語以外の言語であるか否か（翻訳が必要か否か）を判断し（Ｓ４）、議事録データ作成言語以外の言語である場合には、翻訳手段７０の機能により、発言音声を議事録データ作成言語に翻訳して（Ｓ５）、テキストデータ記憶手段４０の機能により、テキスト変換したデータに発言時間データを付帯して記憶する（Ｓ６）。一方、翻訳が必要でない場合には、テキスト変換したデータに発言時間データを付帯してそのまま記憶する（Ｓ６）。 Subsequently, the function of the text conversion means 30 converts the speech voice stored in the speech voice storage means 20 into text data (S3). Here, when a specific language (for example, Japanese) is defined as the minutes data creation language, it is determined whether or not the language is other than the minutes data creation language (whether or not translation is required) (whether or not translation is required). S4) If the language is other than the minutes data creation language, the speech data is translated into the minutes data creation language by the function of the translation means 70 (S5), and the text is written by the function of the text data storage means 40. Speaking time data is attached to the converted data and stored (S6). On the other hand, when translation is not necessary, the text-converted data is accompanied by the speech time data and stored as it is (S6).

続いて、時系列的に重複したテキストデータがあるか否かを判断し（Ｓ７）、時系列的に重複したテキストデータがある場合には、予め定めた優先順位に従って議事録記載順序を決定する（Ｓ８）。優先順位は、会議参加者の個人識別情報に付帯させた発言順位、時系列的に先行する発言、発言音声の音量のいずれか一つに基づいて決定することができる。以上の処理により議事録データ原稿が作成される。 Subsequently, it is determined whether or not there is text data that is duplicated in chronological order (S7), and if there is text data that is duplicated in chronological order, the order of minutes entry is determined according to a predetermined priority. (S8). The priority order can be determined based on any one of the speech order attached to the personal identification information of the conference participants, the speech preceding in chronological order, and the volume of the speech voice. The minutes data manuscript is created by the above processing.

続いて、文脈判断手段８０の機能により文脈を判断し（Ｓ９）、グループ分け手段９０の機能により、文脈判断手段８０で判断した文脈に基づいて、テキストデータをグループ分けする（Ｓ１０）。なお、グループ分け処理は必要な場合にのみ実施される。続いて、議事録データ作成手段５０の機能により、テキストデータ（グループ分けされている場合がある）に基づいて議事録データを作成する（Ｓ１１）。作成した議事録データは、議事録データ記憶手段６０の機能により記憶する（Ｓ１２）。 Subsequently, the context is determined by the function of the context determination means 80 (S9), and the text data is grouped based on the context determined by the context determination means 80 by the function of the grouping means 90 (S10). The grouping process is performed only when necessary. Subsequently, the minutes data is created based on the text data (which may be grouped) by the function of the minutes data creating means 50 (S11). The created minutes data is stored by the function of the minutes data storage means 60 (S12).

議事録データが記憶されると、議事録データ送信手段１００の機能により、記憶された議事録データ（議事録原稿）を会議参加者に送信する（Ｓ１３）。この際、議事録データ送信手段１００は、発言を行った会議参加者の発言を特定して、当該発言を行った会議参加者に対して当該発言の議事録データのみを送信するとともに、予め定めた議事録データ確認者に対して、議事録データのすべてを送信することが好ましい。 When the minutes data is stored, the stored minutes data (minutes manuscript) is transmitted to the conference participants by the function of the minutes data transmission means 100 (S13). At this time, the minutes data transmitting means 100 identifies the remarks of the conference participant who made the remark, transmits only the minutes data of the remark to the conference participant who made the remark, and determines in advance. It is preferable to send all the minutes data to the person who confirms the minutes data.

続いて、議事録データを受信した会議参加者が議事録データを修正した場合には、修正データ受信手段１１０の機能により、会議参加者が修正した修正データを受信する（Ｓ１４）。続いて、議事録データ修正手段１２０の機能により、受信した修正データに基づいて議事録データを修正する（Ｓ１５）。修正された議事録データは、議事録データを記憶するための記憶手段（例えば、議事録データ記憶手段６０）に記憶される（Ｓ１２）。議事録データの修正は、必要に応じて数回行われた後、議事録データの修正が完了すると、最終稿となる議事録データが確定する。 Subsequently, when the meeting participant who has received the minutes data corrects the minutes data, the meeting participant receives the corrected data by the function of the corrected data receiving means 110 (S14). Subsequently, the minutes data is corrected based on the received correction data by the function of the minutes data correction means 120 (S15). The modified minutes data is stored in a storage means for storing the minutes data (for example, the minutes data storage means 60) (S12). The minutes data is revised several times as necessary, and when the minutes data revision is completed, the final minutes data is finalized.

１０発言音声取得手段
２０発言音声記憶手段
３０テキスト変換手段
４０テキストデータ記憶手段
５０議事録データ作成手段
６０議事録データ記憶手段
７０翻訳手段
８０文脈判断手段
９０グループ分け手段
１００議事録データ送信手段
１１０修正データ受信手段
１２０議事録データ修正手段
１３０携帯情報端末
１４０ルーター
１５０コンピュータ（サーバー）
２００議事録データ作成システム 10 Speak voice acquisition means 20 Speak voice storage means 30 Text conversion means 40 Text data storage means 50 Minute data creation means 60 Minute data storage means 70 Translation means 80 Context judgment means 90 Grouping means 100 Minute data transmission means 110 Correction Data receiving means 120 Minutes data correction means 130 Mobile information terminal 140 Router 150 Computer (server)
200 Minutes data creation system

Claims

It is a system for creating minutes data based on the voice of conference participants.
Means for acquiring speech voices for each conference participant and means for acquiring speech voices of conference participants
A speech voice storage means that stores the acquired speech speech in chronological order,
A text conversion means that converts the memorized speech voice into text data,
A text data storage means that attaches speech time data to the converted text data and stores it,
Minute data creation means to create minutes data based on the stored text data,
Minutes data storage means for storing the created minutes data,
Minutes data creation system characterized by being equipped with.

The minutes data creation means creates the minutes data according to a predetermined priority when there is text data that overlaps in time series.
The minutes data creation system according to claim 1, wherein the minutes data is created.

The priority is determined based on any one of the order attached to the personal identification information of the conference participants, the preceding speech in chronological order, and the volume of the speech voice.
The minutes data creation system according to claim 2, wherein the minutes data is created.

A contextual judgment means for judging the context of the converted text data, and
A grouping method that groups text data based on the determined context,
The minutes data creation system according to any one of claims 1 to 3, further comprising.

Equipped with a translation means to translate the speech voice in a language other than the language specified as the minutes data creation language into the minutes data creation language.
The minutes data creation system according to any one of claims 1 to 4, wherein the minutes data is created.

Equipped with a means for transmitting minutes data to transmit the created minutes data to meeting participants,
The minutes data creation system according to any one of claims 1 to 5, wherein the minutes data is created.

The minutes data transmission means identifies the remarks of the conference participant who made the remark, transmits only the minutes data of the remark to the conference participant who made the remark, and also transmits the minutes of the remark in advance. Send all minutes data to the data confirmer,
The minutes data creation system according to claim 6, wherein the minutes data is created.

The means for receiving the modified data, which receives the modified data modified by the conference participants who received the minutes data,
Minute data correction means to correct the minutes data based on the received correction data,
The minutes data creation system according to claim 6 or 7, wherein the minutes 6 or 7 is provided.