JP6860472B2

JP6860472B2 - How to create a summary from meeting audio data

Info

Publication number: JP6860472B2
Application number: JP2017254481A
Authority: JP
Inventors: 太介内山; 剛三野
Original assignee: Hitachi Solutions Ltd
Current assignee: Hitachi Solutions Ltd
Priority date: 2017-12-28
Filing date: 2017-12-28
Publication date: 2021-04-14
Anticipated expiration: 2037-12-28
Also published as: JP2019121075A

Description

本発明は、会議の音声データから要約書を作成する方法に関する。 The present invention relates to a method of creating a summary from audio data of a conference.

本開示の背景技術として、例えば、特開２０１１−２８６３８号公報（特許文献１）が知られている。特開２０１１−２８６３８号公報は、要約文生成技術を開示する。具体的には、単語にその形態素品詞およびその単語概念ベクトルが対応付けられた組が複数登録されている概念語辞書を用いて、要約対象文章から概念語辞書に登録されている単語を抽出し、抽出された単語に対応する単語概念ベクトルを用いて要約対象文章の特徴量を算出し、要約文作成に利用するために予め用意されている参照用文章の集合である参照用文章群に含まれる各参照用文章の特徴量の、要約対象文章の特徴量に対する類似度を算出し、最も類似度の高い参照用文章を選択し、選択された参照用文章に含まれている単語を、当該単語に対して単語概念ベクトルに基づく類似度の高い、要約対象文章に含まれている単語で置換することにより要約文を作成する、ことを開示する（要約参照）。 As a background technique of the present disclosure, for example, Japanese Patent Application Laid-Open No. 2011-28638 (Patent Document 1) is known. Japanese Unexamined Patent Publication No. 2011-28638 discloses a summary sentence generation technique. Specifically, a word registered in the concept word dictionary is extracted from the sentence to be summarized by using a concept word dictionary in which a plurality of sets in which the morphological element part and the word concept vector are associated with the word are registered. , The feature amount of the sentence to be summarized is calculated using the word concept vector corresponding to the extracted word, and it is included in the reference sentence group which is a set of reference sentences prepared in advance for use in creating the summary sentence. Calculate the similarity of the feature amount of each reference sentence to the feature amount of the sentence to be summarized, select the reference sentence with the highest similarity, and select the word contained in the selected reference sentence. Disclose that a summary is created by replacing a word with a word contained in the sentence to be summarized, which has a high degree of similarity based on the word concept vector (see summary).

特開２０１１−２８６３８号公報Japanese Unexamined Patent Publication No. 2011-28638

しかし、会議等における口頭による会話においては、主語や目的語が省略されることが多く、構文的な特徴から要約を生成することが難しい。したがって、口頭で行われている会議の要約書を適切に自動生成することができる技術が望まれる。 However, in oral conversations such as meetings, the subject and object are often omitted, and it is difficult to generate a summary from syntactic features. Therefore, a technique capable of appropriately and automatically generating a summary of an oral conference is desired.

本開示の代表的な一例は、計算機システムが会議の音声データから要約書を作成する方法であって、前記計算機システムは、プロセッサと、前記プロセッサにより実行されるプログラム、及び、辞書を格納している記憶装置と、を含み、前記方法は、前記プロセッサが前記会議の音声データを取得し、前記音声データから、音声認識処理によって、現在テキスト文を形成し、前記現在テキスト文及び辞書を参照し、前記現在テキスト文に前記辞書内の単語が含まれているか判定し、前記現在テキスト文に前記辞書内の単語が含まれている場合に、前記現在テキスト文を、前記記憶装置に格納されている前記要約書に追加する、ことを含む。 A typical example of the present disclosure is a method in which a computer system creates a summary from audio data of a conference, in which the computer system stores a processor, a program executed by the processor, and a dictionary. In the method, the processor acquires the voice data of the conference, forms a current text sentence from the voice data by voice recognition processing, and refers to the current text sentence and the dictionary. , It is determined whether or not the current text sentence contains a word in the dictionary, and when the current text sentence contains a word in the dictionary, the current text sentence is stored in the storage device. Including adding to the above abstract.

本開示の一態様によれば、会議の要約書を適切に自動生成することができる。 According to one aspect of the present disclosure, a meeting summary can be appropriately and automatically generated.

会議要約書生成システムの構成例を示す。An example of the configuration of the conference summary generation system is shown. 会議要約書生成システムのソフトウェア構成例を示す。An example of the software configuration of the conference summary generation system is shown. 辞書登録処理に関連するソフトウェア構成を示す。The software configuration related to the dictionary registration process is shown. 辞書登録処理のフローチャートを示す。The flowchart of the dictionary registration process is shown. 音声認識処理に関連するソフトウェア構成を示す。The software configuration related to speech recognition processing is shown. 要約書生成処理に関連するソフトウェア構成を示す。The software configuration related to the abstract generation process is shown. 要約書生成プログラムによる要約書生成処理のフローチャートを示す。The flowchart of the abstract generation process by the abstract generation program is shown. 要約書生成プログラムによる要約書生成処理のフローチャートを示す。The flowchart of the abstract generation process by the abstract generation program is shown. 要約書の修正において表示装置の画面に表示される画像及び画像に対する操作の例を示す。An example of the image displayed on the screen of the display device and the operation on the image in the modification of the abstract is shown. 要約書の修正において表示装置の画面に表示される画像及び画像に対する操作の他の例を示す。The image displayed on the screen of the display device in the modification of the abstract and other examples of operations on the image are shown. 要約書の修正において表示装置の画面に表示される画像及び画像に対する操作の他の例を示す。The image displayed on the screen of the display device in the modification of the abstract and other examples of operations on the image are shown.

以下、添付図面を参照して本発明の実施形態を説明する。本実施形態は本発明を実現するための一例に過ぎず、本発明の技術的範囲を限定するものではないことに注意すべきである。各図において共通の構成については同一の参照符号が付されている。 Hereinafter, embodiments of the present invention will be described with reference to the accompanying drawings. It should be noted that the present embodiment is merely an example for realizing the present invention and does not limit the technical scope of the present invention. The same reference numerals are given to common configurations in each figure.

本開示の会議要約書生成システムは、口頭で行われる会議から要約書を生成する。会議要約書生成システムは、音声認識によって、発話された文を文字からなる文（テキスト文）に変換する。会議要約書生成システムは、一つの文を解析し、さらに、所定の辞書を参照する。文が所定の辞書に含まれる単語を含む場合、会議要約書生成システムは、当該文を要約書に含めると決定する。これにより、要約書に含めるべき重要な文を、選択することができる。 The conference abstract generation system of the present disclosure generates abstracts from oral meetings. The conference summary generation system converts the spoken sentence into a sentence (text sentence) composed of characters by voice recognition. The conference summary generation system parses a sentence and then refers to a given dictionary. If a sentence contains words that are contained in a given dictionary, the conference abstract generation system determines that the sentence is included in the abstract. This allows you to select important sentences to include in the abstract.

図１は、会議要約書生成システム１００の構成例を示す。会議要約書生成システム１００は、例えば、会議要約書生成のためのプログラムを実行する計算機システムである。図１は、計算機システムである会議要約書生成システム１００のハードウェア構成例を示す。会議要約書生成システム１００は、プロセッサ１１０、主記憶装置１２０、補助記憶装置１３０、及び、インタフェース（Ｉ／Ｆ）１４０を含む。これらは、内部バスに接続されており、互いに通信可能である。 FIG. 1 shows a configuration example of the conference summary generation system 100. The conference summary generation system 100 is, for example, a computer system that executes a program for generating a conference summary. FIG. 1 shows a hardware configuration example of the conference summary generation system 100, which is a computer system. The conference summary generation system 100 includes a processor 110, a main storage device 120, an auxiliary storage device 130, and an interface (I / F) 140. They are connected to the internal bus and can communicate with each other.

会議要約書生成システム１００は、さらに、表示装置１５０、マイク１６１、タッチ入力装置１６２、ペン入力装置１６３、及びマウス１６４を含む。これらは、Ｉ／Ｆ１４０を介して、内部バスに接続される。表示装置１５０は出力装置であり、マイク１６１、タッチ入力装置１６２、ペン入力装置１６３、及びマウス１６４は、入力装置である。 The conference summary generation system 100 further includes a display device 150, a microphone 161, a touch input device 162, a pen input device 163, and a mouse 164. These are connected to the internal bus via the I / F 140. The display device 150 is an output device, and the microphone 161, the touch input device 162, the pen input device 163, and the mouse 164 are input devices.

プロセッサ１１０は主記憶装置１２０に格納されているプログラムに従って動作することで、会議要約書生成システム１００の所定の機能を実現する。主記憶装置１２０は、例えば、揮発性記憶装置であり、プロセッサ１１０により実行されるプログラム及び参照されるデータを格納する。補助記憶装置１３０は、例えば不揮発性記憶装置であって、主記憶装置１２０にロードされるデータを格納する。主記憶装置１２０、補助記憶装置１３０及びこれらの組み合わせは、記憶装置である。 The processor 110 operates according to the program stored in the main storage device 120 to realize a predetermined function of the conference summary generation system 100. The main storage device 120 is, for example, a volatile storage device and stores a program executed by the processor 110 and data to be referred to. The auxiliary storage device 130 is, for example, a non-volatile storage device and stores data loaded in the main storage device 120. The main storage device 120, the auxiliary storage device 130, and a combination thereof are storage devices.

図２は、会議要約書生成システム１００のソフトウェア構成例を示す。主記憶装置１２０は、音声認識プログラム２０１、要約書生成プログラム２０２、要約書修正プログラム２０３、及び、辞書登録プログラム２０４を格納している。これらプログラムは、例えば、補助記憶装置１３０、又は、Ｉ／Ｆ１４０を介して、外部装置（ネットワーク上の装置を含む）から主記憶装置１２０にロードされる。 FIG. 2 shows a software configuration example of the conference summary generation system 100. The main storage device 120 stores the voice recognition program 201, the abstract generation program 202, the abstract modification program 203, and the dictionary registration program 204. These programs are loaded from an external device (including a device on the network) into the main storage device 120 via, for example, the auxiliary storage device 130 or the I / F 140.

上述のように、プロセッサ１１０は、上記プログラムに従って、特定の機能部として動作する。具体的には、プロセッサ１１０は、上記プログラムに従って動作することで、音声認識部、要約書生成部、要約書修正部、及び、辞書登録部として機能する。これらの機能の少なくとも一部は、プロセッサ１１０以外の論理回路によって実装されてもよい。 As described above, the processor 110 operates as a specific functional unit according to the above program. Specifically, the processor 110 functions as a voice recognition unit, a summary generation unit, a summary correction unit, and a dictionary registration unit by operating according to the above program. At least some of these functions may be implemented by logic circuits other than the processor 110.

主記憶装置１２０は、さらに、会議資料辞書２２１、既出名詞辞書２２２、直前文情報２２３、及び要約書２２４を格納している。これらは、上記プログラムのいずれかによって作成されて主記憶装置１２０に格納され、又は、更新される。例えば、会議資料辞書２２１は、辞書登録プログラム２０４により生成される。既出名詞辞書２２２、直前文情報２２３、及び要約書２２４は、要約書生成プログラム２０２により生成、更新される。 The main storage device 120 further stores the conference material dictionary 221 and the existing noun dictionary 222, the preceding sentence information 223, and the abstract 224. These are created by any of the above programs and stored in or updated in the main storage device 120. For example, the conference material dictionary 221 is generated by the dictionary registration program 204. The existing noun dictionary 222, the preceding sentence information 223, and the abstract 224 are generated and updated by the abstract generation program 202.

補助記憶装置１３０は、会議資料３０１、基本辞書３０２、５Ｗ１Ｈ関連語辞書３０３、及び承諾語辞書３０４を格納している。これらは、要約書を作成する会議の開始前に、会議要約書生成システム１００に、予めインストールされている。補助記憶装置１３０に格納されているデータは、主記憶装置１２０にロードされて、プロセッサ１１０に使用される。なお、補助記憶装置１３０は省略されていてもよい。図２におけるデータ格納位置は、説明名の便宜上にものであり、データ格納位置は任意である。 The auxiliary storage device 130 stores the conference material 301, the basic dictionary 302, the 5W1H related word dictionary 303, and the consent word dictionary 304. These are pre-installed in the conference abstract generation system 100 before the start of the conference to create the abstract. The data stored in the auxiliary storage device 130 is loaded into the main storage device 120 and used by the processor 110. The auxiliary storage device 130 may be omitted. The data storage position in FIG. 2 is for convenience of the description name, and the data storage position is arbitrary.

音声認識プログラム２０１は、マイク１６１から入力された音声データを解析して、テキストデータに変換する。音声認識プログラム２０１は、音声データの解析において、基本辞書３０２及び会議資料辞書２２１を参照する。基本辞書３０２は、任意の発話の音声認識のために参照される辞書であり、会議資料辞書２２１は、要約書を作成する会議に専用の辞書である。会議資料辞書２２１は、辞書登録プログラム２０４により、会議資料３０１から生成される。 The voice recognition program 201 analyzes the voice data input from the microphone 161 and converts it into text data. The voice recognition program 201 refers to the basic dictionary 302 and the conference material dictionary 221 in analyzing the voice data. The basic dictionary 302 is a dictionary referred to for voice recognition of arbitrary utterances, and the conference material dictionary 221 is a dictionary dedicated to a conference for creating a summary. The conference material dictionary 221 is generated from the conference material 301 by the dictionary registration program 204.

要約書生成プログラム２０２は、会議に最中において、要約書２２４を更新しつつ、表示装置１５０において表示する。要約書生成プログラム２０２は、音声認識プログラム２０１が生成したテキストデータを解析し、要約書２２４を随時更新する。後述するように、要約書生成プログラム２０２は、各文を要約書２２４に追加するか、５Ｗ１Ｈ関連語辞書３０３及び承諾語辞書３０４を参照して、判定する。 The abstract generator 202 displays the abstract 224 on the display device 150 while updating it during the meeting. The abstract generation program 202 analyzes the text data generated by the speech recognition program 201 and updates the abstract 224 as needed. As will be described later, the abstract generation program 202 adds each sentence to the abstract 224, or makes a determination by referring to the 5W1H related word dictionary 303 and the consent word dictionary 304.

５Ｗ１Ｈ関連語辞書３０３は、システム設計者により予め設定されている。５Ｗ１Ｈ関連語辞書３０３は、設計により選択された、５Ｗ１Ｈに関連する単語が登録されている。５Ｗ１Ｈ関連語辞書３０３は、例えば、５Ｗ１Ｈに対応する疑問を表す単語（疑問詞）、日時を示す単語、場所を示す単語等を含むことができる。 The 5W1H related word dictionary 303 is preset by the system designer. In the 5W1H-related word dictionary 303, words related to 5W1H selected by design are registered. The 5W1H related word dictionary 303 can include, for example, a word (interrogative word) indicating a question corresponding to 5W1H, a word indicating a date and time, a word indicating a place, and the like.

例えば、５Ｗ１Ｈ関連語辞書３０３は、時、場所、主語、目的語、理由、又は方法を問う疑問詞及び、それらのいずれかを示す疑問詞以外の単語を格納する。例えば、「いつ」、「どこ」、「なに」、「誰」、「なぜ」、「どう」、「今日」、「明日」、「来週」、「先週」、「＊曜日」、「＊月＊日」、「そう」、「ここ」等を含むことができる。ここで、「＊」は任意文字を表す。 For example, the 5W1H related word dictionary 303 stores words other than interrogative words asking time, place, subject, object, reason, or method, and interrogative words indicating any of them. For example, "when", "where", "what", "who", "why", "how", "today", "tomorrow", "next week", "last week", "* day of the week", "* It can include "month * day", "so", "here", etc. Here, "*" represents an arbitrary character.

承諾語辞書３０４は、システム設計者により予め設定されている。承諾語辞書３０４は、設計により選択された承諾を表す単語が登録されている。例えば、承諾語辞書３０４は、「はい」、「うん」、「了解」、「拝承」、「承知」、「やります」、「やってみます」等の単語を含む。 The consent word dictionary 304 is preset by the system designer. In the consent word dictionary 304, words representing consent selected by design are registered. For example, the consent word dictionary 304 includes words such as "yes", "yes", "OK", "accept", "know", "do", and "try".

要約書修正プログラム２０３は、要約書生成プログラム２０２が生成した要約書２２４を、表示装置１５０上でユーザによる修正を可能とする。要約書修正プログラム２０３は、タッチ入力装置１６２、ペン入力装置１６３、又はマウス１６４から入力された要約書２２４への修正を、要約書２２４に反映させる。 The abstract modification program 203 enables the user to modify the abstract 224 generated by the abstract generator 202 on the display device 150. The abstract modification program 203 reflects the modification to the abstract 224 input from the touch input device 162, the pen input device 163, or the mouse 164 in the abstract 224.

以下において、各プログラムの処理を説明する。まず、図３及び４を参照して、辞書登録プログラム２０４による、辞書登録処理を説明する。図３は、辞書登録処理に関連するソフトウェア構成を示す。辞書登録プログラム２０４は、会議資料３０１を参照し、会議資料辞書２２１を生成する。会議資料３０１は、例えば、文書ファイル又はプレゼンテーションファイルである。 The processing of each program will be described below. First, the dictionary registration process by the dictionary registration program 204 will be described with reference to FIGS. 3 and 4. FIG. 3 shows a software configuration related to the dictionary registration process. The dictionary registration program 204 refers to the conference material 301 and generates the conference material dictionary 221. The conference material 301 is, for example, a document file or a presentation file.

図４は、辞書登録処理のフローチャートを示す。辞書登録プログラム２０４は、会議資料３０１を取得し（Ｓ１０１）、会議資料３０１に含まれている単語の形態素解析を実行する（Ｓ３０２）。辞書登録プログラム２０４は、形態素を順次選択し（Ｓ１０３）、ステップＳ１０４〜Ｓ１０６を実行する。 FIG. 4 shows a flowchart of the dictionary registration process. The dictionary registration program 204 acquires the conference material 301 (S101) and executes morphological analysis of the words contained in the conference material 301 (S302). The dictionary registration program 204 sequentially selects morphemes (S103) and executes steps S104 to S106.

辞書登録プログラム２０４は、選択した形態素が、動詞又は名詞（の所定サブタイプ）であるか判定する（Ｓ１０４）。選択した形態素が動詞及び名詞のいずれでもない場合（Ｓ１０４：ＮＯ）、辞書登録プログラム２０４は、当該形態素を会議資料辞書２２１に含めることなく、次の形態素を選択する（Ｓ１０３）。 The dictionary registration program 204 determines whether the selected morpheme is a verb or a noun (a predetermined subtype) (S104). When the selected morpheme is neither a verb nor a noun (S104: NO), the dictionary registration program 204 selects the next morpheme without including the morpheme in the conference material dictionary 221 (S103).

選択した形態素が動詞又は名詞である場合（Ｓ１０４：ＹＥＳ）、辞書登録プログラム２０４は、当該形態素をかな（読み）に変換し（Ｓ１０５）、当該形態素と読みとを、会議資料辞書２２１に登録する（Ｓ１０６）。会議資料辞書２２１に登録する名詞のサブタイプは、予め、辞書登録プログラム２０４内に設定されている。例えば、普通名詞、固有名詞、サ変名詞が、会議資料辞書２２１に登録する名詞のサブタイプとして設定される。 When the selected morpheme is a verb or a noun (S104: YES), the dictionary registration program 204 converts the morpheme into kana (reading) (S105) and registers the morpheme and the reading in the conference material dictionary 221. (S106). The subtype of the noun to be registered in the conference material dictionary 221 is set in advance in the dictionary registration program 204. For example, common nouns, proper nouns, and sa-variant nouns are set as subtypes of nouns registered in the conference material dictionary 221.

次に、図５を参照して、音声認識プログラム２０１による音声認識処理を説明する。図５は、音声認識処理に関連するソフトウェア構成を示す。音声認識プログラム２０１は、マイク１６１から入力された音声データを、会議資料辞書２２１及び基本辞書３０２を使用して解析し、文字からなる文（テキスト文）を順次生成する。生成された文（現在の文）は、主記憶装置１２０内に格納される。 Next, the voice recognition process by the voice recognition program 201 will be described with reference to FIG. FIG. 5 shows a software configuration related to speech recognition processing. The voice recognition program 201 analyzes the voice data input from the microphone 161 using the conference material dictionary 221 and the basic dictionary 302, and sequentially generates a sentence (text sentence) composed of characters. The generated sentence (current sentence) is stored in the main storage device 120.

音声認識処理は、広く知られた技術であり、詳細な説明は省略する。音声認識プログラム２０１は、音声データを分析して、音響特徴を抽出し、会議資料辞書２２１又は基本辞書３０２に登録されている単語の中から、音響特徴が近い単語を選択する。音声認識プログラム２０１は、会議資料辞書２２１を、基本辞書３０２よりも優先して参照する。 Speech recognition processing is a well-known technique, and detailed description thereof will be omitted. The voice recognition program 201 analyzes voice data, extracts acoustic features, and selects words having similar acoustic features from the words registered in the conference material dictionary 221 or the basic dictionary 302. The voice recognition program 201 refers to the conference material dictionary 221 with priority over the basic dictionary 302.

例えば、会議資料辞書２２１及び基本辞書３０２の双方に、同音異義語が存在する場合、音声認識プログラム２０１は、会議資料辞書２２１内の単語を選択する。音声認識プログラム２０１は、会議資料辞書２２１内に音声データに対応する（と判定される）単語を発見することができない場合に、基本辞書３０２において対応単語を探索してもよい。 For example, when the homonyms exist in both the conference material dictionary 221 and the basic dictionary 302, the voice recognition program 201 selects a word in the conference material dictionary 221. When the voice recognition program 201 cannot find a word corresponding to (determined to be) voice data in the conference material dictionary 221, the voice recognition program 201 may search for the corresponding word in the basic dictionary 302.

上述のように、会議資料辞書２２１は、予め用意されている会議資料３０１から生成される。会議資料３０１の語は、会議において使用される可能性が高いため、会議資料辞書２２１を優先して使用することで、より正確な音声認識が可能となる。特に、固有名詞、専門用語、業界用語、同音異義語を正確に認識することが可能となる。 As described above, the conference material dictionary 221 is generated from the conference material 301 prepared in advance. Since the word of the conference material 301 is likely to be used in the conference, more accurate voice recognition becomes possible by preferentially using the conference material dictionary 221. In particular, it makes it possible to accurately recognize proper nouns, technical terms, industry terms, and homonyms.

次に、図６、７Ａ及び７Ｂを参照して、要約書生成プログラム２０２による、要約書生成処理（要約書更新処理）を説明する。図６は、要約書生成処理に関連するソフトウェア構成を示す。 Next, the abstract generation process (summary update process) by the abstract generation program 202 will be described with reference to FIGS. 6, 7A and 7B. FIG. 6 shows a software configuration related to the abstract generation process.

要約書生成プログラム２０２は、要約書２２４を作成して、そのデータを主記憶装置１２０に格納すると共に、表示装置１５０において表示する。要約書生成プログラム２０２は、音声認識プログラム２０１から文を順次取得し、その文の解析結果に応じて、主記憶装置１２０の要約書２２４を更新する。要約書２２４の更新結果は、表示装置１５０により表示されている要約書２２４の画像に反映される。 The abstract generation program 202 creates the abstract 224, stores the data in the main storage device 120, and displays the data on the display device 150. The abstract generation program 202 sequentially acquires sentences from the voice recognition program 201, and updates the abstract 224 of the main storage device 120 according to the analysis result of the sentences. The update result of the abstract 224 is reflected in the image of the abstract 224 displayed by the display device 150.

要約書生成プログラム２０２は、音声認識プログラム２０１が生成した文（現在の文）を取得する。後述するように、要約書生成プログラム２０２は、直前文情報２２３及び既出名詞辞書２２２を参照して、現在の文が直前の文と異なる新しいコンテキストに含まれるか判定する。要約書２２４は、各コンテキストを構成する文が区別されるように、コンテキストの境界を表示する。 The abstract generation program 202 acquires the sentence (current sentence) generated by the speech recognition program 201. As will be described later, the abstract generation program 202 refers to the preceding sentence information 223 and the existing noun dictionary 222 to determine whether the current sentence is included in a new context different from the immediately preceding sentence. Abstract 224 displays the boundaries of the context so that the sentences that make up each context are distinguished.

図６の例において、要約書生成プログラム２０２は、要約書２２４において、二つのコンテキストの間に、空白行を挿入する。各コンテキストは、区切り線や囲み線等によって示されてもよい。表示されている要約書２２４において、コンテキストが区別されることで、ユーザが、同一の話題についての相互に関連する文を、容易に特定することができる。 In the example of FIG. 6, the abstract generator 202 inserts a blank line between the two contexts in the abstract 224. Each context may be indicated by a dividing line, a surrounding line, or the like. The context distinction in the displayed abstract 224 allows the user to easily identify interrelated sentences on the same topic.

要約書生成プログラム２０２は、直前文の分析時に、その直前文情報２２３を主記憶装置１２０に格納している。直前文情報２２３は、直前文（単語列）と、直前文の発話終了時刻とを示す。発話終了時刻は、例えば、音声認識プログラム２０１が当該文の音声データを受信終了した時刻であり、要約書生成プログラム２０２は、音声認識プログラム２０１からその値を取得する。 The abstract generation program 202 stores the immediately preceding sentence information 223 in the main storage device 120 at the time of analyzing the immediately preceding sentence. The immediately preceding sentence information 223 indicates the immediately preceding sentence (word string) and the utterance end time of the immediately preceding sentence. The utterance end time is, for example, the time when the voice recognition program 201 finishes receiving the voice data of the sentence, and the abstract generation program 202 acquires the value from the voice recognition program 201.

既出名詞辞書２２２は、会議が開始されてから発話された名詞のリストである。要約書生成プログラム２０２は、現在文の分析において、既出名詞辞書２２２を参照し、既出名詞辞書２２２に格納されていない新規の名詞が現在文に含まれている場合に、それを既出名詞辞書２２２に追加する。 The existing noun dictionary 222 is a list of nouns uttered since the meeting was started. The abstract generation program 202 refers to the existing noun dictionary 222 in the analysis of the current sentence, and when a new noun not stored in the existing noun dictionary 222 is included in the current sentence, the existing noun dictionary 222 is used. Add to.

既出名詞辞書２２２に含めるべき名詞のサブタイプは、予め、要約書生成プログラム２０２内に設定されており、例えば、普通名詞、固有名詞、サ変名詞である。既出名詞辞書２２２は、会議開始からの名詞のリストに代えて、直前のコンテキストの名詞のリストでもよい。 The subtypes of nouns to be included in the existing noun dictionary 222 are set in advance in the abstract generation program 202, and are, for example, common nouns, proper nouns, and sa-variant nouns. The existing noun dictionary 222 may be a list of nouns in the immediately preceding context instead of the list of nouns from the start of the conference.

要約書生成プログラム２０２は、５Ｗ１Ｈ関連語辞書３０３、承諾語辞書３０４、会議資料辞書２２１を参照して、現在文を解析し、要約書２２４に含める文を決定する。以下において、図７Ａ、７Ｂのフローチャートを参照して、要約書生成プログラム２０２による要約書生成処理を説明する。図７Ａはコンテキスト判定処理のフローチャートを示し、図７Ｂは、要約書更新処理のフローチャートである。 The abstract generation program 202 refers to the 5W1H related word dictionary 303, the consent word dictionary 304, and the conference material dictionary 221 to analyze the current sentence and determine the sentence to be included in the abstract 224. Hereinafter, the abstract generation process by the abstract generation program 202 will be described with reference to the flowcharts of FIGS. 7A and 7B. FIG. 7A shows a flowchart of the context determination process, and FIG. 7B is a flowchart of the abstract update process.

要約書生成プログラム２０２は、音声認識プログラム２０１が生成した、現在の文（テキストデータ）を取得する（Ｓ２０１）。要約書生成プログラム２０２は、現在の文の形態素解析を実行し、各単語（形態素）の品詞を特定する（Ｓ２０２）。 The abstract generation program 202 acquires the current sentence (text data) generated by the voice recognition program 201 (S201). The abstract generation program 202 executes morphological analysis of the current sentence and identifies the part of speech of each word (morpheme) (S202).

次に、要約書生成プログラム２０２は、直前の文の発話終了時刻から、現在の文の発話開始時刻まで長い時間が経過しているか判定する（Ｓ２０３）。例えば、要約書生成プログラム２０２は、音声認識プログラム２０１から、上記二つの時刻を取得し、経過時間が所定の閾値を超える場合に、長い時間が経過したと判定する。 Next, the abstract generation program 202 determines whether a long time has elapsed from the utterance end time of the immediately preceding sentence to the utterance start time of the current sentence (S203). For example, the abstract generation program 202 acquires the above two times from the voice recognition program 201, and determines that a long time has elapsed when the elapsed time exceeds a predetermined threshold value.

直前文の発話から長時間経過している場合（Ｓ２０３：ＹＥＳ）、要約書生成プログラム２０２は、現在の文のコンテキストは、直前のコンテキストから変更されていると判定する。要約書生成プログラム２０２は、要約書２２４に、コンテキストの変更を示す空白行を追加する（Ｓ２０５）。長い時間系の経過は、直前の文と現在の文との関連性が低いことを示す可能性が高い。このステップによって、より正確にコンテキストを区別することができる。 When a long time has passed since the utterance of the immediately preceding sentence (S203: YES), the abstract generation program 202 determines that the context of the current sentence has been changed from the immediately preceding context. The abstract generation program 202 adds a blank line indicating the change of context to the abstract 224 (S205). The passage of a long time system is likely to indicate that the previous sentence is less relevant to the current sentence. This step allows for more accurate context distinction.

直前文の発話から長時間経過していない場合（Ｓ２０３：ＮＯ）、要約書生成プログラム２０２は、現在の文での新規名詞が多く、要約書生成プログラム２０２に予め設定されている指示詞の数が直前の文において少ないか判定する（Ｓ２０４）。 If a long time has not passed since the utterance of the immediately preceding sentence (S203: NO), the abstract generation program 202 has many new nouns in the current sentence, and the number of directives preset in the abstract generation program 202. Is less in the immediately preceding sentence (S204).

具体的には、要約書生成プログラム２０２は、既出名詞辞書２２２を参照し、現在の文で新規名詞の数をカウントする。新規名詞の数が所定の閾値より多い場合、要約書生成プログラム２０２は、現在の文での新規名詞が多いと判定する。閾値は０以上の整数である。 Specifically, the abstract generation program 202 refers to the existing noun dictionary 222 and counts the number of new nouns in the current sentence. If the number of new nouns is greater than a predetermined threshold, the abstract generator 202 determines that there are many new nouns in the current sentence. The threshold is an integer of 0 or more.

さらに、要約書生成プログラム２０２は、直前文情報２２３における直前の文の形態素解析を実行し、そこに含まれる予め設定されている指示詞の数をカウントする。指示詞の数が所定の閾値より少ない場合、要約書生成プログラム２０２は、直前の文での指示詞が少ないと判定する。閾値は、１以上の整数である。 Further, the abstract generation program 202 executes the morphological analysis of the immediately preceding sentence in the immediately preceding sentence information 223, and counts the number of preset directives included therein. When the number of demonstratives is less than a predetermined threshold value, the abstract generation program 202 determines that the number of demonstratives in the immediately preceding sentence is small. The threshold is an integer of 1 or more.

現在の文での新規名詞が多く、直前の文での指示詞が少ない場合（Ｓ２０４：ＹＥＳ）、要約書生成プログラム２０２は、コンテキストが変更されたと判定し、要約書２２４に、コンテキストの変更を示す空白行を追加する（Ｓ２０５）。ステップＳ２０３及びＳ２０４の判定結果が共にＮＯである場合、要約書生成プログラム２０２は、現在の文は、直前の文と同一コンテキストに含まれると判定する。 When there are many new nouns in the current sentence and few demonstratives in the previous sentence (S204: YES), the abstract generator 202 determines that the context has been changed, and changes the context in the abstract 224. A blank line is added to indicate (S205). If the determination results in steps S203 and S204 are both NO, the abstract generation program 202 determines that the current sentence is included in the same context as the immediately preceding sentence.

新規名詞が多いことは、直前のコンテキストと異なるコンテキストが開始されている可能性が高い。また、指示詞が少ないことは、直前の文と現在の文との関連性が低い可能性高い。したがって、このステップによって、より正確にコンテキストを区別することができる。 If there are many new nouns, it is highly possible that a context different from the previous context has been started. In addition, the fact that there are few demonstratives is likely to indicate that the previous sentence is less relevant to the current sentence. Therefore, this step allows for more accurate context distinction.

次に、図７Ｂを参照して、要約書更新処理を説明する。要約書生成プログラム２０２は、現在の文に、５Ｗ１Ｈ関連語が含まれているか判定する（Ｓ２０６）。具体的には、要約書生成プログラム２０２は、現在の文に、５Ｗ１Ｈ関連語辞書３０３に登録されている語が含まれているか判定する。現在の文が、５Ｗ１Ｈ関連語を含む場合（Ｓ２０６：ＹＥＳ）、要約書生成プログラム２０２は、現在の文を要約書２２４に出力する（Ｓ２０７）。例えば、要約書生成プログラム２０２は、要約書の最後の行に、現在の文を追加する。 Next, the abstract update process will be described with reference to FIG. 7B. The abstract generator 202 determines whether the current sentence contains 5W1H related words (S206). Specifically, the abstract generation program 202 determines whether or not the current sentence contains a word registered in the 5W1H related word dictionary 303. When the current sentence contains 5W1H related words (S206: YES), the abstract generator 202 outputs the current sentence to the abstract 224 (S207). For example, the abstract generator 202 adds the current sentence to the last line of the abstract.

要約書生成プログラム２０２は、さらに、５Ｗ１Ｈ関連語における疑問を示す単語が、現在の文に含まれているか判定する。疑問を示す単語は、例えば、「なぜ」、「いつ」、「誰」、「どこで」、「どう」等である。例えば、５Ｗ１Ｈ関連語辞書３０３は、疑問を示す単語を同定する情報を含む。疑問を示す単語が現在の文に含まれている場合、要約書生成プログラム２０２は、現在の文の次の文を、要約書２２４に含めると判定する。 The abstract generator 202 further determines if the current sentence contains a questioning word in the 5W1H related words. Words that indicate a question are, for example, "why", "when", "who", "where", "how", and the like. For example, the 5W1H related word dictionary 303 contains information for identifying a word that indicates a question. If the questioning word is included in the current sentence, the abstract generator 202 determines that the sentence following the current sentence is included in the abstract 224.

疑問を示す単語を含む文の次の文は、疑問に対する回答を含む重要文である可能性が高く、重要文を要約に含める可能性を高めることができる。また、疑問を示す文とその回答の文とを要約書２２４に含めることで、会議内容をより正確に示すことができる。たとえば、要約書生成プログラム２０２は、次の文を要約書２２４に含めることを示す情報を内部に保持する、又は、現在の文の情報を示す直前文情報２２３の生成において、そこに含める。 The sentence following the sentence containing the word indicating the question is likely to be an important sentence containing the answer to the question, which can increase the possibility of including the important sentence in the summary. In addition, by including a sentence indicating a question and a sentence indicating the answer in the abstract 224, the content of the meeting can be shown more accurately. For example, the abstract generation program 202 internally retains information indicating that the following sentence is included in the abstract 224, or includes it in the generation of the preceding sentence information 223 indicating the information of the current sentence.

現在の文が、５Ｗ１Ｈ関連語を含まない場合（Ｓ２０６：ＮＯ）、要約書生成プログラム２０２は、現在の文が承諾語を含むか判定する（Ｓ２０８）。具体的には、要約書生成プログラム２０２は、現在の文に、承諾語辞書３０４に登録されている語が含まれているか判定する。 If the current sentence does not contain 5W1H related words (S206: NO), the abstract generator 202 determines whether the current sentence contains consent words (S208). Specifically, the abstract generation program 202 determines whether or not the current sentence contains a word registered in the consent word dictionary 304.

現在の文が、承諾語を含む場合（Ｓ２０８：ＹＥＳ）、要約書生成プログラム２０２は、直前の文が要約書２２４に出力済みであるか判定する（Ｓ２０９）。要約書生成プログラム２０２は直前文情報２２３と要約書２２４とを比較して、直前の文が要約書２２４に出力済みであるか判定できる。直前文情報２２３が、直前の文が要約書２２４に含まれているかを示してもよい。 When the current sentence contains a consent word (S208: YES), the abstract generator 202 determines whether the immediately preceding sentence has been output to the abstract 224 (S209). The abstract generation program 202 compares the immediately preceding sentence information 223 with the abstract 224, and can determine whether or not the immediately preceding sentence has been output to the abstract 224. Immediate sentence information 223 may indicate whether the preceding sentence is included in the abstract 224.

直前の文が要約書２２４に含まれていない場合（Ｓ２０９：ＮＯ）、要約書生成プログラム２０２は、直前の文を直前文情報２２３から取得して、要約書２２４に出力する（Ｓ２１２）。承諾語を含む現在の文の直前の文は、承諾の対象を含む可能性が高い。本ステップによって、承諾の対象を含む重要な文を要約書２２４に含めることができる。直前の文が要約書２２４に含まれている場合（Ｓ２０９：ＹＥＳ）、現在の文について処理は終了する。 When the immediately preceding sentence is not included in the abstract 224 (S209: NO), the abstract generation program 202 acquires the immediately preceding sentence from the immediately preceding sentence information 223 and outputs it to the abstract 224 (S212). The sentence immediately preceding the current sentence containing the consent word is likely to include the subject of consent. This step allows the abstract 224 to include important text that includes the subject of consent. If the preceding sentence is included in the abstract 224 (S209: YES), the process ends for the current sentence.

現在の文が、承諾語を含まない場合（Ｓ２０８：ＮＯ）、要約書生成プログラム２０２は、現在の文が会議資料の語を含むか判定する（Ｓ２１１）。具体的には、要約書生成プログラム２０２は、会議資料辞書２２１と現在の文を比較し、会議資料辞書２２１に登録されている語が現在の文に含まれているか判定する。 If the current sentence does not contain a consent word (S208: NO), the abstract generator 202 determines whether the current sentence contains a word of the conference material (S211). Specifically, the abstract generation program 202 compares the conference material dictionary 221 with the current sentence, and determines whether the word registered in the conference material dictionary 221 is included in the current sentence.

現在の文が、会議資料の語を含む場合（Ｓ２１１：ＹＥＳ）、要約書生成プログラム２０２は、現在の文を要約書２２４に出力する（Ｓ２１２）。これにより、会議資料３０１に含まれる重要語を含む文を、要約書２２４に含めることができる。現在の文が、会議資料の語を含まない場合（Ｓ２１１：ＮＯ）、現在の文について処理は終了する。 If the current sentence contains the words of the conference material (S211: YES), the abstract generator 202 outputs the current sentence to the abstract 224 (S212). Thereby, the sentence including the important word contained in the conference material 301 can be included in the abstract 224. If the current sentence does not include the word of the conference material (S211: NO), the processing for the current sentence ends.

要約書２２４に文を追加する場合（Ｓ２０７、Ｓ２１０、Ｓ２１２）、要約書生成プログラム２０２は、追加する文と情報が重複する既存文を、要約書２２４から削除する。これにより、要約書２２４をシンプルなものとして見やすくすることができる。 When adding a sentence to the abstract 224 (S207, S210, S212), the abstract generator 202 deletes an existing sentence whose information overlaps with the sentence to be added from the abstract 224. This makes the abstract 224 simple and easy to read.

具体的には、要約書生成プログラム２０２は、要約書２２４に追加する文と、同一コンテキスト内の既存文それぞれとを比較する。追加する文と既存文とが、動詞と名詞の同一組み合わせを含む場合（Ｓ２１３：ＹＥＳ）、要約書生成プログラム２０２は、当該既存文を要約書２２４から削除する（Ｓ２１４）。 Specifically, the abstract generation program 202 compares the sentence added to the abstract 224 with each of the existing sentences in the same context. When the sentence to be added and the existing sentence include the same combination of the verb and the noun (S213: YES), the abstract generation program 202 deletes the existing sentence from the abstract 224 (S214).

例えば、追加する文に含まれる任意の一つの動詞及び所定サブタイプの任意の一つの名詞からなる組が、既存文に含まれている場合、既存文は削除される。または、全ての動詞及び所定サブタイプの全ての名詞の組が現在の文と既存文とで共通であることを、既存文削除の条件としてもよい。例えば、普通名詞、固有名詞、サ変名詞が、名詞のサブタイプとして予め設定されている。 For example, if the existing sentence contains a set consisting of any one verb included in the added sentence and any one noun of a predetermined subtype, the existing sentence is deleted. Alternatively, the condition for deleting the existing sentence may be that all the verbs and all the noun sets of the predetermined subtype are common to the current sentence and the existing sentence. For example, common nouns, proper nouns, and s-irregular nouns are preset as noun subtypes.

いずれの既存文も、追加する文と同一の動詞と名詞の組み合わせを含まない場合（Ｓ２１３：ＮＯ）、同一コンテキストの全ての既存文が維持される。既存文についてステップ（Ｓ２１３、２１４）の後、現在の文について処理は終了する。 If none of the existing sentences contains the same verb and noun combination as the sentence to be added (S213: NO), all the existing sentences in the same context are maintained. After the step (S213, 214) for the existing sentence, the processing for the current sentence ends.

以下において、要約書修正プログラム２０３による、要約書２２４の修正処理を説明する。図８は、要約書修正において表示装置１５０の画面に表示される画像及び画像に対する操作の例を示す。ユーザは、画面に表示されている要約書２２４において、修正対象の文（の画像）２４１を選択する。図８の例においては、タッチ入力装置１６２を介して、指によって修正前の文２４１が選択されている。 Hereinafter, the modification process of the abstract 224 by the abstract modification program 203 will be described. FIG. 8 shows an image displayed on the screen of the display device 150 and an example of an operation on the image in the abstract correction. The user selects the sentence (image) 241 to be corrected in the abstract 224 displayed on the screen. In the example of FIG. 8, the sentence 241 before correction is selected by the finger via the touch input device 162.

要約書修正プログラム２０３は、選択された文２４１の形態素解析を実行し、選択された文２４１から、動詞及び所定サブタイプの名詞（規定の品詞）を抽出する。要約書修正プログラム２０３は、抽出した単語列（の画像）２５１を、要約書２２４と異なるセクションにおいて表示することで、明示する。例えば、普通名詞、固有名詞、サ変名詞が、名詞のサブタイプとして予め設定されている。抽出される品詞は、設計により設定されてよい。 The abstract modification program 203 performs morphological analysis of the selected sentence 241 and extracts verbs and nouns (prescribed part of speech) of a predetermined subtype from the selected sentence 241. The abstract modification program 203 specifies the extracted word string (image) 251 by displaying it in a section different from the abstract 224. For example, common nouns, proper nouns, and s-irregular nouns are preset as noun subtypes. The part of speech to be extracted may be set by design.

図８の例において、単語列２５１における単語の順序は、選択された文２４１と同じである。単語列２５１の隣接単語は、線で結ばれている。要約書修正プログラム２０３は、単語列２５１における、ユーザの単語の選択を受け付ける。図８の例において、ユーザは指でタッチすることで一つの単語「特許」を選択する。 In the example of FIG. 8, the order of the words in the word string 251 is the same as that of the selected sentence 241. Adjacent words in word string 251 are connected by a line. The abstract modification program 203 accepts the user's selection of words in the word string 251. In the example of FIG. 8, the user selects one word "patent" by touching with a finger.

さらに、ユーザは、当該単語「特許」を指でタッチしたまま移動し、二つの単語「打ち合わせ」、「行う」を囲んだ後、指を画面から離す。これにより、単語「特許」に加え、単語「打ち合わせ」、「行う」も選択する。図８の例においては、ユーザは、最初に選択した単語を含む連続する単語群が選択可能である。 Further, the user moves while touching the word "patent" with a finger, surrounds the two words "meeting" and "do", and then releases the finger from the screen. As a result, in addition to the word "patent", the words "meeting" and "do" are also selected. In the example of FIG. 8, the user can select a continuous word group including the first selected word.

要約書修正プログラム２０３は、タッチ入力装置１６２から入力に応じて、単語列２５１の表示画像を変化させると共に、単語列２５１において、三つの単語が選択されたことを検出する。図８の例において、要約書修正プログラム２０３は、一回のドラッグアンドドロップ操作によって選択（単語「特許」）及び囲まれた（単語「打ち合わせ」、「行う」）単語を、選択された単語と特定する。 The abstract modification program 203 changes the display image of the word string 251 according to the input from the touch input device 162, and detects that three words have been selected in the word string 251. In the example of FIG. 8, the abstract modification program 203 sets the selected (word “patent”) and enclosed (words “meeting”, “doing”) words by a single drag-and-drop operation as the selected words. Identify.

要約書修正プログラム２０３は、選択され単語の単語列（の画像）２５２を、単語列２５１に代えて表示する。単語列２５２における単語の順序は、選択された文２４１と同じである。単語列２５２の隣接単語は、線で結ばれている。 The abstract modification program 203 displays the word string (image) 252 of the selected word in place of the word string 251. The order of the words in the word string 252 is the same as that of the selected sentence 241. Adjacent words in word string 252 are connected by a line.

要約書修正プログラム２０３は、選択され単語の単語列２５２から、修正後文を生成する。具体的には、要約書修正プログラム２０３は、修正前文２４１を参照し、単語「特許」と単語列２５２における次の単語「打ち合わせ」との間の助詞「の」を取得する。さらに、要約書修正プログラム２０３は、単語「打ち合わせ」と単語列２５２における次の単語「行う」の間の助詞「を」を取得する。生成された修正後の文２４２は、「特許の打ち合わせを行う。」となる。 The abstract modification program 203 generates a modified sentence from the word string 252 of the selected word. Specifically, the abstract modification program 203 refers to the amended preamble 241 and acquires the particle "no" between the word "patent" and the next word "meeting" in the word string 252. Further, the abstract modification program 203 acquires the particle "o" between the word "meeting" and the next word "do" in the word string 252. The generated modified sentence 242 becomes "Patent meeting."

要約書修正プログラム２０３は、主記憶装置１２０の要約書２２４において、修正前の文２４１を修正後の文２４２に書き換える。表示装置１５０が表示する要約書２２４も画像において、修正前の文２４１が修正後の文２４２に置き換えられる。なお、要約書修正プログラム２０３は、修正前の要約書２２４のコピーファイルを生成し、そのコピーファイルを修正してもよい。 The abstract modification program 203 rewrites the sentence 241 before modification to the sentence 242 after modification in the abstract 224 of the main storage device 120. In the image of the abstract 224 displayed by the display device 150, the sentence 241 before the correction is replaced with the sentence 242 after the correction. The abstract modification program 203 may generate a copy file of the abstract 224 before modification and modify the copy file.

図９Ａは、要約書修正において表示装置１５０の画面に表示される画像及び画像に対する操作の他の例を示す。本例は、複数の修正前文から一つの修正後文を生成する。ユーザは、画面に表示されている要約書２２４において、修正対象の文（の画像）２４１を選択する。図９の例においては、タッチ入力装置１６２を介して、指によって二つの修正前の文２４１が選択されている。 FIG. 9A shows the image displayed on the screen of the display device 150 in the abstract modification and another example of the operation on the image. This example generates one post-correction sentence from a plurality of pre-correction sentences. The user selects the sentence (image) 241 to be corrected in the abstract 224 displayed on the screen. In the example of FIG. 9, two uncorrected sentences 241 are selected by a finger via the touch input device 162.

要約書修正プログラム２０３は、選択された文２４１の形態素解析を実行し、選択された文２４１から、動詞及び所定サブタイプの名詞（規定の品詞）を抽出する。要約書修正プログラム２０３は、抽出した単語を、選択された文２４１において明示する。図９Ａの例においては、修正中の文（の画像）２４４において抽出された単語は、それぞれ、破線矩形で囲まれている。抽出される品詞は設計に従い設定されてよい。 The abstract modification program 203 performs morphological analysis of the selected sentence 241 and extracts verbs and nouns (prescribed part of speech) of a predetermined subtype from the selected sentence 241. The abstract modification program 203 specifies the extracted words in the selected sentence 241. In the example of FIG. 9A, the words extracted in the sentence (image) 244 being modified are each surrounded by a broken line rectangle. The part of speech to be extracted may be set according to the design.

図９Ａの例において、要約書修正プログラム２０３は、修正中の文２４４における、ユーザの単語の選択を受け付ける。図９Ａの例において、ユーザは指で画面上の単語をなぞることによってそれら単語を選択する。図９Ａの例においては、第１の文における単語「明日」、及び、第２の文における単語「特許」、「打ち合わせ」、「行います」が、選択されている。図９の例において、要約書修正プログラム２０３は、抽出された単語（動詞及び名詞）から、ユーザによる非連続の単語群の選択を受け付ける。 In the example of FIG. 9A, the abstract modification program 203 accepts the user's word selection in the sentence 244 being modified. In the example of FIG. 9A, the user selects those words by tracing the words on the screen with a finger. In the example of FIG. 9A, the words "tomorrow" in the first sentence and the words "patent", "meeting", and "do" in the second sentence are selected. In the example of FIG. 9, the abstract modification program 203 accepts the user to select a discontinuous word group from the extracted words (verbs and nouns).

要約書修正プログラム２０３は、タッチ入力装置１６２から入力に応じて、修正中の文２４４において、四つの単語が選択されたことを検出する。図９Ａの例において、要約書修正プログラム２０３は、一回のドラッグアンドドロップ操作によって触れられた単語（「明日」、「特許」、「打ち合わせ」、「行う」）を、選択された単語と特定する。 The abstract modification program 203 detects that four words have been selected in the sentence 244 being modified in response to input from the touch input device 162. In the example of FIG. 9A, the abstract modification program 203 identifies the words touched by a single drag-and-drop operation (“tomorrow”, “patent”, “meeting”, “do”) as selected words. To do.

要約書修正プログラム２０３は、選択され単語の単語列から、修正後文２４２を生成する。修正後文２４２における単語順序は、選択された順序に一致する。要約書修正プログラム２０３は、修正前文２４１を参照し、単語「特許」と次の抽出単語「打ち合わせ」の間の助詞「の」、及び、単語「打ち合わせ」と次の抽出単語「行います」との間の助詞「を」復元する。単語「明日」と単語「特許」とは、異なる修正前文に含まれるため、要約書修正プログラム２０３は、これらを読点で連結する。これにより、自然な修正後文２４２を生成できる。 The abstract modification program 203 generates the modified sentence 242 from the word string of the selected words. The word order in the modified sentence 242 matches the selected order. The abstract modification program 203 refers to the amended preamble 241 as the particle "no" between the word "patent" and the next extracted word "meeting", and the word "meeting" and the next extracted word "do". Restore the particle "o" between. Since the word "tomorrow" and the word "patent" are contained in different preambles, the abstract modification program 203 concatenates them with a comma. As a result, a natural modified sentence 242 can be generated.

図９Ｂは、要約書修正において表示装置１５０の画面に表示される画像及び画像に対する操作の他の例を示す。以下において、図９Ａの例との相違点を主に説明する。図９Ｂの例において、第１の文における単語「明日」、「予定」、「確認」、及び「します」、並びに、第２の文における単語「特許」が、選択されている。 FIG. 9B shows an image displayed on the screen of the display device 150 in the abstract modification and another example of the operation on the image. In the following, the differences from the example of FIG. 9A will be mainly described. In the example of FIG. 9B, the words "tomorrow", "plan", "confirm", and "will" in the first sentence, and the word "patent" in the second sentence are selected.

要約書修正プログラム２０３は、選択され単語の単語列から、修正後文２４２を生成する。修正後文２４２における単語順序は、選択された順序に一致する。要約書修正プログラム２０３は、修正前文２４１を参照し、単語「予定」と次の抽出単語「確認」との間の連語「について」を復元する。 The abstract modification program 203 generates the modified sentence 242 from the word string of the selected words. The word order in the modified sentence 242 matches the selected order. The abstract modification program 203 refers to the modification preamble 241 and restores the collocation "about" between the word "plan" and the next extracted word "confirmation".

単語「明日」と単語「特許」とは、異なる修正前文に含まれており、単語「特許」と単語「予定」とは、異なる修正前文に含まれている。したがって、要約書修正プログラム２０３は、単語「明日」、「特許」を読点で連結し、単語「特許」を読点で連結する。これにより、自然な修正後文２４２を生成できる。 The word "tomorrow" and the word "patent" are contained in different preambles, and the word "patent" and the word "planned" are contained in different preambles. Therefore, the abstract modification program 203 connects the words "tomorrow" and "patent" with commas, and connects the words "patent" with commas. As a result, a natural modified sentence 242 can be generated.

上記例は、会議において要約書を表示しつつ、会議の進行と共に表示されている要約書を更新する。これと異なり、システムは、会議終了後に、会議の音声データから要約書を作成してもよい。システムは、上記の機能の一部のみを含んで構成されてもよい。例えば、コンテキストの区分の処理及びその参照情報や省略されてもよく、要約文の修正の機能が省略されていてもよい。上記辞書の一部の辞書のみが実装されていてもよい。 The above example updates the summary displayed as the meeting progresses, while displaying the summary at the meeting. Unlike this, the system may create a summary from the audio data of the meeting after the meeting is over. The system may be configured to include only some of the above functions. For example, the processing of the context division and the reference information thereof may be omitted, or the function of modifying the summary sentence may be omitted. Only some dictionaries of the above dictionaries may be implemented.

本発明は上記した実施例に限定されるものではなく、様々な変形例が含まれる。例えば、上記した実施例は本発明を分かりやすく説明するために詳細に説明したものであり、必ずしも説明したすべての構成を備えるものに限定されるものではない。また、ある実施例の構成の一部を他の実施例の構成に置き換えることが可能であり、また、ある実施例の構成に他の実施例の構成を加えることも可能である。また、各実施例の構成の一部について、他の構成の追加・削除・置換をすることが可能である。 The present invention is not limited to the above-described examples, and includes various modifications. For example, the above-described embodiment has been described in detail in order to explain the present invention in an easy-to-understand manner, and is not necessarily limited to the one including all the configurations described. Further, it is possible to replace a part of the configuration of one embodiment with the configuration of another embodiment, and it is also possible to add the configuration of another embodiment to the configuration of one embodiment. Further, it is possible to add / delete / replace a part of the configuration of each embodiment with another configuration.

また、上記の各構成・機能・処理部等は、それらの一部又は全部を、例えば集積回路で設計する等によりハードウェアで実現してもよい。また、上記の各構成、機能等は、プロセッサがそれぞれの機能を実現するプログラムを解釈し、実行することによりソフトウェアで実現してもよい。各機能を実現するプログラム、テーブル、ファイル等の情報は、メモリや、ハードディスク、ＳＳＤ（ＳｏｌｉｄＳｔａｔｅＤｒｉｖｅ）等の記録装置、または、ＩＣカード、ＳＤカード等の記録媒体に置くことができる。また、制御線や情報線は説明上必要と考えられるものを示しており、製品上必ずしもすべての制御線や情報線を示しているとは限らない。実際には殆どすべての構成が相互に接続されていると考えてもよい。 Further, each of the above-mentioned configurations, functions, processing units and the like may be realized by hardware by designing a part or all of them by, for example, an integrated circuit. Further, each of the above configurations, functions, and the like may be realized by software by the processor interpreting and executing a program that realizes each function. Information such as programs, tables, and files that realize each function can be placed in a memory, a hard disk, a recording device such as an SSD (Solid State Drive), or a recording medium such as an IC card or an SD card. In addition, control lines and information lines are shown as necessary for explanation, and not all control lines and information lines are shown in the product. In practice, it can be considered that almost all configurations are interconnected.

１００会議要約書生成システム、１１０プロセッサ、１２０主記憶装置、１３０補助記憶装置、１５０表示装置、１６１マイク、１６２タッチ入力装置、１６３ペン入力装置、１６４マウス、２０１音声認識プログラム、２０２要約書生成プログラム、２０３要約書修正プログラム、２０４辞書登録プログラム、２２１会議資料辞書、２２２既出名詞辞書、２２３直前文情報、２２４要約書、２４１修正前文、２４２修正後文、２４４修正中の文、２５１、２５２単語列、３０１会議資料、３０２基本辞書、３０３５Ｗ１Ｈ関連語辞書３０４承諾語辞書 100 conference summary generator, 110 processor, 120 main memory, 130 auxiliary memory, 150 display, 161 microphone, 162 touch input device, 163 pen input device, 164 mouse, 201 voice recognition program, 202 summary generator , 203 abstract modification program, 204 dictionary registration program, 221 conference material dictionary, 222 existing nomenclature dictionary, 223 last sentence information, 224 summary, 241 amended preamble, 242 amended post sentence, 244 amended sentence, 251, 252 words Columns, 301 meeting materials, 302 basic dictionaries, 303 5W1H related word dictionaries 304 consent word dictionaries

Claims

A way for computer systems to create summaries from conference audio data
The computer system
With the processor
Includes a program executed by the processor and a storage device that stores a dictionary.
In the method, the processor acquires the audio data of the conference.
From the voice data of the conference, a text sentence is currently formed by voice recognition processing.
With reference to the current text sentence and the dictionary, it is determined whether the current text sentence contains a word in the dictionary.
Wherein if it contains words currently in the dictionary text sentence, the current text sentence, to add to the abstract stored in the storage device, it viewed including the,
The dictionary contains words that indicate a question.
The method is that the processor adds the current text sentence and the next text sentence of the current text sentence to the abstract when the current text sentence contains a questioning word in the dictionary. including methods.

A way for computer systems to create summaries from conference audio data
The computer system
With the processor
Includes a program executed by the processor and a storage device that stores a dictionary.
In the method, the processor
Acquire the audio data of the conference and
From the voice data of the conference, a text sentence is currently formed by voice recognition processing.
With reference to the current text sentence and the dictionary, it is determined whether the current text sentence contains a word in the dictionary.
Including adding the current text sentence to the abstract stored in the storage device when the current text sentence contains a word in the dictionary.
The storage device stores a consent word dictionary consisting of words indicating consent.
The method comprises adding the text sentence immediately preceding the current text sentence to the abstract when the current text sentence contains a word indicating acceptance in the consent word dictionary.

A way for computer systems to create summaries from conference audio data
The computer system
With the processor
Includes a program executed by the processor and a storage device that stores a dictionary.
In the method, the processor
Acquire the audio data of the conference and
From the voice data of the conference, a text sentence is currently formed by voice recognition processing.
With reference to the current text sentence and the dictionary, it is determined whether the current text sentence contains a word in the dictionary.
Including adding the current text sentence to the abstract stored in the storage device when the current text sentence contains a word in the dictionary.
The computer system further includes a display device for displaying the abstract.
In the method, the processor
Display the abstract on the display device and display it.
Accepts user selection of uncorrected text in the abstract
In the uncorrected text sentence displayed on the display device, clearly indicate the word of the specified part of speech.
Accepting user selection of words from the words of the specified part of speech,
A new text sentence is generated by including the user-selected words from the words of the specified part of speech and excluding the words not selected by the user in the words of the specified part of speech.
Replace the uncorrected text sentence with the new text sentence,
A method comprising complementing a word with a part of speech other than the specified part of speech following the word selected by the user in the uncorrected text sentence in the new text sentence.

A way for computer systems to create summaries from conference audio data
The computer system
With the processor
Includes a program executed by the processor and a storage device that stores a dictionary.
In the method, the processor
Acquire the audio data of the conference and
From the voice data of the conference, a text sentence is currently formed by voice recognition processing.
With reference to the current text sentence and the dictionary, it is determined whether the current text sentence contains a word in the dictionary.
Including adding the current text sentence to the abstract stored in the storage device when the current text sentence contains a word in the dictionary.
The computer system further includes a display device for displaying the abstract.
In the method, the processor
Display the abstract on the display device and display it.
Accepting user selection of the first uncorrected text sentence and the second uncorrected text sentence in the abstract
In the first uncorrected text sentence and the second uncorrected text sentence displayed on the display device, the word of the specified part of speech is clearly indicated.
In the first uncorrected text sentence and the second uncorrected text sentence displayed on the display device, the user selection of a word from the word of the specified part of speech is accepted.
Generate a new text sentence containing the user-selected word
Including that the first uncorrected text sentence and the second uncorrected text sentence are deleted from the abstract and the new text sentence is added to the abstract.
The generation of the new text sentence is
The user-selected words are arranged in the selected order to form an array.
In the array of words selected by the user, a comma is placed between a continuous word consisting of a word contained in the first uncorrected text sentence and a word contained in the second uncorrected text sentence. Complement and
In the array of user-selected words, the said in one of the consecutive words consisting of words contained in one of the first uncorrected text sentence or the second uncorrected text sentence. A method that includes complementing words with a part of speech other than the prescribed part of speech.

The method according to any one of claims 1 to 4.
The storage device stores conference materials used in the conference and
The method is
A method comprising the processor performing a morphological analysis of the conference material and selecting a predetermined part of speech word to include in the dictionary before acquiring the audio data of the conference.

The method according to any one of claims 1 to 4.
The dictionary comprises a word indicating time and a word indicating place.