JP6626029B2

JP6626029B2 - Information processing apparatus, information processing method and program

Info

Publication number: JP6626029B2
Application number: JP2017054954A
Authority: JP
Inventors: 浩之田中
Original assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Current assignee: Toshiba Corp; Toshiba Digital Solutions Corp
Priority date: 2017-03-21
Filing date: 2017-03-21
Publication date: 2019-12-25
Anticipated expiration: 2037-03-21
Also published as: JP2018156593A

Description

本発明の実施形態は、情報処理装置、情報処理方法およびプログラムに関する。 An embodiment of the present invention relates to an information processing device, an information processing method, and a program.

近年、文化や経済のグローバル化に伴い、異なる言語を母語とする人同士のコミュニケーションの機会が増加している。そこで、音声認識技術と機械翻訳技術を組み合わせ、互いに異なる母語を用いて会議を行う場合においても、ある言語に統一して議事録を作成するシステムが提案されている。 In recent years, with the globalization of culture and economy, opportunities for communication between people having different languages as their mother tongues have increased. Therefore, a system has been proposed in which, even when a speech recognition technology and a machine translation technology are combined and a conference is held using different native languages, the minutes are created in a certain language.

特許第４４６６６６６号公報Japanese Patent No. 4466666

しかしながら、従来技術では、音声認識結果それ自体が議事録として作成される、または、音声認識結果を直接翻訳して議事録が作成される。このため、議事録としては冗長であり、利用者は必要な情報を素早く把握することが困難である。また、機械翻訳による結果が常に正しいとは限らないため、精度を担保したい場合、人手によるチェックが必要である。 However, in the related art, the speech recognition result itself is created as minutes, or the minutes are created by directly translating the speech recognition result. Therefore, the minutes are redundant, and it is difficult for the user to quickly grasp necessary information. In addition, since the result of machine translation is not always correct, a manual check is required to ensure accuracy.

本発明が解決しようとする課題は、多言語での議事録（要約）の作成をより円滑に行うことができる情報処理装置、情報処理方法およびプログラムを提供することである。 It is an object of the present invention to provide an information processing apparatus, an information processing method, and a program that can smoothly create minutes (summary) in multiple languages.

実施形態の情報処理装置は、検索部と、表示制御部と、生成部と、を備える。検索部は、音声を認識して得られる、１以上の言語で記述された１以上のテキストデータのうち、第１言語で記述された第１文と第２言語で記述された第２文とを対応づけた要約のテンプレートに適合するテキストデータを検索し、適合するテキストデータが検索されたテンプレートを出力する。表示制御部は、出力されたテンプレートを、適合するテキストデータと対応づけて表示させる。生成部は、テンプレートに基づき要約を生成する。 An information processing apparatus according to an embodiment includes a search unit, a display control unit, and a generation unit. The search unit is configured to output a first sentence described in a first language and a second sentence described in a second language among one or more text data described in one or more languages, obtained by recognizing a voice. Is searched for text data that conforms to the summary template associated with, and the template in which the matching text data is retrieved is output. The display control unit displays the output template in association with the matching text data. The generation unit generates an abstract based on the template.

図１は、第１の実施形態にかかる情報処理装置の構成例を示すブロック図である。FIG. 1 is a block diagram illustrating a configuration example of the information processing apparatus according to the first embodiment. 図２は、要約テンプレートの一例を示す図である。FIG. 2 is a diagram illustrating an example of the summary template. 図３は、要約テンプレートの一例を示す図である。FIG. 3 is a diagram illustrating an example of the summary template. 図４は、認識結果記憶部に記憶されるデータの構造の一例を示す図である。FIG. 4 is a diagram illustrating an example of a structure of data stored in the recognition result storage unit. 図５は、翻訳結果の例を示す図である。FIG. 5 is a diagram illustrating an example of the translation result. 図６は、生成画面の一例を示す図である。FIG. 6 is a diagram illustrating an example of the generation screen. 図７は、複数の候補を選択可能に表示する生成画面の一例を示す図である。FIG. 7 is a diagram illustrating an example of a generation screen that displays a plurality of candidates in a selectable manner. 図８は、変数の値を変更した後の生成画面の一例を示す図である。FIG. 8 is a diagram illustrating an example of the generation screen after changing the value of the variable. 図９は、生成された議事録の例を示す図である。FIG. 9 is a diagram illustrating an example of the generated minutes. 図１０は、第１の実施形態における音声認識処理の一例を示すフローチャートである。FIG. 10 is a flowchart illustrating an example of the voice recognition process according to the first embodiment. 図１１は、第１の実施形態における議事録候補生成処理の一例を示すフローチャートである。FIG. 11 is a flowchart illustrating an example of minutes candidate generation processing according to the first embodiment. 図１２は、第１の実施形態における議事録生成処理の一例を示すフローチャートである。FIG. 12 is a flowchart illustrating an example of the minutes generation process according to the first embodiment. 図１３は、第２の実施形態にかかる情報処理装置の構成例を示すブロック図である。FIG. 13 is a block diagram illustrating a configuration example of an information processing apparatus according to the second embodiment. 図１４は、編集作業の一例を示す図である。FIG. 14 is a diagram illustrating an example of the editing operation. 図１５は、第２の実施形態における議事録編集処理の一例を示すフローチャートである。FIG. 15 is a flowchart illustrating an example of the minutes editing process according to the second embodiment. 図１６は、第１または第２の実施形態にかかる情報処理装置のハードウェア構成例を示す説明図である。FIG. 16 is an explanatory diagram illustrating an example of a hardware configuration of the information processing apparatus according to the first or second embodiment.

以下に添付図面を参照して、この発明にかかる情報処理装置の好適な実施形態を詳細に説明する。 Hereinafter, preferred embodiments of an information processing apparatus according to the present invention will be described in detail with reference to the accompanying drawings.

なお、以下では会議システム（会議装置）に対して入力された音声を認識し、認識結果から会議の議事録を生成する場合を例に説明する。適用可能なシステムは会議システムに限られるものではない。例えば音声の認識結果などから、認識結果を要約した情報を生成するシステム（装置）に適用してもよい。 In the following, a case will be described as an example where speech input to a conference system (conference device) is recognized and minutes of the conference are generated from the recognition result. Applicable systems are not limited to conference systems. For example, the present invention may be applied to a system (apparatus) that generates information that summarizes a recognition result from a speech recognition result or the like.

また、以下では、会議参加者の使用する言語を日本語と英語とし、議事録作成者が用いる言語を日本語とした例で説明を進める。処理対象の言語は、これら二言語に限られることなく、さまざまな言語を対象とすることができる。これらの言語情報は予め会議参加時または議事録作成時などに利用者がシステムに入力しておくものとする。 Further, in the following, description will be given by taking an example in which the languages used by the meeting participants are Japanese and English, and the language used by the minutes creator is Japanese. The language to be processed is not limited to these two languages, and various languages can be targeted. It is assumed that the user inputs these pieces of linguistic information into the system in advance at the time of participating in a meeting or preparing minutes.

また以下の各実施形態では、会議中の言語間の意思疎通手段は別に用意されているものとする。例えば、通訳者が別途会議に参加して通訳を行ってもよいし、入力された音声を翻訳して出力する音声翻訳装置を介して行ってもよい。 In the following embodiments, it is assumed that communication means between languages during a conference is separately prepared. For example, the interpreter may separately participate in the meeting and perform the interpreting, or may perform the translation via a voice translating device that translates and outputs the input voice.

（第１の実施形態）
第１の実施形態にかかる情報処理装置は、異なる言語で記述された複数の文を対応づけた議事録のテンプレートから、音声認識結果に適合するテンプレートを検索し、検索されたテンプレートを用いて議事録を生成する。検索されたテンプレートは表示部などに表示され、必要に応じて修正される。このような構成により、議事録作成に必要かつ十分な内容を簡単に作成可能となる。また、複数言語の文を対応づけたテンプレートによる高精度な翻訳によって、多言語での議事録を同時に提供可能となる。 (1st Embodiment)
The information processing apparatus according to the first embodiment searches for a template that matches a speech recognition result from templates of minutes that correspond to a plurality of sentences described in different languages, and uses the searched template to Generate a record. The searched template is displayed on a display unit or the like, and is modified as necessary. With such a configuration, it is possible to easily create the contents necessary and sufficient for creating the minutes. In addition, high-precision translation using templates in which sentences in a plurality of languages are associated makes it possible to simultaneously provide minutes in multiple languages.

図１は、第１の実施形態にかかる情報処理装置１００の構成例を示すブロック図である。図１に示すように、情報処理装置１００は、端末装置２００とネットワークなどを介して接続される。ネットワークは、有線ネットワークおよび無線ネットワークのいずれでもよいし、両者が混在したネットワークでもよい。ネットワークは、例えばインターネットである。 FIG. 1 is a block diagram illustrating a configuration example of the information processing apparatus 100 according to the first embodiment. As shown in FIG. 1, the information processing device 100 is connected to a terminal device 200 via a network or the like. The network may be either a wired network or a wireless network, or a network in which both are mixed. The network is, for example, the Internet.

情報処理装置１００は、物理的に１つの装置（サーバ装置、パーソナルコンピュータなど）として実現してもよいし、複数の物理的な装置により構成される論理的な装置により実現してもよい。情報処理装置１００は、例えば、クラウド環境上に構築された仮想計算機などにより実現される。端末装置２００は、例えば、パーソナルコンピュータおよび携帯端末などのクラウド環境を利用する装置として実現される。 The information processing device 100 may be physically realized as one device (a server device, a personal computer, or the like), or may be realized as a logical device including a plurality of physical devices. The information processing apparatus 100 is realized by, for example, a virtual computer constructed on a cloud environment. The terminal device 200 is realized, for example, as a device using a cloud environment such as a personal computer and a mobile terminal.

端末装置２００は、情報処理装置１００からの指示に応じて議事録を生成するための情報を表示し、議事録を生成するための情報を情報処理装置１００に送信するために主に用いられる。端末装置２００で実行される処理を情報処理装置１００内で実行するように構成してもよい。例えば情報処理装置１００内に表示部を備え、この表示部に対して議事録を生成するための情報を表示してもよい。この場合、端末装置２００を備える必要はない。 The terminal device 200 is mainly used for displaying information for generating minutes in response to an instruction from the information processing device 100 and transmitting information for generating minutes to the information processing device 100. The processing executed by the terminal device 200 may be configured to be executed in the information processing device 100. For example, a display unit may be provided in the information processing apparatus 100, and information for generating minutes may be displayed on the display unit. In this case, there is no need to provide the terminal device 200.

端末装置２００は、表示部２２１と、通信制御部２０１と、表示制御部２０２と、を備えている。 The terminal device 200 includes a display unit 221, a communication control unit 201, and a display control unit 202.

表示部２２１は、画像などの各種情報を表示する装置である。表示部２２１は、液晶ディスプレイ、および、タッチパネルなどにより実現できる。表示部２２１に対する表示内容は、表示制御部２０２により制御される。 The display unit 221 is a device that displays various information such as images. The display unit 221 can be realized by a liquid crystal display, a touch panel, or the like. The display content on the display unit 221 is controlled by the display control unit 202.

通信制御部２０１は、情報処理装置１００などの外部装置との間の通信を制御する。例えば通信制御部２０１は、議事録の生成に用いられる生成画面を表示させるための情報を情報処理装置１００から受信する。また通信制御部２０１は、利用者により指示される情報を情報処理装置１００に送信する。 The communication control unit 201 controls communication with an external device such as the information processing device 100. For example, the communication control unit 201 receives, from the information processing apparatus 100, information for displaying a generation screen used for generating minutes. The communication control unit 201 transmits information specified by the user to the information processing device 100.

表示制御部２０２は、情報処理装置１００の表示制御部１０５（後述）からの指示に従い、表示部２２１に対する各種情報の表示を制御する。なお、情報処理装置１００内に表示部を備える構成の場合は、表示制御部２０２の処理を情報処理装置１００の表示制御部１０５が実行すればよい。 The display control unit 202 controls the display of various types of information on the display unit 221 according to an instruction from a display control unit 105 (described later) of the information processing device 100. In the case of a configuration including a display unit in the information processing device 100, the process of the display control unit 202 may be performed by the display control unit 105 of the information processing device 100.

情報処理装置１００は、テンプレート記憶部１２１と、認識結果記憶部１２２と、議事録記憶部１２３と、認識部１０１と、検索部１０２と、生成部１０３と、翻訳部１０４と、表示制御部１０５と、通信制御部１０６と、を備えている。 The information processing apparatus 100 includes a template storage unit 121, a recognition result storage unit 122, a minutes storage unit 123, a recognition unit 101, a search unit 102, a generation unit 103, a translation unit 104, a display control unit 105 And a communication control unit 106.

テンプレート記憶部１２１は、要約テンプレートを記憶する。図２および図３は、要約テンプレートの一例を示す図である。図２および図３に示すように、要約テンプレートは、日本語（第１言語）で記述された文２１０、３１０（第１文）と英語（第２言語）で記述された文２２０、３２０（第２文）とを含む。また要約テンプレートは、１以上の固定部と変数部とを含む。条件を満たす文字列を変数部に入力することで対訳文章を生成することができる。なお変数部を含まない要約テンプレートを用いてもよい。また対応づけられる言語の数は２に限られるものではなく、３以上の言語を対応づけてもよい。 The template storage unit 121 stores a summary template. 2 and 3 are diagrams illustrating an example of the summary template. As shown in FIGS. 2 and 3, the summary template includes sentences 210 and 310 (first sentence) described in Japanese (first language) and sentences 220 and 320 (320) written in English (second language). 2nd sentence). The summary template includes one or more fixed parts and a variable part. By inputting a character string that satisfies the condition into the variable section, a bilingual sentence can be generated. Note that a summary template that does not include a variable part may be used. Further, the number of languages to be associated is not limited to two, and three or more languages may be associated.

認識結果記憶部１２２は、認識部１０１による音声認識処理の結果を記憶する。図４は、認識結果記憶部１２２に記憶されるデータの構造の一例を示す図である。図４に示すように、認識結果記憶部１２２は、会議ＩＤと、発話者と、発話の開始時間と、言語と、認識結果と、を含む情報を記憶する。会議ＩＤは、会議を識別する識別情報である。発話者は、会議内で発話した利用者を識別する情報（例えば利用者の氏名、利用者ＩＤなど）である。言語は、認識結果の言語を識別する情報である。図４の例では「ＪＡ」が日本語を表し、「ＥＮ」が英語を表す。 The recognition result storage unit 122 stores the result of the voice recognition process performed by the recognition unit 101. FIG. 4 is a diagram illustrating an example of the structure of data stored in the recognition result storage unit 122. As illustrated in FIG. 4, the recognition result storage unit 122 stores information including a conference ID, a speaker, a start time of the utterance, a language, and a recognition result. The conference ID is identification information for identifying the conference. The speaker is information for identifying the user who has spoken in the conference (for example, the name of the user, the user ID, and the like). The language is information for identifying the language of the recognition result. In the example of FIG. 4, "JA" represents Japanese and "EN" represents English.

議事録記憶部１２３は、生成部１０３により生成された議事録（議事録候補）を記憶する。議事録記憶部１２３は、例えば、会議ＩＤと議事録とを対応づけて記憶する。ある会議に対して複数の議事録が生成される場合は、議事録の順序を示す情報などをさらに対応づけて記憶してもよい。 The minutes storage unit 123 stores the minutes (minute candidates) generated by the generation unit 103. The minutes storage unit 123 stores, for example, a meeting ID and a minutes in association with each other. When a plurality of minutes is generated for a certain conference, information indicating the order of the minutes may be further stored in association with the minutes.

なお、各記憶部は、ＨＤＤ（Hard Disk Drive）、光ディスク、メモリカード、ＲＡＭ（Random Access Memory）などの一般的に利用されているあらゆる記憶媒体により構成することができる。記憶部は、物理的に異なる記憶媒体としてもよいし、物理的に同一の記憶媒体の異なる記憶領域として実現してもよい。さらに記憶部のそれぞれは、物理的に異なる複数の記憶媒体により実現してもよい。 In addition, each storage unit can be configured by any storage medium generally used, such as an HDD (Hard Disk Drive), an optical disk, a memory card, and a RAM (Random Access Memory). The storage unit may be a physically different storage medium, or may be implemented as physically different storage areas of the same storage medium. Further, each of the storage units may be realized by a plurality of physically different storage media.

認識部１０１は、音声信号の入力を受付け、音声信号に対して音声認識処理を実行して、認識結果のテキストデータへと変換する。音声信号は、例えばマイクロフォンなどの音声入力装置から入力されてもよいし、事前に収集された音声信号を記憶する他の記憶装置などから入力されてもよい。 The recognition unit 101 receives an input of a speech signal, executes speech recognition processing on the speech signal, and converts the speech signal into text data of a recognition result. The audio signal may be input from an audio input device such as a microphone, for example, or may be input from another storage device that stores an audio signal collected in advance.

認識部１０１は、認識結果であるテキストデータと、発話時の情報とを対にして逐次、認識結果記憶部１２２に記憶する。発話時の情報は、例えば、発話時の言語、発話の開始時間、発話者、および、会議ＩＤなどである。発話時の情報は、少なくとも、いずれの会議であるか、および、認識されたテキストデータの言語を識別でき、かつ、時系列に並べられる情報であればよい。 The recognition unit 101 sequentially stores the text data as the recognition result and the information at the time of utterance in the recognition result storage unit 122 in pairs. The information at the time of the utterance includes, for example, the language at the time of the utterance, the start time of the utterance, the speaker, and the conference ID. The information at the time of the utterance may be at least information that can identify the conference and the language of the recognized text data and are arranged in time series.

検索部１０２は、認識結果記憶部１２２に蓄積されたテキストデータに適合する要約テンプレートを検索する。検索部１０２は、議事録生成要求を受け付けたとき、および、認識部１０１によりテキストデータが認識結果記憶部１２２に記憶されたとき、などの任意のタイミングで処理を開始してよい。 The search unit 102 searches for a summary template that matches the text data stored in the recognition result storage unit 122. The search unit 102 may start the process at an arbitrary timing, such as when a minutes generation request is received, or when the recognition unit 101 stores text data in the recognition result storage unit 122.

検索部１０２は、例えば、認識結果のテキストデータのうち、要約テンプレートに適合するテキストデータを検索し、適合するテキストデータが検索されたテンプレートを出力する。検索部１０２は、例えば、認識結果記憶部１２２から会議ごとに認識結果のテキストデータを取り出し、時系列に並べる。そして検索部１０２は、時系列に並べた情報に適合する要約テンプレートを検索する。 For example, the search unit 102 searches the text data of the recognition result for text data that matches the summary template, and outputs the template in which the matching text data has been searched. For example, the search unit 102 extracts the text data of the recognition result for each meeting from the recognition result storage unit 122 and arranges the text data in chronological order. Then, the search unit 102 searches for a summary template that matches the information arranged in chronological order.

検索部１０２は、適合する要約テンプレートを検索するために、一定以上の類似度を有し、かつ、すべての変数部に対して入力可能な文字列が存在する区間を探索する。検索部１０２は、例えば以下のようにして区間を検索する。まず検索部１０２は、テキストデータを形態素解析して品詞と原形を付与する。検索部１０２は、類似度については単語の一致率が一定以上であるという条件を課し、変数部については品詞の条件を課し、条件に適合する文字列がすべての変数部に対して存在する区間を探索する。検索部１０２は、両方の条件を満たす最小の区間（大きさが最小の区間）を、要約テンプレートに適合する区間とする。 The search unit 102 searches for a section that has a certain degree of similarity or more and has a character string that can be input for all variable parts in order to search for a suitable summary template. The search unit 102 searches for a section as follows, for example. First, the search unit 102 performs a morphological analysis on the text data and gives a part of speech and an original form. The search unit 102 imposes a condition that a word matching rate is equal to or higher than a certain degree for similarity, imposes a part of speech condition on a variable part, and a character string that meets the condition exists for all variable parts. Search for a section to perform. The search unit 102 determines a minimum section (a section having a minimum size) that satisfies both conditions as a section that matches the summary template.

検索部１０２による処理を図２の例で説明する。図２の要約テンプレートでは、［Ｎ］および［ＰＮ］が変数部であり、それぞれ名詞（名詞句）および固有名詞（固有名詞句）を変数に入力可能であることを意味している。このテンプレートを図４で例示されるテキストデータを用いて検索する場合、２行目の認識結果４０１（“その計画書は特に修正も要らないと思うので”）と、３行目の認識結果４０２（“Ｘ部長の承認をもらって進めといて下さい”）の２行に相当するテキストデータが、図２のテンプレートの文２１０に適合している。例えば、文２１０のうち、語句２１１（“［Ｎ］／は”）、語句２１２（“［ＰＮ］／の／承認／を”）、および、語句２１３（“もらう”）の部分が、このテキストデータに適合している。また語句４１１（“その計画書”）が変数部［Ｎ］に入力可能であり、語句４１２（“Ｘ部長”）が変数部［ＰＮ］に入力可能である。従って検索部１０２は、この２行に相当するテキストデータを適合する区間として検出する。 The processing by the search unit 102 will be described with reference to the example of FIG. In the summary template of FIG. 2, [N] and [PN] are variable parts, which means that a noun (noun phrase) and a proper noun (proper noun phrase) can be input to the variable, respectively. When this template is searched using the text data exemplified in FIG. 4, the recognition result 401 on the second line (“I do not think that the plan needs any particular correction”) and the recognition result 402 on the third line The text data corresponding to two lines (“Please proceed with the approval of the director X”) conforms to the template sentence 210 in FIG. For example, in the sentence 210, the words 211 (“[N] / wa”), the words 212 (“[PN] / no / approval /”), and the words 213 (“get”) are included in this text. Fits the data. The phrase 411 (“the plan”) can be input to the variable section [N], and the phrase 412 (“X section length”) can be input to the variable section [PN]. Therefore, the search unit 102 detects the text data corresponding to the two lines as a suitable section.

ここでは、取りうる変数として語句４１３（“修正”）も考えられるが、語句４１１（“その計画書／は”）の方がより長く一致している。このため、検索部１０２は、語句４１１を優先し、語句４１３（“修正”）は別候補として保持しておく。 Here, the word 413 (“correction”) is also conceivable as a possible variable, but the word 411 (“the plan / ha”) matches longer. For this reason, the search unit 102 gives priority to the phrase 411 and holds the phrase 413 (“correction”) as another candidate.

上記の例では、区間の探索は要約テンプレートのうち、日本語の用例で行っていたが、日本語および英語の両方の用例を対象にして探索することもできる。また、探索の際に、後述する翻訳部１０４により認識結果を翻訳し、その翻訳結果も対象に加えて探索することもできる。 In the above example, the section is searched for Japanese examples in the summary template, but it is also possible to search for both Japanese and English examples. Further, at the time of the search, the recognition result can be translated by the translation unit 104 described later, and the translated result can be searched in addition to the target.

例えば、図４で例示されるテキストデータを用いて検索すると、４行目の認識結果４０３と、５行目の認識結果４０４の２行に相当するテキストデータが、図３の要約テンプレートに適合している。例えば、語句３１１（“［ＰＮ］／部門／の”）、語句３１２（“sales”）、および、語句３１３（“last month”）の部分が、このテキストデータに適合している。また語句４２１（“国際営業”）が変数部［ＰＮ］に入力可能であり、語句４２２（“30 percent”）および語句４２３（“11.4 million dollars”）が変数部［ＮＵＭ］に入力可能である。従って検索部１０２は、この２行に相当するテキストデータを適合する区間として検出する。 For example, when a search is performed using the text data exemplified in FIG. 4, the text data corresponding to the two lines of the recognition result 403 on the fourth line and the recognition result 404 on the fifth line match the summary template in FIG. ing. For example, the phrase 311 (“[PN] / department / no”), the phrase 312 (“sales”), and the phrase 313 (“last month”) match this text data. Also, the word 421 (“international sales”) can be input to the variable part [PN], and the word 422 (“30 percent”) and the word 423 (“11.4 million dollars”) can be input to the variable part [NUM]. . Therefore, the search unit 102 detects the text data corresponding to the two lines as a suitable section.

このとき、例えば一致率が十分でない場合（一致率が予め定めた閾値より小さい場合など）には、翻訳部１０４は、それぞれの認識結果を、要約テンプレートの他方の言語（認識結果が日本語であれば英語に、英語であれば日本語、など）に翻訳する。図５は、このようにして翻訳された翻訳結果の例を示す図である。 At this time, for example, when the matching rate is not sufficient (for example, when the matching rate is smaller than a predetermined threshold), the translating unit 104 converts each recognition result into the other language of the summary template (when the recognition result is in Japanese). If it is, translate it into English, if English, translate it into Japanese, etc.). FIG. 5 is a diagram showing an example of the translation result translated in this manner.

検索部１０２は、このような翻訳結果を含めて適合箇所を探すことができる。例えば図５の例では、検索部１０２は、語句５０１（“先月”）、語句５０２（“売上高”）、および、語句５０３（“１１４０万ドル”）がさらに適合すると判定できる。これにより十分な一致率が得られ、検索部１０２は、図４の４行目と５行目に相当するテキストデータを適合する区間として検出する。 The search unit 102 can search for a matching part including such a translation result. For example, in the example of FIG. 5, the search unit 102 can determine that the word 501 (“last month”), the word 502 (“sales”), and the word 503 (“11.4 million dollars”) further match. As a result, a sufficient matching rate is obtained, and the search unit 102 detects text data corresponding to the fourth and fifth lines in FIG. 4 as a suitable section.

なお、変数［ＮＵＭ］に対応する語句として、語句４２２（“30 percent”）、語句４２３（“11.4 million dollars”）、および、語句５０３（“１１４０万ドル”）の３つが検出される。検索部１０２は、例えば最初に見つかった語句４２２を選択し、他の語句は別候補として保持しておく。 It should be noted that three phrases, a phrase 422 (“30 percent”), a phrase 423 (“11.4 million dollars”), and a phrase 503 (“11.4 million dollars”) are detected as the phrases corresponding to the variable [NUM]. The search unit 102 selects, for example, the first phrase 422 found, and holds other phrases as other candidates.

翻訳部１０４は、テキストデータを、異なる言語のテキストデータに翻訳する。翻訳部１０４は、トランスファ方式、用例ベース方式、統計ベース方式、および、中間言語方式などの任意の翻訳方法を適用できる。 The translation unit 104 translates text data into text data in different languages. The translation unit 104 can apply any translation method such as a transfer method, an example-based method, a statistical-based method, and an intermediate language method.

生成部１０３は、テンプレートに基づき議事録を生成する。例えば生成部１０３は、変数部を含まない要約テンプレートが検索された場合、検索された要約テンプレートを議事録として生成する。変数部を含む要約テンプレートが検索された場合、生成部１０３は、変数部に入力可能な語句（文字列）のうちいずれかを変数部に入力して議事録を生成する。 The generation unit 103 generates a minutes based on the template. For example, when a summary template that does not include the variable part is searched, the generation unit 103 generates the searched summary template as minutes. When the summary template including the variable part is searched, the generation unit 103 inputs any of the words (character strings) that can be input to the variable part to the variable part to generate the minutes.

なお、変数部に入力可能な語句の候補が複数存在する場合もあるため、最初に生成される文字列は、最終的に採用する議事録の候補であるとも解釈できる。後述するように、議事録の候補から利用者により選択された候補が、最終的な議事録として採用される。 It should be noted that there may be a plurality of phrase candidates that can be input to the variable part, so that the character string generated first can be interpreted as a candidate for the minutes to be finally adopted. As will be described later, a candidate selected by the user from among minutes candidates is adopted as the final minutes.

生成部１０３による議事録（議事録候補）の生成の具体例について説明する。例えば、図２の要約テンプレートに対して、変数部［Ｎ］に入力可能な語句４１１、および、変数部［ＰＮ］に入力可能な語句４１２が得られたとする。まず翻訳部１０４は、各語句を対応する言語に翻訳する。例えば翻訳部１０４は、日本語の語句４１１および語句４１２を、それぞれ英語の語句（“the plan”、“X director”）に翻訳する。 A specific example of generation of minutes (minute candidates) by the generation unit 103 will be described. For example, it is assumed that a phrase 411 that can be input to the variable part [N] and a word 412 that can be input to the variable part [PN] have been obtained for the summary template of FIG. First, the translation unit 104 translates each phrase into a corresponding language. For example, the translation unit 104 translates the Japanese words 411 and 412 into English words (“the plan” and “X director”).

生成部１０３は、語句４１１、語句４１２、および、翻訳結果の語句を、要約テンプレート内の対応する変数部に入力する。この結果、例えば、図６（後述）の日本語文６０１と、英語文“The plan is subject to X director’s approval.”とを含む対訳文章が議事録（議事録候補）として生成される。 The generation unit 103 inputs the words 411, 412, and the words resulting from the translation to the corresponding variable units in the summary template. As a result, for example, a bilingual sentence including a Japanese sentence 601 in FIG. 6 (described later) and an English sentence “The plan is subject to X director's approval.” Is generated as minutes (minutes candidate).

日本語と英語の両方に変数部が存在する場合も同様である。例えば、図３の要約テンプレートに対して、変数部［ＰＮ］に入力可能な語句４２１、および、変数部［ＮＵＭ］に入力可能な語句４２２が得られたとする。翻訳部１０４は、日本語の語句４２１を英語の語句（“international marketing”）に翻訳し、英語の語句４２２を日本語の語句（“３０％”）に翻訳する。 The same applies to the case where the variable part exists in both Japanese and English. For example, it is assumed that a phrase 421 that can be input to the variable part [PN] and a word 422 that can be input to the variable part [NUM] have been obtained for the summary template of FIG. The translation unit 104 translates the Japanese phrase 421 into an English phrase (“international marketing”), and translates the English phrase 422 into a Japanese phrase (“30%”).

生成部１０３は、得られた各語句を要約テンプレート内の対応する変数部に入力する。この結果、例えば、図６の日本語文６０２と、英語文“Sales of international marketing division in last month were 30%.”とを含む対訳文章が議事録（議事録候補）として生成される。変数部に別候補が存在する場合には、翻訳部１０４は、別候補についても同様に翻訳を行っておく。 The generation unit 103 inputs each obtained phrase to a corresponding variable part in the summary template. As a result, for example, a bilingual sentence including the Japanese sentence 602 in FIG. 6 and the English sentence “Sales of international marketing division in last month were 30%.” Is generated as the minutes (minutes candidate). If another candidate exists in the variable part, the translation unit 104 performs translation for another candidate in the same manner.

表示制御部１０５は、議事録を生成するための情報の表示を制御する。本実施形態では、表示制御部１０５は、端末装置２００の表示部２２１に対する情報の表示を制御する。表示制御部１０５は、通信制御部１０６および通信制御部２０１を介して、表示部２２１に対する表示に関する指示を表示制御部２０２に送信する。情報処理装置１００内に表示部を備える構成の場合は、表示制御部１０５は、この表示部に対して直接、表示に関する指示を送信すればよい。 The display control unit 105 controls display of information for generating minutes. In the present embodiment, the display control unit 105 controls the display of information on the display unit 221 of the terminal device 200. The display control unit 105 transmits an instruction regarding display on the display unit 221 to the display control unit 202 via the communication control unit 106 and the communication control unit 201. In the case of a configuration including a display unit in the information processing device 100, the display control unit 105 may directly transmit an instruction regarding display to the display unit.

通信制御部１０６は、端末装置２００などの外部装置との間の通信を制御する。例えば通信制御部１０６は、生成画面を表示させるための情報を端末装置２００に送信する。また通信制御部１０６は、生成画面で利用者により指示される情報を端末装置２００から受信する。利用者により指示される情報は、例えば、変数部に入力する値として選択された語句を示す情報、および、議事録候補のうち最終的な議事録として登録が指示された議事録候補を示す情報などである。 The communication control unit 106 controls communication with an external device such as the terminal device 200. For example, the communication control unit 106 transmits information for displaying the generation screen to the terminal device 200. Further, the communication control unit 106 receives, from the terminal device 200, information specified by the user on the generation screen. The information specified by the user is, for example, information indicating a phrase selected as a value to be input to the variable portion, and information indicating a minutes candidate whose registration has been instructed as a final minutes among minutes candidates. And so on.

上記各部（認識部１０１、検索部１０２、生成部１０３、翻訳部１０４、表示制御部１０５、および、通信制御部１０６など）は、例えば、１または複数のプロセッサにより実現される。例えば上記各部は、ＣＰＵ（Central Processing Unit）などのプロセッサにプログラムを実行させること、すなわちソフトウェアにより実現してもよい。上記各部は、専用のＩＣ（Integrated Circuit）などのプロセッサ、すなわちハードウェアにより実現してもよい。上記各部は、ソフトウェアおよびハードウェアを併用して実現してもよい。複数のプロセッサを用いる場合、各プロセッサは、各部のうち１つを実現してもよいし、各部のうち２以上を実現してもよい。 Each of the above units (the recognition unit 101, the search unit 102, the generation unit 103, the translation unit 104, the display control unit 105, the communication control unit 106, and the like) is realized by, for example, one or a plurality of processors. For example, each of the above units may be realized by causing a processor such as a CPU (Central Processing Unit) to execute a program, that is, by software. Each of the above units may be realized by a processor such as a dedicated IC (Integrated Circuit), that is, hardware. Each of the above units may be realized by using software and hardware together. When a plurality of processors are used, each processor may realize one of the units, or may realize two or more of the units.

次に、表示制御部１０５による表示処理の詳細について説明する。表示制御部１０５は、例えば、議事録を生成するための生成画面を表示させる。図６は、生成画面の一例を示す図である。図６に示すように、表示制御部１０５は、会議ログと、議事録候補と、を対応づけた生成画面６００を表示する。会議ログは、会議での発話を表す。例えば、発話の認識結果であるテキストデータを時系列に並べた情報が、会議ログとして表示される。図６に示すように、発話者とテキストデータとが対応づけて表示されてもよい。 Next, details of the display processing by the display control unit 105 will be described. The display control unit 105 displays, for example, a generation screen for generating minutes. FIG. 6 is a diagram illustrating an example of the generation screen. As shown in FIG. 6, the display control unit 105 displays a generation screen 600 in which a meeting log and minutes candidates are associated with each other. The conference log represents speech in the conference. For example, information in which text data, which is the recognition result of an utterance, is arranged in time series is displayed as a conference log. As shown in FIG. 6, the speaker and the text data may be displayed in association with each other.

議事録を作成する利用者の使用言語が日本語であることが予め分かっている場合は、議事録候補として生成された対訳文章のうち、日本語文のみが表示されてもよい。複数言語の文を含む対訳文書を表示するように構成してもよい。 If it is known in advance that the language used by the user who creates the minutes is Japanese, only the Japanese sentence among the bilingual sentences generated as minutes candidates may be displayed. A bilingual document including sentences in multiple languages may be displayed.

表示制御部１０５は、対訳文章の変数部に複数の候補が存在する場合、各候補を選択可能に表示し、候補の選択を受け付ける。図７は、複数の候補を選択可能に表示する生成画面の一例を示す図である。例えば利用者が別候補７０１（“１１４０万ドル”）を選択したとすると、議事録候補の変数が変更され、対応する対訳文章も同時に変更される。図８は、変数の値を変更した後の生成画面の一例を示す図である。図８に示すように、変更された候補８０１を含む生成画面が表示される。 When a plurality of candidates exist in the variable portion of the bilingual sentence, the display control unit 105 displays each candidate in a selectable manner, and accepts the selection of the candidate. FIG. 7 is a diagram illustrating an example of a generation screen that displays a plurality of candidates in a selectable manner. For example, if the user selects another candidate 701 ("$ 11.4 million"), the variables of the minutes candidate are changed, and the corresponding translated text is also changed at the same time. FIG. 8 is a diagram illustrating an example of the generation screen after changing the value of the variable. As shown in FIG. 8, a generation screen including the changed candidate 801 is displayed.

生成画面は、議事録候補を編集する機能、議事録候補のうち採用する候補を指定する機能、および、指定された候補を最終的な議事録として登録することを指定する機能などをさらに備えていてもよい。 The generation screen further includes a function for editing minutes candidates, a function for specifying candidates to be adopted among minutes candidates, a function for specifying that the specified candidates are registered as final minutes, and the like. You may.

生成部１０３は、例えば生成画面で登録が指示された議事録候補を、最終的な議事録として生成し、議事録記憶部１２３に記憶してもよい。例えば生成部１０３は、利用者による対訳文章の選択を、表示制御部１０５を介して受け付ける。生成部１０３は、受け付けた対訳文章を順に議事録記憶部１２３に記憶する。生成部１０３は、記憶した対訳文章のリストから議事録を生成する。図９は、生成された議事録の例を示す図である。 For example, the generation unit 103 may generate a minutes candidate whose registration has been instructed on the generation screen as a final minutes, and store the minutes in the minutes storage unit 123. For example, the generation unit 103 receives a selection of a translated text by the user via the display control unit 105. The generation unit 103 stores the received bilingual sentences in the minutes storage unit 123 in order. The generation unit 103 generates minutes from the stored bilingual sentence list. FIG. 9 is a diagram illustrating an example of the generated minutes.

次に、このように構成された第１の実施形態にかかる情報処理装置１００による各処理について説明する。全体の処理は、音声認識処理、議事録候補生成処理、および、議事録生成処理の３つの部分処理に分けられる。以下、各処理について説明する。 Next, each process performed by the information processing apparatus 100 according to the first embodiment configured as described above will be described. The whole process is divided into three partial processes: a speech recognition process, a minutes candidate generation process, and a minutes generation process. Hereinafter, each process will be described.

図１０は、第１の実施形態における音声認識処理の一例を示すフローチャートである。音声認識処理は、例えば会議が開始されたとき、および、利用者により指示されたときなどに開始される。 FIG. 10 is a flowchart illustrating an example of the voice recognition process according to the first embodiment. The voice recognition processing is started, for example, when a conference is started and when a user instructs.

認識部１０１は、音声（音声信号）が入力されたか否かを判定する（ステップＳ１０１）。入力された場合（ステップＳ１０１：Ｙｅｓ）、認識部１０１は、音声に対する認識処理を実行する（ステップＳ１０２）。認識部１０１は、認識結果と発話時の情報とを対応づけて認識結果記憶部１２２に記憶する（ステップＳ１０３）。認識部１０１は、ステップＳ１０１に戻り、次の音声入力を待つ。 The recognizing unit 101 determines whether a voice (voice signal) has been input (step S101). When the input has been made (step S101: Yes), the recognition unit 101 performs a recognition process on the voice (step S102). The recognition unit 101 stores the recognition result and the information at the time of speech in the recognition result storage unit 122 in association with each other (step S103). The recognition unit 101 returns to step S101 and waits for the next voice input.

音声が入力されない場合（ステップＳ１０１：Ｎｏ）、認識部１０１は、例えば利用者により音声認識処理の終了が指示されたか否かを判定する（ステップＳ１０４）。終了が指示されていない場合（ステップＳ１０４：Ｎｏ）、認識部１０１は、ステップＳ１０１に戻り処理を繰り返す。終了が指示された場合（ステップＳ１０４：Ｙｅｓ）、認識部１０１は、音声認識処理を終了する。 When no voice is input (Step S101: No), the recognition unit 101 determines whether or not the user has instructed to end the voice recognition process (Step S104). When the end is not instructed (Step S104: No), the recognition unit 101 returns to Step S101 and repeats the processing. When the end is instructed (Step S104: Yes), the recognition unit 101 ends the voice recognition processing.

図１１は、第１の実施形態における議事録候補生成処理の一例を示すフローチャートである。議事録候補生成処理は、例えば会議の進行と並行して、すなわち、会議と同期して実行されてもよいし、非同期に実行されてもよい。例えば議事録候補生成処理は、会議終了直後、および、利用者から議事録生成要求を受け取ったときなどに実行されてもよい。議事録生成要求は、いずれの会議の議事録を生成するか判定するために会議ＩＤを含んでもよい。 FIG. 11 is a flowchart illustrating an example of minutes candidate generation processing according to the first embodiment. The minutes candidate generation process may be performed, for example, in parallel with the progress of the conference, that is, in synchronization with the conference, or may be performed asynchronously. For example, the minutes candidate generation process may be executed immediately after the end of the meeting or when a minutes generation request is received from the user. The minutes generation request may include the conference ID to determine which minutes of the conference to generate.

検索部１０２は、要求された会議の会議ＩＤを用いて、認識結果記憶部１２２から、対応する会議の認識結果と発話時の情報のリスト（以下、リストＴとする）を読み出す（ステップＳ２０１）。検索部１０２は、リストＴを時系列順に並べた情報（以下、情報Ｔ’とする）を生成する（ステップＳ２０２）。 Using the conference ID of the requested conference, the search unit 102 reads a list of recognition results of the corresponding conference and information at the time of speech (hereinafter, referred to as a list T) from the recognition result storage unit 122 (step S201). . The search unit 102 generates information in which the list T is arranged in chronological order (hereinafter, referred to as information T ') (step S202).

検索部１０２は、未検索の要約テンプレートを、テンプレート記憶部１２１から１つ取得する（ステップＳ２０３）。また検索部１０２は、検索結果を格納するための検出リストＹを用意する。 The search unit 102 acquires one unsearched summary template from the template storage unit 121 (step S203). Further, the search unit 102 prepares a detection list Y for storing search results.

検索部１０２は、取得した要約テンプレートに適合する区間を、情報Ｔ’から検索する（ステップＳ２０４）。適合する区間（以下、区間Ｒとする）が存在する場合（ステップＳ２０４：Ｙｅｓ）、検索部１０２は、区間Ｒと、適合したテンプレート（以下、テンプレートＵとする）と、変数部がある場合は変数部に入力可能な文字列と、の組を検出リストＹに追加する（ステップＳ２０５）。 The search unit 102 searches the information T 'for a section that matches the acquired summary template (step S204). If there is a matching section (hereinafter, referred to as section R) (step S204: Yes), the search unit 102 determines whether there is a section R, a matching template (hereinafter, referred to as template U), and a variable section. A set of a character string that can be input to the variable part is added to the detection list Y (step S205).

検出リストＹに追加後、および、適合する区間が存在しない場合（ステップＳ２０４：Ｎｏ）、検索部１０２は、すべての要約テンプレートを処理したか否かを判定する（ステップＳ２０６）。すべての要約テンプレートを処理していない場合（ステップＳ２０６：Ｎｏ）、検索部１０２は、ステップＳ２０３に戻り、次の未処理の要約テンプレートに対して処理を繰り返す。 After the addition to the detection list Y and when there is no suitable section (step S204: No), the search unit 102 determines whether or not all the summary templates have been processed (step S206). If all the summary templates have not been processed (step S206: No), the search unit 102 returns to step S203 and repeats the process for the next unprocessed summary template.

すべての要約テンプレートを処理した場合（ステップＳ２０６：Ｙｅｓ）、生成部１０３は、得られた検出リストＹに含まれる要約テンプレートから議事録候補を生成する（ステップＳ２０７）。例えば生成部１０３は、変数部がある場合は、変数部に入力可能な候補のうちいずれかを変数部に入力し、対訳文章を生成する。生成部１０３は、各要約テンプレートに対応する対訳文章のリストである対訳文章リストＰを議事録候補として生成する。 When all the summary templates have been processed (Step S206: Yes), the generation unit 103 generates minutes candidates from the summary templates included in the obtained detection list Y (Step S207). For example, when there is a variable part, the generation unit 103 inputs any of the candidates that can be input to the variable part to the variable part, and generates a bilingual sentence. The generating unit 103 generates a bilingual sentence list P, which is a list of bilingual sentences corresponding to each of the summary templates, as minutes candidates.

次に、表示制御部１０５は、情報Ｔ’と対訳文章リストＰとを対応づけて表示させる（ステップＳ２０８）。例えば表示制御部１０５は、図６で示すような生成画面６００を用いて情報Ｔ’（会議ログ）と対訳文章リストＰ（議事録候補）とを表示する。 Next, the display control unit 105 displays the information T 'and the bilingual sentence list P in association with each other (step S208). For example, the display control unit 105 displays information T ′ (meeting log) and a bilingual sentence list P (minute candidates) using a generation screen 600 as shown in FIG.

図１２は、第１の実施形態における議事録生成処理の一例を示すフローチャートである。議事録生成処理は、議事録候補生成処理の後に行われる。例えば議事録生成処理は、議事録候補生成処理の直後、または、利用者の明示的に要求されたときなどに適宜実行される。以下では、例えば図６で示すような生成画面６００が表示された後に引き続き、議事録生成処理が実行される例を説明する。 FIG. 12 is a flowchart illustrating an example of the minutes generation process according to the first embodiment. The minutes generation process is performed after the minutes candidate generation process. For example, the minutes generation process is appropriately executed immediately after the minutes candidate generation process, or when explicitly requested by the user. In the following, an example will be described in which the minutes generation processing is executed after the generation screen 600 shown in FIG. 6 is displayed, for example.

生成部１０３は、利用者から変数部の変更が指示されたか否かを判定する（ステップＳ３０１）。変更が指示された場合（ステップＳ３０１：Ｙｅｓ）、生成部１０３は、変数部を指示された値（語句）に変更する（ステップＳ３０２）。表示制御部１０５は、変更が反映された対訳文章を含むように生成画面を更新して表示する。 The generation unit 103 determines whether the user has instructed to change the variable unit (Step S301). When the change is instructed (Step S301: Yes), the generation unit 103 changes the variable part to the instructed value (phrase) (Step S302). The display control unit 105 updates and displays the generation screen so as to include the translated text in which the change is reflected.

値を変更後、または、変更が指示されていない場合（ステップＳ３０１：Ｎｏ）、生成部１０３は、議事録の追加が指示されたか否かを判定する（ステップＳ３０３）。追加が指示された場合（ステップＳ３０３：Ｙｅｓ）、生成部１０３は、追加された議事録に対応する対訳文章を生成し、議事録記憶部１２３中の該当する会議の議事録に追加する（ステップＳ３０４）。 After changing the value, or when the change is not instructed (step S301: No), the generation unit 103 determines whether or not the addition of the minutes is instructed (step S303). When addition is instructed (step S303: Yes), the generation unit 103 generates a bilingual sentence corresponding to the added minutes and adds the bilingual sentence to the minutes of the corresponding meeting in the minutes storage unit 123 (step S303). S304).

議事録を追加後、または、追加が指示されていない場合（ステップＳ３０３：Ｎｏ）、生成部１０３は、処理の終了が指示されたか否かを判定する（ステップＳ３０５）。終了が指示されていない場合（ステップＳ３０５：Ｎｏ）、生成部１０３は、ステップＳ３０１に戻り処理を繰り返す。終了が指示された場合（ステップＳ３０５：Ｙｅｓ）、生成部１０３は、議事録生成処理を終了する。 After adding the minutes, or when the addition is not instructed (step S303: No), the generation unit 103 determines whether or not the end of the process is instructed (step S305). When the end is not instructed (step S305: No), the generation unit 103 returns to step S301 and repeats the processing. When the end is instructed (Step S305: Yes), the generation unit 103 ends the minutes generation process.

このように、第１の実施形態にかかる情報処理装置は、要約（議事録）のテンプレートから、音声認識結果に適合するテンプレートを検索し、検索されたテンプレートを用いて議事録を生成する。これにより、要約生成に必要かつ十分な内容を簡単に作成可能となる。また、テンプレートは、複数言語の文を対応づけているため、多言語での要約を同時に提供可能となる。 As described above, the information processing apparatus according to the first embodiment searches the template of the summary (minutes) for a template that matches the speech recognition result, and generates the minutes using the searched template. As a result, it is possible to easily create necessary and sufficient contents for generating the summary. In addition, since the template is associated with sentences in a plurality of languages, it is possible to simultaneously provide summaries in multiple languages.

（第２の実施形態）
第２の実施形態にかかる情報処理装置は、第１の実施形態の処理に加えて、議事録編集処理が追加される。音声認識処理、議事録候補生成処理、および、議事録生成処理は同一であるため説明を省略する。 (Second embodiment)
In the information processing apparatus according to the second embodiment, minutes editing processing is added to the processing of the first embodiment. Since the speech recognition process, the minutes candidate generation process, and the minutes generation process are the same, the description is omitted.

図１３は、第２の実施形態にかかる情報処理装置１００−２の構成例を示すブロック図である。図１３に示すように、情報処理装置１００−２は、テンプレート記憶部１２１と、認識結果記憶部１２２と、議事録記憶部１２３と、認識部１０１と、検索部１０２と、生成部１０３と、翻訳部１０４と、表示制御部１０５と、通信制御部１０６と、編集部１０７−２と、を備えている。 FIG. 13 is a block diagram illustrating a configuration example of an information processing device 100-2 according to the second embodiment. As shown in FIG. 13, the information processing apparatus 100-2 includes a template storage unit 121, a recognition result storage unit 122, a minutes storage unit 123, a recognition unit 101, a search unit 102, a generation unit 103, It includes a translation unit 104, a display control unit 105, a communication control unit 106, and an editing unit 107-2.

第２の実施形態では、情報処理装置１００−２に編集部１０７−２を追加したことが第１の実施形態と異なっている。その他の構成および機能は、第１の実施形態にかかる情報処理装置１００のブロック図である図１と同様であるので、同一符号を付し、ここでの説明は省略する。 The second embodiment is different from the first embodiment in that an editing unit 107-2 is added to the information processing apparatus 100-2. Other configurations and functions are the same as those in FIG. 1 which is a block diagram of the information processing apparatus 100 according to the first embodiment, and therefore, are denoted by the same reference numerals and description thereof will be omitted.

編集部１０７−２は、生成された要約（議事録）を編集する。例えば編集部１０７−２は、議事録記憶部１２３から議事録を取得し、利用者による編集作業に従い議事録を編集する。編集部１０７−２は、対訳文章の順番の入れ替え、変数部の修正、任意の文章の追加、および、議事録の削除などを行う。 The editing unit 107-2 edits the generated summary (minutes). For example, the editing unit 107-2 acquires the minutes from the minutes storage unit 123, and edits the minutes according to the editing operation by the user. The editing unit 107-2 changes the order of the translated sentences, corrects the variable part, adds an arbitrary sentence, and deletes the minutes.

図１４は、編集作業の一例を示す図である。図１４は、図９の対訳文章のうち日本語の１行目の文（“その計画書はＸ部長の承認をもらうこと”）の一部を語句１４０１（“●●プロジェクトの計画書”）に変更し、新たな日本語文１４０２（“今月は更に１０％の売上増加を見込む。”）を追加した例を示す。 FIG. 14 is a diagram illustrating an example of the editing operation. FIG. 14 shows a part of the sentence of the first line in Japanese (“the plan must be approved by the director of X”) in the bilingual sentence of FIG. 9 as a phrase 1401 (“●● project plan”). An example in which a new Japanese sentence 1402 (“a 10% increase in sales is expected this month.”) Is added.

これら編集作業は、議事録作成者の使用言語（この場合は日本語）で行われる。その後、編集部１０７−２は、変更された部分を、翻訳部１０４を用いて翻訳し、議事録の対応する部分にそれぞれ入力する。この結果、英語文についても語句１４１１（“The plan for ●● project”）の部分が変更され、日本語文１４０２の翻訳結果である英語文１４１２（“This month we further expected to increase in sales of 10%”）が追加される。 These editing operations are performed in the language used by the minutes creator (in this case, Japanese). After that, the editing unit 107-2 translates the changed portion using the translation unit 104, and inputs the translated portion into the corresponding portion of the minutes. As a result, the phrase 1411 (“The plan for ●● project”) in the English sentence is also changed, and the English sentence 1412 (“This month we further expected to increase in sales of 10% ") Is added.

次に、このように構成された第２の実施形態にかかる情報処理装置１００−２による議事録編集処理について図１５を用いて説明する。図１５は、第２の実施形態における議事録編集処理の一例を示すフローチャートである。議事録編集処理は、例えば利用者からの明示的な要求、または、議事録生成処理の終了後に開始される。 Next, a minutes editing process performed by the information processing apparatus 100-2 according to the second embodiment thus configured will be described with reference to FIG. FIG. 15 is a flowchart illustrating an example of the minutes editing process according to the second embodiment. The minutes editing process is started, for example, after an explicit request from a user or the end of the minutes generating process.

編集部１０７−２は、議事録を編集する会議の会議ＩＤを受け取り、対応する会議の議事録を取得し、表示制御部１０５により表示する（ステップＳ４０１）。その後、編集部１０７−２は、利用者による指示を待つ。 The editing unit 107-2 receives the meeting ID of the meeting whose minutes are to be edited, acquires the minutes of the corresponding meeting, and displays it by the display control unit 105 (step S401). Thereafter, the editing unit 107-2 waits for an instruction from the user.

編集部１０７−２は、編集が指示されたか否かを判定する（ステップＳ４０２）。編集が指示された場合（ステップＳ４０２：Ｙｅｓ）、編集部１０７−２は、指示に従い議事録を編集する（ステップＳ４０３）。 The editing unit 107-2 determines whether editing has been instructed (step S402). When the editing is instructed (Step S402: Yes), the editing unit 107-2 edits the minutes according to the instruction (Step S403).

編集部１０７−２は、例えば対訳文章の変数部の編集が指示された場合、指示に応じて変数部を更新する。編集部１０７−２は、任意の文章の追加が指示された場合、編集部１０７−２は、翻訳部１０４に文章を渡して追加された文章を翻訳させ、翻訳された文章を追加された文章とともに議事録に追加する。編集部１０７−２は、文章の順序入れ替え、または、削除が指示された場合、指示に従い文章の入れ替えまたは削除を実行する。 For example, when the editing of the variable portion of the bilingual sentence is instructed, the editing unit 107-2 updates the variable portion according to the instruction. When an instruction to add an arbitrary sentence is instructed, the editing unit 107-2 passes the sentence to the translating unit 104, causes the added sentence to be translated, and adds the translated sentence to the added sentence. To be added to the minutes. When an instruction to change the order of the text or to delete the text is issued, the editing unit 107-2 executes the text replacement or deletion according to the instruction.

編集が終了後、または、編集が指示されていない場合（ステップＳ４０２：Ｎｏ）、編集部１０７−２は、処理の終了が指示されたか否かを判定する（ステップＳ４０４）。終了が指示されていない場合（ステップＳ４０４：Ｎｏ）、編集部１０７−２は、ステップＳ４０２に戻り処理を繰り返す。 After the editing is completed, or when the editing is not instructed (step S402: No), the editing unit 107-2 determines whether the end of the process is instructed (step S404). If termination has not been instructed (step S404: No), the editing unit 107-2 returns to step S402 and repeats the processing.

終了が指示された場合（ステップＳ４０４：Ｙｅｓ）、編集部１０７−２は、それまでに編集された議事録を議事録記憶部１２３に登録（記憶）し（ステップＳ４０５）、議事録編集処理を終了する。 When the end is instructed (step S404: Yes), the editing unit 107-2 registers (stores) the minutes edited so far in the minutes storage unit 123 (step S405), and executes the minutes editing process. finish.

このように、第２の実施形態にかかる情報処理装置では、生成された議事録をさらに編集することが可能となる。 As described above, in the information processing apparatus according to the second embodiment, the generated minutes can be further edited.

以上説明したとおり、第１および第２の実施形態によれば、多言語での議事録（要約）の作成をより円滑に行うことが可能となる。第１および第２の実施形態にかかる情報処理装置、情報処理方法およびプログラムは、例えば話し言葉を翻訳して意味の伝達を行うことに適している。 As described above, according to the first and second embodiments, it is possible to create minutes (summary) in multiple languages more smoothly. The information processing apparatus, the information processing method, and the program according to the first and second embodiments are suitable for, for example, translating spoken words and transmitting meaning.

次に、第１または第２の実施形態にかかる情報処理装置のハードウェア構成について図１６を用いて説明する。図１６は、第１または第２の実施形態にかかる情報処理装置のハードウェア構成例を示す説明図である。 Next, a hardware configuration of the information processing apparatus according to the first or second embodiment will be described with reference to FIG. FIG. 16 is an explanatory diagram illustrating an example of a hardware configuration of the information processing apparatus according to the first or second embodiment.

第１または第２の実施形態にかかる情報処理装置は、ＣＰＵ（Central Processing Unit）５１などの制御装置と、ＲＯＭ（Read Only Memory）５２やＲＡＭ（Random Access Memory）５３などの記憶装置と、ネットワークに接続して通信を行う通信Ｉ／Ｆ５４と、各部を接続するバス６１を備えている。 An information processing apparatus according to the first or second embodiment includes a control device such as a CPU (Central Processing Unit) 51, a storage device such as a ROM (Read Only Memory) 52 and a RAM (Random Access Memory) 53, a network, And a communication I / F 54 for performing communication by connecting to each other, and a bus 61 for connecting each unit.

第１または第２の実施形態にかかる情報処理装置で実行されるプログラムは、ＲＯＭ５２等に予め組み込まれて提供される。 A program executed by the information processing apparatus according to the first or second embodiment is provided by being incorporated in the ROM 52 or the like in advance.

第１または第２の実施形態にかかる情報処理装置で実行されるプログラムは、インストール可能な形式又は実行可能な形式のファイルでＣＤ−ＲＯＭ（Compact Disk Read Only Memory）、フレキシブルディスク（ＦＤ）、ＣＤ−Ｒ（Compact Disk Recordable）、ＤＶＤ（Digital Versatile Disk）等のコンピュータで読み取り可能な記録媒体に記録してコンピュータプログラムプロダクトとして提供されるように構成してもよい。 The program executed by the information processing apparatus according to the first or second embodiment is a file in an installable format or an executable format, which is a CD-ROM (Compact Disk Read Only Memory), a flexible disk (FD), or a CD. -It may be configured such that it is recorded on a computer-readable recording medium such as R (Compact Disk Recordable) or DVD (Digital Versatile Disk) and provided as a computer program product.

さらに、第１または第２の実施形態にかかる情報処理装置で実行されるプログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、ネットワーク経由でダウンロードさせることにより提供するように構成してもよい。また、第１または第２の実施形態にかかる情報処理装置で実行されるプログラムをインターネット等のネットワーク経由で提供または配布するように構成してもよい。 Further, a program executed by the information processing apparatus according to the first or second embodiment is stored on a computer connected to a network such as the Internet, and provided by being downloaded via the network. Is also good. Further, the program executed by the information processing apparatus according to the first or second embodiment may be provided or distributed via a network such as the Internet.

第１または第２の実施形態にかかる情報処理装置で実行されるプログラムは、コンピュータを上述した情報処理装置の各部として機能させうる。このコンピュータは、ＣＰＵ５１がコンピュータ読取可能な記憶媒体からプログラムを主記憶装置上に読み出して実行することができる。 The program executed by the information processing apparatus according to the first or second embodiment can cause a computer to function as each unit of the information processing apparatus described above. In this computer, the CPU 51 can read out a program from a computer-readable storage medium onto a main storage device and execute the program.

本発明のいくつかの実施形態を説明したが、これらの実施形態は、例として提示したものであり、発明の範囲を限定することは意図していない。これら新規な実施形態は、その他の様々な形態で実施されることが可能であり、発明の要旨を逸脱しない範囲で、種々の省略、置き換え、変更を行うことができる。これら実施形態やその変形は、発明の範囲や要旨に含まれるとともに、特許請求の範囲に記載された発明とその均等の範囲に含まれる。 While some embodiments of the invention have been described, these embodiments have been presented by way of example only, and are not intended to limit the scope of the inventions. These new embodiments can be implemented in other various forms, and various omissions, replacements, and changes can be made without departing from the spirit of the invention. These embodiments and their modifications are included in the scope and gist of the invention, and are also included in the invention described in the claims and their equivalents.

１００、１００−２情報処理装置
１０１認識部
１０２検索部
１０３生成部
１０４翻訳部
１０５表示制御部
１０６通信制御部
１０７−２編集部
１２１テンプレート記憶部
１２２認識結果記憶部
１２３議事録記憶部
２００端末装置
２０１通信制御部
２０２表示制御部
２２１表示部 REFERENCE SIGNS LIST 100, 100-2 information processing device 101 recognition unit 102 search unit 103 generation unit 104 translation unit 105 display control unit 106 communication control unit 107-2 editing unit 121 template storage unit 122 recognition result storage unit 123 minutes storage unit 200 terminal device 201 communication control unit 202 display control unit 221 display unit

Claims

A first sentence described in a first language and a second sentence described in a second language are associated with one or more text data described in one or more languages obtained by recognizing a voice. A search unit that searches for text data that matches the template of the abstract, and outputs the template where the matching text data is searched;
A display control unit that displays the output template in association with the text data that matches,
A generation unit that generates a summary based on the template ,
The first sentence and the second sentence include a variable into which a value can be input,
The generating unit generates a summary by inputting a phrase included in the text data that is adapted to a variable included in the first sentence and the second sentence,
The display control unit, when a plurality of words that can be input to the variable is included in the text data, display a plurality of words in a selectable manner,
The generating unit generates a summary by inputting a selected phrase among the plurality of displayed phrases to the variable,
Information processing apparatus.

The apparatus further comprises a translation unit that translates the text data into text data in different languages,
The search unit searches the text data to which the translated text data has been added, for text data that matches the summary template.
The information processing device according to claim 1.

Further comprising an editing unit for editing the generated summary,
The information processing device according to claim 1.

It further includes a recognition unit that recognizes voice and outputs text data,
The search unit searches for text data matching the template from among the one or more text data output by the recognition unit, and outputs the template in which the matching text data has been searched.
The information processing device according to claim 1.

A first sentence described in a first language and a second sentence described in a second language are associated with one or more text data described in one or more languages obtained by recognizing a voice. A search step of searching for text data that matches the template of the abstract, and outputting the template in which the matching text data has been searched;
A display control step of displaying the output template in association with the matching text data,
Generating a summary based on the template ,
The first sentence and the second sentence include a variable into which a value can be input,
The generating step generates a summary by inputting a phrase included in the text data that is adapted to a variable included in the first sentence and the second sentence,
The display control step, when a plurality of words that can be input to the variable is included in the text data, display a plurality of words in a selectable manner,
The generating step generates a summary by inputting a selected phrase among the plurality of displayed phrases to the variable,
Information processing method.

Computer
A first sentence described in a first language and a second sentence described in a second language are associated with one or more text data described in one or more languages obtained by recognizing a voice. A search unit that searches for text data that matches the template of the abstract, and outputs the template in which the matching text data is searched;
A display control unit that displays the output template in association with the text data that matches,
A generating unit that generates a summary based on the template ,
The first sentence and the second sentence include a variable into which a value can be input,
The generating unit generates a summary by inputting a phrase included in the text data that is adapted to a variable included in the first sentence and the second sentence,
The display control unit, when a plurality of words that can be input to the variable is included in the text data, display a plurality of words in a selectable manner,
The generating unit generates a summary by inputting a selected phrase among the plurality of displayed phrases to the variable,
Program.