JP2022131963A

JP2022131963A - Summary generation device and summary generation method

Info

Publication number: JP2022131963A
Application number: JP2021031239A
Authority: JP
Inventors: 良典山西; Yoshinori Yamanishi; 陽子西原; Yoko Nishihara
Original assignee: Ritsumeikan Trust; Kansai University
Current assignee: Ritsumeikan Trust; Kansai University
Priority date: 2021-02-26
Filing date: 2021-02-26
Publication date: 2022-09-07

Abstract

To provide a summary generation device capable of generating an easy-to-understand summary according to interruption positions.SOLUTION: A summary generation device 10 that generates a summary of a target content includes a processor 11. The processor determines a first range in the target content from a reproduction interruption position of the target content (first determination processing 111), calculates from the first range an index value of sentences included in a second range which is at least partially different from the first range, and which is a region from which a summary constituent sentence is extracted in the target content (calculation processing 113), and extracts the constituent sentences of the summary from the second range on the basis of the index values (extraction processing 114).SELECTED DRAWING: Figure 1

Description

特許法第３０条第２項適用申請有り公開の事実１：令和２（２０２０）年６月８日に、２０２０年度人工知能学会全国大会（第３４回）論文のウェブサイト（ｈｔｔｐｓ：／／ｄｒｉｖｅ．ｇｏｏｇｌｅ．ｃｏｍ／ｆｉｌｅ／ｄ／１ＧＦ６ｑ７ｂｉｍＩ８Ｗ７ｔＴＸｗ２ＤｔＪＷｂ５０ｇｕ１ｌ＿ｑ１ｖ／ｖｉｅｗ？ｕｓｐ＝ｓｈａｒｉｎｇ）に掲載公開の事実２：令和２（２０２０）年９月１日に、電子情報通信学会メディアエクスペリエンス・バーチャル環境基礎（ＭＶＥ研究会）論文集のウェブサイト（ｈｔｔｐｓ：／／ｗｗｗ．ｉｅｉｃｅ．ｏｒｇ／ｋｅｎ／ｕｓｅｒ／ｉｎｄｅｘ．ｐｈｐ？ｃｍｄ＝ｌｏｇｉｎ＆ｂａｃｋ＿ｕｒｌ＝ｈｔｔｐｓ％３Ａ％２Ｆ％２Ｆｗｗｗ．ｉｅｉｃｅ．ｏｒｇ％２Ｆｋｅｎ％２Ｆｐａｐｅｒ％２Ｆ２０２００９０８ｗ１ｚＴ％２Ｆ）に掲載There is an application for application of Article 30, Paragraph 2 of the Patent Act Disclosure fact 1: On June 8, 2020, the website of the paper of the 2020 Annual Conference of the Japanese Society for Artificial Intelligence (34th) (https:// drive.google.com/file/d/1GF6q7bimI8W7tTXw2DtJWb50gu1l_q1v/view?usp=sharing) Public Fact 2: The Institute of Electronics, Information and Communication Engineers Media Experience Virtual Environment Basics (MVE) on September 1, 2020 Research Society) Paper collection website (https://www.ieice.org/ken/user/index.php?cmd=login&back_url=https%3A%2F%2Fwww.ieice.org%2Fken%2Fpaper%2F20200908w1zT%2F )

本開示は、要約生成装置及び要約生成方法に関する。 The present disclosure relates to a summary generation device and a summary generation method.

ドラマや小説や漫画などのストーリー性があるコンテンツは、再生を中断した後に再開する際、以前の再生内容を忘れていることがある。その点、時系列に沿った複数のエピソードから構成され、エピソードの冒頭に前回エピソードまでのあらすじや、次回エピソードの予告などが再生されることがある。 Contents that have a story, such as dramas, novels, and comics, sometimes forget the previous playback contents when resuming playback after being interrupted. In that respect, it is composed of multiple episodes in chronological order, and at the beginning of the episode, the synopsis up to the previous episode, the preview of the next episode, etc. may be played.

昨今、ドラマなどストーリー性があるコンテンツの再生がテレビなどの受動的な形態から、いわゆるサブスクリプションサービスなどの再生期間に対して課金され、オンラインなどによる主体的な形態に変化している。 In recent years, the reproduction of contents with stories such as dramas has changed from a passive form such as television to a proactive form such as online where a charge is made for the reproduction period such as a so-called subscription service.

このような形態でコンテンツを再生する場合、エピソードの冒頭にのみ要約が提示される従来のスタイルでは、エピソード途中の中断位置から再生を再開する場合に要約が提示されない。そのため、再生を再開する際に適した要約とならない場合がある。 When content is reproduced in such a form, the conventional style of presenting a summary only at the beginning of an episode does not present a summary when resuming reproduction from a position interrupted in the middle of the episode. Therefore, the summary may not be suitable for resuming playback.

この点、例えば、特開２００７－３３６０８５号公報（以下、特許文献１）は、コンテンツの再生を中断した位置以降の予告を生成する方法を開示している。 In this respect, for example, Japanese Patent Application Laid-Open No. 2007-336085 (hereinafter referred to as Patent Document 1) discloses a method of generating an advance notice after the position where reproduction of content is interrupted.

特開２００７－３３６０８５号公報Japanese Patent Application Laid-Open No. 2007-336085

しかしながら、任意の中断位置から再生が再開される場合、中断位置によっては、単に、以前の再生範囲から抽出した文や、以降の範囲から抽出した文だけでは理解されやすい要約とならない場合もある。そのため、中断位置に応じて理解されやすい要約を生成できる要約生成装置及び要約生成方法であることが望まれる。 However, when the playback is resumed from an arbitrary interrupted position, depending on the interrupted position, the sentences extracted from the previous playback range or the sentences extracted from the subsequent range may not be an easy-to-understand summary. Therefore, it is desirable to have a summary generation device and a summary generation method that can generate an easy-to-understand summary according to the interruption position.

ここで、要約生成装置は対象コンテンツの要約を生成する要約生成装置であって、プロセッサを備える。プロセッサは、対象コンテンツの中断位置から第１の範囲を決定し、再生中断位置から決定される、要約の構成文を抽出する範囲である第２の範囲に含まれる文の指標値を、第１の範囲から算出し、指標値に基づいて第２の範囲から要約の構成文を抽出する、ように構成されている。 Here, the summary generation device is a summary generation device that generates a summary of target content, and includes a processor. The processor determines a first range from the interruption position of the target content, and sets the index value of the sentence included in the second range, which is the range for extracting the constituent sentences of the summary, determined from the reproduction interruption position to the first range. is calculated from the range of , and a constituent sentence of the summary is extracted from the second range based on the index value.

また、要約生成方法は対象コンテンツの要約を生成する方法であって、対象コンテンツの中断位置から第１の範囲を決定し、要約の構成文を抽出する範囲である第２の範囲に含まれる文の指標値を第１の範囲から算出し、指標値に基づいて第２の範囲から要約の構成文を抽出する、ことを含む。 Also, the method for generating a summary is a method for generating a summary of target content, in which a first range is determined from the interruption position of the target content, and sentences included in a second range, which is a range for extracting constituent sentences of the summary. from the first range, and extracting the constituent sentences of the summary from the second range based on the index value.

更なる詳細は、後述の実施形態として説明される。 Further details are described as embodiments below.

図１は、第１の実施の形態に係る要約生成装置の構成、及び、第１の実施の形態に係る要約生成方法によって実行される処理の一例を表した概略図である。FIG. 1 is a schematic diagram showing an example of the configuration of a summary generation device according to the first embodiment and an example of processing executed by the summary generation method according to the first embodiment. 図２は、要約生成装置によって要約を生成する対象のコンテンツの構成を説明するための図である。FIG. 2 is a diagram for explaining the configuration of content for which a summary is to be generated by the summary generation device. 図３は、コンテンツの要約を生成する範囲を決定する方法を説明するための図である。FIG. 3 is a diagram for explaining a method of determining a range for generating content summaries. 図４は、要約構成文の抽出処理を説明するための図である。FIG. 4 is a diagram for explaining the process of extracting summary constituent sentences. 図５は、第１の実施の形態に係る要約生成方法の一例を表したフローチャートである。FIG. 5 is a flow chart showing an example of a summary generation method according to the first embodiment. 図６は、第２の実施の形態に係る要約生成装置の構成、及び、第２の実施の形態に係る要約生成方法によって実行される処理の一例を表した概略図である。FIG. 6 is a schematic diagram showing an example of the configuration of a summary generation device according to the second embodiment and an example of processing executed by the summary generation method according to the second embodiment. 図７は、第２の実施の形態に係る要約生成方法の一例を表したフローチャートである。FIG. 7 is a flow chart showing an example of a summary generation method according to the second embodiment. 図８は、コンテンツに用意されている部分要約をゴールドスタンダードとして利用して対象範囲の要約を生成する方法の具体例を説明するための図である。FIG. 8 is a diagram for explaining a specific example of a method of generating a summary of a target range using a partial summary prepared for content as a gold standard. 図９は、コンテンツに用意されている部分要約をゴールドスタンダードとして利用して対象範囲の要約を生成する方法の具体例を説明するための図である。FIG. 9 is a diagram for explaining a specific example of a method of generating a summary of a target range using a partial summary prepared for content as a gold standard. 図１０は、コンテンツに用意されている部分要約をゴールドスタンダードとして利用して対象範囲の要約を生成する方法の具体例を説明するための図である。FIG. 10 is a diagram for explaining a specific example of a method of generating a summary of a target range using a partial summary prepared for content as a gold standard. 図１１は、第３の実施の形態に係る要約生成装置の構成、及び、第３の実施の形態に係る要約生成方法によって実行される処理の一例を表した概略図である。FIG. 11 is a schematic diagram showing an example of the configuration of a summary generation device according to the third embodiment and an example of processing executed by the summary generation method according to the third embodiment. 図１２は、あるコンテンツについて生成された部分要約の構成文の、コンテンツにおける出現分布の一例を表した図である。FIG. 12 is a diagram showing an example of the appearance distribution in the content of constituent sentences of a partial summary generated for a certain content. 図１３は、抽出用データの具体例を表した図である。FIG. 13 is a diagram showing a specific example of extraction data. 図１４は、配置用データの具体例を表した図である。FIG. 14 is a diagram showing a specific example of placement data.

＜１．要約生成装置及び要約生成方法の概要＞ <1. Outline of summary generation device and summary generation method>

（１）ある実施の形態に従う要約生成装置は、対象コンテンツの要約を生成する要約生成装置であって、プロセッサを備え、プロセッサは、対象コンテンツの再生中断位置から対象コンテンツにおける第１の範囲を決定し、再生中断位置から決定される、対象コンテンツにおける要約の構成文を抽出する範囲であって、第１の範囲とは少なくとも一部が異なる第２の範囲に含まれる文の指標値を、第１の範囲から算出し、指標値に基づいて第２の範囲から要約の構成文を抽出する、ように構成されている。 (1) A summary generation device according to an embodiment is a summary generation device that generates a summary of target content, and includes a processor, and the processor determines a first range in target content from a playback interruption position of target content. Then, the index value of the sentence included in the second range, which is the range for extracting the constituent sentences of the summary in the target content and which is at least partially different from the first range, is determined from the playback interruption position, 1 range, and based on the index value, extract constituent sentences of the summary from the second range.

対象コンテンツは、ストーリー性を有するものであって、時系列に再生される。例えば、映画やドラマやアニメーションなどの動画、小説などの文学作品、講演や授業などのパフォーマンス、などが相当する。再生される要素は、テキスト、楽曲、画像などを含むが、ここでは、テキストに着目する。テキストは、セリフであっても、文章や名詞であってもよい。テキストには、同時に再生される楽曲や画像が付加されていてもよい。 The target content has a story and is reproduced in chronological order. For example, videos such as movies, dramas, and animations, literary works such as novels, and performances such as lectures and classes are equivalent. Elements to be reproduced include text, music, images, etc., but the focus here is on text. The text may be serifs, sentences or nouns. The text may be accompanied by a piece of music or an image to be played at the same time.

要約は、コンテンツのある範囲（以下、要約対象範囲）のストーリーを短く編集したものである。要約は、第１の要約か第２の要約かである。第１の要約はいわゆるあらすじであり、第２の要約はいわゆる予告である。第１の要約は、再生を開始する時点（以下、開始位置）より前の範囲を紹介する要約を指す。第２の要約は、開始位置より後の範囲を紹介する要約を指す。第２の要約には、開始位置より先の内容が含まれていてもよい。 A summary is a short edited version of a story within a certain range of content (hereinafter referred to as a summary target range). The abstract is either the primary abstract or the secondary abstract. The first summary is a so-called synopsis and the second summary is a so-called preview. The first summary refers to a summary that introduces a range before the time point at which playback is started (hereinafter referred to as the start position). A second summary refers to a summary that introduces the range after the starting position. The second summary may contain content beyond the start location.

第１の範囲は、第２の範囲に含まれる文の指標値を算出するために用いられる範囲であって、対象コンテンツ内の範囲である。第１の範囲は、例えば、予想再生範囲である。予想再生範囲は、再生中断位置より後の範囲であって、再生中断位置から所定の範囲である。所定の範囲は、予め設定された範囲であってよい。所定の範囲は、ユーザの属性や、対象コンテンツの属性や、ユーザの再生傾向などから決定されてもよい。ユーザの再生傾向は、例えば、ユーザの再生行動などである。又は、一般的な再生量の平均値が用いられてもよい。 The first range is a range used for calculating the index value of sentences included in the second range, and is a range within the target content. The first range is, for example, the expected playback range. The expected reproduction range is a range after the reproduction interruption position and a predetermined range from the reproduction interruption position. The predetermined range may be a preset range. The predetermined range may be determined based on the attributes of the user, the attributes of the target content, the user's playback tendency, and the like. The user's playback tendency is, for example, the user's playback behavior. Alternatively, an average value of general regeneration amounts may be used.

再生中断位置は、再生が中断された位置である。一例として、再生中断位置は、再生の開始位置と一致する。その場合、中断位置は、要約生成する基準となる位置である。 The reproduction interruption position is the position where the reproduction is interrupted. As an example, the playback interruption position coincides with the playback start position. In that case, the interruption position is a reference position for generating a summary.

第２の範囲は、要約対象範囲であって、再生中断位置に基づいて決定される。第２の範囲は第１の範囲と異なっていてもよいし、一致していてもよいし、少なくとも一部が重複していてもよい。 The second range is the summary target range and is determined based on the playback break position. The second range may be different from, coincide with, or at least partially overlap with the first range.

要約対象範囲は、例えば、ストーリーの最初から、再生中断位置を基準として決定される予想再生範囲の終了までの範囲である。予想再生範囲は、ユーザの視聴・読書行動、一般的視聴読書の平均量、又は、提供者の規定、などにより定められる。要約対象範囲を適切に設定することによって、より理解されやすい要約を生成することができる。 The summary target range is, for example, the range from the beginning of the story to the end of the expected playback range determined based on the playback interruption position. The expected playback range is determined by the user's viewing/reading behavior, the average amount of general viewing/reading, or the provider's definition. A more understandable summary can be generated by appropriately setting the summary target range.

指標値は、第２の範囲に含まれる文について、第１の範囲から算出される値であって、例えば、重要度を含む。第１の範囲から算出される指標値に基づいて第２の範囲から要約の構成文を抽出することで、生成される要約は、再生中断位置から決定される第１の範囲を考慮した文を含むものとなる。これにより、再生を再開する際に適した要約を生成することができるようになる。 The index value is a value calculated from the first range for sentences included in the second range, and includes, for example, importance. By extracting the constituent sentences of the summary from the second range based on the index value calculated from the first range, the generated summary includes sentences that take into account the first range determined from the playback interruption position. shall include. This makes it possible to generate a digest suitable for resuming playback.

（２）好ましくは、指標値は、第１の範囲に含まれる語句に基づいて得られる値を含む。第１の範囲に含まれる語句に基づいて得られる値は、例えば、重要度である。これにより、生成される要約は、第１の範囲に含まれる語句に基づいて得られる値を考慮した文を含むものとなる。 (2) Preferably, the index value includes a value obtained based on words included in the first range. A value obtained based on the words included in the first range is, for example, importance. As a result, the generated summary includes sentences that take into account the values obtained based on the words included in the first range.

（３）好ましくは、第１の範囲は、再生中断位置より後の範囲であって、再生中断位置から決定される対象コンテンツの予想再生範囲である。これにより、中断後に再開される再生内容を考慮した要約を生成することができるようになる。 (3) Preferably, the first range is a range after the reproduction interruption position and is a predicted reproduction range of the target content determined from the reproduction interruption position. As a result, it becomes possible to generate a summary that takes into consideration the playback content that is resumed after the interruption.

（４）好ましくは、第２の範囲は、要約が第１の要約か第２の要約かによって決定され、第１の要約は、再生中断位置よりも後を含まない範囲を第２の範囲として生成される要約であり、第２の要約は、再生中断位置よりも後を含む範囲を第２の範囲として生成される要約である。第１の要約はあらすじであって、第２の要約は予告である。これにより、第１の要約には、再生中断位置までの範囲から抽出された文が含まれる。そのため、ユーザに再生中断位置までの内容が思い出させ、再生意欲を高める要約が生成されるようになる。また、第２の要約には、再生中断位置以降の文が含まれる。そのため、ユーザの再生意欲を高める要約が生成されるようになる。 (4) Preferably, the second range is determined by whether the summary is the first summary or the second summary. The second summary is a summary generated with the second range including the position after the reproduction interruption position. The first summary is a synopsis and the second summary is a preview. Thus, the first summary includes sentences extracted from the range up to the playback interruption position. Therefore, a summary is generated that reminds the user of the content up to the point where the reproduction is interrupted and increases the desire to reproduce. Also, the second summary includes sentences after the playback interruption position. Therefore, a summary that increases the user's willingness to reproduce is generated.

（５）好ましくは、要約の構成文を抽出することは、指標値に基づいて、構成文に含まれる複数の文を第２の範囲から逐次抽出することで、逐次抽出された文を有する部分集合を生成することを含み、指標値は、第２の範囲に含まれる文と部分集合との類似度を含む。このとき、指標値に基づいて抽出することが、類似度が高い文を抽出することである場合、構成文が内容の統一性ある文の集合となり、要約文の内容が明確になりやすい。また、指標値に基づいて抽出することが、類似度が低い文を抽出することである場合、構成文が多様性ある文の集合となり、バランスよい内容の要約になりやすい。 (5) Preferably, extracting the constituent sentences of the summary includes sequentially extracting a plurality of sentences included in the constituent sentences from the second range based on the index value, and extracting the portion having the sequentially extracted sentences generating a set, wherein the index value includes a similarity between the sentences included in the second range and the subset; At this time, if the extraction based on the index value is to extract sentences with a high degree of similarity, the constituent sentences will be a set of sentences with unity in content, and the content of the summary sentence will tend to be clear. Also, if the extraction based on the index value is to extract sentences with a low degree of similarity, the constituent sentences will be a diverse set of sentences, and the summary will tend to have a well-balanced content.

（６）好ましくは、部分集合は、対象コンテンツに予め用意されている要約から抽出された文を含む。これにより、対象コンテンツに予め用意されている要約の構成文を利用して要約が生成されることになる。そのため、構成文のすべての文を指標値に基づいて抽出するより処理が容易になる。 (6) Preferably, the subset includes sentences extracted from a pre-prepared summary of the target content. As a result, a summary is generated using the constituent sentences of the summary prepared in advance for the target content. Therefore, the processing becomes easier than extracting all the constituent sentences based on the index value.

（７）好ましくは、プロセッサは、さらに、対象コンテンツに基づいて参照用コンテンツを選択するよう構成されており、第２の範囲から要約の構成文を抽出することは、参照用コンテンツに対応付けられた抽出用データを参照して、第２の範囲から要約の構成文を抽出することを含む。参照用コンテンツは対象コンテンツとは異なるコンテンツであって、要約が用意されているコンテンツである。抽出用データは、参照用コンテンツに用意されている要約の構成文の、参照用コンテンツでの位置の傾向を表したデータである。抽出用データを用いて、第２の範囲から要約の構成文を抽出することで、対象コンテンツの要約を容易に生成できるとともに、参照用コンテンツと同程度に理解されやすい要約を生成することができる。 (7) Preferably, the processor is further configured to select the reference content based on the target content, and extracting the constituent sentences of the summary from the second range is associated with the reference content. extracting the constituent sentences of the summary from the second range with reference to the extracted data. The reference content is content different from the target content, and is content for which a summary has been prepared. The extraction data is data representing the tendency of the positions of the summary constituent sentences prepared in the reference content in the reference content. By extracting the constituent sentences of the summary from the second range using the extraction data, it is possible to easily generate a summary of the target content and to generate a summary that is as easy to understand as the reference content. .

（８）ある実施の形態に従う要約生成方法は対象コンテンツの要約を生成する方法であって、（１）～（７）に記載の要約生成装置において対象コンテンツの要約を生成する方法である。これにより、（１）～（７）に記載の要約生成装置により生成される要約が得られる。 (8) A method of generating a summary according to an embodiment is a method of generating a summary of target content, and is a method of generating a summary of target content in the summary generation device described in (1) to (7). As a result, a summary generated by the summary generation device described in (1) to (7) is obtained.

＜２．要約生成方法及び要約生成装置の例＞ <2. Example of summary generation method and summary generation device>

［第１の実施の形態］ [First embodiment]

本実施の形態に係る要約生成装置１０は、コンテンツの要約を生成する。本実施の形態において扱うコンテンツは、ストーリー性を有するものであって、時系列に再生される。例えば、映画やドラマやアニメーションなどの動画、小説などの文学作品、講演や授業などのパフォーマンス、などが相当する。再生される要素は、テキスト、楽曲、画像などを含むが、ここでは、テキストに着目する。テキストは、セリフであっても、文章や名詞であってもよい。テキストには、同時に再生される楽曲や画像が付加されていてもよい。 A summary generation device 10 according to the present embodiment generates a content summary. Content handled in this embodiment has a story and is reproduced in chronological order. For example, videos such as movies, dramas, and animations, literary works such as novels, and performances such as lectures and classes are equivalent. Elements to be reproduced include text, music, images, etc., but the focus here is on text. The text may be serifs, sentences or nouns. The text may be accompanied by a piece of music or an image to be played at the same time.

要約は、コンテンツのある範囲（以下、要約対象範囲）のストーリーを短く編集したものである。要約は、あらすじ（第１の要約）か予告（第２の要約）かである。あらすじは、再生を開始する時点（以下、開始位置）より前の範囲を紹介する要約を指す。予告は、開始位置より後の範囲を紹介する要約を指す。予告には、開始位置より先の内容が含まれていてもよい。 A summary is a short edited version of a story within a certain range of content (hereinafter referred to as a summary target range). The summary is either a synopsis (first summary) or an advance notice (second summary). The synopsis refers to a summary that introduces the range before the point of time when playback starts (hereinafter referred to as the start position). Trailer refers to a summary that introduces the range after the starting position. The advance notice may include content beyond the start position.

図１を参照して、要約生成装置１０は、プロセッサ１１とメモリ１２とを有するコンピュータで構成される。プロセッサ１１は、例えば、ＣＰＵである。メモリ１２は、フラッシュメモリ、ＥＥＰＲＯＭ、ＲＯＭ、ＲＡＭなどを含む。または、メモリ１２は、一次記憶装置であってもよいし、二次記憶装置であってもよい。 Referring to FIG. 1, summary generation device 10 is configured by a computer having processor 11 and memory 12 . Processor 11 is, for example, a CPU. Memory 12 includes flash memory, EEPROM, ROM, RAM, and the like. Alternatively, the memory 12 may be a primary storage device or a secondary storage device.

メモリ１２は、プロセッサ１１で実行される生成プログラム１２１を記憶している。プロセッサ１１は、生成プログラム１２１を実行することによって、要約生成処理を実行する。要約生成処理は、要約を生成する対象のコンテンツ（以下、対象コンテンツ）の要約を生成するための処理を指す。 The memory 12 stores a generation program 121 executed by the processor 11 . The processor 11 executes the abstract generating process by executing the generating program 121 . The summary generation processing refers to processing for generating a summary of content for which a summary is to be generated (hereinafter referred to as target content).

メモリ１２は、さらに、１又は複数のコンテンツ情報１２２を記憶している。コンテンツ情報１２２は、対象コンテンツに関する情報であって、要約生成基準位置に関する情報を含む。一例として、要約生成基準位置は中断位置とし、その場合、コンテンツ情報１２２は中断位置情報２１を含む。以降の説明では、中断位置は開始位置と一致するものとする。 Memory 12 also stores one or more pieces of content information 122 . The content information 122 is information about target content and includes information about a reference position for generating a summary. As an example, the summary generation reference position is the interruption position, and in that case the content information 122 includes the interruption position information 21 . In the following description, it is assumed that the interrupt position matches the start position.

コンテンツ情報１２２は、１又は複数の要約情報２２を含んでもよい。要約情報２２は、対象コンテンツに対して予め用意された要約であって、詳細は後述する。 Content information 122 may include one or more summary information 22 . The summary information 22 is a summary prepared in advance for the target content, and the details will be described later.

なお、コンテンツ情報１２２は、すべて、又は、少なくとも一部が、図１１のサーバ３０等の要約生成装置１０の外部装置に記憶されていてもよい。その場合、要約生成装置１０は必要に応じて外部装置にアクセスし、コンテンツ情報１２２を読み出して用いる。又は、要約生成装置１０がサーバ３０を有していてもよい。 All or at least part of the content information 122 may be stored in an external device of the summary generation device 10, such as the server 30 in FIG. In that case, the summary generation device 10 accesses the external device as necessary to read and use the content information 122 . Alternatively, the abstract generation device 10 may have the server 30 .

図１は、実施の形態に係る要約生成装置１０が、コンテンツの再生装置１５も兼ねている例を示している。なお、要約生成装置１０が再生装置１５を兼ねることは必須ではない。要約生成装置１０は、再生装置１５に搭載されたり、再生装置１５から直接又は間接的に必要な情報を取得したりするものであってもよいし、独立した装置であってもよい。 FIG. 1 shows an example in which a summary generation device 10 according to an embodiment also serves as a content reproduction device 15 . Note that it is not essential that the summary generation device 10 also serves as the reproduction device 15 . The summary generation device 10 may be installed in the playback device 15, or may acquire necessary information directly or indirectly from the playback device 15, or may be an independent device.

図１の場合、要約生成装置１０は、再生に関するユーザ操作などを受け付ける操作部１７を有する。また、要約生成装置１０は、コンテンツを再生する再生装置１５を有する。 In the case of FIG. 1, the summary generation device 10 has an operation unit 17 that receives user operations related to reproduction. The digest generation device 10 also has a playback device 15 that plays back content.

プロセッサ１１は、操作部１７から入力される操作信号に従って、再生装置１５で指定されたコンテンツを再生させる再生処理１１６を実行する。再生装置１５は、プロセッサ１１からの制御信号に従って指定されたコンテンツを再生する。 The processor 11 executes a reproduction process 116 for reproducing the content specified by the reproduction device 15 according to the operation signal input from the operation unit 17 . The playback device 15 plays back the designated content according to the control signal from the processor 11 .

再生処理１１６は、コンテンツの再生が中断された位置を示す中断位置情報２１を、再生されたコンテンツのコンテンツ情報１２２として、メモリ１２に格納する処理を含む。これにより、再生装置１５でコンテンツの再生が中断されると、そのコンテンツについての中断位置情報２１がメモリ１２に記憶される。 The reproduction process 116 includes a process of storing the interruption position information 21 indicating the position where the reproduction of the content was interrupted as the content information 122 of the reproduced content in the memory 12 . As a result, when the reproduction of content is interrupted by the reproduction device 15 , the interruption position information 21 for that content is stored in the memory 12 .

要約生成装置１０は、ディスプレイ１４を有していてもよい。ディスプレイ１４は、生成された要約を出力する出力装置の一例である。出力装置は、ディスプレイ１４に替えて、又は、加えて、スピーカなどの他の形態の出力を行うものであってもよい。要約生成装置１０がコンテンツの再生装置も兼ねる場合、ディスプレイ１４は、再生されたコンテンツの出力装置の一例でもある。 The summary generator 10 may have a display 14 . Display 14 is an example of an output device that outputs the generated summary. The output device may be one that performs other forms of output such as a speaker instead of or in addition to the display 14 . If the summary generation device 10 also serves as a content reproduction device, the display 14 is also an example of an output device for reproduced content.

要約生成装置１０は、インターネットなどのネットワークを介して他の装置と通信可能な通信装置１３を有していてもよい。一例として、通信装置１３によって、生成された要約を他の装置に出力してもよい。その場合、通信装置１３も、生成された要約を出力する出力装置の一例である。また、コンテンツ情報１２２が他の装置に記憶されおり、要約生成装置１０は、通信装置１３が他の装置にアクセスすることによって、コンテンツ情報１２２を他の装置から読み出してもよい。 The abstract generation device 10 may have a communication device 13 capable of communicating with other devices via a network such as the Internet. As an example, the communication device 13 may output the generated summary to another device. In that case, the communication device 13 is also an example of an output device that outputs the generated summary. Also, the content information 122 is stored in another device, and the summary generation device 10 may read the content information 122 from the other device by the communication device 13 accessing the other device.

対象コンテンツは、一例として、図２に示されたように、１又は複数のシーズン（期）で構成されていてもよい。各シーズンは１つのストーリーを構成し、１又は複数のシーズンで、全体として大きなストーリーを構成してもよい。 As an example, the target content may consist of one or more seasons (terms), as shown in FIG. Each season makes up a story, and one or more seasons may make up a larger story as a whole.

各シーズンは、一例として、複数のエピソード（話）に区分されていてもよい。エピソードは、１つの完結したストーリーであって、対象コンテンツは、ストーリーごとに再生されることが想定されている。具体的には、シーズン１は、複数のエピソードＥＰ１１，ＥＰ１２，…ＥＰ１ｎを含む。シーズン２は、複数のエピソードＥＰ２１，ＥＰ２２，…ＥＰ２ｎを含む。各エピソードは、１又は複数の、音声又は文字である文Ｑを含む。 Each season may be divided into a plurality of episodes (talks), for example. An episode is one complete story, and target content is assumed to be reproduced for each story. Specifically, Season 1 includes a plurality of episodes EP11, EP12, . . . EP1n. Season 2 includes multiple episodes EP21, EP22, . . . EP2n. Each episode contains one or more sentences Q, either spoken or written.

対象コンテンツには、エピソードごとに、部分要約が用意されていてもよい。部分要約は、以前のエピソードを要約対象範囲とした要約を指す。具体的に、エピソードＥＰ１２はエピソードＥＰ１１を要約対象範囲とした部分要約ＡＢ１２を含み、エピソードＥＰ１ｎはエピソードＥＰ１１～ＥＰ１（ｎ－１）を要約対象範囲とした部分要約ＡＢ１ｎを含む。 A partial summary may be prepared for each episode of the target content. A partial summary refers to a summary that covers the previous episode. Specifically, episode EP12 includes a partial summary AB12 whose summary target range is episode EP11, and episode EP1n includes a partial summary AB1n whose summary target range is episodes EP11 to EP1(n−1).

なお、対象コンテンツに部分要約が用意されている例については第２の実施の形態以降で用い、第１の実施の形態においては、対象コンテンツに部分要約が用意されていない、又は、用意されている部分要約を用いないものとする。 An example in which a partial summary is prepared for the target content will be used in the second and subsequent embodiments. Do not use partial summaries.

対象コンテンツが想定通りエピソード単位で再生される場合、部分要約は、エピソードの再生に先立って再生される。そのため、ユーザは、エピソードの再生に先立って前エピソードまでのあらすじを確認できる。 If the target content is played in units of episodes as expected, the partial summary is played before the episode is played. Therefore, the user can check the synopsis up to the previous episode before playing back the episode.

詳細には、部分要約の構成文ＣＳ１，ＣＳ２，ＣＳ３…は、部分要約を構成する構成単位であって、要約対象範囲に含まれる文Ｑから抽出されたものである。具体的には、部分要約ＡＢ１２の構成文ＣＳ１，ＣＳ２，ＣＳ３…は、エピソードＥＰ１１から抽出された１又は複数の文ｑの集合（以下、部分集合とも称する）である。 Specifically, sentences CS1, CS2, CS3, . Specifically, the constituent sentences CS1, CS2, CS3, .

図１を参照して、要約生成装置１０のプロセッサ１１が実行する要約生成処理は、第１の決定処理１１１を含む。第１の決定処理１１１は、対象コンテンツにおける予想再生範囲（第１の範囲）を中断位置から決定することを含む。予想再生範囲は、中断位置より後の範囲であって、中断位置から所定の範囲である。所定の範囲は、予め設定された範囲であってよい。他の例として、所定の範囲は、ユーザの属性や、対象コンテンツの属性や、ユーザの再生傾向などから決定されてもよい。ユーザの再生傾向は、例えば、ユーザの再生行動などである。又は、一般的な再生量の平均値が用いられてもよい。 With reference to FIG. 1 , the summary generating process executed by processor 11 of summary generating device 10 includes first determination process 111 . The first determination process 111 includes determining the expected reproduction range (first range) of the target content from the interruption position. The expected reproduction range is a range after the interruption position and a predetermined range from the interruption position. The predetermined range may be a preset range. As another example, the predetermined range may be determined based on user attributes, target content attributes, user playback tendencies, and the like. The user's playback tendency is, for example, the user's playback behavior. Alternatively, an average value of general regeneration amounts may be used.

要約生成処理は、第２の決定処理１１２を含む。第２の決定処理１１２は、対象コンテンツについて要約対象範囲（第２の範囲）を決定することを含む。要約対象範囲は、予想再生範囲とは少なくとも一部が異なっていてもよい。少なくとも一部が異なることは、全く異なる範囲であってもよいし、一部範囲が重複していてもいい。 The summary generation process includes a second decision process 112 . A second determination process 112 includes determining a summary target range (second range) for the target content. The summary target range may be at least partially different from the expected playback range. That at least a part of the range may be different may be a completely different range, or the range may partially overlap.

要約対象範囲は、少なくとも中断位置に基づいて決定される。好ましくは、要約対象範囲は、中断位置と、生成する要約があらすじであるのか予告であるのかと、の両方に基づいて決定される。要約対象範囲の決定方法について、図３を用いて説明する。 A summary target range is determined based at least on the break position. Preferably, the summary target range is determined based on both the break position and whether the summary to be generated is a synopsis or an advance preview. A method for determining a summary target range will be described with reference to FIG.

図３において、矢印は対象コンテンツであるコンテンツＣを表しており、矢印の方向、つまり、左から右に時系列に沿って再生されることを示している。図３の矢印の始点である位置Ｐ０は、コンテンツＣのあるシーズンの開始位置に相当する。つまり、図３は、コンテンツＣのあるシーズンの最初からの再生の様子を示している。図３に表された部分要約ＡＢｎ，ＡＢｎ＋１はコンテンツＣに用意されていなくてもよい。第１の実施の形態においては、これら部分要約はコンテンツＣに含まれないものとする。 In FIG. 3, an arrow indicates content C, which is the target content, and indicates that the content is reproduced in the direction of the arrow, that is, from left to right in chronological order. A position P0, which is the starting point of the arrow in FIG. In other words, FIG. 3 shows how content C is played back from the beginning of a certain season. The partial summaries ABn and ABn+1 shown in FIG. 3 may not be prepared for the content C. It is assumed that these partial summaries are not included in the content C in the first embodiment.

位置Ｐ１はエピソードｎの開始位置であり、位置Ｐ４はエピソードｎの終了位置である。位置Ｐ２は、コンテンツＣのコンテンツ情報１２２に含まれる中断位置情報２１に示される中断位置に相当する位置であって、次回の開始位置に相当する。コンテンツＣの中断位置Ｐ２から位置Ｐ３までの範囲Ｈ３が、第１の決定処理１１１によって決定された予想再生範囲とする。 Position P1 is the start position of episode n and position P4 is the end position of episode n. The position P2 is a position corresponding to the interruption position indicated by the interruption position information 21 included in the content information 122 of the content C, and corresponds to the next start position. The range H3 from the interruption position P2 to the position P3 of the content C is assumed to be the expected reproduction range determined by the first determination processing 111. FIG.

第２の決定処理１１２において、プロセッサ１１は、中断位置Ｐ２に基づいて要約対象範囲を決定する。この例では、プロセッサ１１は、中断位置と、生成する要約があらすじであるのか予告であるのかと、の両方に基づいて要約対象範囲を決定する。 In a second determination process 112, the processor 11 determines a summary target range based on the interruption position P2. In this example, the processor 11 determines the scope of the summary based on both the position of the interruption and whether the summary to be generated is a synopsis or an advance notice.

生成する要約が予告の場合、一例として、プロセッサ１１は、シーズンの開始位置Ｐ０から、中断位置Ｐ２から後の位置Ｐ３までの範囲Ｈ４を要約対象範囲とする。予告の他の例として、プロセッサ１１は、予想再生範囲と一致した、位置Ｐ２から位置Ｐ３までの範囲Ｈ３を要約対象範囲としてもよい。すなわち、要約が予告の場合、要約対象範囲に中断位置Ｐ２より前の範囲を含んでもよいし、後の範囲のみであってもよい。これにより、生成される予告には、予想再生範囲から抽出された文も含まれる。そのため、ユーザは、予想再生範囲の内容が想像され、再生意欲が高められる。 When the summary to be generated is an advance notice, as an example, the processor 11 sets the range H4 from the start position P0 of the season to the position P3 after the interruption position P2 as the summary target range. As another example of the advance notice, the processor 11 may set the range H3 from the position P2 to the position P3, which matches the expected reproduction range, as the summary target range. That is, when the summary is an advance notice, the range to be summarized may include the range before the interruption position P2, or only the range after the interruption position P2. As a result, the generated announcement includes sentences extracted from the expected reproduction range. Therefore, the user can imagine the contents of the expected reproduction range, and the desire to reproduce is heightened.

なお、位置Ｐ３が位置Ｐ４に近づきすぎると、つまり、エピソード結末までを要約対象範囲とすると、そのエピソードの結末が予告に含まれる可能性が高まる。すなわち、いわゆるネタバレになってしまう可能性がある。そのため、好ましくは、要約が予告の場合の要約対象範囲は、エピソードの終了の位置Ｐ４より前の位置までの範囲とする。 Note that if the position P3 is too close to the position P4, that is, if the summary target range is up to the end of the episode, the possibility that the end of the episode will be included in the preview increases. That is, there is a possibility that it will become a so-called spoiler. Therefore, preferably, the summary target range when the summary is an advance notice is the range up to the position before the end position P4 of the episode.

生成する要約があらすじの場合、一例として、プロセッサ１１は、シーズンの開始位置Ｐ０から中断位置Ｐ２までの範囲Ｈ５を要約対象範囲とする。これにより、生成されるあらすじには、中断位置Ｐ２までの範囲から抽出された文も含まれる。そのため、ユーザは、中断位置Ｐ２までの内容が思い出され、再生意欲が高められる。 When the summary to be generated is a synopsis, as an example, the processor 11 sets the range H5 from the start position P0 of the season to the discontinuation position P2 as the summary target range. As a result, the generated synopsis includes sentences extracted from the range up to the interruption position P2. Therefore, the user is reminded of the content up to the interruption position P2, and is motivated to reproduce.

プロセッサ１１の実行する要約生成処理は、抽出処理１１４を含む。抽出処理１１４は、要約対象範囲から要約の構成文として文を抽出することを含む。抽出処理１１４において、プロセッサ１１は、選択処理１１１によって選択した参照用コンテンツの抽出用データ３４を用いる。 The abstract generation process executed by processor 11 includes extraction process 114 . The extraction process 114 includes extracting sentences from the scope of the summary as constituent sentences of the summary. In the extraction process 114 , the processor 11 uses the extraction data 34 of the reference content selected by the selection process 111 .

要約生成処理は、算出処理１１３を含む。算出処理１１３は、要約対象範囲に含まれる各文の指標値を、予想再生範囲から算出することを含む。指標値は、予想再生範囲に含まれる語句に基づいて得られる値を含む。予想再生範囲に含まれる語句に基づいて得られる値は、例えば、重要度である。具体的に、プロセッサ１１は、予想再生範囲に含まれる語句を用いて、要約対象範囲に含まれる各文の指標値を算出する。 The summary generation process includes calculation process 113 . Calculation processing 113 includes calculating the index value of each sentence included in the summary range from the expected reproduction range. The index value includes a value obtained based on words included in the expected playback range. Values obtained based on words included in the expected playback range are, for example, importance. Specifically, the processor 11 calculates the index value of each sentence included in the summary target range using the words included in the expected reproduction range.

ここでの重要度は、要約対象範囲に含まれる各文に含まれる語句について、予想再生範囲における重要性を表す値である。重要度の具体的な算出方法は限定されない。重要度は、一例として、語句の出現頻度であってもよい。例えば、要約対象範囲に含まれる各文に含まれる語句について、予想再生範囲における出現頻度が高いほど、その文の重要度が高いと算出されてもよい。 Here, the degree of importance is a value representing the importance in the expected playback range of words included in each sentence included in the range to be summarized. A specific calculation method of importance is not limited. The degree of importance may be, for example, the appearance frequency of words. For example, it may be calculated that the higher the appearance frequency in the expected playback range of a phrase included in each sentence included in the summary target range, the higher the importance of that sentence.

又、出現頻度に替えて、あるいは加えて、語句の再生状況で重要度が算出されてもよい。語句の再生状況は、要約対象範囲に含まれる各文に含まれる語句について、予想再生範囲における再生での音声や背景音の盛り上がり（音量や音域など）、映像の明暗や色合いの変化、映像内の特定のオブジェクトなどを指す。 Also, instead of or in addition to the frequency of appearance, the importance may be calculated based on the reproduction status of the phrase. Regarding the playback status of phrases, for phrases included in each sentence included in the summary target range, the excitement of voice and background sound (volume, range, etc.) during playback within the expected playback range, changes in the brightness and color of the video, and changes in the video refers to a specific object, etc.

他の例として、重要度の算出にいわゆるページランクの考え方が用いられてもよい。すなわち、要約対象範囲に含まれる各文に含まれる語句について、予想再生範囲においてより参照される語句ほど、その文の重要度が高いと算出する。算出には、予め記憶している関数が用いられてもよい。また、他の例として、重要度の算出に、予想再生範囲における他の語句への関連性の高さを用いてもよいし、それらを組み合わせて用いてもよい。 As another example, the concept of so-called page rank may be used to calculate the degree of importance. That is, with regard to the words included in each sentence included in the range to be summarized, the more often the word is referred to in the expected reproduction range, the higher the importance of the sentence is calculated. A function stored in advance may be used for the calculation. As another example, the degree of importance may be calculated using the degree of relevance to other words in the expected playback range, or a combination thereof.

好ましくは、指標値は、部分集合との類似度を含む。部分集合は、一例として、後述する抽出処理１１４によって構成文とする複数の文を要約対象範囲から逐次抽出する際に、逐次抽出された文の集合を指す。類似度は、一例として、部分集合に対する類似度である。この場合、指標値は、一例として、重要度と類似度とを用いて算出されるＭＭＲ（Maximal Marginal Relevance：周辺関連性最大化）スコアである。 Preferably, the index value includes similarity with the subset. For example, a subset refers to a set of sentences sequentially extracted when a plurality of sentences to be constituent sentences are sequentially extracted from the summary target range by the extraction process 114 described later. The degree of similarity is, for example, the degree of similarity with respect to a subset. In this case, the index value is, for example, an MMR (Maximal Marginal Relevance) score calculated using importance and similarity.

プロセッサ１１は、要約対象範囲に含まれる各文ｑについての重要度Ｉ（ｑ）と、部分集合ｋに対する類似度Ｓｉｍ（ｑ，ｋ）と、を用いて、下の式（１）で各文ｑのＭＭＲスコアＭＭＲ（ｑ）を算出する。なお、係数λは、０以上、１以下の値である。一例として、係数λは０．５とする。
ＭＭＲ（ｑ）＝λＩ（ｑ）－（１－λ）Ｓｉｍ（ｑ，ｋ） …（１） Using the importance I(q) for each sentence q included in the summary target range and the similarity Sim(q, k) for the subset k, the processor 11 uses the following equation (1) for each sentence Calculate q's MMR score MMR(q). Note that the coefficient λ is a value of 0 or more and 1 or less. As an example, the coefficient λ is set to 0.5.
MMR(q)=λI(q)−(1−λ)Sim(q,k) (1)

式（１）に示されるように、ＭＭＲスコアは、係数λが１に近い程、重要度を重視する値になり、係数λが０に近い程、類似度を重視する値になる。重要度を重視する場合、予想再生範囲における重要度の高い語句を含む文ほどＭＭＲスコアが大きくなる。一方、類似度を重視する場合、部分集合に対する類似度が低い文ほどＭＭＲスコアが大きくなる。 As shown in Equation (1), the closer the coefficient λ is to 1, the more important the MMR score is, and the closer the coefficient λ is to 0, the more important is the similarity. When importance is placed on importance, the MMR score increases for sentences that include words and phrases with higher importance in the expected reproduction range. On the other hand, when the similarity is emphasized, a sentence with a lower similarity to the subset has a higher MMR score.

後述する抽出処理１１４にてＭＭＲスコアの高い文を抽出する場合、予想再生範囲における重要度の高い語句を含み、部分集合とは類似しない文が抽出されやすくなる。その結果、構成文が、予想再生範囲と関連し、かつ、多様性ある文の集合となり得る。 When extracting sentences with a high MMR score in extraction processing 114, which will be described later, sentences that include words and phrases with a high degree of importance in the expected playback range and that are not similar to the subset are more likely to be extracted. As a result, the constituent sentences can be related to the expected playback range and can be a diverse set of sentences.

要約生成処理は、抽出処理１１４を含む。抽出処理１１４は、要約対象範囲に含まれる各文の指標値に基づいて、要約対象範囲から文を要約の構成文として抽出することを含む。構成文として用いる文の数は予め規定されているものとする。その場合、一例として、プロセッサ１１は、要約対象範囲に含まれる文のうちの指標値の高い文から順に、規定数までの文を構成文として抽出する。 The abstract generation process includes extraction process 114 . The extraction process 114 includes extracting sentences from the summary range as constituent sentences of the summary based on the index value of each sentence included in the summary range. It is assumed that the number of sentences used as constituent sentences is defined in advance. In this case, as an example, the processor 11 extracts, as constituent sentences, up to a specified number of sentences in descending order of the index value among the sentences included in the range to be summarized.

図４を用いて抽出処理１１４の具体例を説明する。図４の例では、指標値としてＭＭＲスコアを用いるものとする。要約対象範囲に文ｑ１～文ｑ４が含まれているとする。算出処理１１３では、予想再生範囲に含まれる語句に基づいて文ｑ１～文ｑ４それぞれの重要度が算出される。 A specific example of the extraction processing 114 will be described with reference to FIG. In the example of FIG. 4, the MMR score is used as the index value. Assume that sentences q1 to q4 are included in the summary target range. In calculation processing 113, the importance of each of sentences q1 to q4 is calculated based on the words included in the expected reproduction range.

図４を参照して、構成文として１つの文も抽出されていない抽出処理１１４の開始時には、部分集合は０であるため、部分集合に対する文ｑ１～文ｑ４それぞれの類似度は０と算出される。従って、このとき文ｑ１～文ｑ４それぞれのＭＭＲスコアは重要度と一致する。各ＭＭＲスコアの大小関係がｑ２＞ｑ３＞ｑ１＞ｑ４である場合、抽出処理１１４においては、ＭＭＲスコアの最も高い文ｑ２が構成文として抽出される（ステップＳ１）。 Referring to FIG. 4, at the start of extraction processing 114 when not even a single sentence is extracted as a constituent sentence, the subset is 0, so the similarity of each of sentences q1 to q4 with respect to the subset is calculated as 0. be. Therefore, at this time, the MMR scores of sentences q1 to q4 match the importance. When the magnitude relation of each MMR score is q2>q3>q1>q4, in the extraction process 114, the sentence q2 with the highest MMR score is extracted as a constituent sentence (step S1).

文ｑ２が抽出されると、抽出後の要約対象範囲に含まれる文ｑ１，ｑ３，ｑ４それぞれについて、部分集合との類似度が算出される。ここでの部分集合は、文ｑ２となる。算出された類似度を用いて文ｑ１，ｑ３，ｑ４それぞれのＭＭＲスコアが算出される（ステップＳ２）。各ＭＭＲスコアの大小関係がｑ４＞ｑ３＞ｑ１である場合、抽出処理１１４においては、ＭＭＲスコアの最も高い文ｑ４が構成文として抽出される（ステップＳ３）。 When the sentence q2 is extracted, the degree of similarity with the subset is calculated for each of the sentences q1, q3, and q4 included in the summary target range after extraction. The subset here is sentence q2. MMR scores of sentences q1, q3, and q4 are calculated using the calculated similarities (step S2). If the magnitude relation of each MMR score is q4>q3>q1, the sentence q4 with the highest MMR score is extracted as a constituent sentence in the extraction process 114 (step S3).

抽出処理１１４においてプロセッサ１１は上の処理を規定数の文が抽出されるまで繰り返す。これにより、予想再生範囲に関連した文が構成文として抽出されるとともに、要約全体の複数の文の関連も考慮して抽出される。ＭＭＲスコアを指標値として用いる場合、類似度については先に抽出された文からなる部分集合に対して低い文が抽出されやすいため、バランスのよい内容の要約が生成される可能性が高い。 In extraction processing 114, processor 11 repeats the above processing until a specified number of sentences are extracted. As a result, sentences related to the expected playback range are extracted as constituent sentences, and the relationships between multiple sentences in the entire summary are also taken into consideration. When the MMR score is used as an index value, sentences with low similarity are likely to be extracted with respect to a subset of previously extracted sentences, so there is a high possibility of generating a well-balanced summary.

なお、ＭＭＲスコアは、重要度と類似度とを用いた指標値の一例である。他の例として、下の式（２）のように、類似度を重要度に加えて得られる指標値Ｉｖを用いてもよい。
Ｉｖ（ｑ）＝λＩ（ｑ）＋（１－λ）Ｓｉｍ（ｑ，ｋ） …（２） Note that the MMR score is an example of an index value using importance and similarity. As another example, an index value Iv obtained by adding the degree of similarity to the degree of importance may be used, as in Equation (2) below.
Iv(q)=λI(q)+(1−λ)Sim(q,k) (2)

後述する抽出処理１１４にて指標値Ｉｖの高い文を抽出する場合、予想再生範囲における重要度の高い語句を含み、部分集合とは類似する文が抽出されやすくなる。その結果、構成文が、予想再生範囲と関連し、かつ、内容に統一性ある文の集合となり得る。 When extracting sentences with a high index value Iv in extraction processing 114, which will be described later, sentences that include words and phrases with a high degree of importance in the expected playback range and that are similar to the subset are more likely to be extracted. As a result, the constituent sentences can be a set of sentences that are related to the expected playback range and have unity in content.

要約生成処理は、生成処理１１５を含む。生成処理１１５は、抽出処理１１４で構成文として抽出された文を配置することを含む。ここでは、配置の方法は特定の方法に限定されない。一例として、要約対象範囲での出現順に応じて配置する方法であってよい。他の例として、算出された指標値の大きさによって配置する方法であってよい。 The summary generation process includes generation process 115 . Generation processing 115 includes arranging sentences extracted as constituent sentences in extraction processing 114 . Here, the arrangement method is not limited to a specific method. As an example, it may be arranged according to the order of appearance in the summary target range. Another example may be a method of arranging according to the magnitude of the calculated index value.

なお、このとき、プロセッサ１１は、複数の文の間の類似性や対比などを用いて、複数の文をグループ化し、グループ単位で配置するようにしてもよい。これにより、対話のような複数の文がグループ化されている場合に、それらを用いて自然な要約が生成されるようになる。 At this time, the processor 11 may group the plurality of sentences by using the similarity or contrast between the plurality of sentences and arrange them in units of groups. This allows a natural summary to be generated using multiple sentences, such as dialogues, when they are grouped.

図５を用いて、本実施の形態に係る要約生成方法について説明する。図５のフローチャートに表された処理は、本実施の形態に係る要約生成方法に従った要約生成処理であって、プロセッサ１１が生成プログラム１２１を実行することに実現される。図５の処理は、再生装置１５が対象コンテンツを再生する際や、要約の提示を指示するユーザ操作を受け付けたときなどに開始される。 A summary generation method according to the present embodiment will be described with reference to FIG. The process shown in the flowchart of FIG. 5 is a summary generation process according to the summary generation method according to the present embodiment, and is implemented by processor 11 executing generation program 121 . The processing in FIG. 5 is started when the reproducing device 15 reproduces the target content or receives a user operation instructing presentation of a summary.

図５を参照して、プロセッサ１１は、対象コンテンツのコンテンツ情報１２２から中断位置を読み取り、中断位置に基づいて予想再生範囲を決定する（ステップＳ１０１）。ステップＳ１０１では、一例として、プロセッサ１１は、中断位置から予め設定された範囲を予想再生範囲とする。 Referring to FIG. 5, processor 11 reads the interruption position from content information 122 of the target content, and determines the expected reproduction range based on the interruption position (step S101). In step S101, as an example, the processor 11 sets a preset range from the interruption position as the expected reproduction range.

また、プロセッサ１１は、中断位置と、生成する要約があらすじであるのか予告であるのかと、の両方に基づいて要約対象範囲を決定する（ステップＳ１０３）。ステップＳ１０１とステップＳ１０３とは処理順はいずれが先であってもよい。 Also, the processor 11 determines a summary target range based on both the interruption position and whether the summary to be generated is a synopsis or an advance notice (step S103). Either step S101 or step S103 may be processed first.

プロセッサ１１は、ステップＳ１０１で決定した予想再生範囲に含まれる語句に基づいて、ステップＳ１０３で決定した要約対象範囲に含まれる各文の重要度を算出する（ステップＳ１０５）。また、プロセッサ１１は、要約構成文としてすでに抽出した文を部分集合として、要約対象範囲に含まれる文のうちの要約構成文として抽出されていない各文について、部分集合に対する類似度を算出する（ステップＳ１０７）。 Processor 11 calculates the importance of each sentence included in the summary range determined in step S103 based on the words included in the expected reproduction range determined in step S101 (step S105). In addition, the processor 11 uses the sentences already extracted as a summary constituent sentence as a subset, and calculates the similarity to the subset for each sentence that is not extracted as a summary constituent sentence among the sentences included in the summary target range ( step S107).

プロセッサ１１は、ステップＳ１０５で算出された重要度とステップＳ１０７で算出された類似度とを上記の式（１）に代入することで、要約対象範囲に含まれる文のうちの要約構成文として抽出されていない各文について、指標値の一例としてＭＭＲスコアを算出する（ステップＳ１０９）。そして、プロセッサ１１は、ＭＭＲスコアの最も高い文を構成文として抽出する（ステップＳ１１１）。 The processor 11 substitutes the degree of importance calculated in step S105 and the degree of similarity calculated in step S107 into the above equation (1), thereby extracting a summary constituent sentence from among the sentences included in the range to be summarized. An MMR score is calculated as an example of an index value for each sentence that has not been processed (step S109). Processor 11 then extracts a sentence with the highest MMR score as a constituent sentence (step S111).

構成文として抽出された文が規定数に達していない場合（ステップＳ１１３でＮＯ）、プロセッサ１１は、上記のステップＳ１０７～Ｓ１１１を繰り返す。これにより、構成文とする文が逐次抽出される。構成文として１文が抽出される度に部分集合となる文が増加し、それに伴って未抽出の文の類似度が算出し直される。そのため、構成文として１文が抽出される度にＭＭＲスコアが変化する。 If the number of sentences extracted as constituent sentences does not reach the specified number (NO in step S113), processor 11 repeats steps S107 to S111. As a result, sentences to be constituent sentences are sequentially extracted. Each time one sentence is extracted as a constituent sentence, the number of sentences to be a subset increases, and accordingly the similarity of unextracted sentences is recalculated. Therefore, the MMR score changes every time one sentence is extracted as a constituent sentence.

構成文として抽出された文が規定数に達すると（ステップＳ１１３でＹＥＳ）、プロセッサ１１は、抽出された文を配置することで要約を生成する（ステップＳ１１５）。 When the number of sentences extracted as constituent sentences reaches the prescribed number (YES in step S113), processor 11 arranges the extracted sentences to generate a summary (step S115).

［第２の実施の形態］ [Second embodiment]

第２の実施の形態に係る要約生成装置１０は、要約生成処理において、対象コンテンツに用意されている部分要約を利用して要約を生成する。第２の実施の形態において、プロセッサ１１は、部分要約にシルバースタンダードサマリーアルゴリズム（ＳＳＳＡ）を応用する。ＳＳＳＡの適用に関しては、山西良典、西原陽子、及び金田大地，”部分要約とSilver Standard Summary Algorithm の応用による小説の次回予告生成”，［online］，令和２年６月９日，人工知能学会，［令和２年６月９日検索］，インターネット＜URL：https://doi.org/10.11517/pjsai.JSAI2020.0_3K5OS5b01＞に開示されている。 The summary generation device 10 according to the second embodiment generates a summary using a partial summary prepared for the target content in the summary generation process. In a second embodiment, processor 11 applies the Silver Standard Summary Algorithm (SSSA) for partial summarization. Regarding the application of SSSA, Yoshinori Yamanishi, Yoko Nishihara, and Daichi Kaneda, ``Generation of next notice of novel by application of partial summary and Silver Standard Summary Algorithm'', [online], June 9, 2020, The Japanese Society for Artificial Intelligence , [Retrieved on June 9, 2020], disclosed on the Internet <URL: https://doi.org/10.11517/pjsai.JSAI2020.0_3K5OS5b01>.

ＳＳＳＡを適用した処理において、プロセッサ１１は、対象コンテンツに用意されている部分要約をゴールドスタンダードとして利用する。この場合、図６に表されたように、第２の実施の形態において、要約生成処理は、さらに、第３の決定処理１１７を含む。第３の決定処理１１７は、中断位置に基づいて、対象コンテンツに用意されている複数の部分要約のうちの、要約の生成に用いる部分要約をゴールドスタンダードと決定することを含む。 In SSSA-applied processing, the processor 11 uses a partial summary prepared for the target content as a gold standard. In this case, in the second embodiment, the summary generation process further includes a third decision process 117, as represented in FIG. A third determination process 117 includes determining a partial summary to be used for generating a summary among a plurality of partial summaries prepared for the target content as the gold standard, based on the interruption position.

第３の決定処理１１７において、プロセッサ１１は、中断位置と、生成する要約があらすじであるか予告であるかと、に応じたゴールドスタンダード決定範囲内にある位置に対応付けられた部分要約を、ゴールドスタンダードと決定する。これにより、開始位置が中断位置に近いエピソードに対応した部分要約がゴールドスタンダードと決定される。具体例として、中断位置の属するエピソードの次のエピソードに対応した部分要約が挙げられる。生成する要約が予告である場合、一例として、ゴールドスタンダード決定範囲は中断位置から先の範囲とする。 In a third decision process 117, processor 11 converts the partial summaries associated with positions that are within the gold standard decision range according to the break position and whether the summary to be generated is a synopsis or a preview to gold. Decide on standard. As a result, partial summaries corresponding to episodes whose start position is close to the interrupt position are determined as the gold standard. A specific example is a partial summary corresponding to the episode following the episode to which the interruption position belongs. If the summary to be generated is an advance notice, as an example, the gold standard decision range is the range from the break point onwards.

第３の決定処理１１７は、中断位置と、生成する要約があらすじであるか予告であるかと、に応じて、ゴールドスタンダードからシルバースタンダード要約を決定することを含む。シルバースタンダード要約は、ゴールドスタンダードの構成文のうちの、要約の生成に用いる文を指す。一例として、プロセッサ１１は、ゴールドスタンダードの構成文のうちの、対象コンテンツにおける出現位置が中断位置からシルバースタンダード要約決定範囲内にある文をシルバースタンダード要約に決定する。生成する要約が予告である場合、一例として、シルバースタンダード要約決定範囲は中断位置から先の範囲とする。 A third decision process 117 includes determining a gold standard to silver standard summary depending on the position of the break and whether the summary to be generated is a synopsis or a preview. The silver standard abstract refers to the sentences that are used to generate the abstract among the constituent sentences of the gold standard. As an example, the processor 11 determines to be a silver standard summary a sentence whose appearance position in the target content is within the silver standard summary determination range from the interruption position, among the constituent sentences of the gold standard. If the summary to be generated is an advance notice, as an example, the silver standard summary decision range is the range from the interruption position to the future.

第２の実施の形態に係る制御方法を、図７を用いて説明する。また、第２の実施の形態に係る制御方法の具体例について、図３、図８～図１０を用いて説明する。図８～図１０は、「銀河鉄道の夜」（宮沢賢治「銀河鉄道の夜」青空文庫より引用）を対象コンテンツＣとして要約を生成する例を表している。最左列の文番号は、冒頭からすべての文に順に割り当てた番号である。ここでは、要約対象範囲が中断位置より後の範囲を含む予告を生成する場合を説明する。 A control method according to the second embodiment will be described with reference to FIG. A specific example of the control method according to the second embodiment will be described with reference to FIGS. 3 and 8 to 10. FIG. 8 to 10 show an example of generating a summary with target content C of "Night on the Galactic Railroad" (quoted from Kenji Miyazawa's "Night on the Galactic Railroad" Aozora Bunko). Sentence numbers in the leftmost column are numbers assigned to all sentences in order from the beginning. Here, a case will be described in which the summary target range generates an advance notice including the range after the interruption position.

図７のフローチャートは、第１の実施の形態に係る要約生成方法の具体例を表した図５のフローチャートに加えて、ステップＳ２０１，Ｓ２０３の処理が異なっている。すなわち、第２の実施の形態に係る要約生成方法では、プロセッサ１１は、対象コンテンツに用意されている部分要約のうちの、要約の生成に用いる部分要約をゴールドスタンダードとして決定し（ステップＳ２０１）、その構成文の中から、要約の生成に用いる文をシルバースタンダード要約として決定する（ステップＳ２０３）。 The flowchart of FIG. 7 differs from the flowchart of FIG. 5 showing a specific example of the abstract generation method according to the first embodiment in the processing of steps S201 and S203. That is, in the summary generation method according to the second embodiment, the processor 11 determines, as a gold standard, a partial summary to be used for generating the summary, out of the partial summaries prepared for the target content (step S201), Among the constituent sentences, the sentences used for generating the abstract are determined as the silver standard abstract (step S203).

図３の例の場合、コンテンツＣの位置Ｐ１には、エピソードｎに対応した部分要約ＡＢｎが配置されている。位置Ｐ４には、次のエピソードｎ＋１に対応した部分要約ＡＢｎ＋１が配置されている。これら部分要約ＡＢｎ，ＡＢｎ＋１は、コンテンツＣのコンテンツ情報１２２に含まれる要約情報２２に示されている。 In the example of FIG. 3, at position P1 of content C, a partial summary ABn corresponding to episode n is arranged. A partial summary ABn+1 corresponding to the next episode n+1 is placed at position P4. These partial summaries ABn and ABn+1 are shown in the summary information 22 included in the content information 122 of the content C. FIG.

部分要約ＡＢｎの要約対象範囲は範囲Ｈ１である。すなわち、この例の部分要約ＡＢｎは、位置Ｐ０から位置Ｐ１までを要約対象範囲とした、エピソードｎのあらすじである。部分要約ＡＢｎ＋１の要約対象範囲は範囲Ｈ２である。すなわち、この例の部分要約ＡＢｎ＋１は、位置Ｐ０から位置Ｐ４までを要約対象範囲とした、エピソードｎ＋１のあらすじである。 The summary target range of the partial summary ABn is the range H1. That is, the partial summary ABn in this example is a synopsis of the episode n with the range from position P0 to position P1 to be summarized. The summary target range of partial summary ABn+1 is range H2. That is, the partial summary ABn+1 in this example is a synopsis of the episode n+1 with the range from position P0 to position P4 as the summary target range.

生成する要約が予告である場合、一例として、ゴールドスタンダード決定範囲は中断位置から先の範囲Ｈ６とする。この場合、第３の決定処理１１７において、中断位置Ｐ２から範囲Ｈ６にある位置に対応付けられた部分要約をゴールドスタンダードと決定するとすると、図３の例の場合、プロセッサ１１は、エピソードｎが終了する位置Ｐ４に対応付けられた部分要約をゴールドスタンダードと決定する。 If the summary to be generated is an advance notice, as an example, the gold standard determination range is set to the range H6 from the interrupt position. In this case, in the third determination process 117, if the partial summary associated with the position in the range H6 from the interruption position P2 is determined as the gold standard, in the example of FIG. The partial summary associated with position P4 is determined as the gold standard.

生成する要約が予告である場合、一例として、シルバースタンダード要約決定範囲は中断位置から先の範囲Ｈ６とする。この場合、プロセッサ１１は、ゴールドスタンダードの構成文のうちの、対象コンテンツにおける出現位置が中断位置から先の範囲Ｈ６内にある文をシルバースタンダード要約に決定する。なお、ここでは一例として、ゴールドスタンダードとする部分要約を決定する範囲と、ゴールドスタンダードの構成文のうちのシルバースタンダード要約に決定する範囲と、が同じ範囲Ｈ６としているが、これら範囲は異なってもよい。 If the summary to be generated is an advance notice, as an example, the silver standard summary decision range is set to the range H6 from the interruption position. In this case, the processor 11 determines, among the constituent sentences of the gold standard, sentences whose appearance position in the target content is within the range H6 beyond the interruption position to be the silver standard summary. Here, as an example, the range for determining the partial summary to be the gold standard and the range for determining the silver standard summary among the constituent sentences of the gold standard are the same range H6, but these ranges may be different. good.

図８を参照して、「銀河鉄道の夜」（宮沢賢治「銀河鉄道の夜」青空文庫より引用）が文番号１３０まで読了されているとする。この場合、コンテンツ情報１２２に、文番号１３０が中断位置Ｐ２として記憶される。 Referring to FIG. 8, it is assumed that "Night on the Galactic Railroad" (quoted from Kenji Miyazawa's "Night on the Galactic Railroad" Aozora Bunko) has been read up to sentence number 130. In this case, sentence number 130 is stored in content information 122 as interruption position P2.

文番号１３０の属するエピソードｎの終了が文番号１９１である場合、文番号１９２が次のエピソードｎ＋１の開始位置である位置Ｐ４となる。この場合、例えば、図８に示されたように、文番号１９２の直前に、エピソードｎ＋１に対応した部分要約ＡＢｎ＋１が配置されている。 If the end of episode n to which sentence number 130 belongs is sentence number 191, sentence number 192 is position P4, which is the start position of the next episode n+1. In this case, for example, as shown in FIG. 8, the partial summary ABn+1 corresponding to the episode n+1 is arranged immediately before the sentence number 192 .

部分要約ＡＢｎ＋１の要約対象範囲である範囲Ｈ２は文番号１から文番号１９１である。部分要約ＡＢｎ＋１に対応した位置Ｐ４は、中断位置Ｐ２から後の範囲Ｈ６に含まれている。そのため、プロセッサ１１は第３の決定処理１１７において、部分要約ＡＢｎ＋１をゴールドスタンダートと決定する。 The range H2, which is the summary target range of the partial summary ABn+1, is from sentence number 1 to sentence number 191. Position P4 corresponding to partial summary ABn+1 is included in range H6 after interruption position P2. Therefore, processor 11 determines in a third decision process 117 that partial abstract ABn+1 is the gold standard.

図９は、部分要約ＡＢｎ＋１の構成文の具体例を表している。図９を参照して、部分要約ＡＢｎ＋１は、一例として、範囲Ｈ２（文番号１～１９１）から抽出された、文番号５，２１，３１，５５，１２０，１６３，１７０，１９１の８つの文を構成文としている。 FIG. 9 shows a specific example of a constituent sentence of a partial summary ABn+1. Referring to FIG. 9, the partial summary ABn+1 is, for example, eight sentences with sentence numbers 5, 21, 31, 55, 120, 163, 170, and 191 extracted from range H2 (sentence numbers 1 to 191). is a constituent sentence.

このとき、文番号５，２１，３１，５５，１２０であるグループＫ１は中断位置Ｐ２以前から抽出され、文番号１６３，１７０，１９１であるグループＫ２は位置Ｐ２より後から抽出されている。すなわち、グループＫ２は中断位置Ｐ２から後の範囲Ｈ６に含まれた文のグループであり、グループＫ１は含まれていない文のグループである。そのため、プロセッサ１１は第３の決定処理１１７において、グループＫ２をシルバースタンダード要約と決定する。 At this time, the group K1 with sentence numbers 5, 21, 31, 55 and 120 is extracted from before the interruption position P2, and the group K2 with sentence numbers 163, 170 and 191 is extracted after the position P2. That is, the group K2 is a group of sentences included in the range H6 after the interruption position P2, and the group K1 is a group of sentences not included. Therefore, processor 11 determines group K2 as a silver standard summary in a third decision process 117 .

第２の実施の形態に係る要約生成方法においても、以降は、第１の実施の形態に係る要約生成方法と同様にしてプロセッサ１１は要約を生成する。すなわち、図７を参照して、プロセッサ１１は、ステップＳ２０３で決定されたシルバースタンダード要約（グループＫ２：文番号１６３，１７０，１９１）を部分集合とし、要約対象範囲に含まれる文のうちの要約構成文として抽出されていない各文について、部分集合に対する類似度を算出する（ステップＳ１０７）。 Also in the abstract generation method according to the second embodiment, processor 11 generates a summary in the same manner as in the abstract generation method according to the first embodiment. That is, referring to FIG. 7, processor 11 selects the silver standard summaries (group K2: sentence numbers 163, 170, 191) determined in step S203 as a subset, and extracts the summaries of the sentences included in the range to be summarized. For each sentence that has not been extracted as a constituent sentence, the degree of similarity with respect to the subset is calculated (step S107).

なお、ここでの他の例として、プロセッサ１１は、シルバースタンダード要約とされなかった１又は複数の文（例えばグループＫ１：文番号５，２１，３１，５５，１２０）や、ゴールドスタンダードである部分要約の構成文すべて（例えば、グループＫ１＋Ｋ２：文番号５，２１，３１，５５，１２０，１６３，１７０，１９１）を部分集合として、要約対象範囲に含まれる文のうちの要約構成文として抽出されていない各文について、この部分集合に対する類似度を算出してもよい。 Note that, as another example here, the processor 11 may extract one or more sentences that were not made into the silver standard summary (for example, group K1: sentence numbers 5, 21, 31, 55, and 120), or parts that are the gold standard. All the constituent sentences of the summary (for example, group K1+K2: sentence numbers 5, 21, 31, 55, 120, 163, 170, and 191) are extracted as a subset of the sentences included in the scope of the summary as the summary constituent sentences. For each sentence that does not have a similarity to this subset, a similarity score may be calculated.

プロセッサ１１は、重要度と類似度とを用いて、要約対象範囲に含まれる文のうちの要約構成文として抽出されていない各文についてＭＭＲスコアを算出する（ステップＳ１０９）。そして、プロセッサ１１は、ＭＭＲスコアの最も高い文を構成文として抽出する（ステップＳ１１１）。 Processor 11 uses the degree of importance and the degree of similarity to calculate an MMR score for each sentence that is not extracted as a summary constituent sentence among the sentences included in the summary target range (step S109). Processor 11 then extracts a sentence with the highest MMR score as a constituent sentence (step S111).

構成文として抽出された文が規定数に達していない場合（ステップＳ１１３でＮＯ）、プロセッサ１１は、上記のステップＳ１０７～Ｓ１１１を繰り返す。 If the number of sentences extracted as constituent sentences does not reach the specified number (NO in step S113), processor 11 repeats steps S107 to S111.

一例として、要約対象範囲から、読み終わりの位置である文番号１３０より後の文番号１３１，１３２，１３３，１３４，１３８を抽出したとする。予告を生成する場合、図１０に表されたように、プロセッサ１１は、シルバースタンダード要約とした文番号１６３，１７０，１９１（グループＫ２）に、新たに抽出された文番号１３１，１３２，１３３，１３４，１３８（グループＫ３）を加えて要約の構成文とする。 As an example, assume that sentence numbers 131, 132, 133, 134, and 138 after sentence number 130, which is the reading end position, are extracted from the range to be summarized. When generating an advance notice, as shown in FIG. 10, the processor 11 adds newly extracted sentence numbers 131, 132, 133, 134 and 138 (group K3) are added to form the constituent sentences of the summary.

なお、図１０において、予告としての要約の構成文には、中断位置Ｐ２より前の文と後の文との両方が含まれてもよい。言い換えると、予告としての要約の構成文は中断位置Ｐ２より後の文ばかりに限定されない。例えば、中断位置Ｐ２より後の文が１つでも構成文に含まれる場合には、その要約を予告として取り扱ってもよい。 In FIG. 10, the composition sentences of the summary as an advance notice may include both the sentences before and after the interruption position P2. In other words, the constituent sentences of the summary as an advance notice are not limited to the sentences after the interruption position P2. For example, if even one sentence after the interruption position P2 is included in the constituent sentences, the summary may be treated as an advance notice.

このようにゴールドスタンダードを利用することにより、構成文のすべての文を抽出するよりも抽出する文の数が少なくなり、処理が容易になる。すなわち、ステップＳ１０７～Ｓ１１３を繰り返す回数を低減できる。また、ゴールドスタンダードからシルバースタンダード要約を決定して部分集合として用いることで、より適した内容の要約を生成することができる。 By using the gold standard in this way, the number of sentences to be extracted is smaller than that of extracting all sentences of the constituent sentences, and processing is facilitated. That is, the number of times steps S107 to S113 are repeated can be reduced. Also, by determining the silver standard abstract from the gold standard and using it as a subset, it is possible to generate a more suitable abstract.

［第３の実施の形態］ [Third embodiment]

第３の実施の形態に係る要約生成装置１０は、要約生成処理において、参照用コンテンツを用いる。参照用コンテンツは、対象コンテンツの要約の構成文を抽出する際に参照する、対象コンテンツ以外のコンテンツであって、要約が用意されているコンテンツを指す。 The summary generation device 10 according to the third embodiment uses reference content in the summary generation process. The reference content refers to content other than the target content that is referred to when extracting the constituent sentences of the summary of the target content, and for which a summary has been prepared.

一例として、参照用コンテンツとして用いられるコンテンツに関する情報を、要約生成装置１０は、他の装置から取得するものとする。図１１は、第３の実施の形態に係る要約生成装置１０を表した図である。図１１に表されたように、第３の実施の形態に係る要約生成装置１０は、通信装置１３によって他の装置としてサーバ３０と通信可能とする。 As an example, it is assumed that the summary generation device 10 acquires information about content used as reference content from another device. FIG. 11 is a diagram showing a summary generation device 10 according to the third embodiment. As shown in FIG. 11 , the summary generation device 10 according to the third embodiment can communicate with the server 30 as another device through the communication device 13 .

サーバ３０は、複数のコンテンツそれぞれに対応付けられたコンテンツデータ３１Ａ，３１Ｂ，・・・３１を記憶している。コンテンツデータ３１Ａ，３１Ｂ，・・・３１は、要約生成装置１０のプロセッサ１１で実行される要約生成処理にて用いられる。コンテンツデータ３１がコンテンツに対応付けられていることは、コンテンツデータ３１がコンテンツそのものを含むものでなくてもよく、コンテンツを指す名称や識別子などの情報を含むことを指す。 The server 30 stores content data 31A, 31B, . . . 31 associated with each of a plurality of contents. The content data 31A, 31B, . The fact that the content data 31 is associated with the content means that the content data 31 does not have to include the content itself, but includes information such as a name and an identifier indicating the content.

なお、サーバ３０は、図１１の例では要約生成装置１０の外部装置であって、通信装置１３がネットワーク７０を介してアクセスする装置であるものとしている。しかしながら、他の例として、サーバ３０は要約生成装置１０に搭載される記憶装置であってもよい。 In the example of FIG. 11, the server 30 is an external device of the summary generation device 10 and is accessed by the communication device 13 via the network 70 . However, as another example, the server 30 may be a storage device installed in the summary generating device 10. FIG.

コンテンツデータ３１は、それぞれ、属性３２を含む。属性３２は、コンテンツのストーリーの特性を表す情報であって、一例として、ストーリー全体の文を単語ベクトル化した値である。また、属性３２は、ストーリー全体の文を単語ベクトル化した値に変えて、又は、加えて、ジャンル、脚本家、シーズン番号、再生対象者の特性、などのメタ情報であってもよい。 The content data 31 each include attributes 32 . The attribute 32 is information representing the characteristics of the story of the content, and is, for example, a value obtained by converting the sentences of the entire story into word vectors. Also, the attributes 32 may be meta-information such as genre, scriptwriter, season number, characteristics of the playback target, etc. instead of or in addition to word-vectorized values of the sentences of the entire story.

参照用コンテンツとして利用可能なコンテンツデータ３１は、部分要約３３を含んでいる。図１１では、コンテンツデータ３１が、複数の部分要約３３Ａ，３３Ｂ，３３Ｃ，・・・を含んでいる場合を示している。部分要約３３Ａ，３３Ｂ，３３Ｃ，・・・は、コンテンツのエピソードごとに用意されており、以前のエピソードを要約対象範囲とした要約を指す。 Content data 31 that can be used as reference content includes partial summaries 33 . FIG. 11 shows a case where content data 31 includes a plurality of partial summaries 33A, 33B, 33C, . The partial summaries 33A, 33B, 33C, .

コンテンツデータ３１は、それぞれ、抽出用データ３４を含む。抽出用データ３４は、１又は複数の部分要約３３Ａ，３３Ｂ，３３Ｃ，・・・それぞれの構成文の、コンテンツでの位置の傾向を表したデータである。抽出用データ３４について、具体的に、図１２を用いて説明する。 Each content data 31 includes extraction data 34 . The extraction data 34 is data representing the tendency of the position of each constituent sentence of one or more partial summaries 33A, 33B, 33C, . . . in the content. The extraction data 34 will be specifically described with reference to FIG. 12 .

図１２は、実際のコンテンツＡ，Ｂにおける、部分要約の構成文とした文の、コンテンツＡ，Ｂにおける分布を表した図である。図１２の横軸はエピソードナンバーＥＰを指し、縦軸は構成文とした文の要約対象範囲からの抽出位置ＰＰを指している。図１２では、エピソードごとに、対応する部分要約の構成文とした文それぞれの抽出位置ＰＰをプロットしている。 12A and 12B are diagrams showing the distribution in the actual contents A and B of the sentences constituting the partial summary in the actual contents A and B. FIG. The horizontal axis of FIG. 12 indicates the episode number EP, and the vertical axis indicates the extraction position PP from the summary target range of the constituent sentences. FIG. 12 plots the extraction position PP of each of the sentences constituting the corresponding partial summary for each episode.

コンテンツＡ，Ｂは、３シーズン以上が放映されている世界的に人気が高い連続ドラマである。図１２は、コンテンツＡ，Ｂの全シーズンのうちの３シーズンまでを用い、３シーズン分のすべてのエピソードそれぞれの部分要約の構成文とした文の、要約対象範囲からの抽出位置ＰＰを表している。部分要約は、各エピソードの開始から３分以内の文を用いている。 Contents A and B are serial dramas that have been aired for three or more seasons and are popular all over the world. FIG. 12 shows the extraction position PP of sentences constituting partial summaries of all episodes of all three seasons, using up to three seasons out of all seasons of contents A and B, extracted from the summary target range. there is Partial summaries use sentences within 3 minutes of the beginning of each episode.

構成文として抽出された文ｑの抽出位置ＰＰ（ｑ）は、下の手順で得られる。すなわち、下の式（３）によって、文ｑを抽出したエピソードｅに含まれる全文（個数Ｎｅ）中の出現位置を正規化した値ＥＰｅ（ｑ）を得る。
ＥＰｅ（ｑ）＝Ｐｅ（ｑ）／Ｎｅ …（３） The extraction position PP(q) of sentence q extracted as a constituent sentence is obtained by the following procedure. That is, the expression (3) below obtains a value EPe(q) obtained by normalizing the appearance positions in all sentences (number Ne) included in episode e from which sentence q is extracted.
EPe(q)=Pe(q)/Ne (3)

次に、値ＥＰｅ（ｑ）を用いて、下の式（４）で定義される絶対位置ＡＰ（ｑ）を得る。
ＡＰ（ｑ）＝ＥＰｅ（ｑ）＋（ｅ－１） …（４） The value EPe(q) is then used to obtain the absolute position AP(q) defined in equation (4) below.
AP(q)=EPe(q)+(e−1) (4)

そして、下の式（５）で定義される、過去相対位置である抽出位置ＰＰ（ｑ）を得る。過去相対位置は、文ｑを構成要素とする部分要約の対象のエピソードナンバーｅ’を１とした場合の、文ｑの相対的な位置を表したものである。
ＰＰ（ｑ）＝ＡＰ（ｑ）／（ｅ’－１） …（５） Then, the extracted position PP(q), which is the past relative position, defined by the following equation (5) is obtained. The past relative position represents the relative position of the sentence q when the episode number e', which is the object of the partial summary with the sentence q as a constituent element, is set to 1.
PP(q)=AP(q)/(e'-1) (5)

一例として、エピソードナンバー３に対応した部分要約の構成文について絶対位置ＡＰが１．４５９１であった場合、式（５）より抽出位置ＰＰ＝０．７２９５が得られる。抽出位置ＰＰを過去相対位置で表すことによって、シーズンの開始から部分要約の位置までの範囲での相対的な位置関係を考慮することが可能となる。これにより、文の抽出位置について、コンテンツを超えた比較考察が可能になる。 As an example, if the absolute position AP of the constituent sentence of the partial summary corresponding to episode number 3 is 1.4591, the extraction position PP=0.7295 is obtained from equation (5). Representing the extracted position PP by the past relative position makes it possible to consider the relative positional relationship in the range from the start of the season to the position of the partial summary. As a result, it becomes possible to compare and consider the extraction position of the sentence beyond the contents.

発明者らは、コンテンツＡ，Ｂを含め、それぞれ、３シーズン以上が放映されている世界的に人気が高い連続ドラマである複数のコンテンツについて、コンテンツＡ，Ｂと同様に、部分要約の構成文とした文の、コンテンツにおける分布を調べた。 The inventors, including the contents A and B, for a plurality of contents that are serial dramas that are highly popular in the world and have been aired for three seasons or more, have the composition sentences of the partial summaries in the same way as the contents A and B. We examined the distribution of the sentences in the content.

その結果、発明者らは、図１２に示されるように、複数コンテンツに共通して、シーズン１については、抽出位置ＰＰが０．０と１．０とに集中している傾向に気付いた。抽出位置ＰＰは、値が小さいほど（０に近いほど）、抽出位置が要約対象範囲の最初、つまり、シーズンの最初に近い。値が大きいほど（１に近いほど）、抽出位置が要約対象範囲の最後、つまり、対応するエピソードの直前のエピソードの最後に近いことを表している。そのため、複数コンテンツに共通して、シーズン１については、部分要約の構成文とした文が、要約対象範囲の序盤と終盤とに偏る傾向にあると考察された（考察１）。 As a result, as shown in FIG. 12, the inventors have noticed a tendency for the extraction positions PP to concentrate at 0.0 and 1.0 for Season 1, common to multiple contents. The smaller the extraction position PP value (the closer it is to 0), the closer the extraction position is to the beginning of the summary target range, ie, the beginning of the season. A larger value (closer to 1) indicates that the extraction position is closer to the end of the summary target range, ie, the end of the episode immediately preceding the corresponding episode. Therefore, it was considered that, common to multiple contents, sentences used as constituent sentences for partial summaries tended to be biased toward the beginning and end of the scope of the summary for Season 1 (Examination 1).

また、発明者らは、複数コンテンツに共通して、０．０と１．０との中間付近に、抽出位置ＰＰのプロットが右下がりに連続して存在していることに気付いた。これは、各部分要約が、該当するシーズンの同一の位置の文を構成文として用いる傾向にあると考察された（考察２）。つまり、複数コンテンツに共通して、特定の重要度の高い文を各部分要約で用いる傾向にあると考察された。 In addition, the inventors noticed that, in common with a plurality of contents, the plot of the extraction position PP continuously descends to the right near the middle between 0.0 and 1.0. It was considered that each partial summary tends to use sentences at the same position in the corresponding season as constituent sentences (Consideration 2). In other words, it was considered that there is a tendency to use specific high-importance sentences in each partial summary in common with multiple contents.

また、発明者らは、図１２に示されたように、コンテンツＡ，Ｂで、シーズン１～シーズン３それぞれの抽出位置ＰＰの分布傾向が異なる場合があることに気付いた。これは、部分要約の構成文とする文の抽出傾向が、コンテンツごと、及び、又は、シーズンごとに異なる場合があると考察された（考察３）。 Moreover, the inventors have noticed that the distribution tendencies of the extraction positions PP of the seasons 1 to 3 may differ between the contents A and B, as shown in FIG. It was considered that the tendency to extract sentences to be used as constituent sentences for partial summaries may differ from content to content and/or season to season (Consideration 3).

発明者らは、考察１，２より、要約対象範囲を序盤、中盤、終盤の３つに区分して、部分要約ごとに、構成文とした全文の、各区分における抽出割合を抽出用データ３４として用いるものとした。一例として、序盤ｓ、中盤ｃ、終盤ｅは、それぞれ、シーズンの開始から２０％、６０％、及び２０％の範囲とする。 Based on considerations 1 and 2, the inventors divided the scope of the summary into three parts, the beginning, the middle, and the end. It shall be used as As an example, the beginning s, middle c, and end e range from the beginning of the season to 20%, 60%, and 20%, respectively.

図１２において点線で示されている、コンテンツＡのエピソード５０に着目すると、エピソード５０に対応する部分要約の構成文とされた文は、要約対象範囲の序盤ｓに５０％、中盤ｃに３０％、及び、終盤ｅに２０％存在していることが読み取られる。すなわち、コンテンツＡのエピソード５０に対応した部分要約は、シーズン３の開始からエピソード４９の最後までの範囲の序盤ｓから５０％、中盤ｃから３０％、及び、終盤ｅから２０％の文が構成文として抽出されて生成されたものであることが読み取られる。 Focusing on the episode 50 of the content A indicated by the dotted line in FIG. , and 20% at the end e. That is, the partial summary corresponding to episode 50 of content A consists of 50% sentences from the beginning s, 30% from the middle c, and 20% from the end e of the range from the start of season 3 to the end of episode 49. It is read that it was extracted and generated as a sentence.

抽出用データ３４は、各エピソードに対応した部分要約の構成文とした文の、各区分からの抽出割合を表す。つまり、抽出用データ３４は、コンテンツの部分要約の構成文とした文の、コンテンツにおける相対的な位置の傾向を表したデータと言える。 The extraction data 34 represents the extraction ratio of the sentences constituting the partial summary corresponding to each episode from each segment. In other words, the extraction data 34 can be said to be data representing the tendency of the relative positions in the content of the sentences constituting the partial summary of the content.

抽出用データ３４は、一例として、図１３に示されたような表形式で示すことができる。すなわち、図１３を参照して、コンテンツＡの抽出用データ３４は、コンテンツＡのシーズン３までについて、すべてのエピソード１～６９について、部分要約の構成文とした文の、要約対象範囲の序盤ｓ、中盤ｃ、終盤ｅそれぞれの抽出割合を示している。例えば、エピソード５０については、序盤ｓ、中盤ｃ、終盤ｅそれぞれに５０％、３０％、２０％が規定されている。 As an example, the extraction data 34 can be presented in a tabular form as shown in FIG. That is, referring to FIG. 13, the extraction data 34 of the content A includes the beginning s , the middle stage c, and the final stage e. For example, for episode 50, 50%, 30%, and 20% are defined for the beginning s, middle c, and end e, respectively.

また、発明者らは、考察３より、対象コンテンツに適したコンテンツを参照用コンテンツとして、要約生成処理に用いるものとした。そのため、図１１に示されたように、サーバ３０には複数のコンテンツデータ３１Ａ，３１Ｂ，・・・３１が記憶され、それぞれに、抽出用データ３４が含まれている。 In addition, from consideration 3, the inventors decided to use the content suitable for the target content as the reference content for the summary generation process. Therefore, as shown in FIG. 11, the server 30 stores a plurality of content data 31A, 31B, .

図１１を参照して、第３の実施の形態において、要約生成処理は、さらに、選択処理１１８を含む。選択処理１１８は、参照用コンテンツを選択することを含む。参照用コンテンツは、一例として、サーバ３０に記憶されているコンテンツデータ３１Ａ，３１Ｂ，…３１の中から選択される。 Referring to FIG. 11, in the third embodiment, the summary generation process further includes selection process 118. FIG. The selection process 118 includes selecting content for reference. The content for reference is selected from content data 31A, 31B, . . .

選択処理１１８では、コンテンツデータ３１に含まれる属性３２に基づき、対象コンテンツとストーリーの特性が関連あるコンテンツデータ３１が選択される。選択処理１１８では、一例として、ストーリー全体の文を単語ベクトル化した値が、対象コンテンツのその値から所定範囲にあるコンテンツが参照用コンテンツとして抽出される。また、他の例として、ジャンル、脚本家、シーズン番号、及び、再生対象者の特性、のうちのいずれか、あるいは、少なくとも１つについて、一致、もしくは近似したコンテンツが参照用コンテンツとして抽出されてもよい。 In the selection process 118 , based on the attributes 32 included in the content data 31 , the content data 31 whose story characteristics are related to the target content are selected. In the selection process 118, as an example, content whose value obtained by word-vectorizing sentences of the entire story is within a predetermined range from the value of the target content is extracted as reference content. As another example, content matching or approximating at least one of the genre, script writer, season number, and characteristics of the person to be played is extracted as the reference content. good too.

第３の実施の形態では、抽出処理１１４において、プロセッサ１１は、参照用コンテンツに対応付けられた抽出用データ３４を用いて、対象コンテンツから要約の構成文とする文を抽出する。詳しくは、プロセッサ１１は、参照用コンテンツに対応付けられた抽出用データ３４に示される、該当するエピソードの要約についての、要約対象範囲の序盤ｓ、中盤ｃ、終盤ｅそれぞれの抽出割合を参照する。該当するエピソードは、位置Ｐ２近傍に開始位置があるエピソードであって、図６の例の場合、例えばエピソードｎ＋１である。また、この例の場合、エピソードｎであってもよい。 In the third embodiment, in the extraction process 114, the processor 11 uses the extraction data 34 associated with the reference content to extract sentences that constitute the summary from the target content. Specifically, the processor 11 refers to the extraction ratio of the beginning s, middle c, and end e of the summary target range for the summary of the corresponding episode shown in the extraction data 34 associated with the reference content. . The applicable episode is an episode whose start position is near position P2, and in the case of the example of FIG. 6, it is episode n+1, for example. Also, in the case of this example, it may be episode n.

該当するエピソードｎ＋１の要約ＡＢｎ＋１についての要約対象範囲の序盤ｓ、中盤ｃ、終盤ｅそれぞれの抽出割合が、図１３に示された５０％、３０％、２０％であるものとする。この場合、プロセッサ１１は、対象コンテンツについてこの割合を適用して、ＭＭＲスコアに基づいて文を抽出する。この例の場合、構成文とする文の数を１０とした場合、要約対象範囲の序盤ｓ、中盤ｃ、終盤ｅそれぞれから５、３、及び、２の文をＭＭＲスコアに基づいて抽出する。 Assume that the extraction ratios of the beginning s, middle c, and end e of the summary target range for the summary ABn+1 of the applicable episode n+1 are 50%, 30%, and 20%, respectively, shown in FIG. In this case, processor 11 applies this ratio for the subject content to extract sentences based on MMR scores. In this example, if the number of constituent sentences is 10, 5, 3, and 2 sentences are extracted from the beginning s, middle c, and end e of the range to be summarized, respectively, based on the MMR score.

これにより、対象コンテンツについて、ストーリーの特性が関連ある参照用コンテンツに用意されている部分要約と同様の傾向で要約対象範囲から文が抽出して要約が生成される。そのため、要約を容易に生成することができる。 As a result, for the target content, a summary is generated by extracting sentences from the range of the target content with the same tendency as the partial summary prepared for the reference content having the story characteristic. Therefore, a summary can be easily generated.

第３の実施の形態に係る要約生成装置１０は、要約生成処理において、さらに、配置用データを用いてもよい。配置用データは、要約が用意されているコンテンツにおいて、構成文とした文のコンテンツ中での位置の順序と、部分要約における配置順序との不一致の評価値である。配置用データ３５は、例えば、ジャロ・ウィンクラー距離とする。 The summary generation device 10 according to the third embodiment may further use placement data in the summary generation process. The placement data is an evaluation value of the discrepancy between the positional order of sentences in the content for which the summary is prepared and the placement order in the partial summary. The placement data 35 is, for example, the Jaro-Winkler distance.

この場合、図１１に表されたように、コンテンツデータ３１は、それぞれ、配置用データ３５を含んでいる。コンテンツＡの配置用データ３５は、例えば、コンテンツＡのシーズン３までについて、すべてのエピソード１～６９について、部分要約の構成文とした文の、要約対象範囲の序盤ｓ、中盤ｃ、終盤ｅそれぞれの抽出割合を示している。例えば、エピソード５０については、序盤ｓ、中盤ｃ、終盤ｅそれぞれに５０％、３０％、２０％が規定されている。 In this case, as shown in FIG. 11, each of the content data 31 includes placement data 35 . The arrangement data 35 of the content A is, for example, the beginning s, middle c, and end e of the summary target range of the sentences constituting the partial summary for all episodes 1 to 69 of the content A up to season 3. shows the extraction ratio of For example, for episode 50, 50%, 30%, and 20% are defined for the beginning s, middle c, and end e, respectively.

発明者らは、部分要約の構成文とされた文の、部分要約における配置順序がコンテンツ中での位置の順序とは異なっている、つまり、不一致の場合があることに気付いた。そこで、コンテンツＡを含む複数コンテンツについて、部分要約における配置順序と、コンテンツにおける出現順との不一致を表す指標値としてジャロ・ウィンクラー距離を算出し、検証した。ジャロ・ウィンクラー距離は、１に近い方が、部分要約における配置順序がコンテンツ中での位置の順序と近い、つまり、不一致が小さいことを示している。 The inventors have noticed that the arrangement order in the partial summary differs from the positional order in the content, that is, there is a mismatch in some cases. Therefore, for a plurality of contents including content A, the Jaro-Winkler distance was calculated and verified as an index value representing the discrepancy between the arrangement order in the partial summary and the appearance order in the content. The closer the Jaro-Winkler distance is to 1, the closer the arrangement order in the partial summary is to the positional order in the content, that is, the smaller the mismatch.

発明者らは、多数の作品を調査し、シーズンごとのジャロ・ウィンクラー距離の平均値を求めたところ、ほとんどの作品が０．６５～０．８５になることがわかった。これは、部分要約においてコンテンツとは異なる順序で文が配置されている場合が多いことを示している。 When the inventors investigated a large number of works and calculated the average value of the Jaro-Winkler distance for each season, it was found that most of the works ranged from 0.65 to 0.85. This indicates that sentences are often arranged in a different order than the content in the partial summary.

部分要約における配置順序とコンテンツにおける出現順との不一致が大きいほど、要約によってコンテンツの内容が完全には理解されにくくなる。逆に、不一致が小さいほど、コンテンツの内容が理解されやすくなると考えられる。そのため、コンテンツの属性に応じて使い分けられていることが考えられる。 The greater the discrepancy between the arrangement order in the partial summary and the appearance order in the content, the more difficult it is for the summary to fully understand the content. Conversely, it is believed that the smaller the discrepancy, the easier it is for the content to be understood. Therefore, it is conceivable that they are used properly according to the attribute of the content.

すなわち、サスペンスなどの、要約によってコンテンツの内容が完全に理解されない方が好まれるコンテンツのカテゴリの場合、要約対象範囲などによって、適切な不一致とすることで、内容の理解と再生意欲とのバランスが図られると考察された。そこで、第３の実施の形態に係る要約生成装置１０では、コンテンツごとに用意された配置用データ３５を、対象コンテンツの要約の生成に用いてもよい。 In other words, in the case of content categories such as suspense, where it is preferable that the contents of the content are not completely understood by summarization, appropriate inconsistency is achieved depending on the scope of the summary, etc., so that the understanding of the content and the willingness to play are balanced. It was considered to be planned. Therefore, in the summary generation device 10 according to the third embodiment, the arrangement data 35 prepared for each content may be used to generate the summary of the target content.

配置用データ３５は、一例として、図１４に示されたような表形式で示すことができる。図１４の配置用データ３５は、一例として、コンテンツＡ、Ｂ、Ｃのシーズン１、２、３および全シーズン、それぞれの配置用データ（ジャロ・ウィンクラー距離）を示している。ジャロ・ウィンクラー距離は、コンテンツ中での文の位置の順序と、部分要約における配置順序とが完全に一致する場合は１、全く類似しない場合は０となる。なお、図１４の配置用データ３５の例では、配置用データの値がシーズンごとに示されているが、コンテンツごとに１つ示されるものでもよい。 The placement data 35 can be presented in a tabular format as shown in FIG. 14, for example. The placement data 35 in FIG. 14 shows, as an example, placement data (Jarro-Winkler distance) for seasons 1, 2, and 3 of contents A, B, and C, and all seasons. The Jaro-Winkler distance is 1 when the positional order of the sentences in the content completely matches the arrangement order in the partial summary, and 0 when there is no similarity. In addition, in the example of the placement data 35 in FIG. 14, the value of the placement data is shown for each season, but one value may be shown for each content.

第３の実施の形態に係る要約生成装置１０では、対象コンテンツに関連するコンテンツの配置用データを、構成文として抽出された文の配置に用いる。一例として、参照用コンテンツの配置用データを用いる。又は、第３の実施の形態に係る要約生成方法においては、抽出された文の要約における配置順序を、対象コンテンツにおける出現順と一致させるか、関連するコンテンツの配置用データを用いた配置とするか、を選択可能としてもよい。選択は、例えば、要約生成者によって行われるものであってもよい。 In the summary generating apparatus 10 according to the third embodiment, the content placement data related to the target content is used for placement of sentences extracted as constituent sentences. As an example, data for placement of reference content is used. Alternatively, in the summary generation method according to the third embodiment, the arrangement order of the extracted sentences in the summary is matched with the appearance order in the target content, or arranged using the arrangement data of the related content. or may be selectable. The selection may be made, for example, by the summary generator.

具体的には、生成処理１１５において、プロセッサ１１は、構成文として抽出された複数の文を、配置用データ３５に基づいて配置する。一例として、プロセッサ１１は、抽出された複数の文を、ジャロ・ウィンクラー距離が参照用コンテンツの配置用データ３５に示されるジャロ・ウィンクラー距離と同一、又は、所定範囲内の値となるように並べ替える。これにより、要約から対象コンテンツの内容の把握されやすさが参照用コンテンツのそれと同じようにすることができる。 Specifically, in the generating process 115 , the processor 11 arranges the plurality of sentences extracted as constituent sentences based on the arrangement data 35 . As an example, the processor 11 arranges the extracted sentences so that the Jaro-Winkler distance is the same as the Jaro-Winkler distance indicated in the placement data 35 of the reference content, or a value within a predetermined range. Sort by. As a result, the content of the target content can be easily grasped from the summary in the same manner as that of the reference content.

＜３．付記＞
本発明は、上記実施形態に限定されるものではなく、様々な変形が可能である。例えば、第１の実施の形態～第３の実施の形態のうちの少なくとも２つが組み合わされてもよい。 <3. Note>
The present invention is not limited to the above embodiments, and various modifications are possible. For example, at least two of the first to third embodiments may be combined.

１０：要約生成装置
１１：プロセッサ
１２：メモリ
１３：通信装置
１４：ディスプレイ
１５：再生装置
１７：操作部
２１：中断位置情報
２２：要約情報
３０：サーバ
３１：コンテンツデータ
３１Ａ：コンテンツデータ
３１Ｂ：コンテンツデータ
３２：属性
３３：部分要約
３３Ｂ：部分要約
３３Ｃ：部分要約
３４：抽出用データ
３５：配置用データ
７０：ネットワーク
１１１：選択処理
１１２：第２の決定処理
１１３：算出処理
１１４：抽出処理
１１５：生成処理
１１６：再生処理
１１７：第３の決定処理
１１８：選択処理
１２１：生成プログラム
１２２：コンテンツ情報
ＡＢ１２：部分要約
ＡＢ１ｎ：部分要約
ＡＢｎ：部分要約
Ｃ：コンテンツ
ＣＳ１：構成文
ＣＳ２：構成文
ＣＳ３：構成文
ＥＰ１０：エピソード
ＥＰ１１：エピソード
ＥＰ１２：エピソード
ＥＰ１ｎ：エピソード
ＥＰ２１：エピソード
ＥＰ２２：エピソード
Ｈ１：範囲
Ｈ２：範囲
Ｈ３：範囲
Ｈ４：範囲
Ｈ５：範囲
Ｈ６：範囲
Ｋ１：グループ
Ｋ２：グループ
Ｋ３：グループ
Ｐ０：開始位置
Ｐ２：中断位置
ＰＰ：抽出位置
ｑ：文
ｑ１：文
ｑ２：文
ｑ３：文
ｑ４：文 10: Summary generation device 11: Processor 12: Memory 13: Communication device 14: Display 15: Playback device 17: Operation unit 21: Interruption position information 22: Summary information 30: Server 31: Content data 31A: Content data 31B: Content data 32: attribute 33: partial summary 33B: partial summary 33C: partial summary 34: extraction data 35: placement data 70: network 111: selection processing 112: second determination processing 113: calculation processing 114: extraction processing 115: generation Process 116: Playback process 117: Third decision process 118: Selection process 121: Generation program 122: Content information AB12: Partial summary AB1n: Partial summary ABn: Partial summary C: Content CS1: Composition sentence CS2: Composition sentence CS3: Composition Sentence EP10 : Episode EP11 : Episode EP12 : Episode EP1n : Episode EP21 : Episode EP22 : Episode H1 : Range H2 : Range H3 : Range H4 : Range H5 : Range H6 : Range K1 : Group K2 : Group K3 : Group P0 : Start position P2 : Interruption position PP : Extraction position q : Sentence q1 : Sentence q2 : Sentence q3 : Sentence q4 : Sentence

Claims

A summary generation device for generating a summary of target content,
with a processor
The processor
determining a first range in the target content from a playback interruption position of the target content;
an index value of a sentence included in a second range, which is determined from the playback interruption position and which is a range for extracting the constituent sentences of the summary in the target content, and which is at least partially different from the first range; , calculated from said first range,
A summary generation device configured to extract constituent sentences of the summary from the second range based on the index value.

2. The summary generation device according to claim 1, wherein said index value includes a value obtained based on words included in said first range.

3. The summary generating apparatus according to claim 1, wherein the first range is a range after the reproduction interruption position and is an expected reproduction range of the target content determined from the reproduction interruption position.

the second range is determined by whether the summary is a first summary or a second summary;
The first summary is a summary generated with a range that does not include after the reproduction interruption position as the second range,
The summary generation device according to any one of claims 1 to 3, wherein the second summary is a summary generated with a range including after the reproduction interruption position as the second range.

Extracting the constituent sentences of the summary includes sequentially extracting a plurality of sentences included in the constituent sentences from the second range based on the index value, thereby obtaining a portion having the sequentially extracted sentences. generating a set;
5. The summary generation device according to any one of claims 1 to 4, wherein said index value includes a degree of similarity between sentences included in said second range and said subset.

6. The summary generation device according to claim 5, wherein said subset includes sentences extracted from a summary prepared in advance for said target content.

The processor further
configured to select reference content based on the target content;
Extracting constituent sentences of the summary from the second range includes:
7. The summary generation according to any one of claims 1 to 6, comprising extracting the constituent sentences of the summary from the second range by referring to extraction data associated with the reference content. Device.

A summary generation method for generating a summary of target content, comprising:
determining a first range in the target content from a playback interruption position of the target content;
calculating, from the first range, an index value of a sentence contained in a second range, which is a range from which the constituent sentences of the summary in the target content are extracted and which is at least partially different from the first range; ,
extracting constituent sentences of the summary from the second range based on the index value.