JP6179027B2

JP6179027B2 - Slide show creation server, user terminal, and slide show creation method

Info

Publication number: JP6179027B2
Application number: JP2012221293A
Authority: JP
Inventors: 広海石先; 服部　元; 元服部; 小野　智弘; 智弘小野
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2012-10-03
Filing date: 2012-10-03
Publication date: 2017-08-16
Anticipated expiration: 2032-10-03
Also published as: JP2014075662A

Description

本発明は、楽曲データの再生に応じて画像表示するスライドショーを作成するスライドショー作成サーバ、ユーザ端末およびスライドショー作成方法に関する。 The present invention relates to a slide show creation server, a user terminal, and a slide show creation method for creating a slide show for displaying images according to reproduction of music data.

従来、楽曲に合わせてＷＥＢ上の画像を利用して楽曲スライドショーを生成する技術が知られている。たとえば、特許文献１記載の携帯情報端末は、楽曲再生時に、音声認識によって歌詞を抽出し、その抽出された歌詞から所定のルールに基づいてキーワードを抽出し、キーワードに基づいて、ローカルストレージやインターネット上のＷＥＢページから画像を検索し、楽曲再生時に画像を表示させている。 2. Description of the Related Art Conventionally, a technique for generating a music slide show using images on a web in accordance with music is known. For example, the portable information terminal described in Patent Document 1 extracts lyrics by voice recognition during music reproduction, extracts keywords from the extracted lyrics based on a predetermined rule, and uses local storage or the Internet based on the keywords. Images are retrieved from the upper WEB page and displayed during music playback.

また、特許文献２記載のクエリ抽出装置は、歌詞を複数に区分し、優先的にキーワード群が選択される優先区分を決定して、全体的かつ局所的に歌詞に適合した画像を検索している。特許文献３記載の画像表示装置は、楽曲歌詞と画像を同期再生する場合に自動で適した表示時間を決定し、画像表示が極端に短くなったり極端に長くなったりすることを防止して画像表示を切り替えている。 Further, the query extraction device described in Patent Literature 2 classifies lyrics into a plurality of parts, determines a priority section in which a keyword group is preferentially selected, and searches for an image that matches the lyrics locally and locally. Yes. The image display device described in Patent Document 3 automatically determines a suitable display time when music lyrics and an image are synchronously reproduced, and prevents the image display from becoming extremely short or extremely long. The display is switched.

特開２００８−８９５４号公報JP 2008-8954 A 特開２０１１−０４８７２９号公報JP 2011-048729 A 特開２０１１−１６６３８６号公報JP 2011-166386 A

舟澤慎太郎, 石先広海, 帆足啓一郎, 滝嶋康弘, 甲藤二郎: 歌詞特徴を考慮したWeb 画像と楽曲同期再生システムの提案, 第8 回情報科学技術フォーラム, E-034 (2009).Shintaro Funasawa, Hiromi Ishibe, Keiichiro Hoashi, Yasuhiro Takishima, Jiro Katto: Proposal of Web Image and Music Synchronized Playback System Considering Lyric Features, 8th Information Science and Technology Forum, E-034 (2009). D. A. Shamma, B. Pardo, and K. J. Hammond: MusicStory: a Personalized Music VＩＤeo Creator, Proceedings of the 13th Annual ACM International Conference on Multimedia, pp.563-566 (2005).D. A. Shamma, B. Pardo, and K. J. Hammond: MusicStory: a Personalized Music VIDeo Creator, Proceedings of the 13th Annual ACM International Conference on Multimedia, pp.563-566 (2005). R. Cai, L. Zhang, F. Jing, W. Lai, and W. -Y. Ma: Automated Music VＩＤeo Generation Using Web Image Resource, Proceedings of IEEE International Conference on Acoustic, Speech, and SignalProcessing, 2:pp.737-740 (2007).R. Cai, L. Zhang, F. Jing, W. Lai, and W.-Y.Ma: Automated Music VIDeo Generation Using Web Image Resource, Proceedings of IEEE International Conference on Acoustic, Speech, and SignalProcessing, 2: pp. 737-740 (2007). 舟澤慎太郎, 石先広海, 帆足啓一郎, 滝嶋康弘, 甲藤二郎: 歌詞の印象に基づく楽曲検索のための楽曲自動分類に関する検討, 第71 回情報処理学会全国大会, 5R-2 (2009).Shintaro Funasawa, Hiromi Ishibe, Keiichiro Hoashi, Yasuhiro Takishima, Jiro Katto: Examination of automatic music classification for music search based on the impression of lyrics, 71st National Convention on Information Processing, 5R-2 (2009).

しかしながら、上記のような既存技術では、主にＷＥＢ上の多くの画像を利用して楽曲に合わせたスライドショーを生成しており、ユーザ個人の選択を十分に反映できていない。一方、ユーザ自身が楽曲に合わせた画像を一から選んでスライドショーを作成しようとしてもこれは不慣れな作業であり困難となる。 However, in the existing technology as described above, a slide show that matches music is generated mainly using many images on the WEB, and the selection of individual users cannot be sufficiently reflected. On the other hand, even if the user himself / herself selects an image that matches the music from the beginning and creates a slide show, this is an unfamiliar operation and is difficult.

本発明は、このような事情に鑑みてなされたものであり、不慣れなユーザであっても楽曲に合った独自のスライドショーを容易に得られるスライドショー作成サーバ、ユーザ端末およびスライドショー作成方法を提供することを目的とする。 The present invention has been made in view of such circumstances, and provides a slide show creation server, a user terminal, and a slide show creation method that allows even an unfamiliar user to easily obtain a unique slide show suitable for music. With the goal.

（１）上記の目的を達成するため、本発明のスライドショー作成サーバは、楽曲データの再生に応じて画像表示するスライドショーを作成するスライドショー作成サーバであって、一連の画像データを特定する元スライドショーに対応させて、前記一連の画像データの画像特徴量として抽出された第１の画像特徴量を管理するテンプレート管理部と、画像データ源から得られる画像データの画像特徴量を、第２の画像特徴量として抽出する第２の特徴量抽出部と、前記抽出された第１および第２の画像特徴量を用いて類似性を判定し、前記元スライドショーにより特定される画像データを、これに類似する前記画像データ源の画像データで置き換えたスライドショーを生成するスライドショー生成部と、を備えることを特徴としている。 (1) In order to achieve the above object, the slide show creation server of the present invention is a slide show creation server that creates a slide show that displays images in accordance with the reproduction of music data, and is an original slide show that identifies a series of image data. Correspondingly, a template management unit that manages the first image feature amount extracted as the image feature amount of the series of image data, and the image feature amount of the image data obtained from the image data source are used as the second image feature. Similarity is determined using the second feature quantity extraction unit that extracts the quantity and the extracted first and second image feature quantities, and the image data specified by the original slide show is similar to this. And a slide show generation unit for generating a slide show replaced with the image data of the image data source.

このように、本発明のスライドショー作成サーバは、元スライドショーで特定される画像データをこれに類似する画像データ源の画像データで置き換えるため、不慣れなユーザであっても楽曲に合った独自のスライドショーを容易に得られる。 In this way, the slide show creation server of the present invention replaces the image data specified in the original slide show with image data from a similar image data source, so that even an unfamiliar user can create a unique slide show that matches the music. Easy to get.

（２）また、本発明のスライドショー作成サーバは、前記第２の特徴量抽出部が、前記画像データ源としてユーザＤＢに蓄積された画像データの画像特徴量を、前記第２の画像特徴量として抽出することを特徴としている。これにより、ユーザは、ユーザ独自の画像データを用いて作成されたスライドショーを利用できるようになる。 (2) Further, in the slide show creation server of the present invention, the second feature amount extraction unit uses the image feature amount of the image data stored in the user DB as the image data source as the second image feature amount. It is characterized by extracting. As a result, the user can use the slide show created using the user-specific image data.

（３）また、本発明のスライドショー作成サーバは、前記ユーザＤＢが、特定のユーザとして識別されたユーザのみ画像データのアップロードを許可されていることを特徴としている。これにより、ユーザは個人でアップロードした画像データを用いてスライドショーを作成できる。 (3) The slide show creation server of the present invention is characterized in that the user DB is permitted to upload image data only to a user identified as a specific user. Thus, the user can create a slide show using the image data uploaded by the individual.

（４）また、本発明のスライドショー作成サーバは、前記スライドショー生成部が、前記ユーザＤＢに蓄積され、特定のユーザの識別情報に対応付けられた画像データから、置き換えるための画像データを選択することを特徴としている。これにより、ユーザは個人で撮影した写真画像データ等で置換された独自のスライドショーを作成できる。 (4) In the slide show creation server of the present invention, the slide show generation unit selects image data to be replaced from image data stored in the user DB and associated with identification information of a specific user. It is characterized by. As a result, the user can create a unique slide show that is replaced with photographic image data taken by an individual.

（５）また、本発明のスライドショー作成サーバは、前記スライドショー生成部が、前記第１および第２の画像特徴量を対比したときの類似度が所定の閾値より大きいときに前記元スライドショーで特定される画像データを置き換えたスライドショーを生成することを特徴としている。これにより、楽曲の特徴に適合する画像として準備された元スライドショーから容易に独自のスライドショーの作成が可能になる。 (5) The slide show creation server of the present invention is specified by the original slide show when the slide show generation unit compares the first and second image feature quantities with a similarity greater than a predetermined threshold. It is characterized by generating a slide show in which image data to be replaced is generated. This makes it possible to easily create a unique slide show from an original slide show prepared as an image that matches the characteristics of the music.

（６）また、本発明のスライドショー作成サーバは、前記スライドショー生成部が、前記元スライドショーで特定される画像データに対して設定されている置換可否パラメータを判定し、前記置換可否パラメータが置換可能と指定している画像データについては画像データを置き換えたスライドショーを生成し、前記置換可否パラメータが置換不可と指定している画像データについては画像データを置き換えたスライドショーを生成しないことを特徴としている。 (6) Further, in the slide show creation server of the present invention, the slide show generation unit determines a replaceability parameter set for the image data specified in the original slide show, and the replaceability parameter can be replaced. A slide show in which image data is replaced is generated for designated image data, and a slide show in which image data is replaced is not generated for image data in which the replaceability parameter is designated as non-replaceable.

これにより、スライドショーの特定画像については置換の可否が設定され、置換可否パラメータにより置換不可と指定されているときには画像の置換を不可にできる。これにより、スライドショーで置換すべきでない画像はそのまま残すことができる。 As a result, whether or not a specific image in the slide show is replaceable is set, and when the replaceable / unusable parameter designates that the image cannot be replaced, the image can be replaced. Thereby, an image that should not be replaced in the slide show can be left as it is.

（７）また、本発明のスライドショー作成サーバは、前記第２の特徴量抽出部が、前記画像データ源としてのオンライン上の入手可能な画像データの画像特徴量を、第２の画像特徴量として抽出することを特徴としている。これにより、オンライン上の入手可能な画像データで元スライドショーを準備することができる。 (7) Further, in the slide show creation server of the present invention, the second feature amount extraction unit uses the image feature amount of the online available image data as the image data source as the second image feature amount. It is characterized by extracting. Thereby, an original slide show can be prepared with image data available online.

（８）また、本発明のスライドショー作成サーバは、前記抽出された第１および第２の画像特徴量を用いた類似性の判定に基づいて、前記画像データ源の画像データに類似する画像データを特定するスライドショーに対応付けられた楽曲データを選択することを特徴としている。これにより、ユーザは画像データに合った楽曲を容易に見つけることができる。 (8) Further, the slide show creation server of the present invention generates image data similar to the image data of the image data source based on the similarity determination using the extracted first and second image feature amounts. The music data associated with the slide show to be specified is selected. Thereby, the user can easily find music suitable for the image data.

（９）また、本発明のスライドショー作成サーバは、前記テンプレート管理部が、他のユーザにより作成されたスライドショーを前記元スライドショーとして扱うことを特徴としている。これにより、他のユーザから提供されたスライドショーをもとにして、そのテンプレートを利用し、スライドショーを作成することができる。 (9) The slide show creation server of the present invention is characterized in that the template management unit handles a slide show created by another user as the original slide show. As a result, a slide show can be created using the template based on the slide show provided by another user.

（１０）また、本発明のスライドショー作成サーバは、前記元スライドショーに由来するスライドショーをユーザ端末に送信し、前記送信されたスライドショーとともに置換候補の画像データまたは前記置換候補の画像データに対応するデータを送信するスライドショー配信部を更に備えることを特徴としている。これにより、特定ユーザの端末では、置換候補の画像を選択可能に表示でき、特定ユーザはその選択可能に表示された画像から置換すべき画像を選択できる。 (10) Also, the slide show creation server of the present invention transmits a slide show derived from the original slide show to a user terminal, and the image data of the replacement candidate or the data corresponding to the image data of the replacement candidate together with the transmitted slide show. A slide show distribution unit for transmitting is further provided. Accordingly, the replacement user image can be displayed in a selectable manner on the terminal of the specific user, and the specific user can select an image to be replaced from the images displayed in the selectable manner.

（１１）また、本発明のスライドショー作成サーバは、前記スライドショー生成部が、前記送信されたスライドショーで特定される画像データを前記ユーザ端末から指示された置換候補の画像データに置き換えたスライドショーを生成することを特徴としている。これにより、サーバは、スライドショーの画像を特定ユーザが指定した画像に置き換え、ユーザ独自のスライドショーを作成できる。 (11) Also, in the slide show creation server of the present invention, the slide show generation unit generates a slide show in which the image data specified in the transmitted slide show is replaced with replacement candidate image data instructed from the user terminal. It is characterized by that. Thus, the server can replace the slide show image with an image designated by the specific user and create a slide show unique to the user.

（１２）また、本発明のスライドショー作成サーバは、前記ユーザ端末からの再生指示に従って、楽曲およびスライドショーをストリーミング配信するスライドショー配信部を更に備えることを特徴としている。これにより、ユーザの再生指示によってサーバは作成されたスライドショーをストリーミング配信する。その結果、端末には負担をかけずにスライドショーを再生することができる。 (12) The slide show creation server of the present invention is further characterized by further comprising a slide show delivery unit that delivers music and a slide show in a streaming manner in accordance with a reproduction instruction from the user terminal. As a result, the server performs streaming distribution of the created slide show according to the user's reproduction instruction. As a result, the slide show can be reproduced without imposing a burden on the terminal.

（１３）また、本発明のユーザ端末は、上記のスライドショー作成サーバから受信したスライドショーおよび前記置換候補の画像データまたは前記置換候補の画像データに対応するデータを用い、楽曲の区分に対応する画像を選択可能に表示することを特徴としている。これにより、ユーザは端末を操作して楽曲に応じたスライドショーを作成できる。 (13) Also, the user terminal of the present invention uses the slide show received from the slide show creation server and the image data of the replacement candidate or the data corresponding to the image data of the replacement candidate, and uses the image corresponding to the division of the music. The display is selectable. Thereby, the user can create a slide show corresponding to the music by operating the terminal.

（１４）また、本発明のスライドショー作成方法は、楽曲データの再生に応じて画像表示するスライドショーを作成するスライドショー作成方法であって、一連の画像データを特定する元スライドショーに対応させて、前記一連の画像データの画像特徴量として抽出された第１の画像特徴量を準備するステップと、画像データ源から得られる画像データの画像特徴量を、第２の画像特徴量として抽出するステップと、前記抽出された第１および第２の画像特徴量を用いて類似性を判定し、前記元スライドショーにより特定される画像データを、これに類似する前記画像データ源の画像データで置き換えたスライドショーを生成するステップと、を含むことを特徴としている。これにより、不慣れなユーザであっても楽曲に合った独自のスライドショーを容易に得られる。 (14) The slide show creation method of the present invention is a slide show creation method for creating a slide show for displaying an image in accordance with the reproduction of music data. The slide show creation method corresponds to an original slide show for specifying a series of image data, and Preparing a first image feature amount extracted as an image feature amount of the image data, extracting an image feature amount of image data obtained from an image data source as a second image feature amount, and Similarity is determined using the extracted first and second image feature amounts, and a slide show is generated in which the image data specified by the original slide show is replaced with image data of the image data source similar thereto. And a step. Thereby, even an inexperienced user can easily obtain a unique slide show that matches the music.

本発明によれば、元スライドショーで特定される画像データをこれに類似する画像データ源の画像データで置き換えるため、不慣れなユーザであっても楽曲に合った独自のスライドショーを容易に得られる。 According to the present invention, since the image data specified in the original slide show is replaced with image data from a similar image data source, even an inexperienced user can easily obtain a unique slide show that matches the music.

本発明のスライドショー作成システムを示すブロック図である。It is a block diagram which shows the slide show creation system of this invention. テンプレート管理部を示すブロック図である。It is a block diagram which shows a template management part. 楽曲ＤＢのデータ構造を示す図である。It is a figure which shows the data structure of music DB. テンプレートを示すテーブルである。It is a table which shows a template. 全体印象生成の処理を示すフローチャートである。It is a flowchart which shows the process of whole impression generation. 歌詞や推定キーワードを利用した画像選定処理を示すフローチャートである。It is a flowchart which shows the image selection process using a lyric and an estimation keyword. スライドショー作成処理を示すフローチャートである。It is a flowchart which shows a slide show creation process. ユーザ端末における写真共有画面の例を示す図である。It is a figure which shows the example of the photograph share screen in a user terminal. 画像アップロード画面の例を示す図である。It is a figure which shows the example of an image upload screen. 画像アップロード画面の例を示す図である。It is a figure which shows the example of an image upload screen. 楽曲選択のトップ画面の例を示す図である。It is a figure which shows the example of the top screen of music selection. 再生可能な楽曲一覧の画面の例を示す図である。It is a figure which shows the example of the screen of a music list which can be reproduced | regenerated. スライドショー再生準備画面の例を示す図である。It is a figure which shows the example of a slide show reproduction | regeneration preparation screen. スライドショー編集画面の例を示す図である。It is a figure which shows the example of a slide show edit screen. 候補画像データ一覧の画面の例を示す図である。It is a figure which shows the example of the screen of candidate image data list. 候補画像データ一覧の画面の例を示す図である。。It is a figure which shows the example of the screen of candidate image data list. . アップロード済みの画像選択画面の例を示す図である。It is a figure which shows the example of the uploaded image selection screen. アップロード済みの画像選択画面の例を示す図である。It is a figure which shows the example of the uploaded image selection screen. 変更確認画面の例を示す図である。It is a figure which shows the example of a change confirmation screen. ギャラリー画像選択画面の例を示す図である。It is a figure which shows the example of a gallery image selection screen. 保存スライドショー一覧の画面例を示す図である。It is a figure which shows the example of a screen of a preservation | save slide show list.

次に、本発明の実施の形態について、図面を参照しながら説明する。説明の理解を容易にするため、各図面において同一の構成要素に対しては同一の参照番号を付し、重複する説明は省略する。 Next, embodiments of the present invention will be described with reference to the drawings. In order to facilitate understanding of the description, the same reference numerals are given to the same components in the respective drawings, and duplicate descriptions are omitted.

［スライドショー作成システム］
（システムの構成）
図１は、スライドショー作成システム１０を示すブロック図である。スライドショー作成システム１０は、スライドショー作成サーバ１００とユーザ端末２００とを含んで構成されている。スライドショー作成サーバ１００は、楽曲データの再生に応じて画像表示するスライドショーを作成する。ユーザ端末２００は、スライドショーの編集やスライドショーの再生を可能にしている。 [Slide show creation system]
(System configuration)
FIG. 1 is a block diagram showing a slide show creation system 10. The slide show creation system 10 includes a slide show creation server 100 and a user terminal 200. The slide show creation server 100 creates a slide show for displaying images according to the reproduction of music data. The user terminal 200 enables slide show editing and slide show reproduction.

（スライドショー作成サーバの構成）
図１に示すように、スライドショー作成サーバ１００は、テンプレート管理部１１０、ユーザＤＢ１２０、第２の特徴量抽出部１３０、スライドショー生成部１４０およびスライドショー配信部１５０を備えている。 (Configuration of slide show creation server)
As illustrated in FIG. 1, the slide show creation server 100 includes a template management unit 110, a user DB 120, a second feature amount extraction unit 130, a slide show generation unit 140, and a slide show distribution unit 150.

テンプレート管理部１１０は、一連の画像データを特定する元スライドショーに対応させて、一連の画像データの画像特徴量として抽出された第１の画像特徴量を管理する。テンプレート管理部１１０の詳細は後述する。 The template management unit 110 manages the first image feature amount extracted as the image feature amount of the series of image data in association with the original slide show that specifies the series of image data. Details of the template management unit 110 will be described later.

ユーザＤＢ１２０は、ユーザにより撮影された画像データや、ユーザ端末の情報をユーザ情報と関連付けて格納する。ユーザＤＢ１２０は、特定のユーザとして識別されたユーザのみ画像データのアップロードを許可されていてもよい。これにより、ユーザは個人でアップロードした画像データを用いてスライドショーを作成できる。 The user DB 120 stores image data captured by the user and user terminal information in association with user information. In the user DB 120, only a user identified as a specific user may be permitted to upload image data. Thus, the user can create a slide show using the image data uploaded by the individual.

第２の特徴量抽出部１３０は、画像選定処理により画像データ源から得られる画像データの画像特徴量を、第２の画像特徴量として抽出する。格納した画像データから事前に抽出された画像特徴量をユーザＤＢ１２０に格納しておき、それを利用してもよいし、リアルタイムで画像特徴量を抽出してもよい。 The second feature amount extraction unit 130 extracts the image feature amount of the image data obtained from the image data source by the image selection process as the second image feature amount. An image feature amount extracted in advance from the stored image data may be stored in the user DB 120 and used, or the image feature amount may be extracted in real time.

例えば、ＯｐｅｎＣＶを利用して、人物の有無（顔検出）、色情報（カラーモーメント：局所領域ごとの各色の平均・標準偏差）、エッジ情報（エッジ方向ヒストグラム：Canny Filterにより抽出したエッジ点での輝度勾配のヒストグラム）、テクスチャ情報（Local Binary Pattern）、ＢｏＶＷ（特徴点の形状のヒストグラム）等の画像特徴量を抽出することができる。 For example, using OpenCV, the presence / absence of a person (face detection), color information (color moment: average / standard deviation of each color for each local region), edge information (edge direction histogram: edge point extracted by Canny Filter) Image feature quantities such as a luminance gradient histogram, texture information (Local Binary Pattern), and BoVW (feature point shape histogram) can be extracted.

第２の特徴量抽出部１３０は、画像データ源としての特定のユーザに対応するユーザＤＢ１２０に蓄積された画像データの画像特徴量を、第２の画像特徴量として抽出する。これにより、ユーザは、ユーザ独自の画像データを用いて作成されたスライドショーを利用できるようになる。画像データ源は、ユーザに特有のものであることが好ましいが、ＷＥＢ上のものであってもよい。 The second feature amount extraction unit 130 extracts the image feature amount of the image data stored in the user DB 120 corresponding to the specific user as the image data source as the second image feature amount. As a result, the user can use the slide show created using the user-specific image data. The image data source is preferably specific to the user, but may be on the WEB.

第２の特徴量抽出部１３０は、画像データ源としてのオンライン上の入手可能な画像データの画像特徴量を、第２の画像特徴量として抽出してもよい。これにより、オンライン上の入手可能な画像データで元スライドショーを準備することができる。 The second feature amount extraction unit 130 may extract an image feature amount of online available image data as an image data source as the second image feature amount. Thereby, an original slide show can be prepared with image data available online.

スライドショー生成部１４０は、抽出された第１および第２の画像特徴量を用いて類似性を判定し、元スライドショーにより特定される画像データを、これに類似する画像データ源の画像データで置き換えたスライドショーを生成する。これにより、不慣れなユーザであっても楽曲に合った独自のスライドショーを容易に得られる。 The slide show generation unit 140 determines similarity using the extracted first and second image feature amounts, and replaces the image data specified by the original slide show with image data of a similar image data source. Create a slide show. Thereby, even an inexperienced user can easily obtain a unique slide show that matches the music.

類似性の判定は、画像テンプレートに保存された画像特徴と、個人写真の画像特徴を比較し、類似度を算出して行うことができる。例えば、ユーザＤＢ１２０に格納された画像特徴ベクトルと、テンプレート管理部１１０に保存された画像特徴ベクトルのベクトル類似度を計算する。そして、第１および第２の画像特徴量を対比したときの類似度が所定の閾値を超える画像データで元スライドショーで特定される画像データを置き換えたスライドショーを生成できる。これにより、楽曲の特徴に適合する画像として準備された元スライドショーから容易に独自のスライドショーの作成が可能になる。 The similarity determination can be performed by comparing the image feature stored in the image template with the image feature of the personal photograph and calculating the similarity. For example, the vector similarity between the image feature vector stored in the user DB 120 and the image feature vector stored in the template management unit 110 is calculated. Then, it is possible to generate a slide show in which the image data specified in the original slide show is replaced with image data whose similarity when the first and second image feature amounts are compared exceeds a predetermined threshold. This makes it possible to easily create a unique slide show from an original slide show prepared as an image that matches the characteristics of the music.

ベクトル類似度は例えばコサイン類似度を利用することができる。類似度の閾値は、客観的に類似していると評価できる値として例えば、０．７０あるいはその近傍に設定でき、これを超える画像を利用することが可能である。ただし、類似度の値はユーザが任意に設定できることが好ましい。 As the vector similarity, for example, cosine similarity can be used. The similarity threshold value can be set to, for example, 0.70 or the vicinity thereof as a value that can be evaluated to be objectively similar, and images exceeding this value can be used. However, it is preferable that the user can arbitrarily set the similarity value.

スライドショー生成部１４０は、ユーザＤＢ１２０に蓄積され、特定のユーザの識別情報に対応付けられた画像データから、置き換えるための画像データを選択してもよい。これにより、ユーザは個人で撮影した写真画像データ等で置換された独自のスライドショーを作成できる。 The slide show generation unit 140 may select image data to be replaced from image data stored in the user DB 120 and associated with identification information of a specific user. As a result, the user can create a unique slide show that is replaced with photographic image data taken by an individual.

また、スライドショー生成部１４０は、元スライドショーで特定される画像データに対して設定されている置換可否パラメータを判定する。置換可否パラメータは、画像データの置き換えの可否を示すものである。スライドショー生成部１４０は、置換可否パラメータが置換可能と指定している画像データについては画像データを置き換えたスライドショーを生成し、置換可否パラメータが置換不可と指定している画像データについては画像データを置き換えたスライドショーを生成しない。これにより、スライドショーで置換すべきでない画像はそのまま残しつつ、画像の置換が可能となる。なお、置換可否パラメータの置換可否は、ユーザにより設定可能であることが好ましい。 In addition, the slide show generation unit 140 determines a replaceability parameter set for the image data specified in the original slide show. The replaceability parameter indicates whether image data can be replaced. The slide show generation unit 140 generates a slide show in which the image data is replaced for the image data designated as replaceable by the replaceability parameter, and replaces the image data for the image data specified in the replaceability parameter as non-replaceable. Does not generate a slide show. As a result, the image can be replaced while leaving the image that should not be replaced in the slide show. In addition, it is preferable that the replacement possibility of the replacement possibility parameter can be set by the user.

また、スライドショー生成部１４０は、ユーザ端末２００から特定の候補画像を指示された場合には、ユーザ端末２００へ送信されたスライドショーで特定される画像データを指示された置換候補の画像データに置き換えたスライドショーを生成する。これにより、スライドショーの画像をユーザの意思で指定した画像に置き換え、ユーザ独自のスライドショーを作成できる。 Further, when a specific candidate image is instructed from the user terminal 200, the slide show generation unit 140 replaces the image data specified in the slide show transmitted to the user terminal 200 with the specified replacement candidate image data. Create a slide show. Thereby, the slide show image can be replaced with an image designated by the user's intention, and a user-specific slide show can be created.

また、スライドショー生成部１４０は、ユーザＤＢ１２０に蓄積した画像データ群から抽出された特徴量と、楽曲データに紐付けられた特徴量のデータとの類似性を判定し、蓄積された画像データから楽曲データを選択可能にしてもよい。これにより、ユーザは個人で撮影した写真の画像データ群から楽曲を選択することが可能になる。また、スライドショー生成部１４０は、他のユーザのテンプレートを利用してスライドショーを作成することもできる。 In addition, the slide show generation unit 140 determines the similarity between the feature amount extracted from the image data group stored in the user DB 120 and the feature amount data associated with the music data, and the music data is stored from the stored image data. Data may be selectable. Thereby, the user can select music from a group of image data of photographs taken by individuals. In addition, the slide show generation unit 140 can create a slide show using a template of another user.

スライドショー配信部１５０は、元スライドショーに由来するスライドショーをユーザ端末２００に送信する。そして、送信されたスライドショーとともに置換候補の画像データまたは置換候補の画像データに対応するデータも送信する。これにより、ユーザ端末２００では、置換候補の画像を選択可能に表示でき、ユーザはその選択可能に表示された画像から置換すべき画像を選択できる。なお、元スライドショーに由来するスライドショーには、元スライドショーだけでなく元スライドショーから作成したスライドショーやそこから派生したスライドショーを含む。 The slide show distribution unit 150 transmits a slide show derived from the original slide show to the user terminal 200. The replacement candidate image data or data corresponding to the replacement candidate image data is also transmitted together with the transmitted slide show. Thereby, in the user terminal 200, the replacement candidate image can be displayed in a selectable manner, and the user can select an image to be replaced from the images displayed in the selectable manner. The slide show derived from the original slide show includes not only the original slide show but also a slide show created from the original slide show and a slide show derived therefrom.

スライドショー配信部は、ユーザ端末２００からの再生指示に従って、楽曲およびスライドショーをストリーミング配信する。これにより、ユーザの再生指示によってサーバは作成されたスライドショーをストリーミング配信する。その結果、端末には負担をかけずにスライドショーを再生することができる。 The slide show distribution unit performs streaming distribution of music and slide shows in accordance with a reproduction instruction from the user terminal 200. As a result, the server performs streaming distribution of the created slide show according to the user's reproduction instruction. As a result, the slide show can be reproduced without imposing a burden on the terminal.

（テンプレート管理部）
図２は、テンプレート管理部１１０を示すブロック図である。図２に示すように、テンプレート管理部１１０は、楽曲ＤＢ１１１、全体印象生成部１１２、元スライドショー生成部１１３、第１の特徴量抽出部１１４およびテンプレート作成部１１５を備えている。 (Template management department)
FIG. 2 is a block diagram showing the template management unit 110. As illustrated in FIG. 2, the template management unit 110 includes a music DB 111, an overall impression generation unit 112, an original slide show generation unit 113, a first feature quantity extraction unit 114, and a template creation unit 115.

楽曲ＤＢ１１１は、楽曲データの情報だけでなく、スライドショーのデータやテンプレートのデータを保存する。図３は、楽曲ＤＢ１１１のデータ構造を示す図である。楽曲ＤＢ１１１は、たとえば楽曲ごとに与えられたユニークなＩＤとして楽曲ＩＤ、歌詞の有無を示す楽曲種類、行ごとの歌詞が記述されている歌詞、歌詞の各行の開始時間と終了時間が記述されている同期情報、行ごとに表示されるテンプレート画像が記述されているスライドショー画像等の情報を格納できる。上記のうち、スライドショーは、楽曲ＩＤ、同期情報および画像特定情報を含んで構成される。 The music DB 111 stores not only music data information but also slide show data and template data. FIG. 3 is a diagram illustrating a data structure of the music DB 111. The music DB 111 describes, for example, a music ID as a unique ID given to each music, a music type indicating the presence / absence of lyrics, a lyrics in which lyrics are described for each line, a start time and an end time of each line of lyrics. Information such as a slide show image describing a synchronization information and a template image displayed for each row can be stored. Among the above, the slide show includes a music ID, synchronization information, and image specifying information.

なお、楽曲ＤＢ１１１は、他のユーザの端末から受信したスライドショーも蓄積する。そして、そのスライドショーを元スライドショーとして、作成されたテンプレートを利用可能なテンプレートとして蓄積する。この場合、他のユーザの端末から受信したスライドショーから作成したテンプレートは、他のユーザのテンプレートとしてユーザ間で共有される。 The music DB 111 also accumulates slide shows received from other users' terminals. Then, the slide show is stored as an original template, and the created templates are stored as usable templates. In this case, a template created from a slide show received from another user's terminal is shared among the users as a template for the other user.

全体印象生成部１１２は、歌詞が存在する楽曲を学習データとして、歌詞から受ける全体の印象を全体印象ラベルとして付与する。たとえば、以下のような処理により実現する。まず、事前に歌詞に対して全体印象のラベルが付与された教師データを準備する。楽曲の歌詞に対して形態素解析を適用し、歌詞を品詞へと分解する。分解した品詞から代表的な単語を歌詞特徴ベクトルの要素として抽出する。抽出した歌詞特徴ベクトルにより、全体印象語ごとに正、負の２クラス識別器を用いて作成し、新たに入力された歌詞に対してすべての全体印象語の正否を判断する。識別器により正として判断されたすべての全体印象語を、その楽曲の全体印象とする。 The overall impression generating unit 112 assigns the overall impression received from the lyrics as the overall impression label, with the music having the lyrics as learning data. For example, it is realized by the following processing. First, teacher data in which the overall impression label is assigned to the lyrics in advance is prepared. Applies morphological analysis to the lyrics of a song and breaks the lyrics into parts of speech. A representative word is extracted as an element of the lyric feature vector from the decomposed part of speech. Based on the extracted lyric feature vector, each whole impression word is created using a positive and negative two class discriminator, and whether all the whole impression words are correct or not is judged with respect to newly input lyrics. All the overall impression words determined as positive by the discriminator are set as the overall impression of the music.

全識別器は、たとえばSupport Vector Machine（ＳＶＭ）を利用することができる。パラメータ作成のための教師データは、たとえば以下のようにして作成できる。まず、学習データとして２４０曲程度の楽曲を準備し、Music lyrics databaseより、これらの歌詞データを取得する。そして、１曲あたり５人くらいが回答するように曲を振り分けてアンケートを実施する。アンケートの結果より、使用する全体印象語を決定し、過半数の回答が得られた全体語印象語をその楽曲の全体語印象とする。ＳＶＭの特徴量は全楽曲から得られた単語の出現確率をもとにしたＴＦＩＤＦ値を使用する。 For example, Support Vector Machine (SVM) can be used as all the classifiers. Teacher data for parameter creation can be created as follows, for example. First, about 240 songs are prepared as learning data, and these lyrics data are acquired from the Music lyrics database. Then, the questionnaire is conducted with the songs sorted so that about 5 people will answer each song. The overall impression word to be used is determined from the result of the questionnaire, and the overall word impression word for which a majority of responses are obtained is set as the overall word impression of the music. The feature amount of SVM uses a TF IDF value based on the appearance probability of words obtained from all songs.

単語は、形態素解析（ＰＯＳ Tagger、ＭＥＣＡＢ）などを使用して得ることができる。教師データにより全体印象語を付与する識別器を利用して、新規の楽曲歌詞に対して全体印象語を付与する。全体印象生成の全体の処理については後述する。 Words can be obtained using morphological analysis (POS Tagger, MECAB) and the like. An overall impression word is assigned to a new song lyrics using a discriminator that assigns an overall impression word based on teacher data. The overall process for generating the overall impression will be described later.

元スライドショー生成部１１３は、入力された楽曲データの歌詞もしくは、推定されたキーワードを用いて、該当する区分（たとえば行）に関連する画像データを選定する。そして、選定された画像を再生時に楽曲と連動して表示するためのスライドショーを生成する。画像選定の処理については後述する。 The original slide show generating unit 113 selects image data related to the corresponding category (for example, row) using the lyrics of the input music data or the estimated keyword. Then, a slide show for displaying the selected image in conjunction with the music during reproduction is generated. The image selection process will be described later.

第１の特徴量抽出部１１４は、元スライドショーで特定される一連の画像データから各画像データに対応する画像特徴量を抽出する。画像データから画像特徴量を抽出する機能については、第２の特徴量抽出部１３０と同様である。 The first feature amount extraction unit 114 extracts image feature amounts corresponding to each image data from a series of image data specified in the original slide show. The function of extracting the image feature amount from the image data is the same as that of the second feature amount extraction unit 130.

テンプレート作成部１１５は、元スライドショー生成部１１３によって得られた画像データ群に対して、第１の特徴量抽出部１１４を適用して得られた特徴量をテンプレートとして楽曲ＤＢ１１１に格納する。テンプレートは、ユーザが撮影した写真の特徴量（第２の特徴量）と比較するために作成し、テンプレートの特徴量に類似する画像データを自動で抽出することができる。なお、テンプレート作成部１１５は、元スライドショーに限らず楽曲ＤＢ１１１に蓄積されたスライドショーからもテンプレートを作成することができる。 The template creation unit 115 stores the feature quantity obtained by applying the first feature quantity extraction unit 114 for the image data group obtained by the original slide show generation unit 113 as a template in the music DB 111. The template is created for comparison with the feature amount (second feature amount) of the photograph taken by the user, and image data similar to the feature amount of the template can be automatically extracted. The template creation unit 115 can create a template not only from the original slide show but also from the slide show stored in the music DB 111.

図４は、テンプレートを示すテーブルである。行番号は歌詞の行番号、歌詞列には歌詞、表示時間列には表示開始及び終了時間、画像ＩＤには抽出された画像のＩＤ、画像特徴量列には画像特徴量、入れ替え可否列には、個人写真との置換可否パラメータが格納される。置換可否パラメータは任意に設定することができる。上記のように、テンプレートは、楽曲ＩＤ、同期情報および画像特徴量を含んで構成されている。 FIG. 4 is a table showing templates. The line number is the line number of the lyrics, the lyrics are the lyrics, the display time column is the display start and end times, the image ID is the extracted image ID, the image feature column is the image feature, and the interchangeable column is Is stored with a parameter indicating whether or not it can be replaced with a personal photograph. The replaceability parameter can be set arbitrarily. As described above, the template includes the music ID, the synchronization information, and the image feature amount.

（ユーザ端末の構成）
ユーザ端末２００は、スライドショー作成サーバ１００から受信したスライドショーおよび置換候補の画像データまたは置換候補の画像データに対応するデータを用い、楽曲の区分に対応する画像を選択可能に表示する。これにより、ユーザはユーザ端末２００を操作して楽曲に応じたスライドショーを作成できる。 (User terminal configuration)
The user terminal 200 uses the slide show and the replacement candidate image data received from the slide show creation server 100 or the data corresponding to the replacement candidate image data, and displays the images corresponding to the music segments in a selectable manner. Thereby, the user can create a slide show corresponding to the music by operating the user terminal 200.

ユーザ端末２００は、たとえば携帯電話機やＰＣ等である。図１に示すように、ユーザ端末２００は、撮影部２１０、アップローダ２２０、スライドショー編集部２３０および再生部２４０を備えている。撮影部２１０は、カメラで写真を撮影し画像データとして保存する機能を有している。例えば、ユーザ向けにＧＵＩを提供するなどしたカメラアプリにより実現できる。 The user terminal 200 is, for example, a mobile phone or a PC. As illustrated in FIG. 1, the user terminal 200 includes a photographing unit 210, an uploader 220, a slide show editing unit 230, and a playback unit 240. The photographing unit 210 has a function of taking a photograph with a camera and saving it as image data. For example, it can be realized by a camera application that provides a GUI for the user.

アップローダ２２０は、撮影写真およびギャラリー（過去撮影写真一覧）から、写真を一枚選択する機能やスライドショー作成サーバ１００へ画像データを送信する機能を有する。スライドショー作成サーバ１００にアップロードするための画像データは、カメラにて撮影してもよいが、すでにギャラリーとして登録されている写真を選択してもよい。また、選択された画像データをアップロードする共有機能や画像アップロード時、タグ情報を付与したり、端末固有情報を送信する機能も有している。 The uploader 220 has a function of selecting one photograph from a photographed photograph and a gallery (past photographed photograph list) and a function of transmitting image data to the slide show creation server 100. Image data to be uploaded to the slide show creation server 100 may be taken by a camera, or a photo that has already been registered as a gallery may be selected. In addition, a sharing function for uploading selected image data and a function for adding tag information or transmitting terminal-specific information at the time of image uploading are also provided.

スライドショー編集部２３０は、ユーザ端末２００上でスライドショーを編集する機能を有する。例えば、ＧＵＩを利用して、スライドショーで特定される画像データを、自身が撮影した写真の画像データと入れ替えたり、スライドショーにおける画像データの順序を変更することなどができる。なお、このようにして作成されたスライドショーを他のユーザの元スライドショーとし、これから画像特徴量を抽出して他のユーザのテンプレートとして利用できる。 The slide show editing unit 230 has a function of editing a slide show on the user terminal 200. For example, using the GUI, it is possible to replace the image data specified in the slide show with the image data of the photograph taken by itself, or to change the order of the image data in the slide show. The slide show created in this way can be used as another user's original slide show, and image feature amounts can be extracted from the slide show and used as templates for other users.

再生部２４０は、配信されたスライドショーを再生する機能を有する。なお、スライドショーは配信された楽曲とともに再生される。たとえば、ＧＵＩによりユーザの操作を受け付け、楽曲およびスライドショーを選択、再生、停止することができる。スライドショーの編集、再生の動作については後述する。 The playback unit 240 has a function of playing back the distributed slide show. The slide show is played along with the delivered music. For example, a user's operation can be received through the GUI, and music and a slide show can be selected, played, and stopped. The slide show editing and playback operations will be described later.

［スライドショー作成サーバの動作］
次に、上記のように構成されたスライドショー作成サーバ１００の動作を説明する。 [Operation of slide show creation server]
Next, the operation of the slide show creation server 100 configured as described above will be described.

（全体印象生成の処理）
図５は、全体印象生成の処理を示すフローチャートである。スライドショー作成サーバ１００は、楽曲に基づいてＷＥＢ画像を取得し、元スライドショーを作成する。まず、楽曲ＩＤを入力する（ステップＳ１）と、楽曲ＤＢ１１１より歌詞データを抽出し（ステップＳ２）、形態素解析および重要語抽出により歌詞特徴ベクトルを作成する（ステップＳ３）。教師データにより作成された識別器を用いて、歌詞特徴ベクトルに基づいて全体印象語の正否を出力させ、正であった単語に全体印象ラベルを最終的に付与する（ステップＳ４）。なお、全体印象語については日本語に限るものではない。 (Overall impression generation process)
FIG. 5 is a flowchart showing the overall impression generation process. The slide show creation server 100 acquires a WEB image based on the music and creates an original slide show. First, when a song ID is input (step S1), lyrics data is extracted from the song DB 111 (step S2), and a lyrics feature vector is created by morphological analysis and key word extraction (step S3). Using the discriminator created from the teacher data, the correctness of the overall impression word is output based on the lyrics feature vector, and the overall impression label is finally given to the positive word (step S4). The overall impression word is not limited to Japanese.

（画像選定の処理）
次に、このようにして得られた全体印象語と入力された楽曲の歌詞もしくは推定されたキーワードとを用いて、ＷＥＢ上で該当する行に関連する画像データを選定する。選定した画像データは、スライドショーで再生時に楽曲と連動して表示されるように特定される。入力された楽曲の歌詞もしくは推定キーワードを利用した画像選定は、以下のような処理で行なうことができる。 (Image selection process)
Next, using the overall impression word thus obtained and the lyrics of the input music or the estimated keyword, image data related to the corresponding line is selected on the WEB. The selected image data is specified to be displayed in conjunction with the music during playback in the slide show. Image selection using lyrics or estimated keywords of the input music can be performed by the following process.

図６は、歌詞や推定キーワードを利用した画像選定処理を示すフローチャートである。まず、キーワード抽出処理では、楽曲の同期データとして歌詞をともなう場合には歌詞一行、歌詞なしの楽曲であれば、分割単位でのキーワードまたは重要語を抽出する（ステップＴ１）。重要語はたとえばＴＦＩＤＦ値の高い単語を抽出することができる。また、形態素解析器により品詞分解を適用し、単語を品詞ごとに選別して画像検索のキーワードとして利用できる。 FIG. 6 is a flowchart illustrating image selection processing using lyrics and estimated keywords. First, in the keyword extraction process, a keyword or an important word is extracted in units of division if the song is accompanied by lyrics as the synchronization data of the song, and if the song is a song without the lyrics (step T1). As the important word, for example, a word having a high TF IDF value can be extracted. In addition, part-of-speech decomposition is applied by a morphological analyzer, and words can be selected for each part-of-speech and used as keywords for image search.

このようにして抽出した画像検索のキーワードを用いて画像を検索する（ステップＴ２）。画像検索対象としては、ＦＬＩＣＫＲ（登録商標）などのＷＥＢサービスや、個人の写真コレクションなどを利用することができる。そして検索して得られた画像群から、表示するための１枚を抽出することで各行や分割単位で表示する画像を抽出することができる（ステップＴ３）。このようにして、元スライドショーおよびテンプレートを生成することができる。 An image is searched using the image search keyword extracted in this manner (step T2). As an image search target, a WEB service such as FLICKR (registered trademark), a personal photo collection, or the like can be used. An image to be displayed in each row or division unit can be extracted by extracting one image for display from the image group obtained by the search (step T3). In this way, an original slide show and a template can be generated.

（スライドショー作成処理）
次に、元スライドショーおよびテンプレートを用いて画像を置き換えたスライドショーを作成する。図７は、スライドショー作成処理を示すフローチャートである。まず、一連の画像データを特定する元スライドショーに対応させたテンプレートとして、画像データの画像特徴量として抽出された第１の画像特徴量を準備する（ステップＰ１）。 (Slide show creation process)
Next, a slide show is created by replacing images using the original slide show and the template. FIG. 7 is a flowchart showing the slide show creation process. First, a first image feature amount extracted as an image feature amount of image data is prepared as a template corresponding to an original slide show for specifying a series of image data (step P1).

その一方で、画像データ源から得られる画像データの画像特徴量を、第２の画像特徴量として抽出する（ステップＰ２）。上記のように準備された第１の画像特徴量および抽出された第２の画像特徴量との間の類似性を判定し（ステップＰ３）、元スライドショーにより特定される画像データを、これに類似する画像データ源の画像データで置き換えたスライドショーを生成する（ステップＰ４）。このようにして、ユーザ独自のスライドショーを作成することができる。 On the other hand, the image feature amount of the image data obtained from the image data source is extracted as the second image feature amount (step P2). The similarity between the first image feature quantity prepared as described above and the extracted second image feature quantity is determined (step P3), and the image data specified by the original slide show is similar to this. The slide show replaced with the image data of the image data source to be generated is generated (step P4). In this way, a user-specific slide show can be created.

［画面表示例］
上記のスライドショー作成システム１０の動作を画面３００の表示例を参照しつつ説明する。図８は、ユーザ端末２００における写真共有画面の例を示す図である。ユーザ端末２００では、画面３００上の操作により「ギャラリー」ボタン３１０でギャラリーを選択でき、「撮影」ボタン３１５で撮影した写真を選択できるようになっており、その中から画像データＭ１０を選択可能になっている。 [Screen display example]
The operation of the slide show creation system 10 will be described with reference to a display example of the screen 300. FIG. 8 is a diagram illustrating an example of a photo sharing screen on the user terminal 200. In the user terminal 200, a gallery can be selected by a “gallery” button 310 by an operation on the screen 300, and a photograph taken by a “shoot” button 315 can be selected, and image data M10 can be selected from the photograph. It has become.

図９、図１０は、いずれも画像アップロード画面の例を示す図である。画面３００上では、選択された１枚の写真の画像データＭ１０を、スライドショー作成サーバ１００へアップロードするため、「共有」ボタン３２０によりアップローダアプリを起動可能になっている。ユーザ端末２００でアップローダアプリを起動すると、選択した画像データＭ１０が表示され、アップロードボタン３３０を押下することで、画像データをスライドショー作成サーバ１００にアップロードすることができる。また、サーバアップロード時に、タグ情報を付与する入力ボックス３３５を配置することで、ユーザがタグ情報を入力可能にできる。入力ボックス３３５には、ユーザがテキストを入力可能となっている。その場合、複数のタグを設定する場合は、半角カンマを区切り文字とするなどして区別することができる。 9 and 10 are diagrams showing examples of the image upload screen. On the screen 300, in order to upload the image data M10 of one selected photo to the slide show creation server 100, the uploader application can be activated by a “share” button 320. When the uploader application is activated on the user terminal 200, the selected image data M10 is displayed, and the image data can be uploaded to the slide show creation server 100 by pressing the upload button 330. In addition, by placing an input box 335 for assigning tag information at the time of server upload, the user can input tag information. In the input box 335, the user can input text. In this case, when a plurality of tags are set, they can be distinguished by using a half-width comma as a delimiter.

図１１は、楽曲選択のトップ画面の例を示す図である。図１１に示す画面３００は、楽曲スライドショー再生アプリを起動した直後に現れる画面である。画面３００には、ロゴや背景または楽曲スライドショーの表紙３４１とともに「ＳＴＡＲＴ」ボタン３４５が表示されている。例えば、「ＳＴＡＲＴ」ボタン３４５を押下すると、「楽曲選択画面」へ遷移する。 FIG. 11 is a diagram illustrating an example of a music selection top screen. A screen 300 shown in FIG. 11 is a screen that appears immediately after the music slide show playback application is activated. A “START” button 345 is displayed on the screen 300 together with a logo, background, or cover 341 of the music slide show. For example, when the “START” button 345 is pressed, a transition is made to a “music selection screen”.

図１２、図１３は、それぞれ楽曲選択画面の例を示す図である。図１２では、再生が可能な楽曲の一覧が「楽曲選択画面」に表示されている。例えば、スライドショー作成サーバ１００から取得した楽曲一覧や、ユーザ端末２００に格納された楽曲リストが表示される。 12 and 13 are diagrams showing examples of the music selection screen. In FIG. 12, a list of reproducible music pieces is displayed on the “music selection screen”. For example, a music list acquired from the slide show creation server 100 and a music list stored in the user terminal 200 are displayed.

「楽曲選択画面」では、楽曲名やアーティスト名のエリアをタップすることで楽曲を選択することができる。そのほかに検索機能やソート機能などを追加し、ボタンを新たに設置することなども可能である。また、保存したスライドショーボタン３４８を押下すると、「保存スライドショー一覧画面」へ遷移する。また、図１３のように楽曲ジャンルを表示することが可能である。ジャンル情報を選択することで、ジャンルカテゴリの中から楽曲を選択することや、ジャンルに該当する楽曲すべてを再生することも可能である。 On the “music selection screen”, music can be selected by tapping the music name or artist name area. In addition, a search function, a sort function, etc. can be added, and a new button can be installed. Further, when the saved slide show button 348 is pressed, a transition is made to a “saved slide show list screen”. Further, the music genre can be displayed as shown in FIG. By selecting genre information, it is possible to select music from the genre category or to reproduce all music corresponding to the genre.

図１４は、スライドショー編集画面の例を示す図である。図１４に示す画面表示では、選択された楽曲のスライドショーが表示される。楽曲（もしくはジャンル）選択後、選択された楽曲の再生が可能となる。「再生」ボタン３５０を押下すると、楽曲とともにスライドショーが再生される。また、「画像編集」ボタン３５５を押下すると、「画像編集画面」へ遷移する。 FIG. 14 is a diagram illustrating an example of a slide show editing screen. In the screen display shown in FIG. 14, a slide show of the selected music is displayed. After the music (or genre) is selected, the selected music can be played. When the “play” button 350 is pressed, the slide show is played along with the music. Further, when an “image editing” button 355 is pressed, a transition is made to an “image editing screen”.

図１５は、スライドショーの画像編集画面の例を示す図である。図１５に示すように、例えば、シーン（同期データの行）ごとに、代表画像データＡ〜Ｃが表示される。ユーザはユーザ端末２００の画面３００上でスライドショーを編集できる。 FIG. 15 is a diagram illustrating an example of a slide show image editing screen. As shown in FIG. 15, for example, representative image data A to C are displayed for each scene (row of synchronization data). The user can edit the slide show on the screen 300 of the user terminal 200.

例えば、ＧＵＩを利用して指定した画像データとスライドショーで特定される画像データを入れ替えたり、写真の順序を変更することができる。いずれかの画像を選択することで、画像選択画面へ遷移し、対象画像データの変更を可能としてもよい。編集終了ボタン３６０を押下すると、スライドショー再生画面へ遷移する。 For example, the image data specified using the GUI and the image data specified by the slide show can be exchanged, or the order of the photos can be changed. By selecting one of the images, it is possible to change to the image selection screen and change the target image data. When the edit end button 360 is pressed, the screen changes to a slide show playback screen.

画像編集画面で、特定の画像を選択すると、スライドショー作成サーバ１００にて選出された候補画像一覧画面へ遷移する。候補画像Ｂ２〜Ｂ１０はＦＬＩＣＫＲにより推奨されたものである。図１６は、候補画像一覧の画面の例を示す図である。各画像データは、スライドショー作成サーバ１００にてサムネイル化され、リサイズされたものＢ２〜Ｂ１０（置換候補の画像データに対応するデータ）を表示することもできる。いずれかの画像を選択すると、代表画像Ｂ（選択中の画面）が変更され、画像編集画面へ遷移する。 When a specific image is selected on the image editing screen, a transition is made to the candidate image list screen selected by the slide show creation server 100. Candidate images B2 to B10 are recommended by FLICKR. FIG. 16 is a diagram illustrating an example of a candidate image list screen. Each image data can be displayed as thumbnails by the slide show creation server 100 and resized B2 to B10 (data corresponding to replacement candidate image data). When one of the images is selected, the representative image B (screen being selected) is changed, and a transition is made to the image editing screen.

アップロードした画像およびギャラリーの画像選択画面への遷移が可能である。前ボタン３８４または次ボタン３８５を選択することで、シーン送りが可能となっている。デフォルトはＦＬＩＣＫＲ等のＷＥＢサービスから選出された画像が推奨されるが、「ｕｐｌｏａｄ画像から選択」ボタン３７０を押下すると、過去に自分がアップロードした画像からも選択できる。 Transition to uploaded image and gallery image selection screen is possible. By selecting the previous button 384 or the next button 385, scene advance is possible. Although an image selected from a web service such as FLICKR is recommended as a default, when an “select from upload image” button 370 is pressed, it can also be selected from images uploaded by the user in the past.

図１７、図１８は、いずれもアップロード済みの画像選択画面の例を示す図である。図１８の画面表示例では、ギャラリーから、または受信した候補画像から選択画面へ遷移可能となっている。「ＦＬＩＣＫＲ画像から選択」ボタン３８０を押下するとＦＬＩＣＫＲ等のＷＥＢサービスから選出された画像からの選択が可能になる。さらに、前ボタン３８４または次ボタン３８５を選択することで、シーン送りが可能となっている。 17 and 18 are diagrams showing examples of the image selection screen that has been uploaded. In the screen display example of FIG. 18, it is possible to transition from the gallery or from the received candidate image to the selection screen. When a “select from FLICKR image” button 380 is pressed, selection from an image selected from a web service such as FLICKR becomes possible. Furthermore, scene advance is possible by selecting the previous button 384 or the next button 385.

図１９は、変更確認画面の例を示す図である。上記の図１７または図１８に示す画面において画像を1つ選択すると、図１９に示すような確認画面が表示され、画像の置き換えが可能となる。「この画像に変更」ボタン３９１を選択すると、画像が変更され、画像編集画面へ遷移する。また、「削除」ボタン３９２を選択すると、サーバから画像を削除することもできる(サムネイルも同時に削除)。また、「戻る」ボタン３９３は、選択により元の画像選択画面に戻ることを可能にしている。 FIG. 19 is a diagram illustrating an example of a change confirmation screen. When one image is selected on the screen shown in FIG. 17 or FIG. 18, a confirmation screen as shown in FIG. 19 is displayed, and the image can be replaced. When the “change to this image” button 391 is selected, the image is changed and a transition is made to the image editing screen. If the “delete” button 392 is selected, the image can be deleted from the server (the thumbnail is also deleted at the same time). Further, the “return” button 393 allows the user to return to the original image selection screen by selection.

図２０は、ギャラリー画像選択画面の例を示す図である。図２０に示す例では、端末にて撮影された写真の一覧を表示する。画面には撮影写真の画像データＭ４０とギャラリーの画像データＭ５０とが並列に表示されている。このうちの画像を選択すると、選択中の画像が変更され、画像編集画面へ遷移する。 FIG. 20 is a diagram illustrating an example of a gallery image selection screen. In the example shown in FIG. 20, a list of photos taken by the terminal is displayed. The image data M40 of the photographed photograph and the image data M50 of the gallery are displayed in parallel on the screen. When one of these images is selected, the currently selected image is changed, and the image editing screen is displayed.

再生アプリ終了後に保存されたスライドショーの一覧を表示することもできる。図２１は、保存スライドショー一覧の画面例を示す図である。保存されたスライドショー４０１〜４０３のいずれかを選択し、再生ボタン３９５を押下することで再生することができる。また、不要になったスライドショーは、削除ボタン３９６により削除することも可能になっている。また、スライドショーのタイトルをタップすると、タイトル名を編集することが可能となっている。 You can also display a list of slideshows saved after the playback application is closed. FIG. 21 is a diagram illustrating a screen example of a saved slide show list. Playback can be performed by selecting any of the saved slideshows 401 to 403 and pressing a playback button 395. Further, a slide show that is no longer needed can be deleted by a delete button 396. When the title of the slide show is tapped, the title name can be edited.

１０スライドショー作成システム
１００スライドショー作成サーバ
１１０テンプレート管理部
１１１楽曲ＤＢ
１１２全体印象生成部
１１３元スライドショー生成部
１１４第１の特徴量抽出部
１１５テンプレート作成部
１２０ユーザＤＢ
１３０第２の特徴量抽出部
１４０スライドショー生成部
１５０スライドショー配信部
２００ユーザ端末
２１０撮影部
２２０アップローダ
２３０スライドショー編集部
２４０再生部
３００画面 10 slide show creation system 100 slide show creation server 110 template management unit 111 music DB
112 Overall impression generating unit 113 Original slide show generating unit 114 First feature quantity extracting unit 115 Template creating unit 120 User DB
130 second feature amount extraction unit 140 slide show generation unit 150 slide show distribution unit 200 user terminal 210 photographing unit 220 uploader 230 slide show editing unit 240 playback unit 300 screen

Claims

A slide show creation server for creating a slide show for displaying images according to the reproduction of music data,
An original slide show that specifies a series of image data that matches the characteristics of a specific song is automatically generated from an image on the Web, and the song ID and synchronization information of the specific song are associated with the original slide show, A template management unit for storing a table including an image ID of a series of image data and a first image feature quantity extracted as an image feature quantity of the series of image data as a template;
Only the user identified as a specific user is allowed to upload image data. The image feature quantity of the image data stored in the user DB is used as the image data source, and the image feature quantity of the image data obtained from the image data source is obtained. A second feature amount extraction unit for extracting as a second image feature amount;
Using the first image feature value obtained from the stored template and the extracted second image feature value, image data specified by the original slide show and image data obtained from the image data source A slide show generation server, comprising: a slide show generation unit that determines similarity and generates a slide show in which image data specified by the original slide show is replaced with image data of the image data source similar to the image data .

The slide show generation unit, the stored user DB, slide show creation server according to claim 1, wherein the image data associated with the identification information of the specific user selects image data to replace.

The slide show generation unit generates a slide show in which the image data specified in the original slide show is replaced when the similarity when the first and second image feature quantities are compared is larger than a predetermined threshold. The slide show creation server according to claim 1 or 2 .

The slide show generation unit determines a replaceability parameter set for the image data specified in the original slide show, and replaces the image data for the image data designated as replaceable by the replaceability parameter It generates a slide show, the slide show creation according to claim 3 for the image data to which the replacement permission parameter is specified as not substituted claim 1, characterized in that does not generate a slide show by replacing the image data server.

The second feature amount extraction unit extracts an image feature amount of online available image data as the image data source as a second image feature amount. 4. The slide show creation server according to any one of 4 .

The slide show generation unit is associated with a slide show that specifies image data similar to the image data of the image data source based on the similarity determination using the extracted first and second image feature amounts. slideshow creation server according to any one of claims 1 to 5, characterized by selecting the music data.

The template management unit, slide show creation server according to any one of claims 1 to 6, characterized in that handle slideshow created by another user as the original slide show.

A slide show distribution unit is further provided that transmits a slide show derived from the original slide show to a user terminal, and transmits the replacement candidate image data or data corresponding to the replacement candidate image data together with the transmitted slide show. The slide show creation server according to any one of claims 1 to 7 .

9. The slide show creation server according to claim 8 , wherein the slide show generation unit generates a slide show in which the image data specified in the transmitted slide show is replaced with replacement candidate image data instructed from the user terminal. .

Wherein according to the reproduction instruction from the user terminal, according to claim 8 or claim 9, wherein the slide show creation server and further comprising a slide show distribution unit for streaming music and a slide show.

10. The slide show received from the slide show creation server according to claim 8 or 9, and the image data of the replacement candidate or the data corresponding to the image data of the replacement candidate are used to selectably display an image corresponding to a music segment. A user terminal characterized by that.

A slide show creation method for creating a slide show that displays images according to the playback of music data,
An original slide show that specifies a series of image data that matches the characteristics of a specific song is automatically generated from an image on the Web, and the song ID and synchronization information of the specific song are associated with the original slide show, Storing a table including an image ID of a series of image data and a first image feature quantity extracted as an image feature quantity of the series of image data as a template;
Only the user identified as a specific user is allowed to upload image data. The image feature quantity of the image data stored in the user DB is used as the image data source, and the image feature quantity of the image data obtained from the image data source is obtained. Extracting as a second image feature amount;
Using the first image feature value obtained from the stored template and the extracted second image feature value, image data specified by the original slide show and image data obtained from the image data source Determining the similarity, and generating a slide show in which the image data specified by the original slide show is replaced with the image data of the image data source similar to the image data.