JP5474591B2

JP5474591B2 - Image selection apparatus, image selection method, and image selection program

Info

Publication number: JP5474591B2
Application number: JP2010025933A
Authority: JP
Inventors: 広海石先; 啓一郎帆足; 康弘滝嶋
Original assignee: KDDI Corp
Current assignee: KDDI Corp
Priority date: 2010-02-08
Filing date: 2010-02-08
Publication date: 2014-04-16
Anticipated expiration: 2030-02-08
Also published as: JP2011164865A

Description

本発明は、コンテンツに関連する画像を選定する画像選定装置、画像選定方法および画像選定プログラムに関する。 The present invention relates to an image selection device, an image selection method, and an image selection program for selecting an image related to content.

従来、画像表示に合せてＢＧＭ等の楽曲を再生する装置が知られている。たとえば特許文献１記載の情報再生装置は、デジタルカメラで撮影した画像のスライドショー再生時に、その再生する画像の撮影時期に見合うＢＧＭを再生することで映像鑑賞効果を高めている。画像を主体として、音楽を検索して自動でＢＧＭを付与させる方法を採用し、曲の発売日、または、曲の人気ランキングの最高となった期間などの日付と、撮影時期をマッチングさせて音楽を検索している。 2. Description of the Related Art Conventionally, an apparatus that reproduces music such as BGM in accordance with image display is known. For example, the information playback apparatus described in Patent Document 1 enhances the video viewing effect by playing back a BGM that matches the shooting time of an image to be played back when playing back a slide show of images taken with a digital camera. Employs a method of searching for music and assigning BGM automatically, focusing on images, and music by matching the date of release of the song or the period when the song's popularity ranking is the highest with the shooting time. Searching for.

特許文献２記載の音楽配信サーバは、楽曲を再生させる際に、データベースに記憶されている楽曲を選択し、該当楽曲に同期させて表示させるべき歌詞・解説・画像などの項目を指定する。そして、該当楽曲と、該当楽曲のヘッダに同期再生のための時間間隔を書き込んで端末に送信し、楽曲とその他情報を同期させて再生させている。 The music distribution server described in Patent Literature 2 selects a song stored in the database when reproducing a song, and specifies items such as lyrics, commentary, and images to be displayed in synchronization with the corresponding song. And the time interval for synchronous reproduction is written in the corresponding music and the header of the corresponding music and transmitted to the terminal, and the music and other information are synchronized and reproduced.

特許文献３記載の携帯情報端末は、楽曲再生時に、音声認識によって歌詞を抽出し、その抽出された歌詞から所定のルールに基づいてキーワードを抽出し、キーワードに基づいて、ローカルストレージやインターネット上のＷｅｂページから画像を検索し、楽曲再生時に画像を表示させている。 The portable information terminal described in Patent Literature 3 extracts lyrics by voice recognition during music reproduction, extracts keywords from the extracted lyrics based on a predetermined rule, and uses local storage or the Internet on the basis of the keywords. Images are retrieved from a web page and displayed when music is played.

所定のルールとは、たとえば、ある設定期間内の歌詞単語の中で最も多く使用された単語を抽出キーワードとするようなルールや、ある設定期間内の歌詞単語の中で最も大きく発声された単語を抽出キーワードとするようなルール、ある設定期間内の複数の単語の中からランダムに選択した単語をキーワードとするようなルールである。 The predetermined rule is, for example, a rule that uses the most frequently used words in a lyric word within a certain setting period as an extraction keyword, or a word that is uttered most among lyric words within a certain setting period. Are extracted keywords, and a word randomly selected from a plurality of words within a certain setting period is used as a keyword.

インターネット上のウェブページを検索先として画像検索を行う場合、キーワードを用いてインターネット上の検索エンジンなどを用いて画像を検索する。また、上記インターネット上のウェブページを検索先とする場合、好ましくない画像が検索されるのを防ぐためのフィルタリング処理、あるいは、たとえば所望のジャンル等の画像のみが検索されるようなフィルタリング処理を行うことも可能にしている。 When an image search is performed using a web page on the Internet as a search destination, an image is searched using a search engine on the Internet using a keyword. In addition, when the web page on the Internet is a search destination, a filtering process for preventing an undesired image from being searched or a filtering process for searching only an image of a desired genre, for example, is performed. It also makes it possible.

なお、フィルタリング処理は、上記インターネットを経由して画像検索を行う際だけでなく、たとえば、上記キーワード抽出部にてキーワードを抽出する際に行っても良いとしている。その結果、上記キーワード抽出部では、好ましくない画像が検索されてしまう可能性が高いキーワードを抽出しないようなフィルタリング処理や、逆に、たとえば所望のジャンル等の画像が検索される可能性が高いキーワードのみを抽出するようなフィルタリング処理も可能にしている。 Note that the filtering process may be performed not only when the image search is performed via the Internet but also when the keyword extraction unit extracts a keyword, for example. As a result, in the keyword extraction unit, a filtering process that does not extract a keyword that is likely to cause an undesired image to be searched, and conversely, a keyword that is highly likely to search for an image of a desired genre, for example. Filtering processing that extracts only this is also possible.

特許文献４記載の画像提示装置は、楽曲から歌詞を抽出し、抽出された歌詞の出現時間を特定し、あらかじめ複数の視覚効果と視覚効果の表示に要する時間情報を対応させておき、抽出された歌詞に基づいて視覚効果を決定し、楽曲の歌詞の内容に合わせて、動きを伴う視覚効果を付与している。 The image presentation device described in Patent Document 4 extracts lyrics from music, specifies the appearance time of the extracted lyrics, associates a plurality of visual effects with time information required to display the visual effects in advance, and extracts them. The visual effect is determined based on the lyrics, and the visual effect with movement is given according to the contents of the lyrics of the music.

視覚効果データベースは、単語と視覚効果とを対応づけて保持している。視覚効果は、抽出された歌詞中の単語が視覚効果データベースに存在するか探索し、もし単語が存在した場合には、その単語に対応する視覚効果とその視覚効果の所要時間とを視覚効果データベースから取得している。 The visual effect database stores words and visual effects in association with each other. The visual effect is searched for whether or not the word in the extracted lyrics exists in the visual effect database. If the word exists, the visual effect corresponding to the word and the time required for the visual effect are displayed in the visual effect database. Is getting from.

また、楽曲の再生に同期させてＷｅｂ画像を表示させるシステムや歌詞の印象に基づく楽曲検索のための楽曲自動分類技術について研究開発がなされている（たとえば非特許文献１〜５）。 In addition, research and development have been conducted on a system for displaying a Web image in synchronization with the reproduction of music and an automatic music classification technique for searching music based on the impression of lyrics (for example, Non-Patent Documents 1 to 5).

特開２００６−１６４２２９号公報JP 2006-164229 A 特開２００２−７３０４９号公報JP 2002-73049 A 特開２００８−８９５４号公報JP 2008-8954 A 特開２００６−１５４６２６号公報JP 2006-154626 A

舟澤慎太郎、石先広海、帆足啓一郎、滝嶋康弘、甲藤二郎、「歌詞特徴を考慮したＷｅｂ画像と楽曲同期再生システムの提案」、第8回情報科学技術フォーラム、2009、E-034Shintaro Funasawa, Hiromi Ishibe, Keiichiro Hoashi, Yasuhiro Takishima, Jiro Katto, “Proposal of Web Image and Music Synchronized Playback System Considering Lyric Features”, 8th Information Technology Forum, 2009, E-034 D. A. Shamma, B. Pardo, and K. J. Hammond, “Music Story: a Personalized Music Video Creator, Proceedings of the 13th Annual ACM International Conference on Multimedia”, 2005, pp.563-566D. A. Shamma, B. Pardo, and K. J. Hammond, “Music Story: a Personalized Music Video Creator, Proceedings of the 13th Annual ACM International Conference on Multimedia”, 2005, pp.563-566 R. Cai, L. Zhang, F. Jing, W. Lai, and W. -Y. Ma, “Automated Music Video Generation Using Web”R. Cai, L. Zhang, F. Jing, W. Lai, and W. -Y. Ma, “Automated Music Video Generation Using Web” “Image Resource, Proceedings of IEEE International Conference on Acoustic, Speech, and Signal Processing”, 2007, 2, pp.737-740.“Image Resource, Proceedings of IEEE International Conference on Acoustic, Speech, and Signal Processing”, 2007, 2, pp.737-740. 舟澤慎太郎、石先広海、帆足啓一郎、滝嶋康弘、甲藤二郎、歌詞の印象に基づく楽曲検索のための楽曲自動分類に関する検討、第71回情報処理学会全国大会、2009、5R-2.Shintaro Funasawa, Hiromi Ishibe, Keiichiro Hoashi, Yasuhiro Takishima, Jiro Katto, Study on automatic music classification for music search based on the impression of lyrics, 71st National Convention on Information Processing, 2009, 5R-2.

上記のように、楽曲等に合せてＷｅｂ上の画像を利用する技術は存在するが、基本的に利用する画像はユーザが選択しなければならない。そして、ユーザ自身がＷｅｂ画像から画像を選び、コンテンツに付与しようとすれば、その作業に膨大な労力を要し、たとえばスライドショーなどを生成する際の画像選択の労力には膨大な作業が必要となる。また、従来技術を利用した場合には共有サイトのランキングに基づいて画像を選定する場合には、必ずしもコンテンツに適した画像を選定できていない。 As described above, there is a technique for using an image on the Web in accordance with a music piece or the like, but the user must basically select an image to be used. If the user himself selects an image from a Web image and tries to add it to the content, the work requires an enormous amount of work. For example, the work of selecting an image when generating a slide show or the like requires an enormous amount of work. Become. In addition, when the conventional technology is used, when an image is selected based on the ranking of the shared site, an image suitable for the content cannot always be selected.

本発明は、このような事情に鑑みてなされたものであり、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷｅｂで画像を選ぶ労力を無くすことができる画像選定装置、画像選定方法および画像選定プログラムを提供することを目的とする。 The present invention has been made in view of such circumstances, and an image selection apparatus, an image selection method, and an image selection apparatus that can automatically select an image suitable for content and eliminate the effort of the user himself to select an image on the Web. The purpose is to provide an image selection program.

（１）上記の目的を達成するため、本発明の画像選定装置は、コンテンツに関連する画像を選定する画像選定装置であって、特定のコンテンツから１または複数のキーワードを抽出するキーワード抽出部と、ネットワーク上で取得可能な画像に付与された１または複数のタグを抽出するタグ抽出部と、前記抽出されたキーワードと前記取得されたタグとの語句の関連度を算出する語句関連度算出部と、前記算出された語句の関連度を用いて前記コンテンツと適合度の高い画像を選定する選定部とを備えることを特徴としている。 (1) In order to achieve the above object, an image selection device according to the present invention is an image selection device that selects an image related to content, and includes a keyword extraction unit that extracts one or a plurality of keywords from specific content; A tag extraction unit that extracts one or a plurality of tags attached to an image that can be acquired on the network, and a phrase relevance calculation unit that calculates the relevance of the phrase between the extracted keyword and the acquired tag And a selection unit that selects an image having a high degree of matching with the content using the calculated degree of association of the phrases.

このように、コンテンツに関するキーワードと画像に付与されたタグとの語句の関連度を用いて、画像を選定するため、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷＥＢで画像を選ぶ労力を無くすことができる。 In this way, since the image is selected using the degree of association between the keyword related to the content and the tag attached to the image, an image suitable for the content can be automatically selected, and the user himself has the effort to select an image on the WEB. Can be eliminated.

（２）また、本発明の画像選定装置は、前記キーワード抽出部は、特定の楽曲から１または複数のキーワードを抽出することを特徴としている。これにより、楽曲に適した画像を選定することができ、楽曲再生時の画像表示に利用することができる。 (2) Further, the image selection device of the present invention is characterized in that the keyword extraction unit extracts one or a plurality of keywords from a specific music piece. Thereby, an image suitable for music can be selected, and can be used for image display during music reproduction.

（３）また、本発明の画像選定装置は、前記語句関連度算出部は、前記抽出されたキーワードのうちの１語句と前記取得されたタグのうちの１語句との共起確率を用いて前記語句の関連度を算出することを特徴としている。このように共起確率を利用することで、画像選定に適した語句の関連度を算出できる。 (3) Further, in the image selection device according to the present invention, the phrase relevance calculation unit uses a co-occurrence probability between one phrase of the extracted keywords and one phrase of the acquired tags. The degree of relevance of the word is calculated. In this way, by using the co-occurrence probability, it is possible to calculate the degree of association of words suitable for image selection.

（４）また、本発明の画像選定装置は、前記語句関連度算出部は、前記抽出されたキーワードのうちの１語句と同じカテゴリに属する他の語句に対して、前記取得されたタグのうちの語句との共起確率分の寄与を差し引いて前記語句の関連度を算出することを特徴徴としている。このように同じカテゴリの別の語句の寄与を差し引くことで、キーワードとタグの関連度を正確に評価することができ、画像選定に適した語句の関連度を算出できる。 (4) Further, in the image selection device according to the present invention, the phrase relevance calculation unit is configured so that, for other phrases belonging to the same category as the one phrase among the extracted keywords, The relevance of the phrase is calculated by subtracting the contribution of the co-occurrence probability with the phrase. In this way, by subtracting the contribution of another phrase in the same category, the degree of association between the keyword and the tag can be accurately evaluated, and the degree of association of the phrase suitable for image selection can be calculated.

（５）また、本発明の画像選定方法は、コンテンツに関連する画像を選定する画像選定方法であって、特定のコンテンツから１または複数のキーワードを抽出するステップと、ネットワーク上で取得可能な画像に付与された１または複数のタグを取得するステップと、前記抽出されたキーワードと前記取得されたタグとの語句の関連度を算出するステップと、前記算出された語句の関連度を用いて前記コンテンツと関連の高い画像を選定するステップとを含むことを特徴としている。これにより、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷＥＢで画像を選ぶ労力を無くすことができる。 (5) The image selection method of the present invention is an image selection method for selecting an image related to content, the step of extracting one or a plurality of keywords from specific content, and an image obtainable on a network. Using the step of acquiring one or more tags attached to the tag, calculating the relevance level of the phrase between the extracted keyword and the acquired tag, and using the calculated relevance level of the phrase And a step of selecting an image highly related to the content. As a result, an image suitable for the content can be automatically selected, and the user can eliminate the labor of selecting an image by WEB.

（６）また、本発明の画像選定プログラムは、コンテンツに関連する画像を選定させる画像選定プログラムであって、特定のコンテンツから１または複数のキーワードを抽出する処理と、ネットワーク上で取得可能な画像に付与された１または複数のタグを取得する処理と、前記抽出されたキーワードと前記取得されたタグとの語句の関連度を算出する処理と、前記算出された語句の関連度を用いて前記コンテンツと関連の高い画像を選定する処理とをコンピュータに実行させることを特徴としている。これにより、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷＥＢで画像を選ぶ労力を無くすことができる。 (6) The image selection program according to the present invention is an image selection program for selecting an image related to content, and includes a process for extracting one or a plurality of keywords from specific content and an image that can be acquired on a network. Using the processing for acquiring one or more tags assigned to, the processing for calculating the degree of association between the extracted keyword and the acquired tag, and the degree of association between the calculated words It is characterized by having a computer execute processing for selecting an image highly related to content. As a result, an image suitable for the content can be automatically selected, and the user can eliminate the labor of selecting an image by WEB.

本発明によれば、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷｅｂで画像を選ぶ労力を無くすことができる。 According to the present invention, it is possible to automatically select an image suitable for content, and it is possible to eliminate the effort of the user himself to select an image on the Web.

第１の実施形態に係る画像選定装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image selection apparatus which concerns on 1st Embodiment. モジュールの構成を示すブロック図である。It is a block diagram which shows the structure of a module. 第１の実施形態に係る画像選定装置の動作の一例を示すフローチャートである。It is a flowchart which shows an example of operation | movement of the image selection apparatus which concerns on 1st Embodiment. 語句関連度算出の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of word relevance calculation. 画像選定の動作を示すフローチャートである。It is a flowchart which shows the operation | movement of image selection. 語句関連度辞書の一例を示すテーブルである。It is a table which shows an example of a phrase related degree dictionary. 関連タグと語句関連度の抽出結果の一例を示すテーブルである。It is a table which shows an example of the extraction result of a related tag and a phrase relevance degree. 第２の実施形態に係る画像選定装置の構成を示すブロック図である。It is a block diagram which shows the structure of the image selection apparatus which concerns on 2nd Embodiment. モジュールの構成を示すブロック図である。It is a block diagram which shows the structure of a module.

以下に、本発明の実施の形態について、図面を参照しながら説明する。説明の理解を容易にするため、各図面において同一の構成要素に対しては同一の参照番号を付し、重複する説明は省略する。 Embodiments of the present invention will be described below with reference to the drawings. In order to facilitate understanding of the description, the same reference numerals are given to the same components in the respective drawings, and duplicate descriptions are omitted.

［第１の実施形態］
（画像選定装置の構成）
図１は、画像選定装置の構成を示すブロック図である。画像選定装置１００は、ユーザＵが指定する楽曲等のコンテンツに関連する画像を選定する。コンテンツには、楽曲以外にもブログや、映像が含まれる。画像選定装置１００には、たとえばＰＣまたは携帯電話機が用いられる。図１に示すように、画像選定装置１００は、リクエスト受付部１１１、データベース１１２、画像検索モジュール１２０、語句関連度辞書生成モジュール１３０、画像選定モジュール１４０、画像管理部１５１、表示時間管理モジュール１６０、再生制御部１７１、再生部１７２、歌詞表示部１７３および画像表示部１７４を備えている。 [First Embodiment]
(Configuration of image selection device)
FIG. 1 is a block diagram showing the configuration of the image selection apparatus. The image selection device 100 selects an image related to content such as music specified by the user U. In addition to music, the content includes blogs and videos. For the image selection device 100, for example, a PC or a mobile phone is used. As shown in FIG. 1, the image selection apparatus 100 includes a request reception unit 111, a database 112, an image search module 120, a phrase relevance dictionary generation module 130, an image selection module 140, an image management unit 151, a display time management module 160, A playback control unit 171, a playback unit 172, a lyrics display unit 173, and an image display unit 174 are provided.

リクエスト受付部１１１は、ユーザＵによる楽曲や画像のリクエストの入力を受け付ける。データベース１１２は、コンテンツ及び、コンテンツに対するキーワードを格納する。たとえば、コンテンツを楽曲とすると、楽曲の音源、歌詞情報、楽曲と歌詞の同期情報などを格納する。また、該当コンテンツに対するキーワードを格納し、たとえば、コンテンツを表現するキーワードを印象情報として格納する。 The request receiving unit 111 receives an input of a music or image request from the user U. The database 112 stores content and keywords for the content. For example, if the content is a music, the sound source of the music, the lyrics information, the synchronization information of the music and the lyrics, and the like are stored. In addition, a keyword for the corresponding content is stored, for example, a keyword expressing the content is stored as impression information.

たとえば、「季節」のカテゴリに、「春」、「夏」、「秋」、「冬」の印象情報を格納する。「時間帯」のカテゴリに、「朝」、「昼」、「夕方」、「夜」の印象情報を格納してもよい。また、「天候」のカテゴリに、「晴れ」、「曇り」、「雨」、「雪」、「虹」の印象情報を格納してもよい。 For example, impression information of “spring”, “summer”, “autumn”, and “winter” is stored in the “season” category. Impression information of “morning”, “daytime”, “evening”, and “night” may be stored in the “time zone” category. Further, impression information of “sunny”, “cloudy”, “rain”, “snow”, and “rainbow” may be stored in the “weather” category.

画像検索モジュール１２０は、たとえばコンテンツを楽曲とすると、ユーザにより入力された楽曲の歌詞情報をもとに、スライドショーを構成する候補となる画像（以下、候補画像）をＷｅｂから検索する。具体的には、歌詞に出現する名詞と、キーワードとして歌詞全体の印象を表現する単語を用いて最適なクエリを選定する。そして、そのクエリでＷｅｂ画像検索を行い、歌詞の各行に対して最大1500枚程度の候補画像を取得する。なお、Ｗｅｂ上で公開されている日本語形態素解析エンジンを用いて歌詞情報から名詞抽出を行うことができる。また、検索対象となる画像データベースとして、Flickr（登録商標）等の画像共有サイト４００を用い、画像データに付与されているソーシャルタグをもとに画像検索を行うことができる。 For example, if the content is music, the image search module 120 searches the Web for images (hereinafter referred to as candidate images) that are candidates for the slide show based on the lyrics information of the music input by the user. Specifically, an optimal query is selected using nouns appearing in the lyrics and words expressing the impression of the entire lyrics as keywords. Then, a Web image search is performed using the query, and a maximum of about 1500 candidate images are acquired for each line of lyrics. It is possible to extract nouns from the lyric information using a Japanese morphological analysis engine published on the Web. Further, as an image database to be searched, an image sharing site 400 such as Flickr (registered trademark) can be used, and an image search can be performed based on a social tag attached to the image data.

語句関連度辞書生成モジュール１３０は、コンテンツに付与されたキーワードに基づいて、キーワードと候補画像のタグとの語句の関連度を計算する。また、計算された語句の関連度を辞書として保管する。 The word / phrase relevance dictionary generation module 130 calculates the word / phrase relevance between the keyword and the tag of the candidate image based on the keyword assigned to the content. In addition, the calculated degree of association is stored as a dictionary.

画像選定モジュール１４０は、画像検索モジュール１２０によって得られた画像群から、コンテンツに付与されたキーワードに適した画像を選定する。各候補画像に対して、コンテンツの全体印象語との適合度を表現するscoreを算出し、その値が最も高い画像を選定することで、候補画像の中から最終的に用いる画像を選択する。 The image selection module 140 selects an image suitable for the keyword assigned to the content from the image group obtained by the image search module 120. For each candidate image, a score expressing the degree of matching with the overall impression word of the content is calculated, and an image having the highest value is selected to select an image to be finally used from the candidate images.

たとえば、適合度のscoreは、画像に付与されているすべてのソーシャルタグと全体印象語との関連強さをもとに算出する。これにより、全体印象語が直接タグとして付与されていない画像でも、”全体印象語らしさ”を表現することができる。そして、それをもとに画像を選定することで、スライドショー全体の統一感を生み出すことができる。また、楽曲に付与されている全体印象語の組み合わせによりscore算出の指標が変化するため、別々の楽曲に同じ画像が選定されるケースが少なくなり、より多様性のあるスライドショーを作成できる。 For example, the score of fitness is calculated based on the strength of association between all social tags attached to the image and the overall impression word. This makes it possible to express “likeness of the overall impression word” even in an image in which the overall impression word is not directly assigned as a tag. Then, by selecting images based on that, it is possible to create a unified feeling for the entire slide show. In addition, since the score calculation index changes depending on the combination of the overall impression words assigned to the music, the same image is not selected for different music, and a more diverse slide show can be created.

語句関連度辞書生成モジュール１３０および画像選定モジュール１４０の詳細は、後述する。画像管理部１５１は、楽曲の各管理区分と選定された画像とを対応付けて管理する。画像管理部１５１は、再生時に楽曲の各区分と画像との対応により、各区分の歌詞表示に対応させて画像を表示することができる。 Details of the phrase relevance dictionary generation module 130 and the image selection module 140 will be described later. The image management unit 151 manages each music management category in association with the selected image. The image management unit 151 can display an image corresponding to the lyrics display of each section by the correspondence between each section of the music and the image during reproduction.

表示時間管理モジュール１６０は、スライドショーにおいて、各画像を表示する時間を適切にするために、画像切り替えタイミングの再構成を行う。その際には、ネットワーク上の楽曲データベース５００や歌詞データベース６００を利用する。 The display time management module 160 reconfigures the image switching timing in order to make the time for displaying each image appropriate in the slide show. At that time, the music database 500 and the lyrics database 600 on the network are used.

単純に歌詞の行単位で歌詞とともに表示する画像を切り替えると、その歌詞の行の長さに画像の表示時間が依存し、適切な表示時間とならない事態が生じる。これを防止するため、画像の内容を十分に把握できるように表示時間を十分にとり、退屈感を生じさせない程度に表示時間を短くする。すなわち、表示時間の短い行は周辺の行と結合し、結合した区間では１枚の画像を表示し続け、一方表示時間の長い行は分割を行い、１行の中でも複数の画像を表示する。さらに、楽曲の画像表示時間の最頻値を基準に、間奏区間における画像切り替えタイミングも設定する。 If the image to be displayed together with the lyrics is simply switched in units of lyrics, the display time of the image depends on the length of the line of the lyrics, and an appropriate display time does not occur. In order to prevent this, a sufficient display time is taken so that the contents of the image can be sufficiently grasped, and the display time is shortened to such an extent that a feeling of boredom does not occur. That is, a line with a short display time is combined with surrounding lines, and one image is continuously displayed in the combined section, while a line with a long display time is divided, and a plurality of images are displayed in one line. Furthermore, the image switching timing in the interlude section is also set based on the mode value of the music image display time.

再生制御部１７１は、歌詞データベース６００と通信し、楽曲の再生とそれに同期させた歌詞表示および画像表示を制御する。再生部１７２は、スピーカやヘッドホン等の出力装置を介して楽曲の音声を再生する。歌詞表示部１７３は、楽曲の再生に合せて画面上に歌詞を表示する。画像表示部１７４は、楽曲の再生に合せて、算出された画像表示時間にわたり選定された画像を表示する。画像表示部１７４は、連続する行に対応する歌詞表示時間同士の間隔が所定時間を超える場合には、間隔に対応する画像を表示してもよい。 The reproduction control unit 171 communicates with the lyrics database 600 and controls the reproduction of music and the display of lyrics and images synchronized with the reproduction of the music. The playback unit 172 plays back the sound of the music via an output device such as a speaker or headphones. The lyrics display unit 173 displays the lyrics on the screen in accordance with the reproduction of the music. The image display unit 174 displays an image selected over the calculated image display time in accordance with the reproduction of the music. The image display unit 174 may display an image corresponding to the interval when the interval between the lyrics display times corresponding to successive lines exceeds a predetermined time.

図２は、モジュールの構成を示すブロック図である。図２に示すように、語句関連度辞書生成モジュール１３０は、カテゴリ抽出部１３１、キーワード抽出部１３２、タグ抽出部１３３、語句関連度算出部１３４を備えている。 FIG. 2 is a block diagram showing the configuration of the module. As shown in FIG. 2, the phrase relevance level dictionary generation module 130 includes a category extraction unit 131, a keyword extraction unit 132, a tag extraction unit 133, and a phrase relevance level calculation unit 134.

カテゴリ抽出部１３１は、ユーザＵにより指定されたコンテンツ（特定のコンテンツ）から関連するカテゴリを抽出する。キーワード抽出部１３２は、カテゴリから１または複数のキーワードを抽出する。すなわち、コンテンツが楽曲である場合には、その楽曲の歌詞や全体の印象語から１または複数のキーワードを抽出する。このようにして、楽曲のキーワードを画像のタグとの語句関連度の算出に利用し、画像選定に利用する。なお、カテゴリはユーザにより手動で付与してもよいし、上記のように何らかのプログラムによりコンテンツから抽出してもよい。 The category extraction unit 131 extracts related categories from the content specified by the user U (specific content). The keyword extraction unit 132 extracts one or more keywords from the category. That is, when the content is a song, one or more keywords are extracted from the lyrics of the song and the overall impression word. In this way, the keyword of the music is used for calculating the degree of word association with the image tag and used for image selection. The category may be given manually by the user, or may be extracted from the content by some program as described above.

タグ抽出部１３３は、ネットワーク上で取得可能な画像に付与された１または複数のタグを抽出する。画像共有サイト４００を利用することで、多数の画像および画像に付与されたタグを抽出できる。 The tag extraction unit 133 extracts one or a plurality of tags attached to images that can be acquired on the network. By using the image sharing site 400, a large number of images and tags attached to the images can be extracted.

語句関連度算出部１３４は、コンテンツから抽出されたキーワードと画像から抽出されたタグとの語句の関連度を算出する。このように、コンテンツに付与されたカテゴリ情報を利用し、コンテンツに関するキーワードと画像に付与されたタグとの語句の関連度を用いて、コンテンツに適合した画像を選定する。その結果、自動でコンテンツに適した画像を選定でき、ユーザ自身がＷｅｂで画像を選ぶ労力を無くすことができる。なお、スライドショーや、ブログ、などマルチメディアコンテンツに対して画像を付与する場合にも利用できる。 The phrase relevance calculation unit 134 calculates the relevance of the phrase between the keyword extracted from the content and the tag extracted from the image. In this way, using the category information assigned to the content, an image suitable for the content is selected using the degree of association between the keyword related to the content and the tag attached to the image. As a result, it is possible to automatically select an image suitable for the content, and it is possible to eliminate the labor for the user himself to select an image on the Web. It can also be used to add images to multimedia contents such as slide shows and blogs.

語句関連度算出部１３４は、抽出されたキーワードのうちの１語句と取得されたタグのうちの１語句との共起確率を用いて語句の関連度を算出する。このように共起確率を利用することで、画像選定に適した語句の関連度を適正かつ容易に算出できる。 The phrase relevance calculating unit 134 calculates the relevance of the phrase using the co-occurrence probability of one word of the extracted keywords and one of the acquired tags. By using the co-occurrence probability in this way, it is possible to appropriately and easily calculate the degree of association of words suitable for image selection.

その際には、抽出されたキーワードのうちの１語句と同じカテゴリに属する他の語句に対して、取得されたタグのうちの語句との共起確率分の寄与を差し引いて語句の関連度を算出してもよい。このように同じカテゴリの別の語句の寄与を差し引くことで、キーワードとタグの語句関連度を正確に評価することができ、画像選定に適した語句の関連度を算出できる。 In that case, the degree of relevance of the phrase is obtained by subtracting the contribution of the co-occurrence probability with the phrase in the acquired tag from another phrase belonging to the same category as the one of the extracted keywords. It may be calculated. Thus, by subtracting the contribution of another phrase in the same category, the phrase relevance of the keyword and the tag can be accurately evaluated, and the relevance of the phrase suitable for image selection can be calculated.

また、画像選定モジュール１４０は、タグ取得部１４１、画像適合度算出部１４２および選定部１４３を備えている。タグ取得部１４１は、候補として抽出された画像から画像に付与されたタグを取得する。 The image selection module 140 includes a tag acquisition unit 141, an image suitability calculation unit 142, and a selection unit 143. The tag acquisition unit 141 acquires a tag attached to the image from the images extracted as candidates.

画像適合度算出部１４２は、ユーザＵにより指定されたコンテンツ（特定のコンテンツ）と候補画像との適合度を算出する。このとき、画像に付与されたタグとコンテンツに関連するキーワードとの関連度を累積的に加算することが好ましく、これにより妥当な関連度を算出することができる。具体例は後述する。 The image suitability calculation unit 142 calculates the suitability between the content designated by the user U (specific content) and the candidate image. At this time, it is preferable to cumulatively add the degree of association between the tag attached to the image and the keyword related to the content, whereby a reasonable degree of association can be calculated. Specific examples will be described later.

選定部１４３は、算出された語句の適合度の高い画像を選定する。コンテンツのキーワードと画像のタグとの語句の適合度を用いて画像を選定する。これにより、楽曲に適した画像を選定することができ、楽曲再生時の画像表示に利用することができる。 The selection unit 143 selects an image having a high degree of fitness for the calculated word / phrase. An image is selected using the degree of matching between the content keyword and the image tag. Thereby, an image suitable for music can be selected, and can be used for image display during music reproduction.

（画像選定装置の動作）
次に、画像選定装置１００の動作を説明する。図３は、画像選定装置の動作の一例を示すフローチャートである。まず、ユーザＵによるコンテンツの指定を受け付け（ステップＳ１）、キーワードの指定を受け付ける（ステップＳ２）。 (Operation of image selection device)
Next, the operation of the image selection device 100 will be described. FIG. 3 is a flowchart illustrating an example of the operation of the image selection device. First, content specification by the user U is accepted (step S1), and keyword specification is accepted (step S2).

次に、ネットワーク上で画像検索を行う（ステップＳ３）。検索結果に応じてキーワードとタグとの語句関連度を算出し（ステップＳ４）、算出された語句の関連度に応じて画像を選定する（ステップＳ５）。そして、選定された画像や歌詞の表示時間を決定し（ステップＳ６）、楽曲を再生し、再生に伴い歌詞および画像を表示して（ステップＳ７）終了する。以下に、各処理の詳細を説明する。 Next, an image search is performed on the network (step S3). The word / phrase relevance between the keyword and the tag is calculated according to the search result (step S4), and an image is selected according to the calculated relevance of the word / phrase (step S5). Then, the display time of the selected image or lyrics is determined (step S6), the music is reproduced, the lyrics and the image are displayed along with the reproduction (step S7), and the process ends. Details of each process will be described below.

（語句関連度算出）
語句関連度算出の動作を説明する。図４は、語句関連度算出の動作を示すフローチャートである。図４に示すように、まず、指定した楽曲からカテゴリ情報を抽出する（ステップＴ１）。楽曲から得られたカテゴリに関連するキーワードを抽出する（ステップＴ２）。 (Phrase relevance calculation)
The operation of the word relevance calculation will be described. FIG. 4 is a flowchart showing the operation of calculating the word association degree. As shown in FIG. 4, first, category information is extracted from the designated music (step T1). A keyword related to the category obtained from the music is extracted (step T2).

次に、Ｗｅｂ上で検索された候補画像に付与されているタグ群を抽出し（ステップＴ３）、抽出されたタグ群からタグを抽出する（ステップＴ４）。そして、抽出された各タグについてユーザ数が閾値以上か否かを判定する（ステップＴ５）。ユーザ数が閾値未満である場合には、ステップＴ４に戻る。ユーザ数が閾値以上である場合には、そのタグと楽曲のキーワードとの語句の関連度を算出する（ステップＴ６）。 Next, a tag group attached to the candidate image searched on the Web is extracted (step T3), and a tag is extracted from the extracted tag group (step T4). And it is determined whether the number of users is more than a threshold value about each extracted tag (step T5). If the number of users is less than the threshold, the process returns to step T4. If the number of users is greater than or equal to the threshold, the degree of association between the tag and the keyword of the music is calculated (step T6).

関連度の算出後、すべてのタグについて判定が終了したか否かを判定する（ステップＴ７）。すべてのタグについて判定が終了していない場合には、ステップＴ４に戻る。すべてのタグについて判定が終了した場合には、すべてのキーワードについて処理が終了したか否かを判定する（ステップＴ８）。すべてのキーワードについて処理が終了していない場合にはステップＴ２に戻る。 After the relevance is calculated, it is determined whether or not the determination has been completed for all tags (step T7). If the determination is not completed for all tags, the process returns to step T4. If the determination is completed for all tags, it is determined whether the processing is completed for all keywords (step T8). If processing has not been completed for all keywords, the process returns to step T2.

すべてのキーワードについて処理が終了した場合には、すべてのカテゴリについて処理が終了したか否かを判定する（ステップＴ９）。すべてのカテゴリについて処理が終了していない場合にはステップＴ１に戻る。すべてのカテゴリについて処理が終了した場合には、カテゴリ毎の語句関連度辞書を作成する（ステップＴ１０）。カテゴリ毎の語句関連度辞書の例については後述する。 When the process is completed for all keywords, it is determined whether the process is completed for all categories (step T9). If the processing has not been completed for all categories, the process returns to step T1. When the processing is completed for all categories, a phrase relevance dictionary for each category is created (step T10). An example of the phrase relevance dictionary for each category will be described later.

（画像選定の動作）
次に、画像選定の動作を説明する。図５は、画像選定の動作を示すフローチャートである。まず、ユーザによる入力画像を取得する（ステップＰ１）。そして、画像におけるタグ集合を取得する（ステップＰ２）。 (Image selection operation)
Next, an image selection operation will be described. FIG. 5 is a flowchart showing an image selection operation. First, an input image by the user is acquired (step P1). Then, a tag set in the image is acquired (step P2).

次に、入力画像について画像の適合度としてscoreを算出する（ステップＰ３）。そして、すべての画像について処理が終了したか否かを判定する（ステップＰ４）。すべての画像について処理が終了していないときにはステップＰ１に戻る。すべての画像について処理が終了したときには、画像の適合度のランキングを算出する（ステップＰ５）。そして、ランキングに基づいて画像を選定する（ステップＰ６）。 Next, a score is calculated as the image suitability for the input image (step P3). Then, it is determined whether or not the processing has been completed for all images (step P4). When the processing has not been completed for all the images, the process returns to Step P1. When the processing has been completed for all the images, the ranking of the suitability of the images is calculated (step P5). Then, an image is selected based on the ranking (step P6).

（画像選定の具体例）
以下に、画像選定処理の具体例を説明する。まず、カテゴリやキーワード等の数式による定義について説明する。コンテンツに付与されるキーワードのカテゴリCxの集合Wは次の数式で表すことができる。Ncはカテゴリの数である。

(Specific examples of image selection)
A specific example of the image selection process will be described below. First, definitions using mathematical formulas such as categories and keywords will be described. A set W of keyword categories Cx given to content can be expressed by the following mathematical formula. Nc is the number of categories.

また、カテゴリCxは次の数式で表すことができる。Nxはカテゴリに属するキーワードの数である。

The category Cx can be expressed by the following formula. Nx is the number of keywords belonging to the category.

一方、画像に付与されるタグ集合は、次の数式で表すことができる。NTwyはキーワードwyによって検索されたタグの数であり、NTiはコンテンツiに付与されているタグの総数である。

On the other hand, a tag set given to an image can be expressed by the following mathematical formula. NTwy is the number of tags searched by the keyword wy, and NTi is the total number of tags assigned to the content i.

また、ユーザ集合は、以下の数式で表すことができる。NUwyはキーワードwyによって検索された画像の投稿ユーザ数である。

The user set can be expressed by the following mathematical formula. NUwy is the number of posting users of images searched by the keyword wy.

このような定義のもとで画像選定の処理を行う。まず、あるキーワードwyを抽出し、キーワードに基づいて、Ｗｅｂ検索などの結果からタグ集合Twy及び、ユーザ集合Uwyを計算する。次に、タグ群Twyに存在する全てのタグに足して、該当タグを付与したユーザ数UFを計算する。UFに対しては、たとえば閾値THを設定することでノイズを削減することもできる。たとえばTH＝５とすることができる。 Image selection processing is performed under such a definition. First, a certain keyword wy is extracted, and based on the keyword, a tag set Twy and a user set Uwy are calculated from the result of Web search or the like. Next, the number of users UF given the corresponding tag is calculated by adding all the tags existing in the tag group Twy. For UF, for example, noise can be reduced by setting a threshold value TH. For example, TH = 5 can be set.

その後、タグ集合Twyの各要素に対してwyとの語句の関連度Rを計算する。語句の関連度Rは以下のような式で計算することができる。Pは共起確率である。共起確率とは、キーワードとタグが一つの同じ画像に付与されている確率であり、キーワードとタグの関連の強さを表す指標として利用することができる。UF(t|wy)は、wyによって検索された画像のうちtというタグを持つ画像を投稿したユーザ数である。また、UF(wy)は、wyによって検索された画像を投稿したユーザ数である。Weightは、重み係数であり、任意に設定することができる。 Thereafter, the degree of association R of the word with wy is calculated for each element of the tag set Twy. The relevance level R of a phrase can be calculated by the following formula. P is the co-occurrence probability. The co-occurrence probability is a probability that a keyword and a tag are assigned to one and the same image, and can be used as an index representing the strength of association between the keyword and the tag. UF (t | wy) is the number of users who have posted images with the tag t among images searched by wy. Further, UF (wy) is the number of users who have posted images searched by wy. Weight is a weight coefficient and can be set arbitrarily.

式（６）はキーワードとタグとの関連の強さを示す語句の関連度Rを表現しており、式（５）などの共起確率をもとに算出することができる。最終的にキーワード毎に語句の関連度Rの高いタグを関連タグ集合と定義し、それをもとに各キーワードに対する語句の関連度Rおよび重要語を辞書として保持することができる。たとえば、キーワードが“春”の場合、“春”は“季節”という概念の印象語であるため、Cx={“春”、“夏”、“秋”、“冬”}を保持できる。 Equation (6) expresses the degree of association R of the phrase indicating the strength of the association between the keyword and the tag, and can be calculated based on the co-occurrence probability such as equation (5). Finally, a tag having a high degree of relevance R of words for each keyword is defined as a related tag set, and based on this, the relevance degree R of words and important words for each keyword can be held as a dictionary. For example, when the keyword is “spring”, since “spring” is an impression word of the concept of “season”, Cx = {“spring”, “summer”, “autumn”, “winter”} can be held.

図６は、語句関連度辞書の一例を示すテーブルである。図６では、各カテゴリCx={w1、w2、w3、w4}の語句と、各カテゴリに関連する各画像のタグとの語句の関連度Rが対応している。t_w1_1は、画像1に付与され、w1のカテゴリに属するタグである。R(t_w1_1|w1)は、キーワードw1と画像1に付与され、w1のカテゴリに属するタグt_w1_1との語句の関連度Rである。 FIG. 6 is a table showing an example of the phrase relevance dictionary. In FIG. 6, the degree of association R between the words of each category Cx = {w1, w2, w3, w4} and the tag of each image related to each category corresponds. t_w1_1 is a tag attached to the image 1 and belonging to the category of w1. R (t_w1_1 | w1) is the degree of association R of the phrase between the keyword w1 and the tag t_w1_1 belonging to the category of the image 1 and w1.

このように定義した語句の関連度Rを、それぞれの全体印象語と共起するUF(t)≧THを満たすタグtを対象に算出し、たとえば語句の関連度Rが0．024以上のタグを関連タグとして判定することができる。 The relevance level R of the words and phrases defined in this way is calculated for the tag t that satisfies UF (t) ≧ TH that co-occurs with each overall impression word, for example, a tag whose relevance level R is 0.024 or more. Can be determined as related tags.

図７は、関連タグと語句の関連度の抽出結果の一例を示すテーブルである。図７では、“春”、“夏”、“朝”、“雨”に対しての関連語(tag)と語句関連度(value)を示しており、語句関連度の高い順にソートしてある。 FIG. 7 is a table showing an example of a result of extracting the degree of association between a related tag and a phrase. FIG. 7 shows related words (tags) and word relevance levels (value) for “spring”, “summer”, “morning”, and “rain”, sorted in descending order of word relevance levels. .

共起確率はFlickr（登録商標）におけるUF(User Frequency)をもとに算出することができ、UF(W)とは、単語集合Wの要素全てをクエリとしてAND検索した際の、検索結果におけるユニークな画像投稿者数を示す。単純に画像ヒット数ではなくUFを用いるのは、Flickr（登録商標）では一定のユーザが同一のタグを付与した画像を大量にアップロードすることで、画像ヒット数が不当に高い値になることがあるため、その影響を緩和するためである。 The co-occurrence probability can be calculated based on UF (User Frequency) in Flickr (registered trademark), and UF (W) is the result of AND search using all the elements of word set W as a query. Indicates the number of unique image contributors. The reason for simply using UF instead of image hits is that in Flickr (registered trademark), if a certain user uploads a large number of images with the same tag, the number of image hits can be unreasonably high. This is to mitigate the effect.

そして、キーワードwyに対するタグtの語句の関連度Rは、それらの共起確率Pから、wyと同じカテゴリに属する他のキーワードにおけるタグtとの共起確率Pの差分により算出することができる。キーワードとタグ間の共起確率だけでなく、同じカテゴリに属する他のキーワードPにおける共起確率Pも考慮することで、１つのキーワードに対してのみ共起確率の高いタグを重要視することができる。 Then, the degree of association R of the phrase of tag t with keyword wy can be calculated from the co-occurrence probability P based on the difference in co-occurrence probability P with tag t in another keyword belonging to the same category as wy. Considering not only the co-occurrence probability between keywords and tags, but also the co-occurrence probability P of other keywords P belonging to the same category, the tag with a high co-occurrence probability only for one keyword can be regarded as important. it can.

次に、画像検索モジュール１２０によって得られた画像群の中から画像Iiを抽出する。次に、Iiにおけるタグ集合Tiを取得し、式（７）などにより、コンテンツに付与されたキーワードと、画像のタグ情報からスコアを計算することができる。式（７）において関連辞書生成モジュールによって得られた関連辞書を利用することや、利用しなくとも画像の適合度としてscoreを計算することができる。

Next, the image Ii is extracted from the image group obtained by the image search module 120. Next, the tag set Ti in Ii is acquired, and the score can be calculated from the keyword assigned to the content and the tag information of the image by the equation (7) or the like. Using the related dictionary obtained by the related dictionary generation module in the equation (7), the score can be calculated as the degree of matching of the image without using the related dictionary.

たとえば、画像の適合度のscoreとして、式（７）を利用することで、キーワードとの語句関連度が高く、かつ、ノイズタグが少ない画像を選定することができる。たとえば、コンテンツを楽曲とした場合、楽曲におけるキーワードとの関連の強いタグが多く付与されている画像ほど、大きい値をとるように設定することができる。このscoreに基づいて画像をランキング表示することや、ランキング上位Nまでの画像を自動で選定することができる。 For example, by using Equation (7) as the score of the degree of matching of an image, an image having a high degree of word relevance with a keyword and a small number of noise tags can be selected. For example, when the content is music, it can be set to take a larger value for an image to which more tags that are strongly related to the keyword in the music are assigned. Based on this score, images can be displayed in a ranking, and images up to the top N can be selected automatically.

［第２の実施形態］
上記の実施形態では、語句関連度辞書を画像選定が必要なときに動的に作成するが、事前に作成しておき、画像選定時に利用してもよい。図８は、語句関連度辞書を事前に作成する画像選定装置２００の構成を示すブロック図である。図９は、モジュールの構成を示すブロック図である。画像選定装置２００では、語句関連度辞書生成モジュール１３０を事前に機能させ、その後、画像選定モジュール１４０により、画像選定を行う。ただし、各部の機能は、実施形態１と同様なので省略する。 [Second Embodiment]
In the above embodiment, the phrase relevance dictionary is dynamically created when image selection is necessary. However, it may be created in advance and used at the time of image selection. FIG. 8 is a block diagram illustrating a configuration of the image selection device 200 that creates a word-related degree dictionary in advance. FIG. 9 is a block diagram showing the configuration of the module. In the image selection device 200, the word / phrase relevance dictionary generation module 130 functions in advance, and then the image selection module 140 performs image selection. However, since the function of each part is the same as that of Embodiment 1, it abbreviate | omits.

１００画像選定装置
１１１リクエスト受付部
１１２データベース
１２０画像検索モジュール
１３０語句関連度辞書生成モジュール
１３１カテゴリ抽出部
１３２キーワード抽出部
１３３タグ抽出部
１３４語句関連度算出部
１４０画像選定モジュール
１４１タグ取得部
１４２画像適合度算出部
１４３選定部
１５１画像管理部
１６０表示時間管理モジュール
１７１再生制御部
１７２再生部
１７３歌詞表示部
１７４画像表示部
２００画像選定装置
４００画像共有サイト
５００楽曲データベース
６００歌詞データベース
Ｕユーザ DESCRIPTION OF SYMBOLS 100 Image selection apparatus 111 Request reception part 112 Database 120 Image search module 130 Phrase relevance dictionary generation module 131 Category extraction part 132 Keyword extraction part 133 Tag extraction part 134 Phrase relevance calculation part 140 Image selection module 141 Tag acquisition part 142 Image adaptation Degree calculation unit 143 Selection unit 151 Image management unit 160 Display time management module 171 Playback control unit 172 Playback unit 173 Lyric display unit 174 Image display unit 200 Image selection device 400 Image sharing site 500 Music database 600 Lyrics database U User

Claims

An image selection device for selecting an image related to content,
A keyword extractor that extracts one or more keywords from specific content;
A tag extraction unit that extracts one or more tags attached to an image obtainable on the network;
The extracted keyword and the acquisition by subtracting the contribution of the co-occurrence probability with the phrase in the acquired tag from another phrase belonging to the same category as the one of the extracted keywords A phrase relevance calculator that calculates the relevance of the phrase with the tagged tag,
An image selection apparatus comprising: a selection unit that selects an image having a high degree of matching with the content by using the calculated degree of association of words and phrases.

The image selection apparatus according to claim 1, wherein the keyword extraction unit extracts one or a plurality of keywords from a specific music piece.

The phrase relevance calculation unit calculates the relevance of the phrase using a co-occurrence probability of one of the extracted keywords and one of the acquired tags. The image selection device according to claim 1 or 2.

An image selection method for selecting an image related to content,
Extracting one or more keywords from specific content;
Obtaining one or more tags attached to an image obtainable on a network;
The extracted keyword and the acquisition by subtracting the contribution of the co-occurrence probability with the phrase in the acquired tag from another phrase belonging to the same category as the one of the extracted keywords Calculating the relevance of the term to the tagged tag;
Selecting an image highly related to the content using the calculated degree of association of words.

An image selection program for selecting images related to content,
Extracting one or more keywords from specific content;
Processing to acquire one or more tags attached to images that can be acquired on the network;
The extracted keyword and the acquisition by subtracting the contribution of the co-occurrence probability with the phrase in the acquired tag from another phrase belonging to the same category as the one of the extracted keywords Calculating the degree of relevance of the word to the tagged tag,
An image selection program that causes a computer to execute processing for selecting an image highly related to the content by using the calculated degree of association of words.