JP6672707B2

JP6672707B2 - Playlist generation method, playlist generation device, program, and list generation method

Info

Publication number: JP6672707B2
Application number: JP2015215290A
Authority: JP
Inventors: 淳哉斎藤
Original assignee: Fujitsu Ltd
Current assignee: Fujitsu Ltd
Priority date: 2015-10-30
Filing date: 2015-10-30
Publication date: 2020-03-25
Anticipated expiration: 2035-10-30
Also published as: JP2017084313A

Description

本発明は、プレイリスト生成方法、プレイリスト生成装置、及びプログラム、並びに再生リスト生成方法に関する。 The present invention relates to a playlist generation method, a playlist generation device, a program, and a playlist generation method.

近年、通信ネットワークに接続可能な電子機器の種類が増加する傾向にあり、各種電子機器に対し通信ネットワークを利用したサービスが提供されるようになってきている。特に、インターネット等の通信ネットワークを利用したサービスの１つとして、各種電子機器に対する音楽配信サービスの普及が進んでいる。 In recent years, types of electronic devices connectable to a communication network have been increasing, and services using the communication network have been provided to various electronic devices. In particular, as one of services using a communication network such as the Internet, music distribution services for various electronic devices have been widely spread.

音楽配信サービスの１つとして、電子機器の利用者（オペレータ）が聞きたい曲のイメージを表すキーワードを音声で指定することにより、配信サーバがイメージに合った曲を抽出したプレイリストを生成して電子機器に配信するサービスが知られている。プレイリストの生成方法としては、プレイリスト及び曲のそれぞれにイメージを表す特徴量を設定しておき、プレイリストの特徴量との合致度の高い特徴量を持つ曲をプレイリストに登録する方法が知られている（例えば、特許文献１を参照）。 As one of the music distribution services, a user (operator) of an electronic device specifies a keyword representing an image of a song to be heard by voice, and the distribution server generates a playlist in which songs matching the image are extracted. Services for delivering to electronic devices are known. As a method of generating a playlist, there is a method in which a feature amount representing an image is set for each of the playlist and the song, and a song having a feature amount having a high degree of matching with the feature amount of the playlist is registered in the playlist. It is known (see, for example, Patent Document 1).

特開２００３−１３２０８５号公報JP-A-2003-132850

イメージの特徴量に基づいて音楽データのプレイリストを作成する場合、イメージ特徴空間におけるプレイリストの特徴量を中心とした所定の範囲内の特徴量を持つ曲がプレイリストに登録される。そのため、イメージの特徴量の類似したプレイリストがある場合、ある１曲を登録対象とするプレイリストが複数存在する場合がある。 When creating a playlist of music data based on the feature amount of an image, a song having a feature amount within a predetermined range centered on the feature amount of the playlist in the image feature space is registered in the playlist. Therefore, when there are playlists having similar image feature amounts, there may be a plurality of playlists in which a certain song is to be registered.

また、プレイリストを生成する際には、曲のイメージとは別の条件（属性）を表すキーワードを音声で指定することにより、プレイリストへの登録対象となる曲を絞り込んでプレイリストを生成することも可能となっている。そのため、曲を絞り込んでプレイリストを生成した場合、プレイリストに登録される曲数が少なく、かつイメージの特徴量に偏りが生じる可能性がある。 When a playlist is generated, a keyword indicating a condition (attribute) different from the image of the song is specified by voice, thereby narrowing down the songs to be registered in the playlist and generating the playlist. It is also possible. Therefore, when the playlist is generated by narrowing down the songs, the number of songs registered in the playlist may be small and the feature amount of the image may be biased.

このように、プレイリストに登録される曲数が少なく、かつイメージの特徴量に偏りが生じた場合、プレイリストのイメージと登録された曲のみから利用者が抱くイメージとに齟齬が生じ、違和感を抱く可能性がある。また、曲（音楽）に限らず、小説等の朗読再生用の文書データや画像データ等、イメージを表す特徴量により分類可能なデータの再生リストをイメージに基づいて作成する場合にも、同様のことが言える。 As described above, when the number of songs registered in the playlist is small and the feature amount of the image is biased, a discrepancy occurs between the image of the playlist and the image held by the user only from the registered songs, and a sense of incongruity occurs. May be embraced. The same applies to a case where a reproduction list of data that can be classified based on a feature amount representing an image, such as document data for reading and reproducing a novel or the like, image data, etc., based on an image, is not limited to a song (music). I can say that.

１つの側面において、本発明は、提示したプレイリストのイメージとプレイリストに含まれる曲のイメージとの齟齬により利用者に与える違和感を低減させることを目的とする。 In one aspect, an object of the present invention is to reduce a sense of discomfort given to a user due to a discrepancy between an image of a presented playlist and an image of a song included in the playlist.

１つの態様のプレイリスト生成方法では、コンピュータが以下の処理を実行する。コンピュータは、まず、利用者からのプレイリストの生成要求に含まれる当該プレイリストへの登録対象とする曲の属性を表す属性情報に基づいて曲の絞り込みを行う。次に、コンピュータは、絞り込んだ曲についてのイメージを表す特徴量と、データベース上で各プレイリストに対応付けられている範囲であってプレイリストへの登録対象とする曲についての特徴量が分布する範囲を規定しているプレイリストの範囲とに基づいて、イメージを表す特徴量が当該範囲に含まれる曲を登録したプレイリストを、推薦プレイリスト候補として生成する。そして、生成した推薦プレイリスト候補に登録された曲についての特徴量の分布が、生成要求によって生成が要求されているプレイリストである要求プレイリストについて想定している当該範囲に対して偏っている場合に、推薦プレイリスト候補に含まれる曲についての当該分布と、推薦プレイリスト候補について想定している当該範囲と、要求プレイリストについて想定している当該範囲とに基づいて、推薦プレイリスト候補から、利用者に提示するプレイリストである推薦プレイリストを決定する。
In one embodiment, the computer executes the following processing. First, the computer narrows down songs based on attribute information indicating attributes of songs to be registered in the playlist included in the playlist generation request from the user. Then, the computer, the feature of the song for the feature value representing the image of the song narrowed, and registered in a range that has been correlated to each play list on the database to a playlist is distributed Based on the range of the play list defining the range , a play list in which the tunes whose features representing the images are included in the range is generated as a recommended play list candidate . Then, the feature quantity distribution of the songs registered in the generated recommendation playlist candidates, biased against the corresponding range is assumed about the requested playlist is a playlist generation is requested by the generation request If you are, based on the corresponding distributions for the song to be included in the recommendation playlist candidate, and the range assumed for recommended playlist candidate, and the range assumed with the requirements playlist recommendation playlist From the candidates , a recommended playlist that is a playlist to be presented to the user is determined .

上述の態様によれば、提示したプレイリストのイメージとプレイリストに含まれる曲のイメージとの齟齬により利用者に与える違和感を低減させることが可能となる。 According to the above aspect, it is possible to reduce the discomfort given to the user due to the discrepancy between the image of the presented playlist and the image of the music included in the playlist.

第１の実施形態に係るプレイリスト生成装置の機能的構成を示す図である。FIG. 2 is a diagram illustrating a functional configuration of a playlist generation device according to the first embodiment. 優先度算出部及びナレーション作成部の構成例を示す図The figure which shows the example of a structure of a priority calculation part and a narration preparation part キーワードＤＢの例を示す図である。It is a figure showing an example of a keyword DB. 絞込属性ＤＢの例を示す図である。It is a figure showing an example of a narrowing down attribute DB. 曲ＤＢの例を示す図である。It is a figure showing the example of music DB. プレイリストＤＢの例を示す図である。It is a figure showing an example of a play list DB. イメージ特徴量の例を示す図である。FIG. 6 is a diagram illustrating an example of an image feature amount. イメージ特徴空間を説明する図である。FIG. 3 is a diagram illustrating an image feature space. 第１の実施形態に係るプレイリスト生成処理を説明するフローチャート（その１）である。9 is a flowchart (part 1) illustrating a playlist generation process according to the first embodiment. 第１の実施形態に係るプレイリスト生成処理を説明するフローチャート（その２）である。10 is a flowchart (part 2) illustrating a playlist generation process according to the first embodiment. プレイリストの変更方法の例を説明する図である。FIG. 9 is a diagram for describing an example of a playlist change method. 検索キーワード及び絞り込み属性を抽出する処理の内容を説明するフローチャートである。It is a flowchart explaining the content of the process of extracting a search keyword and a refinement attribute. 推薦優先度を算出する処理の内容を説明するフローチャートである。It is a flowchart explaining the content of the process which calculates recommendation priority. 要求ずれ量を説明する図である。FIG. 7 is a diagram illustrating a required shift amount. 認識ずれ量を説明する図である。FIG. 4 is a diagram illustrating a recognition shift amount. プレイリストの範囲の別の例を示す図である。It is a figure showing another example of the range of a play list. プレイリストの範囲と確率密度分布との関係を説明する図である。FIG. 4 is a diagram illustrating a relationship between a range of a playlist and a probability density distribution. 第２の実施形態に係るプレイリスト生成装置の機能的構成を示す図である。It is a figure showing the functional composition of the play list generation device concerning a 2nd embodiment. 適応処理部の構成例を示す図である。It is a figure showing the example of composition of an adaptive processing part. 適応処理部が行う処理を説明するフローチャートである。9 is a flowchart illustrating a process performed by an adaptive processing unit. プレイリストの範囲の修正方法の例を説明する図である。FIG. 9 is a diagram for describing an example of a method of modifying a range of a playlist. プレイリストの範囲の拡大方法の例を説明する図である。FIG. 9 is a diagram illustrating an example of a method of expanding a range of a playlist. プレイリストの範囲の拡大することによる効果の１つを説明する図である。FIG. 9 is a diagram for explaining one of effects obtained by expanding the range of the playlist. 第３の実施形態に係るプレイリスト再生システムの構成を示す図である。It is a figure showing the composition of the play list reproduction system concerning a 3rd embodiment. 第３の実施形態に係るプレイリスト再生システムにおけるサーバの機能的構成を示す図である。It is a figure showing the functional composition of the server in the play list reproduction system concerning a 3rd embodiment. 第４の実施形態に係るプレイリスト再生システムの構成を示す図である。It is a figure showing the composition of the play list reproduction system concerning a 4th embodiment. 第４の実施形態に係るプレイリスト再生システムにおけるサーバの機能的構成を示す図である。It is a figure showing the functional composition of the server in the playlist reproduction system concerning a 4th embodiment. 第５の実施形態に係るプレイリスト再生システムの構成を示す図である。It is a figure showing the composition of the play list reproduction system concerning a 5th embodiment. コンピュータのハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration of a computer.

［第１の実施形態］
図１は、第１の実施形態に係るプレイリスト生成装置の機能的構成を示す図である。 [First Embodiment]
FIG. 1 is a diagram illustrating a functional configuration of the playlist generation device according to the first embodiment.

図１に示すように、本実施形態に係るプレイリスト生成装置１は、要求プレイリスト入力部１０１と、キーワード／属性抽出部１０２と、曲絞込部１０３と、を備える。また、プレイリスト生成装置１は、推薦プレイリスト候補生成部１０４と、優先度算出部１０５と、プレイリスト決定部１０６と、を備える。また、プレイリスト生成装置１は、ナレーション作成部１０７と、プレイリスト再生部１０８と、音声出力部１０９と、を備える。また、プレイリスト生成装置１は、キーワードＤＢ１２１と、絞込属性ＤＢ１２２と、曲ＤＢ１２３と、プレイリストＤＢ１２４と、を備える。 As shown in FIG. 1, the playlist generation device 1 according to the present embodiment includes a required playlist input unit 101, a keyword / attribute extraction unit 102, and a music narrowing unit 103. Further, the playlist generation device 1 includes a recommended playlist candidate generation unit 104, a priority calculation unit 105, and a playlist determination unit 106. Further, the playlist generation device 1 includes a narration creation unit 107, a playlist reproduction unit 108, and an audio output unit 109. Further, the playlist generation device 1 includes a keyword DB 121, a narrow-down attribute DB 122, a song DB 123, and a playlist DB 124.

要求プレイリスト入力部１０１は、利用者からのプレイリストの生成要求の入力を音声又はテキストにより受け付ける。生成要求の入力を音声により受け付ける場合、要求プレイリスト入力部１０１は、マイク等の音声入力部と、音声認識部とを含む。音声による入力を受け付ける場合、要求プレイリスト入力部１０１は、マイクで収音した周囲の音を音声信号にした後、音声認識によりテキストに変換して出力する。また、テキストによる入力を受け付ける場合、要求プレイリスト入力部１０１は、キーボード（ソフトウェアキーボードを含む）から文字入力を検知し、テキストにして出力する。 The request playlist input unit 101 receives an input of a playlist generation request from a user by voice or text. When the input of the generation request is received by voice, the requested playlist input unit 101 includes a voice input unit such as a microphone and a voice recognition unit. When accepting an input by voice, the request playlist input unit 101 converts a surrounding sound picked up by the microphone into a voice signal, converts the voice into a text by voice recognition, and outputs the text. When accepting text input, the request playlist input unit 101 detects a character input from a keyboard (including a software keyboard) and outputs the text.

キーワード／属性抽出部１０２は、音声認識部で得たテキストに基づいて、キーワードＤＢ１２１から検索キーワードを抽出するとともに、絞込属性ＤＢ１２２から曲の絞り込み属性を抽出する。キーワードＤＢ１２１は、テキストと検索キーワードとの対応関係が複数登録されたデータベースである。検索キーワードは、プレイリストの検索に用いるキーワードである。絞込属性ＤＢ１２２は、テキストと曲の絞り込み属性との対応関係が複数登録されたデータベースである。曲の絞り込み属性は、プレイリストに登録する曲の絞り込みに用いる属性情報である。 The keyword / attribute extraction unit 102 extracts a search keyword from the keyword DB 121 based on the text obtained by the voice recognition unit, and extracts a song narrowing attribute from the narrowing attribute DB 122. The keyword DB 121 is a database in which a plurality of correspondences between texts and search keywords are registered. The search keyword is a keyword used for searching for a playlist. The narrowing down attribute DB 122 is a database in which a plurality of correspondences between texts and song narrowing down attributes are registered. The song narrowing down attribute is attribute information used to narrow down songs to be registered in the playlist.

曲絞込部１０３は、抽出した絞り込み属性に基づいて、プレイリストへの登録対象の候補となる曲を、曲ＤＢ１２３に登録された全ての曲から絞り込み属性に合致する属性を持つ曲に絞り込む。曲ＤＢ１２３は、曲音声データと、曲属性と、曲のイメージ特徴量との対応関係が複数登録されたデータベースである。曲属性は、曲音声データの絞り込みに用いる属性情報である。曲のイメージ特徴量は、曲のイメージを数値化した情報である。 The song narrowing unit 103 narrows down the candidate songs to be registered in the playlist to songs having attributes matching the narrowing attribute from all the songs registered in the song DB 123 based on the extracted narrowing attributes. . The song DB 123 is a database in which a plurality of correspondences between song audio data, song attributes, and image feature amounts of the song are registered. The song attribute is attribute information used to narrow down song audio data. The music image feature amount is information obtained by digitizing the music image.

推薦プレイリスト候補生成部１０４は、プレイリストＤＢ１２４から曲絞込部１０３で絞り込んだ曲を含むプレイリストを抽出して推薦プレイリスト候補を作成する。推薦プレイリスト候補は、プレイリスト生成装置１の利用者（オペレータ）に推薦する（提示する）プレイリストの候補である。推薦プレイリスト候補は、プレイリストの名称と、曲ＩＤと、想定分布とを含む。プレイリストＤＢ１２４は、プレイリスト毎にプレイリストの名称と、想定分布と、プレイリストの範囲とを対応付けて登録したデータベースである。プレイリストの想定分布は、曲のイメージ特徴量に基づいて曲の分布を表す際に用いるイメージ特徴空間におけるプレイリストの確率密度分布であり、例えば、確率密度分布を正規分布とする場合は、平均（中心）と分散で表現する。また、プレイリストの範囲は、イメージ特徴空間においてプレイリストへの登録対象とする曲の特徴量を規定する範囲である。すなわち、推薦プレイリスト候補生成部１０４は、プレイリストＤＢ１２４に登録されたプレイリストからプレイリストの範囲内に曲絞込部１０３で絞り込んだ曲が含まれるプレイリストを抽出して推薦プレイリスト候補を作成する。 The recommended playlist candidate generation unit 104 extracts a playlist including the songs narrowed down by the song narrowing unit 103 from the playlist DB 124 and creates a recommended playlist candidate. The recommended playlist candidate is a playlist candidate that is recommended (presented) to a user (operator) of the playlist generation device 1. The recommended playlist candidate includes a playlist name, a song ID, and an assumed distribution. The playlist DB 124 is a database in which playlist names, assumed distributions, and playlist ranges are registered in association with each playlist. The assumed distribution of the playlist is the probability density distribution of the playlist in the image feature space used when expressing the distribution of the song based on the image feature amount of the song.For example, when the probability density distribution is a normal distribution, the average (Center) and variance. The range of the playlist is a range that defines the feature amount of a music piece to be registered in the playlist in the image feature space. That is, the recommended playlist candidate generating unit 104 extracts a playlist including the songs narrowed down by the song narrowing unit 103 within the range of the playlist from the playlists registered in the playlist DB 124, and Create

優先度算出部１０５は、推薦プレイリスト候補生成部１０４が複数の推薦プレイリスト候補を作成した場合に各推薦プレイリスト候補の優先度を算出する。優先度算出部１０５は、推薦プレイリスト候補の想定分布及び範囲、並びに検索キーワードに基づいて、各推薦プレイリスト候補の優先度を算出する。 The priority calculation unit 105 calculates the priority of each recommended playlist candidate when the recommended playlist candidate generation unit 104 creates a plurality of recommended playlist candidates. The priority calculation unit 105 calculates the priority of each recommended playlist candidate based on the assumed distribution and range of the recommended playlist candidate and the search keyword.

プレイリスト決定部１０６は、優先度算出部１０５が算出した優先度に基づいて、複数の推薦プレイリスト候補のなかから採用するプレイリスト、すなわち利用者に提示するプレイリストを決定する。 The playlist determination unit 106 determines a playlist to be adopted from among a plurality of recommended playlist candidates, that is, a playlist to be presented to the user, based on the priority calculated by the priority calculation unit 105.

ナレーション作成部１０７は、プレイリスト決定部１０６が決定したプレイリストのイメージ等を表すナレーション音声データを作成する。 The narration creating unit 107 creates narration audio data representing an image or the like of the playlist determined by the playlist determining unit 106.

プレイリスト再生部１０８は、ナレーション作成部１０７が作成したナレーション音声データを再生するとともに、プレイリスト決定部１０６が決定したプレイリストに含まれる曲音声データを再生する。 The playlist playback unit 108 plays back the narration audio data created by the narration creation unit 107 and plays back the song audio data included in the playlist determined by the playlist determination unit 106.

音声出力部１０９は、例えばスピーカであり、プレイリスト再生部１０８が出力するナレーション音声データ及び曲音声データを曲音声としてプレイリスト生成装置１の外部の空間に向けて出力する。 The audio output unit 109 is, for example, a speaker, and outputs the narration audio data and the music audio data output by the playlist reproduction unit 108 as music audio to a space outside the playlist generation device 1.

このように、本実施形態に係るプレイリスト生成装置１は、生成したプレイリストを再生して出力するプレイリスト再生装置を兼ねている。 As described above, the playlist generation device 1 according to the present embodiment also functions as a playlist reproduction device that reproduces and outputs the generated playlist.

図２は、優先度算出部及びナレーション作成部の構成例を示す図である。
図２に示すように、優先度算出部１０５は、認識ずれ量算出部１０５ａと、要求ずれ量算出部１０５ｂと、プレイリスト検索部１０５ｃと、ずれ量統合部１０５ｄと、を含む。 FIG. 2 is a diagram illustrating a configuration example of a priority calculation unit and a narration creation unit.
As illustrated in FIG. 2, the priority calculation unit 105 includes a recognition shift amount calculation unit 105a, a request shift amount calculation unit 105b, a playlist search unit 105c, and a shift amount integration unit 105d.

認識ずれ量算出部１０５ａは、推薦プレイリスト候補に含まれる曲ＩＤと対応する曲のイメージ特徴量から求めたプレイリストの曲の特徴量の分布と、当該推薦プレイリスト候補の想定している曲の特徴量の分布との差異（認識ずれ量）を算出する。ここで、推薦プレイリスト候補（プレイリスト）の想定している曲の特徴量の分布とは、例えば、曲ＤＢ１２３に登録された全ての曲を登録対象としてプレイリストを作成した場合に当該プレイリストに含まれる全ての曲の特徴量の分布である。以下の説明では、曲の特徴量の分布のことを「曲の分布」又は単に「分布」ともいう。 The recognition shift amount calculation unit 105a calculates the distribution of the feature amount of the song in the playlist obtained from the image feature amount of the song corresponding to the song ID included in the recommended playlist candidate, and the song assumed by the recommended playlist candidate. The difference (recognition shift amount) from the distribution of the feature amount is calculated. Here, the distribution of the feature amount of the music assumed as the recommended playlist candidate (playlist) is, for example, the case where a playlist is created with all the music registered in the music DB 123 as a registration target. This is the distribution of the feature amounts of all the songs included in the list. In the following description, the distribution of the feature amount of a music piece is also referred to as “music piece distribution” or simply “distribution”.

要求ずれ量算出部１０５ｂは、推薦プレイリスト候補の想定している曲の分布と、キーワード／属性抽出部１０２が抽出した検索キーワードに対応したプレイリストの想定している曲の分布との差異（要求ずれ量）を算出する。要求ずれ量算出部１０５ｂは、検索キーワードに対応したプレイリストをプレイリスト検索部１０５ｃから取得する。以下の説明では、検索キーワードに対応したプレイリスト、すなわち利用者が要求するプレイリストのことを「要求プレイリスト」ともいう。 The request deviation amount calculation unit 105b calculates the difference between the distribution of music assumed as a recommended playlist candidate and the distribution of music assumed in a playlist corresponding to the search keyword extracted by the keyword / attribute extraction unit 102 ( Required deviation). The request deviation amount calculation unit 105b acquires a playlist corresponding to the search keyword from the playlist search unit 105c. In the following description, a playlist corresponding to a search keyword, that is, a playlist requested by a user is also referred to as a “requested playlist”.

プレイリスト検索部１０５ｃは、キーワード／属性抽出部１０２が抽出した検索キーワードをキーとしてプレイリストＤＢ１２４を検索し、当該検索キーワードと合致するプレイリスト（要求プレイリスト）を取得する。 The playlist search unit 105c searches the playlist DB 124 using the search keyword extracted by the keyword / attribute extraction unit 102 as a key, and acquires a playlist (requested playlist) that matches the search keyword.

ずれ量統合部１０５ｄは、各推薦プレイリスト候補に対する認識ずれ量と要求ずれ量とに基づいて、各推薦プレイリスト候補の優先度を算出する。 The deviation amount integration unit 105d calculates the priority of each recommended playlist candidate based on the recognition deviation amount and the required deviation amount for each recommended playlist candidate.

また、図２に示すように、ナレーション作成部１０７は、ナレーション文章作成部１０７ａと、ナレーション音声作成部１０７ｂと、を含む。ナレーション文章作成部１０７ａは、検索キーワードと、プレイリスト決定部１０６が決定したプレイリストとに基づいて、決定したプレイリストについての情報を含む文章（テキスト）を作成する。ナレーション音声作成部１０７ｂは、ナレーション文章作成部１０７ａが作成した文章に基づいて、決定したプレイリストについての情報を含む音声データを作成する。 Further, as shown in FIG. 2, the narration creating unit 107 includes a narration text creating unit 107a and a narration voice creating unit 107b. The narration sentence creation unit 107a creates a sentence (text) including information on the determined playlist based on the search keyword and the playlist determined by the playlist determination unit 106. The narration sound creation unit 107b creates sound data including information on the determined playlist based on the sentence created by the narration sentence creation unit 107a.

図３は、キーワードＤＢの例を示す図である。キーワードＤＢ１２１は、上記のように、テキストと検索キーワードとの対応関係テキストと検索キーワードとの対応関係が複数登録されたデータベースである。キーワードＤＢ１２１におけるテキスト及び検索キーワードは、それぞれ、図３に示すように、プレイリスト（曲）のイメージを表す文字列である。また、キーワードＤＢ１２１は、図３に示すように、１つの検索キーワードに対し複数のテキストを対応付けて登録することも可能である。 FIG. 3 is a diagram illustrating an example of the keyword DB. The keyword DB 121 is a database in which a plurality of correspondences between texts and search keywords are registered as described above. Each of the text and the search keyword in the keyword DB 121 is a character string representing an image of a playlist (song), as shown in FIG. Further, as shown in FIG. 3, the keyword DB 121 can register a plurality of texts in association with one search keyword.

図４は、絞込属性ＤＢの例を示す図である。絞込属性ＤＢ１２２は、上記のように、テキストと曲の絞り込み属性との対応関係が複数登録されたデータベースである。曲の絞り込み属性は、イメージとは異なる曲の分類情報であり、図４に示すように、アーティスト名、性別、及び年代等を含む。例えば、テキストに「男性」を表す文字列が含まれている場合、キーワード／属性抽出部１０２は、絞り込み属性の１つとして「性別：男性」という属性情報を抽出して曲絞込部１０３に出力する。この場合、曲絞込部１０３は、プレイリストへの登録対象となる曲を、曲ＤＢ１２３に登録された全ての曲から男性アーティストの曲に絞り込む。 FIG. 4 is a diagram illustrating an example of the narrow-down attribute DB. As described above, the narrowing down attribute DB 122 is a database in which a plurality of correspondences between texts and song narrowing down attributes are registered. The song narrowing-down attribute is classification information of a song different from an image, and includes an artist name, gender, age, and the like as shown in FIG. For example, when the text includes a character string representing “male”, the keyword / attribute extracting unit 102 extracts attribute information “sex: male” as one of the narrowing-down attributes, and sends the attribute information to the song narrowing-down unit 103. Output. In this case, the song narrowing unit 103 narrows down the songs to be registered in the playlist from all the songs registered in the song DB 123 to songs of a male artist.

また、テキストに、アーティスト名を表す文字列、発表された年代を表す文字列が含まれている場合、それらの文字列と対応付けされた属性情報を抽出して曲絞込部１０３に通知する。 If the text includes a character string representing the artist name and a character string representing the date of publication, the attribute information associated with those character strings is extracted and notified to the song narrowing unit 103. .

図５は、曲ＤＢの例を示す図である。曲ＤＢ１２３は、上記のように、曲音声データと、曲属性と、曲のイメージ特徴量との対応関係が複数登録されたデータベースである。曲ＤＢ１２３では、曲ＩＤにより複数の曲音声データを識別する。１個の曲ＩＤには、１曲分の曲音声データと、当該曲音声データの曲属性と、当該曲音声データのイメージ特徴量とが対応付けられる。 FIG. 5 is a diagram illustrating an example of the music DB. The song DB 123 is a database in which a plurality of correspondences between song audio data, song attributes, and song image feature amounts are registered as described above. In the song DB 123, a plurality of song audio data is identified by the song ID. One song ID is associated with one piece of song audio data, a song attribute of the song audio data, and an image feature amount of the song audio data.

曲ＤＢ１２３の曲属性は、イメージとは異なる曲の分類情報であり、絞込属性ＤＢ１２２における絞り込み属性と対応する情報を含む。図５に示した曲ＤＢ１２３では、曲属性にタイトル、アーティスト名、性別、及び年代が含まれている。 The song attribute of the song DB 123 is classification information of a song different from the image, and includes information corresponding to the narrowing down attribute in the narrowing down attribute DB 122. In the song DB 123 shown in FIG. 5, the song attributes include the title, artist name, gender, and age.

曲ＤＢ１２３の曲のイメージ特徴量は、曲のイメージを数値化した情報である。この曲のイメージ特徴量は、例えば、ある１曲に対し１以上のイメージについてそれぞれどの程度当てはまるかを多段階評価により数値化した情報である。 The song image feature amount in the song DB 123 is information obtained by digitizing the song image. The image feature amount of the music is, for example, information obtained by quantifying the extent to which one or more images are applied to one music by multi-stage evaluation.

図６は、プレイリストＤＢの例を示す図である。プレイリストＤＢ１２４は、図６に示すように、プレイリスト毎にプレイリストの名称と、想定分布と、プレイリストの範囲とを対応付けて登録したデータベースである。プレイリストの想定分布は、上記のように、曲のイメージ特徴量に基づいて曲の分布を表す際のイメージ特徴空間におけるプレイリストの平均（中心）及び分散である。また、プレイリストの範囲は、イメージ特徴空間においてプレイリストへの登録対象となる曲のイメージ特徴量を規定する範囲である。プレイリストの範囲が円又は球（超球）とする場合、プレイリストＤＢ１２４におけるプレイリストの範囲は、図６に示したように中心からの半径で指定する。 FIG. 6 is a diagram illustrating an example of the playlist DB. As shown in FIG. 6, the playlist DB 124 is a database in which a playlist name, an assumed distribution, and a range of the playlist are registered in association with each playlist. As described above, the assumed distribution of the playlist is the average (center) and variance of the playlist in the image feature space when expressing the distribution of the song based on the image feature amount of the song. Further, the range of the playlist is a range that defines the image feature amount of the music to be registered in the playlist in the image feature space. When the range of the playlist is a circle or a sphere (super sphere), the range of the playlist in the playlist DB 124 is designated by a radius from the center as shown in FIG.

図７は、イメージ特徴量の例を示す図である。図８は、イメージ特徴空間を説明する図である。 FIG. 7 is a diagram illustrating an example of the image feature amount. FIG. 8 is a diagram illustrating an image feature space.

曲のイメージ特徴量は、図７に示すように、１以上のイメージの度合いを数値化した値で表現される。このイメージ特徴量を用いることにより、曲ＤＢ１２３に登録された各曲は、イメージ特徴空間内の１点で表される。また、各プレイリストの範囲は、例えばイメージ特徴空間内に円又は球（超球）で指定される。そして、ある１つのプレイリストを指定した場合、イメージ特徴空間内においてそのプレイリストの範囲内に含まれる曲が、そのプレイリストへの登録対象の曲となる。 As shown in FIG. 7, the image feature amount of a music piece is represented by a value obtained by digitizing the degree of one or more images. By using this image feature amount, each song registered in the song DB 123 is represented by one point in the image feature space. The range of each playlist is specified by a circle or a sphere (super sphere) in the image feature space, for example. Then, when a certain playlist is designated, the songs included in the range of the playlist in the image feature space are the songs to be registered in the playlist.

例えば、イメージ特徴量が第１のイメージ（イメージＨ）と、第２のイメージ（イメージＫ）の２個の場合、イメージ特徴空間は、図８に示したような二次元の空間（すなわち平面）で表される。 For example, when there are two image feature amounts, a first image (image H) and a second image (image K), the image feature space is a two-dimensional space (that is, a plane) as shown in FIG. It is represented by

図８に示したイメージ特徴空間は、横軸がイメージＨの特徴量を表す軸であり、縦軸がイメージＫの特徴量を表す軸である。また、図８に示したイメージ特徴空間では、各イメージの特徴量を−１．０から＋１．０までの範囲で示している。このイメージ特徴空間は、例えば、横軸（イメージＨ）を「快／不快」のイメージとし、縦軸（イメージＫ）を「興奮／鎮静」のイメージとする。このとき、横軸は、例えば、＋１．０を「快」とし、−１．０を「不快」として、快のイメージが強い曲ほどイメージ特徴空間の右方にマッピングされるようにする。また、縦軸は、例えば、＋１．０を「興奮」とし、−１．０を「鎮静」として、興奮のイメージが強い曲ほどイメージ特徴空間の上方にマッピングされるようにする。なお、イメージ特徴空間は、例えば、横軸（イメージＨ）を「喜／哀」のイメージとし、縦軸（イメージＫ）を「怒／楽」のイメージとしてもよい。 In the image feature space shown in FIG. 8, the horizontal axis is the axis representing the feature amount of the image H, and the vertical axis is the axis representing the feature amount of the image K. Further, in the image feature space shown in FIG. 8, the feature amount of each image is shown in a range from -1.0 to +1.0. In this image feature space, for example, the horizontal axis (image H) is an image of "pleasant / unpleasant", and the vertical axis (image K) is an image of "excitation / sedation". At this time, the abscissa indicates, for example, +1.0 as “pleasant” and −1.0 as “displeased”, so that a song having a stronger pleasant image is mapped to the right of the image feature space. On the vertical axis, for example, +1.0 is set to "excitation" and -1.0 is set to "sedation", so that a song having a stronger excitement image is mapped to an upper portion of the image feature space. In the image feature space, for example, the horizontal axis (image H) may be an image of "joy / sorrow" and the vertical axis (image K) may be an image of "anger / ease".

曲ＤＢ１２３に登録された全ての曲は、図８のイメージ特徴空間の１点（１個の黒丸）で表すことができる。例えば、イメージＨの特徴量が０．２であり、イメージＫの特徴量が０．７である曲は、点２０１で表される。また、イメージＨの特徴量が−０．４であり、イメージＫの特徴量が０．２である曲は、点２０２で表される。 All the songs registered in the song DB 123 can be represented by one point (one black circle) in the image feature space in FIG. For example, a song in which the feature amount of the image H is 0.2 and the feature amount of the image K is 0.7 is represented by a point 201. Also, a tune in which the feature amount of the image H is -0.4 and the feature amount of the image K is 0.2 is represented by a point 202.

また、プレイリストの範囲は、イメージ特徴空間内の１点を中心とする円で表される。そのため、図８に示すように、あるプレイリストの範囲３０１が円で表され、かつ絞り込み属性に基づいて絞り込まれた曲が黒丸で表されている場合、範囲３０１の円周で囲まれた領域に含まれる５個の点と対応する５曲が、そのプレイリストへの登録対象となる。また、範囲３０２で表されるプレイリストへの登録対象となる曲は、範囲３０２の円周で囲まれた領域に含まれる３個の点と対応する３曲となる。なお、音声認識部で得たテキストにプレイリストのイメージとともに曲の絞り込み属性が含まれる場合、プレイリストへの登録対象は、プレイリストの範囲内に特徴量を持ち、かつ絞り込み属性と合致する曲属性を持つ曲のみが登録対象となる。例えば、テキストに特徴量の範囲が範囲３０１の円で表されるプレイリストのイメージとともに絞り込み属性が含まれる場合、範囲３０１内の５曲のうち絞り込み属性と合致する曲属性を持つ曲のみが当該プレイリストへの登録対象となる。 The range of the playlist is represented by a circle centered on one point in the image feature space. Therefore, as shown in FIG. 8, when a range 301 of a certain playlist is represented by a circle and a song narrowed down based on the narrowing down attribute is represented by a black circle, an area surrounded by the circumference of the range 301 Are to be registered in the play list. The songs to be registered in the playlist represented by the range 302 are three songs corresponding to the three points included in the area surrounded by the circumference of the range 302. When the text obtained by the voice recognition unit includes a song narrowing attribute together with the image of the playlist, the registration target for the playlist is a song having a feature amount within the range of the playlist and matching the narrowing attribute. Only songs with attributes are registered. For example, if the text includes a narrowing attribute together with a playlist image in which the range of the feature amount is represented by a circle in the range 301, only songs having a song attribute that matches the narrowing attribute among the five songs in the range 301 are included in the text. It becomes a registration target for the playlist.

図９Ａは、第１の実施形態に係るプレイリスト作成処理を説明するフローチャート（その１）である。図９Ｂは、第１の実施形態に係るプレイリスト作成処理を説明するフローチャート（その２）である。 FIG. 9A is a flowchart (part 1) illustrating a playlist creation process according to the first embodiment. FIG. 9B is a flowchart (part 2) illustrating the playlist creation process according to the first embodiment.

本実施形態のプレイリスト作成装置１は、図９Ａ及び図９Ｂに示したような、プレイリスト再生処理を含むプレイリスト作成処理を行う。なお、図９Ａ及び図９Ｂには、音声によるプレイリストの生成要求の入力を受け付ける場合のプレイリスト作成処理を示している。 The playlist creation device 1 of the present embodiment performs a playlist creation process including a playlist playback process as shown in FIGS. 9A and 9B. 9A and 9B show a playlist creation process in the case of receiving an input of a playlist creation request by voice.

プレイリスト作成処理において、プレイリスト作成装置１は、まず、音声信号を取得しテキストに変換する処理を行う（ステップＳ１）。ステップＳ１の処理は、要求プレイリスト入力部１０１のマイク及び音声認識部が行う。マイクは、プレイリスト作成装置１の周囲を伝播する音を収音し、音声信号に変換して音声認識部に出力する。音声認識部は、音声信号に対し既知の音声認識処理を行う。例えば、音声認識部は、まず音声信号に有意な音声が含まれるか否かを判定する。そして、音声信号に有意な音声が含まれる場合、音声認識部は、当該有意な音声の部分をテキストに変換する。 In the playlist creation process, the playlist creation device 1 first performs a process of acquiring an audio signal and converting it into text (step S1). The process of step S1 is performed by the microphone and the voice recognition unit of the request playlist input unit 101. The microphone collects sound propagating around the playlist creation device 1, converts the sound into a sound signal, and outputs the sound signal to the sound recognition unit. The voice recognition unit performs a known voice recognition process on the voice signal. For example, the voice recognition unit first determines whether a significant voice is included in the voice signal. Then, when a significant voice is included in the voice signal, the voice recognition unit converts the significant voice portion into text.

要求プレイリスト入力部１０１は、上記の音声認識処理の後、テキストが得られたか否かを判定する（ステップＳ２）。テキストが得られなかった場合（ステップＳ２；Ｎｏ）、要求プレイリスト入力部１０１は、テキストが得られるまでステップＳ１の処理を繰り返す。 After the above-described speech recognition processing, the request playlist input unit 101 determines whether or not a text has been obtained (step S2). If the text has not been obtained (Step S2; No), the request playlist input unit 101 repeats the process of Step S1 until the text is obtained.

一方、テキストが得られた場合（ステップＳ２；Ｙｅｓ）、要求プレイリスト入力部１０１は、得られたテキストを、利用者からのプレイリストの生成要求としてキーワード／属性抽出部１０２に出力する。要求プレイリスト入力部１０１からテキストを受け取ると、キーワード／属性抽出部１０２は、受け取ったテキストに基づいて検索キーワード及び絞り込み属性を抽出する処理を行う（ステップＳ３）。ステップＳ３において、キーワード／属性抽出部１０２は、まず、図３に示したキーワードＤＢ１２１を検索し、受け取ったテキストと合致するテキストの有無を調べる。そして、受け取ったテキストと合致するテキストがキーワードＤＢ１２１に登録されている場合、キーワード／属性抽出部１０２は、合致したテキストと対応付けられている検索キーワードをキーワードＤＢ１２１から抽出する。キーワード／属性抽出部１０２は、抽出した検索キーワードを優先度算出部１０５及びナレーション作成部１０７に出力する。 On the other hand, when the text is obtained (Step S2; Yes), the requested playlist input unit 101 outputs the obtained text to the keyword / attribute extraction unit 102 as a playlist generation request from the user. Upon receiving the text from the request playlist input unit 101, the keyword / attribute extraction unit 102 performs a process of extracting a search keyword and a narrowing attribute based on the received text (step S3). In step S3, the keyword / attribute extraction unit 102 first searches the keyword DB 121 shown in FIG. 3 to check for a text that matches the received text. Then, when a text that matches the received text is registered in the keyword DB 121, the keyword / attribute extraction unit 102 extracts a search keyword associated with the matched text from the keyword DB 121. The keyword / attribute extraction unit 102 outputs the extracted search keywords to the priority calculation unit 105 and the narration creation unit 107.

また、ステップＳ３において、キーワード／属性抽出部１０２は、図４に示した絞込属性ＤＢ１２２を検索し、受け取ったテキストと合致する絞り込み属性の有無を調べる。そして、受け取ったテキストと合致するテキストが絞込属性ＤＢ１２２に登録されている場合、キーワード／属性抽出部１０２は、合致したテキストと対応付けられている絞り込み属性を絞込ＤＢ１２２から抽出する。キーワード／属性抽出部１０２は、抽出した絞り込み属性を曲絞込部１０３に出力する。 Also, in step S3, the keyword / attribute extraction unit 102 searches the narrowed-down attribute DB 122 shown in FIG. 4 and checks whether there is a narrowed-down attribute that matches the received text. Then, when a text that matches the received text is registered in the narrow-down attribute DB 122, the keyword / attribute extraction unit 102 extracts a narrow-down attribute associated with the matched text from the narrow-down DB 122. The keyword / attribute extraction unit 102 outputs the extracted refinement attributes to the music refinement unit 103.

ステップＳ３の処理を終えると、プレイリスト生成装置１は、次に、プレイリストへの登録対象となる曲の曲ＩＤを、全ての曲ＩＤから絞り込み属性と合致する曲属性を持つ曲ＩＤに絞り込む処理を行う（ステップＳ４）。ステップＳ４の処理は、曲絞込部１０３が行う。曲絞込部１０３は、キーワード／属性抽出部１０２から絞り込み属性を受け取ると、曲ＤＢ１２３を参照して絞り込み属性と合致する曲属性を持つ曲ＩＤを抽出し、曲ＩＤを絞り込む。曲絞込部１０３は、絞り込んだ曲ＩＤと、その曲ＩＤに対応付けられた曲のイメージ特徴量を推薦プレイリスト候補生成部１０４に出力する。 When the processing of step S3 is completed, the playlist generation device 1 then narrows down the song IDs of the songs to be registered in the playlist to song IDs having song attributes matching the narrowing attributes from all song IDs. Processing is performed (step S4). The process of step S4 is performed by the music narrowing unit 103. Upon receiving the narrowing down attribute from the keyword / attribute extracting unit 102, the song narrowing down unit 103 extracts a song ID having a song attribute matching the narrowing down attribute by referring to the song DB 123, and narrows down the song ID. The song narrowing unit 103 outputs the narrowed song ID and the image feature amount of the song associated with the song ID to the recommended playlist candidate generation unit 104.

曲絞込部１０３から曲ＩＤ及びイメージ特徴量を受け取ると、推薦プレイリスト候補生成部１０４は、当該曲ＩＤ及びイメージ特徴量、並びにプレイリストＤＢ１２４に基づいて、推薦プレイリスト候補を生成する（ステップＳ５）。ステップＳ５の処理において、推薦プレイリスト候補生成部１０４は、プレイリストＤＢ１２４に登録された各プレイリストの範囲と、イメージ特徴量とを比較し、範囲内にイメージ特徴量が含まれるプレイリストを抽出する。すなわち、推薦プレイリスト候補生成部１０４は、図８に示したようなイメージ特徴空間に各曲のイメージ特徴量を表す点とプレイリストの範囲とを表した場合に範囲内にイメージ特徴量を表す点が含まれるプレイリストを、推薦プレイリスト候補とする。推薦プレイリスト候補生成部１０４は、生成した推薦プレイリスト候補を優先度算出部１０５に出力する。 Upon receiving the song ID and the image feature amount from the song narrowing unit 103, the recommended playlist candidate generation unit 104 generates a recommended playlist candidate based on the song ID and the image feature amount and the playlist DB 124 ( Step S5). In the process of step S5, the recommended playlist candidate generation unit 104 compares the range of each playlist registered in the playlist DB 124 with the image feature amount, and determines a playlist including the image feature amount in the range. Extract. That is, the recommended playlist candidate generation unit 104 indicates the image feature amount of each song and the range of the playlist in the image feature space as illustrated in FIG. A playlist including points is set as a recommended playlist candidate. The recommended playlist candidate generation unit 104 outputs the generated recommended playlist candidates to the priority calculation unit 105.

なお、図示は省略しているが、生成した推薦プレイリスト候補が１個である場合、推薦プレイリスト候補生成部１０４は、生成した推薦プレイリスト候補をプレイリスト決定部１０６に出力する。 Although not shown, when the number of generated recommended playlist candidates is one, the recommended playlist candidate generating unit 104 outputs the generated recommended playlist candidates to the playlist determining unit 106.

優先度算出部１０５は、キーワード／属性抽出部１０２から検索キーワードを受け取るとともに、推薦プレイリスト候補部１０５から推薦プレイリスト候補を受け取ると、推薦プレイリスト候補についての推薦優先度を算出する（ステップＳ６）。ステップＳ６の処理において、優先度算出部１０５は、認識ずれ量算出部１０５ａで認識ずれ量を算出するとともに、要求ずれ量算出部１０５ｂで要求ずれ量を算出する。そして、優先度算出部１０５は、ずれ量統合部１０５ｄで認識ずれ量と要求ずれ量とを統合して各推薦プレイリスト候補の推薦優先度を算出する。優先度算出部１０５は、算出した推薦優先度をプレイリスト決定部１０６に出力する。 Upon receiving the search keyword from the keyword / attribute extraction unit 102 and receiving the recommended playlist candidate from the recommended playlist candidate unit 105, the priority calculation unit 105 calculates a recommendation priority for the recommended playlist candidate (step S6). ). In the process of step S6, the priority calculation unit 105 calculates the recognition error amount by the recognition error amount calculation unit 105a, and calculates the required error amount by the required error amount calculation unit 105b. Then, the priority calculation unit 105 calculates the recommendation priority of each recommended playlist candidate by integrating the recognition shift amount and the requested shift amount by the shift amount integrating unit 105d. The priority calculation unit 105 outputs the calculated recommendation priority to the playlist determination unit 106.

優先度算出部１０５が算出した推薦優先度を受け取ると、プレイリスト決定部１０６は、推薦優先度に基づいて推薦するプレイリストを決定する（ステップＳ７）。ステップＳ７の処理において、プレイリスト決定部１０６は、推薦優先度の最も高いプレイリストを推薦するプレイリストに決定する。この際、プレイリスト決定部１０６は、プレイリストの名称のみを変更してもよいし、プレイリスト自体を推薦するプレイリストに変更して登録する曲を変更してもよい。プレイリスト決定部１０６は、決定した推薦プレイリストの名称をナレーション作成部１０７（ナレーション文章作成部１０７ａ）に出力するとともに、当該推薦プレイリストに含まれる曲ＩＤをプレイリスト再生部１０８に出力する。なお、推薦プレイリスト候補に要求プレイリストが含まれており、かつ要求プレイリストの推薦優先度が最も高い場合、推薦するプレイリストは要求プレイリスト（すなわち利用者が要求したプレイリスト）となる。 Upon receiving the recommendation priority calculated by the priority calculation unit 105, the playlist determination unit 106 determines a playlist to be recommended based on the recommendation priority (Step S7). In the processing of step S7, the playlist determination unit 106 determines the playlist having the highest recommendation priority as the recommended playlist. At this time, the play list determination unit 106 may change only the name of the play list, or may change the play list itself to a recommended play list and change the tune to be registered. The playlist determination unit 106 outputs the name of the determined recommended playlist to the narration creation unit 107 (narration text creation unit 107a), and outputs the song ID included in the recommended playlist to the playlist playback unit 108. If the recommended playlist candidate includes the required playlist and the recommended priority of the required playlist is the highest, the recommended playlist is the required playlist (that is, the playlist requested by the user).

次に、プレイリスト生成装置１では、図９Ｂに示すように、推薦プレイリストについてのナレーション音声を作成する（ステップＳ８）。ステップＳ８の処理は、ナレーション作成部１０７が行う。ナレーション作成部１０７は、まず、ナレーション文章作成部１０７ａにおいてナレーション文章を作成する。ナレーション文章作成部１０７ａは、推薦プレイリストの名称が先に取得した検索キーワード（又は検索キーワードにより特定されるプレイリストの名称）と合致するか否かを判定する。その判定結果に基づいて、ナレーション文章作成部１０７ａは、利用者に通知するナレーション文章のテキストを作成する。推薦プレイリストの名称が検索キーワードと合致する場合、ナレーション文章作成部１０７ａは、検索キーワードのプレイリストを再生することを通知するテキストを作成する。一方、推薦プレイリストの名称が検索キーワードと合致しない場合、ナレーション文章作成部１０７ａは、例えば、検索キーワードのプレイリストを作成することができなかったため推薦プレイリストを再生することを通知するテキストを作成する。ナレーション文章作成部１０７ａは、作成したナレーション文章のテキストをナレーション音声作成部１０７ｂに出力する。ナレーション音声作成部１０７ｂは、既知の音声合成技術を用い、受け取ったナレーション文章のテキストからナレーション音声データを作成する。ナレーション音声作成部１０７ｂは、作成したナレーション音声データをプレイリスト再生部１０８に出力する。なお、ナレーション文章作成部１０７ａで作成したナレーション文章を音声データに変換せず、液晶ディスプレイ等の表示装置に表示させてもよい。 Next, as shown in FIG. 9B, the playlist generation device 1 creates a narration voice for the recommended playlist (step S8). The process of step S8 is performed by the narration creation unit 107. The narration creation unit 107 first creates a narration sentence in the narration sentence creation unit 107a. The narration sentence creating unit 107a determines whether or not the name of the recommended playlist matches the previously acquired search keyword (or the name of the playlist specified by the search keyword). Based on the determination result, the narration text creating unit 107a creates a narration text to be notified to the user. If the name of the recommended playlist matches the search keyword, the narration sentence creation unit 107a creates a text notifying that the playlist of the search keyword is to be played. On the other hand, if the name of the recommended playlist does not match the search keyword, the narration sentence creating unit 107a creates a text notifying that the recommended playlist is to be reproduced because, for example, the playlist of the search keyword could not be created. I do. The narration text creation unit 107a outputs the text of the created narration text to the narration voice creation unit 107b. The narration speech creation unit 107b creates narration speech data from the text of the received narration sentence using a known speech synthesis technique. The narration sound creation unit 107b outputs the created narration sound data to the playlist reproduction unit 108. Note that the narration text created by the narration text creation unit 107a may be displayed on a display device such as a liquid crystal display without being converted into voice data.

その後、プレイリスト生成装置１では、推薦プレイリストに含まれる曲ＩＤ及びナレーション音声を受け取ったプレイリスト再生部１０８が、ナレーション音声データ及び推薦プレイリストに含まれる曲ＩＤの曲音声データを再生する（ステップＳ９）。ステップＳ９の処理において、プレイリスト再生部１０８は、まず、ナレーション音声データを再生する。その後、プレイリスト再生部１０８は、推薦プレイリストに含まれる曲ＩＤの曲音声データを所定の順序で順次再生する。プレイリスト再生部１０８が再生して得られたナレーション音声の音声信号及び曲音声の音声信号は、音声出力部（スピーカ）１０９によりプレイリスト生成装置１の外部に向けて音声として出力される。 Thereafter, in the playlist generation device 1, the playlist playback unit 108 that has received the song ID and the narration voice included in the recommended playlist plays back the narration voice data and the song voice data with the song ID included in the recommended playlist ( Step S9). In the process of step S9, the playlist reproduction unit 108 first reproduces narration audio data. Thereafter, the play list reproducing unit 108 sequentially reproduces the music sound data of the music IDs included in the recommended play list in a predetermined order. The voice signal of the narration voice and the voice signal of the tune voice obtained by the playback of the playlist playback unit 108 are output as audio to the outside of the playlist generation device 1 by the audio output unit (speaker) 109.

図１０は、プレイリストの変更方法の例を説明する図である。
ステップＳ７において決定した推薦プレイリストと要求プレイリストとの包含関係は、例えば、図１０の（ａ）から（ｃ）までに示したような関係になる。図１０の（ａ）の例では、推薦プレイリストＬ_２の範囲３１２が要求プレイリストＬ_１の範囲３１１に内包されている。このような場合、黒丸で示した推薦プレイリストＬ_２の範囲３１２内に含まれる曲は、全て要求プレイリストＬ１の範囲３１１内に含まれる。そのため、推薦プレイリストＬ_２の範囲３１２が要求プレイリストＬ_１の範囲３１１に内包されている場合、プレイリスト決定部１０６は、利用者に提示するプレイリストの名称のみを変更してもよいし、プレイリスト自体を推薦プレイリストＬ_２に変更してもよい。 FIG. 10 is a diagram illustrating an example of a method of changing a playlist.
The inclusion relationship between the recommended playlist and the requested playlist determined in step S7 is, for example, the relationship shown in (a) to (c) of FIG. In the example of FIG. 10 (a), the range 312 recommendation playlist _{L 2} is encapsulated in the range 311 requests the playlist _{L 1.} In such cases, the recommendation songs included in the playlist L ₂ of the range 312 shown by black circles are included within the scope 311 of all requests playlist L1. Therefore, if the range 312 recommendation playlist L ₂ is encapsulated in the range 311 requests the playlist L _1, playlist determination unit 106 may be changed only names of the playlist to be presented to the user , it may change the play list itself on the recommendation playlist L _2.

一方、図１０の（ｂ）に示すように、推薦プレイリストＬ_２の範囲３１２の一部のみが要求プレイリストＬ_１の範囲３１１と重なっている場合、黒丸で示した重なり領域内の曲のみが登録されたプレイリストを作成することも可能である。しかしながら、この場合、推薦プレイリストＬ_２の範囲３１２内においてもプレイリストに登録される曲の分布に偏りがあるため、プレイリストの名称のみを変更すると、推薦プレイリストＬ_２のイメージと再生される曲のイメージとに齟齬が生じる可能性がある。そのため、推薦プレイリストＬ_２の範囲３１２の一部のみが要求プレイリストＬ_１の範囲３１１と重なる場合、プレイリスト自体を、図１０の（ｃ）に示すように、推薦プレイリストＬ_２の範囲３１２内に含まれる曲を全て登録したプレイリストに変更してもよい。図１０の（ｃ）に示した例では、推薦プレイリストＬ_２の範囲３１２内における曲の偏りの度合いが小さい。そのため、推薦プレイリストＬ_２のイメージと再生される曲のイメージとの齟齬が解消され、利用者に与える違和感を低減することができる。 On the other hand, as shown in (b) of FIG. 10, when only a part of the recommended playlist L ₂ in the range 312 overlaps the range 311 requests the playlist L _1, only songs overlap region indicated by black circles It is also possible to create a playlist in which is registered. However, in this case, because there is a bias in the distribution of music, which is registered in the play list is also within the scope 312 of the recommendation playlist L _2, when changing only the name of the playlist, the playback of the recommended playlist L ₂ Image May be inconsistent with the image of the song. Therefore, when only a part of the recommended playlist L ₂ in the range 312 overlaps the range 311 requests the playlist L _1, the play list itself, as shown in (c) of FIG. 10, the range of the recommended playlist L ₂ All the songs included in 312 may be changed to a registered playlist. In the example shown in (c) of FIG. 10, a small degree of deviation of the songs in the range 312 recommendation playlist L _2. For this reason, and eliminates the discrepancy between the image of the song to be played back with the recommendation playlist L ₂ image, it is possible to reduce the uncomfortable feeling to the user.

このように、本実施形態に係るプレイリスト生成処理では、利用者（オペレータ）が要求した曲の絞り込み属性に基づいて利用者に推薦するプレイリストの候補を生成し、利用者からの要求に応じて生成される要求プレイリストと比較する。そして、推薦するプレイリストの候補のなかに要求プレイリストよりも優先度の高いプレイリストが存在する場合、優先度の高いプレイリストの名称を利用者に提示する。これにより、利用者は、プレイリストの名称を把握した上で再生される曲を聞くことができる。また、優先度の高いプレイリストがもつイメージは、利用者が要求したプレイリストがもつイメージと比べて、再生される曲のイメージにより近い。そのため、本実施形態に係るプレイリスト生成処理によれば、プレイリストがもつイメージとプレイリストに従って再生される曲のイメージとの齟齬により利用者に与える違和感を低減させることが可能となる。更に、プレイリストの名称だけでなく、プレイリスト自体も変更することで、図１０に示したように、名称の変更だけではイメージの齟齬が生じるような状況でも、イメージの齟齬を解消し、利用者に与える違和感を低減させることが可能となる。また、本実施形態に係るプレイリスト生成処理では、再生される曲のイメージにより近いプレイリストの名称を利用者に提示するため、利用者がイメージと合致した曲を再生させるためのキーワードを試行錯誤する手間を省くことが可能となる。 As described above, in the playlist generation processing according to the present embodiment, a playlist candidate to be recommended to the user is generated based on the song narrowing down attribute requested by the user (operator), and the playlist is generated in response to the request from the user. And compare it with the requested playlist generated. Then, when a playlist having a higher priority than the request playlist exists among the candidates for the playlist to be recommended, the name of the playlist having a higher priority is presented to the user. As a result, the user can listen to the tune to be reproduced after grasping the name of the playlist. Also, the image of the playlist with a higher priority is closer to the image of the music to be reproduced than the image of the playlist requested by the user. Therefore, according to the playlist generation processing according to the present embodiment, it is possible to reduce a sense of discomfort given to the user due to a discrepancy between the image of the playlist and the image of the music reproduced according to the playlist. Further, by changing not only the name of the playlist but also the playlist itself, as shown in FIG. It is possible to reduce the discomfort given to the person. Also, in the playlist generation process according to the present embodiment, in order to present to the user the name of the playlist closer to the image of the song to be reproduced, the user must perform trial and error on the keyword for reproducing the song that matches the image. It is possible to save the trouble of performing.

また、本実施形態に係るプレイリスト生成処理によれば、利用者は、自身が要求したプレイリストの名称に対する、より適切な名称を知ることが可能となる。そのため、利用者は、自身の抱くイメージに合ったプレイリストを得るためのキーワードを容易に把握することが可能となる。例えば、「明るい曲」のプレイリストの要求に対しイメージの近い「涼やかな曲」のプレイリストを提示した場合、利用者は、「涼やかな曲」のプレイリストを要求した場合に再生される曲のイメージを知ることができる。そのため、利用者は、「明るい曲」及び「涼やかな曲」のいずれのプレイリストを要求すると自身の望むイメージにより近い曲が再生されるかを知ることができる。よって、次回以降、自身の望むイメージにより近いプレイリストの生成を要求することが容易となる。 Further, according to the playlist generation processing according to the present embodiment, the user can know a more appropriate name for the playlist name requested by the user. Therefore, the user can easily grasp the keyword for obtaining the playlist suitable for the image held by the user. For example, when a playlist of “cool songs” with a similar image is presented in response to a request for a playlist of “bright songs”, the user can select a song to be played when the playlist of “cool songs” is requested. You can know the image of. For this reason, the user can know which of the playlists “bright song” and “cool song” is played, and which song is closer to the image desired by the user. Therefore, it becomes easy to request generation of a playlist closer to the image desired by the user from the next time.

図１１は、検索キーワード及び絞り込み属性を抽出する処理の内容を説明するフローチャートである。 FIG. 11 is a flowchart illustrating the content of the process of extracting the search keyword and the narrowing down attribute.

本実施形態に係るプレイリスト生成処理における検索キーワード及び絞り込み属性を抽出する処理（ステップＳ３）は、キーワード／属性抽出部１０２が行う。キーワード／属性抽出部１０２は、図１１に示すように、まず、要求プレイリスト入力部１０１からテキストを取得する（ステップＳ３０１）。 The keyword / attribute extraction unit 102 performs the process (step S3) of extracting a search keyword and a narrowing attribute in the playlist generation process according to the present embodiment. As shown in FIG. 11, the keyword / attribute extraction unit 102 first obtains a text from the request playlist input unit 101 (step S301).

キーワード／属性抽出部１０２は、次に、取得したテキストをキー情報としてキーワードＤＢ１２１を検索し、検索キーワードを抽出する（ステップＳ３０２）。キーワードＤＢ１２１には、図３に示したように、テキストと検索キーワードとの対応関係が複数登録されている。ステップＳ３０２の処理において、キーワード／属性抽出部１０２は、取得したテキストがキーワードＤＢ１２１に登録されているか否かを調べる。この際、キーワード／属性抽出部１０２は、例えば、キーワードＤＢ１２１に登録された各キーワードに対して取得したキーワードによる正規表現検索を行い、取得したキーワードと一致するキーワードの有無を調べる。そして、取得したテキストがキーワードＤＢ１２１に登録されている場合、当該テキストと対応付けられたキーワードを抽出して検索キーワードとする。なお、音声認識部から複数のテキストを取得した場合、キーワード／属性抽出部１０２は、取得した複数のテキストのそれぞれについてキーワードＤＢ１２１に登録されているか否かを調べる。 Next, the keyword / attribute extraction unit 102 searches the keyword DB 121 using the obtained text as key information, and extracts a search keyword (step S302). As shown in FIG. 3, a plurality of correspondences between texts and search keywords are registered in the keyword DB 121. In the process of step S302, the keyword / attribute extraction unit 102 checks whether or not the acquired text is registered in the keyword DB 121. At this time, for example, the keyword / attribute extraction unit 102 performs a regular expression search using the acquired keyword for each keyword registered in the keyword DB 121, and checks whether there is a keyword that matches the acquired keyword. If the acquired text is registered in the keyword DB 121, a keyword associated with the text is extracted and used as a search keyword. When a plurality of texts are obtained from the voice recognition unit, the keyword / attribute extraction unit 102 checks whether or not each of the obtained texts is registered in the keyword DB 121.

キーワード／属性抽出部１０２は、次に、取得したテキストをキー情報として絞込属性ＤＢ１２２を検索し、曲の絞り込み属性を抽出する（ステップＳ３０３）。絞込属性ＤＢ１２２には、図４に示したように、テキストと曲の絞込属性との対応関係が複数登録されている。ステップＳ３０３の処理において、キーワード／属性抽出部１０２は、取得したテキストが絞込属性ＤＢ１２２に登録されているか否かを調べる。この際、キーワード／属性抽出部１０２は、例えば、絞込属性ＤＢ１２２に登録された各キーワードに対して取得したキーワードによる正規表現検索を行い、取得したキーワードと一致するキーワードの有無を調べる。そして、取得したテキストが絞込属性ＤＢ１２２に登録されている場合、当該テキストと対応付けられた曲属性を抽出する。なお、音声認識部から複数のテキストを取得した場合、キーワード／属性抽出部１０２は、取得した複数のテキストのそれぞれについて絞込属性ＤＢ１２２に登録されているか否かを調べる。 Next, the keyword / attribute extraction unit 102 searches the narrowing-down attribute DB 122 using the obtained text as key information, and extracts the narrowing-down attribute of the music (step S303). As shown in FIG. 4, a plurality of correspondences between texts and song narrowing-down attributes are registered in the narrowing-down attribute DB 122. In the process of step S303, the keyword / attribute extraction unit 102 checks whether or not the acquired text is registered in the narrow-down attribute DB 122. At this time, for example, the keyword / attribute extraction unit 102 performs a regular expression search using the acquired keyword for each keyword registered in the narrowed-down attribute DB 122, and checks whether there is a keyword that matches the acquired keyword. Then, when the acquired text is registered in the narrow-down attribute DB 122, a song attribute associated with the text is extracted. When a plurality of texts are obtained from the voice recognition unit, the keyword / attribute extraction unit 102 checks whether or not each of the obtained texts is registered in the narrow-down attribute DB 122.

図１２は、推薦優先度を算出する処理の内容を説明するフローチャートである。
本実施形態に係るプレイリスト生成処理における推薦優先度を算出する処理（ステップＳ６）は、優先度算出部１０５が行う。優先度算出部１０５は、図１２に示すように、まず、検索キーワード及び推薦プレイリスト候補を取得する（ステップＳ６０１）。優先度算出部１０５は、キーワード／属性抽出部１０２から検索キーワードを取得し、推薦プレイリスト候補生成部１０４から推薦プレイリスト候補を取得する。取得した検索キーワードはプレイリスト検索部１０５ｃに入力される。また、取得した推薦プレイリスト候補は、認識ずれ量算出部１０５ａ及び要求ずれ量算出部１０５ｂに入力される。 FIG. 12 is a flowchart illustrating the content of the process of calculating the recommendation priority.
The process of calculating the recommendation priority in the playlist generation process according to the present embodiment (step S6) is performed by the priority calculation unit 105. As shown in FIG. 12, the priority calculation unit 105 first obtains a search keyword and a recommended playlist candidate (step S601). The priority calculation unit 105 acquires a search keyword from the keyword / attribute extraction unit 102, and acquires a recommended playlist candidate from the recommended playlist candidate generation unit 104. The acquired search keyword is input to the playlist search unit 105c. The acquired recommended playlist candidate is input to the recognition shift amount calculation unit 105a and the request shift amount calculation unit 105b.

優先度算出部１０５は、次に、認識ずれ量を算出する処理（ステップＳ６０２）と、要求ずれ量を算出する処理（ステップＳ６０３，Ｓ６０４）とを行う。 Next, the priority calculation unit 105 performs a process of calculating the recognition shift amount (Step S602) and a process of calculating the required shift amount (Steps S603 and S604).

認識ずれ量を算出する処理（ステップＳ６０２）は、認識ずれ量算出部１０５ａが行う。認識ずれ量算出部１０５ａは、認識ずれ量として、推薦プレイリスト候補における曲ＩＤに対応したイメージ特徴量に基づく正規分布と、当該推薦プレイリスト候補の想定している曲の正規分布とのずれ量を算出する。例えば、認識ずれ量算出部１０５ａは、ｉ番目の推薦プレイリスト候補における曲ＩＤに対応した曲のイメージ特徴量に基づく正規分布Ｐ_ｉと、当該推薦プレイリスト候補の想定している曲の正規分布Ｑ_ｉとのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）を算出する。ＫＬダイバージェンスは、２つの分布の差異を測る尺度であり、２つの正規分布Ｐ，ＱについてのＫＬダイバージェンスＤ_ＫＬ（Ｐ||Ｑ）は、下記式（１）で表される。 The process of calculating the recognition shift amount (step S602) is performed by the recognition shift amount calculation unit 105a. The recognition shift amount calculation unit 105a calculates, as the recognition shift amount, the shift amount between the normal distribution based on the image feature amount corresponding to the song ID in the recommended playlist candidate and the normal distribution of the song assumed by the recommended playlist candidate. Is calculated. For example, the recognition shift amount calculation unit 105a calculates the normal distribution P _i based on the image feature amount of the song corresponding to the song ID in the i-th recommended playlist candidate and the normal distribution P _i of the song assumed by the recommended playlist candidate. calculating a Q _i and the KL divergence _{_{_{D KL (P i || Q i}}} ). KL divergence is a measure for measuring the difference between two distributions, and KL divergence D _KL (P || Q) for two normal distributions P and Q is represented by the following equation (1).

式（１）におけるμ_Ｐ及びΣ_Ｐは、それぞれ、ｄ次元正規分布Ｐの平均及び分散である。また、式（１）におけるμ_Ｑ及びΣ_Ｑは、それぞれ、ｄ次元正規分布Ｑの平均及び分散である。 Μ _P and Σ _P in Equation (1) are the mean and variance of the d-dimensional normal distribution P, respectively. Further, μ _Q and Σ _Q in the equation (1) are the average and the variance of the d-dimensional normal distribution Q, respectively.

なお、認識ずれ量として上記のＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）を算出する場合、認識ずれ量算出部１０５ａは、ｉ番目の推薦プレイリスト候補における曲ＩＤに基づいて、曲ＤＢ１２３から各曲ＩＤと対応付けられた曲のイメージ特徴量を取得する。 When calculating the KL divergence D _KL (P _i || Q _i ) as the recognition shift amount, the recognition shift amount calculation unit 105a uses the song DB 123 based on the song ID in the i-th recommended playlist candidate. , The image feature amount of the song associated with each song ID is obtained.

認識ずれ量算出部１０５ａは、算出した認識ずれ量をずれ量統合部１０５ｄに出力する。 The recognition shift amount calculation unit 105a outputs the calculated recognition shift amount to the shift amount integration unit 105d.

一方、要求ずれ量を算出する処理（ステップＳ６０３，Ｓ６０４）は、プレイリスト検索部１０５ｃと要求ずれ量算出部１０５ｂとが行う。 On the other hand, the process of calculating the required shift amount (steps S603 and S604) is performed by the playlist search unit 105c and the requested shift amount calculation unit 105b.

プレイリスト検索部１０５ｃは、プレイリストＤＢ１２４を検索し、取得した検索キーワードと対応するプレイリストを抽出する（ステップＳ６０３）。プレイリスト検索部１０５ｃは、検索キーワードと対応するプレイリストを抽出すると、要求ずれ量算出部１０５ｂは、この際、プレイリスト検索部１０５ｃは、例えば、プレイリストＤＢ１２４に登録された名称に対して取得した検索キーワードによる正規表現検索を行い、名称が検索キーワードと一致するプレイリストの想定分布及び範囲を抽出する。プレイリスト検索部１０５ｃは、抽出したプレイリストの名称（検索キーワード）、想定分布、及び範囲を要求ずれ量算出部１０５ｂに出力する。 The playlist search unit 105c searches the playlist DB 124 and extracts a playlist corresponding to the acquired search keyword (step S603). When the playlist search unit 105c extracts the playlist corresponding to the search keyword, the request deviation amount calculation unit 105b, at this time, the playlist search unit 105c, for example, searches for the name registered in the playlist DB 124. A regular expression search is performed using the obtained search keyword, and an assumed distribution and range of a playlist whose name matches the search keyword are extracted. The playlist search unit 105c outputs the name (search keyword), the assumed distribution, and the range of the extracted playlist to the request deviation amount calculation unit 105b.

要求ずれ量算出部１０５ｂは、要求ずれ量として、推薦プレイリスト候補の想定している正規分布と、プレイリスト検索部１０５ｃから受け取ったプレイリストの想定している正規分布（想定分布）とのずれ量を算出する。例えば、要求ずれ量算出部１０５ｂは、ｉ番目の推薦プレイリスト候補の想定している曲の正規分布Ｐ_ｉと、検索キーワードに基づくプレイリストの想定している曲の正規分布ＲとのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｒ）を算出する。正規分布Ｐ_ｉ，ＲについてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｒ）は、式（１）を用いて算出する。 The request deviation amount calculation unit 105b calculates, as the request deviation amount, a deviation between the normal distribution assumed for the recommended playlist candidate and the normal distribution assumed for the playlist received from the playlist search unit 105c (assumed distribution). Calculate the amount. For example, the request deviation amount calculation unit 105b calculates the KL divergence of the normal distribution P _{i of the} music assumed in the i-th recommended playlist candidate and the normal distribution R of the music assumed in the playlist based on the search keyword. _DKL (P _i || R) is calculated. The KL divergence D _KL (P _i || R) for the normal distribution P _i , R is calculated using equation (1).

要求ずれ量算出部１０５ｂは、算出した要求ずれ量をずれ量統合部１０５ｄに出力する。 The request deviation amount calculation unit 105b outputs the calculated request deviation amount to the deviation amount integration unit 105d.

こうして認識ずれ量及び要求ずれ量を算出した後、優先度算出部１０５では、ずれ量統合部１０５ｄが認識ずれ量及び要求ずれ量に基づいて推薦優先度を算出する（ステップＳ６０５）。ステップＳ６０５において、ずれ量統合部１０５ｄは、認識ずれ量と要求ずれ量とを統合し、２つの値の重み付き和で表される推薦優先度を算出する。ずれ量統合部１０５ｄは、例えば、下記式（２）によりｉ番目の推薦プレイリスト候補の推薦優先度ＰＰ_ｉを算出する。 After calculating the recognition shift amount and the required shift amount in this way, in the priority calculating unit 105, the shift amount integrating unit 105d calculates the recommendation priority based on the recognition shift amount and the required shift amount (step S605). In step S605, the deviation amount integration unit 105d integrates the recognition deviation amount and the required deviation amount, and calculates a recommendation priority represented by a weighted sum of the two values. Shift amount integration unit 105d calculates, for example, the recommendation priority PP _i of the i-th recommendation playlist candidate by the following equation (2).

ＰＰ_ｉ＝−ｗ１・Ｄ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）−ｗ２・Ｄ_ＫＬ（Ｐ_ｉ||Ｒ）・・・（２） PP _i = −w1 · D _KL (P _i || Q _i ) −w 2 · D _KL (P _i || R) (2)

式（２）におけるｗ１，ｗ２は、それぞれのずれ量（ＫＬダイバージェンス）に対する重み付け値である。重み付け値ｗ１，ｗ２の組み合わせは任意である。ただし、利用者の要求に応えないとユーザビリティが低下する可能性があるため、例えば、ｗ１＝０．４，ｗ２＝０．６等、ｗ１を小さくしｗ２を大きくする。また、本実施形態のプレイリスト生成処理の適用前後における動作の大きな変化を抑制したい場合には、例えば、ｗ１＝０．１，ｗ２＝０．９等、ｗ１の値を更に小さくしｗ２の値を大きくしてもよい。また、重み付け値ｗ１，ｗ２の組み合わせは、例えば、前回のプレイリスト生成処理からの経過時間（日数）に応じて変えるようにしてもよい。更に、重み付け値ｗ１，ｗ２の組み合わせは、利用者自身が適宜調整できるようにしてもよい。 W1 and w2 in Expression (2) are weighting values for the respective shift amounts (KL divergence). The combination of the weight values w1 and w2 is arbitrary. However, there is a possibility that usability will deteriorate if the user's request is not met. For example, w1 is reduced and w2 is increased, for example, w1 = 0.4, w2 = 0.6. When it is desired to suppress a large change in the operation before and after the application of the playlist generation processing of the present embodiment, for example, the value of w1 is further reduced, such as w1 = 0.1, w2 = 0.9, and the value of w2. May be increased. Further, the combination of the weight values w1 and w2 may be changed according to, for example, the elapsed time (number of days) from the previous playlist generation processing. Further, the combination of the weight values w1 and w2 may be appropriately adjusted by the user himself.

ここで、上記の要求ずれ量及び認識ずれ量について、図１３Ａ及び図１３Ｂを参照して改めて説明する。図１３Ａは、要求ずれ量を説明する図である。図１３Ｂは、認識ずれ量を説明する図である。 Here, the required shift amount and the recognition shift amount will be described again with reference to FIGS. 13A and 13B. FIG. 13A is a diagram illustrating the required shift amount. FIG. 13B is a diagram illustrating the recognition shift amount.

要求ずれ量は、上記のように、推薦プレイリスト候補の想定している曲の正規分布と、検索キーワードにより特定されるプレイリストの想定している曲の正規分布とのずれ量である。プレイリストの想定している曲の正規分布は、プレイリストＤＢ１２４における想定分布である。 As described above, the required shift amount is a shift amount between the normal distribution of the music assumed as the recommended playlist candidate and the normal distribution of the music assumed in the playlist specified by the search keyword. The normal distribution of the music assumed by the playlist is the assumed distribution in the playlist DB 124.

例えば、図１３Ａの（ａ）に示すように、推薦プレイリスト候補が５個であり、イメージ特徴空間（Ｈ−Ｋ平面）における５個のプレイリストＬ_１，Ｌ_２，Ｌ_３，Ｌ_４，及びＬ_５の範囲３１１，３１２，３１３，３１４，及び３１５がそれぞれ円で表されるとする。また、絞り込み属性により絞り込まれた曲（２１１，２１２，２１３，・・・）のイメージ特徴空間における位置が、それぞれ図１３Ａの（ａ）に示した黒丸の位置で表されるとする。更に、５個のプレイリストＬ_１，Ｌ_２，Ｌ_３，Ｌ_４，及びＬ_５のうちプレイリストＬ_１が検索キーワードにより特定されたプレイリスト（要求プレイリスト）であったとする。この場合、要求ずれ量算出部１０５ｂは、プレイリストＬ_ｉ（ｉ＝１，２，３，４，５）のそれぞれに対し、プレイリストＬ_ｉの想定している正規分布Ｐ_ｉと、プレイリストＬ_１の想定している正規分布ＲとのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｒ）を算出する。 For example, as shown in (a) of FIG. 13A, there are five recommended playlist candidates, and five playlists L ₁ , L ₂ , L ₃ , L ₄ , and L ₄ in the image feature space (HK plane). and the _{L 5} ranges 311, 312, 313, and 315 are represented by circles, respectively. It is also assumed that the positions of the songs (211, 212, 213,...) Narrowed down by the narrowing down attribute in the image feature space are represented by the positions of the black circles shown in FIG. Furthermore, five play lists _{_{_{L 1, L 2, L 3}}} , L 4, and the play list _{L 1} of _{which L 5} and were playlist identified (requested playlist) by the search keyword. In this case, the required shift amount calculating unit 105b for each playlist _L i of (i = 1,2,3,4,5), a normal distribution _{P i} which assumes the playlist _{L i,} playlist The KL divergence D _KL (P _i || R) with the assumed normal distribution R of L ₁ is calculated.

図１３Ａの（ａ）に示した５個のプレイリストＬ_ｉ（_ｉ＝１，２，３，４，５）の想定している正規分布（想定分布）Ｐ_ｉは、例えば、図１３Ａの（ｂ）のように表すことができる。 The assumed normal distribution (assumed distribution) P _i of the five playlists L _i ( _i = 1, 2, 3, 4, 5) shown in (a) of FIG. b).

ＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｒ）は、上記のように２つの正規分布Ｐ_ｉ，Ｒの差異を測る尺度であり、Ｐ_ｉ＝Ｒの場合にＤ_ＫＬ（Ｐ_ｉ||Ｒ）＝０となる。また、ＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｒ）は、２つの正規分布Ｐ_ｉ，Ｒの差異が大きくなるほど大きな値となる。よって、図１３Ａの（ｂ）に示した例においては、要求プレイリストでもあるプレイリストＬ_１についてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_１||Ｒ）が０となり、プレイリストＬ_１から最も離れた位置にあるプレイリストＬ_５についてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_５||Ｒ）が最も大きな値となる。 The KL divergence D _KL (P _i || R) is a measure for measuring the difference between the two normal distributions P _i , R as described above. When P _i = R, D _KL (P _i || R) = It becomes 0. The KL divergence D _KL (P _i || R) increases as the difference between the two normal distributions P _i , R increases. Therefore, in the example shown in FIG. 13A (b), the KL divergence D _KL (P ₁ || R) for the playlist L ₁ , which is also the requested playlist, becomes 0, and the position farthest from the playlist L ₁ KL divergence D _KL (P ₅ || R) for the playlist L _{5 in} the first item has the largest value.

すなわち、要求ずれ量の小さいプレイリストに含まれる曲は、イメージ特徴空間において検索キーワードにより特定されるプレイリスト（要求プレイリスト）の範囲内或いはプレイリストの範囲に近い位置になる。そのため、推薦プレイリスト候補から推薦プレイリストを決定する際の推薦優先度は、要求ずれ量の小さいプレイリストほど推薦優先度が高くなるようにすることが好ましい。 In other words, a song included in a playlist with a small required deviation amount is located at a position within or close to the range of the playlist (requested playlist) specified by the search keyword in the image feature space. For this reason, it is preferable that the recommendation priority at the time of determining the recommended playlist from the recommended playlist candidates be such that the playlist with the smaller request deviation amount has the higher recommended priority.

ただし、図１３Ａの（ａ）に示した例では、プレイリストＬ_１の範囲３１１に含まれる曲は、プレイリストＬ_２の範囲３１２内にも含まれる曲のみであり、かつプレイリストＬ_１の範囲３１１内での偏りが大きい。そのため、要求ずれ量のみに基づいて推薦優先度を算出して推薦するプレイリストを決定した場合、利用者が要求したプレイリストのイメージと、決定したプレイリストに従って再生した曲のイメージとに齟齬が生じ、利用者に対し違和感を与える可能性がある。よって、本実施形態に係るプレイリスト生成処理では、要求ずれ量とともに認識ずれ量を算出し、これらを統合した推薦優先度を算出する。認識ずれ量は、上記のように、推薦プレイリスト候補における曲ＩＤに対応した曲のイメージ特徴量に基づく正規分布と、当該推薦プレイリスト候補の想定している正規分布とのずれ量を算出する。 However, in the example shown in (a) of FIG. 13A, the song to be included within the scope 311 of the play list _{L 1} is only tracks are also included within the scope 312 of the play list _{L 2,} and the play list _{L 1} The bias in the range 311 is large. Therefore, if a playlist to be recommended is determined by calculating the recommendation priority based on only the requested deviation amount, there is a discrepancy between the image of the playlist requested by the user and the image of the song played according to the determined playlist. This may cause the user to feel uncomfortable. Therefore, in the playlist generation processing according to the present embodiment, the recognition shift amount is calculated together with the request shift amount, and the recommendation priority obtained by integrating these is calculated. As described above, the recognition shift amount calculates the shift amount between the normal distribution based on the image feature amount of the song corresponding to the song ID in the recommended playlist candidate and the assumed normal distribution of the recommended playlist candidate. .

推薦プレイリスト候補における曲ＩＤに対応した曲のイメージ特徴量に基づく正規分布は、絞り込んだ曲のうちプレイリストの範囲内に含まれる曲に対するイメージ特徴量のみに基づいて算出した正規分布（平均及び分散）である。例えば、図１３Ａの（ａ）に示したプレイリストＬ_１の範囲３１１内には曲２１１、曲２１２、及び曲２１３の３曲が含まれている。そのため、ｉ＝１のプレイリスト（推薦プレイリスト候補）Ｌ_１における曲のイメージ特徴量に基づく正規分布Ｐ_１は、この３曲のイメージ特徴量に基づいて算出される。一方、ｉ＝１のプレイリストＬ_１の想定している正規分布Ｑ_１は、プレイリストＤＢ１２４に登録された想定分布である。よって、プレイリストＬ_１についての２つの正規分布Ｐ_１，Ｑ_１をイメージ特徴空間に表現すると、これら２つの正規分布Ｐ_１，Ｑ_１は、図１３Ｂの（ａ）に示したような関係で表される。 The normal distribution based on the image feature amount of the song corresponding to the song ID in the recommended playlist candidate is the normal distribution (average and average) calculated based only on the image feature amount of the songs included in the range of the playlist among the narrowed-down songs. (Dispersion). For example, it contains three pieces of music 211, music 212, and tune 213 to a playlist _{L 1} in the range 311 shown in (a) of FIG. 13A. Therefore, the normal distribution P ₁ based on the image feature amount of music in i = 1 playlist (recommended playlist candidate) L ₁ is calculated on the basis of the image feature amount of the three songs. On the other hand, the normal distribution _{Q 1} which assumes playlist _{L 1} of i = 1 is a registered envisaged distributed playlist DB 124. Therefore, when the two normal distributions P ₁ and Q ₁ for the playlist L ₁ are expressed in the image feature space, the two normal distributions P ₁ and Q ₁ have a relationship as shown in FIG. expressed.

同様に、他の４個のプレイリストＬ_ｉ（ｉ＝２，３，４，５）について、２つの正規分布Ｐ_ｉ，Ｑ_ｉをイメージ特徴空間に表現すると、それぞれ、図１３Ｂの（ｂ）から（ｅ）までに示したような関係で表される。 Similarly, when the two normal distributions P _i and Q _i are expressed in the image feature space for the other four playlists L _i (i = 2, 3, 4, 5), (b) in FIG. To (e).

各プレイリストＬ_ｉ（ｉ＝１，２，３，４，５）における２つの正規分布Ｐ_ｉ，Ｑ_ｉの関係（差異）は、ＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）により数値化することができる。２つの正規分布Ｐ_ｉ，Ｑ_ｉの差異が小さい場合、ＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）は０に近い小さな値となる。一方、２つの正規分布Ｐ_ｉ，Ｑ_ｉの差異が大きい場合、ＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）は大きい値となる。すなわち、図１３の（ａ）に示したプレイリストＬ_１についてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）は、他のプレイリストについてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_ｉ||Ｑ_ｉ）に比べて大きな値となる。そのため、プレイリストＬ_１の曲としてプレイリストＬ_１の範囲３１１に含まれる曲２１１，２１２，２１３を再生した場合、利用者は、再生された曲のイメージがプレイリストＬ_１のイメージと合わず違和感を抱く可能性が高い。 The relationship (difference) between the two normal distributions P _i and Q _i in each playlist L _i (i = 1, 2, 3, 4, 5) is quantified by KL divergence D _KL (P _i || Q _i ). can do. When the difference between the two normal distributions P _i and Q _i is small, the KL divergence D _KL (P _i || Q _i ) has a small value close to zero. On the other hand, when the difference between the two normal distributions P _i and Q _i is large, KL divergence D _KL (P _i || Q _i ) has a large value. That, KL divergence _{_{_{D KL (P i || Q i}}} ) for a playlist _{L 1} shown in FIG. 13 (a), the KL-divergence _{_{_{D KL (P i || Q i}}} ) for other playlist This is a large value. Therefore, if you play a song 211, 212 and 213 included in the scope 311 of the play list _{L 1} as songs in a playlist _{L 1,} the user, the image of the reproduced music does not match the image playlist _{L 1} It is highly likely that you will feel uncomfortable.

図１３Ｂの（ａ）に示したプレイリストＬ_１（要求プレイリスト）に含まれる３曲は、いずれも（ｂ）に示したプレイリストＬ_２に含まれる。しかも、図１３Ｂの（ｂ）に示したプレイリストＬ_２についてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_２||Ｑ_２）は、（ａ）のプレイリストＬ_１についてのＫＬダイバージェンスＤ_ＫＬ（Ｐ_１||Ｑ_１）よりも小さくなる。そのため、プレイリストＬ_１に含まれる曲２１１，２１２，２１３のみを再生する場合、プレイリストＬ_１の曲としてよりも、プレイリストＬ_２の曲として再生したほうが、曲のイメージに対する違和感が生じにくい。 3 songs included in the playlist _{L 1} (request playlist) shown in (a) of FIG. 13B it is all included in the playlist _{L 2} shown in (b). Moreover, the KL divergence D _KL (P ₂ || Q ₂ ) for the playlist L ₂ shown in (b) of FIG. 13B is the KL divergence D _KL (P ₁ || for the playlist L ₁ of (a). Q ₁ ). Therefore, if you want to play the only song 211, 212, and 213 that are included in the play list _{L 1,} rather than as a music play list _{L 1,} better to play as a song playlist _{L 2} it is less likely to occur discomfort for the song of the image .

よって、本実施形態に係るプレイリスト生成処理では、上記のように、要求ずれ量と認識ずれ量とを統合した推薦優先度を算出し、推薦優先度の最も高い推薦プレイリスト候補を推薦プレイリストとして利用者に提示する。また、本実施形態に係るプレイリスト生成処理では、上記のように、利用者に提示するプレイリストの名称を含むナレーション音声を作成して再生する。そのため、再生する曲のイメージをより適切に表現しており、かつ利用者の要求したイメージに近いプレイリストを利用者に提示することが可能となる。 Therefore, in the playlist generation processing according to the present embodiment, as described above, the recommendation priority obtained by integrating the request deviation amount and the recognition deviation amount is calculated, and the recommended playlist candidate having the highest recommendation priority is selected as the recommended playlist. To the user. Further, in the playlist generation processing according to the present embodiment, as described above, a narration sound including the name of the playlist to be presented to the user is created and reproduced. Therefore, it is possible to more appropriately express the image of the music to be reproduced and to present a playlist close to the image requested by the user to the user.

図１３Ａの（ａ）に示した例において利用者の要求するプレイリストＬ_１とは異なるプレイリストＬ_２を推薦プレイリストに決定した場合、プレイリスト生成装置１は、プレイリストＬ_２の名称を含むナレーション音声を作成して再生する。例えば、プレイリストＬ_１及びＬ_２の名称がそれぞれ「明るい」及び「涼やか」であり、利用者の要求したプレイリストの名称（検索キーワード）が「明るい」であったとする。この場合、プレイリスト生成装置１は、例えば「“明るい”のプレイリストは曲数不足のため作成できませんでした。そのため、“明るい”に近い“涼やか”のプレイリストを再生します。」というナレーション音声を再生した後、プレイリストの曲を再生する。したがって、利用者は、「涼やか」な曲が再生されると把握した上でプレイリストの曲を聞くことができ、再生された曲のイメージに対する違和感を抱きにくくなる。 If you decide to recommended playlist different playlist L ₂ is a playlist L ₁ to request the user in the example shown in (a) of FIG. 13A, the playlist generating apparatus 1, the name of the playlist L ₂ Create and play back narration audio. For example, the name of the play list L ₁ and L ₂ are each "bright" and "cool", it requested the name of the play list of users (search keyword) is assumed to be a "bright". In this case, the playlist creation device 1 narrates, for example, "The playlist of" bright "could not be created due to the lack of the number of songs. Therefore, the playlist of" cool "close to" bright "is reproduced." After playing the audio, play the songs in the playlist. Therefore, the user can listen to the music in the playlist after grasping that the “cool” music is played, and it is difficult for the user to feel uncomfortable with the image of the played music.

なお、利用者が要求したプレイリストの代わりに他のプレイリストを提示する場合、再生する曲は、提示するプレイリストの範囲内の全ての曲でもよいし、提示するプレイリストの範囲内の曲のうち利用者が要求したプレイリストに含まれる曲のみでもよい。その場合は、例えば、「“明るい”かつ“涼やか”のプレイリストを再生します。」というナレーション音声を作成して再生してもよい。また、提示するプレイリストの範囲内の曲もしくは利用者が要求したプレイリストに含まれる曲を、再生してもよい。その場合は、例えば、「“明るい”もしくは“涼やか”のプレイリストを再生します。」というナレーション音声を作成して再生してもよい。また、プレイリストの範囲は、図８に示した円に限らず、任意の空間形状でよい。 When presenting another playlist instead of the playlist requested by the user, the songs to be played may be all songs within the range of the presented playlist, or songs within the range of the presented playlist. Out of the songs included in the playlist requested by the user. In that case, for example, a narration sound “Play a playlist“ bright ”and“ cool ”” may be created and played. Also, songs within the range of the presented playlist or songs included in the playlist requested by the user may be reproduced. In this case, for example, a narration sound “Play a“ bright ”or“ cool ”playlist” may be created and played. Further, the range of the playlist is not limited to the circle shown in FIG. 8, but may be any space shape.

図１４は、プレイリストの範囲の別の例を示す図である。図１４には、図１３Ａの（ａ）と同様の二次元のイメージ特徴空間（Ｈ−Ｋ平面）を示している。 FIG. 14 is a diagram illustrating another example of the range of the playlist. FIG. 14 shows a two-dimensional image feature space (HK plane) similar to FIG. 13A (a).

図１４に示すように、二次元イメージ特徴空間における５個のプレイリストＬ_１，Ｌ_２，Ｌ_３，Ｌ_４，及びＬ_５の範囲３１１，３１２，３１３，３１４，及び３１５は、それぞれ長方形であってもよい。この場合、プレイリストＤＢ１２４における各プレイリストの範囲は、例えば、イメージ特徴空間において左上角部となる点の座標と、Ｈ軸方向の範囲と、Ｋ軸方向の範囲とにより指定する。図１４に示した例では、プレイリストＬ_１の範囲３１１は、左上角部の点の座標（ｈ１，ｋ１）と、Ｈ軸方向の範囲ΔＨ１と、Ｋ軸方向の範囲ΔＫ１とで指定される。 As shown in FIG. 14, the ranges 311, 312, 313, 314, and 315 of the five playlists L ₁ , L ₂ , L ₃ , L ₄ , and L ₅ in the two-dimensional image feature space are each rectangular. There may be. In this case, the range of each playlist in the playlist DB 124 is specified by, for example, the coordinates of a point at the upper left corner in the image feature space, the range in the H-axis direction, and the range in the K-axis direction. In the example shown in FIG. 14, the range 311 of the playlist _{L 1} is a point in the upper left corner coordinates (h1, k1), are specified in the H-axis direction of the range .DELTA.H1, the K axis range ΔK1 .

更に、イメージ特徴空間が三次元以上の空間である場合、各プレイリストの範囲は、球又は超球に限らず、直方体又は超直方体であってもよい。 Furthermore, when the image feature space is a space of three or more dimensions, the range of each playlist is not limited to a sphere or a hypersphere, but may be a rectangular parallelepiped or a hypercube.

また、認識ずれ量及び要求ずれ量を算出する際には、プレイリストについての正規分布に限らず、一様な確率密度分布を用いてもよい。 Further, when calculating the recognition deviation amount and the required deviation amount, not only the normal distribution of the play list but also a uniform probability density distribution may be used.

図１５は、プレイリストの範囲と確率密度分布との関係を説明する図である。図１５の（ａ）には、図１４と同様、二次元のイメージ特徴空間（Ｈ−Ｋ平面）におけるプレイリストの範囲が長方形である場合を示している。また図１５の（ｂ）には、図１５の（ａ）のプレイリストＬ_５の範囲３１５及びその周囲におけるＨ軸方向の確率密度分布ｐ（Ｈ）を示している。 FIG. 15 is a diagram illustrating the relationship between the range of the playlist and the probability density distribution. FIG. 15A shows a case where the range of the playlist in the two-dimensional image feature space (HK plane) is rectangular, as in FIG. Also in FIG. 15 (b) shows the probability density distribution p of H-axis direction (H) in the range 315 and around the playlist _{L 5} in (a) of FIG. 15.

図１５の（ａ）に示すように、二次元イメージ特徴空間におけるプレイリストＬ_１及びＬ_５の範囲３１１及び３１５は、それぞれ長方形でもよい。この場合、各プレイリストＬ_１及びＬ_５の想定している分布は、上記の正規分布に限らず、図１５の（ｂ）に示すようにプレイリストＬ_５の範囲３１５内で一様な確率密度分布ｐ（Ｈ）であってもよい。プレイリストＬ_５の想定している分布を確率密度分布ｐ（Ｈ）とする場合、ｈ５≦Ｈ≦ｈ５＋ΔＨ５の場合のみｐ（Ｈ）＝１／ＳＡ５とし、Ｈ＜ｈ５及びｈ５＋ΔＨ５＜Ｈの場合はｐ（Ｈ）＝０とする。ここで、ＳＡ５は、イメージ特徴空間においてプレイリストＬ_５の範囲３１５を表す長方形の面積である。 As shown in (a) of FIG. 15, the range 311 and 315 of the play list _{L 1} and _{L 5} in the two-dimensional image feature space may each rectangle. In this case, assumed to have the distribution of each play list L ₁ and L ₅ is not limited to the normal distribution of the above uniform probability in the range 315 of the playlist L ₅ as shown in FIG. 15 (b) The density distribution may be p (H). If the assumption to have distributed playlist _{L 5} and a probability density distribution p (H), and h5 ≦ H ≦ h5 + ΔH5 if only p (H) = 1 / SA5 , the case of H <h5 and h5 + ΔH5 <H is Let p (H) = 0. Here, SA5 is the area of a rectangle representing the range 315 of the playlist _{L 5} in the image feature space.

また、プレイリストの想定している分布を確率密度分布ｐ（Ｈ）とした場合、図１５の（ｂ）の下側のグラフのように曲を表す点２１４，２１５，及び２１６の位置の値のみが１／ＳＡ５となるグラフにより範囲内の曲の分布を表すことができる。そのため、グラフにおける１／ＳＡ５となる点の偏りの度合いに基づいて認識ずれ量を算出してもよい。 When the distribution assumed by the playlist is the probability density distribution p (H), the values of the positions of the points 214, 215, and 216 representing the music as shown in the lower graph of FIG. The distribution of music within the range can be represented by a graph in which only 1 / SA5 is set. Therefore, the recognition error amount may be calculated based on the degree of deviation of the point at 1 / SA5 in the graph.

［第２の実施形態］
図１６は、第２の実施形態に係るプレイリスト生成装置の機能的構成を示す図である。 [Second embodiment]
FIG. 16 is a diagram illustrating a functional configuration of a playlist generation device according to the second embodiment.

図１６に示すように、プレイリスト生成装置１は、要求プレイリスト入力部１０１と、キーワード／属性抽出部１０２と、曲絞込部１０３と、推薦プレイリスト候補生成部１０４と、優先度算出部１０５と、プレイリスト決定部１０６と、を備える。また、プレイリスト生成装置１は、ナレーション作成部１０７と、プレイリスト再生部１０８と、音声出力部１０９と、を備える。また、プレイリスト生成装置１は、キーワードＤＢ１２１と、絞込属性ＤＢ１２２と、曲ＤＢ１２３と、プレイリストＤＢ１２４と、を備える。 As shown in FIG. 16, the playlist generation device 1 includes a request playlist input unit 101, a keyword / attribute extraction unit 102, a music narrowing unit 103, a recommended playlist candidate generation unit 104, a priority calculation unit 105 and a play list determination unit 106. Further, the playlist generation device 1 includes a narration creation unit 107, a playlist reproduction unit 108, and an audio output unit 109. Further, the playlist generation device 1 includes a keyword DB 121, a narrow-down attribute DB 122, a song DB 123, and a playlist DB 124.

また、プレイリスト生成装置１は、スキップ操作入力部１１５と、適応処理部１１６と、を更に備える。すなわち、本実施形態のプレイリスト生成装置１は、第１の実施形態のプレイリスト生成装置１にスキップ操作入力部１１５と、適応処理部１１６とを付加した構成となっている。したがって、本実施形態のプレイリスト生成装置１は、第１の実施形態のプレイリスト生成装置１と同様、プレイリストを再生する再生装置を兼ねている。 Further, the playlist generation device 1 further includes a skip operation input unit 115 and an adaptive processing unit 116. That is, the playlist generation device 1 of the present embodiment has a configuration in which the skip operation input unit 115 and the adaptive processing unit 116 are added to the playlist generation device 1 of the first embodiment. Therefore, the playlist generation device 1 of the present embodiment also functions as a playback device that reproduces a playlist, like the playlist generation device 1 of the first embodiment.

本実施形態のプレイリスト生成装置１におけるキーワードＤＢ１２１、絞込属性ＤＢ１２２、曲ＤＢ１２３、及びプレイリストＤＢ１２４は、それぞれ、第１の実施形態で述べた情報が登録されたデータベースである。また、本実施形態のプレイリスト生成装置１における要求プレイリスト入力部１０１、及びキーワード／属性抽出部１０２は、それぞれ、第１の実施形態で述べた機能を有する。また、本実施形態のプレイリスト生成装置１における曲絞込部１０３、推薦プレイリスト候補生成部１０４、優先度算出部１０５、及びプレイリスト決定部１０６は、それぞれ、第１の実施形態で述べた機能を有する。また、本実施形態のプレイリスト生成装置１におけるナレーション作成部１０７、プレイリスト再生部１０８、及び音声出力部１０９は、それぞれ、第１の実施形態で述べた機能を有する。なお、本実施形態のプレイリスト生成装置１におけるプレイリスト再生部１０８は、プレイリストの再生中にスキップ操作入力部１１５に対する操作がなされると、曲をスキップして（言い換えると再生中の曲に対する再生処理を終了して）次の曲の再生処理を開始する。また、プレイリスト再生部１０８は、曲をスキップした場合に、再生中のプレイリスト及びスキップした曲の曲ＩＤを適応処理部１１６に出力する。 The keyword DB 121, the narrow-down attribute DB 122, the song DB 123, and the playlist DB 124 in the playlist generation device 1 of the present embodiment are databases in which the information described in the first embodiment is registered. Further, the required playlist input unit 101 and the keyword / attribute extraction unit 102 in the playlist generation device 1 of the present embodiment have the functions described in the first embodiment. Also, the song narrowing unit 103, the recommended playlist candidate generating unit 104, the priority calculating unit 105, and the playlist determining unit 106 in the playlist generating device 1 of the present embodiment have been described in the first embodiment. Has functions. Further, the narration creation unit 107, the playlist reproduction unit 108, and the audio output unit 109 in the playlist generation device 1 of the present embodiment have the functions described in the first embodiment, respectively. Note that, when an operation on the skip operation input unit 115 is performed during the reproduction of the playlist, the playlist reproduction unit 108 in the playlist generation device 1 of the present embodiment skips the music (in other words, the reproduction of the music being reproduced). The reproduction process of the next song is started (after the reproduction process is completed). In addition, when a song is skipped, the playlist playback unit 108 outputs the playlist being played back and the song ID of the skipped song to the adaptive processing unit 116.

本実施形態のプレイリスト生成装置１におけるスキップ操作入力部１１５は、例えば押し釦スイッチである。利用者がスキップ操作入力部１１５を操作すると、スキップ操作入力部１１５は、スキップ操作を指示する信号をプレイリスト再生部１０８に出力する。スキップ操作を指示する信号を受け取ると、プレイリスト再生部１０８は、曲をスキップして次の曲の再生処理を開始する。この際、プレイリスト再生部１０８は、再生中のプレイリストと、スキップした曲の曲ＩＤとを適応処理部１１６に出力する。 The skip operation input unit 115 in the playlist generation device 1 of the present embodiment is, for example, a push button switch. When the user operates the skip operation input unit 115, the skip operation input unit 115 outputs a signal instructing the skip operation to the playlist reproduction unit 108. When receiving the signal instructing the skip operation, the playlist reproduction unit 108 skips the music and starts the reproduction processing of the next music. At this time, the playlist playback unit 108 outputs the playlist being played back and the song ID of the skipped song to the adaptive processing unit 116.

また、本実施形態のプレイリスト生成装置１における適応処理部１１６は、プレイリスト再生部１０８から取得した再生中のプレイリストと、スキップした曲の曲ＩＤとに基づいてプレイリストの想定分布及び範囲を再学習する。 In addition, the adaptive processing unit 116 in the playlist generation device 1 according to the present embodiment performs the assumed distribution and range of the playlist based on the playlist being reproduced acquired from the playlist reproduction unit 108 and the song ID of the skipped song. Re-learn.

図１７は、適応処理部の構成例を示す図である。図１７に示すように、適応処理部１１６は、プレイリスト範囲修正部１１６ａと、プレイリストＤＢ再学習部１１６ｂと、を備える。また、適応処理部１１６は、修正データを蓄積したデータベース１１６ｃと、プレイリスト学習用データが登録されたデータベース１１６ｂと、を備える。 FIG. 17 is a diagram illustrating a configuration example of the adaptive processing unit. As shown in FIG. 17, the adaptive processing unit 116 includes a playlist range correction unit 116a and a playlist DB re-learning unit 116b. In addition, the adaptive processing unit 116 includes a database 116c in which correction data is stored, and a database 116b in which playlist learning data is registered.

プレイリスト範囲修正部１１６ａは、再生中のプレイリストと、スキップした曲の曲ＩＤとに基づいて、再生中のプレイリストにおける想定分布及び範囲を修正する。プレイリスト範囲修正部１１６ａは、スキップした曲の曲ＩＤを用いて、曲ＤＢ１２３からスキップした曲のイメージ特徴量を取得する。また、プレイリスト範囲修正部１１６ａは、曲ＤＢ１２３から取得した曲のイメージ特徴量がプレイリストの範囲外となるよう、再生中のプレイリストの想定分布及び範囲を修正する。プレイリスト範囲修正部１１６ａは、想定分布及び範囲を修正したプレイリストの修正データをデータベース１１６ｃに格納する。 The playlist range correction unit 116a corrects the assumed distribution and range in the currently reproduced playlist based on the currently reproduced playlist and the song ID of the skipped song. The playlist range modification unit 116a acquires the image feature amount of the skipped song from the song DB 123 using the song ID of the skipped song. Further, the playlist range correction unit 116a corrects the assumed distribution and range of the playlist being reproduced so that the image feature amount of the song acquired from the song DB 123 falls outside the range of the playlist. The playlist range modification unit 116a stores, in the database 116c, modification data of the playlist in which the assumed distribution and the range have been modified.

プレイリストＤＢ再学習部１１６ｂは、データベース１１６ｃの修正データと、データベース１１６ｄのプレイリスト学習用データとに基づいて、プレイリストの想定分布及び範囲を再学習する。プレイリスト学習用データは、プレイリストの想定分布及び範囲を学習する際に使用したデータである。プレイリストの想定分布及び範囲の学習は、プレイリストＤＢ１２４を作成する際に任意の学習方法を用いて行う。例えば、まず、評価者がプレイリストの名称及び曲に対する主観的なイメージに基づいて、イメージ特徴空間におけるプレイリスト及び曲の座標（イメージ特徴量）を指定した初期データを作成する。その後、初期データを学習用データとして、単語特徴量からイメージ特徴空間への変換関数、及び楽曲特徴量からイメージ特徴量への変換関数を、例えば、カーネル回帰で学習する。単語特徴量は、プレイリストの名称（検索キーワード）、すなわち、単語を、意味特徴量に変換する既知の変換技術を用いることで得られる。また、楽曲特徴量は、例えば、音声認識等で用いられる音響特徴量（例えばメル周波数ケプストラム係数（ＭＦＣＣ））を利用することができる。 The playlist DB re-learning unit 116b re-learns the assumed distribution and range of the playlist based on the correction data in the database 116c and the playlist learning data in the database 116d. The playlist learning data is data used when learning the assumed distribution and range of the playlist. The learning of the assumed distribution and range of the playlist is performed using an arbitrary learning method when creating the playlist DB 124. For example, first, the evaluator creates initial data specifying the playlist and the coordinates of the song (image feature amount) in the image feature space based on the name of the playlist and the subjective image of the song. Then, using the initial data as learning data, a conversion function from a word feature to an image feature space and a conversion function from a music feature to an image feature are learned by, for example, kernel regression. The word feature amount is obtained by using a known conversion technique for converting a playlist name (search keyword), that is, a word into a semantic feature amount. As the music feature amount, for example, an acoustic feature amount (for example, a mel frequency cepstrum coefficient (MFCC)) used in voice recognition or the like can be used.

プレイリストＤＢ再学習部１１６ｂは、プレイリストの想定分布及び範囲を再学習した後、プレイリストＤＢにおける再生中のプレイリストの想定分布及び範囲を再学習した想定分布及び範囲に更新する。 After re-learning the assumed distribution and range of the playlist, the playlist DB re-learning unit 116b updates the assumed distribution and range of the playlist being reproduced in the playlist DB to the re-learned assumed distribution and range.

本実施形態のプレイリスト生成装置１が行うプレイリスト生成処理は、第１の実施形態で説明した処理と同じである。すなわち、本実施形態のプレイリスト生成装置１は、プレイリスト生成処理として、図９Ａ、図９Ｂ、図１１、及び図１２に示したような処理を行う。 The playlist generation process performed by the playlist generation device 1 of the present embodiment is the same as the process described in the first embodiment. That is, the play list generation device 1 of the present embodiment performs the processing shown in FIGS. 9A, 9B, 11, and 12 as the play list generation processing.

また、本実施形態のプレイリスト生成装置１は、プレイリスト再生中、スキップ操作入力部１１５において利用者によるスキップ操作を受け付けると、適応処理部１１６においてプレイリストＤＢを更新する処理を行う。 Further, when the skip operation input unit 115 receives a skip operation by the user during the reproduction of the play list, the play list generation device 1 of the present embodiment performs a process of updating the play list DB in the adaptive processing unit 116.

図１８は、適応処理部が行う処理を説明するフローチャートである。
本実施形態のプレイリスト生成装置１におけるスキップ操作部１１５は、プレイリスト再生中に利用者によるスキップ操作を受け付けると、スキップ操作入力部１１５は、プレイリスト再生部１０８に対し再生中の曲をスキップさせる指示を出力する。再生中の曲をスキップさせる指示を受け取ったプレイリスト再生部１０８は、再生中の曲音声データの再生を終了し、次の曲音声データの再生を開始する。この際、プレイリスト再生部１０８は、スキップした曲の曲ＩＤと、再生中のプレイリストとを適応処理部１１６に出力する。 FIG. 18 is a flowchart illustrating a process performed by the adaptive processing unit.
When the skip operation unit 115 in the playlist generation device 1 of the present embodiment receives a skip operation by the user during the reproduction of the playlist, the skip operation input unit 115 causes the playlist reproduction unit 108 to skip the song being reproduced. Output instructions The playlist playback unit 108 that has received the instruction to skip the song being played ends the playback of the song audio data that is being played, and starts playing the next song audio data. At this time, the playlist playback unit 108 outputs the song ID of the skipped song and the playlist being played back to the adaptive processing unit 116.

このとき、適応処理部１１６では、図１８に示すように、スキップした曲の曲ＩＤと再生中のプレイリストとを受付（ステップＳ２１）、スキップした曲の曲ＩＤをキー情報としてスキップした曲のイメージ特徴量を取得する（ステップＳ２２）。ステップＳ２１及びＳ２２の処理は、プレイリスト範囲修正部１１６ａが行う。ステップＳ２２の処理において、プレイリスト範囲修正部１１６ａは、曲ＤＢ１２３を検索し、スキップした曲の曲ＩＤと対応付けられた曲のイメージ特徴量を取得する。 At this time, as shown in FIG. 18, the adaptive processing unit 116 receives the song ID of the skipped song and the playlist being reproduced (step S21), and uses the song ID of the skipped song as key information for the skipped song. An image feature amount is obtained (step S22). The processing of steps S21 and S22 is performed by the playlist range correction unit 116a. In the process of step S22, the playlist range correction unit 116a searches the song DB 123, and acquires the image feature amount of the song associated with the song ID of the skipped song.

スキップした曲のイメージ特徴量を取得すると、プレイリスト範囲修正部１１６ａは、次に、取得したイメージ特徴量に基づいて再生中のプレイリストの範囲を縮小する（ステップＳ２３）。ステップＳ２３の処理において、プレイリスト範囲修正部１１６ａは、スキップした曲のイメージ特徴量がプレイリストの範囲外となるよう、プレイリストの範囲を縮小する。 When the image feature amount of the skipped song is acquired, the playlist range modification unit 116a then reduces the range of the playlist being reproduced based on the acquired image feature amount (step S23). In the process of step S23, the playlist range modification unit 116a reduces the range of the playlist so that the image feature amount of the skipped song is outside the range of the playlist.

プレイリストの範囲の縮小を終えると、プレイリスト範囲修正部１１６ａは、次に、プレイリストの範囲を調整するか否かを判定する（ステップＳ２４）。ステップＳ２４の判定では、例えば、プレイリストの範囲の縮小量に基づいて、プレイリストの範囲を拡大調整するか否かを判定する。 After reducing the playlist range, the playlist range modification unit 116a next determines whether to adjust the playlist range (step S24). In the determination of step S24, for example, it is determined whether or not to expand and adjust the range of the playlist based on the reduction amount of the range of the playlist.

プレイリストの範囲を調整する場合（ステップＳ２４；Ｙｅｓ）、プレイリスト範囲修正部１１６ａは、スキップした曲の特徴量が含まれない方向にプレイリストの範囲を拡大する（ステップＳ２５）。その後、プレイリスト範囲修正部１１６ａは、範囲を変更したプレイリストの変更後の範囲を修正データのデータベース１１６ｃに登録する（ステップＳ２６）。一方、プレイリストの範囲を調整しない場合（ステップＳ２４；Ｎｏ）、プレイリスト範囲修正部１１６ａは、ステップＳ２５の処理を行わずに、範囲を変更したプレイリストの変更後の範囲を修正データのデータベース１１６ｃに登録する（ステップＳ２６）。すなわち、プレイリストの範囲を調整する場合（ステップＳ２４；Ｙｅｓ）、プレイリスト範囲修正部１１６ａは、ステップＳ２３で縮小するとともにステップＳ２５で拡大調整したプレイリストの範囲を修正データのデータベース１１６ｃに登録する。一方、プレイリストの範囲を調整しない場合（ステップＳ２４；Ｎｏ）、プレイリスト範囲修正部１１６ａは、ステップＳ２３で縮小したプレイリストの範囲を修正データのデータベース１１６ｃに登録する。 When adjusting the range of the playlist (Step S24; Yes), the playlist range correcting unit 116a expands the range of the playlist in a direction not including the feature amount of the skipped song (Step S25). Thereafter, the playlist range correction unit 116a registers the changed range of the playlist whose range has been changed in the correction data database 116c (step S26). On the other hand, when the range of the playlist is not adjusted (step S24; No), the playlist range correction unit 116a does not perform the process of step S25, and stores the changed range of the changed playlist in the database of the correction data. 116c (step S26). That is, when adjusting the range of the playlist (Step S24; Yes), the playlist range correction unit 116a registers the range of the playlist reduced and enlarged in Step S23 in the correction data database 116c in Step S23. . On the other hand, when the range of the playlist is not adjusted (Step S24; No), the playlist range correction unit 116a registers the range of the playlist reduced in Step S23 in the correction data database 116c.

修正したプレイリストの範囲を修正データのデータベース１１６ｃに登録した後、適応処理部１１６では、修正データ及びプレイリスト学習用データに基づいて、プレイリストの想定分布及び範囲を再学習する（ステップＳ２７）。ステップＳ２７の処理は、プレイリストＤＢ再学習部１１６ｂが行う。プレイリストＤＢ再学習部１１６ｂは、例えば、修正データに登録された修正後のプレイリストの範囲をプレイリスト学習用データに含め、それらのプレイリストの範囲に対するカーネル回帰でプレイリストの範囲を再学習する。また、プレイリストＤＢ再学習部１１６ｂは、例えば、再学習後のプレイリストの範囲と各曲のイメージ特徴量とに基づいて、プレイリストの想定分布を再学習する。 After registering the corrected playlist range in the correction data database 116c, the adaptive processing unit 116 re-learns the assumed distribution and range of the playlist based on the correction data and the playlist learning data (step S27). . The process of step S27 is performed by the playlist DB re-learning unit 116b. The playlist DB re-learning unit 116b includes, for example, the playlist range after the modification registered in the modification data in the playlist learning data, and re-learns the range of the playlist by kernel regression for the playlist range. I do. The playlist DB re-learning unit 116b re-learns the assumed distribution of the playlist based on, for example, the range of the playlist after the re-learning and the image feature amount of each song.

ステップＳ２７におけるプレイリストの想定分布及び範囲の再学習を終えると、プレイリストＤＢ再学習部１１６ｂは、プレイリストＤＢ１２４の該当プレイリストの想定分布及び範囲を再学習により得られたプレイリストの想定分布及び範囲に更新する（ステップＳ２８）。 After completing the re-learning of the assumed distribution and range of the playlist in step S27, the playlist DB re-learning unit 116b sets the assumed distribution and range of the corresponding playlist in the playlist DB 124 by assuming the playlist obtained by re-learning. The distribution and the range are updated (step S28).

このように、本実施形態に係るプレイリスト生成装置１では、プレイリスト再生中に曲をスキップする操作が行われると、スキップした曲のイメージ特徴量がプレイリストの範囲外となるようプレイリストの範囲を修正する。そのため、次回以降、同じプレイリストを再生する際には、過去にスキップした曲がプレイリストから除外される。したがって、利用者にとってプレイリストのイメージに合わない曲をプレイリストから除外することができ、再生するプレイリストのイメージを利用者のイメージにより近づけることが可能となる。 As described above, in the playlist generation device 1 according to the present embodiment, when the operation of skipping a song is performed during the reproduction of the playlist, the playlist is generated such that the image feature amount of the skipped song is out of the range of the playlist. Modify the range. Therefore, when the same playlist is reproduced from the next time onward, songs skipped in the past are excluded from the playlist. Therefore, a song that does not match the image of the playlist for the user can be excluded from the playlist, and the image of the playlist to be reproduced can be closer to the image of the user.

図１９は、プレイリストの範囲の修正方法の例を説明する図である。図１９の（ａ）及び（ｂ）には、それぞれ、二次元のイメージ特徴空間（Ｈ−Ｋ平面）におけるプレイリストの範囲と曲のイメージ特徴量を表す黒丸とを示している。 FIG. 19 is a diagram illustrating an example of a method of correcting the range of a playlist. FIGS. 19A and 19B show a range of a playlist in a two-dimensional image feature space (HK plane) and a black circle representing an image feature amount of a song, respectively.

プレイリスト生成装置１において第１の実施形態で説明したプレイリスト生成処理を行う際に、例えば、図１９の（ａ）に示すように、推薦プレイリスト候補が５個であり、イメージ特徴空間（Ｈ−Ｋ平面）における５個のプレイリストＬ_１，Ｌ_２，Ｌ_３，Ｌ_４，及びＬ_５の範囲３１１，３１２，３１３，３１４，及び３１５がそれぞれ長方形で表されるとする。また、プレイリスト決定部１０６においてプレイリストＬ_２が推薦プレイリストに決定されたとする。このとき、プレイリスト再生部１０８は、プレイリストＬ_２の範囲３１２内に含まれる５曲の曲２１１，２１２，２１３，２１７，２１８を順次再生する。 When the playlist generation device 1 performs the playlist generation processing described in the first embodiment, for example, as shown in FIG. 19A, there are five recommended playlist candidates and the image feature space ( It is assumed that the ranges 311, 312, 313, 314, and 315 of the five playlists L ₁ , L ₂ , L ₃ , L ₄ , and L _{5 on} the (HK plane) are each represented by a rectangle. Moreover, the play list L ₂ is determined to be the recommended playlist in the playlist decision unit 106. In this case, the play list reproduction unit 108 sequentially reproduces the music 211,212,213,217,218 five songs contained within 312 of the playlist _{L 2.}

ここで、例えば、プレイリストＬ_２に含まれる曲２１７の再生中に利用者がスキップ操作を行ったとする。この場合、適応処理部１１６は、図１９の（ｂ）に示すように、スキップした曲２１７がプレイリストＬ２の範囲外となるよう、プレイリストＬ_２の範囲を縮小する。図１９の（ａ）に示したように、プレイリストＬ_２の範囲３１２内における最も右側、すなわちイメージ特徴量におけるＨの値が最も大きい曲２１７をプレイリストＬ_２の範囲外にする場合、曲２１７の右方に位置する辺を−Ｈ方向に移動させる。これにより、他の４曲２１１，２１２，２１３，及び２１８のイメージ特徴量がプレイリストＬ_２の範囲３１２内に含まれた状態のまま、曲２１７のみをプレイリストＬ_２の範囲外にすることができる。この際、−Ｈ方向に移動させる辺の移動量は、例えば、移動後の辺と曲２１７のイメージ特徴量との距離ｄが所定の距離になる移動量とする。 Here, for example, the user during playback of the song 217 is included in the play list L ₂ has performed the skip operation. In this case, adaptive processing unit 116, as shown in (b) of FIG. 19, as the song 217 skipping is outside the range of the play list L2, to reduce the range of the play list L _2. As shown in (a) of FIG. 19, the rightmost in the range 312 of the playlist L _2, that is, when the song 217 the value of H is the largest in the image feature amount outside the range of the play list L _2, songs 217 is moved in the −H direction. Thereby, the image feature amounts of the other four songs 211, 212, and 218 remain in the state they were contained within 312 playlist _{L 2,} to outside only songs 217 playlist _{L 2} Can be. At this time, the movement amount of the side to be moved in the −H direction is, for example, a movement amount at which the distance d between the moved side and the image feature amount of the music piece 217 becomes a predetermined distance.

なお、曲をスキップする毎にプレイリストの範囲を縮小すると、やがてプレイリストの範囲が０となり無効なプレイリストになってしまう可能性がある。そのため、プレイリストの範囲を縮小する場合、ステップＳ２５の処理のように、縮小した分、他の方向に範囲を拡大してプレイリストの範囲を調整してもよい。 Note that if the range of the playlist is reduced every time a song is skipped, the range of the playlist may eventually become 0 and become an invalid playlist. Therefore, when the range of the playlist is reduced, the range of the playlist may be adjusted by expanding the range in another direction by the reduced amount, as in the process of step S25.

図２０は、プレイリストの範囲の拡大方法の例を説明する図である。図２０の（ａ）、（ｂ）、及び（ｃ）には、それぞれ、二次元のイメージ特徴空間（Ｈ−Ｋ平面）におけるプレイリストＬ_２の範囲３１２と曲のイメージ特徴量を表す黒丸とを示している。また、図２０の（ａ）、（ｂ）、及び（ｃ）には、それぞれ、図１９に示したように曲２１７をスキップしたことにより範囲３１２を縮小した場合の縮小後の範囲３１２の拡大方法の例を示している。 FIG. 20 is a diagram illustrating an example of a method of expanding the range of a playlist. FIGS. 20A, 20B, and 20C respectively show a range 312 of the playlist L2 in the _two -dimensional image feature space (HK plane) and a black circle representing the image feature amount of the song. Is shown. Also, FIGS. 20A, 20B, and 20C respectively show the enlargement of the reduced range 312 when the range 312 is reduced by skipping the music 217 as shown in FIG. 4 shows an example of a method.

例えば、プレイリストＬ_２の範囲３１２における右辺を−Ｈ方向に距離ｈｄだけ移動させて範囲３１２を縮小した場合、範囲３１２を拡大する方法の１つとして、図２０の（ａ）に示すように、範囲３１２の左辺を−Ｈ方向に距離ｈｄだけ移動させる方法が挙げられる。すなわち、図２０の（ａ）に示した例では、プレイリストＬ_２の範囲３１２を−Ｈ方向に距離ｈｄだけ平行移動させたことになり、イメージ特徴空間における範囲３１２の面積を変えずに曲２１７が範囲外となるようにしている。 For example, if obtained by reducing the range 312 to move the right side in the -H direction distance hd in the range 312 of the playlist _{L 2,} as a way to expand the scope 312, as shown in (a) of FIG. 20 , The left side of the range 312 is moved by the distance hd in the −H direction. That is, in the example shown in (a) of FIG. 20, will be the range 312 of the playlist L ₂ is only moved parallel distance hd with -H direction, music without changing the area in the range 312 in the image feature space 217 is outside the range.

また、プレイリストＬ_２の範囲３１２における右辺を−Ｈ方向に距離ｈｄだけ移動させて範囲３１２を縮小した場合、図２０の（ｂ）に示すように、範囲３１２の下辺を−Ｋ方向に距離ｈｅだけ移動させて範囲３１２を拡大してもよい。なお、図示は省略するが、範囲３１２の下辺を−Ｋ方向に移動させる代わりに、上辺を＋Ｋ方向に移動させて範囲３１２を拡大してもよい。更に、図示は省略するが、範囲３１２の下辺をーＫ方向に移動させるとともに、上辺を＋Ｋ方向に移動させて範囲３１２を拡大させてもよい。 Also, if obtained by reducing the range 312 to move the right side in the -H direction distance hd in the range 312 of the playlist _{L 2,} as shown in (b) of FIG. 20, the distance the lower side of the range 312 -K direction The range 312 may be enlarged by moving the area by he. Although not shown, instead of moving the lower side of the range 312 in the −K direction, the range 312 may be enlarged by moving the upper side in the + K direction. Further, although not shown, the range 312 may be enlarged by moving the lower side of the range 312 in the −K direction and moving the upper side in the + K direction.

また、プレイリストＬ_２の範囲３１２における右辺を−Ｈ方向に距離ｈｄだけ移動させて範囲３１２を縮小した場合、図２０の（ｃ）に示すように、範囲３１２の左辺を−Ｈ方向に移動させるとともに下辺を−Ｋ方向に移動させて範囲３１２を拡大してもよい。なお、図示は省略するが、範囲３１２の左辺、上辺、及び下辺を、それぞれ範囲３１２が拡大する方向に移動させてもよい。 The movement, if obtained by reducing the range 312 to move the right side in the -H direction distance hd in the range 312 of the playlist _{L 2,} as shown in (c) of FIG. 20, the left side of the range 312 -H direction Alternatively, the range 312 may be enlarged by moving the lower side in the −K direction. Although not shown, the left side, the upper side, and the lower side of the range 312 may be moved in the direction in which the range 312 expands.

図２１は、プレイリストの範囲の拡大することによる効果の１つを説明する図である。図２１の（ａ）及び（ｂ）には、それぞれ、二次元のイメージ特徴空間（Ｈ−Ｋ平面）におけるプレイリストＬ_２の範囲３１２と曲のイメージ特徴量を表す黒丸とを示している。また、図２１の（ａ）には、図１９に示したように曲２１７をスキップしたことにより範囲３１２を縮小した場合の縮小後の範囲３１２を示している。 FIG. 21 is a diagram illustrating one of the effects obtained by expanding the range of the playlist. In (a) and (b) in FIG. 21 respectively show a black circle representing an image feature amount in the range 312 and songs in a playlist L ₂ in the two-dimensional image feature space (H-K plane). FIG. 21A shows a reduced range 312 when the range 312 is reduced by skipping the music piece 217 as shown in FIG.

プレイリストＬ_２の範囲３１２に含まれていた曲２１７を利用者がスキップする理由の１つとして、利用者にとって曲２１７のイメージがプレイリストＬ_２のイメージと合致していないことが挙げられる。この場合、曲２１７のイメージとプレイリストＬ_２のイメージとの差異（齟齬）の原因としては、例えば、プレイリストＬ_２の範囲３１２が広すぎることや、範囲３１２の中心がずれていること等が挙げられる。範囲３１２の中心がずれている場合、例えば、範囲３１２外でありかつ範囲３１２の近傍に位置する曲２１９のイメージがプレイリストＬ_２のイメージと合致することも考えられる。縮小後の範囲３１２を調整（拡大修正）すれば、図２１の（ｂ）に示すように、プレイリストＬ_２の範囲外に存在するプレイリストＬ_２のイメージに近い曲２１９をプレイリストＬ_２の範囲３１２に含めることができる。そのため、プレイリスト生成時に当該プレイリストＬ_２のイメージに合致した曲をより多く利用者に提示することが可能となる。 One reason to skip user songs 217 contained in the range 312 of the playlist L _2, the image of the song 217 and the like that does not match the playlist L ₂ image for the user. In this case, the cause of the difference between the image image playlists L ₂ songs 217 (conflict), for example, range 312 of the playlist L ₂ is too wide and, like the center of the range 312 is displaced Is mentioned. If the center of the range 312 is shifted, for example, an image of the song 219 located in the vicinity of the range 312 outside a is and the range 312 it is also conceivable to match the image playlist L _2. By adjusting the range 312 after reduction (enlargement modifications), as shown in (b) of FIG. 21, the playlist songs 219 close to the image of the playlist L ₂ existing outside the range of the play list L ₂ L ₂ Range 312. Therefore, it is possible to provide a song that matches the of the play list L ₂ during playlist generation image to more users.

また、範囲３１２を拡大修正することによりプレイリストＬ_２に含まれることとなった曲２１９の再生中に利用者によるスキップ操作が行われれば、プレイリストＬ_２の範囲３１２は曲２１９が含まれない範囲に再学習（再修正）される。そのため、プレイリストＬ_２の範囲３１２をプレイリストＬ_２のイメージに合ったより適切な範囲にすることが可能となる。 Further, if the skip operation by the user during reproduction of music 219 was to be included in the playlist L ₂ is performed by expanding modify the range 312, range 312 of the playlist L ₂ is included songs 219 It is re-learned (re-corrected) to a range that does not exist. Therefore, it is possible to a proper range more suited to the range 312 of the playlist L ₂ to playlist L ₂ images.

このように、本実施形態に係るプレイリスト生成処理では、プレイリスト再生中に利用者が曲をスキップする操作を行った場合、スキップした曲の曲特徴量に基づいて再生中のプレイリストの範囲を修正し、想定分布及び範囲を再学習する。そのため、各プレイリストの想定分布及び範囲を利用者の抱くイメージに適応的に近づけることができ、プレイリストのイメージと再生された曲のイメージとの齟齬により利用者に与える違和感をより低減することが可能となる。 As described above, in the playlist generation processing according to the present embodiment, when the user performs an operation of skipping a song during the playlist playback, the range of the playlist being played based on the song feature amount of the skipped song is Is corrected, and the assumed distribution and range are re-learned. Therefore, the assumed distribution and range of each playlist can be adaptively approximated to the image held by the user, and the discomfort given to the user due to the discrepancy between the image of the playlist and the image of the reproduced music is further reduced. Becomes possible.

［第３の実施形態］
図２２は、第３の実施形態に係るプレイリスト再生システムの構成を示す図である。 [Third Embodiment]
FIG. 22 is a diagram illustrating a configuration of a playlist playback system according to the third embodiment.

図２２に示すように、本実施形態に係るプレイリスト再生システム４は、音楽再生装置５と、サーバ６とを備える。音楽生成装置５とサーバ６とは、インターネット等の通信ネットワーク７を介して通信可能に接続される。 As shown in FIG. 22, the playlist playback system 4 according to the present embodiment includes a music playback device 5 and a server 6. The music generation device 5 and the server 6 are communicably connected via a communication network 7 such as the Internet.

音楽再生装置５は、サーバ６から音楽データ（曲音声データ）を取得して再生する装置である。音楽再生装置５は、音声入力部（マイク）５０１と、音声出力部５０２と、信号処理部５０３と、通信部５０４と、を備える。 The music playback device 5 is a device that acquires music data (song audio data) from the server 6 and plays back the music data. The music reproducing device 5 includes an audio input unit (microphone) 501, an audio output unit 502, a signal processing unit 503, and a communication unit 504.

音声入力部５０１は、音楽再生装置５の周囲を伝播する音声を収音し音声信号として入力する。音声出力部５０２は、例えばスピーカであり、音楽（曲音声）を含む音声信号を音声として音楽再生装置５の外部の空間に向けて出力する。なお、音声出力部５０２は、スピーカに限らず、例えばイヤフォンやアンプ等の音響機器を接続する出力端子であってもよい。 The sound input unit 501 collects sound propagating around the music reproducing device 5 and inputs the sound as a sound signal. The audio output unit 502 is, for example, a speaker, and outputs an audio signal including music (song audio) as audio to a space outside the music playback device 5. Note that the audio output unit 502 is not limited to a speaker, and may be an output terminal for connecting an audio device such as an earphone or an amplifier.

信号処理部５０３は、音声入力部５０１から入力された音声信号の変換処理、及びサーバから取得した曲音声データの再生処理等を行う。通信部５０４は、サーバ６が備える図示しない通信部との間でのデータ通信等を行う。 The signal processing unit 503 performs a conversion process of the audio signal input from the audio input unit 501, a reproduction process of the music audio data acquired from the server, and the like. The communication unit 504 performs data communication with a communication unit (not shown) of the server 6.

図２３は、第３の実施形態に係るプレイリスト再生システムにおけるサーバの機能的構成を示す図である。 FIG. 23 is a diagram illustrating a functional configuration of a server in the playlist playback system according to the third embodiment.

図２３に示すように、本実施形態のプレイリスト再生システム４におけるサーバ６は、通信部６０１と、音声認識部６０２と、キーワード／属性抽出部６０３と、曲絞込部６０４と、を備える。また、サーバ６は、推薦プレイリスト候補生成部６０５と、優先度算出部６０６と、プレイリスト決定部６０７と、ナレーション作成部６０８と、プレイリスト再生部６０９と、を備える。また、サーバ６は、キーワードＤＢ６２１と、絞込属性ＤＢ６２２と、曲ＤＢ６２３と、プレイリストＤＢ６２４と、を備える。 As shown in FIG. 23, the server 6 in the playlist playback system 4 of the present embodiment includes a communication unit 601, a voice recognition unit 602, a keyword / attribute extraction unit 603, and a music narrowing unit 604. In addition, the server 6 includes a recommended playlist candidate generation unit 605, a priority calculation unit 606, a playlist determination unit 607, a narration creation unit 608, and a playlist reproduction unit 609. Further, the server 6 includes a keyword DB 621, a narrowing down attribute DB 622, a music DB 623, and a playlist DB 624.

通信部６０１は、音楽再生装置５の通信部５０４との間でのデータ通信等を行う。通信部６０１は、音楽再生装置５の音声入力部５０１が収音した音声信号を受信する。また、通信部６０１は、プレイリスト再生部６０９が再生した音声データを音楽再生装置５に送信する。 The communication unit 601 performs data communication with the communication unit 504 of the music playback device 5. The communication unit 601 receives the audio signal collected by the audio input unit 501 of the music playback device 5. In addition, the communication unit 601 transmits the audio data reproduced by the play list reproduction unit 609 to the music reproduction device 5.

音声認識部６０２は、通信部６０１を介して入力された音楽再生装置５からの音声信号を解析し、音声信号に含まれる有意な音声をテキストに変換する。 The voice recognition unit 602 analyzes a voice signal from the music playback device 5 input via the communication unit 601 and converts significant voice included in the voice signal into text.

キーワード／属性抽出部６０３は、音声認識部６０２で得たテキストに基づいて、キーワードＤＢ６２１から検索キーワードを抽出するとともに、絞込属性ＤＢ６２２から曲の絞り込み属性を抽出する。キーワードＤＢ６２１は、第１の実施形態で説明したキーワードＤＢ１２１と同様、テキストと検索キーワードとの対応関係が複数登録されたデータベースである。検索キーワードは、プレイリストの検索に用いるキーワードである。絞込属性ＤＢ６２２は、第１の実施形態で説明した絞込属性ＤＢ１２２と同様、テキストと曲の絞り込み属性との対応関係が複数登録されたデータベースである。曲の絞り込み属性は、プレイリストに登録する曲の絞り込みに用いる属性情報である。 The keyword / attribute extraction unit 603 extracts a search keyword from the keyword DB 621 based on the text obtained by the speech recognition unit 602, and extracts a song narrowing attribute from the narrowing attribute DB 622. The keyword DB 621 is a database in which a plurality of correspondences between texts and search keywords are registered, similarly to the keyword DB 121 described in the first embodiment. The search keyword is a keyword used for searching for a playlist. The narrow-down attribute DB 622 is a database in which a plurality of correspondences between texts and narrow-down attributes of songs are registered, similarly to the narrow-down attribute DB 122 described in the first embodiment. The song narrowing down attribute is attribute information used to narrow down songs to be registered in the playlist.

曲絞込部６０４は、抽出した絞り込み属性に基づいて、プレイリストへの登録対象の候補となる曲を、曲ＤＢ６２３に登録された全ての曲から絞り込み属性に合致する属性を持つ曲に絞り込む。曲ＤＢ６２３は、第１の実施形態で説明した曲ＤＢ１２３と同様、曲音声データと、曲属性と、曲のイメージ特徴量との対応関係が複数登録されたデータベースである。曲属性は、曲音声データの絞り込みに用いる属性情報である。曲のイメージ特徴量は、曲のイメージを数値化した情報である。 The song narrowing unit 604 narrows down the candidate songs to be registered in the playlist to the songs having attributes matching the narrowing attribute from all the songs registered in the song DB 623 based on the extracted narrowing attributes. . The music DB 623 is a database in which a plurality of correspondences between music audio data, music attributes, and image feature amounts of music are registered, similarly to the music DB 123 described in the first embodiment. The song attribute is attribute information used to narrow down song audio data. The music image feature amount is information obtained by digitizing the music image.

推薦プレイリスト候補作成部６０５は、プレイリストＤＢ６２４から曲絞込部６０４で絞り込んだ曲を含むプレイリストを抽出して推薦プレイリスト候補を作成する。推薦プレイリスト候補は、音楽再生装置５の利用者（オペレータ）に推薦する（提示する）プレイリストの候補である。推薦プレイリスト候補は、プレイリストの名称と、曲ＩＤと、想定分布とを含む。プレイリストＤＢ６２４は、第１の実施形態で説明したプレイリストＤＢ１２４と同様、プレイリスト毎にプレイリストの名称と、想定分布と、プレイリストの範囲とを対応付けて登録したデータベースである。プレイリストの想定分布は、曲のイメージ特徴量に基づいて曲の分布を表す際のイメージ特徴空間におけるプレイリストの平均（中心）及び分散である。また、プレイリストの範囲は、イメージ特徴空間におけるプレイリストへの登録対象となる曲の存在する範囲である。 The recommended playlist candidate creation unit 605 extracts a playlist including the songs narrowed down by the song narrowing unit 604 from the playlist DB 624 and creates a recommended playlist candidate. The recommended playlist candidates are playlist candidates that are recommended (presented) to the user (operator) of the music playback device 5. The recommended playlist candidate includes a playlist name, a song ID, and an assumed distribution. Like the playlist DB 124 described in the first embodiment, the playlist DB 624 is a database in which playlist names, assumed distributions, and playlist ranges are registered in association with each playlist. The assumed distribution of the playlist is the average (center) and variance of the playlist in the image feature space when expressing the distribution of the song based on the image feature amount of the song. The range of the playlist is a range in which the music to be registered in the playlist in the image feature space exists.

優先度算出部６０６は、推薦プレイリスト候補作成部６０５が複数の推薦プレイリスト候補を作成した場合の各推薦プレイリスト候補の優先度（推薦優先度）を算出する。優先度算出部６０６は、推薦プレイリスト候補の想定分布及び範囲、並びに検索キーワードに基づいて、各推薦プレイリスト候補の優先度を算出する。 The priority calculating unit 606 calculates the priority (recommended priority) of each recommended playlist candidate when the recommended playlist candidate creating unit 605 creates a plurality of recommended playlist candidates. The priority calculation unit 606 calculates the priority of each recommended playlist candidate based on the assumed distribution and range of the recommended playlist candidate and the search keyword.

プレイリスト決定部６０７は、優先度算出部６０６が算出した推薦優先度に基づいて、複数の推薦プレイリスト候補のなかから採用するプレイリストを決定する。 The playlist determination unit 607 determines a playlist to be adopted from among a plurality of recommended playlist candidates based on the recommendation priority calculated by the priority calculation unit 606.

ナレーション作成部６０８は、プレイリスト決定部６０７が決定したプレイリストの名称を含むナレーション音声データを作成する。 The narration creation unit 608 creates narration audio data including the name of the playlist determined by the playlist determination unit 607.

プレイリスト再生部６０９は、ナレーション作成部６０８が作成したナレーション音声を再生するとともに、プレイリスト決定部６０７が決定したプレイリストに含まれる曲音声データを曲ＤＢ６２３から読み出して再生する。 The playlist playback unit 609 plays back the narration voice created by the narration creation unit 608, and reads and plays back song audio data included in the playlist determined by the playlist determination unit 607 from the song DB 623.

このように、本実施形態のプレイリスト再生システム４におけるサーバ６は、第１の実施形態に係るプレイリスト再生装置１における要求プレイリスト入力部１０１及び音声出力部１０９を通信部６０１及び音声認識部１０２に置き換えた構成となっている。すなわち、本実施形態のプレイリスト再生システム４におけるサーバ６は、プレイリスト生成装置として機能する。サーバ６は、音楽再生装置５からのプレイリストの再生を要求する音声信号を受信すると、図９Ａ、図９Ｂ、図１１、及び図１２に示したようなプレイリスト生成処理を行う。その際、プレイリスト再生部６０９で再生したナレーション音声の音声データや曲音声データは、通信部６０１を介して音楽再生装置５に送信される。音楽再生装置５は、信号処理部５０３においてサーバ６から取得したナレーション音声の音声データ及び曲音声データに対する所定の処理を行った後、音声出力部５０２から音声及び曲音声を出力する。 As described above, the server 6 in the playlist playback system 4 according to the present embodiment includes the request playlist input unit 101 and the audio output unit 109 in the playlist playback device 1 according to the first embodiment as the communication unit 601 and the audio recognition unit. 102. That is, the server 6 in the playlist reproduction system 4 of the present embodiment functions as a playlist generation device. When receiving the audio signal requesting the reproduction of the playlist from the music reproducing device 5, the server 6 performs a playlist generation process as shown in FIGS. 9A, 9B, 11, and 12. At this time, the voice data and the tune voice data of the narration voice reproduced by the playlist reproduction unit 609 are transmitted to the music reproduction device 5 via the communication unit 601. The music reproducing device 5 performs predetermined processing on the voice data of the narration voice and the music voice data acquired from the server 6 in the signal processing unit 503, and then outputs the voice and the music voice from the voice output unit 502.

本実施形態のプレイリスト再生システム４では、図９Ａ、図９Ｂ、図１１、及び図１２に示したようなプレイリスト生成処理はサーバ６が行う。そのため、プレイリスト再生システム４においては、例えばカーナビゲーションシステム等の、通信可能であり、かつ音声信号を再生可能な情報端末を音楽再生装置５として用いることができる。また、本実施形態に係るプレイリスト再生システムのように曲ＤＢ６２３をサーバ６に設ける場合、第１の実施形態に係るプレイリスト生成装１に比べ、曲ＤＢ６２３により多くの曲音声データを蓄積することが可能となる。 In the playlist reproduction system 4 of the present embodiment, the server 6 performs the playlist generation processing as shown in FIGS. 9A, 9B, 11, and 12. Therefore, in the playlist playback system 4, an information terminal that can communicate and can reproduce an audio signal, such as a car navigation system, can be used as the music playback device 5. When the song DB 623 is provided in the server 6 as in the playlist playback system according to the present embodiment, more song audio data is stored in the song DB 623 than in the playlist generation device 1 according to the first embodiment. It is possible to do.

なお、プレイリスト再生システム４は、本実施形態で説明したような音声によりプレイリストの生成要求を入力する構成に限らず、プレイリストの生成要求をテキストで入力する構成であってもよい。更に、プレイリスト再生システム４は、音声による入力及びテキストによる入力が可能な構成であってもよい。 Note that the playlist reproduction system 4 is not limited to a configuration in which a playlist generation request is input by voice as described in the present embodiment, but may be a configuration in which a playlist generation request is input as text. Further, the playlist playback system 4 may have a configuration that allows input by voice and input by text.

［第４の実施形態］
図２４は、第４の実施形態に係るプレイリスト再生システムの構成を示す図である。 [Fourth embodiment]
FIG. 24 is a diagram illustrating a configuration of a playlist playback system according to the fourth embodiment.

図２４に示すように、本実施形態に係るプレイリスト再生システム４は、音楽再生装置５と、サーバ６とを備える。音楽生成装置５とサーバ６とは、インターネット等の通信ネットワーク７を介して通信可能に接続される。 As shown in FIG. 24, the playlist playback system 4 according to the present embodiment includes a music playback device 5 and a server 6. The music generation device 5 and the server 6 are communicably connected via a communication network 7 such as the Internet.

音楽再生装置５は、音楽データ（曲音声データ）を再生する装置である。音楽再生装置５は、音声入力部（マイク）５０１と、音声出力部５０２と、通信部５０４と、音声認識部５０５と、を備える。また、音楽再生装置５は、データ入力部５０６と、曲音声データ登録部５０７と、ナレーション音声作成部５０８と、プレイリスト再生部５０９と、を備える。また、音楽再生装置５は、曲ＩＤ及び曲音声データを格納するデータベース５２１を備える。 The music reproducing device 5 is a device for reproducing music data (song audio data). The music reproducing device 5 includes a voice input unit (microphone) 501, a voice output unit 502, a communication unit 504, and a voice recognition unit 505. Further, the music playback device 5 includes a data input unit 506, a music / sound data registration unit 507, a narration sound creation unit 508, and a playlist playback unit 509. Further, the music reproducing device 5 includes a database 521 for storing music IDs and music sound data.

通信部５０４は、サーバ６が備える図示しない通信部との間でのデータ通信等を行う。
音声認識部５０５は、音声入力部５０１により入力された音声信号を解析し、音声信号に含まれる有意な音声をテキストに変換する。 The communication unit 504 performs data communication with a communication unit (not shown) of the server 6.
The voice recognition unit 505 analyzes the voice signal input by the voice input unit 501, and converts significant voice included in the voice signal into text.

データ入力部５０６は、音楽ＣＤ等の音声データから作成した曲音声データの入力を受け付ける。 The data input unit 506 receives an input of music audio data created from audio data such as a music CD.

曲音声データ登録部５０７は、データ入力部５０６を介して入力された曲音声データや、通信ネットワーク７を介して取得した曲音声データをデータベース５２１に登録する。また、曲音声データ登録部５０７は、データ入力部５０６を介して入力された曲音声データについての曲ＩＤをサーバ６の図示しない曲ＤＢから取得する処理を行う。また、曲音声データ登録部５０７は、データベース５２１に登録した曲音声データについての曲ＩＤをサーバ６の図示しない曲ＩＤＤＢに登録する処理を行う。 The song audio data registration unit 507 registers song audio data input via the data input unit 506 and song audio data obtained via the communication network 7 in the database 521. The music sound data registration unit 507 performs a process of acquiring a music ID of the music sound data input via the data input unit 506 from a music DB (not shown) of the server 6. The song sound data registration unit 507 performs a process of registering the song ID of the song sound data registered in the database 521 in a song ID DB (not shown) of the server 6.

ナレーション音声作成部５０８は、サーバ６が作成したナレーション文章のテキストを音声データに変換する。 The narration speech creation unit 508 converts the text of the narration sentence created by the server 6 into speech data.

プレイリスト再生部５０９は、ナレーション音声作成部５０８で作成したナレーション音声の再生、及びサーバ６が作成したプレイリストの再生を行う。サーバ６は、音楽再生装置５のデータベース５２１に収納された曲音声データに対するプレイリストを作成する。そのため、プレイリスト再生部５０９は、プレイリストに従ってデータベース５２１から曲音声データを読み出し再生する。 The playlist playback unit 509 plays back the narration voice created by the narration voice creation unit 508 and plays back the playlist created by the server 6. The server 6 creates a playlist for the music and sound data stored in the database 521 of the music playback device 5. Therefore, the play list reproducing unit 509 reads out and reproduces the tune audio data from the database 521 according to the play list.

図２５は、第４の実施形態に係るプレイリスト再生システムにおけるサーバの機能的構成を示す図である。 FIG. 25 is a diagram illustrating a functional configuration of a server in the playlist playback system according to the fourth embodiment.

図２５に示すように、本実施形態のプレイリスト再生システム４におけるサーバ６は、通信部６０１と、キーワード／属性抽出部６０３と、曲絞込部６０４と、を備える。また、サーバ６は、推薦プレイリスト候補生成部６０５と、優先度算出部６０６と、プレイリスト決定部６０７と、ナレーション文章作成部６０８ａと、を備える。また、サーバ６は、キーワードＤＢ６２１と、絞込属性ＤＢ６２２と、曲ＤＢ６２３と、プレイリストＤＢ６２４と、曲ＩＤＤＢ６２３と、を備える。 As shown in FIG. 25, the server 6 in the playlist reproduction system 4 of the present embodiment includes a communication unit 601, a keyword / attribute extraction unit 603, and a music narrowing unit 604. Further, the server 6 includes a recommended playlist candidate generation unit 605, a priority calculation unit 606, a playlist determination unit 607, and a narration text creation unit 608a. In addition, the server 6 includes a keyword DB 621, a refinement attribute DB 622, a song DB 623, a playlist DB 624, and a song ID DB 623.

通信部６０１は、音楽再生装置５の通信部５０４との間でのデータ通信等を行う。通信部６０１は、音楽再生装置５が送信したテキスト（音声認識結果）や曲音声データの曲ＩＤ等を受信する。また、通信部６０１は、プレイリスト決定部６０７が決定したプレイリスト、ナレーション文章作成部６０８ａが作成したナレーション文章のテキスト等を音楽再生装置５に送信する。 The communication unit 601 performs data communication with the communication unit 504 of the music playback device 5. The communication unit 601 receives the text (speech recognition result) transmitted by the music reproducing device 5, the song ID of the song audio data, and the like. In addition, the communication unit 601 transmits the playlist determined by the playlist determination unit 607, the text of the narration text created by the narration text creation unit 608a, and the like to the music playback device 5.

キーワード／属性抽出部６０３は、音楽再生装置５から取得したテキスト（音声認識結果）に基づいて、キーワードＤＢ６２１から検索キーワードを抽出するとともに、絞込属性ＤＢ６２２から曲の絞り込み属性を抽出する。キーワードＤＢ６２１は、第１の実施形態で説明したキーワードＤＢ１２１と同様、テキストと検索キーワードとの対応関係が複数登録されたデータベースである。検索キーワードは、プレイリストの検索に用いるキーワードである。絞込属性ＤＢ６２２は、第１の実施形態で説明した絞込属性ＤＢ１２２と同様、テキストと曲の絞り込み属性との対応関係が複数登録されたデータベースである。曲の絞り込み属性は、プレイリストに登録する曲の絞り込みに用いる属性情報である。 The keyword / attribute extraction unit 603 extracts a search keyword from the keyword DB 621 based on the text (speech recognition result) acquired from the music reproducing device 5 and extracts a song narrowing attribute from the narrowing attribute DB 622. The keyword DB 621 is a database in which a plurality of correspondences between texts and search keywords are registered, similarly to the keyword DB 121 described in the first embodiment. The search keyword is a keyword used for searching for a playlist. The narrow-down attribute DB 622 is a database in which a plurality of correspondences between texts and narrow-down attributes of songs are registered, similarly to the narrow-down attribute DB 122 described in the first embodiment. The song narrowing down attribute is attribute information used to narrow down songs to be registered in the playlist.

曲絞込部６０４は、抽出した絞り込み属性に基づいて、プレイリストへの登録対象の候補となる曲を、曲ＤＢ６２３に登録された全ての曲のうち曲ＩＤＤＢ６２５に曲ＩＤが登録されている曲のなかから絞り込み属性に合致する属性を持つ曲に絞り込む。曲ＤＢ６２３は、第１の実施形態で説明した曲ＤＢ１２３と同様、曲音声データと、曲属性と、曲のイメージ特徴量との対応関係が複数登録されたデータベースである。曲属性は、曲音声データの絞り込みに用いる属性情報である。曲のイメージ特徴量は、曲のイメージを数値化した情報である。曲ＩＤＤＢ６２５は、音楽再生装置５のデータベース５２１に収納された曲音声データについての曲ＩＤが登録されたデータベースである。 The song narrowing unit 604 selects a song that is a candidate for registration in the playlist based on the extracted narrowing attribute, and registers a song ID in the song ID DB 625 among all songs registered in the song DB 623. The songs that have the attribute that matches the refinement attribute are narrowed down from the songs that exist. The music DB 623 is a database in which a plurality of correspondences between music audio data, music attributes, and image feature amounts of music are registered, similarly to the music DB 123 described in the first embodiment. The song attribute is attribute information used to narrow down song audio data. The music image feature amount is information obtained by digitizing the music image. The music ID DB 625 is a database in which music IDs of music sound data stored in the database 521 of the music playback device 5 are registered.

ナレーション文章作成部６０８は、プレイリスト決定部６０７が決定したプレイリストの名称を含むナレーション文章のテキストを作成する。 The narration sentence creation unit 608 creates a narration sentence text including the name of the playlist determined by the playlist determination unit 607.

本実施形態のプレイリスト再生システム４では、第１の実施形態に係るプレイリスト再生装置１の要求プレイリスト入力部１０１、ナレーション音声作成部１０７ｂ、プレイリスト再生部１０８、及び音声出力部１０９が音楽再生装置５に設けられている。すなわち、本実施形態のプレイリスト再生システム４におけるサーバ６は、プレイリスト生成装置として機能する。また、プレイリスト再生システム４が行うプレイリスト生成処理では、図９Ａ及び図９Ｂに示したステップＳ１〜Ｓ９の処理のうち、ステップＳ１，Ｓ２，Ｓ８の一部，及びＳ９の処理を音楽再生装置５が行う。すなわち、プレイリスト再生システム４が行うプレイリスト生成処理では、図９Ａに示したステップＳ３〜Ｓ７の処理をサーバ６が行う。 In the playlist playback system 4 of the present embodiment, the requested playlist input unit 101, the narration sound creation unit 107b, the playlist playback unit 108, and the audio output unit 109 of the playlist playback device 1 according to the first embodiment play music. The playback device 5 is provided. That is, the server 6 in the playlist reproduction system 4 of the present embodiment functions as a playlist generation device. In the playlist generation process performed by the playlist reproduction system 4, among the processes of steps S1 to S9 shown in FIGS. 9A and 9B, a part of steps S1, S2, and S8, and the process of S9 are performed by the music reproducing apparatus. 5 does. That is, in the playlist generation process performed by the playlist reproduction system 4, the server 6 performs the processes of steps S3 to S7 illustrated in FIG. 9A.

なお、本実施形態のプレイリスト再生システム４におけるサーバ６は、上記のように、音楽再生装置５のデータベース５２１に登録された曲音声データのみを対象としてプレイリストを生成する。そのため、音楽再生装置５は、データベース５２１に登録した曲音声データの曲ＩＤをサーバ６の曲ＩＤＤＢ６２５に登録する処理を行う。この登録処理は、例えば、以下のような手順で行う。 Note that, as described above, the server 6 in the playlist playback system 4 of the present embodiment generates a playlist only for the music and sound data registered in the database 521 of the music playback device 5. Therefore, the music playback device 5 performs a process of registering the song ID of the song audio data registered in the database 521 in the song ID DB 625 of the server 6. This registration process is performed in the following procedure, for example.

音楽再生装置５は、例えば、データ入力部５０６から曲音声データが入力されると、曲音声データ登録部５０７により入力された曲音声データに対する曲ＩＤを取得する処理を行う。曲音声データ登録部５０７は、入力された曲音声データを用いてサーバ６の曲ＤＢ６２３を検索し、当該曲音声データの曲ＩＤを取得する。曲音声データ登録部５０７は、取得した曲ＩＤをサーバ６の曲ＩＤＤＢ６２５に登録するとともに、曲音声データ及び曲ＩＤを音楽再生装置５のデータベース５２１に登録する。 For example, when music sound data is input from the data input unit 506, the music reproducing device 5 performs a process of acquiring a music ID for the music sound data input by the music sound data registration unit 507. The song audio data registration unit 507 searches the song DB 623 of the server 6 using the input song audio data, and acquires the song ID of the song audio data. The song audio data registration unit 507 registers the acquired song ID in the song ID DB 625 of the server 6 and also registers the song audio data and the song ID in the database 521 of the music playback device 5.

一方、サーバ６が行う処理のうち曲絞込部６０４が絞込属性に基づいて曲を絞り込む処理（ステップＳ４）では、曲ＩＤＤＢ６２５に登録された曲ＩＤの曲のみを対象として絞込属性に合致する曲属性を持つ曲に絞り込む。 On the other hand, among the processes performed by the server 6, the song narrowing unit 604 narrows down songs based on the narrowing attribute (step S4), and only the songs with the song IDs registered in the song ID DB 625 become the narrowing attributes. Narrow down to songs with matching song attributes.

このように、本実施形態に係るプレイリスト再生システム４では、音楽再生装置５に登録された曲音声データについてのプレイリストの生成処理の大部分を、音楽再生装置５とは別のサーバ６で行う。そのため、音楽再生装置５の処理負荷を軽減することが可能となる。 As described above, in the playlist playback system 4 according to the present embodiment, most of the playlist generation processing for the music sound data registered in the music playback device 5 is performed by the server 6 different from the music playback device 5. Do. Therefore, the processing load on the music playback device 5 can be reduced.

また、本実施形態に係るプレイリスト再生システム４では、音楽再生装置５に登録された曲音声データのみをプレイリストへの登録対象としているので、通信ネットワーク７を介したサーバ６から音楽再生装置５に曲音声データの転送は行わない。そのため、移動しながら無線通信を行っている場合等、通信品質が不安定な状況下においても、曲音声データを安定して再生することができる。 Further, in the playlist playback system 4 according to the present embodiment, since only the music sound data registered in the music playback device 5 is to be registered in the playlist, the music playback device 5 is transmitted from the server 6 via the communication network 7. The transfer of the song audio data is not performed. Therefore, even when the communication quality is unstable, such as when wireless communication is performed while moving, the music sound data can be stably reproduced.

［第５の実施形態］
図２６は、第５の実施形態に係るプレイリスト再生システムの構成を示す図である。 [Fifth Embodiment]
FIG. 26 is a diagram illustrating a configuration of a playlist playback system according to the fifth embodiment.

図２６に示すように、本実施形態に係るプレイリスト再生システム４は、音楽再生装置５と、サーバ６と、複数の携帯型電子機器８（８Ａ，８Ｂ）を備える。音楽生成装置５とサーバ６とは、インターネット等の通信ネットワーク７を介して通信可能に接続される。また、音楽再生装置５と携帯型電子機器８とは、ブルートゥース（登録商標）等の近距離無線通信により通信可能に接続される。本実施形態のプレイリスト再生システム４におけるサーバ６は、第４の実施形態のサーバ６（図２５を参照）と同じ構成でよい。 As shown in FIG. 26, the playlist playback system 4 according to the present embodiment includes a music playback device 5, a server 6, and a plurality of portable electronic devices 8 (8A, 8B). The music generation device 5 and the server 6 are communicably connected via a communication network 7 such as the Internet. Further, the music reproducing device 5 and the portable electronic device 8 are communicably connected by short-range wireless communication such as Bluetooth (registered trademark). The server 6 in the playlist playback system 4 of the present embodiment may have the same configuration as the server 6 of the fourth embodiment (see FIG. 25).

音楽再生装置５は、音楽データ（曲音声データ）を再生する装置である。音楽再生装置５は、音声入力部（マイク）５０１と、音声出力部５０２と、第１の通信部５０４と、第２の通信部５１０と、音声認識部５０５と、を備える。また、音楽再生装置５は、データ入力部５０６と、曲音声データ登録部５０７と、ナレーション音声作成部５０８と、プレイリスト再生部５０９と、を備える。また、音楽再生装置５は、曲ＩＤ及び曲音声データを格納するデータベース５２１を備える。 The music reproducing device 5 is a device for reproducing music data (song audio data). The music reproducing device 5 includes a voice input unit (microphone) 501, a voice output unit 502, a first communication unit 504, a second communication unit 510, and a voice recognition unit 505. Further, the music playback device 5 includes a data input unit 506, a music / sound data registration unit 507, a narration sound creation unit 508, and a playlist playback unit 509. Further, the music reproducing device 5 includes a database 521 for storing music IDs and music sound data.

第１の通信部５０４は、サーバ６が備える図示しない通信部との間でのデータ通信等を行う。第２の通信部５１０は、携帯型電子機器８が備える通信部８０１との間でデータ通信を行う。 The first communication unit 504 performs data communication with a communication unit (not shown) of the server 6. The second communication unit 510 performs data communication with the communication unit 801 included in the portable electronic device 8.

音声認識部５０５は、音声入力部５０１により入力された音声信号を解析し、音声信号に含まれる有意な音声をテキストに変換する。 The voice recognition unit 505 analyzes the voice signal input by the voice input unit 501, and converts significant voice included in the voice signal into text.

携帯型電子機器８（８Ａ，８Ｂ）は、スマートフォンやタブレット型コンピュータ等の無線通信又は有線通信が可能な電子機器である。携帯型電子機器８は、通信部８０１と、曲音声データを登録したデータベース８２１と、図示しない入力操作部と、備える。通信部８０１は、音楽再生装置５の第２の通信部５１０との間でデータ通信を行う。携帯型電子機器８の通信部８０１と音楽再生装置５の第２の通信部５１０とは、所定の無線通信規格に従った無線通信によりデータ通信を行う。 The portable electronic device 8 (8A, 8B) is an electronic device capable of wireless communication or wired communication such as a smartphone or a tablet computer. The portable electronic device 8 includes a communication unit 801, a database 821 in which music data is registered, and an input operation unit (not shown). The communication unit 801 performs data communication with the second communication unit 510 of the music playback device 5. The communication unit 801 of the portable electronic device 8 and the second communication unit 510 of the music playback device 5 perform data communication by wireless communication according to a predetermined wireless communication standard.

本実施形態のプレイリスト再生システム４では、第１の実施形態に係るプレイリスト再生装置１の要求プレイリスト入力部１０１、ナレーション音声作成部１０７ｂ、プレイリスト再生部１０８、及び音声出力部１０９が音楽再生装置５に設けられている。そのため、プレイリスト再生システム４が行うプレイリスト生成処理では、図９Ａ及び図９Ｂに示したステップＳ１〜Ｓ９の処理のうち、ステップＳ１，Ｓ２，Ｓ８の一部，及びＳ９の処理を音楽再生装置５が行う。すなわち、プレイリスト再生システム４が行うプレイリスト生成処理では、図９Ａに示したステップＳ３〜Ｓ７の処理をサーバ６が行う。したがって、本実施形態のプレイリスト再生システム４においても、サーバ６は、プレイリスト生成装置として機能する。 In the playlist playback system 4 according to the present embodiment, the requested playlist input unit 101, the narration sound creation unit 107b, the playlist playback unit 108, and the audio output unit 109 of the playlist playback device 1 according to the first embodiment play music. The playback device 5 is provided. Therefore, in the playlist generation processing performed by the playlist reproduction system 4, among the processing of steps S1 to S9 shown in FIGS. 9A and 9B, a part of steps S1, S2, and S8, and the processing of S9 are performed by the music reproducing apparatus. 5 does. That is, in the playlist generation process performed by the playlist reproduction system 4, the server 6 performs the processes of steps S3 to S7 illustrated in FIG. 9A. Therefore, also in the playlist reproduction system 4 of the present embodiment, the server 6 functions as a playlist generation device.

また、本実施形態のプレイリスト再生システム４では、携帯型電子機器８のデータベース８２１に登録された曲音声データを音楽再生装置５に転送することが可能である。携帯型電子機器８の曲音声データを音楽再生装置５に転送した場合も、音楽再生装置５は、同様の方法で曲ＩＤを取得して曲ＩＤＤＢ６２５に登録するとともに、曲音声データ及び曲ＩＤを音楽再生装置５のデータベース５２１に登録する。 Further, in the playlist playback system 4 of the present embodiment, it is possible to transfer the music and sound data registered in the database 821 of the portable electronic device 8 to the music playback device 5. Also in the case where the music sound data of the portable electronic device 8 is transferred to the music reproduction device 5, the music reproduction device 5 acquires the music ID by the same method, registers the music ID in the music ID DB 625, and stores the music sound data and the music ID. Is registered in the database 521 of the music playback device 5.

また、本実施形態に係るプレイリスト再生システム４では、音楽再生装置５のデータベース５２１に曲音声データが登録されていない状態であっても、携帯型電子機器８から曲音声データを取得して再生することができる。そのため、音楽ＣＤ等から生成した曲音声データを予め音楽再生装置５のデータベース５２１に登録する作業等の手間が省ける。 Further, in the playlist playback system 4 according to the present embodiment, even when the song audio data is not registered in the database 521 of the music playback device 5, the song audio data is acquired from the portable electronic device 8 and played. can do. Therefore, the work of registering the music sound data generated from the music CD or the like in the database 521 of the music reproducing device 5 in advance can be omitted.

更に、本実施形態に係るプレイリスト再生システム４では、図２６に示したように、複数の携帯型電子機器８から曲音声データを収集して音楽再生装置５のデータベース５２１に登録させることができる。そのため、プレイリストを再生した際に、各利用者が自身の所持する携帯型電子機器８に登録されていない曲を聞くことも可能となる。 Furthermore, in the playlist playback system 4 according to the present embodiment, as shown in FIG. 26, song audio data can be collected from a plurality of portable electronic devices 8 and registered in the database 521 of the music playback device 5. . Therefore, when the play list is reproduced, each user can listen to music that is not registered in the portable electronic device 8 owned by the user.

なお、上記のプレイリスト生成装置１や音楽再生装置５は、要求プレイリスト入力部１０１（マイク５０１）を用いた音声入力によりプレイリストの生成要求及び再生要求を入力する構成になっている。このような構成は、特に、車載向けの装置に適用することが好ましい。車両のドライバーが運転中にプレイリスト生成装置１や音楽再生装置５を使用する場合、装置本体や装置に接続された表示装置を見ながらプレイリストを選択することが難しい。そのため、上記の実施形態で説明したように音声入力によりプレイリストを選択することが好ましい。ただし、本実施形態のプレイリスト再生システム４においても、プレイリストの生成要求及び再生要求は、音声入力に限らず、タッチパネルやスイッチ等の入力装置を操作してテキストで入力するようにしてもよい。 The playlist generation device 1 and the music reproduction device 5 are configured to input a playlist generation request and a reproduction request by voice input using the request playlist input unit 101 (microphone 501). Such a configuration is particularly preferably applied to an in-vehicle device. When the driver of the vehicle uses the playlist generation device 1 or the music playback device 5 while driving, it is difficult to select a playlist while looking at the device body or a display device connected to the device. Therefore, it is preferable to select a playlist by voice input as described in the above embodiment. However, also in the playlist reproduction system 4 of the present embodiment, the playlist generation request and the reproduction request are not limited to voice input, and may be input as text by operating an input device such as a touch panel or a switch. .

また、上記のプレイリスト生成装置１、音楽再生装置５、及びサーバ６は、コンピュータと、当該コンピュータに実行させるプログラムとを用いて実現することが可能である。以下、コンピュータとプログラムとを用いて実現されるプレイリスト生成装置１等について、図２７を参照して説明する。 Further, the playlist generation device 1, the music playback device 5, and the server 6 can be realized using a computer and a program executed by the computer. Hereinafter, the playlist generation device 1 and the like realized using a computer and a program will be described with reference to FIG.

図２７は、コンピュータのハードウェア構成を示す図である。図２７に示すように、プレイリスト生成装置１として動作させることが可能なコンピュータ１０は、Central Processing Unit（ＣＰＵ）１００１と、主記憶装置１００２と、補助記憶装置１００３と、入力装置１００４と、表示装置１００５と、を備える。また、コンピュータ１０は、インタフェース装置１００６と、記憶媒体駆動装置１００７と、通信装置１００８と、を更に備える。コンピュータ１０におけるこれらの要素１００１から１００８までは、バス１０１０により相互に接続されており、要素間でのデータの受け渡しが可能になっている。 FIG. 27 is a diagram illustrating a hardware configuration of a computer. As shown in FIG. 27, a computer 10 that can be operated as the playlist generation device 1 includes a Central Processing Unit (CPU) 1001, a main storage device 1002, an auxiliary storage device 1003, an input device 1004, a display Device 1005. In addition, the computer 10 further includes an interface device 1006, a storage medium driving device 1007, and a communication device 1008. These elements 1001 to 1008 in the computer 10 are interconnected by a bus 1010 so that data can be transferred between the elements.

ＣＰＵ１００１は、オペレーティングシステムを含む各種のプログラムを実行することによりコンピュータ１０の全体の動作を制御する演算処理装置である。 The CPU 1001 is an arithmetic processing device that controls various operations of the computer 10 by executing various programs including an operating system.

主記憶装置１００２は、図示しないRead Only Memory（ＲＯＭ）及びRandom Access Memory（ＲＡＭ）を含む。ＲＯＭには、例えばコンピュータ１０の起動時にＣＰＵ１００１が読み出す所定の基本制御プログラム等が予め記録されている。また、ＲＡＭは、ＣＰＵ１００１が各種のプログラムを実行する際に、必要に応じて作業用記憶領域として使用する。主記憶装置１００２のＲＡＭは、例えば、抽出した検索キーワードや絞り込み属性、推薦プレイリスト候補等の一時的な保持に利用可能である。 The main storage device 1002 includes a read only memory (ROM) and a random access memory (RAM) (not shown). For example, a predetermined basic control program read by the CPU 1001 when the computer 10 is started is recorded in the ROM in advance. The RAM is used as a work storage area when necessary when the CPU 1001 executes various programs. The RAM of the main storage device 1002 can be used for temporarily storing extracted search keywords, narrowing attributes, recommended playlist candidates, and the like.

補助記憶装置１００３は、Hard Disk Drive（ＨＤＤ）やSolid State Drive（ＳＳＤ）等の主記憶装置１００２に比べて容量の大きい記憶装置である。補助記憶装置１００３には、ＣＰＵ１００１によって実行される各種のプログラムや各種のデータ等を記憶させることができる。補助記憶装置１００３には、例えば、記憶させるプログラムとしては、例えば、上記のプレイリスト生成処理を含む曲音声データの再生プログラム等が挙げられる。また、補助記憶装置１００３に記憶させるデータとしては、例えば、キーワードＤＢ１２１、絞込属性ＤＢ１２２、曲ＤＢ１２３、及びプレイリストＤＢ１２４等が挙げられる。 The auxiliary storage device 1003 is a storage device such as a hard disk drive (HDD) or a solid state drive (SSD) having a larger capacity than the main storage device 1002. The auxiliary storage device 1003 can store various programs executed by the CPU 1001, various data, and the like. In the auxiliary storage device 1003, for example, as a program to be stored, for example, there is a reproduction program of music and sound data including the above-described playlist generation processing. The data to be stored in the auxiliary storage device 1003 includes, for example, a keyword DB 121, a narrow-down attribute DB 122, a music DB 123, and a playlist DB 124.

入力装置１００４は、例えば押し釦スイッチ、キーボード装置、マウス装置等であり、利用者（オペレータ）が所定の操作を行うことにより当該操作に応じた情報がＣＰＵ１００１等に入力される。また、入力装置１００４には、例えば、表示装置１００５の表示面に重ねて配置したタッチパネル装置も利用可能である。 The input device 1004 is, for example, a push button switch, a keyboard device, a mouse device, or the like. When a user (operator) performs a predetermined operation, information corresponding to the operation is input to the CPU 1001 or the like. Further, as the input device 1004, for example, a touch panel device that is disposed so as to overlap the display surface of the display device 1005 can be used.

表示装置１００５は、例えば液晶ディスプレイ等の表示装置である。表示装置１００５は、ＣＰＵ１００１等から送信される表示データに従って各種のテキスト、画像等を表示する。 The display device 1005 is a display device such as a liquid crystal display. The display device 1005 displays various texts, images, and the like according to display data transmitted from the CPU 1001 or the like.

インタフェース装置１００６は、コンピュータ１０と他の電子機器とを接続し、コンピュータ１０と他の電子機器との間でのデータの送受信を可能にする入出力装置である。インタフェース装置１００６は、例えば、Universal Serial Bus（ＵＳＢ）規格のコネクタを備えたケーブルを接続可能な端子等を備える。インタフェース装置１００６によりコンピュータ１０と接続可能な電子機器としては、マイク１１やスピーカ１２等が挙げられる。 The interface device 1006 is an input / output device that connects the computer 10 to another electronic device and enables data transmission and reception between the computer 10 and the other electronic device. The interface device 1006 includes, for example, a terminal to which a cable having a connector of the Universal Serial Bus (USB) standard can be connected. Examples of electronic devices that can be connected to the computer 10 by the interface device 1006 include a microphone 11 and a speaker 12.

記憶媒体駆動装置１００７は、図示しない可搬型記憶媒体に記録されているプログラムやデータの読み出し、補助記憶装置１００３に記憶されたデータ等の可搬型記憶媒体への書き込みを行う。可搬型記憶媒体としては、例えば、ＵＳＢ規格のコネクタが備えられているフラッシュメモリが利用可能である。また、可搬型記憶媒体としては、Compact Disk（ＣＤ）、Digital Versatile Disc（ＤＶＤ）、Blu-ray Disc（Blu-rayは登録商標）等の光ディスクも利用可能である。 The storage medium driving device 1007 reads out programs and data recorded on a portable storage medium (not shown) and writes data and the like stored in the auxiliary storage device 1003 on the portable storage medium. As a portable storage medium, for example, a flash memory provided with a USB standard connector can be used. Further, as a portable storage medium, an optical disk such as a Compact Disk (CD), a Digital Versatile Disc (DVD), and a Blu-ray Disc (Blu-ray is a registered trademark) can be used.

通信装置１００８は、コンピュータ１０とインターネット或いはLocal Area Network（ＬＡＮ）等の通信ネットワークとを通信可能に接続し、通信ネットワークを介した他の通信端末（コンピュータ）との通信を制御する装置である。 The communication device 1008 is a device that communicably connects the computer 10 and a communication network such as the Internet or a Local Area Network (LAN), and controls communication with another communication terminal (computer) via the communication network.

このコンピュータ１０をプレイリスト生成装置１として動作させる場合、ＣＰＵ１００１は、補助記憶装置１００３等から図９Ａ及び図９Ｂに示したようなプレイリスト生成処理を含むプログラムを読み出して実行する。この際、ＣＰＵ１００１は、補助記憶装置１００３等に記憶させたキーワードＤＢ１２１等の各データベースを検索して推薦プレイリスト候補を生成し、各候補の優先度を算出する。そして、算出した優先度に基づいて利用者（オペレータ）に推薦するプレイリストを決定する。また、ＣＰＵ１００１は、決定したプレイリストの名称を含むナレーション音声を生成して再生するとともに、決定したプレイリストに含まれる曲の曲音声データを再生する。 When operating the computer 10 as the playlist generation device 1, the CPU 1001 reads a program including a playlist generation process as shown in FIGS. 9A and 9B from the auxiliary storage device 1003 or the like and executes the program. At this time, the CPU 1001 searches each database such as the keyword DB 121 stored in the auxiliary storage device 1003 or the like, generates a recommended playlist candidate, and calculates the priority of each candidate. Then, a playlist to be recommended to the user (operator) is determined based on the calculated priority. Further, the CPU 1001 generates and reproduces a narration sound including the name of the determined playlist, and also reproduces song audio data of a song included in the determined playlist.

また、コンピュータ１０を音楽再生装置５として動作させる場合、ＣＰＵ１００１は、補助記憶装置１００３等から第４の実施形態で説明した曲音声データの登録処理やプレイリストの再生処理を含むプログラムを読み出して実行する。この際、コンピュータ１０は、例えば、記憶媒体駆動装置１００７を用いて音楽ＣＤ等から曲音声データを取得する。その後、コンピュータ１０では、ＣＰＵ１００１が通信装置１００８を介してサーバ６から曲ＩＤを取得し、曲音声データ及び曲ＩＤを補助記憶装置１００３に記憶させる。また、ＣＰＵ１００１は、マイク１１から入力された音声信号に対して音声認識処理を行い音声信号に含まれる有意な音声をテキストに変換する。更に、ＣＰＵ１００１は、再生するプレイリストの名称を含むナレーション音声を作成して再生する処理、プレイリストに含まれる曲音声データを補助記憶装置１００３から読み出して再生する処理等を行う。 When the computer 10 is operated as the music reproducing device 5, the CPU 1001 reads a program including the process of registering the music and sound data and the process of reproducing the playlist described in the fourth embodiment from the auxiliary storage device 1003 and the like. Execute. At this time, the computer 10 obtains music sound data from a music CD or the like using the storage medium driving device 1007, for example. Thereafter, in the computer 10, the CPU 1001 acquires the song ID from the server 6 via the communication device 1008, and stores the song audio data and the song ID in the auxiliary storage device 1003. Further, the CPU 1001 performs a voice recognition process on the voice signal input from the microphone 11 and converts significant voice included in the voice signal into text. Further, the CPU 1001 performs a process of creating and reproducing narration audio including the name of the playlist to be reproduced, a process of reading music audio data included in the playlist from the auxiliary storage device 1003, and reproducing the same.

なお、コンピュータ１０は、図２７に示した全ての構成要素を含む必要はなく、用途や条件に応じて一部の構成要素を省略することも可能である。例えば、車両に搭載して使用することを想定したコンピュータ１０の場合、記憶媒体駆動装置１００７を省略してもよい。 Note that the computer 10 does not need to include all the components shown in FIG. 27, and some of the components can be omitted depending on the application and conditions. For example, in the case of the computer 10 which is assumed to be mounted on a vehicle and used, the storage medium driving device 1007 may be omitted.

また、コンピュータ１０は、種々のプログラムを実行することにより複数の機能を実現する汎用型のものに限らず、例えば、プレイリスト生成装置１として動作させるための処理に特化した専用の情報処理装置でもよい。 The computer 10 is not limited to a general-purpose computer that realizes a plurality of functions by executing various programs. For example, the computer 10 is a dedicated information processing device specialized in processing for operating the playlist generation device 1. May be.

上記のように、本明細書では、曲音声データ（音楽データ）のイメージを表す特徴量に基づいてプレイリストを生成するプレイリスト生成装置１及びプレイリスト再生システム４について説明した。しかしながら、プレイリストに登録するデータは、曲音声データに限らず、イメージを表す特徴量により分類可能なデータであればよい。例えば、プレイリストに登録するデータは、小説等の朗読再生用の文書であってもよい。更に、プレイリストに登録するデータは、画像データや動画像データ等であってもよい。画像データをプレイリストに登録する場合、プレイリストに基づいて画像データをスライドショーで再生することにより、プレイリストのイメージと再生される画像のイメージとの齟齬を解消することが可能となる。 As described above, in the present specification, the playlist generation device 1 and the playlist reproduction system 4 that generate a playlist based on the feature amount representing the image of the music sound data (music data) have been described. However, the data to be registered in the playlist is not limited to the music sound data, but may be any data that can be classified by the feature amount representing the image. For example, the data registered in the playlist may be a document for reading and reproducing a novel or the like. Further, the data registered in the playlist may be image data, moving image data, or the like. When registering image data in a playlist, by playing back the image data in a slide show based on the playlist, it is possible to resolve the discrepancy between the image of the playlist and the image of the image to be played.

以上記載した各実施例を含む実施形態に関し、更に以下の付記を開示する。
（付記１）
コンピュータが、
利用者からのプレイリストの生成要求に含まれる当該プレイリストへの登録対象とする曲の属性を表す属性情報に基づいて曲の絞込を行い、
絞り込んだ前記曲のイメージを表す特徴量、及び前記生成要求に含まれる前記プレイリストのイメージと対応した前記特徴量の範囲に基づいて要求プレイリストを生成し、
生成した前記要求プレイリストの前記特徴量の範囲内において当該要求プレイリストに登録された前記曲の特徴量に偏りがある場合に、当該要求プレイリストに登録された前記曲の特徴量に基づいて前記利用者に提示するプレイリストの名称を変更する、
処理を実行することを特徴とするプレイリスト生成方法。
（付記２）
前記要求プレイリストの前記特徴量の範囲内において当該要求プレイリストに登録された前記曲の特徴量に偏りがある場合に、前記コンピュータは、前記要求プレイリストに登録された前記曲の特徴量に基づいて前記利用者に提示する前記プレイリストの名称を変更するとともに、変更後の前記イメージに基づいて前記利用者に推薦する推薦プレイリストを作成する、処理を実行する、
ことを特徴とする付記１に記載のプレイリスト生成方法。
（付記３）
前記推薦プレイリストを作成する処理では、前記コンピュータは、絞り込まれた前記曲のうち、変更後の前記プレイリストのイメージと対応した前記特徴量の範囲内となり、かつ前記要求プレイリストの前記特徴量の範囲内となる特徴量を持つ曲のみを登録した前記推薦プレイリストを作成する、
ことを特徴とする付記２に記載のプレイリスト生成方法。
（付記４）
前記推薦プレイリストを作成する処理では、前記コンピュータは、絞り込まれた前記曲のうち、変更後の前記プレイリストのイメージと対応した前記特徴量の範囲内となる特徴量を持つ曲を全て登録した前記推薦プレイリストを作成する、
ことを特徴とする付記２に記載のプレイリスト生成方法。
（付記５）
前記コンピュータが、絞り込んだ前記曲の特徴量に基づいて前記利用者に推薦するプレイリストの候補を生成する処理を更に実行し、
前記利用者に提示する前記プレイリストの名称を変更する処理では、前記コンピュータは、前記推薦するプレイリストの候補に登録された前記曲の特徴量の分布と、前記要求プレイリストに登録された前記曲の特徴量の分布に基づいて前記利用者に推薦するプレイリストを決定する、
ことを特徴とする付記１に記載のプレイリスト生成方法。
（付記６）
前記推薦するプレイリストの候補に登録された前記曲の特徴量の分布と、前記要求プレイリストに登録された前記曲の特徴量の分布に基づいて前記利用者に推薦するプレイリストを決定する処理では、前記コンピュータは、
前記推薦するプレイリストの候補に含まれる前記曲の特徴量の分布と当該推薦するプレイリストの候補の想定している前記曲の特徴量の分布とを用いて認識ずれ量を算出するとともに、前記推薦するプレイリストの候補の想定している前記曲の特徴量の分布と前記要求プレイリストの想定している前記曲の特徴量の分布とを用いて要求ずれ量を算出し、
前記認識ずれ量及び前記要求ずれ量に基づいて前記推薦するプレイリストの候補に対する推薦優先度を算出し、
前記推薦優先度の最も高い前記候補を前記利用者に推薦するプレイリストに決定する、
ことを特徴とする付記５に記載のプレイリスト生成方法。
（付記７）
前記コンピュータは、前記曲の特徴量の分布を正規分布として前記認識ずれ量及び前記要求ずれ量を算出する、
ことを特徴とする付記６に記載のプレイリスト生成方法。
（付記８）
前記コンピュータは、前記曲の特徴量の分布を一様分布として前記認識ずれ量及び前記要求ずれ量を算出する、
ことを特徴とする付記６に記載のプレイリスト生成方法。
（付記９）
前記コンピュータが、
前記利用者に提示したプレイリストに含まれる曲を再生し、
前記曲の再生中に再生をスキップする操作情報が入力された場合に、スキップした前記曲の特徴量が前記プレイリストへの登録対象となる曲の特徴量の範囲外となるよう、前記プレイリストの前記特徴量の範囲を修正する、処理を更に実行する、
ことを特徴とする付記１に記載のプレイリスト生成方法。
（付記１０）
前記プレイリストの前記特徴量の範囲を修正する処理では、前記コンピュータは、前記プレイリストの前記特徴量の範囲を縮小する、
ことを特徴とする付記９に記載のプレイリスト生成方法。
（付記１１）
前記プレイリストの前記特徴量の範囲を修正する処理では、前記コンピュータは、
前記プレイリストの前記特徴量の範囲を縮小した後、
スキップした前記曲の特徴量が含まれない態様で前記プレイリストの前記特徴量の範囲を拡大する、
ことを特徴とする付記９に記載のプレイリスト生成方法。
（付記１２）
前記コンピュータが、
前記利用者に提示する前記プレイリストの名称を含む音声データを生成し、当該音声データを再生して前記プレイリストの名称を前記利用者に提示する、処理を更に実行する、
ことを特徴とする付記１に記載のプレイリスト生成方法。
（付記１３）
利用者からのプレイリストの生成要求に基づいて、予め用意されたデータベースから、前記プレイリストのイメージを表すキーワード、及び前記プレイリストへの登録対象とする曲の属性を表す属性情報を抽出する抽出部と、
前記属性情報に基づいて前記プレイリストへの登録対象とする前記曲の絞り込みを行う曲絞込部と、
絞り込んだ前記曲のイメージを表す特徴量に基づいて前記利用者に推薦するプレイリストの候補を生成する推薦プレイリスト候補生成部と、
前記キーワードと対応する要求プレイリストを生成し、当該要求プレイリスト及び前記利用者に推薦するプレイリストの候補に基づいて前記推薦するプレイリストの候補についての優先度を算出する優先度算出部と、
算出した前記優先度に基づいて前記利用者に提示するプレイリストを決定するプレイリスト決定部と、
決定した前記プレイリストの名称を利用者に提示する情報を生成する情報生成部と、を備える、
ことを特徴とするプレイリスト生成装置。
（付記１４）
前記プレイリスト決定部は、決定した前記プレイリストのイメージと対応した前記特徴量の範囲内となり、かつ前記要求プレイリストのイメージと対応した前記特徴量の範囲内となる特徴量を持つ曲のみを登録した前記プレイリストを作成する、
ことを特徴とする付記１３に記載のプレイリスト生成装置。
（付記１５）
前記優先度算出部は、
前記推薦するプレイリストの候補に登録された前記曲の特徴量の分布と当該推薦するプレイリストの候補の想定している前記曲の特徴量の分布とを用いて認識ずれ量を算出する認識ずれ量算出部と、
前記推薦するプレイリストの候補の想定している前記曲の特徴量の分布と前記要求プレイリストの想定している前記曲の特徴量の分布とを用いて要求ずれ量を算出する要求ずれ量算出部と、
前記認識ずれ量及び前記要求ずれ量を統合して前記推薦するプレイリストの候補に対する推薦優先度を算出するずれ量統合部と、を含む、
ことを特徴とする付記１３に記載のプレイリスト生成装置。
（付記１６）
当該プレイリスト生成装置は、外部装置が記憶している前記曲の識別情報を登録する識別情報データベースを更に備え、
前記曲絞込部は、前記識別情報データベースに登録された前記曲から前記属性情報を満たす曲のみに絞り込む、
ことを特徴とする付記１３に記載のプレイリスト生成装置。
（付記１７）
当該プレイリスト生成装置は、前記利用者に提示したプレイリストの再生中に前記利用者により再生している曲をスキップする操作が行われた場合に、スキップした前記曲が前記プレイリストの前記特徴量の範囲外となるよう前記プレイリストを修正する適応処理部、を更に備える、
ことを特徴とする付記１３に記載のプレイリスト生成装置。
（付記１８）
利用者からのプレイリストの生成要求に含まれる当該プレイリストへの登録対象とする曲の属性を表す属性情報に基づいて曲の絞込を行い、
絞り込んだ前記曲のイメージを表す特徴量、及び前記生成要求に含まれる前記プレイリストのイメージと対応した前記特徴量の範囲に基づいて要求プレイリストを生成し、
生成した前記要求プレイリストの前記特徴量の範囲内において当該要求プレイリストに登録された前記曲の特徴量に偏りがある場合に、当該要求プレイリストに登録された前記曲の特徴量に基づいて前記利用者に提示するプレイリストの名称を変更する、
処理をコンピュータに実行させるためのプログラム。
（付記１９）
前記要求プレイリストの前記特徴量の範囲内において当該要求プレイリストに登録された前記曲の特徴量に偏りがある場合に、前記コンピュータに、前記要求プレイリストに登録された前記曲の特徴量に基づいて前記利用者に提示する前記プレイリストの名称を変更するとともに、変更後の前記イメージに基づいて前記利用者に推薦する推薦プレイリストを作成する、処理を実行させる、
ことを特徴とする付記１８に記載のプログラム。
（付記２０）
前記推薦プレイリストを作成する処理は、絞り込まれた前記曲のうち、変更後の前記プレイリストのイメージと対応した前記特徴量の範囲内となり、かつ前記要求プレイリストの前記特徴量の範囲内となる特徴量を持つ曲のみを登録した前記推薦プレイリストを作成する処理を含む、
ことを特徴とする付記１９に記載のプログラム。
（付記２１）
前記推薦プレイリストを作成する処理は、絞り込まれた前記曲のうち、変更後の前記プレイリストのイメージと対応した前記特徴量の範囲内となる特徴量を持つ曲を全て登録した前記推薦プレイリストを作成する処理を含む、
ことを特徴とする付記１９に記載のプログラム。
（付記２２）
前記コンピュータに、絞り込んだ前記曲の特徴量に基づいて前記利用者に推薦するプレイリストの候補を生成する処理を更に実行させ、
前記利用者に提示する前記プレイリストの名称を変更する処理は、前記コンピュータに、前記推薦するプレイリストの候補に登録された前記曲の特徴量の分布と、前記要求プレイリストに登録された前記曲の特徴量の分布との差異に基づいて前記利用者に推薦するプレイリストを決定し、当該推薦するプレイリストの名称に前記プレイリストの名称を変更する処理を含む、
ことを特徴とする付記１８に記載のプログラム。
（付記２３）
前記推薦するプレイリストの候補に登録された前記曲の特徴量の分布と、前記要求プレイリストに登録された前記曲の特徴量の分布との差異に基づいて前記利用者に推薦するプレイリストを決定する処理は、
前記推薦するプレイリストの候補に含まれる前記曲の特徴量の分布と当該推薦するプレイリストの候補の想定している前記曲の特徴量の分布とを用いて認識ずれ量を算出するとともに、前記推薦するプレイリストの候補の想定している前記曲の特徴量の分布と前記要求プレイリストの想定している前記曲の特徴量の分布とを用いて要求ずれ量を算出し、
前記認識ずれ量及び前記要求ずれ量に基づいて前記推薦するプレイリストの候補に対する推薦優先度を算出し、
前記推薦優先度の最も高い前記候補を前記利用者に推薦するプレイリストに決定する、処理を含む、
ことを特徴とする付記２２に記載のプログラム。
（付記２４）
前記コンピュータに、
前記利用者に提示したプレイリストに含まれる曲を再生し、
前記曲の再生中に再生をスキップする操作情報が入力された場合に、スキップした前記曲の特徴量が前記プレイリストへの登録対象となる曲の特徴量の範囲外となるよう、前記プレイリストの前記特徴量の範囲を修正する、処理を更に実行させる、
ことを特徴とする付記１８に記載のプログラム。
（付記２５）
前記プレイリストの前記特徴量の範囲を修正する処理は、前記プレイリストの前記特徴量の範囲を縮小する処理を含む、
ことを特徴とする付記２４に記載のプログラム。
（付記２６）
前記プレイリストの前記特徴量の範囲を修正する処理は、
前記プレイリストの前記特徴量の範囲を縮小した後、スキップした前記曲の特徴量が含まれない態様で前記プレイリストの前記特徴量の範囲を拡大する処理を含む、
ことを特徴とする付記２４に記載のプログラム。
（付記２７）
前記コンピュータに、
前記利用者に提示する前記プレイリストの名称を含む音声データを生成し、当該音声データを再生して前記プレイリストの名称を前記利用者に提示する、処理を更に実行させる、
ことを特徴とする付記１８に記載のプログラム。
（付記２８）
コンピュータが、
利用者からの再生リストの生成要求に含まれる当該再生リストへの登録対象とするデータの属性を表す属性情報に基づいてデータの絞込を行い、
絞り込んだ前記データのイメージを表す特徴量、及び前記生成要求に含まれる前記再生リストのイメージと対応した前記特徴量の範囲に基づいて要求再生リストを生成し、
生成した前記要求再生リストの前記特徴量の範囲内において当該要求再生リストに登録された前記データの特徴量に偏りがある場合に、当該要求再生リストに登録された前記データの特徴量に基づいて前記利用者に提示する再生リストの名称を生成する、
処理を実行することを特徴とする再生リスト生成方法。 Regarding the embodiment including each of the examples described above, the following supplementary notes are further disclosed.
(Appendix 1)
Computer
The music is narrowed down based on the attribute information indicating the attribute of the music to be registered in the playlist included in the playlist generation request from the user,
Generating a requested playlist based on the feature amount representing the image of the narrowed down song and the range of the feature amount corresponding to the image of the playlist included in the generation request;
When there is a bias in the feature amount of the song registered in the request playlist within the range of the feature amount of the generated request playlist, based on the feature amount of the song registered in the request playlist. Changing the name of the playlist presented to the user,
A playlist generation method, characterized by performing a process.
(Appendix 2)
When there is a bias in the feature amount of the song registered in the request playlist within the range of the feature amount of the request playlist, the computer determines the feature amount of the song registered in the request playlist. Executing a process of changing a name of the playlist to be presented to the user based on the changed image and creating a recommended playlist recommended to the user based on the changed image.
3. The playlist generation method according to claim 1, wherein
(Appendix 3)
In the process of creating the recommended playlist, the computer includes, within the narrowed-down songs, a range of the feature amount corresponding to the image of the playlist after the change, and the feature amount of the requested playlist. Creating the recommended playlist in which only songs having feature amounts within the range are registered;
3. The playlist generation method according to claim 2, wherein
(Appendix 4)
In the process of creating the recommended playlist, the computer has registered all songs having feature amounts within the feature amount range corresponding to the image of the playlist after the change among the narrowed songs. Creating the recommended playlist,
3. The playlist generation method according to claim 2, wherein
(Appendix 5)
The computer further executes a process of generating a playlist candidate recommended to the user based on the narrowed down feature amount of the song,
In the process of changing the name of the playlist to be presented to the user, the computer includes a distribution of a feature amount of the song registered as the recommended playlist candidate and the distribution of the feature amount registered in the request playlist. Determining a playlist to be recommended to the user based on a distribution of a feature amount of a song;
3. The playlist generation method according to claim 1, wherein
(Appendix 6)
A process of determining a playlist to be recommended to the user based on the distribution of the feature amounts of the songs registered in the recommended playlist candidates and the distribution of the feature amounts of the songs registered in the request playlist; Then, the computer is
Using the distribution of the feature amounts of the songs included in the recommended playlist candidates and the distribution of the feature amounts of the songs assumed to be the recommended playlist candidates, the recognition misregistration amount is calculated. Calculating the required deviation amount using the distribution of the feature amount of the music assumed as the candidate of the playlist to be recommended and the distribution of the feature amount of the music assumed in the requested playlist,
Calculating a recommendation priority for the recommended playlist candidate based on the recognition shift amount and the request shift amount;
Determine the candidate with the highest recommendation priority as a playlist to recommend to the user,
A playlist generation method according to claim 5, wherein
(Appendix 7)
The computer calculates the recognition deviation amount and the required deviation amount with the distribution of the feature amount of the music as a normal distribution,
6. The playlist generation method according to claim 6, wherein
(Appendix 8)
The computer calculates the recognition shift amount and the required shift amount as a uniform distribution of the feature amount of the music,
6. The playlist generation method according to claim 6, wherein
(Appendix 9)
Said computer,
Play songs included in the playlist presented to the user,
When the operation information for skipping the reproduction is input during the reproduction of the song, the playlist is set such that the feature amount of the skipped song is out of the range of the feature amount of the song to be registered in the playlist. Correcting the range of the feature amount, and further executing a process,
3. The playlist generation method according to claim 1, wherein
(Appendix 10)
In the process of correcting the range of the feature amount of the playlist, the computer reduces the range of the feature amount of the playlist,
A playlist generation method according to claim 9, wherein the playlist is generated.
(Appendix 11)
In the process of correcting the range of the feature amount of the playlist, the computer includes:
After reducing the range of the feature amount of the playlist,
Expanding the range of the feature amount of the playlist in a manner that does not include the feature amount of the skipped song;
A playlist generation method according to claim 9, wherein the playlist is generated.
(Appendix 12)
Said computer,
Generating audio data including the name of the playlist to be presented to the user, reproducing the audio data and presenting the name of the playlist to the user, further executing a process;
3. The playlist generation method according to claim 1, wherein
(Appendix 13)
Extraction for extracting a keyword representing an image of the playlist and attribute information representing an attribute of a song to be registered in the playlist from a database prepared in advance based on a playlist generation request from a user. Department and
A song narrowing-down unit that narrows down the songs to be registered in the playlist based on the attribute information;
A recommended playlist candidate generation unit that generates a playlist candidate to be recommended to the user based on the feature amount representing the image of the tuned song;
A priority calculation unit that generates a requested playlist corresponding to the keyword, and calculates a priority for the recommended playlist candidate based on the requested playlist and a candidate playlist recommended to the user;
A playlist determination unit that determines a playlist to be presented to the user based on the calculated priority,
An information generation unit that generates information that presents the determined name of the playlist to the user.
A playlist generation device, characterized in that:
(Appendix 14)
The playlist determining unit determines only songs having feature amounts that fall within the range of the feature amount corresponding to the determined image of the playlist and fall within the range of the feature amount corresponding to the image of the requested playlist. Create the registered playlist,
14. The playlist generation device according to supplementary note 13, wherein
(Appendix 15)
The priority calculation unit,
Recognition misregistration that calculates a recognition misregistration amount using the distribution of the feature amount of the song registered in the recommended playlist candidate and the distribution of the feature amount of the song assumed as the recommended playlist candidate. An amount calculator,
Request deviation amount calculation using the distribution of the feature amount of the music assumed as the candidate of the recommended playlist and the distribution of the characteristic amount of the music assumed in the required playlist. Department and
A shift amount integration unit that integrates the recognition shift amount and the request shift amount and calculates a recommendation priority for the recommended playlist candidate.
14. The playlist generation device according to supplementary note 13, wherein
(Appendix 16)
The playlist generation device further includes an identification information database for registering identification information of the music stored in the external device,
The song narrowing unit narrows only songs that satisfy the attribute information from the songs registered in the identification information database,
14. The playlist generation device according to supplementary note 13, wherein
(Appendix 17)
The playlist generation device may be configured such that, when an operation of skipping a song being played by the user is performed during playback of the playlist presented to the user, the skipped song is the feature of the playlist. An adaptive processing unit that corrects the playlist so as to be out of the range of the amount.
14. The playlist generation device according to supplementary note 13, wherein
(Appendix 18)
The music is narrowed down based on the attribute information indicating the attribute of the music to be registered in the playlist included in the playlist generation request from the user,
Generating a requested playlist based on the feature amount representing the image of the narrowed down song and the range of the feature amount corresponding to the image of the playlist included in the generation request;
When there is a bias in the feature amount of the song registered in the request playlist within the range of the feature amount of the generated request playlist, based on the feature amount of the song registered in the request playlist. Changing the name of the playlist presented to the user,
A program that causes a computer to execute processing.
(Appendix 19)
When there is a bias in the feature amount of the song registered in the request playlist within the range of the feature amount of the request playlist, the computer sends the feature amount of the song registered in the request playlist to the computer. Changing the name of the playlist to be presented to the user based on the changed image, creating a recommended playlist recommended to the user based on the changed image, and executing a process.
18. The program according to supplementary note 18, wherein:
(Appendix 20)
The process of creating the recommended playlist includes, within the narrowed-down songs, the range of the feature amount corresponding to the image of the playlist after the change, and the range of the feature amount of the requested playlist. Including a process of creating the recommended playlist in which only songs having the following feature amounts are registered:
20. The program according to claim 19, wherein:
(Appendix 21)
The process of creating the recommended playlist includes, among the narrowed-down songs, the recommended playlist in which all songs having feature amounts within a range of the feature amount corresponding to the image of the playlist after the change are registered. Including the process of creating
20. The program according to claim 19, wherein:
(Appendix 22)
The computer further executes a process of generating a playlist candidate recommended to the user based on the narrowed down feature amount of the music,
The process of changing the name of the playlist to be presented to the user includes, in the computer, a distribution of the feature amounts of the songs registered in the recommended playlist candidates and the Determining a playlist to be recommended to the user based on the difference from the distribution of the feature amount of the song, and changing the name of the playlist to the name of the recommended playlist;
18. The program according to supplementary note 18, wherein:
(Appendix 23)
A playlist recommended to the user based on a difference between the distribution of the feature amounts of the songs registered in the recommended playlist candidates and the distribution of the feature amounts of the songs registered in the requested playlist. The process to determine is
Using the distribution of the feature amounts of the songs included in the recommended playlist candidates and the distribution of the feature amounts of the songs assumed to be the recommended playlist candidates, the recognition misregistration amount is calculated. Calculating the required deviation amount using the distribution of the feature amount of the music assumed as the candidate of the playlist to be recommended and the distribution of the feature amount of the music assumed in the requested playlist,
Calculating a recommendation priority for the recommended playlist candidate based on the recognition shift amount and the request shift amount;
Determining the candidate with the highest recommendation priority as a playlist recommended to the user, including:
23. The program according to supplementary note 22, wherein:
(Appendix 24)
On the computer,
Play songs included in the playlist presented to the user,
When the operation information for skipping the reproduction is input during the reproduction of the song, the playlist is set such that the feature amount of the skipped song is out of the range of the feature amount of the song to be registered in the playlist. Correcting the range of the feature amount, causing the process to be further executed,
18. The program according to supplementary note 18, wherein:
(Appendix 25)
The process of modifying the range of the feature amount of the playlist includes a process of reducing the range of the feature amount of the playlist.
25. The program according to supplementary note 24, wherein:
(Supplementary Note 26)
The process of modifying the range of the feature amount of the playlist includes:
After reducing the range of the feature amount of the playlist, expanding the range of the feature amount of the playlist in a manner that does not include the feature amount of the skipped song.
25. The program according to supplementary note 24, wherein:
(Appendix 27)
On the computer,
Generating audio data including the name of the playlist to be presented to the user, reproducing the audio data and presenting the name of the playlist to the user, and further executing a process.
18. The program according to supplementary note 18, wherein:
(Appendix 28)
Computer
Data narrowing down is performed based on attribute information indicating attributes of data to be registered in the playlist included in the playlist generation request from the user,
Generating a requested playlist based on a feature amount representing an image of the narrowed-down data, and a range of the feature amount corresponding to the image of the playlist included in the generation request;
When the feature amount of the data registered in the requested playlist is biased within the range of the feature amount of the generated requested playlist, based on the feature amount of the data registered in the requested playlist. Generating a playlist name to be presented to the user;
A play list generation method, characterized by performing a process.

１プレイリスト生成装置
４プレイリスト再生システム
５音楽再生装置
６サーバ
７通信ネットワーク
８，８Ａ，８Ｂ携帯型電子機器
１０コンピュータ
１１マイク
１２スピーカ
１０１要求プレイリスト入力部
１０２，６０３キーワード／属性抽出部
１０３，６０４曲絞込部
１０４，６０５推薦プレイリスト候補生成部
１０５，６０６優先度算出部
１０５ａ認識ずれ量算出部
１０５ｂ要求ずれ量算出部
１０５ｃプレイリスト検索部
１０５ｄずれ量統合部
１０６，６０７プレイリスト決定部
１０７，６０８ナレーション作成部
１０７ａ，６０８ａナレーション文章作成部
１０７ｂ，５０８ナレーション音声作成部
１０８，５０９，６０９プレイリスト再生部
１０９，５０２音声出力部
１１５スキップ操作入力部
１１６適応処理部
１１６ａプレイリスト範囲修正部
１１６ｂプレイリストＤＢ再学習部
１１６ｃ（修正データの）データベース
１１６ｄ（プレイリスト学習用データの）データベース
１２１，６２１キーワードＤＢ
１２２，６２２絞込属性ＤＢ
１２３，６２３曲ＤＢ
１２４，６２４プレイリストＤＢ
５０１音声入力部
５０３信号処理部
５０４，５１０，６０１，８０１通信部
５０６データ入力部
５０７曲音声データ登録部
６０２音声認識部
５２１（曲ＩＤ及び曲音声データの）データベース
８２１（曲音声データの）データベース
１００１ＣＰＵ
１００２主記憶装置
１００３補助記憶装置
１００４入力装置
１００５表示装置
１００６インタフェース装置
１００７記憶媒体駆動装置
１００８通信装置 Reference Signs List 1 playlist generation apparatus 4 playlist reproduction system 5 music reproduction apparatus 6 server 7 communication network 8, 8A, 8B portable electronic device 10 computer 11 microphone 12 speaker 101 request playlist input section 102, 603 keyword / attribute extraction section 103, 604 music narrowing down unit 104, 605 recommended playlist candidate generation unit 105, 606 priority calculation unit 105a recognition shift amount calculation unit 105b request shift amount calculation unit 105c playlist search unit 105d shift amount integration unit 106, 607 playlist determination unit 107, 608 Narration creator 107a, 608a Narration text creator 107b, 508 Narration audio creator 108, 509, 609 Playlist playback unit 109, 502 Audio output unit 115 Skip operation input unit 116 Adaptation processing unit 1 6a playlist range modification unit 116b playlist DB re-learning section 116c (of correction data) (data for the playlist learning) database 116d database 121,621 keyword DB
122,622 Refinement attribute DB
123,623 Song DB
124,624 Playlist DB
501 Voice input unit 503 Signal processing unit 504, 510, 601, 801 Communication unit 506 Data input unit 507 Music voice data registration unit 602 Voice recognition unit 521 Database (of music ID and music voice data) 821 Database (of music voice data) 1001 CPU
1002 main storage device 1003 auxiliary storage device 1004 input device 1005 display device 1006 interface device 1007 storage medium drive device 1008 communication device

Claims

Computer
The music is narrowed down based on the attribute information indicating the attribute of the music to be registered in the playlist included in the playlist generation request from the user,
A feature amount representing an image of the narrowed down song and a range in which the feature amount of the song to be registered in the playlist, which is a range associated with each playlist on the database, are defined. Based on the range of the playlist that is, a playlist in which the feature amount representing the image is registered as a song included in the range is generated as a recommended playlist candidate,
The distribution of the feature amount of the song registered in the generated recommended playlist candidate is biased with respect to the range assumed for a requested playlist that is a playlist requested to be generated by the generation request. The distribution of the songs included in the recommended playlist candidate, the range assumed for the recommended playlist candidate, and the range assumed for the requested playlist. Determining a recommended playlist that is a playlist to be presented to the user from the recommended playlist candidates;
A playlist generation method, characterized by performing a process.

In the process of determining the recommended playlist, the computer,
A difference between the distribution of the music included in the recommended playlist candidate and the distribution assumed for the recommended playlist candidate is calculated as a recognition error amount, and the difference is assumed for the recommended playlist candidate. the difference between the distributions assumed for the request playlist and the distribution is calculated as the required shift amount,
A weighted sum of the recognition shift amount and the request shift amount for each of the recommended playlist candidates is calculated as a recommendation priority for each of the recommended playlist candidates,
The recommended playlist candidate having the highest recommendation priority is determined as the recommended playlist.
The playlist generation method according to claim 1, wherein:

The computer calculates the recognition shift amount and the required shift amount with the distribution of the feature amount of the song as a normal distribution,
3. The playlist generation method according to claim 2, wherein:

The computer calculates the recognition shift amount and the required shift amount as a uniform distribution of the feature amount of the music,
3. The playlist generation method according to claim 2, wherein:

Said computer,
Play songs included in the recommended playlist,
When operation information for skipping playback is input during the playback of the song, the feature amount of the skipped song is outside the range of the feature amount of the song to be registered in the recommended playlist. Modifying the range assumed for the recommended playlist, further executing a process,
The playlist generation method according to claim 1, wherein:

In the process of modifying the range of the recommended playlist, the computer reduces the range of the recommended playlist,
The playlist generation method according to claim 5, wherein:

In the process of correcting the range for the recommended playlist, the computer may include:
After reducing the range for the recommended playlist,
Expanding the range of the recommended playlist in a manner that does not include the feature amount of the skipped song;
The playlist generation method according to claim 5, wherein:

Said computer,
Generating audio data including the name of the recommended playlist, reproducing the audio data and presenting the name of the recommended playlist to the user, further executing a process;
The playlist generation method according to claim 1, wherein:

A song narrowing unit that narrows down songs based on attributes of songs to be registered in the playlist included in the playlist generation request from the user;
A feature amount representing an image of the narrowed down song and a range in which the feature amount of the song to be registered in the playlist, which is a range associated with each playlist on the database, are defined. A recommended playlist candidate generating unit that generates a playlist in which the feature amount representing the image is registered as a recommended playlist candidate based on the range of the playlist that is included,
The distribution of the feature amount of the song registered in the generated recommended playlist candidate is biased with respect to the range assumed for a requested playlist that is a playlist requested to be generated by the generation request. The distribution of the songs included in the recommended playlist candidate, the range assumed for the recommended playlist candidate, and the range assumed for the requested playlist. A recommended playlist determining unit that determines a recommended playlist that is a playlist to be presented to the user from the recommended playlist candidates.
A playlist generation device, characterized in that:

The recommended playlist determination unit,
A difference between the distribution of the song included in the recommended playlist candidate and the distribution assumed for the recommended playlist candidate, a recognition shift amount calculation unit that calculates the recognition shift amount,
A difference between the distribution assumed for the recommended playlist candidate and the distribution assumed for the requested playlist, a request shift amount calculation unit that calculates the request shift amount,
A shift amount integration unit that calculates a weighted sum of the recognition shift amount and the request shift amount for each of the recommended playlist candidates, as a recommendation priority for each of the recommended playlist candidates,
The recommended playlist candidate having the highest recommendation priority is determined as the recommended playlist.
The playlist generation device according to claim 9, wherein

The music is narrowed down based on the attribute information indicating the attribute of the music to be registered in the playlist included in the playlist generation request from the user,
A feature amount representing an image of the narrowed down song and a range in which the feature amount of the song to be registered in the playlist, which is a range associated with each playlist on the database, are defined. Based on the range of the playlist that is, a playlist in which the feature amount representing the image is registered as a song included in the range is generated as a recommended playlist candidate,
The distribution of the feature amount of the song registered in the generated recommended playlist candidate is biased with respect to the range assumed for a requested playlist that is a playlist requested to be generated by the generation request. The distribution of the songs included in the recommended playlist candidate, the range assumed for the recommended playlist candidate, and the range assumed for the requested playlist. Determining a recommended playlist that is a playlist to be presented to the user from the recommended playlist candidates;
A program that causes a computer to execute processing.

Computer
The data is narrowed down based on the attribute information indicating the attribute of the data to be registered in the list included in the list generation request from the user,
It defines a feature amount representing an image of the narrowed-down data and a range that is associated with each list on the database and in which the feature amount of data to be registered in the list is distributed. Based on the range of the list, a list in which the feature amount representing the image is registered with data included in the range is generated as a recommendation list candidate,
When the distribution of the feature amount of the data registered in the generated recommendation list candidate is biased with respect to the range assumed for a request list that is a list requested to be generated by the generation request. On the basis of the distribution of the data included in the recommendation list candidate, the range assumed for the recommendation list candidate, and the range assumed for the request list, Determining a recommendation list that is a list to be presented to the user;
A list generation method, characterized by performing a process.