JP7061679B2

JP7061679B2 - A method and system for predicting the playing length of a song based on the composition of the playlist

Info

Publication number: JP7061679B2
Application number: JP2020544826A
Authority: JP
Inventors: チェー，ジンウォン; ジョン，ジェホン; チョウ，ミンギョン; チョウ，ソンヨン; リュ，デウォン
Original assignee: Line Plus Corp
Current assignee: Line Plus Corp
Priority date: 2018-02-26
Filing date: 2018-02-26
Publication date: 2022-04-28
Anticipated expiration: 2038-02-26
Also published as: WO2019164041A1; KR20200121787A; JP2021519944A; KR102496542B1

Description

以下の説明は、ストリーミング（ｓｔｒｅａｍｉｎｇ）基盤の音楽サービスを提供する技術に関する。 The following description relates to a technique for providing a streaming-based music service.

デジタル技術の発達により、音源をデジタル形態で生成して保存および管理することができるようになった。これにより、音源の作成、配布、管理が容易となり、ＰＣ、ノート型ＰＣ、タブレット、スマートフォンのような多様な端末機を利用して容易に鑑賞することができるようになった。 With the development of digital technology, it has become possible to generate, store and manage sound sources in digital form. This makes it easier to create, distribute, and manage sound sources, and it has become possible to easily view them using various terminals such as PCs, notebook PCs, tablets, and smartphones.

音楽を鑑賞するための方法が多様化している中、例えば、通信網を介して音楽をダウンロードして利用したり、インターネットによるストリーミングサービスでリアルタイム受信したりすることで、音楽を鑑賞することができるようになった。 While the methods for listening to music are diversifying, for example, music can be listened to by downloading and using it via a communication network or receiving it in real time by a streaming service via the Internet. It became so.

一例として、特許文献１（公開日２０１４年２月１４日）には、ユーザのプレイリストに基づいてユーザカスタムされた音楽放送サービスを提供する、プレイリスト基盤の音楽放送サービス方法およびこの装置が開示されている。 As an example, Patent Document 1 (publication date: February 14, 2014) discloses a playlist-based music broadcasting service method and the apparatus for providing a user-customized music broadcasting service based on a user's playlist. Has been done.

韓国公開特許第１０－２０１４－００１９１２３号公報Korean Publication No. 10-2014-0019123 Gazette

プレイリスト（ｐｌａｙｌｉｓｔ）の曲の構成に基づき、曲ごとに該当の曲が最後まで再生される確率である再生長さを予測することを目的とする。 The purpose is to predict the playback length, which is the probability that the corresponding song will be played to the end for each song, based on the composition of the songs in the playlist (playlist).

プレイリストに含まれる各曲の再生長さに基づいて曲ごとにプリフェッチ（ｐｒｅ－ｆｅｔｃｈｉｎｇ）を効果的に実行することで、滑らかな（ｓｅａｍｌｅｓｓ）再生環境を提供することを他の目的とする。 Another object is to provide a smooth playback environment by effectively performing prefetching for each song based on the playback length of each song included in the playlist.

プレイリストの曲ごとの再生長さに基づくことで不必要なデータフェッチを減らすことができ、サービス全般にわたって消耗されるリソースを効率的に利用することをさらに他の目的とする。 It is possible to reduce unnecessary data fetches based on the playback length of each song in the playlist, and the other purpose is to efficiently use the resources consumed throughout the service.

コンピュータシステムが実行する方法であって、前記コンピュータシステムは、少なくとも１つのプロセッサを含み、当該方法は、前記少なくとも１つのプロセッサが、各プレイリストに対して曲構成情報と再生履歴情報を管理する段階と、前記少なくとも１つのプロセッサが、電子機器から特定のプレイリストに対する再生要求を受信すると、前記曲構成情報と前記再生履歴情報を利用して前記特定のプレイリストの曲ごとに該当の曲が再生される確率である再生長さを予測する段階とを含む方法を提供する。 A method performed by a computer system, wherein the computer system includes at least one processor, wherein the at least one processor manages song composition information and playback history information for each playlist. When the at least one processor receives a playback request for a specific playlist from an electronic device, the corresponding song is played for each song in the specific playlist using the song composition information and the playback history information. Provided is a method including a step of predicting a reproduction length which is a probability of being performed.

一側面によると、前記管理する段階は、前記プレイリストそれぞれに対し、曲の再生順序と曲ごとの全体の再生長さを含む曲構成情報、および曲ごとの前記全体の再生長さに対してユーザそれぞれが実際に再生した割合を示す実際の再生長さを含む再生履歴情報を管理する段階を含んでよい。 According to one aspect, the management step is for each of the playlists with respect to the song composition information including the playback order of the songs and the total playback length of each song, and the total playback length of each song. It may include a step of managing the playback history information including the actual playback length indicating the ratio of the actual playback by each user.

他の側面によると、前記予測する段階は、前記曲構成情報を利用して前記特定のプレイリストの曲構成と対応するプレイリストを取得する段階と、前記取得したプレイリストの再生履歴情報を利用して前記特定のプレイリストの曲ごとに予想再生長さを算出する段階とを含んでよい。 According to another aspect, the predictive stage uses the song composition information to acquire a playlist corresponding to the song composition of the specific playlist, and uses the playback history information of the acquired playlist. The step of calculating the expected playback length for each song in the specific playlist may be included.

また他の側面によると、前記取得する段階は、曲どうしの類似度に基づき、前記特定のプレイリストに含まれる曲と同一あるいは類似の曲で構成されたプレイリストを取得してよい。 Further, according to another aspect, the acquisition step may acquire a playlist composed of songs that are the same as or similar to the songs included in the specific playlist, based on the similarity between the songs.

また他の側面によると、前記取得する段階は、各曲に対し、該当の曲の音源データをディープラーニング学習モデルによって学習して固有の音源特徴を生成する段階、前記音源特徴を利用して曲どうしの類似度を算出する段階、および前記曲どうしの類似度に基づき、前記特定のプレイリストに含まれる曲と同一あるいは類似の曲で構成されたプレイリストを取得する段階を含んでよい。 According to another aspect, the acquisition stage is a stage in which the sound source data of the corresponding song is learned by the deep learning learning model to generate a unique sound source feature for each song, and the song using the sound source feature. It may include a step of calculating the similarity between the songs and a step of acquiring a playlist composed of songs that are the same as or similar to the songs included in the specific playlist based on the similarity between the songs.

また他の側面によると、前記予測する段階は、前記曲構成情報を利用して前記特定のプレイリストの曲の構成と対応するプレイリストを取得する段階と、前記取得したプレイリストで前記特定のプレイリストの各曲と対応する曲に対する全体ユーザの実際の再生長さを平均する方式により、前記特定のプレイリストの曲ごとに予想再生長さを算出する段階とを含んでよい。 According to another aspect, the prediction stage includes a stage of acquiring a playlist corresponding to the composition of a song in the specific playlist by using the song composition information, and the specific stage of the acquired playlist. It may include a step of calculating an expected playback length for each song in the specific playlist by a method of averaging the actual playback lengths of all users for each song in the playlist and the corresponding song.

また他の側面によると、前記予測する段階は、前記曲構成情報を利用して前記特定のプレイリストの曲構成と対応するプレイリストを取得する段階と、前記取得したプレイリストで前記特定のプレイリストの各曲と対応する曲に対する前記電子機器のユーザの実際の再生長さを平均する方式により、前記特定のプレイリストの曲ごとに予想再生長さを算出する段階とを含んでよい。 According to another aspect, the prediction stage includes a stage of acquiring a playlist corresponding to the song composition of the specific playlist by using the song composition information, and the specific play of the acquired playlist. A step of calculating an expected playback length for each song in the specific playlist by a method of averaging the actual playback lengths of the users of the electronic device for each song in the list and the corresponding song may be included.

また他の側面によると、前記管理する段階は、前記プレイリストの各曲に対し、該当の曲が再生された時点の時間、天気、位置のうちの少なくとも１つを含む追加情報を前記再生履歴情報とともに管理する段階をさらに含み、前記予測する段階は、前記曲構成情報を利用して前記特定のプレイリストの曲構成と対応するプレイリストを取得する段階と、前記取得したプレイリストのうちで前記特定のプレイリストを要求した時点の追加情報とマッチングするプレイリストの再生履歴情報を利用して、前記特定のプレイリストの曲ごとに予想再生長さを算出する段階とを含んでよい。 According to another aspect, the management stage adds additional information to each song in the playlist, including at least one of the time, weather, and position at the time the song was played. The step of further including the step of managing together with the information, the step of predicting is the step of acquiring the playlist corresponding to the song composition of the specific playlist by using the song composition information, and the stage of acquiring the playlist. It may include a step of calculating an expected playback length for each song of the specific playlist by using the playlist playback history information that matches the additional information at the time when the specific playlist is requested.

また他の側面によると、当該方法は、前記少なくとも１つのプロセッサが、前記特定のプレイリストの曲ごとの再生長さに従って曲再生に必要なリソースを分配し、前記特定のプレイリストの再生のための音源データを前記電子機器に提供する段階をさらに含んでよい。 According to another aspect, in the method, the at least one processor distributes the resources required for song playback according to the playback length of each song of the specific playlist, and the method is for playing the specific playlist. It may further include a step of providing the sound source data of the above to the electronic device.

さらに他の側面によると、前記提供する段階は、前記特定のプレイリストの曲ごとに、該当の曲の予測された再生長さに対応する単位でプリフェッチ（ｐｒｅ－ｆｅｔｃｈｉｎｇ）を実行してよい。 According to yet another aspect, the provided step may perform prefetching for each song in the particular playlist in units corresponding to the predicted playback length of the song.

前記方法をコンピュータに実行させるためのプログラムが記録されていることを特徴とする、非一時的なコンピュータ読み取り可能な記録媒体を提供する。 Provided is a non-temporary computer-readable recording medium, characterized in that a program for causing a computer to execute the method is recorded.

コンピュータシステムであって、メモリ、および前記メモリに接続され、前記メモリに含まれるコンピュータ読み取り可能な命令を実行するように構成された少なくとも１つのプロセッサを含み、前記少なくとも１つのプロセッサは、各プレイリストに対して曲構成情報と再生履歴情報を管理する情報管理部と、電子機器から特定のプレイリストに対する再生要求が受信されれば、前記曲構成情報と前記再生履歴情報を利用して前記特定のプレイリストの曲ごとに該当の曲が再生される確率である再生長さを予測する再生長さ予測部とを含む、コンピュータシステムを提供する。 A computer system comprising a memory and at least one processor connected to the memory and configured to execute computer-readable instructions contained in the memory, wherein the at least one processor is each playlist. When a playback request for a specific playlist is received from the information management unit that manages the song composition information and the playback history information, the specific playlist is used by using the song composition information and the playback history information. Provided is a computer system including a playback length predicting unit that predicts a playback length, which is the probability that the corresponding song is played for each song in the playlist.

本発明の一実施形態における、ネットワーク環境の例を示した図である。It is a figure which showed the example of the network environment in one Embodiment of this invention. 本発明の一実施形態における、電子機器およびサーバの内部構成を説明するためのブロック図である。It is a block diagram for demonstrating the internal structure of the electronic device and the server in one Embodiment of this invention. 本発明の一実施形態における、サーバのプロセッサが含むことのできる構成要素の例を示したブロック図である。It is a block diagram which showed the example of the component which the processor of a server can include in one Embodiment of this invention. 本発明の一実施形態における、サーバが実行することのできる方法の例を示したフローチャートである。It is a flowchart which showed the example of the method which a server can execute in one Embodiment of this invention. 本発明の一実施形態における、プレイリストの曲構成情報を含む管理テーブルの例を示した図である。It is a figure which showed the example of the management table which contains the music composition information of a playlist in one Embodiment of this invention. 本発明の一実施形態における、プレイリストの再生履歴情報を含む管理テーブルの例を示した図である。It is a figure which showed the example of the management table which contains the play history information of a playlist in one Embodiment of this invention. 本発明の一実施形態における、曲どうしの類似度に基づいてプレイリストの曲ごとに予想再生長さを算出する過程の一例を示したフローチャートである。It is a flowchart which showed an example of the process of calculating the expected play length for each song of a playlist based on the degree of similarity between songs in one Embodiment of this invention. 本発明の一実施形態における、特定のプレイリストの曲構成と対応するプレイリストを取得する過程を説明するための例示図である。It is an exemplary diagram for demonstrating the process of acquiring the playlist corresponding to the song composition of a specific playlist in one embodiment of the present invention. 本発明の一実施形態における、特定のプレイリストの曲ごとに予想再生長さを算出する過程を説明するための例示図である。It is an exemplary diagram for demonstrating the process of calculating the expected reproduction length for each music of a specific playlist in one Embodiment of this invention. 本発明の一実施形態における、特定のプレイリストの曲ごとの予想再生長さによるプリフェッチ過程を説明するための例示図である。It is an exemplary figure for demonstrating the prefetch process by the expected play length for each song of a specific playlist in one Embodiment of this invention.

以下、本発明の実施形態について、添付の図面を参照しながら詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

本発明の実施形態は、ストリーミング基盤の音楽サービスを提供する技術に関する。 An embodiment of the present invention relates to a technique for providing a streaming-based music service.

本明細書において具体的に開示される事項を含む実施形態は、プレイリスト構成に基づいて曲の再生長さを予測することができ、これによって効率性、リソース節減、費用節減、サービス品質、利便性などの側面において相当な長所を達成することができる。 Embodiments, including the matters specifically disclosed herein, can predict the playing length of a song based on a playlist structure, thereby efficiency, resource savings, cost savings, quality of service, convenience. It is possible to achieve considerable advantages in terms of sex and the like.

図１は、本発明の一実施形態における、ネットワーク環境の例を示した図である。図１のネットワーク環境は、複数の電子機器１１０、１２０、１３０、１４０、複数のサーバ１５０、１６０、およびネットワーク１７０を含む例を示している。このような図１は、発明の説明のための一例に過ぎず、電子機器の数やサーバの数が図１のように限定されることはない。 FIG. 1 is a diagram showing an example of a network environment according to an embodiment of the present invention. The network environment of FIG. 1 shows an example including a plurality of electronic devices 110, 120, 130, 140, a plurality of servers 150, 160, and a network 170. Such FIG. 1 is merely an example for explaining the invention, and the number of electronic devices and the number of servers are not limited as in FIG.

複数の電子機器１１０、１２０、１３０、１４０は、コンピュータシステムによって実現される固定端末や移動端末であってよい。複数の電子機器１１０、１２０、１３０、１４０の例としては、スマートフォン、携帯電話、ナビゲーション、ＰＣ（ｐｅｒｓｏｎａｌｃｏｍｐｕｔｅｒ）、ノート型ＰＣ、デジタル放送用端末、ＰＤＡ（ＰｅｒｓｏｎａｌＤｉｇｉｔａｌＡｓｓｉｓｔａｎｔ）、ＰＭＰ（ＰｏｒｔａｂｌｅＭｕｌｔｉｍｅｄｉａＰｌａｙｅｒ）、タブレット、ゲームコンソール、ウェアラブルデバイス、ＩｏＴ（ｉｎｔｅｒｎｅｔｏｆｔｈｉｎｇｓ）デバイス、ＶＲ（ｖｉｒｔｕａｌｒｅａｌｉｔｙ）デバイス、ＡＲ（ａｕｇｍｅｎｔｅｄｒｅａｌｉｔｙ）デバイスなどがある。一例として、図１では、電子機器１１０の例としてスマートフォンを示しているが、本発明の実施形態において、電子機器１１０は、実質的に無線または有線通信方式を利用し、ネットワーク１７０を介して他の電子機器１２０、１３０、１４０および／またはサーバ１５０、１６０と通信することのできる多様な物理的なコンピュータシステムのうちの１つを意味してよい。 The plurality of electronic devices 110, 120, 130, 140 may be fixed terminals or mobile terminals realized by a computer system. Examples of a plurality of electronic devices 110, 120, 130, 140 include smartphones, mobile phones, navigation systems, PCs (personal computers), notebook PCs, digital broadcasting terminals, PDAs (Personal Digital Assistants), and PMPs (Tablet Multimedia Players). ), Tablets, game consoles, wearable devices, IoT (internet of things) devices, VR (visual reality) devices, AR (agmented reality) devices, and the like. As an example, FIG. 1 shows a smartphone as an example of the electronic device 110, but in the embodiment of the present invention, the electronic device 110 substantially utilizes a wireless or wired communication method, and another via the network 170. It may mean one of a variety of physical computer systems capable of communicating with the electronic devices 120, 130, 140 and / or the servers 150, 160.

通信方式が限定されることはなく、ネットワーク１７０が含むことのできる通信網（一例として、移動通信網、有線インターネット、無線インターネット、放送網、衛星網など）を利用する通信方式だけではなく、機器間の近距離無線通信が含まれてよい。例えば、ネットワーク１７０は、ＰＡＮ（ｐｅｒｓｏｎａｌａｒｅａｎｅｔｗｏｒｋ）、ＬＡＮ（ｌｏｃａｌａｒｅａｎｅｔｗｏｒｋ）、ＣＡＮ（ｃａｍｐｕｓａｒｅａｎｅｔｗｏｒｋ）、ＭＡＮ（ｍｅｔｒｏｐｏｌｉｔａｎａｒｅａｎｅｔｗｏｒｋ）、ＷＡＮ（ｗｉｄｅａｒｅａｎｅｔｗｏｒｋ）、ＢＢＮ（ｂｒｏａｄｂａｎｄｎｅｔｗｏｒｋ）、インターネットなどのネットワークのうちの１つ以上の任意のネットワークを含んでよい。さらに、ネットワーク１７０は、バスネットワーク、スターネットワーク、リングネットワーク、メッシュネットワーク、スター－バスネットワーク、ツリーまたは階層的ネットワークなどを含むネットワークトポロジのうちの任意の１つ以上を含んでもよいが、これらに限定されることはない。 The communication method is not limited, and not only a communication method using a communication network that can be included in the network 170 (for example, a mobile communication network, a wired Internet, a wireless Internet, a broadcasting network, a satellite network, etc.), but also a device. Short-range wireless communication between may be included. For example, the network 170 includes a PAN (personal area network), a LAN (local area network), a CAN (campus area network), a MAN (metropolitan area network), a WAN (wise Internet) network, etc. It may include any one or more of the networks. Further, network 170 may include, but is limited to, any one or more of network topologies, including bus networks, star networks, ring networks, mesh networks, star-bus networks, tree or hierarchical networks, and the like. Will not be done.

サーバ１５０、１６０それぞれは、複数の電子機器１１０、１２０、１３０、１４０とネットワーク１７０を介して通信して命令、コード、ファイル、コンテンツ、サービスなどを提供する１つ以上のコンピュータ装置によって実現されてよい。例えば、サーバ１５０は、ネットワーク１７０を介して接続した複数の電子機器１１０、１２０、１３０、１４０に第１サービスを提供するシステムであってよく、サーバ１６０も、ネットワーク１７０を介して接続した複数の電子機器１１０、１２０、１３０、１４０に第２サービスを提供するシステムであってよい。より具体的な例として、サーバ１５０は、複数の電子機器１１０、１２０、１３０、１４０においてインストールされて実行されるコンピュータプログラムであるアプリケーションを通じ、該当のアプリケーションが目的とするサービス（一例として、音楽サービスなど）を第１サービスとして複数の電子機器１１０、１２０、１３０、１４０に提供してよい。他の例として、サーバ１６０は、上述したアプリケーションのインストールおよび実行のためのファイルを複数の電子機器１１０、１２０、１３０、１４０に配布するサービスを第２サービスとして提供してよい。 Each of the servers 150, 160 is realized by one or more computer devices that communicate with a plurality of electronic devices 110, 120, 130, 140 via a network 170 to provide instructions, codes, files, contents, services, and the like. good. For example, the server 150 may be a system that provides the first service to a plurality of electronic devices 110, 120, 130, 140 connected via the network 170, and the server 160 may also be a plurality of systems connected via the network 170. It may be a system that provides a second service to electronic devices 110, 120, 130, 140. As a more specific example, the server 150 is a service (as an example, a music service) aimed at by the application through an application which is a computer program installed and executed in a plurality of electronic devices 110, 120, 130, 140. Etc.) may be provided to a plurality of electronic devices 110, 120, 130, 140 as a first service. As another example, the server 160 may provide a service for distributing the above-mentioned application installation and execution files to a plurality of electronic devices 110, 120, 130, 140 as a second service.

図２は、本発明の一実施形態における、電子機器およびサーバの内部構成を説明するためのブロック図である。図２では、電子機器に対する例として電子機器１１０の内部構成およびサーバ１５０の内部構成について説明する。また、他の電子機器１２０、１３０、１４０やサーバ１６０も、上述した電子機器１１０またはサーバ１５０と同一または類似の内部構成を有してよい。 FIG. 2 is a block diagram for explaining an internal configuration of an electronic device and a server according to an embodiment of the present invention. FIG. 2 describes the internal configuration of the electronic device 110 and the internal configuration of the server 150 as examples for the electronic device. Further, the other electronic devices 120, 130, 140 and the server 160 may have the same or similar internal configuration as the above-mentioned electronic device 110 or the server 150.

電子機器１１０およびサーバ１５０は、メモリ２１１、２２１、プロセッサ２１２、２２２、通信モジュール２１３、２２３、および入力／出力インタフェース２１４、２２４を含んでよい。メモリ２１１、２２１は、非一時的なコンピュータ読み取り可能な記録媒体であって、ＲＡＭ（ｒａｎｄｏｍａｃｃｅｓｓｍｅｍｏｒｙ）、ＲＯＭ（ｒｅａｄｏｎｌｙｍｅｍｏｒｙ）、ディスクドライブ、ＳＳＤ（ｓｏｌｉｄｓｔａｔｅｄｒｉｖｅ）、フラッシュメモリ（ｆｌａｓｈｍｅｍｏｒｙ）などのような永続的大容量記録装置を含んでよい。ここで、ＲＯＭ、ＳＳＤ、フラッシュメモリ、ディスクドライブのような永続的大容量記録装置は、メモリ２１１、２２１とは区分される別の永続的記録装置として電子機器１１０やサーバ１５０に含まれてもよい。また、メモリ２１１、２２１には、オペレーティングシステムと、少なくとも１つのプログラムコード（一例として、電子機器１１０においてインストールされて実行されるブラウザや特定のサービスの提供のために電子機器１１０にインストールされるアプリケーションなどのためのコード）が記録されてよい。このようなソフトウェア構成要素は、メモリ２１１、２２１とは別のコンピュータ読み取り可能な記録媒体からロードされてよい。このような別のコンピュータ読み取り可能な記録媒体は、フロッピー（登録商標）ドライブ、ディスク、テープ、ＤＶＤ／ＣＤ－ＲＯＭドライブ、メモリカードなどのコンピュータ読み取り可能な記録媒体を含んでよい。他の実施形態において、ソフトウェア構成要素は、コンピュータ読み取り可能な記録媒体ではない通信モジュール２１３、２２３を通じてメモリ２１１、２２１にロードされてもよい。例えば、少なくとも１つのプログラムは、開発者またはアプリケーションのインストールファイルを配布するファイル配布システム（一例として、上述したサーバ１６０）がネットワーク１７０を介して提供するファイルによってインストールされるコンピュータプログラム（一例として、上述したアプリケーション）に基づいてメモリ２１１、２２１にロードされてよい。 The electronic device 110 and the server 150 may include memories 211, 221s, processors 212, 222, communication modules 213, 223, and input / output interfaces 214, 224. The memory 211 and 221 are non-temporary computer-readable recording media, and are a RAM (random access memory), a ROM (read only memory), a disk drive, an SSD (sold state drive), and a flash memory (flash memory). Permanent mass recording devices such as, etc. may be included. Here, even if a permanent large-capacity recording device such as a ROM, SSD, flash memory, or disk drive is included in the electronic device 110 or the server 150 as another permanent recording device that is separated from the memories 211 and 221. good. In addition, the memory 211 and 221 contain an operating system and at least one program code (for example, a browser installed and executed in the electronic device 110 or an application installed in the electronic device 110 to provide a specific service. Code for etc.) may be recorded. Such software components may be loaded from a computer-readable recording medium separate from the memories 211 and 221. Such other computer-readable recording media may include computer-readable recording media such as floppy (registered trademark) drives, discs, tapes, DVD / CD-ROM drives, and memory cards. In other embodiments, software components may be loaded into memory 211, 221 through communication modules 213, 223, which are not computer readable recording media. For example, at least one program is a computer program installed by a file provided by a file distribution system (eg, server 160 described above) that distributes a developer or application installation file via a network 170 (eg, described above). It may be loaded into the memory 211 or 221 based on the application.

プロセッサ２１２、２２２は、基本的な算術、ロジック、および入出力演算を実行することにより、コンピュータプログラムの命令を処理するように構成されてよい。命令は、メモリ２１１、２２１または通信モジュール２１３、２２３によって、プロセッサ２１２、２２２に提供されてよい。例えば、プロセッサ２１２、２２２は、メモリ２１１、２２１のような記録装置に記録されたプログラムコードに従って受信される命令を実行するように構成されてよい。 Processors 212 and 222 may be configured to process instructions in a computer program by performing basic arithmetic, logic, and input / output operations. Instructions may be provided to processor 212, 222 by memory 211, 221 or communication modules 213, 223. For example, the processors 212 and 222 may be configured to execute instructions received according to a program code recorded in a recording device such as memories 211 and 221.

通信モジュール２１３、２２３は、ネットワーク１７０を介して電子機器１１０とサーバ１５０とが互いに通信するための機能を提供してもよいし、電子機器１１０および／またはサーバ１５０が他の電子機器（一例として、電子機器１２０）または他のサーバ（一例として、サーバ１６０）と通信するための機能を提供してもよい。一例として、電子機器１１０のプロセッサ２１２がメモリ２１１のような記録装置に記録されたプログラムコードに従って生成した要求が、通信モジュール２１３の制御に従ってネットワーク１７０を介してサーバ１５０に伝達されてよい。これとは逆に、サーバ１５０のプロセッサ２２２の制御に従って提供される制御信号や命令、コンテンツ、ファイルなどが、通信モジュール２２３とネットワーク１７０を経て電子機器１１０の通信モジュール２１３を通じて電子機器１１０に受信されてよい。例えば、通信モジュール２１３を通じて受信されたサーバ１５０の制御信号や命令、コンテンツ、ファイルなどは、プロセッサ２１２やメモリ２１１に伝達されてよく、コンテンツやファイルなどは、電子機器１１０がさらに含むことのできる記録媒体（上述した永続的記録装置）に記録されてよい。 The communication modules 213 and 223 may provide a function for the electronic device 110 and the server 150 to communicate with each other via the network 170, and the electronic device 110 and / or the server 150 may provide another electronic device (as an example). , Electronic device 120) or another server (eg, server 160). As an example, a request generated by a processor 212 of an electronic device 110 according to a program code recorded in a recording device such as a memory 211 may be transmitted to a server 150 via a network 170 under the control of a communication module 213. On the contrary, control signals, instructions, contents, files and the like provided under the control of the processor 222 of the server 150 are received by the electronic device 110 through the communication module 213 of the electronic device 110 via the communication module 223 and the network 170. It's okay. For example, control signals, instructions, contents, files, etc. of the server 150 received through the communication module 213 may be transmitted to the processor 212 and the memory 211, and the contents, files, etc. may be further recorded by the electronic device 110. It may be recorded on a medium (permanent recording device described above).

入力／出力インタフェース２１４は、入力／出力装置２１５とのインタフェースのための手段であってよい。例えば、入力装置は、キーボード、マウス、マイクロフォン、カメラなどの装置を、出力装置は、ディスプレイ、スピーカ、触覚フィードバックデバイスなどのような装置を含んでよい。他の例として、入力／出力インタフェース２１４は、タッチスクリーンのように入力と出力のための機能が１つに統合された装置とのインタフェースのための手段であってもよい。入力／出力装置２１５は、電子機器１１０と１つの装置で構成されてもよい。また、サーバ１５０の入力／出力インタフェース２２４は、サーバ１５０に接続するかサーバ１５０が含むことのできる入力または出力のための装置（図示せず）とのインタフェースのための手段であってよい。より具体的な例として、電子機器１１０のプロセッサ２１２がメモリ２１１にロードされたコンピュータプログラムの命令を処理するにあたり、サーバ１５０や電子機器１２０が提供するデータを利用して構成されるサービス画面やコンテンツが、入力／出力インタフェース２１４を通じてディスプレイに表示されてよい。 The input / output interface 214 may be a means for an interface with the input / output device 215. For example, the input device may include a device such as a keyboard, mouse, microphone, camera, and the output device may include a device such as a display, speaker, haptic feedback device, and the like. As another example, the input / output interface 214 may be a means for an interface with a device that integrates functions for input and output, such as a touch screen. The input / output device 215 may be composed of an electronic device 110 and one device. Also, the input / output interface 224 of the server 150 may be a means for connecting to the server 150 or for interfacing with a device (not shown) for input or output that the server 150 can include. As a more specific example, a service screen or content configured by using data provided by the server 150 or the electronic device 120 when the processor 212 of the electronic device 110 processes an instruction of a computer program loaded in the memory 211. May be displayed on the display through the input / output interface 214.

また、他の実施形態において、電子機器１１０およびサーバ１５０は、図２の構成要素よりも多くの構成要素を含んでもよい。しかし、大部分の従来技術的構成要素を明確に図に示す必要はない。例えば、電子機器１１０は、上述した入力／出力装置２１５のうちの少なくとも一部を含むように実現されてもよいし、トランシーバ、ＧＰＳ（ＧｌｏｂａｌＰｏｓｉｔｉｏｎｉｎｇＳｙｓｔｅｍ）モジュール、カメラ、各種センサ、データベースなどのような他の構成要素をさらに含んでもよい。より具体的な例として、電子機器１１０がスマートフォンである場合、一般的にスマートフォンが含んでいる加速度センサやジャイロセンサ、カメラモジュール、物理的な各種ボタン、タッチパネルを利用したボタン、入力／出力ポート、振動のための振動器などのような多様な構成要素が、電子機器１１０にさらに含まれるように実現されてよい。 Also, in other embodiments, the electronic device 110 and the server 150 may include more components than the components of FIG. However, most prior art components need not be clearly shown in the figure. For example, the electronic device 110 may be realized to include at least a part of the above-mentioned input / output device 215, such as a transceiver, a GPS (Global Positioning System) module, a camera, various sensors, a database, and the like. Other components may be further included. As a more specific example, when the electronic device 110 is a smartphone, an acceleration sensor or gyro sensor, a camera module, various physical buttons, a button using a touch panel, an input / output port, which are generally included in the smartphone, are used. Various components, such as accelerometers for vibration, may be realized to be further included in the electronic device 110.

以下では、プレイリスト構成に基づいて曲の再生長さを予測する方法およびシステムの具体的な実施形態について説明する。 Hereinafter, a method of predicting the playback length of a song based on a playlist structure and a specific embodiment of the system will be described.

一般的なプレイリストは、単純な規則によって曲が順に再生される。ユーザが最後まで再生せずに途中でスキップ（ｓｋｉｐ）する確率が高い曲に対しても再生のためのリソースが消耗されるようになるため、このような不必要なリソースの消耗により、全体的なサービスが円滑になされないことがある。 In a typical playlist, songs are played in sequence according to simple rules. Since the resources for playback are consumed even for songs that are likely to be skipped in the middle without being played to the end by the user, such unnecessary resource consumption causes the overall consumption of resources. Service may not be smooth.

本発明は、プレイリストの再生の流れを確率的モデルによって予測し、これに合わせて曲の再生に必要なリソースを効果的に配分することを目的とする。 It is an object of the present invention to predict the flow of playlist reproduction by a probabilistic model and to effectively allocate the resources required for song reproduction according to the prediction.

図３は、本発明の一実施形態における、サーバのプロセッサが含むことのできる構成要素の例を示したブロック図であり、図４は、本発明の一実施形態における、サーバが実行することのできる方法の例を示したフローチャートである。 FIG. 3 is a block diagram showing an example of components that can be included in the processor of the server in one embodiment of the present invention, and FIG. 4 is a block diagram of what the server executes in one embodiment of the present invention. It is a flowchart which showed the example of the possible method.

本実施形態に係るサーバ１５０は、クライアント（ｃｌｉｅｎｔ）である複数の電子機器１１０、１２０、１３０、１４０を対象にストリーミング基盤の音楽サービスを提供するプラットフォームの役割をする。サーバ１５０は、電子機器１１０、１２０、１３０、１４０上にインストールされるアプリケーションと連動して音楽サービスを提供してよい。 The server 150 according to the present embodiment serves as a platform for providing a streaming-based music service to a plurality of electronic devices 110, 120, 130, 140 which are clients (clients). The server 150 may provide music services in conjunction with applications installed on the electronic devices 110, 120, 130, 140.

サーバ１５０のプロセッサ２２２は、図４に係る音楽提供方法を実行するための構成要素として、図３に示すように、情報管理部３１０、再生長さ予測部３２０、およびデータ提供部３３０を含んでよい。実施形態によって、プロセッサ２２２の構成要素は、選択的にプロセッサ２２２に含まれても除外されてもよい。また、実施形態によって、プロセッサ２２２の構成要素は、プロセッサ２２２の機能の表現のために分離されても併合されてもよい。 As shown in FIG. 3, the processor 222 of the server 150 includes an information management unit 310, a reproduction length prediction unit 320, and a data provision unit 330 as components for executing the music providing method according to FIG. good. Depending on the embodiment, the components of processor 222 may be selectively included or excluded from processor 222. Also, depending on the embodiment, the components of processor 222 may be separated or merged to represent the functionality of processor 222.

このようなプロセッサ２２２およびプロセッサ２２２の構成要素は、図４の音楽提供方法が含む段階４１０～段階４４０を実行するようにサーバ１５０を制御してよい。例えば、プロセッサ２２２およびプロセッサ２２２の構成要素は、メモリ２２１が含むオペレーティングシステムのコードと少なくとも１つのプログラムのコードとによる命令（ｉｎｓｔｒｕｃｔｉｏｎ）を実行するように実現されてよい。 Such a processor 222 and the components of the processor 222 may control the server 150 to perform steps 410 to 440 included in the music providing method of FIG. For example, the processor 222 and the components of the processor 222 may be implemented to execute an instruction by the code of the operating system included in the memory 221 and the code of at least one program.

ここで、プロセッサ２２２の構成要素は、サーバ１５０に記録されたプログラムコードが提供する命令に従ってプロセッサ２２２によって実行される、プロセッサ２２２の互いに異なる機能（ｄｉｆｆｅｒｅｎｔｆｕｎｃｔｉｏｎｓ）の表現であってよい。例えば、サーバ１５０がプレイリストと関連する情報を管理するように上述した命令に従ってサーバ１５０を制御するプロセッサ２２２の機能的表現として、情報管理部３１０が利用されてよい。 Here, the components of the processor 222 may be representations of different functions of the processor 222 that are executed by the processor 222 according to the instructions provided by the program code recorded in the server 150. For example, the information management unit 310 may be used as a functional representation of the processor 222 that controls the server 150 according to the instructions described above so that the server 150 manages the playlist and related information.

段階４１０において、プロセッサ２２２は、サーバ１５０の制御と関連する命令がロードされたメモリ２２１から必要な命令を読み取ってよい。この場合、読み取られた命令は、以下で説明する段階４２０～段階４４０をプロセッサ２２２が実行するように制御するための命令を含んでよい。 At step 410, processor 222 may read the required instructions from memory 221 loaded with instructions associated with control of server 150. In this case, the read instructions may include instructions for controlling the processor 222 to perform steps 420 to 440 described below.

段階４２０において、情報管理部３１０は、プレイリストそれぞれに対して曲構成情報と再生履歴情報を調査および集計して、記録および管理してよい。プレイリストとは、少なくとも１つの音楽コンテンツ（曲）で構成されたトラック集であって、ユーザが直接構成するか他人から共有された再生リスト、あるいはサービス上で自動推薦された再生リストなどを含んでよい。情報管理部３１０は、音楽サービスでユーザが消費した履歴のあるプレイリストを対象に、各プレイリストの曲構成に関する全般的な情報、またはユーザによる再生ログデータを記録および管理してよい。図５は、プレイリスト＿Ａの曲構成情報５００を含む管理テーブルの例を示した図である。図５に示すように、ｓｏｎｇ＿１－＞ｓｏｎｇ＿２－＞ｓｏｎｇ＿３－＞ｓｏｎｇ＿４の順に構成されたプレイリスト＿Ａに対し、曲構成情報５００は、曲再生順序５０１、曲構成のテーマやタイトルなどのような主題５０２、全体の再生長さ５０３、平均の再生長さ５０４、曲ごとの再生長さ５０５、ジャンル５０６、ＢＰＭ（ｂｅａｔｐｅｒｍｉｎｕｔｅ）５０７などを含んでよい。図６は、プレイリスト＿Ａの再生履歴情報６００を含む管理テーブルの例を示した図である。プレイリスト＿Ａに対してすべてのユーザが同じ順序で曲を再生すると仮定する。図６に示すように、ｓｏｎｇ＿１－＞ｓｏｎｇ＿２－＞ｓｏｎｇ＿３－＞ｓｏｎｇ＿４の順に構成されたプレイリスト＿Ａに対し、再生履歴情報６００は、曲再生順序６０１と、ユーザそれぞれが曲ごとの全体の再生長さに対して実際にどのくらい再生したかを示す再生の割合である実際の再生長さ６０２などを含んでよい。さらに、情報管理部３１０は、プレイリスト＿Ａの各曲に対し、該当の曲が再生された時点に関する追加情報として、曲再生時点の時間帯、天気、位置、社会的イシューなどを追加で収集して各ユーザの実際の再生長さ６０２とともに管理してよい。このような追加情報は、音楽サービスを利用するユーザの電子機器１１０、１２０、１３０、１４０から取得した情報、または電子機器１１０、１２０、１３０、１４０から取得した情報を利用してインターネット上で取得した情報を包括したものを意味してよい。 In the stage 420, the information management unit 310 may investigate and aggregate the song composition information and the reproduction history information for each playlist, and record and manage them. A playlist is a collection of tracks composed of at least one piece of music content (songs), and includes a playlist directly composed by the user or shared by others, or a playlist automatically recommended on the service. It's fine. The information management unit 310 may record and manage general information about the song composition of each playlist or playback log data by the user for playlists having a history of consumption by the user in the music service. FIG. 5 is a diagram showing an example of a management table including the song composition information 500 of the playlist_A. As shown in FIG. 5, with respect to the playlist_A configured in the order of song_1-> song_2-> song_3-> song_4, the song composition information 500 is a subject such as a song playback order 501, a theme or title of the song composition. 502, the total playback length 503, the average playback length 504, the playback length 505 for each song, the genre 506, the BPM (beat per minute) 507, and the like may be included. FIG. 6 is a diagram showing an example of a management table including the playback history information 600 of the playlist_A. Suppose all users play songs in the same order for playlist_A. As shown in FIG. 6, for playlist_A configured in the order of song_1-> song_2-> song_3-> song_4, the playback history information 600 includes the song playback order 601 and the overall playback length of each song by each user. It may include an actual reproduction length 602 or the like, which is a reproduction ratio indicating how much the reproduction is actually performed. Further, the information management unit 310 additionally collects the time zone, weather, position, social issue, etc. at the time of song playback as additional information regarding the time when the song is played for each song in playlist_A. It may be managed together with the actual playback length 602 of each user. Such additional information is acquired on the Internet by using the information acquired from the electronic devices 110, 120, 130, 140 of the user who uses the music service, or the information acquired from the electronic devices 110, 120, 130, 140. It may mean a comprehensive set of information.

再び図４を参照すると、段階４３０において、再生長さ予測部３２０は、電子機器１１０から特定のプレイリストに対する再生要求が受信されれば、段階４２０で管理されているプレイリストの曲構成情報と再生履歴情報を利用して、特定のプレイリストの曲構成による曲ごとの再生長さを予測してよい。再生長さ予測部３２０は、電子機器１１０のユーザが要求した特定のプレイリストの再生の流れを確率的モデルによって予測してよい。本発明では、各プレイリストの曲構成情報と再生履歴情報とを関連付けることのできるモデルを設計し、このようなモデルを利用して予め計算されたデータを確保してよい。例えば、Ｔｒａｃｋ１の後にＴｒａｃｋ２が配置される曲構成を含むプレイリストのうち、主題Ａのプレイリストでは平均的にＴｒａｃｋ２が全体再生時間１２０秒のうちの９０秒が再生され、主題Ｂのプレイリストでは平均的にＴｒａｃｋ２が全体再生時間１２０秒のうちの８０秒が再生されることがある。再生長さ予測部３２０は、ユーザが特定のプレイリストの再生を要求すれば、該当のプレイリストの曲構成に対応する既存のプレイリストのログデータを取得して電子機器１１０と共有してよい。一例として、再生長さ予測部３２０は、曲どうしの類似度に基づき、ユーザによる再生履歴があるプレイリストのうちから電子機器１１０が要求した特定のプレイリストの曲構成と対応する既存のプレイリストを探索し、該当のプレイリストの再生履歴情報を利用して特定のプレイリストの曲ごとに予想再生長さを算出してよい。 Referring to FIG. 4 again, in the stage 430, when the playback request for the specific playlist is received from the electronic device 110, the playback length prediction unit 320 together with the song composition information of the playlist managed in the stage 420. The playback history information may be used to predict the playback length of each song according to the song composition of a specific playlist. The reproduction length prediction unit 320 may predict the reproduction flow of a specific playlist requested by the user of the electronic device 110 by a probabilistic model. In the present invention, a model capable of associating the song composition information of each playlist with the reproduction history information may be designed, and pre-calculated data may be secured by using such a model. For example, among the playlists including the song composition in which Track2 is arranged after Track1, 90 seconds of the total playback time of 120 seconds is played by Track2 on average in the playlist of subject A, and 90 seconds in the playlist of subject B is played. On average, Track 2 may play 80 seconds out of a total playback time of 120 seconds. If the user requests the reproduction of a specific playlist, the reproduction length prediction unit 320 may acquire the log data of the existing playlist corresponding to the song composition of the corresponding playlist and share it with the electronic device 110. .. As an example, the playback length prediction unit 320 may use an existing playlist corresponding to the song composition of a specific playlist requested by the electronic device 110 from among playlists having a playback history by the user, based on the similarity between songs. May be searched and the expected playback length may be calculated for each song in a specific playlist by using the playback history information of the corresponding playlist.

図７は、本発明の一実施形態における、曲どうしの類似度に基づいてプレイリストの曲ごとに予想再生長さを算出する過程の一例を示したフローチャートである。 FIG. 7 is a flowchart showing an example of a process of calculating an expected reproduction length for each song in a playlist based on the similarity between songs in one embodiment of the present invention.

段階７０１において、再生長さ予測部３２０は、各曲に対して該当の曲の音源データが入力されると、入力された音源データを学習データ形態で処理してよい。このとき、音楽コンテンツである曲は、オーディオファイルフォーマットを有するすべてのデジタルデータを意味してよく、例えば、ＭＰ３（ＭＰＥＧＡｕｄｉｏＬａｙｅｒ－３）、ＷＡＶＥ（ＷａｖｅｆｏｒｍＡｕｄｉｏＦｏｒｍａｔ）、ＦＬＡＣ（ＦｒｅｅＬｏｓｓｌｅｓｓＡｕｄｉｏＣｏｄｅｃ）などを含んでよい。また、再生長さ予測部３２０には、曲に関するテキスト情報がともに入力されてよいが、このとき、テキスト情報は、歌詞を含むか、あるいは歌手、ジャンル、曲名、アルバム名などのようなメタ情報、さらに曲の分類や検索などと関連して入力されたハッシュタッグ（ｈａｓｈｔａｇ）、クエリ（ｑｕｅｒｙ）などの情報を含んでよい。続いて、再生長さ予測部３２０は、前処理によって音源データを時間－周波数で表現してよい。例えば、再生長さ予測部３２０は、音源データをメルスペクトログラム（Ｍｅｌ－ｓｐｅｃｔｒｏｇｒａｍ）やＭＦＣＣ（ＭｅｌＦｒｅｑｕｅｎｃｙＣｅｐｓｔｒａｌＣｏｅｆｆｉｃｉｅｎｔ）のような時間－周波数－大きさ形態のデータに変換してよい。再生長さ予測部３２０に各曲に対するテキスト情報がともに入力された場合には、テキスト情報を前処理してよい。一例として、再生長さ予測部３２０は、形態素分析器、索引語抽出器などのような言語前処理器により、入力されたテキスト情報から無意味なテキストをフィルタリングしてよい。言い換えれば、再生長さ予測部３２０は、テキスト情報に含まれる助詞や助動詞などのような品詞の単語、特殊記号（例えば、！、？、／など）などを除去し、体言や語根に該当する単語を抽出してよい。 In step 701, when the sound source data of the corresponding song is input to each song, the reproduction length prediction unit 320 may process the input sound source data in the form of learning data. At this time, the music content may mean all digital data having an audio file format, and includes, for example, MP3 (MPEG AudioLayer-3), WAVE (Waveform AudioFormat), FLAC (Free Lossless AudioCodec), and the like. It's fine. Further, text information about the song may be input to the playback length prediction unit 320 together, but at this time, the text information includes lyrics or meta information such as a singer, a genre, a song name, an album name, and the like. , Further, information such as a hashtag and a query input in connection with song classification and search may be included. Subsequently, the reproduction length prediction unit 320 may express the sound source data in time-frequency by preprocessing. For example, the reproduction length prediction unit 320 may convert the sound source data into time-frequency-magnitude form data such as a mel spectrogram or MFCC (Mel Frequency Cepstral Cofficient). When the text information for each song is input to the reproduction length prediction unit 320, the text information may be preprocessed. As an example, the reproduction length prediction unit 320 may filter meaningless text from the input text information by a language preprocessing device such as a morphological analyzer, an index term extractor, and the like. In other words, the reproduction length prediction unit 320 removes part-speech words such as particles and auxiliary verbs and special symbols (for example,!,?, / Etc.) contained in the text information, and corresponds to an uninflected word or a root. Words may be extracted.

段階７０２において、再生長さ予測部３２０は、音源データに対して前処理された学習データを学習モデルによって学習して固有の特徴を生成した後、生成された固有の特徴をデータベース（図示せず）に記録してよい。再生長さ予測部３２０は、ディープラーニングを利用して音源データ自体の固有の音響特徴（ａｃｏｕｓｔｉｃｆｅａｔｕｒｅｓ）を生成してよい。一例として、再生長さ予測部３２０は、畳み込みニューラルネットワーク（ＣＮＮ：ＣｏｎｖｏｌｕｔｉｏｎａｌＮｅｕｒａｌＮｅｔｗｏｒｋ）基盤の学習モデルを利用してよい。再生長さ予測部３２０は、ＣＮＮ学習モデルを利用して音源データを多次元実数ベクトルで表現してよい。ＣＮＮ学習モデルは音源データ学習層を含んでよく、以下の過程１～３は、音源データ学習層で音源データに対応する実数ベクタルを生成する過程の例であってよい。 In step 702, the reproduction length prediction unit 320 learns the training data preprocessed for the sound source data by the learning model to generate unique features, and then stores the generated unique features in a database (not shown). ) May be recorded. The reproduction length prediction unit 320 may generate the unique acoustic features of the sound source data itself by using deep learning. As an example, the reproduction length prediction unit 320 may utilize a learning model based on a convolutional neural network (CNN). The reproduction length prediction unit 320 may express the sound source data as a multidimensional real number vector by using the CNN learning model. The CNN learning model may include a sound source data learning layer, and the following steps 1 to 3 may be an example of a process of generating a real number vector corresponding to the sound source data in the sound source data learning layer.

過程１において、曲の音源データ（一例として、ｍｐ３ファイル）は、前処理によってメルスペクトログラムやＭＦＣＣのような時間－周波数－大きさ形態のデータに変換されてよい。 In process 1, the sound source data of the song (for example, an mp3 file) may be converted into time-frequency-magnitude form data such as mel spectrogram or MFCC by preprocessing.

過程２において、変換された音源データから１つ以上の短い時間区間（１～１０秒）の複数の周波数フレームがサンプリングされて音源データの学習モデルに対する入力データとして使用されてよい。例えば、再生長さ予測部３２０は、複数のフレームをサンプリングし、音源データ学習層で音楽モデルの例として提示されるＣＮＮモデルの入力として活用してよい。これにより、音源データの学習のためのＣＮＮモデルは、サンプリングされたフレームと同じ数のチャネルを有するモデルとなる。あるいは、生成されたフレームそれぞれを単一チャンネルとして使用し、各フレームに対して固有の畳み込み／プーリング過程を経た後、生成された各フレームの特徴ベクトルを接合することで全結合層の入力として使用可能となる。 In step 2, a plurality of frequency frames in one or more short time intervals (1 to 10 seconds) may be sampled from the converted sound source data and used as input data for the learning model of the sound source data. For example, the reproduction length prediction unit 320 may sample a plurality of frames and utilize it as an input of a CNN model presented as an example of a music model in the sound source data learning layer. As a result, the CNN model for learning the sound source data becomes a model having the same number of channels as the sampled frames. Alternatively, use each generated frame as a single channel, go through a unique convolution / pooling process for each frame, and then join the feature vectors of each generated frame to use as an input for the fully connected layer. It will be possible.

過程３において、音源データ学習層が含むことのできる複数の畳み込み（ｃｏｎｖｏｌｕｔｉｏｎ）およびプーリング（ｐｏｏｌｉｎｇ）層を繰り返し構成することにより、音楽フレームから抽象化された特徴（ｆｅａｔｕｒｅ）を生成してよい。畳み込みではパッチの大きさが多様に構成されてよく、プーリングも最大値（ｍａｘ）を利用したプーリング技法、平均値（ａｖｅｒａｇｅ）を利用したプーリング技法、および前記２つのプーリング技法を接合したハイブリッドプーリング技法などのような多様なプーリング技法のうちの少なくとも１つが使用されてよい。 In step 3, a plurality of convolution and pooling layers that can be included in the sound source data learning layer may be iteratively configured to generate an abstracted feature from the music frame. In convolution, the size of the patch may be variously configured, and the pooling technique is a pooling technique that uses the maximum value (max), a pooling technique that uses the average value (average), and a hybrid pooling technique that combines the two pooling techniques. At least one of a variety of pooling techniques, such as, may be used.

複数の畳み込みおよびプーリング層上には音響特徴を生成するための全結合層（ｆｕｌｌｙ－ｃｏｎｎｅｃｔｅｄｌａｙｅｒ）があり、各層別の関数は、シグモイド（ｓｉｇｍｏｉｄ）関数、双曲線（ＨｙｐｅｒｂｏｌｉｃＴａｎｇｅｎｔ：ｔａｎｈ）関数、ＲｅＬＵ（ＲｅｃｔｉｆｉｅｄＬｉｎｅａｒＵｎｉｔ）関数などのような多様な関数が使用されてよい。結局、音源データに対して１つの多次元実数ベクトルが生成されてよい。例えば、与えられた最初の音源データｍ０は、音楽学習モデルの出力層において、ｘ０＝｛０．２、－０．１、０．３、・・・}の形態のように１つの多次元実数ベクトルで表現されてよい。 On multiple convolution and pooling layers, there is a full-connected layer for generating acoustic features, and the functions for each layer are sigmoid function, hyperbolic tangent (tanh) function, and ReLU. Various functions such as the (Rectifier Unit) function may be used. After all, one multidimensional real number vector may be generated for the sound source data. For example, the given first sound source data m0 is one multidimensional real number in the output layer of the music learning model, such as the form of x0 = {0.2, -0.1, 0.3, ...}. It may be expressed as a vector.

また、再生長さ予測部３２０は、各曲に対してテキスト情報がともに入力された場合、テキスト情報も多次元実数ベクトルで表現してよい。一例として、再生長さ予測部３２０は、テキスト情報に対する前処理によってフィルタリングされたテキストを事前学習された学習モデルによってワードベクトル（ｗｏｒｄｖｅｃｔｏｒ）として生成してよい。例えば、ワードベクトルは、数値型（ｎｕｍｅｒｉｃａｌ）多次元ベクトル形態で表現されてよい。ワードベクトル生成のために、単語出現頻度ヒストグラム、ＴＦ（ｔｅｒｍｆｒｅｑｕｅｎｃｙ）／ＩＤＦ（ｉｎｖｅｒｓｅｄｏｃｕｍｅｎｔｆｒｅｑｕｅｎｃｙ）、言語学習モデル（例えば、ｗｏｒｄ２ｖｅｃ、ｐｈｒａｓｅ２ｖｅｃ、ｄｏｃｕｍｅｎｔ２ｖｅｃなど）などが使用されてよい。例えば、「イ・スンファン良い日バラード１９９２」などのような歌手／ジャンル／曲名／年度は、１つのｎ次元実数ベクトルｖ＝｛０．３、－１．２、１．２、・・・}のように表現可能である。このとき、言語学習モデルのためのテキスト情報フィールド（歌手、ジャンル、曲名、年度など）の順に固定する必要はなく、目的に合うように変更可能である。 Further, when the text information is input together for each song, the reproduction length prediction unit 320 may also express the text information as a multidimensional real number vector. As an example, the reproduction length prediction unit 320 may generate a text filtered by preprocessing for text information as a word vector (word vector) by a pre-learned learning model. For example, a word vector may be represented in a numerical multidimensional vector form. For word vector generation, a word appearance frequency histogram, TF (term frequency) / IDF (inverse document fraction), language learning model (for example, word2vec, phase2vec, document2vec, etc.) may be used. For example, for a singer / genre / song title / year such as "Lee Seung Hwan Good Day Ballad 1992", one n-dimensional real number vector v = {0.3, -1.2, 1.2, ...} It can be expressed as. At this time, it is not necessary to fix the text information fields (singer, genre, song title, year, etc.) for the language learning model in this order, and it can be changed to suit the purpose.

上述した再生長さ予測部３２０は、音源データの特徴ベクトルとテキスト情報のワードベクトルを曲ごとにデータベースに記録および維持してよい。音源データの特徴ベクトルとテキスト情報のワードベクトルはそれぞれ、個別のデータベースで構築されてもよいし、あるいは１つのデータベースで構築されてもよい。このようなデータベースは、サーバ１５０に含まれる構成要素によって実現されてもよいし、あるいはサーバ１５０と連動可能な個別のシステム上に構築された外部データベースとして存在してもよい。 The reproduction length prediction unit 320 described above may record and maintain a feature vector of sound source data and a word vector of text information in a database for each song. The feature vector of the sound source data and the word vector of the text information may be constructed in individual databases or may be constructed in one database, respectively. Such a database may be realized by the components included in the server 150, or may exist as an external database built on a separate system that can work with the server 150.

段階７０３において、再生長さ予測部３２０は、曲それぞれに対してデータベースに記録された音源データに対する固有の特徴を利用して曲どうしの類似度を算出してよく、曲どうしの類似度に基づき、ユーザによる再生履歴があるプレイリストのうちから電子機器１１０が要求した特定のプレイリストの曲構成と対応するプレイリストを取得してよい。他の例として、再生長さ予測部３２０は、音源データの特徴ベクトルとともに、テキスト情報のワードベクトルを複合的に利用して曲どうしの類似度を算出することも可能である。言い換えれば、再生長さ予測部３２０は、曲どうしの類似度に基づき、電子機器１１０が要求した特定のプレイリストの曲と同一あるいは類似の曲で構成されたプレイリストを取得してよい。 In step 703, the playlist length prediction unit 320 may calculate the similarity between songs by using the unique characteristics of the sound source data recorded in the database for each song, and based on the similarity between songs. , The playlist corresponding to the song composition of the specific playlist requested by the electronic device 110 may be acquired from the playlists having the playback history by the user. As another example, the reproduction length prediction unit 320 can calculate the similarity between songs by using the word vector of the text information in combination with the feature vector of the sound source data. In other words, the reproduction length prediction unit 320 may acquire a playlist composed of songs that are the same as or similar to the songs of the specific playlist requested by the electronic device 110, based on the similarity between the songs.

段階７０４において、再生長さ予測部３２０は、段階７０３で取得したプレイリストの再生履歴情報を利用して特定のプレイリストを構成している各曲と対応する曲の多数のユーザによる実際の再生長さを平均する方式により、特定のプレイリストの曲ごとの予想再生長さを算出してよい。 In the stage 704, the playback length prediction unit 320 uses the playback history information of the playlist acquired in the stage 703 to actually replay the songs constituting the specific playlist and the corresponding songs by a large number of users. The expected playback length for each song in a specific playlist may be calculated by a method of averaging the growth length.

図８に示すように、再生長さ予測部３２０は、電子機器１１０が要求した特定のプレイリストが、ｓｏｎｇ＿ｉ－＞ｓｏｎｇ＿ｉｉ－＞ｓｏｎｇ＿ｉｉｉ－＞ｓｏｎｇ＿ｉｖの順に構成されたＰｌａｙｌｉｓｔ＿I ８００であるとするとき、曲の構成がｓｏｎｇ＿ｉ、ｓｏｎｇ＿ｉｉ、ｓｏｎｇ＿ｉｉｉ、ｓｏｎｇ＿ｉｖと同一あるいは類似の曲（ｓｏｎｇ＿ｉ’、ｓｏｎｇ＿ｉ’’、・・・、ｓｏｎｇ＿ｉｉ’、ｓｏｎｇ＿ｉｉ’’、・・・、ｓｏｎｇ＿ｉｉｉ’、ｓｏｎｇ＿ｉｉｉ’’、・・ｓｏｎｇ＿ｉｖ’、ｓｏｎｇ＿ｉｖ’’、・・・）を含むプレイリスト（Ｐｌａｙｌｉｓｔ＿１～Ｎ）８１０を探索する。プレイリスト（Ｐｌａｙｌｉｓｔ＿１～Ｎ）８１０にはＰｌａｙｌｉｓｔ＿I８００の曲構成ｓｏｎｇ＿ｉ－＞ｓｏｎｇ＿ｉｉ－＞ｓｏｎｇ＿ｉｉｉ－＞ｓｏｎｇ＿ｉｖのうちの少なくとも２曲以上が連続する曲構成（例えば、ｓｏｎｇ＿ｉ－＞ｓｏｎｇ＿ｉｉ、ｓｏｎｇ＿ｉｉ－＞ｓｏｎｇ＿ｉｉｉ、ｓｏｎｇ＿ｉｉｉ－＞ｓｏｎｇ＿ｉｖ）と対応するプレイリストが含まれてよい。 As shown in FIG. 8, when the specific playlist requested by the electronic device 110 is the Playlist_I 800 configured in the order of song_i-> song_iii-> song_iii-> song_iv, the reproduction length predicting unit 320 determines. Songs whose composition is the same as or similar to song_i, song_ii, song_iii, song_iv (song_i', song_i'', ..., song_ii', song_ii'', ..., song_iii', song_iii'', ... Search for playlists (Playlist_1 to N) 810 including', song_iv'', ...). In the playlist (Playlist_1 to N) 810, at least two or more songs in the song structure song_i-> song_iii-> song_iv of Playlist_I800 are continuous (for example, song_i-> song_ii, song_ii-> sion. -> Song_iv) and the corresponding playlist may be included.

図９を参照すると、再生長さ予測部３２０は、電子機器１１０が要求した特定のプレイリスト８００に対応するプレイリスト８１０を探索し、該当のプレイリスト８１０の曲ごとに再生長さを平均してよい。例えば、特定のプレイリストの最初の曲であるｓｏｎｇ＿ｉと同一あるいは類似の曲（ｓｏｎｇ＿ｉ、ｓｏｎｇ＿ｉ’、ｓｏｎｇ＿ｉ’’）の実際の再生長さを平均し、特定のプレイリストの再生を要求したユーザによってｓｏｎｇ＿ｉがどのくらい再生されるかを予測してよい。特定のプレイリスト８００の曲構成と同一あるいは類似のプレイリスト８１０の再生履歴情報に基づくユーザによる再生長さが平均的に９０％、１０％、２０％、７５％である場合、ｓｏｎｇ＿ｉ－＞ｓｏｎｇ＿ｉｉ－＞ｓｏｎｇ＿ｉｉｉ－＞ｓｏｎｇ＿ｉｖの順に構成されたＰｌａｙｌｉｓｔ＿I８００に対する曲ごとの再生長さを９０％、１０％、２０％、７５％と予測してよい。 Referring to FIG. 9, the reproduction length prediction unit 320 searches for a playlist 810 corresponding to a specific playlist 800 requested by the electronic device 110, and averages the reproduction length for each song of the corresponding playlist 810. You can do it. For example, by a user who requests playback of a particular playlist by averaging the actual play lengths of songs that are the same as or similar to the first song in a particular playlist (song_i, song_i', song_i''). You may predict how much song_i will be played. If the user's playback length based on the playback history information of the playlist 810 that is the same as or similar to the song composition of the specific playlist 800 is 90%, 10%, 20%, or 75% on average, song_i-> song_ii The playback length of each song may be predicted to be 90%, 10%, 20%, and 75% for the Playlist_I800 configured in the order of-> song_iii-> song_iv.

したがって、再生長さ予測部３２０は、ユーザが要求した特定のプレイリストに対して曲構成が同一あるいは類似のプレイリストを消費した他のユーザの再生ログを平均することにより、特定のプレイリストの曲ごとに再生長さを予測することができる。このとき、再生長さ予測部３２０は、プレイリストに対するユーザの再生ログが十分に蓄積されている場合には、他のユーザの再生ログではなくユーザの再生ログだけを利用して特定のプレイリストの曲ごとの再生長さを個人化して予測することも可能である。また、再生長さ予測部３２０は、電子機器１１０が特定のプレイリストを要求した時点の時間帯、天気、位置、社会的イシューなどとマッチングする追加情報を有する再生ログを利用することで、ユーザ状況による特定のプレイリストの曲ごとに再生長さを予測することも可能である。 Therefore, the playback length prediction unit 320 averages the playback logs of other users who have consumed playlists having the same or similar song composition for the specific playlist requested by the user, so that the playback length predicting unit 320 can be used for the specific playlist. The playback length can be predicted for each song. At this time, when the user's playback log for the playlist is sufficiently accumulated, the playback length prediction unit 320 uses only the user's playback log instead of the other user's playback log to obtain a specific playlist. It is also possible to personalize and predict the playback length of each song. In addition, the playback length prediction unit 320 uses a playback log having additional information that matches the time zone, weather, position, social issue, etc. at the time when the electronic device 110 requests a specific playlist, so that the user can use the playback length prediction unit 320. It is also possible to predict the playback length for each song in a particular playlist depending on the situation.

再び図４を参照すると、段階４４０において、データ提供部３３０は、電子機器１１０が要求した特定のプレイリストに対して段階４３０で予測した曲ごとの再生長さに対応するリソースを分配し、特定のプレイリストの再生のための音源データを電子機器１１０に提供してよい。データ提供部３３０は、特定のプレイリストに対して予測された曲ごとの再生長さに合わせて曲の再生に必要なリソースを効果的に分配してプリフェッチを実施してよい。例えば、図１０に示すように、曲の構成が同一あるいは類似のプレイリストを消費した他のユーザの再生ログに基づき、ｓｏｎｇ＿ｉ－＞ｓｏｎｇ＿ｉｉ－＞ｓｏｎｇ＿ｉｉｉ－＞ｓｏｎｇ＿ｉｖの順に構成されたＰｌａｙｌｉｓｔ＿I８００の再生長さが曲ごとに９０％、１０％、２０％、７５％と予測された場合、電子機器１１０ではｓｏｎｇ＿ｉの音源データを９．０秒単位のチャンク（ｃｈｕｎｋ）で受信して再生すると同時に、ｓｏｎｇ＿ｉｉは１．０秒、ｓｏｎｇ＿ｉｉｉは２．０秒、ｓｏｎｇ＿ｉｖは７．５秒単位で該当の音源データを予めダウンロードして次の再生を準備するようにする。言い換えれば、データ提供部３３０は、電子機器１１０のユーザが要求した特定のプレイリスト８００に対し、最後まで再生される確率が高い曲の場合にはより多くのデータをより早くプリフェッチし、最後まで再生される確率が低い曲の場合にはより少ないデータをよりゆっくりプリフェッチする。 Referring to FIG. 4 again, in the stage 440, the data providing unit 330 distributes and specifies the resource corresponding to the playback length for each song predicted in the stage 430 to the specific playlist requested by the electronic device 110. The sound source data for reproducing the playlist may be provided to the electronic device 110. The data providing unit 330 may effectively distribute the resources required for playing a song according to the predicted playing length of each song for a specific playlist, and perform prefetching. For example, as shown in FIG. 10, the playback length of Playlist_I800 configured in the order of song_i-> song_ii-> second_iii-> song_iv based on the playback logs of other users who have consumed playlists having the same or similar composition of songs. When it is predicted that the value is 90%, 10%, 20%, and 75% for each song, the electronic device 110 receives and reproduces the sound source data of song_i in chunks of 9.0 seconds at the same time, and at the same time, song_ii. Is 1.0 second, song_iii is 2.0 seconds, and song_iv is 7.5 seconds. The corresponding sound source data is downloaded in advance to prepare for the next playback. In other words, the data providing unit 330 prefetches more data earlier for a specific playlist 800 requested by the user of the electronic device 110 in the case of a song having a high probability of being played to the end, and to the end. For songs with a low probability of being played, less data is prefetched more slowly.

このように、本発明の実施形態によると、プレイリストの曲構成に基づき、曲ごとに該当の曲が最後まで再生される確率である再生長さを予測することができ、これに基づいて曲ごとにプリフェッチを効率的に実行することで、滑らかな再生環境を提供することができる。さらに、本発明の実施形態によると、プレイリストに対し、曲ごとの再生長さに基づいて不必要なデータフェッチを減らすことができ、これによってサービス全般にわたって消耗されるリソースを効率的に利用することができる。 As described above, according to the embodiment of the present invention, it is possible to predict the playback length, which is the probability that the corresponding song is played to the end for each song, based on the song structure of the playlist, and the song is based on this. By efficiently executing prefetch for each, a smooth playback environment can be provided. Further, according to an embodiment of the present invention, it is possible to reduce unnecessary data fetches for playlists based on the playback length of each song, thereby efficiently utilizing the resources consumed throughout the service. be able to.

上述した装置は、ハードウェア構成要素、ソフトウェア構成要素、および／またはハードウェア構成要素とソフトウェア構成要素との組み合わせによって実現されてよい。例えば、実施形態で説明された装置および構成要素は、プロセッサ、コントローラ、ＡＬＵ（ａｒｉｔｈｍｅｔｉｃｌｏｇｉｃｕｎｉｔ）、デジタル信号プロセッサ、マイクロコンピュータ、ＦＰＧＡ（ｆｉｅｌｄｐｒｏｇｒａｍｍａｂｌｅｇａｔｅａｒｒａｙ）、ＰＬＵ（ｐｒｏｇｒａｍｍａｂｌｅｌｏｇｉｃｕｎｉｔ）、マイクロプロセッサ、または命令を実行して応答することができる様々な装置のように、１つ以上の汎用コンピュータまたは特殊目的コンピュータを利用して実現されてよい。処理装置は、オペレーティングシステム（ＯＳ）およびＯＳ上で実行される１つ以上のソフトウェアアプリケーションを実行してよい。また、処理装置は、ソフトウェアの実行に応答し、データにアクセスし、データを記録、操作、処理、および生成してもよい。理解の便宜のために、１つの処理装置が使用されるとして説明される場合もあるが、当業者は、処理装置が複数個の処理要素および／または複数種類の処理要素を含んでもよいことが理解できるであろう。例えば、処理装置は、複数個のプロセッサまたは１つのプロセッサおよび１つのコントローラを含んでよい。また、並列プロセッサのような、他の処理構成も可能である。 The devices described above may be implemented by hardware components, software components, and / or combinations of hardware components and software components. For example, the apparatus and components described in the embodiments include a processor, a controller, an ALU (arithmetic logic unit), a digital signal processor, a microcomputer, an FPGA (field programgable gate array), a PLU (programmable log unit), a microprocessor, and the like. Alternatively, it may be implemented using one or more general purpose computers or special purpose computers, such as various devices capable of executing and responding to instructions. The processing device may execute an operating system (OS) and one or more software applications running on the OS. The processing device may also respond to the execution of the software, access the data, and record, manipulate, process, and generate the data. For convenience of understanding, one processing device may be described as being used, but one of ordinary skill in the art may include a plurality of processing elements and / or a plurality of types of processing elements. You can understand. For example, the processing device may include multiple processors or one processor and one controller. Also, other processing configurations such as parallel processors are possible.

ソフトウェアは、コンピュータプログラム、コード、命令、またはこれらのうちの１つ以上の組み合わせを含んでもよく、思うままに動作するように処理装置を構成したり、独立的または集合的に処理装置に命令したりしてよい。ソフトウェアおよび／またはデータは、処理装置に基づいて解釈されたり、処理装置に命令またはデータを提供したりするために、いかなる種類の機械、コンポーネント、物理装置、コンピュータ記録媒体または装置に具現化されてよい。ソフトウェアは、ネットワークによって接続されたコンピュータシステム上に分散され、分散された状態で記録されても実行されてもよい。ソフトウェアおよびデータは、１つ以上のコンピュータ読み取り可能な記録媒体に記録されてよい。 The software may include computer programs, codes, instructions, or a combination of one or more of these, configuring the processing equipment to operate at will, or instructing the processing equipment independently or collectively. You may do it. The software and / or data is embodied in any type of machine, component, physical device, computer recording medium or device to be interpreted based on the processing device or to provide instructions or data to the processing device. good. The software is distributed on a computer system connected by a network and may be recorded or executed in a distributed state. The software and data may be recorded on one or more computer-readable recording media.

実施形態に係る方法は、多様なコンピュータ手段によって実行可能なプログラム命令の形態で実現されてコンピュータ読み取り可能な媒体に記録されてよい。ここで、媒体は、コンピュータ実行可能なプログラムを継続して記録するものであっても、実行またはダウンロードのために一時記録するものであってもよい。また、媒体は、単一または複数のハードウェアが結合した形態の多様な記録手段または格納手段であってよく、あるコンピュータシステムに直接接続する媒体に限定されることはなく、ネットワーク上に分散して存在するものであってもよい。媒体の例としては、ハードディスク、フロッピー（登録商標）ディスク、および磁気テープのような磁気媒体、ＣＤ－ＲＯＭおよびＤＶＤのような光媒体、フロプティカルディスク（ｆｌｏｐｔｉｃａｌｄｉｓｋ）のような光磁気媒体、およびＲＯＭ、ＲＡＭ、フラッシュメモリなどを含み、プログラム命令が記録されるように構成されたものであってよい。また、媒体の他の例として、アプリケーションを配布するアプリケーションストアやその他の多様なソフトウェアを供給または配布するサイト、サーバなどで管理する記録媒体または格納媒体が挙げられる。 The method according to the embodiment may be realized in the form of program instructions that can be executed by various computer means and recorded on a computer-readable medium. Here, the medium may be a continuous recording of a computer-executable program or a temporary recording for execution or download. Further, the medium may be various recording means or storage means in the form of a combination of a single piece of hardware or a plurality of pieces of hardware, and is not limited to a medium directly connected to a certain computer system, but is distributed over a network. It may exist. Examples of media include hard disks, floppy (registered trademark) disks, magnetic media such as magnetic tapes, optical media such as CD-ROMs and DVDs, optical magnetic media such as floptic discs, and the like. And may include ROM, RAM, flash memory, etc., and may be configured to record program instructions. Other examples of media include recording media or storage media managed by application stores that distribute applications, sites that supply or distribute various other software, servers, and the like.

以上のように、実施形態を、限定された実施形態および図面に基づいて説明したが、当業者であれば、上述した記載から多様な修正および変形が可能であろう。例えば、説明された技術が、説明された方法とは異なる順序で実行されたり、かつ／あるいは、説明されたシステム、構造、装置、回路などの構成要素が、説明された方法とは異なる形態で結合されたりまたは組み合わされたり、他の構成要素または均等物によって対置されたり置換されたとしても、適切な結果を達成することができる。 As described above, the embodiments have been described based on the limited embodiments and drawings, but those skilled in the art will be able to make various modifications and modifications from the above description. For example, the techniques described may be performed in a different order than the methods described, and / or components such as the systems, structures, devices, circuits described may be in a different form than the methods described. Appropriate results can be achieved even if they are combined or combined, and confronted or replaced by other components or equivalents.

したがって、異なる実施形態であっても、特許請求の範囲と均等なものであれば、添付される特許請求の範囲に属する。 Therefore, even if it is a different embodiment, if it is equivalent to the claims, it belongs to the attached claims.

Claims

The way a computer system does
The computer system includes at least one processor.
The method is
A stage in which the at least one processor manages song composition information and playback history information for each playlist.
When the at least one processor receives a playback request for a specific playlist from an electronic device, the corresponding song is played for each song in the specific playlist using the song composition information and the playback history information. A method that includes a step of predicting the play length, which is a probability.

The management stage is
For each of the playlists, the song composition information including the playback order of the songs and the total playback length of each song, and the actual ratio of each user to the total playback length of each song. The method according to claim 1, comprising a step of managing playlist information including the playback length of the song.

The prediction stage is
The stage of acquiring the playlist corresponding to the song composition of the specific playlist by using the song composition information, and
The method according to claim 1 or 2, comprising the step of calculating the expected playback length for each song of the specific playlist using the playback history information of the acquired playlist.

The acquisition stage is
The method according to claim 3, wherein a playlist composed of songs that are the same as or similar to the songs included in the specific playlist is acquired based on the degree of similarity between the songs.

The acquisition stage is
For each song, the stage of learning the sound source data of the corresponding song by the deep learning learning model to generate unique sound source features, and
At the stage of calculating the similarity between songs using the sound source characteristics,
The method according to claim 3, comprising the step of acquiring a playlist composed of songs that are the same as or similar to the songs included in the specific playlist based on the similarity between the songs.

The prediction stage is
The stage of acquiring the playlist corresponding to the song composition of the specific playlist by using the song composition information, and
A step of calculating an expected playback length for each song of the specific playlist by a method of averaging the actual playback lengths of all users for each song of the specific playlist in the acquired playlist. The method according to claim 2, wherein the method comprises.

The prediction stage is
The stage of acquiring the playlist corresponding to the song composition of the specific playlist by using the song composition information, and
The expected playback length of each song of the specific playlist is calculated by a method of averaging the actual playback lengths of the users of the electronic device for the songs corresponding to each song of the specific playlist in the acquired playlist. The method of claim 2, comprising the step of calculation.

The management stage is
Further including the step of managing additional information including at least one of the time, weather, and position at the time when the corresponding song is played for each song in the playlist together with the playback history information.
The prediction stage is
The stage of acquiring the playlist corresponding to the song composition of the specific playlist by using the song composition information, and
Among the acquired playlists, the expected playback length for each song of the specific playlist is calculated by using the playlist playback history information that matches the additional information at the time when the specific playlist is requested. The method of claim 2, comprising the steps.

The method is
The at least one processor distributes resources necessary for playing a song according to the playing length of each song in the specific playlist, and provides sound source data for playing the specific playlist to the electronic device. The method of any one of claims 1-8, further comprising a step.

The above-mentioned stage of provision is
The method of claim 9, wherein for each song in the particular playlist, prefetching is performed in units corresponding to the predicted playback length of the song.

A program for causing a computer to execute the method according to any one of claims 1 to 10.

It ’s a computer system,
Includes memory and at least one processor connected to said memory and configured to execute computer-readable instructions contained in said memory.
The at least one processor
An information management unit that manages song composition information and playback history information for each playlist,
When a playback request for a specific playlist is received from an electronic device, the playback length is the probability that the song is played for each song in the specific playlist using the song composition information and the playback history information. A computer system that includes a playlist length predictor that predicts the song.

The information management department
For each of the playlists, the song composition information including the song playback order and the total playback length of each song, and the actual ratio of each song actually played by each user to the total playback length of each song. The computer system according to claim 12, wherein the playback history information including the playback length is managed.

The reproduction length prediction unit is
Using the song composition information, the playlist corresponding to the song composition of the specific playlist is acquired, and the playlist is obtained.
The computer system according to claim 12 or 13, wherein the expected playback length is calculated for each song of the specific playlist by using the playback history information of the acquired playlist.

The reproduction length prediction unit is
The computer system according to claim 14, wherein a playlist composed of songs that are the same as or similar to the songs included in the specific playlist is acquired based on the similarity between the songs.

The reproduction length prediction unit is
For each song, the sound source data of the corresponding song is learned by a deep learning learning model to generate unique sound source features.
Using the sound source characteristics, the similarity between songs is calculated, and
The computer system according to claim 14, wherein a playlist composed of songs that are the same as or similar to the songs included in the specific playlist is acquired based on the similarity between the songs.

The reproduction length prediction unit is
Using the song composition information, the playlist corresponding to the song composition of the specific playlist is acquired, and the playlist is obtained.
To calculate the expected playback length for each song in the specific playlist by a method of averaging the actual playback lengths of all users for the songs corresponding to each song in the specific playlist in the acquired playlist. 13. The computer system according to claim 13.

The reproduction length prediction unit is
Using the song composition information, the playlist corresponding to the song composition of the specific playlist is acquired, and the playlist is obtained.
The expected playback length for each song in the specific playlist is calculated by a method of averaging the actual playback lengths of the users of the electronic device for the songs corresponding to each song in the specific playlist in the acquired playlist. 13. The computer system according to claim 13, wherein the calculation is performed.

The information management department
For each song in the playlist, additional information including at least one of the time, weather, and position at the time when the song is played is managed together with the playback history information.
The reproduction length prediction unit is
Using the song composition information, the playlist corresponding to the song composition of the specific playlist is acquired, and the playlist is obtained.
The expected playback length is calculated for each song of the specific playlist by using the playlist playback history information that matches the additional information at the time when the specific playlist is requested among the acquired playlists. 13. The computer system according to claim 13.

The at least one processor
Further includes a data providing unit that distributes resources necessary for playing a song according to the playing length of each song of the specific playlist and provides sound source data for playing the specific playlist to the electronic device. , The computer system according to any one of claims 12 to 19.