WO2014002064A1 - Système et procédé de navigation et de recommandation de médiathèque - Google Patents

Système et procédé de navigation et de recommandation de médiathèque Download PDF

Info

Publication number
WO2014002064A1
WO2014002064A1 PCT/IB2013/055315 IB2013055315W WO2014002064A1 WO 2014002064 A1 WO2014002064 A1 WO 2014002064A1 IB 2013055315 W IB2013055315 W IB 2013055315W WO 2014002064 A1 WO2014002064 A1 WO 2014002064A1
Authority
WO
WIPO (PCT)
Prior art keywords
media
media item
proximity
graph
items
Prior art date
Application number
PCT/IB2013/055315
Other languages
English (en)
Inventor
Alexandre Alahi
Pierre Vandergheynst
Kirell BENZI
Original Assignee
Ecole Polytechnique Federale De Lausanne (Epfl)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ecole Polytechnique Federale De Lausanne (Epfl) filed Critical Ecole Polytechnique Federale De Lausanne (Epfl)
Publication of WO2014002064A1 publication Critical patent/WO2014002064A1/fr

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/48Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/45Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/901Indexing; Data structures therefor; Storage structures
    • G06F16/9024Graphs; Linked lists

Definitions

  • the invention relates to a system and method to optimize navigation and recommendation in a distributed network of media libraries.
  • the conventional approach to ease navigation within a digital media content library consists in classifying the items according to their metadata description, such as genre, author, title, keywords. This is however very limited as the genres are typically defined in a very broad manner, such as rock, pop or classical in the case of a music library, so there are still hundreds of thousands of titles within a single category. Moreover, music genre classification usually applies to an artist or album globally rather than at the song level, so the famous slow song "Still loving you" from the hard-rock group Scorpions will end up misclassified into the "metal" category.
  • a popular approach popularized by Apple iTunes for instance, consists in enabling the end user to generate his/her own content classification, such as user playlists in the case of music or photo albums in the case of photos.
  • his/her own content classification such as user playlists in the case of music or photo albums in the case of photos.
  • this is a cumbersome process for the end user as it requires manual browsing of the whole library in search for relevant items, ideally in a way that optimizes a smooth transition from one item to another; furthermore, it needs to be manually repeated for each playlist or photo album, possibly several times for a given item, for instance in the case of music when the end user wants to include it into a first playlist to listen while professionally working but also into a second playlist dedicated to fitness practice.
  • This approach still suffers from major limitations; first, it tends to overweight top chart content that is widely purchased and reviewed, to the detriment of less popular artists; second, it assumes that an end user has a monolithic taste without capturing the various needs and contexts into which one looks for specific media content, such as moods or emotional contexts (energizing activities, romance time).
  • US patent application 2012/0023403 by Herberger and Tost proposes a dynamic playlist generation method where an audio signature is first generated for each audio item in the end user database to create a model of the audio items, describing their unique features and matching them in a conventional way; the user can then select a starting song for playlist generation, and modify the respective weights of the various song features (such as tempo, singing, male or female voice, instruments, melody%) according to his/her personal taste to adjust the matching algorithm computation and consequently re-generate a user-personalized playlist. For instance the end user may request that more songs with a similar tempo are included into the playlist by overweighting the tempo feature measurement.
  • the various song features such as tempo, singing, male or female voice, instruments, melody
  • the solution requires significant computation to match all media items in a roll up table through an exhaustive search space; even if a heuristic is proposed to prune the search around a reference song by limiting the search to its sub-genre, this approach ultimately depends on the sub- genre definitions by musical experts and does not benefit from extended crowd feedback analysis.
  • all the media library navigation and recommendation solutions identified so far directly manage libraries of digital media, either at the end user client side or at the media provider server side, or both. This binds the end user to a specific media provider (such as iTunes for instance) and requires the navigation and recommendation solution provider to deal with complex digital rights management issues.
  • the present invention proposes a method to assist the selection of media items comprising the steps of:
  • a distributed network of media libraries is represented by a graph of media proximity edges connecting media item nodes.
  • Each end user client initially computes a media proximity subgraph. Starting from a given media item node, the end user client can then derive a media player navigation list in the end user library by searching and navigating along the shortest proximity path between media items in the undirected graph.
  • the end user client When the end user client is connected to the network server, it uploads its subgraph to the server which consolidates it into a global graph by merging it with the various subgraphs already uploaded by other end user clients.
  • the server further provides media player recommendations to enrich the end user library around a given node item by searching for the optimal proximity path in the undirected consolidated graph that is not yet represented into the end user subgraph.
  • the media proximity edges are computed by the end user client as a function of a number of features such as:
  • the media library items similarity computation (automated proximity computation);
  • the end user interactions with the local media library similarity navigation paths e.g. like/dislike on playlists paths, classification of media items in different orthogonal categories, according to the user listening preferences and/or moods).
  • the media proximity edges are further dynamically updated in a personalized way by analyzing:
  • the user taste and behavior based on the end user interactions with the local media library similarity navigation paths and with the distributed media library recommendations;
  • the server can compute on the fly a personalized navigation list as well as further recommendations at the request of any end user, so that the latter benefits from a navigation and recommendation system adapted to his/her media library as closely as possible to his/her profile and media preferences.
  • FIG. 1 shows a client device.
  • FIG. 2 shows a distributed network of client devices and server.
  • FIG. 3 shows the high-level architecture of the client device mass memory according to the present invention.
  • FIG. 4 represents a media proximity graph in which the nodes represent the media items and the edges represent the proximity between them - the shorter the edge the closer the media items.
  • FIG. 5 represents an exemplary user interface for proximity-based navigation into a media library.
  • FIG. 6 represents three different end user media proximity graphs with a subset of shared nodes and edges.
  • FIG. 7 represents a merged media proximity graph as consolidated by the server.
  • FIG. 8 represents the high-level architecture of the media proximity navigation and recommendation server.
  • FIG. 9 represents in bold the nodes and edges that can be recommended out of a given end user subgraph in the consolidated server graph, based on proximity.
  • FIG. 10 shows the high-level architecture of a media proximity navigation and recommendation server enhanced with an end user taste analyzer functionality.
  • FIG. 1 illustrates an exemplary client device 100 comprising computing means 110, memory means 120, communication means 130, and user interaction means 140.
  • client device 100 comprising computing means 110, memory means 120, communication means 130, and user interaction means 140.
  • Examples of such devices are for instance conventional personal computers, handheld devices such as mobile phones or tablets, photo and video cameras.
  • Communication means may be wired or wireless connections, enabling for instance access to the Internet and remote servers.
  • Memory means may be internal or external mass storage, such as a computer hard disk or flash memory.
  • User interaction means may be embedded such as a tactile screen in the case of a tablet, or external such as for instance a keyboard, a mouse, a remote control, and a display screen.
  • FIG. 2 represents a network environment where a server 200, is connected with a number of client devices 100a, 100b, 100 through a communication network 250.
  • Communication network 250 may be LAN, WAN, intranet, the Internet, etc.
  • Client devices and network servers may communicate by means of various communication protocols, such as for instance HTTP in the case of a web server.
  • a client device 100a, 100b, 100c may connect only from time to time to one server 200 through communication network 250 to access services such as media discovery, streaming or download.
  • Client devices may also connect from time to time to one server 200 to report local media usage, user profile or user preferences.
  • Client devices may also connect to each other from time to time in a peer-to-peer mode to exchange information and data in relation with their local media library.
  • a digital media player software enabling user friendly navigation runs on client device 100 using its user interface UI 140 as represented by FIG. 3.
  • the client device 100 comprises a computer readable medium 310 in which a number of digital media items are stored, such as music files, in its memory or storage unit 120. Those items may be organized in a single library such as iTunes or in separate libraries.
  • the end user initializes the digital media player software by selecting the digital media items he/she wishes the digital media player 300 to handle.
  • the digital media player 300 processes each media item file to extract a set of item features, comprising:
  • explicit related metadata from the digital item file format, such as author, title, album and duration information in the case of music.
  • This information can be used as the basis for content identification, intrinsic content features, such as rhythm, chroma, tempo, instruments, that can be extracted from digital signal analysis of the content item and represented in a condensed form as a content item fingerprint.
  • various methods may be used at this stage, such as fingerprinting methods based on Mel Frequency Cepstral Coefficients analysis, or the thumbnail fingerprint proposed by Bartsch in "Audio thumbnailing of popular music using chroma-based representations", IEEE Transactions on Multimedia, vol.7, no. l , Feb 2005.
  • fingerprinting methods based on Mel Frequency Cepstral Coefficients analysis
  • thumbnail fingerprint proposed by Bartsch in "Audio thumbnailing of popular music using chroma-based representations”
  • they are preferably represented by a compact fingerprint value that uniquely characterizes the content item, such as the efficient binary audio fingerprint proposed by Jang et al in "Pairwise boosted audio fingerprint", IEEE Transactions of information forensics and security, Vol. 4, no,4, Dec 2009.
  • various fingerprinting methods may be used; we refer to the fingerprinting method result as the item fingerprint in the remainder of this disclosure.
  • the extracted content features such as the media metadata, intrinsic features and fingerprint, are stored as the properties of media item nodes in a computer readable medium 320, in the memory or storage unit 120 of client 100.
  • a similarity value can be also calculated by applying similarity feature weights in order to adapt the influence of each individual similarity feature in the final measurement.
  • the similarity measurement the closest the two media items; however the reverse convention is also possible.
  • the resulting media item similarity measurements are stored as content proximity edges in a computer readable medium 320, in the memory or storage unit 120 of client 100.
  • similarity value similarity measure or similarity measurement, however this shall not be considered as limitative; as known to those skilled in the art, the similarity measurement may also be represented in the form of a vector or matrix.
  • the media items and their relative proximity are represented into an undirected graph where the nodes represent the media items and the edges connecting two nodes represent the proximity between them.
  • a threshold can be defined to only store the edges whose proximity is above a given threshold. For instance in the graph of FIG. 4 media item nodes A and M are close enough to media item node S for the edges S-A and S-M to be recorded in the media proximity database 320, but that is not the case for media item nodes N, C, T and E so they are not directly linked to media item node S.
  • a navigation list such as a playlist in the case of music or a photo album in the case of photo, by searching the shortest transition between media items from any starting media item node or between any two media item nodes, according to the initial digital signal processing analysis and proximity measurement.
  • the end user can select item S as the starting media item node and a navigation list will be automatically proposed as S-M-N-T corresponding to the shortest path from node S to node T.
  • the client device 100 stores a list of content items S, A, M, N, C, T, E in its media item nodes database 310 and a list of content item proximity paths S-A, S-M, A-N, M-N, M-T, N-T, T-E, C-E, N-C, N-E, C-A in its media proximity edges database 320.
  • the media item nodes and the media proximity edges may be stored as different records in the same database.
  • navigation it is also possible to constrain the navigation by user-defined parameters such as the choice of both a start and end media items, or even a list of media items that the end user wishes to have into the navigation list.
  • user-defined parameters such as the choice of both a start and end media items, or even a list of media items that the end user wishes to have into the navigation list.
  • music it is also possible to hard-constrain the total navigation time for the navigation list, for instance as a maximum duration, or even as a series of desired transition times between selected media item nodes in a list of media items to be navigated through.
  • multiple navigation lists, corresponding to different end points from a given starting node or different path variants along a series of forced navigation path nodes may also be proposed by the navigation method and system.
  • the navigation and recommendation method may also include a further on-the-fly, dynamic processing step to underweight or overweight the edges paths based on the recent navigation history.
  • FIG.5 shows an exemplary smart player user interface operating according the proposed method and system.
  • the user can select a song out of a library of music items 500 so the player automatically proposes a navigation playlist 510 as the shortest path in the graph that is represented by a chain of music items as nodes 520, 521, 522 and proximity edges 530, 531, starting from the initial song 520.
  • the end user can experiment the playlist and vote on the relevance of the proximity connections 530, 531 by simply valuing them in a positive (+) or negative (-) way.
  • Respective positive and/or negative valuation numbers are recorded accordingly for each proximity edge in the media proximity edge database 320. For instance, + 1 for a positive valuation, -1 for a negative valuation, 0 by default.
  • Other valuation measurements are possible.
  • a slider may be used in the user interface to represent an extended valuation scale and the valuation numbers are then chosen within the extended valuation range.
  • the end user interface is kept as simple as possible; in particular no musical or technical expertise on the underlying media item features characterization and/or similarity measurement algorithm is required to adjust the underlying graph representation.
  • the player can also suggest one or more alternative navigation playlists as the next shortest paths in the graph, in particular when the user undervalues a proximity connection - an alternative can then be proposed.
  • the network connectivity can be used to provide a distributed media library navigation and recommendation system and method in which multiple client 100a, 100b, 100c connect, share and enrich their respective undirected graph data through a server 200 in charge with dynamically collecting and consolidating a global undirected graph encompassing the multiple graphs of connected clients 100a, 100b, 100c.
  • FIG. 6 represents an example of three graphs from clients 100a, 100b, 100c respectively where content item nodes A, B, C, E are shared by at least two graphs as well as proximity edge C-E.
  • the server 200 can build and consolidate a global graph, as shown on FIG. 7 with reference to the examples illustrated in FIG.6.
  • the graph is typically represented in server 200 by a database of media item nodes recording all local media item nodes as uploaded by the end users clients 100a, 100b, 100c, as well as a database of media proximity edges that may have been overvalued and/or undervalued by the end users.
  • the media item nodes and the media proximity edges databases may be merged, or each database may be further split; for instance in order to more efficiently manage a very large library of music, it is possible to identify "database bridging" edge candidates that connect two otherwise isolated subgraphs, and store each subgraph in a separate database together with its "database bridging" information to connect to the other database as relevant.
  • the databases may be hosted in the memory or storage unit of server 200.
  • the memory or storage unit may be internal or external, local or remote to the server 200.
  • the databases may also be attached to the same server 200 or distributed over a set of servers. For instance different servers may be associated with a service offering additional music purchase or streaming from specific catalogues and genres, and it may be relevant to specialize the corresponding database upload to the most relevant part of the end users graphs.
  • FIG. 8 represents the components of the server 200 system.
  • a proxy component 810 comprises communication means to manage the communications with the clients through communication network 250.
  • Example of communication means are an Ethernet port with TCP/IP and HTTP/HTTPS protocols; other communication means are also possible.
  • the proxy communication receives every client request and then dispatches them to the appropriate component of server 200.
  • the proxy component 810 forwards it to the client that initiated the request.
  • the proxy component 810 is also in charge with securing the communications, by means of authentication and/or encryption methods, as well as identifying the client version and dispatching the request accordingly, as known by one skilled in the art.
  • each client 100a, 100b, 100c uploads separately the information data characterizing each node and each edge from its graph to a merging component 820 in the server 200 through the proxy component 810.
  • the merger component 820 merges the uploaded client graphs into a consolidated overall graph, represented by a media item nodes database 824 and a proximity edge database 822.
  • the media item nodes and the media proximity edges may be stored as different records in the same database by the merger component 820.
  • each media item is associated with a media node unique identifier and a set of features associated with the media item, such as for instance in the case of music, the metadata information (title, genre, artist name, album name, year, duration, ...), the music intrinsic features and its the corresponding fingerprint value.
  • each edge is associated with an edge unique identifier, the media node unique identifiers of the two end nodes connected by the edge, their initial computed proximity value, the positive valuation number cumulated from end users valuation, the negative valuation number cumulated from end users valuation, and the adjusted proximity value dynamically computed by server 200 from the end users data (by default, equal to the initial computed proximity value).
  • the client Before sending the content item and its whole set of features, the client sends the fingerprint of the song to the proxy component 810 of server 200.
  • the proxy component looks for a match of the fingerprint in the media nodes database to determine if the content item node is already recorded.
  • the proxy component 810 is responsible for associating a unique content item node identifier to each content item fingerprint by means of a lookup table. For each uploaded media fingerprint corresponds exactly one and only one media node identifier.
  • This lookup table can be implemented as a key-value store.
  • Other embodiments are also possible, as will be recognized by one skilled in the art. For instance, in order to associate multiple media items that are directly correlated but have a different fingerprint value, for instance different resolutions of the same picture as different items or the long, short and remix versions of the same song as different items, a matching tree may be employed.
  • the media item fingerprint is not found in the server 200 records, a unique media node identifier is created for it by the proxy module, a new entry is created in the fingerprint proxy lookup table, and/or a new record is created in the media node database 824 to store the uploaded media item information from the client. If it is not possible to add it immediately, the request is put in a node queue waiting to be processed. A background process reads this queue and processes the media item nodes one after the other to add them into the proxy lookup table and/or the media node database 824.
  • the media node information is then uploaded by the client and added to the media node database.
  • it can then be linked to an existing artist and/or album record in the media node database 824, or used to create the artist and/or album records if not registered yet.
  • the media item is also linked with the user identifier of the uploading user. If the media item is found in the media node database 824, only the uploading user identifier is further recorded in association with the existing content item record.
  • the merging component 820 may also check that the uploaded metadata information corresponds to the one recorded (in the case of music, correct artist and album information, year, meta genre, etc ..) and provide a corrective proposition feedback to the client through the proximity component 810 as relevant.
  • a special case may occur when the client is disconnected from the server before the media node upload can be processed out of the node queue by the merging component 820.
  • the media node is added to the graph as an empty placeholder, without user information, to be updated later on when the client connects back.
  • the merging component 820 also handles every edge upload request. If the edge is not found in the media proximity edges database 822, it is added to the database. If it is not possible to add it immediately, it is put in a queue waiting to be processed. This queue is preferably separate from the node queue, but other embodiments are also possible.
  • a background process reads this queue and processes the edge items one after the other to add them into the media proximity edges database 822. When the background process processes the edge, it first verifies that both ends exist as node records in the media nodes database. It then verifies that the edge does not already exist, and if not, creates it. As long as none of the end nodes already exist in the media nodes database, the edge is not popped out from the queue.
  • the background process tries to insert it regularly until it is inserted successfully. This particular case occurs when an edge insertion request arrives before the two end nodes (media items) are created in the media item database.
  • the proximity edge information as uploaded by the client is then added to the server media proximity edges database 822.
  • the edge positive or negative valuation information is recorded.
  • the proximity edge is also linked with the user identifier of the uploading user.
  • the edge positive and/or negative valuation information from the uploading user are used to update the database records.
  • the value may simply be added to the existing recorded value. In an alternate embodiment, the value may be weighted by a trust ratio associated with the uploading user.
  • the merger component 820 updates the adjusted proximity value record accordingly, for instance by increasing the proximity distance as a function of the recorded negative valuation numbers and/or decreasing the proximity distance as a function of the recorded positive valuation numbers and/or adjusting the proximity distance as a function of the positive and the negative valuation numbers.
  • One significant advantage of the proposed method and system is that the computational load in building and adjusting the overall proximity graph is distributed over a number of clients, which makes it cost effective and scalable without requiring significant processing power on the server side. Moreover, thanks to the undirected graph representation, only one edge needs to be processed for each pair of content items, and thanks to the proximity threshold heuristics, it is not necessary to build complete graphs.
  • the server 200 From the merged graph, it is possible for the server 200 to suggest further content recommendations to connected client 100a, 100b, 100c by means of a further recommendation manager component 830, as depicted on FIG. 8. For instance with reference to FIG. 6 and FIG. 7, content item G next to content item C in client 100c subgraph may be recommended to client 100a whose subgraph also includes content item C, but not content item G.
  • FIG. 9 shows in bold the candidate nodes and edges to be recommended accordingly to client 100a.
  • the proposed method is adaptive: it is possible for the end user to provide his/her valuation feedback to adjust the graph beyond the initial automatic computation to capture more accurately the subjective perception of the content navigation relevance.
  • the feedback of other users can also be taken into account, so that each user has an adjusted content item proximity measurement constantly modified as the number of feedback on the proximity edges by other end users increases.
  • the server recommendation component 830 recommends further connections to client 100a by looking at the proximity edges that are connected to content items of the client 100a subgraph by only one end (i.e. the other end node does not belong to the client 100a subgraph) and sorting them according to their adjusted proximity value, so that the closer media items are recommended first.
  • the server recommendation component 830 operates under request by the client through the proxy component 810 to propose recommendations specifically associated with a media item. Proximity edges associated with this media item and not yet part of the client graph can then be proposed as recommendation paths and enrich the diversity of playlists that can be generated on the client side.
  • the server 200 further comprises a taste analyzer component 1000, as depicted by FIG. 10, to analyze the user taste based on a number of user specific features.
  • Some of the latter features can be pre-analyzed in the client side and transmitted by the client to the server analyzer tool 1000 through the server proxy component 810, such as the set of media nodes in the user client database, and the set of media proximity edges the end user has updated by positive or negative valuations.
  • Features related to the interaction of the end user with the system may also be used, such as the number of recommendations requested, the number of recommendations further liked (positive valuation of the corresponding edge), the number of bug reports, the number of invited friends, etc.
  • Those features are represented by a taste matrix of different features for each end user.
  • the taste analyzer component 1000 dynamically identifies a cluster of users that are "music mates" for each user, for instance by selecting a set of the 5, 10 or 20 closest users.
  • the analyzer tool looks at the media proximity edges and/or at the media nodes uploaded by a client to look for other users sharing the same media proximity edge and/or media node in their own graph.
  • the latter information is particularly useful to provide more relevant recommendations to the end users by focusing the recommendation search to users from the same cluster. For instance with reference to FIG. 6 and FIG. 7, if client 100b is part of the client 100a closest users cluster but not client 100c, it is even more relevant for the server to recommend to client 100a the connection of content item B and content item C, from subgraph of client 100b, rather than the connection of content item G and content item C, from subgraph of client 100c.
  • the server 200 recommends further connections to client 100a by looking in the proximity edge database 822 at the proximity edges that are connected to media nodes of the client 100a subgraph by only one end (i.e. the other end node does not belong to the client 100a subgraph) and sorting them according to their user identifier (i.e. the other end node belongs to the subgraph of a client identified in client 100a user proximity cluster) and their adjusted proximity value, so that the closer media items in connected graphs of users from client 100a user cluster are recommended first.
  • the proximity relevance also depends to the time of the day, mood or activity to another extent. Therefore, one way to further adapt the media library to the end user needs is to categorize the media item nodes in a set of non-overlapping, orthogonal categories, for instance in terms of mood (energy, romance, relaxation, for instance) or activity (work, fitness, lounge, party, for instance).
  • a media item must belong to only one category, exclusively, into a given set of categories.
  • several different sets may be defined by the proposed system, for instance the mood set and the activity set, and the end user can then classify the content item in both.
  • the resulting classification in addition to the undervaluation or overvaluation of the proximity edges, provides further information on the user profile that can be analyzed by the taste analyzer component 1000 to match similar end user profiles. Accordingly, the recommendation component 830 can then provide more relevant recommendations, matching more accurately the user preferences.
  • the end user is associated with a reputation ratio.
  • the reputation for an end user is computed based on a number of factors, such as the size of the user client database, the heterogeneity of the user client database, the number of uploaded content items further connected to other end users graphs, the computed connectivity with other users (number of connected graphs by at least one media node), the popularity measured as a the number of invited friends, the number of media classifications in the category sets, the number of positive proximity valuations, the number of negative proximity valuations, the number of contributions to the network improvement (bug reports, metadata tagging, etc), the number of requested recommendations, the number of liked recommendations... These factors can be weighted and summed up to measure a reputation score for each end user.
  • the reputation score can then be used by the merging component 820 to underweight or overweight the end user valuations by a trust factor as a function of the reputation score. It can also be used, as well as by the recommendation component and/or the taste analyzer component in their respective search heuristics. It can also be used to reward the user in association with the underlying business model, for instance by using it as a virtual money currency to buy e-goodies, special features, new content, themes, etc.
  • the proposed system and method offers a number of advantages over the prior art solutions for media library navigation and recommendation.
  • the preferences of the end users are taken into account both at a local (client) and global (server) level to dynamically adapt the media library navigation and recommendation to the preferences, behavior and profile of the end user.
  • the proposed system and method does not operate on a specific media library, but rather on the connections between media library items, independently from the underlying media library source, represented into an undirected graph whose nodes correspond to the media items and edges to the connections between them.
  • the proximity connections between two different media library items depend on the computed static similarity between their intrinsic media features, but also adapt to the end user taste as dynamically analyzed from his/her interaction with the navigation and recommendation lists in the proposed method and system.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

La présente invention concerne un système et un procédé destinés à optimiser la navigation et la recommandation dans un réseau distribué de médiathèques. L'invention propose ainsi un système et un procédé destinés à aider à la sélection d'articles multimédias, ledit procédé comprenant les étapes consistant à : - extraire des caractéristiques intrinsèques des articles multimédias comportant une référence multimédia et mémoriser lesdites caractéristiques intrinsèques d'articles multimédias en association avec lesdites références multimédias en tant que nœuds d'articles multimédias graphiques dans une base de données, - calculer, pour une paire d'articles multimédias extraits, une valeur de similitude représentant un niveau de similitude entre les caractéristiques intrinsèques extraites de la paire d'articles multimédias, - déterminer une mesure de proximité sur la base d'au moins ladite valeur de similitude et d'au moins un paramètre de préférence d'utilisateur, - mémoriser la mesure de proximité en association avec la paire respective de références multimédias desdits articles multimédias en tant que marges de proximité graphique dans une base de données si la valeur de similitude dépasse un certain seuil, - établir un trajet de sélection, à partir d'un nœud donné d'articles multimédias, en reliant ledit article multimédia à un autre article multimédia par le biais de la marge présentant le degré le plus élevé de mesure de proximité.
PCT/IB2013/055315 2012-06-29 2013-06-28 Système et procédé de navigation et de recommandation de médiathèque WO2014002064A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US201261665908P 2012-06-29 2012-06-29
US61/665,908 2012-06-29

Publications (1)

Publication Number Publication Date
WO2014002064A1 true WO2014002064A1 (fr) 2014-01-03

Family

ID=49212999

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2013/055315 WO2014002064A1 (fr) 2012-06-29 2013-06-28 Système et procédé de navigation et de recommandation de médiathèque

Country Status (1)

Country Link
WO (1) WO2014002064A1 (fr)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103974097A (zh) * 2014-05-22 2014-08-06 南京大学镇江高新技术研究院 基于流行度和社交网络的个性化用户原创视频预取方法及系统
US20140372413A1 (en) * 2013-06-17 2014-12-18 Hewlett-Packard Development Company, L.P. Reading object queries
US20150149484A1 (en) * 2013-11-22 2015-05-28 Here Global B.V. Graph-based recommendations service systems and methods
WO2015170126A1 (fr) * 2014-05-09 2015-11-12 Omnifone Ltd Procédés, systèmes et produits programme d'ordinateur pour identifier des points communs rythmiques entre des pistes musicales disparates et utiliser ces informations pour réaliser des recommandations de musique
US10255503B2 (en) * 2016-09-27 2019-04-09 Politecnico Di Milano Enhanced content-based multimedia recommendation method

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6545209B1 (en) 2000-07-05 2003-04-08 Microsoft Corporation Music content characteristic identification and matching
US20030221541A1 (en) * 2002-05-30 2003-12-04 Platt John C. Auto playlist generation with multiple seed songs
WO2005031517A2 (fr) 2003-09-23 2005-04-07 Parasoft Corporation Systeme et procede d'empreintes digitales audio
US20060080356A1 (en) * 2004-10-13 2006-04-13 Microsoft Corporation System and method for inferring similarities between media objects
US20060107823A1 (en) * 2004-11-19 2006-05-25 Microsoft Corporation Constructing a table of music similarity vectors from a music similarity graph
US20060179414A1 (en) * 2005-02-04 2006-08-10 Musicstrands, Inc. System for browsing through a music catalog using correlation metrics of a knowledge base of mediasets
US7627605B1 (en) * 2005-07-15 2009-12-01 Sun Microsystems, Inc. Method and apparatus for generating media playlists by defining paths through media similarity space
US20120023403A1 (en) 2010-07-21 2012-01-26 Tilman Herberger System and method for dynamic generation of individualized playlists according to user selection of musical features
US8185558B1 (en) * 2010-04-19 2012-05-22 Facebook, Inc. Automatically generating nodes and edges in an integrated social graph

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6545209B1 (en) 2000-07-05 2003-04-08 Microsoft Corporation Music content characteristic identification and matching
US20030221541A1 (en) * 2002-05-30 2003-12-04 Platt John C. Auto playlist generation with multiple seed songs
WO2005031517A2 (fr) 2003-09-23 2005-04-07 Parasoft Corporation Systeme et procede d'empreintes digitales audio
US20060080356A1 (en) * 2004-10-13 2006-04-13 Microsoft Corporation System and method for inferring similarities between media objects
US20060107823A1 (en) * 2004-11-19 2006-05-25 Microsoft Corporation Constructing a table of music similarity vectors from a music similarity graph
US20060179414A1 (en) * 2005-02-04 2006-08-10 Musicstrands, Inc. System for browsing through a music catalog using correlation metrics of a knowledge base of mediasets
US7627605B1 (en) * 2005-07-15 2009-12-01 Sun Microsystems, Inc. Method and apparatus for generating media playlists by defining paths through media similarity space
US8185558B1 (en) * 2010-04-19 2012-05-22 Facebook, Inc. Automatically generating nodes and edges in an integrated social graph
US20120023403A1 (en) 2010-07-21 2012-01-26 Tilman Herberger System and method for dynamic generation of individualized playlists according to user selection of musical features

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
BARTSCH: "Audio thumbnailing of popular music using chroma-based representations", IEEE TRANSACTIONS ON MULTIMEDIA, vol. 7, no. L, February 2005 (2005-02-01)
JANG ET AL.: "Pairwise boosted audio fingerprint", IEEE TRANSACTIONS OF INFORMATION FORENSICS AND SECURITY, vol. 4, no. 4, December 2009 (2009-12-01)
RENZO ANGELS ET AL: "Survey of graph database models", 1 February 2008 (2008-02-01), USA, pages 1 - 39, XP055086350, Retrieved from the Internet <URL:http://dl.acm.org/citation.cfm?doid=1322432.1322433> [retrieved on 20131031], DOI: 10.1145/1322432.1322433 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140372413A1 (en) * 2013-06-17 2014-12-18 Hewlett-Packard Development Company, L.P. Reading object queries
US9405853B2 (en) * 2013-06-17 2016-08-02 Hewlett Packard Enterprise Development Lp Reading object queries
US20150149484A1 (en) * 2013-11-22 2015-05-28 Here Global B.V. Graph-based recommendations service systems and methods
US9760609B2 (en) * 2013-11-22 2017-09-12 Here Global B.V. Graph-based recommendations service systems and methods
WO2015170126A1 (fr) * 2014-05-09 2015-11-12 Omnifone Ltd Procédés, systèmes et produits programme d'ordinateur pour identifier des points communs rythmiques entre des pistes musicales disparates et utiliser ces informations pour réaliser des recommandations de musique
CN103974097A (zh) * 2014-05-22 2014-08-06 南京大学镇江高新技术研究院 基于流行度和社交网络的个性化用户原创视频预取方法及系统
CN103974097B (zh) * 2014-05-22 2017-03-01 南京大学镇江高新技术研究院 基于流行度和社交网络的个性化用户原创视频预取方法及系统
US10255503B2 (en) * 2016-09-27 2019-04-09 Politecnico Di Milano Enhanced content-based multimedia recommendation method

Similar Documents

Publication Publication Date Title
US10579667B2 (en) Software preference affinity recommendation systems and methods
US20090138457A1 (en) Grouping and weighting media categories with time periods
US8438168B2 (en) Scalable music recommendation by search
US20170300567A1 (en) Media content items sequencing
US20180053261A1 (en) Automated Compatibility Matching Based on Music Preferences of Individuals
CN104182413B (zh) 多媒体内容的推荐方法与系统
US20090144273A1 (en) System and method for music and compatibility matching
CA2764359A1 (fr) Dispositif et procede de selection d&#39;au moins un support a recommander a un utilisateur
WO2021135701A1 (fr) Procédé et appareil de recommandation d&#39;informations, dispositif électronique et support de stockage
WO2014002064A1 (fr) Système et procédé de navigation et de recommandation de médiathèque
WO2007077991A1 (fr) Dispositif et procede de traitement d’informations, et programme
CN102567447A (zh) 信息处理装置和方法、信息处理系统及程序
Darshna Music recommendation based on content and collaborative approach & reducing cold start problem
US20220222294A1 (en) Densification in Music Search and Recommendation
JP2007207218A (ja) 情報処理装置および方法、並びにプログラム
US20190294690A1 (en) Media content item recommendation system
Pichl et al. User models for multi-context-aware music recommendation
Kathavate Music recommendation system using content and collaborative filtering methods
Tacchini Serendipitous mentorship in music recommender systems
Chen et al. A mood‐and situation‐based model for developing intuitive pop music recommendation systems
Ospitia-Medina et al. Music Recommender Systems: A Review Centered on Biases
Jannach et al. Music Recommendations
US7254618B1 (en) System and methods for automatic DSP processing
Rimaz et al. Audiolens: audio-aware video recommendation for mitigating new item problem
Bhardwaj et al. Recommendation System for Music based on Content and Popularity Ratings

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13763298

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13763298

Country of ref document: EP

Kind code of ref document: A1