AU2014200583A1 - Apparatus and method for generating a collection profile and for communicating based on the collection profile - Google Patents

Apparatus and method for generating a collection profile and for communicating based on the collection profile Download PDF

Info

Publication number
AU2014200583A1
AU2014200583A1 AU2014200583A AU2014200583A AU2014200583A1 AU 2014200583 A1 AU2014200583 A1 AU 2014200583A1 AU 2014200583 A AU2014200583 A AU 2014200583A AU 2014200583 A AU2014200583 A AU 2014200583A AU 2014200583 A1 AU2014200583 A1 AU 2014200583A1
Authority
AU
Australia
Prior art keywords
collection
profile
information
media data
entity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU2014200583A
Inventor
Dagfinn Bach
Sebastian Schmidt
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Bach Tech AS
Original Assignee
Bach Tech AS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from AU2009287066A external-priority patent/AU2009287066A1/en
Application filed by Bach Tech AS filed Critical Bach Tech AS
Priority to AU2014200583A priority Critical patent/AU2014200583A1/en
Publication of AU2014200583A1 publication Critical patent/AU2014200583A1/en
Abandoned legal-status Critical Current

Links

Abstract

Abstract Apparatus for generating a collection profile of a collection of different media data items, the media items 5 being audio files, comprising: a feature extractor for extracting at least two different features describing a content of a media data item from a plurality of media data items of the collection; and a profile creator for creating the collection profile by combining the extracted features 10 or weighted extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the collection, wherein the apparatus further comprises an input for receiving information on a music taste of a user of the 15 collection of different audio files, and wherein the profile creator is operative to create a raw collection profile without information on a user behavior logged by the profile creator or information on the music taste, and to weight the raw collection profile using weights derived 20 from the information on the music taste or the user behavior to obtain the collection profile. Fig. 1 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 storage in mobile device (phone, player, PDA, notebook, navigation device,.). NO. METADATA MEDIA DATA ITEM 45a 1 F1, F2... audiofile, videofile,... 45b 2 Fl, F2... audiofile 45c 3 F1, F2... audiofile 45d 4 Fl. F2... audiofile feature extractor (calculation or meta data reading) F1, F2 per media data item profile creator for creating the collection profile such as a user music DNA collection profile of user device 50 - (fingerprint of content of the " collection) 60 information handler 65 action based on the collection profile

Description

1 Apparatus and method for generating a collection profile and for communicating based on the collection profile Related Application 5 This application is a divisional application of Australian application no. 2009287066 the disclosure of which is incorporated herein by reference. Most of the disclosure of that application is also included herein, however, 10 reference may be made to the specification of application no. 2009287066 as filed or proposed to be amended to gain further understanding of the invention claimed herein. Description 15 The present invention is related to media data processing and, particularly, to media data characterization and usage thereof. 20 Mobile media data players, such as MP3 players or MP4 players, are becoming more and more popular. Furthermore, due to the high data compression rates obtained by modern audio coding tools, the storage requirements for the individual audio or video files are decreasing. 25 Concurrently, the prices for memories are decreasing as well. These memories can be chip memories or hard disks, which are used in mobile video or audio players. Since the prices for these memories drop more and more, users can store more and more different media data items on their 30 players. Furthermore, audio and/or video playing functionalities are not restricted to dedicated audio or video players, but are available in even more mobile devices, such as mobile 35 phones, PDAs, navigation devices, etc. Naturally, notebook computers also have full audio/video-playing functionalities and, of course, almost unlimited storage resources due to huge hard disk resources. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 2 In view of that, users store more and more media data items, such as audio files or video files, on their electronic data carriers, and the task for managing these 5 increasing databases is become more and more difficult. Many mobile devices support the ID3 tag, which indicates, for an audio file, the title of the audio file, the author or a band performing the audio file, an album to which the audio file belongs to and, for example, the playing time of 10 the audio file. These information items can be read by a mobile player, can be displayed by the mobile player and can be used for assisting the user in editing playlists, etc. 15 Furthermore, the MPEG-7 standard has defined the inclusion of additional metadata to audio files. These audio file metadata include additional features related to the audio content of a media data item, such as tempo, beats per minute, etc. These features can be used for characterizing 20 a media data item in a much more content-related way and these features form a basis for finally reaching a search capability among audio files comparable to search functionalities in text files. 25 The specific selection of a media data item on an electronic storage device represents a very user-specific individual collection, since each user will load different media data items on her or his specific personal electronic storage device. Therefore, a collection of media data items 30 will be a very personal issue characterizing the user of this electronic media data storage device. The invention provides apparatus for generating a collection profile of a collection of different media data 35 items, the media items being audio files, comprising: a feature extractor for extracting at least two different features describing a content of a media 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 3 data item from a plurality of media data items of the collection; and a profile creator for creating the collection profile 5 by combining the extracted features or weighted extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the collection, 10 wherein the apparatus further comprises an input for receiving information on a music taste of a user of the collection of different audio files, and 15 wherein the profile creator is operative to create a raw collection profile without information on a user behavior logged by the profile creator or information on the music taste, and to weight the raw collection profile using weights derived from the information on 20 the music taste or the user behavior to obtain the collection profile. The invention also provides apparatus for generating a collection profile of a collection of different media data 25 items, the media items being audio files, comprising: a feature extractor for extracting at least two different features describing a content of a media data item from a plurality of media data items of the 30 collection; and a profile creator for creating the collection profile by combining the extracted features or weighted extracted features for the plurality of media data 35 items so that the collection profile represents a quantitative fingerprint of a content of the collection, 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 4 wherein the profile creator is operative to weight the extracted features using weights derived from the information on a music taste or a user behavior logged by the profile creator, and to combine the weighted 5 extracted features to obtain the collection profile. The invention also provides a method of generating a collection profile of a collection of different media data items, the media items being audio files, comprising: 10 extracting at least two different features describing a content of a media data item from a plurality of media data items of the collection; and 15 creating, by a profile creator, the collection profile by combining the extracted features or weighted extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the 20 collection, wherein the method further comprises the step of receiving, by an input, information on a music taste of a user of the collection of different audio files, 25 and wherein the step of creating comprises: creating a raw collection profile without information 30 on a user behavior logged by the profile creator or information on the music taste, and weighting the raw collection profile using weights derived from the information on the music taste or the user behavior to obtain the collection profile. 35 The invention also provides a method of generating a collection profile of a collection of different media data items, the media items being audio files, comprising: 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 5 extracting at least two different features describing a content of a media data item from a plurality of media data items of the collection; and 5 creating, by a profile creator, the collection profile by combining the extracted features or weighted extracted features for the plurality of media data items so that the collection profile represents a 10 quantitative fingerprint of a content of the collection, wherein the step of creating comprises weighting the extracted features using weights derived from the 15 information on a music taste or a user behavior logged by the profile creator, and combining the weighted extracted features to obtain the collection profile. A quantitative fingerprint, when generated based on audio 20 data, can also be called a "music DNA", since this music DNA individually characterizes the collection of different media data items. Since a collection of different media data items will be different for each person having generated a collection of different media data items, the 25 quantitative fingerprint for each media data collection will be different from quantitative fingerprints from other media data items or other media item collections. Generally, a quantitative fingerprint, fulfills two 30 different and contradicting requirements for a fingerprint. On the one hand, the data amount required by the fingerprint should be as small as possible so that the storage amount required for storing a fingerprint is not too high and, even more important, the processing 35 requirements for a usage of this fingerprint within a database, where this fingerprint is to be matched with other corresponding fingerprints to find similar profiles 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 6 or fingerprints, are the smaller the smaller the fingerprint is. On the other hand, the fingerprint has to be as 5 characteristic as possible for the item indicated by the fingerprint. To make sure that the fingerprint is characteristic for a user collection, the fingerprint is not derived, for example, by a pure hashing or any other highly lossy compression, but the fingerprint is derived 10 from features representing a content of the media data so that the fingerprint does not represent just the media data files without any further information, but the fingerprint actually represents the content of the media data items rather than, for example, a waveform or a bit structure of 15 media data items. The inventive collection profile, therefore, provides the ability to be processed by electronic data equipment due to the fact that it is a quantitative fingerprint which, due 20 to its quantitativeness, can be processed within database matching operations etc. On the other hand, the fingerprint is derived from useful content features so that the fingerprint represents the content of a media data collection which, in an embodiment, will be a kind of an 25 average over each separate feature so that the fingerprint consists of a collection or a vector of several averaged different features. Importantly, the fingerprint does not indicate a certain 30 waveform or a certain bit structure, but indicates the media taste, such as the audio taste or the video taste of the user of the collection of different media data items. In accordance with the present invention, this collection 35 profile which does not represent a metadata for a single file, but which represents a metadata for a collection of many different media data items, can be used for many different purposes which all have in common that, based on 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 7 a collection profile and a certain usage of a collection profile within a database application, a certain communication operation of an electronic data processing equipment is conducted so that the user performs a certain 5 action or receives a certain service optimally suited for her or his media taste represented by the collection profile, which can, for example, also be called a music DNA, when the media data items are audio files. 10 Subsequently, preferred embodiments of the present invention are explained in detail by referencing the enclosed figures, in which: Fig. 1 illustrates a schematic overview of an apparatus 15 for generating a collection profile in accordance with an embodiment of the present invention; Fig. 2 illustrates a block diagram of an apparatus for communicating information from a first entity to 20 a second entity in a schematic view; Fig. 3 illustrates a preferred process for generating an output collection profile having information on the media collection and, additionally, 25 information on the storage device storing the media data collection; Fig. 4a illustrates an exemplary processing of different feature vectors in order to generate a music DNA, 30 and the further processing of the music DNA for the purpose of matching; Fig. 4b illustrates a result list of the matching operation as obtained from Fig. 4a; 35 Fig. 4c illustrates a diagram of the distance versus the media item number of a sorted result list for 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 8 illustrating the mapping of the confidence measure; Fig. 4d illustrates a flow chart for illustrating a 5 preferred embodiment of an inventive matching operation, in which confidence measures are used; Fig. 5a illustrates a certain usage scenario of the collection profile for the purpose of playing 10 selected audio pieces for a group or a room; Fig. 5b illustrates an alternative implementation scenario, in which a user can join a user group based on his collection profile; 15 Fig. 5c illustrates an alternative implementation, in which the music DNA of a user is used for providing him with matching media items or with advertisements for products or services related 20 to her or his media data taste; Fig. 5d illustrates an alternative implementation scenario, in which the music DNA of a user is modified in accordance with the current user 25 taste or an external situation, so that the user can find a matching media item from his own profile using the modified collection profile; Fig. 6 illustrates a schematic representation of a user 30 DNA; Fig. 7 illustrates a feature overview of qualitative and quantitative features, which can be combined into a user collection profile; 35 Fig. 8 illustrates a schematic representation of an audio file having associated metadata; 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 9 Fig. 9 illustrates an explanation of the application scenario, in which products related to certain collection profiles are selected and advertised; 5 Fig. 10 illustrates a schematic diagram of the apparatus for generating a collection of different media data items; and Fig. 11 illustrates the data format of a media metadata 10 portion for a media data item. Fig. 1 illustrates in a schematic way a preferred embodiment of an apparatus for generating a collection profile 50 of a collection 45 of different media data items 15 45a, 45b, 45c, 45d. Preferably, these media data items are stored within a storage 40 of a mobile device, wherein this mobile device may comprise a mobile phone, an audio data player, a video data player, a personal digital assistant (PDA), a notebook, a navigation device or a similar device 20 having the capability of storing media data files, such as a memory stick, which may or may not have any audio data or video data playing capabilities. Furthermore, the collection may comprise all media data items on a storage device or may comprise only a certain portion of media data 25 items on a storage device or may even comprise the storage elements of several different devices owned by a user, such as a personal computer, a notebook, a mobile player, all owned by one and the same user. When, however, electronic storage devices storing media data items of a user are all 30 synchronized to each other, then it will be sufficient to have only a single device for deriving the collection profile in order to have a very sophisticated user DNA. The apparatus comprises a feature extractor 10 for 35 extracting at least two different features 11 describing a content of a media data item, for a plurality of media data items of the collection. Therefore, the feature extractor 10 will process one media data item after the other in 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 10 order to derive, for each media data item, the different features Fl, F2, Fi. In one embodiment, the feature extractor 10 is operative to actually analyze the media data item to derive the at least two different features 5 describing the content of the media data item. In other more preferred embodiments, the media data items have already associated metadata as indicated at 45 in Fig. 1 so that the feature extractor only has to parse and evaluate the metadata portion of a media data item in order to 10 extract the at least two different features Fl, F2 of a single media data item. All the features extracted by the feature extractor 10 are supplied to a profile creator 20 for creating the collection profile. The profile creator combines the extracted features or weighted extracted 15 features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the media data collection. When the media data files are audio files, then the collection profile is the user music DNA, wherein the term "DNA", which normally 20 means deoxyribonucleic acid in biology, stands for a very individual user-specific characterization of the user's music taste. The collection profile 50 of the user can be stored for 25 later use or can be transmitted to a different entity via an information handler 60, or can also be used for performing matching operations in a database, where this database may comprise collection profiles of other users or fingerprints for audio data items, where a matching audio 30 data item which best matches the collection profile can be located, etc. Generally, the information handler 60 will perform an action 65 which, in a way, is based on the collection profile in order to provide the user of the collection profile or a different entity, with which the 35 user of the collection profile may communicate, with a certain message/service, etc. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 11 In a preferred embodiment, the profile creator 20 is operative to generate the quantitative collection profile using an addition, a subtraction, a multiplication or a division of the extracted features or of the weighted 5 extracted features. Specifically, features belonging to the same content are combined among themselves, so that the collection profile which is obtained in the end by the profile creator has a number of collection profile components which is identical to the number of features 10 extracted by the feature extractor. Fig. 2 illustrates an apparatus for communicating information from a first entity to a second entity in accordance with a preferred embodiment of the present 15 invention. The apparatus for communicating comprises a collection profile information generator 70 for generating information on a first collection profile of a first collection at the first entity or for generating information on a second collection profile of a second 20 collection at the second entity. Specifically, the collection profile represents a quantitative fingerprint of a content of the collection. As discussed in connection with claim 1, the collection profile information generator 70 can comprise the same components as the device in Fig. 25 1. Alternatively, the collection profile information generator 70 can retrieve an earlier generated collection profile from a collection profile storage, where the stored collection profile can be generated by a device in accordance with Fig. 1, which is separate from the 30 collection profile information generator 70 of Fig. 2. The apparatus illustrated in Fig. 2 furthermore comprises an information handler, which corresponds to the information handler 60 of Fig. 1, since the device 60 in 35 Fig. 2 receives similar input as the corresponding element 60 of Fig. 1. The information handler is operative for transmitting information based on the first collection profile or the second collection profile to the first 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 12 entity or the second entity. Alternatively, the information handler 60 is operative to receive a message based on the first collection profile or the second collection profile from the first entity or the second entity depending on 5 whether the apparatus for communicating is residing in the first entity or the second entity. Specifically, when the apparatus for communicating illustrated in Fig. 2 is residing in the first entity, a transmission takes place to the second entity and a reception takes place from the 10 second entity. When, however, the apparatus illustrated in Fig. 2 is residing within the second entity, then a transmission to the first entity or a reception from the first entity takes place. 15 Subsequently, Fig. 3 will be described in more detail in order to show a preferred sequence of operations to be performed in this or a similar order for outputting a collection profile such as a music DNA. 20 In a step 81, features for audio files are extracted. The operation performed in step 81 corresponds to the procedure performed by item 10 of Fig. 1. In a step 82, a predefined number of extreme values are deleted. Therefore, in order to make sure that any reading or extraction errors do no 25 influence the final collection profile too much, a number of largest or extreme values from the extracted features of all audio files are deleted. Subsequent to the step of deleting indicated at 82 in Fig. 30 3, a raw collection profile is calculated as indicated at step 83. To this end, the corresponding features of the different audio files are combined to obtain a quantitative raw collection profile subsequent to step 83. Then, in a step 84, the raw collection profile or individual features 35 are weighted in accordance with a user taste input 85. In this embodiment, a user can input some taste specifics, and this is a way that the user can influence his collection profile/music DNA. This weighting can take place with the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 13 raw collection profile in a specific way so that a certain taste corresponds to certain weighting factors for different features. Alternatively, the individual features can be weighted before being combined by step 83. Thus, a 5 user can increase the influence of a certain media data item on the user profile by applying a weighting factor larger than 1 to a preferred media data item and, probably, by applying weighting factors smaller than 1 to all other media data items. 10 In a further preferred embodiment, the output of step 84 is refined in a step 86 based on statistics of a usage behavior which is input into block 86 via usage behavior input 87. Step 86 makes sure that the collection profile is 15 "living" in that it changes with the changing user habits. Again, the profile refinement based on the statistics of the usage behavior can take place using the raw collection profile at the output of step 83 or the weighted raw collection profile at the output of step 84. Alternatively, 20 as indicated in Fig. 3, the statistics of the usage behavior can be accounted for by weighting extracted features so that a feature vector of a certain media data item which is often used is emphasized with respect to a feature vector of a media data item which is used less 25 times. Additionally, the time from the last usage to the actual time can be accounted for so that the fact that a media data item has been used more recently leads to an emphasizing of this media data item with respect to media data items which have been used not so recently. 30 The output of step 86 is a specific up-to-date and user taste-adapted collection profile, which is preferably improved by adding information on a mobile device and/or a communication service provider as indicated in step 87. 35 Specifically, information on the manufacturer of the device to which the electronic storage device is operatively connected, such as the manufacturer of the mobile phone or the manufacturer of the notebook is added to the user 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 14 collection and, additionally, information on the communication provider is preferably added as well in order to provide an output collection profile 89 which is useful for many applications which will be described with respect 5 to Figs. 5a to 5c. In a different embodiment, the output of step 83 can be used as it is without any of the additional steps 84, 86, 88. Furthermore, only one or two of the steps 84, 86, 88 10 can be applied to the output of step 83 in order to provide an output collection profile. In a further preferred embodiment, the generation of the output collection profile/user DNA is conducted in an 15 automatic way. To this end, an extraction of suitable DNA tags (metadata) from all songs in the collection is performed. Then, the tags are analyzed for similar features and outliers are excluded. Then, the high-level tags are weighted based on the user taste, which preferably 20 corresponds to the alternative in Fig. 3, in which the individual feature vectors are weighted before being combined. Additionally, the profile is refined by using statistics of the usage behavior. Apart from a time usage, this feature also allows to prefer certain moods to other 25 less preferred moods. Subsequently, an exemplary determination of the collection profile is discussed in connection with Fig. 4a. Item 100 illustrates a collection of features extracted for 30 different media data items as indicated at 45 in Fig. 1. Specifically, line 100a illustrates the metadata for five different features Fl to F5 for the first audio file indicated at 45a in Fig. 1. Analogously, item 100b illustrates the extracted features for the second media 35 data item of block 45 in Fig. 1. The set of media data items, i.e., the collection for which a certain profile is calculated, is represented by the five 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 15 exemplary media data items in Fig. 4a at 100. It is visible from Fig. 4a that all five features are quantitative features, which are given in a certain value range, which extends between 0 and 10. In the embodiment of Fig. 4a, 5 each quantitative feature has the same value range extending between 0 and 10. In other embodiments, however, the value range for a feature can include negative values as well. Furthermore, the value ranges for different features can be different from each other so that, for 10 example, a first features has a value range between 0 and 100 and a second feature has a value range between -10 and +10, and so on. For calculating the raw music DNA, i.e., a raw collection 15 profile for the collection 100, the corresponding features Fl for all items are added and an average is calculated. In the exemplary case, where an arithmetic average has been calculated, this operation incurs additions of the individual features Fl and a subsequent division by the 20 number of participants. The result for the first feature is 4.4. Analogously, the averages for the corresponding features F2 to F5 are calculated, and they are, in this embodiment, 4.2, 3.0, 0.2 and 8.2. In one embodiment, the vector of these averages, i.e., the vectors (4.4; 4.2; 3.0; 25 0.2; 8.2) is the collection profile or music DNA, when the media data items are audio files. The components of the collection profile correspond to the individual features Fl to F5. Therefore, the collection 30 profile has five components, since five features have contributed to the collection profile. Depending on certain applications, the number of components of the collection profile can be smaller. In this situation, individual components could be averaged so that, for example, the 35 collection profile only has three components, wherein the first component corresponds to an average between the first feature Fl and the second feature F2, the second component corresponds to an average of the third feature F3 and the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 16 fourth feature F4, and the third component would correspond to the fifth feature F5. Furthermore, it becomes clear from the example in Fig. 4a 5 that each DNA component has the same number range as the underlying feature. Since, however, DNA components represent averages between features, the number range of a DNA component could also be smaller than the number range of an individual feature. 10 In the example of Fig. 4a, where the number of components of the music DNA is identical to the number of individual features used to extract the music DNA, a distance between the music DNA and a vector of features can easily be 15 calculated as indicated at 110. In the embodiment, the distance measure D corresponds to the geometric distance between a certain music DNA and a certain feature vector. Since the feature vectors, i.e., the metadata for a media data item, are quantitative vectors, and since the music 20 DNA is a quantitative vector as well, a quantitative distance measure D can be calculated between each feature vector and a music DNA. An exemplary equation is given at 110, but there can also be applied other distance measures, such as non-quadratic distances, higher order distances, 25 logarithmic distances, etc. A matching or non-matching decision can be taken when the distance measure D, which is exemplarily calculated at 110, is smaller than a predefined distance. This predefined 30 distance can be set by the user and determines the number of matching hits for a certain search. Additionally, one can also determine a match between the music DNA and a media data item having a feature vector which results in the smallest distance D among all other feature vectors in 35 the set. The music DNA is specifically useful, since it has a similar appearance as a feature vector. Therefore, the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 17 music DNA can be easily compared to a plurality of media data items to find a matching media data item and, alternatively or additionally, the music DNA can be compared to a plurality of different music DNAs. Thus, the 5 matching procedure, when a music DNA is matched with other music DNAs, is identical to the matching procedure, when a music DNA is matched with feature vectors for media data items. 10 Subsequently, reference is made to column 112, in which the distance measure D between the music DNA of the user and all music items in the user's collection 100 is indicated. When, for example, the predefined distance would be set to 1.0, then media data items 1, 2, 3 would be located as 15 matching items. When, however, the selection would be done in that only the best matching media file will be extracted, then the media data item having the ID number 2 would be selected as the result of the database matching operation. 20 As discussed in connection with step 84 or 86 in Fig. 3, a usage behavior or a user taste can be used to influence the values, i.e., the components of the music DNA. One way for introducing the usage behavior would be to weight the 25 feature vectors 100a, 100b before averaging to calculate the raw music DNA in order to increase the weight of a recently played or often played piece compared to a non recently played or seldom played piece. 30 Alternatively, the different components of the calculated music DNA can be weighted in order to account for a user taste. When, for example, one feature, such as feature F2, represents the percussiveness of an audio piece, and when the user wishes to have pieces having a high percussiveness 35 but, regarding the other features, matching with his music DNA, the user could implement his specific music taste in several ways. One way would be to synthetically increase the percussiveness in each feature vector so that, in the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 18 end, the features F2 of the different media data items have a higher value due to the additional weighting/refinement. Alternatively, the user could increase the percussiveness 5 average, i.e., the value 4.2 at 100 in Fig. 4a to, for example, 8.4. Then, automatically, the distance measure calculation will result in hits having a higher percussiveness. When, for example, the user wishes to have audio pieces selected, which have a percussiveness close to 10 4.2, the user could influence the distance measure calculation D at 110 in order to weight the second term by a weighting factor higher than 1. Then, the second feature is favored with respect to the other features in the distance calculation and a result of a database matching 15 operation will favor pieces having a good matching second features F2. In order to favor certain moods of music, several features can be collectively influenced by certain weighting 20 factors. To this end, a table is preferred in which a certain mood is mapped to certain weighting factors for specific features. Then, the discussed modifications of the feature vectors or the music DNA or the distance calculation can be performed using not only a single factor 25 as discussed, but a collection of preferably different factors. Fig. 4b illustrates the result list of a matching operation which has been performed by using, for example, the 30 matching rule 110 with or without influencing the DNA components or with or without having certain weighting factors in the distance calculation. When a matching operation is performed in the data base 100, a distance is obtained for each media item, and, typically, some media 35 items will match well, and other media items will not match well. Thus, the result list of a matching operation can be given as a sorted list where the media item having the smallest distance will be the first media item and where 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 19 the media item having the second to smallest distance will be number 2 etc. Fig. 4c illustrates a plot of the distance versus the 5 number in the sorted result list, and it becomes clear either from the result list in table representation of Fig. 4b or from the result list in the plot representation in Fig. 4c that there is a certain gap between media item 5 and media item 6 with respect to the distance. 10 Specifically, media items between No. 1 and No. 5 have distances between 0.6 and 2.2, which are quite close to each other, while the distance of media item No. 6 is much higher than the distance of media No. 5. 15 In accordance with the preferred embodiment, a confidence threshold is determined. Generally, for determining this confidence threshold, the change of the distance from one media item in the sorted list to the next media item in the sorted list is determined and, as soon as this change or 20 difference between two adjacent distances is larger than a threshold, a confidence threshold is determined as illustrated in Fig. 4c. Then, in accordance with this preferred embodiment, a 25 confidence measure is mapped to all media items which have a distance below the confidence threshold, and the confidence measure for all media items, which have a distance above the threshold, is set to 0. This results in a mapping as indicated in the last column of Fig. 4b, and 30 the mapping rule for mapping the confidence measures in this example is indicated as a linear mapping rule. In other implementations, however, there can also be a compressed or a logarithmic or generally non-linear mapping 35 rule for mapping the confidence measures to media items. Furthermore, the determination of the confidence threshold can be performed based on different criteria such as a mix between a distance increase between two adjacent media 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 20 items and the number of media items having a distance below the distance, in which the distance increase occurs. Other criteria can include a mix between a distance increase, a number of media items below a suggested threshold and an 5 absolute value of the distance. For example, it can be preset that all media items having a distance below a distance threshold of two receive a confidence measure above 0%. Generally, the confidence measure operation is to free the user from any statistically irrelevant data which 10 are obtained by using the quantitative collection profile. In other words, the user is not interested in any distance measures or in any "hits" in the sorted result list, which have such a high distance that it can be said for sure that the user is not interested in such a media item in the 15 current search. On the other hand, mapping a confidence measure between 0 and 100% will provide the user with a tool which is familiar to the user and which allows the user to quickly determine the usefulness of the results of a matching operation. 20 Fig. 4d illustrates a flow chart of the procedure which can be implemented as a method or as an apparatus in order to produce the examples of Figs. 4a, 4b, 4c in a general form. 25 In step 420, the matching operation comprises the calculation of a distance D(i) for each media item in the collection. In a subsequent step 421, the result list is sorted so that 30 D(i) increases from low to high as exemplarily indicated in Fig. 4b. In a step 422, a confidence threshold TH is determined. Furthermore, the distance D(TH) of the confidence threshold 35 TH is determined. In the Fig. 4c embodiment, the confidence threshold TH is equal to 5 which is the current number of the media item, which is equal to or below the confidence 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 21 threshold. The distance at this threshold is equal to 2.2 in the Fig. 4c embodiment. In a step 423, a confidence measure is mapped to each media 5 item having a distance below the threshold distance. The linear mapping rule is indicated at 419 in Fig. 4c in order to obtain the confidence measure value for each item i having a number in the result list which is below or equal to a threshold. 10 In a step 424, the final result list which only includes the media items having a number below or equal to the threshold, where each media item has an associated confidence measure is used. Different usage scenarios for 15 this final result list exist. In a first implementation, the result list can be stored for later use. In a different usage scenario, the result list can be played as it is so that, in a play list, number 1 is played. Subsequently number 2 is played, subsequently number 3 is played etc. 20 This will be done until the first media item having a confidence measure of 0% is played, which is media item Number 5 in the Fig. 4b embodiment. An alternative usage scenario is to perform a random play operation within this final result list so that only media items of this final 25 result list are played, but in a random order which is not influenced by the confidence measure order. A further usage scenario is to use this final result list for any kind of play list generation different from a random play list. 30 Fig. 6 illustrates an exemplary user DNA, which is a sophisticated user profile which is preferably automatically generated from the user's music collection. Specifically, the user DNA in Fig. 6 comprises qualitative features 120a, which, in this embodiment, indicate the 35 genre of the audio pieces and, additionally, comprises quantitative features 120b. Therefore, in a preferred embodiment, the quantitative features 102d are used and, additionally, qualitative features 120a are added. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 22 Depending on the implementation, specific maximum values 122 can be excluded from a matching operation or can be included. Therefore, a matching operation in a database can be performed without the energy strong feature und the 5 color bright feature or, depending on the user instruction, these extreme values can be used as well. This feature makes sure that, on the one hand, extraction errors are minimized and, on the other hand, a selection is 10 not completely dominated by a certain dominant feature unless the user wishes to include this dominant feature to a database operation. Subsequently, preferred metadata implementations are 15 discussed. MPEG-7 is providing XML formatted information about content, but in MPEG-7, this data is not directly linked or embedded into the audio content itself. The MP7 file format 20 is closing this gap by combining and embedding both - audio content and describing metadata in order to obtain MPEG-7 enriched files. Fig. 8 illustrates an embedding of MPEG-7 data into an MP3 25 file, an MP4 file (AAC file), a WMA file, an AC3 file, etc. To this end, an audio content portion 190 is provided as usual and a metadata portion 200 is added. This metadata portion 200 can be an ID3 V2 portion which has a public 30 MPEG-7 part and a premium MPEG-7 part 200a, 200b. The public portion 200a includes a selection of basic tags for free access, such as genre, music color, speech music discrimination. The premium MPEG-7 portion 200b includes additional tags for premium users, such as information on a 35 subgenre and features for tempo/BPM (beats per minute), percussiveness, vocal detection, aggressiveness, key, mood, segmentation, solo instrument, etc. These content-related features have been pre-calculated and entered into the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 23 second portion 200b. Preferably, this second portion 200b is encrypted so that only a premium user, i.e., a user having a certain right to use this portion or having paid for the usage of this portion, can use this portion while a 5 non-premium user can only use the first portion 200a. The public portion, however, includes at least describing metadata, such as artist, title, album, etc. Generally, a file format in accordance with the present 10 invention comprises an audio recording portion 190, which is preferably an MPEG audio portion, such as MP3, AAC or WMA. The metadata portion is preferably in the ID3 V2 (version 2) format, which includes standard metadata, such as artist, title, album, etc. 15 The public metadata portion 200a is in accordance with MPEG-7 and includes openly accessible metadata stored in MPEG-7 XML for recommendation and advanced navigation, such as genre, music color, speech music discrimination. 20 The additional metadata portion 200b includes additional metadata going beyond public MPEG-7 metadata. These metadata are stored in public-key encrypted format and can only be accessed by premium subscribers and operators or 25 distributors who have acquired a decryption key. Depending on the level of access rights, advanced tags, such as subgenre, tempo/BPM, percussiveness, vocal detection, aggressiveness, key, mood, segmentation, solo 30 instrument, etc., and P2P (peer-to-peer) license information are provided, or features which are also enabling tracing of works. Fig. 7 illustrates a feature overview including qualitative 35 and quantitative features. Qualitative features are genre, subgenre, speech/music discrimination, ID3 data. Quantitative features are tempo/beat determination, percussiveness, music color, vocal detection, 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 24 aggressiveness, segmentation, key, mood, solo instrument and soundslike fingerprint. The public MPEG-7 tags indicated at 200a in Fig. 8 are, as 5 stated before, openly accessible metadata stored in MPEG-7 XML format. This portion includes a selection of high-level tags describing a characteristic of a song for advanced search, such as genre, music color, speech music discrimination. This portion 200a furthermore provides the 10 base for a basic tag-based recommendation and advanced navigation. Additionally, information in the portion 200a provide the base for a basic automated playlist generation. The information included in the premium/second portion 200b 15 includes advanced high-level tags and low-level audio fingerprint information, such as melody, subgenre, tempo/BPM, percussiveness, vocal detection, aggressiveness, key, mood segmentation, solo instrument, etc. 20 The information in this portion allows a more exact search and navigation than the information in the first portion 200a. Furthermore, the information in portion 200b provides a song segmentation and segment-based search and comparison. It also allows a base for exact recommendation 25 and intelligent playlist generation. Preferably, the information in portion 200b is stored in a public-key encrypted format and, thus, can only be accessed by premium subscribers. 30 Furthermore, different levels of access restrictions can be provided, such as for premium consumers, rights holders, operators, etc. The portion 200b furthermore contains license and sales information, and this portion furthermore enables a peer-to-peer distribution and tracing of works. 35 The sophisticated high-level tags in the second portion 200b can furthermore be used to find cover versions and plagiarism. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 25 Generally, MPEG-7 is a multimedia content description interface, which is a standard for describing the multimedia content data that supports some degree of interpretation of the information meaning. MPEG-i is for 5 storage and retrieval of video and audio on storage media. MPEG-2 is for storage and retrieval in digital television. MPEG-4 codes content as objects and enables those objects to be manipulated individually or collectively. MPEG-7 is a standard for describing features of a content, and MPEG-21 10 is for digital rights management. Preferably, the extraction and generation of an MPEG-7 enriched song is performed by the following main steps. 15 In a first step, the audio material is decoded and the available ID3 information is imported. In a second step, the audio material is split into frequency bands and short-time frames. 20 In a third step, the calculation of low-level features for each frame, such as modulation spectrum, noise likeness, etc., is performed. 25 In a fourth step, a soundslike fingerprint for low-level recommendation is generated. In a further step, the song segmentation is calculated via self-similarity information to distinguish chorus, verse, 30 intro and solos. In a further step, high-level features are calculated based on low-level features either for the whole song or for each segment. 35 Then, in a final step, the extracted data is embedded into the audio file in order to create an MPEG-7 enriched song. Specifically, the new feature information not already 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 26 available in the first portion 200a is inserted into portion 200b of Fig. 8. Generally, an apparatus for generating a collection of 5 different media data items in accordance with a preferred embodiment of the present invention is illustrated in Fig. 10. The apparatus comprises a feature extractor 300 for extracting, for a plurality of media data items, at least two different features described in a content of a media 10 data item using the media data items. Specifically, the feature extractor can be implemented in order to accurately calculate these features, such as the high-level features as discussed before. Due to the flexibility and variety of these features, the features require a variable amount of 15 bits to be stored in the metadata portion 200. In order to provide flexibility and future adaptability, a metadata generator 310 is provided. The metadata generator generates a metadata portion 200 for each media data item of the plurality of media data items, where the metadata generator 20 is adapted to generate a metadata header 320 as illustrated in Fig. 11, to generate an information on a content characteristic represented by each feature, and to generate a metadata payload portion 330 associated with the metadata header, the metadata payload portion having the at least 25 two different features for the media data item. In the Fig. 11 embodiment, the header 320 therefore comprises the length information for the complete payload portion or comprises, for each feature separately, a separate length information. Other ways to indicate where the information 30 for a certain feature ends and the information for a further feature starts are useful in addition, such as the implementation of synchronization marks, etc. Furthermore, in order to allow future extension 35 capabilities for adding additional features, information on the meaning of each feature, i.e., feature Fl is, for example, a mood feature and F2 is, for example, an aggressiveness feature, etc., is included as well. This 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 27 information can be included in a standardized order or can be explicitly included in the metadata header 320. When Fig. 11 and Fig. 8 are compared, it is to be noted 5 that metadata header 320 and the metadata payload portion 330 can, together, constitute the second portion 200b. Alternatively, the metadata header 320 can be a header relating to the first portion 200a as well as to the second portion 200b. In this case, the second portion 200b would 10 include the metadata header 320, and the information included in the first portion 200a of Fig. 8, i.e., the public tags, would form an additional metadata payload portion which might precede the metadata payload portion 330 or which might follow the metadata payload portion 330 15 in a certain format. Preferably, the metadata header furthermore comprises information on additional fields of the metadata payload portion, which are not illustrated in Fig. 11, i.e., access 20 information, license and sales information or any additional information related to the way how the media data file associated with the metadata information can be used, distributed, etc. In this scenario, the metadata header 320 would include additional length information and 25 description information of such additional metadata payload portion fields as well. In a preferred embodiment, the feature extractor is operative to detect a newly stored media data item in the 30 collection. Then, the profile creator 20 is operative to retrieve an earlier collected profile and to manipulate the earlier collected profile based on the features of the newly stored media data item to create an updated collection profile. This feature allows to reduce the 35 amount of time and resources for fully calculating a new profile when the user has added a single new item only. In an embodiment, the calculation of the profile takes place in the background, i.e., when the user does not use the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 28 device and the device is running in an idle state. This makes sure that as soon as the user wishes to use her or his device, the user can enjoy the full computing capabilities of the device, since the profile calculation 5 is stopped in this case. Subsequently, reference will be made to Fig. 2 and Figs. 5a to 5c in order to illustrate a plurality of ways of how the inventive collection profile can be advantageously used for 10 performing a communication between two entities. Generally, as illustrated in Fig. 2, the inventive apparatus for communicating information from a first entity to a second entity comprises the collection profile information generator 70, which actually calculates the collection 15 profile from a collection of metadata items or which receives a readily calculated profile information via a wireless or a cable interface. In the first possibility, in which the collection profile information generator actually calculates the collection profile, the collection profile 20 is from the first entity, when the collection profile information generator is located in the first entity. In the alternative, in which a readily calculated collection profile is received by item 70 via a wireless or cable interface, the collection profile refers to a collection of 25 media data items within a first entity, wherein item 70 is located in the second entity. The information handler 60 uses the collection profile in order to find a media data item having a matching feature 30 vector or to find a user or a user group having a matching collection profile. In the former case, a collection profile is compared to a plurality of different feature vectors for media data items, where in the second case, a collection profile obtained from item 70 is compared to the 35 plurality of collection profiles of other entities. Based on this matching operation, information is sent or received, wherein this information not necessarily has to 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 29 include the matching result or the collection profile but can relate to any product/service or any information related to a product/service determined in response to the collection profile matching operation. 5 In a certain embodiment, which is illustrated in Fig. 5a, the collection profile information generator resides in a sound processor/generator of an audio listening room, such as a bar, a lobby or anything else, or provides streaming 10 data for a certain Internet user group. Therefore, the collection profile information generator 70 represents a second entity and collects profiles from users in a user group or a room, as indicated at 500. The users in a group or a room represent the first entities. 15 In a step 502, the device at the second entity represented by the information handler 60 of Fig. 2 calculates an average profile among the profiles from the users received in step 50. Then, based on this average profile, audio 20 pieces are searched in a database, which match with this average profile, as indicated at step 504. Particularly, a database matching operation as discussed in connection with Fig. 4a is performed by using the average profile on the one hand and a collection of feature vectors or an audio 25 database on the other hand in order to find matching audio pieces. Then, in a step 506, the selected audio pieces for the user group or the room are played or streamed, when an Internet user group scenario is considered. 30 Fig. 5b illustrates an alternative embodiment for an advantageous usage of the collection profile, where all steps take place in the first entity, i.e., in the user premises. Specifically, the user determines his user profile by analyzing his collection of media data items 35 indicated at 510 in Fig. 5b. Based on his calculated collection profile, the user finds a suitable user group by matching different group profiles with his own profile, as indicated at 512. In this embodiment, the result of the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 30 matching operation in step 512 performed by the information handler 60 in Fig. 2 will only output the best matching result, since this is what the user wants to have. In other words, the user wants to join a user group where the other 5 members of the user group have the same or a very similar music taste. Then, in a step 514, the information handler 60 of Fig. 2, which already performed the operation 512, now sends a join 10 request message to the selected user group, as indicated in step 514. Alternatively, however, the user could send his own user profile determined in step 510 to a remote user group matching service. Then, the determination of the user profile will be performed in the first entity 510 and the 15 user group matching service performed in step 512 will be in the second entity, and the step 514 would be performed in a different way, i.e., that the (with respect to the user) remote second entity will send information on the matching user group to the user. 20 A further alternative of an advantageous usage of the collection profile is illustrated in Fig. 5c. The steps in Fig. 5c will take place in an entity which is different from the entity from which the user profile comes from. In 25 step 520, a user profile of a different entity is determined such as by wireless or cable transmission. Then, a matching audio/video data item is searched and found in the own database, as indicated in step 530. This step is performed in the same entity in which the user profile of 30 the other entity is received. In a step 540, the matching item found in step 530 is sent to the other entity. This mechanism consisting of steps 520 to 540 may be used for a peer-to-peer distribution of media data items, where the other entity can be sure to receive only a metadata item 35 which fulfils the taste of the other entity due to the fact that the selection was done based on the user DNA of the other entity. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 31 An analogous procedure is illustrated by steps 550 and 560. In step 550, a certain entity may use a user profile of a remote entity for finding a matching product/service for such a user profile. To this end, the information handler 5 60 of Fig. 2 will perform a matching operation, which may be similar to the matching operation regarding the distance in Fig. 4a, when each available product/service has a certain feature vector. Alternatively, a user profile may be input in a certain function, such as a table lookup in 10 order to find a best-matching product/service. Then, in step 560, an advertisement for a matching product or a matching service is delivered to the other entity by the information handler 60. Thus, the advertisement 15 pointing to a certain product or service has been determined and forwarded to the other entity based on the user DNA of the other entity. In accordance with preferred embodiments of the present 20 invention, the inventive collection profile enables new dimensions with community building. The inventive user profile will enable various tailored services, like self pushing files in a peer-to-peer (super distribution) network, a tailored background for visitors in music 25 bars/clubs and community Internet radio (streaming/prodcasting). Such Bach DNA communities can become interesting targets for advertisers having products matching the profile of a 30 Bach DNA, as discussed with respect to items 550 and 560 of Fig. 5. In the preferred usage scenarios described above, the music DMA has always been used to influence/trigger/determine a 35 communication message from a first entity to a second entity, where these two entities were users/persons/devices at different geographic locations. Fig. 5d now illustrates an embodiment, in which the user DNA is used by the user 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 32 herself or himself for generating her or his own play list or, generally stated, by locating a media item which is to be played/rendered for her or his own enjoyment. 5 To this end, the user retrieves the collection profile of her or his own collection as indicated in step 581. In one implementation, this collection profile is used in a matching operation so that the user finds, from her or his own collection, a media item which has the smallest 10 distance. Alternatively, the user can also determine, from her or his own collection, a final result list as discussed in connection with Fig. 4c where the confidence measures are indicated for this own collection. In this implementation, the information handle 60 of Fig. 1 or the 15 information handle 60 of Fig. 2 will use a music DNA for matching the data base of the user, and the result will be the play list generation or the rendering of a media item from the user's own collection for the enjoyment of the user, i.e., by the user device in which the matching 20 operation has been performed. In the preferred embodiment illustrated in Fig. 5d, however, the user will modify her or his collection profile of her or his own collection to obtain a modified profile 25 as indicated in step 582. The modification can be performed by the user so that the user can set his current taste. When, for example, the user is in an aggressive mood, then the user can modify her or his raw collection profile so that a more aggressive profile is generated which, however, 30 nevertheless resembles the user's profile. In the end, when a matching operation is performed using this modified profile as indicated in step 583, the user will be in the position to enjoy a media item which is, of course, more aggressive, but which nevertheless conforms with the user's 35 own music taste. Therefore, the user does not just hear an aggressive audio track, for example, but is in the position to hear a very specific audio track which, on the one hand, is more aggressive and which, on the other hand, however, 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 33 nevertheless has characteristics which conform with the user's general taste. Other alternatives for modifying the collection profile in 5 step 582 are external situations. An external situation can, for example, be a time of the day so that the music player of the user calculates a different music DNA based on the raw collection profile of step 581, for example for the morning or for example for the evening of a day. 10 A different external situation can be the day in a week so that, for example, a different modified profile is calculated for a Sunday or a Saturday compared to, for example, a Tuesday or Wednesday. 15 A further external situation can be a geographic location so that the user's modified DNA is different when the user is, for example, in Europe or in Asia or in the US. 20 Based on this modified profile, a matching operation in the own media collection is performed as indicated in step 583, and the result of this matching operation is used in the step 584 in order to play the matching result for the user's own enjoyment or for generating a play list in the 25 user media device for playing certain media items in a certain order. The Bach DNA will be used in a communication scenario connecting consumers with consumers, artists with artists, 30 artists with consumers in order to calculate/create unique communities. This service can be based on a fee paid by the end user, the Internet service provider or the advertiser for using the advantageous Bach DNA tags included in section 200b of Fig. 8 for the connecting consumers and 35 artists. Subscription models such as download and streaming, which are based on a flat fee (compulsory or voluntary) or 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 34 advertisement-based services are allowing end users to access unlimited music and tailor it for the user's personal DNA. 5 A super distribution extension to the subscription system allowing the consumer to redistribute (share) music in a community with other consumers and artists with similar music DNAs is made possible by the inventive user DNA. 10 Bach DNA communities will become interesting targets for advertisers having products matching the profile of a Bach DNA and give more value per click. Such a situation is illustrated in Fig. 9, where, for certain moods, such as relax, exciting, happy, angry, fear, sorrow, sad, certain 15 products illustrated in Fig. 9 are associated. The mood can be a certain explicit feature or can be calculated by a combination of different features. In this case, a rule for combining certain components of a collection profile vector in a certain way results in a certain number, where this 20 number is then mapped to a certain mood. The association of a certain product to a certain mood can be done via a table and the selection of a certain advertisement as soon as a certain product has been identified is straight forward. 25 Thus, the Bach user DNA together with the Bach DNA tags is the key to perfectly personalized music recommendation. Shop systems and communities can compare user DNA and thus link to songs, artists and even other users with fitting 30 music taste. Different to existing statistical solutions, such as "people who have bought this also bought that...", the DNA approach provides a measure which really corresponds to the 35 user and which even works outside of the mainstream and allows always up-to-date suggestions. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 35 The Bach user DNA can be applied to user groups to provide several unique services, such as automated music recommendation or self-pushing files in a peer-to-peer (super distribution) scenario. Another service is the 5 analysis of the music taste and of streams within the community, and a further service is the automated generation of live radio streams fitting that taste for a (community) Internet radio and even for background music in bars or clubs depending on their customers profiles, as 10 discussed before. As already discussed in connection with Fig. 3, item 88, it is a feature of the present invention to enrich a music DNA profile of a sound collection stored in a customer's 15 handset by additional useful information. Such an additional information is, for example, the brand of the handset and the model of the handset as well as the mobile operator connecting the handset to a communication system or to the inventive match-making service. Therefore, the 20 handset brand and model as well as the mobile operator are identified preferably in real-time. In the new online music world, the handset is the new carrier taking over from the CD. Therefore, the characteristics of the handset and connecting operator are preferably included in the customer 25 music DNA profile. The advantage is to discover any correlation between type of handset and type of music profile. This provides an added value even more, when the MPEG-21 standard related to digital rights management becomes reality. Like the ID3 V2 tags complement the 30 inventive DNA with identification title and producer/rights holders of a soundtrack, the handset characteristics tags will identify the type of carrier and the name of carrier manufacturer. 35 The music DNA/collection profile of the user allows exciting application scenarios for a user which are not only related to finding matching user groups, finding matching advertisements or finding songs corresponding to 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 36 the user taste and, additionally, corresponding to other issues related to the user. The music DNA additionally allows to open completely new scenarios when, for example, two individuals swap their music DNAs. In this situation, a 5 first entity is in the position to experience the world of a second different entity, which can result in astonishing experiences for both users specifically when both users want to learn more about each other. Therefore, a music DNA can be a key feature in building and experiencing an alter 10 ego situation. The described embodiments are merely illustrative for the principles of the present invention. It is understood that modifications and variations of the arrangements and the 15 details described herein will be apparent to others skilled in the art. It is the intent, therefore, to be limited only by the scope of the impending patent claims and not by the specific details presented by way of description and explanation of the embodiments herein. 20 The inventive user collection profile can be stored on a digital storage medium or a machine-readable carrier, in particular, a disc, a DVD or a CD having electronically readable control signals stored thereon, which co-operate 25 with programmable computer systems such that the user profile is displayed or used for database matching or used for a communication message. Depending on certain implementation requirements of the 30 inventive methods, the inventive methods can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, in particular, a disc, a DVD or a CD having electronically-readable control signals stored thereon, which co-operate with programmable 35 computer systems such that the inventive methods are performed. Generally, the present invention is therefore a computer program product with a program code stored on a machine-readable carrier, the program code being operated 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 37 for performing the inventive methods when the computer program product runs on a computer. In other words, the inventive methods are, therefore, a computer program having a program code for performing at least one of the inventive 5 methods when the computer program runs on a computer. In the claims which follow and in the preceding description of the invention, except where the context requires otherwise due to express language or necessary implication, 10 the word "comprise" or variations such as "comprises" or "comprising" is used in an inclusive sense, i.e. to specify the presence of the stated features but not to preclude the presence or addition of further features in various embodiments of the invention. 15 5080792 1 (GHMatters) P86532.AU.1 4/02/2014

Claims (21)

1. Apparatus for generating a collection profile of a 5 collection of different media data items, the media items being audio files, comprising: a feature extractor for extracting at least two different features describing a content of a media 10 data item from a plurality of media data items of the collection; and a profile creator for creating the collection profile by combining the extracted features or weighted 15 extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the collection, 20 wherein the apparatus further comprises an input for receiving information on a music taste of a user of the collection of different audio files, and wherein the profile creator is operative to create a 25 raw collection profile without information on a user behavior logged by the profile creator or information on the music taste, and to weight the raw collection profile using weights derived from the information on the music taste or the user behavior to obtain the 30 collection profile.
2. Apparatus in accordance with claim 1, in which the profile creator is adapted to generate the quantitative collection profile using an addition, a 35 subtraction, a multiplication or a division of the extracted features or the weighted extracted features. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 39
3. Apparatus in accordance with claim 1 or 2, in which the collection is stored on a mobile device having a storage for media data items; and 5 in which the profile creator is operative to add information on a manufacturer of the mobile device or information on a specific model of the mobile device or information on a communication service provider to the collection profile. 10
4. Apparatus in accordance with any one of the preceding claims, in which each media data item of the plurality of media data items has a data portion and an associated metadata portion, the metadata portion 15 having the at least two different features, and in which the feature extractor is operative to parse the metadata portion and to read the features from the metadata portion. 20
5. Apparatus in accordance with claim 4, in which the metadata portion has a first non-encrypted part and a second encrypted part, 25 in which the feature extractor is operative to retrieve a decryption key and to decrypt the second encrypted part, in which the profile creator is operative to use more 30 extracted features from the encrypted part than from the non-encrypted part for creating the collection profile, and in which the second encrypted part comprises features 35 describing the content of a media data item more detailed than a feature included in the first encrypted part. 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 40
6. Apparatus in accordance with any one of the preceding claims, in which the features describing the content comprise information on energy-strong, energy-mid, energy-low, color-bright, color-dark, tempo/beat, 5 percussiveness, music color, vocal detection, aggressiveness, segmentation, key, mood, solo instrument or fingerprint.
7. Apparatus in accordance with any one of the preceding 10 claims, in which the profile creator is operative to analyze the extracted features for similar features or to exclude features having extreme values.
8. Apparatus in accordance with any one of the preceding 15 claims, in which the feature extractor is operative to detect a newly stored media data item in the collection, and in which the profile creator is operative to retrieve an earlier collected profile and to manipulate the earlier collected profile based on 20 the features of the newly stored media data item to create an updated collection profile.
9. Apparatus in accordance with any one of the preceding claims, in which the first feature describes a first 25 content characteristic by a first number in a first number range, in which the second feature describes a second different content characteristic by a second number in 30 a second number range, and wherein the profile creator is operative to combine the first numbers of the plurality of media data files to obtain a combined first number, 35 wherein the profile creator is operative to combine the second numbers of the plurality of media data files to obtain a combined second number, and 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 41 wherein a vector having the combined first number or a derived first number derived from the combined first number, and having the combined second number or a 5 derived second number derived from the combined second number comprises the content profile.
10. Apparatus in accordance with claim 9, in which the combined number is a number within a number range 10 being the same as a number range of the corresponding feature or being smaller than the number range of the corresponding feature.
11. Apparatus for generating a collection profile of a 15 collection of different media data items, the media items being audio files, comprising: a feature extractor for extracting at least two different features describing a content of a media 20 data item from a plurality of media data items of the collection; and a profile creator for creating the collection profile by combining the extracted features or weighted 25 extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the collection, 30 wherein the profile creator is operative to weight the extracted features using weights derived from the information on a music taste or a user behavior logged by the profile creator, and to combine the weighted extracted features to obtain the collection profile. 35
12. Apparatus for communicating information from a first entity to a second entity, comprising: 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 42 a collection profile information generator for generating information on a first collection profile of a first collection at the first entity or for generating a second collection profile of a second 5 collection at the second entity, the first or second collection profile representing a quantitative fingerprint of a content of the corresponding collection of metadata files at the first entity or at the second entity, the first or second collection 10 profile being generated by an apparatus for generating a collection profile in accordance with claim 1 or 11; and an information handler for using the first collection 15 profile or the second collection profile within a matching operation and for transmitting information based on the matching operation or a collection profile or for receiving a message based on a matching operation or the collection profile. 20
13. Apparatus in accordance with claim 12, in which the apparatus is a mobile device representing the first entity, 25 in which the apparatus is operative to generate the first collection profile, and in which the information handler is operative to send a message comprising information on the first 30 collection profile to the second entity.
14. Apparatus in accordance with claim 12, where the apparatus is a device representing the first entity, and where the apparatus is operative to determine 35 information on the collection profile, in which the message handler is operative to retrieve information on a user group having users with a 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 43 similar collection profile as the first collection profile and to send a message to the second entity requesting a joining to a user group, or 5 to retrieve information on a user having a similar profile and to send a message to the second user, the message comprising information on the first entity or a media data item from the collection, or 10 to retrieve information on an advertisement tailored to a product or service related to a collection profile and to send a message to the advertiser requesting advertisements tailored to the collection profile or to deliver an advertisement for a product 15 to a different entity, or to remotely analyze stored media data items for a matching profile and to selectively request a download of a media data item having the matching profile. 20
15. Apparatus in accordance with any one of claims 12 to 14, in which the collection profile information generator is operative to receive, via cable or via a wireless communication device, information on profiles 25 of a plurality of entities located in a limited space around the second entity or being in a network user group, in which the information handler is operative to 30 calculate a combined profile for the plurality of first entities, in which the information handler is operative to select a media data item matching with the combined 35 profile, and in which the information handler is operative to cause a rendering of the selected media data item in the 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 44 limited space or to cause a data streaming of the selected media data item to the network user group.
16. Apparatus in accordance with any one of claims 12 to 5 15, in which the information handler is operative to calculate a result list from the matching operation, to map confidence measures for a selected group of 10 media items from the result list, and to use a final result list having the mapped confidence measures for transmitting data to the user.
17. Apparatus in accordance with claim 12, 15 wherein the collection profile information generator is operative to retrieve the collection profile of the first entity, 20 wherein the information handler is operative to modify the collection profile based on a current taste of a user of the first entity or based on an external situation, 25 wherein the information handler is operative to perform the matching operation with the modified profile in the media collection of the first entity, and 30 wherein the information handler is operative to play a result of the matching operation at the first entity or to generate a play list at the first entity based on the result of the matching operation. 35
18. A method of generating a collection profile of a collection of different media data items, the media items being audio files, comprising: 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 45 extracting at least two different features describing a content of a media data item from a plurality of media data items of the collection; and 5 creating, by a profile creator, the collection profile by combining the extracted features or weighted extracted features for the plurality of media data items so that the collection profile represents a quantitative fingerprint of a content of the 10 collection, wherein the method further comprises the step of receiving, by an input, information on a music taste of a user of the collection of different audio files, 15 and wherein the step of creating comprises: creating a raw collection profile without information 20 on a user behavior logged by the profile creator or information on the music taste, and weighting the raw collection profile using weights derived from the information on the music taste or the user behavior to obtain the collection profile. 25
19. A method of generating a collection profile of a collection of different media data items, the media items being audio files, comprising: 30 extracting at least two different features describing a content of a media data item from a plurality of media data items of the collection; and creating, by a profile creator, the collection profile 35 by combining the extracted features or weighted extracted features for the plurality of media data items so that the collection profile represents a 5080792 1 (GHMatters) P86532.AU.1 4/02/2014 46 quantitative fingerprint of a content of the collection, wherein the step of creating comprises weighting the 5 extracted features using weights derived from the information on a music taste or a user behavior logged by the profile creator, and combining the weighted extracted features to obtain the collection profile. 10
20. A method of communicating information from a first entity to a second entity, comprising: generating information on a first collection profile of a first collection at the first entity or 15 generating a second collection profile of a second collection at the second entity, the collection profile representing a quantitative fingerprint of a content of the corresponding collection of metadata files at the first entity or at the second entity, the 20 first or second collection profile being generated by a method for generating a collection profile in accordance with claim 18 or 19; using the first collection profile or the second 25 collection profile within a matching operation; and transmitting information based on the matching operation or a collection profile or receiving a message based on a matching operation or the 30 collection profile.
21. A computer program having a program code for performing a method in accordance with any one of claims 18, 19 or 20, when running on a computer. 35 5080792 1 (GHMatters) P86532.AU.1 4/02/2014
AU2014200583A 2008-08-28 2014-02-04 Apparatus and method for generating a collection profile and for communicating based on the collection profile Abandoned AU2014200583A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU2014200583A AU2014200583A1 (en) 2008-08-28 2014-02-04 Apparatus and method for generating a collection profile and for communicating based on the collection profile

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US61/092,528 2008-08-28
EP08015229.1 2008-08-28
EP08022058.5 2008-12-18
AU2009287066A AU2009287066A1 (en) 2008-08-28 2009-08-20 Apparatus and method for generating a collection profile and for communicating based on the collection profile
AU2014200583A AU2014200583A1 (en) 2008-08-28 2014-02-04 Apparatus and method for generating a collection profile and for communicating based on the collection profile

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
AU2009287066A Division AU2009287066A1 (en) 2008-08-28 2009-08-20 Apparatus and method for generating a collection profile and for communicating based on the collection profile

Publications (1)

Publication Number Publication Date
AU2014200583A1 true AU2014200583A1 (en) 2014-02-20

Family

ID=50114221

Family Applications (1)

Application Number Title Priority Date Filing Date
AU2014200583A Abandoned AU2014200583A1 (en) 2008-08-28 2014-02-04 Apparatus and method for generating a collection profile and for communicating based on the collection profile

Country Status (1)

Country Link
AU (1) AU2014200583A1 (en)

Similar Documents

Publication Publication Date Title
US8407224B2 (en) Apparatus and method for generating a collection profile and for communicating based on the collection profile
KR101538873B1 (en) Syndication including melody recognition and opt out
KR101242664B1 (en) Method and device for generating a user profile on the basis of playlists
US7613736B2 (en) Sharing music essence in a recommendation system
JP5259212B2 (en) Music-linked advertisement distribution method, apparatus and system
US8914384B2 (en) System and method for playlist generation based on similarity data
US20100063975A1 (en) Scalable system and method for predicting hit music preferences for an individual
US20100088327A1 (en) Method, Apparatus, and Computer Program Product for Identifying Media Item Similarities
JP5143620B2 (en) Audition content distribution system and terminal device
US20090138457A1 (en) Grouping and weighting media categories with time periods
JP2008165759A (en) Information processing unit, method and program
US20090171995A1 (en) Associating and presenting alternate media with a media file
US11169767B2 (en) Automatically generated media preview
CN105144723A (en) Associating audio tracks of album with video content
WO2009090358A1 (en) Playlist manager
KR20110075259A (en) Online music service apparatus for generating music playlist considering user's preferences and ratings and method thereof
JP4599141B2 (en) Information providing system, information providing server, and computer program
CN113747233B (en) Music replacement method and device, electronic equipment and storage medium
JP2005018205A (en) Musical piece retrieving system and method, and musical piece retrieving device
AU2014200583A1 (en) Apparatus and method for generating a collection profile and for communicating based on the collection profile
JP5158450B2 (en) Information processing apparatus and method, and program
WO2007102109A2 (en) System and method of determinng personal music preferences
WO2009130458A1 (en) Complete my artist

Legal Events

Date Code Title Description
MK4 Application lapsed section 142(2)(d) - no continuation fee paid for the application