WO2007102109A2 - Système et procédé de détermination de préférences musicales personnelles - Google Patents

Système et procédé de détermination de préférences musicales personnelles Download PDF

Info

Publication number
WO2007102109A2
WO2007102109A2 PCT/IB2007/050692 IB2007050692W WO2007102109A2 WO 2007102109 A2 WO2007102109 A2 WO 2007102109A2 IB 2007050692 W IB2007050692 W IB 2007050692W WO 2007102109 A2 WO2007102109 A2 WO 2007102109A2
Authority
WO
WIPO (PCT)
Prior art keywords
audio
user
low level
music
played
Prior art date
Application number
PCT/IB2007/050692
Other languages
English (en)
Other versions
WO2007102109A3 (fr
Inventor
Alexander Petrus Paulus Vrijsen
Original Assignee
Koninklijke Philips Electronics N.V.
U.S. Philips Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V., U.S. Philips Corporation filed Critical Koninklijke Philips Electronics N.V.
Publication of WO2007102109A2 publication Critical patent/WO2007102109A2/fr
Publication of WO2007102109A3 publication Critical patent/WO2007102109A3/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • H04N21/44222Analytics of user selections, e.g. selection of programs or purchase activity
    • H04N21/44224Monitoring of user activity on external systems, e.g. Internet browsing
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/102Programmed access in sequence to addressed parts of tracks of operating record carriers
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/11Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/27Arrangements for recording or accumulating broadcast information or broadcast-related information
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/37Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/46Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for recognising users' preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/56Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
    • H04H60/58Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/414Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
    • H04N21/41407Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/432Content retrieval operation from a local storage medium, e.g. hard-disk
    • H04N21/4325Content retrieval operation from a local storage medium, e.g. hard-disk by playing back content from the storage medium
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/439Processing of audio elementary streams
    • H04N21/4394Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8106Monomedia components thereof involving special audio data, e.g. different tracks for different languages
    • H04N21/8113Monomedia components thereof involving special audio data, e.g. different tracks for different languages comprising music, e.g. song in MP3 format
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/162Authorising the user terminal, e.g. by paying; Registering the use of a subscription channel, e.g. billing
    • H04N7/163Authorising the user terminal, e.g. by paying; Registering the use of a subscription channel, e.g. billing by receiver means only
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/095Identification code, e.g. ISWC for musical works; Identification dataset
    • G10H2240/101User identification
    • G10H2240/105User profile, i.e. data about the user, e.g. for user settings or user preferences
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B2220/00Record carriers by type
    • G11B2220/60Solid state media
    • G11B2220/61Solid state media wherein solid state memory is used for storing A/V content

Definitions

  • This invention pertains to a music playback system and method of playing back music, and more particularly, to a method of determining personal musical preferences of a user, and an audio player that determines personal musical preferences of a user.
  • a user's musical preferences can be characterized based on the music to which he/she listens. It would also be desirable to provide an audio player that includes such a feature. It would be further desirable to provide a method which recommends songs to a user based on their characterized musical preferences.
  • the present invention is directed to addressing one or more of the preceding concerns.
  • a method of characterizing an individual's musical preferences comprises: receiving audio samples for each portion of music that is played to a user through an audio playback system; extracting from the audio samples a plurality of low level features of each portion of music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for each portion of music that has been played to the user through the audio playback system; averaging each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over a time period; and storing in memory the averaged low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over the time period.
  • a system for playing music comprises: memory; an audio playback system; a user interface adapted to receive a request from a user to play music through the audio playback system; and a processor configured to execute a method comprising the steps: receiving audio samples for all music that is played to a user through the audio playback system; extracting from the audio samples a plurality of low level features for all music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for all music that is played to the user through the audio playback system over a time period; and storing in the memory the computed low level audio descriptor values for all music that has been played to the user through the audio playback system over the time period.
  • a method of characterizing an individual's musical preferences comprises: receiving audio samples for all music that is played to a user through an audio playback system; extracting from the audio samples a plurality of low level features for all music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for all music that is played to the user through the audio playback system over a time period; and storing in memory the computed low level audio descriptor values for all music that has been played to the user through the audio playback system over the time period.
  • FIG. 1 is a functional block diagram of an audio player
  • FIG. 2 is a flowchart illustrating on embodiment of a method of characterizing musical preferences of a user of an audio playback system.
  • FIG. 1 is a functional block diagram of an audio player 100.
  • the various "parts" shown in FIG. 1 may be physically implemented using a software-controlled microprocessor, hard- wired logic circuits, or a combination thereof. Also, while the parts are functionally segregated in FIG. 1 for explanation purposes, they may be combined in any physical implementation.
  • Audio player 100 includes a processor 120, memory 140, one or more input-output ports 160, audio subsystem 170, and user interface 190.
  • user interface 190 includes a graphical user interface 195.
  • audio player 100 also includes video subsystem 180. It should be understood that when audio player 100 does not include graphical user interface 195, then video subsystem 180 may be omitted.
  • audio player 100 may play back both audio files and audio-video files, such as music videos.
  • Audio player 100 may be integrated into a portable device that includes one or more other functions, such as a portable computing device, a personal digital assistant (PDA), a mobile telephone, a digital camera, a portable e-mail terminal, etc.
  • a portable computing device such as a portable computing device, a personal digital assistant (PDA), a mobile telephone, a digital camera, a portable e-mail terminal, etc.
  • PDA personal digital assistant
  • the processor 120, memory 140, and user interface 190 may be used to perform functions of audio player 100, and also to perform one or more additional functions such as a portable computing, personal information management, mobile telephony, a digital photography, e-mail transmission and reception, etc.
  • the audio player may be provided as a software program executed by a processor of a general purpose personal computer (PC), such that there are no dedicated hardware components to the audio player.
  • processor 120 described below may be performed by the microprocessor of the PC
  • the functions of memory 140 may be provided by the memory of the PC
  • the functions provided by audio subsystem 170 may be provided by the audio systems of the PC
  • the functions of user interface 190 may be provided by a graphical user interface 195 of the PC, etc.
  • Audio player 100 plays music.
  • Music may include digital music files, which may be stored in memory 140.
  • Such music files may include digital audio files, and may also include the audio component of digital audio-video files, such as music videos.
  • Audio files played by audio player 100 may have any convenient file format, such as mp3, wma, wav, MPEG, etc.
  • audio player 100 may play music that is not stored locally, and instead is streamed to audio player 100 from an external source (e.g., a file on a local area network; an Internet site; etc.) via input/output port 160.
  • audio player 100 may also play some music that originates in analog format. Such music may be received via input/output port 160 (e.g., an RCA jack), or another device with which audio player 100 is integrated, such as an FM radio.
  • processor 120 is configured to execute one or more software algorithms in conjunction with memory 140 and user interface 190 to provide the functionality of audio player 100.
  • processor 120 includes its own memory (e.g., nonvolatile memory) for storing executable software code that allows it to perform the various functions of audio player 100.
  • the executable code may be stored in designated memory locations within memory 140.
  • audio player 100 may include a variety of memory components to perform a variety of functions.
  • it may include: processor cache memory; non- volatile memory (e.g., ROM or Flash memory) for storing executable code of software algorithms executed by processor 120; volatile memory (e.g., SRAM/DRAM, etc.) necessary to execute the software algorithms; and additional non- volatile memory (e.g., Flash memory, hard disk drive, or optical or magnetic disc, etc.) for storing the digital music files and, beneficially, associated metadata.
  • Memory 140 shown in FIG. 1 is intended to represent memory performing all of the functions described above. Memory 140 stores data in response to the processor 120. Such data may include user preferences for operation of audio player 120 by one or more users.
  • Such data may also include data for organizing music files, such a playlists, directories, and other mechanisms for grouping, separating, and otherwise generally organizing music files stored in audio player 100.
  • memory 140 also stores music files for playback by audio player 100.
  • music files may be stored in a separate memory other than memory 140.
  • Input/output port(s) 160 facilitates the transfer of music into and out of audio player 100.
  • Input/output port(s) 160 may include one or more USB ports, Firewire ports, Bluetooth ports, wireless Ethernet ports, RCA jacks, etc.
  • Audio subsystem 170 includes audio circuitry 172, audio speaker 176, and audio output jack(s) 178 (e.g., a headphone jack). In some cases, some or all of audio circuitry 172 may be integrated with processor 120. Also, either audio speaker 176 or audio output jack(s) 178 may be omitted.
  • Video subsystem 180 includes audio circuitry 182 and video display device 184. In some cases, some or all of video circuitry 182 may be integrated with processor 120.
  • User interface 190 includes objects that allow a user to control the operation of audio player 100.
  • user interface 190 includes a graphical user interface 195.
  • graphical user interface 195 may be omitted.
  • the objects can include one or more manual buttons 192, knobs 194, click- wheels 196, etc. mounted on a case that houses audio player 100.
  • objects may also include buttons, check boxes, data entry (e.g., text) boxes, pull down menus, etc, that are displayed via display screen 184.
  • Such objects may be selected via click-wheel 196, or another scrolling device, such as a mouse, a trackball, a track-pointer, or by arrow buttons, etc.
  • display screen 184 may be a touchscreen displaying buttons, check boxes, data entry (e.g., text) boxes, pull down menus, etc which a user selects on display screen 184 with a pointer.
  • audio player 100 that characterizes one or more users' musical preferences will be described. In order to not obscure the explanation of this feature, the description of other audio playback operations and features of audio player 100 will be omitted.
  • Audio player 100 plays back music in response to one or more user instructions received user interface 190.
  • Music files may be retrieved from memory (e.g., memory 140), or may be provided from an external source, such a streaming music file from an external source (e.g., a file accessible through a local area network; the Internet; an external audio system; etc.) via input/output port 160.
  • Music files may be played back in a variety of different playback modes. Such modes include playing a preset playlist, playing an individual music file, random playback, etc.
  • audio player 100 may also play some music that originates in analog format.
  • audio subsystem 170 includes an analog-to-digital converter (AJO) that digitizes the analog music to produce the series of audio samples that are to be used to characterize a user's musical preferences, as described in further detail below.
  • AJO analog-to-digital converter
  • processor 120 executes an algorithm to extract a plurality of low level features from the audio samples of the music that is played to the user through the audio player 100.
  • Processor 120 determines, from the extracted low level features, a value for each of a plurality of low level audio descriptors for the music that is played to the user through the audio player 100.
  • this set of descriptor values provides a useful characterization of the user's musical preferences.
  • a feature is a distinctive characteristic of data that has some meaning.
  • a descriptor is a representation of the feature.
  • one descriptor for color might be a frequency spectrum plot.
  • Another descriptor might be a color space diagram.
  • a descriptor value is an instantiation of the descriptor for a given data set.
  • Low-level audio features are those related to the signal itself and have little or no meaning to the end-user. In other words, and thinking in terms of the audio domain, these features cannot be heard.
  • high-level audio features are meaningful and might be related to semantic or syntactic features of the sound. They can be used to classify sound objects into the class they belong.
  • the MPEG-7 Audio Standard defines seventeen temporal and spectral low level audio descriptors that can be divided into following groups:
  • An LLD can be instantiated as a single value for an audio segment or as series.
  • MPEG-7 Audio two different low level descriptor (LLD) Types exist. AudioLLDScalarType is useful for scalar values, such as power or fundamental frequency. AndioLLDVectorType can be used for vector types, such as spectra. Any descriptor that is inherited from one of the two types can be instantiated. More information of MPEG-7 Audio descriptors can be found in ISO document ISO/IEC JTC 1/SC 29/WG 11 N7708 (Oct. 2005), which is incorporated herein by reference.
  • the audio player of FIG. 1 can use any conveniently defined set of low level audio descriptors to characterize a user's musical preferences.
  • audio player 100 characterizes the user's musical preferences based only on the actual music that is played for the user by audio player 100. That is, instead of averaging descriptor values from songs on a playlist or in a song database, audio player 100 uses the music to which the user actually listens to determine the low level audio descriptor values that are employed to characterize a user's musical preferences. So, for example, if a user only listens to part of a song or other musical selection, only that part of the musical selection that was actually played goes into the determination of the low level audio descriptor values used to characterize the user's musical preferences. Furthermore, if a user listens to a favorite portion of a musical selection repeatedly, then that portion will be weighed accordingly in determining the low level audio descriptor values used to characterize the user's musical preferences.
  • processor 120 determines the low level audio descriptor values over a particular time period. That is, the low level audio descriptor values are updated when the user listens to additional music. Furthermore, either automatically or under user control, optionally processor 120 may periodically reset the low level audio descriptor values to insure that the values are not unnecessarily influenced by old, stale, data that no longer reflects the user's current musical preferences. Additionally, one or more time intervals may be excluded from the time period used to determine the low level audio descriptor values. For example, a user may provide an instruction through user interface 190 to turn-on or turn-off the musical preference characterization feature for one or more time intervals.
  • audio player 100 may automatically exclude one or more time intervals from the time period used to determine the low level audio descriptor values. This may be the case when an embodiment of audio player 100 is equipped to receive and play analog music, but is not equipped to convert the analog music to a digital format to produce the audio samples which are needed for extraction of the low level audio features.
  • Audio player 100 can determine the low level audio descriptor values from the audio samples in a variety of ways.
  • processor 120 may compute low level audio descriptor values for each portion of music (e.g., an entire song, or only the portion of a song that is actually played) that audio player 100 plays for the user. Then, processor 120 may average each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through audio player 100 over the particular time period.
  • portion of music e.g., an entire song, or only the portion of a song that is actually played
  • processor 120 may average each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through audio player 100 over the particular time period.
  • processor 120 may maintain a running computation of each of the plurality of low level audio descriptors over all music that is played to the user through audio player 100 over the particular time period.
  • processor 120 may compute the low level descriptor values "off-line” by accessing a database which contains low level audio descriptor values for each portion of music that is played to a user. For example, in this embodiment, it is assumed that low level audio descriptor values have already been computed for each portion of music that may be played to a user, and these low level audio descriptor values are stored in a database accessible by processor 120. In that case, processor 120 may compute "off-line" the low level audio descriptor values for all portions of music that are played to a user by accessing the database.
  • processor may make a "playback record" listing each portion of music that is played to a user. Then, periodically, processor 120 may use this playback record to access the corresponding low level audio descriptor values from the database, and update its computation of the low level descriptor values for all music played to the user.
  • the data may be retrieved automatically from the database and the low level audio descriptor values may be updated automatically at the conclusion of each portion of music being played to the user.
  • a weighting factor may be applied to the computation of the low level audio descriptors based on one or more external factors.
  • the audio descriptor values for that portion of the music may be given a small or reduced weighting factor, compared to low level audio descriptor values that apply when the audio output level is very high.
  • Other external factors may also be used to applying weighting to the audio low level descriptors.
  • processor 120 stores the low level audio descriptor values in memory 140.
  • the low level audio descriptor values stored in memory 140 can then be used to identify additional music that matches the user's musical preferences, and recommend such music to the user. Such recommendations may be provided via user interface 190. Alternatively, the recommendation may be made by simply playing the musical selection to the user and letting the user decide whether to continue listening, or to stop playing the musical selection. If a user likes the recommended musical selection, then audio player 100 may provide the user with an option to purchase and permanently download the corresponding music file to memory in audio player 100.
  • audio player 100 may have access to an external or remote song database (e.g., via a local area network; via the Internet; etc.).
  • processor 120 may compare values for the plurality of low level audio descriptors for a song in the song database, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140, and recommend the song to the user when the comparison result indicates that the song is a good match.
  • any desired threshold(s) may be used to determine when the comparison results indicate that the song is a "good match.”
  • the plurality of low level audio descriptor values for the song may have been predetermined and already stored in the song database or, alternatively, processor 120 may compute the plurality of low level audio descriptor values for the song.
  • audio player 100 may be configured to periodically play advertisements to a user.
  • audio player streams musical files form a remote server (e.g., the Internet), then as part of that streaming music service, the remote server may periodically provide an advertisement to be played to the user.
  • the low level audio descriptor values stored in memory 140 can be used to identify advertisements that are targeted to user's having particular musical preferences.
  • the advertisements are assigned low level audio descriptor values that describe musical preferences of a targeted listener.
  • processor 120 may compare values for the plurality of low level audio descriptors assigned to an advertisement, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140, and deliver the advertisement to the user when the comparison result indicates that the advertisement is a good match.
  • Audio player 100 may be configured to allow a plurality of users to log-in and identify themselves so that, for example, pre-stored individual playback preferences (e.g. equalizer setting; playback mode; volume) may be employed for each user.
  • audio player may individually characterize the musical preferences for each user by separately determining the low level audio descriptor values for the music that audio player 100 plays for each user.
  • processor 120 stores separately in memory 140 the plurality of low level audio descriptor values for each user, together with an indication of the user (e.g., a user ID) to which the low level audio descriptor values correspond.
  • audio player 100 may be configured to import, export, merge, or otherwise share low level audio descriptor values for a user with another device. For example, if a user purchases a new audio player 100, the low level audio descriptor values for the user from the user's old audio player 100 may be imported into the new audio player 100 so that the accumulated history of the user's music preferences will not be lost. In another example, if a user has two audio players 100, low level audio descriptor data may be synchronized or merged between the two devices. Such updating and synchronization could be done by directly connecting audio players 100, by docking audio players 100 to a personal computer and using a computer program executed on the personal computer, etc.
  • FIG. 2 shows a flowchart for a method of characterizing the musical preferences of one or more users which may be executed by an audio playback system such as audio player 100.
  • a user may log-in or otherwise identify herself/himself to the audio playback system.
  • the audio playback system may retrieve a set of user preferences corresponding to that particular user, including a set of low level audio descriptor values corresponding to the particular user.
  • the audio playback system plays music to a user.
  • this is in response to one or more instructions received from the user (e.g., selecting a playlist or a particular music file for playback.)
  • a processor determines whether the current time interval should be excluded from the characterization of the user's musical preferences. If so, then the process returns to step 210 and continues playing the music. Otherwise, the process proceeds to step 215. In a step 215, audio samples for each portion of music that is played to a user through the audio playback system are received by a processor.
  • the processor extracts from the audio samples a plurality of low level features of each portion of music that is played to the user through the audio playback system. Then, in a step 225, the processor computes, from the extracted low level features, a value for each of a plurality of low level audio descriptors for the current portion of music that has been played to the user through the audio playback system.
  • the processor averages each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over a particular time period.
  • step 235 the processor stores in memory the low level audio descriptor values that were determined in the preceding step.
  • a step 240 the processor compares values for the plurality of low level audio descriptors for a song in the song database, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140.
  • step 245 the comparison result is compared to one or more thresholds to determine whether the match is good.
  • step 250 when the match is good, then the song is recommended to the user. Then the process may return to step 210.
  • step 255 the processor compares values for the plurality of low level audio descriptors for an advertisement, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140.
  • step 260 the comparison result is compared to one or more thresholds to determine whether the match is good.
  • step 265 when the match is good, then the advertisement is delivered to the user. Then the process may return to step 210.
  • steps 225 and 230 may be replaced by a single step 227 wherein the processor maintains a running computation of each of the plurality of low level audio descriptor values over all music that has been played to the user through the audio playback system over the particular time period.

Landscapes

  • Engineering & Computer Science (AREA)
  • Signal Processing (AREA)
  • Multimedia (AREA)
  • Social Psychology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Computer Security & Cryptography (AREA)
  • General Engineering & Computer Science (AREA)
  • Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

L'invention concerne un système (100) et un procédé (200) de caractérisation des préférences musicales d'un individu consistant à: recevoir (215) des échantillons audio de toute la musique qui est écoutée par un utilisateur et reproduite par un système de lecture audio (170); extraire (220), à partir des échantillons audio, une pluralité de caractéristiques de faible niveau pour toute la musique qui est écoutée par l'utilisateur et reproduite par le système de lecture audio (170); calculer (227), à partir des caractéristiques de faible niveau extraites, une valeur pour chacun des multiples éléments descriptifs audio de faible niveau, pour toute la musique écoutée par l'utilisateur et reproduite par le système de lecture audio (170) pendant une période déterminée; et stocker (235) en mémoire (140) les valeurs des éléments descriptifs audio de faible niveau calculées pour toute la musique qui a été écoutée par l'utilisateur et reproduite par le système de lecture audio (170) pendant cette période.
PCT/IB2007/050692 2006-03-06 2007-03-02 Système et procédé de détermination de préférences musicales personnelles WO2007102109A2 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US77951106P 2006-03-06 2006-03-06
US60/779,511 2006-03-06

Publications (2)

Publication Number Publication Date
WO2007102109A2 true WO2007102109A2 (fr) 2007-09-13
WO2007102109A3 WO2007102109A3 (fr) 2007-11-15

Family

ID=38283366

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2007/050692 WO2007102109A2 (fr) 2006-03-06 2007-03-02 Système et procédé de détermination de préférences musicales personnelles

Country Status (1)

Country Link
WO (1) WO2007102109A2 (fr)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1378912A2 (fr) * 2002-07-02 2004-01-07 Matsushita Electric Industrial Co., Ltd. Système de recherche de musique
EP1435620A1 (fr) * 2003-01-06 2004-07-07 Thomson Licensing S.A. Procédé pour la création de et l'accès à un menu de contenu audio, sans utilisation d'un écran
US20040194612A1 (en) * 2003-04-04 2004-10-07 International Business Machines Corporation Method, system and program product for automatically categorizing computer audio files
US20050038819A1 (en) * 2000-04-21 2005-02-17 Hicken Wendell T. Music Recommendation system and method
WO2005027519A1 (fr) * 2003-09-16 2005-03-24 Koninklijke Philips Electronics N.V. Utilisation des connaissances de bon sens pour caracteriser un contenu multimedia

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20050038819A1 (en) * 2000-04-21 2005-02-17 Hicken Wendell T. Music Recommendation system and method
EP1378912A2 (fr) * 2002-07-02 2004-01-07 Matsushita Electric Industrial Co., Ltd. Système de recherche de musique
EP1435620A1 (fr) * 2003-01-06 2004-07-07 Thomson Licensing S.A. Procédé pour la création de et l'accès à un menu de contenu audio, sans utilisation d'un écran
US20040194612A1 (en) * 2003-04-04 2004-10-07 International Business Machines Corporation Method, system and program product for automatically categorizing computer audio files
WO2005027519A1 (fr) * 2003-09-16 2005-03-24 Koninklijke Philips Electronics N.V. Utilisation des connaissances de bon sens pour caracteriser un contenu multimedia

Also Published As

Publication number Publication date
WO2007102109A3 (fr) 2007-11-15

Similar Documents

Publication Publication Date Title
US8407224B2 (en) Apparatus and method for generating a collection profile and for communicating based on the collection profile
US8725740B2 (en) Active playlist having dynamic media item groups
US20200201596A1 (en) Method and system for playback of audio content using wireless mobile device
US8751030B2 (en) Audio player and operating method automatically selecting music type mode according to environment noise
US8060227B2 (en) Digital media player and method for facilitating social music discovery through sampling, identification, and logging
US8732180B2 (en) Recommending media items
KR101086197B1 (ko) 미디어 플레이백 디바이스에 의한 미디어 인기도 결정을 위한 방법
US20060265349A1 (en) Sharing music essence in a recommendation system
US20070245376A1 (en) Portable media player enabled to obtain previews of media content
US9537913B2 (en) Method and system for delivery of audio content for use on wireless mobile device
WO2016053888A1 (fr) Recommandations de contenu spécifique à un pays en cas de rareté de données sur le pays
US20090171995A1 (en) Associating and presenting alternate media with a media file
JP5594532B2 (ja) 情報処理装置および方法、情報処理システム、並びに、プログラム
US20150324369A1 (en) Method and system for deep metadata population of media content
CN105824895A (zh) 一种ktv点歌方法和系统
JP2006146622A (ja) 情報提供システム,情報提供サーバ,ユーザ端末,再生機器およびコンピュータプログラム
US8375059B2 (en) Electronic device and method therefor
US11132395B2 (en) Selecting media for a social event according to specified event parameters
WO2007102109A2 (fr) Système et procédé de détermination de préférences musicales personnelles
US20070245377A1 (en) Central system providing previews to a portable media player
US20070078847A1 (en) System and method for generating a play-list
AU2014200583A1 (en) Apparatus and method for generating a collection profile and for communicating based on the collection profile

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 07735031

Country of ref document: EP

Kind code of ref document: A2

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 07735031

Country of ref document: EP

Kind code of ref document: A2