WO2007102109A2 - Système et procédé de détermination de préférences musicales personnelles - Google Patents
Système et procédé de détermination de préférences musicales personnelles Download PDFInfo
- Publication number
- WO2007102109A2 WO2007102109A2 PCT/IB2007/050692 IB2007050692W WO2007102109A2 WO 2007102109 A2 WO2007102109 A2 WO 2007102109A2 IB 2007050692 W IB2007050692 W IB 2007050692W WO 2007102109 A2 WO2007102109 A2 WO 2007102109A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- user
- low level
- music
- played
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/442—Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
- H04N21/44213—Monitoring of end-user related data
- H04N21/44222—Analytics of user selections, e.g. selection of programs or purchase activity
- H04N21/44224—Monitoring of user activity on external systems, e.g. Internet browsing
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/102—Programmed access in sequence to addressed parts of tracks of operating record carriers
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/11—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information not detectable on the record carrier
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B27/00—Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
- G11B27/10—Indexing; Addressing; Timing or synchronising; Measuring tape travel
- G11B27/19—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
- G11B27/28—Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/27—Arrangements for recording or accumulating broadcast information or broadcast-related information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/35—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
- H04H60/37—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/35—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
- H04H60/46—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for recognising users' preferences
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/56—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
- H04H60/58—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/41—Structure of client; Structure of client peripherals
- H04N21/414—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance
- H04N21/41407—Specialised client platforms, e.g. receiver in car or embedded in a mobile appliance embedded in a portable device, e.g. video client on a mobile phone, PDA, laptop
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/432—Content retrieval operation from a local storage medium, e.g. hard-disk
- H04N21/4325—Content retrieval operation from a local storage medium, e.g. hard-disk by playing back content from the storage medium
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/43—Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
- H04N21/439—Processing of audio elementary streams
- H04N21/4394—Processing of audio elementary streams involving operations for analysing the audio stream, e.g. detecting features or characteristics in audio streams
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/80—Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
- H04N21/81—Monomedia components thereof
- H04N21/8106—Monomedia components thereof involving special audio data, e.g. different tracks for different languages
- H04N21/8113—Monomedia components thereof involving special audio data, e.g. different tracks for different languages comprising music, e.g. song in MP3 format
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/16—Analogue secrecy systems; Analogue subscription systems
- H04N7/162—Authorising the user terminal, e.g. by paying; Registering the use of a subscription channel, e.g. billing
- H04N7/163—Authorising the user terminal, e.g. by paying; Registering the use of a subscription channel, e.g. billing by receiver means only
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/095—Identification code, e.g. ISWC for musical works; Identification dataset
- G10H2240/101—User identification
- G10H2240/105—User profile, i.e. data about the user, e.g. for user settings or user preferences
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B2220/00—Record carriers by type
- G11B2220/60—Solid state media
- G11B2220/61—Solid state media wherein solid state memory is used for storing A/V content
Definitions
- This invention pertains to a music playback system and method of playing back music, and more particularly, to a method of determining personal musical preferences of a user, and an audio player that determines personal musical preferences of a user.
- a user's musical preferences can be characterized based on the music to which he/she listens. It would also be desirable to provide an audio player that includes such a feature. It would be further desirable to provide a method which recommends songs to a user based on their characterized musical preferences.
- the present invention is directed to addressing one or more of the preceding concerns.
- a method of characterizing an individual's musical preferences comprises: receiving audio samples for each portion of music that is played to a user through an audio playback system; extracting from the audio samples a plurality of low level features of each portion of music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for each portion of music that has been played to the user through the audio playback system; averaging each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over a time period; and storing in memory the averaged low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over the time period.
- a system for playing music comprises: memory; an audio playback system; a user interface adapted to receive a request from a user to play music through the audio playback system; and a processor configured to execute a method comprising the steps: receiving audio samples for all music that is played to a user through the audio playback system; extracting from the audio samples a plurality of low level features for all music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for all music that is played to the user through the audio playback system over a time period; and storing in the memory the computed low level audio descriptor values for all music that has been played to the user through the audio playback system over the time period.
- a method of characterizing an individual's musical preferences comprises: receiving audio samples for all music that is played to a user through an audio playback system; extracting from the audio samples a plurality of low level features for all music that is played to the user through the audio playback system; computing, from the extracted low level features, a value for each of a plurality of low level audio descriptors for all music that is played to the user through the audio playback system over a time period; and storing in memory the computed low level audio descriptor values for all music that has been played to the user through the audio playback system over the time period.
- FIG. 1 is a functional block diagram of an audio player
- FIG. 2 is a flowchart illustrating on embodiment of a method of characterizing musical preferences of a user of an audio playback system.
- FIG. 1 is a functional block diagram of an audio player 100.
- the various "parts" shown in FIG. 1 may be physically implemented using a software-controlled microprocessor, hard- wired logic circuits, or a combination thereof. Also, while the parts are functionally segregated in FIG. 1 for explanation purposes, they may be combined in any physical implementation.
- Audio player 100 includes a processor 120, memory 140, one or more input-output ports 160, audio subsystem 170, and user interface 190.
- user interface 190 includes a graphical user interface 195.
- audio player 100 also includes video subsystem 180. It should be understood that when audio player 100 does not include graphical user interface 195, then video subsystem 180 may be omitted.
- audio player 100 may play back both audio files and audio-video files, such as music videos.
- Audio player 100 may be integrated into a portable device that includes one or more other functions, such as a portable computing device, a personal digital assistant (PDA), a mobile telephone, a digital camera, a portable e-mail terminal, etc.
- a portable computing device such as a portable computing device, a personal digital assistant (PDA), a mobile telephone, a digital camera, a portable e-mail terminal, etc.
- PDA personal digital assistant
- the processor 120, memory 140, and user interface 190 may be used to perform functions of audio player 100, and also to perform one or more additional functions such as a portable computing, personal information management, mobile telephony, a digital photography, e-mail transmission and reception, etc.
- the audio player may be provided as a software program executed by a processor of a general purpose personal computer (PC), such that there are no dedicated hardware components to the audio player.
- processor 120 described below may be performed by the microprocessor of the PC
- the functions of memory 140 may be provided by the memory of the PC
- the functions provided by audio subsystem 170 may be provided by the audio systems of the PC
- the functions of user interface 190 may be provided by a graphical user interface 195 of the PC, etc.
- Audio player 100 plays music.
- Music may include digital music files, which may be stored in memory 140.
- Such music files may include digital audio files, and may also include the audio component of digital audio-video files, such as music videos.
- Audio files played by audio player 100 may have any convenient file format, such as mp3, wma, wav, MPEG, etc.
- audio player 100 may play music that is not stored locally, and instead is streamed to audio player 100 from an external source (e.g., a file on a local area network; an Internet site; etc.) via input/output port 160.
- audio player 100 may also play some music that originates in analog format. Such music may be received via input/output port 160 (e.g., an RCA jack), or another device with which audio player 100 is integrated, such as an FM radio.
- processor 120 is configured to execute one or more software algorithms in conjunction with memory 140 and user interface 190 to provide the functionality of audio player 100.
- processor 120 includes its own memory (e.g., nonvolatile memory) for storing executable software code that allows it to perform the various functions of audio player 100.
- the executable code may be stored in designated memory locations within memory 140.
- audio player 100 may include a variety of memory components to perform a variety of functions.
- it may include: processor cache memory; non- volatile memory (e.g., ROM or Flash memory) for storing executable code of software algorithms executed by processor 120; volatile memory (e.g., SRAM/DRAM, etc.) necessary to execute the software algorithms; and additional non- volatile memory (e.g., Flash memory, hard disk drive, or optical or magnetic disc, etc.) for storing the digital music files and, beneficially, associated metadata.
- Memory 140 shown in FIG. 1 is intended to represent memory performing all of the functions described above. Memory 140 stores data in response to the processor 120. Such data may include user preferences for operation of audio player 120 by one or more users.
- Such data may also include data for organizing music files, such a playlists, directories, and other mechanisms for grouping, separating, and otherwise generally organizing music files stored in audio player 100.
- memory 140 also stores music files for playback by audio player 100.
- music files may be stored in a separate memory other than memory 140.
- Input/output port(s) 160 facilitates the transfer of music into and out of audio player 100.
- Input/output port(s) 160 may include one or more USB ports, Firewire ports, Bluetooth ports, wireless Ethernet ports, RCA jacks, etc.
- Audio subsystem 170 includes audio circuitry 172, audio speaker 176, and audio output jack(s) 178 (e.g., a headphone jack). In some cases, some or all of audio circuitry 172 may be integrated with processor 120. Also, either audio speaker 176 or audio output jack(s) 178 may be omitted.
- Video subsystem 180 includes audio circuitry 182 and video display device 184. In some cases, some or all of video circuitry 182 may be integrated with processor 120.
- User interface 190 includes objects that allow a user to control the operation of audio player 100.
- user interface 190 includes a graphical user interface 195.
- graphical user interface 195 may be omitted.
- the objects can include one or more manual buttons 192, knobs 194, click- wheels 196, etc. mounted on a case that houses audio player 100.
- objects may also include buttons, check boxes, data entry (e.g., text) boxes, pull down menus, etc, that are displayed via display screen 184.
- Such objects may be selected via click-wheel 196, or another scrolling device, such as a mouse, a trackball, a track-pointer, or by arrow buttons, etc.
- display screen 184 may be a touchscreen displaying buttons, check boxes, data entry (e.g., text) boxes, pull down menus, etc which a user selects on display screen 184 with a pointer.
- audio player 100 that characterizes one or more users' musical preferences will be described. In order to not obscure the explanation of this feature, the description of other audio playback operations and features of audio player 100 will be omitted.
- Audio player 100 plays back music in response to one or more user instructions received user interface 190.
- Music files may be retrieved from memory (e.g., memory 140), or may be provided from an external source, such a streaming music file from an external source (e.g., a file accessible through a local area network; the Internet; an external audio system; etc.) via input/output port 160.
- Music files may be played back in a variety of different playback modes. Such modes include playing a preset playlist, playing an individual music file, random playback, etc.
- audio player 100 may also play some music that originates in analog format.
- audio subsystem 170 includes an analog-to-digital converter (AJO) that digitizes the analog music to produce the series of audio samples that are to be used to characterize a user's musical preferences, as described in further detail below.
- AJO analog-to-digital converter
- processor 120 executes an algorithm to extract a plurality of low level features from the audio samples of the music that is played to the user through the audio player 100.
- Processor 120 determines, from the extracted low level features, a value for each of a plurality of low level audio descriptors for the music that is played to the user through the audio player 100.
- this set of descriptor values provides a useful characterization of the user's musical preferences.
- a feature is a distinctive characteristic of data that has some meaning.
- a descriptor is a representation of the feature.
- one descriptor for color might be a frequency spectrum plot.
- Another descriptor might be a color space diagram.
- a descriptor value is an instantiation of the descriptor for a given data set.
- Low-level audio features are those related to the signal itself and have little or no meaning to the end-user. In other words, and thinking in terms of the audio domain, these features cannot be heard.
- high-level audio features are meaningful and might be related to semantic or syntactic features of the sound. They can be used to classify sound objects into the class they belong.
- the MPEG-7 Audio Standard defines seventeen temporal and spectral low level audio descriptors that can be divided into following groups:
- An LLD can be instantiated as a single value for an audio segment or as series.
- MPEG-7 Audio two different low level descriptor (LLD) Types exist. AudioLLDScalarType is useful for scalar values, such as power or fundamental frequency. AndioLLDVectorType can be used for vector types, such as spectra. Any descriptor that is inherited from one of the two types can be instantiated. More information of MPEG-7 Audio descriptors can be found in ISO document ISO/IEC JTC 1/SC 29/WG 11 N7708 (Oct. 2005), which is incorporated herein by reference.
- the audio player of FIG. 1 can use any conveniently defined set of low level audio descriptors to characterize a user's musical preferences.
- audio player 100 characterizes the user's musical preferences based only on the actual music that is played for the user by audio player 100. That is, instead of averaging descriptor values from songs on a playlist or in a song database, audio player 100 uses the music to which the user actually listens to determine the low level audio descriptor values that are employed to characterize a user's musical preferences. So, for example, if a user only listens to part of a song or other musical selection, only that part of the musical selection that was actually played goes into the determination of the low level audio descriptor values used to characterize the user's musical preferences. Furthermore, if a user listens to a favorite portion of a musical selection repeatedly, then that portion will be weighed accordingly in determining the low level audio descriptor values used to characterize the user's musical preferences.
- processor 120 determines the low level audio descriptor values over a particular time period. That is, the low level audio descriptor values are updated when the user listens to additional music. Furthermore, either automatically or under user control, optionally processor 120 may periodically reset the low level audio descriptor values to insure that the values are not unnecessarily influenced by old, stale, data that no longer reflects the user's current musical preferences. Additionally, one or more time intervals may be excluded from the time period used to determine the low level audio descriptor values. For example, a user may provide an instruction through user interface 190 to turn-on or turn-off the musical preference characterization feature for one or more time intervals.
- audio player 100 may automatically exclude one or more time intervals from the time period used to determine the low level audio descriptor values. This may be the case when an embodiment of audio player 100 is equipped to receive and play analog music, but is not equipped to convert the analog music to a digital format to produce the audio samples which are needed for extraction of the low level audio features.
- Audio player 100 can determine the low level audio descriptor values from the audio samples in a variety of ways.
- processor 120 may compute low level audio descriptor values for each portion of music (e.g., an entire song, or only the portion of a song that is actually played) that audio player 100 plays for the user. Then, processor 120 may average each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through audio player 100 over the particular time period.
- portion of music e.g., an entire song, or only the portion of a song that is actually played
- processor 120 may average each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through audio player 100 over the particular time period.
- processor 120 may maintain a running computation of each of the plurality of low level audio descriptors over all music that is played to the user through audio player 100 over the particular time period.
- processor 120 may compute the low level descriptor values "off-line” by accessing a database which contains low level audio descriptor values for each portion of music that is played to a user. For example, in this embodiment, it is assumed that low level audio descriptor values have already been computed for each portion of music that may be played to a user, and these low level audio descriptor values are stored in a database accessible by processor 120. In that case, processor 120 may compute "off-line" the low level audio descriptor values for all portions of music that are played to a user by accessing the database.
- processor may make a "playback record" listing each portion of music that is played to a user. Then, periodically, processor 120 may use this playback record to access the corresponding low level audio descriptor values from the database, and update its computation of the low level descriptor values for all music played to the user.
- the data may be retrieved automatically from the database and the low level audio descriptor values may be updated automatically at the conclusion of each portion of music being played to the user.
- a weighting factor may be applied to the computation of the low level audio descriptors based on one or more external factors.
- the audio descriptor values for that portion of the music may be given a small or reduced weighting factor, compared to low level audio descriptor values that apply when the audio output level is very high.
- Other external factors may also be used to applying weighting to the audio low level descriptors.
- processor 120 stores the low level audio descriptor values in memory 140.
- the low level audio descriptor values stored in memory 140 can then be used to identify additional music that matches the user's musical preferences, and recommend such music to the user. Such recommendations may be provided via user interface 190. Alternatively, the recommendation may be made by simply playing the musical selection to the user and letting the user decide whether to continue listening, or to stop playing the musical selection. If a user likes the recommended musical selection, then audio player 100 may provide the user with an option to purchase and permanently download the corresponding music file to memory in audio player 100.
- audio player 100 may have access to an external or remote song database (e.g., via a local area network; via the Internet; etc.).
- processor 120 may compare values for the plurality of low level audio descriptors for a song in the song database, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140, and recommend the song to the user when the comparison result indicates that the song is a good match.
- any desired threshold(s) may be used to determine when the comparison results indicate that the song is a "good match.”
- the plurality of low level audio descriptor values for the song may have been predetermined and already stored in the song database or, alternatively, processor 120 may compute the plurality of low level audio descriptor values for the song.
- audio player 100 may be configured to periodically play advertisements to a user.
- audio player streams musical files form a remote server (e.g., the Internet), then as part of that streaming music service, the remote server may periodically provide an advertisement to be played to the user.
- the low level audio descriptor values stored in memory 140 can be used to identify advertisements that are targeted to user's having particular musical preferences.
- the advertisements are assigned low level audio descriptor values that describe musical preferences of a targeted listener.
- processor 120 may compare values for the plurality of low level audio descriptors assigned to an advertisement, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140, and deliver the advertisement to the user when the comparison result indicates that the advertisement is a good match.
- Audio player 100 may be configured to allow a plurality of users to log-in and identify themselves so that, for example, pre-stored individual playback preferences (e.g. equalizer setting; playback mode; volume) may be employed for each user.
- audio player may individually characterize the musical preferences for each user by separately determining the low level audio descriptor values for the music that audio player 100 plays for each user.
- processor 120 stores separately in memory 140 the plurality of low level audio descriptor values for each user, together with an indication of the user (e.g., a user ID) to which the low level audio descriptor values correspond.
- audio player 100 may be configured to import, export, merge, or otherwise share low level audio descriptor values for a user with another device. For example, if a user purchases a new audio player 100, the low level audio descriptor values for the user from the user's old audio player 100 may be imported into the new audio player 100 so that the accumulated history of the user's music preferences will not be lost. In another example, if a user has two audio players 100, low level audio descriptor data may be synchronized or merged between the two devices. Such updating and synchronization could be done by directly connecting audio players 100, by docking audio players 100 to a personal computer and using a computer program executed on the personal computer, etc.
- FIG. 2 shows a flowchart for a method of characterizing the musical preferences of one or more users which may be executed by an audio playback system such as audio player 100.
- a user may log-in or otherwise identify herself/himself to the audio playback system.
- the audio playback system may retrieve a set of user preferences corresponding to that particular user, including a set of low level audio descriptor values corresponding to the particular user.
- the audio playback system plays music to a user.
- this is in response to one or more instructions received from the user (e.g., selecting a playlist or a particular music file for playback.)
- a processor determines whether the current time interval should be excluded from the characterization of the user's musical preferences. If so, then the process returns to step 210 and continues playing the music. Otherwise, the process proceeds to step 215. In a step 215, audio samples for each portion of music that is played to a user through the audio playback system are received by a processor.
- the processor extracts from the audio samples a plurality of low level features of each portion of music that is played to the user through the audio playback system. Then, in a step 225, the processor computes, from the extracted low level features, a value for each of a plurality of low level audio descriptors for the current portion of music that has been played to the user through the audio playback system.
- the processor averages each of the plurality of computed low level audio descriptor values for all portions of music that have been played to the user through the audio playback system over a particular time period.
- step 235 the processor stores in memory the low level audio descriptor values that were determined in the preceding step.
- a step 240 the processor compares values for the plurality of low level audio descriptors for a song in the song database, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140.
- step 245 the comparison result is compared to one or more thresholds to determine whether the match is good.
- step 250 when the match is good, then the song is recommended to the user. Then the process may return to step 210.
- step 255 the processor compares values for the plurality of low level audio descriptors for an advertisement, with the low level audio descriptor values for music that has been played to the user through the audio playback system over the time period and stored in memory 140.
- step 260 the comparison result is compared to one or more thresholds to determine whether the match is good.
- step 265 when the match is good, then the advertisement is delivered to the user. Then the process may return to step 210.
- steps 225 and 230 may be replaced by a single step 227 wherein the processor maintains a running computation of each of the plurality of low level audio descriptor values over all music that has been played to the user through the audio playback system over the particular time period.
Landscapes
- Engineering & Computer Science (AREA)
- Signal Processing (AREA)
- Multimedia (AREA)
- Social Psychology (AREA)
- Health & Medical Sciences (AREA)
- General Health & Medical Sciences (AREA)
- Databases & Information Systems (AREA)
- Computer Networks & Wireless Communication (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Computer Security & Cryptography (AREA)
- General Engineering & Computer Science (AREA)
- Indexing, Searching, Synchronizing, And The Amount Of Synchronization Travel Of Record Carriers (AREA)
- Reverberation, Karaoke And Other Acoustics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
L'invention concerne un système (100) et un procédé (200) de caractérisation des préférences musicales d'un individu consistant à: recevoir (215) des échantillons audio de toute la musique qui est écoutée par un utilisateur et reproduite par un système de lecture audio (170); extraire (220), à partir des échantillons audio, une pluralité de caractéristiques de faible niveau pour toute la musique qui est écoutée par l'utilisateur et reproduite par le système de lecture audio (170); calculer (227), à partir des caractéristiques de faible niveau extraites, une valeur pour chacun des multiples éléments descriptifs audio de faible niveau, pour toute la musique écoutée par l'utilisateur et reproduite par le système de lecture audio (170) pendant une période déterminée; et stocker (235) en mémoire (140) les valeurs des éléments descriptifs audio de faible niveau calculées pour toute la musique qui a été écoutée par l'utilisateur et reproduite par le système de lecture audio (170) pendant cette période.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US77951106P | 2006-03-06 | 2006-03-06 | |
US60/779,511 | 2006-03-06 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007102109A2 true WO2007102109A2 (fr) | 2007-09-13 |
WO2007102109A3 WO2007102109A3 (fr) | 2007-11-15 |
Family
ID=38283366
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/IB2007/050692 WO2007102109A2 (fr) | 2006-03-06 | 2007-03-02 | Système et procédé de détermination de préférences musicales personnelles |
Country Status (1)
Country | Link |
---|---|
WO (1) | WO2007102109A2 (fr) |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1378912A2 (fr) * | 2002-07-02 | 2004-01-07 | Matsushita Electric Industrial Co., Ltd. | Système de recherche de musique |
EP1435620A1 (fr) * | 2003-01-06 | 2004-07-07 | Thomson Licensing S.A. | Procédé pour la création de et l'accès à un menu de contenu audio, sans utilisation d'un écran |
US20040194612A1 (en) * | 2003-04-04 | 2004-10-07 | International Business Machines Corporation | Method, system and program product for automatically categorizing computer audio files |
US20050038819A1 (en) * | 2000-04-21 | 2005-02-17 | Hicken Wendell T. | Music Recommendation system and method |
WO2005027519A1 (fr) * | 2003-09-16 | 2005-03-24 | Koninklijke Philips Electronics N.V. | Utilisation des connaissances de bon sens pour caracteriser un contenu multimedia |
-
2007
- 2007-03-02 WO PCT/IB2007/050692 patent/WO2007102109A2/fr active Application Filing
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050038819A1 (en) * | 2000-04-21 | 2005-02-17 | Hicken Wendell T. | Music Recommendation system and method |
EP1378912A2 (fr) * | 2002-07-02 | 2004-01-07 | Matsushita Electric Industrial Co., Ltd. | Système de recherche de musique |
EP1435620A1 (fr) * | 2003-01-06 | 2004-07-07 | Thomson Licensing S.A. | Procédé pour la création de et l'accès à un menu de contenu audio, sans utilisation d'un écran |
US20040194612A1 (en) * | 2003-04-04 | 2004-10-07 | International Business Machines Corporation | Method, system and program product for automatically categorizing computer audio files |
WO2005027519A1 (fr) * | 2003-09-16 | 2005-03-24 | Koninklijke Philips Electronics N.V. | Utilisation des connaissances de bon sens pour caracteriser un contenu multimedia |
Also Published As
Publication number | Publication date |
---|---|
WO2007102109A3 (fr) | 2007-11-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US8407224B2 (en) | Apparatus and method for generating a collection profile and for communicating based on the collection profile | |
US8725740B2 (en) | Active playlist having dynamic media item groups | |
US20200201596A1 (en) | Method and system for playback of audio content using wireless mobile device | |
US8751030B2 (en) | Audio player and operating method automatically selecting music type mode according to environment noise | |
US8060227B2 (en) | Digital media player and method for facilitating social music discovery through sampling, identification, and logging | |
US8732180B2 (en) | Recommending media items | |
KR101086197B1 (ko) | 미디어 플레이백 디바이스에 의한 미디어 인기도 결정을 위한 방법 | |
US20060265349A1 (en) | Sharing music essence in a recommendation system | |
US20070245376A1 (en) | Portable media player enabled to obtain previews of media content | |
US9537913B2 (en) | Method and system for delivery of audio content for use on wireless mobile device | |
WO2016053888A1 (fr) | Recommandations de contenu spécifique à un pays en cas de rareté de données sur le pays | |
US20090171995A1 (en) | Associating and presenting alternate media with a media file | |
JP5594532B2 (ja) | 情報処理装置および方法、情報処理システム、並びに、プログラム | |
US20150324369A1 (en) | Method and system for deep metadata population of media content | |
CN105824895A (zh) | 一种ktv点歌方法和系统 | |
JP2006146622A (ja) | 情報提供システム,情報提供サーバ,ユーザ端末,再生機器およびコンピュータプログラム | |
US8375059B2 (en) | Electronic device and method therefor | |
US11132395B2 (en) | Selecting media for a social event according to specified event parameters | |
WO2007102109A2 (fr) | Système et procédé de détermination de préférences musicales personnelles | |
US20070245377A1 (en) | Central system providing previews to a portable media player | |
US20070078847A1 (en) | System and method for generating a play-list | |
AU2014200583A1 (en) | Apparatus and method for generating a collection profile and for communicating based on the collection profile |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 07735031 Country of ref document: EP Kind code of ref document: A2 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 07735031 Country of ref document: EP Kind code of ref document: A2 |