WO2007037889A2 - Method and apparatus for audio data analysis in an audio player - Google Patents
Method and apparatus for audio data analysis in an audio player Download PDFInfo
- Publication number
- WO2007037889A2 WO2007037889A2 PCT/US2006/033666 US2006033666W WO2007037889A2 WO 2007037889 A2 WO2007037889 A2 WO 2007037889A2 US 2006033666 W US2006033666 W US 2006033666W WO 2007037889 A2 WO2007037889 A2 WO 2007037889A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- analysis
- audio
- audio data
- profile based
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R5/00—Stereophonic arrangements
- H04R5/04—Circuit arrangements, e.g. for selective connection of amplifier inputs/outputs to loudspeakers, for loudspeaker detection, or for adaptation of settings to personal preferences or hearing impairments
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/56—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
- H04H60/58—Arrangements characterised by components specially adapted for monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 of audio
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/68—Systems specially adapted for using specific information, e.g. geographical or meteorological information
- H04H60/73—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
Definitions
- the present invention relates to audio players. More specifically, the present invention relates to an audio player adapted to analyze audio data and adjust output according to the analysis.
- the present invention generally relates to an audio player adapted to analyze audio data and adjust output according to the analysis.
- One embodiment can be characterized as a method of data analysis for an audio player comprising analyzing at least a portion of audio data; selecting a sound profile based upon the analysis of the audio data; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting.
- the step of analyzing at least a portion of audio data further comprises analyzing metadata.
- the step of analyzing at least a portion of audio data further comprises analyzing sound content.
- Another embodiment can be characterized as a method of data analysis for an audio player comprising recording user interaction with an audio player, the interaction corresponding to at least a portion of audio data; selecting a sound profile based upon the user interaction; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting.
- the user interaction comprises listening to an audio track, adjusting the sound field setting or programming the sound profile by answering prompted questions.
- a subsequent embodiment includes an audio player device comprising an audio analysis circuit adapted to determine a characteristic of audio data; a profile selection circuit adapted to select a sound profile corresponding to the characteristic of audio data; and a sound field circuit adapted to adjust sound field setting according to the sound profile.
- FIG. 1 is a block diagram illustrating an audio player in accordance with one embodiment
- Fig. 2 is a flow diagram illustrating a method of analyzing audio data in accordance with one embodiment
- Fig. 3 is a flow diagram illustrating in more detail the analysis of audio data as shown in the flow diagram of Fig. 2.
- the audio player 100 includes a processor 102 with memory 104, an input interface 106, a decoder 108, a display 110 and an audio output 112,
- the processor 102 includes an audio analysis circuit 114, a sound field circuit 116 and a profile selection circuit 118.
- the audio player 100 can be one of many manufactured and sold audio players widely available, including for example, an MP3 player, a CD player, a DVD audio player, a computer, or other type of audio player.
- the audio player 100 is an electronic device that is capable, through a combination of hardware, firmware and/or software, of receiving, analyzing and outputting audio data.
- the processor 102 has memory 104 and is operably coupled to the input interface 106, the decoder 108 and the display 110.
- the audio player 100 stores audio files in the memory 104 in the form of audio data.
- the processor 102 controls reading the audio data into or out of the memory 104,
- the decoder 108 decodes the audio data and outputs the decoded audio data to the audio output 112.
- the audio output 112 outputs the audio data as an audible signal that is heard by the user of the audio player 100.
- the audio output 112 is, for example, a speaker or an audio jack for use with a headphone set.
- the memory 104 includes memory for storage of audio files,
- the memory 104 is, for example, a built-in hard disk drive, non- volatile "flash" memory, removable memory, such as a compact disk (CD), digital versatile disk (DVD), or any combination thereof. All or a portion of the memory may be in the form of one or more removable blocks, modules, or chips.
- the memory 104 need not be one physical memory device, but can include one or more separate memory devices.
- the input interface 106 includes, for example, a keypad, a touchpad, a touch screen, a mouse, or other types of devices used to interact with an electronic device.
- the user may interact with the input interface 106 of the audio player 100 to adjust the sound field in a variety of ways.
- a sound field is defined by the physical characteristics of sound waves in a region of space.
- the sound field relating to an audio player is the sound that is emitted from an audio player.
- the sound field may be adjusted when a user interacts with the input interface 106 of the audio player 100 to adjust settings of the audio player 100, for example, equalizer settings, mode settings (for example, concert hall mode or surround sound mode), bass, treble, or other settings that affect the sound field.
- the input interface 106 is adapted to record user interactions to be stored in the memory 104.
- User interactions include, by way of example only, playing an audio track at a particular sound field setting, adjusting the sound field setting while listening to a track, programming sound field settings to correspond with a particular track or genre of track, or responding to prompted questions regarding sound field settings in relation to a particular track or genre of track.
- the display 110 visually presents images corresponding to, for example, metadata, sound field settings, or other information pertinent to a user's interaction with and/or use of the audio player 100.
- the metadata includes, for example, the name of the song, the artist, the album title, the genre and the time period from when the song was created. In some embodiments, the display 110 may present questions for the user to respond to regarding sound field settings in relation to a particular track or genre of track.
- the processor 102 includes the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118.
- the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118 represent functional circuitry within the audio player 100.
- the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118 are implemented, in some embodiments, as software stored in the memory 104 and executed by the processor 102. As described herein, those skilled in the art will appreciate that circuit(s) can refer to dedicated fixed-purpose circuits and/or partially or wholly programmable platforms of various types and that these teachings are compatible with any such mode of deployment for the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118.
- the audio analysis circuit 114, sound field circuit 116 and profile selection circuit 118 are any type of executable instructions that can be implemented as, for example, hardware, firmware and/or software, or any combination thereof, which are all within the scope of the various teachings described.
- the audio analysis circuit 114 determines a characteristic of audio data.
- the audio analysis circuit can determine one or more characteristics of the audio data in a varying number of ways.
- the audio data includes both sound data (also referred to herein as sound content) and metadata.
- the audio data is stored in, for example, the memory 104.
- the audio data is streaming audio data received over a network connection (not shown) or stored in a remote memory device.
- the sound data is, for example, a song, a voice recording, or other similar type of recording.
- the metadata is data that is associated with the sound data and can be used to provide information about the sound data. For example, a song may have metadata such as artist, album, title, length, and genre, to name a few possibilities.
- the audio analysis circuit can analyze the metadata to determine a characteristic of the audio data.
- the audio analysis circuit analyzes the sound data portion of the audio data in order to determine a characteristic of the audio data.
- the sound data is made up of wave forms that can be analyzed by the processor.
- the wave form is stored, for example, as a wave file in memory.
- the wave file is analyzed, for example, using twelve tone analysis (from the low tones to the high tones).
- the twelve tone analysis provides information about the key of the music, the chord progression, beat, structure and rhythm of the music. This information can be used to infer the characteristics of the sound data.
- tempo e.g., beats per minute
- speed depends on tempo and rhythm
- dispersion variable in tempo
- major or minor type of chord
- notes per unit of time e.g., rhythm ratio
- the profile selection circuit 118 selects the sound profile corresponding to the characteristic of audio data.
- the audio data includes both sound data and metadata.
- the metadata includes, for example, genre data such as jazz, classical, rock, hip-hop, and metal.
- the profile selection circuit 118 may select a sound profile that best fits the genre that was determined by the audio analysis circuit by analyzing the metadata of the audio data.
- the profile selection circuit 118 may select a sound profile that best fits the characteristic of audio data that was determined by the audio analysis circuit by analyzing the sound data of the audio data.
- the profile selection circuit 118 may select a sound profile based on prior user interaction with the audio player 100.
- the sound profile is used by the sound field circuit 116 to adjust sound field settings.
- the sound profile selection circuit 118 is able to select a sound profile that will lead to automatic adjustments of the sound field settings such that the sound data (e.g., a song) is played back with, for example, equalizer settings, mode settings (for example, concert hall mode or surround sound mode), bass and treble that best match the song.
- the profile selection circuit 118 may be enabled to select sound field settings based upon factory set default settings, user defined preferences, preferences of a user that have been determined from previous user interactions with the audio player 100, or user interactions corresponding to a series of prompted questions the user responds to regarding sound field settings.
- the sound field circuit 116 adjusts sound field settings according to the sound profile.
- the sound profile is, for example, a file that is a collaboration of values for the sound field settings, That is, the sound profile is used by the sound field circuit 116 in order to properly set values of the different sound field settings.
- sound profiles can exist that are for a particular genre of music, for a particular person, and even for a particular audio track.
- FIG. 2 shown is a flow diagram illustrating a method of analyzing audio data on an audio player in accordance with one embodiment. The following steps can be implemented, for example, within circuitry of the audio player 200.
- the audio player retrieves the audio data.
- the audio data can be retrieved from, for example, a local music library 204, a music service 206, a local memory device of the audio player (e.g., a hard drive), or a portable memory device (e.g., a compact disk or DVD audio disk).
- the audio data can be retrieved when a users selects a song to play from the audio player or the audio player can retrieve the song prior to when the song is going to be played by the audio player.
- the audio player 200 determines if a smart sound program is enabled.
- the audio player plays back the audio data in step 216 and sound is output through an audio output (e.g., a speaker).
- the audio data that was retrieved by the audio player 200 is analyzed by the audio player in step 212.
- Fig. 3 discussed below, provides a detailed description of how the audio data is analyzed by the audio player.
- a sound profile is selected as part of the analysis of the audio data file in step 212.
- the audio player 200 adjusts sound field settings of the audio player 200 in accordance with the information contained in the sound profile that was selected in step 212.
- the audio data is output from the audio player with the adjusted sound field settings. As described above, by adjusting one or more of the various sound field settings, an improved listening experience can be obtained by the user 202 of the audio payer 200.
- a flow diagram is shown illustrating in more detail the analysis of audio data (step 212) as shown in the flow diagram of Fig. 2.
- the process begins in step 300 when the audio player determines if the audio data that was retrieved will be analyzed by looking at the metadata of the audio data. If not, the process continues at step 310. If it has been determined that the audio data should be analyzed by looking at the metadata, then the audio player, in step 302, determines whether the metadata is currently available. If the metadata is available, the process continues at step 308, If the metadata is not available, the audio player attempts to retrieve the metadata at step 304.
- the metadata can be retrieved from, for example, a remote database, a web service or a local database.
- one or more sound profiles are selected by the audio player based upon analysis of the metadata (e.g., determining a genre of the audio data). The selection can be based upon default settings, user defined preferences, or preferences of a user that have been determined from previous user interaction with the audio player.
- the audio player determines if the audio data should be analyzed by determining a characteristic of the sound data. If not, the process continues at step 316. If the audio player is going to analyze the audio data, the sound content (e.g., the wave forms or wave file of the audio content) is analyzed by the audio player in step 312. As described above, the sound data is made up of wave forms that can be analyzed by the processor of the audio player using twelve tone analysis (from the low tones to the high tones).
- the sound content e.g., the wave forms or wave file of the audio content
- the twelve tone analysis provides information about the key of the music, the chord progression, beat, structure and rhythm of the music which can be used to determine the characteristics of the sound data such as tempo (e.g., beats per minute), speed (depends on tempo and rhythm), dispersion (variance in tempo), major or minor, type of chord, notes per unit of time, and rhythm ratio.
- tempo e.g., beats per minute
- speed depends on tempo and rhythm
- dispersion variable in tempo
- major or minor type of chord
- notes per unit of time and rhythm ratio
- the twelve tone analysis provides information about the key of the music, the chord progression, beat, structure and rhythm of the music which can be used to determine the characteristics of the sound data such as tempo (e.g., beats per minute), speed (depends on tempo and rhythm), dispersion (variance in tempo), major or minor, type of chord, notes per unit of time, and rhythm ratio.
- the characteristics can then be used to select one or more sound profiles in step 314.
- step 316 the audio player determines if the audio data has been previously played by the audio player and if the audio player is going to select a sound profile based upon user interactions. If not, the process continues at step 322. If the audio data has been previously played by the audio player and if the audio player is to select a sound profile based upon user interactions, then the audio player recalls previous user interactions at step 318 during the playback of the audio file.
- the previous user interactions may be, for example, previously listening to audio data at particular sound field settings or adjusting the sound field settings during a previous playback of the audio data.
- user interaction can be a response to one or a series of prompted questions displayed to the user 202 which the user responds to by interacting with the audio player 200.
- step 320 the audio player selects one or more sound profiles based upon the user interactions with the audio player 200.
- the audio player selects the best matched sound profile with which to play back the audio data.
- the audio player may select between zero or more sound profiles. Having zero sound profiles to select from, for example, corresponds to no adjustments being made to the sound field settings. Having one sound profile to select from, for example, corresponds to adjusting the sound field settings according to the one sound profile. Having two sound profiles, for example, corresponds to the audio player selecting a sound profile from two of the three candidate profiles resulting from steps 308, 314, and 320. Having three sound profiles, for example, corresponds to the audio player selecting a sound profile from each of the three candidate profiles resulting from steps 308, 314, and 320.
- the audio player When there are a plurality of sound profiles, the audio player will select one sound profile and adjust the sound field accordingly.
- the audio player may select the one sound profile based upon factory settings or upon user interaction.
- the factory settings may establish a hierarchy of sound profile candidates such that a candidate profile based upon past user interaction with the player (step 320) trumps a candidate profile based upon metadata (step 308) which trumps a candidate profile based upon sound content (step 314).
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Stereophonic System (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
One embodiment can be characterized as a method of data analysis for an audio player comprising analyzing at least a portion of audio data; selecting a sound profile based upon the analysis of the audio data; adjusting sound field settings according to the sound profile; and outputting at least a portion of the audio data according to the sound field settings. Another embodiment can be characterized as an audio player device comprising an audio analysis circuit adapted to determine a characteristic of audio data; a profile selection circuit adapted to select a sound profile corresponding to the characteristic of audio data; and a sound field circuit adapted to adjust sound field settings according to the sound profile.
Description
METHOD AND APPARATUS FOR AUDIO DATA ANALYSIS IN AN AUDIO PLAYER
BACKGROUND OF THE INVENTION
1. Field of the Invention The present invention relates to audio players. More specifically, the present invention relates to an audio player adapted to analyze audio data and adjust output according to the analysis.
2. Discussion of the Related Art Most music players provide the capability to manually adjust the sound settings (for example, equalizer settings) that affect music playback. Many users will almost never change the sound settings because of a lack of convenience in the manner in which to adjust the sound settings. Additionally, once set, the listener rarely will re- program the sound settings as long as a similar type of music is being played back. Music players are, however, increasingly supporting the random playback of music, through functionality including, for example, song or track shuffle playback, play lists, music streaming and user-defined radio stations. This provides for much more frequent playback of dissimilar types of music during the time when a user is listening to music. This requires the user to re-program the sound settings more frequently in order to properly fit the type of music being played. For many listeners, frequently adjusting the sound settings can become annoying and degrades the overall music listening experience. Other listeners will simply stop adjusting the sound settings which also degrades the overall music listening experience.
SUMMARY OF THE INVENTION
The present invention generally relates to an audio player adapted to analyze audio data and adjust output according to the analysis. One embodiment can be characterized as a method of data analysis for an audio player comprising analyzing at least a portion of audio data; selecting a sound profile based upon the analysis of the audio data; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting. In a further embodiment, the step of analyzing at least a portion of audio data further comprises analyzing metadata. In yet another embodiment, the step of analyzing at least a portion of audio data further comprises analyzing sound content.
Another embodiment can be characterized as a method of data analysis for an audio player comprising recording user interaction with an audio player, the interaction corresponding to at least a portion of audio data; selecting a sound profile based upon the user interaction; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting. In some embodiments, the user interaction comprises listening to an audio track, adjusting the sound field setting or programming the sound profile by answering prompted questions.
A subsequent embodiment includes an audio player device comprising an audio analysis circuit adapted to determine a characteristic of audio data; a profile selection circuit adapted to select a sound profile corresponding to the characteristic of audio data; and a sound field circuit adapted to adjust sound field setting according to the sound profile.
BRIEF DESCRIPTION OF THE DRAWINGS
The above and other aspects, features and advantages of the present invention will be more apparent from the following more particular description thereof, presented in conjunction with the following drawings, wherein:
Fig. 1 is a block diagram illustrating an audio player in accordance with one embodiment;
Fig. 2 is a flow diagram illustrating a method of analyzing audio data in accordance with one embodiment; and Fig. 3 is a flow diagram illustrating in more detail the analysis of audio data as shown in the flow diagram of Fig. 2.
Corresponding reference characters indicate corresponding components throughout the several views of the drawings. Skilled artisans will appreciate that elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale. For example, the dimensions, sizing, and/or relative placement of some of the elements in the figures may be exaggerated relative to other elements to help to improve understanding of various embodiments of the present invention. Also, common but well-understood elements that are useful or necessary in a commercially feasible embodiment are often not depicted in order to facilitate a less obstructed view of these various embodiments of the present invention. It will also be understood that the terms and expressions used herein have the ordinary meaning as is usually accorded to such terms and expressions by those skilled in the corresponding respective areas of inquiry and study except where other specific meanings have otherwise been set forth herein.
DETAILED DESCRIPTION
The following description is not to be taken in a limiting sense, but is made merely for the purpose of describing the general principles of the invention. The scope of the invention should be determined with reference to the claims. The present embodiments address the problems described in the background while also addressing other additional problems as will be seen from the following detailed description.
Referring to Fig. 1, shown is a block diagram illustrating an audio player 100 in accordance with one embodiment. The audio player 100 includes a processor 102 with memory 104, an input interface 106, a decoder 108, a display 110 and an audio
output 112, The processor 102 includes an audio analysis circuit 114, a sound field circuit 116 and a profile selection circuit 118.
The audio player 100 can be one of many manufactured and sold audio players widely available, including for example, an MP3 player, a CD player, a DVD audio player, a computer, or other type of audio player. As will be described herein, the audio player 100 is an electronic device that is capable, through a combination of hardware, firmware and/or software, of receiving, analyzing and outputting audio data. The processor 102 has memory 104 and is operably coupled to the input interface 106, the decoder 108 and the display 110. The audio player 100 stores audio files in the memory 104 in the form of audio data. The processor 102 controls reading the audio data into or out of the memory 104, The decoder 108 decodes the audio data and outputs the decoded audio data to the audio output 112. The audio output 112 outputs the audio data as an audible signal that is heard by the user of the audio player 100. The audio output 112 is, for example, a speaker or an audio jack for use with a headphone set.
The memory 104 includes memory for storage of audio files, The memory 104 is, for example, a built-in hard disk drive, non- volatile "flash" memory, removable memory, such as a compact disk (CD), digital versatile disk (DVD), or any combination thereof. All or a portion of the memory may be in the form of one or more removable blocks, modules, or chips. The memory 104 need not be one physical memory device, but can include one or more separate memory devices.
The input interface 106 includes, for example, a keypad, a touchpad, a touch screen, a mouse, or other types of devices used to interact with an electronic device. During playback, the user may interact with the input interface 106 of the audio player 100 to adjust the sound field in a variety of ways. A sound field is defined by the physical characteristics of sound waves in a region of space. In the present application the sound field relating to an audio player is the sound that is emitted from an audio player. The sound field may be adjusted when a user interacts with the input interface 106 of the audio player 100 to adjust settings of the audio player 100, for example, equalizer settings, mode settings (for example, concert hall mode or surround sound mode), bass, treble, or other settings that affect the sound field. A particular arrangement
of the various settings (equalizer and mode, for example), in aggregate, will result in a complete sound field setup. Throughout this application, therefore, sound field setting(s) will be used to describe a particular arrangement of one or more of the settings of the audio player 100 that affect the sound field. In some embodiments, the input interface 106 is adapted to record user interactions to be stored in the memory 104. User interactions include, by way of example only, playing an audio track at a particular sound field setting, adjusting the sound field setting while listening to a track, programming sound field settings to correspond with a particular track or genre of track, or responding to prompted questions regarding sound field settings in relation to a particular track or genre of track.
The display 110 visually presents images corresponding to, for example, metadata, sound field settings, or other information pertinent to a user's interaction with and/or use of the audio player 100. The metadata includes, for example, the name of the song, the artist, the album title, the genre and the time period from when the song was created. In some embodiments, the display 110 may present questions for the user to respond to regarding sound field settings in relation to a particular track or genre of track. The processor 102 includes the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118. The audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118 represent functional circuitry within the audio player 100. The audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118 are implemented, in some embodiments, as software stored in the memory 104 and executed by the processor 102. As described herein, those skilled in the art will appreciate that circuit(s) can refer to dedicated fixed-purpose circuits and/or partially or wholly programmable platforms of various types and that these teachings are compatible with any such mode of deployment for the audio analysis circuit 114, the sound field circuit 116 and the profile selection circuit 118. The audio analysis circuit 114, sound field circuit 116 and profile selection circuit 118 are any type of executable instructions that can be implemented as, for example, hardware, firmware and/or software, or any combination thereof, which are all within the scope of the various teachings described.
The audio analysis circuit 114 determines a characteristic of audio data. The audio analysis circuit can determine one or more characteristics of the audio data in a varying number of ways. In one embodiment, the audio data includes both sound data (also referred to herein as sound content) and metadata. The audio data is stored in, for example, the memory 104. Alternatively, the audio data is streaming audio data received over a network connection (not shown) or stored in a remote memory device. The sound data is, for example, a song, a voice recording, or other similar type of recording. The metadata is data that is associated with the sound data and can be used to provide information about the sound data. For example, a song may have metadata such as artist, album, title, length, and genre, to name a few possibilities. The audio analysis circuit can analyze the metadata to determine a characteristic of the audio data. In another embodiment, the audio analysis circuit analyzes the sound data portion of the audio data in order to determine a characteristic of the audio data. The sound data is made up of wave forms that can be analyzed by the processor. The wave form is stored, for example, as a wave file in memory. The wave file is analyzed, for example, using twelve tone analysis (from the low tones to the high tones). The twelve tone analysis provides information about the key of the music, the chord progression, beat, structure and rhythm of the music. This information can be used to infer the characteristics of the sound data. Some of the features or characteristics of the sound data that can be extracted are tempo (e.g., beats per minute), speed (depends on tempo and rhythm), dispersion (variance in tempo), major or minor, type of chord, notes per unit of time, and rhythm ratio. By extracting different characteristics of the music, the characteristics can then be used by the profile selection circuit 118.
The profile selection circuit 118 selects the sound profile corresponding to the characteristic of audio data. As described above, in one embodiment, the audio data includes both sound data and metadata. The metadata includes, for example, genre data such as jazz, classical, rock, hip-hop, and metal. In some embodiments, the profile selection circuit 118 may select a sound profile that best fits the genre that was determined by the audio analysis circuit by analyzing the metadata of the audio data. In some embodiments, the profile selection circuit 118 may select a sound profile that best fits the characteristic of audio data that was determined by the audio analysis circuit by
analyzing the sound data of the audio data. In some embodiments, the profile selection circuit 118 may select a sound profile based on prior user interaction with the audio player 100. As will be described below, the sound profile is used by the sound field circuit 116 to adjust sound field settings. In this manner, the sound profile selection circuit 118 is able to select a sound profile that will lead to automatic adjustments of the sound field settings such that the sound data (e.g., a song) is played back with, for example, equalizer settings, mode settings (for example, concert hall mode or surround sound mode), bass and treble that best match the song. The profile selection circuit 118 may be enabled to select sound field settings based upon factory set default settings, user defined preferences, preferences of a user that have been determined from previous user interactions with the audio player 100, or user interactions corresponding to a series of prompted questions the user responds to regarding sound field settings.
The sound field circuit 116 adjusts sound field settings according to the sound profile. The sound profile is, for example, a file that is a collaboration of values for the sound field settings, That is, the sound profile is used by the sound field circuit 116 in order to properly set values of the different sound field settings. For example, sound profiles can exist that are for a particular genre of music, for a particular person, and even for a particular audio track.
Referring to Fig. 2, shown is a flow diagram illustrating a method of analyzing audio data on an audio player in accordance with one embodiment. The following steps can be implemented, for example, within circuitry of the audio player 200.
As shown, when a user 202 decides to play an audio file using the audio player (e.g., a portable audio player, a car stereo or a home stereo), in step 208, the audio player retrieves the audio data. The audio data can be retrieved from, for example, a local music library 204, a music service 206, a local memory device of the audio player (e.g., a hard drive), or a portable memory device (e.g., a compact disk or DVD audio disk). Additionally, the audio data can be retrieved when a users selects a song to play from the audio player or the audio player can retrieve the song prior to when the song is going to be played by the audio player. In step 210, the audio player 200 determines if a smart sound program is enabled. If the smart sound program is disabled, the audio player
plays back the audio data in step 216 and sound is output through an audio output (e.g., a speaker). If the smart sound program is enabled, the audio data that was retrieved by the audio player 200 is analyzed by the audio player in step 212. Fig. 3, discussed below, provides a detailed description of how the audio data is analyzed by the audio player. As will be discussed below, a sound profile is selected as part of the analysis of the audio data file in step 212. Next, in step 214, the audio player 200 adjusts sound field settings of the audio player 200 in accordance with the information contained in the sound profile that was selected in step 212. Following, in step 216, the audio data is output from the audio player with the adjusted sound field settings. As described above, by adjusting one or more of the various sound field settings, an improved listening experience can be obtained by the user 202 of the audio payer 200.
Referring to Fig. 3, a flow diagram is shown illustrating in more detail the analysis of audio data (step 212) as shown in the flow diagram of Fig. 2.
The process begins in step 300 when the audio player determines if the audio data that was retrieved will be analyzed by looking at the metadata of the audio data. If not, the process continues at step 310. If it has been determined that the audio data should be analyzed by looking at the metadata, then the audio player, in step 302, determines whether the metadata is currently available. If the metadata is available, the process continues at step 308, If the metadata is not available, the audio player attempts to retrieve the metadata at step 304. The metadata can be retrieved from, for example, a remote database, a web service or a local database. Next in step 308, one or more sound profiles are selected by the audio player based upon analysis of the metadata (e.g., determining a genre of the audio data). The selection can be based upon default settings, user defined preferences, or preferences of a user that have been determined from previous user interaction with the audio player.
Next, in step 310, the audio player determines if the audio data should be analyzed by determining a characteristic of the sound data. If not, the process continues at step 316. If the audio player is going to analyze the audio data, the sound content (e.g., the wave forms or wave file of the audio content) is analyzed by the audio player in step 312. As described above, the sound data is made up of wave forms that can be analyzed by the processor of the audio player using twelve tone analysis (from the low tones to the
high tones). The twelve tone analysis provides information about the key of the music, the chord progression, beat, structure and rhythm of the music which can be used to determine the characteristics of the sound data such as tempo (e.g., beats per minute), speed (depends on tempo and rhythm), dispersion (variance in tempo), major or minor, type of chord, notes per unit of time, and rhythm ratio. By extracting different characteristics of the music, the characteristics can then be used to select one or more sound profiles in step 314. The selection can be based upon, for example, default settings, user defined preferences, or preferences of a user that have been determined from previous user interaction with the audio player. Next, in step 316, the audio player determines if the audio data has been previously played by the audio player and if the audio player is going to select a sound profile based upon user interactions. If not, the process continues at step 322. If the audio data has been previously played by the audio player and if the audio player is to select a sound profile based upon user interactions, then the audio player recalls previous user interactions at step 318 during the playback of the audio file. The previous user interactions may be, for example, previously listening to audio data at particular sound field settings or adjusting the sound field settings during a previous playback of the audio data. In some embodiments, user interaction can be a response to one or a series of prompted questions displayed to the user 202 which the user responds to by interacting with the audio player 200. Next, in step 320, the audio player selects one or more sound profiles based upon the user interactions with the audio player 200.
Finally, in step 322, the audio player selects the best matched sound profile with which to play back the audio data. Depending upon the settings for the audio player and the flow followed in Fig. 3, the audio player may select between zero or more sound profiles. Having zero sound profiles to select from, for example, corresponds to no adjustments being made to the sound field settings. Having one sound profile to select from, for example, corresponds to adjusting the sound field settings according to the one sound profile. Having two sound profiles, for example, corresponds to the audio player selecting a sound profile from two of the three candidate profiles resulting from steps 308, 314, and 320. Having three sound profiles, for example, corresponds to the audio player selecting a sound profile from each of the three candidate profiles resulting from
steps 308, 314, and 320. When there are a plurality of sound profiles, the audio player will select one sound profile and adjust the sound field accordingly. The audio player may select the one sound profile based upon factory settings or upon user interaction. For example, the factory settings may establish a hierarchy of sound profile candidates such that a candidate profile based upon past user interaction with the player (step 320) trumps a candidate profile based upon metadata (step 308) which trumps a candidate profile based upon sound content (step 314).
While the invention herein disclosed has been described by means of specific embodiments and applications thereof, other modifications, variations, and arrangements of the present invention may be made in accordance with the above teachings other than as specifically described to practice the invention within the spirit and scope defined by the following claims.
Claims
1. A method of data analysis for an audio player comprising: analyzing at least a portion of audio data; selecting a sound profile based upon the analysis of the audio data; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting.
2. The method of claim 1 wherein the step of analyzing at least a portion of audio data further comprises analyzing metadata.
3. The method of claim 1 wherein the step of analyzing at least a portion of audio data further comprises analyzing sound content.
4. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises selecting from factory set sound profiles.
5. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises selecting from user created sound profiles.
6. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises selecting a sound profile based on an analysis of metadata.
7. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises selecting a sound profile based on an analysis of sound content.
8. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises: selecting a candidate profile based on an analysis of metadata; selecting a candidate profile based on an analysis of sound content; and selecting a best match profile from the group consisting of the candidate profile based on an analysis of metadata and the candidate profile based on an analysis of sound content.
9. The method of claim 1 wherein the step of selecting a sound profile based upon the analysis of the audio data further comprises: selecting a candidate profile based on an analysis of metadata; selecting a candidate profile based on an analysis of sound content; selecting a candidate profile based on a user interaction with an audio player, the interaction corresponding to at least a portion of audio data; and
- selecting a best match profile from the group consisting of the candidate profile based on an analysis of metadata, the candidate profile based on an analysis of sound content, and the candidate profile based on a user interaction with an audio player, the interaction corresponding to at least a portion of audio data.
10. A method of data analysis for an audio player comprising: recording user interaction with an audio player, the interaction corresponding to at least a portion of audio data; selecting a sound profile based upon the user interaction; adjusting a sound field setting according to the sound profile; and outputting at least a portion of the audio data according to the sound field setting.
11. The method of claim 10 wherein the user interaction comprises playing an audio track at a particular sound field setting.
12. The method of claim 11 further comprising adjusting the sound field setting while playing the audio track.
13. The method of claim 10 wherein the user interaction comprises programming a sound profile.
14. The method of claim 13 wherein programming the sound profile comprises responding to prompted questions from the audio player by interfacing with the audio player.
15. The method of claim 10 wherein the step of selecting a sound profile based upon the user interaction further comprises selecting from factory set sound profiles.
16. The method of claim 10 wherein the step of selecting a sound profile based upon the user interaction further comprises selecting from user created sound profiles.
17. An audio player device comprising: an audio analysis circuit adapted to determine a characteristic of audio data; a profile selection circuit adapted to select a sound profile corresponding to the characteristic of audio data; and
■ a sound field circuit adapted to adjust a sound field setting according to the sound profile.
18. The device of claim 17 wherein the audio analysis circuit is adapted to analyze metadata.
19. The device of claim 17 wherein the audio analysis circuit is adapted to analyze sound content.
20. The device of claim 17 wherein the profile selection circuit is adapted to select sound profiles from factory set sound profiles.
21. The device of claim 17 wherein the profile selection circuit is adapted to select sound profiles from user created sound profiles.
22. The device of claim 17 further comprising an input interface adapted to record user interaction with an audio player, the interaction corresponding to at least a portion of audio data.
23. The device of claim 17 further comprising a memory adapted to store audio data corresponding to user interaction with an audio player.
24. The method of claim 17 wherein the profile selection circuit is adapted to select: at least one candidate profile based on an analysis of metadata; at least one candidate profile based on an analysis of sound content; at least one candidate profile based on a user interaction with an audio player, the interaction corresponding to at least a portion of audio data; and a best match profile from the group consisting of the candidate profile based on the analysis of metadata, the candidate profile based on the analysis of sound content, and the candidate profile based on the user interaction with the audio player.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP06802551A EP1932391A4 (en) | 2005-09-16 | 2006-08-29 | Method and apparatus for audio data analysis in an audio player |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/229,298 US7774078B2 (en) | 2005-09-16 | 2005-09-16 | Method and apparatus for audio data analysis in an audio player |
US11/229,298 | 2005-09-16 |
Publications (2)
Publication Number | Publication Date |
---|---|
WO2007037889A2 true WO2007037889A2 (en) | 2007-04-05 |
WO2007037889A3 WO2007037889A3 (en) | 2007-09-27 |
Family
ID=37884142
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US2006/033666 WO2007037889A2 (en) | 2005-09-16 | 2006-08-29 | Method and apparatus for audio data analysis in an audio player |
Country Status (3)
Country | Link |
---|---|
US (2) | US7774078B2 (en) |
EP (1) | EP1932391A4 (en) |
WO (1) | WO2007037889A2 (en) |
Families Citing this family (48)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8180067B2 (en) * | 2006-04-28 | 2012-05-15 | Harman International Industries, Incorporated | System for selectively extracting components of an audio input signal |
US8036767B2 (en) | 2006-09-20 | 2011-10-11 | Harman International Industries, Incorporated | System for extracting and changing the reverberant content of an audio input signal |
KR100832360B1 (en) * | 2006-09-25 | 2008-05-26 | 삼성전자주식회사 | Method for controlling equalizer in digital media player and system thereof |
US7968787B2 (en) * | 2007-01-09 | 2011-06-28 | Yamaha Corporation | Electronic musical instrument and storage medium |
US20080175411A1 (en) * | 2007-01-19 | 2008-07-24 | Greve Jens | Player device with automatic settings |
JP2010518428A (en) | 2007-02-01 | 2010-05-27 | ミューズアミ, インコーポレイテッド | Music transcription |
JP2010518459A (en) * | 2007-02-14 | 2010-05-27 | ミューズアミ, インコーポレイテッド | Web portal for editing distributed audio files |
US7873040B2 (en) * | 2007-08-20 | 2011-01-18 | Stephen KARLSGODT | Internet radio player |
US8494257B2 (en) | 2008-02-13 | 2013-07-23 | Museami, Inc. | Music score deconstruction |
US7777122B2 (en) * | 2008-06-16 | 2010-08-17 | Tobias Hurwitz | Musical note speedometer |
US20100058048A1 (en) * | 2008-08-26 | 2010-03-04 | Advanced Micro Devices, Inc. | Profile Adjustment Module For Use With Data Processing System |
US7755526B2 (en) * | 2008-10-31 | 2010-07-13 | At&T Intellectual Property I, L.P. | System and method to modify a metadata parameter |
US20100205222A1 (en) * | 2009-02-10 | 2010-08-12 | Tom Gajdos | Music profiling |
CN102687536B (en) * | 2009-10-05 | 2017-03-08 | 哈曼国际工业有限公司 | System for the spatial extraction of audio signal |
US8515092B2 (en) * | 2009-12-18 | 2013-08-20 | Mattel, Inc. | Interactive toy for audio output |
US9990009B2 (en) * | 2009-12-22 | 2018-06-05 | Nokia Technologies Oy | Output control using gesture input |
US8350867B2 (en) | 2009-12-22 | 2013-01-08 | Ati Technologies Ulc | Image quality configuration apparatus, system and method |
US8793005B2 (en) * | 2010-09-10 | 2014-07-29 | Avid Technology, Inc. | Embedding audio device settings within audio files |
US20120294457A1 (en) * | 2011-05-17 | 2012-11-22 | Fender Musical Instruments Corporation | Audio System and Method of Using Adaptive Intelligence to Distinguish Information Content of Audio Signals and Control Signal Processing Function |
US20130053012A1 (en) * | 2011-08-23 | 2013-02-28 | Chinmay S. Dhodapkar | Methods and systems for determining a location based preference metric for a requested parameter |
US9665339B2 (en) | 2011-12-28 | 2017-05-30 | Sonos, Inc. | Methods and systems to select an audio track |
KR101945816B1 (en) * | 2012-06-08 | 2019-02-11 | 삼성전자주식회사 | Device and method for adjusting volume in terminal |
US9031244B2 (en) | 2012-06-29 | 2015-05-12 | Sonos, Inc. | Smart audio settings |
US8995687B2 (en) | 2012-08-01 | 2015-03-31 | Sonos, Inc. | Volume interactions for connected playback devices |
US10419556B2 (en) | 2012-08-11 | 2019-09-17 | Federico Fraccaroli | Method, system and apparatus for interacting with a digital work that is performed in a predetermined location |
US11184448B2 (en) | 2012-08-11 | 2021-11-23 | Federico Fraccaroli | Method, system and apparatus for interacting with a digital work |
US9473582B1 (en) * | 2012-08-11 | 2016-10-18 | Federico Fraccaroli | Method, system, and apparatus for providing a mediated sensory experience to users positioned in a shared location |
US9053710B1 (en) * | 2012-09-10 | 2015-06-09 | Amazon Technologies, Inc. | Audio content presentation using a presentation profile in a content header |
US20140369523A1 (en) * | 2013-02-15 | 2014-12-18 | Max Sound Corporation | Process for improving audio (api) |
CN104078050A (en) * | 2013-03-26 | 2014-10-01 | 杜比实验室特许公司 | Device and method for audio classification and audio processing |
US9226072B2 (en) | 2014-02-21 | 2015-12-29 | Sonos, Inc. | Media content based on playback zone awareness |
US9672213B2 (en) | 2014-06-10 | 2017-06-06 | Sonos, Inc. | Providing media items from playback history |
WO2016007899A1 (en) * | 2014-07-10 | 2016-01-14 | Rensselaer Polytechnic Institute | Interactive, expressive music accompaniment system |
US11132983B2 (en) | 2014-08-20 | 2021-09-28 | Steven Heckenlively | Music yielder with conformance to requisites |
DE102015005007B4 (en) * | 2015-04-21 | 2017-12-14 | Kronoton Gmbh | Method for improving the sound quality of an audio file |
KR20170030384A (en) * | 2015-09-09 | 2017-03-17 | 삼성전자주식회사 | Apparatus and Method for controlling sound, Apparatus and Method for learning genre recognition model |
DE102015223935A1 (en) * | 2015-12-01 | 2017-06-01 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | System for outputting audio signals and associated method and setting device |
US10750293B2 (en) | 2016-02-08 | 2020-08-18 | Hearing Instrument Manufacture Patent Partnership | Hearing augmentation systems and methods |
US10631108B2 (en) | 2016-02-08 | 2020-04-21 | K/S Himpp | Hearing augmentation systems and methods |
US10284998B2 (en) | 2016-02-08 | 2019-05-07 | K/S Himpp | Hearing augmentation systems and methods |
US10390155B2 (en) * | 2016-02-08 | 2019-08-20 | K/S Himpp | Hearing augmentation systems and methods |
US10341791B2 (en) | 2016-02-08 | 2019-07-02 | K/S Himpp | Hearing augmentation systems and methods |
EP3506255A1 (en) * | 2017-12-28 | 2019-07-03 | Spotify AB | Voice feedback for user interface of media playback device |
CN110010151A (en) * | 2018-12-31 | 2019-07-12 | 瑞声科技(新加坡)有限公司 | A kind of acoustic signal processing method and equipment, storage medium |
DE102019201615A1 (en) * | 2019-02-07 | 2020-08-13 | Volkswagen Aktiengesellschaft | Method for adjusting the sound characteristics when playing back successive audio tracks |
US11012780B2 (en) * | 2019-05-14 | 2021-05-18 | Bose Corporation | Speaker system with customized audio experiences |
US11636855B2 (en) | 2019-11-11 | 2023-04-25 | Sonos, Inc. | Media content based on operational data |
EP3889958A1 (en) * | 2020-03-31 | 2021-10-06 | Moodagent A/S | Dynamic audio playback equalization using semantic features |
Family Cites Families (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR0129989B1 (en) * | 1993-06-30 | 1998-10-01 | 김광호 | Automatic tone adjustment method and apparatus |
US5745583A (en) * | 1994-04-04 | 1998-04-28 | Honda Giken Kogyo Kabushiki Kaisha | Audio playback system |
US5530924A (en) * | 1994-07-05 | 1996-06-25 | Ford Motor Company | Radio station memory presets with stored audio effects |
US5596159A (en) * | 1995-11-22 | 1997-01-21 | Invision Interactive, Inc. | Software sound synthesis system |
US6341166B1 (en) * | 1997-03-12 | 2002-01-22 | Lsi Logic Corporation | Automatic correction of power spectral balance in audio source material |
US7096186B2 (en) * | 1998-09-01 | 2006-08-22 | Yamaha Corporation | Device and method for analyzing and representing sound signals in the musical notation |
WO2001013311A2 (en) * | 1999-08-12 | 2001-02-22 | I2Go, Inc. | Interactive audio and data player for delivery of selected content to a mobile user and obtaining a response therefrom |
US7022905B1 (en) * | 1999-10-18 | 2006-04-04 | Microsoft Corporation | Classification of information and use of classifications in searching and retrieval of information |
US20030007001A1 (en) | 2001-06-07 | 2003-01-09 | Philips Electronics North America Corporation | Automatic setting of video and audio settings for media output devices |
GB0116071D0 (en) * | 2001-06-30 | 2001-08-22 | Hewlett Packard Co | Improvements in audio reproduction |
JP2004536348A (en) | 2001-07-20 | 2004-12-02 | グレースノート インコーポレイテッド | Automatic recording identification |
KR100889438B1 (en) | 2001-09-11 | 2009-03-24 | 톰슨 라이센싱 | Method and apparatus for automatic equalization mode activation |
US20030187820A1 (en) * | 2002-03-29 | 2003-10-02 | Michael Kohut | Media management system and process |
US20060046685A1 (en) * | 2004-08-31 | 2006-03-02 | Hjelmeland Robert W | System and process for automatically adjusting the acoustic settings to best fit an audio system |
-
2005
- 2005-09-16 US US11/229,298 patent/US7774078B2/en not_active Expired - Fee Related
-
2006
- 2006-08-29 WO PCT/US2006/033666 patent/WO2007037889A2/en active Application Filing
- 2006-08-29 EP EP06802551A patent/EP1932391A4/en not_active Withdrawn
-
2010
- 2010-07-20 US US12/840,226 patent/US20100286806A1/en not_active Abandoned
Non-Patent Citations (1)
Title |
---|
See references of EP1932391A4 * |
Also Published As
Publication number | Publication date |
---|---|
US20100286806A1 (en) | 2010-11-11 |
EP1932391A4 (en) | 2011-04-20 |
EP1932391A2 (en) | 2008-06-18 |
WO2007037889A3 (en) | 2007-09-27 |
US7774078B2 (en) | 2010-08-10 |
US20070064954A1 (en) | 2007-03-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7774078B2 (en) | Method and apparatus for audio data analysis in an audio player | |
JP5318095B2 (en) | System and method for automatically beat-mixing a plurality of songs using an electronic device | |
KR101275467B1 (en) | Apparatus and method for controlling automatic equalizer of audio reproducing apparatus | |
US20080175411A1 (en) | Player device with automatic settings | |
KR20060116383A (en) | Method and apparatus for automatic setting equalizing functionality in a digital audio player | |
US20130030557A1 (en) | Audio player and operating method automatically selecting music type mode according to environment noise | |
JP2008532200A (en) | Scan shuffle to create playlist | |
CN105390144B (en) | A kind of audio-frequency processing method and apparatus for processing audio | |
CN101188132A (en) | Automatic setting method and device of balancer function of digital audio player | |
CN103208299B (en) | Recognize audio-frequence player device and the method for user | |
JP2006511845A (en) | Audio signal array | |
US20240314499A1 (en) | Techniques for audio track analysis to support audio personalization | |
US8375059B2 (en) | Electronic device and method therefor | |
WO2007060605A2 (en) | Device for and method of processing audio data items | |
KR101393714B1 (en) | Terminal and method for playing music thereof | |
CN115268828A (en) | Audio playing method, electronic equipment and readable storage medium | |
US8370356B2 (en) | Music search system, music search method, music search program and recording medium recording music search program | |
KR102265347B1 (en) | System for sound source playback changing sound sourse reproduction ptobability by user selection and method thereof | |
US20090192636A1 (en) | Media Modeling | |
JP2006048808A (en) | Audio apparatus | |
KR101082260B1 (en) | A character display method of mobile digital device | |
JP6474292B2 (en) | Karaoke equipment | |
KR100631651B1 (en) | Mobile terminal with music replay ability and method for displaying equalizer thereof | |
EP2083422A1 (en) | Media modelling | |
KR20090063453A (en) | Method for displaying words and music player using the same |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
NENP | Non-entry into the national phase |
Ref country code: DE |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2006802551 Country of ref document: EP |