WO2007060605A2 - Device for and method of processing audio data items - Google Patents

Device for and method of processing audio data items Download PDF

Info

Publication number
WO2007060605A2
WO2007060605A2 PCT/IB2006/054352 IB2006054352W WO2007060605A2 WO 2007060605 A2 WO2007060605 A2 WO 2007060605A2 IB 2006054352 W IB2006054352 W IB 2006054352W WO 2007060605 A2 WO2007060605 A2 WO 2007060605A2
Authority
WO
WIPO (PCT)
Prior art keywords
audio data
audio
data items
transition segment
subsequent
Prior art date
Application number
PCT/IB2006/054352
Other languages
French (fr)
Other versions
WO2007060605A3 (en
Inventor
Sander B. F. Van De Wijdeven
Original Assignee
Koninklijke Philips Electronics N.V.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninklijke Philips Electronics N.V. filed Critical Koninklijke Philips Electronics N.V.
Publication of WO2007060605A2 publication Critical patent/WO2007060605A2/en
Publication of WO2007060605A3 publication Critical patent/WO2007060605A3/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0033Recording/reproducing or transmission of music for electrophonic musical instruments
    • G10H1/0041Recording/reproducing or transmission of music for electrophonic musical instruments in coded form
    • G10H1/0058Transmission between separate instruments or between individual components of a musical system
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/071Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for rhythm pattern analysis or rhythm style recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/101Music Composition or musical creation; Tools or processes therefor
    • G10H2210/125Medley, i.e. linking parts of different musical pieces in one single piece, e.g. sound collage, DJ mix
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/011Files or data streams containing coded musical information, e.g. for transmission
    • G10H2240/046File format, i.e. specific or non-standard musical file format used in or adapted for electrophonic musical instruments, e.g. in wavetables
    • G10H2240/061MP3, i.e. MPEG-1 or MPEG-2 Audio Layer III, lossy audio compression
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/121Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
    • G10H2240/131Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/025Envelope processing of music signals in, e.g. time domain, transform domain or cepstrum domain
    • G10H2250/035Crossfade, i.e. time domain amplitude envelope control of the transition between musical sounds or melodies, obtained for musical purposes, e.g. for ADSR tone generation, articulations, medley, remix

Definitions

  • the invention relates to an audio data-processing device.
  • the invention also relates to an audio playback device.
  • the invention further relates to a method of processing audio data.
  • the invention relates to a program element.
  • the invention relates to a computer-readable medium.
  • Audio playback devices are becoming more and more important. Particularly, an increasing number of users buy hard disk-based audio players and other entertainment equipment.
  • Audio players such as portable MP3 players, which can mix songs together, are an attractive option for consumers. These players have the capability of mixing separate tracks together, in a DJ-like style, so that the transition between the tracks is more pleasant to listen to.
  • US 2003/0183064 Al discloses a sequential playback system that is configured to select each sequential song on the basis of characteristics of an ending segment of each preceding song. Songs are selected on the basis of the characteristics of the overall theme of the selection, if any, and also on the basis of musical correspondence between songs. The correspondence may be based on the rhythm of each song, its notes and chords, and other musical characteristics of the songs. The end segment of each selected song is characterized, and the first segment of a candidate song that satisfies the overall selection criterion is compared with this characterization to determine a correspondence.
  • the audio player according to US 2003/0183064 Al has mixing capabilities. Such an audio player can mix two songs together, because it contains software that can extract the audio parameters and apply them in mixing songs.
  • an audio data-processing device an audio playback device, a method of processing audio data, a program element and a computer-readable medium as defined in the independent claims are provided.
  • an audio data-processing device which comprises a processing unit adapted to generate at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and a sending interface at which the at least one audio data transition segment can be provided for transmission to an audio playback device.
  • an audio playback device which comprises a receiver interface at which at least one audio data transition segment is receivable from an audio data-processing device, wherein the at least one audio data transition segment represents a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and an audio playback unit adapted to play back each audio data transition segment after the corresponding preceding one of the audio data items and before the corresponding subsequent one of the audio data items.
  • a method of processing audio data comprising the steps of generating at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and providing the at least one audio data transition segment for transmission to an audio playback device.
  • a program element which, when being executed by a processor, is adapted to control or carry out a method of processing audio data having the above-mentioned features.
  • a computer- readable medium in which a computer program is stored which, when being executed by a processor, is adapted to control or carry out a method of processing audio data having the above-mentioned features.
  • the audio processing in accordance with embodiments of the invention can be realized by a computer program, i.e. by software, or by using one or more special electronic optimization circuits, i.e. in hardware, or in a hybrid form, i.e. by means of software components and hardware components.
  • an audio data-processing device for instance, a personal computer
  • a processor for instance, a microprocessor
  • a processor of the audio data-processing device can generate a transition section (for instance, a smooth changeover between two songs) which can be provided for transmission (for instance, download) to an audio playback device (such as a portable MP3 player).
  • the apparatus generating the transition sections, on the one hand, (namely the audio data-processing device) and the apparatus playing back the songs with transition sections played back in between two songs, on the other hand, (namely the audio playback device) may be different physical entities. Therefore, a mixing feature may also be provided in a legacy audio playback device, i.e. in an audio playback device which does not have a DJ or mixing feature implemented.
  • transition portion-processing for instance, performed on a computer
  • mixed audio content for instance, performed on an audio player
  • Necessary data may be transferred in a smart manner from the computer to the audio player, i.e. only mixed portions (not the entire audio item to be played back) and playback instructions (for example, a playlist) can be transferred.
  • This may improve flexibility because individual audio content components may be transferred instead of one large pre-mixed block consisting of or comprising several audio items and mix parts. It may be advantageous that the audio player has installed capability to interpret the information transferred from the computer.
  • the audio data-processing device may be capable of finding a suitable order for the mixed audio items to be played back.
  • Such a mix may be in accordance with one of a plurality of mix options, for instance, a beat mix.
  • the previous song may be faded out, and the next song may be faded in.
  • the mix portion to be transferred may be an MP3 file, a ".wav"-file, or the like.
  • the audio player may read and interpret the transferred mix section. For mixing, the audio player may stop playback of the outgoing audio item a small time interval before its actual end (that is to say, it may skip the last seconds of the previous song), may then play back the mix section and subsequently start playback of the incoming audio item a short time interval after its actual beginning (that is to say, it may skip the first seconds of the subsequent song).
  • a method which allows legacy audio players, such as portable MP3 players, to play mixed playlists based on a "precooked" mix.
  • the mix transitions between tracks may be created on a PC before sending them to the audio player. It is possible that only differences between the original tracks and mixes are transferred to the audio player, wherein the original tracks may remain intact.
  • a legacy audio player may be provided, which may be capable of mixing two songs together, based on software installed on the audio data-processing device. Such software can extract the audio parameters and apply them in mixing songs. Thus, even legacy audio players which themselves lack mixing or transition section generation capabilities can be used for playing back mixed tracks as generated by the audio data-processing device. Thus, it is possible to allow legacy audio players to play mixed songs with the aid of precooked mixes. In other words, a method is provided for enabling existing audio players without mixing capabilities to play new sequence mix transitions.
  • a method is disclosed, using a mechanism in which mixes are pre-created and then added to the playlist that is stored on the device, so that the result may sound identical or similar as on an audio player that does have mixing capabilities.
  • the mixes may be pre- created on the PC and then uploaded with the music to the audio playback device when it is connected to the PC.
  • Mix transitions can also be provided by Internet services which have advanced methods of creating mix transitions, or have human (DJ)-created mix transitions offered to the public (for instance, for a fee).
  • Mixes may be created as follows.
  • a user can select the audio tracks for uploading in the synchronization application and place them in one or more playlists.
  • the application can then analyze them by looking at the tempo, key, chromogram, etc. of each song and determine the best or an acceptable transition between two different tracks.
  • the application can then mix the two tracks together and look at the difference between the two original tracks and the mix. It is possible to isolate the difference and store this difference as a small separate mix track.
  • the playlist may be altered by inserting the mixed tracks between the song tracks and then altering the play time of the tracks. Start time and stop time of the tracks may be adjusted in such a way that a transition "track 1-mix 12-track 2" is identical to the mix created by the application at the beginning by decreasing the end time of the first track and increasing the start time of the second track, etc.
  • the playlist and/or the tracks and/or the mixed tracks may be uploaded to the (for instance, portable) audio playback device.
  • the process of analyzing the songs and creating the transitions can also be performed at an earlier time (i.e. before connecting the audio playback device to the audio data-processing device), for instance, when entering music in the collections, so that the uploading process from the audio data-processing device to an audio playback device is faster.
  • specifying the start-to-stop times in a playlist might be accomplished by a modification of the audio player (for instance, of firmware of the audio player).
  • the audio player it is possible to create all possible transitions between all the tracks that are available on the portable device and upload them to the audio player. If necessary or desired, adjustments may be carried out in the audio player (for instance, in firmware of the audio player), so that the audio player is able to insert mix tracks in the playlist instead of playing only the precooked playlists from the personal computer. This may improve the flexibility of the player, because this allows also features like shuffle play with mixes, with in many cases only a small additional amount of storage space on the device. This results from the fact that the mixes usually cover only a few percent and a small size of the length of a single audio track.
  • a synchronization application removes the start and end part of a track when uploading it to the audio player. In this way, it may be dispensable to specify start and end times; tracks are played completely.
  • a field of application of embodiments of the invention is software for a PC that is used to communicate or upload audio files to a portable audio player or any other audio device, wherein mixed audio is desired.
  • a method is provided for performing the steps of identifying a sequence of audio tracks and determining transition boundaries between successive audio tracks of the sequence of audio tracks. Then, mixing of at least a number of successive audio tracks within transition boundaries may be carried out so as to obtain a number of mixed audio tracks. Subsequently, a playlist of the audio tracks and/or of the mixed audio tracks may be stored.
  • a playlist A-AB-B-BC-C can be changed to A-AD-D-DC-C without synchronizing the player to the personal computer just by altering the playlist on the player itself.
  • Embodiments of the invention may have the advantage that a legacy player may be made suitable for playing music tracks with mixed transitions.
  • precooked audio mixes for legacy audio players are provided.
  • a method of creating music transitions, using an external PC application may be provided by creating and storing audio mixes at music transitions, based on playlists and storing the mix to the player memory, when the audio player is docked to the PC. Then, all possible music transitions may be created and stored for the collection of music to allow shuffled playback. Furthermore, synchronization of music timings may be provided as an optional feature. Synchronization may include controlling a time correlation between different songs played back one after the other. In accordance with an embodiment, a mix segment may be stored and/or transferred to another device. Playlists may be adapted on the basis of these segments.
  • mix segments on portable players and synchronization strategies between a personal computer and a portable device may be provided.
  • precooked mixes or transition pieces on portable players is possible. This may include synchronization strategies between PC and portable device and it describes a way to schedule mix segments in the playlist.
  • a method which analyzes a playlist and creates mix transitions between the tracks in the playlist and stores these as a new file. This may leave the original tracks intact and does not necessarily create a single large track.
  • Such a mix is created by inserting the mix transition files as separate tracks to the playlist in between two subsequent songs.
  • a mix may be transferred to a player in various ways, for example, by transferring the original track in the playlist, transferring the transition tracks and/or transferring a playlist file that describes the mix.
  • a mix may be played on the player by selecting the corresponding playlist file on the player for playback.
  • One aspect is that a mix is created and transferred in components/blocks and not as a complete, static, single mix.
  • Such an approach may have the advantage that original tracks remain intact, so that they can still be played normally on the player. Only transitions between tracks have to be copied additionally, so that they can be reshuffled to create new playlists. In other words, if one has stored a number of tracks on the player and a number of transitions, one can create new playlists on the player (if the player supports this) without synchronizing with a PC. Two playlists with partly the same tracks may have a much larger overlap in size than two mixes so that storage capacity may be saved. Furthermore, in accordance with an embodiment, the system of dimension may be used for creating new mixed playlists. Only missing tracks and mix transitions have to be copied from the PC to the player, which may speed up the synchronization and operation.
  • Each audio data transition segment may be generated on the basis of an end portion of the corresponding preceding one of the audio data items and on the basis of a start portion of the corresponding subsequent one of the audio data items.
  • transitions between two songs do not always have to consist of a transition from the end of song 1 to the beginning of song 2.
  • the audio data-processing device may comprise a user interface adapted to allow a user to select the plurality of audio data items as a sub-group of a group of audio data items. For instance, a user may select only those items from all available audio items that are to be played back and/or transferred to the audio playback device. Based on this sub-group, the system may then automatically generate a playback scheme with mixed transitions between subsequent audio data items.
  • the user interface may be a graphical user interface (GUI) including a display (such as a cathode ray tube, a liquid crystal display, a plasma display device or the like) for displaying information to a human operator.
  • GUI graphical user interface
  • the user interface may comprise an input device allowing the user to input data or provide the system with control commands. Such an input device may include a keypad, a joystick, a track ball, or may even be a microphone of a voice recognition system.
  • the user interface may allow a human user to communicate with the system in a bi-directional manner
  • the processing unit of the audio data-processing device may be adapted to sort the plurality of audio data items in a succession (or in an order) so that transitions between adjacent audio data items are in accordance with at least one matching criterion.
  • This sorting may be carried out in such a manner that two subsequently played audio data items fit each other as regards a human listener's perception.
  • the matching or sorting criterion may be selected from the group consisting of tempo, key, chromogram, rhythm, notes, chords, genre (and other forms of metadata and audio-extracted features) of the adjacent audio data items. By taking this measure, it can be ensured that two subsequently played back audio items provide a nice audible experience to a user. Songs may be selected and/or combined and/or ordered on the basis of characteristics of the overall theme of the selection and/or on the basis of a musical correspondence between adjacent songs.
  • the processing unit may be adapted to generate a playlist which is indicative of the succession of the sorted audio data items bridged by the at least one audio data transition segment, wherein the playlist can be provided at the sending interface for transmission to the audio playback device.
  • a playlist may be a small file that can be copied from the audio data-processing device to the audio playback device so that information is communicated to the audio playback device about the sequence or other modalities of the playback of the audio data items.
  • the playlist may also be in a form which combines song sequence description (playlist parameters) and the mixes (audio data) combined to one file.
  • the processing unit may be adapted in such a manner that the at least one audio data transition segment can be provided without the corresponding audio data items at the sending interface for transmission to the audio playback device.
  • only the transition portions are transferred from the audio data-processing device to the audio playback device, wherein the main part of the audio items (which is, for instance, already stored in the audio playback device) does not have to be transmitted (again).
  • the transfer may be accelerated and the amount of transferred data may be reduced.
  • the processing unit is adapted in such a manner that the at least one audio data transition segment can be provided together with the corresponding audio data items at the sending interface for transmission to the audio playback device. Therefore, not only the transition segments are transferred, but also the main part of the audio content.
  • the processing unit may be adapted to generate a shortened preceding one of the plurality of audio data items by cutting an end portion of the preceding one of the plurality of audio data items, wherein the shortened preceding one of the plurality of audio data items can be provided at the sending interface for transmission to the audio playback device.
  • the processing unit may be adapted to generate a shortened subsequent one of the plurality of audio data items by cutting a start portion of the subsequent one of the plurality of audio data items, wherein the shortened subsequent one of the plurality of audio data items can be provided at the sending interface for transmission to the audio playback device.
  • transition section is directly played between the two shortened parts of the two adjacent audio data items. This may reduce the storage required for storing the two main pieces and the transition portion.
  • the processing unit may be further adapted to generate the at least one audio data transition segment exclusively/only in an operation state in which no audio playback device is communicatively connected to the sending interface. If the calculations carried out in the processing unit have already been carried out before actually sending this data to the audio playback device, the operation may be accelerated.
  • the processing unit may be adapted to generate the at least one audio data transition segment exclusively/only in an operation state in which the audio playback device is communicatively connected to the sending interface.
  • a bidirectional communication between the audio data-processing device and the audio playback device is possible that may improve the quality of processing and communication.
  • the processing unit may be adapted to generate the at least one audio data transition segment by including an audio transition piece.
  • mix transitions can be enhanced by introducing new musical elements to 'improve' the transition.
  • a synthetic drum sequence can be generated and mixed in so as to provide a smoother transition between two tracks.
  • the audio data-processing device may be adapted as a computer.
  • a computer may denote any processor-including entity that can be connected to an audio playback device.
  • a computer may be a conventional personal computer or a laptop or also a sophisticated server computer.
  • the computer may also be a mobile phone, a personal digital assistant (PDA) or any other device which allows central control of the communication with the audio playback device.
  • PDA personal digital assistant
  • the audio playback device may be free of any capability of generating audio data transition segments.
  • the audio playback device is a legacy audio player, i.e. an existing audio player without mixing capabilities, such a legacy device may provide a DJ function.
  • the audio playback unit may comprise at least one of the group consisting of a loudspeaker, an earpiece and a headset.
  • the communication between the audio playback device and such a reproduction unit may be wireless or wired.
  • the communication between the audio playback device and the audio data-processing device may be carried out in a wired manner (for instance, using a bus such as a USB connection) or in a wireless manner (for instance, via a WLAN or Bluetooth).
  • the audio playback device may be realized as a portable audio player, a DVD player, a CD player, a hard disk-based media player, an Internet radio device, a public entertainment device, an MP3 player, a vehicle entertainment device, a car entertainment device, a portable video player, a mobile phone, a medical communication system, a body- worn device, and a hearing aid device.
  • a "car entertainment device” may be a hi-fi system for an automobile.
  • an embodiment of the system according to the invention primarily intend to improve the playback of sound or audio data, it is also possible to apply the system for a combination of audio data and video data.
  • an embodiment of the invention may be implemented in audiovisual applications such as a video player in which a loudspeaker is used, or a home cinema system.
  • Fig. 1 shows an embodiment of an audio data-processing and playback system according to the invention.
  • Fig. 2 shows an audio data-processing scheme in accordance with an embodiment of the invention.
  • the audio data-processing and playback system 100 comprises an audio data- processing device 110 and an audio playback device 120.
  • the audio data-processing device 110 comprises a microprocessor 111 (for instance, a CPU) which is adapted to generate at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of an end portion of the corresponding preceding one of the audio data items and on the basis of a start portion of the corresponding subsequent one of the audio data items.
  • a microprocessor 111 for instance, a CPU
  • each audio data transition segment is generated on the basis of an end portion of the corresponding preceding one of the audio data items and on the basis of a start portion of the corresponding subsequent one of the audio data items.
  • the microprocessor 111 is coupled with a sending interface 112 of the audio data-processing device 110, wherein the sending interface 112 may transmit the at least one audio data transition segment generated in the microprocessor 111 to the audio playback device 120.
  • a communication path 130 which may be a wired (for example, a USB connection) or wireless communication path (for example, a Bluetooth connection).
  • the audio data-processing device 110 which may be a personal computer (PC), comprises a storage device 113 (for instance, a hard disk, a CD, a DVD, etc.) on which audio content (like songs of a music CD or stories of an audio book) may be stored.
  • a storage device 113 for instance, a hard disk, a CD, a DVD, etc.
  • audio content like songs of a music CD or stories of an audio book
  • microprocessor 111 is connected to a graphical user interface 114 via which a human user (not shown) may control the operation of the audio data-processing device 110.
  • the audio playback device 120 is a portable audio player, in more detail a legacy player that does not have a separate functionality implemented to allow mixing or other DJ-like features.
  • the audio playback device 120 comprises a receiver interface 121 that receives the audio data transition segment or segments generated by the audio data- processing device 110.
  • the receiver interface 121 may communicate with the sending interface 112 via the communication path 130.
  • the term "sending” in the component referred to as sending interface 112 and the term “receiver” in the component referred to as receiver interface 121 are only related to the transfer of audio data transition segments from the sending interface 112 to the receiver interface 121. Apart from this, it is possible that the interfaces 112, 121 communicate in a bi-directional manner, so that it is possible that data is sent from the receiver interface 121 to the sending interface 112, or vice versa.
  • the audio playback device 120 comprises a loudspeaker 122 for reproducing each audio data transition segment after the end portion of a corresponding preceding one of the audio data items and before the start portion of the corresponding subsequent one of the audio data items.
  • the audio playback device 120 comprises a microprocessor or control unit 123 which is coupled to the receiver interface 121, the loudspeaker 122, a storage device 124 (for instance, a hard disk or flash memory) and a user interface 125 via which a human operator may control the use of the audio playback device 120. For instance, via the user interface 125, it is possible for a human user to adjust loudness, equalization, start and stop, fast forward or fast reverse of playback, etc.
  • the audio data-processing device 110 When a user connects the audio data-processing device 110 to the audio playback device 120, he may input control information via the user interface 114. For instance, the user may select a plurality of audio pieces stored on the storage device 113 for reproduction. Furthermore, the microprocessor 111 may sort the user-defined audio data items in a particular succession so that transitions between adjacent audio data items are in accordance with particular criteria, such as a rhythm of two subsequent audio pieces.
  • the microprocessor 111 generates a playlist which is indicative of the sequence of sorted audio pieces bridged by the generated audio data transition segments. This information may be transferred from the sending interface 112 to the receiver interface 121 via the communication path 130.
  • the microprocessor 123 receives the mixed transitions and/or the playlists and/or the audio data items that may also be shortened by the microprocessor 111 so that a subsequent playback of the preceding shortened audio item, the transition portion and the subsequent audio item fit each other.
  • the microprocessor 123 may receive control commands from the user interface 125 and store received information on the hard disk 124.
  • the audio content may be played back via the loudspeaker 122 in accordance with the mixing functionality of the system.
  • Interface 130 can also be used to transfer the available (technical) capabilities of the player to the sound-processing unit, so that the processing unit can adapt its strategy of generating mix transitions to the capabilities of the player.
  • FIG. 1 An audio data-processing scheme 100 in accordance with an embodiment of the invention will now be described with reference to Fig. 2.
  • the scheme described hereinafter may be implemented in a system 100 as shown in Fig. 1.
  • Fig. 2 shows, as an example, a first track 201, a second track 202 and a third track 203 of audio content (for instance, three songs selected by a user from a CD for playback) in a playlist 210.
  • the first to third tracks 201 to 203 are to be mixed.
  • the songs 201 to 203 are played back simply sequentially one after the other, as shown in a playlist 230.
  • a playlist 240 including a mixing feature may be generated as follows. After such a system has analyzed all the tracks 201 to 203 in the playlist 210 previously created by a user, audio data transition segments 204, 205 are determined or generated. This is indicated in the playlist 220.
  • the first track 201 and the second track 202, as well as the second track 202 and the third track 203 are mixed together so that they overlap, wherein the overlapping parts are denoted as the audio data transition segments 204, 205.
  • the parameters for the mixes are calculated and the mixes 204, 205 are created.
  • the playlist 220 shows that the audio data transition segment 204 is formed on the basis of an end portion 209 of the first track 201 and on the basis of a start portion 210 of the second track 202.
  • the audio data transition segment 205 is formed on the basis of an end portion 211 of the second track 202 and on the basis of a start portion 212 of the third track 203.
  • This adaptation is denoted as a first shortened track 206, a second shortened track 207 and a third shortened track 208.
  • the audio playback device 120 After having transferred the necessary playlist 240 information, mixes 204, 205, etc. from the audio data-processing device 110 to the audio playback device 120, the audio playback device 120 reproduces audio content in accordance with the playlist 240.

Abstract

An audio data-processing device (110), comprising a processing unit (111) adapted to generate at least one audio data transition segment (204) representing a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment (204) is generated on the basis of a portion of the corresponding preceding one of the audio data items (201) and on the basis of a portion of the corresponding subsequent one of the audio data items (202), and a sending interface (112) at which the at least one audio data transition segment (204) can be provided for transmission to an audio playback device (120).

Description

Device for and method of processing audio data items
FIELD OF THE INVENTION
The invention relates to an audio data-processing device. The invention also relates to an audio playback device. The invention further relates to a method of processing audio data. Moreover, the invention relates to a program element.
Furthermore, the invention relates to a computer-readable medium.
BACKGROUND OF THE INVENTION
Audio playback devices are becoming more and more important. Particularly, an increasing number of users buy hard disk-based audio players and other entertainment equipment.
Audio players, such as portable MP3 players, which can mix songs together, are an attractive option for consumers. These players have the capability of mixing separate tracks together, in a DJ-like style, so that the transition between the tracks is more pleasant to listen to.
US 2003/0183064 Al discloses a sequential playback system that is configured to select each sequential song on the basis of characteristics of an ending segment of each preceding song. Songs are selected on the basis of the characteristics of the overall theme of the selection, if any, and also on the basis of musical correspondence between songs. The correspondence may be based on the rhythm of each song, its notes and chords, and other musical characteristics of the songs. The end segment of each selected song is characterized, and the first segment of a candidate song that satisfies the overall selection criterion is compared with this characterization to determine a correspondence. If the first segment of the candidate song is inconsistent with the end segment of the previously selected song, another candidate song is found that satisfies the overall selection criterion and the first segment of this new candidate song is compared with the characterization. This process continues until a suitable candidate song is identified, or until a time limit is exceeded. Transition pieces are optionally provided to facilitate a smooth transition between songs. Thus, the audio player according to US 2003/0183064 Al has mixing capabilities. Such an audio player can mix two songs together, because it contains software that can extract the audio parameters and apply them in mixing songs.
OBJECT AND SUMMARY OF THE INVENTION
It is an object of the invention to allow improved audio playback. In order to achieve the object defined above, an audio data-processing device, an audio playback device, a method of processing audio data, a program element and a computer-readable medium as defined in the independent claims are provided. In accordance with an embodiment of the invention, an audio data-processing device is provided, which comprises a processing unit adapted to generate at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and a sending interface at which the at least one audio data transition segment can be provided for transmission to an audio playback device.
In accordance with another embodiment of the invention, an audio playback device is provided, which comprises a receiver interface at which at least one audio data transition segment is receivable from an audio data-processing device, wherein the at least one audio data transition segment represents a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and an audio playback unit adapted to play back each audio data transition segment after the corresponding preceding one of the audio data items and before the corresponding subsequent one of the audio data items. In accordance with another embodiment of the invention, a method of processing audio data is provided, the method comprising the steps of generating at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of a portion of the corresponding preceding one of the audio data items and on the basis of a portion of the corresponding subsequent one of the audio data items; and providing the at least one audio data transition segment for transmission to an audio playback device.
In accordance with yet another embodiment of the invention, a program element is provided, which, when being executed by a processor, is adapted to control or carry out a method of processing audio data having the above-mentioned features.
In accordance with a further embodiment of the invention, a computer- readable medium is provided, in which a computer program is stored which, when being executed by a processor, is adapted to control or carry out a method of processing audio data having the above-mentioned features. The audio processing in accordance with embodiments of the invention can be realized by a computer program, i.e. by software, or by using one or more special electronic optimization circuits, i.e. in hardware, or in a hybrid form, i.e. by means of software components and hardware components.
In accordance with an embodiment of the invention, an audio data-processing device (for instance, a personal computer) is provided, which may have a sound-mixing capability of mixing subsequent items or tracks of audio data (for instance, two songs played one after the other). A processor (for instance, a microprocessor) of the audio data-processing device can generate a transition section (for instance, a smooth changeover between two songs) which can be provided for transmission (for instance, download) to an audio playback device (such as a portable MP3 player).
In other words, the apparatus generating the transition sections, on the one hand, (namely the audio data-processing device) and the apparatus playing back the songs with transition sections played back in between two songs, on the other hand, (namely the audio playback device) may be different physical entities. Therefore, a mixing feature may also be provided in a legacy audio playback device, i.e. in an audio playback device which does not have a DJ or mixing feature implemented.
In accordance with an embodiment of the invention, it is possible to separate transition portion-processing (for instance, performed on a computer) from playing back mixed audio content (for instance, performed on an audio player). Necessary data may be transferred in a smart manner from the computer to the audio player, i.e. only mixed portions (not the entire audio item to be played back) and playback instructions (for example, a playlist) can be transferred. This may improve flexibility because individual audio content components may be transferred instead of one large pre-mixed block consisting of or comprising several audio items and mix parts. It may be advantageous that the audio player has installed capability to interpret the information transferred from the computer. Furthermore, the audio data-processing device may be capable of finding a suitable order for the mixed audio items to be played back. Such a mix may be in accordance with one of a plurality of mix options, for instance, a beat mix. The previous song may be faded out, and the next song may be faded in. The mix portion to be transferred may be an MP3 file, a ".wav"-file, or the like.
The audio player may read and interpret the transferred mix section. For mixing, the audio player may stop playback of the outgoing audio item a small time interval before its actual end (that is to say, it may skip the last seconds of the previous song), may then play back the mix section and subsequently start playback of the incoming audio item a short time interval after its actual beginning (that is to say, it may skip the first seconds of the subsequent song).
In accordance with an embodiment of the invention, a method is provided, which allows legacy audio players, such as portable MP3 players, to play mixed playlists based on a "precooked" mix. The mix transitions between tracks may be created on a PC before sending them to the audio player. It is possible that only differences between the original tracks and mixes are transferred to the audio player, wherein the original tracks may remain intact.
Thus, a legacy audio player may be provided, which may be capable of mixing two songs together, based on software installed on the audio data-processing device. Such software can extract the audio parameters and apply them in mixing songs. Thus, even legacy audio players which themselves lack mixing or transition section generation capabilities can be used for playing back mixed tracks as generated by the audio data-processing device. Thus, it is possible to allow legacy audio players to play mixed songs with the aid of precooked mixes. In other words, a method is provided for enabling existing audio players without mixing capabilities to play new sequence mix transitions.
In accordance with an embodiment, a method is disclosed, using a mechanism in which mixes are pre-created and then added to the playlist that is stored on the device, so that the result may sound identical or similar as on an audio player that does have mixing capabilities.
In order to play mixed audio tracks on a legacy player, the mixes may be pre- created on the PC and then uploaded with the music to the audio playback device when it is connected to the PC. Mix transitions can also be provided by Internet services which have advanced methods of creating mix transitions, or have human (DJ)-created mix transitions offered to the public (for instance, for a fee).
Also mix transitions could be exchanged between and/or provided by a group of people, for example friends, on bulletin boards, news groups, artist websites, etc.
Mixes may be created as follows.
When a portable audio player is connected to the personal computer, a user can select the audio tracks for uploading in the synchronization application and place them in one or more playlists. When the user has selected the tracks, the application can then analyze them by looking at the tempo, key, chromogram, etc. of each song and determine the best or an acceptable transition between two different tracks. The application can then mix the two tracks together and look at the difference between the two original tracks and the mix. It is possible to isolate the difference and store this difference as a small separate mix track.
When all the transitions between the tracks have been created, the playlist may be altered by inserting the mixed tracks between the song tracks and then altering the play time of the tracks. Start time and stop time of the tracks may be adjusted in such a way that a transition "track 1-mix 12-track 2" is identical to the mix created by the application at the beginning by decreasing the end time of the first track and increasing the start time of the second track, etc. After this, the playlist and/or the tracks and/or the mixed tracks may be uploaded to the (for instance, portable) audio playback device.
In accordance with an embodiment, the process of analyzing the songs and creating the transitions can also be performed at an earlier time (i.e. before connecting the audio playback device to the audio data-processing device), for instance, when entering music in the collections, so that the uploading process from the audio data-processing device to an audio playback device is faster.
It is also possible that specifying the start-to-stop times in a playlist might be accomplished by a modification of the audio player (for instance, of firmware of the audio player).
In accordance with an embodiment, it is possible to create all possible transitions between all the tracks that are available on the portable device and upload them to the audio player. If necessary or desired, adjustments may be carried out in the audio player (for instance, in firmware of the audio player), so that the audio player is able to insert mix tracks in the playlist instead of playing only the precooked playlists from the personal computer. This may improve the flexibility of the player, because this allows also features like shuffle play with mixes, with in many cases only a small additional amount of storage space on the device. This results from the fact that the mixes usually cover only a few percent and a small size of the length of a single audio track.
If the adjustment of start and end times of tracks in the playlist is not possible or is not supported by the audio player, another embodiment of the invention renders it also possible that a synchronization application removes the start and end part of a track when uploading it to the audio player. In this way, it may be dispensable to specify start and end times; tracks are played completely.
It is also possible to upload the cut-off start and end pieces mentioned above as separate tracks, so that the original track can be reconstructed on the player when desired by playing start piece - trimmed track - end piece.
A field of application of embodiments of the invention is software for a PC that is used to communicate or upload audio files to a portable audio player or any other audio device, wherein mixed audio is desired. According to the invention, a method is provided for performing the steps of identifying a sequence of audio tracks and determining transition boundaries between successive audio tracks of the sequence of audio tracks. Then, mixing of at least a number of successive audio tracks within transition boundaries may be carried out so as to obtain a number of mixed audio tracks. Subsequently, a playlist of the audio tracks and/or of the mixed audio tracks may be stored.
For example, in an audio player with tracks A, B, C, D, ... and transitions (i.e. mixed audio tracks) AB, BC, AD, DC, ..., stored on it, a playlist A-AB-B-BC-C can be changed to A-AD-D-DC-C without synchronizing the player to the personal computer just by altering the playlist on the player itself. Embodiments of the invention may have the advantage that a legacy player may be made suitable for playing music tracks with mixed transitions. Thus, in accordance with an embodiment, precooked audio mixes for legacy audio players are provided.
In a legacy audio player, a method of creating music transitions, using an external PC application may be provided by creating and storing audio mixes at music transitions, based on playlists and storing the mix to the player memory, when the audio player is docked to the PC. Then, all possible music transitions may be created and stored for the collection of music to allow shuffled playback. Furthermore, synchronization of music timings may be provided as an optional feature. Synchronization may include controlling a time correlation between different songs played back one after the other. In accordance with an embodiment, a mix segment may be stored and/or transferred to another device. Playlists may be adapted on the basis of these segments. Furthermore, the use of mix segments on portable players and synchronization strategies between a personal computer and a portable device may be provided. Thus, the use of precooked mixes or transition pieces on portable players is possible. This may include synchronization strategies between PC and portable device and it describes a way to schedule mix segments in the playlist.
According to the invention, a method is provided, which analyzes a playlist and creates mix transitions between the tracks in the playlist and stores these as a new file. This may leave the original tracks intact and does not necessarily create a single large track. Such a mix is created by inserting the mix transition files as separate tracks to the playlist in between two subsequent songs. A mix may be transferred to a player in various ways, for example, by transferring the original track in the playlist, transferring the transition tracks and/or transferring a playlist file that describes the mix. A mix may be played on the player by selecting the corresponding playlist file on the player for playback. One aspect is that a mix is created and transferred in components/blocks and not as a complete, static, single mix.
Such an approach may have the advantage that original tracks remain intact, so that they can still be played normally on the player. Only transitions between tracks have to be copied additionally, so that they can be reshuffled to create new playlists. In other words, if one has stored a number of tracks on the player and a number of transitions, one can create new playlists on the player (if the player supports this) without synchronizing with a PC. Two playlists with partly the same tracks may have a much larger overlap in size than two mixes so that storage capacity may be saved. Furthermore, in accordance with an embodiment, the system of dimension may be used for creating new mixed playlists. Only missing tracks and mix transitions have to be copied from the PC to the player, which may speed up the synchronization and operation.
Further embodiments of the audio data-processing device will be described hereinafter. However, these embodiments also apply to the audio playback device, the method of processing audio data, the program element and the computer-readable medium. Each audio data transition segment may be generated on the basis of an end portion of the corresponding preceding one of the audio data items and on the basis of a start portion of the corresponding subsequent one of the audio data items. However, transitions between two songs do not always have to consist of a transition from the end of song 1 to the beginning of song 2. In some types of mixing, it is also possible to mix, for example, the chorus of song 1 with the second verse of song 2, or mix the chorus of song 1 with the third verse of song 1.
The audio data-processing device may comprise a user interface adapted to allow a user to select the plurality of audio data items as a sub-group of a group of audio data items. For instance, a user may select only those items from all available audio items that are to be played back and/or transferred to the audio playback device. Based on this sub-group, the system may then automatically generate a playback scheme with mixed transitions between subsequent audio data items. The user interface may be a graphical user interface (GUI) including a display (such as a cathode ray tube, a liquid crystal display, a plasma display device or the like) for displaying information to a human operator. Furthermore, the user interface may comprise an input device allowing the user to input data or provide the system with control commands. Such an input device may include a keypad, a joystick, a track ball, or may even be a microphone of a voice recognition system. The user interface may allow a human user to communicate with the system in a bi-directional manner.
The processing unit of the audio data-processing device may be adapted to sort the plurality of audio data items in a succession (or in an order) so that transitions between adjacent audio data items are in accordance with at least one matching criterion. This sorting may be carried out in such a manner that two subsequently played audio data items fit each other as regards a human listener's perception. Thus, the matching or sorting criterion may be selected from the group consisting of tempo, key, chromogram, rhythm, notes, chords, genre (and other forms of metadata and audio-extracted features) of the adjacent audio data items. By taking this measure, it can be ensured that two subsequently played back audio items provide a nice audible experience to a user. Songs may be selected and/or combined and/or ordered on the basis of characteristics of the overall theme of the selection and/or on the basis of a musical correspondence between adjacent songs.
The processing unit may be adapted to generate a playlist which is indicative of the succession of the sorted audio data items bridged by the at least one audio data transition segment, wherein the playlist can be provided at the sending interface for transmission to the audio playback device. Such a playlist may be a small file that can be copied from the audio data-processing device to the audio playback device so that information is communicated to the audio playback device about the sequence or other modalities of the playback of the audio data items. The playlist may also be in a form which combines song sequence description (playlist parameters) and the mixes (audio data) combined to one file.
Also several different mixes can be combined to one large (audio) file and serve as a repository for mix transitions. The file would contain all of the separate mix transitions concatenated, and when a playlist is played, just the segment of the file containing the correct transition is played.
The processing unit may be adapted in such a manner that the at least one audio data transition segment can be provided without the corresponding audio data items at the sending interface for transmission to the audio playback device. In this embodiment, only the transition portions are transferred from the audio data-processing device to the audio playback device, wherein the main part of the audio items (which is, for instance, already stored in the audio playback device) does not have to be transmitted (again). By taking this measure, the transfer may be accelerated and the amount of transferred data may be reduced.
However, it is also possible that the processing unit is adapted in such a manner that the at least one audio data transition segment can be provided together with the corresponding audio data items at the sending interface for transmission to the audio playback device. Therefore, not only the transition segments are transferred, but also the main part of the audio content.
The processing unit may be adapted to generate a shortened preceding one of the plurality of audio data items by cutting an end portion of the preceding one of the plurality of audio data items, wherein the shortened preceding one of the plurality of audio data items can be provided at the sending interface for transmission to the audio playback device.
Additionally or alternatively, the processing unit may be adapted to generate a shortened subsequent one of the plurality of audio data items by cutting a start portion of the subsequent one of the plurality of audio data items, wherein the shortened subsequent one of the plurality of audio data items can be provided at the sending interface for transmission to the audio playback device.
By taking this measure, it is possible that the transition section is directly played between the two shortened parts of the two adjacent audio data items. This may reduce the storage required for storing the two main pieces and the transition portion.
The processing unit may be further adapted to generate the at least one audio data transition segment exclusively/only in an operation state in which no audio playback device is communicatively connected to the sending interface. If the calculations carried out in the processing unit have already been carried out before actually sending this data to the audio playback device, the operation may be accelerated.
Alternatively, the processing unit may be adapted to generate the at least one audio data transition segment exclusively/only in an operation state in which the audio playback device is communicatively connected to the sending interface. In such a scenario, a bidirectional communication between the audio data-processing device and the audio playback device is possible that may improve the quality of processing and communication.
The processing unit may be adapted to generate the at least one audio data transition segment by including an audio transition piece. Thus, mix transitions can be enhanced by introducing new musical elements to 'improve' the transition. For example, a synthetic drum sequence can be generated and mixed in so as to provide a smoother transition between two tracks.
The audio data-processing device may be adapted as a computer. In the context of this description, a computer may denote any processor-including entity that can be connected to an audio playback device. For instance, such a computer may be a conventional personal computer or a laptop or also a sophisticated server computer. In principle, the computer may also be a mobile phone, a personal digital assistant (PDA) or any other device which allows central control of the communication with the audio playback device.
Further embodiments of the audio playback device will be described hereinafter. However, these embodiments also apply to the audio data-processing device, the method of processing audio data, the program element and the computer-readable medium.
The audio playback device may be free of any capability of generating audio data transition segments. In other words, it is also possible to use an audio playback device without mixing capabilities in combination with an audio data-processing device so that the audio playback device, when operated in conjunction with the audio data-processing device, can provide a mixing capability. Thus, even in a scenario in which the audio playback device is a legacy audio player, i.e. an existing audio player without mixing capabilities, such a legacy device may provide a DJ function.
The audio playback unit may comprise at least one of the group consisting of a loudspeaker, an earpiece and a headset. The communication between the audio playback device and such a reproduction unit may be wireless or wired.
Similarly, the communication between the audio playback device and the audio data-processing device may be carried out in a wired manner (for instance, using a bus such as a USB connection) or in a wireless manner (for instance, via a WLAN or Bluetooth). The audio playback device may be realized as a portable audio player, a DVD player, a CD player, a hard disk-based media player, an Internet radio device, a public entertainment device, an MP3 player, a vehicle entertainment device, a car entertainment device, a portable video player, a mobile phone, a medical communication system, a body- worn device, and a hearing aid device. A "car entertainment device" may be a hi-fi system for an automobile.
However, although the embodiments of the system according to the invention primarily intend to improve the playback of sound or audio data, it is also possible to apply the system for a combination of audio data and video data. For instance, an embodiment of the invention may be implemented in audiovisual applications such as a video player in which a loudspeaker is used, or a home cinema system.
These and other aspects of the invention are apparent from and will be elucidated with reference to the embodiments described hereinafter.
BRIEF DESCRIPTION OF THE DRAWINGS
In the drawing,
Fig. 1 shows an embodiment of an audio data-processing and playback system according to the invention.
Fig. 2 shows an audio data-processing scheme in accordance with an embodiment of the invention.
DESCRIPTION OF EMBODIMENTS
The illustrations in the drawing are schematic. In the different Figures, similar or identical elements are denoted by the same reference signs. An embodiment of an audio data-processing and playback system 100 according to the invention will now be described with reference to Fig. 1.
The audio data-processing and playback system 100 comprises an audio data- processing device 110 and an audio playback device 120.
The audio data-processing device 110 comprises a microprocessor 111 (for instance, a CPU) which is adapted to generate at least one audio data transition segment representing a transition between a preceding one of a plurality of audio data items and a subsequent one of the plurality of audio data items, wherein each audio data transition segment is generated on the basis of an end portion of the corresponding preceding one of the audio data items and on the basis of a start portion of the corresponding subsequent one of the audio data items.
The microprocessor 111 is coupled with a sending interface 112 of the audio data-processing device 110, wherein the sending interface 112 may transmit the at least one audio data transition segment generated in the microprocessor 111 to the audio playback device 120. This may be carried out via a communication path 130 which may be a wired (for example, a USB connection) or wireless communication path (for example, a Bluetooth connection).
Furthermore, the audio data-processing device 110, which may be a personal computer (PC), comprises a storage device 113 (for instance, a hard disk, a CD, a DVD, etc.) on which audio content (like songs of a music CD or stories of an audio book) may be stored.
Furthermore, the microprocessor 111 is connected to a graphical user interface 114 via which a human user (not shown) may control the operation of the audio data-processing device 110.
The audio playback device 120 is a portable audio player, in more detail a legacy player that does not have a separate functionality implemented to allow mixing or other DJ-like features.
The audio playback device 120 comprises a receiver interface 121 that receives the audio data transition segment or segments generated by the audio data- processing device 110. For this purpose, the receiver interface 121 may communicate with the sending interface 112 via the communication path 130. The term "sending" in the component referred to as sending interface 112 and the term "receiver" in the component referred to as receiver interface 121 are only related to the transfer of audio data transition segments from the sending interface 112 to the receiver interface 121. Apart from this, it is possible that the interfaces 112, 121 communicate in a bi-directional manner, so that it is possible that data is sent from the receiver interface 121 to the sending interface 112, or vice versa.
Furthermore, the audio playback device 120 comprises a loudspeaker 122 for reproducing each audio data transition segment after the end portion of a corresponding preceding one of the audio data items and before the start portion of the corresponding subsequent one of the audio data items.
Moreover, the audio playback device 120 comprises a microprocessor or control unit 123 which is coupled to the receiver interface 121, the loudspeaker 122, a storage device 124 (for instance, a hard disk or flash memory) and a user interface 125 via which a human operator may control the use of the audio playback device 120. For instance, via the user interface 125, it is possible for a human user to adjust loudness, equalization, start and stop, fast forward or fast reverse of playback, etc.
When a user connects the audio data-processing device 110 to the audio playback device 120, he may input control information via the user interface 114. For instance, the user may select a plurality of audio pieces stored on the storage device 113 for reproduction. Furthermore, the microprocessor 111 may sort the user-defined audio data items in a particular succession so that transitions between adjacent audio data items are in accordance with particular criteria, such as a rhythm of two subsequent audio pieces.
As a result of the calculation, the microprocessor 111 generates a playlist which is indicative of the sequence of sorted audio pieces bridged by the generated audio data transition segments. This information may be transferred from the sending interface 112 to the receiver interface 121 via the communication path 130. The microprocessor 123 receives the mixed transitions and/or the playlists and/or the audio data items that may also be shortened by the microprocessor 111 so that a subsequent playback of the preceding shortened audio item, the transition portion and the subsequent audio item fit each other. The microprocessor 123 may receive control commands from the user interface 125 and store received information on the hard disk 124. Based on the user operation via the user interface 125, the audio content may be played back via the loudspeaker 122 in accordance with the mixing functionality of the system. Interface 130 can also be used to transfer the available (technical) capabilities of the player to the sound-processing unit, so that the processing unit can adapt its strategy of generating mix transitions to the capabilities of the player.
An audio data-processing scheme 100 in accordance with an embodiment of the invention will now be described with reference to Fig. 2. The scheme described hereinafter may be implemented in a system 100 as shown in Fig. 1.
Fig. 2 shows, as an example, a first track 201, a second track 202 and a third track 203 of audio content (for instance, three songs selected by a user from a CD for playback) in a playlist 210. The first to third tracks 201 to 203 are to be mixed. On a conventional legacy player lacking any mixing functionality, the songs 201 to 203 are played back simply sequentially one after the other, as shown in a playlist 230.
However, in accordance with an embodiment of the invention, a playlist 240 including a mixing feature may be generated as follows. After such a system has analyzed all the tracks 201 to 203 in the playlist 210 previously created by a user, audio data transition segments 204, 205 are determined or generated. This is indicated in the playlist 220.
Then, the first track 201 and the second track 202, as well as the second track 202 and the third track 203 are mixed together so that they overlap, wherein the overlapping parts are denoted as the audio data transition segments 204, 205. The parameters for the mixes are calculated and the mixes 204, 205 are created.
The playlist 220 shows that the audio data transition segment 204 is formed on the basis of an end portion 209 of the first track 201 and on the basis of a start portion 210 of the second track 202. In a similar manner, the audio data transition segment 205 is formed on the basis of an end portion 211 of the second track 202 and on the basis of a start portion 212 of the third track 203.
After this, insertion of the mixes 204, 205 between the tracks 201 and 203 and between the tracks 202 and 203, respectively, alters the original playlist 210, and the playback times of the tracks 201, 202, 203 are adapted to represent this mix. This adaptation is denoted as a first shortened track 206, a second shortened track 207 and a third shortened track 208.
After having transferred the necessary playlist 240 information, mixes 204, 205, etc. from the audio data-processing device 110 to the audio playback device 120, the audio playback device 120 reproduces audio content in accordance with the playlist 240.
It should be noted that use of the verb "comprise" and its conjugations does not exclude other elements or steps, and use of the article "a" or "an" does not exclude a plurality. Also elements described in association with different embodiments may be combined. It should also be noted that reference signs in the claims shall not be construed as limiting the scope of the claims.

Claims

CLAIMS:
1. An audio data-processing device (110) comprising a processing unit (111) adapted to generate at least one audio data transition segment (204) representing a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment (204) is generated on the basis of a portion of the preceding one of the audio data items (201) and on the basis of a portion of the subsequent one of the audio data items (202); a sending interface (112) at which the at least one audio data transition segment (204) can be provided for transmission to an audio playback device (120).
2. The audio data-processing device (110) according to claim 1, wherein each audio data transition segment (204) is generated on the basis of an end portion of the corresponding preceding one of the audio data items (201) and on the basis of a start portion of the corresponding subsequent one of the audio data items (202).
3. The audio data-processing device (110) according to claim 1, comprising a user interface (114) adapted to enable a user to select the plurality of audio data items (201 to 203) as a sub-group of a group of audio data items.
4. The audio data-processing device (110) according to claim 1, wherein the processing unit (111) is adapted to sort the plurality of audio data items (201 to 203) in a succession so that transitions between adjacent audio data items (201, 202) match with at least one matching criterion.
5. The audio data-processing device (110) according to claim 4, wherein the at least one matching criterion is selected from the group consisting of tempo, key, chromogram, rhythm, notes, chords and genre of the adjacent audio data items.
6. The audio data-processing device (110) according to claim 4, wherein the processing unit (111) is adapted to generate a playlist (240) which is indicative of the succession of the sorted audio data items (201, 202) bridged by the at least one audio data transition segment (204), wherein the playlist (240) can be provided at the sending interface (112) for transmission to the audio playback device (120).
7. The audio data-processing device (110) according to claim 1, wherein the processing unit (111) is adapted in such a manner that the at least one audio data transition segment (204) can be provided without the corresponding audio data items (201, 202) at the sending interface (112) for transmission to the audio playback device (120).
8. The audio data-processing device (110) according to claim 1, wherein the processing unit (111) is adapted in such a manner that the at least one audio data transition segment (204) can be provided together with the corresponding audio data items (201, 202) at the sending interface (112) for transmission to the audio playback device (120).
9. The audio data-processing device (110) according to claim 2, wherein the processing unit (111) is adapted to generate a shortened preceding one of the plurality of audio data items (206) by cutting an end portion of the preceding one of the plurality of audio data items (201), wherein the shortened preceding one of the plurality of audio data items (206) can be provided at the sending interface (112) for transmission to the audio playback device (120).
10. The audio data-processing device (110) according to claim 2, wherein the processing unit (111) is adapted to generate a shortened subsequent one of the plurality of audio data items (207) by cutting a start portion of the subsequent one of the plurality of audio data items (202), wherein the shortened subsequent one of the plurality of audio data items (207) can be provided at the sending interface (112) for transmission to the audio playback device (120).
11. The audio data-processing device (110) according to claim 1 , wherein the processing unit (111) is adapted to generate the at least one audio data transition segment (204) selectively in an operation state in which no audio playback device (120) is communicatively connected to the sending interface (112).
12. The audio data-processing device (110) according to claim 1, wherein the processing unit (111) is adapted to generate the at least one audio data transition segment (204) selectively in an operation state in which the audio playback device (120) is communicatively connected to the sending interface (112).
13. The audio data-processing device (110) according to claim 1, wherein the processing unit (111) is adapted to generate the at least one audio data transition segment (204) by including an audio transition piece.
14. The audio data-processing device (110) according to claim 1, adapted as a computer.
15. An audio playback device (120) comprising a receiver interface (121) at which at least one audio data transition segment
(204) is receivable from an audio data-processing device (110), wherein the at least one audio data transition segment (204) represents a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment (204) is generated on the basis of a portion of the preceding one of the audio data items (201) and on the basis of a portion of the subsequent one of the audio data items (202); an audio playback unit (122) adapted to play back each audio data transition segment (204) after the preceding one of the audio data items (201) and before the subsequent one of the audio data items (202).
16. The audio playback device (120) according to claim 15, wherein each audio data transition segment (204) is generated on the basis of an end portion of the corresponding preceding one of the audio data items (201) and on the basis of a start portion of the corresponding subsequent one of the audio data items (202).
17. The audio playback device (120) according to claim 15, being free of a capability of generating audio data transition segments (204).
18. The audio playback device (120) according to claim 15, wherein the audio playback unit (122) comprises at least one of the group consisting of a loudspeaker, an earpiece and a headset.
19. The audio playback device (120) according to claim 15, realized as at least one of the group consisting of a portable audio player, a DVD player, a CD player, a hard disk- based media player, an Internet radio device, a public entertainment device, an MP3 player, a hi-fi system, a vehicle entertainment device, a car entertainment device, a portable video player, a mobile phone, a medical communication system, a body-worn device, and a hearing aid device.
20. A method of processing audio data, the method comprising the steps of: generating at least one audio data transition segment (204) representing a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment
(204) is generated on the basis of a portion of the preceding one of the audio data items (201) and on the basis of a portion of the subsequent one of the audio data items (202); and providing the at least one audio data transition segment (204) for transmission to an audio playback device (120).
21. A program element, which, when being executed by a processor (111), is adapted to control or carry out a method of processing audio data, the method comprising the steps of: generating at least one audio data transition segment (204) representing a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment (204) is generated on the basis of a portion of the preceding one of the audio data items (201) and on the basis of a portion of the subsequent one of the audio data items (202); and providing the at least one audio data transition segment (204) for transmission to an audio playback device (120).
22. A computer-readable medium, in which a computer program is stored which, when being executed by a processor (111), is adapted to control or carry out a method of processing audio data, the method comprising the steps of: generating at least one audio data transition segment (204) representing a transition between a preceding one of a plurality of audio data items (201) and a subsequent one of the plurality of audio data items (202), wherein each audio data transition segment (204) is generated on the basis of a portion of the preceding one of the audio data items (201) and on the basis of a portion of the subsequent one of the audio data items (202); and providing the at least one audio data transition segment (204) for transmission to an audio playback device (120).
PCT/IB2006/054352 2005-11-25 2006-11-21 Device for and method of processing audio data items WO2007060605A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP05111344.7 2005-11-25
EP05111344 2005-11-25

Publications (2)

Publication Number Publication Date
WO2007060605A2 true WO2007060605A2 (en) 2007-05-31
WO2007060605A3 WO2007060605A3 (en) 2007-08-16

Family

ID=37946377

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2006/054352 WO2007060605A2 (en) 2005-11-25 2006-11-21 Device for and method of processing audio data items

Country Status (1)

Country Link
WO (1) WO2007060605A2 (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2496304A (en) * 2011-10-31 2013-05-08 Omnifone Ltd Managing playback of media content
US8525012B1 (en) 2011-10-25 2013-09-03 Mixwolf LLC System and method for selecting measure groupings for mixing song data
WO2013160532A1 (en) * 2012-04-27 2013-10-31 Nokia Corporation Method and apparatus for switching between presentations of two media items
US9111519B1 (en) 2011-10-26 2015-08-18 Mixwolf LLC System and method for generating cuepoints for mixing song data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5747716A (en) * 1996-01-23 1998-05-05 Yamaha Corporation Medley playback apparatus with adaptive editing of bridge part
US6243725B1 (en) * 1997-05-21 2001-06-05 Premier International, Ltd. List building system
US20010039872A1 (en) * 2000-05-11 2001-11-15 Cliff David Trevor Automatic compilation of songs
US20030183064A1 (en) * 2002-03-28 2003-10-02 Shteyn Eugene Media player with "DJ" mode

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5747716A (en) * 1996-01-23 1998-05-05 Yamaha Corporation Medley playback apparatus with adaptive editing of bridge part
US6243725B1 (en) * 1997-05-21 2001-06-05 Premier International, Ltd. List building system
US20010039872A1 (en) * 2000-05-11 2001-11-15 Cliff David Trevor Automatic compilation of songs
US20030183064A1 (en) * 2002-03-28 2003-10-02 Shteyn Eugene Media player with "DJ" mode

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8525012B1 (en) 2011-10-25 2013-09-03 Mixwolf LLC System and method for selecting measure groupings for mixing song data
US9070352B1 (en) 2011-10-25 2015-06-30 Mixwolf LLC System and method for mixing song data using measure groupings
US9111519B1 (en) 2011-10-26 2015-08-18 Mixwolf LLC System and method for generating cuepoints for mixing song data
GB2496304A (en) * 2011-10-31 2013-05-08 Omnifone Ltd Managing playback of media content
WO2013160532A1 (en) * 2012-04-27 2013-10-31 Nokia Corporation Method and apparatus for switching between presentations of two media items

Also Published As

Publication number Publication date
WO2007060605A3 (en) 2007-08-16

Similar Documents

Publication Publication Date Title
JP5318095B2 (en) System and method for automatically beat-mixing a plurality of songs using an electronic device
US20080152165A1 (en) Ad-hoc proximity multi-speaker entertainment
CN101322180B (en) Music edit device and music edit method
CN101326569B (en) Music edit device and music edit method
JP4202374B2 (en) Apparatus and method for providing music file search function
US20070064954A1 (en) Method and apparatus for audio data analysis in an audio player
JP2005521979A (en) Media player with “DJ” mode
US20160012853A1 (en) Clip creation and collaboration
JP2007522591A (en) Lyrics providing system for digital audio files
JP2010533928A (en) Media that can be played with selectable performers
US20080134866A1 (en) Filter for dynamic creation and use of instrumental musical tracks
US20120308014A1 (en) Audio playback device and method
US20150268924A1 (en) Method and system for selecting tracks on a digital file
KR20110040190A (en) Apparatus and method for playing music in portable terminal
KR20140039266A (en) Enhanced media recording and playback
CN1842856B (en) Media item selection
WO2007060605A2 (en) Device for and method of processing audio data items
JP2005285285A (en) Content read system and musical piece reproduction apparatus
WO2008087548A2 (en) Ad-hoc proximity multi-speaker entertainment
US20170039026A1 (en) Auditory Transition System
Jones Technology and the future of popular music
Luini et al. Streaming audio: the FezGuys' guide
JP2008225232A (en) Signal processing method and audio content distribution method
JP2007073139A (en) Recording/reproducing device and control method
JP2006201655A (en) Music play back device and music play back program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase in:

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06821513

Country of ref document: EP

Kind code of ref document: A2