WO2015093744A1 - Multimedia apparatus, music composing method thereof, and song correcting method thereof - Google Patents

Multimedia apparatus, music composing method thereof, and song correcting method thereof Download PDF

Info

Publication number
WO2015093744A1
WO2015093744A1 PCT/KR2014/011463 KR2014011463W WO2015093744A1 WO 2015093744 A1 WO2015093744 A1 WO 2015093744A1 KR 2014011463 W KR2014011463 W KR 2014011463W WO 2015093744 A1 WO2015093744 A1 WO 2015093744A1
Authority
WO
WIPO (PCT)
Prior art keywords
midi data
user
controller
score
information
Prior art date
Application number
PCT/KR2014/011463
Other languages
French (fr)
Inventor
Sang-Bae Chon
Sung-Min Kim
Sang-Mo Son
Original Assignee
Samsung Electronics Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Samsung Electronics Co., Ltd. filed Critical Samsung Electronics Co., Ltd.
Priority to EP14872744.9A priority Critical patent/EP3066662A4/en
Publication of WO2015093744A1 publication Critical patent/WO2015093744A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10GREPRESENTATION OF MUSIC; RECORDING MUSIC IN NOTATION FORM; ACCESSORIES FOR MUSIC OR MUSICAL INSTRUMENTS NOT OTHERWISE PROVIDED FOR, e.g. SUPPORTS
    • G10G1/00Means for the representation of music
    • G10G1/04Transposing; Transcribing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0033Recording/reproducing or transmission of music for electrophonic musical instruments
    • G10H1/0041Recording/reproducing or transmission of music for electrophonic musical instruments in coded form
    • G10H1/0058Transmission between separate instruments or between individual components of a musical system
    • G10H1/0066Transmission between separate instruments or between individual components of a musical system using a MIDI interface
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10GREPRESENTATION OF MUSIC; RECORDING MUSIC IN NOTATION FORM; ACCESSORIES FOR MUSIC OR MUSICAL INSTRUMENTS NOT OTHERWISE PROVIDED FOR, e.g. SUPPORTS
    • G10G3/00Recording music in notation form, e.g. recording the mechanical operation of a musical instrument
    • G10G3/04Recording music in notation form, e.g. recording the mechanical operation of a musical instrument using electrical means
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0008Associated control or indicating means
    • G10H1/0025Automatic or semi-automatic music composition, e.g. producing random music, applying rules from music theory or modifying a musical piece
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/155User input interfaces for electrophonic musical instruments
    • G10H2220/211User input interfaces for electrophonic musical instruments for microphones, i.e. control of musical parameters either directly from microphone signals or by physically associated peripherals, e.g. karaoke control switches or rhythm sensing accelerometer within the microphone casing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/155User input interfaces for electrophonic musical instruments
    • G10H2220/351Environmental parameters, e.g. temperature, ambient light, atmospheric pressure, humidity, used as input for musical purposes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2220/00Input/output interfacing specifically adapted for electrophonic musical tools or instruments
    • G10H2220/155User input interfaces for electrophonic musical instruments
    • G10H2220/441Image sensing, i.e. capturing images or optical patterns for musical purposes or musical control purposes
    • G10H2220/455Camera input, e.g. analyzing pictures from a video camera and using the analysis results as control data
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2230/00General physical, ergonomic or hardware implementation of electrophonic musical tools or instruments, e.g. shape or architecture
    • G10H2230/005Device type or category
    • G10H2230/021Mobile ringtone, i.e. generation, transmission, conversion or downloading of ringing tones or other sounds for mobile telephony; Special musical data formats or protocols therefor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/075Musical metadata derived from musical analysis or for use in electrophonic musical instruments
    • G10H2240/085Mood, i.e. generation, detection or selection of a particular emotional content or atmosphere in a musical piece

Definitions

  • Apparatuses and methods consistent with the exemplary embodiments relate to a multimedia apparatus, a music composing method thereof, and a song correcting method thereof, and more particularly, to a multimedia apparatus capable of composing music according to a user interaction and correcting a song sung by a user, a music composing method thereof, and a song correcting method thereof.
  • MIDI musical instrument digital interface
  • a song can only be composed by using the user’s voice. That is, there are limits to composing a song using other interactions and only the user's voice can be used.
  • Exemplary embodiments address the above disadvantages and other disadvantages not described above. Also, the exemplary embodiments are not required to overcome the disadvantages described above, and exemplary embodiments may not overcome any of the problems described above.
  • An exemplary embodiment provides a multimedia apparatus capable of composing music using diverse types of user interactions and video data, and a music composing method thereof.
  • An exemplary embodiment also provides a multimedia apparatus capable of searching for a song sung by the user and correcting the song sung by the user, and a song correcting method thereof.
  • a music composing method includes setting a type of musical instrument digital interface (MIDI) data according to a user’s input, sensing a user interaction, analyzing the sensed user interaction and determining a beat and a pitch, and generating MIDI data using the set type of MIDI data and the determined beat and pitch.
  • MIDI musical instrument digital interface
  • the type of MIDI data including setting at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
  • the method may further include receiving an image, and obtaining emotion information using at least one of color information, motion information, and spatial information of the received image.
  • the MIDI data may be generated using the emotion information.
  • the method may further include sensing at least one of a weather, a temperature, a humidity, and an illumination, and generating emotion information using the sensed at least one of the weather, the temperature, the humidity, and the illumination.
  • the MIDI data may be generated using the emotion information.
  • the method may further include generating a score using the determined beat and pitch, and displaying the generated score.
  • the method may further include modifying the MIDI data using the displayed generated score.
  • the method may further include generating a previous measure of MIDI data and a subsequent measure of MIDI data of the generated MIDI data using the generated MIDI data, and generating a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data.
  • the user interaction may be one of humming by the user, a touch made by the user, and a motion made by the user.
  • the method may further include mixing and outputting the MIDI data and the humming by the user when the user interaction is the humming by the user.
  • a multimedia apparatus includes an inputter configured to receive a user command to set a type of musical instrument digital interface (MIDI) data, a sensor configured to sense a user interaction, and a controller configured to analyze the sensed user interaction and determine a beat and a pitch, and to generate MIDI data using the set type of MIDI data and the determined beat and pitch.
  • MIDI musical instrument digital interface
  • the inputter may receive a user command to set at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
  • the multimedia apparatus may further include an image inputter configured to receive an image.
  • the controller may obtain emotion information using at least one of a color information, a motion information, and a spatial information of the image received through the image inputter, and generate the MIDI data using the emotion information.
  • the multimedia apparatus may further include an environment sensor configured to sense at least one of a weather, a temperature, a humidity, and an illumination.
  • the controller may generate emotion information using at least one of the weather, the temperature, the humidity, and the illumination, and generate the MIDI data using the emotion information.
  • the multimedia apparatus may further include a display.
  • the controller may generate a score using the determined beat and pitch, and control the display to display the generated score.
  • the controller may modify the MIDI data according to a user command which is input onto the displayed score.
  • the controller may generate a previous measure MIDI data and a subsequent measure MIDI data of the generated MIDI data using the generated MIDI data, and generate a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data.
  • the user interaction may be one of humming by the user, a touch made by the user, and a motion made by the user.
  • the multimedia apparatus may further include an audio outputter.
  • the controller may control the audio outputter to mix and output the MIDI data and the humming by the user when the user interaction is the humming by the user.
  • a music composing method includes receiving video data, determining a composition parameter by analyzing the received video data, and generating musical instrument digital interface (MIDI) data using the determined composition parameter.
  • MIDI musical instrument digital interface
  • a chord progression may be determined using color information of the received video data
  • a drum pattern may be determined using screen motion information of the received video data
  • a beats per minute (BPM) may be determined using object motion information of the received video data
  • a parameter of an area of a sound image may be determined using spatial information of the received video data.
  • the method may further include executing the generated MIDI data together with the video data.
  • a song correcting method includes receiving a song sung by a user, analyzing the song and obtaining a score that matches the song, synchronizing the song and the score, and correcting the received song based on the synchronized score.
  • a pitch and a beat of the song may be analyzed, and the score that matches the song may be obtained based on the analyzed pitch and beat.
  • a virtual score may be generated based on the analyzed pitch and beat, and a score which is most similar to the virtual score among scores stored in a database may be acquired as the score that matches the song.
  • the method may further include searching for a sound source which corresponds to the song, extracting an accompaniment sound from the sound source, and mixing and outputting the corrected song and the accompaniment sound.
  • FIG. 1 is a block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment
  • FIG. 2 is a detailed block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment
  • FIG. 3 illustrates diverse modules to compose music according to an exemplary embodiment
  • FIG. 4 illustrates a user interface to set a type of MIDI data according to an exemplary embodiment
  • FIG. 5 illustrates a score generated using user interaction according to an exemplary embodiment
  • FIG. 6 is a flowchart of a method for composing music using user interaction according to an exemplary embodiment
  • FIG. 7 illustrates a plurality of modules to compose music using video data according to an exemplary embodiment
  • FIG. 8 is a flowchart of a method for composing music using video data according to another exemplary embodiment
  • FIG. 9 illustrates a plurality of modules to correct a song according to yet another exemplary embodiment.
  • FIG. 10 is a flowchart of a method for correcting a song according to yet another exemplary embodiment.
  • FIG. 1 is a block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment.
  • the multimedia apparatus 100 may include an inputter 110, a sensor 120, and a controller 130.
  • the inputter 110 receives a user command to control the overall operation of the multimedia apparatus 100.
  • the inputter 110 may receive a user command to set a type of musical instrument digital interface (MIDI) data that the user wishes to compose.
  • the type of the MIDI data may include at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
  • the sensor 120 senses a user interaction in order to compose music.
  • the sensor 120 may include at least one of a microphone to sense if the user is humming, a motion sensor to sense a motion by the user, and a touch sensor to sense a touch made by the user.
  • the controller 130 controls the multimedia apparatus 100 according to a user command input through the inputter 110.
  • the controller 130 determines a beat and a pitch by analyzing sensed user interaction, and generates MIDI data using a set type of MIDI data and the determined beat and pitch.
  • the controller 130 determines a type of MIDI data set through the inputter 110. More specifically, the controller 130 may determine at least one of a genre, an S Type, a BPM, and a complexity of MIDI data set through the inputter 110.
  • the controller 130 determines a beat and a pitch using one of the user’s humming, the user's motion, and the user's touch sensed by the sensor 120. For example, when a user hums and the humming is input through a microphone, the controller 130 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and may determine a pitch of the user’s humming using correntropy pitch detection. When the user inputs a motion through a motion sensor, the controller 130 may determine the beat using a speed of the user’s motion, and determine a pitch using the distance of the motion. When the user’s touch is input through a touch sensor, the controller 130 may determine the beat by calculating the time at which the user touches the touch sensor, and determine a pitch by calculating an amount of pressure of a user's touch.
  • HCR harmonic cepstrum regularity
  • controller 130 generates MIDI data using a type of MIDI data input through the inputter 110 and determines a beat and a pitch.
  • the controller 130 may acquire emotion information using at least one of color information, motion information, and spatial information of an image input through an image inputter (not shown), and generate MIDI data using the emotion information.
  • the emotion information is information regarding the mood of the music that the user wishes to compose, including information to determine chord progression, drum pattern, beats per minute (BPM), and spatial impression information. More specifically, the controller 130 may determine chord progression of MIDI data using color information of the input image, determine drum pattern or BPM of MIDI data using motion information of the input image, and acquire spatial impression of MIDI data using spatial information extracted from an input audio signal.
  • the controller 130 may generate emotion information using at least one of weather information, temperature information, humidity information, and illumination information sensed by an environment sensor (not shown) of the multimedia apparatus 100, and generate MIDI data using the emotion information.
  • controller 130 may generate a score using a determined beat and pitch, and display the generated score.
  • the controller 130 may correct MIDI data according to a user command which is input onto the displayed score.
  • the controller 130 may generate a previous measure of MIDI data and a subsequent measure of MIDI data using generated MIDI data, and generate a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data. More specifically, when four measures having a C-B-A-G chord composition are currently generated, measures may be extended using harmonic characteristics that a next measure is likely to have such as a chord including F-E-D-C or F-E-D-E. A chord progression of C-B-A-G is likely to appear in front of F-E-D-C.
  • the controller 130 may mix and output MIDI data and the user’s humming.
  • the controller 130 may synchronize and output the MIDI data and the video data.
  • FIG. 2 is a detailed block diagram of a configuration of a multimedia apparatus 200 according to an exemplary embodiment.
  • the multimedia apparatus 200 may include an inputter 210, an image inputter 220, an environment sensor 230, a display 240, an audio outputter 250, a sensor 260, a storage 270, a communicator 280, and a controller 290.
  • the multimedia apparatus 200 as shown in FIG. 2 is a multimedia apparatus which performs diverse functions such as a music composing function, a song correcting function, and the like. Accordingly, when other functions are added or functions change, components may be added or changed.
  • the inputter 210 receives a user command to control the multimedia apparatus 200.
  • the inputter 210 receives a user command to set a type of MIDI data.
  • the inputter 210 may receive a user command to set a type of MIDI data such as a genre, a style, a BPM, and a complexity of music that the user wishes to compose.
  • the user may select a genre of music such as rock, ballad, rap, and jazz through the inputter 210.
  • the user may select a style such as gloomy, pleasant, heavy, and dreamy through the inputter 210.
  • the user may adjust a complexity by reducing or increasing the number of instruments or tracks through the inputter 210.
  • the user may adjust the BPM, which is the number of quarter notes per minute, through the inputter 210.
  • the user may adjust the tempo, which is the rate of quarter notes, half notes, and whole notes, through the inputter 210.
  • the image inputter 220 receives image data externally. More specifically, the image inputter 220 may receive broadcast image data from an external broadcasting station, receive streaming image data from an external server, or receive image data from an external device (for example, a DVD player, etc). In addition, the image inputter 220 may receive personal content, such as home video, personally recorded by the user. In particular, when the image inputter 220 is implemented in devices such as a smart phone, the image inputter 220 may receive image data from a video library of the user stored in, for example, the smart phone or stored externally.
  • the environment sensor 230 senses an external environment. More specifically, the environment sensor 230 may acquire weather information externally, acquire temperature information of an area at which the multimedia apparatus 200 is located by using a temperature sensor, acquire humidity information of an area at which the multimedia apparatus 200 is located using a humidity sensor, or acquire illumination information of an area at which the multimedia apparatus 200 is located by using an illumination sensor. In addition, the environment sensor 230 may acquire weather and time information by linking the multimedia apparatus 200 with an internet service using the location information of the user.
  • the display 240 may be controlled by the controller 290 to display diverse types of image data.
  • the display 240 may display image data input through the image inputter 220.
  • the display 240 may display diverse types of user interfaces (UIs) to control the multimedia apparatus 200.
  • UIs user interfaces
  • the display 240 may display a UI to set a type of MIDI data as shown in FIG. 4.
  • the display 240 may display a score having a pitch and a beat which is determined according to a user interaction.
  • the display 240 may display a score as shown in FIG. 5.
  • the audio outputter 250 may output audio data.
  • the audio outputter 250 may output not only externally input audio data but also MIDI data generated by user interaction.
  • the sensor 260 senses a user interaction.
  • the sensor 260 may sense user interaction to compose music. More specifically, the sensor 260 may sense various and diverse types of user interactions to determine a beat and a pitch of music that the user wishes to compose.
  • the sensor 260 may sense whether the user is humming by using a microphone, sense whether the user is making a motion by using a motion sensor, or sense whether the user is touching the apparatus by using a touch sensor. Therefore, the sensor 260 can include, for example, a microphone, a motion sensor or a touch sensor.
  • the storage 270 stores diverse modules to drive the multimedia apparatus 200.
  • the storage 270 may include software including a base module, a sensing module, a communication module, a presentation module, a web browser module, and a service module (not shown).
  • the base module is a module that processes a signal transmitted from hardware included in the multimedia apparatus 200 and transmits the signal to an upper layer module.
  • the sensing module is a module that collects information from diverse sensors and analyzes and manages the collected information, including a face recognition module, a voice recognition module, a motion recognition module, a near field communication (NFC) recognition module, and so on.
  • NFC near field communication
  • the presentation module is a module that composes a display screen, including a multimedia module to play back and output multimedia content and a user interface (UI) rendering module to process UIs and graphics.
  • the communication module is a module that communicates with external devices.
  • the web browser module is a module that performs web browsing and accesses a web server.
  • the service module is a module including diverse applications to provide diverse services.
  • the storage 270 may store diverse modules to compose music according to a user interaction. This is described with reference to FIG. 3.
  • the modules to compose music according to user interaction may include a MIDI data type setting module 271, an interaction input module 272, an analysis module 273, a video input module 274, an emotion analysis module 275, a composed piece generation module 276, and a mixing module 277.
  • the MIDI data type setting module 271 may set a type of the MIDI data according to a user command which is input through the inputter 210. More specifically, the MIDI data type setting module 271 may set diverse types of MIDI data such as genre, BPM, style, and complexity of the MIDI data.
  • the interaction input module 272 receives a user interaction sensed by the sensor 260. More specifically, the interaction input module 272 may receive a user interaction including at least one of the user’s humming, a user's motion, and a user's touch.
  • the analysis module 273 may analyze the user interaction input through the interaction input module 272, and thus determine a pitch and a beat. For example, when a user hums and the humming is input through a microphone, the analysis module 273 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and determine a pitch of the user’s humming using correntropy pitch detection. When the user’s motion is input through a motion sensor, the analysis module 273 may determine a beat using a speed of the user’s motion, and determine a pitch using the distance of the motion. When the user’s touch is input through a touch sensor, the analysis module 273 may determine the beat by calculating a time at which the user touches the touch sensor, and determine the pitch by calculating an amount of pressure touched by the user on the touch sensor.
  • HCR harmonic cepstrum regularity
  • the video input module 274 receives video data input through the image inputter 220, and outputs the video data to the emotion analysis module 275.
  • the emotion analysis module 275 may analyze the input video data and thus determine emotion information of MIDI data.
  • the emotion information of the MIDI data is information regarding the mood of the music that the user wishes to compose, including information such as chord progression, drum pattern, BPM, and spatial impression information. More specifically, the emotion analysis module 275 may determine a chord progression of the MIDI data using color information of an input image. For example, when brightness or chroma of an image is high, the emotion analysis module 275 may determine a bright major chord progression, that is, a chord progression which gives a sense of brightness, and when brightness or chroma is low, the emotion analysis module 275 may determine a dark minor chord progression, that is a chord progression which gives a sense of darkness.
  • the emotion analysis module 275 may determine a drum pattern or BPM of the MIDI data using motion information of an input image. For example, the emotion analysis module 275 may presume a certain BPM from a degree of motion of the entire clip, and then increase the complexity of a drum pattern at a portion having a lot of motion. The emotion analysis module 275 may acquire spatial impression information of the MIDI data using spatial information of the input video so that the acquired spatial impression may be used to form a spatial impression when multichannel audio is generated.
  • the composed piece generation module 276 generates MIDI data which is a composed piece, based on a type of the MIDI data set by the MIDI data type setting module 271, a pitch and a beat determined by the analysis module 273, and emotion information determined by the emotion analysis module 275.
  • the composed piece generation module 276 may also generate a score image corresponding to the generated MIDI data.
  • the composed piece generation module 276 may generate a previous measure of MIDI data and a subsequent measure of MIDI data using the MIDI data generated according to the user’s settings. More specifically, the composed piece generation module 276 may generate a previous measure of MIDI data and a subsequent measure of MIDI data of MIDI data generated based on a general composition pattern set by the user, a type of MIDI data set by the user, a chord progression determined by the emotion analysis module 275, etc.
  • the mixing module 277 mixes an input MIDI data with the user’s humming or video data.
  • an environment information input module may be added to receive surrounding environment information sensed by the environment sensor 230.
  • the communicator 280 communicates with various types of external devices according to various types of communication methods.
  • the communicator 280 may include various communication chips such as a wireless fidelity (Wi-Fi) chip, a Bluetooth chip, a near field communication (NFC) chip, and a wireless communication chip.
  • the Wi-Fi chip, the Bluetooth chip, and the NFC chip perform communication according to a Wi-Fi method, a Bluetooth method, and an NFC method, respectively.
  • the NFC chip is a chip that operates according to the NFC method which uses a 13.56MHz band among diverse radio frequency identification (RFID) frequency bands such as 135kHz, 13.56MHz, 433MHz, 860-960MHz, and 2.45GHz.
  • RFID radio frequency identification
  • connection information such as a subsystem identification (SSID) and a session key are transmitted and received first, and then after communication is established, diverse information can be transmitted and received.
  • the wireless communication chip is a chip that performs communication according to diverse communication standards such as IEEE, Zigbee, 3rd generation (3G), 3rd generation partnership project (3GPP), and long term evolution (LTE).
  • the controller 290 may include a random-access memory (RAM) 291, a read-only memory (ROM) 292, a graphic processor 293, a main central processing unit (CPU) 294, first to Nth interfaces 295-1 to 295-N, and a bus 296 as shown in FIG. 2.
  • the RAM 291, the ROM 292, the graphic processor 293, the main CPU 294, and the first to Nth interfaces 295-1 to 295-N may be connected to one another via the bus 296.
  • the ROM 292 stores a set of commands to boot up the system.
  • the main CPU 294 copies an operating system (OS) stored in the storage 270 to the RAM 291 and executes the OS according to the commands stored in the ROM 292 so that the system can boot up.
  • OS operating system
  • the main CPU 294 copies diverse application programs stored in the storage 270 to the RAM 291, and runs the copied application programs so that various operations can be performed.
  • the graphic processor 293 generates images to be displayed on a screen on a display area of the display 240 including diverse objects such as an icon, an image, and text, using an operator (not shown) and a renderer (not shown).
  • the operator operates property values of each object, such as a coordinate value, a shape, a size and a color, according to the layout of the screen by using a control command received from the inputter 210.
  • the renderer generates an image on the screen having a diverse layout including objects based on the property values operated by the operator.
  • the screen generated by the renderer is displayed on a display area of the display 240.
  • the main CPU 294 accesses the storage 270 and boots up the system using the OS stored in the storage 270. In addition, the main CPU 294 performs various operations using different types of programs, contents, and data stored in the storage 270.
  • the first to Nth interfaces 295-1 to 295-N are connected to the aforementioned components.
  • One of the interfaces may be a network interface that is connected to an external device through a network.
  • the controller 290 may determine a beat and a pitch by analyzing a sensed user interaction, and generates MIDI data by using a type of MIDI data, which is set according to a user command input through the inputter 110, and by using the determined beat and pitch.
  • the controller 290 may control the display 240 to display a UI 400 to set a type of MIDI data, as shown in FIG. 4.
  • the controller 290 may set various types of MIDI data such as genre, style, complexity, BPM, and tempo according to a user command input through the UI 400, as shown in FIG. 4.
  • the controller 290 may analyze the user interaction and determine a pitch and a beat corresponding to the user interaction.
  • the controller 290 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and determine a pitch of the user’s humming using correntropy pitch detection.
  • the harmonic structure changes sharply at the point at which the humming first starts.
  • the controller 290 may determine a beat by determining a point on which onset of the humming occurs using the HCR method.
  • the controller 290 may determine a pitch using a signal between onsets of the humming according to correntropy pitch detection.
  • a pitch and beat can be determined according to a motion made by the user.
  • the controller 290 may determine a beat using the speed of the user’s motion, and determine a pitch using the distance of the motion. That is, as the user’s motion is faster, the controller 290 may determine that the beat is faster, and as the user’s motion becomes slower, the controller 290 may determine that beat is slower.
  • the controller 290 may determine that the pitch is lower, and as the distance of the motion of the user sensed by the motion sensor is longer, the controller 290 may determine that the pitch is higher.
  • a pitch and beat can be determined if a user touches the touch screen or touch panel, such as the display 240, of the multimedia apparatus 200.
  • the analysis module 273 may determine a beat by calculating a time at which the user touches the touch sensor, and determine a pitch by calculating a position on a touch screen touched by the user. That is, if the user touches the touch screen for a longer period of time, the controller 290 may determine that the beat is slower, and if the user touches the screen for a short period of time, the controller 290 may determine that the beat is faster.
  • the controller 290 may determine the pitch according to an area of the touch screen touched by the user.
  • the controller 290 may determine emotion information based on video data which is input or based on sensed surrounding environment information.
  • the emotion information of the MIDI data indicates information regarding the mood of music that the user wishes to compose, including information such as chord progression, a drum pattern, BPM, and spatial impression information.
  • the controller 290 may acquire emotion information using at least one of color information, motion information, and spatial information of an image input through an image inputter 220. For example, the controller 290 may determine the chord progression of MIDI data using the color information of an input image. More specifically, when the input image has many bright colors, the controller 290 may determine that the chord of the MIDI data is a major chord, and when the input image has many dark colors, the controller 290 may determine that chord of the MIDI data is a minor chord.
  • the controller 290 may determine a drum pattern or BPM of MIDI data using motion information of an input image. More specifically, when the input image has a lot of motion, the controller 290 may increase the BPM, and when the input image has a little bit of motion, the controller 290 may decrease the BPM.
  • the controller 290 may acquire spatial impression information of MIDI data using the spatial information of the input video. More specifically, the controller 290 may extract an area parameter of a sound image of a composed piece using spatial information of the input video.
  • the controller 290 may acquire emotion information based on the surrounding environment information sensed by the environment sensor 230. For example, when the weather is sunny, when the temperature is warm, or when illumination is bright, the controller 290 may determine that the chord of the MIDI data is a major chord. When the weather is dark, when the temperature is cold, or when illumination is dark, the controller 290 may determine that the chord of the MIDI data is a minor chord.
  • the controller 290 may determine a type of MIDI data using surrounding environment information or video data. For example, when the weather is sunny, the controller 290 may set a genre of the MIDI data to be dance.
  • the controller 290 may generate a score using the determined beat and pitch, and may control the display 240 to display the generated score. More specifically, the controller 290 may generate a score using a beat and a pitch determined according to a user interaction as shown in FIG. 5. With reference to FIG. 5, the score may include different icons such as icon 510, icon 520, and icon 530 to generate a music file as well as the score determined according to user interaction.
  • the diverse icons may include a first icon 510 to generate a previous measure of MIDI data in front of a currently generated MIDI data, a second icon 520 to generate a rear measure of MIDI data behind the currently generated MIDI data, and a third icon 530 to repeat the currently generated MIDI data, as shown in FIG. 5.
  • the controller 290 may generate the previous measure of MIDI data or the rear measure of MIDI data using an existing database.
  • the controller 290 may store a composition pattern of the user in the database, and predict and generate a previous measure or a rear measure of a currently generated MIDI data based on the stored composition pattern. For example, when a chord of four measures of a currently generated MIDI data is C-B-A-G, the controller 290 may set a chord of a subsequent measure to be C-D-G-C or F-E-D-C based on the database. In addition, when a chord of four measures of a currently generated MIDI data is C-D-G-C, the controller 290 may set a chord of a previous measure to be C-B-A-G based on the database.
  • the controller 290 may modify the MIDI data according to a user command which is input on a displayed score.
  • the controller 290 may modify the MIDI data using the user’s touch input to a score as shown in FIG. 5.
  • the controller 290 may modify the pitch of the touched note, and when a user command is input in which the user touches the note for more than a predetermined period of time, the controller 290 may modify the beat.
  • the controller 290 may modify diverse composition parameters using other user commands.
  • the controller 290 may control the audio outputter 250 to mix and output MIDI data and the user’s humming.
  • the controller 290 may control the audio outputter 250 and the display 240 to mix and output the input video data and the MIDI data.
  • FIG. 6 is a flowchart of a method for composing music using a user interaction according to an exemplary embodiment.
  • the multimedia apparatus 200 sets a type of the MIDI data according to the user’s input (S610).
  • the type of MIDI data may include at least one of a genre, a style, a BPM, and a complexity of the MIDI data.
  • the multimedia apparatus 200 senses a user interaction with the multimedia apparatus 200 (S620).
  • the user interaction may include at least one of the user humming into the microphone of the multimedia apparatus, touching a touch screen, and making a motion which is sensed by the multimedia apparatus.
  • the multimedia apparatus 200 analyzes the user interaction and determines a beat and a pitch (S630). More specifically, when the user’s humming is input through a microphone, the multimedia apparatus 200 may determine a beat of the user’s humming using the HCR method, and determines a pitch of the user’s humming using correntropy pitch detection. When the user’s motion is input through a motion sensor, the multimedia apparatus 200 may determine a beat using a speed of the user’s motion, and determine a pitch using the distance of the motion.
  • the multimedia apparatus 200 may determine a beat by calculating a time at which the user touches the multimedia apparatus 200, and determine a pitch by calculating an amount of pressure placed by the user on, for example, the touch sensor of the multimedia apparatus 200.
  • the multimedia apparatus 200 generates MIDI data based on the set type of the MIDI data and the determined pitch and beat (S640). At this time, the multimedia apparatus 200 may display a score of the generated MIDI data, and mix and output the generated MIDI data with the user’s humming or video data.
  • the user may easily and conveniently generate the MIDI data of music that the user wishes to compose.
  • the user’s humming is sensed using a microphone, but this is merely an exemplary embodiment. Instead, audio data in which the user’s humming is recorded may be input.
  • FIG. 7 illustrates a plurality of modules to compose music using video data according to an exemplary embodiment.
  • the storage 270 may include a video input module 710, a video information analysis module 720, a parameter determination module 730, an accompaniment generation module 740, and a mixing module 750.
  • the video input module 710 receives video data through the image inputter 220.
  • the video information analysis module 720 analyzes information regarding the input video data. More specifically, the video information analysis module 720 may analyze color information of the entire image, screen motion information according to a position of a camera, object motion information in the video, and spatial information extracted from an audio input signal.
  • the parameter determination module 730 determines a composition parameter based on the analyzed video information. More specifically, the parameter determination module 730 may determine a chord progression using the analyzed color information. For example, when analyzed color information is a bright or warm color, the parameter determination module 730 may determine that the chord progression is a major chord progression, and when the analyzed color information is a dark or cool color, the parameter determination module 730 may determine that the chord progression is a minor chord progression.
  • the parameter determination module 730 may determine a drum pattern using screen motion information. For example, when a screen motion or motion on a screen is fast, the parameter determination module 730 may determine that the drum pattern is fast, and when the motion on the screen is fixed, the parameter determination module 730 may determine that the drum pattern is slow. In addition, the parameter determination module 730 may determine BPM using the object motion information. For example, when the object motion is slow, the parameter determination module 730 may determine that the BPM is low, and when the object motion is fast, the parameter determination module 730 may determine that the BPM is high.
  • the parameter determination module 730 may adjust an area of a sound image using spatial information. For example, when a space of an audio signal is large, the parameter determination module 730 may determine that an area of a sound image is large, and when a space of an audio signal is small, the parameter determination module 730 may determine that an area of a sound image is small.
  • the accompaniment generation module 740 generates MIDI data using the composition parameter determined by the parameter determination module 730. More specifically, the accompaniment generation module 740 generates MIDI tracks of melody instruments (for example, piano, guitar, keyboard, etc), percussion instruments (for example, drum, etc), and bass rhythm instruments (for example, bass, etc) using a composition parameter determined by the parameter determination module 730. Subsequently, the accompaniment generation module 740 may generate complete MIDI data using the generated MIDI tracks of the melody instruments, percussion instruments, and bass rhythm instruments.
  • melody instruments for example, piano, guitar, keyboard, etc
  • percussion instruments for example, drum, etc
  • bass rhythm instruments for example, bass, etc
  • the mixing module 750 may mix the generated MIDI data with video data.
  • the mixing module 750 may locate a sound image to correspond to spatial information of an audio signal included in the video data, and generate space sense according to spatial information of an audio signal included in the video data using a decorrelator.
  • the controller 290 may compose music according to input video data using the modules 710 to 750 as shown in FIG. 7. More specifically, when video is input through the image inputter 220, the controller 290 may analyze the input video data, determine a composition parameter, and generate MIDI data using the determined composition parameter.
  • the composition parameter is a parameter to compose music, such as a chord progression, a drum pattern, BPM, and an area parameter.
  • the controller 290 may determine a chord progression using color information of the input video data.
  • the controller 290 may determine that the chord progression of MIDI data is a major chord progression, and when the color of the entire image of the input video is dark, the controller 290 may determine that the chord progression of the MIDI data is a minor chord progression.
  • the controller 290 may determine a drum pattern using screen motion information of the input video data. More specifically, when the motion on the screen of the input image is fast, the controller 290 may determine that the drum pattern is fast, and when the motion on the screen of the input image is fixed, the controller 290 may determine that the drum pattern is slow.
  • the controller 290 may determine BPM using object motion information of the input video data. More specifically, when the motion of a particular object in the input image is slow, the controller 290 may determine that BPM is low, and when the motion of a particular object in the input image is fast, the controller 290 may determine that BPM is high.
  • the controller 290 may adjust an area of a sound image using spatial information of an audio signal included in the input video data. More specifically, when a space of an audio signal is large, the controller 290 may determine that an area of a sound image is large, and when a space of an audio signal is small, the controller 290 may determine that an area of a sound image is small.
  • the controller 290 may generate MIDI data using a determined parameter. More specifically, the controller 290 generates a MIDI track of melody instruments (for example, piano, guitar, keyboard, etc) using a template based on a determined chord progression and genre set by the user, generates a MIDI track of percussion instruments (for example, a drum, etc) using a drum pattern, and generates a MIDI track of bass rhythm instruments (for example, bass, etc) using a chord progression, a genre, and a drum pattern. Subsequently, the controller 290 may generate complete MIDI data using the generated MIDI tracks of the melody instruments, percussion instruments, and bass rhythm instruments.
  • melody instruments for example, piano, guitar, keyboard, etc
  • a template based on a determined chord progression and genre set by the user
  • a MIDI track of percussion instruments for example, a drum, etc
  • bass rhythm instruments for example, bass, etc
  • the controller 290 may run the generated MIDI data together with the video data. In other words, the controller 290 may mix and output the generated MIDI data with the video data. At this time, the controller 290 may synchronize the MIDI data and audio signals included in the video data.
  • FIG. 8 is a flowchart of a method for composing music using video data according to another exemplary embodiment.
  • the multimedia apparatus 200 receive video data (S810).
  • the multimedia apparatus 200 may receive video data from an external device, or may receive pre-stored video data.
  • the multimedia apparatus 200 analyzes the input video data and determines a composition parameter (S820).
  • the composition parameter is a parameter to compose music, such as a chord progression, a drum pattern, BPM, and an area parameter. More specifically, the multimedia apparatus 200 may determine a chord progression using the analyzed color information. In addition, the multimedia apparatus 200 may determine a drum pattern using screen motion information of the video data. In addition, the multimedia apparatus 200 may determine BPM using object motion information of the video data. Also, the multimedia apparatus 200 may adjust an area of a sound image using spatial information.
  • the multimedia apparatus 200 generates MIDI data using the composition parameter (S830). More specifically, the multimedia apparatus 200 may generate MIDI tracks of melody instruments, percussion instruments, and bass rhythm instruments using the composition parameter, and generate MIDI data by mixing the generated MIDI tracks. In addition, the multimedia apparatus 200 may run the generated MIDI data together with the video data.
  • MIDI data is generated using video data so that the user may compose music suitable with the mood of the video data.
  • music is composed using a pitch and a beat detected based on, for example, the user’s humming, but this is merely an exemplary embodiment.
  • the pitch and beat can be detected based on a song sung by the user and the song is obtained based on the detected pitch and beat and the song sung by the user is corrected based on the obtained song.
  • FIG. 9 illustrates a plurality of modules to correct a song according to yet another exemplary embodiment.
  • the storage 270 of the multimedia apparatus 200 may include a song input module 910, a song analysis module 920, a virtual score generation module 930, a score acquisition module 940, a song and score synchronization module 950, a song correction module 960, a sound source acquisition module 970, an accompaniment separation module 980, and a mixing module 990 in order to correct a song sung by the user.
  • the song input module 910 receives a song sung by the user. At this time, the song input module 910 may receive a song input through a microphone, or a song included in audio data.
  • the song analysis module 920 analyzes a beat and a pitch of the song sung by the user. More specifically, the song analysis module 920 determines a beat of the song using an HCR method, and determines a pitch of the song using correntropy pitch detection.
  • the virtual score generation module 930 generates a virtual score based on the pitch and beat analyzed by the song analysis module 920.
  • the score acquisition module 940 acquires a score of the song sung by the user using the virtual score generation module 930.
  • the score acquisition module 940 may acquire the score by comparing a score stored in the database with the virtual score.
  • the score acquisition module 940 may acquire the score by taking a photograph of a printed score using a camera and analyzing the captured image.
  • the score acquisition module 940 may acquire the score using musical notes input by the user on manuscript paper which is displayed on the display 240.
  • the score acquisition module 940 may acquire a score by comparing the song sung by the user with a vocal track extracted from a pre-stored sound source.
  • the score acquisition module 940 may acquire a score by stochastically presuming an onset and offset pattern and dispersion of pitch based on frequency characteristics of the song which was input.
  • the score acquisition module 940 may presume a beat and a pitch from the input song using the HCR method and correntropy pitch detection, extract stochastically the most suitable BPM and chord from dispersion of the presumed beat and pitch, and thus generate a score.
  • the song and score synchronization module 950 synchronizes the song sung by the user and the score acquired by the score acquisition module 940. At this time, the song and score synchronization module 950 may synchronize the song which was sung and the score using a dynamic time warping (DTW) method.
  • the DTW method is an algorithm that finds an optimum warping path by comparing the similarity between two sequences.
  • the song correction module 960 corrects a wrong portion, for example, an off-key portion, an off-beat portion, etc, of the song sung by the user by comparing the song and the score. More specifically, the song correction module 960 may correct the song to correspond to the score by applying time stretching and a frequency shift.
  • the sound source acquisition module 970 acquires a sound source of the song sung by the user. At this time, the sound source acquisition module 970 may acquire a sound source using a score acquired by the score acquisition module 940.
  • the accompaniment separation module 980 separates a vocal track and an accompaniment track from the acquired sound source, and outputs the accompaniment track to the mixing module 990.
  • the mixing module 990 mixes and outputs the accompaniment track separated by the accompaniment separation module 980 with the song corrected by the song correction module 960.
  • the controller 290 corrects a song sung by the user using the exemplary modules as shown in FIG. 9.
  • the controller 290 analyzes the song and acquires a score that matches the song.
  • the controller 290 determines a beat of the song using an HCR method, and determines pitch of the song using correntropy pitch detection.
  • the controller 290 may generate a virtual score based on the determined beat and pitch, and acquire a score which is the most similar to the virtual score among the scores stored in the database, as a score corresponding to the song.
  • the controller 290 may acquire a score by the user’s input, acquire a score using a photographed score image, acquire a score from a vocal track separated from a pre-stored sound source, or use the virtual score as a score corresponding to the song.
  • the controller 290 synchronizes the score and the song sung by the user. At this time, the controller 290 may synchronize the score and the song using a DTW method.
  • the controller 290 corrects the song based on the synchronized score. More specifically, the controller 290 may correct a pitch and a beat of the song by applying time stretching and a frequency shift so that the song is synchronized with the score.
  • controller 290 controls the audio outputter 250 to output the corrected song.
  • the controller 290 searches for a sound source which matches the song sung by the user.
  • the controller 290 may search for the sound source using a score or according to the user’s input.
  • the controller 290 receives the sound source.
  • the found sound source may be pre-stored or may be externally downloaded through the communicator 280.
  • the controller 290 extracts an accompaniment sound from the sound source.
  • the controller 290 may control the audio outputter 250 to mix and output the corrected song and the accompaniment sound.
  • FIG. 10 is a flowchart of a method for correcting a song according to another exemplary embodiment.
  • the multimedia apparatus 200 receives a song sung by the user (S1010).
  • the multimedia apparatus 200 may receive the song through a microphone or through externally transmitted audio data.
  • the multimedia apparatus 200 analyzes the song (S1020). More specifically, the multimedia apparatus 200 may analyze a pitch and a beat of the song.
  • the multimedia apparatus 200 acquires a score which matches the song (S1030). More specifically, the multimedia apparatus 200 may acquire a virtual score using the analyzed pitch and beat, compare the virtual score with the scores stored in the database, and determine that a score which is the most similar to the virtual score is the score which matches the song.
  • the multimedia apparatus 200 then synchronizes the song and the acquired score (S1040). More specifically, the multimedia apparatus 200 may synchronize the song and the acquired score in a DTW method.
  • the multimedia apparatus 200 corrects the song based on the acquired score (S1050). More specifically, the multimedia apparatus 200 may correct a pitch and a beat of the song to correspond to the score by applying time stretching and a frequency shift.
  • the music composing method or the song correcting method according to the aforementioned exemplary embodiments may be implemented with a program, and may be provided to a display apparatus.
  • Programs including the music composing method or the song correcting method may be stored in a non-transitory computer readable medium.
  • the non-transitory computer readable medium is a medium which does not store data temporarily such as a register, cache, and memory but stores data semi-permanently and is readable by devices. More specifically, the aforementioned applications or programs may be stored in the non-transitory computer readable medium such as compact disks (CDs), digital video disks (DVDs), hard disks, Blu-ray disks, universal serial buses (USBs), memory cards, and read-only memory (ROM).
  • CDs compact disks
  • DVDs digital video disks
  • hard disks hard disks
  • Blu-ray disks Blu-ray disks
  • USBs universal serial buses
  • memory cards and read-only memory (ROM).

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Electrophonic Musical Instruments (AREA)
  • Auxiliary Devices For Music (AREA)

Abstract

A multimedia apparatus, a music composing method thereof, and a song correcting method thereof are provided. A music composing method includes setting a type of musical instrument digital interface (MIDI) data according to a user's input, sensing a user interaction, analyzing the sensed user interaction and determining a beat and a pitch of the user interaction, and generating MIDI data using the set type of MIDI data and the determined beat and pitch.

Description

MULTIMEDIA APPARATUS, MUSIC COMPOSING METHOD THEREOF, AND SONG CORRECTING METHOD THEREOF
Apparatuses and methods consistent with the exemplary embodiments relate to a multimedia apparatus, a music composing method thereof, and a song correcting method thereof, and more particularly, to a multimedia apparatus capable of composing music according to a user interaction and correcting a song sung by a user, a music composing method thereof, and a song correcting method thereof.
Recently, the music content production market of multimedia apparatuses, especially smart phones, has been rapidly growing.
Music content production methods use interfaces such as a musical instrument digital interface (MIDI). Such an interface can be difficult to use if one is not an expert. In order to produce music using the MIDI interface, users need to have both musical knowledge and knowledge about the MIDI interface.
In addition, in the related art, a song can only be composed by using the user’s voice. That is, there are limits to composing a song using other interactions and only the user's voice can be used.
Accordingly, there is a need for an easier and more convenient method for composing music using a diverse types of user interactions.
Exemplary embodiments address the above disadvantages and other disadvantages not described above. Also, the exemplary embodiments are not required to overcome the disadvantages described above, and exemplary embodiments may not overcome any of the problems described above.
An exemplary embodiment provides a multimedia apparatus capable of composing music using diverse types of user interactions and video data, and a music composing method thereof.
An exemplary embodiment also provides a multimedia apparatus capable of searching for a song sung by the user and correcting the song sung by the user, and a song correcting method thereof.
According to an aspect of an exemplary embodiment, a music composing method includes setting a type of musical instrument digital interface (MIDI) data according to a user’s input, sensing a user interaction, analyzing the sensed user interaction and determining a beat and a pitch, and generating MIDI data using the set type of MIDI data and the determined beat and pitch.
In the setting the type of MIDI data including setting at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
The method may further include receiving an image, and obtaining emotion information using at least one of color information, motion information, and spatial information of the received image. In the generating the MIDI data, the MIDI data may be generated using the emotion information.
The method may further include sensing at least one of a weather, a temperature, a humidity, and an illumination, and generating emotion information using the sensed at least one of the weather, the temperature, the humidity, and the illumination. In the generating the MIDI data, the MIDI data may be generated using the emotion information.
The method may further include generating a score using the determined beat and pitch, and displaying the generated score.
The method may further include modifying the MIDI data using the displayed generated score.
The method may further include generating a previous measure of MIDI data and a subsequent measure of MIDI data of the generated MIDI data using the generated MIDI data, and generating a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data.
The user interaction may be one of humming by the user, a touch made by the user, and a motion made by the user.
The method may further include mixing and outputting the MIDI data and the humming by the user when the user interaction is the humming by the user.
According to another aspect, a multimedia apparatus includes an inputter configured to receive a user command to set a type of musical instrument digital interface (MIDI) data, a sensor configured to sense a user interaction, and a controller configured to analyze the sensed user interaction and determine a beat and a pitch, and to generate MIDI data using the set type of MIDI data and the determined beat and pitch.
The inputter may receive a user command to set at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
The multimedia apparatus may further include an image inputter configured to receive an image. The controller may obtain emotion information using at least one of a color information, a motion information, and a spatial information of the image received through the image inputter, and generate the MIDI data using the emotion information.
The multimedia apparatus may further include an environment sensor configured to sense at least one of a weather, a temperature, a humidity, and an illumination. The controller may generate emotion information using at least one of the weather, the temperature, the humidity, and the illumination, and generate the MIDI data using the emotion information.
The multimedia apparatus may further include a display. The controller may generate a score using the determined beat and pitch, and control the display to display the generated score.
The controller may modify the MIDI data according to a user command which is input onto the displayed score.
The controller may generate a previous measure MIDI data and a subsequent measure MIDI data of the generated MIDI data using the generated MIDI data, and generate a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data.
The user interaction may be one of humming by the user, a touch made by the user, and a motion made by the user.
The multimedia apparatus may further include an audio outputter. The controller may control the audio outputter to mix and output the MIDI data and the humming by the user when the user interaction is the humming by the user.
According to another aspect, a music composing method includes receiving video data, determining a composition parameter by analyzing the received video data, and generating musical instrument digital interface (MIDI) data using the determined composition parameter.
In the determining the composition parameter, a chord progression may be determined using color information of the received video data, a drum pattern may be determined using screen motion information of the received video data, a beats per minute (BPM) may be determined using object motion information of the received video data, or a parameter of an area of a sound image may be determined using spatial information of the received video data.
The method may further include executing the generated MIDI data together with the video data.
According to another aspect, a song correcting method includes receiving a song sung by a user, analyzing the song and obtaining a score that matches the song, synchronizing the song and the score, and correcting the received song based on the synchronized score.
In the obtaining the matching score, a pitch and a beat of the song may be analyzed, and the score that matches the song may be obtained based on the analyzed pitch and beat.
A virtual score may be generated based on the analyzed pitch and beat, and a score which is most similar to the virtual score among scores stored in a database may be acquired as the score that matches the song.
The method may further include searching for a sound source which corresponds to the song, extracting an accompaniment sound from the sound source, and mixing and outputting the corrected song and the accompaniment sound.
According to the aforementioned exemplary embodiments, general users who do not have great musical knowledge and who do not sing well may generate music contents or correct their song easily and conveniently.
Additional and/or other aspects and advantages will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice.
The above and/or other aspects will be more apparent by describing certain exemplary embodiments with reference to the accompanying drawings, in which:
FIG. 1 is a block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment;
FIG. 2 is a detailed block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment;
FIG. 3 illustrates diverse modules to compose music according to an exemplary embodiment;
FIG. 4 illustrates a user interface to set a type of MIDI data according to an exemplary embodiment;
FIG. 5 illustrates a score generated using user interaction according to an exemplary embodiment;
FIG. 6 is a flowchart of a method for composing music using user interaction according to an exemplary embodiment;
FIG. 7 illustrates a plurality of modules to compose music using video data according to an exemplary embodiment;
FIG. 8 is a flowchart of a method for composing music using video data according to another exemplary embodiment;
FIG. 9 illustrates a plurality of modules to correct a song according to yet another exemplary embodiment; and
FIG. 10 is a flowchart of a method for correcting a song according to yet another exemplary embodiment.
Certain exemplary embodiments will now be described in greater detail with reference to the accompanying drawings.
In the following description, same drawing reference numerals are used for the same elements even in different drawings. The matters defined in the description, such as detailed construction and elements, are provided to assist in a comprehensive understanding of the exemplary embodiments. Thus, it is apparent that the exemplary embodiments can be carried out without those specifically defined matters. Also, well-known functions or constructions are not described in detail since they would obscure the invention with unnecessary detail.
FIG. 1 is a block diagram of a configuration of a multimedia apparatus according to an exemplary embodiment. With reference to FIG. 1, the multimedia apparatus 100 may include an inputter 110, a sensor 120, and a controller 130.
The inputter 110 receives a user command to control the overall operation of the multimedia apparatus 100. In particular, the inputter 110 may receive a user command to set a type of musical instrument digital interface (MIDI) data that the user wishes to compose. The type of the MIDI data may include at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
The sensor 120 senses a user interaction in order to compose music. The sensor 120 may include at least one of a microphone to sense if the user is humming, a motion sensor to sense a motion by the user, and a touch sensor to sense a touch made by the user.
The controller 130 controls the multimedia apparatus 100 according to a user command input through the inputter 110. In particular, the controller 130 determines a beat and a pitch by analyzing sensed user interaction, and generates MIDI data using a set type of MIDI data and the determined beat and pitch.
The controller 130 determines a type of MIDI data set through the inputter 110. More specifically, the controller 130 may determine at least one of a genre, an S Type, a BPM, and a complexity of MIDI data set through the inputter 110.
In addition, the controller 130 determines a beat and a pitch using one of the user’s humming, the user's motion, and the user's touch sensed by the sensor 120. For example, when a user hums and the humming is input through a microphone, the controller 130 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and may determine a pitch of the user’s humming using correntropy pitch detection. When the user inputs a motion through a motion sensor, the controller 130 may determine the beat using a speed of the user’s motion, and determine a pitch using the distance of the motion. When the user’s touch is input through a touch sensor, the controller 130 may determine the beat by calculating the time at which the user touches the touch sensor, and determine a pitch by calculating an amount of pressure of a user's touch.
In addition, the controller 130 generates MIDI data using a type of MIDI data input through the inputter 110 and determines a beat and a pitch.
In addition, the controller 130 may acquire emotion information using at least one of color information, motion information, and spatial information of an image input through an image inputter (not shown), and generate MIDI data using the emotion information. The emotion information is information regarding the mood of the music that the user wishes to compose, including information to determine chord progression, drum pattern, beats per minute (BPM), and spatial impression information. More specifically, the controller 130 may determine chord progression of MIDI data using color information of the input image, determine drum pattern or BPM of MIDI data using motion information of the input image, and acquire spatial impression of MIDI data using spatial information extracted from an input audio signal.
In another exemplary embodiment, the controller 130 may generate emotion information using at least one of weather information, temperature information, humidity information, and illumination information sensed by an environment sensor (not shown) of the multimedia apparatus 100, and generate MIDI data using the emotion information.
In addition, the controller 130 may generate a score using a determined beat and pitch, and display the generated score. The controller 130 may correct MIDI data according to a user command which is input onto the displayed score.
In addition, the controller 130 may generate a previous measure of MIDI data and a subsequent measure of MIDI data using generated MIDI data, and generate a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data. More specifically, when four measures having a C-B-A-G chord composition are currently generated, measures may be extended using harmonic characteristics that a next measure is likely to have such as a chord including F-E-D-C or F-E-D-E. A chord progression of C-B-A-G is likely to appear in front of F-E-D-C.
When the user interaction is the user humming, the controller 130 may mix and output MIDI data and the user’s humming. In addition, when video data is input, the controller 130 may synchronize and output the MIDI data and the video data.
By using the multimedia apparatus 100, general users who do not have extensive musical knowledge and who may not sing very well may generate music contents easily and conveniently.
FIG. 2 is a detailed block diagram of a configuration of a multimedia apparatus 200 according to an exemplary embodiment. With reference to FIG. 2, the multimedia apparatus 200 may include an inputter 210, an image inputter 220, an environment sensor 230, a display 240, an audio outputter 250, a sensor 260, a storage 270, a communicator 280, and a controller 290.
The multimedia apparatus 200 as shown in FIG. 2 is a multimedia apparatus which performs diverse functions such as a music composing function, a song correcting function, and the like. Accordingly, when other functions are added or functions change, components may be added or changed.
The inputter 210 receives a user command to control the multimedia apparatus 200. In particular, the inputter 210 receives a user command to set a type of MIDI data. More specifically, the inputter 210 may receive a user command to set a type of MIDI data such as a genre, a style, a BPM, and a complexity of music that the user wishes to compose. The user may select a genre of music such as rock, ballad, rap, and jazz through the inputter 210. In addition, the user may select a style such as gloomy, pleasant, heavy, and dreamy through the inputter 210. Also, the user may adjust a complexity by reducing or increasing the number of instruments or tracks through the inputter 210. In addition, the user may adjust the BPM, which is the number of quarter notes per minute, through the inputter 210. Also, the user may adjust the tempo, which is the rate of quarter notes, half notes, and whole notes, through the inputter 210.
The image inputter 220 receives image data externally. More specifically, the image inputter 220 may receive broadcast image data from an external broadcasting station, receive streaming image data from an external server, or receive image data from an external device (for example, a DVD player, etc). In addition, the image inputter 220 may receive personal content, such as home video, personally recorded by the user. In particular, when the image inputter 220 is implemented in devices such as a smart phone, the image inputter 220 may receive image data from a video library of the user stored in, for example, the smart phone or stored externally.
The environment sensor 230 senses an external environment. More specifically, the environment sensor 230 may acquire weather information externally, acquire temperature information of an area at which the multimedia apparatus 200 is located by using a temperature sensor, acquire humidity information of an area at which the multimedia apparatus 200 is located using a humidity sensor, or acquire illumination information of an area at which the multimedia apparatus 200 is located by using an illumination sensor. In addition, the environment sensor 230 may acquire weather and time information by linking the multimedia apparatus 200 with an internet service using the location information of the user.
The display 240 may be controlled by the controller 290 to display diverse types of image data. In particular, the display 240 may display image data input through the image inputter 220.
In addition, the display 240 may display diverse types of user interfaces (UIs) to control the multimedia apparatus 200. For example, the display 240 may display a UI to set a type of MIDI data as shown in FIG. 4.
In addition, the display 240 may display a score having a pitch and a beat which is determined according to a user interaction. For example, the display 240 may display a score as shown in FIG. 5.
The audio outputter 250 may output audio data. The audio outputter 250 may output not only externally input audio data but also MIDI data generated by user interaction.
The sensor 260 senses a user interaction. In particular, the sensor 260 may sense user interaction to compose music. More specifically, the sensor 260 may sense various and diverse types of user interactions to determine a beat and a pitch of music that the user wishes to compose. For example, the sensor 260 may sense whether the user is humming by using a microphone, sense whether the user is making a motion by using a motion sensor, or sense whether the user is touching the apparatus by using a touch sensor. Therefore, the sensor 260 can include, for example, a microphone, a motion sensor or a touch sensor.
The storage 270 stores diverse modules to drive the multimedia apparatus 200. For example, the storage 270 may include software including a base module, a sensing module, a communication module, a presentation module, a web browser module, and a service module (not shown). The base module is a module that processes a signal transmitted from hardware included in the multimedia apparatus 200 and transmits the signal to an upper layer module. The sensing module is a module that collects information from diverse sensors and analyzes and manages the collected information, including a face recognition module, a voice recognition module, a motion recognition module, a near field communication (NFC) recognition module, and so on. The presentation module is a module that composes a display screen, including a multimedia module to play back and output multimedia content and a user interface (UI) rendering module to process UIs and graphics. The communication module is a module that communicates with external devices. The web browser module is a module that performs web browsing and accesses a web server. The service module is a module including diverse applications to provide diverse services.
In addition, the storage 270 may store diverse modules to compose music according to a user interaction. This is described with reference to FIG. 3. The modules to compose music according to user interaction may include a MIDI data type setting module 271, an interaction input module 272, an analysis module 273, a video input module 274, an emotion analysis module 275, a composed piece generation module 276, and a mixing module 277.
The MIDI data type setting module 271 may set a type of the MIDI data according to a user command which is input through the inputter 210. More specifically, the MIDI data type setting module 271 may set diverse types of MIDI data such as genre, BPM, style, and complexity of the MIDI data.
The interaction input module 272 receives a user interaction sensed by the sensor 260. More specifically, the interaction input module 272 may receive a user interaction including at least one of the user’s humming, a user's motion, and a user's touch.
The analysis module 273 may analyze the user interaction input through the interaction input module 272, and thus determine a pitch and a beat. For example, when a user hums and the humming is input through a microphone, the analysis module 273 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and determine a pitch of the user’s humming using correntropy pitch detection. When the user’s motion is input through a motion sensor, the analysis module 273 may determine a beat using a speed of the user’s motion, and determine a pitch using the distance of the motion. When the user’s touch is input through a touch sensor, the analysis module 273 may determine the beat by calculating a time at which the user touches the touch sensor, and determine the pitch by calculating an amount of pressure touched by the user on the touch sensor.
The video input module 274 receives video data input through the image inputter 220, and outputs the video data to the emotion analysis module 275.
The emotion analysis module 275 may analyze the input video data and thus determine emotion information of MIDI data. The emotion information of the MIDI data is information regarding the mood of the music that the user wishes to compose, including information such as chord progression, drum pattern, BPM, and spatial impression information. More specifically, the emotion analysis module 275 may determine a chord progression of the MIDI data using color information of an input image. For example, when brightness or chroma of an image is high, the emotion analysis module 275 may determine a bright major chord progression, that is, a chord progression which gives a sense of brightness, and when brightness or chroma is low, the emotion analysis module 275 may determine a dark minor chord progression, that is a chord progression which gives a sense of darkness.
The emotion analysis module 275 may determine a drum pattern or BPM of the MIDI data using motion information of an input image. For example, the emotion analysis module 275 may presume a certain BPM from a degree of motion of the entire clip, and then increase the complexity of a drum pattern at a portion having a lot of motion. The emotion analysis module 275 may acquire spatial impression information of the MIDI data using spatial information of the input video so that the acquired spatial impression may be used to form a spatial impression when multichannel audio is generated.
The composed piece generation module 276 generates MIDI data which is a composed piece, based on a type of the MIDI data set by the MIDI data type setting module 271, a pitch and a beat determined by the analysis module 273, and emotion information determined by the emotion analysis module 275.
The composed piece generation module 276 may also generate a score image corresponding to the generated MIDI data.
In addition, the composed piece generation module 276 may generate a previous measure of MIDI data and a subsequent measure of MIDI data using the MIDI data generated according to the user’s settings. More specifically, the composed piece generation module 276 may generate a previous measure of MIDI data and a subsequent measure of MIDI data of MIDI data generated based on a general composition pattern set by the user, a type of MIDI data set by the user, a chord progression determined by the emotion analysis module 275, etc.
The mixing module 277 mixes an input MIDI data with the user’s humming or video data.
Diverse types of modules, as well as the aforementioned modules, may be added, or the aforementioned modules may be changed. For example, an environment information input module may be added to receive surrounding environment information sensed by the environment sensor 230.
Returning to FIG. 2, the communicator 280 communicates with various types of external devices according to various types of communication methods. The communicator 280 may include various communication chips such as a wireless fidelity (Wi-Fi) chip, a Bluetooth chip, a near field communication (NFC) chip, and a wireless communication chip. The Wi-Fi chip, the Bluetooth chip, and the NFC chip perform communication according to a Wi-Fi method, a Bluetooth method, and an NFC method, respectively. The NFC chip is a chip that operates according to the NFC method which uses a 13.56MHz band among diverse radio frequency identification (RFID) frequency bands such as 135kHz, 13.56MHz, 433MHz, 860-960MHz, and 2.45GHz. In the case that a Wi-Fi chip or a Bluetooth chip is used, connection information such as a subsystem identification (SSID) and a session key are transmitted and received first, and then after communication is established, diverse information can be transmitted and received. The wireless communication chip is a chip that performs communication according to diverse communication standards such as IEEE, Zigbee, 3rd generation (3G), 3rd generation partnership project (3GPP), and long term evolution (LTE).
The controller 290 may include a random-access memory (RAM) 291, a read-only memory (ROM) 292, a graphic processor 293, a main central processing unit (CPU) 294, first to Nth interfaces 295-1 to 295-N, and a bus 296 as shown in FIG. 2. The RAM 291, the ROM 292, the graphic processor 293, the main CPU 294, and the first to Nth interfaces 295-1 to 295-N may be connected to one another via the bus 296.
The ROM 292 stores a set of commands to boot up the system. When a command to turn on the multimedia apparatus 200 is input and power is supplied, the main CPU 294 copies an operating system (OS) stored in the storage 270 to the RAM 291 and executes the OS according to the commands stored in the ROM 292 so that the system can boot up. When the boot-up is complete, the main CPU 294 copies diverse application programs stored in the storage 270 to the RAM 291, and runs the copied application programs so that various operations can be performed.
The graphic processor 293 generates images to be displayed on a screen on a display area of the display 240 including diverse objects such as an icon, an image, and text, using an operator (not shown) and a renderer (not shown). The operator operates property values of each object, such as a coordinate value, a shape, a size and a color, according to the layout of the screen by using a control command received from the inputter 210. The renderer generates an image on the screen having a diverse layout including objects based on the property values operated by the operator. The screen generated by the renderer is displayed on a display area of the display 240.
The main CPU 294 accesses the storage 270 and boots up the system using the OS stored in the storage 270. In addition, the main CPU 294 performs various operations using different types of programs, contents, and data stored in the storage 270.
The first to Nth interfaces 295-1 to 295-N are connected to the aforementioned components. One of the interfaces may be a network interface that is connected to an external device through a network.
The controller 290 may determine a beat and a pitch by analyzing a sensed user interaction, and generates MIDI data by using a type of MIDI data, which is set according to a user command input through the inputter 110, and by using the determined beat and pitch.
More specifically, when a command to run a music application is input so as to compose music, the controller 290 may control the display 240 to display a UI 400 to set a type of MIDI data, as shown in FIG. 4. The controller 290 may set various types of MIDI data such as genre, style, complexity, BPM, and tempo according to a user command input through the UI 400, as shown in FIG. 4.
When the controller 290 senses a user interaction through the sensor 260 after setting a type of the MIDI data, the controller 290 may analyze the user interaction and determine a pitch and a beat corresponding to the user interaction.
More specifically, if a user hums into a microphone , the controller 290 may determine a beat of the user’s humming using a harmonic cepstrum regularity (HCR) method, and determine a pitch of the user’s humming using correntropy pitch detection. The harmonic structure changes sharply at the point at which the humming first starts. Accordingly, the controller 290 may determine a beat by determining a point on which onset of the humming occurs using the HCR method. In addition, the controller 290 may determine a pitch using a signal between onsets of the humming according to correntropy pitch detection.
As another example, a pitch and beat can be determined according to a motion made by the user. When the user’s motion is input through a motion sensor, the controller 290 may determine a beat using the speed of the user’s motion, and determine a pitch using the distance of the motion. That is, as the user’s motion is faster, the controller 290 may determine that the beat is faster, and as the user’s motion becomes slower, the controller 290 may determine that beat is slower. In addition, as the distance of the motion of the user sensed by the motion sensor is shorter, the controller 290 may determine that the pitch is lower, and as the distance of the motion of the user sensed by the motion sensor is longer, the controller 290 may determine that the pitch is higher.
As another example, a pitch and beat can be determined if a user touches the touch screen or touch panel, such as the display 240, of the multimedia apparatus 200. When the user’s touch is input through a touch sensor, the analysis module 273 may determine a beat by calculating a time at which the user touches the touch sensor, and determine a pitch by calculating a position on a touch screen touched by the user. That is, if the user touches the touch screen for a longer period of time, the controller 290 may determine that the beat is slower, and if the user touches the screen for a short period of time, the controller 290 may determine that the beat is faster. In addition, the controller 290 may determine the pitch according to an area of the touch screen touched by the user.
The controller 290 may determine emotion information based on video data which is input or based on sensed surrounding environment information. The emotion information of the MIDI data indicates information regarding the mood of music that the user wishes to compose, including information such as chord progression, a drum pattern, BPM, and spatial impression information.
More specifically, the controller 290 may acquire emotion information using at least one of color information, motion information, and spatial information of an image input through an image inputter 220. For example, the controller 290 may determine the chord progression of MIDI data using the color information of an input image. More specifically, when the input image has many bright colors, the controller 290 may determine that the chord of the MIDI data is a major chord, and when the input image has many dark colors, the controller 290 may determine that chord of the MIDI data is a minor chord.
As another example, the controller 290 may determine a drum pattern or BPM of MIDI data using motion information of an input image. More specifically, when the input image has a lot of motion, the controller 290 may increase the BPM, and when the input image has a little bit of motion, the controller 290 may decrease the BPM.
Also, in another example, the controller 290 may acquire spatial impression information of MIDI data using the spatial information of the input video. More specifically, the controller 290 may extract an area parameter of a sound image of a composed piece using spatial information of the input video.
In addition, the controller 290 may acquire emotion information based on the surrounding environment information sensed by the environment sensor 230. For example, when the weather is sunny, when the temperature is warm, or when illumination is bright, the controller 290 may determine that the chord of the MIDI data is a major chord. When the weather is dark, when the temperature is cold, or when illumination is dark, the controller 290 may determine that the chord of the MIDI data is a minor chord.
When a type of MIDI data is not set by the user, the controller 290 may determine a type of MIDI data using surrounding environment information or video data. For example, when the weather is sunny, the controller 290 may set a genre of the MIDI data to be dance.
In addition, the controller 290 may generate a score using the determined beat and pitch, and may control the display 240 to display the generated score. More specifically, the controller 290 may generate a score using a beat and a pitch determined according to a user interaction as shown in FIG. 5. With reference to FIG. 5, the score may include different icons such as icon 510, icon 520, and icon 530 to generate a music file as well as the score determined according to user interaction. For example, the diverse icons may include a first icon 510 to generate a previous measure of MIDI data in front of a currently generated MIDI data, a second icon 520 to generate a rear measure of MIDI data behind the currently generated MIDI data, and a third icon 530 to repeat the currently generated MIDI data, as shown in FIG. 5.
At this time, the controller 290 may generate the previous measure of MIDI data or the rear measure of MIDI data using an existing database. In other words, the controller 290 may store a composition pattern of the user in the database, and predict and generate a previous measure or a rear measure of a currently generated MIDI data based on the stored composition pattern. For example, when a chord of four measures of a currently generated MIDI data is C-B-A-G, the controller 290 may set a chord of a subsequent measure to be C-D-G-C or F-E-D-C based on the database. In addition, when a chord of four measures of a currently generated MIDI data is C-D-G-C, the controller 290 may set a chord of a previous measure to be C-B-A-G based on the database.
In addition, the controller 290 may modify the MIDI data according to a user command which is input on a displayed score. In particular, when the display 240 includes a touch panel or touch screen, the controller 290 may modify the MIDI data using the user’s touch input to a score as shown in FIG. 5. When a user command is input to touch and drag a musical note, the controller 290 may modify the pitch of the touched note, and when a user command is input in which the user touches the note for more than a predetermined period of time, the controller 290 may modify the beat. However, this is merely an exemplary embodiment. The controller 290 may modify diverse composition parameters using other user commands.
When the user interaction is the user humming, the controller 290 may control the audio outputter 250 to mix and output MIDI data and the user’s humming. In addition, when video data is input through the image inputter 220, the controller 290 may control the audio outputter 250 and the display 240 to mix and output the input video data and the MIDI data.
FIG. 6 is a flowchart of a method for composing music using a user interaction according to an exemplary embodiment.
First, the multimedia apparatus 200 sets a type of the MIDI data according to the user’s input (S610). The type of MIDI data may include at least one of a genre, a style, a BPM, and a complexity of the MIDI data.
Subsequently, the multimedia apparatus 200 senses a user interaction with the multimedia apparatus 200 (S620). The user interaction may include at least one of the user humming into the microphone of the multimedia apparatus, touching a touch screen, and making a motion which is sensed by the multimedia apparatus.
The multimedia apparatus 200 analyzes the user interaction and determines a beat and a pitch (S630). More specifically, when the user’s humming is input through a microphone, the multimedia apparatus 200 may determine a beat of the user’s humming using the HCR method, and determines a pitch of the user’s humming using correntropy pitch detection. When the user’s motion is input through a motion sensor, the multimedia apparatus 200 may determine a beat using a speed of the user’s motion, and determine a pitch using the distance of the motion. When the user’s touch is input through a touch sensor, the multimedia apparatus 200 may determine a beat by calculating a time at which the user touches the multimedia apparatus 200, and determine a pitch by calculating an amount of pressure placed by the user on, for example, the touch sensor of the multimedia apparatus 200.
Subsequently, the multimedia apparatus 200 generates MIDI data based on the set type of the MIDI data and the determined pitch and beat (S640). At this time, the multimedia apparatus 200 may display a score of the generated MIDI data, and mix and output the generated MIDI data with the user’s humming or video data.
By using the multimedia apparatus 200, the user may easily and conveniently generate the MIDI data of music that the user wishes to compose.
In the above exemplary embodiment, the user’s humming is sensed using a microphone, but this is merely an exemplary embodiment. Instead, audio data in which the user’s humming is recorded may be input.
In the above exemplary embodiments, a method for composing music using a user interaction has been described, but this is merely an exemplary embodiment. It is also possible to compose music using video data. This is described with reference to FIGs. 7 and 8.
FIG. 7 illustrates a plurality of modules to compose music using video data according to an exemplary embodiment. With reference to FIG. 7, in order to compose music using video data, the storage 270 may include a video input module 710, a video information analysis module 720, a parameter determination module 730, an accompaniment generation module 740, and a mixing module 750.
The video input module 710 receives video data through the image inputter 220.
The video information analysis module 720 analyzes information regarding the input video data. More specifically, the video information analysis module 720 may analyze color information of the entire image, screen motion information according to a position of a camera, object motion information in the video, and spatial information extracted from an audio input signal.
The parameter determination module 730 determines a composition parameter based on the analyzed video information. More specifically, the parameter determination module 730 may determine a chord progression using the analyzed color information. For example, when analyzed color information is a bright or warm color, the parameter determination module 730 may determine that the chord progression is a major chord progression, and when the analyzed color information is a dark or cool color, the parameter determination module 730 may determine that the chord progression is a minor chord progression.
In addition, the parameter determination module 730 may determine a drum pattern using screen motion information. For example, when a screen motion or motion on a screen is fast, the parameter determination module 730 may determine that the drum pattern is fast, and when the motion on the screen is fixed, the parameter determination module 730 may determine that the drum pattern is slow. In addition, the parameter determination module 730 may determine BPM using the object motion information. For example, when the object motion is slow, the parameter determination module 730 may determine that the BPM is low, and when the object motion is fast, the parameter determination module 730 may determine that the BPM is high.
Also, , the parameter determination module 730 may adjust an area of a sound image using spatial information. For example, when a space of an audio signal is large, the parameter determination module 730 may determine that an area of a sound image is large, and when a space of an audio signal is small, the parameter determination module 730 may determine that an area of a sound image is small.
The accompaniment generation module 740 generates MIDI data using the composition parameter determined by the parameter determination module 730. More specifically, the accompaniment generation module 740 generates MIDI tracks of melody instruments (for example, piano, guitar, keyboard, etc), percussion instruments (for example, drum, etc), and bass rhythm instruments (for example, bass, etc) using a composition parameter determined by the parameter determination module 730. Subsequently, the accompaniment generation module 740 may generate complete MIDI data using the generated MIDI tracks of the melody instruments, percussion instruments, and bass rhythm instruments.
The mixing module 750 may mix the generated MIDI data with video data. In particular, the mixing module 750 may locate a sound image to correspond to spatial information of an audio signal included in the video data, and generate space sense according to spatial information of an audio signal included in the video data using a decorrelator.
The controller 290 may compose music according to input video data using the modules 710 to 750 as shown in FIG. 7. More specifically, when video is input through the image inputter 220, the controller 290 may analyze the input video data, determine a composition parameter, and generate MIDI data using the determined composition parameter. The composition parameter is a parameter to compose music, such as a chord progression, a drum pattern, BPM, and an area parameter.
In particular, the controller 290 may determine a chord progression using color information of the input video data. When the color of the entire image of the input video is bright, the controller 290 may determine that the chord progression of MIDI data is a major chord progression, and when the color of the entire image of the input video is dark, the controller 290 may determine that the chord progression of the MIDI data is a minor chord progression.
In addition, the controller 290 may determine a drum pattern using screen motion information of the input video data. More specifically, when the motion on the screen of the input image is fast, the controller 290 may determine that the drum pattern is fast, and when the motion on the screen of the input image is fixed, the controller 290 may determine that the drum pattern is slow.
In addition, the controller 290 may determine BPM using object motion information of the input video data. More specifically, when the motion of a particular object in the input image is slow, the controller 290 may determine that BPM is low, and when the motion of a particular object in the input image is fast, the controller 290 may determine that BPM is high.
In addition, the controller 290 may adjust an area of a sound image using spatial information of an audio signal included in the input video data. More specifically, when a space of an audio signal is large, the controller 290 may determine that an area of a sound image is large, and when a space of an audio signal is small, the controller 290 may determine that an area of a sound image is small.
The controller 290 may generate MIDI data using a determined parameter. More specifically, the controller 290 generates a MIDI track of melody instruments (for example, piano, guitar, keyboard, etc) using a template based on a determined chord progression and genre set by the user, generates a MIDI track of percussion instruments (for example, a drum, etc) using a drum pattern, and generates a MIDI track of bass rhythm instruments (for example, bass, etc) using a chord progression, a genre, and a drum pattern. Subsequently, the controller 290 may generate complete MIDI data using the generated MIDI tracks of the melody instruments, percussion instruments, and bass rhythm instruments.
In addition, the controller 290 may run the generated MIDI data together with the video data. In other words, the controller 290 may mix and output the generated MIDI data with the video data. At this time, the controller 290 may synchronize the MIDI data and audio signals included in the video data.
FIG. 8 is a flowchart of a method for composing music using video data according to another exemplary embodiment.
First, the multimedia apparatus 200 receive video data (S810). The multimedia apparatus 200 may receive video data from an external device, or may receive pre-stored video data.
Subsequently, the multimedia apparatus 200 analyzes the input video data and determines a composition parameter (S820). The composition parameter is a parameter to compose music, such as a chord progression, a drum pattern, BPM, and an area parameter. More specifically, the multimedia apparatus 200 may determine a chord progression using the analyzed color information. In addition, the multimedia apparatus 200 may determine a drum pattern using screen motion information of the video data. In addition, the multimedia apparatus 200 may determine BPM using object motion information of the video data. Also, the multimedia apparatus 200 may adjust an area of a sound image using spatial information.
Subsequently, the multimedia apparatus 200 generates MIDI data using the composition parameter (S830). More specifically, the multimedia apparatus 200 may generate MIDI tracks of melody instruments, percussion instruments, and bass rhythm instruments using the composition parameter, and generate MIDI data by mixing the generated MIDI tracks. In addition, the multimedia apparatus 200 may run the generated MIDI data together with the video data.
As described above, MIDI data is generated using video data so that the user may compose music suitable with the mood of the video data.
In the exemplary embodiments, music is composed using a pitch and a beat detected based on, for example, the user’s humming, but this is merely an exemplary embodiment. In other exemplary embodiments, the pitch and beat can be detected based on a song sung by the user and the song is obtained based on the detected pitch and beat and the song sung by the user is corrected based on the obtained song.
FIG. 9 illustrates a plurality of modules to correct a song according to yet another exemplary embodiment. With reference to FIG. 9, the storage 270 of the multimedia apparatus 200 may include a song input module 910, a song analysis module 920, a virtual score generation module 930, a score acquisition module 940, a song and score synchronization module 950, a song correction module 960, a sound source acquisition module 970, an accompaniment separation module 980, and a mixing module 990 in order to correct a song sung by the user.
The song input module 910 receives a song sung by the user. At this time, the song input module 910 may receive a song input through a microphone, or a song included in audio data.
The song analysis module 920 analyzes a beat and a pitch of the song sung by the user. More specifically, the song analysis module 920 determines a beat of the song using an HCR method, and determines a pitch of the song using correntropy pitch detection.
The virtual score generation module 930 generates a virtual score based on the pitch and beat analyzed by the song analysis module 920.
The score acquisition module 940 acquires a score of the song sung by the user using the virtual score generation module 930. The score acquisition module 940 may acquire the score by comparing a score stored in the database with the virtual score. In another exemplary embodiment, the score acquisition module 940 may acquire the score by taking a photograph of a printed score using a camera and analyzing the captured image. In another exemplary embodiment, the score acquisition module 940 may acquire the score using musical notes input by the user on manuscript paper which is displayed on the display 240.
In yet another exemplary embodiment, the score acquisition module 940 may acquire a score by comparing the song sung by the user with a vocal track extracted from a pre-stored sound source. In addition, the score acquisition module 940 may acquire a score by stochastically presuming an onset and offset pattern and dispersion of pitch based on frequency characteristics of the song which was input. At this time, the score acquisition module 940 may presume a beat and a pitch from the input song using the HCR method and correntropy pitch detection, extract stochastically the most suitable BPM and chord from dispersion of the presumed beat and pitch, and thus generate a score.
The song and score synchronization module 950 synchronizes the song sung by the user and the score acquired by the score acquisition module 940. At this time, the song and score synchronization module 950 may synchronize the song which was sung and the score using a dynamic time warping (DTW) method. The DTW method is an algorithm that finds an optimum warping path by comparing the similarity between two sequences.
The song correction module 960 corrects a wrong portion, for example, an off-key portion, an off-beat portion, etc, of the song sung by the user by comparing the song and the score. More specifically, the song correction module 960 may correct the song to correspond to the score by applying time stretching and a frequency shift.
The sound source acquisition module 970 acquires a sound source of the song sung by the user. At this time, the sound source acquisition module 970 may acquire a sound source using a score acquired by the score acquisition module 940.
The accompaniment separation module 980 separates a vocal track and an accompaniment track from the acquired sound source, and outputs the accompaniment track to the mixing module 990.
The mixing module 990 mixes and outputs the accompaniment track separated by the accompaniment separation module 980 with the song corrected by the song correction module 960.
The controller 290 corrects a song sung by the user using the exemplary modules as shown in FIG. 9.
More specifically, when a song sung by the user is input, the controller 290 analyzes the song and acquires a score that matches the song. The controller 290 determines a beat of the song using an HCR method, and determines pitch of the song using correntropy pitch detection. In addition, the controller 290 may generate a virtual score based on the determined beat and pitch, and acquire a score which is the most similar to the virtual score among the scores stored in the database, as a score corresponding to the song. In another exemplary embodiment, the controller 290 may acquire a score by the user’s input, acquire a score using a photographed score image, acquire a score from a vocal track separated from a pre-stored sound source, or use the virtual score as a score corresponding to the song.
When the score is acquired, the controller 290 synchronizes the score and the song sung by the user. At this time, the controller 290 may synchronize the score and the song using a DTW method.
In addition, the controller 290 corrects the song based on the synchronized score. More specifically, the controller 290 may correct a pitch and a beat of the song by applying time stretching and a frequency shift so that the song is synchronized with the score.
In addition, the controller 290 controls the audio outputter 250 to output the corrected song.
In another exemplary embodiment, the controller 290 searches for a sound source which matches the song sung by the user. The controller 290 may search for the sound source using a score or according to the user’s input. When the sound source is found, the controller 290 receives the sound source. The found sound source may be pre-stored or may be externally downloaded through the communicator 280. In addition, the controller 290 extracts an accompaniment sound from the sound source. The controller 290 may control the audio outputter 250 to mix and output the corrected song and the accompaniment sound.
FIG. 10 is a flowchart of a method for correcting a song according to another exemplary embodiment.
First, the multimedia apparatus 200 receives a song sung by the user (S1010). The multimedia apparatus 200 may receive the song through a microphone or through externally transmitted audio data.
Subsequently, the multimedia apparatus 200 analyzes the song (S1020). More specifically, the multimedia apparatus 200 may analyze a pitch and a beat of the song.
Subsequently, the multimedia apparatus 200 acquires a score which matches the song (S1030). More specifically, the multimedia apparatus 200 may acquire a virtual score using the analyzed pitch and beat, compare the virtual score with the scores stored in the database, and determine that a score which is the most similar to the virtual score is the score which matches the song.
The multimedia apparatus 200 then synchronizes the song and the acquired score (S1040). More specifically, the multimedia apparatus 200 may synchronize the song and the acquired score in a DTW method.
Subsequently, the multimedia apparatus 200 corrects the song based on the acquired score (S1050). More specifically, the multimedia apparatus 200 may correct a pitch and a beat of the song to correspond to the score by applying time stretching and a frequency shift.
Using the aforementioned song correction method, general users who do not sing well may easily and conveniently correct their song so as to be suitable as an original song.
The music composing method or the song correcting method according to the aforementioned exemplary embodiments may be implemented with a program, and may be provided to a display apparatus. Programs including the music composing method or the song correcting method may be stored in a non-transitory computer readable medium.
The non-transitory computer readable medium is a medium which does not store data temporarily such as a register, cache, and memory but stores data semi-permanently and is readable by devices. More specifically, the aforementioned applications or programs may be stored in the non-transitory computer readable medium such as compact disks (CDs), digital video disks (DVDs), hard disks, Blu-ray disks, universal serial buses (USBs), memory cards, and read-only memory (ROM).
The foregoing exemplary embodiments are merely exemplary and are not to be construed as limiting the exemplary embodiments. The exemplary embodiments can be readily applied to other types of apparatuses. Also, the description of the exemplary embodiments is intended to be illustrative, and not to limit the scope of the claims, and many alternatives, modifications, and variations will be apparent to those skilled in the art.

Claims (15)

  1. A music composing method comprising:
    setting a type of musical instrument digital interface (MIDI) data according to a user’s input;
    sensing a user interaction;
    analyzing the sensed user interaction and determining a beat and a pitch of the sensed user interaction; and
    generating MIDI data using the set type of MIDI data and the determined beat and pitch.
  2. The method as claimed in claim 1, wherein the setting the type of MIDI data comprises setting at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
  3. The method as claimed in claim 1, further comprising:
    receiving an image; and
    obtaining emotion information using at least one of color information, motion information, and spatial information of the received image,
    wherein the generating the MIDI data comprises generating the MIDI data by using the emotion information.
  4. The method as claimed in claim 1, further comprising:
    sensing at least one of weather, a temperature, a humidity, and an illumination; and
    generating emotion information using the sensed at least one of the weather, the temperature, the humidity, and the illumination,
    wherein the generating the MIDI data comprising generating the MIDI data by using the emotion information.
  5. The method as claimed in claim 1, further comprising:
    generating a score using the determined beat and the determined pitch; and
    displaying the generated score.
  6. The method as claimed in claim 5, further comprising:
    modifying the MIDI data using the displayed generated score.
  7. The method as claimed in claim 1, further comprising:
    generating previous measure of MIDI data and a subsequent measure of MIDI data of the generated MIDI data using the generated MIDI data; and
    generating a music file using the generated MIDI data, the generated previous measure of MIDI data, and the generated subsequent measure of MIDI data.
  8. The method as claimed in claim 1, wherein the user interaction comprises one of humming by the user, a touch made by the user, and a motion made by the user.
  9. The method as claimed in claim 8, further comprising:
    mixing and outputting the MIDI data and the humming by the user when the user interaction is the humming by the user.
  10. A multimedia apparatus comprising:
    an inputter configured to receive a user command to set a type of musical instrument digital interface (MIDI) data;
    a sensor configured to sense a user interaction; and
    a controller configured to analyze the sensed user interaction and determine a beat and a pitch, and configured to generate MIDI data using the set type of MIDI data and the determined beat and pitch.
  11. The multimedia apparatus as claimed in claim 10, wherein the inputter receives a user command to set at least one of a genre, a style, a beats per minute (BPM), and a complexity of the MIDI data.
  12. The multimedia apparatus as claimed in claim 10, further comprising:
    an image inputter configured to receive an image,
    wherein the controller obtains emotion information using at least one of color information, motion information, and spatial information of the image received through the image inputter, and generates the MIDI data using the emotion information.
  13. The multimedia apparatus as claimed in claim 10, further comprising:
    an environment sensor configured to sense at least one of a weather, a temperature, a humidity, and an illumination,
    wherein the controller generates emotion information using the at least one of the weather, the temperature, the humidity, and the illumination, and generates the MIDI data using the emotion information.
  14. The multimedia apparatus as claimed in claim 10, further comprising:
    a display,
    wherein the controller generates a score using the determined beat and pitch, and controls the display to display the generated score.
  15. The multimedia apparatus as claimed in claim 14, wherein the controller modifies the MIDI data according to a user command which is input onto the displayed score.
PCT/KR2014/011463 2013-12-20 2014-11-27 Multimedia apparatus, music composing method thereof, and song correcting method thereof WO2015093744A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
EP14872744.9A EP3066662A4 (en) 2013-12-20 2014-11-27 Multimedia apparatus, music composing method thereof, and song correcting method thereof

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
KR10-2013-0159906 2013-12-20
KR1020130159906A KR20150072597A (en) 2013-12-20 2013-12-20 Multimedia apparatus, Method for composition of music, and Method for correction of song thereof

Publications (1)

Publication Number Publication Date
WO2015093744A1 true WO2015093744A1 (en) 2015-06-25

Family

ID=53400687

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/KR2014/011463 WO2015093744A1 (en) 2013-12-20 2014-11-27 Multimedia apparatus, music composing method thereof, and song correcting method thereof

Country Status (4)

Country Link
US (1) US9607594B2 (en)
EP (1) EP3066662A4 (en)
KR (1) KR20150072597A (en)
WO (1) WO2015093744A1 (en)

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20150072597A (en) * 2013-12-20 2015-06-30 삼성전자주식회사 Multimedia apparatus, Method for composition of music, and Method for correction of song thereof
US9721551B2 (en) 2015-09-29 2017-08-01 Amper Music, Inc. Machines, systems, processes for automated music composition and generation employing linguistic and/or graphical icon based musical experience descriptions
US10854180B2 (en) 2015-09-29 2020-12-01 Amper Music, Inc. Method of and system for controlling the qualities of musical energy embodied in and expressed by digital music to be automatically composed and generated by an automated music composition and generation engine
US9799312B1 (en) * 2016-06-10 2017-10-24 International Business Machines Corporation Composing music using foresight and planning
US10902829B2 (en) 2017-03-16 2021-01-26 Sony Corporation Method and system for automatically creating a soundtrack to a user-generated video
KR101970878B1 (en) * 2017-05-29 2019-04-19 한양대학교 에리카산학협력단 Automatic Composition Method Using Composition Processing History and Apparatus Therefor
KR101942814B1 (en) * 2017-08-10 2019-01-29 주식회사 쿨잼컴퍼니 Method for providing accompaniment based on user humming melody and apparatus for the same
CN107464572B (en) * 2017-08-16 2020-10-16 重庆科技学院 Multi-mode interactive music perception system and control method thereof
KR101975193B1 (en) * 2017-11-15 2019-05-07 가기환 Automatic composition apparatus and computer-executable automatic composition method
WO2019226985A1 (en) * 2018-05-24 2019-11-28 Kids Ii, Inc. Adaptive sensory outputs synchronized to input tempos for soothing effects
CN110555126B (en) 2018-06-01 2023-06-27 微软技术许可有限责任公司 Automatic generation of melodies
CN108922505B (en) * 2018-06-26 2023-11-21 联想(北京)有限公司 Information processing method and device
US10748515B2 (en) 2018-12-21 2020-08-18 Electronic Arts Inc. Enhanced real-time audio generation via cloud-based virtualized orchestra
US10790919B1 (en) 2019-03-26 2020-09-29 Electronic Arts Inc. Personalized real-time audio generation based on user physiological response
US10799795B1 (en) 2019-03-26 2020-10-13 Electronic Arts Inc. Real-time audio generation for electronic games based on personalized music preferences
US10657934B1 (en) * 2019-03-27 2020-05-19 Electronic Arts Inc. Enhancements for musical composition applications
US10643593B1 (en) 2019-06-04 2020-05-05 Electronic Arts Inc. Prediction-based communication latency elimination in a distributed virtualized orchestra
USD920277S1 (en) 2019-07-12 2021-05-25 Kids2, Inc. Audio player
US11037538B2 (en) 2019-10-15 2021-06-15 Shutterstock, Inc. Method of and system for automated musical arrangement and musical instrument performance style transformation supported within an automated music performance system
US10964299B1 (en) * 2019-10-15 2021-03-30 Shutterstock, Inc. Method of and system for automatically generating digital performances of music compositions using notes selected from virtual musical instruments based on the music-theoretic states of the music compositions
US11024275B2 (en) * 2019-10-15 2021-06-01 Shutterstock, Inc. Method of digitally performing a music composition using virtual musical instruments having performance logic executing within a virtual musical instrument (VMI) library management system
WO2021163377A1 (en) * 2020-02-11 2021-08-19 Aimi Inc. Music content generation
KR102390950B1 (en) * 2020-06-09 2022-04-27 주식회사 크리에이티브마인드 Method for generating user engagement music and apparatus therefor
KR20230078372A (en) * 2021-11-26 2023-06-02 삼성전자주식회사 An electronic apparatus and a method thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5281754A (en) * 1992-04-13 1994-01-25 International Business Machines Corporation Melody composer and arranger
US20060230910A1 (en) * 2005-04-18 2006-10-19 Lg Electronics Inc. Music composing device
US20070186750A1 (en) * 2006-01-20 2007-08-16 Samsung Electronics Co., Ltd. Apparatus and method for composing music in a portable terminal
US20090249945A1 (en) * 2004-12-14 2009-10-08 Sony Corporation Music composition data reconstruction device, music composition data reconstruction method, music content reproduction device, and music content reproduction method
US20120312145A1 (en) * 2011-06-09 2012-12-13 Ujam Inc. Music composition automation including song structure

Family Cites Families (32)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5428708A (en) 1991-06-21 1995-06-27 Ivl Technologies Ltd. Musical entertainment system
US5428707A (en) 1992-11-13 1995-06-27 Dragon Systems, Inc. Apparatus and methods for training speech recognition systems and their users and otherwise improving speech recognition performance
US5627335A (en) 1995-10-16 1997-05-06 Harmonix Music Systems, Inc. Real-time music creation system
FR2785438A1 (en) * 1998-09-24 2000-05-05 Baron Rene Louis MUSIC GENERATION METHOD AND DEVICE
JP3533974B2 (en) * 1998-11-25 2004-06-07 ヤマハ株式会社 Song data creation device and computer-readable recording medium recording song data creation program
KR100357996B1 (en) 1999-12-29 2002-10-25 발레오만도전장시스템스코리아 주식회사 stator of alternator
JP3666364B2 (en) * 2000-05-30 2005-06-29 ヤマハ株式会社 Content generation service device, system, and recording medium
KR100363027B1 (en) 2000-07-12 2002-12-05 (주) 보이스웨어 Method of Composing Song Using Voice Synchronization or Timbre Conversion
US6384310B2 (en) 2000-07-18 2002-05-07 Yamaha Corporation Automatic musical composition apparatus and method
JP2002149173A (en) 2000-11-13 2002-05-24 Dainippon Printing Co Ltd Karaoke machine, its system and recording medium
KR100412196B1 (en) 2001-05-21 2003-12-24 어뮤즈텍(주) Method and apparatus for tracking musical score
KR100658219B1 (en) * 2001-06-25 2006-12-15 어뮤즈텍(주) Method and apparatus for designating expressive performance notes with synchronization information
US7174510B2 (en) 2001-10-20 2007-02-06 Hal Christopher Salter Interactive game providing instruction in musical notation and in learning an instrument
CN1703734A (en) 2002-10-11 2005-11-30 松下电器产业株式会社 Method and apparatus for determining musical notes from sounds
JP2006084749A (en) * 2004-09-16 2006-03-30 Sony Corp Content generation device and content generation method
WO2006079813A1 (en) 2005-01-27 2006-08-03 Synchro Arts Limited Methods and apparatus for use in sound modification
JP4626376B2 (en) * 2005-04-25 2011-02-09 ソニー株式会社 Music content playback apparatus and music content playback method
EP1785891A1 (en) * 2005-11-09 2007-05-16 Sony Deutschland GmbH Music information retrieval using a 3D search algorithm
KR100658869B1 (en) * 2005-12-21 2006-12-15 엘지전자 주식회사 Music generating device and operating method thereof
KR100705176B1 (en) 2006-01-09 2007-04-06 엘지전자 주식회사 Method for generating music file in mobile terminal
US7705231B2 (en) 2007-09-07 2010-04-27 Microsoft Corporation Automatic accompaniment for vocal melodies
JP4306754B2 (en) * 2007-03-27 2009-08-05 ヤマハ株式会社 Music data automatic generation device and music playback control device
US8111241B2 (en) 2007-07-24 2012-02-07 Georgia Tech Research Corporation Gestural generation, sequencing and recording of music on mobile devices
JP2010066739A (en) 2008-09-08 2010-03-25 Shunpei Takahira Pitch conversion device
US8805854B2 (en) * 2009-06-23 2014-08-12 Gracenote, Inc. Methods and apparatus for determining a mood profile associated with media data
KR101611511B1 (en) 2009-05-12 2016-04-12 삼성전자주식회사 A method of composing music in a portable terminal having a touchscreen
KR20110107496A (en) 2010-03-25 2011-10-04 민경국 Realizing technology of electronic samulnori and electronic daegeum(traditional korean musical instruments) using input mode of touch screen(multi touch, gesture)and signal of accelerometer/processing of a sound source and simultaneous performance using wire or wireless communications between server and client /emission technology and game
KR20110121883A (en) 2010-05-03 2011-11-09 삼성전자주식회사 Apparatus and method for compensating of user voice
KR101637470B1 (en) 2010-05-13 2016-07-08 한양대학교산학협력단 Method of determining pedestrian's indoor position
US8618405B2 (en) 2010-12-09 2013-12-31 Microsoft Corp. Free-space gesture musical instrument digital interface (MIDI) controller
KR20150072597A (en) * 2013-12-20 2015-06-30 삼성전자주식회사 Multimedia apparatus, Method for composition of music, and Method for correction of song thereof
CN104834642B (en) * 2014-02-11 2019-06-18 北京三星通信技术研究有限公司 Change the method, device and equipment of music deduction style

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5281754A (en) * 1992-04-13 1994-01-25 International Business Machines Corporation Melody composer and arranger
US20090249945A1 (en) * 2004-12-14 2009-10-08 Sony Corporation Music composition data reconstruction device, music composition data reconstruction method, music content reproduction device, and music content reproduction method
US20060230910A1 (en) * 2005-04-18 2006-10-19 Lg Electronics Inc. Music composing device
US20070186750A1 (en) * 2006-01-20 2007-08-16 Samsung Electronics Co., Ltd. Apparatus and method for composing music in a portable terminal
US20120312145A1 (en) * 2011-06-09 2012-12-13 Ujam Inc. Music composition automation including song structure

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP3066662A4 *

Also Published As

Publication number Publication date
EP3066662A1 (en) 2016-09-14
US9607594B2 (en) 2017-03-28
US20150179157A1 (en) 2015-06-25
EP3066662A4 (en) 2017-07-26
KR20150072597A (en) 2015-06-30

Similar Documents

Publication Publication Date Title
WO2015093744A1 (en) Multimedia apparatus, music composing method thereof, and song correcting method thereof
EP3574395A1 (en) Display device for visualizing contents as the display is rotated and control method thereof
WO2017039142A1 (en) User terminal apparatus, system, and method for controlling the same
WO2015133782A1 (en) Contents analysis method and device
WO2010011007A1 (en) Method of managing content and electronic apparatus using the same
WO2015099276A1 (en) Display apparatus, server apparatus, display system including them, and method for providing content thereof
WO2020162709A1 (en) Electronic device for providing graphic data based on voice and operating method thereof
WO2019139301A1 (en) Electronic device and subtitle expression method thereof
WO2014104658A1 (en) Method and system for executing an application
WO2016060296A1 (en) Apparatus for recording audio information and method for controlling same
WO2014012498A1 (en) Desktop control method and apparatus, and computer storage medium
WO2019139270A1 (en) Display device and content providing method thereof
WO2017057960A1 (en) Electronic device and method for controlling the same
WO2015190781A1 (en) User terminal, method for controlling same, and multimedia system
EP3759707A1 (en) A method and system for musical synthesis using hand-drawn patterns/text on digital and non-digital surfaces
WO2019231138A1 (en) Image display apparatus and operating method of the same
WO2019050083A1 (en) Method for providing user's hummed melody-based melody recording and apparatus therefor
WO2019142988A1 (en) Electronic device, control method therefor, and computer-readable recording medium
WO2018056587A1 (en) Electronic apparatus and controlling method thereof
WO2017065394A1 (en) Display apparatus and control method therefor
WO2019031771A1 (en) Display device for visualizing contents as the display is rotated and control method thereof
WO2018016760A1 (en) Electronic device and control method thereof
WO2022239988A1 (en) Dance matching method and system
WO2019088471A1 (en) Electronic device and method for controlling same
WO2017217613A1 (en) Method for creating and sharing subtitles of video content by using machine learning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14872744

Country of ref document: EP

Kind code of ref document: A1

REEP Request for entry into the european phase

Ref document number: 2014872744

Country of ref document: EP

WWE Wipo information: entry into national phase

Ref document number: 2014872744

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: DE