WO2021115311A1

WO2021115311A1 - Song generation method, apparatus, electronic device, and storage medium

Info

Publication number: WO2021115311A1
Application number: PCT/CN2020/134835
Authority: WO
Inventors: 蒋慧军; 黄尹星; 姜凯英; 韩宝强; 肖京
Original assignee: 平安科技（深圳）有限公司
Priority date: 2020-05-29
Filing date: 2020-12-09
Publication date: 2021-06-17
Also published as: CN111680185A

Abstract

The present application relates to the technical field of data processing, and in particular to a song generation method, an apparatus, an electronic device, and a storage medium. The song generation method comprises: receiving a first confirmation instruction, and on the basis of the first confirmation instruction, acquiring first musical attribute information and a mood tag for a song; on the basis of a preset association relationship between mood tags and musical properties and on the basis of the mood tag, determining second musical attribute information for the song; causing the first musical attribute information and the second musical attribute information to serve as filter conditions to traverse a pre-constructed music database, to acquire candidate musical segments matching the filter conditions; and on the basis of a received second confirmation instruction, selecting musical segments from among the candidate musical segments, and joining the musical segments according to a corresponding sequence of measures of music to acquire a new song. Using the method provided in the present application, new songs meeting a user's requirements can be controllably generated.

Description

Music generating method, device, electronic equipment and storage medium

This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on May 29, 2020, the application number is 202010478115.7, and the invention title is "Music composition generation method, device, electronic equipment and storage medium", the entire content of which is incorporated by reference In this application.

Technical field

This application relates to the field of data processing technology, and in particular to a method, device, electronic device, and storage medium for generating music.

Background technique

For music creation, it is necessary for the creator to have the corresponding knowledge of music theory, so that many non-professionals who love music cannot create music that suits their preferences. Automatic creation of music melody, especially automatic creation of complete melody with specific style and emotion, has always been an urgent problem to be solved.

The inventor found that with the development of computer technology, many auxiliary tools have appeared to help non-professionals create music. For example, deep learning models are used to generate music, but the music created in this way is lacking in professionalism. , The generated melody has a certain degree of randomness, and the process of music generation cannot be reproduced, that is, it is impossible to controllably generate music that meets the needs of users.

technical problem

The present application provides a music generation method, electronic equipment, and storage medium, the main purpose of which is to live in a controlled manner to meet the needs of users with new music.

Technical solutions

The embodiment of the present application first provides a method for generating music, including the following steps: receiving a first confirmation instruction, and obtaining first music attribute information and emotion tags of the music according to the first confirmation instruction; the first confirmation instruction is correct The first music attribute information and the selection confirmation instruction of the emotion tag; determine the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag; combine the first music attribute information with The second music attribute information is used as a filter condition to traverse the pre-built music database to obtain candidate music pieces matching the filter condition; wherein the music database stores a plurality of music pieces pre-divided according to the music attributes; The received second confirmation instruction selects a music fragment from the candidate music fragments, and splices the music fragments in the order of the corresponding music bars to obtain a new music piece; wherein, the second confirmation instruction is to select a music fragment Confirm the instruction.

Correspondingly, an embodiment of the present application also provides a music generating device, including: a first music attribute information obtaining module, configured to receive a first confirmation instruction, and obtain first music attribute information of the music according to the first confirmation instruction and Emotion tag; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and the emotion tag; the second music attribute information module is determined to be used based on the pre-set association relationship between the emotion tag and the music attribute and the The emotion tag determines the second music attribute information of the music; the obtaining candidate music segment module is used to traverse the pre-built music database using the first music attribute information and the second music attribute information as filtering conditions, and obtain the information corresponding to the filtering conditions. A matching candidate music piece; wherein the music database stores a plurality of music pieces pre-divided according to music attributes; a new music piece module is used to select from the candidate music pieces according to the received second confirmation instruction Music fragments, the music fragments are spliced in the order of corresponding music bars to obtain a new music piece; wherein, the second confirmation instruction is a selection confirmation instruction for the music fragment.

Further, an embodiment of the present application also provides an electronic device, the electronic device includes: a memory, a processor, and a computer program stored in the memory and capable of running on the processor, and when the processor executes the program The following method is implemented: receiving a first confirmation instruction, and obtaining first music attribute information and emotion tags of a music piece according to the first confirmation instruction; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tags; Determine the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag; use the first music attribute information and the second music attribute information as filter conditions to traverse the pre-built music A database to obtain candidate music pieces matching the screening conditions; wherein the music database stores a plurality of music pieces pre-divided according to music attributes; according to the received second confirmation instruction, from the candidate music pieces The music fragments are selected, and the music fragments are spliced in the order of the corresponding music bars to obtain a new music piece; wherein the second confirmation instruction is a selection confirmation instruction for the music fragment.

In addition, in order to achieve the above object, the present application also provides a computer-readable storage medium, the computer-readable storage medium includes a music generation program, and when the music generation program is executed by a processor, the following method is implemented: Confirmation instruction, according to the first confirmation instruction to obtain the first music attribute information and emotion tag of the music; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tag; based on the preset emotion tag and The association relationship between the music attributes and the emotion tag determine the second music attribute information of the music; the first music attribute information and the second music attribute information are used as filtering conditions to traverse the pre-built music database to obtain the information related to the filtering Candidate music fragments matching the conditions; wherein the music database stores a plurality of music fragments pre-divided according to music attributes; according to the received second confirmation instruction, a music fragment is selected from the candidate music fragments, and the The music fragments are spliced in the order of the corresponding music bars to obtain a new music piece; wherein, the second confirmation instruction is a selection confirmation instruction for the music fragment.

Beneficial effect

The first confirmation instruction provided in this application is based on the user's selection of music attributes, and the second confirmation instruction is based on the user's selection of candidate music fragments. The first confirmation instruction and the second confirmation instruction are used to determine the music fragment, that is, the process of generating new music. The determination of the music attributes and music fragments is based on user selection, so the generated music is highly related to the user’s preferences, and the human-computer interaction in the music generation process is strong. Moreover, compared with deep learning and other methods, the generated music is generated by this application. The music is controllable and reproducible, that is, it can reproduce the process of music generation. The music generation method provided in this application preliminarily stores music sections according to music attribute information for a large number of music data samples to obtain a music database. In the process of generating music, the music attributes selected by the user are used to filter matching music. Fragments, music fragments include multiple music bars. Since the new music is formed based on existing music fragments, the melody of the new music conforms to the harmony trend, and the generated music is coherent.

Description of the drawings

FIG. 1 is a flowchart of a method for generating a music composition according to an embodiment of the application.

2 is a flowchart of determining second music attribute information of a music piece based on the association relationship between preset emotion tags and music attributes and the emotion tags according to an embodiment of the application.

FIG. 3 is a schematic diagram of a valence-Arousal dimensional emotion model provided by an embodiment of this application.

FIG. 4 is a schematic diagram of a melody curve provided by an embodiment of the application with a small-amplitude gyration, and the middle tone in the melody curve is adjusted in the opposite direction to the tuning inner tone.

FIG. 5 is a schematic structural diagram of a music generating device provided by an embodiment of the application.

FIG. 6 is a schematic structural diagram of an electronic device provided by an embodiment of this application.

Embodiments of the present invention

Hereinafter, embodiments of the present application will be described in more detail with reference to the accompanying drawings. Although some embodiments of the present application are shown in the drawings, it should be understood that the present application can be implemented in various forms and should not be construed as being limited to the embodiments set forth herein. On the contrary, these embodiments are provided for Have a more thorough and complete understanding of this application. It should be understood that the drawings and embodiments of the present application are only used for exemplary purposes, and are not used to limit the protection scope of the present application.

It should be understood that the steps described in the method embodiments of the present application may be executed in a different order, and/or executed in parallel. In addition, method implementations may include additional steps and/or omit to perform the illustrated steps. The scope of this application is not limited in this respect.

The term "including" and its variants as used herein are open-ended includes, that is, "including but not limited to"; the term "based on" is "based at least in part on". The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments." Related definitions of other terms will be given in the following description.

It should be noted that the modifications of "a" and "multiple" mentioned in this application are illustrative and not restrictive, and those skilled in the art should understand that unless otherwise clearly indicated in the context, they should be interpreted as "one or Multiple".

The technical solution of this application can be applied to the fields of artificial intelligence, smart city, blockchain and/or big data technology. Optionally, the data involved in this application, such as attribute information, tags, and/or music, can be stored in a database, or can be stored in a blockchain, such as distributed storage through a blockchain, which is not limited in this application.

The technical solutions of the present application and how the technical solutions of the present application solve the above technical problems will be described in detail below with specific embodiments. The following specific embodiments can be combined with each other, and the same or similar concepts or processes may not be repeated in some embodiments. The embodiments of the present application will be described below in conjunction with the accompanying drawings.

The embodiment of the application first provides a method for generating music. Figure 1 is a flowchart of the method for generating music provided by an embodiment of the application. The method can be executed by a device, and the device can be implemented by software and/or hardware. The method can be executed on the user side, and the user side includes a human-computer interaction interface.

S110: Receive a first confirmation instruction, and obtain first music attribute information and emotion tags of the music according to the first confirmation instruction; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tags.

S120: Determine the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag.

S130. Use the first music attribute information and the second music attribute information as filter conditions to traverse a pre-built music database to obtain candidate music pieces that match the filter conditions; wherein the music database stores pre-established music Multiple pieces of music divided by music attributes.

S140: Select a music fragment from the candidate music fragments according to the received second confirmation instruction, and splice the music fragments in the order of the corresponding music bars to obtain a new music piece; wherein the second confirmation instruction is right Confirm the selection of music fragments.

The music attributes provided in this application include at least the following information: rhythm type, speed, chord, key, mode, time signature, musical structure, texture, orchestration, etc. The music attribute information is divided into The first music attribute information and the second music attribute information. The first music attribute information includes: music type, structure, music length, key, time signature, and the second music attribute information includes: mode, harmony direction, speed, and Rhythm information. The overall structure of the music can be determined according to the first music attribute information, and the emotion expressed by the music can be determined according to the second music attribute information.

The server or the client receives the first confirmation instruction, the first confirmation instruction is the user's selection confirmation instruction of the first music attribute information and emotion tag, the first confirmation instruction may be input through the human-computer interaction interface and sent through the client The selection confirmation instruction for the first music attribute information and emotion tag of the generated music, the first confirmation instruction can be regarded as the user's selection of the first music attribute information and emotion tag, that is, both the first music attribute information and emotion tag can be Selected for users according to their own preferences.

The pre-built association relationship between the emotion tag and the music attribute is called, and the second music attribute information of the music piece is determined according to the determined emotion tag and the association relationship.

Pre-construct the association relationship between emotion tags and music attributes. The association relationship between emotion tags and music attributes can be reflected by matching emotions with rhythmic information or/and speed. The granularity of each music section will affect The emotional expression of the melody curve. At the same time, the same note granularity may have different emotional expressions at different speeds. Therefore, this solution associates the second music attribute information with the emotional tags when constructing the music database, for example, the speed is relatively high. Fast music often expresses cheerful emotions, while slow music often expresses melancholy and sad emotions.

The first music attribute information and the second music attribute information are used as filter conditions to traverse the pre-built music database to obtain candidate music pieces matching the filter conditions in the music database. The music database stores a plurality of pre-divided music attributes. Music fragments.

Prior to this, first construct a music database, the process is as follows: obtain a large number of music data samples, these music data samples can be selected according to the user's preferences, so that the final generated music is more in line with the user's preferences, the music data samples are divided into music subsections It is a plurality of music fragments, each music fragment includes a plurality of continuous music bars, and each music bar has the position and number of the music in which it is located. The music fragments are classified and stored according to the above-mentioned music attributes, and a music database is constructed. The music database is also called a data set dictionary. That is, the music database is a dictionary storing music attribute information and music attribute parameters, and the dictionary can be stored in a hierarchical structure of rhythm-chords.

Since a large number of music pieces are stored in the music database, and the identification information of these music pieces includes first music attribute information and second music attribute information, the first music attribute information and the second music attribute information selected by the user are used. The music database is traversed as a filter condition, and music pieces that meet the filter condition are selected, that is, music pieces that match the first music attribute information and the second music attribute information selected by the user are filtered out, and the multiple music pieces selected are candidate music pieces.

Display multiple candidate music fragments on the human-computer interaction interface, receive the user's selection confirmation instruction of the music fragment, receive and analyze the second confirmation instruction, and select the music fragment from the multiple candidate music fragments according to the analysis information of the second confirmation instruction .

A plurality of music fragments are determined according to the second confirmation instruction, and the music fragments are spliced in the order of the music bars included in the music fragment to obtain a new music piece.

In the music generation method provided by the present application, the first confirmation instruction and the second confirmation instruction are used as filter conditions to determine the music fragments of the new music, because the first confirmation instruction and the second confirmation instruction are the user’s response to the first music attribute information and the emotion tag, respectively. , The selection confirmation instructions of candidate music fragments can all reflect the user's preferences, therefore, the final new music composition conforms to the user's preferences. Moreover, the music fragments that make up the new music are extracted from the music database, and the music fragments stored in the music database are all existing music data. Therefore, the new music generated has continuity; further, it is not compatible with the use of deep learning. Compared with the box-like generation method, the path of generating new music in this scheme can be traced back, and the new music generated is controllable and reproducible.

In order to be more clear about the log information storage solution provided by this application and its technical effects, the specific implementation solution will be described in detail with a number of embodiments in the following.

In an embodiment, in step S120, the step of determining the second music attribute information of the music based on the pre-set association relationship between the emotion tag and the music attribute and the emotion tag may be implemented as shown in FIG. 2 , Including the following sub-steps.

S210: Quantify the emotion label through a pre-built emotion model.

S220: Determine candidate second music attribute information of the music piece according to the quantized emotion tag.

S230. Receive and analyze a third confirmation instruction, and confirm the second music attribute information of the music piece from the candidate second music attribute information according to the analysis information of the third confirmation instruction, wherein the third confirmation instruction is for the first 2. The selection confirmation command of the music attribute.

Analyze the above-mentioned first confirmation instruction, determine the emotion label selected by the user, and quantify the emotion label through a pre-built emotion model. The emotion model may be a valence-Arousal dimensional emotion model. A schematic diagram of the valence-Arousal dimensional emotion model is shown in Figure 3 As shown, the emotion model has two dimensions. One dimension is used to express the positive and negative of emotions, such as happiness, anger, etc., and the other dimension represents the positive degree of emotions. For example, different levels of emotions corresponding to happiness can include: Satisfaction, surprise, etc. indicate the degree of happiness. The positive and negative of the emotion and its positive degree are collectively referred to as the user’s emotional label. The relationship between the emotional label and the second attribute of music is preset. For example, the expression of happy emotion can correspond to the following second Music attribute information: tune up, bright rhythm, etc.

Based on the emotion label determined by the user in the valence-Arousal dimensional emotion model, the candidate second music attribute information is determined by combining the relationship between the emotion label and the second attribute.

Specifically, the user can select any point in the coordinate axis corresponding to the emotion model, and determine the emotion label corresponding to the coordinate of the point according to the statistical analysis of the data, and then according to the association between the preset emotion label and the second music attribute information The relationship determines candidate second music attribute information, including mode, harmony direction, speed, and rhythm information.

Specifically, first define the second music attribute corresponding to the dimensional information of the emotion model, assuming that the model has four boundary points, and then set the second music attribute to have a linear relationship with the distance from any point in the emotion model to each boundary point, namely The corresponding relationship between the distance from any point to each boundary point in the emotion model and the music attribute is defined in advance. After calculating the distance between any point selected by the user in the model space and each boundary point, the second music attribute information corresponding to the point is obtained according to the corresponding distance.

If there are multiple candidate second music attribute information, then receive and analyze the third confirmation instruction for the second music attribute information, and determine the music composition from the candidate second music attribute information according to the analysis information of the third confirmation instruction Second music attribute information.

The solution provided by this embodiment uses an emotion model to quantify the emotion label, and determines the second music attribute information of the music according to the quantized emotion label, so as to achieve the purpose of determining the second music attribute information of the music according to the emotion label. When there are multiple candidate second music attribute information determined according to the emotion tag, it includes the following situations: there are multiple candidate second music attribute information that match the emotion tag, and the multiple candidate second music attribute information needs to be filtered or filtered , Receiving and analyzing the selection confirmation instruction of the second music attribute information sent by the user terminal, that is, the third confirmation instruction. According to the analysis information of the third confirmation instruction, the second music attribute information is selected from the multiple candidate second music attribute information. For the music attribute information, since the third confirmation instruction is based on the user's selection, the filtered second music attribute information meets the user's preference, and realizes the human-computer interaction in the music generation process, and improves the user experience.

In a feasible implementation manner, the step of traversing a pre-built music database using the first music attribute information and the second music attribute information as filtering conditions in step S130 includes: A1, obtaining orchestration information of the music; A2 , Traverse a pre-built music database according to the first music attribute information, the second music attribute information, and the orchestrator information.

Among them, the orchestration information includes the allocation of musical instruments to the parts, the first music attribute information, the second music attribute information, and the orchestration information cover more music attribute information of a piece of music, based on the first music attribute information and the second music attribute information. The information and orchestration information traverses the music data dictionary to filter out a number of candidate music pieces that match the filtering conditions.

Adding orchestration information to the screening conditions, and traversing the music database by integrating the first music attribute information, the second music attribute information and the orchestration information, can reduce the number of candidate music pieces that are collected and filtered from the music database, and help simplify the process of determining music pieces , And the finally obtained music fragments are more in line with user needs.

In a feasible implementation manner, after the step of obtaining candidate music pieces that match the filtering conditions in step S130, the method further includes: B1, determining the first music attribute information and the second music attribute information according to a preset rule Matching priority of each music attribute; B2, sort the candidate music pieces according to the matching priority of each music attribute; B3, sort the candidate music pieces according to the sorted.

According to preset rules, the matching priority of each music attribute is divided, and the matching priority of the music attribute can be in order from high to low: musical structure, chord, rhythm pattern, orchestration. If there are currently several candidate music pieces for the user to choose from, you can first sort the candidate music pieces according to the music structure. In the case of the same music structure, sort the music pieces according to the chord. If the music structure and chord are the same In this case, the music clips are sorted according to the rhythm pattern, and so on. These music attributes are sorted according to their influence on the overall structure of the music. According to this method, the candidate music pieces are sorted, which is beneficial for the final generated music to be more in line with user needs.

After sorting the candidate music fragments, a music fragment is selected from the sorted candidate music fragments according to the received second confirmation instruction, and the music fragments are spliced in the order of the corresponding music bars to obtain a new music piece.

In a feasible implementation manner, according to the above method, a sorted candidate music segment is established for each position of the music, and the step of splicing the music segments in the order of the corresponding music subsections can be implemented in the following manner: The candidate music pieces with the highest ranking corresponding to the positions are spliced according to their position identifiers.

Specifically, each candidate music piece is provided with identification information, and the identification information includes the track to which the music piece belongs (e.g., represented by a track number) and location (e.g., represented by the position number of the music section in the track to which it belongs) ).

If the candidate music fragments corresponding to multiple positions all contain the same track, the sorted candidate music fragments are created for each position of the music according to the above method, and the steps of splicing the music fragments according to the sequence of the corresponding music sections include: Preferably, candidate music pieces from the same song are spliced to ensure the fluency of the new music.

In a feasible implementation manner, after the step of obtaining the new music in step S140, the method further includes: S150, obtaining the melody curve of the new music, and adjusting the melody curve according to the curve characteristics of the melody curve to obtain Optimize the music.

Furthermore, in the music composition generation solution provided by the present application, after a new music composition is generated, the melody curve is adjusted according to the curve characteristics, so that the melody of the adjusted optimized music composition is more unique.

Among them, the melody curve here may include note information, pitch information, etc. The characteristics of the curve, such as: the melody curve has a position where the melody curve has continuous upward (or downward) more than three units, the melody curve has a small-amplitude convolution, and the adjacent unit The slope of the curve is greater than the preset threshold, the pitch of the notes that exceed three consecutive units is the same, and so on.

Specifically, the step of adjusting the melody curve according to the curve characteristics includes: when it is detected that the melody curve has a small-amplitude gyration, adjusting the middle tone in the melody curve in the opposite direction to the inner tone of the tuning.

The melody curve has a small convolution, as shown in Figure 4, if the two consecutive tones are adjacent in the selected key, the middle tone is modified in the opposite direction of the tone, such as: the original upward convolution, then it is modified to the downward convolution , And vice versa, if the selected pitch composition is [do,re,do], modify the pitch composition to [do,si,do], the adjustment process is shown in the solid line box in Figure 4, where , The value on the vertical axis is the MIDI value of pitch, and the note corresponding to a MIDI value of 60 is C4.

According to the solution provided by this embodiment, the melody curve of the new music is adjusted to generate an optimized music, and the melody of the generated optimized music is more unique, coherent and beautiful.

In a feasible implementation manner, when it is detected that the melody curve has a continuous upward movement of more than three units, any position on the melody curve is randomly selected to modify the tune down; when it is detected that the melody curve has a continuous downward movement For positions exceeding three units, randomly select any position on the melody curve to modify the tune up.

In a feasible implementation manner, when it is detected that the curve slopes of adjacent units on the melody curve are greater than the preset threshold, the in-tune tones are inserted into the melody curve.

In a feasible implementation, when it is detected that more than three consecutive units of notes have the same pitch, the penultimate unit of the melody curve is moved adjacently, such as: random tuning Move up or down.

By adjusting the melody curve of the new music through the above-mentioned implementation manner, a more unique optimized music can be obtained.

On this basis, the duration of the notes can also be modified accordingly to make the adjusted optimized music more coordinated.

Further, after obtaining a new piece of music, it further includes: C1, obtaining user feedback information on the new piece of music, and adjusting at least one of the first music attribute information, the second music attribute information, and the candidate music segment based on the feedback information C2, based on at least one of the adjusted first music attribute information, adjusted second music attribute information, and adjusted candidate music segments, determine the adjusted music segment, and obtain the adjusted music segment based on the adjusted music segment Optimized music.

Obtain the user's feedback information on the new music, such as: the new music has a shorter duration and faster speed, etc., according to the feedback information, correspondingly adjust the duration and speed of the new music, the first music attribute information and the second music attribute information of the new music After any one of the factors in is changed, the obtained candidate music pieces will all change, and new music pieces generated based on different candidate music pieces will also be changed accordingly to form an optimized piece of music that is more in line with user needs.

The selection of candidate music fragments can also be adjusted according to the feedback information, that is, the selection criteria of the music fragments are adjusted, the final music fragments are adjusted based on the modified selection criteria, and the optimized music pieces more in line with the user's preferences are obtained based on the adjusted music fragments. The adjustment of the candidate music piece may be performed on the basis of adjusting the first music attribute information or/and the second music attribute information, or the candidate music piece may be adjusted separately without adjusting the first music attribute information and the second music attribute information.

The solution provided by this embodiment is based on the feedback information provided by the user, using a negative feedback mechanism to adjust the music attributes, music fragments, etc. of the new music based on the feedback information provided by the user, and generate optimized music based on the adjusted music attributes or/and music fragments. Make the adjusted music more in line with user preferences.

Correspondingly, an embodiment of the present application also provides a musical composition generating device 500. The schematic structural diagram of the musical composition generating device 500 is shown in FIG. 5. The musical composition generating device 500 includes a module 510 for obtaining first music attribute information, and a module for determining second music attribute information. 520. The candidate music segment obtaining module 530 and the new music obtaining module 540 are specifically as follows.

The first music attribute information obtaining module 510 is configured to receive a first confirmation instruction, and obtain the first music attribute information and mood tag of the music according to the first confirmation instruction; the first confirmation instruction is for the first music attribute information and Confirm the selection of emotion tags.

The determining second music attribute information module 520 is configured to determine the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag.

The candidate music segment obtaining module 530 is configured to use the first music attribute information and the second music attribute information as filtering conditions to traverse a pre-built music database to obtain candidate music pieces that match the filtering conditions; wherein, the The music database stores a plurality of music pieces pre-divided according to music attributes.

The obtaining new music module 540 is configured to select music fragments from the candidate music fragments according to the received second confirmation instruction, and splice the music fragments in the order of the corresponding music bars to obtain a new music; wherein, the The second confirmation command is a selection confirmation command for the music segment.

Regarding the music generating device in the foregoing embodiment, the specific manner of performing operations of each module therein has been described in detail in the embodiment of the method, and detailed description will not be given here.

The music generation method provided in the foregoing embodiment can be applied to an electronic device. Refer to Figure 6 for a schematic diagram of the structure.

In this embodiment, the electronic device 600 may be a terminal device with arithmetic function, such as a smart phone, a tablet computer, a portable computer, a desktop computer, and the like.

The electronic device includes a memory and a processor. The processor here may be referred to as the processing device 601 below, and the memory may include a read-only memory (ROM) 602, a random access memory (RAM) 603, and a storage device 608 below. At least one item of is as follows.

As shown in FIG. 6, the electronic device 600 may include a processing device (such as a central processing unit, a graphics processor, etc.) 601, which may be loaded into a random access device according to a program stored in a read-only memory (ROM) 602 or from a storage device 608. The program in the memory (RAM) 603 executes various appropriate actions and processing. In the RAM 603, various programs and data required for the operation of the electronic device 600 are also stored. The processing device 601, the ROM 602, and the RAM 603 are connected to each other through a bus 604. An input/output (I/O) interface 605 is also connected to the bus 604.

Generally, the following devices can be connected to the I/O interface 605: including input devices 606 such as touch screen, touch pad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, liquid crystal display (LCD), speakers, vibration An output device 607 such as a device; a storage device 608 such as a magnetic tape, a hard disk, etc.; and a communication device 609. The communication device 609 may allow the electronic device 600 to perform wireless or wired communication with other devices to exchange data. Although FIG. 3 shows an electronic device 600 having various devices, it should be understood that it is not required to implement or have all of the illustrated devices. It may be implemented alternatively or provided with more or fewer devices.

In particular, according to an embodiment of the present disclosure, the process described above with reference to the flowchart can be implemented as a computer software program. For example, an embodiment of the present disclosure includes a computer program product, which includes a computer program carried on a non-transitory computer readable medium, and the computer program contains program code for executing the method shown in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network through the communication device 609, or installed from the storage device 608, or installed from the ROM 602. When the computer program is executed by the processing device 601, the above-mentioned functions defined in the method of the embodiment of the present disclosure are executed.

It should be noted that the above-mentioned computer-readable medium in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the two. The computer-readable storage medium may be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or a combination of any of the above. More specific examples of computer-readable storage media may include, but are not limited to: electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. In the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program, and the program may be used by or in combination with an instruction execution system, apparatus, or device. In the present disclosure, a computer-readable signal medium may include a data signal propagated in a baseband or as a part of a carrier wave, and a computer-readable program code is carried therein. This propagated data signal can take many forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. The computer-readable signal medium may also be any computer-readable medium other than the computer-readable storage medium. The computer-readable signal medium may send, propagate, or transmit the program for use by or in combination with the instruction execution system, apparatus, or device . The program code contained on the computer-readable medium can be transmitted by any suitable medium, including but not limited to: wire, optical cable, RF (radio frequency), etc., or any suitable combination of the above.

In some embodiments, the client and server can use HTTP (HyperText Any currently known or future developed network protocol such as Hypertext Transfer Protocol for communication, and can be interconnected with any form or medium of digital data communication (for example, a communication network). Examples of communication networks include local area networks ("LAN"), wide area networks ("WAN"), the Internet (for example, the Internet), and end-to-end networks (for example, ad hoc end-to-end networks), as well as any currently known or future research and development network of.

The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or it may exist alone without being assembled into the electronic device.

The above-mentioned computer-readable medium carries one or more programs. When the above-mentioned one or more programs are executed by the electronic device, the electronic device is caused to perform the following operations: receiving a first confirmation instruction, and obtaining music according to the first confirmation instruction The first music attribute information and the emotion tag; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and the emotion tag; based on the preset association relationship between the emotion tag and the music attribute and the emotion tag Determine the second music attribute information of the music; use the first music attribute information and the second music attribute information as filter conditions to traverse a pre-built music database to obtain candidate music pieces that match the filter conditions; wherein, the The music database stores a plurality of music fragments pre-divided according to music attributes; according to the received second confirmation instruction, a music fragment is selected from the candidate music fragments, and the music fragments are spliced in the order of the corresponding music sections, Obtain a new music piece; wherein, the second confirmation instruction is a selection confirmation instruction for a music piece.

In addition, the embodiment of the present application also proposes a computer-readable storage medium. The computer-readable medium may be a tangible medium, which may contain or store for use by the instruction execution system, apparatus, or equipment, or be used in conjunction with the instruction execution system, apparatus, or equipment. The program used in combination. The computer-readable medium may be a machine-readable signal medium or a machine-readable storage medium. The computer-readable medium may include, but is not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, device, or device, or any suitable combination of the foregoing. More specific examples of computer-readable storage media would include electrical connections based on one or more wires, portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the above. The computer-readable storage medium includes a music generation program, and when the music generation program is executed by a processor, the steps of the music generation method according to any one of the above technical solutions are implemented.

Optionally, the storage medium involved in this application, such as a computer-readable storage medium, may be non-volatile or volatile.

The specific implementation of the computer-readable storage medium of the present application is substantially the same as the specific implementation of the aforementioned music generation method and electronic device, and will not be repeated here.

The above are only the preferred embodiments of the application, and do not limit the scope of the patent for this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of the application, or directly or indirectly applied to other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims

A method for generating music, which includes:

Receiving a first confirmation instruction, and obtaining first music attribute information and emotion tags of the music piece according to the first confirmation instruction; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tags;

Determining the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag;

The first music attribute information and the second music attribute information are used as filtering conditions to traverse a pre-built music database to obtain candidate music pieces that match the filtering conditions; wherein, the music database stores the music data in advance according to the music attributes. Multiple music fragments divided;

According to the received second confirmation instruction, a music fragment is selected from the candidate music fragments, and the music fragments are spliced in the order of the corresponding music sections to obtain a new music piece; wherein, the second confirmation instruction is for the music fragment To confirm the command.
The method of generating a music composition according to claim 1, wherein after the step of obtaining a new music composition, the method further comprises:

The melody curve of the new music composition is acquired, and the melody curve is adjusted according to the curve characteristics of the melody curve to obtain an optimized music composition.
The music generating method according to claim 2, wherein the step of adjusting the melody curve according to the curve characteristics comprises:

It is detected that there is a small-amplitude gyration in the melody curve, and the middle tone in the melody curve is adjusted in the opposite direction to the inner tone.
The music composition generation method according to claim 1, wherein the step of determining the second music attribute information of the music based on the pre-set association relationship between the emotion tag and the music attribute and the emotion tag comprises:

Quantifying the emotion label through a pre-built emotion model;

Determining candidate second music attribute information of the music piece according to the quantized emotion tag;

A third confirmation instruction is received and analyzed, and the second music attribute information of the music piece is determined from the candidate second music attribute information according to the analysis information of the third confirmation instruction, wherein the third confirmation instruction is for the second music Confirm the selection of attributes.
The music generation method according to claim 1, wherein the step of using the first music attribute information and the second music attribute information as filtering conditions to traverse a pre-built music database comprises:

Acquiring orchestration information of the music;

Traverse a pre-built music database according to the first music attribute information, the second music attribute information, and the orchestrator information.
2. The music generation method according to claim 1, wherein after the step of obtaining candidate music fragments matching the screening conditions, the method further comprises:

Determining the matching priority of each music attribute in the first music attribute information and the second music attribute information according to a preset rule;

Sorting the candidate music fragments according to the matching priority of each music attribute;

Obtain the sorted candidate music fragments;

The selecting a music fragment from the candidate music fragments according to the received second confirmation instruction, and splicing the music fragments in the order of corresponding music bars to obtain a new music composition, including:

According to the received second confirmation instruction, music fragments are selected from the sorted candidate music fragments, and the music fragments are spliced in the order of corresponding music bars to obtain a new music piece.
The method of generating a music composition according to claim 1, wherein after the step of obtaining a new music composition, the method further comprises:

Acquiring user feedback information for the new music, and adjusting at least one of first music attribute information, second music attribute information, and candidate music fragments based on the feedback information;

Based on at least one of the adjusted first music attribute information, the adjusted second music attribute information, and the adjusted candidate music segment, the adjusted music segment is determined, and the adjusted optimized music piece is obtained based on the adjusted music segment.
A music generating device, which includes:

The module for obtaining first music attribute information is configured to receive a first confirmation instruction, and obtain the first music attribute information and mood tag of the music according to the first confirmation instruction; the first confirmation instruction is for the first music attribute information and mood The selection confirmation command of the label;

Determining the second music attribute information module, configured to determine the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag;

The candidate music piece obtaining module is used to traverse a pre-built music database using the first music attribute information and the second music attribute information as filtering conditions to obtain candidate music pieces matching the filtering conditions; wherein, the music A plurality of music fragments pre-divided according to music attributes are stored in the database;

The acquiring new music module is used to select music fragments from the candidate music fragments according to the received second confirmation instruction, and to splice the music fragments in the order of the corresponding music bars to obtain a new music; wherein, the first The second confirmation command is the selection confirmation command for the music piece.
An electronic device includes a memory, a processor, and a computer program stored on the memory and capable of running on the processor, wherein the processor implements the following method when the program is executed:

Receiving a first confirmation instruction, and obtaining first music attribute information and emotion tags of the music piece according to the first confirmation instruction; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tags;

Determining the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag;

The first music attribute information and the second music attribute information are used as filtering conditions to traverse a pre-built music database to obtain candidate music pieces that match the filtering conditions; wherein, the music database stores the music data in advance according to the music attributes. Multiple music fragments divided;

According to the received second confirmation instruction, a music fragment is selected from the candidate music fragments, and the music fragments are spliced in the order of the corresponding music sections to obtain a new music piece; wherein, the second confirmation instruction is for the music fragment To confirm the command.
9. The electronic device according to claim 9, wherein after the step of obtaining a new music, the processor is further configured to implement:

The melody curve of the new music composition is acquired, and the melody curve is adjusted according to the curve characteristics of the melody curve to obtain an optimized music composition.
11. The electronic device according to claim 10, wherein the step of adjusting the melody curve according to the curve characteristics specifically implements:

It is detected that there is a small-amplitude gyration in the melody curve, and the middle tone in the melody curve is adjusted in the opposite direction to the inner tone.
9. The electronic device according to claim 9, wherein the step of determining the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag is specifically implemented:

Quantifying the emotion label through a pre-built emotion model;

Determining candidate second music attribute information of the music piece according to the quantized emotion tag;

A third confirmation instruction is received and analyzed, and the second music attribute information of the music piece is determined from the candidate second music attribute information according to the analysis information of the third confirmation instruction, wherein the third confirmation instruction is for the second music Confirm the selection of attributes.
9. The electronic device according to claim 9, wherein after the step of obtaining candidate music pieces that match the screening conditions, the processor is further configured to implement:

Determining the matching priority of each music attribute in the first music attribute information and the second music attribute information according to a preset rule;

Sorting the candidate music fragments according to the matching priority of each music attribute;

Obtain the sorted candidate music fragments;

When the music fragment is selected from the candidate music fragments according to the received second confirmation instruction, and the music fragments are spliced in the order of the corresponding music bars to obtain a new music piece, the specific implementation is as follows:

According to the received second confirmation instruction, music fragments are selected from the sorted candidate music fragments, and the music fragments are spliced in the order of corresponding music bars to obtain a new music piece.
9. The electronic device according to claim 9, wherein after the step of obtaining a new music, the processor is further configured to implement:

Acquiring user feedback information for the new music, and adjusting at least one of first music attribute information, second music attribute information, and candidate music fragments based on the feedback information;

Based on at least one of the adjusted first music attribute information, the adjusted second music attribute information, and the adjusted candidate music segment, the adjusted music segment is determined, and the adjusted optimized music piece is obtained based on the adjusted music segment.
A computer-readable storage medium, wherein the computer-readable storage medium includes a music generation program, and when the music generation program is executed by a processor, the following method is implemented:

Receiving a first confirmation instruction, and obtaining first music attribute information and emotion tags of the music piece according to the first confirmation instruction; the first confirmation instruction is a selection confirmation instruction for the first music attribute information and emotion tags;

Determining the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag;

The first music attribute information and the second music attribute information are used as filtering conditions to traverse a pre-built music database to obtain candidate music pieces that match the filtering conditions; wherein, the music database stores the music data in advance according to the music attributes. Multiple music fragments divided;

According to the received second confirmation instruction, a music fragment is selected from the candidate music fragments, and the music fragments are spliced in the order of the corresponding music sections to obtain a new music piece; wherein, the second confirmation instruction is for the music fragment To confirm the command.
15. The computer-readable storage medium according to claim 15, wherein after the step of obtaining a new music composition, when the music composition generation program is executed by the processor, it is further used to realize:

The melody curve of the new music composition is acquired, and the melody curve is adjusted according to the curve characteristics of the melody curve to obtain an optimized music composition.
The computer-readable storage medium according to claim 16, wherein the step of adjusting the melody curve according to the curve characteristics specifically implements:

It is detected that there is a small-amplitude gyration in the melody curve, and the middle tone in the melody curve is adjusted in the opposite direction to the inner tone.
The computer-readable storage medium according to claim 15, wherein the step of determining the second music attribute information of the music based on the preset association relationship between the emotion tag and the music attribute and the emotion tag is specifically implemented :

Quantifying the emotion label through a pre-built emotion model;

Determining candidate second music attribute information of the music piece according to the quantized emotion tag;

A third confirmation instruction is received and analyzed, and the second music attribute information of the music piece is determined from the candidate second music attribute information according to the analysis information of the third confirmation instruction, wherein the third confirmation instruction is for the second music Confirm the selection of attributes.
15. The computer-readable storage medium according to claim 15, wherein after the step of obtaining candidate music pieces matching the filtering conditions, when the music composition generation program is executed by the processor, it is further used to realize:

Determining the matching priority of each music attribute in the first music attribute information and the second music attribute information according to a preset rule;

Sorting the candidate music fragments according to the matching priority of each music attribute;

Obtain the sorted candidate music fragments;

When the music fragment is selected from the candidate music fragments according to the received second confirmation instruction, and the music fragments are spliced in the order of the corresponding music bars to obtain a new music piece, the specific implementation is as follows:

According to the received second confirmation instruction, music fragments are selected from the sorted candidate music fragments, and the music fragments are spliced in the order of corresponding music bars to obtain a new music piece.
15. The computer-readable storage medium according to claim 15, wherein after the step of obtaining a new music composition, when the music composition generation program is executed by the processor, it is further used to realize:

Acquiring user feedback information for the new music, and adjusting at least one of first music attribute information, second music attribute information, and candidate music fragments based on the feedback information;

Based on at least one of the adjusted first music attribute information, the adjusted second music attribute information, and the adjusted candidate music segment, the adjusted music segment is determined, and the adjusted optimized music piece is obtained based on the adjusted music segment.