CN109063069A - Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing - Google Patents

Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing Download PDF

Info

Publication number
CN109063069A
CN109063069A CN201810812529.1A CN201810812529A CN109063069A CN 109063069 A CN109063069 A CN 109063069A CN 201810812529 A CN201810812529 A CN 201810812529A CN 109063069 A CN109063069 A CN 109063069A
Authority
CN
China
Prior art keywords
target
song
information
sample
lyrics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810812529.1A
Other languages
Chinese (zh)
Inventor
骆延楠
吴三阳
陈国言
潘志锋
曾荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
iMusic Culture and Technology Co Ltd
Original Assignee
iMusic Culture and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by iMusic Culture and Technology Co Ltd filed Critical iMusic Culture and Technology Co Ltd
Priority to CN201810812529.1A priority Critical patent/CN109063069A/en
Publication of CN109063069A publication Critical patent/CN109063069A/en
Pending legal-status Critical Current

Links

Landscapes

  • Electrophonic Musical Instruments (AREA)

Abstract

The present invention relates to a kind of song labels to determine method, which comprises the validity feature word information in the target lyrics is obtained according to the target lyrics of target song and default song tag database;The target cadence information of the target song is obtained according to the temporal information of the target song and the target lyrics;In conjunction with the validity feature word information, the target cadence information and the default song tag database, the target labels of the target song are obtained.The present invention program song label determines method and apparatus, can carry out label for labelling to song with fast automatic, effectively reduce cost of labor required for song label for labelling and time cost, in addition, Benchmark is unified, label body system unified standard.

Description

Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing
Technical field
The present invention relates to field of computer technology, determine that method, apparatus, computer are set more particularly to a kind of song label Standby and readable storage medium storing program for executing.
Background technique
In order to classify to song, existing audio player would generally mark label to the song in library.
There are two types of types for general song label, are that song mark is corresponding one is the personnel by possessing musical expertise Label, it is independently song mark label that another kind, which is by user,.
When marking label using expert, inefficiency and labor intensive;And when independently marking label using user, then can Irregular due to user cognition field causes label system chaotic.
Summary of the invention
The purpose of the present invention is to provide a kind of song labels to determine method, apparatus computer equipment and readable storage medium Matter can carry out label for labelling to song with fast automatic, effectively reduce cost of labor required for song label for labelling and time Cost, in addition, Benchmark is unified, label body system unified standard.
The purpose of the present invention is achieved through the following technical solutions:
A kind of song label determines method, which comprises
Effective spy in the target lyrics is obtained according to the target lyrics of target song and default song tag database Levy word information;
The target rhythm letter of the target song is obtained according to the temporal information of the target song and the target lyrics Breath;
In conjunction with the validity feature word information, the target cadence information and the default song tag database, obtain Take the target labels of the target song.
The default song tag database stores the sample song of multiple sample songs in one of the embodiments, Word, sample cadence information, the corresponding standard label of each sample song, the corresponding sample song of each standard label sample The sample characteristics word information of eigen word information and sample cadence information and the corresponding all sample songs of same standard label Intersection and sample cadence information intersection.
It is described in one of the embodiments, to be obtained according to the target lyrics of target song and default song tag database Multiple validity feature words in the target lyrics include:
The target lyrics of the target song are split into multiple group of words, the target extracted in the multiple group of words is real Word information;
In conjunction with the sample characteristics word information of the target notional word information and the corresponding sample song of each standard label The validity feature word information is obtained with sample cadence information.
Target notional word information described in the combination and the corresponding sample of each standard label in one of the embodiments, The sample characteristics word information and sample cadence information of this song obtain the validity feature word information
In conjunction with the sample characteristics word information of the target notional word information and the corresponding sample song of each standard label The information gain of each target notional word in the target notional word information is obtained with sample cadence information;
All target notional words in the target notional word information are sorted according to corresponding informance gain size, information is chosen and increases The target notional word of the maximum predetermined number of benefit is as the validity feature word information.
Validity feature word information described in the combination, the target cadence information and institute in one of the embodiments, Default song tag database is stated, the target labels for obtaining the target song include:
In conjunction with the validity feature word information, the target cadence information and the corresponding all samples of same standard label The sample characteristics word information intersection and sample cadence information intersection of song calculate separately the target using default disaggregated model and sing Song belongs to the probability of each standard label;
The corresponding standard label of the target song maximum probability is chosen as the target labels.
The target cadence information includes mean tempo rate information and rhythm pause duration in one of the embodiments, Information.
It is described in one of the embodiments, that institute is obtained according to the temporal information of the target song and the target lyrics The target cadence information for stating target song includes:
The described average of the target lyrics is obtained in conjunction with the time that the length and the target lyrics of the target lyrics occupy Rhythm rate information;
The rhythm pause duration information is obtained in conjunction with lyrics pause duration in the target lyrics.
A kind of song label determining device, described device include:
Feature Words obtain module, for according to the target lyrics of target song and the acquisition of default song tag database Validity feature word information in the target lyrics;
Rhythm obtains module, for obtaining the target according to the temporal information of the target song and the target lyrics The target cadence information of song;
Label determining module, in conjunction with the validity feature word information, the target cadence information and described default Song tag database obtains the target labels of the target song.
A kind of computer equipment, including memory and processor, the memory are stored with computer program, the processing Device realizes following steps when executing the computer program:
Effective spy in the target lyrics is obtained according to the target lyrics of target song and default song tag database Levy word information;
The target rhythm letter of the target song is obtained according to the temporal information of the target song and the target lyrics Breath;
In conjunction with the validity feature word information, the target cadence information and the default song tag database, obtain Take the target labels of the target song.
A kind of computer readable storage medium, is stored thereon with computer program, and the computer program is held by processor Following steps are realized when row:
Effective spy in the target lyrics is obtained according to the target lyrics of target song and default song tag database Levy word information;
The target rhythm letter of the target song is obtained according to the temporal information of the target song and the target lyrics Breath;
In conjunction with the validity feature word information, the target cadence information and the default song tag database, obtain Take the target labels of the target song.
According to the scheme of aforementioned present invention, obtained according to the target lyrics of target song and default song tag database Validity feature word information in the target lyrics can establish unified label and obtain in conjunction with default song tag database Standard, label body system unified standard;The target song is obtained according to the temporal information of the target song and the target lyrics Bent target cadence information;In conjunction with the validity feature word information, the target cadence information and the default song label Database obtains the target labels of the target song, can carry out label for labelling to song with fast automatic, effectively reduce song Cost of labor and time cost required for label for labelling.
Detailed description of the invention
Fig. 1 is the applied environment figure that song label determines method in one embodiment;
Fig. 2 is the flow diagram that song label determines method in one embodiment;
Fig. 3 is the flow diagram that song label determines method in one embodiment;
Fig. 4 is the flow diagram that song label determines method in another embodiment;
Fig. 5 is the structural block diagram of song label determining device in one embodiment;
Fig. 6 is the structural block diagram of song label determining device in one embodiment;
Fig. 7 is the structural block diagram of song label determining device in another embodiment;
Fig. 8 is the internal structure chart of computer equipment in one embodiment;
Fig. 9 is the internal structure chart of computer equipment in another embodiment.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention more comprehensible, with reference to the accompanying drawings and embodiments, to this Invention is described in further detail.It should be appreciated that the specific embodiments described herein are only used to explain the present invention, And the scope of protection of the present invention is not limited.
Song label provided by the present application determines method, can be applied in application environment as shown in Figure 1.Wherein, it takes Business device obtains the validity feature word in the target lyrics according to the target lyrics and default song tag database of target song Information;Server obtains the target rhythm of the target song according to the temporal information and the target lyrics of the target song Information;Server in conjunction with the validity feature word information, the target cadence information and the default song tag database, Obtain the target labels of the target song.Wherein, server can be formed with the either multiple servers of independent server Server cluster realize.It will be understood by those skilled in the art that song label provided by the present application determines method, not only may be used To be applied in application environment shown in FIG. 1, it can also apply but be not limited in various computers or server.
In one embodiment, as shown in Fig. 2, providing a kind of song label determines method, it is applied to Fig. 1 in this way In server for be illustrated, comprising the following steps:
Step S102 is obtained in the target lyrics according to the target lyrics of target song and default song tag database Validity feature word information;
Specifically, by being labeled with multiple sample songs of standard tag information, default song tag database is established, so Default song tag database is combined to obtain target labels, the unifying datum in label determination process afterwards, label body system one is advised Model.
Step S104 obtains the mesh of the target song according to the temporal information of the target song and the target lyrics Mark cadence information;
Specifically, the target cadence information includes mean tempo rate information and rhythm pause duration information.
Step S106, in conjunction with the validity feature word information, the target cadence information and the default song label Database obtains the target labels of the target song.
Above-mentioned song label determines in method, is obtained according to the target lyrics of target song and default song tag database Validity feature word information in the target lyrics can establish unified label and obtain in conjunction with default song tag database Standard, label body system unified standard;The target song is obtained according to the temporal information of the target song and the target lyrics Bent target cadence information;In conjunction with the validity feature word information, the target cadence information and the default song label Database obtains the target labels of the target song, can carry out label for labelling to song with fast automatic, effectively reduce song Cost of labor and time cost required for label for labelling.
The default song tag database stores the sample song of multiple sample songs in one of the embodiments, Word, sample cadence information, the corresponding standard label of each sample song, the corresponding sample song of each standard label sample The sample characteristics word information of eigen word information and sample cadence information and the corresponding all sample songs of same standard label Intersection and sample cadence information intersection.
In one of the embodiments, as shown in figure 3, the target lyrics and default song label according to target song Database obtains multiple validity feature words in the target lyrics
The target lyrics of the target song are split into multiple group of words, extract the multiple group of words by step S1021 In target notional word information;
Specifically, the target lyrics are splitted by group of words one by one using segmenter, filters function word, modal particle, extract the lyrics Interior notional word part, i.e. target notional word information.
Step S1022, in conjunction with the corresponding all sample songs of the target notional word information and each standard tag information Bent sample characteristics word intersection obtains the validity feature word information.
In one of the embodiments, as shown in figure 4, target notional word information described in the combination and each standard mark The sample characteristics word intersection of the corresponding all sample songs of label information obtains the validity feature word information
Step S10221, in conjunction with the sample of the target notional word information and the corresponding sample song of each standard label Feature word information and sample cadence information obtain the validity feature word information.
Step S10222, by all target notional words in the target notional word information according to the big float of corresponding informance gain Sequence chooses the target notional word of the maximum predetermined number of information gain as the validity feature word information.
Specifically, the feature selecting mode based on Information Gain Method is a kind of common hand for carrying out text feature selection Method, Information Gain Method measures the information content of a system using the concept of entropy, and the information gain of some feature then refers to, Under known conditions, the variation degree of the information content of whole system, attribute gain is an important indicator of feature selecting, it is defined This feature how much information can be carried out for categorizing system synteny, and bring information is more, then illustrate that this feature is more important.
The comentropy of each target notional word in target notional word information is calculated first, and calculation formula is as follows:
In formula: wherein p (yi) chance event Y is represented as yiProbability, H (Y) is average information;Y refers to target notional word Intersection, yiRefer to every i-th of target notional word.
Then the conditional entropy of each target notional word in target notional word information is calculated, the calculation formula of conditional entropy is as follows:
In formula: under conditions of H (Y | X) refers to known variables X, another expectation of variable Y entropy to X;P (y | x) refer to x thing When part occurs, the probability of y event generation;X refers to standard label;Y is target notional word;X is the intersection of standard label;Y is feeling the pulse with the finger-tip Mark the intersection of notional word;That is in the case that H (Y | X) is known standard label, the information content of target notional word.
Then information gain value is calculated according to conditional entropy and comentropy, calculation formula is as follows:
IG (Y, X)=H (Y)-H (Y | X) (3)
In formula: IG (Y, X) representative information yield value, information gain score have measured the information content of the lyrics feature, information The amount the big, represents the importance of the lyrics feature.
Assuming that extracting four first sample songs, wherein the label of two songs is " Chinese feature ", two song labels are " feelings Song ", the target notional word information chosen in target song include " in love ", " shedding tears " and " misty rain " these three words, it is assumed that this three The number that a word occurs in four first songs is as shown in the table:
(A) in love Shed tears (B) Misty rain (C) Label
1 1 0 Love song
0 1 1 Love song
1 0 0 Chinese feature
1 0 1 Chinese feature
The comentropy of the intersection of target notional word is calculated according to upper meter:
The conditional entropy of each Feature Words is calculated according to the formula of conditional entropy:
H (Y | B)==0-0-0-0=0
Then according to information gain formula, the information gain score of each Feature Words is calculated, the information of " shedding tears " is obtained Gain is scored at 1, and the information gain of " in love " is scored at 0.31, and the information gain of " misty rain " is scored at 0.
" shedding tears " (1) > " in love " (0.31) > " misty rain " (0)
Select the target notional word of the maximum predetermined number of information gain as the validity feature word information, such as selection letter The maximum two target notional words of gain are ceased as validity feature word information, then finally obtained validity feature word information just includes " shedding tears " and " in love ".
The target cadence information includes mean tempo rate information and rhythm pause duration in one of the embodiments, Information.
In one of the embodiments, as shown in figure 3, the temporal information and the target according to the target song The target cadence information that the lyrics obtain the target song includes:
Step S1041 obtains the target lyrics in conjunction with the time of length and the target lyrics occupancy of the target lyrics The mean tempo rate information;
Step S1042 obtains the rhythm pause duration information in conjunction with lyrics pause duration in the target lyrics.
It specifically, also include the temporal information of the lyrics, according to LRC due to not only possessing lyrics information in LRC lyrics file The lyrics of format set the rhythm characteristic that following formula extracts song:
Avg(Si)=Duration (Si)/Len(Si) (4)
Avg (S in formulai) represent each word occupied time in current lyrics segment, that is, mean tempo quantity Information, Duration (Si) then represent in the lyrics segment be occupy time, Len (Si) then represent the length of the lyrics.
Specifically, the length of the lyrics refers to the number of words of the lyrics, does not include the space between a lyrics.
By taking the following lyrics as an example:
[04:59.82] you always the too soft heart of the heart is too soft
[05:05.38] all oneself shoulders all problems
[05:09.93] is in love always simply to get along too difficult
[05:15.75] be not inadequate again without being you
[05:20.47] be not inadequate again without being you
[05:25.63] be not inadequate again without being you
[05:30.90] be not inadequate again without being you
[05:35.81]
By taking this section of lyrics " [04:59.82] you always the too soft heart of the heart is too soft " as an example, the time in bracket is this section of lyrics At the beginning of T (Si), until T (S at the beginning of the next section of lyricsi+1) so the lyrics segment occupy time be then Duration(Si)=T (Si+1)-T(Si)=305s-299s=6s in seconds, and Len (Si) it is then 9, then can obtain The rhythm characteristic of the lyrics segment out:
Lyrics rhythm demarcation interval is carried out by section of 100ms, while can also be obtained by above-mentioned lyrics segment, [05: 35.81] there is the performance part of the not lyrics, belong to the transition stage of background or absolute music, the part is as song Rhythm terminates Core feature word of the feature extraction as following categorizing songs models.The flat of every first song is extracted from LRC file Equal rhythm rate and rhythm pause duration.
In one of the embodiments, as shown in figure 3, validity feature word information described in the combination, the target rhythm Information and the default song tag database, the target labels for obtaining the target song include:
Step S1061, it is corresponding in conjunction with the validity feature word information, the target cadence information and same standard label All sample songs sample characteristics word information intersection and sample cadence information intersection, calculated separately using default disaggregated model The target song belongs to the probability of each standard label;
Step S1062 chooses the corresponding standard label of the target song maximum probability as the target labels.
Specifically, disaggregated model is using Naive Bayes Classification Model, in order to verify Naive Bayes Classification Model Accuracy in computation, randomly select the song lyrics data that 5000 do not mark label and carry out test assessment, pass through song respectively Label coverage, can the coverage of song of prediction label account for the accounting and accuracy of overall test song amount, that is, The accuracy of song label for labelling is assessed, its coverage, the accuracy as the result is shown of final test are averagely respectively 80% And 85%, it was demonstrated that by this method carry out song label for labelling coverage and accuracy a reasonable acceptable use water In flat.
For example: assuming that the standard label and cadence information collection that store in default song tag database share vector table Be shown as [" in love ", " 100ms ", " misty rain ", " 300ms "], and only " love song " and " R&B (and Rhythm and Blues, save Play Bruce) " two tag along sorts, two class sample sizes i.e. validity feature word quantity are 8, are parsed with certain head song The vector space model obtained for the feature [" in love ", " misty rain ", " 300ms "] come are as follows: x=[1,0,1,1] recycles Piao Plain Bayesian model calculates classification prediction, it is assumed that the frequency statistics of sample data are as follows:
It is calculated using the model-naive Bayesian for being applied to text classification, while in order to avoid calculating probability When, there is CiUnder do not occur feature Xi, and lead to this unreasonable situation of zero probability occur, it is flat to introduce Laplce Sliding processing, carries out+1 to molecule, adds classification number N to denominator, then target labels are standard label C1Probability calculation formula are as follows:
In formula: i ∈ I, X indicate the intersection of validity feature word information x in target song;xiIndicate i-th of validity feature word, The intersection of I expression Feature Words number;p(xi|C1) indicate that ith feature word appears in standard label C1Corresponding sample characteristics word Probability in information intersection;A (i, 1) is indicated in standard label C1Ith feature word in corresponding sample characteristics word information intersection The number of appearance;B1Expression appears in standard label C1In validity feature word total quantity;The classification number of N expression standard label.
It can be calculated in conjunction with upper table and formula (5):
Equally, then ith feature word is standard label C2Probability calculation formula are as follows:
In formula: p (X | C2) indicate that ith feature word appears in standard label C2In corresponding sample characteristics word information intersection Probability;A (i, 2) is indicated in standard label C2The number that ith feature word occurs in corresponding sample characteristics word information intersection; B2Expression appears in standard label C2In validity feature word total quantity;The classification number of N expression standard label.
It can be calculated in conjunction with upper table and formula (6):
Therefore, the probability that the target labels belong to " love song " is much larger than the probability for belonging to " R&B ", shows the target song Bent target labels are " love song ".
In the remittance of wherein one embodiment, as shown in figure 5, providing a kind of song label determining device, described device packet It includes:
Feature Words obtain module 102, for according to the target lyrics of target song and the acquisition of default song tag database Validity feature word information in the target lyrics;
Rhythm obtains module 104, for according to the temporal information of the target song and target lyrics acquisition The target cadence information of target song;
Label determining module 106, in conjunction with the validity feature word information, the target cadence information and described pre- If song tag database obtains the target labels of the target song.
The default song tag database stores the sample song of multiple sample songs in one of the embodiments, Word, sample cadence information, the corresponding standard label of each sample song, the corresponding sample song of each standard label sample The sample characteristics word information of eigen word information and sample cadence information and the corresponding all sample songs of same standard label Intersection and sample cadence information intersection.
In one of the embodiments, as shown in fig. 6, Feature Words acquisition module 102 includes:
Notional word acquiring unit 1021 extracts institute for the target lyrics of the target song to be split into multiple group of words State the target notional word information in multiple group of words;
Feature Words determination unit 1022, in conjunction with the target notional word information and the corresponding sample of each standard label The sample characteristics word information and sample cadence information of this song obtain the validity feature word information.
In one of the embodiments, as shown in fig. 7, the Feature Words determination unit 1022 includes:
Information gain computing unit 10221, for corresponding in conjunction with the target notional word information and each standard label Sample song sample characteristics word information and sample cadence information obtain each target notional word in the target notional word information Information gain;
Sequencing unit 10222, for by all target notional words in the target notional word information according to corresponding informance gain Size sequence chooses the target notional word of the maximum predetermined number of information gain as the validity feature word information.
In one of the embodiments, as shown in fig. 6, the label determining module 106 includes:
Probability calculation unit 1061, in conjunction with the validity feature word information, the target cadence information and same The sample characteristics word information intersection and sample cadence information intersection of the corresponding all sample songs of standard label, using default classification Model calculates separately the probability that the target song belongs to each standard label;
Label selection unit 1062, for choosing the corresponding standard label of the target song maximum probability as the mesh Mark label.
The target cadence information includes mean tempo rate information and rhythm pause duration in one of the embodiments, Information.
In one of the embodiments, as shown in fig. 6, rhythm acquisition module 104 includes:
Rhythm rate calculation unit 1041, the time for length and target lyrics occupancy in conjunction with the target lyrics are obtained Take the mean tempo rate information of the target lyrics;
Rhythm pause computing unit 1042 stops for obtaining the rhythm in conjunction with lyrics pause duration in the target lyrics Immediately long message.
In one embodiment, a kind of computer equipment is provided, which can be server, internal junction Composition can be as shown in Figure 8.The computer equipment include the processor connected by device bus, memory, network interface and Database.Wherein, the processor of the computer equipment is for providing calculating and control ability.The memory packet of the computer equipment Include non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating device, computer program and data Library.The built-in storage provides environment for the operation of operating device and computer program in non-volatile memory medium.The calculating The database of machine equipment is used to store song label and determines the data being related to.The network interface of the computer equipment is used for and outside Terminal by network connection communication.To realize that a kind of song label determines method when the computer program is executed by processor.
In one embodiment, a kind of computer equipment is provided, which can be terminal, internal structure Figure can be as shown in Figure 9.The computer equipment includes processor, the memory, network interface, display connected by system bus Screen and input unit.Wherein, the processor of the computer equipment is for providing calculating and control ability.The computer equipment is deposited Reservoir includes non-volatile memory medium, built-in storage.The non-volatile memory medium is stored with operating system and computer journey Sequence.The built-in storage provides environment for the operation of operating system and computer program in non-volatile memory medium.The calculating The network interface of machine equipment is used to communicate with external terminal by network connection.When the computer program is executed by processor with Realize that a kind of song label determines method.The display screen of the computer equipment can be liquid crystal display or electric ink is shown Screen, the input unit of the computer equipment can be the touch layer covered on display screen, be also possible on computer equipment shell Key, trace ball or the Trackpad of setting can also be external keyboard, Trackpad or mouse etc..
It will be understood by those skilled in the art that structure shown in Fig. 8-9, only part relevant to application scheme The block diagram of structure, does not constitute the restriction for the computer equipment being applied thereon to application scheme, and specific computer is set Standby may include perhaps combining certain components or with different component layouts than more or fewer components as shown in the figure.
In one embodiment, a kind of computer equipment, including memory and processor are provided, is stored in memory Computer program, the processor perform the steps of the target lyrics according to target song and preset when executing computer program Song tag database obtains the validity feature word information in the target lyrics;According to the temporal information of the target song and The target lyrics obtain the target cadence information of the target song;In conjunction with the validity feature word information, the target section Information and the default song tag database are played, the target labels of the target song are obtained.
The default song tag database stores when processor executes computer program in one of the embodiments, The sample lyrics, sample cadence information, the corresponding standard label of each sample song, each standard mark of multiple sample songs Sign the sample characteristics word information and sample cadence information and the corresponding all samples of same standard label of corresponding sample song The sample characteristics word information intersection and sample cadence information intersection of song.
In one of the embodiments, processor execute computer program when the target lyrics according to target song and It includes: by the target of the target song that default song tag database, which obtains multiple validity feature words in the target lyrics, The lyrics split into multiple group of words, extract the target notional word information in the multiple group of words;In conjunction with the target notional word information The sample characteristics word information and sample cadence information of sample song corresponding with each standard label obtain effective spy Levy word information.
Target notional word information described in the combination and every when processor executes computer program in one of the embodiments, The sample characteristics word information and sample cadence information of the corresponding sample song of one standard label obtain the validity feature word Information includes: the sample characteristics word information in conjunction with the target notional word information and the corresponding sample song of each standard label The information gain of each target notional word in the target notional word information is obtained with sample cadence information;The target notional word is believed All target notional words in breath sort according to corresponding informance gain size, and the target for choosing the maximum predetermined number of information gain is real Word is as the validity feature word information.
Validity feature word information described in combination when processor executes computer program in one of the embodiments, The target cadence information and the default song tag database, the target labels for obtaining the target song include: knot Close the sample of the validity feature word information, the target cadence information and the corresponding all sample songs of same standard label Feature word information intersection and sample cadence information intersection, using default disaggregated model calculate separately the target song belong to it is each The probability of the standard label;The corresponding standard label of the target song maximum probability is chosen as the target labels.
It includes mean tempo that processor, which executes target cadence information when computer program, in one of the embodiments, Rate information and rhythm pause duration information.
Processor executes described according to the time of the target song letter when computer program in one of the embodiments, Ceasing the target cadence information for obtaining the target song with the target lyrics includes: the length and mesh in conjunction with the target lyrics The time that the mark lyrics occupy obtains the mean tempo rate information of the target lyrics;In conjunction with the lyrics in the target lyrics Pause duration obtains the rhythm pause duration information.
In one embodiment, a kind of computer readable storage medium is provided, computer program is stored thereon with, is calculated Machine program performs the steps of when being executed by processor to be obtained according to the target lyrics and default song tag database of target song Take the validity feature word information in the target lyrics;It is obtained according to the temporal information of the target song and the target lyrics The target cadence information of the target song;In conjunction with the validity feature word information, the target cadence information and described pre- If song tag database obtains the target labels of the target song.
Default song tag database storage when computer program is executed by processor in one of the embodiments, There are the sample lyrics, sample cadence information, the corresponding standard label of each sample song, each standard of multiple sample songs The sample characteristics word information and sample cadence information of the corresponding sample song of label and the corresponding all samples of same standard label The sample characteristics word information intersection and sample cadence information intersection of this song.
The target lyrics according to target song when computer program is executed by processor in one of the embodiments, Obtaining multiple validity feature words in the target lyrics with default song tag database includes: by the mesh of the target song The mark lyrics split into multiple group of words, extract the target notional word information in the multiple group of words;Believe in conjunction with the target notional word The sample characteristics word information and sample cadence information of breath and the corresponding sample song of each standard label obtain described effective Feature word information.
When computer program is executed by processor in one of the embodiments, target notional word information described in the combination and The sample characteristics word information and sample cadence information of the corresponding sample song of each standard label obtain the validity feature Word information includes: to believe in conjunction with the sample characteristics word of the target notional word information and the corresponding sample song of each standard label Breath and sample cadence information obtain the information gain of each target notional word in the target notional word information;By the target notional word All target notional words in information sort according to corresponding informance gain size, choose the target of the maximum predetermined number of information gain Notional word is as the validity feature word information.
Validity feature word described in the combination is believed when computer program is executed by processor in one of the embodiments, Breath, the target cadence information and the default song tag database, the target labels for obtaining the target song include: In conjunction with the sample of the validity feature word information, the target cadence information and the corresponding all sample songs of same standard label Eigen word information intersection and sample cadence information intersection, calculate separately the target song using default disaggregated model and belong to often The probability of one standard label;The corresponding standard label of the target song maximum probability is chosen as the target labels.
The target cadence information includes average section when computer program is executed by processor in one of the embodiments, Play rate information and rhythm pause duration information.
The time according to target song when computer program is executed by processor in one of the embodiments, Information and the target lyrics obtain the target song target cadence information include: in conjunction with the target lyrics length and The time that the target lyrics occupy obtains the mean tempo rate information of the target lyrics;In conjunction with being sung in the target lyrics Word pause duration obtains the rhythm pause duration information.
Those of ordinary skill in the art will appreciate that realizing all or part of the process in above-described embodiment method, being can be with Relevant hardware is instructed to complete by computer program, the computer program can be stored in a non-volatile computer In read/write memory medium, the computer program is when being executed, it may include such as the process of the embodiment of above-mentioned each method.Wherein, To any reference of memory, storage, database or other media used in each embodiment provided herein, Including non-volatile and/or volatile memory.Nonvolatile memory may include read-only memory (ROM), programming ROM (PROM), electrically programmable ROM (EPROM), electrically erasable ROM (EEPROM) or flash memory.Volatile memory may include Random access memory (RAM) or external cache.By way of illustration and not limitation, RAM is available in many forms, Such as static state RAM (SRAM), dynamic ram (DRAM), synchronous dram (SDRAM), double data rate sdram (DDRSDRAM), enhancing Type SDRAM (ESDRAM), synchronization link (Synchlink) DRAM (SLDRAM), memory bus (Rambus) direct RAM (RDRAM), direct memory bus dynamic ram (DRDRAM) and memory bus dynamic ram (RDRAM) etc..
Each technical characteristic of above embodiments can be combined arbitrarily, for simplicity of description, not to above-described embodiment In each technical characteristic it is all possible combination be all described, as long as however, the combination of these technical characteristics be not present lance Shield all should be considered as described in this specification.
The embodiments described above only express several embodiments of the present invention, and the description thereof is more specific and detailed, but simultaneously It cannot therefore be construed as limiting the scope of the patent.It should be pointed out that coming for those of ordinary skill in the art It says, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to protection of the invention Range.Therefore, the scope of protection of the patent of the invention shall be subject to the appended claims.

Claims (10)

1. a kind of song label determines method, which is characterized in that the described method includes:
The validity feature word in the target lyrics is obtained according to the target lyrics of target song and default song tag database Information;
The target cadence information of the target song is obtained according to the temporal information of the target song and the target lyrics;
In conjunction with the validity feature word information, the target cadence information and the default song tag database, institute is obtained State the target labels of target song.
2. song label according to claim 1 determines method, which is characterized in that the default song tag database storage There are the sample lyrics, sample cadence information, the corresponding standard label of each sample song, each mark of multiple sample songs The sample characteristics word information and sample cadence information of the corresponding sample song of fiducial mark label and same standard label are corresponding all The sample characteristics word information intersection and sample cadence information intersection of sample song.
3. song label according to claim 2 determines method, which is characterized in that described to be sung according to the target of target song Word and default song tag database obtain multiple validity feature words in the target lyrics
The target lyrics of the target song are split into multiple group of words, extract the target notional word letter in the multiple group of words Breath;
In conjunction with the sample characteristics word information and sample of the target notional word information and the corresponding sample song of each standard label Validity feature word information described in this rhythm acquisition of information.
4. song label according to claim 3 determines method, which is characterized in that target notional word information described in the combination The sample characteristics word information and sample cadence information of sample song corresponding with each standard label obtain effective spy Levying word information includes:
In conjunction with the sample characteristics word information and sample of the target notional word information and the corresponding sample song of each standard label The information gain of each target notional word in target notional word information described in this rhythm acquisition of information;
All target notional words in the target notional word information are sorted according to corresponding informance gain size, choose information gain most The target notional word of big predetermined number is as the validity feature word information.
5. song label according to claim 2 determines method, which is characterized in that the letter of validity feature word described in the combination Breath, the target cadence information and the default song tag database, the target labels for obtaining the target song include:
In conjunction with the validity feature word information, the target cadence information and the corresponding all sample songs of same standard label Sample characteristics word information intersection and sample cadence information intersection, the target song category is calculated separately using default disaggregated model In the probability of each standard label;
The corresponding standard label of the target song maximum probability is chosen as the target labels.
6. song label according to claim 1 determines method, which is characterized in that the target cadence information includes average Rhythm rate information and rhythm pause duration information.
7. song label according to claim 6 determines method, which is characterized in that it is described according to the target song when Between information and the target lyrics obtain the target cadence information of the target song and include:
The mean tempo of the target lyrics is obtained in conjunction with the time that the length and the target lyrics of the target lyrics occupy Rate information;
The rhythm pause duration information is obtained in conjunction with lyrics pause duration in the target lyrics.
8. a kind of song label determining device, which is characterized in that described device includes:
Feature Words obtain module, for obtaining the target according to the target lyrics of target song and default song tag database Validity feature word information in the lyrics;
Rhythm obtains module, for obtaining the target song according to the temporal information of the target song and the target lyrics Target cadence information;
Label determining module, in conjunction with the validity feature word information, the target cadence information and the default song Tag database obtains the target labels of the target song.
9. a kind of computer equipment, including memory and processor, the memory are stored with computer program, feature exists In the step of processor realizes any one of claims 1 to 7 the method when executing the computer program.
10. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program The step of method described in any one of claims 1 to 7 is realized when being executed by processor.
CN201810812529.1A 2018-07-23 2018-07-23 Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing Pending CN109063069A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810812529.1A CN109063069A (en) 2018-07-23 2018-07-23 Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810812529.1A CN109063069A (en) 2018-07-23 2018-07-23 Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing

Publications (1)

Publication Number Publication Date
CN109063069A true CN109063069A (en) 2018-12-21

Family

ID=64836150

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810812529.1A Pending CN109063069A (en) 2018-07-23 2018-07-23 Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing

Country Status (1)

Country Link
CN (1) CN109063069A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110633476A (en) * 2019-09-27 2019-12-31 北京百度网讯科技有限公司 Method and device for acquiring knowledge annotation information
CN110727840A (en) * 2019-09-27 2020-01-24 浙江大搜车软件技术有限公司 Vehicle inquiry tag pushing method and device, computer equipment and storage medium
CN111026908A (en) * 2019-12-10 2020-04-17 腾讯科技(深圳)有限公司 Song label determination method and device, computer equipment and storage medium
CN111859014A (en) * 2020-06-29 2020-10-30 维沃移动通信有限公司 Data labeling method and device
CN112163116A (en) * 2020-09-28 2021-01-01 广州酷狗计算机科技有限公司 Song classification method and device and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101963972A (en) * 2010-07-01 2011-02-02 深港产学研基地产业发展中心 Method and system for extracting emotional keywords
CN102654859A (en) * 2011-03-01 2012-09-05 北京彩云在线技术开发有限公司 Method and system for recommending songs
CN103488782A (en) * 2013-09-30 2014-01-01 华北电力大学 Method for recognizing musical emotion through lyrics
CN103678274A (en) * 2013-04-15 2014-03-26 南京邮电大学 Feature extraction method for text categorization based on improved mutual information and entropy
CN105404674A (en) * 2015-11-20 2016-03-16 焦点科技股份有限公司 Knowledge-dependent webpage information extraction method
CN105868372A (en) * 2016-03-31 2016-08-17 广州酷狗计算机科技有限公司 Label distribution method and device

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101963972A (en) * 2010-07-01 2011-02-02 深港产学研基地产业发展中心 Method and system for extracting emotional keywords
CN102654859A (en) * 2011-03-01 2012-09-05 北京彩云在线技术开发有限公司 Method and system for recommending songs
CN103678274A (en) * 2013-04-15 2014-03-26 南京邮电大学 Feature extraction method for text categorization based on improved mutual information and entropy
CN103488782A (en) * 2013-09-30 2014-01-01 华北电力大学 Method for recognizing musical emotion through lyrics
CN105404674A (en) * 2015-11-20 2016-03-16 焦点科技股份有限公司 Knowledge-dependent webpage information extraction method
CN105868372A (en) * 2016-03-31 2016-08-17 广州酷狗计算机科技有限公司 Label distribution method and device

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
郑亚斌等: ""中文歌词的统计特征及其检索应用"", 《中文信息学报》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110633476A (en) * 2019-09-27 2019-12-31 北京百度网讯科技有限公司 Method and device for acquiring knowledge annotation information
CN110727840A (en) * 2019-09-27 2020-01-24 浙江大搜车软件技术有限公司 Vehicle inquiry tag pushing method and device, computer equipment and storage medium
CN110727840B (en) * 2019-09-27 2022-07-05 浙江大搜车软件技术有限公司 Vehicle inquiry tag pushing method and device, computer equipment and storage medium
CN110633476B (en) * 2019-09-27 2024-04-05 北京百度网讯科技有限公司 Method and device for acquiring knowledge annotation information
CN111026908A (en) * 2019-12-10 2020-04-17 腾讯科技(深圳)有限公司 Song label determination method and device, computer equipment and storage medium
CN111026908B (en) * 2019-12-10 2023-09-08 腾讯科技(深圳)有限公司 Song label determining method, device, computer equipment and storage medium
CN111859014A (en) * 2020-06-29 2020-10-30 维沃移动通信有限公司 Data labeling method and device
CN112163116A (en) * 2020-09-28 2021-01-01 广州酷狗计算机科技有限公司 Song classification method and device and computer readable storage medium

Similar Documents

Publication Publication Date Title
CN109063069A (en) Song label determines method, apparatus, computer equipment and readable storage medium storing program for executing
Benchimol et al. Text mining methodologies with R: An application to central bank texts
Lai et al. Exploring the research in information technology implementation
CN109002492B (en) Performance point prediction method based on LightGBM
CN110362823A (en) The training method and device of text generation model are described
CN108959271A (en) Document creation method, device, computer equipment and readable storage medium storing program for executing are described
CN107038173A (en) Application query method and apparatus, similar application detection method and device
CN107194430A (en) A kind of screening sample method and device, electronic equipment
CN110457677B (en) Entity relationship identification method and device, storage medium and computer equipment
CN107818491A (en) Electronic installation, Products Show method and storage medium based on user's Internet data
CN106874314A (en) The method and apparatus of information recommendation
CN110472049B (en) Disease screening text classification method, computer device and readable storage medium
CN110276382A (en) Listener clustering method, apparatus and medium based on spectral clustering
US20070136220A1 (en) Apparatus for learning classification model and method and program thereof
CN109858009A (en) Device, method and its computer storage medium of control instruction are generated according to text
CN106294128B (en) A kind of automated testing method and device exporting report data
CN105786898B (en) A kind of construction method and device of domain body
CN110111143A (en) A kind of control method and control device for establishing mobile end subscriber portrait
CN111159167B (en) Labeling quality detection device and method
CN109783740A (en) Pay close attention to the sort method and device of the page
CN109614982A (en) Product analysis method, apparatus, computer equipment and storage medium
CN108647714A (en) Acquisition methods, terminal device and the medium of negative label weight
CN114692889A (en) Meta-feature training model for machine learning algorithm
De Crombrugghe et al. Statistical Demand Functions for Food in the USA and the Netherlands
CN110134935A (en) A kind of method, device and equipment for extracting font style characteristic

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20181221