CN104157296B - A kind of audio frequency assessment method and device - Google Patents

A kind of audio frequency assessment method and device Download PDF

Info

Publication number
CN104157296B
CN104157296B CN201410362595.5A CN201410362595A CN104157296B CN 104157296 B CN104157296 B CN 104157296B CN 201410362595 A CN201410362595 A CN 201410362595A CN 104157296 B CN104157296 B CN 104157296B
Authority
CN
China
Prior art keywords
sequence
simple sentence
measured
related coefficient
test
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410362595.5A
Other languages
Chinese (zh)
Other versions
CN104157296A (en
Inventor
赵伟峰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Kugou Computer Technology Co Ltd
Original Assignee
Tencent Technology Shenzhen Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Tencent Technology Shenzhen Co Ltd filed Critical Tencent Technology Shenzhen Co Ltd
Priority to CN201410362595.5A priority Critical patent/CN104157296B/en
Publication of CN104157296A publication Critical patent/CN104157296A/en
Application granted granted Critical
Publication of CN104157296B publication Critical patent/CN104157296B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The embodiment of the present invention provides a kind of audio frequency assessment method and device, and method wherein can comprise: obtain performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence; According to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence; Related operation is carried out to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence; According to described related coefficient sequence, determine the test and appraisal mark of described performance simple sentence.The performance simple sentence that the present invention can comprise target audio file is tested and assessed, and meets the application demand to audio file, and the application promoting audio file is intelligent.

Description

A kind of audio frequency assessment method and device
Technical field
The present invention relates to Internet technical field, be specifically related to audio signal processing technique field, particularly relate to a kind of audio frequency assessment method and device.
Background technology
Along with the development of Internet technology, a large amount of audio files such as such as song, snatch of song etc. has been included in internet audio storehouse, application about internet audio also day by day increases, such as: KTV (KaraokeTelevision, Karaoke) system, K song system etc.In the process of application audio file, major part user wishes to know deduction level in real time in audio rendition process, such as: user is when singing certain song, wish the test and appraisal mark obtaining each sentence of song, to understand the performance situation of every, etc., therefore, how the audio frequency simple sentence in the audio files such as such as song is tested and assessed and become a technical matters urgently to be resolved hurrily.
Summary of the invention
The embodiment of the present invention provides a kind of audio frequency assessment method and device, and the performance simple sentence that can comprise target audio file is tested and assessed, and meets the application demand to audio file, and the application promoting audio file is intelligent.
Embodiment of the present invention first aspect provides a kind of audio frequency assessment method, can comprise:
Obtain performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence;
According to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence;
Related operation is carried out to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence;
According to described related coefficient sequence, determine the test and appraisal mark of described performance simple sentence.
Embodiment of the present invention second aspect provides a kind of audio frequency assessment device, can comprise:
Acquisition module to be measured, for obtaining performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence;
With reference to acquisition module, for according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence;
Related operation module, for carrying out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence;
Test and appraisal module, for according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence.
Implement the embodiment of the present invention, there is following beneficial effect:
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present invention or technical scheme of the prior art, be briefly described to the accompanying drawing used required in embodiment or description of the prior art below, apparently, accompanying drawing in the following describes is only some embodiments of the present invention, for those of ordinary skill in the art, under the prerequisite not paying creative work, other accompanying drawing can also be obtained according to these accompanying drawings.
The process flow diagram of a kind of audio frequency assessment method that Fig. 1 provides for the embodiment of the present invention;
The process flow diagram of the embodiment that Fig. 2 is the step S103 shown in Fig. 1;
The process flow diagram of the embodiment that Fig. 3 is the step S104 shown in Fig. 1;
Fig. 4 is the process flow diagram of another embodiment of the step S104 shown in Fig. 1;
The structural representation of a kind of audio frequency assessment device that Fig. 5 provides for the embodiment of the present invention;
Fig. 6 is the structural representation of the embodiment of the related operation module shown in Fig. 5;
Fig. 7 is the structural representation of an embodiment of the test and appraisal module shown in Fig. 5;
Fig. 8 is the structural representation of another embodiment of the test and appraisal module shown in Fig. 5.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
In the embodiment of the present invention, audio file can include but not limited to: the file such as song, snatch of song.Source audio file refers to the file of the reference data that can be used for as audio frequency test and appraisal, such as: original singer's song, or the snatch of song intercepted from original singer's song etc.; Target audio file refers to the file needing to carry out audio frequency test and appraisal, such as: the song that user sings again for original singer's song, or the snatch of song intercepted from the song that user sings again etc.
In the embodiment of the present invention, audio file can be formed by least one audio frequency simple sentence order arrangement, needs the part of carrying out singing in the set description of this at least one audio frequency simple sentence audio file.For song A, the description of song A can be expressed as follows:
[661,860]aaaaaaaa
[1541,320]bbbbbbbb
[1871,245]cccccccc
……
In the description of above-mentioned song A, such as " aaaaaaaa ", " bbbbbbbb ", " cccccccc " can be respectively used to expression audio frequency simple sentence, " [] " before each audio frequency simple sentence is for describing the time attribute of corresponding audio frequency simple sentence, its unit interval is generally ms, such as: above-mentioned [661, 860] for the time attribute of description audio simple sentence " aaaaaaaa ", " 661 " wherein represent the start time of audio frequency simple sentence " aaaaaaaa ", " 860 " represent the duration of audio frequency simple sentence " aaaaaaaa ", suppose song A totally 5 minutes, audio frequency simple sentence " aaaaaaaa " is then sung from 661ms, lasting 860ms terminates to sing.According to the sequencing of start time, can determine the order of each audio frequency simple sentence that audio file comprises, such as: according to the description of above-mentioned song A, audio frequency simple sentence " aaaaaaaa " is first audio frequency simple sentence, and its order in song A is 1; Audio frequency simple sentence " bbbbbbbb " is second audio frequency simple sentence, and its order in song A is 2; By that analogy.Be understandable that, before each audio frequency simple sentence starts or also can comprise without the need to carrying out the part of singing after terminating, such as: 0-661ms time period of above-mentioned song A, this part can comprise prelude information then for without the need to carrying out the part of singing.
In the embodiment of the present invention, described source audio file is formed by least one audio frequency simple sentence order arrangement, and this audio frequency simple sentence can be described as performance simple sentence.Described target audio file is formed by least one audio frequency simple sentence order arrangement, and this audio frequency simple sentence can be described as performance simple sentence.Unless otherwise indicated, in the follow-up each embodiment of the present invention, described reference simple sentence refers in particular to the reference simple sentence of locating from described source audio file, and described performance simple sentence refers in particular to performance simple sentence to be tested and assessed in described target audio file.
Below in conjunction with Fig. 1-Fig. 4, the audio frequency assessment method that the embodiment of the present invention provides is described in detail.
Referring to Fig. 1, is the process flow diagram of a kind of audio frequency assessment method that the embodiment of the present invention provides; The method can comprise the following steps S101-step S104.
S101, obtains performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence.
Note is note again, refers to the symbol of the carrying out of the sound for recording different length, can comprise whole note, minim, crotchet, quaver etc. kind.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries note, and each note forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.Pitch is pitch again, refers to the height of sound.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries pitch, and each pitch forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.To sum up, the sequence of notes of audio frequency simple sentence or pitch sequence all can reflect the melody characteristics of this audio frequency simple sentence.
In the embodiment of the present invention, preferably, performance simple sentence to be tested and assessed can be current performance simple sentence, and now namely audio frequency assessment process can be real-time audio assessment process.Wherein, current performance simple sentence refers to the performance simple sentence in described target audio file corresponding to the current time of playing.Target audio file is song A, example is described as: suppose song A totally 5 minutes with above-mentioned song A, the current time of playing is 1895ms, according to the description of song A, 1895ms belongs in the time period described by time attribute of audio frequency simple sentence " cccccccc ", can determine that audio frequency simple sentence " cccccccc " is for current performance simple sentence thus, further, can determine that audio frequency simple sentence " cccccccc " is for performance simple sentence to be tested and assessed, described in the order of performance simple sentence in target audio file to be tested and assessed be 3.
In this step, the characteristic sequence to be measured of this performance simple sentence to be tested and assessed can be obtained, described characteristic sequence to be measured for described in the sequence of notes of performance simple sentence to be tested and assessed or pitch sequence.Preferably, after the described performance simple sentence performance wait testing and assessing terminates, the characteristic sequence to be measured of this performance simple sentence to be tested and assessed is obtained.
S102, according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence.
Described source audio file can be formed with reference to the arrangement of simple sentence order by least one.In this step, the order of the reference simple sentence of locating in described source audio file is identical with performance simple sentence described the to be tested and assessed order in described target audio file.According to the above-mentioned example of the present embodiment, if with song A for target audio file, then the original singer song B of song A when publishing and distributing is source audio file, the order of performance simple sentence described to be tested and assessed is 3, the order of reference simple sentence that then song B locates also is 3, from song B, namely chooses the 3rd with reference to the test and appraisal benchmark of simple sentence as performance simple sentence described to be tested and assessed.
In a kind of feasible embodiment of the embodiment of the present invention, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence.In the embodiment that the another kind of the embodiment of the present invention is feasible, described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
S103, carries out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence.
Because described fixed reference feature sequence can be used for the melody characteristics characterizing the reference simple sentence of locating in source audio file, described characteristic sequence to be measured can be used for the melody characteristics characterizing performance simple sentence to be tested and assessed in target audio file, this step to the related operation between described fixed reference feature sequence and described characteristic sequence to be measured, can obtain related coefficient sequence.
S104, according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence.
In this step, described in the test and appraisal mark of performance simple sentence to be tested and assessed higher, the singing effect of performance simple sentence to be tested and assessed described in showing is better, the singing effect of its more close located reference simple sentence.Otherwise, described in the test and appraisal mark of performance simple sentence to be tested and assessed lower, the singing effect of performance simple sentence to be tested and assessed described in showing is poorer, and it more departs from the singing effect of located reference simple sentence.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with accompanying drawing 2-accompanying drawing 4, each step in the audio frequency assessment method shown in accompanying drawing 1 is described in detail.It should be noted that, during subsequent figure 2-is embodiment illustrated in fig. 4, the reference simple sentence of locating comprises N number of audio frame, and described fixed reference feature sequence can be expressed as p (i); Wherein, i is integer, and 0≤i≤N-1.Particularly, if described fixed reference feature sequence is the described sequence of notes with reference to simple sentence, then p (0) represents the note of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, then p (0) represents the pitch of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.
Set performance simple sentence to be tested and assessed and comprise N number of audio frame, described characteristic sequence to be measured can be expressed as s (i), and wherein, i is integer, and 0≤i≤N-1.Particularly, if described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, the note of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the note of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the note of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, the pitch of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the pitch of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the pitch of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.
Referring to Fig. 2, is the process flow diagram of an embodiment of the step S103 shown in Fig. 1; This step S103 can comprise the following steps s1001-step s1004.
S1001, calculates the average of described fixed reference feature sequence and the average of described characteristic sequence to be measured respectively.
In this step, following formula (1) can be adopted to calculate the average of described fixed reference feature sequence p (i) and the average of described characteristic sequence s (i) to be measured respectively, and this formula (1) is as follows:
MP=mean(p(i))
MS=mean(s(i))(1)
In above-mentioned formula (1), MP represents the average of described fixed reference feature sequence p (i), and MS represents the average of described fixed reference feature sequence p (i), and mean () is operation of averaging.
S1002, adopts the average of described fixed reference feature sequence, carries out regular process to described fixed reference feature sequence, adopts the average of described characteristic sequence to be measured, carries out regular process to described characteristic sequence to be measured.
The object of regular process is: described fixed reference feature sequence and described characteristic sequence to be measured are adjusted to same benchmark, asks for inconsistent the brought calculation deviation of standard affect to eliminate described fixed reference feature sequence and described characteristic sequence to be measured due to average.
In this step, formula (2) can be adopted to carry out regular process to described fixed reference feature sequence, and this formula (2) can be expressed as follows:
p2(i)=p(i)-MP(2)
In above-mentioned formula (2), the fixed reference feature sequence that p2 (i) obtains after representing regular process.
In this step, formula (3) can be adopted to carry out regular process to described characteristic sequence to be measured, and this formula (3) can be expressed as follows:
s2(i)=s(i)-MS(3)
In above-mentioned formula (3), the characteristic sequence to be measured that s2 (i) obtains after representing regular process.
S1003, adopts and presets clipping threshold, the described fixed reference feature sequence after regular process is converted to referential data sequence, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured.
Wherein, described default clipping threshold can set according to actual needs, and preferably, described clipping threshold can adopt formula (4) to set, and this formula (4) can be expressed as follows:
Th_xue=max(max(abs(p2(i)),max(abs(s2(i)))(4)
In above-mentioned formula (4), Th_xue represents described default clipping threshold, and max () asks for operation for maximal value, and abs () asks for operation for absolute value.
In this step, can adopt formula (5) that the described fixed reference feature sequence after regular process is converted to referential data sequence, this formula (5) can be expressed as follows:
p 3 ( i ) = 1 , p 2 ( i ) > Th _ xue - 1 , p 2 ( i ) < Th _ xue 0 , else - - - ( 5 )
In above-mentioned formula (5), p3 (i) represents referential data sequence.
In this step, can adopt formula (6) that the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured, this formula (6) can be expressed as follows:
s 3 ( i ) = 1 , s 2 ( i ) > Th _ xue - 1 , s 2 ( i ) < Th _ xue 0 , else - - - ( 6 )
In above-mentioned formula (6), s3 (i) represents sequence of values to be measured.
S1004, adopts cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtains related coefficient sequence.
In a kind of feasible embodiment of this step, formula (7) can be adopted to carry out related operation to described referential data sequence and described sequence of values to be measured, and this formula (7) can be expressed as follows:
R ( n ) = 1 N &Sigma;p 3 ( i ) &CenterDot; s 3 ( i - n ) - - - ( 7 )
In above-mentioned formula (7), R (n) represents related coefficient sequence; " " represents multiplication operations; S3 (i-n) represents the sequence that s3 (i) loopy moving n is formed, wherein, and 0≤n≤N-1.
In the embodiment that the another kind of this step is feasible, formula (8) can be adopted to carry out related operation to described referential data sequence and described sequence of values to be measured, and this formula (8) can be expressed as follows:
R ( n ) = 1 N &Sigma;p 3 ( i - n ) &CenterDot; s 3 ( i ) - - - ( 8 )
In above-mentioned formula (8), R (n) represents related coefficient sequence; " " represents multiplication operations; P3 (i-n) represents the sequence that p3 (i) loopy moving n is formed, wherein, and 0≤n≤N-1.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 3, is the process flow diagram of an embodiment of the step S104 shown in Fig. 1; This step S104 can comprise the following steps s2001-step s2002.
S2001, calculates the maximal value of described related coefficient sequence.
In this step, following formula (9) can be adopted to calculate the maximal value of described related coefficient sequence, and this formula (9) can be expressed as follows:
RMAX=max(R(n))(9)
In above-mentioned formula (9), R (n) represents related coefficient sequence; Max () asks for operation for maximal value; RMAX represents the maximal value of described related coefficient sequence.
S2002, by the maximal value of described related coefficient sequence, is defined as the test and appraisal mark of described performance simple sentence.
The maximal value of described related coefficient sequence directly can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by this step, and namely the test and appraisal mark of described performance simple sentence is the value of RMAX.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 4, is the process flow diagram of another embodiment of the step S104 shown in Fig. 1; This step S104 can comprise the following steps s3001-step s3003.
S3001, calculates the maximal value of described related coefficient sequence.
This step can the step s2001 of embodiment shown in Figure 3, is not repeated herein.
S3002, maps to preset fraction interval by the maximal value of described related coefficient sequence, obtains the mapping value of the maximal value of described related coefficient sequence.
Described preset fraction interval can set according to actual needs, such as: described preset fraction interval can be set as [0,10]; Or described preset fraction interval can be set as [0,100].In this step, [score_min can be adopted, score_max] represent that described preset fraction is interval, by linear or nonlinear method, the maximal value RMAX of described related coefficient sequence is mapped to described preset fraction interval, the mapping value obtained can represent score, this score is positioned in the preset fraction interval shown in [score_min, score_max].
S3003, is defined as the scoring of described performance simple sentence by described mapping value.
Described mapping value score can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by this step, and namely the test and appraisal mark of described performance simple sentence is the value of score.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with Fig. 5-Fig. 8, a kind of audio frequency assessment device that the embodiment of the present invention provides is described in detail.It should be noted that, the audio frequency assessment device described in Fig. 5-Fig. 8 can be applicable to perform the method shown in above-mentioned accompanying drawing 1-accompanying drawing 4.In practical application, described audio frequency assessment device can run on server end, or runs in such as notebook computer, mobile phone, PAD (panel computer), intelligent wearable device etc. terminal.
Referring to Fig. 5, is the structural representation of a kind of audio frequency assessment device that the embodiment of the present invention provides; This device can comprise: acquisition module 101 to be measured, reference acquisition module 102, related operation module 103 and test and appraisal module 104.
Acquisition module 101 to be measured, for obtaining performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence.
Note is note again, refers to the symbol of the carrying out of the sound for recording different length, can comprise whole note, minim, crotchet, quaver etc. kind.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries note, and each note forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.Pitch is pitch again, refers to the height of sound.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries pitch, and each pitch forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.To sum up, the sequence of notes of audio frequency simple sentence or pitch sequence all can reflect the melody characteristics of this audio frequency simple sentence.
In the embodiment of the present invention, preferably, performance simple sentence to be tested and assessed can be current performance simple sentence, and now namely audio frequency assessment process can be real-time audio assessment process.Wherein, current performance simple sentence refers to the performance simple sentence in described target audio file corresponding to the current time of playing.Target audio file is song A, example is described as: suppose song A totally 5 minutes with above-mentioned song A, the current time of playing is 1895ms, according to the description of song A, 1895ms belongs in the time period described by time attribute of audio frequency simple sentence " cccccccc ", can determine that audio frequency simple sentence " cccccccc " is for current performance simple sentence thus, further, can determine that audio frequency simple sentence " cccccccc " is for performance simple sentence to be tested and assessed, described in the order of performance simple sentence in target audio file to be tested and assessed be 3.
Described acquisition module to be measured 101 can obtain the characteristic sequence to be measured of this performance simple sentence to be tested and assessed, described characteristic sequence to be measured for described in the sequence of notes of performance simple sentence to be tested and assessed or pitch sequence.Preferably, described acquisition module 101 to be measured, after the described performance simple sentence performance wait testing and assessing terminates, obtains the characteristic sequence to be measured of this performance simple sentence to be tested and assessed.
With reference to acquisition module 102, for according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence.
Described source audio file can be formed with reference to the arrangement of simple sentence order by least one.The order of reference simple sentence in described source audio file that described reference acquisition module 102 is located is identical with performance simple sentence described the to be tested and assessed order in described target audio file.According to the above-mentioned example of the present embodiment, if with song A for target audio file, then the original singer song B of song A when publishing and distributing is source audio file, the order of performance simple sentence described to be tested and assessed is 3, the order of reference simple sentence that then song B locates also is 3, from song B, namely chooses the 3rd with reference to the test and appraisal benchmark of simple sentence as performance simple sentence described to be tested and assessed.
In a kind of feasible embodiment of the embodiment of the present invention, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence.In the embodiment that the another kind of the embodiment of the present invention is feasible, described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
Related operation module 103, for carrying out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence.
Because described fixed reference feature sequence can be used for the melody characteristics characterizing the reference simple sentence of locating in source audio file, described characteristic sequence to be measured can be used for the melody characteristics characterizing performance simple sentence to be tested and assessed in target audio file, described related operation module 103 to the related operation between described fixed reference feature sequence and described characteristic sequence to be measured, can obtain related coefficient sequence.
Test and appraisal module 104, for according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence.
The test and appraisal mark of performance simple sentence described to be tested and assessed is higher, and the singing effect of performance simple sentence to be tested and assessed described in showing is better, the singing effect of its more close located reference simple sentence.Otherwise, described in the test and appraisal mark of performance simple sentence to be tested and assessed lower, the singing effect of performance simple sentence to be tested and assessed described in showing is poorer, and it more departs from the singing effect of located reference simple sentence.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with accompanying drawing 6-accompanying drawing 8, the 26S Proteasome Structure and Function of each module in the audio frequency assessment device shown in accompanying drawing 5 is described in detail.It should be noted that, during subsequent figure 6-is embodiment illustrated in fig. 8, the reference simple sentence of locating comprises N number of audio frame, and described fixed reference feature sequence can be expressed as p (i); Wherein, i is integer, and 0≤i≤N-1.Particularly, if described fixed reference feature sequence is the described sequence of notes with reference to simple sentence, then p (0) represents the note of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, then p (0) represents the pitch of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.
Set performance simple sentence to be tested and assessed and comprise N number of audio frame, described characteristic sequence to be measured can be expressed as s (i), and wherein, i is integer, and 0≤i≤N-1.Particularly, if described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, the note of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the note of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the note of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, the pitch of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the pitch of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the pitch of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.
Referring to Fig. 6, is the structural representation of the embodiment of the related operation module shown in Fig. 5; Described related operation module 103 can comprise: average calculation unit 1301, regular processing unit 1302, sequence converting unit 1303 and related operation unit 1304.
Average calculation unit 1301, for the average of the average and described characteristic sequence to be measured that calculate described fixed reference feature sequence respectively.
Described average calculation unit 1301 can adopt embodiment illustrated in fig. 2 in formula (1) calculate the average MP of described fixed reference feature sequence p (i) and the average MS of described characteristic sequence s (i) to be measured respectively.
Regular processing unit 1302, for adopting the average of described fixed reference feature sequence, carrying out regular process to described fixed reference feature sequence, adopting the average of described characteristic sequence to be measured, carry out regular process to described characteristic sequence to be measured.
The object of regular process is: described fixed reference feature sequence and described characteristic sequence to be measured are adjusted to same benchmark, asks for inconsistent the brought calculation deviation of standard affect to eliminate described fixed reference feature sequence and described characteristic sequence to be measured due to average.Described regular processing unit 1302 can adopt embodiment illustrated in fig. 2 in formula (2) regular process is carried out to described fixed reference feature sequence, fixed reference feature sequence p2 (i) obtained after obtaining regular process; And can adopt embodiment illustrated in fig. 2 in formula (3) regular process is carried out to described characteristic sequence to be measured, characteristic sequence s2 (i) to be measured obtained after obtaining regular process.
Sequence converting unit 1303, for adopting default clipping threshold, being converted to referential data sequence by the described fixed reference feature sequence after regular process, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured.
Wherein, described default clipping threshold can set according to actual needs, preferably, described clipping threshold Th_xue can adopt embodiment illustrated in fig. 2 in formula (4) set.Described sequence converting unit 1303 can adopt embodiment illustrated in fig. 2 in formula (5) the described fixed reference feature sequence after regular process is converted to referential data sequence p3 (i); And can adopt embodiment illustrated in fig. 2 in formula (6) characteristic sequence described to be measured after regular process is converted to sequence of values s3 (i) to be measured.
Related operation unit 1304, for adopting cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtains related coefficient sequence.
In a kind of feasible embodiment of the present embodiment, described related operation unit 1304 can adopt embodiment illustrated in fig. 2 in formula (7) related operation is carried out to described referential data sequence and described sequence of values to be measured, obtain related coefficient sequence R (n).In the embodiment that the another kind of the present embodiment is feasible, described related operation unit 1304 can adopt embodiment illustrated in fig. 2 in formula (8) related operation is carried out to described referential data sequence and described sequence of values to be measured, obtain related coefficient sequence R (n).
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 7, is the structural representation of an embodiment of the test and appraisal module shown in Fig. 5; Described test and appraisal module 104 can comprise: the first computing unit 1401 and the first test and appraisal unit 1402.
First computing unit 1401, for calculating the maximal value of described related coefficient sequence.
Described first computing unit 1401 can adopt embodiment illustrated in fig. 3 in formula (9) calculate the maximal value RMAX of described related coefficient sequence.
First test and appraisal unit 1402, for the maximal value by described related coefficient sequence, is defined as the test and appraisal mark of described performance simple sentence.
The maximal value of described related coefficient sequence directly can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by described first test and appraisal unit 1402, and namely the test and appraisal mark of described performance simple sentence is the value of RMAX.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 8, is the structural representation of another embodiment of the test and appraisal module shown in Fig. 5; Described test and appraisal module 104 can comprise: the second computing unit 1411, map unit 1412 and the second test and appraisal unit 1413.
Second computing unit 1411, for calculating the maximal value of described related coefficient sequence.
The computation process of described second computing unit 1411 can the computation process of described first computing unit 1401 in embodiment shown in Figure 7, is not repeated herein.
Map unit 1412, for the maximal value of described related coefficient sequence is mapped to preset fraction interval, obtains the mapping value of the maximal value of described related coefficient sequence.
Described preset fraction interval can set according to actual needs, such as: described preset fraction interval can be set as [0,10]; Or described preset fraction interval can be set as [0,100].Described map unit 1412 can adopt [score_min, score_max] represent that described preset fraction is interval, by linear or nonlinear method, the maximal value RMAX of described related coefficient sequence is mapped to described preset fraction interval, the mapping value obtained can represent score, this score is positioned in the preset fraction interval shown in [score_min, score_max].
Second test and appraisal unit 1413, for being defined as the test and appraisal mark of described performance simple sentence by described mapping value.
Described mapping value score can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by described second test and appraisal unit 1413, and namely the test and appraisal mark of described performance simple sentence is the value of score.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-OnlyMemory, ROM) or random store-memory body (RandomAccessMemory, RAM) etc.
Above disclosedly be only present pre-ferred embodiments, certainly can not limit the interest field of the present invention with this, therefore according to the equivalent variations that the claims in the present invention are done, still belong to the scope that the present invention is contained.

Claims (8)

1. an audio frequency assessment method, is characterized in that, comprising:
Obtain performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence;
According to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence;
Calculate the average of described fixed reference feature sequence and the average of described characteristic sequence to be measured respectively;
Adopt the average of described fixed reference feature sequence, regular process is carried out to described fixed reference feature sequence, adopt the average of described characteristic sequence to be measured, regular process is carried out to described characteristic sequence to be measured;
Adopt and preset clipping threshold, the described fixed reference feature sequence after regular process is converted to referential data sequence, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured;
Adopt cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtain related coefficient sequence;
According to described related coefficient sequence, determine the test and appraisal mark of described performance simple sentence.
2. the method for claim 1, is characterized in that, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence; Or,
Described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
3. method as claimed in claim 1 or 2, is characterized in that, described according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence, comprising:
Calculate the maximal value of described related coefficient sequence;
By the maximal value of described related coefficient sequence, be defined as the test and appraisal mark of described performance simple sentence.
4. method as claimed in claim 1 or 2, is characterized in that, described according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence, comprising:
Calculate the maximal value of described related coefficient sequence;
The maximal value of described related coefficient sequence is mapped to preset fraction interval, obtains the mapping value of the maximal value of described related coefficient sequence;
Described mapping value is defined as the test and appraisal mark of described performance simple sentence.
5. an audio frequency assessment device, is characterized in that, comprising:
Acquisition module to be measured, for obtaining performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence;
With reference to acquisition module, for according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence;
Related operation module, for carrying out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence;
Test and appraisal module, for according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence;
Wherein, described related operation module comprises:
Average calculation unit, for the average of the average and described characteristic sequence to be measured that calculate described fixed reference feature sequence respectively;
Regular processing unit, for adopting the average of described fixed reference feature sequence, carrying out regular process to described fixed reference feature sequence, adopting the average of described characteristic sequence to be measured, carry out regular process to described characteristic sequence to be measured;
Sequence converting unit, for adopting default clipping threshold, being converted to referential data sequence by the described fixed reference feature sequence after regular process, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured;
Related operation unit, for adopting cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtains related coefficient sequence.
6. device as claimed in claim 5, it is characterized in that, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence; Or,
Described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
7. the device as described in claim 5 or 6, is characterized in that, described test and appraisal module comprises:
First computing unit, for calculating the maximal value of described related coefficient sequence;
First test and appraisal unit, for the maximal value by described related coefficient sequence, is defined as the test and appraisal mark of described performance simple sentence.
8. the device as described in claim 5 or 6, is characterized in that, described test and appraisal module comprises:
Second computing unit, for calculating the maximal value of described related coefficient sequence;
Map unit, for the maximal value of described related coefficient sequence is mapped to preset fraction interval, obtains the mapping value of the maximal value of described related coefficient sequence;
Second test and appraisal unit, for being defined as the test and appraisal mark of described performance simple sentence by described mapping value.
CN201410362595.5A 2014-07-28 2014-07-28 A kind of audio frequency assessment method and device Active CN104157296B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410362595.5A CN104157296B (en) 2014-07-28 2014-07-28 A kind of audio frequency assessment method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410362595.5A CN104157296B (en) 2014-07-28 2014-07-28 A kind of audio frequency assessment method and device

Publications (2)

Publication Number Publication Date
CN104157296A CN104157296A (en) 2014-11-19
CN104157296B true CN104157296B (en) 2016-04-27

Family

ID=51882778

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410362595.5A Active CN104157296B (en) 2014-07-28 2014-07-28 A kind of audio frequency assessment method and device

Country Status (1)

Country Link
CN (1) CN104157296B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104464754A (en) * 2014-12-11 2015-03-25 北京中细软移动互联科技有限公司 Sound brand search method
CN105187936B (en) * 2015-06-15 2018-08-21 福建星网视易信息系统有限公司 Based on the method for broadcasting multimedia file and device for singing audio scoring
CN109979482B (en) * 2019-05-21 2021-12-07 科大讯飞股份有限公司 Audio evaluation method and device
CN115171729B (en) * 2022-09-02 2023-01-20 北京达佳互联信息技术有限公司 Audio quality determination method and device, electronic equipment and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101430876B (en) * 2007-11-08 2012-03-14 中国科学院声学研究所 Singing marking system and method
CN101441865A (en) * 2007-11-19 2009-05-27 盛趣信息技术(上海)有限公司 Method and system for grading sing genus game
WO2014043815A1 (en) * 2012-09-24 2014-03-27 Hitlab Inc. A method and system for assessing karaoke users

Also Published As

Publication number Publication date
CN104157296A (en) 2014-11-19

Similar Documents

Publication Publication Date Title
CN104157296B (en) A kind of audio frequency assessment method and device
JP4322283B2 (en) Performance determination device and program
US20100192753A1 (en) Karaoke apparatus
US20080072741A1 (en) Methods and Systems for Identifying Similar Songs
CN104282322B (en) A kind of mobile terminal and its method and apparatus for identifying song climax parts
CN105023559A (en) Karaoke processing method and system
CN104620313A (en) Audio signal analysis
CN104143340B (en) A kind of audio frequency assessment method and device
CN102568456A (en) Notation recording method and a notation recording device based on humming input
CN106653037A (en) Audio data processing method and device
CN104143339A (en) Music signal processing apparatus and method, and program
CN111210850B (en) Lyric alignment method and related product
CN104361883A (en) Production method and device of singing evaluation standards files
Pereira et al. Moisesdb: A dataset for source separation beyond 4-stems
Monti et al. Monophonic transcription with autocorrelation
CN104978380B (en) A kind of audio-frequency processing method and device
Sheng et al. Automatic control of the dynamic range compressor using a regression model and a reference sound
CN103310795A (en) Sound processing apparatus
Bozkurt A system for tuning instruments using recorded music instead of theory-based frequency presets
CN105047202A (en) Audio processing method, device and terminal
CN105788609A (en) Method and device of associating a plurality of sound sources, evaluation method, and evaluation system
JP5168165B2 (en) Apparatus and program for embedding and extracting digital watermark information
Pang et al. Automatic detection of vibrato in monophonic music
CN103531220A (en) Method and device for correcting lyric
CN111782868A (en) Audio processing method, device, equipment and medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20161124

Address after: 510000 Guangzhou, Tianhe District branch Yun Yun Road, No. 16, self built room 2, building 1301

Patentee after: Guangzhou KuGou Networks Co., Ltd.

Address before: Shenzhen Futian District City, Guangdong province 518000 Zhenxing Road, SEG Science Park 2 East Room 403

Patentee before: Tencent Technology (Shenzhen) Co., Ltd.

CP02 Change in the address of a patent holder
CP02 Change in the address of a patent holder

Address after: No. 315, Huangpu Avenue middle, Tianhe District, Guangzhou City, Guangdong Province

Patentee after: GUANGZHOU KUGOU COMPUTER TECHNOLOGY Co.,Ltd.

Address before: 510000 room 1301, building 2, No.16 Keyun Road, Tianhe District, Guangzhou City, Guangdong Province

Patentee before: GUANGZHOU KUGOU COMPUTER TECHNOLOGY Co.,Ltd.