Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, be clearly and completely described the technical scheme in the embodiment of the present invention, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
In the embodiment of the present invention, audio file can include but not limited to: the file such as song, snatch of song.Source audio file refers to the file of the reference data that can be used for as audio frequency test and appraisal, such as: original singer's song, or the snatch of song intercepted from original singer's song etc.; Target audio file refers to the file needing to carry out audio frequency test and appraisal, such as: the song that user sings again for original singer's song, or the snatch of song intercepted from the song that user sings again etc.
In the embodiment of the present invention, audio file can be formed by least one audio frequency simple sentence order arrangement, needs the part of carrying out singing in the set description of this at least one audio frequency simple sentence audio file.For song A, the description of song A can be expressed as follows:
[661,860]aaaaaaaa
[1541,320]bbbbbbbb
[1871,245]cccccccc
……
In the description of above-mentioned song A, such as " aaaaaaaa ", " bbbbbbbb ", " cccccccc " can be respectively used to expression audio frequency simple sentence, " [] " before each audio frequency simple sentence is for describing the time attribute of corresponding audio frequency simple sentence, its unit interval is generally ms, such as: above-mentioned [661, 860] for the time attribute of description audio simple sentence " aaaaaaaa ", " 661 " wherein represent the start time of audio frequency simple sentence " aaaaaaaa ", " 860 " represent the duration of audio frequency simple sentence " aaaaaaaa ", suppose song A totally 5 minutes, audio frequency simple sentence " aaaaaaaa " is then sung from 661ms, lasting 860ms terminates to sing.According to the sequencing of start time, can determine the order of each audio frequency simple sentence that audio file comprises, such as: according to the description of above-mentioned song A, audio frequency simple sentence " aaaaaaaa " is first audio frequency simple sentence, and its order in song A is 1; Audio frequency simple sentence " bbbbbbbb " is second audio frequency simple sentence, and its order in song A is 2; By that analogy.Be understandable that, before each audio frequency simple sentence starts or also can comprise without the need to carrying out the part of singing after terminating, such as: 0-661ms time period of above-mentioned song A, this part can comprise prelude information then for without the need to carrying out the part of singing.
In the embodiment of the present invention, described source audio file is formed by least one audio frequency simple sentence order arrangement, and this audio frequency simple sentence can be described as performance simple sentence.Described target audio file is formed by least one audio frequency simple sentence order arrangement, and this audio frequency simple sentence can be described as performance simple sentence.Unless otherwise indicated, in the follow-up each embodiment of the present invention, described reference simple sentence refers in particular to the reference simple sentence of locating from described source audio file, and described performance simple sentence refers in particular to performance simple sentence to be tested and assessed in described target audio file.
Below in conjunction with Fig. 1-Fig. 4, the audio frequency assessment method that the embodiment of the present invention provides is described in detail.
Referring to Fig. 1, is the process flow diagram of a kind of audio frequency assessment method that the embodiment of the present invention provides; The method can comprise the following steps S101-step S104.
S101, obtains performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence.
Note is note again, refers to the symbol of the carrying out of the sound for recording different length, can comprise whole note, minim, crotchet, quaver etc. kind.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries note, and each note forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.Pitch is pitch again, refers to the height of sound.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries pitch, and each pitch forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.To sum up, the sequence of notes of audio frequency simple sentence or pitch sequence all can reflect the melody characteristics of this audio frequency simple sentence.
In the embodiment of the present invention, preferably, performance simple sentence to be tested and assessed can be current performance simple sentence, and now namely audio frequency assessment process can be real-time audio assessment process.Wherein, current performance simple sentence refers to the performance simple sentence in described target audio file corresponding to the current time of playing.Target audio file is song A, example is described as: suppose song A totally 5 minutes with above-mentioned song A, the current time of playing is 1895ms, according to the description of song A, 1895ms belongs in the time period described by time attribute of audio frequency simple sentence " cccccccc ", can determine that audio frequency simple sentence " cccccccc " is for current performance simple sentence thus, further, can determine that audio frequency simple sentence " cccccccc " is for performance simple sentence to be tested and assessed, described in the order of performance simple sentence in target audio file to be tested and assessed be 3.
In this step, the characteristic sequence to be measured of this performance simple sentence to be tested and assessed can be obtained, described characteristic sequence to be measured for described in the sequence of notes of performance simple sentence to be tested and assessed or pitch sequence.Preferably, after the described performance simple sentence performance wait testing and assessing terminates, the characteristic sequence to be measured of this performance simple sentence to be tested and assessed is obtained.
S102, according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence.
Described source audio file can be formed with reference to the arrangement of simple sentence order by least one.In this step, the order of the reference simple sentence of locating in described source audio file is identical with performance simple sentence described the to be tested and assessed order in described target audio file.According to the above-mentioned example of the present embodiment, if with song A for target audio file, then the original singer song B of song A when publishing and distributing is source audio file, the order of performance simple sentence described to be tested and assessed is 3, the order of reference simple sentence that then song B locates also is 3, from song B, namely chooses the 3rd with reference to the test and appraisal benchmark of simple sentence as performance simple sentence described to be tested and assessed.
In a kind of feasible embodiment of the embodiment of the present invention, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence.In the embodiment that the another kind of the embodiment of the present invention is feasible, described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
S103, carries out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence.
Because described fixed reference feature sequence can be used for the melody characteristics characterizing the reference simple sentence of locating in source audio file, described characteristic sequence to be measured can be used for the melody characteristics characterizing performance simple sentence to be tested and assessed in target audio file, this step to the related operation between described fixed reference feature sequence and described characteristic sequence to be measured, can obtain related coefficient sequence.
S104, according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence.
In this step, described in the test and appraisal mark of performance simple sentence to be tested and assessed higher, the singing effect of performance simple sentence to be tested and assessed described in showing is better, the singing effect of its more close located reference simple sentence.Otherwise, described in the test and appraisal mark of performance simple sentence to be tested and assessed lower, the singing effect of performance simple sentence to be tested and assessed described in showing is poorer, and it more departs from the singing effect of located reference simple sentence.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with accompanying drawing 2-accompanying drawing 4, each step in the audio frequency assessment method shown in accompanying drawing 1 is described in detail.It should be noted that, during subsequent figure 2-is embodiment illustrated in fig. 4, the reference simple sentence of locating comprises N number of audio frame, and described fixed reference feature sequence can be expressed as p (i); Wherein, i is integer, and 0≤i≤N-1.Particularly, if described fixed reference feature sequence is the described sequence of notes with reference to simple sentence, then p (0) represents the note of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, then p (0) represents the pitch of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.
Set performance simple sentence to be tested and assessed and comprise N number of audio frame, described characteristic sequence to be measured can be expressed as s (i), and wherein, i is integer, and 0≤i≤N-1.Particularly, if described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, the note of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the note of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the note of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, the pitch of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the pitch of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the pitch of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.
Referring to Fig. 2, is the process flow diagram of an embodiment of the step S103 shown in Fig. 1; This step S103 can comprise the following steps s1001-step s1004.
S1001, calculates the average of described fixed reference feature sequence and the average of described characteristic sequence to be measured respectively.
In this step, following formula (1) can be adopted to calculate the average of described fixed reference feature sequence p (i) and the average of described characteristic sequence s (i) to be measured respectively, and this formula (1) is as follows:
MP=mean(p(i))
MS=mean(s(i))(1)
In above-mentioned formula (1), MP represents the average of described fixed reference feature sequence p (i), and MS represents the average of described fixed reference feature sequence p (i), and mean () is operation of averaging.
S1002, adopts the average of described fixed reference feature sequence, carries out regular process to described fixed reference feature sequence, adopts the average of described characteristic sequence to be measured, carries out regular process to described characteristic sequence to be measured.
The object of regular process is: described fixed reference feature sequence and described characteristic sequence to be measured are adjusted to same benchmark, asks for inconsistent the brought calculation deviation of standard affect to eliminate described fixed reference feature sequence and described characteristic sequence to be measured due to average.
In this step, formula (2) can be adopted to carry out regular process to described fixed reference feature sequence, and this formula (2) can be expressed as follows:
p2(i)=p(i)-MP(2)
In above-mentioned formula (2), the fixed reference feature sequence that p2 (i) obtains after representing regular process.
In this step, formula (3) can be adopted to carry out regular process to described characteristic sequence to be measured, and this formula (3) can be expressed as follows:
s2(i)=s(i)-MS(3)
In above-mentioned formula (3), the characteristic sequence to be measured that s2 (i) obtains after representing regular process.
S1003, adopts and presets clipping threshold, the described fixed reference feature sequence after regular process is converted to referential data sequence, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured.
Wherein, described default clipping threshold can set according to actual needs, and preferably, described clipping threshold can adopt formula (4) to set, and this formula (4) can be expressed as follows:
Th_xue=max(max(abs(p2(i)),max(abs(s2(i)))(4)
In above-mentioned formula (4), Th_xue represents described default clipping threshold, and max () asks for operation for maximal value, and abs () asks for operation for absolute value.
In this step, can adopt formula (5) that the described fixed reference feature sequence after regular process is converted to referential data sequence, this formula (5) can be expressed as follows:
In above-mentioned formula (5), p3 (i) represents referential data sequence.
In this step, can adopt formula (6) that the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured, this formula (6) can be expressed as follows:
In above-mentioned formula (6), s3 (i) represents sequence of values to be measured.
S1004, adopts cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtains related coefficient sequence.
In a kind of feasible embodiment of this step, formula (7) can be adopted to carry out related operation to described referential data sequence and described sequence of values to be measured, and this formula (7) can be expressed as follows:
In above-mentioned formula (7), R (n) represents related coefficient sequence; " " represents multiplication operations; S3 (i-n) represents the sequence that s3 (i) loopy moving n is formed, wherein, and 0≤n≤N-1.
In the embodiment that the another kind of this step is feasible, formula (8) can be adopted to carry out related operation to described referential data sequence and described sequence of values to be measured, and this formula (8) can be expressed as follows:
In above-mentioned formula (8), R (n) represents related coefficient sequence; " " represents multiplication operations; P3 (i-n) represents the sequence that p3 (i) loopy moving n is formed, wherein, and 0≤n≤N-1.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 3, is the process flow diagram of an embodiment of the step S104 shown in Fig. 1; This step S104 can comprise the following steps s2001-step s2002.
S2001, calculates the maximal value of described related coefficient sequence.
In this step, following formula (9) can be adopted to calculate the maximal value of described related coefficient sequence, and this formula (9) can be expressed as follows:
RMAX=max(R(n))(9)
In above-mentioned formula (9), R (n) represents related coefficient sequence; Max () asks for operation for maximal value; RMAX represents the maximal value of described related coefficient sequence.
S2002, by the maximal value of described related coefficient sequence, is defined as the test and appraisal mark of described performance simple sentence.
The maximal value of described related coefficient sequence directly can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by this step, and namely the test and appraisal mark of described performance simple sentence is the value of RMAX.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 4, is the process flow diagram of another embodiment of the step S104 shown in Fig. 1; This step S104 can comprise the following steps s3001-step s3003.
S3001, calculates the maximal value of described related coefficient sequence.
This step can the step s2001 of embodiment shown in Figure 3, is not repeated herein.
S3002, maps to preset fraction interval by the maximal value of described related coefficient sequence, obtains the mapping value of the maximal value of described related coefficient sequence.
Described preset fraction interval can set according to actual needs, such as: described preset fraction interval can be set as [0,10]; Or described preset fraction interval can be set as [0,100].In this step, [score_min can be adopted, score_max] represent that described preset fraction is interval, by linear or nonlinear method, the maximal value RMAX of described related coefficient sequence is mapped to described preset fraction interval, the mapping value obtained can represent score, this score is positioned in the preset fraction interval shown in [score_min, score_max].
S3003, is defined as the scoring of described performance simple sentence by described mapping value.
Described mapping value score can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by this step, and namely the test and appraisal mark of described performance simple sentence is the value of score.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with Fig. 5-Fig. 8, a kind of audio frequency assessment device that the embodiment of the present invention provides is described in detail.It should be noted that, the audio frequency assessment device described in Fig. 5-Fig. 8 can be applicable to perform the method shown in above-mentioned accompanying drawing 1-accompanying drawing 4.In practical application, described audio frequency assessment device can run on server end, or runs in such as notebook computer, mobile phone, PAD (panel computer), intelligent wearable device etc. terminal.
Referring to Fig. 5, is the structural representation of a kind of audio frequency assessment device that the embodiment of the present invention provides; This device can comprise: acquisition module 101 to be measured, reference acquisition module 102, related operation module 103 and test and appraisal module 104.
Acquisition module 101 to be measured, for obtaining performance simple sentence the to be tested and assessed order in target audio file and the characteristic sequence to be measured of described performance simple sentence.
Note is note again, refers to the symbol of the carrying out of the sound for recording different length, can comprise whole note, minim, crotchet, quaver etc. kind.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries note, and each note forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.Pitch is pitch again, refers to the height of sound.An audio frequency simple sentence can be expressed as a frame sequence of multiple audio frame composition, and each audio frame all carries pitch, and each pitch forms the melody of this audio frequency simple sentence according to the time order and function order of each audio frame in this audio frequency simple sentence.To sum up, the sequence of notes of audio frequency simple sentence or pitch sequence all can reflect the melody characteristics of this audio frequency simple sentence.
In the embodiment of the present invention, preferably, performance simple sentence to be tested and assessed can be current performance simple sentence, and now namely audio frequency assessment process can be real-time audio assessment process.Wherein, current performance simple sentence refers to the performance simple sentence in described target audio file corresponding to the current time of playing.Target audio file is song A, example is described as: suppose song A totally 5 minutes with above-mentioned song A, the current time of playing is 1895ms, according to the description of song A, 1895ms belongs in the time period described by time attribute of audio frequency simple sentence " cccccccc ", can determine that audio frequency simple sentence " cccccccc " is for current performance simple sentence thus, further, can determine that audio frequency simple sentence " cccccccc " is for performance simple sentence to be tested and assessed, described in the order of performance simple sentence in target audio file to be tested and assessed be 3.
Described acquisition module to be measured 101 can obtain the characteristic sequence to be measured of this performance simple sentence to be tested and assessed, described characteristic sequence to be measured for described in the sequence of notes of performance simple sentence to be tested and assessed or pitch sequence.Preferably, described acquisition module 101 to be measured, after the described performance simple sentence performance wait testing and assessing terminates, obtains the characteristic sequence to be measured of this performance simple sentence to be tested and assessed.
With reference to acquisition module 102, for according to the order of described performance simple sentence in described target audio file, position reference simple sentence in source audio file, and obtain the described fixed reference feature sequence with reference to simple sentence.
Described source audio file can be formed with reference to the arrangement of simple sentence order by least one.The order of reference simple sentence in described source audio file that described reference acquisition module 102 is located is identical with performance simple sentence described the to be tested and assessed order in described target audio file.According to the above-mentioned example of the present embodiment, if with song A for target audio file, then the original singer song B of song A when publishing and distributing is source audio file, the order of performance simple sentence described to be tested and assessed is 3, the order of reference simple sentence that then song B locates also is 3, from song B, namely chooses the 3rd with reference to the test and appraisal benchmark of simple sentence as performance simple sentence described to be tested and assessed.
In a kind of feasible embodiment of the embodiment of the present invention, described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, and described fixed reference feature sequence is the described sequence of notes with reference to simple sentence.In the embodiment that the another kind of the embodiment of the present invention is feasible, described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, and referenced characteristic sequence is the described pitch sequence with reference to simple sentence.
Related operation module 103, for carrying out related operation to described fixed reference feature sequence and described characteristic sequence to be measured, obtains related coefficient sequence.
Because described fixed reference feature sequence can be used for the melody characteristics characterizing the reference simple sentence of locating in source audio file, described characteristic sequence to be measured can be used for the melody characteristics characterizing performance simple sentence to be tested and assessed in target audio file, described related operation module 103 to the related operation between described fixed reference feature sequence and described characteristic sequence to be measured, can obtain related coefficient sequence.
Test and appraisal module 104, for according to described related coefficient sequence, determines the test and appraisal mark of described performance simple sentence.
The test and appraisal mark of performance simple sentence described to be tested and assessed is higher, and the singing effect of performance simple sentence to be tested and assessed described in showing is better, the singing effect of its more close located reference simple sentence.Otherwise, described in the test and appraisal mark of performance simple sentence to be tested and assessed lower, the singing effect of performance simple sentence to be tested and assessed described in showing is poorer, and it more departs from the singing effect of located reference simple sentence.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Below in conjunction with accompanying drawing 6-accompanying drawing 8, the 26S Proteasome Structure and Function of each module in the audio frequency assessment device shown in accompanying drawing 5 is described in detail.It should be noted that, during subsequent figure 6-is embodiment illustrated in fig. 8, the reference simple sentence of locating comprises N number of audio frame, and described fixed reference feature sequence can be expressed as p (i); Wherein, i is integer, and 0≤i≤N-1.Particularly, if described fixed reference feature sequence is the described sequence of notes with reference to simple sentence, then p (0) represents the note of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, then p (0) represents the pitch of first audio frame in the reference simple sentence of locating, p (1) represents the note of second audio frame in the reference simple sentence of locating, by that analogy, p (N-1) represents the note of the N number of audio frame in the reference simple sentence of locating.
Set performance simple sentence to be tested and assessed and comprise N number of audio frame, described characteristic sequence to be measured can be expressed as s (i), and wherein, i is integer, and 0≤i≤N-1.Particularly, if described characteristic sequence to be measured is the sequence of notes of described performance simple sentence, the note of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the note of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the note of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.If described characteristic sequence to be measured is the pitch sequence of described performance simple sentence, the pitch of first audio frame in performance simple sentence then to be tested and assessed described in s (0) expression, the pitch of second audio frame in performance simple sentence to be tested and assessed described in s (1) represents, by that analogy, the pitch of the N number of audio frame in performance simple sentence to be tested and assessed described in s (N-1) expression.
Referring to Fig. 6, is the structural representation of the embodiment of the related operation module shown in Fig. 5; Described related operation module 103 can comprise: average calculation unit 1301, regular processing unit 1302, sequence converting unit 1303 and related operation unit 1304.
Average calculation unit 1301, for the average of the average and described characteristic sequence to be measured that calculate described fixed reference feature sequence respectively.
Described average calculation unit 1301 can adopt embodiment illustrated in fig. 2 in formula (1) calculate the average MP of described fixed reference feature sequence p (i) and the average MS of described characteristic sequence s (i) to be measured respectively.
Regular processing unit 1302, for adopting the average of described fixed reference feature sequence, carrying out regular process to described fixed reference feature sequence, adopting the average of described characteristic sequence to be measured, carry out regular process to described characteristic sequence to be measured.
The object of regular process is: described fixed reference feature sequence and described characteristic sequence to be measured are adjusted to same benchmark, asks for inconsistent the brought calculation deviation of standard affect to eliminate described fixed reference feature sequence and described characteristic sequence to be measured due to average.Described regular processing unit 1302 can adopt embodiment illustrated in fig. 2 in formula (2) regular process is carried out to described fixed reference feature sequence, fixed reference feature sequence p2 (i) obtained after obtaining regular process; And can adopt embodiment illustrated in fig. 2 in formula (3) regular process is carried out to described characteristic sequence to be measured, characteristic sequence s2 (i) to be measured obtained after obtaining regular process.
Sequence converting unit 1303, for adopting default clipping threshold, being converted to referential data sequence by the described fixed reference feature sequence after regular process, the characteristic sequence described to be measured after regular process is converted to sequence of values to be measured.
Wherein, described default clipping threshold can set according to actual needs, preferably, described clipping threshold Th_xue can adopt embodiment illustrated in fig. 2 in formula (4) set.Described sequence converting unit 1303 can adopt embodiment illustrated in fig. 2 in formula (5) the described fixed reference feature sequence after regular process is converted to referential data sequence p3 (i); And can adopt embodiment illustrated in fig. 2 in formula (6) characteristic sequence described to be measured after regular process is converted to sequence of values s3 (i) to be measured.
Related operation unit 1304, for adopting cross correlation function to carry out related operation to described referential data sequence and described sequence of values to be measured, obtains related coefficient sequence.
In a kind of feasible embodiment of the present embodiment, described related operation unit 1304 can adopt embodiment illustrated in fig. 2 in formula (7) related operation is carried out to described referential data sequence and described sequence of values to be measured, obtain related coefficient sequence R (n).In the embodiment that the another kind of the present embodiment is feasible, described related operation unit 1304 can adopt embodiment illustrated in fig. 2 in formula (8) related operation is carried out to described referential data sequence and described sequence of values to be measured, obtain related coefficient sequence R (n).
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 7, is the structural representation of an embodiment of the test and appraisal module shown in Fig. 5; Described test and appraisal module 104 can comprise: the first computing unit 1401 and the first test and appraisal unit 1402.
First computing unit 1401, for calculating the maximal value of described related coefficient sequence.
Described first computing unit 1401 can adopt embodiment illustrated in fig. 3 in formula (9) calculate the maximal value RMAX of described related coefficient sequence.
First test and appraisal unit 1402, for the maximal value by described related coefficient sequence, is defined as the test and appraisal mark of described performance simple sentence.
The maximal value of described related coefficient sequence directly can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by described first test and appraisal unit 1402, and namely the test and appraisal mark of described performance simple sentence is the value of RMAX.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
Referring to Fig. 8, is the structural representation of another embodiment of the test and appraisal module shown in Fig. 5; Described test and appraisal module 104 can comprise: the second computing unit 1411, map unit 1412 and the second test and appraisal unit 1413.
Second computing unit 1411, for calculating the maximal value of described related coefficient sequence.
The computation process of described second computing unit 1411 can the computation process of described first computing unit 1401 in embodiment shown in Figure 7, is not repeated herein.
Map unit 1412, for the maximal value of described related coefficient sequence is mapped to preset fraction interval, obtains the mapping value of the maximal value of described related coefficient sequence.
Described preset fraction interval can set according to actual needs, such as: described preset fraction interval can be set as [0,10]; Or described preset fraction interval can be set as [0,100].Described map unit 1412 can adopt [score_min, score_max] represent that described preset fraction is interval, by linear or nonlinear method, the maximal value RMAX of described related coefficient sequence is mapped to described preset fraction interval, the mapping value obtained can represent score, this score is positioned in the preset fraction interval shown in [score_min, score_max].
Second test and appraisal unit 1413, for being defined as the test and appraisal mark of described performance simple sentence by described mapping value.
Described mapping value score can be defined as the test and appraisal mark of described performance simple sentence to be tested and assessed by described second test and appraisal unit 1413, and namely the test and appraisal mark of described performance simple sentence is the value of score.
In the embodiment of the present invention, the characteristic sequence to be measured of performance simple sentence to be tested and assessed in target audio file can be obtained, and the fixed reference feature sequence of reference simple sentence in acquisition source audio file, by obtaining related coefficient sequence to the related operation of characteristic sequence to be measured and fixed reference feature sequence, the test and appraisal of the performance simple sentence to target audio file are achieved based on this related coefficient sequence, both met user for the actual demand in the application process of audio file, the application improving again audio file is intelligent.
One of ordinary skill in the art will appreciate that all or part of flow process realized in above-described embodiment method, that the hardware that can carry out instruction relevant by computer program has come, described program can be stored in a computer read/write memory medium, this program, when performing, can comprise the flow process of the embodiment as above-mentioned each side method.Wherein, described storage medium can be magnetic disc, CD, read-only store-memory body (Read-OnlyMemory, ROM) or random store-memory body (RandomAccessMemory, RAM) etc.
Above disclosedly be only present pre-ferred embodiments, certainly can not limit the interest field of the present invention with this, therefore according to the equivalent variations that the claims in the present invention are done, still belong to the scope that the present invention is contained.