CN113744764B - Method for obtaining optimal comparison path of performance time value information and score time value information - Google Patents

Method for obtaining optimal comparison path of performance time value information and score time value information Download PDF

Info

Publication number
CN113744764B
CN113744764B CN202111036222.5A CN202111036222A CN113744764B CN 113744764 B CN113744764 B CN 113744764B CN 202111036222 A CN202111036222 A CN 202111036222A CN 113744764 B CN113744764 B CN 113744764B
Authority
CN
China
Prior art keywords
matrix
path
information
time value
performance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202111036222.5A
Other languages
Chinese (zh)
Other versions
CN113744764A (en
Inventor
李惠子
曹琛
冯亚星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Average Law Technology Co ltd
Original Assignee
Shenzhen Average Law Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Average Law Technology Co ltd filed Critical Shenzhen Average Law Technology Co ltd
Priority to CN202111036222.5A priority Critical patent/CN113744764B/en
Publication of CN113744764A publication Critical patent/CN113744764A/en
Application granted granted Critical
Publication of CN113744764B publication Critical patent/CN113744764B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/076Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of timing, tempo; Beat detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • G10L2025/906Pitch tracking
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Auxiliary Devices For Music (AREA)

Abstract

The invention discloses a method for obtaining an optimal comparison path of performance time value information and score time value information, which mainly solves the problems that the prior evaluation system and method in the prior art mechanically require the matching of actual performance data and standard performance data, neglect the non-fluency of the actual performance process, lack the evaluation of fine technical points such as voice region errors, snap, drags and beats, unstable rhythm and the like, and cause lower performance evaluation. The invention obtains the playing sound information and the music score information; then, respectively performing code conversion on the performance sound information and the music score information to generate a matrix; then inputting the matrix into a corresponding comparison function to generate a comparison matrix; then planning a path according to the comparison matrix, and producing an optimal path; then, the matrix generated first is segmented according to the position index of the optimal comparison path; and finally, evaluating according to the evaluation dictionary and outputting an evaluation language. Through the scheme, the aim of detail comment can be achieved.

Description

Method for obtaining optimal comparison path of performance time value information and score time value information
Technical Field
The invention relates to the technical field of music analysis, in particular to a method for obtaining an optimal comparison path of performance time value information and score time value information.
Background
Currently, performance evaluation methods for musical instruments are generally based on systems for additionally evaluating performance states on electronic musical instruments; the method and system mainly store standard performance data in an electronic musical instrument or electronic musical instrument metadata, and perform performance evaluation by comparing the standard performance data with actual performance data.
The method has a plurality of barriers in musical instrument playing training and playing evaluation suitable for music education, and mainly comprises the following steps: highly dependent on electronic musical instruments or electronic musical instrument accessories, but most of music education adopts non-electronic musical instruments; in the process of performing performance training, a musical instrument learner needs to gradually improve the performance technology, and the requirement of standard performance data is generally difficult to directly reach, while the method generally mechanically requires the matching of actual performance data and standard performance data, neglects the non-fluency of the actual performance process, and thus causes lower performance evaluation; the core purpose of performance training of a learner of a musical instrument is to improve performance technology, and the above evaluation method generally involves only overall evaluation, lacks evaluation of fine technical points such as a voice region error, snap, a trawling, a rhythm instability, and the like, and cannot well assist performance training of the learner of a musical instrument by adopting the above technology.
Disclosure of Invention
The invention aims to provide a method for obtaining an optimal comparison path of performance time value information and score time value information, so as to solve the problem that the conventional evaluation system and method mechanically require the matching of actual performance data and standard performance data, neglect the non-fluency of the actual performance process, lack the evaluation of fine technical points such as voice region errors, snap, drags and beats, unstable rhythm and the like, and cause lower performance evaluation.
In order to solve the problems, the invention provides the following technical scheme:
The method for obtaining the optimal comparison path of the performance pitch information and the melody pitch information comprises the following steps:
(A1) Acquiring pitch information of a performance sound and pitch information of a melody;
(A2) Performing code conversion on the pitch information of the performance sound and the pitch information of the melody respectively to generate a pitch matrix;
(A3) Inputting the pitch matrix of (A2) into a pitch comparison function to generate a pitch comparison matrix;
(A4) And (3) carrying out path planning on the pitch comparison matrix of the step (A3), and generating a pitch minimum score path as an optimal path.
Specifically, the PITCH information of the melody in step (A1) is taken from the information of < NOTE < PITCH < stem > < OCTAVE > > tag in the melody file in the extensible markup format (XML/MusicXML).
Specifically, in step (A2), the pitch information is code-converted into a two-dimensional boolean matrix of 128×n, the column coordinates represent the position index of each of the played notes in the pitch sequence, the row coordinates represent the position indexes of 128 semitone notes of absolute pitches C-1 to G9 in scientific notation, the element values in the matrix are represented by 0 or 1, 0 represents not playing the corresponding notes, and 1 represents playing the corresponding notes.
Specifically, the pitch comparison function in step (A3) is a composite function including a pitch scoring function and a pitch matching function;
the pitch scoring function is:
The pitch matching function is:
wherein,
A j represents the column vector for converting the pitch information of the sound into a pitch matrix in step (A2);
x i represents the column vector of converting the cursive pitch information into a pitch matrix in step (A2);
If the expression x i=Xi*aj is satisfied between the ith column of the music spectrum pitch matrix and the jth column of the music spectrum pitch matrix, judging that the two columns are matched, assigning the (i+1, j+1) position of the scoring matrix as a matching score m 1, otherwise, assigning the (i+1, j+1) position of the scoring matrix as a matching score u 1;
g 1 represents the insertion score of g 1 for each bit of representation moved in the pitch matrix from left to right and top to bottom, introducing an insertion operation.
Specifically, the path planning in the step (A4) is minimum score path dynamic planning based on position indexes, and the specific process is that the position indexes of the music notes and the sound notes are shifted by 1 positive unit integrally through position index transformation, 0 is used for representing insertion or deletion, and a position index (r, c) sequence of the optimal path is generated according to a minimum score index function;
Wherein i is the ith column of the cursive pitch matrix; j is the j-th column of the pitch matrix of the sound; let R sequence be the set of all R; the C sequence is that a set r of all C represents a certain row coordinate or 0 in the matching path, C represents a certain column coordinate or 0 in the matching path, wherein 0 represents a placeholder and is used for representing insertion and deletion; the sequence R represents a matching sequence of sound information and consists of row coordinates and 0; sequence C represents a matching sequence of the cursive information, consisting of column coordinates and 0.
Specifically, the method for obtaining the optimal comparison path of the performance duration information and the score duration information comprises the following steps:
(B1) Acquiring performance sound duration information and melody duration information;
(B2) Performing code conversion on the (B1) performance sound time value information and the music spectrum time value information respectively to generate a time value matrix;
(B3) Inputting the timing value matrix of the step (B2) into a timing value comparison function to generate a timing value comparison matrix;
(B4) And (3) carrying out path planning according to the timing value comparison matrix of the step (B3), and generating a timing value minimum variance path as an optimal path.
Specifically, the music score value information in step (B1) is taken from the < NOTE < DURATION > > tagged information in the music score file in the extensible markup format (XML/MusicXML).
Specifically, in step (B2), the time value information is code-converted into a two-dimensional numerical matrix 1*n, the column coordinates represent the position index of each performance note in the time value sequence, and the element values in the matrix are represented by integer numerical values and represent the number of time frames.
Specifically, the value comparison function in step (B3) is:
Wherein, the matrix of the curvelet time value is Sound duration matrix is/>Wherein y i represents the ith column vector of the curvelet matrix and b j represents the jth column vector of the sound value matrix; dura_s is the duration alignment matrix.
Specifically, the path planning in the step (B4) is a minimum variance path dynamic planning based on the position index, and the specific process is that the alignment matrix of the timing values is traversed by a directed graph, and the path planning from DURA_S (0,0) to DURA_S (0,0) is acquiredCalculating the variance of each path to obtain a minimum variance path, and converting the minimum variance path into a (r, c) sequence based on a position index, wherein 0 represents insertion or deletion, and shifting the whole of the position indexes of the music notes and the sound notes by 1 positive unit through position index conversion;
Where the duration of each position index (i, j) of DURA_S path(i,j) deviates, E (DURA_S path) represents the average of the duration deviations of the entire path, and N represents the number of path positions.
A comparison method of performance sound information and music score information includes the following steps:
(C1) Obtaining an optimal comparison path of the pitch information according to an obtaining method of the optimal comparison path of the performance pitch information and the melody pitch information;
(C2) Obtaining an optimal comparison path of the time value information according to an obtaining method of the optimal comparison path of the performance time value information and the score time value information;
(C3) Obtaining an optimal comparison path according to the comparison paths in the step (C1) (C2);
(C4) Cutting the pitch matrix and the duration matrix of the step (B2) according to the position index of the optimal comparison path of the step (C3) to generate a section, phrase and paragraph matrix;
(C5) Constructing evaluation rules of different levels in advance to generate an evaluation dictionary;
(C6) And (3) generating evaluation words of different matrixes according to the different matrixes generated in the step (C4) and the evaluation dictionary in the step (C5).
Specifically, the evaluation word dictionary in the step (C5) comprises evaluation words with four evaluation levels of pitch, rhythm and fluency, notes, phrases, music pieces and music pieces; the evaluation word includes: region error, snap, drag beat, rhythm instability, temporary change sign error, pitch error, mispronounce, snap, drag beat, rhythm instability, cross-bar incoherence, phrase incoherence.
Specifically, the specific process of step (C3) is: the path sequencing value rank=p×w, P is an attribute sequence, W is a weight sequence, and the path corresponding to the rank maximum value is the optimal path; adding R, C in step (A4) to sequences R and C, respectively, wherein the eight attribute values of attribute sequence P are in turn the number proportion of element 0 in sequence R, the number proportion of element 0 in sequence C, the relative start position of element 0 in sequence R, the relative start position of element 0 in sequence C, the relative end position of element 0 in sequence R, the relative end position of element 0 in sequence C, the maximum continuous number proportion of element 0 in sequence R, the maximum continuous number proportion of element 0 in sequence C, and the weight sequence w=w 1w2w3w4w5w6w7w8.
Specifically, the specific process of step (C4) is: the method comprises the steps of splitting into position indexes of optimal comparison paths generated in step (C3) according to position indexes of a music spectrum file < measurement > tag of extensible markup (XML/musicXML), pre-constructed phrase position indexes and paragraph position indexes, generating splitting marks, and splitting matrixes in step (A2) and step (B2) into sections, phrases and paragraph matrixes respectively according to the splitting marks.
The comparison system of the performance sound information and the music score information comprises an information extraction module, a vector conversion module, a comparison matrix calculation module, a path planning module and a performance evaluation module which are connected in sequence.
Specifically, the information extraction module comprises an audio receiving module and a stored music score; the audio receiving module and the stored music score are respectively connected with the time value information extracting module and the pitch information extracting module;
the vector conversion module comprises a time value vector conversion module and a pitch vector conversion module which are respectively connected with the time value information extraction module and the pitch information extraction module;
The comparison matrix calculation module comprises a time value comparison matrix calculation module and a pitch comparison matrix calculation module which are respectively connected with the time value vector conversion module and the pitch vector conversion module;
the path planning module comprises a sequencing module, a time value minimum variance path planning module and a pitch minimum score path planning module which are respectively connected with the time value comparison matrix calculation module and the pitch comparison matrix calculation module; the pitch minimum score path planning module is sequentially connected with the time value minimum variance path planning module and the sequencing module;
The performance evaluation module comprises a segmentation module and an evaluation module which are connected with the sequencing module;
The sound value information extraction module and the sound height information extraction module are respectively used for extracting sound value information and sound height information of the music score and the sound received by the sound receiving module;
The time value vector conversion module and the sound height vector conversion module are respectively used for converting pitch information and time value information in the melody and the sound into two-dimensional matrixes with corresponding coding formats;
The time value comparison matrix calculation module and the pitch comparison matrix calculation module are respectively used for calculating a pitch comparison matrix and a time value comparison matrix according to the comparison function;
The time value minimum variance path planning module and the pitch minimum score path planning module are used for carrying out path dynamic planning and sorting on the comparison matrix to obtain an optimal path;
the segmentation module is used for segmenting the pitch matrix and the duration matrix into the section, phrase and paragraph matrix.
Specifically, the sorting value in the sorting module is the product of the attribute sequence P and the weight sequence W, where the eight attribute values of the attribute sequence P are in turn the number proportion of the elements 0 in the sequence R, the number proportion of the elements 0 in the sequence C, the relative start position of the elements 0 in the sequence R, the relative start position of the elements 0 in the sequence C, the relative end position of the elements 0 in the sequence R, the relative end position of the elements 0 in the sequence C, the maximum continuous number proportion of the elements 0 in the sequence R, the maximum continuous number proportion of the elements 0 in the sequence C, and the weight sequence w=w 1w2w3w4w5w6w7w8.
A computer device comprises a processor and a memory in which a computer program is stored which, when loaded and executed by the processor, implements the method of obtaining.
A computer readable medium has a computer program stored therein, the computer program being loaded and executed by a processor to implement the obtaining method.
Compared with the prior art, the invention has the following beneficial effects:
(1) The invention breaks away from dependence on electronic musical instruments and electronic musical instrument accessories by means of voice recognition analysis, and enlarges the range of musical instrument performance evaluation; the method fully considers the unsmooth of a musical instrument learner in the performance training process, matches the actual performance data with the standard performance data through a comparison algorithm, and has more accurate and flexible evaluation results; the evaluation word dictionary is constructed, the evaluation level is more abundant, the evaluation of the performance technology is not limited to overall evaluation, such as overall scoring, and the evaluation granularity can reach the evaluation of the performance technology point of a single performance note, so that the performance training is better assisted.
(2) The invention constructs a comparison function converted into a matrix by coding and based on column vectors (position index vectors) and a dynamic programming algorithm based on the position index on the basis of the position index, can be used for realizing d-dimensional (d is more than or equal to 2) information comparison and generating a comparison path based on the position index, and expands from one-dimensional character string comparison to multi-dimensional matrix comparison.
(3) The invention adopts the playing sound as the input source, thereby eliminating the dependence on the electronic musical instrument and the electronic musical instrument elements in the current playing evaluation system; aiming at the problem that the instrument learner frequently has unsmooth problems such as rebound, sound leakage and the like in the performance training process, the comparison concepts such as 'insert' and 'delete' are introduced, and the actual performance of the instrument learner is identified through a comparison algorithm; the invention realizes multi-level and multi-dimensional performance evaluation by constructing evaluation rules and evaluation word dictionaries of five levels of notes, bars, phrases, paragraphs, tracks and the like and different performance dimensions of pitch, time value, smoothness, strength and the like.
Drawings
Fig. 1 is a block diagram of an alignment system.
Fig. 2 is an example of a coding scheme of a pitch vector matrix.
Fig. 3 is an example of a section and phrase cut.
Fig. 4 is an example of an evaluation dictionary.
Detailed Description
The invention will now be further described with reference to the accompanying drawings and examples, embodiments of which include, but are not limited to, the following examples.
As shown in fig. 1 to 4, the audio receiving module of the present invention is configured to receive performance sounds; the invention obtains the playing sound information and the music score information; then, respectively performing code conversion on the performance sound information and the music score information to generate a matrix; then inputting the matrix into a corresponding comparison function to generate a comparison matrix; then planning a path according to the comparison matrix, and producing an optimal path; then, the first generated matrix is segmented according to the position index of the optimal comparison path to produce a bar, phrase and paragraph matrix; finally, evaluating the bar, phrase and paragraph matrix according to the evaluation dictionary and outputting an evaluation language; the specific implementation process is as follows:
The PITCH information extraction module extracts and stores PITCH information in a music spectrum, the PITCH information being taken from < NOTE < PITCH < STEPS > < OCTAVE > > tagged information in a music spectrum file in an extensible markup format (XML/musicXML).
The [ M112 ] pitch information extraction module extracts and stores pitch information in sound, and the pitch information extraction refers to the patent application number as follows: 201910669985.X, patent name: a music analysis data set construction method and an extraction method in a pitch and duration value extraction method based on the music analysis data set construction method are used for obtaining pitch information.
The [ M121 ] value information extraction module extracts and stores value information in a music spectrum, the value information being taken from information of < NOTE < DURATION > > tag in a music spectrum file in an extensible markup format (XML/MusicXML).
The [ M122 ] timing value information extraction module extracts and stores timing value information in sound, and the timing value information extraction refers to the patent application number as follows: 201910669985.X, patent name: a music analysis data set construction method and an extraction method in a pitch and duration extraction method based on the music analysis data set construction method are used for obtaining duration information.
The pitch information in [ M211 ] is converted into a (128 x n 1) two-dimensional Boolean vector matrix. Wherein the column coordinates represent the position index of each performing note in the pitch sequence, the row coordinates represent 128 semitone notes representing absolute pitches C-1 through G9 in scientific notation, the value of each position in the matrix is represented by 0/1, 0 represents not performing the corresponding note, and 1 represents performing the corresponding note.
Converting the pitch information in [ M212 ] into a (128 x n 2) two-dimensional Boolean vector matrix; the two-dimensional boolean vector matrix is represented by [ M211 ].
Converting the time value information in the M221 into a 1*n 1 two-dimensional numerical vector matrix; wherein the column coordinates represent the position index of each performance note in the pitch sequence.
Converting the time value information in the M223 into a 1*n 2 two-dimensional numerical vector matrix; the two-dimensional numerical vector matrix is represented by [ M221 ].
The PITCH score matrix pitch_s is constructed, initialized to a value of 0, and shaped ((n 1+1)*(n2 +1)).
Inputting the matrix in the M211 and the M212, and updating the scoring matrix S obtained by the M311 according to a scoring function; wherein the scoring function is described below.
Setting the pitch matrix of the cursive spectrum asThe pitch matrix of the sound is/>Where x i represents the column vector of the ith column of the spectrum pitch matrix, a j represents the column vector of the jth column of the sound pitch matrix:
The PITCH comparison matrix pitch_m is constructed, initialized to a value of 0 and shaped ((n 1+1)*(n2 +1)).
The scoring matrix of the M314 is taken as input, and the PITCH comparison matrix PITCH_M obtained by the M313 is updated according to a matching function. The matching function is described as follows:
the value comparison matrix DURA_M is constructed [ M321 ], the value is initialized to 0, and the shape is ((n 1+1)*(n2 +1)).
The matrix in the M322 is input, and the timing value comparison matrix DURA_M obtained by updating the M321 according to the comparison function is input. Wherein the alignment function is described as follows:
Setting the matrix of the curvespectral time value as Sound duration matrix is/>Where y i represents the ith column vector of the curvelet matrix and b j represents the jth column vector of the sound value matrix, then: /(I)
Calculating to obtain the best score paths in the pitch comparison matrix of [ M314 ], wherein the number of the best score paths is more than or equal to 1; the calculation rules are described as follows:
traversing elements in a PITCH comparison matrix PITCH_M, generating R and C according to iteration and assignment rules, and adding R and C into sequences R and C respectively, wherein the iteration and assignment rules are as follows, and 0 represents insertion or deletion and is used for representing multi-performance notes or few-performance notes in the actual performance process:
inputting a plurality of best score paths (M411) and a score duration matrix of (M322), calculating duration variance of each path, and selecting a path with the smallest variance as a minimum variance path, wherein the number of the minimum variance paths is more than or equal to 1:
Where the duration of each position index (i, j) of DURA_S path(i,j) deviates, E (DURA_S path) represents the average of the duration deviations of the entire path, and N represents the number of path positions.
The eight attribute values of the multiple minimum variance paths in the sequence [ M421 ] are respectively obtained and marked as P, and the eight attribute values are sequentially the number proportion of the elements 0 in the sequence R, the number proportion of the elements 0 in the sequence C, the relative starting position of the elements 0 in the sequence R, the relative starting position of the elements 0 in the sequence C, the relative ending position of the elements 0 in the sequence R, the relative ending position of the elements 0 in the sequence C, the maximum continuous number proportion of the elements 0 in the sequence R and the maximum continuous number proportion of the elements 0 in the sequence C.
Inputting eight attribute values of [ M431 ] into a sequencing model for sequencing respectively to obtain an optimal path; the sorting model is that the distribution of the element 0 in the optimal path obeys a certain rule, a path sorting value rank=p×w is calculated, and the path corresponding to the maximum rank is the optimal path; w is the weight sequence w=w iw2w3w4w5w6w7w8.
The method comprises the steps of dividing a vector matrix into section vector matrixes according to an optimal path sequence, wherein dividing rules are that section initial note position indexes are extracted from a music spectrum file < MEASURE > tag in an extensible tag format and are matched with position indexes of an optimal comparison path, section dividing tags are generated, and pitch vector matrixes and duration vector matrixes are divided into section pitch vector matrixes and section duration vector matrixes respectively according to the section dividing tags.
Dividing the vector matrix into phrase vector matrixes according to the optimal path sequence, wherein the dividing rule is that the initial note position index of a phrase is extracted from the pre-constructed phrase position index and is matched with the position index of the optimal comparison path, phrase dividing marks are generated, and the pitch vector matrixes and the duration vector matrixes are respectively divided into phrase pitch vector matrixes and phrase duration vector matrixes according to the phrase dividing marks;
the vector matrix is segmented into paragraph vector matrices according to the optimal path sequence, wherein the segmentation rule is that the paragraph starting note position index is extracted from the pre-constructed paragraph position index and matched with the position index of the optimal comparison path, a paragraph segmentation mark is generated, and the pitch vector matrix and the time value vector matrix are segmented into a paragraph pitch vector matrix and a paragraph time value vector matrix respectively according to the paragraph segmentation mark.
And (3) evaluating each data tuple of the step (M521) according to an evaluation rule, and outputting a corresponding position index.
And (M522) matching and selecting the evaluation words from a pre-constructed evaluation word dictionary, and outputting corresponding evaluation words. Wherein the evaluation word dictionary comprises evaluation words with multiple dimensions of pitch, duration, fluency and the like.
The invention acquires performance sound information and standard music score information; respectively encoding and converting the performance sound information and the standard music score information into vector matrixes; respectively inputting the vector matrix into a comparison function to generate a comparison matrix; generating an optimal comparison path by adopting a dynamic programming algorithm, a sequencing model and the like; the evaluation of the performance sound information and the score information includes: constructing an evaluation word dictionary; dividing the optimal comparison path into a multi-level evaluation sequence; and generating evaluation information of different layers of sequences according to the evaluation rule.
The specific process of path planning is that the position indexes of the music notes and the sound notes are shifted by 1 positive unit integrally through position index conversion, and 0 is used for representing insertion or deletion; if the whole is changed from 0 to 8 to 1 to 9, 0 is introduced if insertion or deletion is needed, for example 102 34 500 6 7 8 9 is the result after the change, which represents that a space needs to be inserted after the first position and 2 spaces are inserted after the 5 th position.
The invention adopts the playing sound as the input source, thereby eliminating the dependence on the electronic musical instrument and the electronic musical instrument elements in the current playing evaluation system; aiming at the problem that the instrument learner frequently has unsmooth problems such as rebound, sound leakage and the like in the performance training process, the comparison concepts such as 'insert' and 'delete' are introduced, and the actual performance of the instrument learner is identified through a comparison algorithm; the invention realizes multi-level and multi-dimensional performance evaluation by constructing evaluation rules and evaluation word dictionaries of five levels of notes, bars, phrases, paragraphs, tracks and the like and different performance dimensions of pitch, time value, smoothness, strength and the like.
The present invention can be well implemented according to the above-described embodiments. It should be noted that, based on the above structural design, even if some insubstantial modifications or color-rendering are made on the present invention, the essence of the adopted technical solution is still the same as the present invention, so it should be within the protection scope of the present invention.

Claims (5)

1. The method for obtaining the optimal comparison path of the performance time value information and the score time value information is characterized by comprising the following steps:
(B1) Acquiring performance sound duration information and melody duration information;
(B2) Performing code conversion on the (B1) performance sound time value information and the music spectrum time value information respectively to generate a time value matrix;
(B3) Inputting the timing value matrix of the step (B2) into a timing value comparison function to generate a timing value comparison matrix;
The value comparison function in step (B3) is: dura_s (i+1,j+1)=bj/yi;
Wherein, the matrix of the curvelet time value is Sound duration matrix is/>Wherein y i represents the ith column vector of the curvelet matrix, and b j represents the jth column vector of the sound value matrix; dura_s is a timing value comparison matrix;
(B4) Performing path planning according to the timing value comparison matrix of the step (B3), and generating a timing value minimum variance path as an optimal path;
The path planning in the step (B4) is the minimum variance path dynamic planning based on the position index, and the specific process is that the directed graph traversal is performed on the value comparison matrix, and the distance from DURA_S (0, 0) to DURA_S (0) is obtained Calculating the variance of each path to obtain a minimum variance path, and converting the minimum variance path into a (r, c) sequence based on a position index, wherein 0 represents insertion or deletion, and shifting the whole of the position indexes of the music notes and the sound notes by 1 positive unit through position index conversion;
wherein the duration of each position index (i, j) of DURA_S path (i, j) deviates, E (DURA_S path) represents the average of the duration deviations of the entire path, and N represents the number of path positions.
2. The obtaining method according to claim 1, wherein the music score value information in the step (B1) is taken from information of < NOTE < DURATION > > flag in a music score file in a scalable flag format.
3. The obtaining method according to claim 1, wherein the time value information is code-converted into a two-dimensional numerical matrix 1*n in step (B2), the column coordinates representing the position index of each performance note in the time value sequence, and the element values in the matrix are represented by integer numbers representing the number of time frames.
4. A computer device comprising a processor and a memory, in which a computer program is stored which, when loaded and executed by the processor, implements the obtaining method according to claims 1-3.
5. A computer readable medium, characterized in that a computer program is stored in the computer readable medium, which computer program is loaded and executed by a processor to implement the obtaining method according to claims 1 to 3.
CN202111036222.5A 2019-09-02 2019-09-02 Method for obtaining optimal comparison path of performance time value information and score time value information Active CN113744764B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111036222.5A CN113744764B (en) 2019-09-02 2019-09-02 Method for obtaining optimal comparison path of performance time value information and score time value information

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910806968.6A CN110600057B (en) 2019-09-02 2019-09-02 Method and system for comparing performance sound information with music score information
CN202111036222.5A CN113744764B (en) 2019-09-02 2019-09-02 Method for obtaining optimal comparison path of performance time value information and score time value information

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
CN201910806968.6A Division CN110600057B (en) 2019-09-02 2019-09-02 Method and system for comparing performance sound information with music score information

Publications (2)

Publication Number Publication Date
CN113744764A CN113744764A (en) 2021-12-03
CN113744764B true CN113744764B (en) 2024-04-26

Family

ID=68856284

Family Applications (3)

Application Number Title Priority Date Filing Date
CN201910806968.6A Active CN110600057B (en) 2019-09-02 2019-09-02 Method and system for comparing performance sound information with music score information
CN202111041068.0A Active CN113763991B (en) 2019-09-02 2019-09-02 Comparison method and system of performance sound information and music score information
CN202111036222.5A Active CN113744764B (en) 2019-09-02 2019-09-02 Method for obtaining optimal comparison path of performance time value information and score time value information

Family Applications Before (2)

Application Number Title Priority Date Filing Date
CN201910806968.6A Active CN110600057B (en) 2019-09-02 2019-09-02 Method and system for comparing performance sound information with music score information
CN202111041068.0A Active CN113763991B (en) 2019-09-02 2019-09-02 Comparison method and system of performance sound information and music score information

Country Status (1)

Country Link
CN (3) CN110600057B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110600057B (en) * 2019-09-02 2021-12-10 深圳市平均律科技有限公司 Method and system for comparing performance sound information with music score information
CN113643676A (en) * 2020-04-27 2021-11-12 汲趣艺术科技(上海)有限公司 Performance evaluation system
CN111554257A (en) * 2020-05-07 2020-08-18 南京邮电大学 Note comparison system of traditional Chinese national musical instrument and use method thereof
CN112258932B (en) * 2020-11-04 2022-07-19 深圳市平均律科技有限公司 Auxiliary exercise device, method and system for musical instrument playing
CN112836080B (en) * 2021-02-05 2023-09-12 小叶子(北京)科技有限公司 Method and system for searching music score through audio
CN114417915A (en) * 2021-12-29 2022-04-29 星花怒放(苏州)科技有限公司 Two-dimensional sequence similarity evaluation system for turning over spectrums
CN116070077A (en) * 2022-12-29 2023-05-05 江苏理工学院 Music composing effect evaluation method and device for automatic music composing algorithm

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5952597A (en) * 1996-10-25 1999-09-14 Timewarp Technologies, Ltd. Method and apparatus for real-time correlation of a performance to a musical score
CN103824565A (en) * 2014-02-26 2014-05-28 曾新 Humming music reading method and system based on music note and duration modeling
CN105070298A (en) * 2015-07-20 2015-11-18 科大讯飞股份有限公司 Polyphonic musical instrument scoring method and device
JP2016184112A (en) * 2015-03-26 2016-10-20 ヤマハ株式会社 Ensemble evaluation apparatus
CN106228961A (en) * 2016-07-21 2016-12-14 赵洪云 Play evaluation methodology and device
CN107424476A (en) * 2017-08-01 2017-12-01 上海破破飞网络科技有限公司 A kind of instrumental music training mate system
CN107767847A (en) * 2017-09-29 2018-03-06 小叶子(北京)科技有限公司 A kind of intelligent piano performance assessment method and system
CN110600057A (en) * 2019-09-02 2019-12-20 深圳市平均律科技有限公司 Method and system for comparing performance sound information with music score information

Family Cites Families (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020116196A1 (en) * 1998-11-12 2002-08-22 Tran Bao Q. Speech recognizer
KR100658219B1 (en) * 2001-06-25 2006-12-15 어뮤즈텍(주) Method and apparatus for designating expressive performance notes with synchronization information
GB2433349B (en) * 2004-10-22 2010-03-17 In The Chair Pty Ltd A method and system for assessing a musical performance
US7750224B1 (en) * 2007-08-09 2010-07-06 Neocraft Ltd. Musical composition user interface representation
JP5593608B2 (en) * 2008-12-05 2014-09-24 ソニー株式会社 Information processing apparatus, melody line extraction method, baseline extraction method, and program
US8575465B2 (en) * 2009-06-02 2013-11-05 Indian Institute Of Technology, Bombay System and method for scoring a singing voice
CN103871426A (en) * 2012-12-13 2014-06-18 上海八方视界网络科技有限公司 Method and system for comparing similarity between user audio frequency and original audio frequency
CN103823867B (en) * 2014-02-26 2017-02-15 深圳大学 Humming type music retrieval method and system based on note modeling
CN106228996B (en) * 2016-07-15 2019-08-02 黄河科技学院 Vocality study electron assistant articulatory system
CN106340286B (en) * 2016-09-27 2020-05-19 华中科技大学 Universal real-time musical instrument playing evaluation system
CN107818796A (en) * 2017-11-16 2018-03-20 重庆师范大学 A kind of music exam assessment method and system
CN108711336B (en) * 2018-04-27 2020-05-12 山东英才学院 Piano playing scoring method and device, computer equipment and storage medium
CN109448682A (en) * 2018-12-04 2019-03-08 无锡市思娱文化科技有限公司 A kind of system and its application method for realizing that microphone is interacted with jukebox
CN110070847B (en) * 2019-03-28 2023-09-26 深圳市芒果未来科技有限公司 Musical tone evaluation method and related products

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5952597A (en) * 1996-10-25 1999-09-14 Timewarp Technologies, Ltd. Method and apparatus for real-time correlation of a performance to a musical score
CN103824565A (en) * 2014-02-26 2014-05-28 曾新 Humming music reading method and system based on music note and duration modeling
JP2016184112A (en) * 2015-03-26 2016-10-20 ヤマハ株式会社 Ensemble evaluation apparatus
CN105070298A (en) * 2015-07-20 2015-11-18 科大讯飞股份有限公司 Polyphonic musical instrument scoring method and device
CN106228961A (en) * 2016-07-21 2016-12-14 赵洪云 Play evaluation methodology and device
CN107424476A (en) * 2017-08-01 2017-12-01 上海破破飞网络科技有限公司 A kind of instrumental music training mate system
CN107767847A (en) * 2017-09-29 2018-03-06 小叶子(北京)科技有限公司 A kind of intelligent piano performance assessment method and system
CN110600057A (en) * 2019-09-02 2019-12-20 深圳市平均律科技有限公司 Method and system for comparing performance sound information with music score information

Also Published As

Publication number Publication date
CN113763991A (en) 2021-12-07
CN110600057A (en) 2019-12-20
CN113744764A (en) 2021-12-03
CN113763991B (en) 2024-04-19
CN110600057B (en) 2021-12-10

Similar Documents

Publication Publication Date Title
CN113744764B (en) Method for obtaining optimal comparison path of performance time value information and score time value information
CN110147451B (en) Dialogue command understanding method based on knowledge graph
CN113591457B (en) Text error correction method, device, equipment and storage medium
CN107993636B (en) Recursive neural network-based music score modeling and generating method
WO2019158927A1 (en) A method of generating music data
CN113010730B (en) Music file generation method, device, equipment and storage medium
CN115293139A (en) Training method of voice transcription text error correction model and computer equipment
CN115293138A (en) Text error correction method and computer equipment
CN115545033A (en) Chinese field text named entity recognition method fusing vocabulary category representation
CN114021573B (en) Natural language processing method, device, equipment and readable storage medium
CN110956039A (en) Text similarity calculation method and device based on multi-dimensional vectorization coding
CN109102006A (en) A kind of music automark method based on the enhancing of audio frequency characteristics induction information
CN116483337A (en) API completion method based on prompt learning and data enhancement
CN114021572B (en) Natural language processing method, device, equipment and readable storage medium
CN114579763A (en) Character-level confrontation sample generation method for Chinese text classification task
CN114925197A (en) Deep learning text classification model training method based on topic attention
Miyao et al. Format of ground truth data used in the evaluation of the results of an optical music recognition system
Wai et al. Myanmar (Burmese) String Similarity Measures based on Phoneme Similarity
CN110781678B (en) Text representation method based on matrix form
JP2014191490A (en) Abbreviation generation device, abbreviation generation method and program
WO2024042650A1 (en) Training device, training method, and program
KR20190111360A (en) System, device, and method to generate polyphonic music
US20240013760A1 (en) Text providing method and text providing device
Moon et al. Concepts for Computing Patterns in 15th Century Korean Music
Rahal et al. Separated Feature Learning for Music Composition Using Memory-Based Neural Networks

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant