CN108735192B

CN108735192B - System and method for evaluating piano playing tone quality by combining music

Info

Publication number: CN108735192B
Application number: CN201810311753.2A
Authority: CN
Inventors: 韦岗; 孙启梦; 曹燕
Original assignee: South China University of Technology SCUT
Current assignee: South China University of Technology SCUT
Priority date: 2018-04-09
Filing date: 2018-04-09
Publication date: 2021-09-21
Anticipated expiration: 2038-04-09
Also published as: CN108735192A

Abstract

The present invention provides a system and method for evaluating the sound quality of piano performance combined with musical styles. The system includes a piano music library, a microphone array recording device, a high-quality microphone recording device, a music database, an expert listening evaluation module, a signal feature extraction module, a sample library, a signal feature analysis module and a sound quality evaluation module. After the system is established, the user only needs to select a tune of the preferred genre and play it on the piano to be evaluated. The system will intelligently process the collected performance sound signals and output an evaluation score as the evaluation result of the piano performance quality. The system proposed by the invention combines the elements of the genre, analyzes various signal characteristics such as air domain, time domain, frequency domain and time-frequency diagram, and outputs the evaluation results intelligently during the use process. Compared with the previous sound quality evaluation methods, Objectivity, convenience, and intelligence are greatly improved, and at the same time, it is more in line with the individual needs of users.

Description

System and method for evaluating piano playing tone quality by combining music

Technical Field

The invention relates to the cross technical field of music acoustics, psychoacoustics, signal processing, computer mode identification and the like, in particular to a piano playing tone quality evaluation system and method combining music.

Background

With the continuous development of economy and society, the living standard of people is rapidly improved, the spiritual culture is concerned and paid more and more attention, and the demand is increasingly increased. Among them, the music art is the most contacted aspect in people's daily life, and the piano as the "king of musical instrument" occupies a large part of the proportion of playing the music art. Because the piano has a complex structure and a wide range of sound, the difficulty of researching and evaluating the tone quality and tone color is high. At present, the quality of the sound quality of the piano basically depends on subjective judgment, namely, experts with certain piano music theory knowledge and playing experience listen and score according to some indexes. The evaluation method can cause the problems of fluctuation of evaluation accuracy and the like due to auditory fatigue of testers, personal preference and different on-site listening environments. Furthermore, in many real-time situations, the labor and time costs of listening tests are also relatively expensive.

With the development of computer science and the research on the combination of signal processing technology and music acoustics, people begin to analyze the characteristics of various aspects of musical instruments from a rational and scientific point of view. The music signal characteristics of musical instrument playing sound signals can be accurately extracted by applying digital signal processing technologies such as wavelet analysis and the like or machine learning knowledge such as neural networks, and most of the previous researches are applied to aspects such as musical instrument identification, playing correct and incorrect evaluation and the like by using the characteristics or simply evaluating the sound quality by directly analyzing the characteristics, so that the sound quality evaluation cannot be automatically and intelligently carried out. If the subjective priori knowledge and the objective signal characteristic analysis can be combined to find the mapping relation between the subjective priori knowledge and the objective signal characteristic analysis, then an intelligent evaluation system is designed by utilizing the technologies of neural network, fuzzy reasoning and the like which are closer to the human cognition and judgment mechanism, finally, only a section of piano playing sound signal is input to a computer, the labor cost can be saved, and the piano playing sound quality does not need to be evaluated manually.

In the aspect of music signal feature extraction and analysis, more and more attention and research are paid to the relationship between the subjective priori knowledge of piano music and the objective digital signal features. In the previous researches, most of the researches are based on the music recognition of contents, and most of the researches are to collect signals of piano playing sounds by using a single microphone, and in the researches combined with a microphone array, most of the researches are to collect signals of a spatial field, apply the signals to sound field simulation and sound field reproduction of the spatial field, rarely use the microphone array to receive signals of the piano playing sounds at different positions, namely measurement and collection of spatial signals, wherein the spatial signals are related to overall auditory perception information, and meanwhile, in the researches on the influence of sound board characteristics on the sound quality of the piano, the spatial characteristics of the spatial signals are closely related to the sound board characteristics. Therefore, considering the collection of spatial domain signals of a plurality of positions and the comprehensive analysis, the tone quality of one piano can be more comprehensively evaluated.

On the other hand, in the case of the piano tone quality study, only a single note, a double note, or a simple performance sample without a melody label is often used in the previous study in analyzing music data, and the subjective preference of the player, that is, a factor of the melody is not considered. In the social environment, the individual requirements are more emphasized by piano manufacturers and sellers, and each piano fan has own preference, so that the piano most suitable for own habits and preferences is expected to be found. And whether the beginners or the experienced experts, when players play piano music of different styles on the same piano, the tone quality of the piano is different. Therefore, the quality of a piano is evaluated, and a factor of personalized characteristics such as a music style should be considered.

Disclosure of Invention

The invention aims to overcome the defects that the analysis is not comprehensive enough due to the fact that the piano tone quality evaluation depends on subjective evaluation and the music is not considered, and solves the problem that the spatial domain characteristic analysis of piano playing tone signals is lacked due to the fact that only a single microphone is used for data acquisition, and provides a piano playing tone quality evaluation system combined with the music and a playing tone signal acquisition device combining a microphone array recording device and a high-quality microphone recording device. Meanwhile, the tone quality evaluation module is constructed by adopting the fuzzy neural network, after a playing sound signal of a tune with a specific style played on one piano by a user is received, the corresponding characteristics of the data in a domain, a time domain, a frequency domain and a time-frequency diagram are analyzed, the obtained characteristic vector and the tune label are used as the input of the fuzzy neural network, a score is finally output, and the tone quality evaluation is carried out on the played piano when the tune is selected.

In order to realize the purpose and the function, the system provided by the invention needs to be subjected to the process of system establishment before use, namely, collecting playing sound signals of different music styles on different pianos, collecting expert subjective evaluation data, extracting and analyzing signal characteristics, determining the structure of a fuzzy neural network and training; the trained fuzzy neural network can be used. The specific technical scheme of the invention is as follows.

A piano playing tone quality evaluation system combined with music comprises a piano music library, a microphone array recording device, a high-quality microphone recording device, a music database, an expert listening evaluation module, a signal characteristic extraction module, a sample library, a signal characteristic analysis module and a tone quality evaluation module;

the piano music library is provided with music labels, and the music labels refer to the serial number marking of each music on the basis of music classification; the piano music library has two functions, namely, in the system establishing process, a large number of piano music resources are provided for subsequent analysis and training of a fuzzy neural network; secondly, in the using process of the user, selectable piano music is provided for the user;

the microphone array recording device comprises microphones arranged at different spatial positions, and a plurality of microphones collect playing sound signals, namely space-domain signals, of the piano at different positions so as to realize macroscopically analyzing the playing sound quality of the piano;

the signal characteristic extraction module extracts macroscopic and microscopic signal characteristics through signal characteristic extraction of audio files formed by piano playing sounds collected by the microphone array recording device and the high-quality microphone recording device, wherein the signal characteristics comprise space domain characteristics, time domain characteristics, frequency domain characteristics and time-frequency diagram characteristics; then establishing a sample library for storing input samples of the fuzzy neural network, wherein the input samples comprise two sample types of training samples and samples to be evaluated; the sample content is a signal feature vector and comprises the extracted signal features and corresponding curved wind labels;

the signal characteristic analysis module realizes the function of establishing a fuzzy set and a fuzzy inference rule, and comprises the steps of carrying out statistical comparison analysis on the extracted signal characteristics and subjective evaluation data obtained from the expert listening evaluation module, thereby establishing a fuzzy set of the signal characteristics and the subjective evaluation data, and simultaneously establishing a fuzzy inference rule of an evaluation process;

the tone quality evaluation module realizes the function of outputting tone quality evaluation scores after the samples in the sample library are processed by the fuzzy neural network; before the tone quality evaluation module is used, the structure of a fuzzy neural network is determined by adopting a fuzzy inference rule established by the signal characteristic analysis module, all samples of the sample library are used as training samples of the fuzzy neural network, subjective evaluation data of an audio file corresponding to each sample are used as expected output, namely a supervision signal, and then the fuzzy neural network is trained; after the network training is finished, the tone quality evaluation module can be used; when the user uses the tone quality evaluation module, the function of intelligently outputting the evaluation score after obtaining the sample to be evaluated can be realized without manually obtaining subjective evaluation data.

Based on the technical scheme, the evaluation method of the piano playing sound quality evaluation system combining the music comprises two processes, namely a system establishing process and a user using process.

Before the system is used, playing sound signals of different music on different pianos need to be collected, expert subjective evaluation data are collected, signal characteristics are extracted and analyzed, and the structure of a fuzzy neural network is determined and trained.

The steps of the system setup procedure are as follows:

(1) the piano music library with the music labels is established by analyzing the representative music classification rules, and the music labels are used for numbering and marking the types of the music on the basis of music classification.

(2) The performance sound signals played on a plurality of pianos are recorded on site using a microphone array recording apparatus and a high-quality microphone recording apparatus, and the played tunes include all the tunes in a piano tune library. The performance sound signals collected by the microphone array recording device are processed to form a multi-channel audio file, and the performance sound signals collected by the high-quality microphone recording device are correspondingly processed to form a high-fidelity audio file.

(3) The high-fidelity audio file is played back in a listening room, a plurality of professionals are allowed to carry out listening experiments, the quality of the sound quality of the professional is evaluated, subjective evaluation data are collected, and the subjective evaluation data, the audio file and the music note tag which are obtained through statistics are correspondingly stored in a music database.

(4) And based on the short-time stationarity of the audio signal, carrying out pre-emphasis processing and framing processing on the multi-channel audio file and the high-fidelity audio file.

(5) Inputting all audio files in the music database into a signal feature extraction module to extract signal features, establishing a sample library, forming signal feature vectors by the extracted signal features and corresponding music labels, and correspondingly storing the signal feature vectors into the sample library to be used as training samples.

(6) Counting the extracted signal characteristics, and carrying out comparative analysis on the signal characteristics and the subjective evaluation data in the step (3) so as to establish a fuzzy set of the signal characteristics and the subjective evaluation data and establish a fuzzy inference rule of an evaluation process; the fuzzy set and fuzzy inference rules refer to conditions required for judging the sound quality and the description of judgment logic in fuzzy mathematics; as can be described as follows: if the fundamental frequency harmonic proportion is in the range of x1 and the spatial balance is in the range of y1, the sound quality evaluation score is in the range of z 1; x1, y1, z1 all belong to respective fuzzy sets X, Y, Z.

(7) And (4) determining the structure of the fuzzy neural network according to the size of the signal feature vector obtained in the step (5) and the fuzzy inference rule obtained by analysis in the step (6), taking all samples of the sample library as training samples of the fuzzy neural network, taking subjective evaluation data of the audio file corresponding to each sample as expected output, namely a supervision signal, then training the fuzzy neural network, and finishing the establishment of the tone quality evaluation module.

After the system is built, a user can use the system, and the function of intelligently outputting the evaluation score after the sample to be evaluated is obtained can be realized without manually obtaining subjective evaluation data. The steps of the user using the system are as follows:

(1) let the user select a tune in the piano tune library and play on the piano to be evaluated, the microphone array recording apparatus and the high-quality microphone recording apparatus simultaneously collect the performance tone signals.

(2) Processing the playing sound signals to form audio files, and storing the audio files and the music labels into a music database;

(3) the audio file is input into a signal feature extraction module after being preprocessed, the extracted signal features and the music wind labels form signal feature vectors together, and the signal feature vectors are stored in a sample library to serve as samples to be evaluated.

(4) The sample to be evaluated is input into the established sound quality evaluation module, and finally the sound quality evaluation module outputs an evaluation score in the range of [0,100] as the performance sound quality evaluation result of the selected piano.

Compared with the existing tone quality evaluation system, the invention has the following advantages:

(1) according to the piano playing tone quality evaluation system combined with the music, after the system is established, only a section of piano playing tone signal needs to be input into the computer, the piano playing tone quality can be evaluated without subsequent manual work, and compared with the manual subjective evaluation of experts, the piano playing tone quality evaluation system combined with the music greatly improves convenience and intelligence and reduces labor cost and time cost.

(2) The system provided by the invention fully considers subjective evaluation data and objective signal characteristics in the system establishing process, wherein the objective signal characteristics comprise space domain characteristics, time domain characteristics, frequency domain characteristics and time-frequency diagram characteristics, and the multi-characteristic analysis method greatly improves the objectivity of the evaluation system and avoids the problems of evaluation standard floating and the like caused by artificial subjective evaluation.

(3) The system provided by the invention also considers the factors of the music which have personalized characteristics, and brings the music label into the piano for playing, and after the system processing, the user can obtain the sound quality of the music which is preferred by the user on the piano to be evaluated. Therefore, the user can better assist the user in selecting the piano really suitable for the user by comparing the evaluation scores of the plurality of intended pianos on the favorite music, so that the playing effect is optimal; and according to the preference of a user, a piano manufacturer can use the system to assist in debugging various parameters of the piano when producing the personalized custom piano.

Drawings

Fig. 1 is a block diagram showing the overall configuration of a piano playing sound quality evaluation system incorporating music.

Fig. 2 is a block diagram of a microphone array recording apparatus.

FIG. 3 is a flow diagram of an example system set up.

FIG. 4 is an example user usage flow diagram.

Detailed Description

In order to make the objects, technical solutions, innovative points and advantages of the present invention more apparent, embodiments of the present invention are further described below with reference to the accompanying drawings.

The following further describes embodiments of the present invention with reference to the drawings, but the practice of the present invention is not limited thereto.

As shown in fig. 1, the system of the present invention comprises nine modules: the system comprises a piano music library, a microphone array recording device, a high-quality microphone recording device, a music database, an expert listening evaluation module, a signal characteristic extraction module, a sample library, a signal characteristic analysis module and a tone quality evaluation module.

The piano performance sound quality evaluation system combined with the music wind mainly comprises the following parts:

(1) piano music library: the piano music book comprises a plurality of piano music books with music book labels, wherein the music book labels are used for numbering and marking each music book on the basis of music book classification.

(2) Microphone array recording device: the device is used for collecting spatial signals of piano playing tones for macroscopic analysis.

(3) High-quality microphone recording device: the signal is used for acquiring the piano high-fidelity performance sound for microscopic analysis.

(4) An expert listening evaluation module: in the system establishing process, playing sound signals of all piano songs on different pianos in a piano song library are required to be collected, and then the playing sound signals are correspondingly processed to form audio files; then, subjective evaluation data is obtained through the process of audio file playback and expert listening evaluation.

(5) A music database: correspondingly storing the music style label, the audio file and the subjective evaluation data into a music database; the process of playback of audio files and expert listening evaluation is not needed during the use of the system by the user, so that the data stored in the music library only comprises the audio files and the music labels.

(6) A signal feature extraction module: after the audio files of the playing sound in the music database are preprocessed, the macroscopic and microscopic signal characteristics of the piano playing sound are extracted through the module, and the signal characteristics mainly comprise space domain characteristics, time domain characteristics, frequency domain characteristics and time-frequency diagram characteristics.

(7) Sample library: the part is used for storing input samples of the fuzzy neural network, the sample content is a signal feature vector and comprises extracted signal features and corresponding curved wind labels, and the sample types comprise training samples and samples to be evaluated.

(8) A signal characteristic analysis module: the module is mainly used for realizing the function of establishing the fuzzy inference rule in the system establishing process, and comprises the steps of carrying out statistic comparison analysis on the extracted signal characteristics and subjective evaluation data, establishing a fuzzy set of the signal characteristics and the subjective evaluation data and establishing the fuzzy inference rule in the evaluation process.

(9) And a tone quality evaluation module: the module realizes the function of outputting the tone quality evaluation score after the samples in the sample library are processed by the fuzzy neural network. Before the module is used, the fuzzy inference rule established by the signal characteristic analysis module in the step (8) is adopted to determine the structure of the fuzzy neural network, all samples in the sample library in the step (7) are used as training samples of the fuzzy neural network, subjective evaluation data of an audio file corresponding to each sample is used as expected output, namely a supervision signal, and then the fuzzy neural network is trained. After the network is trained, the tone quality evaluation module can be used. When the module is used by a user, the function of intelligently outputting the evaluation score after the sample to be evaluated is obtained can be realized without manually obtaining subjective evaluation data.

Before the system provided by the invention is used, the process of system establishment is required, namely, playing sound signals of different music on different pianos are collected, expert subjective evaluation data are collected, signal characteristics are extracted and analyzed, and the structure of a fuzzy neural network is determined and trained; the trained fuzzy neural network can be used. Thus, the system set-up process includes all nine modules, while the user usage process includes only seven of the modules: the system comprises a piano music library, a microphone array recording device, a high-quality microphone recording device, a music database, a signal characteristic extraction module, a sample library and a tone quality evaluation module.

The piano tune library in this example may include four types of styles formed by classifying the piano tune development periods: the song label is respectively numbered 1,2,3 and 4 in the baroque style, the classical style, the romantic style and the modern style; selecting three typical exercise songs from each type of style; thus, a total of 12 piano tunes were included in the tune library.

The microphone array recording device in this embodiment is shown in fig. 2, and includes 18 microphones, an interface assembly board, a USB to serial port signal transmission control circuit, an amplification band-pass circuit, an a/D module, and an ARM signal processing and storing module. The amplifying band-pass circuit, the A/D module and the ARM signal processing and storing module are integrated into a set circuit, each set circuit board receives playing sound signals collected by 3 microphones, and 6 set circuit boards in total comprise 1 host set circuit and 5 slave set circuits; the sampling frequency is 100kHz, and the quantization precision is 12 bits; the USB-to-serial port signal transmission control circuit is mainly responsible for receiving signals of starting and stopping recording at the computer end, when the computer sends a command for starting receiving, the command is transmitted to the host through the module, the host simultaneously transmits the command to each slave through a signal wire connected with the slave, and then other modules start working; data in the ARM signal processing storage module are stored on the SD storage card in a bin file mode, and are converted into multichannel wav audio files through corresponding programs.

The high-quality microphone recording device comprises a single microphone capable of collecting high-fidelity playing sound signals, and is combined with Adobe Audio software to generate a high-fidelity wav audio file, the selected sampling frequency is 96kHz, and the quantization precision is 16 bits.

The music database in the embodiment is mainly used for storing multi-channel wav audio files and high-fidelity wav audio files, the audio files, subjective evaluation data obtained after the audio files are subjected to the expert listening evaluation module, and music labels are correspondingly stored in the database, so that the music database with various styles, multiple pianos and different sound quality evaluation score labels is established.

And wherein the expert listening evaluation module mainly comprises the following components: the high-fidelity wav audio file interception part is played back in a listening room, 5 experts are invited to perform listening experiments, each intercepted audio is evaluated according to indexes such as sound stability, richness, brightness, fullness and the like, then tone quality between every two audio files is compared and scored integrally, and the collected subjective evaluation data is counted and analyzed.

The music feature extraction module in the embodiment mainly realizes the function of extracting and obtaining features in the aspects of airspace, time domain, frequency domain, time-frequency diagram and the like by taking an audio file in a music database as input; the relationship between these features and the psychoacoustic index is as follows:

1) macroscopic spatial domain signal characteristics such as spatial balance extracted based on the multi-channel wav audio file are related to the stability of sound and the naturalness of a sound area and a transition area;

2) microscopic time domain envelope characteristics such as the oscillation starting time and the single tone time value extracted based on the high-fidelity wav audio file are related to the tone brightness or low tone quality of the sound, and the tone quality is directly influenced;

3) microscopic frequency domain characteristics such as amplitude and amplitude proportion of fundamental frequency and overtone extracted based on the high-fidelity wav audio file are related to timbre expressive force of sound;

4) the time-frequency diagram shows that the overall characteristics of fundamental frequency and overtone are related to the tone fullness and harmony of sound.

The sample library in the embodiment is used for storing input samples of the tone quality evaluation module, and comprises two sample types, namely a training sample and a sample to be evaluated; the sample content is a signal feature vector comprising the extracted signal features and corresponding melody tags.

The tone quality evaluation module in the embodiment is a main module of the system, the main structure of the tone quality evaluation module is a multi-input single-output five-layer neural network, and the selection of internal logic and weight functions depends on a fuzzy set and a fuzzy inference rule established by a signal characteristic analysis module.

The music signal characteristic analysis module mainly realizes the function of establishing a fuzzy inference rule, and comprises the steps of carrying out statistic comparison analysis on the extracted signal characteristics and subjective evaluation data, thereby establishing a fuzzy set of the signal characteristics and the subjective evaluation data, and simultaneously establishing the fuzzy inference rule of an evaluation process; the fuzzy set and the fuzzy rule refer to conditions required for judging the sound quality and the description of a judgment logic in fuzzy mathematics; as can be described as follows: if the fundamental frequency harmonic proportion is in the range of x1 and the spatial balance is in the range of y1, the sound quality evaluation score is in the range of z 1. x1, y1, z1 all belong to respective fuzzy sets X, Y, Z.

Before the system is used, the system needs to be subjected to the process of system establishment, namely, playing sound signals of different music styles on different pianos are collected, expert subjective evaluation data are collected, signal characteristics are extracted and analyzed, and the structure of a fuzzy neural network is determined and trained; the trained fuzzy neural network can be used. A flow chart of the system set-up procedure is shown in figure 3.

The system establishment process is as follows:

(1) establishing a piano music library: as described above, a total of 12 piano songs of four genres are included in the library.

(2) The recording device is turned on, and the player plays the music in the music library: the performance sound signals played on a plurality of pianos are recorded on site using a microphone array recording apparatus and a high-quality microphone recording apparatus, and the played tunes include all the tunes in a piano tune library.

(3) Forming an audio file: the performance sound signals collected by the microphone array recording device are processed to form a multi-channel audio file, and the performance sound signals collected by the high-quality microphone recording device are correspondingly processed to form a high-fidelity audio file. In order to correspond to each exercise music of each style in the piano music library of this example, thus letting 3 players play on 4 distinctly different pianos, 48 hi-fi audio files and 48 16-channel audio files are obtained in total.

(4) Audio file playback, expert listening evaluation, music database establishment: the high-fidelity audio file is played back in a listening room, a plurality of professionals are allowed to carry out listening experiments, the quality of the sound quality of the professional is evaluated, subjective evaluation data are collected, and the subjective evaluation data, the audio file and the music note tag which are obtained through statistics are correspondingly stored in a music database.

(5) Preprocessing an audio file: and based on the short-time stationarity of the audio signal, carrying out pre-emphasis processing and framing processing on the multi-channel audio file and the high-fidelity audio file.

(6) Extracting signal features to form training samples: inputting all audio files in the music database into a signal feature extraction module to extract signal features, establishing a sample library, forming signal feature vectors by the extracted signal features and corresponding music labels, and correspondingly storing the signal feature vectors into the sample library to be used as training samples.

(7) Analyzing the signal characteristics and subjective evaluation data, and establishing a fuzzy set and a fuzzy inference rule: counting the extracted signal characteristics, and carrying out comparative analysis on the extracted signal characteristics and the subjective evaluation data obtained in the step (4), so as to establish a fuzzy set of the signal characteristics and the subjective evaluation data and establish a fuzzy inference rule of an evaluation process; the fuzzy set and fuzzy inference rules refer to conditions required for judging the sound quality and the description of judgment logic in fuzzy mathematics; as can be described as follows: if the fundamental frequency harmonic proportion is in the range of x1 and the spatial balance is in the range of y1, the sound quality evaluation score is in the range of z 1; x1, y1, z1 all belong to respective fuzzy sets X, Y, Z.

(8) Determining the structure of the fuzzy neural network, training the fuzzy neural network: and (4) determining the structure of the fuzzy neural network according to the size of the signal feature vector obtained in the step (6) and the fuzzy inference rule obtained by analysis in the step (7), taking all samples of the sample library as training samples of the fuzzy neural network, taking subjective evaluation data of the audio file corresponding to each sample as expected output, namely a supervision signal, and then training the fuzzy neural network.

And after the verification and verification evaluation result is verified to be optimal, the tone quality evaluation module is established. After the system is successfully established, a user can use the system, and the function of intelligently outputting evaluation scores after obtaining samples to be evaluated can be realized without manually obtaining subjective evaluation data; a flow chart of the user using the system is shown in fig. 4.

The flow of the system using process of the user is as follows:

(1) user selection of preferred music: the user selects a preferred tune in the piano tune library, for example, a romantic style tune, the tune label number is set to 3, and the selected tune is correctly played on the piano desired to obtain the sound quality evaluation.

(2) The recording device is turned on, and the user plays the tune of the selected tune on the piano to be evaluated: the microphone array recording device and the high-quality microphone recording device simultaneously acquire performance sound signals of a user.

(3) Forming an audio file, and storing the audio file in a music database by combining with a music label: and after the performance is finished, processing the collected performance sound signals to form audio files, and storing the audio files and the music labels into a music database.

(4) Preprocessing an audio file: and performing pre-emphasis processing and framing processing on the audio file based on the short-time stationarity of the audio signal.

(5) Extracting signal characteristics to form a sample to be evaluated: and inputting the preprocessed audio file into a signal feature extraction module, forming a signal feature vector by the extracted signal feature and the song label, and storing the signal feature vector into a sample library to be used as a sample to be evaluated.

(6) And (3) performing sound quality evaluation, and outputting a sound quality evaluation score: inputting the sample to be evaluated into the established sound quality evaluation module, and finally outputting an evaluation score, such as 65 scores, within the range of [0,100], by the sound quality evaluation module, wherein the evaluation score is the performance sound quality evaluation result of the played piano under the selected curved wind.

Therefore, the user can compare the evaluation scores of multiple intended pianos on the favorite music so as to assist in selecting the piano really suitable for the user and enable the playing to achieve the best effect; according to the preference of users, piano manufacturers can use the system to assist in debugging various parameters of the piano when producing personalized custom pianos, so that a large part of manual evaluation cost and time cost are reduced.

The above embodiments are intended to be preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims

1. a piano performance sound quality evaluation system in conjunction with genre, is characterized in that comprising piano music library, microphone array recording device, high-quality microphone recording device, music database, expert listening evaluation module, signal feature extraction module, sample library , signal feature analysis module and sound quality evaluation module;

The piano music library has a genre label, and the genre label refers to numbering and marking each genre on the basis of genre classification; the piano music library has two functions, one is during the system establishment process. , to provide a large number of piano music resources for subsequent analysis and training of fuzzy neural networks; the second is to provide users with optional piano music during the use process;

The microphone array recording device includes microphones placed at different spatial positions, and the plurality of microphones collects the playing sound signals of the piano at different positions, that is, the spatial signal, so as to analyze the playing sound quality of the piano on a macroscopic level;

The microphone array recording device is used to collect the spatial signal of the piano performance sound for macroscopic analysis; the high-quality microphone recording device is used to collect the signal of the piano high-fidelity performance sound for microscopic analysis; The high-quality microphone recording device contains a single microphone that can capture high-fidelity performance sound signals, and combines with Adobe Audition software to generate high-fidelity wav audio files;

The signal feature extraction module extracts the signal features of the audio files formed by the piano performance sound collected by the microphone array recording device and the high-quality microphone recording device, and extracts the macroscopic and microscopic signal features, and these signal features include spatial features. , time-domain features, frequency-domain features, and time-frequency map features; then a sample library is established to store the input samples of the fuzzy neural network. The input samples include two types of samples: training samples and samples to be evaluated; the sample content is the signal feature vector, Including the extracted signal features and corresponding genre labels;

The signal feature analysis module realizes the function of establishing fuzzy sets and fuzzy inference rules, including statistical comparative analysis between the extracted signal features and the subjective evaluation data obtained from the expert listening evaluation module, thereby establishing signal features and subjective evaluation data. Evaluate fuzzy sets of data and establish fuzzy inference rules for the evaluation process;

The sound quality evaluation module realizes the function of outputting sound quality evaluation scores after the samples in the sample library are processed by the fuzzy neural network; before the sound quality evaluation module is used, the fuzzy inference rules established by the signal feature analysis module need to be adopted. To determine the structure of the fuzzy neural network, and use all the samples in the sample library as the training samples of the fuzzy neural network, and the subjective evaluation data of the audio file corresponding to each sample as the expected output, that is, the supervision signal, and then to the fuzzy neural network. After the network is trained, the sound quality evaluation module can be used; when users use the sound quality evaluation module, they can realize the function of intelligently outputting evaluation scores after obtaining the samples to be evaluated without manually obtaining subjective evaluation data.

2. utilize the described a kind of evaluation method of the piano performance sound quality evaluation system in conjunction with the style of claim 1, it is characterized in that comprising system establishment process and user use process; Before the system is used, need to collect the performance of different styles of music on different pianos Audio signal, collect expert subjective evaluation data, extract and analyze signal features, determine the structure of fuzzy neural network and conduct training;

The steps of the system establishment process are as follows:

(1) By analyzing the representative genre classification rules, a piano music library with genre labels is established, and genre labels refer to numbering and marking each genre on the basis of genre classification;

(2) Use the microphone array recording device and the high-quality microphone recording device to record the performance sound signals played on multiple pianos on the spot, and the performed music includes all the music in the piano music library; the performance sound collected by the microphone array recording device After the signal is processed, a multi-channel audio file is formed, and the performance sound signal collected by the high-quality microphone recording device is processed to form a high-fidelity audio file;

(3) Play back the high-fidelity audio files in the listening room, let multiple professionals conduct listening experiments, evaluate the quality of their sound quality, collect subjective evaluation data, and count the subjective evaluation data, audio The files and genre labels are correspondingly stored in the music database;

(4) Pre-emphasis and framing processing are performed on multi-channel audio files and high-fidelity audio files based on the short-term stability of the audio signal;

(5) Input all the audio files in the music database into the signal feature extraction module to extract the signal features, establish a sample library, and form a signal feature vector with the extracted signal features and the corresponding genre labels, and store them in the corresponding In the sample library, as a training sample;

(6) carrying out statistics on the extracted signal features, and comparing and analyzing the subjective evaluation data in step (3), thereby establishing a fuzzy set of signal features and subjective evaluation data, and establishing fuzzy inference rules for the evaluation process; the fuzzy Set and fuzzy inference rules refer to the description of the conditions and judgment logic required to judge the quality of sound quality in fuzzy mathematics; for example, the following description can be made: if the fundamental frequency overtone ratio is in the range of x1, and the spatial balance is in the range of y1, then the sound quality The evaluation score is in the range of z1; x1, y1, z1 all belong to their respective fuzzy sets X, Y, Z;

(7) According to the size of the signal feature vector obtained in step (5) and the fuzzy inference rule obtained by analysis in step (6), determine the structure of the fuzzy neural network, and use all the samples in the sample library as the training samples of the fuzzy neural network, The subjective evaluation data of the audio file corresponding to each sample is used as the expected output, that is, the supervision signal, and then the fuzzy neural network is trained, and the sound quality evaluation module is established;

After the system is established, the user can use the system, and the function of intelligently outputting evaluation scores after obtaining the samples to be evaluated can be realized without manual acquisition of subjective evaluation data;

The steps for users to use the system are as follows:

(1) Let the user select a piece of music in the piano music library and play it on the piano to be evaluated, and the microphone array recording device and the high-quality microphone recording device simultaneously collect the performance sound signal;

(2) the performance sound signal is processed to form an audio file, and is stored in the music database together with the genre label;

(3) Input the signal feature extraction module after preprocessing the audio file, and form the signal feature vector together with the extracted signal feature and the genre label, and store it in the sample library as the sample to be evaluated;

(4) Input the samples to be evaluated into the established sound quality evaluation module, and the final sound quality evaluation module will output an evaluation score in the range of [0, 100] as the performance evaluation result of the selected piano.