CN103761965A - Method for classifying musical instrument signals - Google Patents

Method for classifying musical instrument signals Download PDF

Info

Publication number
CN103761965A
CN103761965A CN201410008533.4A CN201410008533A CN103761965A CN 103761965 A CN103761965 A CN 103761965A CN 201410008533 A CN201410008533 A CN 201410008533A CN 103761965 A CN103761965 A CN 103761965A
Authority
CN
China
Prior art keywords
centerdot
dimension
tau
phase
vector
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410008533.4A
Other languages
Chinese (zh)
Other versions
CN103761965B (en
Inventor
郭一娜
王志社
郅逍遥
王晓梅
李临生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Taiyuan University of Science and Technology
Original Assignee
Taiyuan University of Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Taiyuan University of Science and Technology filed Critical Taiyuan University of Science and Technology
Priority to CN201410008533.4A priority Critical patent/CN103761965B/en
Publication of CN103761965A publication Critical patent/CN103761965A/en
Application granted granted Critical
Publication of CN103761965B publication Critical patent/CN103761965B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention discloses a method for classifying musical instrument signals, and belongs to the technical field of electronic information. Modules adopted in the method comprise a phase-space reconstruction module, a principal component analysis module, a feature extraction module and a flexibility neural tree module. The method is characterized by comprising the step of carrying out the phase-space reconstruction on a time sequence produced by different musical instrument sample signals, the step of removing redundant information through the principal component analysis to achieve the dimensionality reduction purpose, the step of depicting the differences of different musical instruments in the phase space through the probability density function by analyzing the features of various musical instruments, and the step of utilizing a flexible neural tree model to serves as a classifier to carry out classification. The method can effectively solve the problem of the high dependency of an artificial neural network structure, and the classification accuracy of a single musical instrument can reach up to 98.7 percent.

Description

A kind of sorting technique of instrument signal
Technical field
The invention belongs to electronic information technical field, be specifically related to a kind of sorting technique of instrument signal.
Background technology
Music is part indispensable during people live, yet we understand the mode of music information can only, by real-time audition, therefrom obtain own interested content.Along with scientific and technological development, the information that people can touch, also in growth at full speed, therefore, obtains own interested music by real-time audition not easy in the data of magnanimity.In recent years, music data analysis and retrieval have become domestic and international study hotspot.
Most music relies on the performance of musical instrument and expresses, and same different instrument playing for first song brings different enjoyments also can to people's the sense of hearing.In computing machine identification field, along with the development of signal analysis and data mining technology, the multiple cross one another technology such as time frequency analysis, neural network are all applied in the middle of musical instrument classification gradually.
Effectively feature extraction and sorter selection are the committed steps in musical instrument classification always.Mel cepstrum coefficient feature is used in voice signal field conventionally, but also yields unusually brilliant results in the classification of musical genre and instrument signal.MPEG-7 is as with audio feature extraction and the standardization framework that is described as object, essential in the feature extraction of music signal, especially better to similar music signal effect.Except numerous acoustic characteristics, sorter has also obtained further investigation in audio classification widely, such as neural network, gauss hybrid models, hidden Markov model, Bayesian decision sorter and support vector machine etc.In researcher, there is people to utilize single-channel voice partition method to make two or more musical instrument separation obtain possibility by Gauss's modeling, someone uses AM/FM amplitude modulation/frequency modulation modulating characteristic instrument signal to be analyzed and identified (its accuracy rate can reach 70%), also someone uses Mel cepstrum coefficient feature, permanent q conversion and autocorrelation function carry out feature extraction, in conjunction with Bayesian decision sorter, make recognition accuracy reach 79%-84%.But in existing achievement in research, the classification accuracy of some sound signals is generally on the low side, illustrate that above-mentioned acoustic characteristic does not depict the difference between all signals well, and the parameter of most of sorter is chosen and operating process exists complicated loaded down with trivial details shortcoming.
Summary of the invention
The object of the invention is to provide a kind of sorting technique of instrument signal, can effectively overcome the shortcoming existing in prior art.
The present invention realizes like this, by employing, comprise that phase space reconfiguration module, principal component analysis (PCA) module, characteristic extracting module and flexible Neural Tree module carry out feature extraction to various instrument signal, then through sorter, classify, it is characterized in that implementation step is:
The first step, the musical instrument sample signal collecting is carried out to phase space reconfiguration, mainly determine the time delay of sample signal and embed dimension, comprise the following steps:
If the One-dimension Time Series of certain instrument signal sample is x=(x 1, x 2..., x k) t, be made as τ time delay, set up the embedded space Y of a m dimension, x is mapped in this embedded space, the vector of the phase space of reconstruct is:
Y = ( y 1 , y 2 , . . . , y n ) = x ( 1 ) x ( 2 ) · · · x ( n ) x ( 1 + τ ) x ( 2 + τ ) · · · x ( n + τ ) · · · · · · · · · x ( 1 + ( m - 1 ) τ ) x ( 2 + ( m - 1 ) τ ) · · · x ( n + ( m - 1 ) τ ) - - - ( 1 )
Wherein, K is the length of audio frequency time series x; The number n=K-of phase vector (m-1) τ in phase space (n=1,2 ..., N).
(1), Average Mutual method is determined optimum delay time τ
Choose Average Mutual method and determine best delay time T:
I ( τ ) = Σ n = 1 N P ( x n , x n + τ ) log 2 P ( x n , x n + τ ) P ( x n ) P ( x n + τ ) - - - ( 2 )
P (x wherein n, x n+ τ) be the joint distribution probability in restructuring graph, P (x n), P (x n+ τ) be marginal distribution probability.Choose the τ of first local minimum of I (τ) as optimum delay time.
(2), false Neighbor Points method is determined embedding dimension m
In m dimension phase space, to phase vector y arbitrarily n(i)=x (i), x (i+ τ) ..., x[i+ (m-1) τ] } have and face recently a y in certain distance n nN(i), and distance between two points be:
D m(i)=||y n(i)-y n NN(i)|| (3)
D m(i) be a less amount, when the embedding dimension of phase space is increased to m+1 dimension from m dimension, the distance of these two phase points will change.If neighbor distance is D in this case m+1(i), have:
D 2 m+1(i)=D 2 m(i)+||x(i+τm)-x NN(i+τm)|| (4)
By providing certain threshold value, judge which point is false Neighbor Points, order
ΔD m ( i , m ) = D m + 1 2 ( i ) - D m 2 ( i ) D m 2 ( i ) = | x ( i + τm ) - x NN ( i + τm ) | D m ( i ) - - - ( 5 )
Δ D wherein mwhat (i, m) represented m dimension faces the distance a little variable quantity while tieing up with respect to m+1 recently.
If Δ D m(i, m)>=10%, illustrates that Neighbor Points is false.Whether each phase vector in m-dimensional space is differentiated to its Neighbor Points is successively false, calculates the ratio that false Neighbor Points accounts for whole phase vectors.When the ratio of false Neighbor Points is less than 5%, think that trajectory of phase space opens completely, m is now the best dimension that embeds.
(3), phase space reconfiguration
Determine delay time T and embed after dimension m, can be by original One-dimension Time Series x=(x 1, x 2..., x k) tdelay reconstruction obtains embedding vector x m=(x t, x t-τ..., x t-(m-1) τ) t, these vectors have been determined a point on the embedded space Y of m dimension, each retardation of different time points just produced a series of such points in m-dimensional space, thereby formed the phase space reconfiguration matrix A that reflects m * n rank of primal system information;
Second step, the musical instrument sample signal after reconstruct is carried out to principal component analysis (PCA), eliminate redundancy information, reaches the effect of dimensionality reduction:
R=E (AA t), RV=V Λ, wherein A is the m * n rank matrix obtaining after phase space reconfiguration, R is the autocorrelation matrix of m component, m * m rank eigenvectors matrix that V is R, its column vector is the proper vector of the orthonomalization of R; Λ is the feature diagonal matrix of R, λ i(i=1,2 ..., m) be the element on i diagonal line; M incoherent new variables Y=V of structure ta, Y={y 1, y 2..., y m} t, to λ i(i=1,2 ..., m) by after descending sort, get p the corresponding proper vector of larger eigenwert above, obtain p * n rank vector matrix B, wherein p>=3 after dimensionality reduction;
The 3rd step, principal component analysis (PCA) is processed after resulting matrix B carry out feature extraction, it is main that what extract is the eigenwert of probability density function:
We suppose the region G of sample data be one centered by x, the minimum cube that the length of side is h, first defines a kernel function:
Required probability density function is:
p ( x ) = 1 N Σ n = 1 N 1 h m k ( x - x n h ) - - - ( 7 )
Wherein, N is sample point number, n=1, and 2 ..., N, m is data dimension, k (u) is the kernel function of definition.Obtain thus the matrix on m * n rank, after it is sampled line by line, form the characteristic set H on m * c rank, wherein c<n.
The 4th step, adopt flexible Neural Tree model as the sorter of musical instrument identification module, carry out neural network training:
The calculation process adopting in flexible Neural Tree model is as follows: first set the initial configuration of flexible Neural Tree, then according to the initial configuration optimization of the characteristic set H obtaining and setting and train a fuzzy classification model.The instruction that comprises two types in this model: flexible neuron instruction and terminal instruction.Flexible neuron instruction connects its subtree for the non-leaf node of tree construction, and terminal instruction is each input feature vector.Function instruction set and terminal instruction set are expressed as follows:
S=F∪T={+ 2,+ 3,...,+ N}∪{z 1,z 2,...z n} (8)
Wherein+ i(2,3 ..., N) represent non-leaf node instruction, this non-leaf node has i input, has i input variable; z 1, z 2... z nrepresent leaf node instruction, this leaf node is without input variable.
Adopt Gaussian function as flexible excitation function:
f ( j i , b i , z ) = e - ( z - j i b i ) 2 - - - ( 9 )
Wherein, j iand b iit is the random adjustable parameter producing in the constructive process of flexible Neural Tree.Fitness function is the problem that another need to be in advance definite except instruction set, selects root-mean-square error as fitness function.
Advantage of the present invention and good effect: the present invention adopts a kind of new musical instrument classification to classify to instrument signal, the method is carried out phase space reconfiguration by the time series that different musical instruments are produced, and discloses the particular attribute of every kind of musical instrument; By principal component analysis (PCA), remove redundant information, reach dimensionality reduction object; By analyzing various musical instrument characteristics, adopt probability density function to portray the difference of each musical instrument in phase space; Finally, adopt flexible Neural Tree model to classify as sorter, it has effectively solved artificial neural network structure's high dependency problem.The application's parameter choose and operating process simple, experimental result can make the classification accuracy of single musical instrument be up to 98.7%, and the root-mean-square error of this sorter is minimum reaches 0.045570.
Accompanying drawing explanation
Fig. 1 is the invention process process flow diagram;
Fig. 2 and Fig. 3 are the time domain waveform figure of the experiment signal that adopts, the sample signal x that Fig. 2 is circular horn 1-1time domain waveform figure, the sample signal x that Fig. 3 is piano 2-1time domain waveform figure;
Fig. 4 and Fig. 5 adopt the definite circular horn sample x of mutual information method 1-1with piano sample x 2-1delay time T;
Fig. 6 and Fig. 7 adopt the definite circular horn sample x of false Neighbor Points method 1-1with piano sample x 2-1embedding dimension m;
Fig. 8 and Fig. 9 are respectively circular horn sample x 1-1with piano sample x 2-1phase space reconfiguration figure in three dimensions;
Figure 10 and Figure 11 are respectively circular horn sample x 1-1with piano sample x 2-1the eigenwert of probability density function;
Figure 12 is the flexible Neural Tree model finally training;
Figure 13 and Figure 14 are respectively circular horn sample x 1-1with piano sample x 2-1training sample and the prediction output of test sample book and target output.
In figure: amplitude represents amplitude, N represents sample point number, and I (τ) represents Average, and τ represents time delay, and m represents to embed dimension, Δ D mrepresent that false neighbour leads, PDF represents probability density function values, and time represents the time, Value of forecast and target represents prediction and target output, and Train data is training sample, and Test data represents test sample book, forecast represents predicted value, and target represents desired value.
Embodiment
We select Matlab7.0 and Visual Studio2008 as software platform, and programming realizes the present invention program's design.Implementation process is to choose two kinds of common instrument signal: circular horn, piano are as experimental subjects.For each instrument signal, select 10 groups of actual measurement sample datas, sample data length is 2000 points.Wherein front 8 groups as training sample, other 2 groups as test sample book.
Two kinds of one of them samples of instrument signal of take describe as example, and concrete operation step is as follows:
First sample signal is carried out to phase space reconfiguration processing, then carry out principal component analysis (PCA) and reduce redundancy, extract pivot, then enter feature extraction part, finally carry out musical instrument identification, adopt flexible Neural Tree neural network training to reach identifying purpose.
In the first step, the present embodiment, the processing concrete steps of phase space reconfiguration are as follows:
(1), optimum delay time τ's determines
Choose Average Mutual method and determine best delay time T, as shown in the formula:
I ( &tau; ) = &Sigma; n = 1 N P ( x n , x n + &tau; ) log 2 P ( x n , x n + &tau; ) P ( x n ) P ( x n + &tau; ) - - - ( 2 )
Choose the τ of first local minimum of Average I (τ) as optimum delay time.Fig. 3 is for adopting the definite circular horn sample x of the method 1-1with piano sample x 2-1delay time T, τ 1-1=3, τ 2-1=6
(2), best the determining of dimension m that embed
Choose false Neighbor Points method and determine embedding dimension m, according to following formula, judge which point is false Neighbor Points:
&Delta;D m ( i , m ) = D m + 1 2 ( i ) - D m 2 ( i ) D m 2 ( i ) = | | y n + &tau;m ( i + &tau;m ) - y n + &tau;m NN ( i + &tau;m ) | | D m ( i ) - - - ( 5 )
If Δ D m(i, m)>=10%, illustrates that Neighbor Points is false, wherein τ 1-1=3, τ 2-1=6.Calculate respectively x 1-1and x 2-1false Neighbor Points account for the ratio of whole phase vectors, when the ratio of false Neighbor Points is less than 5%, think that trajectory of phase space opens completely, m is now the best dimension that embeds.Fig. 4 is for adopting the definite circular horn sample x of the method 1-1with piano sample x 2-1embedding dimension m, m 1-1=m 2-1=5.
(3), phase space reconfiguration
Determine x 1-1and x 2-1delay time T 1-1=3, τ 2-1=6 and embed dimension m 1-1=m 2-1=5, by their time series delay reconstructions separately, obtain embedding vector x 5=(x t, x t-3, x t-6, x t-9, x t-12) T and x 5=(x t, x t-6, x t-12, x t-18, x t-24) T, these vectors have been determined respectively a point on 5 dimension embedded spaces separately, each retardation of different time points has just produced a series of such points in 5 dimension spaces, thereby has formed the phase space of two 5 dimensions of reflection primal system information.In order to see intuitively the figure after reconstruct, the three-dimensional that we choose is wherein carried out imaging, and Fig. 5 is x 1-1and x 2-1phase space reconfiguration figure in three dimensions.
The processing concrete steps of principal component analysis (PCA) in second step, the present embodiment are as follows:
R=E (AA t), RV=V Λ, wherein A is x 1-15 * 1991 matrixes that obtain after phase space reconfiguration, R is the autocorrelation matrix of 5 dimension reconstruct variablees, 5 * 5 rank eigenvectors matrixs that V is R, its column vector is the proper vector of the orthonomalization of R; Λ is the feature diagonal matrix of R, λ i(i=1,2 ..., 5) be the element on i diagonal line; Construct 5 incoherent new variables Y=V tx, Y={y 1, y 2..., y 5} t, to Y={y 1, y 2..., y 5} tby after descending sort, get 3 corresponding proper vectors of larger eigenwert above, obtain 3 * 1991 rank vector matrix B after dimensionality reduction 1;
In like manner obtain x 2-1vector matrix B after principal component analysis (PCA) dimensionality reduction 2.
In the 3rd step, the present embodiment, the processing concrete steps of feature extraction part are as follows:
The matrix B that dimensionality reduction after principal component analysis (PCA) is obtained 1and B 2, extract the eigenwert of its probability density function:
p ( x ) = 1 N &Sigma; n = 1 N 1 h m k ( x - x n h ) - - - ( 7 )
Wherein, N=1991, m=3, h=0.2.Through over-sampling, form the characteristic set H of two 3 * 10 1-1and H 2-1.Fig. 6 is circular horn sample x 1-1with piano sample x 2-1the eigenwert figure of probability density function.
In like manner draw according to the method described above the characteristic set H of all the other 9 groups of sample datas of circular horn and piano 1-iand H 2-i, i=2 ..., 10.
In the 4th step, the present embodiment, adopt flexible Neural Tree to identify musical instrument as sorter, mainly on the software platform of VisualStudio2008, implement, concrete steps are as follows:
Using the circular horn data H as training sample 1-i(i=1 ..., 8) characteristic set be successively placed in the file of train.txt, all the other two groups of data are placed in the file of text.txt.First set the initial configuration of flexible Neural Tree, then according to the initial configuration optimization of the characteristic set in train.txt file and setting and train a fuzzy classification model.Then test sample book signal is input in the flexible Neural Tree model training and carries out performance verification.In like manner, to piano data H 2-i(i=1 ..., 8) carry out training and testing.
Figure 7 shows that the flexible Neural Tree model finally training.Wherein, function instruction set is F={+ 2,+ 3,+ 4, terminal instruction set is: T={x 1, x 2, x 3, x 4, x 5, x 6, x 7.Choose root-mean-square error RMSE=0.1 and judge that as threshold value whether classification results is correct, when the RMSE value of output is greater than 0.1, be regarded as the musical instrument that is identified as other; Otherwise, be considered as identification correct.The discrimination of i kind musical instrument is as follows:
R i = n N &times; 100 %
Wherein, i is sample class, and n is the number of samples that is identified as i kind musical instrument, and N is the total sample number of i kind musical instrument.The present embodiment effect is, circular horn sample x 1-1with piano sample x 2-1discrimination be respectively 84.9% and 98.7%, average root-mean-square error be respectively 0.088685 and 0.045570, Fig. 8 be circular horn sample x 1-1with piano sample x 2-1training sample and the prediction output of test sample book and target output comparison diagram, as can be seen from the figure the prediction output of test sample book and target output are basically identical, although have part to occur to depart from, very approaching generally.The classification that experiment shows to the present invention is directed to instrument signal is very effective.

Claims (1)

1. the sorting technique of an instrument signal, by employing, comprise that phase space reconfiguration module, principal component analysis (PCA) module, characteristic extracting module and flexible Neural Tree module carry out feature extraction to various instrument signal, then through sorter, classify, it is characterized in that implementation step is:
The first step, the musical instrument sample signal collecting is carried out to phase space reconfiguration, mainly determine the time delay of sample signal and embed dimension, comprise the following steps:
If the One-dimension Time Series of certain instrument signal sample is x=(x 1, x 2..., x k) t, be made as τ time delay, set up the embedded space Y of a m dimension, x is mapped in this embedded space, the vector of the phase space of reconstruct is:
Y = ( y 1 , y 2 , . . . , y n ) = x ( 1 ) x ( 2 ) &CenterDot; &CenterDot; &CenterDot; x ( n ) x ( 1 + &tau; ) x ( 2 + &tau; ) &CenterDot; &CenterDot; &CenterDot; x ( n + &tau; ) &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; x ( 1 + ( m - 1 ) &tau; ) x ( 2 + ( m - 1 ) &tau; ) &CenterDot; &CenterDot; &CenterDot; x ( n + ( m - 1 ) &tau; ) - - - ( 1 )
Wherein, K is the length of audio frequency time series x, the number n=K-of phase vector (m-1) τ in phase space (n=1,2 ..., N), (1), Average Mutual method are determined optimum delay time τ
Choose Average Mutual method and determine best delay time T:
I ( &tau; ) = &Sigma; n = 1 N P ( x n , x n + &tau; ) log 2 P ( x n , x n + &tau; ) P ( x n ) P ( x n + &tau; ) - - - ( 2 )
P (x wherein n, x n+ τ) be the joint distribution probability in restructuring graph, P (x n), P (x n+ τ) be marginal distribution probability.Choose the τ of first local minimum of I (τ) as optimum delay time;
(2), false Neighbor Points method is determined embedding dimension m
In m dimension phase space, to phase vector y arbitrarily n(i)=x (i), x (i+ τ) ..., x[i+ (m-1) τ] } have and face recently a y in certain distance n nN(i), and distance between two points be:
D m(i)=||y n(i)-y n NN(i)|| (3)
D m(i) be a less amount, when the embedding dimension of phase space is increased to m+1 dimension from m dimension, the distance of these two phase points will change; If neighbor distance is D in this case m+1(i), have:
D 2 m+1(i)=D 2 m(i)+||x(i+τm)-x NN(i+τm)|| (4)
By providing certain threshold value, judge which point is false Neighbor Points, order
&Delta;D m ( i , m ) = D m + 1 2 ( i ) - D m 2 ( i ) D m 2 ( i ) = | x ( i + &tau;m ) - x NN ( i + &tau;m ) | D m ( i ) - - - ( 5 )
Δ D wherein mwhat (i, m) represented m dimension faces the distance a little variable quantity while tieing up with respect to m+1 recently,
If Δ D m(i; m)>=10%; illustrate that Neighbor Points is false; whether each phase vector in m-dimensional space is differentiated to its Neighbor Points is successively false; calculate the ratio that false Neighbor Points accounts for whole phase vectors; when the ratio of false Neighbor Points is less than 5%, think that trajectory of phase space opens completely, m is now the best dimension that embeds;
(3), phase space reconfiguration
Determine delay time T and embed after dimension m, can be by original One-dimension Time Series x=(x 1, x 2..., x k) tdelay reconstruction obtains embedding vector x m=(x t, x t-τ..., x t-(m-1) τ) t, these vectors have been determined a point on the embedded space Y of m dimension, each retardation of different time points just produced a series of such points in m-dimensional space, thereby formed the phase space reconfiguration matrix A that reflects m * n rank of primal system information;
Second step, the musical instrument sample signal after reconstruct is carried out to principal component analysis (PCA), eliminate redundancy information, reaches the effect of dimensionality reduction:
R=E (AA t), RV=V Λ, wherein A is the m * n rank matrix obtaining after phase space reconfiguration, R is the autocorrelation matrix of m component, m * m rank eigenvectors matrix that V is R, its column vector is the proper vector of the orthonomalization of R, the feature diagonal matrix that Λ is R, λ i(i=1,2 ..., m) be the element on i diagonal line, m incoherent new variables Y=V of structure ta, Y={y 1, y 2..., y m} t, to λ i(i=1,2 ..., m) by after descending sort, get p the corresponding proper vector of larger eigenwert above, obtain p * n rank vector matrix B, wherein p>=3 after dimensionality reduction;
The 3rd step, principal component analysis (PCA) is processed after resulting matrix B carry out feature extraction, it is main that what extract is the eigenwert of probability density function:
We suppose the region G of sample data be one centered by x, the minimum cube that the length of side is h, first defines a kernel function:
Figure FDA0000454777920000031
Required probability density function is:
p ( x ) = 1 N &Sigma; n = 1 N 1 h m k ( x - x n h ) - - - ( 7 )
Wherein, N is sample point number, n=1, and 2 ..., N, m is data dimension, k (u) is the kernel function of definition, obtains thus the matrix on m * n rank, after it is sampled line by line, forms the characteristic set H on m * c rank, wherein c<n;
The 4th step, adopt flexible Neural Tree model as the sorter of musical instrument identification module, carry out neural network training:
The calculation process adopting in flexible Neural Tree model is as follows: first set the initial configuration of flexible Neural Tree, then according to the initial configuration optimization of the characteristic set H obtaining and setting and train a fuzzy classification model; The instruction that comprises two types in this model: flexible neuron instruction and terminal instruction, flexible neuron instruction connects its subtree for the non-leaf node of tree construction, and terminal instruction is each input feature vector, and function instruction set and terminal instruction set are expressed as follows:
S=F∪T={+ 2,+ 3,...,+ N}∪{z 1,z 2,...z n} (8)
Wherein+ i(2,3 ..., N) represent non-leaf node instruction, this non-leaf node has i input, has i input variable; z 1, z 2... z nrepresent leaf node instruction, this leaf node is without input variable,
Adopt Gaussian function as flexible excitation function:
f ( j i , b i , z ) = e - ( z - j i b i ) 2 - - - ( 9 )
Wherein, j iand b ibe the random adjustable parameter producing in the constructive process of flexible Neural Tree, fitness function is the problem that another need to be in advance definite except instruction set, selects root-mean-square error as fitness function.
CN201410008533.4A 2014-01-09 2014-01-09 A kind of sorting technique of instrument signal Expired - Fee Related CN103761965B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410008533.4A CN103761965B (en) 2014-01-09 2014-01-09 A kind of sorting technique of instrument signal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410008533.4A CN103761965B (en) 2014-01-09 2014-01-09 A kind of sorting technique of instrument signal

Publications (2)

Publication Number Publication Date
CN103761965A true CN103761965A (en) 2014-04-30
CN103761965B CN103761965B (en) 2016-05-25

Family

ID=50529193

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410008533.4A Expired - Fee Related CN103761965B (en) 2014-01-09 2014-01-09 A kind of sorting technique of instrument signal

Country Status (1)

Country Link
CN (1) CN103761965B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105070301A (en) * 2015-07-14 2015-11-18 福州大学 Multiple specific musical instrument strengthening separation method in single-channel music human voice separation
CN105096987A (en) * 2015-06-01 2015-11-25 努比亚技术有限公司 Audio data processing method and terminal
CN110310666A (en) * 2019-06-27 2019-10-08 成都嗨翻屋科技有限公司 A kind of New Method for Instrument Recognition and system based on SE convolutional network
CN110657882A (en) * 2019-09-23 2020-01-07 暨南大学 Bridge real-time safety state monitoring method utilizing single-measuring-point response
WO2020006898A1 (en) * 2018-07-05 2020-01-09 平安科技(深圳)有限公司 Method and device for recognizing audio data of instrument, electronic apparatus, and storage medium
CN111461201A (en) * 2020-03-30 2020-07-28 重庆大学 Sensor data classification method based on phase space reconstruction
CN111681674A (en) * 2020-06-01 2020-09-18 中国人民大学 Method and system for identifying musical instrument types based on naive Bayes model

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110071665A (en) * 2009-12-21 2011-06-29 세종대학교산학협력단 System and method for recognizing instrument to classify signal source

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110071665A (en) * 2009-12-21 2011-06-29 세종대학교산학협력단 System and method for recognizing instrument to classify signal source

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
MIZUKI IHARA ETC: "Instrument Identification in Monophonic Music Using Spectral Information", 《2007 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY》, 31 December 2007 (2007-12-31), pages 595 - 599, XP031234206 *
NORHALINA SENANN ETC: "A study on traditional Malay musical instruments sounds classification system", 《PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES》, 31 December 2009 (2009-12-31), pages 729 - 733 *
杭小兵: "基于音频的数字媒体中乐器识别方法研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》, no. 1, 15 December 2011 (2011-12-15) *
林玉志: "基于声学特征的乐器识别研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》, no. 1, 15 January 2013 (2013-01-15) *
邓见光 等: "基于声学特征的乐器识别综述", 《东莞理工学院学报》, vol. 19, no. 3, 30 June 2012 (2012-06-30), pages 58 - 64 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105096987A (en) * 2015-06-01 2015-11-25 努比亚技术有限公司 Audio data processing method and terminal
CN105070301A (en) * 2015-07-14 2015-11-18 福州大学 Multiple specific musical instrument strengthening separation method in single-channel music human voice separation
CN105070301B (en) * 2015-07-14 2018-11-27 福州大学 A variety of particular instrument idetified separation methods in the separation of single channel music voice
WO2020006898A1 (en) * 2018-07-05 2020-01-09 平安科技(深圳)有限公司 Method and device for recognizing audio data of instrument, electronic apparatus, and storage medium
CN110310666A (en) * 2019-06-27 2019-10-08 成都嗨翻屋科技有限公司 A kind of New Method for Instrument Recognition and system based on SE convolutional network
CN110310666B (en) * 2019-06-27 2021-07-23 成都潜在人工智能科技有限公司 Musical instrument identification method and system based on SE convolutional network
CN110657882A (en) * 2019-09-23 2020-01-07 暨南大学 Bridge real-time safety state monitoring method utilizing single-measuring-point response
CN110657882B (en) * 2019-09-23 2021-07-27 暨南大学 Bridge real-time safety state monitoring method utilizing single-measuring-point response
CN111461201A (en) * 2020-03-30 2020-07-28 重庆大学 Sensor data classification method based on phase space reconstruction
CN111461201B (en) * 2020-03-30 2023-09-19 重庆大学 Sensor data classification method based on phase space reconstruction
CN111681674A (en) * 2020-06-01 2020-09-18 中国人民大学 Method and system for identifying musical instrument types based on naive Bayes model
CN111681674B (en) * 2020-06-01 2024-03-08 中国人民大学 Musical instrument type identification method and system based on naive Bayesian model

Also Published As

Publication number Publication date
CN103761965B (en) 2016-05-25

Similar Documents

Publication Publication Date Title
CN103761965B (en) A kind of sorting technique of instrument signal
Lidy et al. CQT-based Convolutional Neural Networks for Audio Scene Classification.
CN112562741B (en) Singing voice detection method based on dot product self-attention convolution neural network
CN111444967A (en) Training method, generation method, device, equipment and medium for generating confrontation network
CN105760888A (en) Neighborhood rough set ensemble learning method based on attribute clustering
Massoudi et al. Urban sound classification using CNN
CN105609116A (en) Speech emotional dimensions region automatic recognition method
CN111462761A (en) Voiceprint data generation method and device, computer device and storage medium
Benamer et al. Database for arabic speech commands recognition
Bubashait et al. Urban sound classification using DNN, CNN & LSTM a comparative approach
Ji et al. LBP-based bird sound classification using improved feature selection algorithm
Thwe et al. Environmental sound classification based on time-frequency representation
Liu et al. Birdsong classification based on multi feature channel fusion
CN112750442B (en) Crested mill population ecological system monitoring system with wavelet transformation and method thereof
Park Classification of audio signals using Fuzzy c-Means with divergence-based Kernel
Krey et al. Music and timbre segmentation by recursive constrained K-means clustering
CN105006231A (en) Distributed large population speaker recognition method based on fuzzy clustering decision tree
Virtanen et al. Proceedings of the Detection and Classification of Acoustic Scenes and Events 2017 Workshop (DCASE2017)
Islam et al. DCNN-LSTM based audio classification combining multiple feature engineering and data augmentation techniques
Yang et al. Sound event detection in real-life audio using joint spectral and temporal features
Chuchra et al. A deep learning approach for splicing detection in digital audios
Hu et al. A lightweight multi-sensory field-based dual-feature fusion residual network for bird song recognition
Joshi et al. Comparative study of Mfcc and Mel spectrogram for raga classification using CNN
Xie et al. Acoustic features for multi-level classification of Australian frogs
Anderson Musical instrument classification utilizing a neural network

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20160525

Termination date: 20190109