CN108921233A

CN108921233A - A kind of Raman spectrum data classification method based on autoencoder network

Info

Publication number: CN108921233A
Application number: CN201810857856.9A
Authority: CN
Inventors: 雷俊锋; 董宇轩; 沈爱国; 周景龙; 肖进胜; 杨天; 邹文涛
Original assignee: Wuhan University WHU
Current assignee: Wuhan University WHU
Priority date: 2018-07-31
Filing date: 2018-07-31
Publication date: 2018-11-30
Anticipated expiration: 2038-07-31
Also published as: CN108921233B

Abstract

The Raman spectrum data classification method based on autoencoder network that the invention proposes a kind of.The present invention extracts the Raman spectrogram of the Alzheimer's blood platelet in different sick ages by optical tweezer Raman system, and pretreated Raman spectrogram is randomly divided into training set and test set；Using training set sample as input data, the sparse autoencoder network of stack of two layers of characteristic layer is constructed by layer-by-layer greedy coaching method；The further feature of second layer autocoder is trained into Softmax classifier as input data, using Softmax classifier after training as Softmax classification layer with the output layer of the sparse autoencoder network of the stack for replacing two layers of characteristic layer；Preliminary classification network, sorter network after being optimized by back-propagation algorithm are constructed according to the sparse autoencoder network of the stack of two layers of characteristic layer；Using test set as sorter network input data after optimization, the sorter network of neuronal quantity optimization is obtained by optimizing neuronal quantity.The present invention improves the accuracy and stability of classification.

Description

A kind of Raman spectrum data classification method based on autoencoder network

Technical field

The invention belongs to artificial intelligence application technical fields, and in particular to a kind of Raman spectrum number based on autoencoder network According to classification method.

Background technique

Alzheimer disease (Alzheimer's Disease) is one of the most common type disease type in senile dementia.It is one Kind central nervous system degeneration, and a kind of chronic degenerative disease, can cause to damage, clinical symptoms show as recognizing to brain Know dysfunction, memory disorders and aphasis etc., directly affects the normal life of patient.It is shown according to data in 2006, The AD patient in the whole world existing more than 200,000,000.Also, social senilization's degree is also higher and higher, and the afflicted patient of AD is also more and more, Alzheimer disease has become a kind of disease that entire society has to face.Each AD patient can to one family or Several families bring heavy burden and affect indirectly the every aspect of society by relevant medical plan etc., because This, the treatment of this disease has caused extensive public attention.But up to the present, due to the understanding to AD pathogenesis Deficiency, diagnosing and treating also rely primarily on the clinical experience of attending physician, psychological test and medical imaging, such as computerized tomography Scan (CT) and Magnetic resonance imaging (NMR) etc..However, these diagnostic methods have certain subjectivity, mistaken diagnosis and leakage mostly It examines phenomenon to happen occasionally, and haves the defects that early stage AD patient can not be identified.Therefore, there is an urgent need to one in clinical practice Objective, Accurate Diagnosis AD the foundation of kind, so as to identify and diagnose AD in the different phase of disease.

Single celled Raman spectrum detects and is taken the lead in reporting by Pupples in nineteen ninety, and subsequent Raman spectroscopy is in disease Diagnosis aspect using more and more, Raman spectroscopy has the advantages of non-destructive and Noninvasive testing so that It can ideally detect biological sample, obtain valid data.And Raman spectroscopy combines some sorting algorithms such as partially most The statistical analysis techniques such as small square law, linear discriminant analysis, support vector machines can be used for analyzing Alzheimer's not The different characteristic of same disease age blood platelet is simultaneously identified.

In terms of biology, Raman spectrum is applied to be often some biologists or chemistry to the research of cytopathy What family carried out, they generally use the tools such as SPSS software to realize analysis and processing to data.Traditional Raman spectrum classification side The methods of method, such as Principal Component Analysis, Partial Least Squares, these methods are all by initial data by a way Transformation is that the mapping of feature space finds suitable character representation method in new feature space, to realize classification.This Class method often calculates complexity, and classifying quality is bad.The learning method of some shallow-layers, such as support vector machines, linear regression point Analysis etc., then be to find out decision boundary according to the tape label data of input.But usually will appear such a case, that is, it inputs When certain samples and other differences between samples are larger in training sample, the decision boundary of classification will receive large effect.

Summary of the invention

To solve the above-mentioned problems, the invention proposes a kind of Raman spectrum data classification side based on autoencoder network Method, the present invention is sparse from encoding according to stack, and it is small to be directed to different sick age Alzheimer's blood in conjunction with Softmax classifier Plate Raman spectrum is analyzed and is identified.

The solution of the present invention is a kind of Raman spectrum data classification method based on autoencoder network, which is characterized in that packet Include following steps：

Step 1：The Raman spectrum of the Alzheimer's blood platelet in different sick ages is extracted by optical tweezer Raman system Figure, and Raman spectrogram progress background is reduced, baseline correction, smooth and average treatment, by the Raman spectrogram after processing It is randomly divided into training set and test set；

Step 2：Using training set sample as first layer autocoder input data, pass through layer-by-layer greedy coaching method training Each layer weight coefficient and each layer bias vector of first layer autocoder are obtained, by the further feature of first layer autocoder Each layer of second layer autocoder is obtained by successively greedy coaching method training as second layer autocoder input data Weight coefficient and each layer bias vector construct two layers of characteristic layer by first layer autocoder and second layer autocoder The sparse autoencoder network of stack；

Step 3：Using the further feature of second layer autocoder described in step 2 as input data with training Softmax classifier, by gradient descent algorithm Optimization Solution Softmax classifier, using Softmax classifier after training as Softmax classifies layer with the output layer of the sparse autoencoder network of the stack for replacing two layers of characteristic layer；

Step 4：According to the fisrt feature layer of the sparse autoencoder network of the stack of two layers of characteristic layer, second layer characteristic layer and Softmax classification layer building preliminary classification network, is optimized after preliminary classification network is optimized by back-propagation algorithm and is classified Network；

Step 5：Using test set as sorter network input data after optimization, building neuronal quantity optimizes first layer feature Layer and neuronal quantity optimize second layer characteristic layer, and it is excellent to optimize first layer characteristic layer, neuronal quantity by neuronal quantity The Softmax classification layer of sorter network obtains the sorter network of neuronal quantity optimization after changing second layer characteristic layer and optimizing；

Preferably, sample size is m in training set described in step 1, sample size is K in test set；

Preferably, using training set sample as first layer autocoder input data being by step 1 described in step 2 Described in sample x in training set⁽ⁱ⁾(i∈[1,m],x⁽ⁱ⁾Expression characteristic dimension is s₁Vector), as input data composition the One layer of input layer；

Described in step 2 by successively greedy coaching method training obtain first layer autocoder each layer weight coefficient and Each layer bias vector be：

To x⁽ⁱ⁾It performs the encoding operation to obtain the further feature y of first layer autocoder characteristic layer⁽ⁱ⁾：

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

Wherein, W₁For first layer autocoder characteristic layer weight coefficient, b₁For the biasing of first layer autocoder characteristic layer Vector matrix, s () are activation primitive, and the present invention is sigmoid function；

To the further feature y of first layer autocoder characteristic layer⁽ⁱ⁾It is decoded operation and obtains first layer autocoder Output layer reconstructs vector

Wherein, W '₁For first layer autocoder output layer weight coefficient, b '₁It is inclined for first layer autocoder output layer Vector matrix is set, s () is activation primitive, and the present invention is sigmoid function；

Constructing first layer autocoder cross entropy function model is:

Wherein, m is the quantity of sample in training set described in step 1, and W is the weight coefficient of autoencoder network model, and b is certainly The bias vector matrix of coding network model；

In first layer autocoder characteristic layer, sparse constraint is added to each neuron and realizes sparse expression, neuron j Average activity be：

Wherein, f_θ(x⁽ⁱ⁾) it be input data is sample x⁽ⁱ⁾When neuron j activity, defining average activity coefficient is ρ, average activityThe relative entropy for being ρ with average activity coefficient is：

The sparse self-encoding encoder loss function of first layer autocoder is:

Wherein, s₂It is the quantity of neuron in first layer autocoder characteristic layer, j is neuron serial number, and β is sparse system Number；

When self-encoding encoder loss function minimum sparse to first layer autocoder by using layer-by-layer greedy coaching method into Row Optimization Solution obtains first layer autocoder characteristic layer weight coefficient W₁, first layer autocoder characteristic layer is biased towards Moment matrix b₁, first layer autocoder output layer weight coefficient W '₁And first layer autocoder output layer bias vector square Battle array b '₁；

Lead to described in step 2 using the further feature of first layer autocoder as second layer autocoder input data It crosses successively greedy coaching method training and obtains each layer weight coefficient and each layer bias vector of second layer autocoder：

By the further feature y of first layer autocoder characteristic layer after layer-by-layer greedy coaching method training⁽ⁱ⁾As input data As second layer autocoder input layer, the further feature of second layer autocoder characteristic layer is obtained by encoding operation For：

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

Wherein, W₂For second layer autocoder characteristic layer weight coefficient, b₂For the biasing of second layer autocoder characteristic layer Vector matrix, s () are activation primitive, and the present invention is sigmoid function；

To the further feature y' of second layer autocoder characteristic layer⁽ⁱ⁾It is decoded operation, obtains second layer autocoding Device output layer reconstructs vector：

Wherein, W₂' it is second layer autocoder output layer weight coefficient, b'₂It is inclined for second layer autocoder output layer Vector matrix is set, s () is activation primitive, and the present invention is sigmoid function；

The sparse self-encoding encoder loss function of second layer autocoder is constructed according to step 2, certainly in conjunction with the second layer The quantity s of neuron in dynamic encoder feature layer₃, sparse to second layer autocoder certainly by using layer-by-layer greedy coaching method It is optimized when encoder loss function minimum, obtains second layer autocoder characteristic layer weight coefficient W₂, the second layer Autocoder characteristic layer bias vector matrix b₂, second layer autocoder output layer weight coefficient W₂' and the second layer it is automatic Encoder output layer bias vector matrix b'₂；

The stack of two layers of characteristic layer is constructed described in step 2 by first layer autocoder and second layer autocoder The sparse autoencoder network of formula：

It is after successively greedy coaching method training, first layer autocoder input layer is dilute as the stack of two layers of characteristic layer The input layer for dredging autoencoder network is sparse from coding net using first layer autocoder characteristic layer as the stack of two layers of characteristic layer The fisrt feature layer of network is sparse from coding net using second layer characteristic layer autocoder characteristic layer as the stack of two layers of characteristic layer The second feature layer of network, using second layer autocoder output layer as the defeated of the sparse autoencoder network of the stack of two layers of characteristic layer Layer out constructs the sparse autoencoder network of stack of two layers of characteristic layer；

Preferably, by the further feature y ' of second autocoder described in step 2 described in step 3⁽ⁱ⁾As input Data, for a k classification problem, need to export a k dimensional vector and carry out the general of this k estimation to train Softmax classification layer Rate value, it is assumed that function h_θ(x)：

Wherein, θ is the matrixes that whole parameters of model are a k × (n+1) dimension, and subscript refers to a certain specific neuron Parameter, subscript T represent the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension；

It is by gradient descent algorithm Optimization Solution Softmax classifier described in step 3：

To input data y '⁽ⁱ⁾It carries out a Softmax to return, establishing a Softmax recurrence cost function model is：

Wherein, m is the quantity of sample in training set described in step 1, and i is sample serial number, and k is the categorical measure of classification, and j is Classification sequence number, θ are whole parameters of model, and subscript refers to the parameter of the Softmax classification a certain specific neuron of layer, and n is y '⁽ⁱ⁾ Characteristic dimension, γ is weight term attenuation coefficient；

Carrying out derivation to cost function is：

It is minimised as optimization aim with J (θ), by gradient descent algorithm Optimization Solution, thus Softmax after being trained Classifier；

Preferably, the fisrt feature layer of the sparse autoencoder network of the stack of two layers of characteristic layer described in step 4 is：

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

Wherein, W₁For first layer characteristic layer weight coefficient, b₁For first layer characteristic layer bias vector matrix, s () is activation Function, the present invention are sigmoid function；

The second layer characteristic layer of the sparse autoencoder network of the stack of two layers of characteristic layer is：

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

The sparse autoencoder network of the stack of two layers of characteristic layer Softmax classification layer be：

For a k classification problem, need to export the probability value that a k dimensional vector carrys out this k estimation, it is assumed that function h_θ (x) as follows：

θ is whole parameters of model, is the matrix of a k × (n+1) dimension, and subscript refers to the ginseng of a certain specific neuron Number, subscript T represent the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension；

Preliminary classification network is constructed described in step 4 is：

Using the fisrt feature layer of the sparse autoencoder network of stack as the first layer characteristic layer of preliminary classification network, by stack Second layer characteristic layer of the second layer characteristic layer of sparse autoencoder network as preliminary classification network is sparse from coding net by stack Softmax classification layer of the Softmax classification layer of network as preliminary classification network；

Sorter network after preliminary classification network is optimized is optimized by back-propagation algorithm described in step 4：

The first layer characteristic layer of sorter network after optimization：

Wherein,For optimization after first layer characteristic layer weight coefficient,For first layer characteristic layer bias vector square after optimization Battle array, s () are activation primitive, and the present invention is sigmoid function；

The second layer characteristic layer of sorter network after optimization：

Wherein,For optimization after second layer characteristic layer weight coefficient,For second layer characteristic layer bias vector square after optimization Battle array, s () are activation primitive, and the present invention is sigmoid function；

The Softmax classification layer of sorter network after optimization：

Wherein,Whole parameters for model after optimization are the matrixes of a k × (n+1) dimension, and subscript refers to a certain specific mind Parameter through member, subscript T represent the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension；

Preferably, the optimization first layer characteristic layer of building neuronal quantity described in step 5 is：

Sorter network obtains classification results after test set described in step 1 is input to optimization, net of classifying after traversal optimization The neuronal quantity of the first layer characteristic layer of sorter network after neuronal quantity, that is, adjusting and optimizing of the first layer characteristic layer of network, Gu Determine other parameters, sick age classification results and the practical sick age of test set described in step 1 are compared, will be compared with practical sick age Neuronal quantity of the smallest neuronal quantity of resultant error as the first layer characteristic layer of sorter network after optimization, obtains nerve First quantity optimization first layer characteristic layer；

Building neuronal quantity described in step 5 optimizes second layer characteristic layer：

Sorter network after the neuronal quantity that test set described in step 1 is input to first layer characteristic layer is optimized, traversal The second layer characteristic layer of sorter network after neuronal quantity, that is, adjusting and optimizing of the second layer characteristic layer of sorter network after optimization Neuronal quantity, fixed other parameters compare sick age classification results and the practical sick age of test set described in step 1, will Nerve with the practical sick age the smallest neuronal quantity of comparing result error as the second layer characteristic layer of sorter network after optimization First quantity obtains neuronal quantity optimization second layer characteristic layer；

Compared with prior art, the invention has the advantages that improving the accuracy and stability of classification.

Detailed description of the invention

Fig. 1：The method of the present invention flow chart；

Fig. 2：The Raman spectrogram of different disease age alzheimer's disease small white mouse blood platelets；

Fig. 3：The training schematic diagram of two layers of stack autoencoder network first layer；

Fig. 4：The training schematic diagram of two layers of stack autoencoder network second layer；

Fig. 5：The schematic diagram of Softmax classification layer；

Fig. 6：The network structure of two layers of stack autoencoder network combination Softmax classifier.

Specific embodiment

Understand for the ease of those of ordinary skill in the art and implement the present invention, with reference to the accompanying drawing and implements example to this Invention is described in further detail, it should be understood that and implementation example described herein is merely to illustrate and explain the present invention, and It is not used in the restriction present invention.

Embodiments of the present invention are introduced below with reference to Fig. 1 to Fig. 6, embodiments of the present invention specifically include following step Suddenly：

Sample size is m=328 in training set described in step 1, and sample size is K=40 in test set；

Being as first layer autocoder input data using training set sample described in step 2 will instruction described in step 1 Practice and concentrates sample x⁽ⁱ⁾(i∈[1,328],x⁽ⁱ⁾Expression characteristic dimension is s₁=440 vector), as input data composition first Layer input layer；

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

Constructing first layer autocoder cross entropy function model is:

Wherein, f_θ(x⁽ⁱ⁾) it be input data is sample x⁽ⁱ⁾When neuron j activity, defining average activity coefficient is ρ=0.05, average activityThe relative entropy for being ρ with average activity coefficient is：

The sparse self-encoding encoder loss function of first layer autocoder is:

Wherein, s₂=400 be the quantity of neuron in first layer autocoder characteristic layer, and j is neuron serial number, β= 0.1 is sparse coefficient；

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

The sparse self-encoding encoder loss function of second layer autocoder is constructed according to step 2, certainly in conjunction with the second layer The quantity s of neuron in dynamic encoder feature layer₃=60, it is dilute to second layer autocoder by using layer-by-layer greedy coaching method It is optimized when dredging self-encoding encoder loss function minimum, obtains second layer autocoder characteristic layer weight coefficient W₂, Two layers of autocoder characteristic layer bias vector matrix b₂, second layer autocoder output layer weight coefficient W₂' and the second layer Autocoder output layer bias vector matrix b'₂；

By the further feature y ' of second autocoder described in step 2 described in step 3⁽ⁱ⁾As input data to instruct Practice Softmax classification layer, for a k=4 classification problem, needs to export a k=4 dimensional vector and carry out the general of this k=4 estimation Rate value, it is assumed that function h_θ(x)：

Wherein, m=328 is the quantity of sample in training set described in step 1, and i is sample serial number, and k=4 is the classification of classification Quantity, j are classification sequence number, and θ is whole parameters of model, and subscript refers to the parameter of the Softmax classification a certain specific neuron of layer, N is y '⁽ⁱ⁾Characteristic dimension, γ=3e^ (- 3) is weight term attenuation coefficient；

Carrying out derivation to cost function is：

The fisrt feature layer of the sparse autoencoder network of the stack of two layers of characteristic layer described in step 4 is：

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

For a k=4 classification problem, need to export the probability value that a k=4 dimensional vector carrys out this k=4 estimation, it is false If function h_θ(x) as follows：

Preliminary classification network is constructed described in step 4 is：

The first layer characteristic layer of sorter network after optimization：

The second layer characteristic layer of sorter network after optimization：

The Softmax classification layer of sorter network after optimization：

Building neuronal quantity described in step 5 optimizes first layer characteristic layer：

It should be understood that the above-mentioned description for preferred embodiment is more detailed, can not therefore be considered to this The limitation of invention patent protection range, those skilled in the art under the inspiration of the present invention, are not departing from power of the present invention Benefit requires to make replacement or deformation under protected ambit, fall within the scope of protection of the present invention, this hair It is bright range is claimed to be determined by the appended claims.

Claims

1. a kind of Raman spectrum data classification method based on autoencoder network, which is characterized in that include the following steps：

Step 1：The Raman spectrogram of the Alzheimer's blood platelet in different sick ages is extracted by optical tweezer Raman system, and Raman spectrogram progress background is reduced, baseline correction, smooth and average treatment, the Raman spectrogram after processing is divided at random For training set and test set；

Step 2：Using training set sample as first layer autocoder input data, obtained by layer-by-layer greedy coaching method training Each layer weight coefficient and each layer bias vector of first layer autocoder, using the further feature of first layer autocoder as Second layer autocoder input data obtains each layer weight of second layer autocoder by successively greedy coaching method training Coefficient and each layer bias vector construct the stack of two layers of characteristic layer by first layer autocoder and second layer autocoder The sparse autoencoder network of formula；

Step 3：The further feature of second layer autocoder described in step 2 is trained Softmax points as input data Class device, by gradient descent algorithm Optimization Solution Softmax classifier, using Softmax classifier after training as Softmax points Class layer is with the output layer of the sparse autoencoder network of the stack for replacing two layers of characteristic layer；

Step 5：Using test set as optimization after sorter network input data, building neuronal quantity optimization first layer characteristic layer with And neuronal quantity optimizes second layer characteristic layer, optimizes first layer characteristic layer, neuronal quantity optimization the by neuronal quantity The Softmax classification layer of sorter network obtains the sorter network of neuronal quantity optimization after two layers of characteristic layer and optimization.

2. the Raman spectrum data classification method according to claim 1 based on autoencoder network, it is characterised in that：Step Sample size is m in training set described in 1, and sample size is K in test set.

3. the Raman spectrum data classification method according to claim 1 based on autoencoder network, it is characterised in that：Step It using training set sample as first layer autocoder input data is by sample x in training set described in step 1 described in 2⁽ⁱ⁾ (i∈[1,m],x⁽ⁱ⁾Expression characteristic dimension is s₁Vector), as input data form first layer input layer；

Each layer weight coefficient and each layer of first layer autocoder are obtained by successively greedy coaching method training described in step 2 Bias vector is：

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

Wherein, W₁For first layer autocoder characteristic layer weight coefficient, b₁For first layer autocoder characteristic layer bias vector Matrix, s () are activation primitive, and the present invention is sigmoid function；

To the further feature y of first layer autocoder characteristic layer⁽ⁱ⁾It is decoded operation and obtains the output of first layer autocoder Layer reconstruct vector

Wherein, W '₁For first layer autocoder output layer weight coefficient, b '₁It is biased towards for first layer autocoder output layer Moment matrix, s () are activation primitive, and the present invention is sigmoid function；

Constructing first layer autocoder cross entropy function model is:

Wherein, m is the quantity of sample in training set described in step 1, and W is the weight coefficient of autoencoder network model, and b is to encode certainly The bias vector matrix of network model；

In first layer autocoder characteristic layer, sparse constraint is added to each neuron and realizes sparse expression, neuron j's is flat Equal activity is：

Wherein, f_θ（x⁽ⁱ⁾) it be input data is sample x⁽ⁱ⁾When neuron j activity, defining average activity coefficient is ρ, is put down Equal activityThe relative entropy for being ρ with average activity coefficient is：

The sparse self-encoding encoder loss function of first layer autocoder is:

Wherein, s₂It is the quantity of neuron in first layer autocoder characteristic layer, j is neuron serial number, and β is sparse coefficient；

It is carried out when self-encoding encoder loss function minimum sparse to first layer autocoder by using layer-by-layer greediness coaching method excellent Change and solve, obtains first layer autocoder characteristic layer weight coefficient W₁, first layer autocoder characteristic layer bias vector square Battle array b₁, first layer autocoder output layer weight coefficient W '₁And first layer autocoder output layer bias vector matrix b′₁；

Described in step 2 using the further feature of first layer autocoder as second layer autocoder input data by by The greedy coaching method training of layer obtains each layer weight coefficient and each layer bias vector of second layer autocoder：

By the further feature y of first layer autocoder characteristic layer after layer-by-layer greedy coaching method training⁽ⁱ⁾As input data conduct Second layer autocoder input layer is by the further feature that encoding operation obtains second layer autocoder characteristic layer：

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

Wherein, W₂For second layer autocoder characteristic layer weight coefficient, b₂For second layer autocoder characteristic layer bias vector Matrix, s () are activation primitive, and the present invention is sigmoid function；

To the further feature y' of second layer autocoder characteristic layer⁽ⁱ⁾It is decoded operation, it is defeated to obtain second layer autocoder Layer reconstruct vector is out：

Wherein, W '₂For second layer autocoder output layer weight coefficient, b '₂It is biased towards for second layer autocoder output layer Moment matrix, s () are activation primitive, and the present invention is sigmoid function；

The sparse self-encoding encoder loss function of second layer autocoder is constructed according to step 2, is compiled automatically in conjunction with the second layer The quantity s of neuron in code device characteristic layer₃, sparse to second layer autocoder from coding by using layer-by-layer greedy coaching method It is optimized when device loss function minimum, obtains second layer autocoder characteristic layer weight coefficient W₂, the second layer it is automatic Encoder feature layer bias vector matrix b₂, second layer autocoder output layer weight coefficient W '₂And second layer autocoding Device output layer bias vector matrix b '₂；

The stack for constructing two layers of characteristic layer by first layer autocoder and second layer autocoder described in step 2 is dilute Dredge autoencoder network：

It is sparse certainly using first layer autocoder input layer as the stack of two layers of characteristic layer after successively greedy coaching method training The input layer of coding network, using first layer autocoder characteristic layer as the sparse autoencoder network of the stack of two layers of characteristic layer Fisrt feature layer, using second layer characteristic layer autocoder characteristic layer as the sparse autoencoder network of the stack of two layers of characteristic layer Second feature layer, using second layer autocoder output layer as the output of the sparse autoencoder network of the stack of two layers of characteristic layer Layer constructs the sparse autoencoder network of stack of two layers of characteristic layer.

4. the Raman spectrum data classification method according to claim 1 based on autoencoder network, it is characterised in that：Step The 3 further feature y ' by second autocoder described in step 2⁽ⁱ⁾As input data to train Softmax to classify Layer needs to export the probability value that a k dimensional vector carrys out this k estimation, it is assumed that function h for a k classification problem_θ(x)：

Wherein, θ is the matrixes that whole parameters of model are a k × (n+1) dimension, and subscript refers to the ginseng of a certain specific neuron Number, subscript T represent the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension；

Wherein, m is the quantity of sample in training set described in step 1, and i is sample serial number, and k is the categorical measure of classification, and j is classification Serial number, θ are whole parameters of model, and subscript refers to the parameter of the Softmax classification a certain specific neuron of layer, and n is y '⁽ⁱ⁾Spy Dimension is levied, γ is weight term attenuation coefficient；

Carrying out derivation to cost function is：

It is minimised as optimization aim with J (θ), by gradient descent algorithm Optimization Solution, so that Softmax classifies after being trained Device.

5. the Raman spectrum data classification method according to claim 1 based on autoencoder network, it is characterised in that：Step The fisrt feature layer of the sparse autoencoder network of the stack of two layers of characteristic layer described in 4 is：

y⁽ⁱ⁾=s (W₁x⁽ⁱ⁾+b₁)

Wherein, W₁For first layer characteristic layer weight coefficient, b₁For first layer characteristic layer bias vector matrix, s () is activation letter Number, the present invention are sigmoid function；

y'⁽ⁱ⁾=s (W₂y⁽ⁱ⁾+b₂)

For a k classification problem, need to export the probability value that a k dimensional vector carrys out this k estimation, it is assumed that function h_θ(x) such as Under：

θ is whole parameters of model, is the matrix of a k × (n+1) dimension, and subscript refers to the parameter of a certain specific neuron, Subscript T represents the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension；

Preliminary classification network is constructed described in step 4 is：

It is using the fisrt feature layer of the sparse autoencoder network of stack as the first layer characteristic layer of preliminary classification network, stack is sparse Second layer characteristic layer of the second layer characteristic layer of autoencoder network as preliminary classification network, by the sparse autoencoder network of stack Softmax classification layer of the Softmax classification layer as preliminary classification network；

The first layer characteristic layer of sorter network after optimization：

Wherein,For optimization after first layer characteristic layer weight coefficient,For first layer characteristic layer bias vector matrix after optimization, s () is activation primitive, and the present invention is sigmoid function；

The second layer characteristic layer of sorter network after optimization：

Wherein,For optimization after second layer characteristic layer weight coefficient,For second layer characteristic layer bias vector matrix after optimization, s () is activation primitive, and the present invention is sigmoid function；

The Softmax classification layer of sorter network after optimization：

Wherein,Whole parameters for model after optimization are the matrixes of a k × (n+1) dimension, and subscript refers to a certain specific neuron Parameter, subscript T represents the transposition of matrix, and n is y '⁽ⁱ⁾Characteristic dimension.

6. the Raman spectrum data classification method according to claim 1 based on autoencoder network, it is characterised in that：Step Building neuronal quantity described in 5 optimizes first layer characteristic layer：

Sorter network obtains classification results after test set described in step 1 is input to optimization, sorter network after traversal optimization The neuronal quantity of the first layer characteristic layer of sorter network, fixes it after neuronal quantity, that is, adjusting and optimizing of first layer characteristic layer His parameter compares sick age classification results and test set described in step 1 practical sick age, will be with practical sick age comparing result Neuronal quantity of the smallest neuronal quantity of error as the first layer characteristic layer of sorter network after optimization, obtains neuron number Amount optimization first layer characteristic layer；

Sorter network after the neuronal quantity that test set described in step 1 is input to first layer characteristic layer is optimized, traversal optimization Afterwards after neuronal quantity, that is, adjusting and optimizing of the second layer characteristic layer of sorter network the second layer characteristic layer of sorter network nerve First quantity, fixed other parameters compare sick age classification results and the practical sick age of test set described in step 1, will be with reality Neuron number of the border disease age the smallest neuronal quantity of comparing result error as the second layer characteristic layer of sorter network after optimization Amount obtains neuronal quantity optimization second layer characteristic layer.