CN113313059A

CN113313059A - One-dimensional spectrum classification method and system

Info

Publication number: CN113313059A
Application number: CN202110666072.XA
Authority: CN
Inventors: 王书涛; 刘诗瑜; 孔德明
Original assignee: Yanshan University
Current assignee: Yanshan University
Priority date: 2021-06-16
Filing date: 2021-06-16
Publication date: 2021-08-27
Anticipated expiration: 2041-06-16
Also published as: CN113313059B

Abstract

The invention relates to a one-dimensional spectrum classification method and a system, wherein the method comprises the following steps: acquiring near infrared spectrum original data of m samples to be detected; intercepting peak data of all single spectrum sequences in the near infrared spectrum original data, preprocessing the peak data, and then zooming to obtain one-dimensional spectrum signal data; carrying out polar coordinate coding on the zoomed one-dimensional spectral signal data; reconstructing the one-dimensional spectral signal data after polar coordinate coding to obtain a gram angle and field matrix and a gram angle difference field matrix; storing a gram angle and field matrix and the gram angle difference field matrix as an image; dividing the image to obtain a training set and a test set; training the convolutional neural network by adopting a training set; and inputting the test set into the trained convolutional neural network to obtain a classification result. The method can make up the defect of losing useful characteristic information through the double mapping relation between the one-dimensional spectrum and the two-dimensional image.

Description

One-dimensional spectrum classification method and system

Technical Field

The invention relates to the field of one-dimensional spectrum data processing, in particular to a one-dimensional spectrum classification method and system.

Background

The near infrared spectrum technology is to fully utilize the characteristic absorption of frequency doubling and frequency combination of the vibration of hydrogen-containing groups X-H (such as O-H, C-H and N-H) in a near infrared spectrum interval in a measured substance so as to realize the detection of the measured substance. The technology has the advantages of wide applicable sample range, small sample consumption, no damage and easy operation. The method has the advantages of high analysis speed, low detection cost and the like because the single-dimensional absorption spectrum signals are collected. Through the intensive research of people for many years and the continuous improvement of detection technology, the near infrared spectrum comprehensively utilizes a plurality of subjects such as mathematical statistics, chemometrics, machine learning and the like, and is practically applied in the fields of medicine, food and agriculture, environmental pollution, petroleum, chemical industry and the like, and the performance of the constructed classification recognition model is continuously improved.

The current commonly used one-dimensional spectral sequence classification methods include nearest neighbor, logistic regression, naive Bayes, decision trees, support vector machines, neural networks and the like. The machine learning methods can theoretically realize the classification task of a one-dimensional spectrum sequence, but because the near infrared spectrum has the defects of wide spectrum range, high data dimension, weak useful information intensity, more noise interference and the like, the traditional methods have to combine a large amount of spectrum preprocessing, characteristic wavelength extraction, dimensionality reduction and other preprocessing, so that the task amount is increased invisibly, the useful characteristic information is inevitably missed, and the accuracy of the classification result and the applicability of the model cannot be ensured.

At present, with the rapid development of deep learning, researchers begin to focus on the design of deep neural network structures. A deep Convolutional Neural Network (CNN) represents deep learning, and exhibits excellent performance in the fields of speech recognition, computer vision, natural language processing, and the like. In the field of one-dimensional spectra, relevant research for constructing a suitable one-dimensional CNN exists, but an established model is complex, low in precision and poor in robustness. However, to apply the advantage of CNN two-dimensional image recognition to the classification of one-dimensional near infrared spectrum sequences, the following problems need to be solved: firstly, converting one-dimensional spectrum sequence data into a two-dimensional image capable of highlighting the spectral characteristics of the one-dimensional spectrum sequence data through mathematical transformation; and secondly, a proper convolutional neural network structure is designed to realize automatic classification of the special spectrum image.

Disclosure of Invention

The invention aims to provide a one-dimensional spectrum classification method and a one-dimensional spectrum classification system, which can make up the defect of losing useful characteristic information through a double mapping relation between a one-dimensional spectrum and a two-dimensional image.

In order to achieve the purpose, the invention provides the following scheme:

a method of one-dimensional spectral classification, the method comprising:

acquiring near infrared spectrum original data of m samples to be detected, and recording the data as X ═ X₁,x₂,…,x_i,…,x_nN is the total number of wavelength points, x_iIs a scalar;

specifically, the collected near infrared spectrum data includes two categories, which are gasoline and diesel data, respectively. Gasoline corresponds to class 0 and diesel corresponds to class 1, and each group has 60 groups, namely, the total number m of samples is 120. The gasoline spectrum sampling range is 900 nm-1700 nm, and the sampling interval is 2 nm. The diesel spectrum sampling range is 750 nm-1550 nm, and the sampling interval is 2nm, that is, the number of wavelength points n is 401. The data sample matrix is denoted as X (120 × 401), and the output class matrix is Y (120 × 1). For subsequent two-dimensional CNN modeling, the class labels are converted into a one-hot coded form.

Intercepting peak data of all single spectrum sequences in the near infrared spectrum original data;

the length of the intercepted peak data is t, and the intercepted data is recorded as X_i＝{x₁,…,x_i,…,x_tAnd t is the number of the current wavelength points.

Preprocessing the peak data;

zooming the preprocessed peak data to obtain one-dimensional spectral signal data X_f；

Performing polar coordinate encoding on the scaled one-dimensional spectral signal data;

reconstructing the one-dimensional spectral signal data after the polar coordinate coding to obtain a gram angle and field matrix and a gram angle difference field matrix;

storing the gram angle sum field matrix and the gram angle difference field matrix as an image;

dividing the image to obtain a training set and a test set;

training a convolutional neural network by adopting the training set;

and inputting the test set into a trained convolutional neural network to obtain a classification result.

Optionally, after the image is divided to obtain a training set and a test set, the method further includes:

and performing data enhancement processing on the training set, wherein the data enhancement processing comprises rotation, scaling, translation, turning, shearing transformation and channel shifting.

Optionally, the scaling the preprocessed peak data specifically includes:

scaling the preprocessed peak data to [0,1], the formula is as follows:

wherein the content of the first and second substances,

is the normalized effective spectral absorbance value, x, corresponding to a wavelength i_iIs the original spectral absorbance value, X, corresponding to a wavelength i_iIs the sequence of the spectral absorbances in the peak wavelength range.

Optionally, the polar coordinate encoding of the scaled one-dimensional spectral signal data specifically adopts the following formula:

wherein the content of the first and second substances,

the normalized effective spectral absorbance value corresponding to the wavelength i, r is the radius, i ═ 1,2, … …, t is the current number of wavelength points, X_fIs a local one-dimensional spectrum absorbance sequence after being scaled in a peak wavelength range。

As can be seen from the formula, the converted angle phi has a value range of [0, pi ]]The other chord values are monotonically decreased within the range, and as the wavelength is increased, x in each Cartesian coordinate system_iUnder the mapping relation, corresponding bending occurs between different angle points on a polar coordinate circle only corresponding to a unique value in a polar coordinate system.

Optionally, the expression of the gram angle and the field matrix is as follows:

wherein I is a unit vector, X_fIs a local one-dimensional spectral absorbance sequence phi after being scaled in the peak wavelength range_iAnd phi_jRespectively the angle of point i and point j,

and

respectively, the normalized effective spectral absorbance values corresponding to the wavelengths i/j.

Optionally, an expression of the gram angle difference field matrix is as follows:

and

Optionally, the trained convolutional neural network includes: the device comprises an input layer, a first coiling layer, a second coiling layer, a first pooling layer, a second pooling layer, a first full-connection layer and a second full-connection layer, wherein the input layer, the first coiling layer, the second coiling layer, the first pooling layer, the second pooling layer, the first full-connection layer and the second full-connection layer are sequentially connected.

Optionally, the trained convolutional neural network further includes: a normalization layer and a regularization layer, the normalization layer and the regularization layer being located intermediate the second volume base layer and the first pooling layer.

Optionally, the loss function in the convolutional neural network is a cross entropy loss function, and the specific formula is as follows:

where a represents the actual output of the neuron and y represents the desired output.

The present invention additionally provides a one-dimensional spectral classification system, the system comprising:

the original data acquisition module is used for acquiring near infrared spectrum original data of the m samples to be detected, and the near infrared spectrum original data is recorded as X ═ X₁,x₂,…,x_i,…,x_nN is the total number of wavelength points, x_iIs a scalar;

the intercepting module is used for intercepting peak data of all single spectrum sequences in the near infrared spectrum original data;

the preprocessing module is used for preprocessing the peak data;

a scaling module for scaling the preprocessed peak data to obtain one-dimensional spectral signal data

The polar coordinate coding module is used for carrying out polar coordinate coding on the zoomed one-dimensional spectral signal data;

the reconstruction module is used for reconstructing the one-dimensional spectral signal data after the polar coordinate coding to obtain a gram angle and field matrix and a gram angle difference field matrix;

a format conversion module to store the gram angle and field matrix and the gram angle difference field matrix as an image;

the image dividing module is used for dividing the image to obtain a training set and a test set;

the training module is used for training the convolutional neural network by adopting the training set;

and the classification result determining module is used for inputting the test set into the trained convolutional neural network to obtain a classification result.

According to the specific embodiment provided by the invention, the invention discloses the following technical effects:

the method does not need a large amount of complex operations such as manual feature extraction, dimension reduction and the like in the traditional method, has strong interpretability of features, and can realize automatic extraction, learning and classification of spectral signals; the method realizes the bidirectional mapping from the one-dimensional spectrum signal to the two-dimensional image through the gram angular field, and does not lose any characteristics of the original one-dimensional spectrum; the Graham matrix established through mathematical operation does not destroy the dependence of spectral data on wavelength characteristics, and is favorable for improving the classification performance of the model; according to the invention, the one-dimensional spectral data is subjected to two-dimensional imaging, so that the key technical problem that a more mature two-dimensional image processing method cannot be directly utilized to perform high-performance classification and identification on the one-dimensional spectral sequence data is solved, and a new thought is provided for spectral data processing research; the one-dimensional spectrum classification and identification method based on the Graham angular field imaging and the CNN is suitable for any one-dimensional spectrum data, has strong applicability and expandability, and is beneficial to realizing the development of an industrial Internet system unit which is based on one-dimensional spectrum signals, has high accuracy, is simple to operate and can quickly detect in practical application.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings needed to be used in the embodiments will be briefly described below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without inventive exercise.

FIG. 1 is a flow chart of a one-dimensional spectral classification method according to an embodiment of the present invention;

FIG. 2 is a graph of an original near infrared spectrum of an embodiment of the present invention;

FIG. 3 is a schematic diagram of a partial near infrared spectrogram and a transformation to a polar coordinate system according to an embodiment of the present invention;

FIG. 4 is a two-dimensional gasoline image of an embodiment of the present invention;

FIG. 5 is a two-dimensional diesel image of an embodiment of the present invention;

FIG. 6 is a schematic diagram showing the first 21 images of the training set after data enhancement according to the embodiment of the present invention;

FIG. 7 is a schematic diagram of a two-dimensional convolutional neural network structure according to an embodiment of the present invention;

fig. 8 is a schematic diagram of a CNN training process according to an embodiment of the present invention;

FIG. 9 is a diagram illustrating predicted results and actual results according to an embodiment of the present invention;

fig. 10 is a schematic structural diagram of a one-dimensional spectrum classification system according to an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below.

Fig. 1 is a flowchart of a one-dimensional spectrum classification method according to an embodiment of the present invention, as shown in fig. 1, the method includes:

step 101: acquiring near infrared spectrum original data of m samples to be detected, and recording a spectrum sequence of each sample in given near infrared wavelength as X ═ X₁,x₂,…,x_i,…,x_nN is the total number of wavelength points, x_iIs a scalar quantity.

In this embodiment, the near infrared spectral data collected includes two categories, gasoline and diesel data, respectively. Gasoline corresponds to class 0 and diesel corresponds to class 1, and each group has 60 groups, namely, the total number m of samples is 120. The gasoline spectrum sampling range is 900 nm-1700 nm, and the sampling interval is 2 nm. The diesel spectrum sampling range is 750 nm-1550 nm, and the sampling interval is 2nm, that is, the number of wavelength points n is 401. The data sample matrix is denoted as X (120 × 401), and the output class matrix is Y (120 × 1). For subsequent two-dimensional CNN modeling, the class labels are converted into a one-hot coded form. The near infrared spectrum of gasoline is shown in part (a) of FIG. 2, and the near infrared spectrum of diesel is shown in part (b) of FIG. 2. The spectra of both oils have two distinct peaks and are also substantially similar in shape.

Step 102: respectively intercepting peak data of all single spectrum sequences in the near infrared spectrum original data, wherein the length of an intercepting interval is t, and the intercepted peak data is recorded as X_i＝{x₁,…,x_i,…,x_tAnd t is the number of the current wavelength points.

Step 103: preprocessing the peak data.

Specifically, the peak data is normalized.

Step 104: scaling the preprocessed peak data to [0,1]]In the interior, new one-dimensional spectral signal data is obtained and recorded as X_f。

The specific formula is as follows:

wherein the content of the first and second substances,

Step 105: the scaled new one-dimensional spectral signal data X_fPolar coordinate coding is carried out, namely, the numerical value is regarded as an angle value, the wavelength step length i is regarded as a radius, and the specific formula is as follows:

the normalized effective spectral absorbance value corresponding to the wavelength i, r is the radius, i ═ 1,2, … …, t is the current number of wavelength points, X_fIs a local one-dimensional spectral absorbance sequence after scaling in the peak wavelength range.

As can be seen from the above formula, the converted angle phi has a value range of [0, pi ]]The other chord values are monotonically decreased within the range, and as the wavelength is increased, x in each Cartesian coordinate system_iUnder the mapping relation, corresponding bending occurs between different angle points on a polar coordinate circle only corresponding to a unique value in a polar coordinate system.

In this embodiment, the truncated local spectral sequence

The spectral data corresponding to 100 wavelength points near the first peak is normalized, and then converted to a polar coordinate system by taking the numerical value as an angle value and the wavelength step length i as a radius according to a formula (2). Localized near infrared of gasoline and dieselThe spectrogram and the connecting lines of points transformed into a polar coordinate system are shown in FIG. 3, wherein part (a) in FIG. 3 is gasoline and part (b) is diesel.

Step 106: and reconstructing the one-dimensional spectral signal data after the polar coordinate coding to obtain a gram angle and field matrix and a gram angle difference field matrix.

Specifically, cosine values of angle sums and sine values of angle differences between different points are calculated, and one-dimensional data are reconstructed into a gram angle sum field matrix and a gram angle difference field matrix.

The gram angle and field calculation formula is defined as follows:

in the formula, theta₁And theta₂Representing the angles of x and y in a polar coordinate system, respectively, the gram angle and the field matrix can be obtained:

trigonometric function transformation of equation (4) can result in:

and

Similarly, the calculation formula of the available gram angle difference field matrix is shown in (6).

and

It can be seen that the main diagonal in the matrix is formed by the original spectrum sequence, and the elements in the matrix are continuously moved from left to right and from top to bottom through the phi value, so that the wavelength dependence of the spectrum signal can be maintained.

Step 107: storing the gram angle sum field matrix and the gram angle difference field matrix as an image.

In this embodiment, the gray angle and field and gray angle difference field matrices are calculated according to equations (5) and (6), all samples are converted into corresponding two-dimensional images, the white edges of the coordinate axis display and the images are removed, and the pixel size of the drawn image is 218 × 218. The gasoline sample No. 1 and the diesel sample No. 61 were selected and subjected to generation of pictures, as shown in fig. 4 and fig. 5, in which (a) part is a gram angle and field image, and (b) part is a gram angle difference field image. It can be seen that the characteristic difference between gasoline and diesel oil is amplified and the discrimination is obviously improved by two-dimensionally processing the one-dimensional spectral data. In addition, in view of computer performance, the image is scaled to 100 × 100 on the basis of ensuring clarity and saved in a designated folder in jpg format.

Step 108: and dividing the image to obtain a training set and a testing set, and performing data enhancement processing on the training sample to improve the generalization capability of the model in order to reduce the overfitting risk because the spectrum difference between different samples is not large. The image data generator module is specifically adopted to realize random rotation, random scaling, horizontal/vertical movement, horizontal/vertical overturning, shearing transformation and channel shifting of the image.

In this embodiment, the random seed is set to be 3, and 120 sample image sets are divided into training sets and testing sets at a ratio of 8:2, so as to obtain 96 training sets and 24 testing sets. Because the spectrum difference between different samples is not large, in order to reduce the overfitting risk, the training samples are subjected to data enhancement processing, so that the generalization capability of the model is improved. Random rotation (angle 10), random scaling (scale 10%), horizontal/vertical panning (scale 10%), horizontal/vertical flipping (scale 10%), shear transformation (scale 10%) and channel shift (range 10%) of the image are implemented specifically using the ImageDataGenerator module. The first 21 pictures of the data enhanced training set are shown, for example, in gram angle and field images, as shown in fig. 6. The randomness of the samples can be greatly increased, and the robustness and the generalization performance of the model can be improved.

Step 109: and training the convolutional neural network by adopting the training set.

The designed CNN model consists of an input layer, two convolutional layers, two pooling layers and two fully-connected layers, as shown in fig. 7.

The input layer is used for inputting data, and a three-dimensional neuron is input. The convolution layer has a convolution kernel structure with trainable parameters, and can realize automatic feature extraction. The pooling layer is also called as down-sampling, so that network parameters and the size of data volume can be compressed, and the automatic compression of the image can be realized on the premise of ensuring the image characteristics. The fully-connected layer mainly realizes the mapping between abstract features extracted by the convolutional layer and the pooling layer to target output.

To avoid overfitting, batch normalization and regularization layers are added to the CNN structure to optimize the network. Specifically, after two batches of normalization are respectively added into the convolution operation, the data after the convolution processing is corrected, the training data are ensured to obey normal distribution, and the expression capacity of the network is improved. And adding a regularization layer before the full-connection layer, and setting the proportion of disconnected neurons as alpha.

Furthermore, the method of model compilation is matched. An Adam optimizer is selected, the loss function is a cross entropy loss function, such as a formula (7), and the evaluation index is accuracy, such as a formula (8).

In the formula, m_iThe number of prediction classes is the same as the number of actual classes.

And setting batch size batch _ size and training round epochs, verifying 20% of the training sample set, completing a training task, and observing the descending process of the training loss and the change condition of the accuracy.

In the present embodiment, since the size of each image is 100 × 100, the input layer input image shape is (100, 1).

The convolution kernel size of the first layer convolution layer is set to (5,5), the step size is 1, and the number of convolution kernels is 16. The convolution kernel size of the second convolutional layer is set to (3,3), the step size is 1, and the number of convolution kernels is 128. The zero padding of both convolutional layers is selected by default not to cross the edge sampling, and the activation functions are both relu functions.

And (3) respectively connecting a pooling layer behind each convolution layer, wherein the pooling layers are prepared by a common maximum pooling method, and the pooling size is (2, 2).

A batch normalization layer is connected between the convolution layer and the pooling layer.

Next, there is a regularization layer that sets the Dropout parameter α to 25% of the neurons to be randomly inactivated.

All data is flattened to one dimension, the first fully-connected layer setting parameter is set to 128, and the activation function is relu. The second full connectivity layer parameter is set to 2, i.e. corresponding to two classes, the activation function is softmax.

After the model is constructed, a matching training method is needed, and the method specifically comprises the following steps: the cross entropy is used as a loss function, an Adam optimizer is adopted, the initial learning rate is set to be 0.001, and the accuracy accurve is used as an evaluation index. Setting the batch sample number batch _ size to 64 and the training round epochs to 30, inputting the enhanced training set image into the model, randomly selecting 20% of the training set for verification, and obtaining an iteration curve of the loss value and the accuracy as shown in fig. 8. With the increase of the iteration times, the training loss and the verification loss gradually approach to 0, the training accuracy and the verification accuracy gradually approach to 1, the training process is relatively stable, and the constructed model has good performance.

Step 110: and inputting the test set into a trained convolutional neural network to obtain a classification result.

The 24 test samples are tested by using the constructed model, and the obtained prediction type and real type results are shown in fig. 9. It can be obviously seen that the prediction results are all correct, and the accuracy rate reaches 100%.

Fig. 10 is a schematic structural diagram of a one-dimensional spectrum classification system according to an embodiment of the present invention, and as shown in fig. 10, the system includes:

an original data obtaining module 201, configured to obtain near infrared spectrum original data of m samples to be detected, where the original data is recorded as X ═ X₁,x₂,…,x_i,…,x_nN is the total number of wavelength points, x_iIs a scalar;

an intercepting module 202, configured to intercept peak data of all single spectrum sequences in the near infrared spectrum raw data, where the intercepted peak data is recorded as X_i＝{x₁,…,x_i,…,x_tT is the number of the current wavelength points;

a preprocessing module 203, configured to preprocess the peak data;

a scaling module 204, configured to scale the preprocessed peak data to obtainTo one-dimensional spectral signal data

A polar coordinate encoding module 205, configured to perform polar coordinate encoding on the scaled one-dimensional spectral signal data;

a reconstruction module 206, configured to reconstruct the polar coordinate encoded one-dimensional spectral signal data to obtain a gram angle and field matrix and a gram angle difference field matrix;

a format conversion module 207 for storing the gram angle sum field matrix and the gram angle difference field matrix as an image;

an image dividing module 208, configured to divide the image to obtain a training set and a test set;

a training module 209, configured to train the convolutional neural network by using the training set;

and a classification result determining module 210, configured to input the test set into a trained convolutional neural network to obtain a classification result.

In summary, the one-dimensional spectrum classification method based on the gram angular field imaging and the CNN not only solves the key technical problem that the two-dimensional image processing method cannot be directly utilized to perform high-performance classification and identification on the one-dimensional spectrum sequence data, but also avoids the complex preprocessing processes of manual feature selection, dimension reduction and the like required by the traditional one-dimensional spectrum modeling. All the characteristics of the original one-dimensional spectrum can be reserved, and the method has the advantages of better automatic characteristic extraction capability, higher identification accuracy, strong generalization capability of the model and higher practical value. The method provides a novel and efficient scheme for intelligent qualitative analysis of the one-dimensional spectrum.

The embodiments in the present description are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other. For the system disclosed by the embodiment, the description is relatively simple because the system corresponds to the method disclosed by the embodiment, and the relevant points can be referred to the method part for description.

The principles and embodiments of the present invention have been described herein using specific examples, which are provided only to help understand the method and the core concept of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed. In view of the above, the present disclosure should not be construed as limiting the invention.

Claims

1. A method for one-dimensional spectral classification, the method comprising:

intercepting peak data X of all single spectrum sequences in the near infrared spectrum original data_f；

Preprocessing the peak data;

zooming the preprocessed peak data to obtain one-dimensional spectral signal data;

dividing the image to obtain a training set and a test set;

training a convolutional neural network by adopting the training set;

2. The one-dimensional spectral classification method of claim 1, further comprising, after the dividing the image into a training set and a test set:

and performing data enhancement processing on the training set.

3. The one-dimensional spectral classification method of claim 1, wherein scaling the preprocessed peak data specifically comprises:

scaling the preprocessed peak data to [0,1], the formula is as follows:

wherein the content of the first and second substances,

4. The one-dimensional spectral classification method of claim 1, wherein the polar coordinate encoding of the scaled one-dimensional spectral signal data specifically employs the following formula:

wherein the content of the first and second substances,

5. The method for one-dimensional spectral classification according to claim 1, characterized in that the expression of the gram angle and field matrix is as follows:

and

6. The method for one-dimensional spectral classification according to claim 1, characterized in that the gram angle difference field matrix is expressed as follows:

and

7. The one-dimensional spectral classification method of claim 1, wherein the trained convolutional neural network comprises: the device comprises an input layer, a first coiling layer, a second coiling layer, a first pooling layer, a second pooling layer, a first full-connection layer and a second full-connection layer, wherein the input layer, the first coiling layer, the second coiling layer, the first pooling layer, the second pooling layer, the first full-connection layer and the second full-connection layer are sequentially connected.

8. The one-dimensional spectral classification method of claim 7, wherein the trained convolutional neural network further comprises: a normalization layer and a regularization layer, the normalization layer and the regularization layer being located intermediate the second volume base layer and the first pooling layer.

9. The one-dimensional spectrum classification method according to claim 1, wherein the loss function in the convolutional neural network is a cross entropy loss function, and the specific formula is as follows:

10. A one-dimensional spectral classification system, the system comprising:

the intercepting module is used for intercepting peak data of all single spectrum sequences in the near infrared spectrum original data, and the intercepted peak data is recorded as X_i＝{x₁,…,x_i,…,x_tT is the number of the current wavelength points;

the preprocessing module is used for preprocessing the peak data;