CN111242201A - Stellar spectrum small sample classification method based on confrontation generation network - Google Patents
Stellar spectrum small sample classification method based on confrontation generation network Download PDFInfo
- Publication number
- CN111242201A CN111242201A CN202010014622.5A CN202010014622A CN111242201A CN 111242201 A CN111242201 A CN 111242201A CN 202010014622 A CN202010014622 A CN 202010014622A CN 111242201 A CN111242201 A CN 111242201A
- Authority
- CN
- China
- Prior art keywords
- stellar
- classification
- generation network
- data
- network
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000001228 spectrum Methods 0.000 title claims abstract description 44
- 238000000034 method Methods 0.000 title claims abstract description 27
- 235000019892 Stellar Nutrition 0.000 title claims abstract description 24
- 230000003595 spectral effect Effects 0.000 claims abstract description 22
- 238000012549 training Methods 0.000 claims abstract description 12
- 230000003042 antagnostic effect Effects 0.000 claims abstract description 4
- 230000006870 function Effects 0.000 claims description 8
- 230000004913 activation Effects 0.000 claims description 5
- 238000013507 mapping Methods 0.000 claims description 2
- 230000008485 antagonism Effects 0.000 claims 1
- 238000013145 classification model Methods 0.000 abstract 2
- 238000010801 machine learning Methods 0.000 abstract 1
- 239000011159 matrix material Substances 0.000 description 7
- 238000013528 artificial neural network Methods 0.000 description 3
- 238000007781 pre-processing Methods 0.000 description 2
- PFFIDZXUXFLSSR-UHFFFAOYSA-N 1-methyl-N-[2-(4-methylpentan-2-yl)-3-thienyl]-3-(trifluoromethyl)pyrazole-4-carboxamide Chemical compound S1C=CC(NC(=O)C=2C(=NN(C)C=2)C(F)(F)F)=C1C(C)CC(C)C PFFIDZXUXFLSSR-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013527 convolutional neural network Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000005065 mining Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
- 238000012545 processing Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
- 238000011425 standardization method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/214—Generating training patterns; Bootstrap methods, e.g. bagging or boosting
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a semi-supervised stellar spectrum classification method based on an antagonistic generation network, which comprises the following steps: the method is based on a classification method under the condition that known training samples are few. Since the number of spectra of a specific celestial body is known to be very small, it is desirable to classify the spectra of the specific celestial body, however, using conventional machine learning and statistics based classification methods requires a large amount of known data to train the classification model. Thus, unlike the supervised approach, the proposed method can make full use of a large number of unlabelled samples, which consists of two parts: a generator that captures the data distribution and a discriminator that determines whether the sample consists of actual data. And then training the trained discriminator by using a small amount of labeled data to obtain a better classification model. The performance of the model is evaluated under the condition of using real world spectral data and limited amount of spectral data, and experimental results show that the model is superior to other methods in the aspect of classification precision.
Description
The technical field is as follows:
the invention belongs to a statistical learning classification method under the condition of small training data quantity, and particularly relates to classification of stellar spectral data.
Background art:
with the development of astronomical observational instruments, astronomy has generated more and more spectral data in recent decades, such as a sialon digital sky patrol, a global celestial physical interferometer and a large sky space multi-target optical fiber spectral telescope. Classification of stars helps us to study the overall structure and evolution of galaxy. It is also effective for detecting a specific celestial body. We can obtain spectral data that contains continuous spectral band information and a range of ultraviolet to infrared bands. The stellar spectrum has a plurality of wave bands and continuous spectrum characteristics, and is beneficial to classification and identification. The search and identification of a particular celestial body is of great significance to astronomers.
From the perspective of data processing and mining, for data with a large amount of labeled information, a traditional supervised learning method can be used for firstly training a model of a neural network and then classifying unknown observed data. However, the number of observations we obtained for distinctive celestial bodies and rare astronomical phenomena is small. That is, the observed data corresponding to the distinctive celestial body and rare astronomical phenomena belong to a small sample as compared to a large number of known celestial bodies or astronomical phenomena. Therefore, we search for characteristic celestial bodies or rare astronomical phenomena in massive observation data, which is essentially a classification problem about small samples or sample imbalance. The problem with using conventional unsupervised learning methods is that the accuracy is not high, so improving the accuracy of the classifier with a small number of spectra is a challenging problem.
MK (Morgan et al, 1973) is a widely used spectral classification method based on spectral features of small-scale standard stars. MK systems classify the stellar spectra into seven classes based on the temperature of the stars going from high to low. However, MK star classification systems are primarily done manually by comparing spectra to a small number of standard stars, which is inefficient and unreliable when the number of spectra is very large. Bailer-Jones (1997) trained an Artificial Neural Network (ANN) directly from the observed spectra using physical parameters, which can be applied to spectral classification. BailerJones (2003) tends to use pattern recognition algorithms and outlines two alternative frameworks (parallel and hierarchical) for classification, with data from the Galactic survey task GAIA. The supervised methods widely used for classification can make full use of a priori knowledge to improve the performance of the classifier. However, a large number of labeled spectra are required to train the model, and obtaining a large number of labeled spectra is time consuming and expensive. Moreover, the number of spectra for a given celestial body is known to be generally small.
The invention content is as follows:
the invention discloses a method for classifying stellar spectrum small samples based on an adversary generation network, which learns the characteristics of a stellar spectrum by utilizing the strong characteristic learning capacity of the adversary generation network. And then, a small amount of labeled data is utilized to finely adjust the network, and finally the problem of classifying different stellar spectrums under a small sample is solved.
A method for classifying stellar spectrum small samples based on a challenge generation network comprises the following steps:
1) collecting the stellar spectral data, preprocessing the stellar spectral data, and preprocessing the data by using a z-score standardization method.
2) The method includes training a generator-countermeasure network (GAN) using unlabeled real spectra and pseudo-spectra generated by a generative model, wherein the trained generative model is used to simulate the generation of one-dimensional constellations and a trained discriminative model is used to discriminate whether the constellations are generated by the generator network.
3) In step 2, when the confrontation network model is generated in the training process, the input of the generated model is random noise pz(z)=μ(-1,1)。
4) In step 2, when the confrontation network model is generated in the training process, the generation model obtains a pseudo spectrum with the same size as the real spectrum by using an up-sampling scheme.
5) In step 2, when the generated network model is trained, the generated model uses the tanh activation function to represent nonlinearity, and the last layer of the generated model is connected to the input layer of the discriminant model.
6) After the generation of the confrontation network is trained, the generation model is deleted, the discrimination model is reserved, and the dimension output by the discrimination model is increased.
7) The retained output of the discriminant model is replaced with a Softmax function, which is typically used as a classifier for multi-class problems. The modified discrimination network can classify the stellar spectra into classes.
8) A small amount of tagged spectral data is used to fine tune the modified discrimination network.
9) After retraining the network with a small amount of tagged data, the network can perform multi-classification tasks
Description of the drawings:
FIG. 1 is a framework of an SCGAN network
FIG. 2 is a generated spectral image
FIG. 3 is a confusion matrix for different methods of SDSS stellar spectral classification
FIG. 4 is a graph of classification accuracy results of different algorithms compared on SDSS
Description of reference numerals:
the label 0 indicates a category for discriminating the final output of the network
The symbol X represents random noise
The notation Z denotes spurious spectra generated by noise
The specific implementation mode is as follows:
the specific implementation mode of the invention is as follows:
the invention is composed of two parts, the first part is to generate the training of the countermeasure network (GAN), the countermeasure generating network includes the generating model (G), which is mainly used to simulate and generate the one-dimensional stellar spectrum, and the discriminating model (D), the discriminating network is used to discriminate whether the stellar spectrum is generated by the generating network. The second part is to fine-tune the trained discrimination model (D) of the first part by using a small number of labeled stellar spectra. Two sections are detailed below.
A first part:
1) we use an upsampling scheme to obtain a pseudo-spectrum of the same size as the real spectrum
2) The input being random noise pz(z)=μ(-1,1)。
3) To obtain a pseudo-spectrum (X in fig. 1) of the same size as the real spectrum, we use upsampling to generate data of the same dimension as the real signal.
4) We use the tanh activation function in G to represent the non-linearity (Nair and Hinton, 2010). The last layer of G is connected to the input layer of D.
5) We train D and G using unlabeled real spectra and pseudo-spectra generated by G. The network framework of SCGAN is shown in fig. 1. In the SCGAN training process, the task of generating the network is to generate data as real as possible to deceive the discrimination network, and the main objective of the discrimination network is to discriminate the data generated by the generation network as false, i.e. to distinguish the real data from the generated false data. Thus, the generation network and the discrimination network form a dynamic game process:
6) after the generation of the antagonistic network is trained, fig. 2 is a spectral image generated by the generation model. Then we delete G and increase the dimension of the D output.
7) We design D as a one-dimensional convolution based classifier where the non-linear mapping is by the ReLU activation function.
8) And storing the trained model.
A second part:
9) the output of D is replaced with a Softmax function, which is typically used as a classifier for multi-classification problems. In this way, D can classify the stellar spectra into classes.
10) A small amount of labeled spectral data is used to fine-tune the first portion generated D.
11) In this case, the retrained D can be used to classify three spectral types: f type spectrum, G type spectrum and K type spectrum, achieving the purpose of multi-classification. Experimental results, fig. 3 shows a confusion matrix for different methods of SDSS stellate spectral classification.
12) We evaluated classification performance of SCGAN using F1-score and confusion matrix. F1-score is a harmonic mean of precision and recall, typically using a confusion metric to evaluate the performance of the method in the classification task. FIG. 4 is a graph of classification accuracy results comparing different algorithms on an SDSS.
Claims (4)
1. A method for classifying stellar spectrum small samples based on a challenge generation network is characterized by comprising the following steps:
step 1: learning a large number of unlabeled stellar spectral features by using a generation countermeasure generation network;
step 2: after G and D of the confrontation network are generated after training, when D cannot distinguish a real spectrum from a false spectrum, storing the trained model D;
and step 3: replacing the output of the trained D with a Softmax function;
and 4, step 4: d is finely adjusted by using a small amount of labeled spectral data, and classification of multiple categories of stellar spectra can be realized by retraining D.
2. The method for stellar spectrum small sample classification based on countermeasure generation network of claim 1, wherein the input of G is random noise pz(z)=μ(-1,1)。
3. The method for stellar spectral small sample classification based on antagonism generation network of claim 1, in which G represents nonlinearity using tanh activation function and connects the last layer of G to the input layer of D.
4. The method for classifying the stellar spectrum small samples based on the antagonistic generation network as claimed in claim 1, training the antagonistic generation network to delete G and D after training, increasing the dimension of D output, replacing the last layer of D with Softmax, and designing D as a classifier based on one-dimensional convolution, wherein the nonlinear mapping is through a ReLU activation function.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010014622.5A CN111242201A (en) | 2020-01-07 | 2020-01-07 | Stellar spectrum small sample classification method based on confrontation generation network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010014622.5A CN111242201A (en) | 2020-01-07 | 2020-01-07 | Stellar spectrum small sample classification method based on confrontation generation network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111242201A true CN111242201A (en) | 2020-06-05 |
Family
ID=70866385
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010014622.5A Pending CN111242201A (en) | 2020-01-07 | 2020-01-07 | Stellar spectrum small sample classification method based on confrontation generation network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111242201A (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111797916A (en) * | 2020-06-30 | 2020-10-20 | 东华大学 | Classification method of stellar spectra |
CN113609361A (en) * | 2021-08-20 | 2021-11-05 | 东北大学 | Data classification method based on Gaia system |
CN113656754A (en) * | 2021-09-16 | 2021-11-16 | 南京邮电大学 | Method and system for enhancing stellar spectral data |
CN114492526A (en) * | 2022-01-25 | 2022-05-13 | 太原科技大学 | SPEC-Net network architecture and identification method for multi-satellite spectrum automatic identification |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108846405A (en) * | 2018-04-11 | 2018-11-20 | 东莞迪赛软件技术有限公司 | Uneven medical insurance data classification method based on SSGAN |
CN109143353A (en) * | 2018-08-20 | 2019-01-04 | 电子科技大学 | A kind of pre-stack seismic waveform classification generating confrontation network based on depth convolution |
-
2020
- 2020-01-07 CN CN202010014622.5A patent/CN111242201A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108846405A (en) * | 2018-04-11 | 2018-11-20 | 东莞迪赛软件技术有限公司 | Uneven medical insurance data classification method based on SSGAN |
CN109143353A (en) * | 2018-08-20 | 2019-01-04 | 电子科技大学 | A kind of pre-stack seismic waveform classification generating confrontation network based on depth convolution |
Non-Patent Citations (2)
Title |
---|
姜斌等: "基于反贝叶斯学习的WDMS光谱自动识别研究", 《光谱学与光谱分析》 * |
李博等: "改进的卷积神经网络关系分类方法研究", 计算机科学与探索 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111797916A (en) * | 2020-06-30 | 2020-10-20 | 东华大学 | Classification method of stellar spectra |
CN113609361A (en) * | 2021-08-20 | 2021-11-05 | 东北大学 | Data classification method based on Gaia system |
CN113609361B (en) * | 2021-08-20 | 2023-11-14 | 东北大学 | Data classification method based on Gaia system |
CN113656754A (en) * | 2021-09-16 | 2021-11-16 | 南京邮电大学 | Method and system for enhancing stellar spectral data |
CN113656754B (en) * | 2021-09-16 | 2024-02-23 | 南京邮电大学 | Fixed star spectrum data enhancement method and system |
CN114492526A (en) * | 2022-01-25 | 2022-05-13 | 太原科技大学 | SPEC-Net network architecture and identification method for multi-satellite spectrum automatic identification |
CN114492526B (en) * | 2022-01-25 | 2022-11-22 | 太原科技大学 | SPEC-Net network architecture and identification method for multi-satellite spectrum automatic identification |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Wang et al. | Heterogeneous domain generalization via domain mixup | |
CN111242201A (en) | Stellar spectrum small sample classification method based on confrontation generation network | |
Deng et al. | Deep metric learning-based feature embedding for hyperspectral image classification | |
CN105184303B (en) | A kind of image labeling method based on multi-modal deep learning | |
CN113486851A (en) | Hyperspectral image classification method based on double-branch spectrum multi-scale attention network | |
Zhong et al. | A comparative study of image classification algorithms for Foraminifera identification | |
Champ et al. | A comparative study of fine-grained classification methods in the context of the LifeCLEF plant identification challenge 2015 | |
Longo et al. | Foreword to the focus issue on machine intelligence in astronomy and astrophysics | |
CN114444600A (en) | Small sample image classification method based on memory enhanced prototype network | |
Viertel et al. | Pattern recognition methodologies for pollen grain image classification: a survey | |
Bittencourt et al. | Feature selection by using classification and regression trees (CART) | |
CN116630700A (en) | Remote sensing image classification method based on introduction channel-space attention mechanism | |
Jose et al. | Deep hashing with hash center update for efficient image retrieval | |
Teimoorinia et al. | Assessment of astronomical images using combined machine-learning models | |
Zhou et al. | Masked Spectral-Spatial Feature Prediction for Hyperspectral Image Classification | |
Shang et al. | Faster and lighter meteorological satellite image classification by a lightweight channel-dilation-concatenation net | |
CN113343801B (en) | Automatic wireless signal modulation and identification method based on lightweight convolutional neural network | |
CN111191033A (en) | Open set classification method based on classification utility | |
Faza et al. | An initial study of deep learning for mangrove classification | |
Alawi et al. | Deep residual networks model for star-galaxy classification | |
CN109145950A (en) | A kind of high spectrum image Active Learning Method based on figure signal sampling | |
Abd El-Khalek et al. | A Robust Machine Learning Algorithm for Cosmic Galaxy Images Classification Using Neutrosophic Score Features. | |
CN108898157B (en) | Classification method for radar chart representation of numerical data based on convolutional neural network | |
CN113111774B (en) | Radar signal modulation mode identification method based on active incremental fine adjustment | |
Sarowar et al. | Performance evaluation of feature extraction and dimensionality reduction techniques on various machine learning classifiers |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20200605 |
|
WD01 | Invention patent application deemed withdrawn after publication |