CN102601059B - Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer) - Google Patents

Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer) Download PDF

Info

Publication number
CN102601059B
CN102601059B CN201210049492.4A CN201210049492A CN102601059B CN 102601059 B CN102601059 B CN 102601059B CN 201210049492 A CN201210049492 A CN 201210049492A CN 102601059 B CN102601059 B CN 102601059B
Authority
CN
China
Prior art keywords
particle
classification
class
similarity
collected
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210049492.4A
Other languages
Chinese (zh)
Other versions
CN102601059A (en
Inventor
李梅
张莉
黄正旭
高伟
粘慧青
董俊国
傅忠
周振
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Hexin Instrument Co Ltd
University of Shanghai for Science and Technology
Original Assignee
GUANGZHOU HEXIN ANALYTICAL INSTRUMENT CO Ltd
University of Shanghai for Science and Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by GUANGZHOU HEXIN ANALYTICAL INSTRUMENT CO Ltd, University of Shanghai for Science and Technology filed Critical GUANGZHOU HEXIN ANALYTICAL INSTRUMENT CO Ltd
Priority to CN201210049492.4A priority Critical patent/CN102601059B/en
Publication of CN102601059A publication Critical patent/CN102601059A/en
Application granted granted Critical
Publication of CN102601059B publication Critical patent/CN102601059B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method for classifying particle aerosol collected by a single particle aerosol mass spectrometer (SPAMS). The method comprises the following steps: firstly setting a class array InWM, wherein a specific line in each class determined by M/Z has a specific valve; numbering every particle class; firstly comparing a collected particle with the particles in the numbered classes; if the collected particle and the particles in the numbered classes do not meet the logic 'and' relation, skipping over the step and conducting the next class judgment; otherwise, if the collected particle belongs to one of particles in the numbered classes, numbering the collected particle; and classifying residual particles, which do not belong to the preset particle classes, for the second time by using a particle similarity computing method. According to the method, the first classification is conducted with a clear target to eliminate the ambiguous state of particles and improve the classification accuracy.

Description

A kind of sorting technique of the aerosol particle collecting for SPAMS
Technical field
The present invention relates to a kind of sorting technique of the aerosol particle collecting for SPAMS.
Background technology
Aerosol is solid or the common heterogeneous system forming of liquid being suspended in gas.Aerosol is due to its tremendous influence to local, region and global environment, and more and more by people, paid attention to.SPAMS can identify size and the chemical composition of particle as a kind of novel aerosol analytical technology from the aspect of individual particle, have high time and spatial resolution.SPAMS has huge data volume, and this is a sizable challenge for follow-up analysis and processing.SPAMS adopts the YAADA software kit based on MATLAB to carry out data analysis, has mainly comprised data importing, the supervisor of searching, classify, map in YAADA, and user can oneself modification or write function and expand its application.Aerocolloidal classification has great importance for research aerosol.In YAADA software, have the ART-2a(adaptive neural networks algorithm carrying) particle sorting technique.ART-2a sorting algorithm will be carried out the calculating of dot product between two between particle, if end value is greater than the threshold value of setting, two particles are classified as to a class, otherwise, if be less than the threshold values of setting, do not belong to a class.But crucial is the concrete classifications of our uncertain each class, needs us according to the mass spectrogram of each class, its classification of artificial differentiation.If data volume is very large, through after ART-2a classification, may there are hundreds and thousands of kinds, next can need us to remove to differentiate its specific category in a large amount of work, be a kind of coarse, sorting technique of thering is no purpose.
Summary of the invention
The sorting technique that the object of this invention is to provide a kind of aerosol particle collecting for SPAMS.
The technical solution used in the present invention is:
A kind of sorting technique of the aerosol particle collecting for SPAMS, comprise the following steps: first set classification matrix InWM, the particular row of the class of being determined by M/Z at each has specific value, each particle type is numbered, first the particle collecting compares with above-mentioned classification particle, if do not meet logic " and " relation, skip, carry out classification judgement next time, otherwise, particle belongs to a kind of in classification particle, be numbered, the remainder particulate that does not belong to predefined classification particle utilizes the computational methods of particle similarity to carry out secondary classification.
Preset classification particle vector.
The Information in Mass Spectra matrix unit of particle will be collected, every a line respectively with classification matrix multiple, obtain similarity and the most similar classification number, if particle similarity is greater than set threshold value and keeps logic " and " relation, finally must have such other all particles simultaneously.
Program is carried out n circulation, guarantees that the remaining particle that carries out secondary classification does not belong to any one of given classification.
By carrying out the calculating of similarity through sorted remainder particulate once, if similarity is greater than set threshold value, be classified as a class, then judge its classification.
The invention has the beneficial effects as follows:
This sorting technique is classified targetedly for the first time, gets rid of the state of particle " equivocal ", improves the degree of accuracy of classification; When judgement classification, with logical data, set up " and " property, accurately position particles kind; Program has generality, adapts to various types of particles, if the classification kind of the first round is more, just fewer through the remaining particle of classifying for the first time so, follow-up judgement is just simpler.
The specific embodiment
Below in conjunction with embodiment, the present invention is described further:
The mass spectral database that following table 1 is corresponding particulate:
Figure DEST_PATH_IMAGE004
operation principle of the present invention is:
Classification matrix size is that classification is counted n*MaxMZ, particular column (M/Z determines by mass-to-charge ratio) in each definite class has specific value, when particle peak information vector and such vectorial product are not 0, can determine that such particle may belong to such, further judgement need to determine whether this particle contains all mass-to-charge ratio M/Z that class vector comprises, and must guarantee to occupy maximum (the most similar) in the product of all class vectors, after having classified for the first time, utilize particle similarity based method, classify for the second time.
embodiment 1:
Weight matrix InWM(n*500) every a line represents a class, each row represents different mass-to-charge ratio (m/z), and front 250 classify anion information as, and rear 250 classify cation information as, on the position of the corresponding mass-to-charge ratio of each class, there is numerical value, and must guarantee that the mould length of every a line is 1.Such as for rich sodium potassium (NaK) particle, meet m/z=23 and m/z=39.Suppose that we set rich NaK particle in the first row of InWM, at the 1*(250+23 of InWM) there is a numerical value A position, at 1*(250+39) position have a numerical value B, and A, B meets.In experiment, we set A=0.7071, B=0.7071.For organic carbon, need meet m/z=27 and m/z=43 andnot m/z=70, that is to say must (250+27) of certain a line and (250+43) position have numerical value, must be without numerical value in (250+70) position, in experiment we still to set its two value be all 0.7071.
Described " and " relation, a kind of particle may determine by organizing mass-to-charge ratio jointly, as met the mass-to-charge ratio of every group, just thinks that it belongs to the particle of certain type.Rich sodium potassium (NaK) particle for example, must meet m/z=23 and m/z=39, in program, must meet so (250+23) of certain a line of InWM and (250+39) position have numerical value, as only have a position to have numerical value, we are not just classified as rich NaK particle.For another example contain molybdenum (Mo) particle, and m/z=98, or m/z=96, eligible m/z=95, we are just classified as containing molybdenum particle.The meaning is that we just think to contain molybdenum particle as long as (250+95) and (250+96) there is value at least one place and (250+98) has value.
The described particle collecting first with mass spectral database in class of particles compare, it is the negative ions peak area matrix that first obtains wanted analysing particulates, then by every provisional capital unitization of matrix, the every row of this matrix is carried out to dot product calculating with the transposition of weight matrix respectively, get the maximum in gained matrix after multiplying each other, if this maximum is less than the threshold value of setting or does not meet " and " relation, automatically carry out follow-up judgement.If this threshold value setting is higher, illustrate that the similarity of particle is higher.Suppose that we require particle similarity high, we can set threshold values is 0.8 or 0.9, and in experiment, we are set as 0.7 and can meet the demands.This threshold values is not fixed, and can, according to requirement of experiment, specifically adjust.
The concept of described similarity is the peak area vector that first we obtain particle, then the vector of particle is between two carried out to the calculating of similarity, if result of calculation is greater than the threshold values of setting, we think that this two particle belongs to same class.The setting of threshold values, is determined by experiment.
Described similarity is calculated, and supposes that the area vector of two particles is respectively;
Particle carries out similarity calculating between two, and result of calculation is greater than the threshold values that we set, and we are just classified as a class, otherwise, being less than the threshold values of setting, two particles just will not belong to a class.The set basis of threshold values specifically needs oneself to set, and in experiment, we set threshold values is 0.7.
Above-described embodiment is preferably embodiment of the present invention; but embodiments of the present invention are not restricted to the described embodiments; other any do not deviate from change, the modification done under Spirit Essence of the present invention and principle, substitutes, combination, simplify; all should be equivalent substitute mode, within being included in protection scope of the present invention.

Claims (1)

1. the sorting technique of the aerosol particle collecting for SPAMS, it is characterized in that: comprise the following steps: first set classification matrix InWM, the every a line of weight matrix InWM represents a class, each row represents different mass-to-charge ratioes, each definite class and by mass-to-charge ratio M/Z, determined specificly show specific value, each particle type is numbered, first the particle collecting compares with above-mentioned classification particle, if do not meet logic " and " relation, skip, carry out secondary classification judgement, otherwise, particle belongs to a kind of in classification particle, be numbered, the remainder particulate that does not belong to predefined classification particle utilizes the computational methods of particle similarity to carry out secondary classification, preset classification particle vector, the Information in Mass Spectra matrix unit of particle will be collected, every a line respectively with classification matrix multiple, obtain similarity and the most similar classification number, if particle similarity is greater than set threshold value and keeps logic " and " relation, finally must have such other all particles simultaneously, program is carried out n circulation since a subseries, guarantees that the remaining particle that carries out secondary classification does not belong to any one of given classification, by carrying out the calculating of similarity through sorted remainder particulate once, if similarity is greater than set threshold value, be classified as a class, then judge its classification.
CN201210049492.4A 2012-02-29 2012-02-29 Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer) Active CN102601059B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210049492.4A CN102601059B (en) 2012-02-29 2012-02-29 Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210049492.4A CN102601059B (en) 2012-02-29 2012-02-29 Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer)

Publications (2)

Publication Number Publication Date
CN102601059A CN102601059A (en) 2012-07-25
CN102601059B true CN102601059B (en) 2014-04-30

Family

ID=46519044

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210049492.4A Active CN102601059B (en) 2012-02-29 2012-02-29 Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer)

Country Status (1)

Country Link
CN (1) CN102601059B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104568681A (en) * 2015-01-29 2015-04-29 暨南大学 Real-time airborne fine particulate source monitoring method
CN105021515A (en) * 2015-07-22 2015-11-04 暨南大学 Mobile surveillance car-based single particle aerosol online mass spectrum detection method

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262039A (en) * 2011-04-27 2011-11-30 上海大学 Method and device for detecting indoor heavy metal pollution by using single particle aerosol mass spectrometer (SPAMS)

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8252519B2 (en) * 2010-08-12 2012-08-28 Phage Biocontrol Research, Llc Process for continuous production of bacteriophage

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262039A (en) * 2011-04-27 2011-11-30 上海大学 Method and device for detecting indoor heavy metal pollution by using single particle aerosol mass spectrometer (SPAMS)

Non-Patent Citations (6)

* Cited by examiner, † Cited by third party
Title
《Mixing state of biomass burning particles by single particle aerosol mass spectrometer in the urban area of PRD, China》;Xinhui Bi etc;《Atmospheric Environment》;20110630;第45卷(第20期);第3447-3453页 *
《气溶胶单粒子光谱的PLS聚类分析》;张子良等;《量子电子学报》;20120131;第29卷(第1期);第106-113页 *
《运用单颗粒气溶胶质谱技术初步研究广州大气矿尘污染》;李梅等;《环境科学研究》;20110630;第24卷(第6期);第632-636页 *
XinhuiBietc.《MixingstateofbiomassburningparticlesbysingleparticleaerosolmassspectrometerintheurbanareaofPRD China》.《Atmospheric Environment》.2011
张子良等.《气溶胶单粒子光谱的PLS聚类分析》.《量子电子学报》.2012,第29卷(第1期),第106-113页.
李梅等.《运用单颗粒气溶胶质谱技术初步研究广州大气矿尘污染》.《环境科学研究》.2011,第24卷(第6期),第632-636页.

Also Published As

Publication number Publication date
CN102601059A (en) 2012-07-25

Similar Documents

Publication Publication Date Title
CN105389480B (en) Multiclass imbalance genomics data iteration Ensemble feature selection method and system
CN103617429A (en) Sorting method and system for active learning
CN110503245A (en) A kind of prediction technique of air station flight large area risk of time delay
CN103258213A (en) Vehicle model dynamic identification method used in intelligent transportation system
CN103324939B (en) Skewed popularity classification and parameter optimization method based on least square method supporting vector machine technology
US20070133855A1 (en) Similar pattern searching apparatus, method of similar pattern searching, program for similar pattern searching, and fractionation apparatus
Li et al. Intelligent anti-money laundering solution based upon novel community detection in massive transaction networks on spark
Dubey et al. A systematic review on k-means clustering techniques
Schaefer et al. Towards automatic airborne pollen monitoring: From commercial devices to operational by mitigating class-imbalance in a deep learning approach
CN103425994A (en) Feature selecting method for pattern classification
CN102601059B (en) Method for classifying particle aerosol collected by SPAMS (Single Particle Aerosol Mass Spectrometer)
Xue et al. Multi long-short term memory models for short term traffic flow prediction
CN103902706B (en) Method for classifying and predicting big data on basis of SVM (support vector machine)
CN103942415A (en) Automatic data analysis method of flow cytometer
Thibault et al. Efficient statistical/morphological cell texture characterization and classification
CN107563324A (en) A kind of hyperspectral image classification method and device of the learning machine that transfinited based on core basis
CN104615910A (en) Method for predicating helix interactive relationship of alpha transmembrane protein based on random forest
CN110706004B (en) Farmland heavy metal pollutant tracing method based on hierarchical clustering
CN110008938B (en) Space target shape recognition method
CN103400159A (en) Target classification identifying method in quick mobile context and classifier obtaining method for target classification and identification in quick mobile context
CN108830432B (en) Unmanned aerial vehicle group action scheme searching method based on small amount of prior knowledge
Turton et al. Testing space-time and more complex hyperspace geographical analysis tools
He et al. A HK clustering algorithm based on ensemble learning
Qiong et al. Application of clustering algorithm in intelligent transportation data analysis
Wang Asymmetric random subspace method for imbalanced credit risk evaluation

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C56 Change in the name or address of the patentee
CP01 Change in the name or title of a patent holder

Address after: 200444 Baoshan District Road, Shanghai, No. 99

Patentee after: Shanghai University

Patentee after: Guangzhou Hexin Instruments Co., Ltd.

Address before: 200444 Baoshan District Road, Shanghai, No. 99

Patentee before: Shanghai University

Patentee before: Guangzhou Hexin Analytical Instrument Co., Ltd.