CN110163290A - A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode - Google Patents
A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode Download PDFInfo
- Publication number
- CN110163290A CN110163290A CN201910449506.3A CN201910449506A CN110163290A CN 110163290 A CN110163290 A CN 110163290A CN 201910449506 A CN201910449506 A CN 201910449506A CN 110163290 A CN110163290 A CN 110163290A
- Authority
- CN
- China
- Prior art keywords
- pollution sources
- mass balance
- source
- pollution
- contribution rate
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 239000000126 substance Substances 0.000 title claims abstract description 20
- 238000004458 analytical method Methods 0.000 claims abstract description 10
- 238000001228 spectrum Methods 0.000 claims abstract description 10
- 238000011109 contamination Methods 0.000 claims abstract description 7
- 238000012549 training Methods 0.000 claims abstract description 4
- 238000004364 calculation method Methods 0.000 claims description 15
- 239000003344 environmental pollutant Substances 0.000 claims description 12
- 231100000719 pollutant Toxicity 0.000 claims description 12
- 239000011159 matrix material Substances 0.000 claims description 10
- 238000009792 diffusion process Methods 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 238000000556 factor analysis Methods 0.000 description 2
- 238000000491 multivariate analysis Methods 0.000 description 2
- 230000002265 prevention Effects 0.000 description 2
- 238000013459 approach Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 229910001385 heavy metal Inorganic materials 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 125000005575 polycyclic aromatic hydrocarbon group Chemical group 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/26—Government or public services
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Business, Economics & Management (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Tourism & Hospitality (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- General Engineering & Computer Science (AREA)
- General Health & Medical Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- General Business, Economics & Management (AREA)
- Strategic Management (AREA)
- Primary Health Care (AREA)
- Marketing (AREA)
- Human Resources & Organizations (AREA)
- Economics (AREA)
- Health & Medical Sciences (AREA)
- Educational Administration (AREA)
- Development Economics (AREA)
- Other Investigation Or Analysis Of Materials By Electrical Means (AREA)
Abstract
The invention discloses a kind of methods parsed based on quick clustering and Chemical mass balance mode to pollution sources, the identification of the factor loading based on pollution source spectrum is treated as first the identification problem of multiparameter model, pollution sources are identified using fast clustering analysis, carry out the pretreatment and initialization of data first;Then training sample pair is exported;Then the source resolution of characteristic contamination is realized using the calculating of the pollution sources contribution rate of Chemical mass balance mode method realization factor load using the disaggregated model identified.The present invention can be achieved fast and accurately to parse pollution sources.
Description
Technical field
The present invention relates to pollution Source apportionment fields, are based on quick clustering and Chemical mass balance mode more particularly to one kind
The method that pollution sources are parsed.
Background technique
Source resolution (source apportionment) is the influence and effect for studying pollution sources to ambient contamination
A kind of method.Current pollution Source apportionment can substantially be divided into three kinds: inventory analysis method, diffusion model and receptor model.
Inventory analysis method is to establish list by the source emission amount of observation and simulating pollution object, discharge characteristics and discharge geographical distribution etc.
A kind of Source Apportionment of model;Diffusion model belongs to prediction type model, it is the emissions data by inputting each pollution sources
The change in time and space situation of pollutant is predicted with relevant parameter information;Receptor model is then by the chemical and micro- of acceptor sample
Analysis determines that a kind of technology of each pollution sources contribution rate, final purpose are identification to the contributive pollution sources of receptor, and fixed
Amount calculates the share rate of each pollution sources.In all kinds of Source Apportionments based on receptor model chemical method, multivariate statistics analysis application
Simply, and the fingerprint chromatogram that each pollution sources are known in advance is not needed, does not need in advance to be monitored survey region pollution sources,
Only need acceptor sample monitoring data.Positive definite matrix factorized model belongs to the multivariate statistics side in contamination sources analytic technique
Method is a kind of non-negative based on element in split-matrix, the factor-analysis approach optimized using data standard deviation.The skill
The core concept of art is principal component analysis, and traditional main Composition Factor based on least square method analyzes (PCA), due to taking base
Acceptor sample data D is standardized in row or column, this will lead to the data distortion of Factor Analysis.They think simultaneously
PCA based on least square method is hidden to assume standard deviation of the sample data there are unreality with closing, and cannot obtain so as to cause PCA
To the optimal solution of minimum variance.Carry out source resolution using positive definite matrix factorized model to study, core link is non-break a promise
Beam Factorization, and the pollution identifing source carried out using factor loading matrix.The current research for pollution identifing source is seldom,
Main pollution source discrimination is exactly to realize qualitative comparison by the figure observation to source spectrum and factor loading, or pass through calculating
Source spectrum and the deviation of factor loading realize that sxemiquantitative is compared.These methods do not account for the nonlinear characteristic of pollution source spectrum more, know
Other result cannot really reflect factor loading and pollute the corresponding relationship of source spectrum.
Summary of the invention
Based on quick clustering and Chemical mass balance mode pollution sources are parsed the object of the present invention is to provide a kind of
Method.It is that factor loading identification process is regarded as to a Nonlinear Classification process, is a multiple-factor compressive classification problem,
It is a mode identification procedure.The present invention identifies pollution sources using base fast clustering analysis, and then recycles chemical matter
Amount balance Source Apportionment technical field.
Pollution Source Apportionment of the invention the following steps are included:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, is gathered using quick
Alanysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported;
(2) the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified
Calculating, realize the source resolution of characteristic contamination.
Preferably, the fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center,
And use formula
Calculate criterion function value Ji;
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiDistribution
Criterion function value J into class m, after then calculating distribution at this time2;
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
Preferably, the applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijIt is
The concentration of i pollutant in j pollution sources;αiFor uncertain error.
Preferably, it is as follows to carry out pollution sources calculation method for the applied chemistry mass balance method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
Preferably, the calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
Preferably, the formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
Preferably, the formula of the source contribution rate uncertainty deviation calculation basis is as follows:
Compared with prior art, the advantages of the invention has the following advantages: of the invention and beneficial effect are as follows:
(1) this method can fast and accurately trace the source of pollutant, practical, there is extensive popularization and application valence
Value copes with pollution sources accident for environmental management department, control pollution risk provides reliable technical guarantee.
(2) traditional contamination sources analytic technique, which can only be provided substantially, contributes biggish pollution source category to environment receptor,
And the size that specific emission source contributes receptor cannot be provided, lack the practical directive significance to work prevention and cure of pollution.Pass through this
Invention the method, discloses discharge of pollutant sources composition characteristic comprehensively, and screening can indicate the signature identification object of pollution source.
(3) present invention provides technical support to formulate pollution source apportionment countermeasure and environmental quality improvement, makes environment pipe from now on
When reason department faces pollution problem, it can be known rapidly by system, complete Source Apportionment and corresponding data information system
Other pollution sources, to carry out effective pollution prevention.
Specific embodiment
The present invention is further elaborated combined with specific embodiments below.
Embodiment
A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode, comprising the following steps:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, is gathered using quick
Alanysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported.
The fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center,
And use formula
Calculate criterion function value Ji;
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiDistribution
Criterion function value J into class m, after then calculating distribution at this time2;
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
(2) the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified
Calculating, realize the source resolution of characteristic contamination.
The applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijIt is
The concentration of i pollutant in j pollution sources;αiFor uncertain error.
It is as follows that the applied chemistry mass balance method carries out pollution sources calculation method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
The calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
The formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
The formula of the source contribution rate uncertainty deviation calculation basis is as follows:
The method parsed using the present invention to pollution sources may be implemented to polycyclic aromatic hydrocarbons contaminated source and heavy metal pollution
Source is parsed, and testing result and PAHs pollution sources dactylogram compare, and is coincide substantially.Because polluting the uncertainty of source spectrum, mould
Type speculates former spectrum within the scope of the reasonable error of dactylogram.
Used herein a specific example illustrates the principle and implementation of the invention, and above embodiments are said
It is bright to be merely used to help understand method and its core concept of the invention;At the same time, for those skilled in the art, foundation
Thought of the invention, there will be changes in the specific implementation manner and application range.In conclusion the content of the present specification is not
It is interpreted as limitation of the present invention.
Claims (7)
1. a kind of method parsed based on quick clustering and Chemical mass balance mode to pollution sources, which is characterized in that including with
Lower step:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, utilizes quick clustering point
Analysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported;
(2) meter of the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified
It calculates, realizes the source resolution of characteristic contamination.
2. the method according to claim 1 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
Be characterized in that, the fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center, are used in combination
FormulaCalculate criterion function value Ji;
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiIt is assigned to class
Criterion function value J in m, after then calculating distribution at this time2;
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
3. the method according to claim 1 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
It is characterized in that, the applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijFor jth pollution
The concentration of i pollutant in source;αiFor uncertain error.
4. the method according to claim 3 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
It is characterized in that, it is as follows that the applied chemistry mass balance method carries out pollution sources calculation method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
5. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
It is characterized in that, the calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
6. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
It is characterized in that, the formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
7. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode,
It is characterized in that, the formula of the source contribution rate uncertainty deviation calculation basis is as follows:
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910449506.3A CN110163290A (en) | 2019-05-28 | 2019-05-28 | A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910449506.3A CN110163290A (en) | 2019-05-28 | 2019-05-28 | A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110163290A true CN110163290A (en) | 2019-08-23 |
Family
ID=67629249
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910449506.3A Pending CN110163290A (en) | 2019-05-28 | 2019-05-28 | A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110163290A (en) |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111739587A (en) * | 2020-06-19 | 2020-10-02 | 中科三清科技有限公司 | Processing method and device of particulate matter monitoring data, storage medium and terminal |
CN113311081A (en) * | 2021-05-17 | 2021-08-27 | 清华大学 | Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint |
CN113657698A (en) * | 2020-05-12 | 2021-11-16 | 中国环境科学研究院 | Basin partition pollution source identification method based on multivariate statistics and receptor model |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105158353A (en) * | 2015-07-29 | 2015-12-16 | 北华航天工业学院 | Source apportionment method for polycyclic aromatic hydrocarbon pollution in soil |
CN105631203A (en) * | 2015-12-27 | 2016-06-01 | 北华航天工业学院 | Method for recognizing heavy metal pollution source in soil |
CN105844301A (en) * | 2016-04-05 | 2016-08-10 | 北华航天工业学院 | Soil heavy metal pollution source analysis method based on Bayes source identification |
CN105868479A (en) * | 2016-04-05 | 2016-08-17 | 北华航天工业学院 | Polycyclic aromatic hydrocarbon source apportionment method |
-
2019
- 2019-05-28 CN CN201910449506.3A patent/CN110163290A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105158353A (en) * | 2015-07-29 | 2015-12-16 | 北华航天工业学院 | Source apportionment method for polycyclic aromatic hydrocarbon pollution in soil |
CN105631203A (en) * | 2015-12-27 | 2016-06-01 | 北华航天工业学院 | Method for recognizing heavy metal pollution source in soil |
CN105844301A (en) * | 2016-04-05 | 2016-08-10 | 北华航天工业学院 | Soil heavy metal pollution source analysis method based on Bayes source identification |
CN105868479A (en) * | 2016-04-05 | 2016-08-17 | 北华航天工业学院 | Polycyclic aromatic hydrocarbon source apportionment method |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113657698A (en) * | 2020-05-12 | 2021-11-16 | 中国环境科学研究院 | Basin partition pollution source identification method based on multivariate statistics and receptor model |
CN111739587A (en) * | 2020-06-19 | 2020-10-02 | 中科三清科技有限公司 | Processing method and device of particulate matter monitoring data, storage medium and terminal |
CN111739587B (en) * | 2020-06-19 | 2021-02-02 | 中科三清科技有限公司 | Processing method and device of particulate matter monitoring data, storage medium and terminal |
CN113311081A (en) * | 2021-05-17 | 2021-08-27 | 清华大学 | Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint |
CN113311081B (en) * | 2021-05-17 | 2023-08-11 | 清华大学 | Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110163290A (en) | A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode | |
Xing et al. | Nonlinear response of ozone to precursor emission changes in China: a modeling study using response surface methodology | |
Deissenboeck et al. | Software quality models: Purposes, usage scenarios and requirements | |
CN105631203A (en) | Method for recognizing heavy metal pollution source in soil | |
CN105868479A (en) | Polycyclic aromatic hydrocarbon source apportionment method | |
CN106327087A (en) | Test task assignment method | |
Gong et al. | Evolutionary generation of test data for many paths coverage based on grouping | |
CN105631600A (en) | Method and system for publishing auto insurance survey tasks | |
CN102360335B (en) | Method for quantitatively evaluating value of security level DCS (Data Communication System) application software defect for nuclear power plant | |
CN103257917A (en) | Management method for software evaluation system | |
CN113282514B (en) | Method, device, computer equipment and storage medium for processing problem data | |
CN105844301A (en) | Soil heavy metal pollution source analysis method based on Bayes source identification | |
CN113537563B (en) | Pollution emergency management and control effect evaluation method and device | |
CN101008915A (en) | Automatic verification method of network chip | |
Fox et al. | An agent-based model for estimating emissions reduction equivalence among leak detection and repair programs | |
Roshan et al. | Review of search based techniques in software testing | |
CN115357906A (en) | Intelligent auxiliary evaluation method and system for network security level protection 2.0 | |
CN104182829A (en) | Instrument development reliability management and support system | |
CN102567351B (en) | Testing method and testing device for database modification effects | |
CN117999478A (en) | Methods, media, and systems for linking chromatographic data and metadata to compliance risk | |
Doğan et al. | A v-model software development application for sustainable and smart campus analytics domain | |
CN117216558A (en) | Coal quality analysis model training method and device, electronic equipment and storage medium | |
Silva et al. | Towards making safety-critical systems safer: learning from mistakes | |
CN110533392A (en) | A kind of realization method and system confirming capital settlement attribution data unit | |
CN108982402A (en) | The prediction technique and system of a kind of similar organic pollutant to environmental hazard |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20200117 Address after: 065000 Langfang City, Hebei province Guangyang District Edmonton East Road No. 133 Applicant after: Beihua Institute of Aerospace Technology Applicant after: Langfang Shengyao Information Technology Co., Ltd Address before: 065000 Langfang City, Hebei province Guangyang District Edmonton East Road No. 133 Applicant before: Beihua Institute of Aerospace Technology |
|
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190823 |