CN110163290A - A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode - Google Patents

A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode Download PDF

Info

Publication number
CN110163290A
CN110163290A CN201910449506.3A CN201910449506A CN110163290A CN 110163290 A CN110163290 A CN 110163290A CN 201910449506 A CN201910449506 A CN 201910449506A CN 110163290 A CN110163290 A CN 110163290A
Authority
CN
China
Prior art keywords
pollution sources
mass balance
source
pollution
contribution rate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910449506.3A
Other languages
Chinese (zh)
Inventor
陈锋
张云峰
刘凤明
曹张伟
孟凡生
梅凯
刘艳梅
司秀荣
周建华
丁玎
刘晓立
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
North China Institute of Aerospace Engineering
Original Assignee
North China Institute of Aerospace Engineering
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by North China Institute of Aerospace Engineering filed Critical North China Institute of Aerospace Engineering
Priority to CN201910449506.3A priority Critical patent/CN110163290A/en
Publication of CN110163290A publication Critical patent/CN110163290A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/26Government or public services

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • General Business, Economics & Management (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • Marketing (AREA)
  • Human Resources & Organizations (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Educational Administration (AREA)
  • Development Economics (AREA)
  • Other Investigation Or Analysis Of Materials By Electrical Means (AREA)

Abstract

The invention discloses a kind of methods parsed based on quick clustering and Chemical mass balance mode to pollution sources, the identification of the factor loading based on pollution source spectrum is treated as first the identification problem of multiparameter model, pollution sources are identified using fast clustering analysis, carry out the pretreatment and initialization of data first;Then training sample pair is exported;Then the source resolution of characteristic contamination is realized using the calculating of the pollution sources contribution rate of Chemical mass balance mode method realization factor load using the disaggregated model identified.The present invention can be achieved fast and accurately to parse pollution sources.

Description

A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode
Technical field
The present invention relates to pollution Source apportionment fields, are based on quick clustering and Chemical mass balance mode more particularly to one kind The method that pollution sources are parsed.
Background technique
Source resolution (source apportionment) is the influence and effect for studying pollution sources to ambient contamination A kind of method.Current pollution Source apportionment can substantially be divided into three kinds: inventory analysis method, diffusion model and receptor model. Inventory analysis method is to establish list by the source emission amount of observation and simulating pollution object, discharge characteristics and discharge geographical distribution etc. A kind of Source Apportionment of model;Diffusion model belongs to prediction type model, it is the emissions data by inputting each pollution sources The change in time and space situation of pollutant is predicted with relevant parameter information;Receptor model is then by the chemical and micro- of acceptor sample Analysis determines that a kind of technology of each pollution sources contribution rate, final purpose are identification to the contributive pollution sources of receptor, and fixed Amount calculates the share rate of each pollution sources.In all kinds of Source Apportionments based on receptor model chemical method, multivariate statistics analysis application Simply, and the fingerprint chromatogram that each pollution sources are known in advance is not needed, does not need in advance to be monitored survey region pollution sources, Only need acceptor sample monitoring data.Positive definite matrix factorized model belongs to the multivariate statistics side in contamination sources analytic technique Method is a kind of non-negative based on element in split-matrix, the factor-analysis approach optimized using data standard deviation.The skill The core concept of art is principal component analysis, and traditional main Composition Factor based on least square method analyzes (PCA), due to taking base Acceptor sample data D is standardized in row or column, this will lead to the data distortion of Factor Analysis.They think simultaneously PCA based on least square method is hidden to assume standard deviation of the sample data there are unreality with closing, and cannot obtain so as to cause PCA To the optimal solution of minimum variance.Carry out source resolution using positive definite matrix factorized model to study, core link is non-break a promise Beam Factorization, and the pollution identifing source carried out using factor loading matrix.The current research for pollution identifing source is seldom, Main pollution source discrimination is exactly to realize qualitative comparison by the figure observation to source spectrum and factor loading, or pass through calculating Source spectrum and the deviation of factor loading realize that sxemiquantitative is compared.These methods do not account for the nonlinear characteristic of pollution source spectrum more, know Other result cannot really reflect factor loading and pollute the corresponding relationship of source spectrum.
Summary of the invention
Based on quick clustering and Chemical mass balance mode pollution sources are parsed the object of the present invention is to provide a kind of Method.It is that factor loading identification process is regarded as to a Nonlinear Classification process, is a multiple-factor compressive classification problem, It is a mode identification procedure.The present invention identifies pollution sources using base fast clustering analysis, and then recycles chemical matter Amount balance Source Apportionment technical field.
Pollution Source Apportionment of the invention the following steps are included:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, is gathered using quick Alanysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported;
(2) the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified Calculating, realize the source resolution of characteristic contamination.
Preferably, the fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center, And use formula
Calculate criterion function value Ji
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiDistribution Criterion function value J into class m, after then calculating distribution at this time2
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
Preferably, the applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijIt is The concentration of i pollutant in j pollution sources;αiFor uncertain error.
Preferably, it is as follows to carry out pollution sources calculation method for the applied chemistry mass balance method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
Preferably, the calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
Preferably, the formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
Preferably, the formula of the source contribution rate uncertainty deviation calculation basis is as follows:
Compared with prior art, the advantages of the invention has the following advantages: of the invention and beneficial effect are as follows:
(1) this method can fast and accurately trace the source of pollutant, practical, there is extensive popularization and application valence Value copes with pollution sources accident for environmental management department, control pollution risk provides reliable technical guarantee.
(2) traditional contamination sources analytic technique, which can only be provided substantially, contributes biggish pollution source category to environment receptor, And the size that specific emission source contributes receptor cannot be provided, lack the practical directive significance to work prevention and cure of pollution.Pass through this Invention the method, discloses discharge of pollutant sources composition characteristic comprehensively, and screening can indicate the signature identification object of pollution source.
(3) present invention provides technical support to formulate pollution source apportionment countermeasure and environmental quality improvement, makes environment pipe from now on When reason department faces pollution problem, it can be known rapidly by system, complete Source Apportionment and corresponding data information system Other pollution sources, to carry out effective pollution prevention.
Specific embodiment
The present invention is further elaborated combined with specific embodiments below.
Embodiment
A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode, comprising the following steps:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, is gathered using quick Alanysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported.
The fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center, And use formula
Calculate criterion function value Ji
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiDistribution Criterion function value J into class m, after then calculating distribution at this time2
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
(2) the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified Calculating, realize the source resolution of characteristic contamination.
The applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijIt is The concentration of i pollutant in j pollution sources;αiFor uncertain error.
It is as follows that the applied chemistry mass balance method carries out pollution sources calculation method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
The calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
The formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
The formula of the source contribution rate uncertainty deviation calculation basis is as follows:
The method parsed using the present invention to pollution sources may be implemented to polycyclic aromatic hydrocarbons contaminated source and heavy metal pollution Source is parsed, and testing result and PAHs pollution sources dactylogram compare, and is coincide substantially.Because polluting the uncertainty of source spectrum, mould Type speculates former spectrum within the scope of the reasonable error of dactylogram.
Used herein a specific example illustrates the principle and implementation of the invention, and above embodiments are said It is bright to be merely used to help understand method and its core concept of the invention;At the same time, for those skilled in the art, foundation Thought of the invention, there will be changes in the specific implementation manner and application range.In conclusion the content of the present specification is not It is interpreted as limitation of the present invention.

Claims (7)

1. a kind of method parsed based on quick clustering and Chemical mass balance mode to pollution sources, which is characterized in that including with Lower step:
(1) the identification problem for the identification of the factor loading based on pollution source spectrum being treated as multiparameter model, utilizes quick clustering point Analysis identifies pollution sources, carries out the pretreatment and initialization of data first;Then training sample pair is exported;
(2) meter of the pollution sources contribution rate of Chemical mass balance mode method realization factor load is utilized using the disaggregated model identified It calculates, realizes the source resolution of characteristic contamination.
2. the method according to claim 1 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, Be characterized in that, the fast clustering analysis to pollution sources carry out know method for distinguishing the following steps are included:
A) k object optionally in n data object is as initial cluster center;
B) it is assigned to by nearest cluster according to it at a distance from each cluster center to remaining each object;
C) formula Z is utilizedj=∑ienjXi, wherein i=1,2 ..., n;J=1,2 ..., k recalculate each class center, are used in combination FormulaCalculate criterion function value Ji
D) new distribution formula is calculated: assuming that xiIn class n, if | | xi-zm||2< | | xi-zn||2, by sample xiIt is assigned to class Criterion function value J in m, after then calculating distribution at this time2
If e) | J1-J2| < ε stops calculating, otherwise c=c+1, repeats c), d), e) step.
3. the method according to claim 1 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, It is characterized in that, the applied chemistry mass balance method carries out the following formula of pollution sources calculation basis:
Wherein, CiFor concentration of i-th pollutant in receptor, mjIt is jth pollution sources to the contribution rate of pollutant, xijFor jth pollution The concentration of i pollutant in source;αiFor uncertain error.
4. the method according to claim 3 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, It is characterized in that, it is as follows that the applied chemistry mass balance method carries out pollution sources calculation method:
The first step, data prediction;
Second step, the calculating of source contribution rate;
Third step, the calculating of source contribution rate iteration deviation;
4th step, source contribution rate uncertainty deviation calculate.
5. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, It is characterized in that, the calculation basis formula of the source contribution rate is as follows:
Wherein: S is pollution sources finger-print;STFor its transposed matrix;For effective variance diagonal matrix;D is sample concentration.
6. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, It is characterized in that, the formula of the calculation basis of the source contribution rate iteration deviation is as follows:
Until deviation is less than 0.0001.
7. the method according to claim 4 that pollution sources are parsed based on quick clustering and Chemical mass balance mode, It is characterized in that, the formula of the source contribution rate uncertainty deviation calculation basis is as follows:
CN201910449506.3A 2019-05-28 2019-05-28 A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode Pending CN110163290A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910449506.3A CN110163290A (en) 2019-05-28 2019-05-28 A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910449506.3A CN110163290A (en) 2019-05-28 2019-05-28 A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode

Publications (1)

Publication Number Publication Date
CN110163290A true CN110163290A (en) 2019-08-23

Family

ID=67629249

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910449506.3A Pending CN110163290A (en) 2019-05-28 2019-05-28 A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode

Country Status (1)

Country Link
CN (1) CN110163290A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111739587A (en) * 2020-06-19 2020-10-02 中科三清科技有限公司 Processing method and device of particulate matter monitoring data, storage medium and terminal
CN113311081A (en) * 2021-05-17 2021-08-27 清华大学 Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint
CN113657698A (en) * 2020-05-12 2021-11-16 中国环境科学研究院 Basin partition pollution source identification method based on multivariate statistics and receptor model

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105158353A (en) * 2015-07-29 2015-12-16 北华航天工业学院 Source apportionment method for polycyclic aromatic hydrocarbon pollution in soil
CN105631203A (en) * 2015-12-27 2016-06-01 北华航天工业学院 Method for recognizing heavy metal pollution source in soil
CN105844301A (en) * 2016-04-05 2016-08-10 北华航天工业学院 Soil heavy metal pollution source analysis method based on Bayes source identification
CN105868479A (en) * 2016-04-05 2016-08-17 北华航天工业学院 Polycyclic aromatic hydrocarbon source apportionment method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105158353A (en) * 2015-07-29 2015-12-16 北华航天工业学院 Source apportionment method for polycyclic aromatic hydrocarbon pollution in soil
CN105631203A (en) * 2015-12-27 2016-06-01 北华航天工业学院 Method for recognizing heavy metal pollution source in soil
CN105844301A (en) * 2016-04-05 2016-08-10 北华航天工业学院 Soil heavy metal pollution source analysis method based on Bayes source identification
CN105868479A (en) * 2016-04-05 2016-08-17 北华航天工业学院 Polycyclic aromatic hydrocarbon source apportionment method

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113657698A (en) * 2020-05-12 2021-11-16 中国环境科学研究院 Basin partition pollution source identification method based on multivariate statistics and receptor model
CN111739587A (en) * 2020-06-19 2020-10-02 中科三清科技有限公司 Processing method and device of particulate matter monitoring data, storage medium and terminal
CN111739587B (en) * 2020-06-19 2021-02-02 中科三清科技有限公司 Processing method and device of particulate matter monitoring data, storage medium and terminal
CN113311081A (en) * 2021-05-17 2021-08-27 清华大学 Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint
CN113311081B (en) * 2021-05-17 2023-08-11 清华大学 Pollution source identification method and device based on three-dimensional liquid chromatography fingerprint

Similar Documents

Publication Publication Date Title
CN110163290A (en) A method of pollution sources are parsed based on quick clustering and Chemical mass balance mode
Xing et al. Nonlinear response of ozone to precursor emission changes in China: a modeling study using response surface methodology
Deissenboeck et al. Software quality models: Purposes, usage scenarios and requirements
CN105631203A (en) Method for recognizing heavy metal pollution source in soil
CN105868479A (en) Polycyclic aromatic hydrocarbon source apportionment method
CN106327087A (en) Test task assignment method
Gong et al. Evolutionary generation of test data for many paths coverage based on grouping
CN105631600A (en) Method and system for publishing auto insurance survey tasks
CN102360335B (en) Method for quantitatively evaluating value of security level DCS (Data Communication System) application software defect for nuclear power plant
CN103257917A (en) Management method for software evaluation system
CN113282514B (en) Method, device, computer equipment and storage medium for processing problem data
CN105844301A (en) Soil heavy metal pollution source analysis method based on Bayes source identification
CN113537563B (en) Pollution emergency management and control effect evaluation method and device
CN101008915A (en) Automatic verification method of network chip
Fox et al. An agent-based model for estimating emissions reduction equivalence among leak detection and repair programs
Roshan et al. Review of search based techniques in software testing
CN115357906A (en) Intelligent auxiliary evaluation method and system for network security level protection 2.0
CN104182829A (en) Instrument development reliability management and support system
CN102567351B (en) Testing method and testing device for database modification effects
CN117999478A (en) Methods, media, and systems for linking chromatographic data and metadata to compliance risk
Doğan et al. A v-model software development application for sustainable and smart campus analytics domain
CN117216558A (en) Coal quality analysis model training method and device, electronic equipment and storage medium
Silva et al. Towards making safety-critical systems safer: learning from mistakes
CN110533392A (en) A kind of realization method and system confirming capital settlement attribution data unit
CN108982402A (en) The prediction technique and system of a kind of similar organic pollutant to environmental hazard

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200117

Address after: 065000 Langfang City, Hebei province Guangyang District Edmonton East Road No. 133

Applicant after: Beihua Institute of Aerospace Technology

Applicant after: Langfang Shengyao Information Technology Co., Ltd

Address before: 065000 Langfang City, Hebei province Guangyang District Edmonton East Road No. 133

Applicant before: Beihua Institute of Aerospace Technology

RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190823