CN103018177A - Spectrogram abnormal sample point detection method based on random sampling agree set - Google Patents

Spectrogram abnormal sample point detection method based on random sampling agree set Download PDF

Info

Publication number
CN103018177A
CN103018177A CN2012105191839A CN201210519183A CN103018177A CN 103018177 A CN103018177 A CN 103018177A CN 2012105191839 A CN2012105191839 A CN 2012105191839A CN 201210519183 A CN201210519183 A CN 201210519183A CN 103018177 A CN103018177 A CN 103018177A
Authority
CN
China
Prior art keywords
model
sigma
spectrogram
median
sample point
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012105191839A
Other languages
Chinese (zh)
Inventor
王海燕
刘军
姜久英
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
JIANGSU YIPU TECHNOLOGY Co Ltd
Original Assignee
JIANGSU YIPU TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIANGSU YIPU TECHNOLOGY Co Ltd filed Critical JIANGSU YIPU TECHNOLOGY Co Ltd
Priority to CN2012105191839A priority Critical patent/CN103018177A/en
Publication of CN103018177A publication Critical patent/CN103018177A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Complex Calculations (AREA)

Abstract

The invention discloses a spectrogram abnormal sample point detection method based on a random sampling agree set. The spectrogram abnormal sample point detection method comprises the following steps of: eliminating part of an abnormal sample in advance so as to obtain a correcting sample set through principal component analysis on the base of a maximum posteriori probability random sampling agree set and starting from a given spectroscopic data, carrying out random sampling, establishing multivariated correcting model and evaluating a model property, and selecting an appropriate sample subset as an inner point set through random sampling for many times. The designed spectrogram abnormal sample point detection method based on the random sampling agree set, provided by the invention, has the advantages of being rapid, effective, high in accuracy and wide in application range.

Description

Spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling
Technical field
The present invention relates to Chemical Measurement Multivariate Correction model data processing technology field, particularly a kind of spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling.
Background technology
Along with the development of Modern Analytical Instrument, detection signal has been complete spectrogram by traditional single numerical value change, or even image.For spectroscopic data, dimension is normally very high with respect to the number of samples that gathers, and proofread and correct the regression problem Very Ill-conditioned this moment, and traditional monobasic single argument bearing calibration is difficult to these data analysis be the substitute is multivariate calibration methods [1]Chemical Measurement Multivariate Correction technology is directly utilized measuring-signal, sets up quantitative model between spectral signal and the sample concentration by dimensionality reduction, feature extraction, eigentransformation and multiple regression technology, to realize quantitative test.
Yet classical multivariate calibration methods is such as multiple linear regression, principal component regression, partial least squares regression [2-3]Usually be subject to especially the impact of exceptional sample point.Usually, compare with the most of sample of data centralization, exceptional sample be exactly have nothing to do or be wrong and abnormal sample in a way.Exceptional sample is generally caused by reasons such as instrument failure, acquisition condition factor, manual operation error or data self-defects.The existence of exceptional sample can affect the quality of model, and the model that causes setting up can't reflect the true relation of data, can't be predicted the outcome accurately.Therefore, need the impact of rejecting abnormalities sample point to set up sane model [4]
For principal component regression, generally adopt sane covariance to estimate to substitute traditional data covariance matrix, thereby realize sane principal component regression.Return for offset minimum binary (PLS), different sane PLS models are suggested, as with least square regression method involved in the PLS method, partly or entirely replace to certain robustness regression method, such as iteration heavy weighted least-squares (IRLS), minimum median quadratic method (LMS) and truncation least square method (LTS) etc.; Iteration is weighting offset minimum binary (IRPLS) method heavily [5]Inclined to one side robust M homing method [6]The RSIMPLS method [7]
Also have class methods to detect exceptional sample by cross validation, as based on staying a cross validation to obtain spectrum residual error corresponding to each sample or concentration residual error, judge that then the sample that residual error exceeds certain threshold value is exceptional sample [8]Similarly, model's Caro cross validation also is used to exceptional sample and detects, the method model model Caro cross validation model, then sort according to Prediction sum squares, and add up the frequency of occurrence of each sample in different models, final anomaly-based sample judges with the frequency of occurrence difference of normal sample whether sample is unusual.
Yet, based on the exceptional sample detection method of cross validation, may produce " covering " phenomenon, cause detecting or the wrong identification exceptional sample.It is relatively poor to detect effect when sane principal component regression or partial least squares regression are more for the data centralization exceptional sample.Unanimously collect based on the maximum a posteriori probability stochastic sampling [9], carry out the Multivariate Correction exceptional sample and detect, be a kind of new method, it can pass through constantly stochastic sampling, rejects the exceptional sample in the data, yet there are no proven technique and document.Various complicated cases in the real world applications such as observation condition, operation factors etc., all can cause the appearance of exceptional sample point.Various dissimilar exceptional sample points are different to the influence degree of calibration model, and impact how effectively to eliminate these exceptional samples is a difficult problem of Chemical Measurement Multivariate Correction technology.
[1]Martens?H,Nas?T.Multivariate?calibration.Wiley,1992
[2]Wold?H.Soft?modelling?by?latent?variables:the?nonlineariterative?partial?least?squares?approach.Perspectives?in?Probability?andStatistics.London:Academic?Press,1975
[3]de?Jong?S.SIMPLS:an?alternative?approach?squares?regression?topartial?least?squares?regression.Chemometrics?and?IntelligentLaboratory?Systems,1993,18(3),251-263
[4]Liang?Y?Z,Kvalheim?O?M.Robust?methods?for?multivariate?analysis-a?tutorial?review.Chemometrics?and?Intelligent?Laboratory?Systems,1996,32(1),1-10
[5]Cummins?D?J,Andrews?C?W.Iteratively?reweighted?partial?leastsquares:A?performance?analysis?by?monte?carlo?simulation.Journal?ofChemometrics,1995,9(6),489-507
[6]Serneels?S,Croux?C,Filzmoser?P,et?al.Partial?robust?Mregression.Chemometrics?and?Intelligent?Laboratory?Systems,2005,79(1-2),55-64
[7]Hubert?M,Branden?K.Robust?methods?for?partial?least?squaresregression.Journal?of?Chemometrics,2003,17,537-549
[8]Koshoubu?J,Iwata?T,Minami?S.Elimination?of?the?uninformativecalibration?sample?subset?in?the?modified?UVE-PLS?method.AnalyticalSciences,2001,17(2),319-322
[9]Torr?P.Bayesian?Model?Estimation?and?Selection?for?EpipolarGeometry?and?Generic?Manifold?Fitting.International?Journal?of?ComputerVision,2002,50(1),35-61
Summary of the invention
Technical matters to be solved by this invention provides a kind of fast effective, accuracy spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling high with applied widely.
The present invention has designed following technical scheme in order to solve the problems of the technologies described above: the present invention has designed a kind of spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling, comprises following concrete steps:
Step (1): given spectroscopic data X is carried out sane principal component analysis (PCA), detect and elimination exceptional spectrum sample point, obtain calibration samples collection X c, note calibration samples collection X cMiddle number of samples is m c
Step (2): the calibration samples collection X in described step (1) cOn carry out stochastic sampling, obtain current training set X s
Step (3): based on the training set X in the described step (2) sSet up the Multivariate Correction model, and computation model prediction residual error E s
Step (4): utilize Multivariate Correction model and model prediction residual error E in the step (3) s, the performance of evaluation model also draws the evaluation score, and with the calibration samples collection X in the step (1) cBe defined as interior point set u c
Step (5): repeating step (2) is to step (4) N time, and wherein N is defined as natural number, estimates score thereby obtain N, and selecting wherein to estimate the corresponding calibration samples collection of the most much higher first calibration model of score is final interior point set u m
As a kind of optimization method of the present invention: described step (1) comprises following concrete steps:
Step (11): set up model X=TP T, T[t wherein 1, t 2..., t a] TBe defined as score matrix, P[p 1, p 2..., p a] TBe defined as loading matrix, a is defined as the major component number;
Step (12): utilize formula t Median=median (t 1, t 2... t a) calculating principal component scores vector t 1, t 2..., t aIntermediate value t Median
Step (13): based on the intermediate value t in the step (12) MedianAnd following formula
s mad=1.4826median(|t 1-t median|,|t 2-t median|,…|t a-t median|)
Calculate the intermediate value absolute deviation s of data Mad
Step (14): utilize formula d i=| t i-t Median| calculate the error amount d between each principal component scores data and the intermediate value i, i=1 wherein ..., m c, reject d i〉=3 * s MadSample point, the data set that obtains is calibration samples collection X c
As a kind of optimization method of the present invention: described step (2) comprises following concrete processing:
At calibration set X cOn carry out stochastic sampling, pick out randomly m=m c/ 2 samples, wherein, m is defined as positive even numbers, forms sample set
Figure BDA00002538925500051
As current training set.
As a kind of optimization method of the present invention: described step (3) comprises following concrete processing:
Set up concentration value Multivariate Correction model Y s=X sB, and utilize formula
Figure BDA00002538925500052
Calculate the prediction residual error E of all samples s, wherein, i=1 ..., m c,
Figure BDA00002538925500053
Be defined as the actual concentration value,
Figure BDA00002538925500054
Be defined as model predication value, B is defined as Optimal Parameters.
As a kind of optimization method of the present invention: described step (4) comprises following concrete processing:
Step (41): utilize formula θ * = arg max P ( θ | e , I ) = arg max P ( e | θ , I ) P ( θ | I ) P ( e | I ) Calculate the maximum a posteriori probability of Multivariate Correction model parameter θ, wherein, I is defined as the probability distribution of data, and the posterior probability of θ is expressed as: P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) , Wherein γ is defined as the prior probability of interior point, and v is defined as the size of the error space;
Step (42): maximization P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) Be equivalent to and minimize following objective function:
Figure BDA00002538925500058
Step (43): minimize above-mentioned objective function c = Σ i ρ ( e i 2 ) , ρ ( e i 2 ) = e i σ , e i / σ 2 ≤ T T σ 2 , e i / σ 2 > T , Thereby obtain current optimum model parameter θ *, and the likelihood function value of interior point, exterior point if an exterior point likelihood function value corresponding to sample is put the likelihood function value greatly in the inner, judges that then this sample is the exceptional sample point, eliminate these exceptional sample points after, determine corresponding interior point set u c
The present invention compared with prior art has following advantage:
1. the designed spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling of the present invention is simple to operate, convenient and swift, need not to set in advance parameter;
2. the designed spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling of the present invention passes through repeatedly stochastic sampling, carries out model evaluation and determines consistent sample set, and no matter whether data centralization has exceptional sample, and algorithm all can be obtained satisfied performance; Especially, when the data centralization exceptional sample was more, algorithm can be rejected these exceptional samples effectively by repeatedly screening, improves the robustness of follow-up Multivariate Correction model;
3. the designed spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling of the present invention can be processed the exceptional sample point of number of different types, has stronger practicality.
Description of drawings
Fig. 1 is the designed process flow diagram based on the consistent spectrogram exceptional sample point detecting method that collects of stochastic sampling of the present invention.
Embodiment
The present invention is described in further detail below in conjunction with accompanying drawing:
As shown in Figure 1, the present invention has designed a kind of spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling, comprises following concrete steps:
Step (1): given spectroscopic data X is carried out sane principal component analysis (PCA), detect and elimination exceptional spectrum sample point, obtain calibration samples collection X c, note calibration samples collection X cMiddle number of samples is m c
Step (2): the calibration samples collection X in described step (1) cOn carry out stochastic sampling, obtain current training set X s
Step (3): based on the training set X in the described step (2) sSet up the Multivariate Correction model, and computation model prediction residual error E s
Step (4): utilize Multivariate Correction model and model prediction residual error E in the step (3) s, the performance of evaluation model also draws the evaluation score, and with the calibration samples collection X in the step (1) cBe defined as interior point set u c
Step (5): repeating step (2) is to step (4) N time, and wherein N is defined as natural number, estimates score thereby obtain N, and selecting wherein to estimate the corresponding calibration samples collection of the most much higher first calibration model of score is final interior point set u m
As a kind of optimization method of the present invention: described step (1) comprises following concrete steps:
Step (11): set up model X=TP T, T[t wherein 1, t 2..., t a] TBe defined as score matrix, P[p 1, p 2..., p a] TBe defined as loading matrix, a is defined as the major component number;
Step (12): utilize formula t Median=median (t 1, t 2... t a) calculating principal component scores vector t 1, t 2..., t aIntermediate value t Median
Step (13): based on the intermediate value t in the step (12) MedianAnd following formula
s mad=1.4826median(|t 1-t median|,|t 2-t median|,…|t a-t median|)
Calculate the intermediate value absolute deviation s of data Mad
Step (14): utilize formula d i=| t i-t Median| calculate the error amount d between each principal component scores data and the intermediate value i, i=1 wherein ..., m c, reject d i〉=3 * s MadSample point, the data set that obtains is calibration samples collection X c
As a kind of optimization method of the present invention: described step (2) comprises following concrete processing:
At calibration set X cOn carry out stochastic sampling, pick out randomly m=m c/ 2 samples, wherein, m is defined as positive even numbers, forms sample set
Figure BDA00002538925500081
As current training set.
As a kind of optimization method of the present invention: described step (3) comprises following concrete processing:
Set up concentration value Multivariate Correction model Y s=X sB, and utilize formula
Figure BDA00002538925500082
Calculate the prediction residual error E of all samples s, wherein, i=1 ..., m c, Be defined as the actual concentration value, Be defined as model predication value, B is defined as Optimal Parameters.
As a kind of optimization method of the present invention: described step (4) comprises following concrete processing:
Step (41): utilize formula θ * = arg max P ( θ | e , I ) = arg max P ( e | θ , I ) P ( θ | I ) P ( e | I ) Calculate the maximum a posteriori probability of Multivariate Correction model parameter θ, wherein, I is defined as the probability distribution of data, and the posterior probability of θ is expressed as: P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) , Wherein γ is defined as the prior probability of interior point, and v is defined as the size of the error space;
Step (42): maximization P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) Be equivalent to and minimize following objective function:
Figure BDA00002538925500088
Step (43): minimize above-mentioned objective function c = Σ i ρ ( e i 2 ) , ρ ( e i 2 ) = e i σ , e i / σ 2 ≤ T T σ 2 , e i / σ 2 > T , Thereby obtain current optimum model parameter θ *, and the likelihood function value of interior point, exterior point if an exterior point likelihood function value corresponding to sample is put the likelihood function value greatly in the inner, judges that then this sample is the exceptional sample point, eliminate these exceptional sample points after, determine corresponding interior point set u c
Described sane principal component analysis (PCA) detects the exceptional sample point and refers to given spectroscopic data, carries out principal component analysis (PCA) and obtains the principal component scores matrix; For this principal component scores data matrix, use sane 3 σPrinciple detects the exceptional sample point.Described principal component scores matrix refers to choose the data matrix that score vector corresponding to front several major component forms.Describedly choose front several major component, the determining of major component number generally determined according to cumulative proportion in ANOVA, also can specify in advance the major component number, as choose 10 or 20 major components.
Described sane 3 σPrinciple refers at first calculate the intermediate value of principal component scores vector, and based on the intermediate value absolute deviation of these median calculation data; Then calculate the error between each principal component scores vector and the intermediate value, be the exceptional sample point if this error amount, is then judged this sample greater than 3 times of the intermediate value absolute deviation.
Described stochastic sampling refers on the calibration samples collection that ascertains the number, and half the sample of picking out randomly the calibration set number forms sample set, as current training set.
Described Multivariate Correction model can be multiple linear regression model, principal component regression model, partial least square model or other nonlinear model.The performance of described evaluation model, refer to according to model prediction residual error, based on Bayes principle, posterior probability corresponding to maximization model parameter, can be converted into and minimize an objective function, thereby obtain current optimum model parameter, and the likelihood function value of interior point, exterior point, if an exterior point likelihood function value corresponding to sample is put the likelihood function value greatly in the inner, judge that then this sample is the exceptional sample point.After eliminating these exceptional sample points, determine the corresponding consistent sample set of interior point.
N stochastic sampling of described repetition and model evaluation process select to estimate the highest model of score, and determine that corresponding sample set is final interior point set, refer to stochastic sampling, set up Multivariate Correction model, these three steps repetitions of model evaluation N time; In this N time model evaluation, selecting the solution of posterior probability values maximum (value of objective function C is minimum) is final solution, and determines that interior corresponding sample set is final interior point set.
Above-described specific embodiment; purpose of the present invention, technical scheme and beneficial effect are further described; institute is understood that; the above only is specific embodiments of the invention; be not limited to the present invention; within the spirit and principles in the present invention all, any modification of making, be equal to replacement, improvement etc., all should be included within protection scope of the present invention.

Claims (5)

1. the spectrogram exceptional sample point detecting method based on the consistent collection of stochastic sampling is characterized in that, comprises following concrete steps:
Step (1): given spectroscopic data X is carried out sane principal component analysis (PCA), detect and elimination exceptional spectrum sample point, obtain calibration samples collection X c, note calibration samples collection X cMiddle number of samples is m c
Step (2): the calibration samples collection X in described step (1) cOn carry out stochastic sampling, obtain current training set X s
Step (3): based on the training set X in the described step (2) sSet up the Multivariate Correction model, and computation model prediction residual error E s
Step (4): utilize Multivariate Correction model and model prediction residual error E in the step (3) s, the performance of evaluation model also draws the evaluation score, and with the calibration samples collection X in the step (1) cBe defined as interior point set u c
Step (5): repeating step (2) is to step (4) N time, and wherein N is defined as natural number, estimates score thereby obtain N, and selecting wherein to estimate the corresponding calibration samples collection of the most much higher first calibration model of score is final interior point set u m
2. according to claim 1 based on the consistent spectrogram exceptional sample point detecting method that collects of stochastic sampling, it is characterized in that described step (1) comprises following concrete steps:
Step (11): set up model X=TP T, T[t wherein 1, t 2..., t a] TBe defined as score matrix, P[p 1, p 2..., p a] TBe defined as loading matrix, a is defined as the major component number;
Step (12): utilize formula t Median=median (t 1, t 2... t a) calculating principal component scores vector t 1, t 2..., t aIntermediate value t Median
Step (13): based on the intermediate value t in the step (12) MedianAnd following formula
s mad=1.4826median(|t 1-t median|,|t 2-t median|,…|t a-t median|)
Calculate the intermediate value absolute deviation s of data Mad
Step (14): utilize formula d i=| t i-t Median| calculate the error amount d between each principal component scores data and the intermediate value i, i=1 wherein ..., m c, reject d i〉=3 * s MadSample point, the data set that obtains is calibration samples collection X c
3. according to claim 2 based on the consistent spectrogram exceptional sample point detecting method that collects of stochastic sampling, it is characterized in that described step (2) comprises following concrete processing:
At calibration set X cOn carry out stochastic sampling, pick out randomly m=m c/ 2 samples, wherein, m is defined as positive even numbers, forms sample set
Figure FDA00002538925400021
As current training set.
4. according to claim 3 based on the consistent spectrogram exceptional sample point detecting method that collects of stochastic sampling, it is characterized in that described step (3) comprises following concrete processing:
Set up concentration value Multivariate Correction model Y s=X sB, and utilize formula
Figure FDA00002538925400022
Calculate the prediction residual error E of all samples s, wherein, i=1 ..., m c,
Figure FDA00002538925400023
Be defined as the actual concentration value,
Figure FDA00002538925400024
Be defined as model predication value, B is defined as Optimal Parameters.
5. according to claim 4 based on the consistent spectrogram exceptional sample point detecting method that collects of stochastic sampling, it is characterized in that described step (4) comprises following concrete processing:
Step (41): utilize formula θ * = arg max P ( θ | e , I ) = arg max P ( e | θ , I ) P ( θ | I ) P ( e | I ) Calculate the maximum a posteriori probability of Multivariate Correction model parameter θ, wherein, I is defined as the probability distribution of data, and the posterior probability of θ is expressed as: P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) , Wherein γ is defined as the prior probability of interior point, and v is defined as the size of the error space;
Step (42): maximization P ( θ | e , I ) ∝ Π i = 1 m c ( γ i 1 2 πσ exp ( - e i 2 2 σ 2 ) + ( 1 - γ i ) 1 v ) P ( θ | I ) Be equivalent to and minimize following objective function:
Figure FDA00002538925400033
Step (43): minimize above-mentioned objective function c = Σ i ρ ( e i 2 ) , ρ ( e i 2 ) = e i σ , e i / σ 2 ≤ T T σ 2 , e i / σ 2 > T , Thereby obtain current optimum model parameter θ *, and the likelihood function value of interior point, exterior point if an exterior point likelihood function value corresponding to sample is put the likelihood function value greatly in the inner, judges that then this sample is the exceptional sample point, eliminate these exceptional sample points after, determine corresponding interior point set u c
CN2012105191839A 2012-12-06 2012-12-06 Spectrogram abnormal sample point detection method based on random sampling agree set Pending CN103018177A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012105191839A CN103018177A (en) 2012-12-06 2012-12-06 Spectrogram abnormal sample point detection method based on random sampling agree set

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012105191839A CN103018177A (en) 2012-12-06 2012-12-06 Spectrogram abnormal sample point detection method based on random sampling agree set

Publications (1)

Publication Number Publication Date
CN103018177A true CN103018177A (en) 2013-04-03

Family

ID=47967030

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012105191839A Pending CN103018177A (en) 2012-12-06 2012-12-06 Spectrogram abnormal sample point detection method based on random sampling agree set

Country Status (1)

Country Link
CN (1) CN103018177A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103310122A (en) * 2013-07-10 2013-09-18 北京航空航天大学 Parallel random sampling consensus method and device
CN103645284A (en) * 2013-12-13 2014-03-19 林丽君 Quick smell fingerprint detection method based on improved RANSAC (random sample consensus) theory
CN104062008A (en) * 2014-06-13 2014-09-24 武汉理工大学 Method for removing abnormal spectrums in actually measured spectrum curve with integral measurement considered
CN104198529A (en) * 2014-08-07 2014-12-10 山东东阿阿胶股份有限公司 Method for distinguishing donkey-hide gelatin by utilizing electronic nose technology
CN106018515A (en) * 2016-06-08 2016-10-12 北京科技大学 A kind of electronic tongues signal characteristic extracting methods based on manifold learning
CN106018325A (en) * 2016-04-29 2016-10-12 南京富岛信息工程有限公司 Method for evaluating credibility of gasoline property modeling prediction result
CN110417610A (en) * 2018-04-30 2019-11-05 慧与发展有限责任合伙企业 Storage system postpones Outlier Detection
CN112016249A (en) * 2020-08-31 2020-12-01 华北电力大学 SCR denitration system bad data identification method based on optimized BP neural network
CN112784373A (en) * 2021-01-19 2021-05-11 河北大学 Fault early warning method for wind turbine generator gearbox
CN113740277A (en) * 2021-10-15 2021-12-03 北方民族大学 Environment safety analysis method based on spectral multi-component analysis
CN117093841A (en) * 2023-10-18 2023-11-21 中国科学院合肥物质科学研究院 Abnormal spectrum screening model determining method, device and medium for wheat transmission spectrum

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030155092A1 (en) * 2000-04-14 2003-08-21 Raija Badenlid Method in connection with the production of pulp, paper or paperboard
CN1519557A (en) * 2003-01-07 2004-08-11 三星电子株式会社 Method of removing abnormal data and blood component spectroscopy analysis system employing same
CN102305772A (en) * 2011-07-29 2012-01-04 江苏大学 Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030155092A1 (en) * 2000-04-14 2003-08-21 Raija Badenlid Method in connection with the production of pulp, paper or paperboard
CN1519557A (en) * 2003-01-07 2004-08-11 三星电子株式会社 Method of removing abnormal data and blood component spectroscopy analysis system employing same
CN102305772A (en) * 2011-07-29 2012-01-04 江苏大学 Method for screening characteristic wavelength of near infrared spectrum features based on heredity kernel partial least square method

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
P.H.S. TORR: "Bayesian Model Estimation and Selection for Epipolar Geometry", 《INTERNATIONAL JOURNAL OF COMPUTER VISION》 *
刘蓉等: "奇异点快速检测在牛奶成分近红外光谱测量中的应用", 《光谱学与光谱分析》 *
包鑫: "稳健回归技术及其在光谱分析中的应用", 《中国博士学位论文全文数据库 工程科技Ⅱ辑》 *
史波林等: "苹果内部品质近红外光谱检测的异常样本分析", 《农业机械学报》 *
陈达等: "近红外光谱的光谱干扰消除和奇异点识别", 《光谱学与光谱分析》 *

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103310122A (en) * 2013-07-10 2013-09-18 北京航空航天大学 Parallel random sampling consensus method and device
CN103310122B (en) * 2013-07-10 2016-04-20 北京航空航天大学 A kind of parallel stochastic sampling consistent method and device thereof
CN103645284A (en) * 2013-12-13 2014-03-19 林丽君 Quick smell fingerprint detection method based on improved RANSAC (random sample consensus) theory
CN103645284B (en) * 2013-12-13 2015-09-30 阿默思(天津)科技发展有限公司 A kind of quick smell fingerprint detection method based on improving RANSAC theory
CN104062008A (en) * 2014-06-13 2014-09-24 武汉理工大学 Method for removing abnormal spectrums in actually measured spectrum curve with integral measurement considered
CN104062008B (en) * 2014-06-13 2016-04-13 武汉理工大学 A kind of elimination method considering exceptional spectrum in the measured spectra curve of overall tolerance
CN104198529A (en) * 2014-08-07 2014-12-10 山东东阿阿胶股份有限公司 Method for distinguishing donkey-hide gelatin by utilizing electronic nose technology
CN106018325A (en) * 2016-04-29 2016-10-12 南京富岛信息工程有限公司 Method for evaluating credibility of gasoline property modeling prediction result
CN106018325B (en) * 2016-04-29 2019-02-01 南京富岛信息工程有限公司 A method of evaluation gasoline property modeling and forecasting credible result degree
CN106018515A (en) * 2016-06-08 2016-10-12 北京科技大学 A kind of electronic tongues signal characteristic extracting methods based on manifold learning
CN106018515B (en) * 2016-06-08 2019-01-15 北京科技大学 A kind of electronic tongues signal characteristic extracting methods based on manifold learning
CN110417610A (en) * 2018-04-30 2019-11-05 慧与发展有限责任合伙企业 Storage system postpones Outlier Detection
CN112016249A (en) * 2020-08-31 2020-12-01 华北电力大学 SCR denitration system bad data identification method based on optimized BP neural network
CN112784373A (en) * 2021-01-19 2021-05-11 河北大学 Fault early warning method for wind turbine generator gearbox
CN112784373B (en) * 2021-01-19 2022-03-01 河北大学 Fault early warning method for wind turbine generator gearbox
CN113740277A (en) * 2021-10-15 2021-12-03 北方民族大学 Environment safety analysis method based on spectral multi-component analysis
CN117093841A (en) * 2023-10-18 2023-11-21 中国科学院合肥物质科学研究院 Abnormal spectrum screening model determining method, device and medium for wheat transmission spectrum
CN117093841B (en) * 2023-10-18 2024-02-09 中国科学院合肥物质科学研究院 Abnormal spectrum screening model determining method, device and medium for wheat transmission spectrum

Similar Documents

Publication Publication Date Title
CN103018177A (en) Spectrogram abnormal sample point detection method based on random sampling agree set
US11144576B2 (en) Target class feature model
Sun et al. Quantifying variable interactions in continuous optimization problems
CN104867150A (en) Wave band correction change detection method of remote sensing image fuzzy clustering and system thereof
Yucel et al. Impact of non-normal random effects on inference by multiple imputation: A simulation assessment
CN108829878B (en) Method and device for detecting abnormal points of industrial experimental data
White et al. Methodological tools
CN105334185A (en) Spectrum projection discrimination-based near infrared model maintenance method
CN112036426A (en) Method and system for unsupervised anomaly detection and accountability using majority voting of high dimensional sensor data
CN111143768A (en) Air quality prediction algorithm based on ARIMA-SVM combined model
CN113516228A (en) Network anomaly detection method based on deep neural network
CN108470194B (en) Feature screening method and device
CN103426004A (en) Vehicle type recognition method based on error correction output code
CN113378473A (en) Underground water arsenic risk prediction method based on machine learning model
Wagner et al. OCD: Online convergence detection for evolutionary multi-objective algorithms based on statistical testing
Bacro et al. Testing the independence of maxima: From bivariate vectors to spatial extreme fields: Asymptotic independence of extremes
Weiß et al. Non-parametric analysis of serial dependence in time series using ordinal patterns
CN106198433B (en) Infrared spectroscopy method for qualitative analysis based on LM-GA algorithm
Dette et al. A simple test for the parametric form of the variance function in nonparametric regression
Hernández-López et al. Bayesian joint inference of hydrological and generalized error models with the enforcement of Total Laws
Favenza et al. A machine learning approach to GNSS scintillation detection: Automatic soft inspection of the events
CN115047853A (en) Micro fault detection method based on recursion standard variable residual error and kernel principal component analysis
Spagnolo et al. Forensic metrology: uncertainty of measurements in forensic analysis
CN106599587A (en) Factor extraction method for dam deformation analysis
Sutton-Charani et al. Training and evaluating classifiers from evidential data: Application to E 2 M decision tree pruning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130403