KR101145397B1

KR101145397B1 - Red tide blooms prediction method using fuzzy reasoning and naive bayes classifier

Info

Publication number: KR101145397B1
Application number: KR1020110055571A
Authority: KR
Inventors: 박선; 임정수; 이성호; 김영주; 이성로
Original assignee: 목포대학교산학협력단
Priority date: 2011-06-09
Filing date: 2011-06-09
Publication date: 2012-05-15

Abstract

PURPOSE: A red tide generation predicting method which uses a naive Bayes classifier and a fuzzy inference are provided to increase the accuracy of prediction rate of a red tide by using a naive Bayes classifier. CONSTITUTION: A marine environment data is normalized as learning data which is suitable for a fuzzy inference. A fuzzy reasoning rule is generated by using learning material(230). A red tide plankton density is predicted for input data. A red tide generation is predicted by estimating probability value based on a naive Bayes method(300). The red tide plankton density is controlled based on the prediction result.

Description

Red tide blooms prediction method using Fuzzy Reasoning and Naive Bayes Classifier}

본 발명은 적조 관련 기술에 관한 것으로, 특히 적조 발생 예측 기술에 관한 것이다.The present invention relates to a red tide related technology, and more particularly to a red tide occurrence prediction technology.

적조(red tide)란 유해적조 생물의 일시적인 대량 번식으로 바닷물의 색깔이 적색이나 황색으로 변하며 수산업에 피해를 일으키는 현상이다. 국내에는 60여 종의 적조 생물들이 있으며, 이중 수산업에 피해를 미치는 유해적조 생물은 총 9종이 있다. 국내에서는 특히 coclodinium polykrikoides(coclodinium p.)에 의한 양식업의 피해가 가장 크다. 이러한 적조에 의한 수산업의 피해가 매년 발생함에 따라 적조에 대한 많은 연구가 진행되고 있다. 그러나 주로 연구되는 분야는 적조원인 생물의 생리적 특성 및 환경변화에 대한 생물의 반응으로 상당부분 연구되어 활용하는 수준에 있다. 특히, 적조의 발생 시기를 예측하여 대비를 하면, 적조에 의한 수산업의 피해를 최소화시킬 수 있을 것이다. 그러나 적조 발생의 예측에 대한 국내의 연구는 아직 미흡한 편에 있다.Red tide is a temporary mass breeding of harmful red tide organisms that causes the ocean to turn red or yellow, causing damage to fisheries. There are about 60 kinds of red tide creatures in Korea, and there are nine kinds of harmful tide creatures that damage fisheries. In Korea, aquaculture is most damaging due to coclodinium polykrikoides (coclodinium p.). As the damage of fisheries caused by red tide occurs every year, many researches on red tide have been conducted. However, most of the researched areas are at the level of being studied and utilized in large part due to the physiological characteristics of living organisms and the response of organisms to environmental changes. In particular, if the prediction of the occurrence of the red tide is prepared, it will be possible to minimize the damage of the fisheries caused by the red tide. However, domestic studies on the prediction of red tide occurrence are still inadequate.

본 발명은 해양환경자료를 이용하여 적조 발생과 그 발생 생물 밀도를 예측할 수 있는 방법을 제공함을 목적으로 한다.An object of the present invention is to provide a method for predicting the occurrence of red tide and its density by using marine environment data.

전술한 기술적 과제를 달성하기 위한 본 발명의 일 양상에 따른 나이브 베이스 분류자와 퍼지 추론을 이용한 적조 발생 예측 방법은 적조 발생시의 수온, 기온, 강수량 중 적어도 하나를 포함하는 해양환경자료를 퍼지 추론에 적합한 학습 자료로 정규화하는 전처리 단계, 상기 학습 자료를 이용하여 퍼지 추론 규칙을 생성하고, 생성된 규칙을 이용하여 입력 자료에 대해 적조 발생시의 적조생물 밀도를 예측하는 퍼지 추론 단계, 상기 학습 자료에 대한 베이스 정리를 이용하여 상기 입력 자료에 대한 분류표시의 사후 확률 값을 추정하여 적조 발생을 예측하는 나이브 베이스 분류자 단계, 및 상기 나이브 베이스 분류자 단계의 예측 결과를 기준으로 상기 퍼지 추론 단계에서 예측된 적조생물 밀도를 조정하는 후처리 단계를 포함한다.According to an aspect of the present invention, a method of predicting red tide occurrence using a naïve base classifier and fuzzy inference may include marine environment data including at least one of water temperature, temperature, and precipitation when a red tide occurs. A preprocessing step of normalizing the appropriate learning material, a fuzzy inference rule for generating fuzzy inference rules using the learning data, and a fuzzy inference step for predicting red tide density at the time of occurrence of red tide with respect to the input data using the generated rules, A naive base classifier step of predicting red tide occurrence by estimating a posterior probability value of a classification mark on the input data using a base theorem, and the fuzzy inference step estimated based on a prediction result of the naive base classifier step. A post-treatment step to adjust the red tide density.

나아가 상기 퍼지 추론 단계는 상기 학습 자료를 유사 특징 집단으로 군집하는 단계, 상기 군집된 자료에 대해 퍼지 멤버십 함수를 이용하여 멤버십을 계산하는 단계, 및 상기 계산된 멤버십에 대해 퍼지 모델을 이용하여 퍼지 규칙을 생성하고, 상기 생성된 퍼지 규칙을 상기 입력 자료에 적용하여 적조생물 밀도를 예측하는 단계를 포함한다.Further, the fuzzy inference step includes clustering the learning material into similar feature groups, calculating membership using a fuzzy membership function for the clustered data, and fuzzy rules using a fuzzy model for the calculated membership. Generating a value and applying the generated fuzzy rule to the input data to predict red tide density.

본 발명에 따른 나이브 베이스 분류자와 퍼지 추론을 이용한 적조 발생 예측 방법은 적조 발생의 예측률의 정확성을 높일 수 있는 효과를 창출하며, 나아가 발생 생물의 생물 밀도율에 대해서도 예측할 수 있는 효과를 창출한다.The red tide occurrence prediction method using the naive base classifier and the fuzzy inference according to the present invention creates an effect that can increase the accuracy of the prediction rate of the red tide occurrence, and also creates a predictable effect on the bio density of the generated organism.

도 1은 본 발명의 일 실시예에 따른 나이브 베이스 분류자와 퍼지 추론을 이용한 적조 발생 예측 설명을 위한 블록도.1 is a block diagram illustrating red tide occurrence prediction using a naive base classifier and fuzzy inference according to an embodiment of the present invention.

전술한, 그리고 추가적인 본 발명의 양상들은 첨부된 도면을 참조하여 설명되는 바람직한 실시예들을 통하여 더욱 명백해질 것이다. 이하에서는 본 발명을 이러한 실시예를 통해 당업자가 용이하게 이해하고 재현할 수 있도록 상세히 설명하기로 한다.The foregoing and further aspects of the present invention will become more apparent through the preferred embodiments described with reference to the accompanying drawings. Hereinafter, the present invention will be described in detail to enable those skilled in the art to easily understand and reproduce the present invention.

우선, 퍼지 추론(fuzzy reasoning)과 나이브 베이스 분류자(naive Bayes classifier)에 대해 설명한다. 퍼지 추론은 애매하게 표현된 지식과 정보를 사용하여 다른 정보를 찾아내는 방법을 말한다. 퍼지 이론에서 추론은 몇 개의 퍼지명제에서 연역적으로 각각 하나의 근사적인 퍼지명제를 유도하는 것을 기본으로 한다. 이 같은 이유로 퍼지 추론 또는 근사적 추론(approximate reasoning)이라 한다. 퍼지 집합(fuzzy set)은 기존의 집합을 퍼지 논리 개념을 사용해 확장한 것으로, 각 원소는 그 집합에 속하는 정도가 존재한다. 여기서 각 원소가 그 집합에 속하는 정도를 멤버십 정도 혹은 소속 정도(소속도)라 한다. 퍼지집합은 단위구간 [0, 1] 사이의 실수 값을 멤버십 정도로 취하는 원소들로 구성된 집합으로써, 수학식 1과 같이 나타낸다. 이때 원소가 집합에 완전히 속하는 경우를 1, 전혀 속하지 않는 경우를 0으로 나타낸다.First, fuzzy reasoning and naive Bayes classifier are described. Fuzzy reasoning is a method of finding other information using obfuscated knowledge and information. Inferencing in fuzzy theory is based on deriving one approximate fuzzy proposition from several fuzzy propositions. For this reason, it is called fuzzy reasoning or approximate reasoning. A fuzzy set is an extension of an existing set using the fuzzy logic concept, and each element has a degree to which it belongs. Here, the degree to which each element belongs to the set is called membership degree or degree of belonging (small speed). A fuzzy set is a set of elements that take a real value between unit intervals [0, 1] as membership, and is expressed as in Equation 1. In this case, 1 indicates that the element completely belongs to the set, and 0 indicates that the element does not belong at all.

여기서 A를 X의 퍼지부분집합 또는 퍼지집합이라 하고, μA는 전체집합 X에 대한 멤버십 함수(membership function)이며, 퍼지집합 A의 μA(x) 값은 x∈X에 있어서 멤버십 값(membership value) 또는 등급(grade)으로 원소 x가 퍼지집합 A에 속하는 정도이다. 퍼지 추론은 사실 A와 규칙 A→B(A이면 B이다)일 때, 결론 B가 사일임을 얻는 분리규칙(modus ponens)에 기초를 두며, 퍼지 조건문으로 멤버십을 이용한 하위 규칙은 수학식 2와 같다.Where A is called the fuzzy subset or fuzzy set of X, μA is the membership function for the whole set X, and the μA (x) value of fuzzy set A is the membership value for x∈X. Or grade, in which element x belongs to fuzzy set A. Fuzzy inference is in fact based on the modus ponens that concludes that B is silent when A and rule A → B (where A is B), and the subrule using membership as a fuzzy conditional statement is .

퍼지 추론의 합성규칙은 퍼지규칙 A→B는 퍼지관계로 수학식 3의 결론 B´는 퍼지집합 A´와 퍼지관계 A→B와의 합성(ο)에 의해 얻는다.The synthesis rule of fuzzy inference is the fuzzy rule A → B is a fuzzy relationship, and the conclusion B ′ of Equation 3 is obtained by combining fuzzy set A ′ and fuzzy relationship A → B (ο).

퍼지 추론을 위한 추론규칙은 IF-THEN 형식으로 기술된다. 즉, 이 추론규칙은 조건부(IF)와 결론부(THEN)의 형식을 가지며, 조건부의 조건을 만족하는 입력 값이 들어올 때 결론부의 명제를 실현시킨다.Inference rules for fuzzy inference are described in IF-THEN format. In other words, this inference rule has the form of conditional part IF and THEN, and realizes the proposition of the conclusion part when an input value that satisfies the conditional condition is entered.

나이브 베이스 분류자는 나이브 베이스 알고리즘에 기반을 두고 있다. 나이브 베이스 알고리즘은 자료를 분류하기 위하여 수학식 4와 같이 베이스 정리를 이용하여 주어진 자료에 대한 분류표시의 사후확률 값(posterior probability)을 추정하여 이루어진다.The naive base classifier is based on the naive base algorithm. In order to classify data, the naive base algorithm estimates the posterior probability of a classification mark for a given data by using a base theorem as shown in Equation 4.

여기서 Pr(c_j)는 전체 자료집합에서 임의로 추출한 자료가 분류표시 c_j에 속할 사전확률 값, Pr(d_i│c_j)는 분류표시 c_j에 속하는 자료집합에서 임의로 추출된 자료가 d_i일 확률, Pr(d_i)는 전체 자료집합에서 임의로 추출한 자료가 d_i일 확률 값을 의미한다. 수학식 4의 사후확률 값을 추정할 수 있도록 θ_NB를 추정하고 나면, 자료 분류자

는 입력된 자료에 대한 모든 분류표시에 대한 사후확률 값을 계산한 후에 가장 큰 사후확률 값을 가지는 분류표시를 반환한다. 이를 다음의 수학식 5와 같이 나타낼 수 있다.Where Pr (c _j) is a priori probability value is randomly extracted data in the entire data set belongs to a category shown _{_{c j, Pr (d i │c}} j) is the data randomly extracted from the data sets pertaining to the classification shown c _j d _i The probability, Pr (d _i ), is the probability value that d _i is randomly extracted from the entire data set. After estimating θ _NB to estimate the posterior probability value of Equation 4, the data classifier

Computes the posterior probability values for all classification markers for the data entered and returns the classification marker with the largest posterior probability value. This may be expressed as in Equation 5 below.

도 1은 본 발명의 일 실시예에 따른 나이브 베이스 분류자와 퍼지 추론을 이용한 적조 발생 예측 설명을 위한 블록도이다.1 is a block diagram illustrating red tide occurrence prediction using a naive base classifier and fuzzy inference according to an embodiment of the present invention.

도시된 바와 같이, 본 발명의 일 실시예에 따른 적조 발생 예측 과정은 전처리 단계(100), 퍼지 추론 단계(200), 나이브 베이스 분류자 단계(300), 및 후처리 단계(400)를 포함하여 구성되며, 이 적조 발생 예측 과정은 입력 데이터를 처리하여 출력 가능한 컴퓨팅 장치에 의해 수행될 수 있다. 전처리 단계(100)에서는 과거 적조 발생시의 해양환경자료를 퍼지 추론에 적합한 학습 자료로 가공 및 정규화한다. 퍼지 추론 단계(200)에서는 학습 자료를 이용하여 퍼지 추론 규칙을 생성하고, 생성된 규칙을 이용하여 적조 발생시 적조생물 밀도를 예측한다. 나이브 베이스 분류자 단계(300)에서는 적조 발생을 예측한다. 그리고 후처리 단계(400)에서는 예측된 적조생물 밀도와 발생을 조합하여 적조 발생을 예측한다. 이하, 보다 구체적으로 설명한다.As shown, the red tide occurrence prediction process according to an embodiment of the present invention includes a pre-processing step 100, a fuzzy inference step 200, naive base classifier step 300, and a post-processing step 400 The red tide occurrence prediction process may be performed by a computing device capable of processing and outputting input data. In the preprocessing step 100, the marine environment data of past red tide occurrences are processed and normalized into learning data suitable for fuzzy inference. In the fuzzy inference step 200, a fuzzy inference rule is generated using learning data, and the red tide density is predicted when red tide occurs using the generated rule. The naive base classifier step 300 predicts the occurrence of red tide. In the post-treatment step 400, red tide occurrence is predicted by combining the predicted red tide density and occurrence. Hereinafter, it demonstrates more concretely.

전처리 단계(100)에서는 적조 발생 예측을 위해서 학습 자료를 가공한다. 일 실시예에 있어서, 학습 자료로는 2002년부터 2007년까지 6년간의 통영 지역에서 발생한 적조생물인 coclodinium p.의 발생자료 및 이 지역의 수온, 기온 및 강수량 정보를 이용하였다. 이 중에서 적조 발생 정보는 국립수산과학원의 적조정보시스템으로부터 수집하였고, 수온 정보는 해양수산연구정보 포털의 연안정지관측정보로부터 수집하였으며, 기온 및 강수량 정보는 기상청 관측 자료로부터 수집하였다. 표 1은 학습자료 중 적조가 적게 발생한 2005년도의 적조 생물밀도이며, 표 2는 표 1의 적조가 발생하기 전의 10일간의 수온, 기온, 및 강수량 정보이며, 표 3은 적조 발생 건수, 적조 발생 전 수온 및 기온의 일수, 강수 횟수를 나타낸다.In the preprocessing step 100, the learning material is processed to predict the occurrence of red tide. In one embodiment, the learning data were generated from the data of coclodinium p., A red tide organism that occurred in the Tongyeong region for six years from 2002 to 2007, and the water temperature, temperature, and precipitation information of the region. Among them, red tide occurrence information was collected from the National Institute of Fisheries Science, red tide information system, water temperature information was collected from coastal stop observation information of the Ocean Fisheries Research Information Portal, and temperature and precipitation information were collected from the Meteorological Agency observation data. Table 1 shows the red tide biodensity in 2005 when less red tide occurred in the study data, and Table 2 shows the water temperature, temperature, and precipitation information for 10 days before the red tide in Table 1 occurs. It displays the total water temperature, the number of days in the temperature, and the number of precipitation.

전처리 단계(100)에서는 학습 자료를 위한 전처리로 수온 및 기온에 대해서는 수학식 6과 같이 적조 발생 전 10일간의 평균값을 계산하고, 강수량에 대해서는 수학식 7과 같이 적조 발생 전 10일간의 총 강수량을 계산한다. 이들 중에서 10일 평균수온, 10일 평균기온 및 10일 총 강수량은 입력 학습 자료이고, 표 1의 생물밀도는 출력 학습 자료이다. 그리고 표 4는 수온 10일 평균, 기온 10일 평균, 강수량 10일 총량, 생물밀도 등의 학습 자료 변수를 정의한 것이다. 분류자에 사용되는 출력함수는 최소값 0과 최대값 1을 출력하기 때문에, 최종적으로 입력 자료를 분류자에 적용하기 위해서는 0과 1 사이의 값으로 정규화시킨다.In the pre-treatment step 100, as the pre-processing for the learning data, the average value of the 10 days before the red tide occurs as shown in Equation 6 for the water temperature and the temperature, and the total precipitation for the 10 days before the red tide occurs as shown in Equation 7 for the precipitation. Calculate Among them, 10-day average water temperature, 10-day average temperature, and 10-day total precipitation are input learning data, and biodensity in Table 1 is output learning data. Table 4 defines the learning data variables, such as 10 days average water temperature, 10 days average temperature, 10 days total amount of precipitation, and biological density. Since the output function used for the classifier outputs the minimum value 0 and the maximum value 1, it is normalized to a value between 0 and 1 to finally apply the input data to the classifier.

퍼지 추론 단계(200)는 군집 단계(210)와 퍼지 멤버십 계산 단계(220) 및 퍼지 규칙 생성 단계(230)를 포함한다. 퍼지 추론을 이용하여 적조 발생을 예측하기 위해서는 입력 자료가 적조 발생인지 아닌지를 판단해야 할 규칙이 필요하다. 이를 위해, 학습 자료를 유사한 특징을 갖는 집단으로 군집하고, 군집된 자료의 퍼지 멤버십을 계산하여 규칙을 생성한다. 우선, 군집 단계(210)에서는 학습 자료를 각 특성에 맞는 집단으로 구분한다. 일 실시예에 있어서, 군집 단계(210)에서는 퍼지 추론에서 많이 사용하는 감산 군집(subtractive clustering) 방법을 이용하여 학습 자료를 각 특성에 맞는 집단으로 구분한다. 감산 군집 방법은 자료집합으로부터 유사한 특징을 갖는 집단을 추정하여 군집하는 방식이다. 일 실시예에 있어서, 군집 추정을 위해서 Chiu의 방법을 이용하며, 수학식 8을 이용하여 군집의 잠재적인 중심을 추정하여 군집에 이용한다.The fuzzy inference step 200 includes a clustering step 210, a fuzzy membership calculation step 220, and a fuzzy rule generation step 230. In order to predict the occurrence of red tide using fuzzy inference, a rule is required to determine whether the input data is the occurrence of red tide. To do this, the learning materials are clustered into groups with similar characteristics, and the rules are generated by calculating the fuzzy membership of the clustered materials. First, in the clustering step 210, the learning material is divided into groups suitable for each characteristic. In one embodiment, in the clustering step 210, the learning data is divided into groups suitable for each characteristic by using a subtractive clustering method that is frequently used in fuzzy inference. The subtractive clustering method is a method of estimating a group having similar characteristics from the data set. In one embodiment, Chiu's method is used to estimate the cluster, and Equation 8 is used to estimate the potential center of the cluster and use the cluster.

여기서 P_i는 i번째 잠재 군집의 중심이고, r은 주변 반지름으로 정의된 양의 계수이다. 그리고 x_i는 i번째 군집에 속하는 잠재적 자료이며, x_j는 j번째 학습 자료이다.Where P _i is the center of the i th latent cluster and r is a positive coefficient defined by the surrounding radius. And x _i is potential data belonging to the i th cluster, and x _j is the j th training material.

일 실시예에 있어서, 퍼지 멤버십 계산 단계(220)에서는 퍼지 멤버십 함수로 가우시안 멤버십 함수(gaussian membership function)를 사용한다. 기본적인 퍼지 멤버십 함수는 삼각형(triangular), 가우시안(gaussian), 사다리꼴형(trapezoid)이 있다. 이들 중에서 세밀한 멤버십을 표현하기 위해서 가우시안 멤버십 함수를 이용함이 바람직하다. 가우시안 멤버십 함수는 수학식 9와 같이 계산할 수 있다.In one embodiment, the fuzzy membership calculation step 220 uses a Gaussian membership function as the fuzzy membership function. Basic fuzzy membership functions are triangular, gaussian, and trapezoid. Among them, it is desirable to use Gaussian membership function to express detailed membership. The Gaussian membership function may be calculated as shown in Equation 9.

여기서 x는 학습 자료이고, σ는 가우시안 곡선의 폭을 결정하는 상수이며, c는 가우시안 곡선의 중심을 결정하는 상수이다. 일 실시예에 있어서, σ=2, c=5를 사용해서 가우시안 곡선의 폭은 2로, 곡선의 중심은 중앙에 위치하도록 초기 값을 설정하였다.Where x is the learning material, σ is a constant that determines the width of the Gaussian curve, and c is a constant that determines the center of the Gaussian curve. In one embodiment, the initial values were set so that the width of the Gaussian curve was 2 and the center of the curve was centered using σ = 2, c = 5.

일 실시예에 있어서, 퍼지 규칙 생성 단계(230)에서는 퍼지 규칙 생성을 위해 퍼지 추론에서 많이 사용하는 sugeno 퍼지 모델을 이용한다. sugeno 퍼지 모델의 출력 규칙의 최종 출력 값인 적조율은 수학식 10과 같이 계산한다.In one embodiment, the fuzzy rule generation step 230 uses a sugeno fuzzy model that is frequently used in fuzzy inference for fuzzy rule generation. The red tide rate, which is the final output value of the output rule of the sugeno fuzzy model, is calculated as in Equation 10.

여기서 w는 퍼지 규칙에 대한 가중치로 수학식 11과 같으며, z는 적조율 멤버십에 대한 군집, N은 규칙의 수이다.Where w is a weight for a fuzzy rule, as shown in Equation 11, z is a cluster for red tide membership, and N is the number of rules.

여기서 wi는 i번째 규칙의 가중치이고, f()는 수학식 9의 가우시안 멤버십 함수이다. 퍼지 규칙을 생성한 다음에 적조 예측 전 10일간의 적조 발생 지역의 환경정보를 입력하여 적조 발생 및 발생 적조생물 밀도를 예측한다.Where wi is a weight of the i th rule and f () is a Gaussian membership function of Equation (9). After generating a fuzzy rule, the environment information of the red tide occurrence area is input for 10 days before the red tide prediction to predict the red tide occurrence and the density of the red tide.

나이브 베이스 분류자 단계(300)에서는 나이브 베이스 분류자를 이용하여 적조 발생을 예측한다. 나이브 베이스 분류자의 학습을 위하여 사전 확률 값 Pr(c_j)와 Pr(di|cj)의 계산은 주어진 적조 학습 자료에 대하여 모형의 매개 변수의 추정은 MAP(maximum a posteriori hypothesis) 가설을 이용한다. Pr(c_j)의 계산은 수학식 12와 같이 학습자료 TD중에 c_j에 속하는 자료의 수로써 쉽게 계산할 수 있다.The naive base classifier step 300 predicts the occurrence of red tide using the naive base classifier. The calculation of prior probability values Pr (c _j ) and Pr (di | cj) for the learning of naive base classifiers uses the MAP (maximum a posteriori hypothesis) hypothesis. The calculation of Pr (c _j ) can be easily calculated as the number of data belonging to c _j in the learning data TD as shown in Equation 12.

후처리 단계(400)에서는 나이브 베이스 분류자 단계(300)에서 얻어진 예측 결과를 기준으로 이용하여 퍼지 추론 단계(200)에서 예측된 적조 발생 생물밀도와 예측 결과를 조정한다. 예를 들어, 나이브 베이스 분류자 단계(300)에서 적조 발생이나 퍼지 추론이 적조가 발생하지 않는 결과를 예측하면, 퍼지 추론 단계(200)의 결과를 나이브 베이스 분류자 단계(300)의 결과에 맞추어 적조 발생으로 맞추며 생물밀도를 1로 설정한다. 이와 반대의 경우는 생물밀도를 0으로 설정한다.In the post-processing step 400, the red tide occurrence biodensity and the prediction result predicted in the fuzzy inference step 200 are adjusted using the prediction result obtained in the naive base classifier step 300 as a reference. For example, when the red tide generation or the fuzzy inference predicts the result that the red tide does not occur in the naive base classifier step 300, the result of the fuzzy inference step 200 is matched with the result of the naive base classifier step 300. Set to red tide and set biodensity to 1. In the opposite case, the bio density is set to zero.

이상에서 설명한 나이브 베이스 분류자와 퍼지 추론을 이용한 적조 발생 예측 방법에 대해 국립수산과학원의 적조정보시스템으로부터 가져온 통영지역의 2008년부터 2010년 동안 발생한 coclodinium p. 적조경보 및 주의보 38건의 자료를 이용하여 그 성능을 평가하였다. 또한 같은 연도의 해양수산연구정보 포털의 연안정지관측정보로부터 121일간의 수온정보를 이용하였으며, 기상청 관측 자료로부터 수집한 121일간의 기온정보와 29건의 강수정보도 이용하였다. 다음의 표 5는 실험에 사용한 입력 자료의 속성표이다.For the prediction method of red tide occurrence using naive base classifier and fuzzy inference described above, coclodinium p. The performance of this study was evaluated using 38 data of red tide alarm and warning. In addition, 121 days of temperature information was used from coastal stop observation information of the Marine Fisheries Research Information Portal of the same year, and 121 days of temperature information and 29 precipitation information collected from meteorological office observation data were also used. Table 5 below is an attribute table of the input data used in the experiment.

표 6은 통영지역에서 3년간 발생한 적조 coclodinium p.에 대해, SVM(support vector machine), 역전파 신경망을 나타내는 BPNN(back propagation neural network), 회귀 신경망을 나타내는 GRNN(general regression neural network), 퍼지 추론(FR), 나이브 베이스 분류자(NB)를 이용한 적조발생 예측의 정확률 평가결과를 나타낸 것이다. NB 평균 정확률이 BPNN에 비해서 16.3%, GRNN에 비해서 13.7%, SVM에 비해서 5.8%, FR에 비해서 3.3% 더 높음이 확인된다.Table 6 shows the support vector machine (SVM), the back propagation neural network (BPNN) representing the back propagation neural network, the general regression neural network (GRNN) representing the regression neural network, and fuzzy inference for red tide coclodinium p. (FR) and Naive Base Classifier (NB) show the results of evaluating the accuracy of red tide occurrence prediction. The NB mean accuracy is 16.3% higher than BPNN, 13.7% higher than GRNN, 5.8% higher than SVM and 3.3% higher than FR.

표 7과 표 8은 후처리 전후의 관계로 실제 적조 발생 생물 밀도와 예측 결과 값의 일부분을 나타낸 것이다. 표 7과 표 8에서 FR은 퍼지 추론을 사용하여 예측된 적조 발생 생물 밀도로 0보다 작으면 적조가 발생하지 않은 것을 나타내고, NB는 나이브 베이스 분류자를 나타낸 것으로 0이면 적조가 발생하지 않은 것을, 1이면 적조가 발생한 것을 나타낸다. 표 7에서 6 ~ 10번은 퍼지 추론이 잘못 예측한 값이고, 4번은 나이브 베이스 분류자가 잘못 예측한 값이다. 표 7을 후처리하여 표 8과 같이 보정한다. 즉, 표 7의 4번은 0으로, 6 ~ 10번은 1로 보정하면 표 8과 같은 결과가 나온다. 표 8을 통해서 적조 발생 예측의 정확율이 90%인 발생 생물 밀도를 예측할 수 있다.Tables 7 and 8 show some of the actual red tide generated biodensities and predicted results in relation to pre and post treatment. In Table 7 and Table 8, FR indicates red tide generation biomass density predicted using fuzzy inference that is less than 0, indicating that no red tide occurs, and NB indicates naive base classifier, and zero indicates no red tide. If it indicates red tide has occurred. In Table 7, numbers 6 to 10 are incorrectly predicted by fuzzy inference, and number 4 is incorrectly predicted by naive base classifier. Post-process Table 7 and correct it as Table 8. That is, if the number 4 in Table 7 is 0 and the numbers 6 to 10 are 1, the result is shown in Table 8. From Table 8, we can predict the developmental density of the organism with a 90% accuracy rate for red tide outbreak predictions.

이제까지 본 발명에 대하여 그 바람직한 실시예들을 중심으로 살펴보았다. 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자는 본 발명이 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 변형된 형태로 구현될 수 있음을 이해할 수 있을 것이다. 그러므로 개시된 실시예들은 한정적인 관점이 아니라 설명적인 관점에서 고려되어야 한다. 본 발명의 범위는 전술한 설명이 아니라 특허청구범위에 나타나 있으며, 그와 동등한 범위 내에 있는 모든 차이점은 본 발명에 포함된 것으로 해석되어야 할 것이다.So far I looked at the center of the preferred embodiment for the present invention. Those skilled in the art will appreciate that the present invention can be implemented in a modified form without departing from the essential features of the present invention. Therefore, the disclosed embodiments should be considered in an illustrative rather than a restrictive sense. The scope of the present invention is shown in the claims rather than the foregoing description, and all differences within the scope will be construed as being included in the present invention.

100 : 전처리 단계 200 : 퍼지 추론 단계
210 : 군집 단계 220 : 퍼지 멤버십 계산 단계
230 : 퍼지 규칙 생성 단계 300 : 나이브 베이스 분류자 단계
400 : 후처리 단계100: pretreatment step 200: fuzzy inference step
210: cluster stage 220: fuzzy membership calculation stage
230: fuzzy rule generation step 300: naive base classifier step
400: post-processing step

Claims

A preprocessing step of normalizing marine environment data including at least one of water temperature, temperature, and precipitation at the time of red tide occurrence with learning data suitable for fuzzy inference;
A fuzzy inference step of generating fuzzy inference rules using the learning data and predicting red tide density when red tide occurs with respect to input data using the generated rules;
A naive base classifier step of estimating a red tide occurrence by estimating a posterior probability value of a classification mark on the input material by using a base theorem on the learning material; And
A post-processing step of adjusting the red tide density predicted in the fuzzy inference step based on the prediction result of the naive base classifier step;
Red tide occurrence prediction method using a naive base classifier and fuzzy inference, characterized in that it comprises a.

The method of claim 1, wherein the fuzzy inference step is:
Clustering the learning material into similar feature groups;
Calculating a membership of the clustered data using a fuzzy membership function; And
Generating a fuzzy rule using the fuzzy model for the calculated membership, and applying the generated fuzzy rule to the input data to predict red tide density;
Red tide occurrence prediction method using a naive base classifier and fuzzy inference, characterized in that it comprises a.