CN110781240A - Visual mining and application method of red tide data - Google Patents

Visual mining and application method of red tide data Download PDF

Info

Publication number
CN110781240A
CN110781240A CN201911064759.5A CN201911064759A CN110781240A CN 110781240 A CN110781240 A CN 110781240A CN 201911064759 A CN201911064759 A CN 201911064759A CN 110781240 A CN110781240 A CN 110781240A
Authority
CN
China
Prior art keywords
red tide
data
information
mining
dynamic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201911064759.5A
Other languages
Chinese (zh)
Inventor
林锐
商少平
吴璟瑜
戴昊
贺志刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen University
Original Assignee
Xiamen University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen University filed Critical Xiamen University
Priority to CN201911064759.5A priority Critical patent/CN110781240A/en
Publication of CN110781240A publication Critical patent/CN110781240A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

一种赤潮数据可视化的挖掘与应用方法,涉及海洋环境科学技术领域。1)对原始赤潮事件的信息进行规范化处理;2)分析赤潮事件历史记录,检验并筛选出独立赤潮事件,完成数据预处理;3)基于步骤2)预处理的数据,设计并绘制多种内容的数据透视表、透视图,识别与分析赤潮信息的构成、联系、层次与分布,选择并优化各类图表类型;4)基于步骤2)预处理的数据,设计动态时间线及三维地图的多源可视化融合方式,验证赤潮信息挖掘的可行性和有效性;5)基于步骤3)和4),构建仪表盘模型,设计较为普适的赤潮动态分析图表,用于赤潮信息挖掘分析。可有效地筛选出赤潮分区、分时期的节点,广泛应用于赤潮信息挖掘。

Figure 201911064759

A mining and application method of red tide data visualization relates to the field of marine environmental science and technology. 1) Standardize the information of the original red tide events; 2) Analyze the historical records of red tide events, check and screen out independent red tide events, and complete the data preprocessing; 3) Based on the data preprocessed in step 2), design and draw a variety of content It can identify and analyze the composition, relationship, level and distribution of red tide information, select and optimize various types of charts; 4) Based on the data preprocessed in step 2), design dynamic timelines and 3D maps Source visualization fusion method to verify the feasibility and effectiveness of red tide information mining; 5) Based on steps 3) and 4), build a dashboard model, and design a more general red tide dynamic analysis chart for red tide information mining and analysis. It can effectively screen out the nodes of red tide zones and periods, and is widely used in red tide information mining.

Figure 201911064759

Description

一种赤潮数据可视化的挖掘与应用方法A method of mining and application of red tide data visualization

技术领域technical field

本发明涉及海洋环境科学技术领域,特别涉及将Matlab编程的数据前处理及基于Excel新增功能的多种可视化融合方式相结合,形成只用“一张图表”即可对数据结构分层及可视化展示的方式,进而深入挖掘数据信息的一种赤潮数据可视化的挖掘与应用方法。The invention relates to the field of marine environment science and technology, in particular to the combination of data preprocessing programmed by Matlab and multiple visualization fusion methods based on new Excel functions to form a layered and visualized data structure with only "one chart" A kind of mining and application method of red tide data visualization, which is a method of deep mining data information.

背景技术Background technique

赤潮数据的可视化多在预测预报后采用遥感或GIS等专业领域相关的软件进行展示(徐波等,浙江大学学报(理学版),1008-9497(2004)04-471-05,31(4):471-475;陈芸芝等,福州大学学报(自然科学版),1000-2243(2013)06-1002-072013,41(6):1002-1008;杨静等,海洋预报,1003-0239(2013)01-0059-062013,30(1):59-64),而对于预测预报前期利用其他领域分析统计的软件(特别是一般办公软件EXCEL)进行专业信息的挖掘较少。不同学科或领域的交叉可以迸发新的思路和观点。The visualization of red tide data is mostly displayed by software related to professional fields such as remote sensing or GIS after forecasting (Xu Bo et al., Journal of Zhejiang University (Science Edition), 1008-9497(2004) 04-471-05, 31(4) : 471-475; Chen Yunzhi et al., Journal of Fuzhou University (Natural Science Edition), 1000-2243 (2013) 06-1002-07 2013, 41(6): 1002-1008; ) 01-0059-062013, 30(1):59-64), while the professional information mining using other fields of analysis and statistics software (especially the general office software EXCEL) in the early stage of prediction and forecasting is less. The intersection of different disciplines or fields can spark new ideas and perspectives.

中国专利CN201810966994.0公开一种基于图模型构建的赤潮数据查询方法,所述方法包含:赤潮数据图模型的构建、赤潮数据查询语言的构建;所述赤潮数据图模型RTGraph包括三种数据:点数据、边数据、赤潮边缘数据;所述赤潮边缘数据是一种点数据,由点上属性标记;所述赤潮数据查询语言包括:创建语句、查询语句、更新语句、插入语句、删除语句。将赤潮数据按照特定阶段存储在图模型中,建立赤潮的边数据,可以表示赤潮数据之间的关联,不仅可以在图模型上进行普通的点和边的查询,同时可以进行各种模型查询,提高了查询的速度和精度,可以充分利用赤潮数据进行研究。研究人员可以预测到阶段转换发生的时间和地点,针对此采取相应的措施,减少经济和生态上的损失。Chinese patent CN201810966994.0 discloses a red tide data query method based on a graph model, the method includes: the construction of a red tide data graph model and the construction of a red tide data query language; the red tide data graph model RTGraph includes three kinds of data: point Data, edge data, and red tide edge data; the red tide edge data is a kind of point data, marked by attributes on the point; the red tide data query language includes: create statement, query statement, update statement, insert statement, delete statement. The red tide data is stored in the graph model according to a specific stage, and the edge data of the red tide is established, which can represent the relationship between the red tide data. The query speed and accuracy are improved, and red tide data can be fully utilized for research. Researchers can predict when and where the phase transition will occur, and take corresponding measures to reduce economic and ecological losses.

如何紧扣“智慧海洋”工程中认知海洋、服务大众的理念,通过多样化、静态动态图表组合化等多源可视化表达,从不同的视角、途径深入挖掘赤潮信息,探索可以帮助用户分析找出数据的内在规律和使用价值的可视化方式,实现获取有效、客观且具有价值的赤潮信息,从而帮助科研及相关部门在创新和应用上提供技术支撑是现阶段的一个技术难题。How to closely follow the concept of recognizing the ocean and serving the public in the "Smart Ocean" project, through the multi-source visual expression such as diversification and combination of static and dynamic charts, from different perspectives and ways to dig deep red tide information, and exploration can help users analyze and find It is a technical problem at this stage to find out the inherent laws of data and the visualization method of use value, and to obtain effective, objective and valuable red tide information, so as to help scientific research and related departments to provide technical support in innovation and application.

发明内容SUMMARY OF THE INVENTION

本发明的目的在于提供一种赤潮数据可视化的挖掘与应用方法,结合非专业软件Excel与专业软件Matlab,设计可实现赤潮有效信息挖掘分析的较为普适的可视化表达融合方式,更有效地帮助用户分析、提取赤潮数据的内在规律,为科研、管理等相关部门提供技术支撑。The purpose of the present invention is to provide a method for mining and applying red tide data visualization, combining non-professional software Excel and professional software Matlab, to design a more general visualization expression fusion method that can realize effective information mining and analysis of red tide, and more effectively help users Analyze and extract the inherent laws of red tide data, and provide technical support for scientific research, management and other related departments.

本发明包括下列步骤:The present invention includes the following steps:

1)对原始赤潮事件的信息进行规范化处理;利用matlab编程实现文件的批量读取,然后识别数据的结构及各项目名称,根据规范的数据项目对数据进行批量修正,最后输出文件;1) Standardize the information of the original red tide event; use matlab programming to realize the batch reading of files, then identify the structure of the data and the names of each item, modify the data in batches according to the standardized data items, and finally output the file;

2)分析赤潮事件历史记录,检验并筛选出独立赤潮事件,完成数据预处理;2) Analyze the historical records of red tide events, check and filter out independent red tide events, and complete data preprocessing;

3)基于步骤2)预处理的数据,设计并绘制多种内容的数据透视表、透视图,识别与分析赤潮信息的构成、联系、层次与分布,选择并优化各类图表类型;3) Based on the data preprocessed in step 2), design and draw data pivot tables and perspective charts of various contents, identify and analyze the composition, connection, level and distribution of red tide information, and select and optimize various types of charts;

4)基于步骤2)预处理的数据,设计动态时间线及三维地图的多源可视化融合方式,验证赤潮信息挖掘的可行性和有效性;4) Based on the data preprocessed in step 2), a multi-source visualization fusion method of dynamic timeline and three-dimensional map is designed to verify the feasibility and effectiveness of red tide information mining;

5)基于步骤3)和步骤4),构建仪表盘模型,设计较为普适的赤潮动态分析图表,用于赤潮信息挖掘分析。5) Based on step 3) and step 4), build a dashboard model, and design a more general red tide dynamic analysis chart for red tide information mining and analysis.

在步骤1)中,所述对原始赤潮事件的信息进行规范化处理的具体方法可为:In step 1), the specific method for normalizing the information of the original red tide event may be:

首先利用Matlab编程批量读取数据,通过软件识别原始数据的结构和各项目名称,然后根据规范的数据项目,检验输入赤潮事件历史记录文件中各列标题名称,利用Matlab修正程序进行相应的补充和排序,批量修正后的数据通过编程批量输出;所述规范的数据项目即各列标题名称,设定为:序号、发生时间、消亡时间、持续时间、地点、经度、纬度、发生海域、具体海域、最大影响面积、生物优势种、最高密度、损失情况。First, use Matlab programming to read the data in batches, identify the structure of the original data and the names of each item through the software, and then check and input the title names of each column in the red tide event history file according to the standard data items, and use the Matlab correction program to make corresponding supplements and corrections. Sorting, batch-corrected data are output in batches through programming; the standardized data items are the title names of each column, which are set as: serial number, occurrence time, extinction time, duration, location, longitude, latitude, occurrence sea area, specific sea area , Maximum affected area, biological dominant species, highest density, loss situation.

在步骤2)中,所述数据预处理的具体方法可为:In step 2), the specific method of the data preprocessing can be:

检验并筛选出独立赤潮事件,原始数据由于统计方式不同,存在单一藻种赤潮和混合藻种赤潮两种赤潮事件,部分原始文件中,存在将一个混合藻种赤潮事件分成多个单一藻种赤潮事件,导致赤潮事件重复统计,为避免该情况发生,检验赤潮事件的起止时间及发生地点,将混合藻种的赤潮事件统一起来,删除其分支统计的重复信息。Check and screen out independent red tide events. Due to different statistical methods in the original data, there are two kinds of red tide events: single algal species red tide and mixed algal species red tide. In some original documents, there is a mixed algal species red tide event divided into multiple single algal species red tide events. In order to avoid the occurrence of red tide events, the starting and ending time and place of red tide events were checked, the red tide events of mixed algal species were unified, and the repeated information of branch statistics was deleted.

在步骤3)中,所述识别与分析赤潮信息的构成、联系、层次与分布的具体方法可为:In step 3), the specific method for identifying and analyzing the composition, connection, level and distribution of red tide information may be:

利用Excel的数据透视表增强功能,实现跨数据轻松构建复杂的模型,对数据按时间自动分组,用户根据需要查看内容,通过筛选按钮选择相应的海域、区域等进行数据统计,数据的时间分组有秒、分、小时、日、月、季度、年或这几种方式组合等;分析设计多种内容的数据透视表,将透视表作为动态数据源,选择并优化各类图表类型,生成透视图;插入透视图,可以跨时间分组和对数据中的其他层次结构进行放大和缩小,可以根据用户需要查看的内容,选择相应的海域、区域等进行图形显示。Using the enhanced function of Excel's pivot table, it is possible to easily build complex models across data, and automatically group data by time. Users can view the content according to their needs, and select the corresponding sea area, area, etc. for data statistics through the filter button. The time grouping of data includes Seconds, minutes, hours, days, months, quarters, years, or a combination of these; analyze and design pivot tables with multiple contents, use pivot tables as dynamic data sources, select and optimize various chart types, and generate pivot charts ;Insert a perspective view, which can group and zoom in and out of other hierarchical structures in the data across time, and can select the corresponding sea area, area, etc. for graphical display according to the content that the user needs to view.

在步骤4)中,所述设计动态时间线及三维地图的多源可视化融合方式,验证赤潮信息挖掘的可行性和有效性,具体步骤可为:In step 4), the multi-source visualization fusion mode of the design dynamic timeline and the three-dimensional map is described to verify the feasibility and effectiveness of red tide information mining, and the specific steps can be:

(1)运用Excel动态时间线筛选赤潮信息分时期的节点:基于Excel日程表的功能,根据用户需求,选择单独某一年份或连续多个年份的动态时间范围进行数据的统计与可视化展示。拖动动画控制工具条上的时间滑块,可以显示不同时期赤潮信息的分布情况。基于Excel动态图表的功能,选择相应的项目(如:时间节点或分区域)进行数据筛选后的可视化展示,有利于用户对数据单点项目进行拆分查看与分析;(1) Use the Excel dynamic time line to filter the nodes of red tide information in different periods: Based on the function of Excel schedule, according to user needs, select the dynamic time range of a single year or multiple consecutive years for data statistics and visual display. Drag the time slider on the animation control toolbar to display the distribution of red tide information in different periods. Based on the function of Excel dynamic charts, select the corresponding items (such as time nodes or sub-regions) for visual display after data filtering, which is beneficial for users to split, view and analyze single-point data items;

(2)运用Excel三维地图分析赤潮信息的空间位置关系及规律:对于包含位置信息的赤潮数据,通过数据空间坐标信息的获取、识别,快速将数据映射到虚拟地球上,将数据要表达的内容展示到二维平面图或三维地图等适合的地图中,对专题地图进行动态分析和三维展示,将具有空间信息的表单数据转换成专题地图,实现数据空间可视化产品,提高数据的可视性和分析效率。例如:三维地图展示赤潮藻种随时间变化的空间分布中,根据时间轴可以按时间先后顺序来演示历次赤潮藻种发生位置和频数的动态变化。(2) Use Excel 3D map to analyze the spatial relationship and law of red tide information: For red tide data containing location information, through the acquisition and identification of data spatial coordinate information, the data can be quickly mapped to the virtual earth, and the content to be expressed by the data can be quickly mapped. Display it on a suitable map such as a 2D floor plan or a 3D map, perform dynamic analysis and 3D display on thematic maps, convert form data with spatial information into thematic maps, realize data spatial visualization products, and improve data visibility and analysis. efficiency. For example, in a three-dimensional map showing the spatial distribution of red tide algae species over time, the dynamic changes in the location and frequency of previous red tide algae species can be demonstrated in chronological order according to the time axis.

(3)基于赤潮事件历史记录原始数据的实例,验证该融合方式挖掘赤潮信息的有效性和可行性。(3) Based on the example of the original data of red tide event historical records, the validity and feasibility of mining red tide information by this fusion method are verified.

在步骤5)中,构建仪表盘模型,设计较为普适的赤潮动态分析图表的具体方法是:针对不藻种、不同月份的赤潮发生情况设计动态图表,构建赤潮发生频率仪表盘模型,根据用户需求,通过将透视表筛选、迷你图展示、仪表盘模型等多种可视化设计融于一体,动态展示赤潮发生情况的分布特征,同时新增两种藻种赤潮爆发情况对比图设计,用户可以根据需求自主选择两种藻种进行对比分析,寻找不同藻种赤潮发生的规律。In step 5), a dashboard model is constructed, and the specific method for designing a more general red tide dynamic analysis chart is: design a dynamic chart for the occurrence of red tides in different algae species and different months, and build a red tide occurrence frequency dashboard model. Demand, by integrating various visual designs such as pivot table screening, mini-chart display, dashboard model, etc., to dynamically display the distribution characteristics of red tide occurrences, and at the same time, a comparison chart design for red tide outbreaks of two algae species has been added. It is necessary to independently select two algal species for comparative analysis to find the regularity of red tide occurrence of different algal species.

为更完整、少遗漏、多挖掘赤潮的价值信息,本发明利用并融合Excel动态时间线、动态图表等多源可视化功能,识别与分析赤潮信息的构成、联系、层次与分布,判别数据时间节点或空间分界等细微差异,构建仪表盘模型等,设计可实现赤潮有效信息挖掘分析的多种可视化表达融合方式,并通过实例检测该设计的有效性。In order to be more complete, less omission, and more valuable information of red tide, the present invention utilizes and integrates multi-source visualization functions such as Excel dynamic time line and dynamic chart, identifies and analyzes the composition, connection, level and distribution of red tide information, and discriminates data time nodes. Or space demarcation and other subtle differences, build a dashboard model, etc., design a variety of visual expression fusion methods that can realize effective information mining and analysis of red tide, and test the effectiveness of the design through examples.

本发明的有益效果是:The beneficial effects of the present invention are:

本发明首先利用Matlab批量实现数据预处理,然后基于Excel中几种可视化图表的设计,利用动态时间线有效地筛选出赤潮不同时期的时间变化特征的节点,再结合三维地图中赤潮爆发的空间分布的静态展示及其动态时间展示,有效地筛选出赤潮分区、分时期的节点,验证多种可视化表达融合设计对赤潮信息挖掘的可行性和有效性。之后,结合Excel中数据快速分析模块中的数据透视表、迷你图等功能,将透视图、迷你图、动态图表等多种可视化方式融合,构建仪表盘模型,设计了两种较为普适化的可视化表达方式—“一张图表”:不同藻种赤潮发生频率分布的仪表盘模型和A/B藻种对比的动态图表模型,可快速对数据结构进行分层筛选,有利于赤潮信息中细微特征的发现,为赤潮爆发的演变规律及机制探讨有重要的意义,为科研、管理等相关部门提供技术支撑。可广泛应用于赤潮信息挖掘。The invention first uses Matlab to realize data preprocessing in batches, and then, based on the design of several visual charts in Excel, uses dynamic time lines to effectively screen out nodes with time-varying characteristics of red tides in different periods, and then combines the spatial distribution of red tide outbreaks in a three-dimensional map The static display and dynamic time display of red tide can effectively screen out the nodes of red tide zones and periods, and verify the feasibility and effectiveness of the fusion design of multiple visual expressions for red tide information mining. After that, combined with the functions of pivot table and sparkline in the data quick analysis module in Excel, and integrated various visualization methods such as pivot chart, sparkline, dynamic chart, etc., to build a dashboard model, and designed two more generalized models. Visual expression - "one chart": a dashboard model for the frequency distribution of red tides of different algal species and a dynamic chart model for the comparison of A/B algal species, which can quickly screen the data structure hierarchically, which is conducive to the subtle features of red tide information The discovery is of great significance for the discussion of the evolution law and mechanism of red tide outbreaks, and provides technical support for relevant departments such as scientific research and management. It can be widely used in red tide information mining.

附图说明Description of drawings

图1为一种赤潮数据可视化的挖掘与应用方法的流程图;Fig. 1 is the flow chart of a kind of red tide data visualization mining and application method;

图2为基于2000-2016年赤潮数据透视图的层式结构放大;Figure 2 is an enlarged layered structure based on the 2000-2016 red tide data perspective;

图3为基于2000-2016年赤潮数据透视图的层式结构缩小;Fig. 3 is the layered structure reduction based on the 2000-2016 red tide data perspective;

图4为基于2000-2016年福建省赤潮信息的动态图表示意图;Figure 4 is a schematic diagram of a dynamic chart based on red tide information in Fujian Province from 2000 to 2016;

图5为Excel动态时间线展示福建省赤潮持续时间的时空分布;Figure 5 shows the temporal and spatial distribution of red tide duration in Fujian Province on the Excel dynamic timeline;

图6为三维地图展示赤潮爆发频数的空间分布;Figure 6 is a three-dimensional map showing the spatial distribution of the frequency of red tide outbreaks;

图7为三维地图展示赤潮藻种随时间变化的空间分布;Fig. 7 is a three-dimensional map showing the spatial distribution of red tide algal species over time;

图8为三维面积图展示各海区赤潮爆发最大面积分级分布;Figure 8 is a three-dimensional area map showing the grading distribution of the maximum area of red tide outbreaks in each sea area;

图9为基于多源可视化方式融合对福建近岸赤潮分区分时期爆发情况的信息挖掘;Figure 9 shows the information mining of the outbreak of red tides in the near-shore area of Fujian based on the fusion of multi-source visualization methods;

图10为基于赤潮发生频率仪表盘模型的动态图表;Figure 10 is a dynamic chart based on a dashboard model of red tide occurrence frequency;

图11为两种藻种赤潮爆发情况对比图。Figure 11 shows the comparison of red tide outbreaks for two algal species.

具体实施方式Detailed ways

以下实施例将结合附图对本发明作进一步的说明。The following embodiments will further illustrate the present invention in conjunction with the accompanying drawings.

参见图1,本发明实施例包括以下步骤:Referring to Fig. 1, the embodiment of the present invention includes the following steps:

1)对原始赤潮事件的信息进行规范化处理;1) Standardize the information of the original red tide event;

本实施例中选取2000-2016年赤潮事件历史记录数据,利用Matlab编程批量读取数据,通过软件识别原始数据的结构和分布,然后根据规范的数据项目,检验输入赤潮事件历史记录文件中各列标题名称,补充并顺序排列其列标题。规范的数据项目及排序可以包括如下:序号、发生时间、消亡时间、持续时间、地点、发生海域、具体海域、最大影响面积(平方公里)、生物优势种、最高密度、损失情况。In this embodiment, the historical record data of red tide events from 2000 to 2016 is selected, and Matlab programming is used to read the data in batches, and the structure and distribution of the original data are identified through software, and then each column in the red tide event historical record file is checked and input according to the standard data items. Header names, complement and order their column headers. The normative data items and sorting can include the following: serial number, occurrence time, extinction time, duration, location, occurrence sea area, specific sea area, maximum affected area (square kilometers), biological dominant species, highest density, and loss.

由于原始文件中的列标题“地点”一部分以地名标注,而另一部分以经纬度标注;列标题“海域”标注地名多为县级或更小地名,部分地区虽指向同一个地点,但地名却不统一;综上问题,为便于后续数据分析,对数据进行规范化处理,统一数据结构及名称,将列标题“海域”分成两列“发生海域”和“具体海域”。其中,“发生海域”标注市级以上的名称,而“具体海域”标注为具体海域名称;“地点”标注具体的经纬度。Since part of the column heading "Location" in the original file is marked with place names, while the other part is marked with latitude and longitude; the column heading "Sea Area" is marked with place names at the county level or smaller, and although some areas point to the same place, the place names are not Unification; In summary, in order to facilitate subsequent data analysis, the data is standardized, the data structure and names are unified, and the column heading "Sea Areas" is divided into two columns, "Occurrence Sea Areas" and "Specific Sea Areas". Among them, "occurrence sea area" is marked with the name above the city level, and "specific sea area" is marked with the name of the specific sea area; "location" is marked with the specific latitude and longitude.

为便于赤潮事件的发生地点的空间分布展示,设置“经度”、“纬度”两列。“发生海域”列标题名称统一后,根据市级地方结合“具体海域”的位置,确定该赤潮事件发生的中心位置,补充“经度”、“纬度”两列信息,如表1。To facilitate the display of the spatial distribution of the locations of red tide events, two columns of "Longitude" and "Latitude" are set. After the title of the column of "occurred sea area" is unified, the central location of the red tide event is determined according to the location of the city-level locality combined with the "specific sea area", and the two columns of "longitude" and "latitude" are supplemented, as shown in Table 1.

表1Table 1

Figure BDA0002258976540000051
Figure BDA0002258976540000051

利用Matlab修正程序进行相应的补充和排序等。已修正的数据通过编程批量输出。Use the Matlab correction program to supplement and sort accordingly. The corrected data is output in batches by programming.

2)分析赤潮事件历史记录,检验并筛选出独立赤潮事件,完成数据预处理。2) Analyze the historical records of red tide events, check and filter out independent red tide events, and complete data preprocessing.

检验并筛选出独立赤潮事件。原始数据由于统计方式不同,存在单一藻种赤潮和混合藻种赤潮两种赤潮事件。部分原始文件中,存在将一个混合藻种赤潮事件分成多个单一藻种赤潮事件,导致赤潮事件重复统计。为避免该情况发生,检验赤潮事件的起止时间及发生地点,将混合藻种的赤潮事件统一起来,删除其分支统计的重复信息;Test and filter out independent red tide events. Due to different statistical methods in the original data, there are two types of red tide events: single algal species red tide and mixed algal species red tide. In some original documents, a mixed algal species red tide event was divided into multiple single algal species red tide events, resulting in repeated statistics of red tide events. In order to avoid this situation, check the start and end time and place of red tide events, unify the red tide events of mixed algal species, and delete the duplicate information of their branch statistics;

3)基于步骤2)中预处理的数据,设计并绘制多种内容的数据透视表、透视图,识别与分析赤潮信息的构成、联系、层次与分布,选择并优化各类图表类型。3) Based on the data preprocessed in step 2), design and draw pivot tables and perspective charts with various contents, identify and analyze the composition, connection, level and distribution of red tide information, and select and optimize various types of charts.

首先,利用Excel 2016版的数据透视表增强功能,实现跨数据轻松构建复杂的模型,对数据按时间自动分组,用户根据需要查看内容,通过筛选按钮选择相应的海域、区域等进行数据统计。例如:单击相应年和季度即可看到年、季度、月等各种时间展示方式的分组。分析设计多种内容的数据透视表,将透视表作为动态数据源,选择并优化各类图表类型,生成透视图。例如:基于透视表,插入一个透视图,透视图右下角的“+、-”按钮,让你可以跨时间分组和对数据中的其他层次结构进行放大和缩小(图2和3)。First of all, using the enhanced function of the pivot table of Excel 2016, it is possible to easily build complex models across data, and automatically group the data by time. Users can view the content according to their needs, and select the corresponding sea area, area, etc. for data statistics through the filter button. For example: Click the corresponding year and quarter to see the grouping of various time display methods such as year, quarter, month, etc. Analyze and design pivot tables with various contents, use pivot tables as dynamic data sources, select and optimize various chart types, and generate pivot charts. For example: based on a pivot table, insert a pivot chart, the "+, -" buttons in the lower right corner of the pivot chart allow you to group across time and zoom in and out of other hierarchies in the data (Figures 2 and 3).

透视图右边两个筛选标签可以根据用户需要查看的内容,选择相应的海域、区域等进行图形显示。The two filter tabs on the right side of the perspective view can select the corresponding sea area, area, etc. for graphic display according to the content that the user needs to view.

4)基于步骤2)中预处理的数据,利用动态时间线及三维地图的多源可视化相结合的设计,验证赤潮信息挖掘的有效性和可行性。4) Based on the data preprocessed in step 2), the effectiveness and feasibility of red tide information mining is verified by the design of the combination of dynamic timeline and multi-source visualization of 3D map.

运用Excel动态时间线筛选赤潮信息分时期的节点。具体如下:基于Excel日程表的功能,根据用户需求,选择单独某一年份或连续多个年份的动态时间范围进行数据的统计与可视化展示(图4)。拖动屏幕下方动画控制工具条上的时间滑块,可以显示不同时期赤潮信息的分布情况。基于Excel动态图表的功能,选择相应的项目(如:时间节点或分区域)进行数据筛选后的可视化展示,有利于用户对数据单点项目进行拆分查看与分析。Use Excel dynamic time line to filter the nodes of red tide information in different periods. The details are as follows: Based on the function of Excel schedule, according to user needs, select a single year or a dynamic time range of multiple consecutive years for data statistics and visual display (Figure 4). Drag the time slider on the animation control toolbar at the bottom of the screen to display the distribution of red tide information in different periods. Based on the function of Excel dynamic charts, selecting the corresponding items (such as time nodes or sub-regions) for visual display after data filtering is helpful for users to split, view and analyze single-point data items.

运用Excel三维地图分析赤潮信息的空间位置关系及规律。具体如下:对于包含位置信息的赤潮数据,通过数据空间坐标信息的获取、识别,快速将数据映射到虚拟地球上,将数据要表达的内容展示到二维平面图或三维地图等适合的地图中,对专题地图进行动态分析和三维展示,将具有空间信息的表单数据转换成专题地图,实现数据空间可视化产品,提高数据的可视性和分析效率(图5~7)。例如:三维地图展示赤潮藻种随时间变化的空间分布中,根据时间轴可以按时间先后顺序来演示历次赤潮藻种发生位置和频数的动态变化。The spatial relationship and regularity of red tide information were analyzed by using Excel 3D map. The details are as follows: For red tide data containing location information, through the acquisition and identification of data space coordinate information, the data can be quickly mapped to the virtual earth, and the content to be expressed by the data can be displayed on a suitable map such as a two-dimensional plan or a three-dimensional map. Perform dynamic analysis and three-dimensional display of thematic maps, convert form data with spatial information into thematic maps, realize data spatial visualization products, and improve data visibility and analysis efficiency (Figures 5-7). For example, in a three-dimensional map showing the spatial distribution of red tide algae species over time, the dynamic changes in the location and frequency of previous red tide algae species can be demonstrated in chronological order according to the time axis.

本实施例中基于2000~2016年赤潮事件历史记录的赤潮数据,其动态时间线、三维地图的多源可视化相结合的设计,已成功挖掘出该时期赤潮分区分时期的特征。首先对赤潮分布的高发区及累计最大面积的三维地图和三维面积图设计,将爆发分布区分成A、B、C共3个区域。其中A区主要为北部,长江以北海域的福鼎、霞浦和连江;B区为中部,长江以南海域的福清、平潭、莆田和泉州;C区为南部,内湾海区,厦门和东山湾。同时,设计并绘制2000~2016年赤潮爆发次数的动态时间线和动态图表,结果显示大部分海区在2009年赤潮爆发次数显著下降,而到2010年升高明显,可能该时期为转折点。另外,A、B、C三个分区逐年分布看,B区在2010年明显上升为6次,面积也明显升高,所以分成2000~2009年和2010~2016年两个时期。基于上述多种可视化表达的信息融合,得出赤潮的爆发次数和累积最大面积呈现较明显的时间和空间差异(图8),2000~2009年和2010~2016年两个时期的赤潮爆发次数和累积最大面积在各分区均呈现相反的趋势,其中北部(A区)下降,中部(B区)上升,南部(C区)下降(图9)。In this example, based on the red tide data recorded in the history of red tide events from 2000 to 2016, the design of the combination of the dynamic time line and the multi-source visualization of the three-dimensional map has successfully excavated the characteristics of red tide zones in this period. Firstly, the three-dimensional map and three-dimensional area map of the high-incidence area of red tide distribution and the largest cumulative area are designed, and the eruption distribution area is divided into three areas: A, B, and C. Among them, Area A is mainly in the north, Fuding, Xiapu and Lianjiang in the waters north of the Yangtze River; Area B is in the middle, Fuqing, Pingtan, Putian and Quanzhou in the waters south of the Yangtze River; Area C is in the south, Neiwan Sea Area, Xiamen and Dongshan b. At the same time, the dynamic time line and dynamic chart of the number of red tide outbreaks from 2000 to 2016 were designed and drawn. The results showed that the number of red tide outbreaks in most sea areas decreased significantly in 2009, but increased significantly in 2010, which may be a turning point. In addition, looking at the distribution of the three divisions A, B, and C year by year, the B district increased significantly to 6 times in 2010, and the area also increased significantly, so it is divided into two periods: 2000-2009 and 2010-2016. Based on the information fusion of the above-mentioned various visual expressions, it is concluded that the number of red tide outbreaks and the maximum accumulated area show obvious temporal and spatial differences (Figure 8). The cumulative maximum area showed an opposite trend in each subregion, with the northern (Area) decreasing, the central (B) increasing, and the southern (C) decreasing (Figure 9).

5)基于步骤3)和步骤4),构建仪表盘模型,设计较为普适的赤潮动态分析图表,用于赤潮信息挖掘分析。5) Based on step 3) and step 4), build a dashboard model, and design a more general red tide dynamic analysis chart for red tide information mining and analysis.

针对不藻种,不同月份的赤潮发生情况设计了动态图表,构建赤潮发生频率仪表盘模型。根据用户需求,通过将透视表筛选、迷你图展示、仪表盘模型等多种设计融于一体,动态展示赤潮发生情况的分布特征(图10)。同时设计新增了两种藻种赤潮爆发情况对比图(图11),用户可以根据需求自主选择两种藻种进行对比分析,寻找不同藻种赤潮发生的规律。A dynamic chart was designed for the occurrence of red tides in different months, and a dashboard model of red tide occurrence frequency was constructed. According to user needs, through the integration of various designs such as pivot table screening, sparkline display, and dashboard model, the distribution characteristics of red tide occurrences are dynamically displayed (Figure 10). At the same time, a comparison chart of red tide outbreaks of two algae species has been added in the design (Figure 11). Users can choose two algae species for comparison and analysis according to their needs, and find the law of red tide occurrence of different algae species.

本发明基于Excel功能模块,将赤潮数据规范化处理后,结合数据透视表及透视图中跨时间分组和数据结构分层的方法,跨数据轻松构建复杂的模型,识别与分析赤潮信息的构成、联系、层次与分布,判别数据时间节点或空间分界等细微信息的差异;通过融合动态时间线、三维地图等多种可视化方式,对数据信息层层剥离,通过实例评估该融合方式的有效性;最后,将透视图、迷你图、动态图表等多种可视化方式融合,构建仪表盘模型,设计较为普适的赤潮动态分析的可视化方式——“一张图表”,用于赤潮信息挖掘。实例证明本发明的方法可有效地筛选出赤潮分区、分时期的节点,更有效地帮助用户分析、提取赤潮数据的内在规律,为科研、管理等相关部门提供技术支撑。Based on the Excel function module, the present invention normalizes the red tide data, combines the method of cross-time grouping and data structure layering in the data pivot table and the pivot chart, and easily constructs a complex model across data, and identifies and analyzes the composition and connection of red tide information. , level and distribution, to discriminate the differences in subtle information such as data time nodes or spatial boundaries; through the fusion of dynamic timelines, three-dimensional maps and other visualization methods, the data information is peeled off layer by layer, and the effectiveness of the fusion method is evaluated through examples; finally , which integrates various visualization methods such as perspective charts, sparklines, and dynamic charts to build a dashboard model, and design a more common visualization method for red tide dynamic analysis—“one chart” for red tide information mining. Examples prove that the method of the present invention can effectively screen out the nodes of red tide zones and periods, more effectively help users to analyze and extract the inherent laws of red tide data, and provide technical support for relevant departments such as scientific research and management.

Claims (7)

1. A visual mining and application method of red tide data is characterized by comprising the following steps:
1) carrying out normalized processing on the information of the original red tide event; reading files in batches by utilizing matlab programming, identifying the structure and the name of each project of data, correcting the data in batches according to the specified data project, and finally outputting the files;
2) analyzing the history of the red tide events, checking and screening out independent red tide events, and finishing data preprocessing;
3) designing and drawing a data pivot table and a perspective view of various contents based on the data preprocessed in the step 2), identifying and analyzing the composition, relation, hierarchy and distribution of red tide information, and selecting and optimizing various chart types;
4) based on the data preprocessed in the step 2), designing a multi-source visual fusion mode of a dynamic time line and a three-dimensional map, and verifying the feasibility and effectiveness of the red tide information mining;
5) based on the step 3) and the step 4), an instrument panel model is constructed, and a relatively universal red tide dynamic analysis chart is designed and used for red tide information mining analysis.
2. The method for mining and applying the visualization of red tide data as claimed in claim 1, wherein in step 1), the specific method for performing the normalization processing on the information of the original red tide event comprises:
firstly, reading data in batches by utilizing Matlab programming, identifying the structure and the item names of original data through software, then checking the title names of all columns in an input red tide event history record file according to a standard data item, correspondingly supplementing and sequencing by utilizing a Matlab modification program, and outputting the batch modified data in batches by programming.
3. The visual mining and application method of red tide data as claimed in claim 2, wherein the canonical data items, i.e. the column header names, are set as: sequence number, occurrence time, death time, duration, location, longitude, latitude, sea area of occurrence, specific sea area, maximum area of influence, species of biological dominance, highest density, loss condition.
4. The method for mining and applying the visualization of red tide data as claimed in claim 1, wherein in step 2), the specific method for preprocessing the data is: the method comprises the steps of detecting and screening independent red tide events, wherein two red tide events of single algae species red tide and mixed algae species red tide exist in original data due to different statistical modes, in part of original files, one mixed algae species red tide event is divided into a plurality of single algae species red tide events, repeated statistics of the red tide events is caused, in order to avoid the situation, the starting and ending time and the occurrence place of the red tide events are detected, the red tide events of the mixed algae species are unified, and repeated information of branch statistics is deleted.
5. The method as claimed in claim 1, wherein in step 3), the specific method for identifying and analyzing the composition, relationship, hierarchy and distribution of red tide information is as follows: the data perspective table enhancement function of Excel is utilized to realize that a complex model is easily built across data, the data are automatically grouped according to time, a user checks the content according to the requirement, selects a corresponding sea area and area through a screening button to carry out data statistics, and the time grouping of the data comprises the modes of second, minute, hour, day, month, quarter, year or the combination of the modes; analyzing and designing a data pivot table with various contents, taking the pivot table as a dynamic data source, selecting and optimizing various chart types, and generating a perspective view; inserting perspective, grouping and zooming in and out other hierarchies in the data in a time-span mode, and selecting corresponding sea areas and regions for graphical display according to contents which a user needs to view.
6. The visual mining and application method of red tide data according to claim 1, wherein in step 4), the feasibility and the effectiveness of the red tide information mining are verified by designing a multi-source visual fusion mode of a dynamic timeline and a three-dimensional map, and the specific steps are as follows:
(1) screening the nodes of the red tide information in different periods by using an Excel dynamic time line: based on the function of an Excel calendar, selecting a dynamic time range of a single year or a plurality of continuous years for data statistics and visual display according to the user requirements; dragging the time slider on the animation control toolbar can display the distribution condition of the red tide information in different periods; based on the function of the Excel dynamic chart, selecting a corresponding project for visual display after data screening, and facilitating a user to split, check and analyze a data single-point project; the corresponding project is a time node or a partition area;
(2) analyzing the spatial position relation and the rule of the red tide information by using an Excel three-dimensional map: for red tide data containing position information, the data are quickly mapped onto a virtual earth through acquisition and identification of data space coordinate information, contents to be expressed by the data are displayed in a two-dimensional plane map or a three-dimensional map, the thematic map is dynamically analyzed and three-dimensionally displayed, form data with space information are converted into the thematic map, a data space visualization product is realized, and the visibility and the analysis efficiency of the data are improved;
(3) and verifying the effectiveness and feasibility of the mining of the red tide information by the fusion mode based on the red tide event historical record original data.
7. The visual mining and application method of red tide data as claimed in claim 1, wherein in step 5), an instrument panel model is constructed, and the specific method for designing a relatively universal red tide dynamic analysis chart is as follows: the method comprises the steps of designing a dynamic chart aiming at the red tide occurrence conditions of different alga species and different months, constructing a red tide occurrence frequency instrument panel model, integrating perspective table screening, mini-graph display and multiple visual designs of the instrument panel model according to user requirements, dynamically displaying the distribution characteristics of the red tide occurrence conditions, simultaneously newly adding two alga species red tide outbreak condition comparison chart designs, and enabling a user to independently select two alga species according to requirements to perform comparison analysis and search the rules of the red tide occurrence of different alga species.
CN201911064759.5A 2019-11-04 2019-11-04 Visual mining and application method of red tide data Pending CN110781240A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911064759.5A CN110781240A (en) 2019-11-04 2019-11-04 Visual mining and application method of red tide data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911064759.5A CN110781240A (en) 2019-11-04 2019-11-04 Visual mining and application method of red tide data

Publications (1)

Publication Number Publication Date
CN110781240A true CN110781240A (en) 2020-02-11

Family

ID=69388746

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911064759.5A Pending CN110781240A (en) 2019-11-04 2019-11-04 Visual mining and application method of red tide data

Country Status (1)

Country Link
CN (1) CN110781240A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111639149A (en) * 2020-05-29 2020-09-08 山东浪潮通软信息科技有限公司 Ocean data visualization method and device
CN113312375A (en) * 2021-05-18 2021-08-27 南京中科水治理股份有限公司 Visual mining and implementing method for water ecological restoration engineering data
CN117131249B (en) * 2023-10-26 2024-01-12 湖南省不动产登记中心 Intelligent management method and system for natural resource data

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729699A (en) * 2017-11-22 2018-02-23 天津北方天力增压技术有限公司 A kind of booster turbine impeller design optimization method based on MATLAB

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107729699A (en) * 2017-11-22 2018-02-23 天津北方天力增压技术有限公司 A kind of booster turbine impeller design optimization method based on MATLAB

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
曹敏杰: "浙江近岸海域海洋生态环境时空分析及预测关鍵抆术研究", 《中国优秀博硕士学位论文全文数据库(博士) 工程科技Ⅰ辑》 *
李金花: "利用数据透视表进行多角度动态分析", 《中国管理信息化》 *
黄海波: "Excel2016三维地图的地理数据可视化与教学应用", 《中学地理教学参考》 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111639149A (en) * 2020-05-29 2020-09-08 山东浪潮通软信息科技有限公司 Ocean data visualization method and device
CN111639149B (en) * 2020-05-29 2023-07-25 浪潮通用软件有限公司 Ocean data visualization method and device
CN113312375A (en) * 2021-05-18 2021-08-27 南京中科水治理股份有限公司 Visual mining and implementing method for water ecological restoration engineering data
CN117131249B (en) * 2023-10-26 2024-01-12 湖南省不动产登记中心 Intelligent management method and system for natural resource data

Similar Documents

Publication Publication Date Title
Cobo et al. SciMAT: A new science mapping analysis software tool
US7548925B2 (en) Diagrammatic access and arrangement of data
Keim Information visualization: Scope, techniques and opportunities for geovisualization
Mansmann et al. Exploring OLAP aggregates with hierarchical visualization techniques
CN103984755A (en) Multidimensional model based oil and gas resource data key system implementation method and system
CN110781240A (en) Visual mining and application method of red tide data
Boulil et al. Multidimensional modeling and analysis of large and complex watercourse data: an OLAP-based solution
Anselin An Introduction to Spatial Data Science with GeoDa: Volume 1: Exploring Spatial Data
US9009584B2 (en) Peeking into the Z-dimensional drawer
Therón Hierarchical-temporal data visualization using a tree-ring metaphor
Milligan et al. Learning Tableau 2022: Create effective data visualizations, build interactive visual analytics, and improve your data storytelling capabilities
Visvalingam Trends and concerns in digital cartography
Nicolas Tetrapod biodiversity through the Permo-Triassic Beaufort Group (Karoo Supergroup) of South Africa
Viswanathan et al. User-centric spatial data warehousing: a survey of requirements and approaches
Wang et al. Set-stat-map: Extending parallel sets for visualizing mixed data
Renolen Concepts and methods for modelling temporal and spatiotemporal information
Militello Surface-Water Modeling system tidal constituents toolbox for ADCIRC
Hughes et al. An integrated software package for the analysis and display of hydrological or water resources time series data
Lesnichaya et al. Algorithm of an Automated Analytical Cognitive Modeling Platform for Semi-Structured Management Problems
Hirakawa et al. Situation-dependent browser to explore the information space
Mulyukova et al. Big data analysis for studying spatiotemporal trends in the sustainable development of large cities
Gencheva From data collection to analysis: Designing a relational database for archaeological research in the eastern Rhodope region
Mizushima The Rural-urban Nexus in India's Economic Transformation
CN118606433A (en) Business encyclopedia query method and tool for offshore oil and gas production objects
Schloen et al. Digital Archaeology Case Study: Tell Keisan, Israel

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200211

RJ01 Rejection of invention patent application after publication