CN108319715B

CN108319715B - Parallel coordinate improvement method for multi-dimensional integer data set

Info

Publication number: CN108319715B
Application number: CN201810131947.4A
Authority: CN
Inventors: 陈红倩; 程中娟
Original assignee: Beijing Technology and Business University
Current assignee: Dragon Totem Technology Hefei Co ltd
Priority date: 2018-02-09
Filing date: 2018-02-09
Publication date: 2020-05-22
Anticipated expiration: 2038-02-09
Also published as: CN108319715A

Abstract

The invention relates to a parallel coordinate improvement method for a multi-dimensional integer data set, and belongs to the technical field of computer graphics and visualization. The method comprises the following implementation steps: counting the number of types of data values of each integer data dimension in the data set; counting the occupation ratios of different data values in a data set, establishing a segmentation coordinate axis according to the occupation ratios, and displaying the occupation ratios of the data values through the height ratios of the segmentation of the coordinate axis; aiming at each record in the data set, continuously updating an offset value in the visual mapping process by an offset mapping method, so that the integral values of different records are mapped to different heights on a coordinate axis, and the problem of same-point mapping is solved; the invention can intuitively obtain the recording number ratio condition of each data value in each data dimension, can quickly analyze the association relation and the association strength among the data of each dimension, and improves the visual analysis capability aiming at the multi-dimensional integer data set.

Description

Parallel coordinate improvement method for multi-dimensional integer data set

Technical Field

The invention relates to a parallel coordinate improvement method for a multi-dimensional integer data set, and belongs to the technical field of computer graphics and visualization.

Technical Field

"integer data" is a common content in a data set, and is category data in the form of gender (male, female), region (province, city), pesticide toxicity (highly toxic, low toxic), etc., which is converted into discrete integer data instead of continuous data when being numerically converted for data analysis.

The parallel coordinate method is a commonly used visualization method for analyzing multidimensional data, but when the method faces multidimensional integer data, a lot of troubles are caused, for example, in the parallel coordinate visualization method, the same numerical value in the integer data is mapped to the same position on a coordinate axis in the visualization mapping process (in the invention, the same-point mapping is called), so that the visualization result cannot reflect the information such as the distribution and quantity information of the data, the correlation strength among the category data and the like, and the effectiveness of cross correlation analysis is greatly reduced.

The invention provides an improved scheme of a parallel coordinate visualization method for a multi-dimensional integer data set, introduces the concept of 'segmented coordinate axes' into the coordinate axes of a parallel coordinate system, and enables different records in integer data to be different in mapping positions on the coordinate axes even if the data values are the same by providing an 'offset mapping' method, thereby solving the 'same-point mapping' problem and improving the analysis capability and the analysis efficiency of the multi-dimensional integer data set or the data set containing integer data dimensions.

In terms of visualization methods, the parallel coordinate improvement method for multidimensional integer-valued datasets proposed in the present invention has not found similar techniques in published documents.

Disclosure of Invention

The invention aims to provide a parallel coordinate improvement method for a multidimensional integer value type data set or a data set containing integer value type data dimensions, which comprises the following steps:

step 1: and counting the number of the types of the data values of each integer data dimension in the data set, and calculating the ratio of each data value.

For one of the integer data dimensions (set to D)_i) The calculation method of (2) is as follows:

step 1.1: integrating data dimension D_iIs extracted as a vector (denoted as V)_i). If the number of data records in the data set is T, then V_iThe number of component data of the vector is T.

Step 1.2: statistics V_iNumber of types of data values in vector (denoted NV)_i)。

Step 1.3: statistics V_iAnd the record number of each data value in the vector is sorted from more records to less records. Will V_iConverting the data values in the vector from 1 to NV according to the sequence of the record number from more to less_iThe data value named conversion value j is V_ijNaming the data dimension D_iIn satisfy V_i＝V_ijIs NV_ij。

In the invention, V is_iThe converted data value of each data value in the vector is referred to as the "conversion value", V_iThe conversion value of each data value in the vector ranges from 1 to NV_i。

Step 1.4: calculating V_iAnd recording ratio corresponding to each data value in the vector. Satisfy V_i＝V_ijIs recorded in the ratio R_ijThe calculation method is as the formula (1):

where T is the total number of records in the data set described in step 1.1.

Step 2: and establishing a coordinate axis according to the data distribution of all integer data dimensions in the data set. If the non-integer data dimension exists in the data set at the same time, the coordinate axis establishing method corresponding to the non-integer data dimension remains unchanged from the traditional method.

For integer data dimension D_iThe corresponding coordinate axis establishing method comprises the following steps:

dividing coordinate axes into NV_iEach segment, called coordinate axis segment, corresponding to a data dimension D_iThe height of each coordinate axis segment is related to the proportion of the corresponding data value. The coordinate axis corresponding to the integer data dimension established by the method is formed by segmenting coordinate axes corresponding to different types of data values, and the coordinate axis is called as a segmented coordinate axis in the invention.

The calculation method of each piece of segmentation information in the coordinate axis comprises the following steps:

step 2.1: and calculating the height of the segmentation coordinate axis corresponding to each data value according to the height (expressed as height) of the coordinate axis in the parallel coordinate system in the final visualization result.

In an integer data dimension D_iFor example, the data value V_ijThe corresponding coordinate axis segmentation height calculation method is as the formula (2):

H_ij＝height*R_ij(2)

wherein R is_ijTo satisfy V as derived in step 1.4_i＝V_ijIs recorded to the ratio.

Step 2.2: the starting height and ending height of each "segmentation coordinate axis" are calculated. By a data value V_ijThe corresponding "segmentation coordinate axis" is taken as an example, and the calculation method of the starting height is as the following formula (3):

the calculation method of the ending height is as the formula (4):

and step 3: for each "coordinate axis segment" of the coordinate axis corresponding to all integer data dimensions, the offset height of a data record is calculated. By a data value V_ijCorresponding "coordinate axis segmentation" is taken as an example, and the offset height of one record is calculated according to the formula (5):

wherein H_ijFor the data value V obtained in step 2.1_ijCorresponding axis segment height, NV_ijTo satisfy V as derived in step 1.3_i＝V_ijThe number of records of (2).

And 4, step 4: and 3, calculating data value mapping basic data of all coordinate axes corresponding to the integer data dimension according to the adjacent relation of the coordinate axes established in the step 3.

The invention maps different heights aiming at the same data value in different records in the integral data dimension, thereby solving the problem of same-point mapping and effectively reducing the intersection of connecting lines. This mapping method is named as "offset mapping" method in the present invention.

In the offset mapping method, the mapping height of the data value depends on two factors. The first is the sequence of the record in the data set, and the second is the mapping height of other dimensional data values of the record on the left adjacent coordinate axis (the coordinate axis corresponding to the current data dimension is the leftmost coordinate axis, and the factor is not considered).

The specific mapping method is divided into two cases: one is that the data dimension corresponding to the left adjacent coordinate axis is an integer data dimension, in which case step 5 is continued; and the other is that the data dimension corresponding to the left adjacent coordinate axis is a non-integer data dimension or the coordinate axis corresponding to the current data dimension is the leftmost coordinate axis, and in this case, the step 6 is skipped.

And 5: in the step, under the condition that the data dimension corresponding to the left adjacent coordinate axis is a numerical data dimension, the data value mapping basic data is calculated.

For the current coordinate axis, the step of calculating the data value mapping basic data is as follows:

step 5.1: setting an integer data dimension D corresponding to an adjacent coordinate axis on the left side of the coordinate axis of the current integer data dimension_uSetting the vector extracted by the integer data dimension as V_uVector V_uThe number of kinds of medium data value is NV_u(NV_uCalculated according to step 1.2).

Step 5.2: statistical integer data dimension D_uAnd D_iI.e. for any V_uConverted values p and V of_iThe conversion value q of (a), statistics satisfy V_u＝V_upAnd V is_i＝V_iqIs named as

Wherein V_upV corresponding to conversion value p_uData value of (1), V_iqV corresponding to the conversion value q_iThe data value of (1).

Step 5.3: according to vector V_uNumber of kinds NV of medium data values_uThe data value V in the current coordinate axis is used_iqCorresponding division of coordinate axis into NV_uA "coordinate axis sub-segment".

Step 5.4: and calculating the heights of all the coordinate axis subsections in the current coordinate axis.

With V_u＝V_upAnd V is_i＝V_iqFor example, the height of the coordinate axis sub-segment corresponding thereto

The calculation method is as the formula (6):

in NV_iqTo satisfy V as derived in step 1.3_i＝V_iqThe number of records of (1), H_iqIs obtained according to step 2.1V_i＝V_iqThe corresponding axis segment height is set to be,

to satisfy V as derived in step 5.2_u＝V_upAnd V is_i＝V_iqThe number of records of (2).

Step 5.5: and calculating the starting heights of all the coordinate axis subsections in the current coordinate axis.

With V_u＝V_upAnd V is_i＝V_iqFor example, the starting height of the corresponding coordinate axis sub-segment is recorded as

The calculation method is as the formula (7):

wherein Hstart_iqIs the data value V obtained according to step 2.2_iqThe starting height of the corresponding coordinate axis segment,

to satisfy V as derived in step 5.4_u＝V_ukAnd V is_i＝V_iqThe height of the coordinate axis sub-segment.

Step 5.6: the "next mapping height" of the "coordinate axis sub-segment" is set for each "coordinate axis sub-segment" of the current coordinate axis.

With V_u＝V_upAnd V is_i＝V_iqFor example, the next mapping height of the coordinate axis sub-segment to which it corresponds

Is assigned as the starting height of the coordinate axis segment in which it is located

Jump to step 7.

Step 6: in the step, under the condition that the adjacent coordinate axis on the left side is a non-integer data dimension or the current data dimension is the coordinate axis corresponding to the leftmost data dimension, the data value mapping basic data is calculated.

Because the left side does not have a coordinate axis corresponding to the integer data dimension, the segmentation of the current coordinate axis does not need to be continuously divided into coordinate axis sub-segments, and the next mapping height of all coordinate axis segments is directly set.

With V_i＝V_iqFor example, it corresponds to the next mapping height Hnext of the coordinate axis segment_iqIs assigned to the corresponding Hstart_iqI.e. the data value V obtained according to step 2.2_iqThe starting height of the corresponding coordinate axis segment.

And 7: and calculating the mapping height of each dimension data value on the corresponding coordinate axis of each record in the data set.

For each record, if the current data dimension is a non-integer data dimension, calculating the mapping height of the data value on the corresponding coordinate axis by using a traditional method;

if the current data dimension is an integer data dimension and the coordinate axis adjacent to the left side of the corresponding coordinate axis is an integer data dimension coordinate axis, continuing to execute the step 7.1;

if the current data dimension is an integer data dimension and the corresponding coordinate axis is the leftmost coordinate axis or the left adjacent coordinate axis is a non-integer data dimension, continue to execute step 7.3.

Step 7.1: in an integer data dimension D_iData value V of_i＝V_iqFor example, a data dimension vector (named V) corresponding to the left coordinate axis_u) The data value (named V) of the record is obtained_up) I.e. the record satisfies V_u＝V_upAnd V is_i＝V_iq。

According to V_u＝V_upAnd V is_i＝V_iqIn step 5, the next mapping height of the corresponding coordinate axis sub-segment is obtained

That is, the piece of data is in the data dimension D_iThe mapping height on the corresponding coordinate axis.

Step 7.2: according to V_i＝V_iqIn step 3, the data value V is obtained_iqOne recording offset height I of the corresponding "coordinate axis segment_iqUpdate

As in the formula (8),

jump to step 8.

Step 7.3: in an integer data dimension D_iData value V of_i＝V_iqFor example, V obtained in step 6_iqNext mapping height Hnext of corresponding coordinate axis sub-segment_iq，Hnext_iqThat is, the piece of data is in the data dimension D_iThe mapping height on the corresponding coordinate axis.

Step 7.4: according to V_i＝V_iqIn step 3, the data value V is obtained_iqOne recording offset height I of the corresponding "coordinate axis segment_iqUpdate Hnext_iqAs shown in the formula (9),

Hnext_iq＝Hnext_iq+I_iq(9)

and 8: in order to distinguish each coordinate axis segment in the coordinate axis corresponding to the integer value type data dimension, different textures can be set for each coordinate axis segment, and the textures can be selected by using distinctive colors or shading.

And step 9: and drawing an improved parallel coordinate visualization result of the current data set according to the coordinate axis information obtained in the steps 1 to 8, the mapping heights of all records and the segmented textures of the coordinate axes.

Advantageous effects

By the parallel coordinate improvement method provided by the invention, the recording number ratio condition of each data value in each data dimension can be intuitively obtained through the height ratio of each segment in the coordinate axis; in the data screening interaction process, the association relationship and the association strength among all dimensional data can be rapidly obtained; the visual analysis capability for the multidimensional integer-value data set is improved.

Drawings

FIG. 1 is a flow chart of an implementation of a method for improving parallel coordinates of a multidimensional integer data set according to an embodiment of the present invention;

FIG. 2 is a parallel coordinate improvement method for a multidimensional integer value type data set, which is applied to the visualization effect of a pesticide residue detection result data set (desensitization and decryption).

FIG. 3 is a visualization result after interactive screening based on the visualization result of FIG. 2.

Detailed Description

The invention is further described below with reference to the accompanying drawings and examples.

Taking the data set of the pesticide residue detection result as an example, the data dimension comprises (area, year, month, agricultural product, pesticide), and the number of data records is 1241, wherein the first 10 data records are shown in table 1.

Table 1 pesticide residue detection result data set example data

The original data was subjected to integer data conversion as shown in table 2.

Table 2 conversion of sample data of pesticide residue detection result data set into integral value type data set

An implementation flow chart of the parallel coordinate improvement method for the multidimensional integral value type data set in the embodiment is shown in fig. 1, and specific operation processes of the method are described in combination with the pesticide residue detection result data set as follows:

For one of the integer data dimension "month" (set to D)₃) The calculation method of (2) is as follows:

step 1.1: integrating data dimension D₃Is extracted as a vector (denoted as V)₃)，V₃(11, 3, 1.., 2, 1). The number of data records in the data set is T-1241, then V₃The number of component data of the vector is T1241.

Step 1.2: data dimension D₃Vector V₃The number NV of data value types in (11, 3, 1., 2, 1)₃＝9。

Step 1.3: statistics V₃And the record number of each data value in the vector is sorted from more records to less records. Will V₃Converting the data values in the vector from 1 to NV according to the sequence of the record number from more to less₃(NV₃Calculated from step 1.2) with a conversion value of 1, 2 months (named V)_3，1) Data dimension D₃In satisfy V₃＝V_3，1Is NV_3，1＝283。

In the invention, V is₃The converted data value of each data value in the vector is referred to as the "conversion value", V₃The conversion value of each data value in the vector ranges from 1 to NV₃(calculated from step 1.2).

Step 1.4: calculating V₃The log ratio for each data value in the (11, 3, 1., 2, 1) vector. Satisfy V₃＝V_3，1Is recorded in the ratio R_3，1R is obtained by calculation according to the formula (1)_3，1The calculation method of (2) is as in formula (10):

where T1241 is the number of all records in the data set described in step 1.1.

Step 2: and establishing a coordinate axis by the data distribution of all integer data dimensions in the data set. If the non-integer data dimension exists in the data set at the same time, the coordinate axis establishing method corresponding to the non-integer data dimension remains unchanged from the traditional method.

For integer data dimension D₃The corresponding coordinate axis establishing method comprises the following steps:

dividing coordinate axes into NV₃Each segment, called coordinate axis segment, corresponding to a data dimension D₃The height of each coordinate axis segment is related to the proportion of the corresponding data value. The coordinate axis corresponding to the integer data dimension established by the method is formed by segmenting coordinate axes corresponding to different types of data values, and the coordinate axis is called as a segmented coordinate axis in the invention.

according to step 2.1: the height of the "segmentation coordinate axis" corresponding to each data value is calculated according to the height (indicated as height 520) of the coordinate axis in the "parallel coordinate system" in the final visualization result.

In an integer data dimension D₃For example, the data value V is calculated according to the formula (2)_3，1The corresponding coordinate axis segment height is calculated according to the formula (11):

wherein R is_3，1To satisfy V as derived in step 1.4₃＝V_3，1Is recorded to the ratio.

Step 2.2: the starting height and ending height of each "segmentation coordinate axis" are calculated. By a data value V_3，1Taking the corresponding "segmented coordinate axis" as an example, the calculation method for calculating the starting height according to the formula (3) is as the formula (12):

Hstart_3，1＝0(j＝1) (12)

the calculation method for calculating the ending height according to the formula (4) is as the formula (13):

according to step 3: for each "coordinate axis segment" of the coordinate axis corresponding to all integer data dimensions, the offset height of a data record is calculated. By a data value V_3，1For example, the offset height of a record is calculated according to formula (5) as shown in formula (14):

wherein

For the data value V obtained in step 2.1_3，1Corresponding axis segment height, NV_3，1To satisfy V as derived in step 1.3₃＝V_3，1The number of records of (2).

For the current coordinate axis, the step of calculating the mapping height of each record on the integral data dimension is as follows:

step 5.1: setting an integer data dimension D corresponding to an adjacent coordinate axis on the left side of the coordinate axis of the current integer data dimension₂Setting the vector extracted by the integer data dimension as V₂Vector V₂The number of kinds of medium data value is NV₂＝3(NV₂Calculated according to step 1.2).

Step 5.2: statistical integer data dimension D₂And D₃I.e. for any V₂Is 1 and V₃The conversion value q of (1) is satisfied with V₂＝V_2，1And V is₃＝V_3，1Number of records (named

The number of records

Wherein V_2，1V corresponding to 1 for conversion value p₂Data value of (1), V_3，1V corresponding to 1 for conversion value q₃The data value of (1).

Step 5.3: according to vector V₂Number of kinds NV of medium data values₂3, the data value V in the current coordinate axis_3，1Corresponding division of coordinate axis into NV₂3 "coordinate axis subsections".

With V₂＝V_2，1And V is₃＝V_3，1For example, the height of the corresponding coordinate axis sub-segment is calculated according to equation (6)

The calculation method of (2) is as in formula (15):

in NV_3，1To satisfy V as derived in step 1.3₃＝V_3，1The number of records of (1), H_3，1For V obtained according to step 2.1₃＝V_3，1The corresponding axis segment height is set to be,

to satisfy V as derived in step 5.2₂＝V_2，1And V is₃＝V_3，1The number of records of (2).

With V₂＝V_2，1And V is₃＝V_3，1For example, the starting height of the corresponding coordinate axis sub-segment is calculated according to the formula (7)

The calculation method is as the formula (16):

wherein Hstart_3，1Is the data value V obtained according to step 2.2_3，1The starting height of the corresponding coordinate axis segment.

With V₂＝V_2，1And V is₃＝V_3，1For example, the next mapping height of the coordinate axis sub-segment to which it corresponds

Jump to step 7.

With V₁＝V_1，1For example, it corresponds to the next mapping height Hnext of the coordinate axis segment_1，1Is assigned to the corresponding Hstart_1，10, i.e. the data value V obtained in step 2.2_1，1The starting height of the corresponding coordinate axis segment.

Step 7.1: in an integer data dimension D₃Data value V of₃＝V_3，1For example, a data dimension vector (named V) corresponding to the left coordinate axis₂) The data value (named V) of the record is obtained_2，1) I.e. the record satisfies V₂＝V_2，1And V is₃＝V_3，1。

According to V₂＝V_2，1And V is₃＝V_3，1In step 5, the next mapping height of the corresponding coordinate axis sub-segment is obtained

That is, the piece of data is in the data dimension D₃The mapping height on the corresponding coordinate axis.

Step 7.2: according to V₃＝V_3，1In step 3, the data value V is obtained_3，1One recording offset height I of the corresponding "coordinate axis segment_3，1Update

Calculated according to the formula (8)

The calculation formula is as (17):

jump to step 8.

Step 7.3: in an integer data dimension D₃Data value V of₃＝V_3，1For example, V obtained in step 6_3，1Next mapping height Hnext of corresponding coordinate axis sub-segment_3，1，Hnext_3，1That is, the piece of data is in the data dimension D₃The mapping height on the corresponding coordinate axis.

Step 7.4: according to V₃＝V_3，1In step 3, the data value V is obtained_3，1One recording offset height I of the corresponding "coordinate axis segment_3，1Updating Hnext according to equation (9)_3，1。

In the invention, the diagonal stripes and the cross stripes are selected as coordinate axis segmentation textures.

FIG. 2 is a parallel coordinate improvement method for a multidimensional integer value type data set, which is applied to the visualization effect of a pesticide residue detection result data set (desensitization and decryption). From the visualization result, the visual analysis conclusion of multi-dimensional comparison on the pesticide residue detection data set example data comprises the following steps:

(1) after the method is applied in the process of drawing the parallel coordinates, each coordinate axis is divided into a plurality of sections, the height of each section represents the data record number of the data value of the section, and the comparison of the data record number is realized. In the "region" dimension, the number of data records in the sunny region is the highest, and the number of data records in the mountain region is the lowest. In the "years" dimension, 2012's of data records are the most, followed by 2014 and finally 2013. The number of data records for february is the largest and the number of data records for february is the smallest in the "month" dimension. The number of records in the "day" dimension is the largest for number five and the smallest for number eighteen. The cucumber records are the most and the peach records are the least in the "agricultural products" dimension. In the "pesticide" dimension, the most pesticide was not detected, indicating that pesticide use in most agricultural products is standard.

(2) The incidence relation analysis between every two coordinate axes can be realized by adjusting the coordinate axis sequence. The "year" dimension may analyze the association with the "region" dimension, the association with the "month" dimension, or the association with the "day" dimension.

In the invention, the distribution condition of the data values of different dimensions in each data value of other dimensions can be analyzed by screening the data values of different dimensions. Fig. 3 is a data visualization result obtained by screening the sunny region based on the visualization result of fig. 2, and data records with data values of the "sunny region" in the region dimension are displayed, and the rest are not displayed. From the visualization results after screening, the analysis conclusion can be that: in the "year" dimension, the number of data records is distributed most in 2012, and the distribution of 2014 and 2013 is approximately the same; in the 'month' dimension, the number of data records is distributed to the maximum in October, and the number of data records in September is the minimum; in the "day" dimension, the number of data records 10 is the largest and the number of data records 14 is the smallest.

Claims

1. The parallel coordinate improvement method for the multidimensional integer value type data set comprises the following steps:

step 1: counting the number of types of data values of each integer data dimension in the data set, and calculating the ratio of each data value;

for one of the integer data dimensions D_iThe calculation method of (2) is as follows:

step 1.1: integrating data dimension D_iIs extracted as a vector V_iIf the number of data records in the data set is T, then V_iThe number of the component data of the vector is T;

step 1.2: statistics V_iNumber of types NV of data values in a vector_i；

Step 1.3: statistics V_iThe record number of each data value in the vector is sorted according to the record number from more to less, and V is obtained_iConverting the data values in the vector from 1 to NV according to the sequence of the record number from more to less_iThe data value named conversion value j is V_ijNaming an integer data dimension D_iIn satisfy V_i＝V_ijIs NV_ij；

Will V_iThe converted data value of each data value in the vector is referred to as the "conversion value", V_iThe conversion value of each data value in the vector ranges from 1 to NV_i；

Step 1.4: calculating V_iRecording proportion corresponding to each data value in the vector;

step 2: establishing a coordinate axis according to the data distribution of all integer data dimensions in the data set, and if non-integer data dimensions exist in the data set at the same time, keeping the coordinate axis establishing method corresponding to the non-integer data dimensions unchanged in the traditional method;

dividing coordinate axes into NV_iEach segment is called a coordinate axis segment, and each coordinate axis segment corresponds to an integer data dimension D_iThe height of each coordinate axis segment is related to the proportion of the corresponding data value;

step 2.1: calculating the height H of the coordinate axis segmentation corresponding to each data value according to the height of the coordinate axis in the parallel coordinate system in the final visualization result_ij；

Step 2.2: calculate the starting height Hstart of each "coordinate axis segment_ijAnd a finish height Hend_ij；

And step 3: calculating the offset height of a data record aiming at each coordinate axis segment corresponding to all integer data dimensions, and enabling the integer data dimensions D_iThe offset height of a data record of a coordinate axis segment with a transition value j is named I_ij；

And 4, step 4: calculating data value mapping basic data of all coordinate axes corresponding to the integer data dimension according to the adjacent relation of the coordinate axes established in the step 3;

the specific calculation method is divided into two cases: one is that the data dimension corresponding to the left adjacent coordinate axis is an integer data dimension, in which case step 5 is continued; the other is that the data dimension corresponding to the left adjacent coordinate axis is a non-integer data dimension or the coordinate axis corresponding to the current data dimension is the leftmost coordinate axis, and in this case, the step 6 is skipped;

and 5: the method comprises the following steps of calculating data value mapping basic data under the condition that data dimensions corresponding to left adjacent coordinate axes are numerical data dimensions;

for a current integer data dimension D_iAnd calculating data value mapping basic data according to the corresponding coordinate axis as follows:

step 5.1: setting the current "integer-type data dimension D_iAdjacent coordinate axes to the left of the coordinate axes "Corresponding to an integer data dimension D_uThe integer data dimension D_uThe extracted vector is V_uVector V_uThe number of kinds of medium data value is NV_uIn which NV is_uCan be calculated according to the method of step 1.2;

step 5.2: statistical integer data dimension D_uAnd an integer data dimension D_iFor an integer data dimension D_uAnd each conversion value p of the integer data dimension Di, and conforming the condition "D" in the data set_uDimension conversion value is p 'and' D_iThe number of records with a dimension conversion value of q "is named

Step 5.3: according to vector V_uNumber of kinds NV of medium data values_uThe data value V in the current coordinate axis is used_iqCorresponding division of coordinate axis into NV_u"coordinate axis subsections";

step 5.4: calculating the heights of all coordinate axis subsections in the current coordinate axis, and enabling the integral value type data dimension D_uConversion value of p and integer data dimension D_iThe height of the coordinate axis subsection with the conversion value of q is named as

Step 5.5: calculating the initial heights of all coordinate axis subsections in the current coordinate axis, and enabling the integral value type data dimension D_uConversion value of p and integer data dimension D_iThe starting height of the coordinate axis sub-segment with the conversion value q is recorded as

Step 5.6: setting the next mapping height of the coordinate axis sub-segment for each coordinate axis sub-segment of the current coordinate axis, and assigning an initial value as the initial height of the coordinate axis sub-segment;

step 6: the step is that a needle is usedInteger data dimension D_iThe left adjacent coordinate axis is a non-integer data dimension or an integer data dimension D_iA calculation method for mapping the data value to the basic data under the condition of the coordinate axis corresponding to the leftmost data dimension;

for integer data dimension D_iThe next mapping height Hnext of the coordinate axis segment with the conversion value of q_iqIs assigned as the Hstart of the coordinate axis segment_iq，Hstart_iqI.e. the data value V obtained according to step 2.2_iqThe starting height of the corresponding coordinate axis segment;

and 7: calculating the mapping height of each dimension data value on the corresponding coordinate axis of each record in the data set;

if the current data dimension is an integer data dimension and the corresponding coordinate axis is the leftmost coordinate axis or the left adjacent coordinate axis is a non-integer data dimension, continuing to execute the step 7.3;

step 7.1: integer data dimension D for the left side_uConversion value is p and current integer data dimension D_iConverting the data record with the value q, and acquiring the next mapping height of the coordinate axis sub-segment corresponding to the data record in step 5

I.e. the data in the integer data dimension D_iMapping height on corresponding coordinate axes;

step 7.2: in an integer data dimension D according to the current record_iThe conversion value q of (a) is obtained in step 3 as a recording offset height I of the corresponding "coordinate axis segment_iqUpdate

Skipping to step 8;

step 7.3: for a current integer data dimension D_iConverting the data records with value q, V obtained in step 6_iqNext mapping height Hnext of corresponding coordinate axis sub-segment_iq，Hnext_iqThat is, the piece of data is in the data dimension D_iMapping height on corresponding coordinate axes;

step 7.4: in an integer data dimension D according to the current record_iThe conversion value q of (a) is obtained in step 3 as a recording offset height I of the corresponding "coordinate axis segment_iqUpdate Hnext_iq＝Hnext_iq+I_iq；

And 8: in order to distinguish each coordinate axis segment in the coordinate axes corresponding to the integer value type data dimension, different textures can be set for each coordinate axis segment, and the textures can be selected by using distinguishable colors or shading;