CN107368599B - Visual analysis method and system for high-dimensional data - Google Patents

Visual analysis method and system for high-dimensional data Download PDF

Info

Publication number
CN107368599B
CN107368599B CN201710620143.6A CN201710620143A CN107368599B CN 107368599 B CN107368599 B CN 107368599B CN 201710620143 A CN201710620143 A CN 201710620143A CN 107368599 B CN107368599 B CN 107368599B
Authority
CN
China
Prior art keywords
subspace
point
establishing
mapping
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201710620143.6A
Other languages
Chinese (zh)
Other versions
CN107368599A (en
Inventor
夏佳志
李强
廖胜辉
奎晓燕
王建新
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Central South University
Original Assignee
Central South University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Central South University filed Critical Central South University
Priority to CN201710620143.6A priority Critical patent/CN107368599B/en
Publication of CN107368599A publication Critical patent/CN107368599A/en
Application granted granted Critical
Publication of CN107368599B publication Critical patent/CN107368599B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Probability & Statistics with Applications (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Computational Linguistics (AREA)
  • Image Analysis (AREA)

Abstract

The invention relates to a visual analysis method of high-dimensional data, which comprises the steps of establishing local subspace difference-geodesic distance projection on original high-dimensional data; establishing mapping of clustering point clusters; visual analysis views of the series of subspaces are established. The invention also discloses an analysis system for realizing the visual analysis method of the high-dimensional data. According to the invention, a series of interactive visual analysis operations are provided by establishing local subspace difference-geodesic distance projection, clustering point cluster mapping and a series of subspace visual analysis views, so that a reliable technical basis is provided for visual subspace clustering and analysis, a user can be effectively guided and helped to effectively analyze and explore high-dimensional data, the times of user tests and errors are obviously reduced in high-dimensional data processing, the redundancy of data is reduced, the interchangeability of a data analysis process is enhanced, and the reliability of results is improved.

Description

Visual analysis method and system for high-dimensional data
Technical Field
The invention particularly relates to a visual analysis method and a visual analysis system for high-dimensional data.
Background
With the development of national economic technology and the arrival of digital society, data has become an indispensable part in production and life of people. People are trading everyday with endless data, such as financial data, scientific computing data, biomedical data, and the like. Therefore, data analysis is one of the most popular areas of development today. Data mining and visual analysis technology is an important part in the field of information technology; in the data mining and analyzing process, a powerful visualization analysis method can double the effect with half the effort.
High dimensionality is one of the important features of large data. High-dimensional data often contains subspace cluster structures. Automated subspace clustering methods often generate highly redundant and difficult to understand subspace clustering results. Visual analysis is an effective method for enhancing user cognition and helping user understanding. However, most of current visual analysis methods for subspace clustering are automated method result-oriented visualization, rather than the subspace clustering task itself, and it is difficult to effectively improve the result of the subspace clustering method.
Disclosure of Invention
One of the objectives of the present invention is to provide a method for visually analyzing high-dimensional data, which can effectively guide and help users to effectively analyze and explore high-dimensional data.
It is another object of the present invention to provide an analysis system for implementing the method for visually analyzing high-dimensional data.
The invention provides a visual analysis method of high-dimensional data, which comprises the following steps:
s1, establishing local subspace difference-geodesic distance projection on original high-dimensional data;
s2, establishing mapping of a clustering point cluster in the projection of the local subspace difference-geodesic distance;
and S3, establishing a visual analysis view of the series subspace according to the local subspace difference-geodesic distance projection obtained in the step S1 and the mapping of the clustering point cluster obtained in the step S2.
Step S1, which is to establish the local subspace difference-geodesic distance projection on the original high-dimensional data, specifically, the following steps are adopted to establish the projection:
A. establishing a data point correlation measurement based on geodesic distance aiming at high-dimensional data needing projection;
B. b, establishing a local subspace difference metric according to the metric established in the step A;
C. and B, establishing local subspace difference-geodesic distance projection according to the metrics established in the steps A and B.
Step A, establishing a data point correlation measurement based on geodesic distance, specifically adopting the following steps to establish the measurement:
constructing an S-NN graph with a plurality of connected components on the basis of a high-order data set needing projection;
II, aiming at each connected component in the step I, connecting any two independent connected components;
and III, calculating the shortest distance between any two points so as to obtain the geodesic distance.
And step II, connecting any two independent connected components, specifically connecting two data points closest to each other in the two connected components.
And step III, calculating the shortest distance, specifically adopting a shortest path algorithm to calculate.
Step B, establishing the local subspace difference metric, specifically, establishing the metric by using the following steps:
1) the weights for each dimension are calculated using the following formula:
ω=[ω12,...,ωi,...,ωd],
Figure GDA0002449208560000021
where ω is a dimensional weight matrix, ωiWeight, σ, representing the ith dimensioniRepresenting the variance of the data points in the ith dimension, wherein d is the number of the dimensions;
2) calculating the weighted distance between any two points in the SNN graph by adopting the following formula:
dpq[W]=max{dpqp],dpqq]}
wherein
Figure GDA0002449208560000031
dpq[W]Is a weighted distance matrix, ω, of points p and qp=[ωp1p2,...,ωpi,...,ωpd]The feature vector, ω, representing the local subspace of the point pq=[ωq1q2,...,ωqi,...,ωqd]Local subspace representing qCharacteristic vector of diIs the Euclidean distance between a local subspace p and a point q in the ith dimension, dpqp]Weighted distance of point q with respect to point p, dpqq]Is the weighted distance of point p relative to point q;
3) based on cosine similarity, a delta matrix is established according to the following formula:
Figure GDA0002449208560000032
in the formula
Figure GDA0002449208560000033
Is a point piAnd pjDifference values based on cosine similarity; i and j are numbers of data points, the numeric range is [0, n), and n is the size of the data set.
And C, establishing a local subspace difference-geodesic distance projection, namely mapping the data points in the space to an x axis through a local subspace difference metric by using an MDS algorithm, and mapping to a y axis through a data point correlation metric of the geodesic distance.
The step S2 of establishing the mapping of the cluster of clustering points specifically includes the following steps:
extracting the data point information which is selected in the local subspace difference-geodesic distance projection established in the step S1, wherein the selection operation is completed by user interaction operation to obtain a data point set to be processed;
(II) calculating a feature vector of the data point set obtained in the step (I) by using a PCA (principal Component analysis) method;
and (III) selecting the two eigenvectors with the minimum eigenvalue in the step (II) to generate mapping transformation on a two-dimensional plane, thereby completing the mapping of the cluster point clusters.
The step S3 of establishing a visual analysis view of the series of subspaces specifically includes the following steps:
(a) performing non-parametric representation on the subspace;
(b) performing similarity measurement on the subspaces;
(c) and establishing a contrast interactive operation interface of the subspace according to the non-parametric representation and the similarity measurement of the subspace, thereby completing the visual analysis of the high-dimensional data.
Performing non-parametric representation on the subspace in the step (a), specifically performing non-parametric representation by adopting the following rules:
defining a subspace S accommodating the clusters CcIs { omega [ p ]]P ∈ C, where ω p]Is the local subspace feature vector for point p.
Performing similarity measurement on the subspace in the step (b), specifically, performing similarity measurement by adopting the following steps:
for a d-dimensional subspace housing a cluster C having n points, the eigenvector is defined by the following equation
Figure GDA0002449208560000041
Figure GDA0002449208560000042
Wherein
Figure GDA0002449208560000043
In the formula [ omega ]i1i2,...,ωij,...ωid]For clustering point p in CiThe subspace feature vector of (a);
(ii) using the eigenvectors obtained in step (i)
Figure GDA0002449208560000044
The cosine similarity of the two subspaces is measured.
Establishing a subspace comparison interactive operation interface in the step (c), specifically adopting the following rules to establish an operation interface:
hypothetical clustering of C1Is contained in subspace V1In, cluster C2Is contained in subspace V2And (3) establishing an operation interface by adopting the following rules:
subspace summation interface: subspace V1+V2Result of (A) is C1∩C2Subspace V1+V2Is V1And V2A union of active dimensions;
subspace interface finding: subspace V1∩V2Result of (A) is C1∪C2Subspace V1∩V2Is V1And V2The intersection of active dimensions of;
subspace list ordering interface: given a set of selected dimensions, the feature vectors in the selected dimensions are used to judge the similarity of one-degree subspaces, and a set of subspaces is mapped onto a 1-dimensional coordinate axis through a 1-dimensional MDS (multidimensional scaling) algorithm to form a sorted list of subspaces.
The invention also discloses an analysis system for realizing the visual analysis method of the high-dimensional data, which comprises a local subspace difference-geodesic distance projection establishing module, a mapping establishing module of a cluster point cluster and a visual analysis view establishing module of a series of subspaces which are sequentially connected in series; the local subspace difference-geodesic distance projection establishing module is used for establishing local subspace difference-geodesic distance projection on the original high-dimensional data and uploading a mapping establishing module of the clustering point cluster; the mapping establishing module of the clustering point cluster is used for establishing the mapping of the clustering point cluster according to the local subspace difference-geodesic distance projection and uploading a visual analysis view establishing module of a series of subspaces; and the visual analysis view establishing module of the series subspace is used for establishing the visual analysis view of the series subspace according to the established mapping of the local subspace difference-geodesic distance projection and the clustering point cluster.
According to the visual analysis method and the visual analysis system for the high-dimensional data, provided by the invention, a series of interactive visual analysis operations are provided by establishing local subspace difference-geodesic distance projection, clustering point cluster mapping and a series of subspace visual analysis views, so that a reliable technical basis is provided for visual subspace clustering and analysis, a user can be effectively guided and helped to effectively analyze and explore the high-dimensional data, the times of user tests and errors are obviously reduced in the high-dimensional data processing, the redundancy of the data is reduced, the interchangeability of the data analysis process is enhanced, and the reliability of results is improved.
Drawings
FIG. 1 is a process flow diagram of the process of the present invention.
FIG. 2 is a functional block diagram of the system of the present invention.
Detailed Description
FIG. 1 shows a flow chart of the method of the present invention: the invention provides a visual analysis method of high-dimensional data, which comprises the following steps:
s1, establishing a local subspace difference-geodesic distance projection on original high-dimensional data, specifically adopting the following steps to establish the projection:
A. aiming at high-dimensional data needing projection, establishing a data point correlation measurement based on geodesic distance, specifically adopting the following steps to establish the measurement:
constructing an S-NN graph with a plurality of connected components on the basis of a high-order data set needing projection; the SNN map refers to a sub-map of the K-NN map. Specifically, in the SNN graph, there is an edge between points p, q if and only if they are k neighbors;
II, aiming at each connected component in the step I, connecting any two independent connected components, specifically connecting two data points which are closest to each other in the two connected components;
calculating the shortest distance between any two points by adopting a shortest path algorithm so as to obtain the geodesic distance;
B. and B, establishing a local subspace difference metric according to the metric established in the step A, wherein the metric is established by adopting the following steps:
1) the weights for each dimension are calculated using the following formula:
ω=[ω12,...,ωi,...,ωd],
Figure GDA0002449208560000061
where ω is a dimensional weight matrix, ωiWeight, σ, representing the ith dimensioniRepresenting the variance of the data points in the ith dimension, wherein d is the number of the dimensions;
2) calculating the weighted distance between any two points in the SNN graph by adopting the following formula:
dpq[W]=max{dpqp],dpqq]}
wherein
Figure GDA0002449208560000071
dpq[W]Is a weighted distance matrix, ω, of points p and qp=[ωp1p2,...,ωpi,...,ωpd]The feature vector, ω, representing the local subspace of the point pq=[ωq1q2,...,ωqi,...,ωqd]Feature vector representing local subspace of q, diIs the Euclidean distance between a local subspace p and a point q in the ith dimension, dpqp]Weighted distance of point q with respect to point p, dpqq]Is the weighted distance of point p relative to point q;
3) based on cosine similarity, a delta matrix is established according to the following formula:
Figure GDA0002449208560000072
in the formula
Figure GDA0002449208560000073
Is a point piAnd pjDifference values based on cosine similarity; i and j are numbers of data points, the numeric range is [0, n), and n is the size of the data set;
C. b, according to the measurement established in the steps A and B, establishing a local subspace difference-geodesic distance projection, specifically, mapping data points in the space to an x axis through a local subspace difference measurement by using an MDS algorithm, and mapping to a y axis through a data point correlation measurement of the geodesic distance; the x-axis represents the local subspace difference and the y-axis represents the geodesic distance measure. In the local subspace difference-geodesic distance mapping, the clusters are distinguished;
s2, establishing mapping of the clustering point clusters in the projection of the local subspace difference-geodesic distance, specifically adopting the following steps to establish mapping:
extracting the data point information which is selected in the local subspace difference-geodesic distance projection established in the step S1, wherein the selected data points are obtained by the user through self selection, and a data point set to be processed is obtained;
(II) calculating a feature vector of the data point set obtained in the step (I) by using a PCA (principal Component analysis) method;
(III) selecting two eigenvectors with the minimum eigenvalues in the step (II) to generate mapping transformation on a two-dimensional plane, thereby completing the mapping of the cluster point clusters;
s3, establishing a visual analysis view of a series of subspaces according to the local subspace difference-geodesic distance projection obtained in the step S1 and the mapping of the clustering point cluster obtained in the step S2, specifically adopting the following steps to establish the visual analysis view:
(a) and performing non-parametric representation on the subspace, specifically performing non-parametric representation by adopting the following rules:
defining a subspace S accommodating the clusters CcIs { omega [ p ]]P ∈ C, where ω p]A local subspace feature vector for point p;
(b) and (3) carrying out similarity measurement on the subspace, specifically carrying out similarity measurement by adopting the following steps:
for a d-dimensional subspace housing a cluster C having n points, the eigenvector is defined by the following equation
Figure GDA0002449208560000081
Figure GDA0002449208560000082
Wherein
Figure GDA0002449208560000083
In the formula [ omega ]i1i2,...,ωij,...ωid]For clustering point p in CiThe subspace feature vector of (a);
(ii) using the eigenvectors obtained in step (i)
Figure GDA0002449208560000084
The cosine similarity of the two subspaces is measured;
(c) establishing a contrast interactive operation interface of the subspace according to the non-parametric representation and the similarity measurement of the subspace, thereby completing the visual analysis of the high-dimensional data; specifically, the following rules are adopted to establish an operation interface:
hypothetical clustering of C1Is contained in subspace V1In, cluster C2Is contained in subspace V2And (3) establishing an operation interface by adopting the following rules:
subspace summation interface: subspace V1+V2Result of (A) is C1∩C2Subspace V1+V2Is V1And V2A union of active dimensions;
subspace interface finding: subspace V1∩V2Result of (A) is C1∪C2Subspace V1∩V2Is V1And V2The intersection of active dimensions of;
subspace list ordering interface: given a set of selected dimensions, the feature vectors in the selected dimensions are used to judge the similarity of one-degree subspaces, and a set of subspaces is mapped onto a 1-dimensional coordinate axis through a 1-dimensional MDS (multidimensional scaling) algorithm to form a sorted list of subspaces.
FIG. 2 shows a functional block diagram of the system of the present invention: the invention also discloses an analysis system for realizing the visual analysis method of the high-dimensional data, which comprises a local subspace difference-geodesic distance projection establishing module, a mapping establishing module of a cluster point cluster and a visual analysis view establishing module of a series of subspaces which are sequentially connected in series; the local subspace difference-geodesic distance projection establishing module is used for establishing local subspace difference-geodesic distance projection on the original high-dimensional data and uploading a mapping establishing module of the clustering point cluster; the mapping establishing module of the clustering point cluster is used for establishing the mapping of the clustering point cluster according to the local subspace difference-geodesic distance projection and uploading a visual analysis view establishing module of a series of subspaces; and the visual analysis view establishing module of the series subspace is used for establishing the visual analysis view of the series subspace according to the established mapping of the local subspace difference-geodesic distance projection and the clustering point cluster.

Claims (2)

1. A visual analysis method of high-dimensional data comprises the following steps:
s1, establishing local subspace difference-geodesic distance projection on original high-dimensional data; specifically, the projection is established by adopting the following steps:
A. establishing a data point correlation measurement based on geodesic distance aiming at high-dimensional data needing projection; specifically, the following steps are adopted to establish the measurement:
constructing an S-NN graph with a plurality of connected components on the basis of a high-order data set needing projection;
II, aiming at each connected component in the step I, connecting any two independent connected components;
calculating the shortest distance between any two points to obtain the geodesic distance;
B. b, establishing a local subspace difference metric according to the metric established in the step A; specifically, the following steps are adopted to establish the measurement:
1) the weights for each dimension are calculated using the following formula:
Figure FDA0002449208550000011
where ω is a dimensional weight matrix, ωiWeight, σ, representing the ith dimensioniRepresenting the variance of the data points in the ith dimension, wherein d is the number of the dimensions;
2) calculating the weighted distance between any two points in the SNN graph by adopting the following formula:
dpq[W]=max{dpqp],dpqq]}
wherein
Figure FDA0002449208550000012
dpq[W]Is point p and pointq weighted distance matrix, ωp=[ωp1p2,...,ωpi,...,ωpd]The feature vector, ω, representing the local subspace of the point pq=[ωq1q2,...,ωqi,...,ωqd]Feature vector representing local subspace of q, diIs the Euclidean distance between a local subspace p and a point q in the ith dimension, dpqp]Weighted distance of point q with respect to point p, dpqq]Is the weighted distance of point p relative to point q;
3) based on cosine similarity, a delta matrix is established according to the following formula:
Figure FDA0002449208550000021
in the formula
Figure FDA0002449208550000022
Is a point piAnd pjDifference values based on cosine similarity; i and j are numbers of data points, the numeric range is [0, n), and n is the size of the data set;
C. according to the measurement established in the steps A and B, establishing a local subspace difference-geodesic distance projection;
s2, establishing mapping of a clustering point cluster in the projection of the local subspace difference-geodesic distance; specifically, the mapping is established by adopting the following steps:
extracting the data point information which is selected in the local subspace difference-geodesic distance projection established in the step S1, wherein the selected data point is obtained by user interaction selection, and thus a data point set to be processed is obtained;
(II) calculating to obtain a characteristic vector of the data point set in the step (I) by using a PCA method;
(III) selecting two eigenvectors with the minimum eigenvalues in the step (II) to generate mapping transformation on a two-dimensional plane, thereby completing the mapping of the cluster point clusters;
s3, establishing a visual analysis view of a series of subspaces according to the local subspace difference-geodesic distance projection obtained in the step S1 and the mapping of the clustering point clusters obtained in the step S2; specifically, the method comprises the following steps of establishing a visual analysis view:
(a) performing non-parametric representation on the subspace; specifically, the following rules are adopted for non-parametric representation:
defining a subspace S accommodating the clusters CcIs { omega [ p ]]P ∈ C, where ω p]A local subspace feature vector for point p;
(b) performing similarity measurement on the subspaces; specifically, the following steps are adopted for similarity measurement:
for a d-dimensional subspace housing a cluster C having n points, the eigenvector is defined by the following equation
Figure FDA0002449208550000023
Figure FDA0002449208550000031
Wherein
Figure FDA0002449208550000032
In the formula [ omega ]i1i2,...,ωij,...ωid]For clustering point p in CiThe subspace feature vector of (a);
(ii) using the eigenvectors obtained in step (i)
Figure FDA0002449208550000033
The cosine similarity of the two subspaces is measured;
(c) establishing a contrast interactive operation interface of the subspace according to the non-parametric representation and the similarity measurement of the subspace, thereby completing the visual analysis of the high-dimensional data; specifically, the following rules are adopted to establish an operation interface:
hypothetical clustering of C1Is contained in subspace V1In, cluster C2Is contained in subspace V2And (3) establishing an operation interface by adopting the following rules:
subspace summation interface: subspace V1+V2Result of (A) is C1∩C2Subspace V1+V2Is V1And V2A union of active dimensions;
subspace interface finding: subspace V1∩V2Result of (A) is C1∪C2Subspace V1∩V2Is V1And V2The intersection of active dimensions of;
subspace list ordering interface: and giving a group of selected dimensions, judging the similarity of one-degree subspaces by using the feature vectors on the selected dimensions, and mapping the group of subspaces to 1-dimensional coordinate axes by using a 1-dimensional MDS algorithm to form a sorted list of the subspaces.
2. An analysis system for realizing the visual analysis method of the high-dimensional data as claimed in claim 1, which is characterized by comprising a local subspace difference-geodesic distance projection establishing module, a mapping establishing module of cluster point clusters and a visual analysis view establishing module of a series of subspaces which are connected in series in sequence; the local subspace difference-geodesic distance projection establishing module is used for establishing local subspace difference-geodesic distance projection on the original high-dimensional data and uploading a mapping establishing module of the clustering point cluster; the mapping establishing module of the clustering point cluster is used for establishing the mapping of the clustering point cluster according to the local subspace difference-geodesic distance projection and uploading a visual analysis view establishing module of a series of subspaces; and the visual analysis view establishing module of the series subspace is used for establishing the visual analysis view of the series subspace according to the established mapping of the local subspace difference-geodesic distance projection and the clustering point cluster.
CN201710620143.6A 2017-07-26 2017-07-26 Visual analysis method and system for high-dimensional data Expired - Fee Related CN107368599B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710620143.6A CN107368599B (en) 2017-07-26 2017-07-26 Visual analysis method and system for high-dimensional data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710620143.6A CN107368599B (en) 2017-07-26 2017-07-26 Visual analysis method and system for high-dimensional data

Publications (2)

Publication Number Publication Date
CN107368599A CN107368599A (en) 2017-11-21
CN107368599B true CN107368599B (en) 2020-06-23

Family

ID=60307219

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710620143.6A Expired - Fee Related CN107368599B (en) 2017-07-26 2017-07-26 Visual analysis method and system for high-dimensional data

Country Status (1)

Country Link
CN (1) CN107368599B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108021664B (en) * 2017-12-04 2020-05-05 北京工商大学 Multidimensional data correlation visual analysis method and system based on dimension projection
CN108090182B (en) * 2017-12-15 2018-10-30 清华大学 A kind of distributed index method and system of extensive high dimensional data
CN111428631B (en) * 2020-03-23 2023-05-05 中南大学 Visual identification and sorting method for unmanned aerial vehicle flight control signals
CN115952426B (en) * 2023-03-10 2023-06-06 中南大学 Distributed noise data clustering method based on random sampling and user classification method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102163224A (en) * 2011-04-06 2011-08-24 中南大学 Adaptive spatial clustering method
CN105868352A (en) * 2016-03-29 2016-08-17 天津大学 High-dimensional data dimension ordering method based on dimension correlation analysis
CN106203516A (en) * 2016-07-13 2016-12-07 中南大学 A kind of subspace clustering visual analysis method based on dimension dependency

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6675103B1 (en) * 2000-03-22 2004-01-06 Tripos, Inc. Visualizing high dimensional descriptors of molecular structures

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102163224A (en) * 2011-04-06 2011-08-24 中南大学 Adaptive spatial clustering method
CN105868352A (en) * 2016-03-29 2016-08-17 天津大学 High-dimensional data dimension ordering method based on dimension correlation analysis
CN106203516A (en) * 2016-07-13 2016-12-07 中南大学 A kind of subspace clustering visual analysis method based on dimension dependency

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
一种基于子空间聚类的局部相关可视分析方法;夏佳志;《计算机辅助设计与图形学学报》;20161130;第1855-1862页 *

Also Published As

Publication number Publication date
CN107368599A (en) 2017-11-21

Similar Documents

Publication Publication Date Title
CN107368599B (en) Visual analysis method and system for high-dimensional data
Zhou et al. Variational LSTM enhanced anomaly detection for industrial big data
US11195312B1 (en) Gragnostics rendering
Liu et al. MFDNet: Collaborative poses perception and matrix Fisher distribution for head pose estimation
Granata et al. Accurate estimation of the intrinsic dimension using graph distances: Unraveling the geometric complexity of datasets
Gao et al. A robust distance measure for similarity-based classification on the SPD manifold
Murthy Bridging feature selection and extraction: compound feature generation
CN111476270A (en) Course information determining method, device, equipment and storage medium based on K-means algorithm
Lai et al. Point cloud discretization of Fokker--Planck operators for committor functions
Böhm et al. CoCo: coding cost for parameter-free outlier detection
Nie et al. Implicit weight learning for multi-view clustering
Chebbout et al. Comparative study of clustering based colour image segmentation techniques
KR20160095735A (en) Method and system for complex and multiplex emotion recognition of user face
Barbosa et al. Visualizing and interacting with kernelized data
Tsai et al. Dimensionality reduction techniques for data exploration
CN108229552B (en) Model processing method and device and storage medium
Du et al. An Improved Algorithm Based on Fast Search and Find of Density Peak Clustering for High‐Dimensional Data
CN110781802B (en) Face image recognition method based on information theory manifold
CN109658489B (en) Three-dimensional grid data processing method and system based on neural network
Chen et al. Angle-based embedding quality assessment method for manifold learning
WO2013044484A1 (en) Method and apparatus for robust similarity comparison of 3d models
CN107451238B (en) Visual analysis method and system for exploring inherent low-dimensional structure of high-dimensional data
Li et al. Kernel hierarchical agglomerative clustering-comparison of different gap statistics to estimate the number of clusters
Yin et al. A novel image retrieval method for image based localization in large-scale environment
Wu et al. Conic fitting: New easy geometric method and revisiting sampson distance

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200623

Termination date: 20210726

CF01 Termination of patent right due to non-payment of annual fee