CN114722926A - Convolution clustering method for scale vortex time sequence diagram in towed sensor array - Google Patents
Convolution clustering method for scale vortex time sequence diagram in towed sensor array Download PDFInfo
- Publication number
- CN114722926A CN114722926A CN202210311899.3A CN202210311899A CN114722926A CN 114722926 A CN114722926 A CN 114722926A CN 202210311899 A CN202210311899 A CN 202210311899A CN 114722926 A CN114722926 A CN 114722926A
- Authority
- CN
- China
- Prior art keywords
- clustering
- time sequence
- graph
- vector
- similarity
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02A—TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
- Y02A90/00—Technologies having an indirect contribution to adaptation to climate change
- Y02A90/10—Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Artificial Intelligence (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Software Systems (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- Molecular Biology (AREA)
- General Health & Medical Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Image Analysis (AREA)
Abstract
The invention discloses a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array. The method comprises the following steps: building an integral link of a towing system; laying and recovering a towed sensor array to obtain time sequence data; calculating the similarity of time sequence segmented characteristic vectors by using a continuous bag-of-words model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the category by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting the key characteristic vector category with stage representativeness of the mesoscale vortex as a label; establishing and training a graph convolution network model, establishing an affinity graph, inputting a sample to be tested into the trained GCN, and obtaining representative clustering output of a mesoscale vortex stage; and (4) carrying out secondary clustering based on the graph convolution network to finally obtain a clustering result of the time sequence. According to the invention, a high-accuracy time sequence classification result is finally obtained through twice clustering.
Description
Technical Field
The invention relates to the technical field of marine observation and deep learning, in particular to a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array, and belongs to the technical field of time sequence classification.
Background
The ocean is the largest resource in the world, and abundant mineral resources and oil gas resources are stored. In the exploration activities of various marine resources, the marine towing system plays an important role and role, not only relates to the exploration and exploration of the marine resources, but also is very important in the fields of civilian use, fishery, military use and the like. The ocean science is used for researching the natural phenomena, properties and change rules of the ocean and developing and utilizing a knowledge system related to the ocean; the research objects are the oceans occupying 71 percent of the surface of the ball, including seawater, substances dissolved and suspended in seawater, organisms living in the oceans, seabed sediments and seabed rocky circles, and atmospheric boundary layers and estuary coastal zones on the sea surface. Thus, marine science is an important component of the earth science. The research field of marine science is very wide, and the main contents of the research field comprise basic research on physical, chemical, biological and geological processes in the sea and application research for marine resource development and utilization, marine military activities and the like. Towing systems are currently a powerful tool that can meet the above requirements. The towing system integrates multidisciplinary sensors, and the observation elements comprise temperature, salinity, pressure and conductivity. In order to guarantee the observation precision and the observation depth, the dragging system is required to have good hydrodynamic performance, the water exchange channel is smooth, and the sensor has quick response capability and high-precision measurement performance.
The Graph Convolutional neural Network (GCN) is a method for processing Graph domain information based on deep learning, combines Graph broadcasting operation and deep learning algorithm, can enable the structure information and vertex attribute information of a Graph to participate in learning, shows good effect and interpretability in the applications of vertex classification, Graph classification, link prediction and the like, and becomes a widely applied Graph analysis method.
But the GCN semi-supervised learning also has certain defects: (1) if the number of nodes with label is too small during semi-supervision, the performance of the GCN is seriously reduced. (2) The shallow GCN network cannot spread label information over a large area (the deeper the hierarchy, the larger the receptive field of the nodes) (3) the deep GCN network causes the problem of excessive smoothing.
Disclosure of Invention
The invention aims to provide a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array, which is used for solving the problems.
In order to achieve the purpose, the invention provides a specific technical scheme that:
a convolution clustering method for a scale vortex time series diagram in a towed sensor array comprises the following steps:
s1: building an integral link of a towing system;
s2, laying and recycling the towed sensor array to acquire three-dimensional section observation data of the towed sensor array to obtain time sequence data;
s3: calculating the similarity of time sequence segmented feature vectors by using a Continuous Bag-of-Words (CBOW) model in the word vectors, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by using a clustering center, dividing the segmented time sequence feature vector value represented by the feature vectors into a training sample and a sample to be tested, and selecting the category of key feature vectors with stage representativeness of the mesoscale vortex as a label;
s4: establishing and training a Graph Convolutional Network (GCN) model, inputting the characteristic vector representation of the original time sequence segmentation, ideally outputting the stage representative time sequence characteristic vector category of the life cycle of the mesoscale vortex, establishing an affinity Graph, inputting a sample to be tested to the trained GCN, and obtaining possible mesoscale vortex stage representative clustering output.
Further, in S1, the building of the overall link of the towing system includes the following steps:
the towing system consists of a deck unit, a towing chain, a fixed-depth submersible vehicle, an electrode and the like. The total length of the towing cable is L and totally comprises NsA sensor integration module, wherein the distance from the ith node to the previous node is set as Li。
In S2, the deployment and recovery of the towed sensor (towed optical, temperature, salinity, pressure sensor) array includes the following steps:
1) the towing chain laying and recovering process is implemented by a deck unit winch, an A frame and a module assembling and disassembling device. Let the navigational speed be v, the towing time be T, and the length of the cable corresponding to the ith node from the water surface be liThe fixed-depth submersible vehicle ensures that the bottom end of the towing chain is observed in a three-dimensional section within a certain depth, and the tension F (t) is tested in real time to ensure that the bottom end of the towing chain is at the bearing capacity limit F of the winchmaxWithin.
2) Based on the electromagnetic coupling principle, the towing system realizes non-contact power supply and data transmission in the same transmission link, and the overwater electric control system converts direct current into high-frequency alternating current signals, supplies power to the underwater sensor integration module and carries out data communication. Setting the time sequence of the absorbance, the fluorescence, the temperature, the conductivity and the pressure collected by the ith sensor integration module at the jth time as Ai,j(t),Fi,j(t),Ti,j(t),Ci,j(t),Di,j(t),t∈(0,T),i=1,2,…,Ns,j=1,2,…,NsThe set of multi-dimensional stereo observation time series is Si,j(t)。
Further, the S3 is specifically as follows:
1) time series data Si,j(t) dividing into m subsequences, and setting the subsequence S of the f-th sectioni,j,f(T) has a length of T ', T ∈ [0, T']The subsequence is also divided into a training subsequence and a subsequence to be detected;
2) establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequences, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting An hierarchical clustering algorithm (An iterative clustering algorithm).
Further, the classifying specifically includes: firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechettstance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein (m.gtoreq.n). The discrete Fourier distance is used for evaluating nearest neighbor connection, and the method has the advantages of high diagnosis accuracy, high speed and strong adaptability because the method simultaneously considers the sequencing of positions and time and the factor of spatial distance.
Wherein the discrete Fourier distance evaluation nearest neighbor connections are as follows:
let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,…,up) And σ (C)j)=(u1,…,uq) The following sequence point pairs L can be obtained as shown in the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ; for any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi;
CjThe length L between sequences between feature points is expressed as follows:
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
the quality of the clustering task is evaluated using the silhouette distance, which is given by the following formula for the ith (i vector distance to other points in the cluster) data point:
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
Further, the S4 includes the following:
establishing an affinity graph, and calculating a corresponding sparse symmetric adjacency matrix asN is the total number of nodes of the low-dimensional mapping of the scale vortex data in the historical observation, and D is the characteristic dimension of the data obtained by detection in the given translation window;
the GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.I is an identity matrix, whereinAngle matrixIs composed ofFlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn;Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layer; fLThe predicted affinity graph aggregates information from the neighborhood and encodes the graph structure, which is trained with a pair of features connected by the affinity graph edges as input to the classifier. The use minimizes the supervised contrast Loss (Supervised diagnostic Loss) between the predicted edge confidence and the true edge label.
The edges in the affinity graph are composed of dense connections within each mesoscale vortex propagation process cluster and sparse connections between approximate clusters. The structure retention sampling is to recombine training nodes in a cluster, the target is a subgraph formed by data characteristic vectors obtained by sampling mesoscale vortex detection, and a representative retention sample has the capability of representing the self structural characteristics of the mesoscale vortex, namely the edge connection in the cluster and the connection between near clusters. At the same time, the approximate clusters of the subgraph will also be sampled and the edges between the approximate clusters will be used as negative samples according to probability to improve performance gain. Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn selecting K at randommEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
when clustering is carried out, the method willThe attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to the GCN to predict edge scores, graph analysis is directly trimmed according to the predicted edge scores, graph refinement is carried out to calculate node intimacy, further unrelated edges are deleted, and then a clustering reasoning result is obtained. Suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrixThe mutual adjacency number of the node pair is Each element in (1)Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
through the steps, the nodes can be classified, and then the representative time sequence of the mesoscale vortex stage is classified.
The invention has the advantages and technical effects that:
the invention uses a word vector model to carry out similarity calculation on time sequence data, carries out hierarchical clustering, forms a new characteristic vector, constructs an affinity graph, and carries out secondary clustering based on a Graph Convolution Network (GCN) to finally obtain a clustering result of a time sequence. According to the invention, a high-accuracy time sequence classification result is finally obtained through twice clustering.
Drawings
FIG. 1 is a schematic overall flow chart of the present invention.
FIG. 2 is a block diagram illustrating a specific process of the present invention.
Detailed Description
In order to make the objects, embodiments and advantages of the present invention clearer, the present invention is further described in detail below by way of specific examples with reference to the accompanying drawings.
Example (b):
a convolution clustering method for a scale vortex time series diagram in a towed sensor array comprises the following steps (the specific flow is shown in figure 2):
firstly, collecting multi-dimensional stereo observation time sequence data of a towed sensor array through the arrangement and recovery of a marine test towed optical sensor array, a temperature sensor array, a salinity sensor array and a pressure sensor array, as shown in figure 1.
1. The towing system consists of a deck unit, a towing chain, a fixed-depth submersible vehicle, an electrode and the like. The total length of the towing cable is L and totally comprises NsA sensor integration module, wherein the distance from the ith node to the previous node is set as Li. The towing chain laying and recovering process is implemented by a deck unit winch, an A frame and a module assembling and disassembling device. Let the navigational speed be v, the towing time be T, and the length of the cable corresponding to the ith node from the water surface be liThe fixed-depth underwater vehicle ensures that the bottom end of the towing chain is observed in a three-dimensional section in a certain depth, and the tension F (t) is tested in real time to ensure that the bottom end of the towing chain is positioned at the bearing capacity limit F of the winchmaxWithin.
2. Setting the time sequence of the absorbance, the fluorescence, the temperature, the conductivity and the pressure collected by the ith sensor integration module at the jth time as Ai,j(t),Fi,j(t),Ti,j(t),Ci,j(t),Di,j(t),t∈(0,T),i=1,2,…,Ns,j=1,2,…,NsThe set of multi-dimensional stereo observation time series is Si,j(t)。
Secondly, calculating the similarity of the time sequence segmented characteristic vectors by using a Continuous Bag-of-Words (CBOW) model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the class by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting the class of the key characteristic vectors of which the mesoscale vortexes have stage representativeness as a label;
1. multidimensional stereo observation time sequence S for sensor integrated modulei,j(t) dividing the sequence into m subsequences, and setting the subsequence S of the f-th segmenti,j,f(T) has a length of T ', T ∈ [0, T']。
2. Establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequences, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting An hierarchical clustering algorithm (An iterative clustering algorithm). Firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechet distance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein (m.gtoreq.n). The discrete Fourier distance is used for evaluating nearest neighbor connection, and the method has the advantages of high diagnosis accuracy, high speed and strong adaptability because the method simultaneously considers the sequencing of positions and time and the factor of spatial distance. Let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,…,up) And σ (C)j)=(u1,...,uq) Can be obtained asThe following sequence point pairs L are shown as the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ. For any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi。
CjThe length L between sequences between feature points is expressed as follows:
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
the quality of the clustering task was evaluated using the silhouette distance, which is given by the following equation for the ith (i vector distance to other points in the cluster) data point
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
And step three, establishing and training a Graph Convolutional Network (GCN) model, inputting the feature vector representation of the original time sequence segment, ideally outputting the class of the phase representative time sequence feature vector of the life cycle of the mesoscale vortex, establishing an affinity Graph, inputting the sample to be detected to the trained GCN, and obtaining possible mesoscale vortex phase representative clustering output.
Computing a corresponding sparse symmetric adjacency matrix asN isAnd D is the characteristic dimension of the data obtained by detection in the given translation window.
The GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.I is an identity matrix, whereinAngle matrixIs composed ofFlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn;Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layers; fLThe predicted affinity graph aggregates information from the neighborhood and encodes the graph structure, which is trained with a pair of features connected by the affinity graph edges as input to the classifier. The use minimizes the supervised contrast Loss (Supervised diagnostic Loss) between the predicted edge confidence and the true edge label.
The edges in the affinity graph are composed of dense connections within each mesoscale vortex propagation process cluster and sparse connections between approximate clusters. The structure-preserving sampling recombines training nodes in a cluster, and the target is sampling mesoscale vortex detectionAnd (3) forming a subgraph by the obtained data feature vector, wherein the representative retained sample has the capability of representing the self structural characteristics of the mesoscale vortex, namely the edge connection in the cluster and the connection between the near clusters. At the same time, the approximate clusters of the subgraph will also be sampled and the edges between the approximate clusters will be used as negative samples according to probability to improve performance gain. Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn random selection of KmEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
during clustering reasoning, the attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to a GCN to predict edge scores, graph analysis is directly pruned according to the predicted edge scores, and graph refinement is carried out to calculate the node intimacy and further delete unrelated edges, so that a clustering reasoning result is obtained. Suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrixThe mutual adjacency number of the node pair is Each element in (1)Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
on the basis of the above embodiments, the present invention continues to describe the technical features and functions of the technical features in the present invention in detail to help those skilled in the art fully understand the technical solutions of the present invention and reproduce them.
Claims (5)
1. A convolution clustering method for a scale vortex time series diagram in a towed sensor array is characterized by comprising the following steps:
s1: building an integral link of a towing system;
s2, laying and recycling the towed sensor array to acquire three-dimensional section observation data of the towed sensor array to obtain time sequence data;
s3: calculating the similarity of time sequence segmented characteristic vectors by using a continuous bag-of-words model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the category by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting a key characteristic vector category with stage representativeness of the mesoscale vortex as a label;
s4: establishing and training a graph convolution network model, inputting the segmented characteristic vector representation of an original time sequence, ideally outputting the class of the phase representative time sequence characteristic vector of the life cycle of the mesoscale vortex, establishing an affinity graph, inputting a sample to be tested to the trained GCN, and obtaining possible mesoscale vortex phase representative clustering output.
2. The clustering method according to claim 1, wherein the S3 is specifically as follows:
1) time series dataSi,j(t) dividing the sequence into m subsequences, and setting the subsequence S of the f-th segmenti,j,f(T) has a length of T ', T ∈ [0, T']The subsequence is also divided into a training subsequence and a subsequence to be detected;
2) establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequence, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting a hierarchical clustering algorithm.
3. The clustering method according to claim 2, wherein the classifying specifically comprises: firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechet distance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein m is more than or equal to n.
4. The clustering method according to claim 3, wherein the discrete Fourier distance evaluation nearest neighbor connections are as follows:
let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,...,up) And σ (C)j)=(u1,…,uq) The following sequence point pairs L can be obtained as shown in the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ; for any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi;
CjThe length L between sequences between feature points is expressed as follows:
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
using silhouette distance to assess the quality of the clustering task, for the ith data point: distance data points of the i vector to other points in the cluster, the silhouette distance is given by the following equation
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
5. The clustering method according to claim 1, wherein the S4 comprises the following:
1) establishing an affinity graph, and calculating a corresponding sparse symmetric adjacency matrix asN is the total number of nodes of the low-dimensional mapping of the scale vortex data in the historical observation, and D is the characteristic dimension of the data obtained by detection in the given translation window;
the GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.I is an identity matrix, whereinAngle matrixIs composed ofFlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn;Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layers; fLThe predicted affinity graph gathers information from the neighborhood and encodes the graph structure, and a pair of features connected by the edges of the affinity graph are used as classifier input during training; the use minimizes the Supervised contrast Loss (Supervised contrast Loss) between the predicted edge confidence and the true edge label.
2) Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn selecting K at randommEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
3) when clustering is carried out, the method willThe attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to a GCN to predict edge scores, graph analysis is directly pruned according to the predicted edge scores, and graph refinement is carried out to calculate the node intimacy and further delete unrelated edges so as to obtain a clustering reasoning result; suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrixThe mutual adjacency number of the node pair is Each element in (1)Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
through the steps, the nodes can be classified, and then the representative time sequence of the mesoscale vortex stage is classified.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210311899.3A CN114722926A (en) | 2022-03-28 | 2022-03-28 | Convolution clustering method for scale vortex time sequence diagram in towed sensor array |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202210311899.3A CN114722926A (en) | 2022-03-28 | 2022-03-28 | Convolution clustering method for scale vortex time sequence diagram in towed sensor array |
Publications (1)
Publication Number | Publication Date |
---|---|
CN114722926A true CN114722926A (en) | 2022-07-08 |
Family
ID=82240389
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202210311899.3A Pending CN114722926A (en) | 2022-03-28 | 2022-03-28 | Convolution clustering method for scale vortex time sequence diagram in towed sensor array |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN114722926A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116361678A (en) * | 2023-05-26 | 2023-06-30 | 西南石油大学 | Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal |
WO2024036758A1 (en) * | 2022-08-15 | 2024-02-22 | 广州广电运通金融电子股份有限公司 | Automatic image annotation method and apparatus, device, and medium |
-
2022
- 2022-03-28 CN CN202210311899.3A patent/CN114722926A/en active Pending
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2024036758A1 (en) * | 2022-08-15 | 2024-02-22 | 广州广电运通金融电子股份有限公司 | Automatic image annotation method and apparatus, device, and medium |
CN116361678A (en) * | 2023-05-26 | 2023-06-30 | 西南石油大学 | Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal |
CN116361678B (en) * | 2023-05-26 | 2023-08-25 | 西南石油大学 | Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Han et al. | Marine organism detection and classification from underwater vision based on the deep CNN method | |
CN109086824B (en) | Seabed substrate sonar image classification method based on convolutional neural network | |
CN113255733B (en) | Unsupervised anomaly detection method under multi-modal data loss | |
CN110111345B (en) | Attention network-based 3D point cloud segmentation method | |
CN109871875B (en) | Building change detection method based on deep learning | |
CN114722926A (en) | Convolution clustering method for scale vortex time sequence diagram in towed sensor array | |
CN112101085A (en) | Adaptive intelligent fault diagnosis method based on importance weighted domain impedance | |
US20230134531A1 (en) | Method and system for rapid retrieval of target images based on artificial intelligence | |
CN113988147B (en) | Multi-label classification method and device for remote sensing image scene based on graph network, and multi-label retrieval method and device | |
CN112307410A (en) | Seawater temperature and salinity information time sequence prediction method based on shipborne CTD measurement data | |
CN114241208A (en) | Local and global depth contrast feature learning network construction method for inter-image change detection | |
CN111209813B (en) | Remote sensing image semantic segmentation method based on transfer learning | |
CN114660598A (en) | InSAR and CNN-AFSA-SVM fused mining subsidence basin automatic detection method | |
Wang et al. | Fused adaptive receptive field mechanism and dynamic multiscale dilated convolution for side-scan sonar image segmentation | |
Jalilzadeh et al. | Machine learning method for predicting the depth of shallow lakes using multi-band remote sensing images | |
Juliani et al. | Deep learning of terrain morphology and pattern discovery via network-based representational similarity analysis for deep-sea mineral exploration | |
CN115147727A (en) | Method and system for extracting impervious surface of remote sensing image | |
CN111242028A (en) | Remote sensing image ground object segmentation method based on U-Net | |
CN105809200B (en) | Method and device for autonomously extracting image semantic information in bioauthentication mode | |
Yasin et al. | Automatic pore structure analysis in organic-rich shale using FIB-SEM and attention U-Net | |
CN116597313B (en) | Ship optical image wake detection method based on improvement YOLOv7 | |
CN104851090B (en) | Image change detection method and device | |
Zhang et al. | Mathematical models for information classification and recognition of multi-target optical remote sensing images | |
Li et al. | Ctmu-net: an improved u-net for semantic segmentation of remote-sensing images based on the combined attention mechanism | |
Wang et al. | Mixed seabed sediment classification based on transferred convolutional neural network: A case study in the ancient river valley |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |