CN114722926A - Convolution clustering method for scale vortex time sequence diagram in towed sensor array - Google Patents

Convolution clustering method for scale vortex time sequence diagram in towed sensor array Download PDF

Info

Publication number
CN114722926A
CN114722926A CN202210311899.3A CN202210311899A CN114722926A CN 114722926 A CN114722926 A CN 114722926A CN 202210311899 A CN202210311899 A CN 202210311899A CN 114722926 A CN114722926 A CN 114722926A
Authority
CN
China
Prior art keywords
clustering
time sequence
graph
vector
similarity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210311899.3A
Other languages
Chinese (zh)
Inventor
年睿
李秋颖
何波
高爽
翟颖
张卉
都奕
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ocean University of China
Original Assignee
Ocean University of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ocean University of China filed Critical Ocean University of China
Priority to CN202210311899.3A priority Critical patent/CN114722926A/en
Publication of CN114722926A publication Critical patent/CN114722926A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A90/00Technologies having an indirect contribution to adaptation to climate change
    • Y02A90/10Information and communication technologies [ICT] supporting adaptation to climate change, e.g. for weather forecasting or climate simulation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • Computing Systems (AREA)
  • Molecular Biology (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array. The method comprises the following steps: building an integral link of a towing system; laying and recovering a towed sensor array to obtain time sequence data; calculating the similarity of time sequence segmented characteristic vectors by using a continuous bag-of-words model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the category by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting the key characteristic vector category with stage representativeness of the mesoscale vortex as a label; establishing and training a graph convolution network model, establishing an affinity graph, inputting a sample to be tested into the trained GCN, and obtaining representative clustering output of a mesoscale vortex stage; and (4) carrying out secondary clustering based on the graph convolution network to finally obtain a clustering result of the time sequence. According to the invention, a high-accuracy time sequence classification result is finally obtained through twice clustering.

Description

Convolution clustering method for scale vortex time sequence diagram in towed sensor array
Technical Field
The invention relates to the technical field of marine observation and deep learning, in particular to a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array, and belongs to the technical field of time sequence classification.
Background
The ocean is the largest resource in the world, and abundant mineral resources and oil gas resources are stored. In the exploration activities of various marine resources, the marine towing system plays an important role and role, not only relates to the exploration and exploration of the marine resources, but also is very important in the fields of civilian use, fishery, military use and the like. The ocean science is used for researching the natural phenomena, properties and change rules of the ocean and developing and utilizing a knowledge system related to the ocean; the research objects are the oceans occupying 71 percent of the surface of the ball, including seawater, substances dissolved and suspended in seawater, organisms living in the oceans, seabed sediments and seabed rocky circles, and atmospheric boundary layers and estuary coastal zones on the sea surface. Thus, marine science is an important component of the earth science. The research field of marine science is very wide, and the main contents of the research field comprise basic research on physical, chemical, biological and geological processes in the sea and application research for marine resource development and utilization, marine military activities and the like. Towing systems are currently a powerful tool that can meet the above requirements. The towing system integrates multidisciplinary sensors, and the observation elements comprise temperature, salinity, pressure and conductivity. In order to guarantee the observation precision and the observation depth, the dragging system is required to have good hydrodynamic performance, the water exchange channel is smooth, and the sensor has quick response capability and high-precision measurement performance.
The Graph Convolutional neural Network (GCN) is a method for processing Graph domain information based on deep learning, combines Graph broadcasting operation and deep learning algorithm, can enable the structure information and vertex attribute information of a Graph to participate in learning, shows good effect and interpretability in the applications of vertex classification, Graph classification, link prediction and the like, and becomes a widely applied Graph analysis method.
But the GCN semi-supervised learning also has certain defects: (1) if the number of nodes with label is too small during semi-supervision, the performance of the GCN is seriously reduced. (2) The shallow GCN network cannot spread label information over a large area (the deeper the hierarchy, the larger the receptive field of the nodes) (3) the deep GCN network causes the problem of excessive smoothing.
Disclosure of Invention
The invention aims to provide a convolution clustering method for a scale vortex time sequence diagram in a towed sensor array, which is used for solving the problems.
In order to achieve the purpose, the invention provides a specific technical scheme that:
a convolution clustering method for a scale vortex time series diagram in a towed sensor array comprises the following steps:
s1: building an integral link of a towing system;
s2, laying and recycling the towed sensor array to acquire three-dimensional section observation data of the towed sensor array to obtain time sequence data;
s3: calculating the similarity of time sequence segmented feature vectors by using a Continuous Bag-of-Words (CBOW) model in the word vectors, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by using a clustering center, dividing the segmented time sequence feature vector value represented by the feature vectors into a training sample and a sample to be tested, and selecting the category of key feature vectors with stage representativeness of the mesoscale vortex as a label;
s4: establishing and training a Graph Convolutional Network (GCN) model, inputting the characteristic vector representation of the original time sequence segmentation, ideally outputting the stage representative time sequence characteristic vector category of the life cycle of the mesoscale vortex, establishing an affinity Graph, inputting a sample to be tested to the trained GCN, and obtaining possible mesoscale vortex stage representative clustering output.
Further, in S1, the building of the overall link of the towing system includes the following steps:
the towing system consists of a deck unit, a towing chain, a fixed-depth submersible vehicle, an electrode and the like. The total length of the towing cable is L and totally comprises NsA sensor integration module, wherein the distance from the ith node to the previous node is set as Li
In S2, the deployment and recovery of the towed sensor (towed optical, temperature, salinity, pressure sensor) array includes the following steps:
1) the towing chain laying and recovering process is implemented by a deck unit winch, an A frame and a module assembling and disassembling device. Let the navigational speed be v, the towing time be T, and the length of the cable corresponding to the ith node from the water surface be liThe fixed-depth submersible vehicle ensures that the bottom end of the towing chain is observed in a three-dimensional section within a certain depth, and the tension F (t) is tested in real time to ensure that the bottom end of the towing chain is at the bearing capacity limit F of the winchmaxWithin.
2) Based on the electromagnetic coupling principle, the towing system realizes non-contact power supply and data transmission in the same transmission link, and the overwater electric control system converts direct current into high-frequency alternating current signals, supplies power to the underwater sensor integration module and carries out data communication. Setting the time sequence of the absorbance, the fluorescence, the temperature, the conductivity and the pressure collected by the ith sensor integration module at the jth time as Ai,j(t),Fi,j(t),Ti,j(t),Ci,j(t),Di,j(t),t∈(0,T),i=1,2,…,Ns,j=1,2,…,NsThe set of multi-dimensional stereo observation time series is Si,j(t)。
Further, the S3 is specifically as follows:
1) time series data Si,j(t) dividing into m subsequences, and setting the subsequence S of the f-th sectioni,j,f(T) has a length of T ', T ∈ [0, T']The subsequence is also divided into a training subsequence and a subsequence to be detected;
2) establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequences, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting An hierarchical clustering algorithm (An iterative clustering algorithm).
Further, the classifying specifically includes: firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechettstance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein (m.gtoreq.n). The discrete Fourier distance is used for evaluating nearest neighbor connection, and the method has the advantages of high diagnosis accuracy, high speed and strong adaptability because the method simultaneously considers the sequencing of positions and time and the factor of spatial distance.
Wherein the discrete Fourier distance evaluation nearest neighbor connections are as follows:
let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,…,up) And σ (C)j)=(u1,…,uq) The following sequence point pairs L can be obtained as shown in the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ; for any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi
CjThe length L between sequences between feature points is expressed as follows:
Figure RE-GDA0003606855270000031
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
the quality of the clustering task is evaluated using the silhouette distance, which is given by the following formula for the ith (i vector distance to other points in the cluster) data point:
Figure RE-GDA0003606855270000032
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
Further, the S4 includes the following:
establishing an affinity graph, and calculating a corresponding sparse symmetric adjacency matrix as
Figure RE-GDA0003606855270000033
N is the total number of nodes of the low-dimensional mapping of the scale vortex data in the historical observation, and D is the characteristic dimension of the data obtained by detection in the given translation window;
the GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
Figure RE-GDA0003606855270000041
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.
Figure RE-GDA0003606855270000042
I is an identity matrix, wherein
Figure RE-GDA0003606855270000043
Angle matrix
Figure RE-GDA0003606855270000044
Is composed of
Figure RE-GDA0003606855270000045
FlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn
Figure RE-GDA0003606855270000046
Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layer; fLThe predicted affinity graph aggregates information from the neighborhood and encodes the graph structure, which is trained with a pair of features connected by the affinity graph edges as input to the classifier. The use minimizes the supervised contrast Loss (Supervised diagnostic Loss) between the predicted edge confidence and the true edge label.
The edges in the affinity graph are composed of dense connections within each mesoscale vortex propagation process cluster and sparse connections between approximate clusters. The structure retention sampling is to recombine training nodes in a cluster, the target is a subgraph formed by data characteristic vectors obtained by sampling mesoscale vortex detection, and a representative retention sample has the capability of representing the self structural characteristics of the mesoscale vortex, namely the edge connection in the cluster and the connection between near clusters. At the same time, the approximate clusters of the subgraph will also be sampled and the edges between the approximate clusters will be used as negative samples according to probability to improve performance gain. Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn selecting K at randommEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
when clustering is carried out, the method willThe attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to the GCN to predict edge scores, graph analysis is directly trimmed according to the predicted edge scores, graph refinement is carried out to calculate node intimacy, further unrelated edges are deleted, and then a clustering reasoning result is obtained. Suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrix
Figure RE-GDA0003606855270000047
The mutual adjacency number of the node pair is
Figure RE-GDA0003606855270000048
Figure RE-GDA0003606855270000049
Each element in (1)
Figure RE-GDA00036068552700000410
Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
Figure RE-GDA00036068552700000411
through the steps, the nodes can be classified, and then the representative time sequence of the mesoscale vortex stage is classified.
The invention has the advantages and technical effects that:
the invention uses a word vector model to carry out similarity calculation on time sequence data, carries out hierarchical clustering, forms a new characteristic vector, constructs an affinity graph, and carries out secondary clustering based on a Graph Convolution Network (GCN) to finally obtain a clustering result of a time sequence. According to the invention, a high-accuracy time sequence classification result is finally obtained through twice clustering.
Drawings
FIG. 1 is a schematic overall flow chart of the present invention.
FIG. 2 is a block diagram illustrating a specific process of the present invention.
Detailed Description
In order to make the objects, embodiments and advantages of the present invention clearer, the present invention is further described in detail below by way of specific examples with reference to the accompanying drawings.
Example (b):
a convolution clustering method for a scale vortex time series diagram in a towed sensor array comprises the following steps (the specific flow is shown in figure 2):
firstly, collecting multi-dimensional stereo observation time sequence data of a towed sensor array through the arrangement and recovery of a marine test towed optical sensor array, a temperature sensor array, a salinity sensor array and a pressure sensor array, as shown in figure 1.
1. The towing system consists of a deck unit, a towing chain, a fixed-depth submersible vehicle, an electrode and the like. The total length of the towing cable is L and totally comprises NsA sensor integration module, wherein the distance from the ith node to the previous node is set as Li. The towing chain laying and recovering process is implemented by a deck unit winch, an A frame and a module assembling and disassembling device. Let the navigational speed be v, the towing time be T, and the length of the cable corresponding to the ith node from the water surface be liThe fixed-depth underwater vehicle ensures that the bottom end of the towing chain is observed in a three-dimensional section in a certain depth, and the tension F (t) is tested in real time to ensure that the bottom end of the towing chain is positioned at the bearing capacity limit F of the winchmaxWithin.
2. Setting the time sequence of the absorbance, the fluorescence, the temperature, the conductivity and the pressure collected by the ith sensor integration module at the jth time as Ai,j(t),Fi,j(t),Ti,j(t),Ci,j(t),Di,j(t),t∈(0,T),i=1,2,…,Ns,j=1,2,…,NsThe set of multi-dimensional stereo observation time series is Si,j(t)。
Secondly, calculating the similarity of the time sequence segmented characteristic vectors by using a Continuous Bag-of-Words (CBOW) model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the class by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting the class of the key characteristic vectors of which the mesoscale vortexes have stage representativeness as a label;
1. multidimensional stereo observation time sequence S for sensor integrated modulei,j(t) dividing the sequence into m subsequences, and setting the subsequence S of the f-th segmenti,j,f(T) has a length of T ', T ∈ [0, T']。
2. Establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequences, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting An hierarchical clustering algorithm (An iterative clustering algorithm). Firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechet distance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein (m.gtoreq.n). The discrete Fourier distance is used for evaluating nearest neighbor connection, and the method has the advantages of high diagnosis accuracy, high speed and strong adaptability because the method simultaneously considers the sequencing of positions and time and the factor of spatial distance. Let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,…,up) And σ (C)j)=(u1,...,uq) Can be obtained asThe following sequence point pairs L are shown as the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ. For any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi
CjThe length L between sequences between feature points is expressed as follows:
Figure RE-GDA0003606855270000061
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
the quality of the clustering task was evaluated using the silhouette distance, which is given by the following equation for the ith (i vector distance to other points in the cluster) data point
Figure RE-GDA0003606855270000062
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
And step three, establishing and training a Graph Convolutional Network (GCN) model, inputting the feature vector representation of the original time sequence segment, ideally outputting the class of the phase representative time sequence feature vector of the life cycle of the mesoscale vortex, establishing an affinity Graph, inputting the sample to be detected to the trained GCN, and obtaining possible mesoscale vortex phase representative clustering output.
Computing a corresponding sparse symmetric adjacency matrix as
Figure RE-GDA0003606855270000071
N isAnd D is the characteristic dimension of the data obtained by detection in the given translation window.
The GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
Figure RE-GDA0003606855270000072
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.
Figure RE-GDA0003606855270000073
I is an identity matrix, wherein
Figure RE-GDA0003606855270000074
Angle matrix
Figure RE-GDA0003606855270000075
Is composed of
Figure RE-GDA0003606855270000076
FlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn
Figure RE-GDA0003606855270000077
Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layers; fLThe predicted affinity graph aggregates information from the neighborhood and encodes the graph structure, which is trained with a pair of features connected by the affinity graph edges as input to the classifier. The use minimizes the supervised contrast Loss (Supervised diagnostic Loss) between the predicted edge confidence and the true edge label.
The edges in the affinity graph are composed of dense connections within each mesoscale vortex propagation process cluster and sparse connections between approximate clusters. The structure-preserving sampling recombines training nodes in a cluster, and the target is sampling mesoscale vortex detectionAnd (3) forming a subgraph by the obtained data feature vector, wherein the representative retained sample has the capability of representing the self structural characteristics of the mesoscale vortex, namely the edge connection in the cluster and the connection between the near clusters. At the same time, the approximate clusters of the subgraph will also be sampled and the edges between the approximate clusters will be used as negative samples according to probability to improve performance gain. Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn random selection of KmEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
during clustering reasoning, the attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to a GCN to predict edge scores, graph analysis is directly pruned according to the predicted edge scores, and graph refinement is carried out to calculate the node intimacy and further delete unrelated edges, so that a clustering reasoning result is obtained. Suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrix
Figure RE-GDA0003606855270000078
The mutual adjacency number of the node pair is
Figure RE-GDA0003606855270000079
Figure RE-GDA00036068552700000710
Each element in (1)
Figure RE-GDA00036068552700000711
Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
Figure RE-GDA00036068552700000712
Figure RE-GDA0003606855270000081
on the basis of the above embodiments, the present invention continues to describe the technical features and functions of the technical features in the present invention in detail to help those skilled in the art fully understand the technical solutions of the present invention and reproduce them.

Claims (5)

1. A convolution clustering method for a scale vortex time series diagram in a towed sensor array is characterized by comprising the following steps:
s1: building an integral link of a towing system;
s2, laying and recycling the towed sensor array to acquire three-dimensional section observation data of the towed sensor array to obtain time sequence data;
s3: calculating the similarity of time sequence segmented characteristic vectors by using a continuous bag-of-words model in the word vectors, carrying out hierarchical clustering, uniformly representing the characteristic vectors in the category by using a clustering center, dividing the segmented time sequence characteristic vector value represented by the similarity into a training sample and a sample to be tested, and selecting a key characteristic vector category with stage representativeness of the mesoscale vortex as a label;
s4: establishing and training a graph convolution network model, inputting the segmented characteristic vector representation of an original time sequence, ideally outputting the class of the phase representative time sequence characteristic vector of the life cycle of the mesoscale vortex, establishing an affinity graph, inputting a sample to be tested to the trained GCN, and obtaining possible mesoscale vortex phase representative clustering output.
2. The clustering method according to claim 1, wherein the S3 is specifically as follows:
1) time series dataSi,j(t) dividing the sequence into m subsequences, and setting the subsequence S of the f-th segmenti,j,f(T) has a length of T ', T ∈ [0, T']The subsequence is also divided into a training subsequence and a subsequence to be detected;
2) establishing a continuous bag-of-words model CBOW in word vectors, calculating the similarity of the segmented feature vectors of the training time sequence, carrying out hierarchical clustering, uniformly representing the feature vectors in the category by a clustering center, carrying out classifier training, inputting the subsequence to be tested into the trained classifier for classification, and classifying the subsequence by adopting a hierarchical clustering algorithm.
3. The clustering method according to claim 2, wherein the classifying specifically comprises: firstly, the time series data Si,j(t) m subsequences, the f-th subsequence Si,j,f(t) input into the continuous bag of words model CBOW, for Si,j,f(t) extracting the features to obtain a feature vector C1,C2............CmAssuming that each eigenvector is a cluster class, performing similarity calculation by using a discrete Fourier distance (Frechet distance), merging two cluster classes with the highest similarity, and updating a similarity matrix; dividing the track set into a certain number of clusters by using a hierarchical clustering algorithm, and finally obtaining new characteristic vectors K after clustering1,K2,............,KnWherein m is more than or equal to n.
4. The clustering method according to claim 3, wherein the discrete Fourier distance evaluation nearest neighbor connections are as follows:
let feature vector CiIs composed of p feature points, feature vector CjThe device consists of q track points; using σ (C)i) And σ (C)j) Representing the sequential sets of two feature points, respectively, then has σ (C)i)=(u1,...,up) And σ (C)j)=(u1,…,uq) The following sequence point pairs L can be obtained as shown in the formula:
L=(ua1,vb1),(ua2,vb2),...,(uam,vbm) (1)
wherein, a1=1,b1=1,am=p,bmQ; for any i 1,2,3, ai+1=aiOr ai+1=ai+1And bi+1=bi
CjThe length L between sequences between feature points is expressed as follows:
Figure RE-FDA0003606855260000021
then its discrete frochet distance is defined as follows:
δF(Ci,Cj)=min||L|| (3)
using silhouette distance to assess the quality of the clustering task, for the ith data point: distance data points of the i vector to other points in the cluster, the silhouette distance is given by the following equation
Figure RE-FDA0003606855260000022
and a (i) is the average dissimilarity between the ith data point and all other points in the cluster to which the ith data point belongs.
5. The clustering method according to claim 1, wherein the S4 comprises the following:
1) establishing an affinity graph, and calculating a corresponding sparse symmetric adjacency matrix as
Figure RE-FDA0003606855260000023
N is the total number of nodes of the low-dimensional mapping of the scale vortex data in the historical observation, and D is the characteristic dimension of the data obtained by detection in the given translation window;
the GCN is used as a backbone network to further extract the obvious connection relation, and the L-th layer calculation process is as follows:
Figure RE-FDA0003606855260000024
adding self-loop edges (self-loops) to the adjacency matrix A, i.e.
Figure RE-FDA0003606855260000025
I is an identity matrix, wherein
Figure RE-FDA0003606855260000031
Angle matrix
Figure RE-FDA0003606855260000032
Is composed of
Figure RE-FDA0003606855260000033
FlDenotes the l-th layer embedding characteristics, F0As input feature vector K1,K2,............,Kn
Figure RE-FDA0003606855260000034
Is a learnable matrix, maps the embedded features to a new space, and σ is a nonlinear activation function ReLU; fLAn output feature map representing the L layers; fLThe predicted affinity graph gathers information from the neighborhood and encodes the graph structure, and a pair of features connected by the edges of the affinity graph are used as classifier input during training; the use minimizes the Supervised contrast Loss (Supervised contrast Loss) between the predicted edge confidence and the true edge label.
2) Firstly, randomly selecting M clusters as samples, and expanding the M clusters to N neighbor clusters to obtain a subgraph Q consisting of M multiplied by N clustersn(ii) a Introducing a cluster randomness strategy from QnIn random selection of KnIndividual clustering, and sample randomness strategy, from QnIn selecting K at randommEach node is used for reconstructing an affinity graph Q based on the sampling nodes;
3) when clustering is carried out, the method willThe attributes of the mesoscale vortex samples to be classified are represented in the same mode and are sent to a GCN to predict edge scores, graph analysis is directly pruned according to the predicted edge scores, and graph refinement is carried out to calculate the node intimacy and further delete unrelated edges so as to obtain a clustering reasoning result; suppose two nodes NiAnd NjAre each connected to niAnd njAnd the edge is provided with k common neighbor nodes, wherein C represents a clustering function, and the node affinity I can be represented as an aggregation operation:
I=C(k/ni,k/nj) (6)
setting given adjacency matrix
Figure RE-FDA0003606855260000035
The mutual adjacency number of the node pair is
Figure RE-FDA0003606855260000036
Figure RE-FDA0003606855260000037
Each element in (1)
Figure RE-FDA0003606855260000038
Represents NiAnd NjP represents a maximum function, v represents a vector (vector) function, and then the node affinity I is expressed as:
Figure RE-FDA0003606855260000039
through the steps, the nodes can be classified, and then the representative time sequence of the mesoscale vortex stage is classified.
CN202210311899.3A 2022-03-28 2022-03-28 Convolution clustering method for scale vortex time sequence diagram in towed sensor array Pending CN114722926A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210311899.3A CN114722926A (en) 2022-03-28 2022-03-28 Convolution clustering method for scale vortex time sequence diagram in towed sensor array

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210311899.3A CN114722926A (en) 2022-03-28 2022-03-28 Convolution clustering method for scale vortex time sequence diagram in towed sensor array

Publications (1)

Publication Number Publication Date
CN114722926A true CN114722926A (en) 2022-07-08

Family

ID=82240389

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210311899.3A Pending CN114722926A (en) 2022-03-28 2022-03-28 Convolution clustering method for scale vortex time sequence diagram in towed sensor array

Country Status (1)

Country Link
CN (1) CN114722926A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116361678A (en) * 2023-05-26 2023-06-30 西南石油大学 Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal
WO2024036758A1 (en) * 2022-08-15 2024-02-22 广州广电运通金融电子股份有限公司 Automatic image annotation method and apparatus, device, and medium

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2024036758A1 (en) * 2022-08-15 2024-02-22 广州广电运通金融电子股份有限公司 Automatic image annotation method and apparatus, device, and medium
CN116361678A (en) * 2023-05-26 2023-06-30 西南石油大学 Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal
CN116361678B (en) * 2023-05-26 2023-08-25 西南石油大学 Graph enhancement structure-based quasi-periodic time sequence segmentation method and terminal

Similar Documents

Publication Publication Date Title
Han et al. Marine organism detection and classification from underwater vision based on the deep CNN method
CN109086824B (en) Seabed substrate sonar image classification method based on convolutional neural network
CN113255733B (en) Unsupervised anomaly detection method under multi-modal data loss
CN110111345B (en) Attention network-based 3D point cloud segmentation method
CN109871875B (en) Building change detection method based on deep learning
CN114722926A (en) Convolution clustering method for scale vortex time sequence diagram in towed sensor array
CN112101085A (en) Adaptive intelligent fault diagnosis method based on importance weighted domain impedance
US20230134531A1 (en) Method and system for rapid retrieval of target images based on artificial intelligence
CN113988147B (en) Multi-label classification method and device for remote sensing image scene based on graph network, and multi-label retrieval method and device
CN112307410A (en) Seawater temperature and salinity information time sequence prediction method based on shipborne CTD measurement data
CN114241208A (en) Local and global depth contrast feature learning network construction method for inter-image change detection
CN111209813B (en) Remote sensing image semantic segmentation method based on transfer learning
CN114660598A (en) InSAR and CNN-AFSA-SVM fused mining subsidence basin automatic detection method
Wang et al. Fused adaptive receptive field mechanism and dynamic multiscale dilated convolution for side-scan sonar image segmentation
Jalilzadeh et al. Machine learning method for predicting the depth of shallow lakes using multi-band remote sensing images
Juliani et al. Deep learning of terrain morphology and pattern discovery via network-based representational similarity analysis for deep-sea mineral exploration
CN115147727A (en) Method and system for extracting impervious surface of remote sensing image
CN111242028A (en) Remote sensing image ground object segmentation method based on U-Net
CN105809200B (en) Method and device for autonomously extracting image semantic information in bioauthentication mode
Yasin et al. Automatic pore structure analysis in organic-rich shale using FIB-SEM and attention U-Net
CN116597313B (en) Ship optical image wake detection method based on improvement YOLOv7
CN104851090B (en) Image change detection method and device
Zhang et al. Mathematical models for information classification and recognition of multi-target optical remote sensing images
Li et al. Ctmu-net: an improved u-net for semantic segmentation of remote-sensing images based on the combined attention mechanism
Wang et al. Mixed seabed sediment classification based on transferred convolutional neural network: A case study in the ancient river valley

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination