CN113032378B - Ship behavior pattern mining method based on clustering algorithm and pattern mining - Google Patents
Ship behavior pattern mining method based on clustering algorithm and pattern mining Download PDFInfo
- Publication number
- CN113032378B CN113032378B CN202110247443.0A CN202110247443A CN113032378B CN 113032378 B CN113032378 B CN 113032378B CN 202110247443 A CN202110247443 A CN 202110247443A CN 113032378 B CN113032378 B CN 113032378B
- Authority
- CN
- China
- Prior art keywords
- track
- point
- data
- value
- clustering
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000005065 mining Methods 0.000 title claims abstract description 41
- 238000004422 calculation algorithm Methods 0.000 title claims abstract description 30
- 238000000034 method Methods 0.000 title claims abstract description 26
- 230000006835 compression Effects 0.000 claims abstract description 15
- 238000007906 compression Methods 0.000 claims abstract description 15
- 238000004140 cleaning Methods 0.000 claims abstract description 7
- 238000004364 calculation method Methods 0.000 claims abstract description 4
- 230000002159 abnormal effect Effects 0.000 claims description 6
- 238000012545 processing Methods 0.000 claims description 3
- 230000000750 progressive effect Effects 0.000 claims description 2
- 230000009466 transformation Effects 0.000 claims description 2
- 238000009412 basement excavation Methods 0.000 abstract description 4
- 230000000694 effects Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 238000011161 development Methods 0.000 description 3
- 238000007405 data analysis Methods 0.000 description 2
- 230000007123 defense Effects 0.000 description 2
- 238000007781 pre-processing Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000007621 cluster analysis Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007418 data mining Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 239000006185 dispersion Substances 0.000 description 1
- 238000002474 experimental method Methods 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 238000003909 pattern recognition Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/21—Design, administration or maintenance of databases
- G06F16/215—Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2458—Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
- G06F16/2465—Query processing support for facilitating data mining operations in structured databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Probability & Statistics with Applications (AREA)
- Evolutionary Computation (AREA)
- Remote Sensing (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Fuzzy Systems (AREA)
- Mathematical Physics (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- Quality & Reliability (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A ship behavior pattern mining method based on a clustering algorithm and pattern mining relates to the field of pattern mining of ships. The method mainly comprises the steps of obtaining tracks stored in a track database of a ship, performing calculation of two processes of data cleaning and track compression, clustering the processed track data to generate clusters, and taking the clusters as frequent item sets to perform frequent sequence mining. The invention optimizes the key point clusters in the data, can effectively utilize the clustering algorithm to excavate various ship data, thereby being capable of adapting to the characteristic of uneven large-scale data density of the ship and improving the quality and accuracy of behavior pattern excavation.
Description
Technical Field
Aiming at the characteristics of ship tracks, the invention designs a ship behavior pattern mining method by utilizing an improved clustering algorithm and a pattern mining algorithm, thereby realizing the mining of the behavior tracks of the ship in the course of navigation; relates to the field of mode excavation, in particular to the field of mode excavation of ships.
Background
With the development of emerging technologies such as big data, cloud service, artificial intelligence and the like in recent years, the ocean construction of China is also informationized and dataized. Meanwhile, the precision of basic technologies such as a positioning system, communication equipment, a sensor network and the like is improved, and the application is also wide. The large-scale data can be accurately collected and well stored due to the development of various technologies, the complicated large-scale data of ships is widely concerned by government, national defense units and related enterprises, and the full utilization of the data is expected through different technical methods. The ship behavior pattern mining based on the clustering algorithm and pattern mining can form a target historical behavior template by utilizing the track data of the ship through clustering and mining means, so that important auxiliary and reference functions can be provided for the application and processing of ship data.
Trajectory data is a set of moving points of a moving object in space-time, wherein the moving points comprise longitude and latitude, altitude, speed, time and other information of the object, and the trajectory data is a snapshot in the behavior activity of the moving object. Meanwhile, with the continuous development of navigation information technology, the automatic ship identification system (Automatic Identification System, AIS) is forcedly installed on the ship, so that massive AIS information data is generated, and the information data contains rich ship information. From the original track to the behavior pattern application, the original track data collected by the equipment can be filtered and processed through means such as data cleaning, data preprocessing and the like, noise, redundancy, stay points and the like, and then the data is used for pattern mining to form a relatively more accurate and reliable historical behavior track template of the moving target, and the activity rule seen by the target individual or group is displayed. By analyzing the track data through various technical means, the activity rules among the individuals and groups of the movable objects can be mined, and further support can be provided for track prediction, traffic planning and sea and air target monitoring.
Cluster analysis is an effective means of data mining, and has been widely used in fields of pattern recognition, data analysis, image processing, and the like. Frequent pattern mining is one of the directions of trace data pattern mining.
In the analysis of the clustering method, the fact that the ship track points are difficult to accurately repeat and the ship data density distribution is irregular is considered, meanwhile, the density-based clustering method is low in complexity, and the irregular-shaped data sets can be clustered according to the data density distribution characteristics. Therefore, the algorithm based on density clustering is determined to be more suitable for the characteristics of the ship data area dispersion non-shape rule and large data scale. The DBSCAN algorithm is a typical representation in a density-based spatial clustering algorithm, can divide a high-density point region into clusters, effectively filters a low-density point region, and can realize clustering of any shape in a data set containing noise. However, the DBSCAN algorithm is very sensitive to the parameters of Eps and MinPts, and incorrect values can cause poor clustering effect and even incorrect clustering effect.
According to the invention, the ship data set is clustered through the improved DBSCAN algorithm by preprocessing the ship original data set, clusters generated by clustering are used as data frequent items for pattern mining, and the pattern mining algorithm is used for mining the data frequent items, so that a behavior template sequence of the ship is mined.
Disclosure of Invention
The invention provides a ship behavior pattern mining method based on an improved clustering algorithm and a pattern mining algorithm.
The invention provides a method for mining a ship behavior pattern, and optimizes key point clusters in the behavior pattern, so that the clustering algorithm can be effectively utilized to mine various ship data, thereby being capable of adapting to the characteristic of uneven large-scale data density of ships and improving the quality and accuracy of mining the behavior pattern.
The invention adopts the following technical scheme and implementation steps:
a ship behavior pattern mining method based on a clustering algorithm and pattern mining,
Redundancy and noise are present for the original track data, which causes great inconvenience to the subsequent analysis, and therefore the original track data needs to be preprocessed to obtain track data for the subsequent analysis. And carrying out track clustering on the preprocessed track data, defining a frequent item set, and carrying out excavation of a frequent pattern.
The method is characterized by comprising the following steps of:
(1) The data cleaning and compressing method is provided for the characteristics of large space-time span and gentle steering amplitude of the original track target track:
① Cleaning of the data set is necessary because the raw data collected may have outliers. The main rules are as follows:
the time interval between the starting point and the end point of the track segment is more than one day (24 hours), and then the track segment is divided;
The maximum speed v max is defined, which is the maximum speed of the ship, and the maximum speed v max is set to be 110 km/h according to the ship overview and knowledge graph published by national defense industry press. Assuming that the locus point p (lon p,latp,tp) is the previous locus point of the locus point q (lon q,latq,tq) in the locus section, t p>tq, the speed between two points can be calculated as:
Wherein lon p is the longitude value of the track point p, lat p is the latitude value of the track point p, and t p is the time of generating the track point p; similarly, lon q is the longitude value of the track point q, lat q is the latitude value of the track point q, and t q is the time when the track point q is generated. HAVERSINE (lat p,lonp,latq,lonq) is the distance between two latitude and longitude points calculated by HAVERSINE formula. If v pq≥vmax indicates that the speed of q point generation is abnormal, defining q point as abnormal point, and deleting the point.
② Track data is generally collected in seconds, so the amount of the track data is large, but when the data analysis is performed, the operation efficiency is affected by the large amount of the track data, and a plurality of track points are unnecessary to analyze. For the convenience of calculation, the track data needs to be compressed, and a Douglas-Peucker (DP) algorithm is a commonly used track compression algorithm, and the Douglas-Peucker algorithm is used for compression, so that an offset threshold value threshold from a point to a track straight line needs to be set. Selecting a total of 102584 track points of 8 targets in the experiment, and comparing the compression ratio, the compression time and the track error after compression under different thresholds to obtain the increase of the compression ratio and the compression error of the track along with the increase of the compression threshold; the track increases with the track threshold and the compression time of the track correspondingly decreases. To achieve a balance of computation time and computation error, setting the trajectory compression threshold to 0.8km will result in a relatively accurate compression effect in a short computation time.
The main flow is as follows:
all points on the track segment are arranged in time order.
1) A straight line AB is connected between the first and the last points A and B of the track section, and the straight line is the chord of the track section
2) Obtaining a point C with the largest distance from the straight line segment on the track segment, and calculating the distance d between the point C and the AB;
3) Comparing the distance with a preset threshold value threshold, and if the distance is smaller than the threshold, the straight line segment is used as an approximation of the track segment, and the track of the segment is processed.
4) If the distance is greater than or equal to the threshold value threshold, the track section is divided into two sub track sections AC and BC by C, and the two sections are respectively subjected to 1-3 chord taking processes.
When all the sub track sections are processed, the broken lines formed by the dividing points are connected in sequence, and the broken lines can be used as approximations of the track sections.
(2) Designing an improved DBSCAN clustering method:
① Parameter Eps neighborhood: the Eps neighborhood of an object p refers to a region centered on the object p and having an Eps radius, namely:
Neps(p)={q∈D|Dist(p,q)≤Eps}; (2)
wherein D is a data set; dist (p, q) is the distance between object p and object q; n eps (p) refers to the set of points contained within the D-dimensional hypersphere region of dataset D object p with Eps as radius.
Obtaining neighborhood values Eps other than the first cluster using kernel density estimation
For a ship track data set D, n sample points x 1,x2,x3,…,xn which are independently and uniformly distributed exist in the data set, and a probability density function of the data set D is set as f (x), wherein the kernel density estimation form of f (x) is as follows:
Wherein, As a kernel function (non-negative, integral 1, mean 0), i=1, 2, …, n; is a scaling kernel function (SCALED KERNEL), whereby, I=1, 2, …, n; h is the bandwidth, also called window (h > 0), n is the number of samples.
Bandwidth is a free parameter that has a large impact on the resulting estimate. Then for the choice of h, the estimated probability density function can be determined using the integrated mean square Error (MEAN INTERGRATED Squared Error)And the true probability density function f (x), expressed as
Where E () represents the mathematical expectation of the variables in brackets.
Under weak hypothesis
Where o () represents the higher order infinitely small of the variable in brackets; AMISE is the progressive integral mean square error, and AMISE has
To minimize MISE (h), the transformation is to a pole problem
Thus optimum bandwidth
In the above expression for the bandwidth h, the second derivative f "(x) of the existence probability density function f (x) represents the degree of concavity and convexity of each point of the density estimation function, the gaussian density kernel function is taken as the kernel function to perform kernel density estimation, and the optimal choice of h (i.e., the bandwidth with minimized integral mean square error) is
Wherein,Representing the sample variance. For the data set D, the number of data set samples is used for obtaining the nuclear density estimated optimal bandwidth h, and h is used as an Eps initial value for clustering the data set.
② The parameter MinPts density threshold describes the threshold for the number of samples in a neighborhood of a certain sample distance Eps.
For the data set D, traversing the data set D, and recording the number M of the objects of each data point in the Eps neighborhood, wherein the value M can be used as the basis of the density distribution of the data set. Selecting a data point with the maximum M value in the data set D as a first core object D 1, acquiring the M value of D 1 as an initial MinPts, clustering the first cluster, selecting a data object with the maximum M value from the data objects which are not clustered after the clustering is finished as a core object, and carrying out the next clustering. Dynamic update obtains the density threshold MinPts of cluster classes other than the first cluster
Wherein M (n) is the M value of the current core object; m (max) records the M value of D 1; Representation of To the power of 2; minPts' is the pre-update density threshold. Once per cluster, the density threshold MinPts changes with the density value M of the first core object neighborhood at the beginning of each cluster class.
(3) And performing sequence mining on the frequent item sets.
Through clustering of the ship data set, frequent items of tracks represented by a cluster center and a cluster-like distance range and tracks formed by the cluster center are finally generated, and a PrefixSpan (Prefix-Projected Pattern Growth, prefix projection mode mining) algorithm is utilized to mine out frequently-occurring partial sequence rules from a large number of partial sequence phenomena, so that a behavior mode of the ship is obtained.
The invention mainly comprises the following steps:
(1) According to the invention, on the basis of a clustering algorithm and pattern mining, the behavior of the ship is subjected to pattern mining, cleaning, compression and clustering are provided, the clustering result is used as an entrance of the template mining, and the behavior of the ship is effectively mined according to the characteristics of redundancy of ship data, large data volume, gentle steering angle and difficulty in accurate repetition of ship track, so that the navigation rule of the ship can be obtained;
(2) Compared with the prior clustering algorithm, the clustering method is improved, the improved algorithm can enable the clustering process to be more fit with sample characteristics, the quality of a clustering result is higher, so that the final behavior pattern mining is more accurate, the utilization rate of ship data is higher, and more accurate reference can be provided for the utilization of ocean data.
Drawings
FIG. 1 is a schematic view of the overall structure of the present invention;
FIG. 2 is a schematic diagram of the clustering method of the present invention.
Detailed Description
According to the invention, the ship AIS data is selected as the original data, the ship behavior mode is mined by using a clustering algorithm and a mode mining algorithm, and the following technical scheme and the realization steps are adopted.
The ship behavior pattern mining method based on the clustering algorithm and pattern mining specifically comprises the following steps:
1. The original data set of the ship is cleaned and compressed, and the method comprises the following steps:
step 1: traversing the track data set, segmenting the track with the time interval between the starting point and the end point of the track segment being greater than one day (24 hours) according to the time difference requirement, and storing the segmented track data set.
Step2: traversing all sub-track segments of a track for the segmented track data set, judging that if the speed of the track segment is larger than a given speed value according to the set maximum speed v max value of 110 km/h, taking the start point of the track segment as the end point of the previous track and the end point as the start point of the next track. The abnormal points are divided into track segments with the length of 1, and the abnormal points are removed by removing the track segments with the length of 1.
Step 3: the data sets which are washed are compressed one by one according to track segments by using a Fabry-Perot algorithm, and the compressed track data sets are stored.
2. Compressing the processed track data set by using the improved DBSCAN clustering method
Step 1: setting an initial Eps neighborhood threshold:
Estimation from nuclear density
To select the optimal bandwidth h value, a method is used that minimizes the average integral squared error
AMISE has
According to the above formula, to find the minimum h value, MISE (h) needs to be minimized to find the pole
Using kernel density estimation with gaussian kernel function, the best choice for h is
And n is the number of objects in the data set, and the value of h is assigned to the neighborhood threshold Eps for clustering.
Step 2: traversing the data set D, and recording the number M of objects of each data point in the Eps neighborhood.
Step 3: selecting the data point with the maximum M value as a first core object, performing DBSCAN clustering on the first cluster by using the parameters of Eps and MinPts, and marking the clustered data points until the clustering is finished.
Step 4: selecting a data object with the maximum M value from the data objects which are not clustered as a core object, and dynamically acquiring a cluster density threshold MinPts except for the first cluster
Clustering was performed using the Eps and MinPts parameters.
Step 5: and (4) repeating the step until the rest objects cannot be used as core objects.
Step 6: the track consisting of cluster centers and the range of each cluster center and cluster are recorded.
3. And carrying out pattern mining on the clustering result.
And taking the clustering center and the clustering range as frequent items, and carrying out mode mining on tracks formed by the clustering centers by using PrefixSpan algorithm to obtain a frequent sequence with time sequence, wherein the sequence is a behavior template of the ship and is an operation rule in the ship navigation process.
Claims (1)
1. The ship behavior pattern mining method based on the clustering algorithm and pattern mining is characterized by comprising the following steps of:
(1) Data cleaning and compression:
① Cleaning the data set is necessary because the collected raw data may have outliers; the rules are:
the time interval between the starting point and the end point of the track segment is greater than 24 hours, and then the track segment is divided;
Defining a maximum speed v max, and setting the maximum speed v max to be 110 km/h; assuming that the locus point p (lon p,latp,tp) is the previous locus point of the locus point q (lon q,latq,tq) in the locus section, t p>tq, the speed between the two points is calculated as:
Wherein lon p is the longitude value of the track point p, lat p is the latitude value of the track point p, and t p is the time of generating the track point p; similarly, lon q is the longitude value of the track point q, lat q is the latitude value of the track point q, and t q is the time generated by the track point q; HAVERSINE (lat p,lonp,latq,lonq) is the distance between two latitude and longitude points calculated by HAVERSINE formula; if v pq≥vmax indicates that the speed generated by the q point is abnormal, defining the q point as an abnormal point, and deleting the point;
② Compressing the track data, and setting a track compression threshold value threshold to be 0.8km in order to balance a calculation time and a calculation error;
The flow is as follows:
arranging all points on the track section according to time sequence;
1) A straight line AB is connected between the first and the last points A and B of the track section, and the straight line is the chord of the track section
2) Obtaining a point C with the largest distance from the straight line segment on the track segment, and calculating the distance d between the point C and the AB;
3) Comparing the distance with a preset threshold value threshold, and if the distance is smaller than the threshold, taking the straight line segment as an approximation of a track segment, wherein the track segment is processed;
4) If the distance is greater than or equal to a threshold value threshold, dividing the track section into two sub track sections AC and BC by using C, and respectively carrying out 1) to 3) of processing on the two sections of strings;
When all the sub track sections are processed, the broken lines formed by all the dividing points are connected in sequence, namely, the broken lines are used as approximations of the track sections;
(2) Designing an improved DBSCAN clustering method:
① Parameter Eps neighborhood: the Eps neighborhood of an object p refers to a region centered on the object p and having an Eps radius, namely:
Neps(p)={q∈D|Dist(p,q)≤Eps}; (2)
wherein D is a data set; dist (p, q) is the distance between object p and object q; n eps (p) refers to the set of points contained within the D-dimensional hypersphere region of dataset D object p with Eps as radius;
Obtaining neighborhood values Eps other than the first cluster using kernel density estimation
For a ship track data set D, n sample points x 1,x2,x3,…,xn which are independently and uniformly distributed exist in the data set, and a probability density function of the data set D is set as f (x), wherein the kernel density estimation form of f (x) is as follows:
Wherein, As a kernel function, the kernel function is non-negative, the integral is 1, the average value is 0, i=1, 2, …, n; Is a scaling kernel function, whereby, H is the bandwidth, also called window, n is the number of samples; h >0;
For the selection of h, the estimated probability density function is determined using the integrated mean square error And the true probability density function f (x), expressed as
Where E () represents the mathematical expectation of the variables in brackets;
Under weak hypothesis
Where o () represents the higher order infinitely small of the variable in brackets; AMISE is the progressive integral mean square error, and AMISE has
To minimize MISE (h), the transformation is to a pole problem
Thus optimum bandwidth
In the expression of bandwidth h, the second derivative f' (x) of the existence probability density function f (x) represents the degree of concavity and convexity of each point of the density estimation function, the Gaussian density kernel function is taken as the kernel function to perform kernel density estimation, and the optimal choice of h is that the bandwidth with minimized integral mean square error is
Wherein,Representing the sample variance; aiming at a data set D, obtaining an optimal bandwidth h of kernel density estimation by using the number of data set samples, and taking h as an Eps initial value for clustering the data set;
② The parameter MinPts density threshold describes the threshold of the number of samples in a neighborhood with the distance of Eps of a certain sample;
Traversing the data set D aiming at the data set D, and recording the number M of objects of each data point in the Eps neighborhood, wherein the value M is used as the basis of the density distribution of the data set; selecting a data point with the maximum M value in the data set D as a first core object D 1, acquiring the M value of D 1 as an initial MinPts, clustering the first cluster, selecting a data object with the maximum M value from the data objects which are not clustered after the clustering is finished as a core object, and carrying out the next clustering; dynamic update obtains the density threshold MinPts of cluster classes other than the first cluster
Wherein M (n) is the M value of the current core object; m (max) records the M value of D 1; Representation of To the power of 2; minPts' is the pre-update density threshold; once per cluster, the density threshold MinPts changes with the density value M of the first core object neighborhood at the beginning of each cluster class;
(3) Sequence mining is carried out on frequent item sets;
and finally generating frequent items of tracks represented by the cluster centers and the cluster-like distance ranges and tracks formed by the cluster centers through clustering of the ship data sets, and mining out frequently-occurring partial sequence rules from the partial sequence phenomenon, so as to obtain the behavior mode of the ship.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110247443.0A CN113032378B (en) | 2021-03-05 | 2021-03-05 | Ship behavior pattern mining method based on clustering algorithm and pattern mining |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110247443.0A CN113032378B (en) | 2021-03-05 | 2021-03-05 | Ship behavior pattern mining method based on clustering algorithm and pattern mining |
Publications (2)
Publication Number | Publication Date |
---|---|
CN113032378A CN113032378A (en) | 2021-06-25 |
CN113032378B true CN113032378B (en) | 2024-07-19 |
Family
ID=76468504
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110247443.0A Active CN113032378B (en) | 2021-03-05 | 2021-03-05 | Ship behavior pattern mining method based on clustering algorithm and pattern mining |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113032378B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113298195A (en) * | 2021-07-27 | 2021-08-24 | 中国电子科技集团公司第十五研究所 | Method and device for generating classical trajectory of offshore target and storage medium |
CN114118243B (en) * | 2021-11-18 | 2023-07-07 | 中交疏浚技术装备国家工程研究中心有限公司 | Construction track identification method for trailing suction hopper dredger based on track data |
CN114385765B (en) * | 2021-12-03 | 2023-07-07 | 中交疏浚技术装备国家工程研究中心有限公司 | Production time prediction method of trailing suction hopper dredger based on track data |
CN115270920B (en) * | 2022-06-21 | 2023-07-18 | 中国人民解放军91977部队 | Ship target classical track generation method based on density spatial clustering |
CN115828120B (en) * | 2023-02-17 | 2023-05-16 | 山东科技大学 | Self-adaptive identification method, system and computer equipment for ship traffic behavior mode |
CN117150217B (en) * | 2023-11-01 | 2024-01-26 | 苏州爱雄斯通信技术有限公司 | Data compression processing method based on big data |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110309383A (en) * | 2019-06-17 | 2019-10-08 | 武汉科技大学 | Ship trajectory clustering analysis method based on improved DBSCAN algorithm |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2020210974A1 (en) * | 2019-04-16 | 2020-10-22 | 江南大学 | High-quality pattern mining model and method based on improved multi-objective evolutionary algorithm |
-
2021
- 2021-03-05 CN CN202110247443.0A patent/CN113032378B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110309383A (en) * | 2019-06-17 | 2019-10-08 | 武汉科技大学 | Ship trajectory clustering analysis method based on improved DBSCAN algorithm |
Also Published As
Publication number | Publication date |
---|---|
CN113032378A (en) | 2021-06-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN113032378B (en) | Ship behavior pattern mining method based on clustering algorithm and pattern mining | |
CN113780395B (en) | Mass high-dimensional AIS trajectory data clustering method | |
US11851147B2 (en) | Spatio-temporal DP method based on ship trajectory characteristic point extraction | |
CN109191922B (en) | Large-scale four-dimensional track dynamic prediction method and device | |
CN107682319B (en) | Enhanced angle anomaly factor-based data flow anomaly detection and multi-verification method | |
CN108985380B (en) | Point switch fault identification method based on cluster integration | |
CN111310833B (en) | Travel mode identification method based on Bayesian neural network | |
CN111582380B (en) | Ship track density clustering method and device based on space-time characteristics | |
CN111985529B (en) | Ship AIS track hybrid clustering method and system | |
CN109581339B (en) | Sonar identification method based on automatic adjustment self-coding network of brainstorming storm | |
CN112069310A (en) | Text classification method and system based on active learning strategy | |
CN109977546B (en) | Four-dimensional track online anomaly detection method based on unsupervised learning | |
CN110708318A (en) | Network abnormal flow prediction method based on improved radial basis function neural network algorithm | |
CN110738355A (en) | urban waterlogging prediction method based on neural network | |
CN110196962A (en) | A kind of air speed abnormality recognition method based on Density Estimator | |
CN112783938B (en) | Hydrological telemetering real-time data anomaly detection method | |
CN112363896A (en) | Log anomaly detection system | |
CN114550498A (en) | Intelligent early warning system for ship deviation channel driven by massive AIS data | |
Wang et al. | Classification of ship trajectories by using naive Bayesian algorithm | |
CN114091578A (en) | Ship track clustering method based on curve length distance | |
CN106991817B (en) | Method for determining traffic capacity of road sections of multi-level road network | |
CN115879051A (en) | Track big data anomaly detection method and system based on VAE | |
CN109447309B (en) | Method and system for mining galloping data | |
CN116881750B (en) | Track clustering method | |
Qin et al. | A trajectory abnormal detection method based on segmentation and clustering |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |