WO2020048183A1 - Vessel type identification method based on coarse-to-fine cascaded convolutional neural network - Google Patents

Vessel type identification method based on coarse-to-fine cascaded convolutional neural network Download PDF

Info

Publication number
WO2020048183A1
WO2020048183A1 PCT/CN2019/092016 CN2019092016W WO2020048183A1 WO 2020048183 A1 WO2020048183 A1 WO 2020048183A1 CN 2019092016 W CN2019092016 W CN 2019092016W WO 2020048183 A1 WO2020048183 A1 WO 2020048183A1
Authority
WO
WIPO (PCT)
Prior art keywords
ship
neural network
convolutional neural
fine
coarse
Prior art date
Application number
PCT/CN2019/092016
Other languages
French (fr)
Chinese (zh)
Inventor
陈信强
杨勇生
吴华锋
苌道方
于泽崴
张倩楠
陈晶
傅俊杰
赵建森
陈辉兴
刘卫
李军军
许波桅
鲜江峰
Original Assignee
上海海事大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海海事大学 filed Critical 上海海事大学
Priority to CA3084451A priority Critical patent/CA3084451C/en
Publication of WO2020048183A1 publication Critical patent/WO2020048183A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2413Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns

Definitions

  • the invention relates to the technical field of maritime video surveillance, and in particular to a cascaded convolutional neural network ship type recognition method from coarse to fine.
  • VTS Vessel Traffic Service
  • AIS Automatic Identification System
  • the on-board personnel on the ship report the basic information of the ship, such as the port of destination, the port of departure, and the type of ship, to the maritime supervision department through a VHF phone.
  • the AIS system will also periodically distribute static and dynamic information of the ship, including the ship's type, position, call sign, ship name, gross tonnage, ship draft and speed, etc.
  • AIS users need to manually enter static information such as ship type and ship call sign for the AIS system in advance. From the above analysis, it can be known that both VTS and AIS require human participation to obtain ship type information.
  • the purpose of the present invention is to provide a cascaded deep convolutional neural network ship type identification method for overcoming the shortcomings of the prior art described above.
  • common ships including container ships, tankers, and chemical tankers
  • Bulk carriers, general cargo ships, LNG carriers, other merchant ships Obtaining ship type information by traditional technical means is time-consuming and not conducive to improving the efficiency of maritime supervision.
  • a cascaded deep-convolution neural network ship type recognition method from coarse to fine includes the following steps:
  • S1 Input pictures of all ship types and corresponding picture tags, and perform coarse-level training on the deep convolutional neural network from coarse to fine, to obtain the setting parameters of the deep convolutional neural network from coarse to fine, and Training recognition accuracy;
  • step S2 Use the training to identify the ship type with the lowest accuracy picture to perform fine-level training on the deep to convolutional neural network from coarse to fine. If the deep convolutional neural network has not reached the preset convergence condition, return to step S1 to continue Training, otherwise execute step S3;
  • S3 Perform type recognition on the ship in the picture, and output the recognition result of the ship type.
  • the step S1 includes the following steps:
  • S11 crop the originally input ship type picture to a fixed size, match the input ship type picture and corresponding picture tag, and obtain a formatted ship image and image tag;
  • S12 using the formatted ship image and image label to train a cascaded deep convolutional neural network from coarse to fine to obtain the setting parameters of the cascaded deep convolutional neural network from coarse to fine, Extract depth characteristics of different ship types;
  • the step S12 includes the following steps:
  • the cascaded deep-to-fine convolutional neural network uses convolution layers to extract ship features;
  • the ship features include: low-level ship features, including ship textures, contours, and corner points; advanced ship features, Different types of ships are obtained by correspondingly abstracting the characteristics of low-level ships;
  • S122 The cascaded deep convolutional neural network from coarse to fine uses the pooling layer to reduce the dimension of the ship features and learn;
  • S123 The cascaded deep to convolutional neural network uses a local response normalization layer to increase the local response of the ship feature extracted by the convolution layer to randomly assign a larger response value and extract Generalize ship characteristics.
  • the step S13 includes the following steps:
  • S131 The cascaded deep-to-fine convolutional neural network uses a fully connected layer to map the generalized ship feature to a single ship feature vector, and its expression is as follows:
  • F out is a single ship feature vector output from the fully connected layer, which has a total of n 1 elements;
  • F in is the input generalized ship feature, its dimension is n 2 +1;
  • is the F in and F out 's connection matrix with dimensions n 1 ⁇ (n 2 +1);
  • S132 The cascaded deep-to-fine convolutional neural network uses a loss layer to generate a probability vector based on the single ship feature vector as an input ship image, and the elements in the vector represent the probability of the type of ship
  • the calculation expression is as follows:
  • F p is a single ship feature vector of the loss layer
  • v j is a weight corresponding to the j-th ship type when calculating a ship probability vector.
  • the calculation expression of the training recognition accuracy e1t of the ship type in step S14 is as follows:
  • N s is the total number of ship pictures to be identified
  • N er is the total number of ship pictures with incorrect type recognition.
  • the step S2 includes the following steps:
  • S21 Acquire all the training pictures of the ship type according to the pictures of the ship type with the lowest training recognition accuracy, as the input samples for the fine-level training of the coarse to fine deep convolutional neural network;
  • the coarse-to-fine deep convolutional neural network adjusted according to the refinement parameters is used to re-identify the ship type of the picture. If the training type recognition accuracy change rate is less than a preset threshold, the training process ends; if the ship type The type of training recognition accuracy change rate is greater than the preset threshold, the coarse-to-fine deep convolutional neural network completes the current fine-level training, and returns to step S1 to continue training.
  • the data enhancement includes horizontally / vertically flipping the training picture, changing a color, and / or randomly changing the size of the training picture.
  • the selective discarding method sleeps some neurons of the convolution layer with a preset probability, and all neurons of the convolution layer sleep or stop hibernation with the same probability.
  • the selective connection method is to randomly modify the weights of the neurons in the convolution layer, thereby weakening or strengthening the influence of the ship features extracted by the neurons in the layer on the accuracy of ship type recognition.
  • the calculation formula for the rate of change in training recognition accuracy P ia of the ship type is as follows:
  • a jc is the accuracy of class j ship type recognition in step S1;
  • a jf is the accuracy of class j ship type recognition in step S2.
  • the present invention has the following advantages: the method of the present invention realizes the ship type in the picture by stepwise training from rough to fine level by cascading from coarse to fine deep convolutional neural network.
  • the automatic and accurate identification of ships effectively realizes the automation and high-precision identification of ship types, and has important practical value for organizing maritime traffic order, ensuring maritime traffic safety, and improving navigation efficiency in the era of intelligent navigation.
  • FIG. 1 is a schematic diagram of an overall process of the present invention
  • FIG. 2A is a schematic flowchart of step S1 in a preferred embodiment
  • FIG. 2B is a schematic flowchart of step S12 in the preferred embodiment
  • FIG. 2C is a schematic flowchart of step S13 in the preferred embodiment
  • FIG. 2D is a schematic flowchart of step S2 in the preferred embodiment
  • FIG. 3 is a picture of a typical ship to be identified according to the present invention.
  • 4A is a deep network recognition error distribution of different batch sample sizes when different parameters are set
  • 4B is a deep network recognition error distribution with different weight value attenuation when different parameters are set
  • 4C is a deep network recognition error distribution with different learning rates when different parameters are set
  • 4D is a deep network recognition error distribution with different training times when different parameters are set
  • 5A is a recognition result of a container ship
  • Figure 5B is the identification result of the general cargo ship
  • FIG. 6 is a classification distribution of different ship types based on a cascaded deep to convolutional neural network
  • FIG. 7 is a comparison chart of ship type recognition accuracy by different methods.
  • This embodiment provides a cascade-type coarse-to-fine deep convolutional neural network ship type recognition method. Referring to the schematic diagram of the overall process shown in FIG. 1, the method includes the following steps:
  • S1 Input pictures of all ship types and corresponding picture tags, perform rough-level training on the deep convolutional neural network from coarse to fine, obtain the setting parameters of the deep convolutional neural network from coarse to fine, and obtain different ship types Training recognition accuracy;
  • step S2 Use the training to identify the ship type with the lowest accuracy picture to perform fine-level training on the deep to convolutional neural network from coarse to fine. If the deep convolutional neural network has not reached the preset convergence condition, return to step S1 to continue Training, otherwise execute step S3;
  • S3 Perform type recognition on the ship in the picture, and output the recognition result of the ship type.
  • step S1 includes the following steps:
  • S11 crop the originally input ship type picture to a fixed size, match the input ship type picture and corresponding picture tag, and obtain a formatted ship image and image tag;
  • S12 Use the formatted ship images and image tags to train the cascaded deep convolutional neural network from coarse to fine, obtain the setting parameters of the cascaded deep convolutional neural network from coarse to fine, and extract Depth characteristics of different ship types;
  • step S12 further includes the following steps:
  • a cascaded deep-to-fine convolutional neural network uses convolution layers to extract ship features.
  • the ship features here include low-level ship features and high-level ship features, where low-level ship features include ship texture, contours, and corner points.
  • the characteristics of high-level ships are high-level features of low-level ships, and different types of ships present different high-level features.
  • the ship features extracted by the convolutional layer in this step are as follows:
  • the cascaded deep-to-fine convolutional neural network uses the pooling layer to learn the above-mentioned ship features, and abandons the learning of secondary ship features, and retains the learning of important ship features.
  • the secondary ship feature refers to the feature extracted by the neurons whose pooled weight value is less than 75% of the entire layer of the network.
  • the important ship feature refers to the neurons whose pooled weight value is greater than 75% of the whole network. Extracted features.
  • the pooling layer expression here is as follows:
  • k is the dimension of the pooling kernel
  • d is the step size
  • parameters u and v are dimensions of Pool u, v ;
  • S123 The cascaded deep to convolutional neural network uses the local response normalization layer to increase the local response of the ship features extracted by the convolution layer to randomly assign larger response values to extract generalized ship features.
  • the expression of the local response normalization layer is as follows;
  • step S13 includes the following steps:
  • F out is a single ship feature vector output from the fully connected layer, which has a total of n 1 elements;
  • F in is the input generalized ship feature, its dimension is n 2 +1;
  • is the F in and
  • the connection matrix of F out the dimension is n 1 ⁇ (n 2 +1), where n2 is the output ship feature after generalization;
  • S132 The cascaded deep-to-fine convolutional neural network uses the loss layer to generate a probability vector based on the above-mentioned single ship feature vector as an input ship image, and the elements in the vector represent the probability of the type to which the ship belongs, and calculates the expression. as follows:
  • F p is a single ship feature vector of the loss layer
  • v j is a weight corresponding to the j-th ship type when calculating a ship probability vector.
  • the calculation expression of the training type recognition accuracy e 1t of the ship type in step S14 is as follows:
  • N s is the total number of ship pictures to be identified
  • N er is the total number of ship pictures with incorrect type recognition.
  • step S2 includes the following steps:
  • S21 Acquire all the training pictures of the ship type according to the pictures of the ship type with the lowest training recognition accuracy, as the input samples for the fine-level training of the coarse to fine deep convolutional neural network;
  • P Max ⁇ i1 ⁇ ⁇ ir + ⁇ i2 ⁇ ⁇ ih ⁇
  • P is the probability of choosing the i-th regularization method, the data is enhanced, and the dropout and dropconnect mechanisms are respectively labeled as 1, 2, and 3.
  • P 1s indicates that data enhancement is selected as the current regularization method
  • P 2s indicates that dropout is selected as the current regularization method
  • P 3s indicates that dropout is selected as the current regularization method
  • ⁇ ir is a random factor
  • ⁇ ih is heuristic factor
  • ⁇ i1 of representation ⁇ ir weight ⁇ i2 representative of ⁇ ih weight
  • P ia after the session rate of change of the depth of the network of the ship recognition accuracy
  • step S23 According to the setting parameters of step S1, use a random regularization mechanism to train the deep convolutional neural network from coarse to fine, and obtain the refined parameters and the adjusted deep convolutional neural network from coarse to fine;
  • the above-mentioned data enhancement mechanism specifically includes horizontal / vertical flipping, color changing, and / or randomly changing the training picture size of the original training picture.
  • the selective discarding method described above sleeps some neurons of the convolutional layer with a preset probability, and all neurons of the convolutional layer sleep or stop dormant with the same probability.
  • the above selective connection method is to randomly modify the weights of the neurons in the convolution layer, thereby weakening or strengthening the influence of the ship features extracted by the neurons in this layer on the accuracy of ship type recognition.
  • a jc is the accuracy of class j ship type recognition in step S1;
  • a jf is the accuracy of class j ship type recognition in step S2.
  • the experimental platform for ship type identification in this application example is Windows 10 operating system, 16G RAM, the main frequency of the CPU processor is 3.4GHz, and the simulation platform is MATLAB (R2016 version).
  • the experimental test ship of the present invention includes 7 types of ships, including container ships, oil tankers, chemical ships, liquefied natural gas ships (LNG), general cargo ships and bulk carriers. These 6 types of ships are common types of merchant ships. Category 7 ships are a collection of uncommon merchant ships, including timber ships, refrigerated ships and barges. In the training and test of this application example, the names of the first six ship types are recorded as the tags of various ship types, and the tag of the seventh type is "other ship types".
  • the training and test sets include 11,760 pictures, of which 2,720 are container ship pictures, 1,320 are tanker pictures, 1600 are chemical tanker ships, 1,200 are LNG ship pictures, 2,850 are general ship pictures, and 2,070 are bulk ship pictures. A total of 1,560 pictures of 7 types of ships.
  • Figure 3 shows a picture of a typical ship type.
  • the 1-type error rate is used to obtain the recognition accuracy of the ship type, and the parameter setting when the ship type has the highest recognition accuracy is set to the network, as the optimal cascade type from coarse to fine deep convolutional neural network parameter setting -
  • the calculation expression for the class error rate is as follows:
  • N s is the total number of ship images to be identified
  • N er is the total number of misidentified ship pictures.
  • N bz is the sample batch capacity
  • N ep is the number of network trainings
  • ⁇ wd is the weight decay rate
  • ⁇ lr is the learning rate
  • the parameters f cl and f pl are the convolution kernels of the convolution layer and the pooling layer, respectively. size.
  • FIG. 4A shows that the optimal size of the batch sample is 15, that is, the cascaded deep convolutional neural network of the present invention selects 15 pictures from the test set as a training set for training.
  • the 1-type error rate showed a significant downward trend.
  • the 1-type error rate increases rapidly.
  • the 1-type error rate reaches almost 40%.
  • the default batch size of the cascaded deep convolutional neural network from coarse to fine of the present invention is set to 15.
  • FIG. 4B shows the change of the ship type recognition accuracy corresponding to different weight attenuation rates.
  • the weight attenuation rate is 5 ⁇ 10 -4
  • the error rate of ship type identification is the smallest.
  • the 1-type error rate of the cascaded deep convolutional neural network of the present invention is only 10%. Therefore, the default value of the weight decay rate is set to 5 ⁇ 10 -4 .
  • FIG. 4C shows that when the learning rate decreases from 2 ⁇ 10 -1 to 2 ⁇ 10 -3 , the 1-type error rate goes from 22%. Down to 15%.
  • the optimal learning rate is set to 2 ⁇ 10 -3 .
  • FIG. 4D it can be seen that compared with the above three parameters, the change in the number of network training epochs has little effect on the accuracy of ship type recognition. In fact, when the network training number is equal to or more than 200, the recognition accuracy of the ship remains basically unchanged, so the default value of the network training number is 200.
  • FIGs 5A-5C show the recognition results of these three types of typical ships.
  • Figure 5A shows a typical container ship image and recognition results.
  • the right side of FIG. 5A shows that the cascaded deep convolutional neural network according to the present invention has a probability of thinking that the ship in the image is a container ship and the probability of being a general cargo ship is 97.8%.
  • the cascaded deep convolutional neural network of the present invention determines the ship type with the highest probability value as the recognized ship type. Therefore, it is determined that the input image is a container ship, which shows that the cascaded deep convolutional neural network of the present invention fully extracts and learns important and significant features of the container ship.
  • FIG. 5B shows that the cascaded deep convolutional neural network of the present invention tests a ship image as a general cargo ship.
  • the recognition result of the cascaded deep convolutional neural network of the present invention is as follows. 5B right sub-picture. It is obvious that the ship in the test picture was equipped with a crane. It is only possible for general cargo ships and small bulk carriers to be equipped with restraint cranes. In addition, from the perspective of the ship, bulk carriers are basically equipped with hatch covers, while general cargo ships are not equipped with such facilities. It can be seen from the left sub-picture of FIG. 5B that the ship in the picture does not have a hatch.
  • the above characteristics of the general cargo ship and the bulk cargo ship make the cascaded deep convolutional neural network of the present invention to easily distinguish between the general cargo ship and the bulk cargo ship. It can be known from FIG. 5B that the probability that the cascaded deep convolutional neural network of the present invention considers the ship to be a general cargo ship is 99.6%, and the probability that it is a bulk carrier is 0.4%. Therefore, it is determined that the ship type of the input image is a general cargo ship. This shows that the cascaded deep convolutional neural network of the present invention can correctly extract the characteristics of general cargo ships and bulk cargo ships.
  • FIG. 5C shows the recognition result of the oil tanker by the cascaded deep convolutional neural network from coarse to fine according to the present invention.
  • the appearance of tankers and chemical tankers is similar, we can perceptually recognize that the pipelines on the decks of chemical tankers are more complicated than the pipelines on tanker decks, and the quantitative complexity cannot be used to describe the complexity of the pipelines of the two types of ships.
  • the good generalization ability of the cascaded deep convolutional neural network from coarse to fine according to the present invention enables it to effectively grasp the complexity of two kinds of ship pipelines.
  • the cascaded deep convolutional neural network of the present invention considers that the probability that the ship belongs to an oil tanker is 96.4%, and the probability that it belongs to a chemical tanker is 3.6%.
  • the above-mentioned CFCCNN ship confidence level distribution also validates our analysis. Therefore, it is determined that the ship type of the input image is a tanker. This shows that the cascaded deep convolutional neural network of the present invention can correctly extract the characteristics of general cargo ships and bulk cargo ships.
  • Fig. 6 shows that the cascaded deep convolutional neural network of the present invention has the highest recognition accuracy for a category 7 ship, and its recognition accuracy is as high as 93.3%. This is because the structural characteristics of Class 7 ships are more obvious. For example, as a member of the seventh category of ships, ro-ro ships usually carry small ships. Therefore, from the perspective of the image, the shape and structure corresponding to the ro-ro ship will be significantly different from other types of ships, and the cascaded deep-convolution neural network of the invention can easily obtain the ro-ro ship's This structural texture features.
  • the cascaded deep convolutional neural network of the present invention can obtain better accuracy when identifying a type 7 ship.
  • Figure 6 shows that the cascaded deep convolutional neural network of the present invention has a recognition accuracy of 90.7% for container ships, a recognition accuracy for general cargo ships of 86%, and a tanker recognition accuracy of 84.6%.
  • the cascaded deep convolutional neural network of the present invention has lower recognition accuracy for chemical tankers and LNG tankers. This is because the cascaded deep convolutional neural network of the present invention recognizes part of the chemical tankers as tankers, and part of the liquefied natural gas ships as class 7 ships.
  • the cascaded deep convolutional neural network of the present invention does not have high recognition accuracy for chemical tankers and LNG carriers, the average recognition rate for all ship types reaches 81.4%.
  • this embodiment also uses the existing K-Nearest Neighbor (KNN), artificial neural network (ANN), random forest (RF), and traditional convolution.
  • the neural network (convolutional neural network, CNN) method compares the recognition results of different ship types.
  • the KNN algorithm and the ANN algorithm have the lowest recognition accuracy for chemical tankers, and their recognition accuracy is 29.8% and 28.1%, respectively.
  • the recognition accuracy is the lowest, and its recognition accuracy is only 41.2%.
  • the traditional CNN algorithm has a recognition accuracy of 61.3% and 63.2% for chemical tankers and LNG ships respectively, while the cascaded deep convolutional neural network of the present invention has a recognition accuracy of 65.6 for the above two ship types. % And 66.7%.

Abstract

The present invention provides a vessel type identification method based on a coarse-to-fine cascaded deep convolutional neural network. The method uses a random heuristic selection mechanism to dynamically adjust structure and parameter settings of the depth network, and the method obtains the deep convolutional neural network capable of identifying the vessel type by means of the steps of coarse-level training and fine-level training; the coarse-level training process is similar to that of a conventional deep convolutional neural network, and an input sample of the training process is a vessel image; the fine-level training process is directed at a merchant vessel image having the lowest vessel type identification precision in the coarse-level training process, and retrains the deep convolutional neural network to improve the overall precision of the vessel type identification. By means of the method of the present invention, better identification precision can be realized for different vessel types, and information support is provided for automatic vessel type identification and vessel intelligent navigation.

Description

一种级联式由粗到精的卷积神经网络船舶类型识别方法A cascaded rough-to-fine convolutional neural network ship type recognition method 技术领域Technical field
本发明涉及一种海事视频监控技术领域,尤其涉及一种级联式由粗到精的卷积神经网络船舶类型识别方法。The invention relates to the technical field of maritime video surveillance, and in particular to a cascaded convolutional neural network ship type recognition method from coarse to fine.
背景技术Background technique
目前,船舶交通服务(Vessel Traffic Service,VTS)和船舶自动识别系统(Automatic Identification System,AIS)是获取船舶类型信息的主要手段。船舶进入VTS报告线后,船上的值班人员通过甚高频电话向海事监管部门报告本船基本信息,比如目的港、出发港、船舶类型等。此外,AIS系统也会通过广播周期性地分发本船的静态和动态信息,包括本船船型、船位、呼号、船名、总吨位、船舶吃水和航速等。但AIS用户需要提前为AIS系统手动输入船舶的船舶类型、船舶呼号等静态信息。从上述分析可知VTS和AIS都需要借助人工参与才能获得船舶类型信息。随着海上交通量的快速增长和船队规模的迅速扩张,这些传统的船舶类型获取方法需要人工干预的工作越来越大。因此,利用传统的技术手段获取船舶类型信息是一项非常耗时的工作。基于可视化数据信息的船舶类型自动化识别,是无人船舶时代和智能航行时代的需要应对的重要挑战之一。At present, Vessel Traffic Service (VTS) and Automatic Identification System (AIS) are the main means to obtain ship type information. After the ship enters the VTS report line, the on-board personnel on the ship report the basic information of the ship, such as the port of destination, the port of departure, and the type of ship, to the maritime supervision department through a VHF phone. In addition, the AIS system will also periodically distribute static and dynamic information of the ship, including the ship's type, position, call sign, ship name, gross tonnage, ship draft and speed, etc. However, AIS users need to manually enter static information such as ship type and ship call sign for the AIS system in advance. From the above analysis, it can be known that both VTS and AIS require human participation to obtain ship type information. With the rapid growth of maritime traffic and the rapid expansion of fleet size, these traditional methods of acquiring ship types require more and more manual intervention. Therefore, it is a very time-consuming task to obtain the ship type information by using traditional technical means. The automatic identification of ship types based on visual data information is one of the important challenges to be addressed in the era of unmanned ships and the era of intelligent navigation.
发明的公开Disclosure of invention
本发明的目的就是为了克服上述现有技术存在的缺陷而提供一种级联式由粗到精的深度卷积神经网络船舶类型识别方法,对常见的船舶(包括集装箱船、油轮、化学品船、散货船、杂货船、液化天然气船、其他商船)进行识别。以解决传统的技术手段获取船舶类型信息非常耗时,不利于提高海事监管效率的问题。The purpose of the present invention is to provide a cascaded deep convolutional neural network ship type identification method for overcoming the shortcomings of the prior art described above. For common ships (including container ships, tankers, and chemical tankers) , Bulk carriers, general cargo ships, LNG carriers, other merchant ships). Obtaining ship type information by traditional technical means is time-consuming and not conducive to improving the efficiency of maritime supervision.
本发明的目的可以通过以下技术方案来实现:The object of the present invention can be achieved by the following technical solutions:
一种级联式由粗到精的深度卷积神经网络船舶类型识别方法,该方法包括以下步骤:A cascaded deep-convolution neural network ship type recognition method from coarse to fine, the method includes the following steps:
S1:输入所有船舶类型的图片和对应的图片标签,对由粗到精的深度卷积神经网络进行粗糙级训练,得到由粗到精的深度卷积神经网络的设置参数,及不同船舶类型的训练识别精度;S1: Input pictures of all ship types and corresponding picture tags, and perform coarse-level training on the deep convolutional neural network from coarse to fine, to obtain the setting parameters of the deep convolutional neural network from coarse to fine, and Training recognition accuracy;
S2:利用所述训练识别精度最低的船舶类型的图片,对由粗到精的深度卷积神经网络进行精细级训练,如果深度卷积神经网络尚未达到预设的收敛条件,则返回步骤S1继续训练,否则执行步骤S3;S2: Use the training to identify the ship type with the lowest accuracy picture to perform fine-level training on the deep to convolutional neural network from coarse to fine. If the deep convolutional neural network has not reached the preset convergence condition, return to step S1 to continue Training, otherwise execute step S3;
S3:对图片中的船舶进行类型识别,并输出船舶类型的识别结果。S3: Perform type recognition on the ship in the picture, and output the recognition result of the ship type.
较佳地,所述的步骤S1包括以下步骤:Preferably, the step S1 includes the following steps:
S11:将初始输入的船舶类型的图片裁剪为固定尺寸,匹配输入的船舶类型的图片及对应的图片标签,得到格式化的船舶图像和图像标签;S11: crop the originally input ship type picture to a fixed size, match the input ship type picture and corresponding picture tag, and obtain a formatted ship image and image tag;
S12:利用所述的格式化的船舶图像和图像标签,对级联式由粗到精的深度卷积神经网络进行训练,得到级联式由粗到精的深度卷积神经网络的设置参数,提取不同的船舶类型的深度特征;S12: using the formatted ship image and image label to train a cascaded deep convolutional neural network from coarse to fine to obtain the setting parameters of the cascaded deep convolutional neural network from coarse to fine, Extract depth characteristics of different ship types;
S13:根据所述深度特征,得到输入的船舶类型的图片的置信度水平分布,将置信度值最大对应的船舶类型输出为单次训练识别结果;S13: Obtain the confidence level distribution of the input ship type picture according to the depth feature, and output the ship type corresponding to the maximum confidence value as a single training recognition result;
S14:根据所述单次训练识别结果,比较图片标签对应的船舶真实所属类型,获得不同船舶类型的训练识别精度。S14: According to the recognition result of the single training, compare the true type of the ship corresponding to the picture tag to obtain the training recognition accuracy of different ship types.
较佳地,所述的步骤S12包括以下步骤:Preferably, the step S12 includes the following steps:
S121:所述的级联式由粗到精的深度卷积神经网络利用卷积层提取船舶特征;所述船舶特征包括:低级船舶特征,包括船舶纹理、轮廓和角点;高级船舶特征,根据船舶的不同类型对低级船舶特征进行相应的高度抽象而获得;S121: The cascaded deep-to-fine convolutional neural network uses convolution layers to extract ship features; the ship features include: low-level ship features, including ship textures, contours, and corner points; advanced ship features, Different types of ships are obtained by correspondingly abstracting the characteristics of low-level ships;
S122:所述的级联式由粗到精的深度卷积神经网络利用池化层对所述的船舶特征降维学习;S122: The cascaded deep convolutional neural network from coarse to fine uses the pooling layer to reduce the dimension of the ship features and learn;
S123:所述的级联式由粗到精的深度卷积神经网络利用局部响应归一化层,增加对所述的卷积层提取的船舶特征的局部响应随机赋予更大的响应值,提取泛化船舶特征。S123: The cascaded deep to convolutional neural network uses a local response normalization layer to increase the local response of the ship feature extracted by the convolution layer to randomly assign a larger response value and extract Generalize ship characteristics.
较佳地,所述的步骤S13包括以下步骤:Preferably, the step S13 includes the following steps:
S131:所述的级联式由粗到精的深度卷积神经网络利用全连接层,将所述的泛化船舶特征映射为单一船舶特征向量,其表达式如下:S131: The cascaded deep-to-fine convolutional neural network uses a fully connected layer to map the generalized ship feature to a single ship feature vector, and its expression is as follows:
F out=Θ×F in F out = Θ × F in
其中:F out是全连接层输出的单一船舶特征向量,该特征向量共有n 1个元素;F in是输入的泛化船舶特征,其维度为n 2+1;Θ是特征向量的F in和F out的连接矩阵,维度是n 1×(n 2+1); Among them: F out is a single ship feature vector output from the fully connected layer, which has a total of n 1 elements; F in is the input generalized ship feature, its dimension is n 2 +1; Θ is the F in and F out 's connection matrix with dimensions n 1 × (n 2 +1);
S132:所述的级联式由粗到精的深度卷积神经网络利用损失层,根据所述的单一船舶特征向量作为输入的船舶图像生成概率向量,该向量中的元素表示船舶所属类型的概率,计算表达式如下:S132: The cascaded deep-to-fine convolutional neural network uses a loss layer to generate a probability vector based on the single ship feature vector as an input ship image, and the elements in the vector represent the probability of the type of ship The calculation expression is as follows:
Figure PCTCN2019092016-appb-000001
Figure PCTCN2019092016-appb-000001
其中,F p是所述损失层的单一船舶特征向量;v j是计算船舶概率向量时第j类船舶类型对应的权值。 Among them, F p is a single ship feature vector of the loss layer; v j is a weight corresponding to the j-th ship type when calculating a ship probability vector.
较佳地,所述的步骤S14的船舶类型的训练识别精度e1t的计算表达式如下:Preferably, the calculation expression of the training recognition accuracy e1t of the ship type in step S14 is as follows:
Figure PCTCN2019092016-appb-000002
Figure PCTCN2019092016-appb-000002
其中:N s是待识别的船舶图片总数量;N er是类型识别错误的船舶图片总数量。 Among them: N s is the total number of ship pictures to be identified; N er is the total number of ship pictures with incorrect type recognition.
较佳地,所述步骤S2包括以下步骤:Preferably, the step S2 includes the following steps:
S21:根据所述训练识别精度最低的船舶类型的图片,获取该船舶类型的所有训练图片,作为所述的由粗到精的深度卷积神经网络的精细级训练的输入样本;S21: Acquire all the training pictures of the ship type according to the pictures of the ship type with the lowest training recognition accuracy, as the input samples for the fine-level training of the coarse to fine deep convolutional neural network;
S22:利用随机启发式选择方法,从数据增强、选择性丢弃方法和选择性连接方法中选择一种作为随机正则化机制;S22: Use a random heuristic selection method to select one of the data enhancement, selective discarding method, and selective connection method as a random regularization mechanism;
S23:根据所述设置参数,利用所述随机正则化机制对所述的由粗到精的深度卷积神经网络进行训练,并得到精细化参数及其调节后的由粗到精的深度卷积神经网络;S23: According to the setting parameters, use the random regularization mechanism to train the coarse to fine deep convolutional neural network, and obtain the refined parameters and the adjusted coarse to fine deep convolution. Neural Networks;
S24:根据所述精细化参数调节的由粗到精的深度卷积神经网络对图片的船舶类型进行重新识别,如果船舶类型的训练识别精度变化率小于预设阈值,则结束训练过程;如果船舶类型的训练识别精度变化率大于所述预设阈值,则所述的由粗到精的深度卷积神经网络完成当前的精细级的训练,并返回所述的步骤S1继续训练。S24: The coarse-to-fine deep convolutional neural network adjusted according to the refinement parameters is used to re-identify the ship type of the picture. If the training type recognition accuracy change rate is less than a preset threshold, the training process ends; if the ship type The type of training recognition accuracy change rate is greater than the preset threshold, the coarse-to-fine deep convolutional neural network completes the current fine-level training, and returns to step S1 to continue training.
较佳地,所述的数据增强包括对所述训练图片的水平/竖直翻转、颜色改变和/或随机改变所述训练图片的大小。Preferably, the data enhancement includes horizontally / vertically flipping the training picture, changing a color, and / or randomly changing the size of the training picture.
较佳地,所述的选择性丢弃方法以预设概率休眠卷积层的部分神经元,且卷积层所有神经元都以相同的概率休眠或停止休眠。Preferably, the selective discarding method sleeps some neurons of the convolution layer with a preset probability, and all neurons of the convolution layer sleep or stop hibernation with the same probability.
较佳地,所述的选择性连接方法是对卷积层神经元的权重进行随机修改,从而弱化或者强化该层神经元提取的船舶特征对船舶类型识别的精度的影响。Preferably, the selective connection method is to randomly modify the weights of the neurons in the convolution layer, thereby weakening or strengthening the influence of the ship features extracted by the neurons in the layer on the accuracy of ship type recognition.
较佳地,所述的船舶类型的训练识别精度的变化率P ia计算公式如下: Preferably, the calculation formula for the rate of change in training recognition accuracy P ia of the ship type is as follows:
Figure PCTCN2019092016-appb-000003
Figure PCTCN2019092016-appb-000003
其中:A jc是步骤S1的第j类船舶类型识别精度;A jf是步骤S2的第j类船舶类型识别精度。 Among them: A jc is the accuracy of class j ship type recognition in step S1; A jf is the accuracy of class j ship type recognition in step S2.
与现有技术相比,本发明具有以下优点:本发明方法通过对级联式由粗到精的深度卷积神经网络进行由粗糙到精细级别的逐级训练,实现了对图片中的船舶类型的自动精确识别,有效实现了对船舶类型的自动化和高精度的识别,对组织海上交通秩序,保障海上交通安全,提高智能航行时代的通航效率具有重要的现实价值。Compared with the prior art, the present invention has the following advantages: the method of the present invention realizes the ship type in the picture by stepwise training from rough to fine level by cascading from coarse to fine deep convolutional neural network. The automatic and accurate identification of ships effectively realizes the automation and high-precision identification of ship types, and has important practical value for organizing maritime traffic order, ensuring maritime traffic safety, and improving navigation efficiency in the era of intelligent navigation.
附图的简要说明Brief description of the drawings
下文将参考附图进一步描述本发明的实施例,在附图中:Embodiments of the present invention will be further described below with reference to the accompanying drawings, in which:
图1是本发明的总体流程示意图图;FIG. 1 is a schematic diagram of an overall process of the present invention;
图2A为优选实施例中步骤S1的具体流程示意图;FIG. 2A is a schematic flowchart of step S1 in a preferred embodiment; FIG.
图2B为优选实施例中步骤S12的具体流程示意图;FIG. 2B is a schematic flowchart of step S12 in the preferred embodiment; FIG.
图2C为优选实施例中步骤S13的具体流程示意图;FIG. 2C is a schematic flowchart of step S13 in the preferred embodiment; FIG.
图2D为优选实施例中步骤S2的具体流程示意图;FIG. 2D is a schematic flowchart of step S2 in the preferred embodiment; FIG.
图3是本发明的典型待识别的船舶图片;FIG. 3 is a picture of a typical ship to be identified according to the present invention;
图4A为设置不同参数时不同批样本容量的深度网络识别误差分布;4A is a deep network recognition error distribution of different batch sample sizes when different parameters are set;
图4B为设置不同参数时不同权重值衰减的深度网络识别误差分布;4B is a deep network recognition error distribution with different weight value attenuation when different parameters are set;
图4C为设置不同参数时不同学习率的深度网络识别误差分布;4C is a deep network recognition error distribution with different learning rates when different parameters are set;
图4D为设置不同参数时不同训练次数的深度网络识别误差分布;4D is a deep network recognition error distribution with different training times when different parameters are set;
图5A是集装箱船的识别结果;5A is a recognition result of a container ship;
图5B是杂货船的识别结果;Figure 5B is the identification result of the general cargo ship;
图5C是油轮的识别结果;5C is the recognition result of the tanker;
图6是基于级联式由粗到精的深度卷积神经网络的不同船舶类型识别精度分布;FIG. 6 is a classification distribution of different ship types based on a cascaded deep to convolutional neural network;
图7不同方法的船舶类型识别精度对比图。FIG. 7 is a comparison chart of ship type recognition accuracy by different methods.
实现本发明的最佳方式The best way to implement the invention
下面结合附图和具体实施例对本发明进行详细说明。本实施例以本发明技术方案为前提进行实施,给出了详细的实施方式和具体的操作过程,但本发明的保护范围不限于下述的实施例。The present invention is described in detail below with reference to the drawings and specific embodiments. This embodiment is implemented on the premise of the technical solution of the present invention, and a detailed implementation manner and a specific operation process are given, but the protection scope of the present invention is not limited to the following embodiments.
本实施例提供了一种级联式由粗到精的深度卷积神经网络船舶类型识别方法,参考附图1所示的总体流程示意图,该方法包括以下步骤:This embodiment provides a cascade-type coarse-to-fine deep convolutional neural network ship type recognition method. Referring to the schematic diagram of the overall process shown in FIG. 1, the method includes the following steps:
S1:输入所有船舶类型的图片和对应的图片标签,对由粗到精的深度卷积神经网络进行粗糙级训练,得到由粗到精的深度卷积神经网络的设置参数,以及得到不同船舶类型的训练识别精度;S1: Input pictures of all ship types and corresponding picture tags, perform rough-level training on the deep convolutional neural network from coarse to fine, obtain the setting parameters of the deep convolutional neural network from coarse to fine, and obtain different ship types Training recognition accuracy;
S2:利用所述训练识别精度最低的船舶类型的图片,对由粗到精的深度卷积神经网络进行精细级训练,如果深度卷积神经网络尚未达到预设的收敛条件,则返回步骤S1继续训练,否则执行步骤S3;S2: Use the training to identify the ship type with the lowest accuracy picture to perform fine-level training on the deep to convolutional neural network from coarse to fine. If the deep convolutional neural network has not reached the preset convergence condition, return to step S1 to continue Training, otherwise execute step S3;
S3:对图片中的船舶进行类型识别,并输出船舶类型的识别结果。S3: Perform type recognition on the ship in the picture, and output the recognition result of the ship type.
进一步参考附图2A所示,在优选的实施例中,上述步骤S1包括以下步骤:Further referring to FIG. 2A, in a preferred embodiment, the above step S1 includes the following steps:
S11:将初始输入的船舶类型的图片裁剪为固定尺寸,匹配输入的船舶类型的图片及对应的图片标签,得到格式化的船舶图像和图像标签;S11: crop the originally input ship type picture to a fixed size, match the input ship type picture and corresponding picture tag, and obtain a formatted ship image and image tag;
S12:利用得到的格式化的船舶图像和图像标签,对级联式由粗到精的深度卷积神经网络进行训练,得到级联式由粗到精的深度卷积神经网络的设置参数,提取不同的船舶类型的深度特征;S12: Use the formatted ship images and image tags to train the cascaded deep convolutional neural network from coarse to fine, obtain the setting parameters of the cascaded deep convolutional neural network from coarse to fine, and extract Depth characteristics of different ship types;
S13:根据所得到的深度特征,得到输入的船舶类型的图片的置信度水平分布,将置信度值最大对应的船舶类型输出为单次训练识别结果;S13: Obtain the confidence level distribution of the input ship type picture according to the obtained depth characteristics, and output the ship type corresponding to the maximum confidence value as a single training recognition result;
S14:根据单次训练识别结果,比较图片标签对应的船舶真实所属类型,获得不同船舶类型的训练识别精度。S14: According to the recognition result of a single training, compare the true type of the ship corresponding to the picture label to obtain the training recognition accuracy of different ship types.
其中,如图2B所示,上述的步骤S12进一步包括以下步骤:As shown in FIG. 2B, the above step S12 further includes the following steps:
S121:级联式由粗到精的深度卷积神经网络利用卷积层提取船舶特征,这里的船舶特征包括:低级船舶特征和高级船舶特征,其中低级船舶特征包括船舶纹理、轮廓和角点,而高级船舶特征是低级船舶特征高度抽象,不同类型的船舶呈现出不同的高级特征。则本步骤中卷积层所提取的船舶特征如下:S121: A cascaded deep-to-fine convolutional neural network uses convolution layers to extract ship features. The ship features here include low-level ship features and high-level ship features, where low-level ship features include ship texture, contours, and corner points. The characteristics of high-level ships are high-level features of low-level ships, and different types of ships present different high-level features. Then the ship features extracted by the convolutional layer in this step are as follows:
Figure PCTCN2019092016-appb-000004
Figure PCTCN2019092016-appb-000004
其中:
Figure PCTCN2019092016-appb-000005
是第r-1个网络层的第m个输入船舶特征映射;
Figure PCTCN2019092016-appb-000006
是第n个网络输出层的船舶特征映射和第m个输入特征映射的连接权重;
Figure PCTCN2019092016-appb-000007
是第r层卷积网络的第n个特征映射的偏置值;符号f表示激活第r层卷积网络神经元的激活函数;
Figure PCTCN2019092016-appb-000008
是第r层卷积网络的第n个输出特征映射;
among them:
Figure PCTCN2019092016-appb-000005
Is the mth input ship feature map of the r-1th network layer;
Figure PCTCN2019092016-appb-000006
Is the connection weight of the ship feature map of the n-th network output layer and the m-th input feature map;
Figure PCTCN2019092016-appb-000007
Is the bias value of the n-th feature map of the r-th layer convolutional network; the symbol f represents the activation function that activates the r-th layer of the convolutional network neuron;
Figure PCTCN2019092016-appb-000008
Is the n-th output feature map of the r-th layer convolutional network;
S122:级联式由粗到精的深度卷积神经网络利用池化层对上述的船舶特征学习,并放弃次要船舶特征的学习,保留重要船舶特征的学习。次要船舶特征是指池化后的权重值 小于整层网络75%权重值的神经元提取的特征,重要要船舶特征是指池化后的权重值大于整层网络75%权重值的神经元提取的特征。这里的池化层表达式如下:S122: The cascaded deep-to-fine convolutional neural network uses the pooling layer to learn the above-mentioned ship features, and abandons the learning of secondary ship features, and retains the learning of important ship features. The secondary ship feature refers to the feature extracted by the neurons whose pooled weight value is less than 75% of the entire layer of the network. The important ship feature refers to the neurons whose pooled weight value is greater than 75% of the whole network. Extracted features. The pooling layer expression here is as follows:
Figure PCTCN2019092016-appb-000009
Figure PCTCN2019092016-appb-000009
其中:k是池化核的维度;d代表步长;
Figure PCTCN2019092016-appb-000010
是所述的级联式由粗到精的深度卷积神经网络的卷积层生成的第n个船舶特征图;Pool u,v池化层对特征图
Figure PCTCN2019092016-appb-000011
池化得到的重要特征;参数u和v是Pool u,v的维度;
Where: k is the dimension of the pooling kernel; d is the step size;
Figure PCTCN2019092016-appb-000010
Is the n-th ship feature map generated by the cascaded convolutional layer of coarse to fine deep convolutional neural network; Pool u, v pooling layer pair feature map
Figure PCTCN2019092016-appb-000011
Important features obtained by pooling; parameters u and v are dimensions of Pool u, v ;
S123:级联式由粗到精的深度卷积神经网络利用局部响应归一化层,增加对卷积层所提取的船舶特征的局部响应随机赋予更大的响应值,提取泛化船舶特征,局部响应归一化层的表达式如下;S123: The cascaded deep to convolutional neural network uses the local response normalization layer to increase the local response of the ship features extracted by the convolution layer to randomly assign larger response values to extract generalized ship features. The expression of the local response normalization layer is as follows;
Figure PCTCN2019092016-appb-000012
Figure PCTCN2019092016-appb-000012
其中:
Figure PCTCN2019092016-appb-000013
是第r层局部响应归一化层的神经单元的第i个船舶特征响应;
Figure PCTCN2019092016-appb-000014
是第r层局部响应归一化层单元中的第i个船舶特征图的特征值;参数a、η、
Figure PCTCN2019092016-appb-000015
h为预定参数;参数U为船舶类型的标签。
among them:
Figure PCTCN2019092016-appb-000013
Is the i-th ship characteristic response of the neural unit in the r-th local response normalization layer;
Figure PCTCN2019092016-appb-000014
Is the eigenvalue of the i-th ship feature map in the r-th local response normalization layer unit; parameters a, η,
Figure PCTCN2019092016-appb-000015
h is a predetermined parameter; parameter U is a tag of a ship type.
进一步优选的实施例中,如图2C所示,上述的步骤S13包括以下步骤:In a further preferred embodiment, as shown in FIG. 2C, the above step S13 includes the following steps:
S131:级联式由粗到精的深度卷积神经网络利用全连接层,将上述的步骤S123所得到的泛化船舶特征映射为单一船舶特征向量,其表达式如下:S131: The cascaded deep to convolutional neural network uses a fully connected layer to map the generalized ship features obtained in the above step S123 to a single ship feature vector, and its expression is as follows:
F out=Θ×F in F out = Θ × F in
其中:F out是全连接层输出的单一船舶特征向量,该特征向量共有n 1个元素;F in是输入的泛化船舶特征,其维度为n 2+1;Θ是特征向量的F in和F out的连接矩阵,维度是n 1×(n 2+1),这里的n2就是泛化之后的、输出的船舶特征; Among them: F out is a single ship feature vector output from the fully connected layer, which has a total of n 1 elements; F in is the input generalized ship feature, its dimension is n 2 +1; Θ is the F in and The connection matrix of F out , the dimension is n 1 × (n 2 +1), where n2 is the output ship feature after generalization;
S132:级联式由粗到精的深度卷积神经网络利用损失层,根据上述的单一船舶特征向量作为输入的船舶图像生成概率向量,该向量中的元素表示船舶所属类型的概率,计算表达式如下:S132: The cascaded deep-to-fine convolutional neural network uses the loss layer to generate a probability vector based on the above-mentioned single ship feature vector as an input ship image, and the elements in the vector represent the probability of the type to which the ship belongs, and calculates the expression. as follows:
Figure PCTCN2019092016-appb-000016
Figure PCTCN2019092016-appb-000016
其中,F p是所述损失层的单一船舶特征向量;v j是计算船舶概率向量时第j类船舶类型对应的权值。 Among them, F p is a single ship feature vector of the loss layer; v j is a weight corresponding to the j-th ship type when calculating a ship probability vector.
在另一优选实施例中,上述的步骤S14的船舶类型的训练识别精度e 1t的计算表达式如下: In another preferred embodiment, the calculation expression of the training type recognition accuracy e 1t of the ship type in step S14 is as follows:
Figure PCTCN2019092016-appb-000017
Figure PCTCN2019092016-appb-000017
其中:N s是待识别的船舶图片总数量;N er是类型识别错误的船舶图片总数量。 Among them: N s is the total number of ship pictures to be identified; N er is the total number of ship pictures with incorrect type recognition.
在另一优选实施例中,如图2D所示,上述的步骤S2包括以下步骤:In another preferred embodiment, as shown in FIG. 2D, the above-mentioned step S2 includes the following steps:
S21:根据所述训练识别精度最低的船舶类型的图片,获取该船舶类型的所有训练图片,作为所述的由粗到精的深度卷积神经网络的精细级训练的输入样本;S21: Acquire all the training pictures of the ship type according to the pictures of the ship type with the lowest training recognition accuracy, as the input samples for the fine-level training of the coarse to fine deep convolutional neural network;
S22:利用随机启发式选择方法,从数据增强、选择性丢弃和选择性连接选择一种作为随机正则化机制,随机启发式选择方法的计算表达式如下:S22: Use a random heuristic selection method to select one as a random regularization mechanism from data enhancement, selective discarding, and selective connection. The calculation expression of the random heuristic selection method is as follows:
P is=Max{ω i1×θ iri2×θ ih} P is = Max {ω i1 × θ ir + ω i2 × θ ih }
Figure PCTCN2019092016-appb-000018
Figure PCTCN2019092016-appb-000018
其中:P is是选择第i种正则化方法的概率,数据增强,dropout和dropconnect机制分别标注为1,2,3。P 1s表示选择数据增强作为本次的正则化方法,P 2s表示选择dropout作为本次的正则化方法,P 3s表示选择dropout作为本次的正则化方法;θ ir是表示随机因子;θ ih是启发式因子;ω i1代表θ ir的权重;ω i2代表θ ih的权重;N i是待训练的第i类船舶图片数量;P ia是训练结束后,深度网络对船舶识别精度的变化率; Among them: P is the probability of choosing the i-th regularization method, the data is enhanced, and the dropout and dropconnect mechanisms are respectively labeled as 1, 2, and 3. P 1s indicates that data enhancement is selected as the current regularization method, P 2s indicates that dropout is selected as the current regularization method, and P 3s indicates that dropout is selected as the current regularization method; θ ir is a random factor; θ ih is heuristic factor; ω i1 of representation θ ir weight; ω i2 representative of θ ih weight; N i ship picture number of class i to be trained; P ia after the session, rate of change of the depth of the network of the ship recognition accuracy;
S23:根据步骤S1的设置参数,利用随机正则化机制对由粗到精的深度卷积神经网络进行训练,并得到精细化参数及其调节后的由粗到精的深度卷积神经网络;S23: According to the setting parameters of step S1, use a random regularization mechanism to train the deep convolutional neural network from coarse to fine, and obtain the refined parameters and the adjusted deep convolutional neural network from coarse to fine;
S24:根据精细化参数调节的由粗到精的深度卷积神经网络对图片的船舶类型进行重新识别,如果船舶类型的训练识别精度变化率小于预设阈值,则结束训练过程;根据多次测试结果,预设阈值设为0.01时,可获得比较好的网络性能;如果船舶类型的训练识别精度变化率大于预设阈值,则由粗到精的深度卷积神经网络完成当前的精细级的训练,并返回所述的步骤S1继续训练。S24: The coarse-to-fine deep convolutional neural network adjusted according to the refined parameters re-recognizes the ship type of the picture. If the training type ’s recognition accuracy change rate is less than a preset threshold, the training process ends; according to multiple tests As a result, a better network performance can be obtained when the preset threshold is set to 0.01; if the rate of change in training recognition accuracy of the ship type is greater than the preset threshold, the current fine-level training is completed by a coarse to fine deep convolutional neural network And return to step S1 to continue training.
其中,上述的数据增强机制具体包括对原始训练图片的水平/竖直翻转、颜色改变和/或随机改变训练图片大小。The above-mentioned data enhancement mechanism specifically includes horizontal / vertical flipping, color changing, and / or randomly changing the training picture size of the original training picture.
上述的选择性丢弃方法,所述的选择性丢弃方法以预设概率休眠卷积层的部分神经元,且卷积层所有神经元都以相同的概率休眠或停止休眠。The selective discarding method described above sleeps some neurons of the convolutional layer with a preset probability, and all neurons of the convolutional layer sleep or stop dormant with the same probability.
上述的选择性连接方法是对卷积层神经元的权重进行随机修改,从而弱化或者强化该层神经元提取的船舶特征对船舶类型识别的精度的影响。The above selective connection method is to randomly modify the weights of the neurons in the convolution layer, thereby weakening or strengthening the influence of the ship features extracted by the neurons in this layer on the accuracy of ship type recognition.
上述的船舶类型的训练识别精度的变化率P ia计算表达式如下: The calculation formula for the change rate P ia of the training recognition accuracy of the above ship types is as follows:
Figure PCTCN2019092016-appb-000019
Figure PCTCN2019092016-appb-000019
其中:A jc是步骤S1的第j类船舶类型识别精度;A jf是步骤S2的第j类船舶类型识别精度。 Among them: A jc is the accuracy of class j ship type recognition in step S1; A jf is the accuracy of class j ship type recognition in step S2.
下面对本发明的方法的一具体应用例中对船舶类型进行识别的实验及分析过程进行详细介绍:The experiment and analysis process for identifying the ship type in a specific application example of the method of the present invention are described in detail below:
本应用例中的船舶类型识别的实验平台是Windows 10操作系统,16G RAM,CPU处理器的主频是3.4GHz,仿真平台是MATLAB(R2016版)。本发明的实验测试船舶包括7 类船舶,包括集装箱船、油轮、化学品船、液化天然气船(LNG)、杂货船和散货船,这6种船舶是常见的商船类型。第7类船舶是由不常见商船构成的集合,包括木材船、冷藏船和载驳船。本应用例训练和测试将前面6种船舶类型的名字记为各种船型的标签,而第7类船型的标签为“其他船型”。训练集和测试集包括11760张图片,其中,集装箱船图片为2720张,油轮图片1320张,化学品船1600,液化天然气船图片1200张,杂货船图片2850张,散货船图片2070张,第7类船舶的图片共1560张。附图3显示了典型船舶类型的图片。The experimental platform for ship type identification in this application example is Windows 10 operating system, 16G RAM, the main frequency of the CPU processor is 3.4GHz, and the simulation platform is MATLAB (R2016 version). The experimental test ship of the present invention includes 7 types of ships, including container ships, oil tankers, chemical ships, liquefied natural gas ships (LNG), general cargo ships and bulk carriers. These 6 types of ships are common types of merchant ships. Category 7 ships are a collection of uncommon merchant ships, including timber ships, refrigerated ships and barges. In the training and test of this application example, the names of the first six ship types are recorded as the tags of various ship types, and the tag of the seventh type is "other ship types". The training and test sets include 11,760 pictures, of which 2,720 are container ship pictures, 1,320 are tanker pictures, 1600 are chemical tanker ships, 1,200 are LNG ship pictures, 2,850 are general ship pictures, and 2,070 are bulk ship pictures. A total of 1,560 pictures of 7 types of ships. Figure 3 shows a picture of a typical ship type.
不合理的参数设置会严重影响本发明提出的级联式由粗到精的深度卷积神经网络(Coarse-to-Fine Cascaded Convolutional Neural Network,CFCCNN)的船舶类型识别精度,因此,本应用例首先对级联式由粗到精的深度卷积神经网络参数进行调优设置,参数的初始化设置如表1所示。批样本容量(batch size),权值衰减率(weight decay),学习率(learning rate)和网络的训练次数(epoch)是级联式由粗到精的深度卷积神经网络的关键参数。这里使用1-类错误率来获取船舶类型的识别精度,得到船舶类型的识别精度最高时的参数设置为网络,作为最优的级联式由粗到精的深度卷积神经网络参数设置,1-类错误率的计算表达式如下所示:Improper parameter settings will seriously affect the accuracy of the ship type recognition of the cascaded coarse-to-fine deep convolutional neural network (Coarse-to-Fine Cascaded Convolutional Neural Network, CFCCNN). Therefore, this application example first Tuning the parameters of the cascaded deep convolutional neural network from coarse to fine, the initial settings of the parameters are shown in Table 1. Batch sample size (batch size), weight decay rate (learning rate), learning rate (epoch) and network training times (epoch) are the key parameters of a cascaded deep convolutional neural network from coarse to fine. Here, the 1-type error rate is used to obtain the recognition accuracy of the ship type, and the parameter setting when the ship type has the highest recognition accuracy is set to the network, as the optimal cascade type from coarse to fine deep convolutional neural network parameter setting -The calculation expression for the class error rate is as follows:
Figure PCTCN2019092016-appb-000020
Figure PCTCN2019092016-appb-000020
其中:among them:
N s是待识别的船舶图像总数量; N s is the total number of ship images to be identified;
N er是错误识别的船舶图片总数量。 N er is the total number of misidentified ship pictures.
表1级联式由粗到精的深度卷积神经网络参数初始化调优设置Table 1.Initial tuning settings for cascaded deep convolutional neural network parameters from coarse to fine
参数parameter 初始值Initial value 步长Stride
N bz* N bz * 55 55
N ep N ep 5050 5050
σ wd σ wd 1×10 -4 1 × 10 -4 1×10 -4 1 × 10 -4
σ lr σ lr 2×10 -1 2 × 10 -1 1×10 -1 1 × 10 -1
ω i1(i=1,2,3) ω i1 (i = 1, 2, 3) 0.50.5 __
ω i2(i=1,2,3) ω i2 (i = 1, 2, 3) 0.50.5 __
(i=1,2,3)(i = 1, 2, 3) 00 __
N cl N cl 200200 __
θ c θ c 0.10.1 __
N d N d 33 __
f cl f cl 3×33 × 3 __
f pl f pl 2×22 × 2 __
*:N bz是样本批容量;N ep是网络训练次数;σ wd是权值衰减率;σ lr是学习率,参数f cl和f pl分别是卷积层和池化层的卷积核的尺寸。 *: N bz is the sample batch capacity; N ep is the number of network trainings; σ wd is the weight decay rate; σ lr is the learning rate, and the parameters f cl and f pl are the convolution kernels of the convolution layer and the pooling layer, respectively. size.
附图4A~4D为设置不同的参数时,船舶类型识别的1-类错误率的分布情况。附图4A~4D显示不合理的参数设置会降低网络的识别精度。附图4A显示,批样本的最佳大小 为15,即本发明的级联式由粗到精的深度卷积神经网络每次从测试集中选择15张图片作为一个训练集进行训练。当批样本的容量从5以等步长的形式增加到15时,1-类错误率的呈现出明显的下降趋势。但是,当批样本的大小从15逐步增加到50时,1-类错误率迅速增大。当批样本的大小设置为50时,1-类错误率几乎达到了40%。基于上述分析,本发明的级联式由粗到精的深度卷积神经网络的批样本容量默认值设为15。附图4B显示了不同的权值衰减率对应的船舶类型识别精度的变化。当权值衰减率为5×10 -4,船舶类型识别错误率最小。实际上,权值衰减率为5×10 -4时,本发明的级联式由粗到精的深度卷积神经网络的1-类错误率仅为10%。因此,权值衰减率的默认值设为5×10 -4Figures 4A to 4D show the distribution of the 1-type error rate for ship type identification when different parameters are set. Figures 4A to 4D show that irrational parameter settings will reduce the recognition accuracy of the network. FIG. 4A shows that the optimal size of the batch sample is 15, that is, the cascaded deep convolutional neural network of the present invention selects 15 pictures from the test set as a training set for training. When the batch size increased from 5 to 15 in equal steps, the 1-type error rate showed a significant downward trend. However, as the batch size increased from 15 to 50, the 1-type error rate increased rapidly. When the batch size is set to 50, the 1-type error rate reaches almost 40%. Based on the above analysis, the default batch size of the cascaded deep convolutional neural network from coarse to fine of the present invention is set to 15. FIG. 4B shows the change of the ship type recognition accuracy corresponding to different weight attenuation rates. When the weight attenuation rate is 5 × 10 -4 , the error rate of ship type identification is the smallest. In fact, when the weight attenuation rate is 5 × 10 -4 , the 1-type error rate of the cascaded deep convolutional neural network of the present invention is only 10%. Therefore, the default value of the weight decay rate is set to 5 × 10 -4 .
我们将学习率从2×10 -1逐步减小到2×10 -7,附图4C显示当学习率从2×10 -1递减到2×10 -3时,1-类错误率从22%下降到15%。而学习率从2×10 -3减小到2×10 -7时,1-类错误率基本保持不变,但CFCCNN的收敛时间却显著增加。因此,最佳学习率设为2×10 -3。参考图4D所示,可知与上述三个参数相比,网络训练次数epoch的变化对船舶类型识别精度影响不大。实际上,当网络的训练次数等于或超过200时,船舶的识别精度基本保持不变,所以将网络训练次数的默认值为200。 We gradually reduce the learning rate from 2 × 10 -1 to 2 × 10 -7 . Figure 4C shows that when the learning rate decreases from 2 × 10 -1 to 2 × 10 -3 , the 1-type error rate goes from 22%. Down to 15%. When the learning rate is reduced from 2 × 10 -3 to 2 × 10 -7 , the 1-type error rate remains basically unchanged, but the convergence time of CFCCNN increases significantly. Therefore, the optimal learning rate is set to 2 × 10 -3 . Referring to FIG. 4D, it can be seen that compared with the above three parameters, the change in the number of network training epochs has little effect on the accuracy of ship type recognition. In fact, when the network training number is equal to or more than 200, the recognition accuracy of the ship remains basically unchanged, so the default value of the network training number is 200.
集装箱船、杂货船和油轮是水上运输常见的船舶类型,正确识别这三种船舶类型对智能船舶视觉感知,保障海上交通安全具有重要的现实意义。附图5A~5C显示了这三类典型船舶的识别结果。附图5A显示了典型的集装箱船图像和识别结果。附图5A右侧显示,本发明的级联式由粗到精的深度卷积神经网络的认为图像中的船舶为集装箱船的概率是97.8%,为杂货船的概率是2.2%。本发明的级联式由粗到精的深度卷积神经网络把概率值最大的船舶类型判定为识别的船舶类型。因此确定输入图像为集装箱船,这说明本发明的级联式由粗到精的深度卷积神经网络充分提取并学习了集装箱船重要的、显著的特征。Container ships, general cargo ships and oil tankers are common types of ships in water transportation. Correct identification of these three types of ships has important practical significance for the visual perception of intelligent ships and ensuring the safety of maritime traffic. Figures 5A-5C show the recognition results of these three types of typical ships. Figure 5A shows a typical container ship image and recognition results. The right side of FIG. 5A shows that the cascaded deep convolutional neural network according to the present invention has a probability of thinking that the ship in the image is a container ship and the probability of being a general cargo ship is 97.8%. The cascaded deep convolutional neural network of the present invention determines the ship type with the highest probability value as the recognized ship type. Therefore, it is determined that the input image is a container ship, which shows that the cascaded deep convolutional neural network of the present invention fully extracts and learns important and significant features of the container ship.
附图5B显示了本发明的级联式由粗到精的深度卷积神经网络测试船舶图像为杂货船,本发明的级联式由粗到精的深度卷积神经网络的识别结果如附图5B右子图所示。显而易见的是测试图片的船舶配置一台克令吊。而杂货船和小型散货船才可能配备克令吊。此外,从船舶外形看,散货船基本上都配有舱盖,而杂货船并不配备此种设施。从附图5B的左子图可以看出,图片中的船舶并没有舱盖。杂货船和散货船的上述特征使得本发明的级联式由粗到精的深度卷积神经网络很容易区分出杂货船和散货船。从附图5B中可知,本发明的级联式由粗到精的深度卷积神经网络认为该船是杂货船的概率为99.6%,为散货船的概率为0.4%。因此,确定输入图像的船舶类型为杂货船。这表明本发明的级联式由粗到精的深度卷积神经网络能够正确提取杂货船和散货船的特征。FIG. 5B shows that the cascaded deep convolutional neural network of the present invention tests a ship image as a general cargo ship. The recognition result of the cascaded deep convolutional neural network of the present invention is as follows. 5B right sub-picture. It is obvious that the ship in the test picture was equipped with a crane. It is only possible for general cargo ships and small bulk carriers to be equipped with restraint cranes. In addition, from the perspective of the ship, bulk carriers are basically equipped with hatch covers, while general cargo ships are not equipped with such facilities. It can be seen from the left sub-picture of FIG. 5B that the ship in the picture does not have a hatch. The above characteristics of the general cargo ship and the bulk cargo ship make the cascaded deep convolutional neural network of the present invention to easily distinguish between the general cargo ship and the bulk cargo ship. It can be known from FIG. 5B that the probability that the cascaded deep convolutional neural network of the present invention considers the ship to be a general cargo ship is 99.6%, and the probability that it is a bulk carrier is 0.4%. Therefore, it is determined that the ship type of the input image is a general cargo ship. This shows that the cascaded deep convolutional neural network of the present invention can correctly extract the characteristics of general cargo ships and bulk cargo ships.
附图5C的最后一个子图展示了本发明的级联式由粗到精的深度卷积神经网络对油轮的识别结果。虽然油轮和化学品船的外观相似,但我们可以感性认识到化学品船甲板上的管道比油轮甲板的管道复杂,而无法用定量的方法描述两种船舶管道的复杂度。但是,本发明的级联式由粗到精的深度卷积神经网络良好的泛化能力,使其能够有效的掌握两种船舶管道的复杂度。附图5C中,本发明的级联式由粗到精的深度卷积神经网络认为该船舶属于油轮的概率为96.4%,属于化学品船的概率为3.6%。上述CFCCNN的船舶置信度水平分布也验证了我们的分析。因此,确定输入图像的船舶类型为油轮。这表明本发明的级联式由粗到精的深度卷积神经网络能够正确提取杂货船和散货船的特征。The last sub-picture of FIG. 5C shows the recognition result of the oil tanker by the cascaded deep convolutional neural network from coarse to fine according to the present invention. Although the appearance of tankers and chemical tankers is similar, we can perceptually recognize that the pipelines on the decks of chemical tankers are more complicated than the pipelines on tanker decks, and the quantitative complexity cannot be used to describe the complexity of the pipelines of the two types of ships. However, the good generalization ability of the cascaded deep convolutional neural network from coarse to fine according to the present invention enables it to effectively grasp the complexity of two kinds of ship pipelines. In FIG. 5C, the cascaded deep convolutional neural network of the present invention considers that the probability that the ship belongs to an oil tanker is 96.4%, and the probability that it belongs to a chemical tanker is 3.6%. The above-mentioned CFCCNN ship confidence level distribution also validates our analysis. Therefore, it is determined that the ship type of the input image is a tanker. This shows that the cascaded deep convolutional neural network of the present invention can correctly extract the characteristics of general cargo ships and bulk cargo ships.
附图6显示本发明的级联式由粗到精的深度卷积神经网络对第7类船舶的识别精度最 高,其识别精度高达93.3%。这是因为第7类船舶的结构特征更明显。例如,作为第七类船舶的成员,滚装船通常会装运一些小型船舶。因此,从图像的角度出发,滚装船对应的外形和结构会明显区别于其他类型的船舶,而本发明的级联式由粗到精的深度卷积神经网络可以轻易地获取滚装船的这种结构纹理特征。实际上,第7类船舶的木船、冷藏船等其他类型的船舶的外形特征和纹理特征明显区别于其他6类船舶。因此,本发明的级联式由粗到精的深度卷积神经网络在识别第7类船舶时能够获得更好的精度。Fig. 6 shows that the cascaded deep convolutional neural network of the present invention has the highest recognition accuracy for a category 7 ship, and its recognition accuracy is as high as 93.3%. This is because the structural characteristics of Class 7 ships are more obvious. For example, as a member of the seventh category of ships, ro-ro ships usually carry small ships. Therefore, from the perspective of the image, the shape and structure corresponding to the ro-ro ship will be significantly different from other types of ships, and the cascaded deep-convolution neural network of the invention can easily obtain the ro-ro ship's This structural texture features. In fact, the shape and texture characteristics of other types of ships, such as wooden ships and refrigerated ships of the seventh type of ship, are clearly different from those of the other six types of ship. Therefore, the cascaded deep convolutional neural network of the present invention can obtain better accuracy when identifying a type 7 ship.
附图6显示本发明的级联式由粗到精的深度卷积神经网络对集装箱船识别精度为90.7%,对杂货船的识别精度为86%,油轮的识别精度为84.6%。相比于上述船舶类型,本发明的级联式由粗到精的深度卷积神经网络对化学品船和LNG船的识别精度较低。这是因为本发明的级联式由粗到精的深度卷积神经网络将部分化学品船识别为油轮,而部分液化天然气船被识别为第7类船。尽管本发明的级联式由粗到精的深度卷积神经网络对化学品船和液化天然气船的识别精度不高,但对所有船舶类型的平均识别率达到了81.4%。Figure 6 shows that the cascaded deep convolutional neural network of the present invention has a recognition accuracy of 90.7% for container ships, a recognition accuracy for general cargo ships of 86%, and a tanker recognition accuracy of 84.6%. Compared with the above ship types, the cascaded deep convolutional neural network of the present invention has lower recognition accuracy for chemical tankers and LNG tankers. This is because the cascaded deep convolutional neural network of the present invention recognizes part of the chemical tankers as tankers, and part of the liquefied natural gas ships as class 7 ships. Although the cascaded deep convolutional neural network of the present invention does not have high recognition accuracy for chemical tankers and LNG carriers, the average recognition rate for all ship types reaches 81.4%.
参考图7所示,本实施例也利用已有的K-近邻算法(K-Nearest neighbor,KNN)、人工神经网络(artificial neural network,ANN)、随机森林(random forest,RF)和传统卷积神经网络(convolutional neural network,CNN)方法对比不同船舶类型的识别结果,KNN算法和ANN算法对化学品船的识别精度最低,其识别精度分别为29.8%和28.1%,而RF方法对LNG船的识别精度最低,其识别精度仅为41.2%。而传统CNN算法对化学品船和LNG船的识别精度分别为61.3%和63.2%,而本发明的级联式由粗到精的深度卷积神经网络对上述两种船型的识别精度分别为65.6%和66.7%。上述传统方法(KNN,ANN和RF)和基于传统CNN深度学习方法的船舶类型识别精度表明,传统方法并不能很好提取不同船舶类型的特征,而本发明的级联式由粗到精的深度卷积神经网络的船舶类型识别方法可以较好地找到不同船舶类型的深度特征,获得可靠的船舶识别结果。As shown in FIG. 7, this embodiment also uses the existing K-Nearest Neighbor (KNN), artificial neural network (ANN), random forest (RF), and traditional convolution. The neural network (convolutional neural network, CNN) method compares the recognition results of different ship types. The KNN algorithm and the ANN algorithm have the lowest recognition accuracy for chemical tankers, and their recognition accuracy is 29.8% and 28.1%, respectively. The recognition accuracy is the lowest, and its recognition accuracy is only 41.2%. The traditional CNN algorithm has a recognition accuracy of 61.3% and 63.2% for chemical tankers and LNG ships respectively, while the cascaded deep convolutional neural network of the present invention has a recognition accuracy of 65.6 for the above two ship types. % And 66.7%. The above-mentioned traditional methods (KNN, ANN and RF) and the accuracy of ship type recognition based on traditional CNN deep learning methods show that the traditional method cannot well extract the characteristics of different ship types, and the cascaded depth of the present invention ranges from coarse to fine The ship type recognition method of convolutional neural network can better find the depth characteristics of different ship types and obtain reliable ship recognition results.
以上详细描述了本发明的较佳具体实施例。应当理解,本领域的普通技术人员无需创造性劳动就可以根据本发明的构思作出诸多修改和变化。因此,凡本技术领域中技术人员依本发明的构思在现有技术的基础上通过逻辑分析、推理或者有限的实验可以得到的技术方案,皆应在由权利要求书所确定的保护范围内。The preferred embodiments of the present invention have been described in detail above. It should be understood that those skilled in the art can make many modifications and changes according to the concept of the present invention without creative work. Therefore, any technical solution that can be obtained by a person skilled in the technical field based on the concept of the present invention through logic analysis, reasoning, or limited experiments based on the prior art should fall within the protection scope determined by the claims.

Claims (10)

  1. 一种级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,包括以下步骤:A cascaded deep to convolutional neural network ship type recognition method from coarse to fine, which is characterized by including the following steps:
    S1:输入所有船舶类型的图片和对应的图片标签,对由粗到精的深度卷积神经网络进行粗糙级训练,得到由粗到精的深度卷积神经网络的设置参数,及不同船舶类型的训练识别精度;S1: Input pictures of all ship types and corresponding picture tags, and perform coarse-level training on the deep convolutional neural network from coarse to fine, to obtain the setting parameters of the deep convolutional neural network from coarse to fine, and the settings of different ship types. Training recognition accuracy;
    S2:利用所述训练识别精度最低的船舶类型的图片,对由粗到精的深度卷积神经网络进行精细级训练,如果深度卷积神经网络尚未达到预设的收敛条件,则返回步骤S1继续训练,否则执行步骤S3;S2: Use the training to identify the ship type with the lowest accuracy picture to perform fine-level training on the deep to convolutional neural network from coarse to fine. If the deep convolutional neural network has not reached the preset convergence condition, return to step S1 to continue Training, otherwise execute step S3;
    S3:对图片中的船舶进行类型识别,并输出船舶类型的识别结果。S3: Perform type recognition on the ship in the picture, and output the recognition result of the ship type.
  2. 根据权利要求1所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的步骤S1包括以下步骤:The method according to claim 1, wherein the step S1 comprises the following steps:
    S11:将初始输入的船舶类型的图片裁剪为固定尺寸,匹配输入的船舶类型的图片及对应的图片标签,得到格式化的船舶图像和图像标签;S11: crop the originally input ship type picture to a fixed size, match the input ship type picture and corresponding picture tag, and obtain a formatted ship image and image tag;
    S12:利用所述的格式化的船舶图像和图像标签,对级联式由粗到精的深度卷积神经网络进行训练,得到级联式由粗到精的深度卷积神经网络的设置参数,提取不同的船舶类型的深度特征;S12: using the formatted ship image and image label to train a cascaded deep convolutional neural network from coarse to fine to obtain the setting parameters of the cascaded deep convolutional neural network from coarse to fine, Extract depth characteristics of different ship types;
    S13:根据所述深度特征,得到输入的船舶类型的图片的置信度水平分布,将置信度值最大对应的船舶类型输出为单次训练识别结果;S13: Obtain the confidence level distribution of the input ship type picture according to the depth feature, and output the ship type corresponding to the maximum confidence value as a single training recognition result;
    S14:根据所述单次训练识别结果,比较图片标签对应的船舶真实所属类型,获得不同船舶类型的训练识别精度。S14: According to the recognition result of the single training, compare the true type of the ship corresponding to the picture tag to obtain the training recognition accuracy of different ship types.
  3. 根据权利要求2所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的步骤S12包括以下步骤:The cascaded deep to convolutional neural network ship type recognition method according to claim 2, wherein the step S12 comprises the following steps:
    S121:所述的级联式由粗到精的深度卷积神经网络利用卷积层提取船舶特征;所述船舶特征包括:低级船舶特征,包括船舶纹理、轮廓和角点;高级船舶特征,根据船舶的不同类型对低级船舶特征进行相应的高度抽象而获得;S121: The cascaded deep-to-fine convolutional neural network uses convolution layers to extract ship features; the ship features include: low-level ship features, including ship textures, contours, and corner points; advanced ship features, according to Different types of ships are obtained by correspondingly abstracting the characteristics of low-level ships;
    S122:所述的级联式由粗到精的深度卷积神经网络利用池化层对所述的船舶特征降维学习;S122: The cascaded deep convolutional neural network from coarse to fine uses the pooling layer to reduce the dimension of the ship features and learn;
    S123:所述的级联式由粗到精的深度卷积神经网络利用局部响应归一化层,增加对所述的卷积层提取的船舶特征的局部响应随机赋予更大的响应值,提取泛化船舶特征。S123: The cascaded deep to convolutional neural network uses a local response normalization layer to increase the local response of the ship feature extracted by the convolution layer to randomly assign a larger response value and extract Generalize ship characteristics.
  4. 根据权利要求3所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的步骤S13包括以下步骤:The cascaded deep to convolutional neural network ship type recognition method according to claim 3, wherein the step S13 comprises the following steps:
    S131:所述的级联式由粗到精的深度卷积神经网络利用全连接层,将所述的泛化船舶特征映射为单一船舶特征向量,其表达式如下:S131: The cascaded deep-to-fine convolutional neural network uses a fully connected layer to map the generalized ship feature to a single ship feature vector, and its expression is as follows:
    F out=Θ×F in F out = Θ × F in
    其中:F out是全连接层输出的单一船舶特征向量,该特征向量共有n 1个元素;F in是 输入的泛化船舶特征,其维度为n 2+1;Θ是特征向量的F in和F out的连接矩阵,维度是n 1×(n 2+1); Among them: F out is a single ship feature vector output from the fully connected layer, which has a total of n 1 elements; F in is the input generalized ship feature, its dimension is n 2 +1; Θ is the F in and F out 's connection matrix with dimensions n 1 × (n 2 +1);
    S132:所述的级联式由粗到精的深度卷积神经网络利用损失层,根据所述的单一船舶特征向量作为输入的船舶图像生成概率向量,该向量中的元素表示船舶所属类型的概率,计算表达式如下:S132: The cascaded deep-to-fine convolutional neural network uses a loss layer to generate a probability vector based on the single ship feature vector as an input ship image, and the elements in the vector represent the probability of the type of ship The calculation expression is as follows:
    Figure PCTCN2019092016-appb-100001
    Figure PCTCN2019092016-appb-100001
    其中,F p是所述损失层的单一船舶特征向量;v j是计算船舶概率向量时第j类船舶类型对应的权值。 Among them, F p is a single ship feature vector of the loss layer; v j is a weight corresponding to the j-th ship type when calculating a ship probability vector.
  5. 根据权利要求1或2或3或4所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的步骤S14的船舶类型的训练识别精度e 1t的计算表达式如下: The method for identifying a ship type according to a cascaded deep convolutional neural network from coarse to fine according to claim 1 or 2 or 3 or 4, characterized in that, in step S14, the training type recognition accuracy e 1t The calculation expression is as follows:
    Figure PCTCN2019092016-appb-100002
    Figure PCTCN2019092016-appb-100002
    其中:N s是待识别的船舶图片总数量;N er是类型识别错误的船舶图片总数量。 Among them: N s is the total number of ship pictures to be identified; N er is the total number of ship pictures with incorrect type recognition.
  6. 根据权利要求1所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述步骤S2包括以下步骤:The method for identifying a ship type of a cascaded deep convolutional neural network from coarse to fine according to claim 1, wherein the step S2 comprises the following steps:
    S21:根据所述训练识别精度最低的船舶类型的图片,获取该船舶类型的所有训练图片,作为所述的由粗到精的深度卷积神经网络的精细级训练的输入样本;S21: Acquire all the training pictures of the ship type according to the pictures of the ship type with the lowest training recognition accuracy, as the input samples for the fine-level training of the coarse to fine deep convolutional neural network;
    S22:利用随机启发式选择方法,从数据增强、选择性丢弃方法和选择性连接方法中选择一种作为随机正则化机制;S22: Use a random heuristic selection method to select one of the data enhancement, selective discarding method, and selective connection method as a random regularization mechanism;
    S23:根据所述设置参数,利用所述随机正则化机制对所述的由粗到精的深度卷积神经网络进行训练,并得到精细化参数及其调节后的由粗到精的深度卷积神经网络;S23: According to the setting parameters, use the random regularization mechanism to train the coarse to fine deep convolutional neural network, and obtain the refined parameters and the adjusted coarse to fine deep convolution. Neural Networks;
    S24:根据所述精细化参数调节的由粗到精的深度卷积神经网络对图片的船舶类型进行重新识别,如果船舶类型的训练识别精度变化率小于预设阈值,则结束训练过程;如果船舶类型的训练识别精度变化率大于所述预设阈值,则所述的由粗到精的深度卷积神经网络完成当前的精细级的训练,并返回所述的步骤S1继续训练。S24: The coarse-to-fine deep convolutional neural network adjusted according to the refinement parameters is used to re-identify the ship type of the picture. If the training type recognition accuracy change rate is less than a preset threshold, the training process ends; if the ship type The type of training recognition accuracy change rate is greater than the preset threshold, the coarse-to-fine deep convolutional neural network completes the current fine-level training, and returns to step S1 to continue training.
  7. 根据权利要求6所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的数据增强包括对所述训练图片的水平/竖直翻转、颜色改变和/或随机改变所述训练图片的大小。The method according to claim 6, wherein the data enhancement includes horizontal / vertical flipping of the training picture, color change, and / Or randomly change the size of the training picture.
  8. 根据权利要求6所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的选择性丢弃方法以预设概率休眠卷积层的部分神经元,且卷积层所有神经元都以相同的概率休眠或停止休眠。The method according to claim 6, wherein the selective discarding method sleeps some neurons of the convolutional layer with a preset probability, and All neurons in the convolutional layer sleep or cease to sleep with the same probability.
  9. 根据权利要求6所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的选择性连接方法是对卷积层神经元的权重进行随机修改,从而弱化或者强化该层神经元提取的船舶特征对船舶类型识别的精度的影响。The method according to claim 6, wherein the selective connection method is to randomly modify the weights of the neurons in the convolutional layer, so that Weaken or strengthen the influence of ship features extracted by neurons in this layer on the accuracy of ship type recognition.
  10. 根据权利要求6所述的级联式由粗到精的深度卷积神经网络船舶类型识别方法,其特征在于,所述的船舶类型的训练识别精度的变化率P ia计算公式如下: The cascaded deep to convolutional neural network ship type recognition method according to claim 6, wherein the calculation rate of the change rate P ia of the training recognition accuracy of the ship type is as follows:
    Figure PCTCN2019092016-appb-100003
    Figure PCTCN2019092016-appb-100003
    其中:A jc是步骤S1的第j类船舶类型识别精度;A jf是步骤S2的第j类船舶类型识别精度。 Among them: A jc is the accuracy of class j ship type recognition in step S1; A jf is the accuracy of class j ship type recognition in step S2.
PCT/CN2019/092016 2018-09-04 2019-06-20 Vessel type identification method based on coarse-to-fine cascaded convolutional neural network WO2020048183A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CA3084451A CA3084451C (en) 2018-09-04 2019-06-20 Vessel type identification method using coarse-to-fine cascaded convolutional neural network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811025411.0A CN109299671A (en) 2018-09-04 2018-09-04 A kind of tandem type is by slightly to the convolutional neural networks Ship Types recognition methods of essence
CN201811025411.0 2018-09-04

Publications (1)

Publication Number Publication Date
WO2020048183A1 true WO2020048183A1 (en) 2020-03-12

Family

ID=65166226

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/092016 WO2020048183A1 (en) 2018-09-04 2019-06-20 Vessel type identification method based on coarse-to-fine cascaded convolutional neural network

Country Status (3)

Country Link
CN (1) CN109299671A (en)
CA (1) CA3084451C (en)
WO (1) WO2020048183A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112052762A (en) * 2020-08-27 2020-12-08 西安电子科技大学 Small sample ISAR image target identification method based on Gaussian prototype
CN113409325A (en) * 2020-12-24 2021-09-17 华中科技大学 Large-breadth SAR image ship target detection and identification method based on fine segmentation
CN116385806A (en) * 2023-05-29 2023-07-04 四川大学华西医院 Method, system, equipment and storage medium for classifying strabismus type of eye image

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109299671A (en) * 2018-09-04 2019-02-01 上海海事大学 A kind of tandem type is by slightly to the convolutional neural networks Ship Types recognition methods of essence
CN110060508B (en) * 2019-04-08 2020-11-20 武汉理工大学 Automatic ship detection method for inland river bridge area
CN110232319B (en) * 2019-05-07 2021-04-06 杭州电子科技大学 Ship behavior identification method based on deep learning
CN110363171A (en) * 2019-07-22 2019-10-22 北京百度网讯科技有限公司 The method of the training method and identification sky areas of sky areas prediction model
CN111950476A (en) * 2020-08-17 2020-11-17 重庆大学 Deep learning-based automatic river channel ship identification method in complex environment
CN112232269B (en) * 2020-10-29 2024-02-09 南京莱斯网信技术研究院有限公司 Ship identity intelligent recognition method and system based on twin network
CN114898308B (en) * 2022-07-14 2022-09-20 上海鹰觉科技有限公司 Ship cockpit position detection method and system based on deep convolutional neural network
CN115457388B (en) * 2022-09-06 2023-07-28 湖南经研电力设计有限公司 Power transmission and transformation remote sensing image ground object identification method and system based on deep learning optimization

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182735A (en) * 2014-08-18 2014-12-03 厦门美图之家科技有限公司 Training optimization pornographic picture or video detection method based on convolutional neural network
US9665802B2 (en) * 2014-11-13 2017-05-30 Nec Corporation Object-centric fine-grained image classification
CN106778646A (en) * 2016-12-26 2017-05-31 北京智芯原动科技有限公司 Model recognizing method and device based on convolutional neural networks
CN108399420A (en) * 2018-01-30 2018-08-14 北京理工雷科电子信息技术有限公司 A kind of visible light naval vessel false-alarm elimination method based on depth convolutional network
CN109299671A (en) * 2018-09-04 2019-02-01 上海海事大学 A kind of tandem type is by slightly to the convolutional neural networks Ship Types recognition methods of essence

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103824054B (en) * 2014-02-17 2018-08-07 北京旷视科技有限公司 A kind of face character recognition methods based on cascade deep neural network
CN104036474B (en) * 2014-06-12 2017-12-19 厦门美图之家科技有限公司 A kind of Automatic adjustment method of brightness of image and contrast
CN106682616B (en) * 2016-12-28 2020-04-21 南京邮电大学 Method for recognizing neonatal pain expression based on two-channel feature deep learning
CN107145903A (en) * 2017-04-28 2017-09-08 武汉理工大学 A kind of Ship Types recognition methods extracted based on convolutional neural networks picture feature
CN107392314A (en) * 2017-06-30 2017-11-24 天津大学 A kind of deep layer convolutional neural networks method that connection is abandoned based on certainty
CN107609601B (en) * 2017-09-28 2021-01-22 北京计算机技术及应用研究所 Ship target identification method based on multilayer convolutional neural network

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104182735A (en) * 2014-08-18 2014-12-03 厦门美图之家科技有限公司 Training optimization pornographic picture or video detection method based on convolutional neural network
US9665802B2 (en) * 2014-11-13 2017-05-30 Nec Corporation Object-centric fine-grained image classification
CN106778646A (en) * 2016-12-26 2017-05-31 北京智芯原动科技有限公司 Model recognizing method and device based on convolutional neural networks
CN108399420A (en) * 2018-01-30 2018-08-14 北京理工雷科电子信息技术有限公司 A kind of visible light naval vessel false-alarm elimination method based on depth convolutional network
CN109299671A (en) * 2018-09-04 2019-02-01 上海海事大学 A kind of tandem type is by slightly to the convolutional neural networks Ship Types recognition methods of essence

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112052762A (en) * 2020-08-27 2020-12-08 西安电子科技大学 Small sample ISAR image target identification method based on Gaussian prototype
CN113409325A (en) * 2020-12-24 2021-09-17 华中科技大学 Large-breadth SAR image ship target detection and identification method based on fine segmentation
CN113409325B (en) * 2020-12-24 2022-09-23 华中科技大学 Large-breadth SAR image ship target detection and identification method based on fine segmentation
CN116385806A (en) * 2023-05-29 2023-07-04 四川大学华西医院 Method, system, equipment and storage medium for classifying strabismus type of eye image
CN116385806B (en) * 2023-05-29 2023-09-08 四川大学华西医院 Method, system, equipment and storage medium for classifying strabismus type of eye image

Also Published As

Publication number Publication date
CA3084451C (en) 2024-02-20
CA3084451A1 (en) 2020-03-12
CN109299671A (en) 2019-02-01

Similar Documents

Publication Publication Date Title
WO2020048183A1 (en) Vessel type identification method based on coarse-to-fine cascaded convolutional neural network
CN110674714B (en) Human face and human face key point joint detection method based on transfer learning
CN109919108B (en) Remote sensing image rapid target detection method based on deep hash auxiliary network
CN110598029B (en) Fine-grained image classification method based on attention transfer mechanism
US10515275B2 (en) Intelligent digital image scene detection
CN112434672A (en) Offshore human body target detection method based on improved YOLOv3
CN113569667B (en) Inland ship target identification method and system based on lightweight neural network model
CN111967480A (en) Multi-scale self-attention target detection method based on weight sharing
US20230047131A1 (en) Contour shape recognition method
CN107704859A (en) A kind of character recognition method based on deep learning training framework
CN115035361A (en) Target detection method and system based on attention mechanism and feature cross fusion
CN114241340A (en) Image target detection method and system based on double-path depth residual error network
CN113610180A (en) Visible light image and infrared image fusion ship classification method and device based on deep learning
CN113177503A (en) Arbitrary orientation target twelve parameter detection method based on YOLOV5
CN112381030A (en) Satellite optical remote sensing image target detection method based on feature fusion
CN113205103A (en) Lightweight tattoo detection method
CN115471746A (en) Ship target identification detection method based on deep learning
Chan et al. VGGreNet: A light-weight VGGNet with reused convolutional set
Yulin et al. Wreckage target recognition in side-scan sonar images based on an improved faster r-cnn model
CN113436125B (en) Side-scan sonar simulation image generation method, device and equipment based on style migration
CN114565824A (en) Single-stage rotating ship detection method based on full convolution network
WO2022222233A1 (en) Usv-based obstacle segmentation network and method for generating same
Zhang et al. Integrate traditional hand-crafted features into modern CNN-based models to further improve SAR ship classification accuracy
WO2024032010A1 (en) Transfer learning strategy-based real-time few-shot object detection method
CN113850274A (en) Image classification method based on HOG characteristics and DMD

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19857045

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 3084451

Country of ref document: CA

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19857045

Country of ref document: EP

Kind code of ref document: A1