CN109376576A - The object detection method for training network from zero based on the intensive connection of alternately update - Google Patents

The object detection method for training network from zero based on the intensive connection of alternately update Download PDF

Info

Publication number
CN109376576A
CN109376576A CN201810951609.5A CN201810951609A CN109376576A CN 109376576 A CN109376576 A CN 109376576A CN 201810951609 A CN201810951609 A CN 201810951609A CN 109376576 A CN109376576 A CN 109376576A
Authority
CN
China
Prior art keywords
attention
training
module
longitudinal
object detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810951609.5A
Other languages
Chinese (zh)
Inventor
年睿
徐晓
张世昌
郝宝趁
孔双
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ocean University of China
Original Assignee
Ocean University of China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ocean University of China filed Critical Ocean University of China
Priority to CN201810951609.5A priority Critical patent/CN109376576A/en
Publication of CN109376576A publication Critical patent/CN109376576A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • G06V20/41Higher-level, semantic clustering, classification or understanding of video scenes, e.g. detection, labelling or Markovian modelling of sport events or news items
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

The present invention provides a kind of based on object detection method of the intensive connection from zero training network is alternately updated, and collection target image first makes data set, and is labeled;Carry out intensively being connected to again alternately update module, bounded can deformation convolution module, decouple based on the attention power module, spatial scaling feature pyramid module and variable segment that combine from the transverse and longitudinal of attention and channel attention the training of convolution module, obtain training pattern;Training image is obtained, data set is obtained, detection identification finally is carried out to target image using data set and training pattern.The present invention will also promote to realize that deformation, posture, dimensional variation is big and has the high-precision identification for blocking underwater movement objective efficiently from the development of zero training study mechanism from distorted movement target detection and follow under deep learning angle auxiliary water.

Description

The object detection method for training network from zero based on the intensive connection of alternately update
Technical field
The present invention relates to the alternatings being oriented to based on attention to update underwater distorted movement of the intensive connection from zero training network Object detection method belongs to Intelligent Information Processing and object detection and recognition technical field.
Background technique
Object detection and recognition is link important in vision system, and target detection technique is in video monitoring, intelligence machine The fields such as people's navigation, automatic Pilot, gesture recognition, Shape-memory behavior have broad application prospects.Object detection and recognition is to close Reason utilizes and protection marine resources, permanently effective multi-angle monitoring ocean, is also cultivation fishery, marine fishing and Fish behavior The offers basic data and information support such as analysis.However for underwater distorted movement target, since marine optics is at slice Part restricts, and vulnerable to illumination, visual angle, block, the factors such as form and dimensional variation influence so that pole occurs for the external appearance characteristic of target Big variation, to bring great challenge to image detection and identification.
In recent years, target detection achieves quantum jump, has benefited from deep learning --- mainly convolutional neural networks (Convolution Neural Network, CNN) and candidate region (Region Proposal) algorithm.Target detection and knowledge Other main stream approach includes: conventional target detection, deep learning target detection based on Region Proposal and based on returning The deep learning target detection of method.State-of-the-art target detection identification network is very dependent on this kind of big in Imagenet at present Classification task is trained to obtain feature extraction network and then training objective detection identification on type categorized data set in advance, but due to classification It will lead to study deviation with difference of the detection identification on loss function and classification, model fine tuning can alleviate this deviation but not This deviation can be fundamentally solved, and pre-training model is moved into the bigger detection identification neck of difference domain from classification task Domain is more difficult.The DSOD method proposed gives a very good solution thinking within 2017, utilizes the outstanding ladder of DenseNet Degree transmission mechanism can not depend on pre-training disaggregated model start from scratch training detection identification network.
Target detection is extremely important a part in computer vision field, blocks deformation, background between object Complexity, illumination variation, dimensional variation etc. are urgent problems to be solved in detection process.It is i.e. existing in the prior art mainly to ask Topic: (1) since marine optics image-forming condition restrict, and vulnerable to illumination, visual angle, block, the factors such as form and dimensional variation influence, So that the problem of greatly variation, occurs for the external appearance characteristic of target;(2) since image object lacks contextual information, in image object In the case where partial occlusion or deformation occurs, the problem of will lead to target detection mistake.
Summary of the invention
In view of the above-mentioned problems, the object of the present invention is to provide a kind of alternating based on attention guiding update intensive connection from The underwater distorted movement object detection and recognition method of zero training network, to realize for distorted movement target's feature-extraction, spy Sign refining, the unification of attention, it is intended to quickly excavate and detect in the underwater observation data low from magnanimity, high speed, value density Distorted movement target, to make up the deficiencies in the prior art.
In order to achieve the above objectives, the present invention adopts the following technical scheme:
A kind of underwater distorted movement target inspection that the intensive connection of alternating update based on attention guiding trains network from zero Survey and recognition methods, comprising the following steps:
(1) it collects target image and makes data set, and be labeled;
(2) it intensive connection alternating update module: in order to not depend on classification pre-training model from zero training, avoids classifying and examine Study deviation and data set cross-border issue caused by difference of the identification on loss function and classification are surveyed, especially for from waterborne Data set is transitioned into the problem of underwater data collection, using intensive connection alternating update module, relies on its outstanding gradient conveyer System, efficient feature extraction and feature refining effect, included attention effect realize feature extraction, feature refining, pay attention to The unification of power;
(3) bounded can deformation convolution module: in order to overcome the detection of deformation target to identify difficult problem, handing in intensive connection For use after update module bounded can deformation convolution module, can deformation convolution the limitation of receptive field is not present, there is study sense Target area can be effectively paid close attention to the modified receptive field of the variation of input picture by the effect of open country offset, and Can deformation convolution there is very strong adaptive faculty for target deformation, the detection effect of network will not restrict by target deformation;
(4) based on the attention power module combined from the transverse and longitudinal of attention and channel attention: in order to preferably optimize transmitting The intensive connection feature that alternately update module is extracted, using based on the attention combined from the transverse and longitudinal of attention and channel attention Module is laterally intended to pay close attention to the relationship between area-of-interest and hard objectives, is longitudinally intended to pay close attention to the important of different channel characteristics Property, reinforce important feature, weaken inessential feature, be characterized extraction module and convey superior feature, is drawn using attention force characteristic Lead feature transmission, dominant carry out characteristic optimization;
(5) spatial scaling feature pyramid module: accuracy of identification is detected to improve, in network header feature pyramid module It is middle to replace up-sampling using spatial scaling, the fusion between different resolution feature is carried out under the premise of not destructive characteristics, together When greatly reduce the calculation amount of network header;
(6) variable segment decouples convolution: to solve the problems, such as that the detection from zero training identifies that network convergence is slow and common Convolution is difficult to distinguish the problem of difference and class inherited in class, using the amplitude and angle of decoupling convolution, uses amplitude measure class Interior difference, angle measure class inherited, and direct study amplitude and angle is taken to replace being fitted;
(7) training image is obtained, and carries out scale amplification to it and hide at random to obtain { In, n=1,2 ..., N };
(8) the target detection identification of model training: { I is usedn, n=1,2 ..., N and trained model to target Image carries out detection identification.
Further, the intensive connection alternating update module in the step (2):
X expression characteristic pattern, k represent the number alternately updated, and i represents i-th layer in certain alternating update, and W is represented wait instruct Experienced weight, * represent convolution operation, and g represents nonlinear function, m, and l is the index of summation sign.
Further, the bounded in the step (3) can deformation convolution module:
X indicates that characteristic pattern, p indicate the p of characteristic pattern X0Position and p=p after the offset of position receptive field0+ Δ p, wherein Δ p is the variable that can learn, and q indicates the position for four integral points being located at around p, and G (q, p) is the interpolation for asking p and q, is passed through The mode of interpolation finds out the characteristic pattern after offset.
Further, in the step (4) based on the attention mould combined from the transverse and longitudinal of attention and channel attention Block, wherein lateral attention:
F (x)=Wfx
G (x)=Wgx
yi=γ oi
Wherein, x is the characteristic pattern of input, and W is weight to be trained, and f, g, h is respectively three kinds for laterally paying attention to power module Feature extraction mode, γ are the significance level of lateral attention, are a trainable variable, yiLaterally to pay attention to power module Output;
Longitudinal attention are as follows:
S=σ (W2δ(W1Z))
Wherein, Z is the feature in global average pond, and H, W are characterized the ranks size of figure, and U is to input longitudinal attention Characteristic pattern, σ, δ are nonlinear function, and W is weight to be trained, and s is longitudinal output for paying attention to power module;
Finally, lateral attention and longitudinal attention are merged:
Y=(X+yi)*(1+s)
Wherein, X is the input that transverse and longitudinal pays attention to power module, and Y is the output that transverse and longitudinal pays attention to power module.
Further, the spatial scaling feature pyramid module in the step (5):
Wherein I is characterized figure, and before LR is spatial scaling, after SR is spatial scaling, r be the step-length of conversion, x, y, and c divides Not Wei column coordinate, row coordinate, depth coordinate.
Further, the variable segment in the step (6) decouples convolution:
Wherein, x is the characteristic pattern of input, and w is weight to be trained, and β, ρ are that can train variable, and E is to seek desired value.
Further, the size 7 for the characteristic pattern that the multiple dimensioned training method in the step (7) is finally obtained using network ~10 times be used as network inputs, this is conducive to network for the target detection precision of different scale, at the same to the pixel of image into Row blocks at random, and the detection accuracy of identification of network can be improved in this in the training process, promote the entirety of network attention target and It is not a certain position.
Beneficial effects of the present invention:
The present invention from distorted movement target detection and follow under deep learning angle auxiliary water, also will promote efficiently from The development of zero training study mechanism realizes that deformation, posture, dimensional variation is big and has the high-precision for blocking underwater movement objective to know Not.It can be inclined to avoid classifying and detecting study caused by difference of the identification on loss function and classification from zero training study mechanism Difference and data set cross-border issue, and the intensive connection alternating update module proposed also achieves feature extraction, feature refines, The unification of attention.Can deformation convolution sum transverse and longitudinal pay attention to power module effectively overcome deformation target detection know Not, while using the transmission of attention guide features, play the role of further refining feature.Spatial scaling feature pyramid module The fusion of different resolution feature can be carried out under the premise of not destructive characteristics, improve the precision of detection identification.Variable segment Decoupling convolution can solve that slow from the detection identification network convergence of zero training and common convolution is difficult to distinguish difference and class in class Between difference the problem of.Finally, multiple dimensioned and hiding at random training method can be improved network for target scale and block Robustness.
Detailed description of the invention
Fig. 1 is overall flow figure of the invention.
Fig. 2 is that marine environment is to be detected in embodiment 1 and identifies image.
Fig. 3 is intensive connection of the invention alternately update module figure.
Fig. 4 is that bounded of the invention can deformation convolution module figure.
Fig. 5 is the attention module map of the invention combined from the transverse and longitudinal of attention and channel attention.
Fig. 6 is spatial scaling feature pyramid module map of the invention.
Fig. 7 is multiple dimensioned and random hiding training method schematic diagram of the invention.
Fig. 8 is the detection recognition result figure in example 1.
Specific embodiment
To keep the purpose of the present invention, embodiment and advantage relatively sharp, with reference to the accompanying drawing and pass through specific embodiment It is next that present invention be described in more detail.
Embodiment 1: being detection identification object with the dynamic of ocean underwater environment Mesichthyes.
The specific flow chart of the present embodiment is as shown in Figure 1.
Specifically using one section as shown in Fig. 2, under the marine environment that Shandong Province's aquafarm is shot in the present embodiment In the daytime fish movement video (1920*1080 pixel, 25 frame per second) is as to be detected and identification video.
Following steps should be described in detail in conjunction with attached drawing and concrete outcome, and should be general in summary of the invention The step of condition.
Step 1: a large amount of fish image making data sets that will be taken in aquafarm, mark all fishes in image Position, type;
Step 2: as shown in figure 3, alternately update module realizes feature extraction, feature refining, attention by intensively connection Unification:
X indicate characteristic pattern, k represent alternately update number, i represent certain time alternately update in i-th layer, W represent to Trained weight, * represent convolution operation, and g represents nonlinear function.
Step 3: in order to overcome the detection of deformation target to identify difficult problem, it can deformation using bounded after step 2 Convolution module, as shown in figure 4, can deformation convolution be not present receptive field limitation, have the function of learn receptive field deviate, with The modified receptive field of the variation of input picture, can effectively pay close attention to target area, and can deformation convolution for target Deformation has very strong adaptive faculty, and the detection effect of network will not be restricted by target deformation, and formula is as follows:
X indicates that characteristic pattern, p indicate the p of characteristic pattern X0Position and p=p after the offset of position receptive field0+ Δ p, wherein Δ p is the variable that can learn, and q indicates the position for four integral points being located at around p, and G (q, p) is the interpolation for asking p and q, is passed through The mode of interpolation finds out the characteristic pattern after offset.
Step 4: used after step 3 based on the attention power module combined from the transverse and longitudinal of attention and channel attention, As shown in figure 5, preferably to optimize the intensive connection feature that alternately update module is extracted of transmitting, wherein lateral attention:
F (x)=Wfx
G (x)=Wgx
yi=γ oi
Wherein x is the characteristic pattern of input, and W is weight to be trained, and f, g, h is respectively three kinds of spies for laterally paying attention to power module Extracting mode is levied, it is a trainable variable, y that γ, which is the significance level of lateral attention,iLaterally to pay attention to the defeated of power module Out.
Longitudinal attention are as follows:
S=σ (W2δ(W1Z))
Wherein, Z is the feature in global average pond, and H, W are characterized the ranks size of figure, and U is to input longitudinal attention Characteristic pattern, σ, δ are nonlinear function, and W is weight to be trained, and s is longitudinal output for paying attention to power module.
Finally, lateral attention and longitudinal attention are merged:
Y=X+s*X+yi
Wherein, X is the input that transverse and longitudinal pays attention to power module, and Y is the output that transverse and longitudinal pays attention to power module.
Step 5: three times by Module cycle of the step 2 into step 4, then using spatial scaling feature pyramid mould Block improves detection accuracy of identification, replaces up-sampling using spatial scaling in the network header feature pyramid module, is not breaking The fusion between different resolution feature is carried out under the premise of bad feature, while greatly reducing the calculation amount of network header.
Spatial scaling feature pyramid module, as shown in Figure 6:
Wherein I is characterized figure, and before LR is spatial scaling, after SR is spatial scaling, r be the step-length of conversion, x, y, and c divides Not Wei column coordinate, row coordinate, depth coordinate.
Note: all convolution are all the variable segment decoupling convolution used in the present invention, are able to solve the detection from zero training The identification slow problem of network convergence and common convolution are difficult to distinguish the problem of difference and class inherited in class, use decoupling convolution Amplitude and angle, using difference in amplitude measure class, angle measures class inherited, direct study amplitude and angle to replace It is fitted.
Step 6: obtaining training image, and carry out scale amplification to it and hide at random to obtain { In, n=1,2 ..., N }, as shown in Figure 7;
Step 7: the target detection of model training identifies: using { In, n=1,2 ..., N } to what is be oriented to based on attention It alternately updates intensive connection to be trained from zero training network to obtain training pattern, using trained model to target image Carry out detection identification.
Detection, identification and statistics result is shown in Fig. 8, the interior fish to be identified of rectangle frame, and carries out above rectangle frame The display of target category and confidence level result, verified detection recognition result is essentially identical with legitimate reading, that is, illustrates this hair It is bright that the feasibility and high accuracy of detection method are provided.

Claims (7)

1. a kind of object detection method for training network from zero based on the intensive connection of alternately update, which is characterized in that including following Step:
(1) it collects target image and makes data set, and be labeled;
(2) intensive connection alternating update module;
(3) bounded can deformation convolution module;
(4) based on the attention power module combined from the transverse and longitudinal of attention and channel attention;
(5) spatial scaling feature pyramid module;
(6) variable segment decouples convolution;
(7) training image is obtained, data set { I is obtainedn, n=1,2 ..., N }, obtain training pattern;
(8) the target detection identification of model training: { I is usedn, n=1,2 ..., N and training pattern to target image carry out Detection identification.
2. object detection method as described in claim 1, which is characterized in that the intensive connection alternating in the step (2) is more New module:
X expression characteristic pattern, k represent the number alternately updated, and i represents i-th layer in certain alternating update, and W represents to be trained Weight, * represent convolution operation, and g represents nonlinear function, m, and l is the index of summation sign.
3. object detection method as described in claim 1, which is characterized in that the bounded in the step (3) can deformation convolution Module:
X indicates that characteristic pattern, p indicate the p of characteristic pattern X0Position and p=p after the offset of position receptive field0+ Δ p, wherein Δ p be The variable that can learn, q indicate the position for four integral points being located at around p, and G (q, p) is the interpolation for asking p and q, passes through interpolation Mode finds out the characteristic pattern after offset.
4. object detection method as described in claim 1, which is characterized in that in the step (4) based on from attention and The attention power module that the transverse and longitudinal of channel attention combines, wherein lateral attention:
F (x)=Wfx
G (x)=Wgx
yi=γ oi
Wherein, x is the characteristic pattern of input, and W is weight to be trained, and f, g, h is respectively three kinds of features for laterally paying attention to power module Extracting mode, γ are the significance level of lateral attention, are a trainable variable, yiLaterally to pay attention to the defeated of power module Out;
Longitudinal attention are as follows:
S=σ (W2δ(W1Z))
Wherein, Z is the feature in global average pond, and H, W are characterized the ranks size of figure, and U is the feature for inputting longitudinal attention Figure, σ, δ are nonlinear function, and W is weight to be trained, and s is longitudinal output for paying attention to power module;
Finally, lateral attention and longitudinal attention are merged:
Y=(X+yi)*(1+s)
Wherein, X is the input that transverse and longitudinal pays attention to power module, and Y is the output that transverse and longitudinal pays attention to power module.
5. object detection method as described in claim 1, which is characterized in that the spatial scaling feature gold in the step (5) Word tower module:
Wherein I is characterized figure, and before LR is spatial scaling, after SR is spatial scaling, r be the step-length converted, x, y, and c is respectively Column coordinate, row coordinate, depth coordinate.
6. object detection method as described in claim 1, which is characterized in that the variable segment in the step (6) decouples volume Product:
Wherein, x is the characteristic pattern of input, and w is weight to be trained, and β, ρ are that can train variable, and E is to seek desired value.
7. object detection method as described in claim 1, which is characterized in that the multiple dimensioned training method in the step (7) 7~10 times of size of the characteristic pattern finally obtained using network are as network inputs.
CN201810951609.5A 2018-08-21 2018-08-21 The object detection method for training network from zero based on the intensive connection of alternately update Pending CN109376576A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810951609.5A CN109376576A (en) 2018-08-21 2018-08-21 The object detection method for training network from zero based on the intensive connection of alternately update

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810951609.5A CN109376576A (en) 2018-08-21 2018-08-21 The object detection method for training network from zero based on the intensive connection of alternately update

Publications (1)

Publication Number Publication Date
CN109376576A true CN109376576A (en) 2019-02-22

Family

ID=65403775

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810951609.5A Pending CN109376576A (en) 2018-08-21 2018-08-21 The object detection method for training network from zero based on the intensive connection of alternately update

Country Status (1)

Country Link
CN (1) CN109376576A (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110020658A (en) * 2019-03-28 2019-07-16 大连理工大学 A kind of well-marked target detection method based on multitask deep learning
CN110210571A (en) * 2019-06-10 2019-09-06 腾讯科技(深圳)有限公司 Image-recognizing method, device, computer equipment and computer readable storage medium
CN110232316A (en) * 2019-05-05 2019-09-13 杭州电子科技大学 A kind of vehicle detection and recognition method based on improved DSOD model
CN110516670A (en) * 2019-08-26 2019-11-29 广西师范大学 Suggested based on scene grade and region from the object detection method for paying attention to module
CN110619369A (en) * 2019-09-23 2019-12-27 常熟理工学院 Fine-grained image classification method based on feature pyramid and global average pooling
CN111027512A (en) * 2019-12-24 2020-04-17 北方工业大学 Remote sensing image shore-approaching ship detection and positioning method and device
CN111079604A (en) * 2019-12-06 2020-04-28 重庆市地理信息和遥感应用中心(重庆市测绘产品质量检验测试中心) Method for quickly detecting tiny target facing large-scale remote sensing image
CN111144364A (en) * 2019-12-31 2020-05-12 北京理工大学重庆创新中心 Twin network target tracking method based on channel attention updating mechanism
CN111210443A (en) * 2020-01-03 2020-05-29 吉林大学 Deformable convolution mixing task cascading semantic segmentation method based on embedding balance
CN111582225A (en) * 2020-05-19 2020-08-25 长沙理工大学 Remote sensing image scene classification method and device
CN111723829A (en) * 2019-03-18 2020-09-29 四川大学 Full-convolution target detection method based on attention mask fusion
CN111738045A (en) * 2020-01-19 2020-10-02 中国科学院上海微系统与信息技术研究所 Image detection method and device, electronic equipment and storage medium
CN111860619A (en) * 2020-07-02 2020-10-30 苏州富鑫林光电科技有限公司 Industrial detection AI intelligent model for deep learning
CN113239784A (en) * 2021-05-11 2021-08-10 广西科学院 Pedestrian re-identification system and method based on space sequence feature learning
CN113449756A (en) * 2020-03-26 2021-09-28 太原理工大学 Improved DenseNet-based multi-scale image identification method and device
CN117636078A (en) * 2024-01-25 2024-03-01 华南理工大学 Target detection method, target detection system, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101241593A (en) * 2007-02-06 2008-08-13 英特维数位科技股份有限公司 Picture layer image processing unit and its method
CN105447864A (en) * 2015-11-20 2016-03-30 小米科技有限责任公司 Image processing method, device and terminal
US20160310043A1 (en) * 2015-04-26 2016-10-27 Endochoice, Inc. Endoscopic Polyp Measurement Tool and Method for Using the Same
CN108038872A (en) * 2017-12-22 2018-05-15 中国海洋大学 One kind perceives follow method based on sound state target detection and Real Time Compression

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101241593A (en) * 2007-02-06 2008-08-13 英特维数位科技股份有限公司 Picture layer image processing unit and its method
US20160310043A1 (en) * 2015-04-26 2016-10-27 Endochoice, Inc. Endoscopic Polyp Measurement Tool and Method for Using the Same
CN105447864A (en) * 2015-11-20 2016-03-30 小米科技有限责任公司 Image processing method, device and terminal
CN108038872A (en) * 2017-12-22 2018-05-15 中国海洋大学 One kind perceives follow method based on sound state target detection and Real Time Compression

Non-Patent Citations (8)

* Cited by examiner, † Cited by third party
Title
HAN ZHANG ET AL: ""Self-Attention Generative Adversarial Networks"", 《ARXIV》 *
JIE HU ET AL: ""Squeeze-and-Excitation Networks"", 《ARXIV》 *
JIFENG DAI ET AL: ""Deformable Convolutional Networks"", 《ARXIV》 *
PENG ZHOU ET AL: ""Scale-Transferrable Object Detection"", 《CVPR》 *
WEIYANG LIU ET AL: ""Decoupled Networks"", 《ARXIV》 *
YIBO YANG ET AL: ""Convolutional Neural Networks with Alternately Updated Clique"", 《ARXIV》 *
ZHIQIANG SHEN ET AL: "DSOD: Learning Deeply Supervised Object Detectors from Scratch", 《ARXIV》 *
赵欣欣等: ""基于卷积神经网络的铁路桥梁高强螺栓缺失图像识别方法"", 《中国铁道科学》 *

Cited By (29)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111723829B (en) * 2019-03-18 2022-05-06 四川大学 Full-convolution target detection method based on attention mask fusion
CN111723829A (en) * 2019-03-18 2020-09-29 四川大学 Full-convolution target detection method based on attention mask fusion
CN110020658A (en) * 2019-03-28 2019-07-16 大连理工大学 A kind of well-marked target detection method based on multitask deep learning
CN110232316A (en) * 2019-05-05 2019-09-13 杭州电子科技大学 A kind of vehicle detection and recognition method based on improved DSOD model
CN110210571A (en) * 2019-06-10 2019-09-06 腾讯科技(深圳)有限公司 Image-recognizing method, device, computer equipment and computer readable storage medium
CN110348543A (en) * 2019-06-10 2019-10-18 腾讯医疗健康(深圳)有限公司 Eye fundus image recognition methods, device, computer equipment and storage medium
CN110348543B (en) * 2019-06-10 2023-01-06 腾讯医疗健康(深圳)有限公司 Fundus image recognition method and device, computer equipment and storage medium
CN110210571B (en) * 2019-06-10 2023-01-06 腾讯医疗健康(深圳)有限公司 Image recognition method and device, computer equipment and computer readable storage medium
CN110516670A (en) * 2019-08-26 2019-11-29 广西师范大学 Suggested based on scene grade and region from the object detection method for paying attention to module
CN110516670B (en) * 2019-08-26 2022-04-22 广西师范大学 Target detection method based on scene level and area suggestion self-attention module
CN110619369B (en) * 2019-09-23 2020-12-11 常熟理工学院 Fine-grained image classification method based on feature pyramid and global average pooling
CN110619369A (en) * 2019-09-23 2019-12-27 常熟理工学院 Fine-grained image classification method based on feature pyramid and global average pooling
CN111079604A (en) * 2019-12-06 2020-04-28 重庆市地理信息和遥感应用中心(重庆市测绘产品质量检验测试中心) Method for quickly detecting tiny target facing large-scale remote sensing image
CN111027512A (en) * 2019-12-24 2020-04-17 北方工业大学 Remote sensing image shore-approaching ship detection and positioning method and device
CN111027512B (en) * 2019-12-24 2023-04-18 北方工业大学 Remote sensing image quayside ship detection and positioning method and device
CN111144364B (en) * 2019-12-31 2022-07-26 北京理工大学重庆创新中心 Twin network target tracking method based on channel attention updating mechanism
CN111144364A (en) * 2019-12-31 2020-05-12 北京理工大学重庆创新中心 Twin network target tracking method based on channel attention updating mechanism
CN111210443A (en) * 2020-01-03 2020-05-29 吉林大学 Deformable convolution mixing task cascading semantic segmentation method based on embedding balance
CN111210443B (en) * 2020-01-03 2022-09-13 吉林大学 Deformable convolution mixing task cascading semantic segmentation method based on embedding balance
CN111738045A (en) * 2020-01-19 2020-10-02 中国科学院上海微系统与信息技术研究所 Image detection method and device, electronic equipment and storage medium
CN111738045B (en) * 2020-01-19 2024-04-19 中国科学院上海微系统与信息技术研究所 Image detection method and device, electronic equipment and storage medium
CN113449756A (en) * 2020-03-26 2021-09-28 太原理工大学 Improved DenseNet-based multi-scale image identification method and device
CN113449756B (en) * 2020-03-26 2022-08-16 太原理工大学 Improved DenseNet-based multi-scale image identification method and device
CN111582225A (en) * 2020-05-19 2020-08-25 长沙理工大学 Remote sensing image scene classification method and device
CN111582225B (en) * 2020-05-19 2023-06-20 长沙理工大学 Remote sensing image scene classification method and device
CN111860619A (en) * 2020-07-02 2020-10-30 苏州富鑫林光电科技有限公司 Industrial detection AI intelligent model for deep learning
CN113239784A (en) * 2021-05-11 2021-08-10 广西科学院 Pedestrian re-identification system and method based on space sequence feature learning
CN117636078A (en) * 2024-01-25 2024-03-01 华南理工大学 Target detection method, target detection system, computer equipment and storage medium
CN117636078B (en) * 2024-01-25 2024-04-19 华南理工大学 Target detection method, target detection system, computer equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109376576A (en) The object detection method for training network from zero based on the intensive connection of alternately update
Huang et al. Development and validation of a deep learning algorithm for the recognition of plant disease
CN109299274A (en) A kind of natural scene Method for text detection based on full convolutional neural networks
CN108805070A (en) A kind of deep learning pedestrian detection method based on built-in terminal
CN108509976A (en) The identification device and method of animal
Li et al. Semisupervised semantic segmentation of remote sensing images with consistency self-training
CN109117877A (en) A kind of Pelteobagrus fulvidraco and its intercropping kind recognition methods generating confrontation network based on depth convolution
CN110334656A (en) Multi-source Remote Sensing Images Clean water withdraw method and device based on information source probability weight
Liu et al. Two-stage underwater object detection network using swin transformer
Zhang et al. Adaptive anchor networks for multi-scale object detection in remote sensing images
Liu et al. Density-aware and background-aware network for crowd counting via multi-task learning
Ye et al. Recognition of terminal buds of densely-planted Chinese fir seedlings using improved YOLOv5 by integrating attention mechanism
Wang et al. Accurate real-time ship target detection using Yolov4
Wang et al. TBC-YOLOv7: a refined YOLOv7-based algorithm for tea bud grading detection
Peng et al. An adaptive coarse-fine semantic segmentation method for the attachment recognition on marine current turbines
Sun et al. Prediction model for the number of crucian carp hypoxia based on the fusion of fish behavior and water environment factors
Jia et al. Polar-Net: Green fruit instance segmentation in complex orchard environment
Xu et al. Multiscale edge-guided network for accurate cultivated land parcel boundary extraction from remote sensing images
Huang et al. A Stepwise Refining Image-Level Weakly Supervised Semantic Segmentation Method for Detecting Exposed Surface for Buildings (ESB) From Very High-Resolution Remote Sensing Images
CN118097709A (en) Pig posture estimation method and device
CN108230322A (en) A kind of eyeground feature detection device based on weak sample labeling
Zhao et al. Ocean ship detection and recognition algorithm based on aerial image
CN116758421A (en) Remote sensing image directed target detection method based on weak supervised learning
Mei et al. A Method Based on Knowledge Distillation for Fish School Stress State Recognition in Intensive Aquaculture.
CN115205215A (en) Corneal nerve image segmentation method and system based on Transformer

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190222