CN104217225A - A visual target detection and labeling method - Google Patents

A visual target detection and labeling method Download PDF

Info

Publication number
CN104217225A
CN104217225A CN201410442817.4A CN201410442817A CN104217225A CN 104217225 A CN104217225 A CN 104217225A CN 201410442817 A CN201410442817 A CN 201410442817A CN 104217225 A CN104217225 A CN 104217225A
Authority
CN
China
Prior art keywords
image
candidate region
training
bag
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410442817.4A
Other languages
Chinese (zh)
Other versions
CN104217225B (en
Inventor
黄凯奇
任伟强
王冲
张俊格
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Automation of Chinese Academy of Science
Original Assignee
Institute of Automation of Chinese Academy of Science
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Automation of Chinese Academy of Science filed Critical Institute of Automation of Chinese Academy of Science
Priority to CN201410442817.4A priority Critical patent/CN104217225B/en
Publication of CN104217225A publication Critical patent/CN104217225A/en
Application granted granted Critical
Publication of CN104217225B publication Critical patent/CN104217225B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The present invention discloses a visual target detection and labeling method. The method includes: an image inputting step, to input an image to be detected; a candidate region extracting step, to extract a candidate window as the candidate region from the image to be detected using selectively search algorithm; a feature description extracting step, to perform feature description on the candidate region using a pre-trained large-scale convolutional neural network and output the feature description of the candidate region; a visual target predicting step, to predict the candidate region based on the feature description of the candidate region using a pre-trained object detection module, to estimate regions having the visual target; and a position labeling step, to labeling the position of the visual target according to the estimated result. Experiments show that, compared with the mainstream week supervision visual target detection and labeling method, the present invention has a stronger ability to excavate positive samples and a more general application prospect, and is suitable for visual target detection and automatic labeling tasks on the large-scale data set.

Description

A kind of sensation target detects and mask method
Technical field
The present invention relates to object detection technical field in computer vision, particularly a kind of sensation target based on Weakly supervised study detects and mask method.
Background technology
It is basic problem of computer vision field that objects in images detects with automated location mark, is also one of the key problem that will study of this field.It is exactly given test pattern that objects in images detects, and answers what this problem somewhere.Object detection has a wide range of applications in a lot of other vision research problems, as the foreground detection under object identification, pedestrian detection, face detection, monitoring scene, motion tracking, behavior identification and analysis etc.
General object detection needs the given database that has marked object boundary rectangle, carries out model training to use based on pure object detection models that has supervision such as gradient orientation histogram (HOG), deformable member models (DPM).The high speed development of digital media technology, makes the data such as image, video occur explosive growth, and the universal of internet makes people can get more easily image, the video data of magnanimity.In the face of the view data of such magnanimity, the severe problem that current object detection and canonical algorithm need to be faced is that a large amount of data do not have available object space markup information.Mass image data is carried out to position mark, is a task that labour intensity is very high, cost is very high.
Comparatively speaking, whole image carried out to classification mark much easier, adopt to filter in advance also can realize without methods such as supervision clusterings and in the short time, construct fairly large taxonomy database.Thereby, utilize the image data base of only having classification annotation, realize and automatically carry out object Category Learning and location, realize sensation target by Weakly supervised study and detect and mark, there are important theory value and realistic meaning.
In traditional Weakly supervised learning algorithm, for the selection of candidate region, the general candidate window algorithm based on intensive collection, window number is very huge, and recall rate and registration are not very desirable.Meanwhile, conventionally adopt word bag model to be described to candidate window, the eigentransformation level of word bag model is conventionally few, and the feature obtaining can be thought middle level expression, lacks more high-rise information and allows model can automatically from image, excavate out object apparent model.
The method of current Weakly supervised object detection and mark aspect main flow comprises many learn-by-examples, topic model, condition random field etc.Traditional a lot of many learn-by-example algorithms are owing to depending on to a great extent core study or the learning framework based on distance metric, and use the very high optimized algorithms of complexity such as heuritic approach, quadratic programming, integer programming, be difficult to obtain efficient application on large-scale dataset.
Therefore, how improving and to optimize Weakly supervised learning algorithm and efficiently realize the object detection of large nuber of images and automated location mark, is the major issue that urgent need of the prior art solves.
Summary of the invention
In view of this, the sensation target that fundamental purpose of the present invention is to provide under Weakly supervised scene detects and mask method, can as class label in the situation that, automatically from image collection, locate interested target at a Given Graph, also can carry out object space automatic marking to image.
In order to achieve the above object, the invention provides following technical scheme:
A kind of sensation target detects and mask method, it is characterized in that, comprising:
Image input step, inputs image to be detected;
Candidate region extraction step, uses selective search algorithm to extract candidate window as candidate region from described image to be detected;
Feature is described extraction step, and the extensive convolutional neural networks of use training in advance carries out feature description to candidate region and exports the feature description of this candidate region;
Sensation target prediction steps, the feature based on described candidate region is described, and utilizes the object detection model of training in advance to predict candidate region, estimates to exist the region of described sensation target;
Position annotation step, marks the position of described sensation target according to described estimated result.
Preferably, the selective search algorithm in the extraction step of described candidate region further comprises:
Be predetermined space by the color space conversion of image to be detected, the over-segmentation algorithm of utilization based on Graph is to described Image Segmentation Using, constantly merge two the highest regions of similarity, obtain the stratification segmentation result of image, after multiple color spaces and multi-level cut zone set merging and duplicate removal are processed, obtain the set of candidate regions of this image.
Preferably, described predetermined color space comprises: HSV, RGI, I, Lab.
Preferably, the convolutional neural networks of described training in advance is: the convolutional neural networks of training based on object classification database ImageNet 2013.
Preferably, also comprise object detection model training step, specifically comprise:
Input is with the training set image of image category label;
Adopt selective search algorithm to extract candidate window as candidate region from training set image;
The extensive convolutional neural networks of use training in advance carries out feature description to candidate region and exports the feature description of this candidate region;
Feature based on described candidate region is described, and utilizes many examples linear SVM training object apparent model.
Preferably, described many examples of use linear SVM training object detection model, comprising:
Adopt MILinear to learn from example algorithm to the training of object detection model without constraint large-spacing, its objective function is more:
min w 1 2 | | w | | 2 + C | B | Σ i = 1 | B | ( max ( 0,1 - y i w T B I i i ) ) 2 ,
Wherein, an image I icomprise n by one ithe bag B of individual d dimension example idescribe, wherein j example is designated as be exemplified as positive sample if at least include one in a bag, so the label y of this bag ifor+1, if all examples are all negative samples, the label y of this bag so ifor-1, training set is B={ (B i, y i) | i=1,2 ..., N}, | B|=N is training set number of samples, and w is sorter coefficient, C be regular terms for controlling the punishment to mis-classification, bag B ithe index value of the example that middle prediction mark is the highest.
Preferably, adopt inter-trust domain Newton method to solve MILinear algorithm, comprising:
The optimization aim function of determining MILinear is the unconfined objective function of leading, and its first order derivative is:
g ( w ) = w + 2 C | B | Σ i ∈ I B ( w T B I i i B I i iT - y i B I i iT ) ,
Wherein, I B = { i | 1 - y i w T B I i i , i = 1,2 , . . . , | B | > 0 } It is the set that interval is less than 1 example;
By the generalized Hessian of formula calculating below
Wherein, I is unit matrix;
Mode with iteration is optimized objective function, calculates
s k = min q k ( s ) = min s ▿ f ( w k ) T s + 1 2 s T ▿ 2 f ( w k ) s = min s g ( w k ) T s + 1 2 s T H ( w k ) s , s . t . | | s | | ≤ Δ k ,
Wherein, s kto upgrade step-length, Δ kinter-trust domain, g (w k) and H (w k) be respectively first order derivative and the second derivative of MILinear objective function.
Obtain upgrading step-length s solving kafter, enough large if realistic objective function declines, so just to w kupgrade, otherwise keep w kconstant, formula is as follows:
w k + 1 = w k + s k if f ( w k + s k ) - f ( w k ) q k ( s k ) > η 0 , w k otherwise . ,
Wherein, η 0that the I of a predefined control is accepted the positive number that actual function declines.Preferably, also comprise and utilize the object detection model running bag decomposition algorithm training, adopt iterative manner to gradually reduce the blur level of positive closure, comprising:
Train the object detection model obtaining on training set image, to obtain the prediction probability to all candidate window by MILinear, according to this prediction probability, positive closure is resolved into a positive closure and a negative bag, on the data set obtaining after decomposition, train a new MILinear object detection model, described decomposable process may iteration several.
Sensation target provided by the invention detects and mask method, has several obvious advantages:
1), adopt the mode of selective search, based on the result of a large amount of over-segmentations, obtain the candidate window that target most probable occurs, the window that this mode obtains can be good at keeping the border of object, very high with real-world object coincidence factor, keep high recall rate to several thousand candidate window in the situation that at hundreds of simultaneously.
2), adopt the convolutional neural networks that training obtains on a very large Images Classification data set in advance from candidate window, to extract feature representation, can obtain the feature-rich that comprises stronger high-layer semantic information and express, allow model can automatically from image, excavate out object apparent model.
3), adopted a kind of new many examples linear SVM model, adopt a kind of optimized algorithm based on inter-trust domain Newton method to be optimized simultaneously, can on large-scale dataset, carry out efficiently the study of Weakly supervised detection model.
4), adopted new one bag decomposition algorithm, by positive sample packages being resolved into a positive sample packages and a negative sample bag, greatly reduce the ambiguity in positive sample packages, can effectively improve the performance of Weakly supervised detection model.
Brief description of the drawings
Fig. 1 is that the sensation target based on Weakly supervised study detects and mask method model training and test flow chart according to the embodiment of the present invention;
Fig. 2 is the MILinear schematic diagram with the decomposition of band bag according to embodiment of the present invention MILinear;
Fig. 3 adopts inter-trust domain Newton method to be optimized and other optimization method result contrast schematic diagram according to the embodiment of the present invention;
Fig. 4 trains the object detection model prediction mark and the sample registration that obtain to be related to schematic diagram according to the embodiment of the present invention;
Fig. 5 adopts some object classification improvement in performance schematic diagram in bag decomposition algorithm iterative process according to the embodiment of the present invention;
Fig. 6 trains the testing result schematic diagram of the object detection model obtaining on Pascal VOC2007 database according to the embodiment of the present invention.
Embodiment
For making the object, technical solutions and advantages of the present invention clearer, below in conjunction with specific embodiment, and with reference to accompanying drawing, the present invention is described in more detail.
Thought main points of the present invention are: 1) adopt the mode of selective search, based on the result of a large amount of over-segmentations, can in less candidate window situation, obtain higher target recall rate and registration; 2) the present invention adopts the convolutional neural networks that training obtains on a very large Images Classification data set in advance from candidate window, to extract feature representation, and the feature-rich that can obtain comprising stronger high-layer semantic information is expressed; 3) adopt a kind of new many examples linear SVM model, adopted a kind of optimized algorithm based on inter-trust domain Newton method to be optimized, can on large-scale dataset, carry out efficiently the study of Weakly supervised detection model; 4) the present invention has adopted new one bag decomposition algorithm, by positive sample packages being resolved into a positive sample packages and a negative sample bag, greatly reduces the ambiguity in positive sample packages, can effectively improve the performance of Weakly supervised detection model.
As shown in Figure 1, Fig. 1 the first half is that the sensation target based on Weakly supervised study detects and mask method model training process flow diagram according to the embodiment of the present invention.First, be input picture; Secondly,, by adopting selective search algorithm to carry out candidate window extraction to the image of input, obtain the candidate region of extracting; Then, by candidate region, candidate window sample order is sent into convolutional neural networks, and the feature that obtains each candidate region is described, i.e. Zonal expression; Finally, describe based on feature, the algorithm based on Weakly supervised study that uses the present invention to propose carries out the automatic learning of object apparent model, and positive sample excavates.Fig. 1 the latter half has been set forth the test process of the method.For test pattern, adopt the mode the same with training process to extract candidate window, then use degree of depth convolutional neural networks to carry out feature description to window area, finally use the object apparent model training to classify to window area above, realize target detects or mark task.The method comprises the following steps:
Extract S1, candidate region, uses selective search algorithm to extract candidate window as candidate region from training set image.
At a Given Graph as class label in the situation that, can only know the object that comprises some classification in image, such as " automobile ", " people ", but for the position of " automobile " and " people ", be ignorant, this just need to determine by algorithm the boundary rectangle of object.If extract all possible rectangle frame from image, the number of that all possible rectangle frame is very huge, and it is also unpractical dealing with.Candidate region extraction algorithm is exactly by extracting a limited number of possibility object rectangle frame, to make wherein to include as much as possible the object that will locate.Here have three indexs most important: the one, the number of candidate window, number is fewer, and efficiency of algorithm is higher; The 2nd, recall rate, is also the number that comprises real-world object in candidate window and the ratio of all objects number; The 3rd, the registration of candidate window and real-world object boundary rectangle frame.Candidate window algorithm based on intensive collection, window number is very huge, and recall rate and registration are not very desirable.
The selective search algorithm that the present invention adopts is a kind of candidate window extraction algorithm based on over-segmentation, it is by adopting different parameters to carry out over-segmentation to image, obtain different image blocks, adopt again the thought of stratification tissue to merge piecemeal, thereby find the boundary rectangle that most possibly comprises object.Concrete steps are as follows: first, original image, from RGB color space conversion to other color spaces, is comprised to HSV, RGI, I, Lab etc.; Then, use respectively the over-segmentation algorithm based on Graph to cut apart respectively respective image, then the thought of organizing by stratification constantly merges two regions that similarity is the highest, obtain the stratification segmentation result of image.By multiple color spaces, multi-level cut zone set is combined, and after carrying out duplicate removal processing, just obtains the set of candidate regions of this figure.
Selective search algorithm operational efficiency is higher, the in the situation that of hundreds of extremely thousands of candidate window, can obtain very high recall rate and registration.
The extensive convolutional neural networks of S2, use training in advance carries out feature description to each candidate region and exports this feature and describe.
After getting the candidate region that may comprise attention object, to determine whether certain candidate window is certain object by computer vision and algorithm for pattern recognition, need to first carry out feature description to this candidate region, thus can after use the sorter judgement of classifying.In Images Classification and identification field, conventional Image Description Methods comprises that the low-level image feature such as SIFT, LBP, HOG describes, and the middle level features such as word bag model are described, the popular stratification feature representation in recent years such as convolutional neural networks, degree of depth belief network.Weakly supervised object detection and mark problem, what solve is the identification problem of object level, be by eliminating Weakly supervised what object of ambiguity answer problem of these semantic hierarchies somewhere.This high-level semantic problem be not low-level image feature describe and middle level features describe can fine processing, need very abstract high-level characteristic expression.Convolutional neural networks has been obtained a series of important breakthrough in object identification field, the feature representation of its stratification, realized feature by bottom to high-rise successively abstract, normally edge of characteristic layer before it, Corner detector, along with the number of plies increases, feature below starts to describe object part, whole object gradually.By extracting the feature of characteristic layer after convolutional neural networks, can obtain description and expression to image higher level (such as object rank).It is exactly that its model capacity is very large that convolutional neural networks also has an important characteristic, and the number of plies is more, and neuron number is larger, and model complexity is more, and quantity of information that can code storage is larger.
Based on this, the present invention has trained a large-scale convolutional neural networks on the data set ImageNet 2013 of a very large image, and a large amount of general object informations is stored in this network.Preferably, carry out the training of convolutional neural networks with a large-scale general object classification database ImageNet 2013, training data comprises approximately 1,200,000 images of 1000 classes, the convolutional neural networks using comprises 5 convolutional layers, 2 full articulamentums, and 1st, after 2,5 convolutional layers, connected maximal value convergence-level, whole network packet is containing approximately 650,000 neurons.Just as the knowledge during the mankind exist in a large number contributes to differentiate object, the convolutional neural networks that this has comprised a large amount of general vision prior imformations, can be effectively for carrying out general description to object.
On S3, basis at a Given Graph as class label, use many examples linear SVM MI-SVM on the feature representation of candidate region, to train object detection model.
The present invention is by adopting selective search algorithm to get candidate window set from image, and use a good extensive convolutional neural networks of training in advance to carry out feature description to these candidate window, what next will do is exactly automatic learning object detection model on these candidate window features are described, the object detection model that-utilization trains, just can predict candidate region, find most probable to have the region of object.
Weakly supervised object detection and mark problem can be modeled as learn-by-example problem more than conventionally.An image I icomprise n by one ithe bag B of individual d dimension example idescribe, wherein j example is designated as be exemplified as positive sample if at least include one in a bag, so the label y of this bag ifor+1, if all examples are all negative samples, the label y of this bag so ifor-1.Process side-play amount for fear of explicitly below, the present invention has added one extra 1 at the end of each exemplary characteristics.Note
min w 1 2 | | w | | 2 + C Σ i = 1 | B | ξ i - - - ( 1 )
s . t . max ( w T B j i ) ≥ + 1 - ξ i , y i = + 1
max ( w T B j i ) ≤ - 1 + ξ i , y i = - 1
ξ i≥0
Training set is B={ (B i, y i) | i=1,2 ..., N}, | B|=N is training set number of samples, and w is sorter coefficient, and C is that regular terms is for controlling the punishment to mis-classification, ξ iit is slack variable.
Under many learn-by-example frameworks, what the basic markup information of image brought is the ambiguity in positive closure, only knows that at least comprising a positive sample does not but know which is positive sample.MI-SVM algorithm is by only considering prediction mark W t maximum example solves this problem, and relies on this to predict bag, as shown in Fig. 2 (a).The lineoid of MI-SVM algorithm is to be determined by the highest example of the mark of each bag, and it optimizes formula is a mixed integer programming problem, can only solve by heuritic approach, and speed is very slow.
S3.1 MILinear algorithm
Be different from the small data set of traditional many learn-by-examples issue handling, it is above and eachly include hundreds of large data problems to thousand higher-dimension examples that the present invention mainly considers to comprise 5000 bags.For better the Weakly supervised problem under large data scale being carried out to Efficient Solution, the present invention proposes a kind of newly for many examples of constraint large-spacing linear SVM algorithm, be called MILinear.Its formula is shown below:
min w 1 2 | | w | | 2 + C | B | Σ i = 1 | B | ( max ( 0,1 - y i w T B I i i ) ) 2 - - - ( 2 )
Wherein the proper vector of j example in i bag, y iit is the classification mark of i bag.Above formula Section 2 has adopted a square Hinge loss function, and max (a, b) gets a, the maximal value of b.
I i = arg max j w T B j i - - - ( 3 )
Bag B ithe index value of the example that middle prediction mark is the highest.
Optimization method based on gradient is used widely on Large-scale Optimization Problems, and the present invention has used the Hinge Loss loss function that can lead.Shown in 2 (a), MI-SVM and MILinear solve this many learn-by-examples of large scale problem by the example of selecting mark maximum.
S3.2 wraps decomposition algorithm
In the experiment of MILinear, the present invention's discovery, in a positive closure, positive sample concentrates on front 30% of mark maximum conventionally.Notice after this problem, the present invention proposes a kind of new bag decomposition algorithm, by positive closure being resolved into a positive closure and a negative bag, effectively reduce the ambiguity of positive closure.Preferably, train the model obtaining on training image, to obtain the prediction probability to all candidate window by MILinear, according to this prediction probability, positive closure is resolved into a positive closure and a negative bag, being specially 30% of maximum probability is new positive closure, and all the other samples become a new negative bag.Next, on the data set obtaining, train a new MILinear model after decomposition, as shown in Fig. 2 (b).By bag decomposition algorithm, reduce the ambiguity of sample in positive closure, thereby improved category of model performance.This decomposable process possibility iteration several, until model performance no longer improves.
S3.3 gradient optimal method
Provide the definition of MILinear algorithm above, will discuss below under large scale data set, how can carry out efficiently model learning.The optimization aim function of MILinear is the unconfined form of leading, and its first order derivative is
g ( w ) = w + 2 C | B | Σ i ∈ I B ( w T B I i i B I i iT - y i B I i iT ) - - - ( 4 )
Wherein
I B = { i | 1 - y i w T B I i i , i = 1,2 , . . . , | B | > 0 } - - - ( 5 )
It is the set that interval is less than 1 example.
After having obtained the gradient Analytical Expression of objective function, just there are a lot of methods to have carried out objective function optimization, comprise random Gradient Descent (SGD), L-BFGS, Nonlinear Conjugate Gradient Methods (CG) etc.Random gradient descent method is processed one by one to data set, and iteratively model is upgraded.L-BFGS is a kind of Newton optimization method of intending, and it avoids storing whole Hessian matrix by a kind of approximate low-rank method for solving of Hessian matrix.In general, the cost of the random every step of Gradient Descent is lower but iteration time is longer, and the every steps of second order optimization method such as L-BFGS are consuming time longer, but global convergence speed.
In order to carry out more efficiently the study of object apparent model, the present invention proposes a kind of than the more efficient many examples linear SVM optimized algorithm based on inter-trust domain Newton method of L-BFGS.Inter-trust domain Newton method is one large scale unconstrained problem method for solving very efficiently, and has obtained application in general large scale logistic recurrence and support vector machine training.For application inter-trust domain Newton Algorithm MILinear problem, use formula below to calculate generalized Hessian
Wherein I is unit matrix.
Inter-trust domain Newton method is optimized objective function in the mode of iteration, and every suboptimization attempts to solve the subproblem that comprises inter-trust domain below
s k = min q k ( s ) = min s ▿ f ( w k ) T s + 1 2 s T ▿ 2 f ( w k ) s = min s g ( w k ) T s + 1 2 s T H ( w k ) s , s . t . | | s | | ≤ Δ k - - - ( 7 )
Wherein s kto upgrade step-length, Δ kinter-trust domain, g (w k) core H (w k) be respectively first order derivative and the second derivative of MILinear objective function (formula 2).
This subproblem can adopt the method for conjugate gradient of having considered inter-trust domain to carry out Efficient Solution.
Obtain upgrading step-length s solving kafter, enough large if realistic objective function declines, so just to w kupgrade, otherwise keep w kconstant.
w k + 1 = w k + s k if f ( w k + s k ) - f ( w k ) q k ( s k ) > η 0 , w k otherwise . - - - ( 8 )
Wherein η 0be that the I of a predefined control is accepted the positive number that actual function declines, actual function declines and is greater than this value and upgrades direction and be accepted, and in an embodiment of the present invention, it is preferably set for 1e-4.
Strictly say, the objective function of MILinear is owing to having introduced max function, thereby right and wrong are protruding.This objective function neither second order can be led simultaneously.Although can not ensure globally optimal solution, under actual conditions, this algorithm can be effectively from large-scale dataset learning to object apparent model.
S4, on test pattern, extract candidate region, and make to carry out in the same way feature description, use the interested object of object detection model orientation that training obtains above.At test phase, the candidate region that first uses selective search algorithm to obtain some, then adopts the convolutional neural networks the same with the training stage to carry out feature description.Whether thereby, judge each candidate window be interested object, draw the conclusion of what object where if using afterwards the object apparent model that training obtains to classify to window feature above.The automatic detection and the mark that only utilize image tag information realization attention object are so just completed.
Fig. 3 adopts inter-trust domain Newton method to be optimized and other optimization method result contrast schematic diagram according to the embodiment of the present invention, Fig. 4 trains the object detection model prediction mark and the sample registration that obtain to be related to schematic diagram according to the embodiment of the present invention, Fig. 5 adopts some object classification improvement in performance schematic diagram in bag decomposition algorithm iterative process according to the embodiment of the present invention, and Fig. 6 trains the testing result schematic diagram of the object detection model obtaining on Pascal VOC2007 database according to the embodiment of the present invention.
In a word, the present invention proposes a kind of new sensation target based on Weakly supervised study detects and mask method, use selective search algorithm to carry out candidate window extraction, use the deep layer convolutional neural networks of pre-training in mass data as candidate window feature representation model and general priori, and use a kind of algorithm based on many examples linear SVM to carry out positive sample excavation.By adopting inter-trust domain Newton method to carry out model optimization, and utilize a kind of bag decomposition algorithm of novelty progressively to reduce the ambiguity of positive closure, the sensation target that this method has realized under Weakly supervised scene detects and automatic marking.Experiment shows that the Weakly supervised sensation target of this invention and main flow detects compared with mask method, has stronger positive sample mining ability and application prospect more generally, is suitable for sensation target detection and automatic marking task on large-scale dataset.
Above-described specific embodiment; object of the present invention, technical scheme and beneficial effect are further described; institute is understood that; the foregoing is only specific embodiments of the invention; be not limited to the present invention; within the spirit and principles in the present invention all, any amendment of making, be equal to replacement, improvement etc., within all should being included in protection scope of the present invention.

Claims (8)

1. sensation target detects and a mask method, it is characterized in that, comprising:
Image input step, inputs image to be detected;
Candidate region extraction step, uses selective search algorithm to extract candidate window as candidate region from described image to be detected;
Feature is described extraction step, and the extensive convolutional neural networks of use training in advance carries out feature description to candidate region and exports the feature description of this candidate region;
Sensation target prediction steps, the feature based on described candidate region is described, and utilizes the object detection model of training in advance to predict candidate region, estimates to exist the region of described sensation target;
Position annotation step, marks the position of described sensation target according to described estimated result.
2. method according to claim 1, is characterized in that, the selective search algorithm in the extraction step of described candidate region further comprises:
Be predetermined color space by the color space conversion of image to be detected, the over-segmentation algorithm of utilization based on Graph is to described Image Segmentation Using, constantly merge two the highest regions of similarity, obtain the stratification segmentation result of image, after multiple color spaces and multi-level cut zone set merging and duplicate removal are processed, obtain the set of candidate regions of this image.
3. method according to claim 2, is characterized in that, described predetermined color space comprises: HSV, RGI, I, Lab.
4. method according to claim 1, is characterized in that, the convolutional neural networks of described training in advance is: the convolutional neural networks of training based on object classification database ImageNet 2013.
5. method according to claim 1, is characterized in that, also comprises object detection model training step, specifically comprises:
Input is with the training set image of image category label;
Adopt selective search algorithm to extract candidate window as candidate region from training set image;
The extensive convolutional neural networks of use training in advance carries out feature description to candidate region and exports the feature description of this candidate region;
Feature based on described candidate region is described, and utilizes many examples linear SVM training object apparent model.
6. method according to claim 5, is characterized in that, described many examples of use linear SVM training object detection model, comprising:
Adopt MILinear to learn from example algorithm to the training of object detection model without constraint large-spacing, its objective function is more
min w 1 2 | | w | | 2 + C | B | Σ i = 1 | B | ( max ( 0,1 - y i w T B I i i ) ) 2 ,
Wherein, an image I icomprise n by one ithe bag B of individual d dimension example idescribe, wherein j example is designated as be exemplified as positive sample if at least include one in a bag, so the label y of this bag ifor+1, if all examples are all negative samples, the label y of this bag so ifor-1, training set is B={ (B i, y i) | i=1,2 ..., N}, | B|=N is training set number of samples, and w is sorter coefficient, C be regular terms for controlling the punishment to mis-classification, bag B ithe index value of the example that middle prediction mark is the highest.
7. method according to claim 6, is characterized in that, adopts inter-trust domain Newton method to solve MILinear algorithm, comprising:
The optimization aim function of determining MILinear is the unconfined objective function of leading, and its first order derivative is
g ( w ) = w + 2 C | B | Σ i ∈ I B ( w T B I i i B I i iT - y i B I i iT ) ,
Wherein, I B = { i | 1 - y i w T B I i i , i = 1,2 , . . . , | B | > 0 } It is the set that interval is less than 1 example;
By the generalized Hessian of formula calculating below
Wherein, I is unit matrix;
Mode with iteration is optimized objective function, calculates
s k = min q k ( s ) = min s ▿ f ( w k ) T s + 1 2 s T ▿ 2 f ( w k ) s = min s g ( w k ) T s + 1 2 s T H ( w k ) s , s . t . | | s | | ≤ Δ k ,
Wherein, k is iterations, s kto upgrade step-length, w kthe weights of iteration the k time, Δ kinter-trust domain, ▽ f (w k)=g (w k) and ▽ 2f (w k) (w k)=H (w k) be respectively first order derivative and the second derivative of MILinear objective function;
Obtain upgrading step-length s solving kafter, enough large if realistic objective function declines, so just to w kupgrade, otherwise keep w kconstant, formula is as follows:
w k + 1 = w k + s k if f ( w k + s k ) - f ( w k ) q k ( s k ) > η 0 , w k otherwise . ,
Wherein η 0that the I of a predefined control is accepted the positive number that actual function declines.
8. method according to claim 7, is characterized in that, also comprises and utilizes the object detection model running bag decomposition algorithm training, and adopts iterative manner to gradually reduce the blur level of positive closure, specifically comprises:
Train the object detection model obtaining on training set image, to obtain the prediction probability to all candidate window by MILinear, according to this prediction probability, positive closure is resolved into a positive closure and a negative bag, on the data set obtaining after decomposition, train a new MILinear object detection model, described decomposable process needs iteration for several times.
CN201410442817.4A 2014-09-02 2014-09-02 A kind of sensation target detection and mask method Active CN104217225B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410442817.4A CN104217225B (en) 2014-09-02 2014-09-02 A kind of sensation target detection and mask method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410442817.4A CN104217225B (en) 2014-09-02 2014-09-02 A kind of sensation target detection and mask method

Publications (2)

Publication Number Publication Date
CN104217225A true CN104217225A (en) 2014-12-17
CN104217225B CN104217225B (en) 2018-04-24

Family

ID=52098687

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410442817.4A Active CN104217225B (en) 2014-09-02 2014-09-02 A kind of sensation target detection and mask method

Country Status (1)

Country Link
CN (1) CN104217225B (en)

Cited By (74)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104572965A (en) * 2014-12-31 2015-04-29 南京理工大学 Search-by-image system based on convolutional neural network
CN104573669A (en) * 2015-01-27 2015-04-29 中国科学院自动化研究所 Image object detection method
CN104700118A (en) * 2015-03-18 2015-06-10 中国科学院自动化研究所 Pulmonary nodule benignity and malignancy predicting method based on convolutional neural networks
CN105069774A (en) * 2015-06-30 2015-11-18 长安大学 Object segmentation method based on multiple-instance learning and graph cuts optimization
CN105138983A (en) * 2015-08-21 2015-12-09 燕山大学 Pedestrian detection method based on weighted part model and selective search segmentation
CN105678322A (en) * 2015-12-31 2016-06-15 百度在线网络技术(北京)有限公司 Sample labeling method and apparatus
CN105868269A (en) * 2016-03-08 2016-08-17 中国石油大学(华东) Precise image searching method based on region convolutional neural network
CN105893963A (en) * 2016-03-31 2016-08-24 南京邮电大学 Method for screening out optimal easily-recognizable frame of single pedestrian target in video
CN105956563A (en) * 2016-05-06 2016-09-21 西安工程大学 Method of face marking in news image based on multiple instance learning
CN105989174A (en) * 2015-03-05 2016-10-05 欧姆龙株式会社 Extraction device and extraction method of area of interest
CN106127204A (en) * 2016-06-30 2016-11-16 华南理工大学 A kind of multi-direction meter reading Region detection algorithms of full convolutional neural networks
CN106156777A (en) * 2015-04-23 2016-11-23 华中科技大学 Textual image detection method and device
CN106203450A (en) * 2016-07-11 2016-12-07 国家新闻出版广电总局广播科学研究院 Based on degree of depth learning framework, image is carried out the object detection method of feature extraction
CN106227836A (en) * 2016-07-26 2016-12-14 上海交通大学 Associating visual concept learning system and method is supervised with the nothing of word based on image
CN106327516A (en) * 2015-06-29 2017-01-11 北京雷动云合智能技术有限公司 Learning-type visual tracking method based on appearance model
CN106326985A (en) * 2016-08-18 2017-01-11 北京旷视科技有限公司 Neural network training method, neural network training device, data processing method and data processing device
CN106326893A (en) * 2016-08-25 2017-01-11 安徽水滴科技有限责任公司 Vehicle color recognition method based on area discrimination
CN106384345A (en) * 2016-08-31 2017-02-08 上海交通大学 RCNN based image detecting and flow calculating method
CN106504233A (en) * 2016-10-18 2017-03-15 国网山东省电力公司电力科学研究院 Image electric power widget recognition methodss and system are patrolled and examined based on the unmanned plane of Faster R CNN
CN106529485A (en) * 2016-11-16 2017-03-22 北京旷视科技有限公司 Method and apparatus for obtaining training data
WO2017059576A1 (en) * 2015-10-09 2017-04-13 Beijing Sensetime Technology Development Co., Ltd Apparatus and method for pedestrian detection
CN106611177A (en) * 2015-10-27 2017-05-03 北京航天长峰科技工业集团有限公司 Big data-based image classification method
CN106815604A (en) * 2017-01-16 2017-06-09 大连理工大学 Method for viewing points detecting based on fusion of multi-layer information
CN106845430A (en) * 2017-02-06 2017-06-13 东华大学 Pedestrian detection and tracking based on acceleration region convolutional neural networks
WO2017096570A1 (en) * 2015-12-10 2017-06-15 Intel Corporation Visual recognition using deep learning attributes
CN106909887A (en) * 2017-01-19 2017-06-30 南京邮电大学盐城大数据研究院有限公司 A kind of action identification method based on CNN and SVM
CN106934346A (en) * 2017-01-24 2017-07-07 北京大学 A kind of method of target detection performance optimization
CN106934344A (en) * 2017-01-23 2017-07-07 西北大学 A kind of rapid pedestrian detection method based on neutral net
CN106991400A (en) * 2017-04-05 2017-07-28 北京中燕信息技术有限公司 A kind of fire hazard smoke detecting method and device
CN107038448A (en) * 2017-03-01 2017-08-11 中国科学院自动化研究所 Target detection model building method
WO2017139927A1 (en) * 2016-02-17 2017-08-24 Intel Corporation Region proposal for image regions that include objects of interest using feature maps from multiple layers of a convolutional neural network model
CN107123027A (en) * 2017-04-28 2017-09-01 广东工业大学 A kind of cosmetics based on deep learning recommend method and system
CN107133253A (en) * 2015-12-31 2017-09-05 达索系统公司 Recommendation based on forecast model
CN107203781A (en) * 2017-05-22 2017-09-26 浙江大学 A kind of object detection method Weakly supervised end to end instructed based on conspicuousness
CN107330449A (en) * 2017-06-13 2017-11-07 瑞达昇科技(大连)有限公司 A kind of BDR sign detection method and device
CN107562050A (en) * 2017-08-29 2018-01-09 广东工业大学 A kind of method and system of robot environment-identification
CN107609483A (en) * 2017-08-15 2018-01-19 中国科学院自动化研究所 Risk object detection method, device towards drive assist system
CN107636727A (en) * 2016-12-30 2018-01-26 深圳前海达闼云端智能科技有限公司 Target detection method and device
WO2018036454A1 (en) * 2016-08-26 2018-03-01 Huawei Technologies Co., Ltd. Method and apparatus for annotating a video stream comprising a sequence of frames
CN107918624A (en) * 2016-10-11 2018-04-17 富士通株式会社 Image retrieving apparatus and method, electronic equipment
CN107945153A (en) * 2017-11-07 2018-04-20 广东广业开元科技有限公司 A kind of road surface crack detection method based on deep learning
CN107958460A (en) * 2016-10-18 2018-04-24 奥多比公司 Instance-level semantic segmentation system
CN108062574A (en) * 2017-12-31 2018-05-22 厦门大学 A kind of Weakly supervised object detection method based on particular category space constraint
CN108229514A (en) * 2016-12-29 2018-06-29 北京市商汤科技开发有限公司 Object detecting method, device and electronic equipment
CN108303747A (en) * 2017-01-12 2018-07-20 清华大学 The method for checking equipment and detecting gun
CN108303748A (en) * 2017-01-12 2018-07-20 同方威视技术股份有限公司 The method for checking equipment and detecting the gun in luggage and articles
CN108319633A (en) * 2017-11-17 2018-07-24 腾讯科技(深圳)有限公司 A kind of image processing method, device and server, system, storage medium
CN108496185A (en) * 2016-01-18 2018-09-04 北京市商汤科技开发有限公司 System and method for object detection
CN108596223A (en) * 2018-04-11 2018-09-28 珠海博明视觉科技有限公司 A method of automatically generating object data set
CN109035370A (en) * 2018-07-23 2018-12-18 郑州云海信息技术有限公司 A kind of picture mask method and system
CN109063559A (en) * 2018-06-28 2018-12-21 东南大学 A kind of pedestrian detection method returned based on improvement region
CN109492686A (en) * 2018-11-01 2019-03-19 郑州云海信息技术有限公司 A kind of picture mask method and system
CN109492702A (en) * 2018-11-21 2019-03-19 中国科学院自动化研究所 Pedestrian based on sorting measure function recognition methods, system, device again
CN109740571A (en) * 2019-01-22 2019-05-10 南京旷云科技有限公司 The method of Image Acquisition, the method, apparatus of image procossing and electronic equipment
CN109800863A (en) * 2016-08-30 2019-05-24 中国石油大学(华东) A kind of well-log facies recognition method based on fuzzy theory and neural network
CN109857878A (en) * 2018-12-27 2019-06-07 深兰科技(上海)有限公司 Article mask method and device, electronic equipment and storage medium
CN109872333A (en) * 2019-02-20 2019-06-11 腾讯科技(深圳)有限公司 Medical image dividing method, device, computer equipment and storage medium
CN109919149A (en) * 2019-01-18 2019-06-21 平安科技(深圳)有限公司 Object mask method and relevant device based on object detection model
CN110147796A (en) * 2018-02-12 2019-08-20 杭州海康威视数字技术股份有限公司 Image matching method and device
CN110288629A (en) * 2019-06-24 2019-09-27 湖北亿咖通科技有限公司 Target detection automatic marking method and device based on moving Object Detection
CN110443366A (en) * 2019-07-30 2019-11-12 上海商汤智能科技有限公司 Optimization method and device, object detection method and the device of neural network
CN110851641A (en) * 2018-08-01 2020-02-28 杭州海康威视数字技术股份有限公司 Cross-modal retrieval method and device and readable storage medium
CN110929729A (en) * 2020-02-18 2020-03-27 北京海天瑞声科技股份有限公司 Image annotation method, image annotation device and computer storage medium
CN111311634A (en) * 2020-01-23 2020-06-19 支付宝实验室(新加坡)有限公司 Face image detection method, device and equipment
CN111460958A (en) * 2020-03-26 2020-07-28 暗物智能科技(广州)有限公司 Object detector construction method, object detection method and object detection system
CN111464647A (en) * 2020-04-02 2020-07-28 科锐特(厦门)净化科技有限公司 Smart cloud clean room control method and system
CN111488400A (en) * 2019-04-28 2020-08-04 北京京东尚科信息技术有限公司 Data classification method, device and computer readable storage medium
US10853409B2 (en) 2016-12-13 2020-12-01 Shanghai United Imaging Healthcare Co., Ltd. Systems and methods for image search
CN112215123A (en) * 2020-10-09 2021-01-12 腾讯科技(深圳)有限公司 Target detection method, device and storage medium
CN112348822A (en) * 2019-08-08 2021-02-09 佳能株式会社 Image processing apparatus and image processing method
CN112560870A (en) * 2020-12-15 2021-03-26 哈尔滨工程大学 Image target identification method used in underwater complex environment
US11157744B2 (en) 2020-01-15 2021-10-26 International Business Machines Corporation Automated detection and approximation of objects in video
CN113869714A (en) * 2021-09-26 2021-12-31 平安科技(深圳)有限公司 Attendance recording method and device, computer equipment and storage medium
CN117292283A (en) * 2023-11-24 2023-12-26 成都庆龙航空科技有限公司 Target identification method based on unmanned aerial vehicle

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100246971A1 (en) * 2007-10-30 2010-09-30 Pasco Corporation House change determining method, house change determining program, house change determining image generating method, and house change determining image
US8355569B2 (en) * 2006-08-10 2013-01-15 Nec Corporation Object region extracting device
CN103870834A (en) * 2014-04-03 2014-06-18 张琰 Method for searching for sliding window based on layered segmentation
CN103984959A (en) * 2014-05-26 2014-08-13 中国科学院自动化研究所 Data-driven and task-driven image classification method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8355569B2 (en) * 2006-08-10 2013-01-15 Nec Corporation Object region extracting device
US20100246971A1 (en) * 2007-10-30 2010-09-30 Pasco Corporation House change determining method, house change determining program, house change determining image generating method, and house change determining image
CN103870834A (en) * 2014-04-03 2014-06-18 张琰 Method for searching for sliding window based on layered segmentation
CN103984959A (en) * 2014-05-26 2014-08-13 中国科学院自动化研究所 Data-driven and task-driven image classification method

Cited By (112)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104572965A (en) * 2014-12-31 2015-04-29 南京理工大学 Search-by-image system based on convolutional neural network
CN104573669B (en) * 2015-01-27 2018-09-04 中国科学院自动化研究所 Image object detection method
CN104573669A (en) * 2015-01-27 2015-04-29 中国科学院自动化研究所 Image object detection method
CN105989174B (en) * 2015-03-05 2019-11-01 欧姆龙株式会社 Region-of-interest extraction element and region-of-interest extracting method
CN105989174A (en) * 2015-03-05 2016-10-05 欧姆龙株式会社 Extraction device and extraction method of area of interest
CN104700118A (en) * 2015-03-18 2015-06-10 中国科学院自动化研究所 Pulmonary nodule benignity and malignancy predicting method based on convolutional neural networks
CN106156777B (en) * 2015-04-23 2020-06-02 华中科技大学 Text picture detection method and device
CN106156777A (en) * 2015-04-23 2016-11-23 华中科技大学 Textual image detection method and device
CN106327516B (en) * 2015-06-29 2018-12-18 北京雷动云合智能技术有限公司 A kind of learning-oriented visual pursuit method based on display model
CN106327516A (en) * 2015-06-29 2017-01-11 北京雷动云合智能技术有限公司 Learning-type visual tracking method based on appearance model
CN105069774A (en) * 2015-06-30 2015-11-18 长安大学 Object segmentation method based on multiple-instance learning and graph cuts optimization
CN105069774B (en) * 2015-06-30 2017-11-10 长安大学 The Target Segmentation method of optimization is cut based on multi-instance learning and figure
CN105138983B (en) * 2015-08-21 2019-06-28 燕山大学 The pedestrian detection method divided based on weighting block model and selective search
CN105138983A (en) * 2015-08-21 2015-12-09 燕山大学 Pedestrian detection method based on weighted part model and selective search segmentation
WO2017059576A1 (en) * 2015-10-09 2017-04-13 Beijing Sensetime Technology Development Co., Ltd Apparatus and method for pedestrian detection
CN106611177A (en) * 2015-10-27 2017-05-03 北京航天长峰科技工业集团有限公司 Big data-based image classification method
CN108292369A (en) * 2015-12-10 2018-07-17 英特尔公司 Visual identity is carried out using deep learning attribute
US9971953B2 (en) 2015-12-10 2018-05-15 Intel Corporation Visual recognition using deep learning attributes
WO2017096570A1 (en) * 2015-12-10 2017-06-15 Intel Corporation Visual recognition using deep learning attributes
CN105678322A (en) * 2015-12-31 2016-06-15 百度在线网络技术(北京)有限公司 Sample labeling method and apparatus
CN107133253A (en) * 2015-12-31 2017-09-05 达索系统公司 Recommendation based on forecast model
CN108496185B (en) * 2016-01-18 2022-09-16 北京市商汤科技开发有限公司 System and method for object detection
CN108496185A (en) * 2016-01-18 2018-09-04 北京市商汤科技开发有限公司 System and method for object detection
WO2017139927A1 (en) * 2016-02-17 2017-08-24 Intel Corporation Region proposal for image regions that include objects of interest using feature maps from multiple layers of a convolutional neural network model
US11244191B2 (en) 2016-02-17 2022-02-08 Intel Corporation Region proposal for image regions that include objects of interest using feature maps from multiple layers of a convolutional neural network model
CN105868269A (en) * 2016-03-08 2016-08-17 中国石油大学(华东) Precise image searching method based on region convolutional neural network
CN105893963A (en) * 2016-03-31 2016-08-24 南京邮电大学 Method for screening out optimal easily-recognizable frame of single pedestrian target in video
CN105893963B (en) * 2016-03-31 2019-03-08 南京邮电大学 A kind of method of the best frame easy to identify of single pedestrian target in screening video
CN105956563B (en) * 2016-05-06 2019-04-16 西安工程大学 The method for carrying out face mark in news image based on multi-instance learning
CN105956563A (en) * 2016-05-06 2016-09-21 西安工程大学 Method of face marking in news image based on multiple instance learning
CN106127204A (en) * 2016-06-30 2016-11-16 华南理工大学 A kind of multi-direction meter reading Region detection algorithms of full convolutional neural networks
CN106127204B (en) * 2016-06-30 2019-08-09 华南理工大学 A kind of multi-direction meter reading Region detection algorithms of full convolutional neural networks
CN106203450A (en) * 2016-07-11 2016-12-07 国家新闻出版广电总局广播科学研究院 Based on degree of depth learning framework, image is carried out the object detection method of feature extraction
CN106227836A (en) * 2016-07-26 2016-12-14 上海交通大学 Associating visual concept learning system and method is supervised with the nothing of word based on image
CN106227836B (en) * 2016-07-26 2020-07-14 上海交通大学 Unsupervised joint visual concept learning system and unsupervised joint visual concept learning method based on images and characters
CN106326985A (en) * 2016-08-18 2017-01-11 北京旷视科技有限公司 Neural network training method, neural network training device, data processing method and data processing device
CN106326893A (en) * 2016-08-25 2017-01-11 安徽水滴科技有限责任公司 Vehicle color recognition method based on area discrimination
US10140508B2 (en) 2016-08-26 2018-11-27 Huawei Technologies Co. Ltd. Method and apparatus for annotating a video stream comprising a sequence of frames
WO2018036454A1 (en) * 2016-08-26 2018-03-01 Huawei Technologies Co., Ltd. Method and apparatus for annotating a video stream comprising a sequence of frames
CN109800863B (en) * 2016-08-30 2023-05-23 中国石油大学(华东) Logging phase identification method based on fuzzy theory and neural network
CN109800863A (en) * 2016-08-30 2019-05-24 中国石油大学(华东) A kind of well-log facies recognition method based on fuzzy theory and neural network
CN106384345B (en) * 2016-08-31 2019-04-02 上海交通大学 A kind of image detection and flow statistical method based on RCNN
CN106384345A (en) * 2016-08-31 2017-02-08 上海交通大学 RCNN based image detecting and flow calculating method
CN107918624A (en) * 2016-10-11 2018-04-17 富士通株式会社 Image retrieving apparatus and method, electronic equipment
CN107918624B (en) * 2016-10-11 2022-03-15 富士通株式会社 Image retrieval device and method and electronic equipment
CN107958460B (en) * 2016-10-18 2023-09-08 奥多比公司 Instance-level semantic segmentation system
CN107958460A (en) * 2016-10-18 2018-04-24 奥多比公司 Instance-level semantic segmentation system
CN106504233A (en) * 2016-10-18 2017-03-15 国网山东省电力公司电力科学研究院 Image electric power widget recognition methodss and system are patrolled and examined based on the unmanned plane of Faster R CNN
CN106529485A (en) * 2016-11-16 2017-03-22 北京旷视科技有限公司 Method and apparatus for obtaining training data
US10853409B2 (en) 2016-12-13 2020-12-01 Shanghai United Imaging Healthcare Co., Ltd. Systems and methods for image search
CN108229514A (en) * 2016-12-29 2018-06-29 北京市商汤科技开发有限公司 Object detecting method, device and electronic equipment
CN107636727A (en) * 2016-12-30 2018-01-26 深圳前海达闼云端智能科技有限公司 Target detection method and device
CN108303747B (en) * 2017-01-12 2023-03-07 清华大学 Inspection apparatus and method of detecting a gun
CN108303747A (en) * 2017-01-12 2018-07-20 清华大学 The method for checking equipment and detecting gun
CN108303748A (en) * 2017-01-12 2018-07-20 同方威视技术股份有限公司 The method for checking equipment and detecting the gun in luggage and articles
CN106815604A (en) * 2017-01-16 2017-06-09 大连理工大学 Method for viewing points detecting based on fusion of multi-layer information
CN106815604B (en) * 2017-01-16 2019-09-27 大连理工大学 Method for viewing points detecting based on fusion of multi-layer information
CN106909887A (en) * 2017-01-19 2017-06-30 南京邮电大学盐城大数据研究院有限公司 A kind of action identification method based on CNN and SVM
CN106934344B (en) * 2017-01-23 2020-01-31 西北大学 quick pedestrian detection method based on neural network
CN106934344A (en) * 2017-01-23 2017-07-07 西北大学 A kind of rapid pedestrian detection method based on neutral net
WO2018137357A1 (en) * 2017-01-24 2018-08-02 北京大学 Target detection performance optimization method
CN106934346B (en) * 2017-01-24 2019-03-15 北京大学 A kind of method of target detection performance optimization
CN106934346A (en) * 2017-01-24 2017-07-07 北京大学 A kind of method of target detection performance optimization
CN106845430A (en) * 2017-02-06 2017-06-13 东华大学 Pedestrian detection and tracking based on acceleration region convolutional neural networks
CN107038448B (en) * 2017-03-01 2020-02-28 中科视语(北京)科技有限公司 Target detection model construction method
CN107038448A (en) * 2017-03-01 2017-08-11 中国科学院自动化研究所 Target detection model building method
CN106991400A (en) * 2017-04-05 2017-07-28 北京中燕信息技术有限公司 A kind of fire hazard smoke detecting method and device
CN107123027A (en) * 2017-04-28 2017-09-01 广东工业大学 A kind of cosmetics based on deep learning recommend method and system
CN107203781A (en) * 2017-05-22 2017-09-26 浙江大学 A kind of object detection method Weakly supervised end to end instructed based on conspicuousness
CN107203781B (en) * 2017-05-22 2020-07-28 浙江大学 End-to-end weak supervision target detection method based on significance guidance
CN107330449A (en) * 2017-06-13 2017-11-07 瑞达昇科技(大连)有限公司 A kind of BDR sign detection method and device
CN107609483B (en) * 2017-08-15 2020-06-16 中国科学院自动化研究所 Dangerous target detection method and device for driving assistance system
CN107609483A (en) * 2017-08-15 2018-01-19 中国科学院自动化研究所 Risk object detection method, device towards drive assist system
CN107562050B (en) * 2017-08-29 2021-03-16 广东工业大学 Method and system for robot to recognize environment
CN107562050A (en) * 2017-08-29 2018-01-09 广东工业大学 A kind of method and system of robot environment-identification
CN107945153A (en) * 2017-11-07 2018-04-20 广东广业开元科技有限公司 A kind of road surface crack detection method based on deep learning
CN108319633B (en) * 2017-11-17 2022-02-11 腾讯科技(深圳)有限公司 Image processing method and device, server, system and storage medium
CN108319633A (en) * 2017-11-17 2018-07-24 腾讯科技(深圳)有限公司 A kind of image processing method, device and server, system, storage medium
CN108062574A (en) * 2017-12-31 2018-05-22 厦门大学 A kind of Weakly supervised object detection method based on particular category space constraint
CN108062574B (en) * 2017-12-31 2020-06-16 厦门大学 Weak supervision target detection method based on specific category space constraint
CN110147796A (en) * 2018-02-12 2019-08-20 杭州海康威视数字技术股份有限公司 Image matching method and device
CN108596223A (en) * 2018-04-11 2018-09-28 珠海博明视觉科技有限公司 A method of automatically generating object data set
CN109063559A (en) * 2018-06-28 2018-12-21 东南大学 A kind of pedestrian detection method returned based on improvement region
CN109063559B (en) * 2018-06-28 2021-05-11 东南大学 Pedestrian detection method based on improved region regression
CN109035370B (en) * 2018-07-23 2022-02-22 郑州云海信息技术有限公司 Picture labeling method and system
CN109035370A (en) * 2018-07-23 2018-12-18 郑州云海信息技术有限公司 A kind of picture mask method and system
CN110851641A (en) * 2018-08-01 2020-02-28 杭州海康威视数字技术股份有限公司 Cross-modal retrieval method and device and readable storage medium
CN109492686A (en) * 2018-11-01 2019-03-19 郑州云海信息技术有限公司 A kind of picture mask method and system
CN109492702A (en) * 2018-11-21 2019-03-19 中国科学院自动化研究所 Pedestrian based on sorting measure function recognition methods, system, device again
CN109492702B (en) * 2018-11-21 2020-09-22 中国科学院自动化研究所 Pedestrian re-identification method, system and device based on ranking measurement function
CN109857878A (en) * 2018-12-27 2019-06-07 深兰科技(上海)有限公司 Article mask method and device, electronic equipment and storage medium
CN109857878B (en) * 2018-12-27 2021-08-20 深兰科技(上海)有限公司 Article labeling method and device, electronic equipment and storage medium
CN109919149A (en) * 2019-01-18 2019-06-21 平安科技(深圳)有限公司 Object mask method and relevant device based on object detection model
CN109740571A (en) * 2019-01-22 2019-05-10 南京旷云科技有限公司 The method of Image Acquisition, the method, apparatus of image procossing and electronic equipment
CN109872333A (en) * 2019-02-20 2019-06-11 腾讯科技(深圳)有限公司 Medical image dividing method, device, computer equipment and storage medium
CN111488400A (en) * 2019-04-28 2020-08-04 北京京东尚科信息技术有限公司 Data classification method, device and computer readable storage medium
CN110288629A (en) * 2019-06-24 2019-09-27 湖北亿咖通科技有限公司 Target detection automatic marking method and device based on moving Object Detection
CN110443366A (en) * 2019-07-30 2019-11-12 上海商汤智能科技有限公司 Optimization method and device, object detection method and the device of neural network
CN112348822A (en) * 2019-08-08 2021-02-09 佳能株式会社 Image processing apparatus and image processing method
US11157744B2 (en) 2020-01-15 2021-10-26 International Business Machines Corporation Automated detection and approximation of objects in video
CN111311634A (en) * 2020-01-23 2020-06-19 支付宝实验室(新加坡)有限公司 Face image detection method, device and equipment
CN111311634B (en) * 2020-01-23 2024-02-27 支付宝实验室(新加坡)有限公司 Face image detection method, device and equipment
CN110929729A (en) * 2020-02-18 2020-03-27 北京海天瑞声科技股份有限公司 Image annotation method, image annotation device and computer storage medium
CN111460958A (en) * 2020-03-26 2020-07-28 暗物智能科技(广州)有限公司 Object detector construction method, object detection method and object detection system
CN111464647A (en) * 2020-04-02 2020-07-28 科锐特(厦门)净化科技有限公司 Smart cloud clean room control method and system
CN112215123A (en) * 2020-10-09 2021-01-12 腾讯科技(深圳)有限公司 Target detection method, device and storage medium
CN112215123B (en) * 2020-10-09 2022-10-25 腾讯科技(深圳)有限公司 Target detection method, device and storage medium
CN112560870A (en) * 2020-12-15 2021-03-26 哈尔滨工程大学 Image target identification method used in underwater complex environment
CN112560870B (en) * 2020-12-15 2022-04-29 哈尔滨工程大学 Image target identification method used in underwater complex environment
CN113869714A (en) * 2021-09-26 2021-12-31 平安科技(深圳)有限公司 Attendance recording method and device, computer equipment and storage medium
CN117292283A (en) * 2023-11-24 2023-12-26 成都庆龙航空科技有限公司 Target identification method based on unmanned aerial vehicle
CN117292283B (en) * 2023-11-24 2024-02-13 成都庆龙航空科技有限公司 Target identification method based on unmanned aerial vehicle

Also Published As

Publication number Publication date
CN104217225B (en) 2018-04-24

Similar Documents

Publication Publication Date Title
CN104217225A (en) A visual target detection and labeling method
CN110909820B (en) Image classification method and system based on self-supervision learning
Lee et al. Learning the easy things first: Self-paced visual category discovery
US11734390B2 (en) Unsupervised domain adaptation method, device, system and storage medium of semantic segmentation based on uniform clustering
EP3620980B1 (en) Learning method, learning device for detecting lane by using cnn and testing method, testing device using the same
CN106055573B (en) Shoe print image retrieval method and system under multi-instance learning framework
US11816149B2 (en) Electronic device and control method thereof
CN103984959A (en) Data-driven and task-driven image classification method
EP3596655B1 (en) Method and apparatus for analysing an image
CN113688665B (en) Remote sensing image target detection method and system based on semi-supervised iterative learning
CN112132014B (en) Target re-identification method and system based on non-supervised pyramid similarity learning
CN108734109B (en) Visual target tracking method and system for image sequence
CN103745233B (en) The hyperspectral image classification method migrated based on spatial information
CN110008365B (en) Image processing method, device and equipment and readable storage medium
CN112052818A (en) Unsupervised domain adaptive pedestrian detection method, unsupervised domain adaptive pedestrian detection system and storage medium
CN112990282B (en) Classification method and device for fine-granularity small sample images
CN112733936A (en) Recyclable garbage classification method based on image recognition
Nath et al. Single-and multi-label classification of construction objects using deep transfer learning methods.
CN112734803A (en) Single target tracking method, device, equipment and storage medium based on character description
CN115238197A (en) Expert thinking model-based field business auxiliary analysis method
CN116129286A (en) Method for classifying graphic neural network remote sensing images based on knowledge graph
CN113487610B (en) Herpes image recognition method and device, computer equipment and storage medium
Payet et al. ^ 2--Random Forest Random Field
US11829442B2 (en) Methods and systems for efficient batch active learning of a deep neural network
CN106775694A (en) A kind of hierarchy classification method of software merit rating code product

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant