CN103116893B - Digital image labeling method based on multi-exampling multi-marking learning - Google Patents

Digital image labeling method based on multi-exampling multi-marking learning Download PDF

Info

Publication number
CN103116893B
CN103116893B CN201310084956.XA CN201310084956A CN103116893B CN 103116893 B CN103116893 B CN 103116893B CN 201310084956 A CN201310084956 A CN 201310084956A CN 103116893 B CN103116893 B CN 103116893B
Authority
CN
China
Prior art keywords
mark
image
marking
many examples
model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310084956.XA
Other languages
Chinese (zh)
Other versions
CN103116893A (en
Inventor
周志华
黄圣君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN201310084956.XA priority Critical patent/CN103116893B/en
Publication of CN103116893A publication Critical patent/CN103116893A/en
Application granted granted Critical
Publication of CN103116893B publication Critical patent/CN103116893B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The invention discloses a digital image labeling method based on multi-exampling multi-marking learning and aims at solving the technical problems that digital images frequently have complex semantics and cannot be effectively expressed or learned through the technical method based on single exampling. The digital image labeling method includes: initializing a marking model; randomly selecting one image and a related mark of the image from a data set, and determining a representative exampling of the mark; obtaining a non-related mark arranged in the front of the related mark through random sampling, and determining a representative exampling of the non-related mark; and performing gradient descent aiming at a triad formed by the image, the related mark and the non-related mark to update the model. According to the digital image labeling method, online learning is performed by aid of a random gradient descent algorithm, time and internal memory expanse are greatly reduced, marking accuracy is ensured, and marking efficiency is improved.

Description

Based on the digital picture mask method of many examples Multi-label learning
Technical field
The present invention relates to digital picture label technology field, particularly relate to a kind of digital picture mask method based on many examples Multi-label learning.
Background technology
Popular along with the universal of digital product and all kinds of social network sites, has the digital picture of magnanimity produce and propagate every day.Will provide relevant service in so large-scale view data, one most crucial, and to be also the most difficult task be allows the semanteme of computer understanding image, and image labeling is then gordian technique wherein.
The task of automatic image annotation device carrys out the semantic marker of predicted picture.Concrete, first annotation equipment can extract visual signature to represent these images from digital picture, then based on these character representations, trains a marking model from the sets of image data of existing semantic marker.When after the character representation input marking model that will do not have a markd digital picture, model just can dope their semantic marker.
Image table is often shown as single example by current automatic image annotation technology.But image often has complicated semanteme, and comprise multiple object entity, the expression of so single example can cause information loss, cannot the semanteme of accurate description image, thus cannot Accurate Prediction image tagged.More effective ways are the input representations based on the machine learning of many examples multiple labeling (being called for short MIML), represent piece image with the set that multiple examples of features forms, wherein each example entity that then often correspondence one is fairly simple and semanteme.Have the automatic image annotation technology that minority represents based on MIML input at present, but their model complexity increases greatly and sharply along with the change of representation space, causes these technology very poor efficiency, cannot be applied in large-scale image labeling task.Therefore, a kind of efficient automatic image annotation technology that can represent based on MIML input urgently proposes.
Summary of the invention
Technical matters: often have complicated semantic for digital picture, and effective expression and study cannot be carried out to it based on the technology of single example, and the low inferior technical matters of annotating efficiency, the present invention proposes a kind of digital picture mask method based on many examples Multi-label learning.
Technical scheme: based on the digital picture mask method of many examples Multi-label learning, comprise the steps,
(1) initialization marking model;
(2) mark of correlation of Stochastic choice piece image and this image from data acquisition, and determine the representative example of this mark;
(3) obtain a uncorrelated mark come before mark of correlation by stochastic sampling, and determine the representative example of this uncorrelated mark;
(4) for image, the tlv triple that mark of correlation and uncorrelated mark are formed carries out Gradient Descent Renewal model;
(5) judge whether this model reaches requirement, if then return (2); Otherwise terminate and export marking model.
The present invention adopts technique scheme, there is following beneficial effect: the present invention is based on many examples Multi-label learning framework, to each mark of image, the representative example selecting "current" model to approve most from many examples multiple labeling machine learning input represents to represent this image, thus can make full use of the more information that the machine learning of many examples multiple labeling brings.Meanwhile, utilize stochastic gradient descent algorithm to carry out on-line study, greatly reduce time and memory cost, thus both ensure that the degree of accuracy of mark, turn improve annotating efficiency.
Accompanying drawing explanation
Fig. 1 is the process flow diagram of the training marking model of the embodiment of the present invention;
Fig. 2 is the method flow diagram of the embodiment of the present invention;
Fig. 3 is the sampled images of the embodiment of the present invention and determines the process flow diagram of representative example of its mark of correlation;
Fig. 4 is the uncorrelated mark of sampling of the embodiment of the present invention and determines that it represents the process flow diagram of example;
Fig. 5 is the process flow diagram of the renewal marking model of the embodiment of the present invention.
Embodiment
Below in conjunction with specific embodiment, illustrate the present invention further, these embodiments should be understood only be not used in for illustration of the present invention and limit the scope of the invention, after having read the present invention, the amendment of those skilled in the art to the various equivalent form of value of the present invention has all fallen within the application's claims limited range.
Fig. 1 is the process flow diagram of the digital picture automatic marking device training marking model of the embodiment of the present invention.Suppose that training image data acquisition is made up of N width image, every width image all marks.Device meets the feature of many examples Multi-label learning input to the image zooming-out in data acquisition, and every piece image is by a stack features vector representation, and each proper vector is called an example.As preferably, feature extraction can use the classical way in machine learning textbook to generate the characteristics of image be suitable for, such as, first carry out Iamge Segmentation, then extract the features such as color, texture, shape to each image block.After training out marking model with this data acquisition, device carries out same feature extraction by the unmarked image of input, and predicts by marking model, exports mark of correlation as annotation results.
Figure 2 shows that the method flow diagram of the embodiment of the present invention.Step S20 carries out initialization to marking model, mainly comprises two matrix W and V initialize.Suppose that the characteristics of image dimension extracted is d, the individual possible mark of total total L, then W is the matrix of L × 100 size, and V is the matrix of d × 100 size.Here numerical value 100 can need to change other value into according to user, and usually select larger value can improve mark accuracy, less value then can pick up speed.W and V is endowed random value and the average of each row that ensure them is 0, and standard deviation is step S21 is stochastic sampling piece image X from training data set, and suppose that its mark of correlation set is Y, uncorrelated tag set is device is Stochastic choice one mark from Y, is assumed to be y, and represents example x for y selects from set X, and detailed process is shown in Fig. 3.Step S22 is from the uncorrelated tag set of this image in one by one stochastic sampling mark, until run into one by the uncorrelated mark come before y and determine to represent example for it detailed process is shown in Fig. 4.Step S23 carries out Gradient Descent Renewal model, and detailed process is shown in Fig. 5.Whether step S24 judgment models reaches requirement, is, terminates training process, otherwise gets back to step S21.Here whether judgment models reaches standard and can to adopt in machine learning or pattern-recognition textbook conventional method, and such as iteration wheel number reaches the number of times that user specifies.
Figure 3 shows that the detailed process of step S21 in Fig. 2.Step S210 is for starting action.In step S211, in training image data acquisition, randomly draw piece image X, suppose that it comprises n example { x altogether 1..., x n, the mark of correlation set Y of image comprises m mark { y altogether 1... y m.Step S212 randomly draws a mark y from Y.The initial i of step S213 is 1, starts the sample count in X.Step S214 judges whether i is greater than n, if be greater than, the example in X has traveled through, and jumps to step S217, otherwise enters in next step S215.Step S215 is to example x icalculate the value shown in following formula:
f y(x i)=W yV Tx i(1)
Wherein W yrepresent that the y of W is capable, f y(x i) can be understood as example x iwith the degree of correlation of mark y.Counting i is added 1 by step S216, then returns step S214.The more all f calculated of step S217 y(x i) value, and the example representatively example that selective value is maximum, be assumed to be x, corresponding f yx () can be understood as this width image and the degree of correlation marking y.Whole process ends at step S218.
Figure 4 shows that the detailed process of step S23 in Fig. 2.Suppose the uncorrelated tag set of image comprise t mark altogether step S230 is origination action.Step S231 is mark y calculated value f according to formula (1) yx (), we suppose that x is the representative example of y here.Step S232 initialization i is 1, starts counting, and introduces an indieating variable Q, be initialized as 0.Step S233 judges whether that t uncorrelated mark has traveled through all, if i > is t, jumps to step S238 and terminates, otherwise enter next step.Step S234 is mark according to formula (1) calculate here we suppose for representative example.Step S235 compare two mark y and order, if then enter step S236, counting i is added and returns step S234 in the lump; If incoherent mark come before mark of correlation y, namely , then jumping to step S237, is i by Q assignment, represents the uncorrelated mark that have found violation order i-th sampling, and by the uncorrelated mark found and represent example and be recorded as respectively with , then enter step S238 and terminate.
Figure 5 shows that the detailed process by Gradient Descent Renewal model.Step S240 is origination action.Step S241 judges the mark whether having sampled violation order in step S23, and namely whether Q is greater than 0.If Q=0, represent the mark not sampling violation order, directly enter step S245 and terminate, otherwise enter step S242, estimate the rank r of mark of correlation y according to Q value, specific formula for calculation is as follows:
Wherein t is the number of middle mark, symbol represents and rounds downwards.The y line item of model variable W is a by step S243, the line item is b.It is capable that step S244 upgrades the y of W according to formula (3) respectively, upgrades the of W by formula (4) oK, and by formula (5) upgrade V.
W y = a + γ Σ i = 1 r 1 i V T x - - - ( 3 )
W y ‾ = b - γ Σ i = 1 r 1 i V T x ‾ - - - ( 4 )
V = V - γ Σ i = 1 r 1 i ( x ‾ b - xa ) - - - ( 5 )
Wherein γ is the step-length of stochastic gradient descent algorithm, and its value can be arranged according to method conventional in machine learning or pattern-recognition textbook.After having upgraded model variable, whole process ends at step S245.

Claims (5)

1., based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, comprise the steps,
(1) initialization marking model;
(2) mark of correlation of Stochastic choice piece image and this image from data acquisition, and represent based on many examples of this image the representative example determining this mark;
(3) obtain a uncorrelated mark come before mark of correlation by stochastic sampling, and represent based on many examples of selected image the representative example determining this uncorrelated mark;
(4) for selected image, the tlv triple Renewal model that mark of correlation and uncorrelated mark are formed;
(5) judge whether this model reaches requirement, if then terminate and export marking model; Otherwise return (2).
2. as claimed in claim 1 based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, the method of described marking model is trained to be, for the marking image of input, data image automatic marking device is adopted the image zooming-out in training image data acquisition to be met to the feature of many examples Multi-label learning input, every piece image is by a stack features vector representation, and each proper vector is called an example; After training out marking model, and predict by marking model, export mark of correlation as annotation results; The unmarked image of described device to input carries out same feature extraction and predictive marker process.
3. as claimed in claim 2 based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, under the input of many examples Multi-label learning represents, the representative example of each mark is determined by current marking model.
4., as claimed in claim 1 based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, utilize the tlv triple Renewal model that stochastic gradient descent algorithm is formed for image, mark of correlation and uncorrelated mark.
5. as claimed in claim 1 based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, whether judgment models reach the required standards and comprises iteration wheel number and reach the number of times that user specifies.
CN201310084956.XA 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning Active CN103116893B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310084956.XA CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310084956.XA CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Publications (2)

Publication Number Publication Date
CN103116893A CN103116893A (en) 2013-05-22
CN103116893B true CN103116893B (en) 2015-07-01

Family

ID=48415257

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310084956.XA Active CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Country Status (1)

Country Link
CN (1) CN103116893B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605667B (en) * 2013-10-28 2017-02-08 中国计量学院 Automatic image annotation algorithm
CN104318459B (en) * 2014-11-11 2017-07-11 苏州晨川通信科技有限公司 A kind of automation market analysis method based on Multimodal Learning
WO2017004803A1 (en) * 2015-07-08 2017-01-12 Xiaoou Tang An apparatus and a method for semantic image labeling
CN105117429B (en) * 2015-08-05 2018-11-23 广东工业大学 Scene image mask method based on Active Learning and multi-tag multi-instance learning
CN105894350A (en) * 2016-03-29 2016-08-24 南京大学 Method for recommending mobile game props based on multi-example multiple labeling learning
CN106022389B (en) * 2016-05-30 2019-05-14 南京航空航天大学 A kind of related feedback method actively selecting more example multiple labeling digital pictures
CN106127247B (en) * 2016-06-21 2019-07-09 广东工业大学 Image classification method based on the more example support vector machines of multitask
CN107122800B (en) * 2017-04-27 2020-09-18 南京大学 Robust digital image labeling method based on prediction result screening
CN110059696B (en) * 2019-04-26 2021-05-18 杭州智趣智能信息技术有限公司 Image annotation method and device and computer readable storage medium
CN110175657B (en) * 2019-06-05 2021-10-01 广东工业大学 Image multi-label marking method, device, equipment and readable storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101872343A (en) * 2009-04-24 2010-10-27 罗彤 Semi-supervised mass data hierarchy classification method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101872343A (en) * 2009-04-24 2010-10-27 罗彤 Semi-supervised mass data hierarchy classification method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Aurelien Lucchi等..Joint Image and Word Sense Discrimination for Image Retrieval.《Computer Vision-ECCV 2012》.2012, *
Bing Bai等..Learning to rank with (a lot of) word features.《Springer Science+Business Media,LLC 2009》.2009, *
Jason Weston等..WSABIE: Scaling Up To Large Vocabulary Image Annotation.《Proceedings of the 22nd International Joint Conference on Artificial Intelligence》.2011,第3卷 *
Koby Crammer等..A Family of Additive Online Algorithms for Category Ranking.《Journal of Machine Learning Research》.2003,第3卷 *

Also Published As

Publication number Publication date
CN103116893A (en) 2013-05-22

Similar Documents

Publication Publication Date Title
CN103116893B (en) Digital image labeling method based on multi-exampling multi-marking learning
CN105117429A (en) Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105354307A (en) Image content identification method and apparatus
CN105389584A (en) Streetscape semantic annotation method based on convolutional neural network and semantic transfer conjunctive model
CN103984959A (en) Data-driven and task-driven image classification method
CN103678431A (en) Recommendation method based on standard labels and item grades
CN103927394A (en) Multi-label active learning classification method and system based on SVM
CN105404686A (en) Method for matching place name and address in news event based on geographical feature hierarchical segmented words
CN104156433A (en) Image retrieval method based on semantic mapping space construction
CN104102696A (en) Content recommendation method and device
CN107797989A (en) Enterprise name recognition methods, electronic equipment and computer-readable recording medium
CN105678309A (en) Image multi-tag marking algorithm based on multi-example package feature learning
CN114639233B (en) Congestion state prediction method and device, electronic equipment and storage medium
CN113068131B (en) Method, device, equipment and storage medium for predicting user movement mode and track
CN110096499A (en) A kind of the user object recognition methods and system of Behavior-based control time series big data
CN104537028A (en) Webpage information processing method and device
CN102163285A (en) Cross-domain video semantic concept detection method based on active learning
CN103853792A (en) Automatic image semantic annotation method and system
CN105894038A (en) Credit card fraud prediction method based on signal transmission and link mode
CN104077288B (en) Web page contents recommend method and web page contents recommendation apparatus
CN114741732A (en) Intelligent networking automobile data training method based on privacy data protection, electronic equipment and computer readable storage medium
CN114997263A (en) Training rate analysis method, device, equipment and storage medium based on machine learning
CN104966109A (en) Medical laboratory report image classification method and apparatus
CN108647295B (en) Image labeling method based on depth collaborative hash
CN108664464A (en) A kind of the determination method and determining device of semantic relevancy

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant