CN103116893A - Digital image labeling method based on multi-exampling multi-marking learning - Google Patents

Digital image labeling method based on multi-exampling multi-marking learning Download PDF

Info

Publication number
CN103116893A
CN103116893A CN201310084956XA CN201310084956A CN103116893A CN 103116893 A CN103116893 A CN 103116893A CN 201310084956X A CN201310084956X A CN 201310084956XA CN 201310084956 A CN201310084956 A CN 201310084956A CN 103116893 A CN103116893 A CN 103116893A
Authority
CN
China
Prior art keywords
mark
image
marking
model
many examples
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310084956XA
Other languages
Chinese (zh)
Other versions
CN103116893B (en
Inventor
周志华
黄圣君
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing University
Original Assignee
Nanjing University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Nanjing University filed Critical Nanjing University
Priority to CN201310084956.XA priority Critical patent/CN103116893B/en
Publication of CN103116893A publication Critical patent/CN103116893A/en
Application granted granted Critical
Publication of CN103116893B publication Critical patent/CN103116893B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Image Analysis (AREA)

Abstract

The invention discloses a digital image labeling method based on multi-exampling multi-marking learning and aims at solving the technical problems that digital images frequently have complex semantics and cannot be effectively expressed or learned through the technical method based on single exampling. The digital image labeling method includes: initializing a marking model; randomly selecting one image and a related mark of the image from a data set, and determining a representative exampling of the mark; obtaining a non-related mark arranged in the front of the related mark through random sampling, and determining a representative exampling of the non-related mark; and performing gradient descent aiming at a triad formed by the image, the related mark and the non-related mark to update the model. According to the digital image labeling method, online learning is performed by aid of a random gradient descent algorithm, time and internal memory expanse are greatly reduced, marking accuracy is ensured, and marking efficiency is improved.

Description

Digital picture mask method based on many examples Multi-label learning
Technical field
The present invention relates to digital picture label technology field, particularly relate to a kind of digital picture mask method based on many examples Multi-label learning.
Background technology
Popular along with the universal and all kinds of social network sites of digital product has the digital picture of magnanimity to produce and propagate every day.Provide relevant service on so large-scale view data, one most crucial is also that the most difficult task is to allow the semanteme of computer understanding image, and image labeling is gordian technique wherein.
The visual signature that the task of automatic image annotation device is based on digital picture comes the semantic marker of predicted picture.Concrete, at first annotation equipment can extract visual signature and represent these images from digital picture, then based on these character representations, train a marking model from the sets of image data of existing semantic marker.After the character representation input marking model that will not have markd digital picture, model just can dope their semantic marker.
Present automatic image annotation technology often is shown as image table single example.But image often has complicated semanteme, comprises a plurality of object entities, and the expression of so single example can cause information loss, semanteme that can't the accurate description image, thus can't the Accurate Prediction image tagged.More the effective ways input that is based on many examples multiple labeling machine learning (be called for short MIML) represents mode, and the set that forms with a plurality of examples of features represents piece image, wherein often fairly simple entity of correspondence and semanteme of each example.Minority is arranged at present based on the automatic image annotation technology of MIML input expression, but their model complexity increases greatly and sharply along with the change of representation space, causes very poor efficiency of these technology, can't be applied on large-scale image labeling task.Therefore, a kind ofly can demand urgently proposing based on the efficient automatic image annotation technology of MIML input expression.
Summary of the invention
Technical matters: often have complicated semantic for digital picture, and can't carry out effective expression and study to it based on the technology of single example, and technical matters such as mark inefficiency etc., the present invention proposes a kind of digital picture mask method based on many examples Multi-label learning.
Technical scheme: the digital picture mask method based on many examples Multi-label learning, comprise the steps,
(1) initialization marking model;
(2) select at random a mark of correlation of piece image and this image from data acquisition, and determine the example that represents of this mark;
(3) obtain a uncorrelated mark that comes the mark of correlation front by stochastic sampling, and determine the example that represents of this uncorrelated mark;
(4) for image, the tlv triple that mark of correlation and uncorrelated mark consist of is carried out the Gradient Descent Renewal model;
(5) judge whether this model reaches requirement, if return to (2); Otherwise finish and export marking model.
The present invention adopts technique scheme, has following beneficial effect: the present invention is based on many examples Multi-label learning framework, each mark to image, the example that represents of selecting a "current" model to approve most from many examples multiple labeling machine learning input expression represents this image, thereby can take full advantage of the more information that the machine learning of many examples multiple labeling brings.Simultaneously, utilize stochastic gradient descent algorithm to carry out on-line study, greatly reduce time and memory cost, thereby both guaranteed the degree of accuracy of mark, improved again mark efficient.
Description of drawings
Fig. 1 is the process flow diagram of the training marking model of the embodiment of the present invention;
Fig. 2 is the method flow diagram of the embodiment of the present invention;
Fig. 3 is the sampled images of the embodiment of the present invention and the process flow diagram that represents example of determining its mark of correlation;
Fig. 4 is the uncorrelated mark of the sampling of the embodiment of the present invention and determines that it represents the process flow diagram of example;
Fig. 5 is the process flow diagram of the renewal marking model of the embodiment of the present invention.
Embodiment
Below in conjunction with specific embodiment, further illustrate the present invention, should understand these embodiment only is used for explanation the present invention and is not used in and limits the scope of the invention, after having read the present invention, those skilled in the art all fall within the application's claims limited range to the modification of the various equivalent form of values of the present invention.
Fig. 1 is the process flow diagram of the digital picture automatic marking device training marking model of the embodiment of the present invention.Suppose that the training image data acquisition is comprised of N width image, every width image all marks.Device extracts the feature that meets the input of many examples Multi-label learning to the image in the data set, each width image is by a stack features vector representation, and each proper vector is called an example.As preferably, feature extraction can use the classical way in the machine learning textbook to generate applicable characteristics of image, for example first carries out image segmentation, then each image block is extracted the features such as color, texture, shape.After training out marking model with this data acquisition, device will carry out same feature extraction to the unmarked image of input, and predict with marking model, and the output mark of correlation is as annotation results.
Figure 2 shows that the method flow diagram of the embodiment of the present invention.Step S20 carries out initialization to marking model, mainly comprises two matrix W and V initialize.Suppose that the characteristics of image dimension that extracts is d, always have L possible mark, W is the matrix of a L * 100 sizes, and V is the matrix of a d * 100 sizes.The numerical value 100 here can change other value into according to user's needs, usually selects larger value can improve the mark accuracy, and less value can pick up speed.W and V are endowed random value and guarantee that the average of their each row is 0, and standard deviation is
Figure BDA00002920467600031
Step S21 is stochastic sampling piece image X from the training data set, supposes that its mark of correlation set is Y, and uncorrelated tag set is
Figure BDA00002920467600032
Device is random from Y selects a mark, is assumed to be y, and selects from set X for y and represent example x, and detailed process is seen Fig. 3.Step S22 is from the uncorrelated tag set of this image
Figure BDA00002920467600033
In one by one stochastic sampling mark, until run into the uncorrelated mark that a quilt comes the y front And determine to represent example for it
Figure BDA00002920467600035
Detailed process is seen Fig. 4.Step S23 carries out the Gradient Descent Renewal model, and detailed process is seen Fig. 5.Whether step S24 judgment models reaches requirement, is to finish training process, otherwise gets back to step S21.Here whether judgment models reaches standard and can adopt in machine learning or pattern-recognition textbook method commonly used, reaches the number of times of user's appointment such as iteration wheel number.
Figure 3 shows that the detailed process of step S21 in Fig. 2.Step S210 is for beginning action.In step S211, randomly draw piece image X in the training image data acquisition, suppose that it comprises n example { x altogether 1..., x n, the mark of correlation set Y of image comprises m mark { y altogether 1... y m.Step S212 randomly draws a mark y from Y.The initial i of step S213 is 1, begins the sample count in X.Whether step S214 judges i greater than n, and if greater than example in X traveled through, jump to step S217, otherwise enter in next step S215.Step S215 is to example x iCalculate the value shown in following formula:
f y(x i)=W yV Tx i (1)
W wherein yThe y of expression W is capable, f y(x i) can be understood as example x iDegree of correlation with mark y.Step S216 will count i and add 1, then return to step S214.Step S217 is all f that calculate relatively y(x i) value, and the example of selective value maximum example is assumed to be x as representing, corresponding f y(x) can be understood as the degree of correlation of this width image and mark y.Whole process ends at step S218.
Figure 4 shows that the detailed process of step S23 in Fig. 2.Suppose the uncorrelated tag set of image
Figure BDA00002920467600036
Comprise altogether t mark
Figure BDA00002920467600041
Step S230 is origination action.Step S231 is mark y calculated value f according to formula (1) y(x), we suppose that x is the example that represents of y here.Step S232 initialization i is 1, begins counting, and introduces an indieating variable Q, is initialized as 0.Step S233 judges whether that t uncorrelated mark all traveled through, finishes if i>t jumps to step S238, otherwise enters next step.Step S234 is mark according to formula (1)
Figure BDA00002920467600042
Calculate
Figure BDA00002920467600043
Here we suppose
Figure BDA00002920467600044
For
Figure BDA00002920467600045
Represent example.Step S235 relatively two mark y and
Figure BDA00002920467600046
Order, if
Figure BDA00002920467600047
Enter step S236, will count i and add and return in the lump step S234; If incoherent mark
Figure BDA000029204676000412
Come the front of mark of correlation y, namely
Figure BDA000029204676000413
, jump to step S237, be i with the Q assignment, be illustrated in the uncorrelated mark that a violation order has been found in the i time sampling, and the uncorrelated mark that will find and represent that example is recorded as respectively
Figure BDA000029204676000414
With
Figure BDA000029204676000415
, then enter step S238 and finish.
Figure 5 shows that the detailed process by the Gradient Descent Renewal model.Step S240 is origination action.Whether step S241 judgement has sampled the mark of violation order in step S23, namely whether Q is greater than 0.If Q=0, expression does not sample the mark of violation order, directly enters step S245 and finishes, otherwise enter step S242, estimates the rank r of mark of correlation y according to the Q value, and specific formula for calculation is as follows:
Figure BDA00002920467600048
Wherein t is
Figure BDA000029204676000416
The number of middle mark,
Figure BDA000029204676000419
Symbolic representation rounds downwards.Step S243 is a with the y line item of model variable W, the
Figure BDA000029204676000417
Line item is b.It is capable that step S244 upgrades the y of W according to formula (3) respectively, upgrades the of W by formula (4)
Figure BDA000029204676000418
OK, and by formula (5) upgrade V.
W y = a + γ Σ i = 1 r 1 i V T x - - - ( 3 )
W y ‾ = b - γ Σ i = 1 r 1 i V T x ‾ - - - ( 4 )
V = V - γ Σ i = 1 r 1 i ( x ‾ b - xa ) - - - ( 5 )
Wherein γ is the step-length of stochastic gradient descent algorithm, and its value can be according to method setting commonly used in machine learning or pattern-recognition textbook.After having upgraded model variable, whole process ends at step S245.

Claims (5)

1. based on the digital picture mask method of many examples Multi-label learning, it is characterized in that, comprise the steps,
(1) initialization marking model;
(2) select at random a mark of correlation of piece image and this image from data acquisition, and determine the example that represents of this mark;
(3) obtain a uncorrelated mark that comes the mark of correlation front by stochastic sampling, and determine the example that represents of this uncorrelated mark;
(4) for selected image, the tlv triple Renewal model that mark of correlation and uncorrelated mark consist of;
(5) judge whether this model reaches requirement, if return to (2); Otherwise finish and export marking model.
2. the digital picture mask method based on many examples Multi-label learning as claimed in claim 1, it is characterized in that, the method of described training marking model is, marking image for input, adopt data image automatic marking device the image in the training image data acquisition to be extracted the feature that meets the input of many examples Multi-label learning, each width image is by a stack features vector representation, and each proper vector is called an example; After training out marking model, and predict with marking model, the output mark of correlation is as annotation results; Described device carries out same feature extraction and predictive marker process to the unmarked image of input.
3. the digital picture mask method based on many examples Multi-label learning as claimed in claim 2, is characterized in that, under many examples Multi-label learning input expression, the example that represents of each mark is determined by current marking model.
4. the digital picture mask method based on many examples Multi-label learning as claimed in claim 1, is characterized in that, the tlv triple Renewal model that utilizes stochastic gradient descent algorithm to consist of for image, mark of correlation and uncorrelated mark.
5. the digital picture mask method based on many examples Multi-label learning as claimed in claim 1, is characterized in that, whether judgment models reach the required standards comprises that iteration wheel number reaches the number of times of user's appointment.
CN201310084956.XA 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning Active CN103116893B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310084956.XA CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310084956.XA CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Publications (2)

Publication Number Publication Date
CN103116893A true CN103116893A (en) 2013-05-22
CN103116893B CN103116893B (en) 2015-07-01

Family

ID=48415257

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310084956.XA Active CN103116893B (en) 2013-03-15 2013-03-15 Digital image labeling method based on multi-exampling multi-marking learning

Country Status (1)

Country Link
CN (1) CN103116893B (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605667A (en) * 2013-10-28 2014-02-26 中国计量学院 Automatic image annotation algorithm
CN104318459A (en) * 2014-11-11 2015-01-28 苏州晨川通信科技有限公司 Automatic market analysis method based on multi-mode learning
CN105117429A (en) * 2015-08-05 2015-12-02 广东工业大学 Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105894350A (en) * 2016-03-29 2016-08-24 南京大学 Method for recommending mobile game props based on multi-example multiple labeling learning
CN106022389A (en) * 2016-05-30 2016-10-12 南京航空航天大学 Related feedback method for actively selecting multi-instance multi-mark digital image
CN106127247A (en) * 2016-06-21 2016-11-16 广东工业大学 Image classification method based on multitask many examples support vector machine
CN107122800A (en) * 2017-04-27 2017-09-01 南京大学 A kind of Robust digital figure mask method based on the screening that predicts the outcome
CN107851174A (en) * 2015-07-08 2018-03-27 北京市商汤科技开发有限公司 The apparatus and method of linguistic indexing of pictures
CN110059696A (en) * 2019-04-26 2019-07-26 杭州智趣智能信息技术有限公司 A kind of image labeling method, device and computer readable storage medium
CN110175657A (en) * 2019-06-05 2019-08-27 广东工业大学 A kind of image multi-tag labeling method, device, equipment and readable storage medium storing program for executing

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101872343A (en) * 2009-04-24 2010-10-27 罗彤 Semi-supervised mass data hierarchy classification method

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101872343A (en) * 2009-04-24 2010-10-27 罗彤 Semi-supervised mass data hierarchy classification method

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
AURELIEN LUCCHI等。: "Joint Image and Word Sense Discrimination for Image Retrieval", 《COMPUTER VISION-ECCV 2012》 *
BING BAI等。: "Learning to rank with (a lot of) word features", 《SPRINGER SCIENCE+BUSINESS MEDIA,LLC 2009》 *
JASON WESTON等。: "WSABIE: Scaling Up To Large Vocabulary Image Annotation", 《PROCEEDINGS OF THE 22ND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE》 *
KOBY CRAMMER等。: "A Family of Additive Online Algorithms for Category Ranking", 《JOURNAL OF MACHINE LEARNING RESEARCH》 *

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103605667B (en) * 2013-10-28 2017-02-08 中国计量学院 Automatic image annotation algorithm
CN103605667A (en) * 2013-10-28 2014-02-26 中国计量学院 Automatic image annotation algorithm
CN104318459A (en) * 2014-11-11 2015-01-28 苏州晨川通信科技有限公司 Automatic market analysis method based on multi-mode learning
CN104318459B (en) * 2014-11-11 2017-07-11 苏州晨川通信科技有限公司 A kind of automation market analysis method based on Multimodal Learning
CN107851174B (en) * 2015-07-08 2021-06-01 北京市商汤科技开发有限公司 Image semantic annotation equipment and method, and generation method and system of image semantic annotation model
CN107851174A (en) * 2015-07-08 2018-03-27 北京市商汤科技开发有限公司 The apparatus and method of linguistic indexing of pictures
CN105117429A (en) * 2015-08-05 2015-12-02 广东工业大学 Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105117429B (en) * 2015-08-05 2018-11-23 广东工业大学 Scene image mask method based on Active Learning and multi-tag multi-instance learning
CN105894350A (en) * 2016-03-29 2016-08-24 南京大学 Method for recommending mobile game props based on multi-example multiple labeling learning
CN106022389B (en) * 2016-05-30 2019-05-14 南京航空航天大学 A kind of related feedback method actively selecting more example multiple labeling digital pictures
CN106022389A (en) * 2016-05-30 2016-10-12 南京航空航天大学 Related feedback method for actively selecting multi-instance multi-mark digital image
CN106127247B (en) * 2016-06-21 2019-07-09 广东工业大学 Image classification method based on the more example support vector machines of multitask
CN106127247A (en) * 2016-06-21 2016-11-16 广东工业大学 Image classification method based on multitask many examples support vector machine
CN107122800A (en) * 2017-04-27 2017-09-01 南京大学 A kind of Robust digital figure mask method based on the screening that predicts the outcome
CN107122800B (en) * 2017-04-27 2020-09-18 南京大学 Robust digital image labeling method based on prediction result screening
CN110059696A (en) * 2019-04-26 2019-07-26 杭州智趣智能信息技术有限公司 A kind of image labeling method, device and computer readable storage medium
CN110059696B (en) * 2019-04-26 2021-05-18 杭州智趣智能信息技术有限公司 Image annotation method and device and computer readable storage medium
CN110175657A (en) * 2019-06-05 2019-08-27 广东工业大学 A kind of image multi-tag labeling method, device, equipment and readable storage medium storing program for executing
CN110175657B (en) * 2019-06-05 2021-10-01 广东工业大学 Image multi-label marking method, device, equipment and readable storage medium

Also Published As

Publication number Publication date
CN103116893B (en) 2015-07-01

Similar Documents

Publication Publication Date Title
CN103116893B (en) Digital image labeling method based on multi-exampling multi-marking learning
CN102902821B (en) The image high-level semantics mark of much-talked-about topic Network Based, search method and device
CN103678431A (en) Recommendation method based on standard labels and item grades
CN104573359A (en) Method for integrating crowdsource annotation data based on task difficulty and annotator ability
CN110415521B (en) Traffic data prediction method, apparatus and computer-readable storage medium
CN105117429A (en) Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105389584A (en) Streetscape semantic annotation method based on convolutional neural network and semantic transfer conjunctive model
CN105404686A (en) Method for matching place name and address in news event based on geographical feature hierarchical segmented words
CN104102696A (en) Content recommendation method and device
CN104156433A (en) Image retrieval method based on semantic mapping space construction
CN103927394A (en) Multi-label active learning classification method and system based on SVM
CN102929894A (en) Online clustering visualization method of text
CN111210111B (en) Urban environment assessment method and system based on online learning and crowdsourcing data analysis
CN108959329A (en) A kind of file classification method, device, medium and equipment
CN107239564A (en) A kind of text label based on supervision topic model recommends method
CN110096499A (en) A kind of the user object recognition methods and system of Behavior-based control time series big data
CN102163285A (en) Cross-domain video semantic concept detection method based on active learning
CN112926399A (en) Target object detection method and device, electronic equipment and storage medium
CN105894038A (en) Credit card fraud prediction method based on signal transmission and link mode
CN115374189A (en) Block chain-based food safety tracing method, device and equipment
CN104077288B (en) Web page contents recommend method and web page contents recommendation apparatus
CN114997263A (en) Training rate analysis method, device, equipment and storage medium based on machine learning
CN104572915A (en) User event relevance calculation method based on content environment enhancement
CN110472143A (en) A kind of information-pushing method, device, readable storage medium storing program for executing and terminal device
CN106022389B (en) A kind of related feedback method actively selecting more example multiple labeling digital pictures

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant