CN106127247B - Image classification method based on the more example support vector machines of multitask - Google Patents

Image classification method based on the more example support vector machines of multitask Download PDF

Info

Publication number
CN106127247B
CN106127247B CN201610466376.0A CN201610466376A CN106127247B CN 106127247 B CN106127247 B CN 106127247B CN 201610466376 A CN201610466376 A CN 201610466376A CN 106127247 B CN106127247 B CN 106127247B
Authority
CN
China
Prior art keywords
packet
multitask
image
support vector
distance
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610466376.0A
Other languages
Chinese (zh)
Other versions
CN106127247A (en
Inventor
阮奕邦
肖燕珊
刘波
郝志峰
黎启祥
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong University of Technology
Original Assignee
Guangdong University of Technology
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong University of Technology filed Critical Guangdong University of Technology
Priority to CN201610466376.0A priority Critical patent/CN106127247B/en
Publication of CN106127247A publication Critical patent/CN106127247A/en
Application granted granted Critical
Publication of CN106127247B publication Critical patent/CN106127247B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/01Social networking

Abstract

The invention discloses a kind of image classification methods based on the more example support vector machines of multitask.This method comprises: establishing T learning tasks for T group image;The image of T learning tasks is carried out how instantiating;For one class packet of picture construction of each classification in T task;Establish the example in class packet to more example packets Euclidean distance formula;Example distance vector of the building class packet to more example packets;Establish class packet to more example packets weighted euclidean distance formula;The distance for constraining more example packets to generic is less than the distance to other classifications;Establish the optimization problem of the more example support vector machines of multitask;Converting optimization problem is traditional single task list example support vector machines problem;Solve support vector machines optimization problem.The present invention relates to a kind of methods for optimizing weighted euclidean distance formula, by establishing the more example support vector machines problems concerning study of multitask example images, so that optimal dissolve ideal weight, to improve the performance of Image Classifier.

Description

Image classification method based on the more example support vector machines of multitask
Technical field
The present invention relates to Image Classfication Technology fields, more particularly to the image based on the more example support vector machines of multitask Classification method.
Background technique
With the progress of information technology and the sustainable development of social networks, the figure of magnanimity is had existed above internet Picture, and the amount of images newly uploaded on internet daily also exponentially rises, and the scene that image is included is also more and more richer Richness, although social network sites have obtained permanent development, the picture of magnanimity is not fully utilized but on website, and Can all there are a large amount of new images to upload to above website daily, how identify not labeled image, and Accurate classification arrives It is most of Internet company all in a problem of research with preferably site for service user in corresponding classification.
On the one hand, due to that may include various background elements when shooting image, then it will lead to image It include not only a scene, if such as single example support vector machines may using traditional single example image recognition methods Lead to misclassification.For example, at the zoo photographed when, different plant species may be photographed same image simultaneously, such as The animals such as people, horse, bird all may be in same image.
On the other hand, due to the opening of internet and the diversity of capture apparatus, the photo of the same person may It appears in above different social network sites, perhaps as captured by distinct device or from different video institute editings, this A little pictures, which are mixed, to be identified, it is clear that is unreasonable;Furthermore it in order to improve the performance of Image Classifier, needs a large amount of Markd image carry out the training of classifier, if lack of training samples, will lead to the performance decline of classifier, from And influence the effect of image classification.The image classification of early stage is classified by way of handmarking, but this side The artificial of method successfully can be very high, perhaps also feasible under a small amount of image, but generates speed with the present image in internet, It is then less desirable.
Summary of the invention
Although there are many quantity in the same type of image marked face on the internet, not due to source mode Together, for example, the equipment of shooting or the social network sites of storage are different, these pictures, which are mixed, to carry out the training of classifier and is Unreasonable, but training is grouped according to source formation, then it can be potentially encountered lack of training samples so as to cause classification The problems such as accuracy decline of device, it is possible to using the form of multitask, several groups picture be trained simultaneously, and utilized The correlation of every group of picture improves the performance of every group of picture classifier.And since image contains multiple scenes, image is seen It is handled at single example, then can neglect the correlation of multiple scenes, multi-instance learning method can be used at this time, one A image regards multiple examples as.
Image classification method based on the more example support vector machines of multitask of the invention includes the following steps:
(1) image of several groups is obtained, and guarantees that the quantity of every group of image is few, as unit of group, establishes several Learning tasks, and in the form of handmarking, carry out the manual sort of image.
(2) all images of all learning tasks, more sample datas are converted to.
(3) in each multi-instance learning task, an associated more example packets are constructed for each image category, this is more Example packet in the present invention be known as class packet, and establish the example in class packet to more example packets Euclidean distance formula.
(4) building class packet to more example packets example distance vector, so that the weighting for establishing class packet to more example packets is European Range formula.
(5) constraint is established, guarantees that more example packets will be far smaller than to the distance of other classifications to the distance of generic.
(6) optimization problem of the more example support vector machines of multitask is established.
(7) the more example support vector machines optimization problems of the multitask of switch process (6) are a similar single task list example The optimization problem of support vector machines.
(8) the support vector machines optimization problem of solution procedure (7), can obtain the weight of optimization, to train one A Image Classifier based on the more example support vector machines of multitask, carries out the classification of image.
Detailed description of the invention
Fig. 1 is the flow chart of the Web page classification method of the invention based on maximum spacing multitask multi-instance learning.
Specific embodiment
Image classification method based on the more example support vector machines of multitask of the invention includes the following steps:
The first step, obtains the image of several groups, and guarantees that the quantity of every group of image is few, as unit of group, if establishing Dry learning tasks, and in the form of handmarking, carry out the manual sort of image.For example, if there is T group image, then T Image Classifier learning tasks are established, and since the amount of images of T task is all few, handmarking can be carried out.
All images of all learning tasks are converted to more sample datas by second step.Since image contains multiple fields Scape, and when classification, it is only necessary to one of key scenes, so whole image is converted to a single example at this time Classify, the correlation of multiple scenes may be neglected, classifying quality is caused to be deteriorated, so at this time can be using showing more Example learning method carries out image classification.Before multi-instance learning method, need to carry out more sample datas to image, it can With using classical image cutting method, such as the compartmentalization for Blobworld System, the Lai Jinhang image that the present invention uses, this When to each image-region carry out feature extraction, so that the image-region be made to be converted to an example.One image contains multiple Region can then be converted to multiple examples, and an image is properly termed as example packet more than one at this time.
Third step constructs an associated more example packets in each multi-instance learning task for each image category, More example packets in the present invention be known as class packet, and establish the example in class packet to more example packets Euclidean distance formula.No As traditional more exemplary methods, the present invention does not pay close attention to the distance between image and image directly, but all of each classification Image is combined, and establishes more example packets of a class rank, referred to as class packet, and the example established in class packet to showing more The Euclidean distance formula of example packet, as follows:
In above formula, exampleIt is class packet CktJ-th of example,It is more example packet BitCenter.nktIt is class packet Ckt Example number.
4th step, the example distance vector of building class packet to more example packets, to establish the weighting of class packet to more example packets Euclidean distance formula.In the third step, can in the hope of each class packet example to more example packets apart from size, with this apart from size For vector element, class packet is established to the example distance vector of more example packets, then k-th of class of t-th of task is clipped to i-th more shows The example distance vector of example packetIt is as follows:
Establish one and example distance vectorThe weight vector w of equal lengthkt, which is defined as follows:
By example distance vectorWith weight vector wktWant to multiply, then the weighting of available class packet to more example packets is European Range formula:
5th step, establish constraint, guarantee more example packets to generic distance will far smaller than arrive other classifications away from From.Establish following constraint:
In above formula, Pt(Bit) it is more example packet BitAffiliated category set, Nt(Bit) be and more example packet BitUnrelated class Do not gather,For error term, which ensure that classification n to more example packet BitDistance be greater than classification p to more example packets BitDistance.
6th step establishes the optimization problem of the more example support vector machines of multitask.In t-th of task, all categories Weight vector form a vector wt, it is as follows:
Correspondingly, one isometric vector of buildingVectorByWith-Composition, the other positions of the vector Filling 0, it is possible to the constraints conversion established in the 5th step be following form:
Based on the constraint, wtBe converted to the form of multi-task learning, i.e. wt=w0+vt, w0It is considered as that all tasks are total The public weight coefficient enjoyed, and vtIt is the weight coefficient that each task then exclusively enjoys, establishes the more example branch of a multitask thus The optimization problem of vector machine is held, as follows:
In above formula, CwFor controlling error termSize, regularization parameter γ0And γ1For controlling multi-instance learning Similitude between task.If γ0It is intended to infinity, then it is not that each multi-instance learning task, which trains the classifier come, It is relevant.Opposite, if γ1It is intended to infinity, then it is identical that all multi-instance learning tasks, which train the classifier come, Or it is similar.
7th step, the more example support vector machines optimization problems of multitask for turning the 6th step are a similar single task list example The optimization problem of support vector machines.In order to use the numerical technologies such as quadratic programming solve the more examples of the multitask support to Amount machine problem, needs the problem to be converted to the form of a similar traditional support vector machine optimization problem, therefore establishes two Vector is as follows:
According to two above vector, the more example support vector machines of multitask of the 6th step can be converted to the support of standard Vector machine optimization problem form is as follows:
8th step solves the support vector machines optimization problem of the 7th step, the weight of optimization can be obtained, to train One Image Classifier based on the more example support vector machines of multitask, carries out the classification of image.
In the case where not departing from spirit of that invention or necessary characteristic, the present invention can be embodied in other specific forms.It answers The specific embodiment various aspects are considered merely as illustrative and not restrictive.Therefore, scope of the invention such as appended claims It is shown as indicated above shown in range.Change in all equivalent meanings and range for falling in claim should be regarded as It falls in the scope of claim.

Claims (4)

1. a kind of image classification method based on the more example support vector machines of multitask, which comprises the steps of:
The first step, the image for obtaining several groups establish several learning tasks, and as unit of group with the shape of handmarking Formula carries out the manual sort of image;
Second step, all images all learning tasks, are converted to more sample datas;
Third step, in each multi-instance learning task, for each image category construct an associated more example packet set, More example packet collection are collectively referred to as class packet, and establish the example in class packet to more example packets Euclidean distance formula;
The example distance vector of 4th step, building class packet to more example packets, so that the weighting for establishing class packet to more example packets is European Range formula;
5th step establishes constraint, guarantees that more example packets are less than the distance of other classifications to the distance of generic;
6th step, the optimization problem for establishing the more example support vector machines of multitask;
7th step, convert the more example support vector machines optimization problems of multitask of the 6th step into a single task list example support to The optimization problem of amount machine;
8th step, the support vector machines optimization problem for solving the 7th step, can obtain the weight of optimization, to train one Based on the Image Classifier of the more example support vector machines of multitask, the classification of image is carried out;
In 6th step, the optimization problem of the more example support vector machines of multitask is established;In t-th of task, all classes Other weight vector forms a vector wt, it is as follows:
Correspondingly, one isometric vector of buildingVectorByWithComposition, the other positions filling of the vector 0, it is possible to the constraints conversion established in the 5th step be following form:
Based on the constraint, wtBe converted to the form of multi-task learning, i.e. wt=w0+vt, w0It is considered as all task sharings Public weight coefficient, and vtIt is the weight coefficient that each task is exclusively enjoyed, establishes the more example supporting vectors of a multitask thus The optimization problem of machine is as follows:
In above formula, T is task number, CwFor controlling error termSize, regularization parameter γ0And γ1It is more for controlling Similitude between learn-by-example task, if γ0It is intended to infinity, then each multi-instance learning task trains point come Class device is incoherent;Opposite, if γ1It is intended to infinity, then all multi-instance learning tasks train the classification come Device is same or similar;
In 7th step, the more example support vector machines optimization problems of multitask for turning the 6th step are a single task list example branch The optimization problem of vector machine is held, solves the more example supporting vectors of the multitask to use the numerical technologies such as quadratic programming Machine problem, needs the problem to be converted to the form of a similar traditional support vector machine optimization problem, thus establish two to It measures as follows:
In above formula,According to two above vector, the more example support vector machines of multitask of the 6th step can be converted It is as follows for the support vector machines optimization problem form of standard:
2. the image classification method according to claim 1 based on the more example support vector machines of multitask, which is characterized in that In third step, in each multi-instance learning task, an associated more example packets are constructed for each image category, this shows Example packet be known as class packet, and establish the example in class packet to more example packets Euclidean distance formula specifically: each classification All images are combined, and establish more example packets of a class rank, referred to as class packet, and the example established in class packet to showing more The Euclidean distance formula of example packet, as follows:
Wherein, exampleIt is class packet CktJ-th of example,It is more example packet BitCenter, nktIt is class packet CktExample Number.
3. the image classification method according to claim 2 based on the more example support vector machines of multitask, which is characterized in that In 4th step, building class packet to more example packets example distance vector, thus the weighting for establishing class packet to more example packets it is European away from From formula, in the third step, can in the hope of each class packet example to more example packets apart from size, using this apart from size as vector Element establishes class packet to the example distance vector of more example packets, then k-th of class of t-th of task is clipped to the packet of example more than i-th Example distance vectorIt is as follows:
Establish one and example distance vectorThe weight vector w of equal lengthkt, which is defined as follows:
By example distance vectorWith weight vector wktIt is multiplied, then weighted euclidean distance of the available class packet to more example packets Formula:
4. the image classification method according to claim 3 based on the more example support vector machines of multitask, which is characterized in that In 5th step, constraint is established, guarantees that more example packets are less than the distance of other classifications to the distance of generic, is established following Constraint:
In above formula, Pt(Bit) it is more example packet BitAffiliated category set, Nt(Bit) be and more example packet BitUnrelated classification collection It closes,For error term, which ensure that classification n to more example packet BitDistance be greater than classification p to more example packet Bit's Distance.
CN201610466376.0A 2016-06-21 2016-06-21 Image classification method based on the more example support vector machines of multitask Active CN106127247B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610466376.0A CN106127247B (en) 2016-06-21 2016-06-21 Image classification method based on the more example support vector machines of multitask

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610466376.0A CN106127247B (en) 2016-06-21 2016-06-21 Image classification method based on the more example support vector machines of multitask

Publications (2)

Publication Number Publication Date
CN106127247A CN106127247A (en) 2016-11-16
CN106127247B true CN106127247B (en) 2019-07-09

Family

ID=57268191

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610466376.0A Active CN106127247B (en) 2016-06-21 2016-06-21 Image classification method based on the more example support vector machines of multitask

Country Status (1)

Country Link
CN (1) CN106127247B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112183752A (en) * 2020-12-01 2021-01-05 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107392256A (en) * 2017-07-31 2017-11-24 广东工业大学 A kind of image-recognizing method and system
CN108664986B (en) * 2018-01-16 2020-09-04 北京工商大学 Based on lpNorm regularized multi-task learning image classification method and system
CN108537290A (en) * 2018-04-25 2018-09-14 攀枝花学院 Stellar spectra classification method based on data distribution characteristics and fuzzy membership function
CN109165673B (en) * 2018-07-18 2021-08-31 广东工业大学 Image classification method based on metric learning and multi-example support vector machine
CN109919165B (en) * 2019-03-18 2021-07-06 广东工业大学 Similarity-based multi-instance dictionary learning classification method and device
CN110008365B (en) * 2019-04-09 2023-02-07 广东工业大学 Image processing method, device and equipment and readable storage medium
CN110175657B (en) * 2019-06-05 2021-10-01 广东工业大学 Image multi-label marking method, device, equipment and readable storage medium
CN110414621B (en) * 2019-08-06 2022-03-22 广东工业大学 Classifier construction method and device based on multi-instance learning
CN112052840B (en) * 2020-10-10 2023-02-03 苏州科达科技股份有限公司 Picture screening method, system, equipment and storage medium
CN112766161B (en) * 2021-01-20 2022-12-02 西安电子科技大学 Hyperspectral target detection method based on integrated constraint multi-example learning

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102945372A (en) * 2012-10-18 2013-02-27 浙江大学 Classifying method based on multi-label constraint support vector machine
CN103116893A (en) * 2013-03-15 2013-05-22 南京大学 Digital image labeling method based on multi-exampling multi-marking learning
CN104778457A (en) * 2015-04-18 2015-07-15 吉林大学 Video face identification algorithm on basis of multi-instance learning
CN105046269A (en) * 2015-06-19 2015-11-11 鲁东大学 Multi-instance multi-label scene classification method based on multinuclear fusion
CN105069473A (en) * 2015-08-05 2015-11-18 广东工业大学 Multi-instance weighted packet learning method for online uncertain image recognition
CN105117429A (en) * 2015-08-05 2015-12-02 广东工业大学 Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105404877A (en) * 2015-12-08 2016-03-16 商汤集团有限公司 Human face attribute prediction method and apparatus based on deep study and multi-task study

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102945372A (en) * 2012-10-18 2013-02-27 浙江大学 Classifying method based on multi-label constraint support vector machine
CN103116893A (en) * 2013-03-15 2013-05-22 南京大学 Digital image labeling method based on multi-exampling multi-marking learning
CN104778457A (en) * 2015-04-18 2015-07-15 吉林大学 Video face identification algorithm on basis of multi-instance learning
CN105046269A (en) * 2015-06-19 2015-11-11 鲁东大学 Multi-instance multi-label scene classification method based on multinuclear fusion
CN105069473A (en) * 2015-08-05 2015-11-18 广东工业大学 Multi-instance weighted packet learning method for online uncertain image recognition
CN105117429A (en) * 2015-08-05 2015-12-02 广东工业大学 Scenario image annotation method based on active learning and multi-label multi-instance learning
CN105404877A (en) * 2015-12-08 2016-03-16 商汤集团有限公司 Human face attribute prediction method and apparatus based on deep study and multi-task study

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
Deep Convolutional Neural Networks for Multi-Instance Multi-Task Learning;Tao Zeng 等;《2015 IEEE International Conference on Data Mining》;20151231;第579-588页
基于SVM的多示例多标签主动学习;李杰龙 等;《计算机工程与设计》;20160131;第37卷(第1期);第254-258页
基于密度聚类和多示例学习的图像分类方法;陈涛 等;《吉林大学学报(工学版)》;20140731;第44卷(第4期);第1126-1134页

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112183752A (en) * 2020-12-01 2021-01-05 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection
CN112183752B (en) * 2020-12-01 2021-02-19 南京智谷人工智能研究院有限公司 End-to-end multi-example learning method based on automatic example selection

Also Published As

Publication number Publication date
CN106127247A (en) 2016-11-16

Similar Documents

Publication Publication Date Title
CN106127247B (en) Image classification method based on the more example support vector machines of multitask
Fang et al. Perceptual quality assessment of smartphone photography
CN108140032B (en) Apparatus and method for automatic video summarization
CN109558942B (en) Neural network migration method based on shallow learning
CN110147700B (en) Video classification method, device, storage medium and equipment
US8923570B2 (en) Automated memory book creation
CN110533097A (en) A kind of image definition recognition methods, device, electronic equipment and storage medium
CN106096661B (en) The zero sample image classification method based on relative priority random forest
CN109002857A (en) A kind of transformation of video style and automatic generation method and system based on deep learning
CN111444966A (en) Media information classification method and device
CN110796098B (en) Method, device, equipment and storage medium for training and auditing content auditing model
CN104112143A (en) Weighted hyper-sphere support vector machine algorithm based image classification method
CN109344738A (en) The recognition methods of crop diseases and pest crop smothering and device
CN111709477A (en) Method and tool for garbage classification based on improved MobileNet network
JP2017084340A (en) Tag processing method and tag processing device
CN111143617A (en) Automatic generation method and system for picture or video text description
CN106445977A (en) Picture pushing method and device
CN109165673A (en) Image classification method based on metric learning and more example support vector machines
CN109636764A (en) A kind of image style transfer method based on deep learning and conspicuousness detection
CN112418082A (en) Plant leaf identification system and method based on metric learning and depth feature learning
CN106250396B (en) Automatic image label generation system and method
CN110019827A (en) A kind of corpus library generating method, device, equipment and computer storage medium
WO2011096010A1 (en) Pattern recognition device
CN111444255A (en) Training method and device of data model
CN113657473A (en) Web service classification method based on transfer learning

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant