CN110297933A

CN110297933A - A kind of theme label recommended method and tool based on deep learning

Info

Publication number: CN110297933A
Application number: CN201910587052.6A
Authority: CN
Inventors: 张雨柔; 李锐; 于治楼; 段强
Original assignee: Shandong Inspur Artificial Intelligence Research Institute Co Ltd
Current assignee: Shandong Inspur Artificial Intelligence Research Institute Co Ltd
Priority date: 2019-07-01
Filing date: 2019-07-01
Publication date: 2019-10-01

Abstract

The present invention discloses a kind of theme label recommended method based on deep learning, it is related to technical field of information processing, this method is based on zero-shot learning thought, feature extraction is carried out to image or video clip using the ability in feature extraction of depth network model, the tagsort of theme label is carried out to the feature of extraction using support vector machines model, and it obtains about image or a prediction theme label of video clip, prediction theme label is extended using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm, and then it obtains and the semantic relevant K theme label of prediction theme label, prediction theme label and K theme label are labeled as the final theme label of input picture or video clip, so that annotation results are more reliable.The theme label recommendation tool based on deep learning that invention additionally discloses a kind of, remaining aforementioned theme label recommended method combine, may be used in major social media network platform.

Description

A kind of theme label recommended method and tool based on deep learning

Technical field

The present invention relates to technical field of information processing, specifically a kind of theme label recommendation side based on deep learning Method and tool.

Background technique

With the development of internet technology, occur currently on the market miscellaneous from media social application software, example Such as microblogging, trill, the Little Red Book, watermelon video, all users can upload oneself video or image on App, simultaneously Suitable theme label is marked for it, when the theme label that we are marked meets the interest of viewer, theme label has newly When newness and attraction or theme label meet the fashion trend on current network, which will be will receive more Concern amount.It is appropriate for its mark due to the magnanimity of the diversification of media information content and media information data on network Theme label is still a urgent problem to be solved.

Traditional method some directly using single machine learning algorithm to the media contents such as image or video directly into The prediction of row theme label, but this is still unable to satisfy content slightly abundant or complicated image or video subject label recommendations and asks Topic.

Currently, the theme label classification that method also is recommended is relatively more fixed, current prevalence can not be well adapted for Trend causes recommended theme label not have attraction.

Summary of the invention

The present invention is directed to the demand and shortcoming of current technology development, provides a kind of theme label based on deep learning Recommended method and tool.

Firstly, the present invention discloses a kind of theme label recommended method based on deep learning, solves above-mentioned technical problem and adopt Technical solution is as follows:

A kind of theme label recommended method based on deep learning, this method are based on zero-shot learning thought, Feature extraction is carried out to image or video clip using the ability in feature extraction of depth network model, utilizes support vector machines Model carries out the tagsort of theme label to the feature of extraction, and obtains about image or a prediction theme of video clip Label is extended prediction theme label using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm, so obtain in advance The semantic relevant K theme label of theme label is surveyed, predicts theme label and K theme label as input picture or piece of video The final theme label of section is labeled.

Specifically, using depth network model ability in feature extraction to image or video clip carry out feature extraction it Before, need the ability in feature extraction to depth network model to be trained, concrete operations are as follows:

The image with label or video clip are collected as training set；

Feature extraction is carried out using image or video clip of the depth network model to training set；

The feature that depth network model is extracted inputs support vector machines model as image feature vector；

Support vector machines model carries out the tagsort of theme label to the feature of extraction, and obtain about image or One prediction theme label of video clip；

Bluebeard compound incorporation model word2vec and k nearest neighbor algorithm obtain multiple themes after being extended to prediction theme label Label；

Judge whether the multiple theme labels for predicting to obtain after theme label and extension are related；

If related, continue to train next image or video clip；

If uncorrelated, continue to train next image or video clip after correcting.

Specifically, involved training set is divided into three data sets；

There are three depth network models, and image classification model respectively based on CNN convolutional neural networks is followed based on RNN The image classification model of ring neural network, the image classification model based on DNN deep neural network；

Three image classification models carry out feature extraction to the image or video clip of three data sets respectively；

Three kinds of features that above three different images sorter network extracts are concatenated together to form new multidimensional image spy Vector is levied, multidimensional image feature vector inputs support vector machines model, is with the original label having of image or video clip Reference generates a prediction theme label.

Specifically, involved bluebeard compound incorporation model word2vec has modeled the correlative relationship between text, bluebeard compound Text representation is converted term vector form by incorporation model word2vec, for semantic similar text representation be converted into vector it After will have a lesser distance, semantic dissimilar text representation will have biggish distance, be based on this feature, and bluebeard compound is embedding Enter model word2vec by predict theme label project to term vector space, and further using k nearest neighbor algorithm search obtain with The similar K theme label of the prediction theme label is labeled as the final theme label of input picture or video clip.

Specifically, being extended using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm to prediction theme label, have Gymnastics is made

Prediction theme label is mapped to text vector space by bluebeard compound incorporation model word2vec；

Text vector is calculated at a distance from other vectors of corpus kind using the method for cosine similarity or Euclidean distance Relationship；

It is obtained and the most similar K theme label of prediction theme label currently entered by k nearest neighbor method；

Using the prediction theme label of this K theme label and support vector machines model as image or video clip Final theme label is labeled.

Specifically, involved corpus is periodically updated, meanwhile, corpus is also to bluebeard compound incorporation model word2vec It is updated.

Secondly, invention additionally discloses a kind of theme label recommendation tool based on deep learning, which includes depth net Network model, support vector machines model, bluebeard compound incorporation model word2vec, k nearest neighbor algoritic module；

Depth network model carries out feature extraction to image or video clip；

Bluebeard compound incorporation model word2vec and k nearest neighbor algoritic module are extended prediction theme label, and obtain with The most similar K theme label of prediction theme label currently entered；

K theme label of acquisition and the prediction theme label of support vector machines model word2vec as image or The final theme label of video clip is labeled.

Specifically, theme label recommendation tool further includes the instruction being trained to the ability in feature extraction of depth network model Practice module；

Training module includes collecting submodule and judging submodule；

It collects image or video clip of the submodule collection with label and is stored in training set, training set as training sample Image or video clip sequentially input depth network model；

Depth network model carries out feature extraction to the image or video clip of training set, extracts feature as characteristics of image Vector inputs support vector machines model, and support vector machines model carries out the feature point of theme label to the feature of extraction Class, and obtain and calculated about image or the prediction theme label bluebeard compound incorporation model word2vec and k nearest neighbor of video clip Method obtains multiple theme labels after being extended to prediction theme label；

Judging submodule judges whether the multiple theme labels for predicting to obtain after theme label and extension are related, and in correlation When directly export training set in next image or video clip, when uncorrelated, first correct export again it is next in training set Image or video clip.

Specifically, involved training set is divided into three data sets；

There are three depth network models, is image classification model based on CNN convolutional neural networks respectively, is followed based on RNN The image classification model of ring neural network, the image classification model based on DNN deep neural network；

Three kinds of features that above three different images sorter network extracts are concatenated together to form new multidimensional image spy Vector is levied, multidimensional image feature vector inputs support vector machines model, then, has so that image or video clip are original Label is reference, generates a prediction theme label.

Specifically, involved bluebeard compound incorporation model word2vec and k nearest neighbor algorithm are extended prediction theme label Concrete operations include:

A kind of theme label recommended method and tool based on deep learning of the invention, has compared with prior art Beneficial effect is:

1) the present invention is based on zero-shot learning thoughts, using the ability in feature extraction of depth network model to figure Picture or video clip carry out feature extraction, carry out the feature of theme label to the feature of extraction using support vector machines model Classification, and obtain about image or a prediction theme label of video clip, utilize bluebeard compound incorporation model word2vec and K Nearest neighbor algorithm is extended prediction theme label, and then obtains K theme label relevant to prediction theme label semanteme, in advance It surveys theme label and K theme label is labeled as the final theme label of input picture or video clip, so that mark As a result more reliable；

12) present embodiments can apply to the image or videos that in major social media network platform, uploaded by user are automatic Suitable theme label is selected, when the theme label of image or video labeling and the matching degree of its content are higher, and is more accorded with When closing current popular topic, image or video are concerned degree will be higher.

Detailed description of the invention

Attached drawing 1 is the principle flow chart of the embodiment of the present invention one；

Attached drawing 2 is the structural block diagram of the embodiment of the present invention two.

Each label information indicates in attached drawing 2:

1, depth network model, 2, support vector machines model,

3, bluebeard compound incorporation model word2vec, 4, k nearest neighbor algoritic module,

5, the image classification model of CNN convolutional neural networks,

6, based on the image classification model of RNN Recognition with Recurrent Neural Network,

7, based on the image classification model of DNN deep neural network,

8, training module, 9, collection submodule, 10, judging submodule.

Specific embodiment

The technical issues of to make technical solution of the present invention, solving and technical effect are more clearly understood, below in conjunction with tool Body embodiment carries out clear, complete description to technical solution of the present invention, it is clear that described embodiment is only this hair Bright a part of the embodiment, instead of all the embodiments.Based on the embodiment of the present invention, those skilled in the art are not doing All embodiments obtained under the premise of creative work out, all within protection scope of the present invention.

Embodiment one:

In conjunction with attached drawing 1, the present embodiment proposes that a kind of theme label recommended method based on deep learning, this method are based on Zero-shot learning thought carries out image or video clip using the ability in feature extraction of three depth network models Feature extraction is carried out the tagsort of theme label to the feature of extraction using support vector machines model, and obtained about figure One prediction theme label of picture or video clip leads prediction using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm Topic label is extended, and then obtains K theme label relevant to prediction theme label semanteme, predicts theme label and K Theme label is labeled as the final theme label of input picture or video clip.

In the present embodiment, feature is carried out to image or video clip using the ability in feature extraction of depth network model to mention Before taking, the ability in feature extraction to depth network model is needed to be trained, concrete operations are as follows:

The image with label or video clip are collected as training set, training set is divided into three data sets；

Feature extraction, three depth are carried out using image or video clip of three depth network models to three data sets Network model is respectively the image classification model based on CNN convolutional neural networks, the image classification based on RNN Recognition with Recurrent Neural Network Model, the image classification model based on DNN deep neural network；

Three kinds of features that above three different images sorter network extracts are concatenated together to form new multidimensional image spy Vector is levied, multidimensional image feature vector inputs support vector machines model；

If related, continue to train next image or video clip；

If uncorrelated, continue to train next image or video clip after correcting.

In the present embodiment, involved bluebeard compound incorporation model word2vec has modeled the correlative relationship between text, Text representation is converted term vector form by bluebeard compound incorporation model word2vec, and semantic similar text representation is converted into There will be lesser distance after vector, semantic dissimilar text representation there will be biggish distance, be based on this feature, knot Closing word incorporation model word2vec will predict that theme label projects to term vector space, and further utilize k nearest neighbor algorithm search K theme label similar with the prediction theme label is obtained to carry out as the final theme label of input picture or video clip Mark.

In the present embodiment, prediction theme label is carried out using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm Extension, concrete operations include:

In the present embodiment, involved corpus is periodically updated, meanwhile, corpus is also to bluebeard compound incorporation model Word2vec is updated.

Embodiment two:

In conjunction with attached drawing 2, the present embodiment proposes a kind of theme label recommendation tool based on deep learning, which includes deep Spend network model 1, support vector machines model 2, bluebeard compound incorporation model word2vec 3, k nearest neighbor algoritic module 4.

In attached drawing 2, depth network model 1, support vector machines model 2, bluebeard compound incorporation model word2vec 3, K are close Adjacent algoritic module 4 can directly acquire image or video clip without training, and export theme label.At this time:

Depth network model 1 carries out feature extraction to image or video clip；

The features of 2 pairs of support vector machines model extractions carry out the tagsort of theme label, and obtain about image or One prediction theme label of video clip；

4 pairs of prediction theme labels of bluebeard compound incorporation model word2vec 3 and k nearest neighbor algoritic module are extended, and are obtained With the most similar K theme label of prediction theme label currently entered；

K theme label of acquisition and the prediction theme label of support vector machines model 2word2vec as image or The final theme label of video clip is labeled.

In conjunction with attached drawing 2, in order to guarantee to export the maximum correlation of theme label and image or video clip, we can be with The image for being largely labeled with label or video clip are obtained in advance as training sample.At this point, theme label recommendation tool It also needs to include the training module 8 for being trained the ability in feature extraction of depth network model 1.

Training module 8 includes collecting submodule 9 and judging submodule 10；

It collects image or video clip of the collection of submodule 9 with label and is stored in training set as training sample, training The image or video clip of collection sequentially input depth network model 1；

Depth network model 1 carries out feature extraction to the image or video clip of training set, extracts feature as image spy It levies vector and inputs support vector machines model 2, the feature of 2 pairs of support vector machines model extractions carries out the feature of theme label Classification, and obtain close with K about a prediction theme label bluebeard compound incorporation model word2vec 3 of image or video clip Adjacent algorithm obtains multiple theme labels after being extended to prediction theme label；

Judging submodule 10 judges whether the multiple theme labels for predicting to obtain after theme label and extension are related, and in phase The next image or video clip directly exported in training set when pass is first corrected under exporting in training set again when uncorrelated One image or video clip.

In training module 8 to depth network model 1, support vector machines model 2, bluebeard compound incorporation model After the completion of word2vec3, the training of k nearest neighbor algoritic module 4, then the image directly acquired or video clip sequentially input, depth Network model 1, support vector machines model 2, bluebeard compound incorporation model word2vec 3, k nearest neighbor algoritic module 4, i.e., it is exportable Multiple theme labels relevant to input picture or video clip.

In the training process, involved training set is divided into three data sets；

There are three depth network models 1, is image classification model 5 based on CNN convolutional neural networks respectively, based on RNN The image classification model 6 of Recognition with Recurrent Neural Network, the image classification model 7 based on DNN deep neural network；

Three kinds of features that above three different images sorter network extracts are concatenated together to form new multidimensional image spy Vector is levied, multidimensional image feature vector inputs support vector machines model 2, then, has so that image or video clip are original Label is reference, generates a prediction theme label.

In the present embodiment, involved bluebeard compound incorporation model word2vec 3 and k nearest neighbor algorithm are to prediction theme label The concrete operations being extended include:

Prediction theme label is mapped to text vector space by bluebeard compound incorporation model word2vec 3；

Using the prediction theme label of this K theme label and support vector machines model 2 as image or video clip Final theme label is labeled.

In summary, using a kind of theme label recommended method and tool based on deep learning of the invention, Ke Yiying It uses in major social media network platform, the image or video uploaded by user automatically selects suitable theme label.Separately Outside, when the theme label of image or video labeling and the matching degree of its content are higher, and more meet current institute it is popular if When topic, image or video are concerned degree will be higher.

Use above specific case elaborates the principle of the present invention and embodiment, these embodiments are It is used to help understand core of the invention technology contents, the protection scope being not intended to restrict the invention.Based on of the invention upper State specific embodiment, those skilled in the art without departing from the principle of the present invention, to made by the present invention Any improvement and modification, all shall fall within the protection scope of the present invention.

Claims

1. a kind of theme label recommended method based on deep learning, which is characterized in that this method is based on zero-shot Learning thought carries out feature extraction to image or video clip using the ability in feature extraction of depth network model, utilizes Support vector machines model carries out the tagsort of theme label to the feature of extraction, and obtains about image or video clip A prediction theme label, prediction theme label is expanded using bluebeard compound incorporation model word2vec and k nearest neighbor algorithm Exhibition, and then obtain and the semantic relevant K theme label of prediction theme label, prediction theme label and K theme label conduct The final theme label of input picture or video clip is labeled.

2. a kind of theme label recommended method based on deep learning according to claim 1, which is characterized in that using deeply Before the ability in feature extraction of degree network model carries out feature extraction to image or video clip, need to depth network model Ability in feature extraction is trained, concrete operations are as follows:

The image with label or video clip are collected as training set；

Support vector machines model carries out the tagsort of theme label to the feature of extraction, and obtains about image or video One prediction theme label of segment；

Bluebeard compound incorporation model word2vec and k nearest neighbor algorithm obtain multiple theme marks after being extended to prediction theme label Label；

If related, continue to train next image or video clip；

If uncorrelated, continue to train next image or video clip after correcting.

3. a kind of theme label recommended method based on deep learning according to claim 2, which is characterized in that the instruction Practice collection and is divided into three data sets；

There are three the depth network models, and image classification model respectively based on CNN convolutional neural networks is followed based on RNN The image classification model of ring neural network, the image classification model based on DNN deep neural network；

By above three different images sorter network extract three kinds of features be concatenated together to be formed new multidimensional image feature to Amount, multidimensional image feature vector input support vector machines model, are ginseng with the original label having of image or video clip According to generating a prediction theme label.

4. a kind of theme label recommended method based on deep learning according to claim 1, which is characterized in that the knot It closes word incorporation model word2vec and has modeled the correlative relationship between text, the bluebeard compound incorporation model word2vec will be literary This expression is converted into term vector form, semantic similar text representation is converted into after vector will have it is lesser away from From semantic dissimilar text representation will have biggish distance, be based on this feature, bluebeard compound incorporation model word2vec will be pre- It surveys theme label and projects to term vector space, and is further similar to the prediction theme label using the acquisition of k nearest neighbor algorithm search K theme label be labeled as the final theme label of input picture or video clip.

5. a kind of theme label recommended method based on deep learning according to claim 4, which is characterized in that utilize knot It closes word incorporation model word2vec and k nearest neighbor algorithm to be extended prediction theme label, concrete operations include:

The distance relation of text vector and other vectors of corpus kind is calculated using the method for cosine similarity or Euclidean distance；

Using the prediction theme label of this K theme label and support vector machines model as the final of image or video clip Theme label is labeled.

6. a kind of theme label recommended method based on deep learning according to claim 5, which is characterized in that institute's predicate Material library is periodically updated, meanwhile, the corpus is also updated bluebeard compound incorporation model word2vec.

7. a kind of theme label recommendation tool based on deep learning, which is characterized in that the tool includes depth network model, branch Hold vector machine SVM model, bluebeard compound incorporation model word2vec, k nearest neighbor algoritic module；

The depth network model carries out feature extraction to image or video clip；

The support vector machines model carries out the tagsort of theme label to the feature of extraction, and obtain about image or One prediction theme label of video clip；

The bluebeard compound incorporation model word2vec and the k nearest neighbor algoritic module are extended prediction theme label, and obtain It takes and the most similar K theme label of prediction theme label currently entered；

K theme label of acquisition and the prediction theme label of support vector machines model word2vec are as image or video The final theme label of segment is labeled.

8. a kind of theme label recommendation tool based on deep learning according to claim 7, which is characterized in that the tool It further include the training module being trained to the ability in feature extraction of depth network model；

The training module includes collecting submodule and judging submodule；

The collection submodule collects the image with label or video clip as training sample and is stored in training set, training set Image or video clip sequentially input depth network model；

Depth network model carries out feature extraction to the image or video clip of training set, extracts feature as image feature vector Support vector machines model is inputted, support vector machines model carries out the tagsort of theme label to the feature of extraction, and It obtains about image or the prediction theme label bluebeard compound incorporation model word2vec and k nearest neighbor algorithm of video clip to pre- It surveys after theme label is extended and obtains multiple theme labels；

The judging submodule judges whether the multiple theme labels for predicting to obtain after theme label and extension are related, and in correlation When directly export training set in next image or video clip, when uncorrelated, first correct export again it is next in training set Image or video clip.

9. a kind of theme label recommendation tool based on deep learning according to claim 8, which is characterized in that the instruction Practice collection and is divided into three data sets；

There are three the depth network models, is image classification model based on CNN convolutional neural networks respectively, is followed based on RNN The image classification model of ring neural network, the image classification model based on DNN deep neural network；

By above three different images sorter network extract three kinds of features be concatenated together to be formed new multidimensional image feature to Amount, multidimensional image feature vector inputs support vector machines model, then, with the original label having of image or video clip For reference, a prediction theme label is generated.

10. a kind of theme label recommendation tool based on deep learning according to claim 8, which is characterized in that in conjunction with Word incorporation model word2vec and k nearest neighbor algorithm include: to the concrete operations for predicting that theme label is extended