CN110619344A

CN110619344A - Microblog friend recommendation method based on SSD and time sequence model

Info

Publication number: CN110619344A
Application number: CN201910635218.7A
Authority: CN
Inventors: 马汉达; 景迪
Original assignee: Jiangsu University
Current assignee: Jiangsu University
Priority date: 2019-07-15
Filing date: 2019-07-15
Publication date: 2019-12-27
Anticipated expiration: 2039-07-15
Also published as: CN110619344B

Abstract

The invention belongs to the field of picture processing analysis, and particularly relates to a microblog friend recommendation method based on an SSD (solid State disk) and a time sequence model; according to the method, dynamic picture information issued by a blog user in a platform is collected and analyzed, the trained SSD target detection algorithm is utilized to retrieve user interest information hidden in the picture, and on the basis, a time sequence model is utilized to conduct time sequence analysis to obtain a friend recommendation list. The invention improves the scientificity of the friend recommendation strategy, improves the accuracy of the recommendation result, and enriches the friend recommendation strategy scheme, thereby improving the richness of the recommendation strategy.

Description

Microblog friend recommendation method based on SSD and time sequence model

Technical Field

The invention relates to the field of picture processing analysis, in particular to a microblog friend recommendation method based on an SSD and a time sequence model.

Background

In order to improve the quality and interpretability of the acquired user information and recommend friends to the user and improve the acceptance rate of the user on recommendation results, a friend recommendation system needs to recommend according to different user interests; in order to implement this method, when a friend is recommended, effective information of an analysis user can be obtained from different angles as consideration content of user similarity.

In the existing microblog friend recommendation algorithm, the following two recommendation modes are mainly adopted: and recommending friends based on the social relationship, and mining user interest preference based on microblog content.

Recommending friends based on the social relationship, wherein user recommendation is mainly carried out according to similar social preferences, so that the acceptance rate of users is greatly improved, a social topological graph is established by using the attention relationship of the users, and social similarity is obtained through specific algorithm processing so as to recommend the friends; such as predicting users who may be friends using Pareto optimal genetic algorithms (Pareto-optimal). There is a problem in that it is limited to the user's own buddy list information.

In a huge social network such as microblog, a user not only has a social relationship, but also contains a large amount of blog information. Potential interest preference of the user can be deeply mined through a large amount of text information based on microblog content, potential friends are recommended to a target user, and a social circle is enlarged; for example, a feature vector of a user represented by a keyword with a higher weight is extracted by using a Term Frequency-Inverse Document Frequency (TF-IDF) algorithm, and a tag may be added to each user, so that friends with similar interests are recommended to the user according to the similarity of the interest tags. The microblog friend recommendation algorithm has common interests and hobbies for friend recommendation, processing means for text information generated by a user in a microblog platform are enriched continuously, the technology is mature continuously, but processing of picture data generated by the user is omitted, and the picture is often rich in information effective for user interest analysis. Therefore, the SSD target detection algorithm is used for extracting the information of the user picture data so as to obtain the user interest and hobbies, and the time sequence model is utilized to improve the effectiveness and the accuracy of the user interest analysis by combining the characteristic that the interest of the user is gradually reduced along with the lapse of time, which is very significant.

Disclosure of Invention

The invention provides a microblog friend recommendation method based on an SSD (solid state disk) and a time sequence model, aiming at the problem that important information of picture data in friend recommendation strategies is lack of effective utilization and processing.

In order to achieve the purpose, the specific technical scheme of the invention is as follows: a microblog friend recommendation method based on an SSD and a time sequence model comprises the following steps:

1) collecting data and constructing a data table per _ info, namely acquiring picture data and corresponding time data of a user from a microblog platform, and constructing the data table per _ info, wherein one picture corresponds to one entry of the data table per _ info, and each entry comprises 4 data elements: name, date, picture and category, wherein the name represents a user name, the date represents time, the picture represents a user picture, and the category represents the category;

2) selecting a public data set as a training data set;

3) training a model;

4) taking picture data in the data table per _ info as input of the trained SSD model, and constructing a user interest tag matrix;

5) calculating a potential interest distribution matrix, wherein the time sequence calculation formula of each interest is as follows:

taking time in a time interval as a time point, wherein n represents the total number of the time points, and t represents the t-th time point;

6) calculating an interest similarity matrix of each time period between users p and q by using a probability distribution distance formula

N rows in the matrix represent interest similarity of the nth time period, and the calculation formula is as follows:

wherein D is_JSRepresenting JS divergence, D_KLRepresenting the KL divergence.

7) And sequencing the similarity according to the obtained interest similarity between the user and other users to obtain the top k friends, and recommending the top k friends to corresponding target users.

Further, in the step 3), the model training includes the following steps:

3.1) preprocessing the training data set, processing the pictures of the training data set into images of 300 × 300 pixels;

3.2) building an SSD network architecture;

3.3) setting the prior frame size and the length-width ratio, and calculating the width w of the prior frame_kAnd a height h_k；

3.4) calculating the intersection ratio IOU value, wherein the calculation formula is as follows:

wherein S is_p∩S_gRepresenting the intersection of the area of the prior frame and the area of the real target frame, S_p∪S_gRepresenting the union of the area of the prior frame and the real target frame;

3.5) matching prior frames, wherein the prior frame matching method comprises the following steps: for each real target frame in the graph, taking the prior frame with the largest IOU value as an object which is successfully matched, for the prior frame which is not successfully matched, comparing the IOU value with a threshold value, and if the IOU value is larger than the threshold value, setting the prior frame as the object which is successfully matched;

3.6) calculating the SSD detection value, including the predicted position and the classification information; the classification information calculation method is that if a certain prior frame is successfully matched with a real target frame, the real target frame is considered to be consistent with the category information of the prior frame; the predicted position is recorded as l ═ (l)^cx,l^cy,l^w,l^h) The calculation formula is as follows:

l^cx＝(b^cx-d^cx)/d^w，l^cy＝(b^cy-d^cy)/d^h

l^w＝log(b^w/d^w)，l^h＝log(b^h/d^h)

wherein d ═ d (d)^cx,d^cy,d^w,d^h) Representing a priori box position, b ═ b^cx,b^cy,b^w,b^h) Representing the position of the corresponding real target frame;

3.7) calculating the confidence error L_conf(x, c) and position error L_loc(x,l,g)；

3.8) calculating a loss function L (x, c, L, g), said loss function being calculated as follows:

wherein N is the number of positive samples of the prior frame; α is a weight (typically 1).

Further, the method for constructing the user interest tag matrix in the step 4) is as follows:

4.1) carrying out interest identification on the input pictures of the trained model, outputting the model to obtain the category of each picture, and assigning the category to the category attribute in the per _ info;

4.2) data processing, namely dividing user data into n time periods by taking T days as time intervals according to the date attribute, and constructing data with the same name attribute into a group of data sets { name, date, picture, category } in each time period;

4.3) constructing to obtain a certain time period t of the user_iH, { h ═ h_y1,h_y2,…,h_ymUser interest vector, where h_yiRepresenting the proportion of the occurrence frequency of the objects in the ith type of interest tags in the interest tags of the user to the total occurrence frequency of the objects in the t-th time period;

4.4) integrating the user interest vectors of n time periods, constructing and obtaining a user interest label matrix, and recording

Is composed of

Further, in the step 3.3), the prior frame scale linear formula is as follows:

wherein S is_kRepresents the corresponding scale size of the k prior frame, S_minRepresenting the smallest prior boxDimension, value 0.2, S_maxRepresenting the maximum prior frame scale, taking the value of 0.9, and taking m as the number of the characteristic graphs;

the aspect ratio of the prior box satisfies

Width w of the prior frame_kAnd a height h_kThe calculation formula is as follows:

wherein, W_inputAnd H_inputRespectively the width and height of the input image.

Further, in the step 3.7), the position error L is determined_loc(x, l, g) is calculated as follows:

wherein, smooth_L1The loss function calculation formula is as follows:

for the loss error of the real target, the calculation formula is:

in the formula (I), the compound is shown in the specification,respectively represent the center of the jth real target frameThe horizontal coordinate, the vertical coordinate, the width and the height of the position;respectively representing the loss errors of the abscissa, the ordinate, the width and the height of the center position of the jth real target;respectively represents the abscissa and ordinate of the central position of the ith prior frame, the width and the height of the ith prior frame.

Further, in the step 3.7), the confidence error L is determined_conf(x, c), the calculation formula is as follows:

wherein the content of the first and second substances,when the value is 1, the value represents that the ith prior frame is matched with the jth group channel, and the category is p; c is a category confidence degree predicted value; l is the predicted value of the corresponding bounding box position, and g is the position parameter of the groudtruth.

Further, said D in the above step 7)_JSThe calculation formula is as follows:

said D_KLThe calculation formula is as follows:

the method has the advantages that the picture information generated by the user is fully utilized, and the characteristic that the interest and hobbies of the user change along with the lapse of time is considered, so that the recommendation strategy is more scientific, the accuracy of the recommendation result is improved, the friend recommendation strategy scheme is enriched, and the richness of the recommendation strategy is improved.

Drawings

FIG. 1 is a data flow diagram associated with an SSD destination detection algorithm.

FIG. 2 is a specific flowchart of a microblog friend recommendation algorithm based on an SSD and a timing model.

Fig. 3 is four main components of the SSD model.

FIG. 4 is the SSD model main execution sequence.

Detailed Description

The present invention will be further described with reference to the accompanying drawings and specific embodiments, it should be noted that the technical solutions and design principles of the present invention are described in detail below only with one optimized technical solution, but the scope of the present invention is not limited thereto.

The present invention is not limited to the above-described embodiments, and any obvious improvements, substitutions or modifications can be made by those skilled in the art without departing from the spirit of the present invention.

Referring to fig. 1-4, a microblog friend recommendation algorithm based on an SSD and a timing model. The method collects relevant data published by the user from the Sina microblog. As shown in fig. 1, according to the algorithm design requirement, a training data set is used to train an SSD model, and then microblog user data is input into the SSD model to obtain classification information of each user picture, and finally a user interest tag matrix is obtained for further processing and analysis.

The invention provides a microblog friend recommendation method based on an SSD and a time sequence model, which mainly comprises the following steps as shown in FIG. 2: collecting data, constructing a data table, selecting a training data set, training a model, analyzing and processing the data, and calculating the data.

1) Collecting data and constructing a data table per _ info, namely acquiring picture data and corresponding time data of a user from a microblog platform, constructing the data table per _ info, wherein one picture corresponds to one item of the data table per _ info, each item comprises data elements such as name, date, picture, category and the like, the name represents a user name, the date represents time, the picture represents a user picture, and the category represents the category to which the category belongs;

2) selecting a public data set as a training data set, downloading a PASCAL VOC (VOC 2007) data set which is public in the specific embodiment of the invention and can be used for image classification, target detection and the like, wherein the PASCAL VOC data set comprises 9963 pictures with 20 categories, and each picture comprises position information of an object target frame in the picture, category information of the target frame and the like;

3) model training, which aims to make the result obtained by the input user picture through the SSD object detection algorithm model more accurate, four main components of the SSD model are shown in fig. 3, and the execution steps are shown in fig. 4, and include the following steps:

3.1) preprocessing a training data set, and processing pictures of the training data set into images of 300 × 300 pixels;

3.2) building an SSD network architecture, in the specific embodiment of the invention, the SSD network architecture is based on VGG16 and comprises six convolutional layers of Conv4_3, Conv7, Conv8_2, Conv9_2, Conv10_2 and Conv11_2, wherein Conv4_3 is a convolutional layer of VGG16, and Conv7 is a convolutional layer converted from a full connection layer of VGG 16;

3.3) setting and generating prior frames, wherein Conv4_3, Conv7, Conv8_2, Conv9_2, Conv10_2 and Conv11_2 are used as feature maps for detection, the sizes of the six feature maps are 38 × 38, 19 × 19,10 × 10,5 × 5,3 × 3 and 1 × 1, and the number of the prior frames generated by different feature maps is different; as the feature size decreases, the scale of the prior box should increase linearly;

the prior frame is obtained by setting the prior frame scale and the aspect ratio and calculating. First, the prior box scale linear formula is as follows:

wherein S is_kRepresents the corresponding scale size of the k prior frame, S_minRepresents the smallest prior frame size, and takes the value of 0.2, S_maxRepresents the maximum prior frame size, takes the value of 0.9, and m is specialFigure number of the figure;

second, the aspect ratio of the prior frame is generally chosenWherein alpha is_rWhen 1', itFor the feature maps Conv4_3, Conv10_2, Conv11_2, only the aspect ratio of 4 prior boxes is used, 3 andall other feature maps are in aspect ratio;

from this, the width (w) of each prior frame can be calculated_k) And high (h)_k) The calculation formula is as follows:

let W be the width and height of the input image, respectively_inputAnd H_inputThen the aspect ratio of the feature map is α_rThe prior frame width and height calculation formula is as follows:

a total of 38 × 38 × 4+19 × 19 × 6+10 × 10 × 6+5 × 5 × 6+3 × 3 × 6+1 × 1 × 4 would be generated as 8732 prior frames;

3.4) calculating an IOU value, wherein the IOU (intersection over Union) is an intersection ratio, namely calculated by intersection and union of a prior box (prior box) of the SSD and a real target box (ground route, namely the position of a target in a picture), and a calculation formula is as follows:

3.5) matching prior frames, wherein matching is mainly carried out through an IOU value; the higher the IOU value is, the higher the correlation degree between the real target and the prediction is;

the prior box matching follows the following two principles: firstly, regarding each real target frame in the graph, using a prior frame with the largest IOU value as an object which is successfully matched, and secondly, setting a threshold (generally 0.5) for the remaining prior frames which are not successfully matched, comparing the IOU value with the threshold, and if the IOU value is larger than the threshold, setting the prior frames as objects which are successfully matched;

for the prior frames successfully matched, the prior frames are positive samples, and for the prior frames not successfully matched, the prior frames are negative samples;

3.6) calculating SSD detection values (predicted position and classification information);

the classification information is mainly obtained through the step 4.5), if a certain prior frame is successfully matched with the real target frame, the real target frame is considered to be consistent with the prior frame type information, and the classification confidence is obtained;

the predicted position is a predicted position of the bounding box, and is defined as l ═ l^cx,l^cy,l^w,l^h) The code is obtained by calculation through a coding formula, and the calculation formula is as follows:

l^cx＝(b^cx-d^cx)/d^w，l^cy＝(b^cy-d^cy)/d^h

l^w＝log(b^w/d^w)，l^h＝log(b^h/d^h)

3.7) calculating a confidence error (confidence loss) and a location error (localization loss);

for position error L_loc(x, L, g) using the loss function method of Smooth L1 loss (i.e., the loss calculation method of the regression box of RPN in fastercCNN), the calculation formula is as follows:

wherein smooth used in the position error formula_L1The calculation formula is as follows:

wherein the loss error of the real target used in the position error formulaThe calculation formula is as follows:

wherein the content of the first and second substances,respectively representing the horizontal coordinate, the vertical coordinate, the width and the height of the center position of the jth real target frame;respectively representing the loss errors of the abscissa, the ordinate, the width and the height of the center position of the jth real target;respectively representing the horizontal coordinate, the vertical coordinate, the width and the height of the center position of the ith prior frame;

for confidence error L_conf(x, c) calculated using softmax loss (i.e., the loss function of softmax), the formula is defined as follows:

wherein the content of the first and second substances,when the value is 1, the value represents that the ith prior frame is matched with the jth group channel, and the category is p; c is a category confidence degree predicted value; l is the predicted value of the corresponding bounding box position, and g is the position parameter of the groudtruth;

3.8) calculating the loss function by means of the confidence error L_conf(x, c) and position error L_loc(x, L, g) and then calculating a loss function L (x, c, L, g), wherein the loss function is calculated according to the following formula:

wherein N is the number of positive samples of the prior frame; α is a weight (generally 1);

4) taking picture data in the data table per _ info as input of the trained SSD model to be used for calculating a user interest tag matrix, and the specific steps are as follows;

4.4) integrating the user interest vectors of n time periods to construct and obtain a user interest label matrix

5) And (3) further processing the user interests by combining the thought of a time sequence model, calculating a user interest label matrix according to a time sequence attenuation function, and calculating a potential interest distribution matrix, namely a time sequence calculation formula of each interest is as follows:

The interest similarity calculation formula of the nth time segment in the matrix is as follows:

wherein D is_JSThe formula is defined as follows:

wherein D is_KLThe formula is defined as follows:

wherein D is_KLRepresents the KL Divergence (Kullback-Leibler Divergence), i.e.the relative entropy, D_JSRepresenting JS divergence (Jensen-Shannon), the KL divergence formula has the divergence asymmetry problem, so the JS formula is used for solving the problem; sim (p, q)_interestThe method is used for calculating the interest similarity of the user vector p and the user vector q in each time period;

Claims

1. A microblog friend recommendation method based on an SSD and a time sequence model is characterized by comprising the following steps:

2) selecting a public data set as a training data set;

3) training a model;

taking a time interval of T days as a time period, wherein n represents the total time period, and T represents the T-th time period;

wherein D is_JSRepresenting JS divergence, D_KLRepresents the KL divergence;

2. The microblog friend recommending method based on the SSD and the time sequence model according to claim 1, wherein in the step 3), the model training comprises the following steps:

3.2) building an SSD network architecture;

l^cx＝(b^cx-d^cx)/d^w，l^cy＝(b^cy-d^cy)/d^h

l^w＝log(b^w/d^w)，l^h＝log(b^h/d^h)

3. The microblog friend recommending method based on the SSD and the time sequence model according to claim 1, wherein the user interest tag matrix constructing method in the step 4) is as follows:

4.2) data processing, dividing user data into n time periods by taking T days as time intervals according to the date attribute, and constructing the data with the same name attribute into a group of data sets in each time period

{name,date,picture,category}；

4.3) constructing a certain time period t of the user_iH, { h ═ h_y1,h_y2,…,h_ymUser interest vector, where h_yiRepresenting the proportion of the occurrence frequency of the objects in the ith type of interest tags in the interest tags of the user to the total occurrence frequency of the objects in the t-th time period;

4.4) integrating the user interest vectors of n time periods, constructing and obtaining a user interest label matrix, and recording the user interest label matrix as

4. The microblog friend recommending method based on the SSD and the time sequence model according to claim 2, wherein in the step 3.3), the prior frame scale linear formula is as follows:

wherein S is_kRepresents the corresponding scale size of the k prior frame, S_minRepresents the smallest prior frame size, and takes the value of 0.2, S_maxRepresenting the maximum prior frame scale, taking the value of 0.9, and taking m as the number of the characteristic graphs;

the aspect ratio of the prior box satisfies

5. The microblog friend recommendation method based on the SSD and the timing model according to claim 2, wherein

In the step 3.7), the position error L_loc(x, l, g) is calculated as follows:

wherein, smooth_L1Loss functionThe calculation formula is as follows:

for the loss error of the real target, the calculation formula is:

in the formula (I), the compound is shown in the specification,respectively representing the horizontal coordinate, the vertical coordinate, the width and the height of the center position of the jth real target frame;respectively representing the loss errors of the abscissa, the ordinate, the width and the height of the center position of the jth real target;respectively represents the abscissa and ordinate of the central position of the ith prior frame, the width and the height of the ith prior frame.

6. The microblog friend recommendation method based on the SSD and the timing model according to claim 2, wherein

In the step 3.7), the confidence error L_conf(x, c), the calculation formula is as follows:

7. The microblog friend recommendation method based on the SSD and the time sequence model according to claim 1, wherein the D in the step 6) is_JSThe calculation formula is as follows:

said D_KLThe calculation formula is as follows: