CN107066578B

CN107066578B - 3D picture intelligent recommendation method based on deep learning and transfer learning

Info

Publication number: CN107066578B
Application number: CN201710239554.0A
Authority: CN
Inventors: 王华珍; 潘傲寒
Original assignee: Xiamen Huiku Culture Media Co ltd; Huaqiao University
Current assignee: Xiamen Huiku Culture Media Co ltd; Huaqiao University
Priority date: 2017-04-13
Filing date: 2017-04-13
Publication date: 2020-06-26
Anticipated expiration: 2037-04-13
Also published as: CN107066578A

Abstract

The invention discloses a 3D picture intelligent recommendation method based on deep learning and transfer learning, which comprises the steps of firstly, obtaining a universal scene image classifier based on deep learning by utilizing a large open sample scene image data set; then, migration learning is carried out through a 3D scene picture data set collected by a user, and a universal image classifier is converted into a 3D scene space classifier; then, an information fingerprint library of the 3D picture design scheme recommendation gallery is constructed through a Hash perception algorithm; and finally, matching and screening the pictures of the scene shot by the user and the 3D picture design scheme library to obtain a matching candidate subset, calculating the information fingerprint Hamming distance between each picture in the subset and the picture of the user, and intelligently recommending the 3D picture with the minimum distance to the user. The method is based on deep learning and transfer learning, realizes the design of the 3D picture in a specific environment and a specific space structure, and shortens the design period of the 3D picture.

Description

3D picture intelligent recommendation method based on deep learning and transfer learning

Technical Field

The invention relates to the field of machine learning and image processing, in particular to a 3D picture intelligent recommendation method based on deep learning and transfer learning.

Background

In recent years, naked-eye 3D pictures are more and more concerned and sought after with special artistic expression, super-strong visual impact and extremely interesting interactivity, cover multiple fields such as decoration, advertisement, exhibition, home furnishing and the like, and have wide development prospects. The 3D picture is a special artistic form using the principles of anti-perspective and optical illusion, and needs to be created skillfully using the fusion of environment and spatial structure. Therefore, 3D picture design according to specific environment and space structure is time-consuming and labor-consuming, and certain requirements are also made on experience and level of painters. In the traditional mode, a painter needs to skillfully use various perspective relations in the painting and have very strong sense of space to design a good 3D painting. Some new painters cannot independently design the 3D painting due to inexperience, and the new painters become a vacancy in the 3D painting industry.

Disclosure of Invention

The invention provides a 3D picture intelligent recommendation method based on deep learning and transfer learning, which overcomes the defects of the 3D picture intelligent recommendation method based on deep learning and transfer learning in the prior art.

The technical scheme adopted by the invention for solving the technical problems is as follows: A3D picture intelligent recommendation method based on deep learning and transfer learning comprises the following steps:

s1: constructing an image classifier RCLF based on the public image dataset; the public image data set BS is an MIT computational science Places205 public data set, and an image classifier RCLF for identifying each scene in the Places205 public data set is obtained after a convolution model increment-ResNet is selected to train on the Places205 public data set;

s2: 3D scene space migration learning based on the image classifier RCLF obtained in step S1;

s21, keeping the parameters of other layers of the image classifier RCLF except the softmax layer unchanged, increasing the parameter learning rate of the softmax layer by 2 times, and decreasing the learning rate of other parameters of the full-connection layer by half;

s22, training a full connection layer by using a 3D scene data set SS collected by a user to capture 3D scene space information, and finally obtaining a 4-classification 3D scene space classifier DCLF;

s3: constructing an information fingerprint of a 3D picture design scheme recommendation gallery;

s31, collecting a large number of images of the 3D picture design scheme as a result data set to be recommended;

s32, constructing an information fingerprint FPS for each collected 3D picture image;

s33, solving the Hash perception fingerprint of each image in the step S32 by using a Hash perception algorithm;

s4: outputting a 3D picture design scheme based on a real scene;

s41, the user shoots a picture Ps of the real scene as a matching service request;

s42, transmitting the picture in the step S41 to a classifier DCLF for identification to obtain an intelligent identification result scene of the user scene;

s43, calculating the corresponding information fingerprint fp of the user scene picture Ps by adopting the method of the step S3;

s44, searching a sample subset CSS belonging to scene with scene category being scene in a 3D picture design scheme recommendation gallery;

s45, calculating a Hamming distance between the information fingerprint cp of each picture in the sample subset CSS and the information fingerprint fp of the user scene picture Ps, wherein the 3D drawing design scheme picture with the minimum Hamming distance in the sample subset CSS is the recommendation result Ds.

Further, the Places205 dataset is over two hundred and fifty thousand pictures of scenes collected by MIT computer science and artificial intelligence laboratories, for a total of 205 scene categories.

Further, the 3D painting scene data set SS includes 3D paintings of different colors, wall surfaces, ground surfaces, wall and ground surfaces, and recessed corner surfaces.

Further, the information fingerprint FPS construction process is to set the pixel value of each channel in the picture central area a to zero.

Compared with the prior art, the invention has the following beneficial effects: the method realizes intelligent design of the 3D picture based on deep learning and transfer learning, and avoids the problems of long design period and difficult design caused by the problems of personal inspiration of painters, drawing experience and the like in any environment space required by users; and the method saves training time on one hand, further shortens the period of 3D picture design, and greatly expands the application field of deep learning on the other hand.

The invention is further explained in detail with the accompanying drawings and the embodiments; however, the 3D picture intelligent recommendation method based on deep learning and transfer learning of the present invention is not limited to the embodiment.

Drawings

FIG. 1 is a block diagram of a transfer learning process of the present invention;

FIG. 2 is a block diagram of the recommendation process of the present invention.

Detailed Description

In an embodiment, please refer to fig. 1 and fig. 2, a 3D picture intelligent recommendation method based on deep learning and transfer learning of the present invention includes the following steps:

s1: constructing an image classifier based on a public image data set, and naming the image classifier as RCLF; the public image data set is named as BS, the MIT computing science Places205 public data set, and after a convolution model increment-ResNet is selected to be trained on the Places205 public data set, an image classifier RCLF for identifying each scene in the Places205 public data set is obtained; the recognizer RCLF can classify 205 scenes in the Places205 public data set according to factors such as color, structure, environment and the like;

s21, keeping the parameters of other layers of the image classifier RCLF except the softmax layer unchanged, increasing the parameter learning rate of the softmax layer by 2 times, and decreasing the learning rate of other parameters of a full connection layer by half, wherein the full connection layer is a connection medium connecting the last layer, namely the softmax layer, with the penultimate layer;

s22, training a full connection layer by using a 3D scene data set SS collected by a user to capture 3D scene space information, and finally obtaining a 4-classification 3D scene space classifier named as DCLF; the spatial classifier DCLF further classifies the 205 scenes classified by the image classifier RCLF in step S1 into 4 classes;

s32, constructing an information fingerprint library for each collected 3D picture image, and naming the fingerprint library as an FPS;

s4: outputting a 3D picture design scheme based on a real scene;

s41, a user shoots a picture of a real scene, and the picture is marked as Ps and serves as a matching service request;

s42, transmitting the picture Ps in the step S41 to a space classifier DCLF for identification to obtain an intelligent identification result of the user scene, wherein the result is expressed by scene, and the scene comprises the characteristics of the user scene such as color, composition and the like;

s43, calculating the corresponding information fingerprint of the user scene picture Ps by adopting the method of the step S3, and recording as fp;

s44, searching pictures which belong to scene categories and are closest to feature categories in scene in a 3D picture design scheme recommendation gallery to serve as a sample subset, and recording the sample subset as CSS;

s45, recording the information fingerprint of each picture in the sample subset CSS as cp and the information fingerprint fp of the user scene picture Ps to obtain the Hamming distance, and recording the 3D drawing design scheme picture with the minimum Hamming distance in the sample subset CSS as a recommendation result as Ds.

In this embodiment, the Places205 data set is more than two hundred and fifty thousand scene pictures collected by MIT computer science and artificial intelligence laboratories, and there are 205 scene categories in total; the 3D picture scene data set SS comprises 3D pictures of different colors, wall surfaces, ground surfaces, wall surfaces and concave wall corner surfaces; the FPS construction process of the information fingerprint database comprises the step of setting the pixel value of each channel in the central area a of the picture to zero, wherein the central area a of the picture is 50% of the length and width of the original picture.

The above embodiments are only used to further illustrate the intelligent 3D image recommendation method based on deep learning and migration learning, but the present invention is not limited to the embodiments, and any simple modifications, equivalent changes and modifications made to the above embodiments according to the technical spirit of the present invention fall within the scope of the technical solution of the present invention.

Claims

1. A3D picture intelligent recommendation method based on deep learning and transfer learning is characterized by comprising the following steps:

s1: constructing an image classifier RCLF based on the public image dataset; the public image data set is an MIT computational science Places205 public data set, and an image classifier RCLF for identifying each scene in the Places205 public data set is obtained after a convolution model increment-ResNet is selected to train on the Places205 public data set;

s21, keeping the parameters of other layers of the image classifier RCLF except for the softmax layer and the full connection layer unchanged, increasing the parameter learning rate of the softmax layer by 2 times, and decreasing the learning rate of other parameters of the full connection layer by half;

s31, collecting 3D picture design scheme images as a result data set to be recommended;

s4: outputting a 3D picture design scheme based on a real scene;

2. The 3D picture intelligent recommendation method based on deep learning and transfer learning according to claim 1, characterized in that: the Places205 public dataset is over two hundred and fifty thousand pictures of scenes collected by MIT computer science and artificial intelligence laboratories, for a total of 205 scene categories.

3. The 3D picture intelligent recommendation method based on deep learning and transfer learning according to claim 1, characterized in that: the 3D picture scene data set SS comprises 3D pictures of different colors, wall surfaces, ground surfaces, wall surfaces and concave wall corner surfaces.

4. The 3D picture intelligent recommendation method based on deep learning and transfer learning according to claim 1, characterized in that: the information fingerprint FPS construction process is to set the pixel value of each channel in the picture central area a to zero, and the picture central area a is 50% of the length and width of the original picture.