CN111461248A

CN111461248A - Photographic composition line matching method, device, equipment and storage medium

Info

Publication number: CN111461248A
Application number: CN202010274362.5A
Authority: CN
Inventors: 姜男; 熊鑫; 刘浏
Original assignee: Shanghai Chengshi Information Technology Co ltd
Current assignee: Shanghai Chengshi Information Technology Co ltd
Priority date: 2020-04-09
Filing date: 2020-04-09
Publication date: 2020-07-28

Abstract

The invention discloses a method, a device, equipment and a storage medium for matching a photographic composition line, and mainly relates to the technical field of computer information. The method comprises the steps of importing a classification sample set, wherein the classification sample set is a photo collection set which is classified by taking a manually calibrated construction line as a label; training the classified sample set through a convolutional neural network DenseNet, and constructing a composition model; acquiring a target image, wherein the target image is a shot picture; and identifying and classifying the target image by using the composition model to obtain a composition line of a corresponding category. The invention has the beneficial effects that: the intelligent shooting guidance system has the advantages that the network is improved and then is divided into multiple structural line types, and intelligent shooting guidance is performed.

Description

Photographic composition line matching method, device, equipment and storage medium

Technical Field

The invention relates to the technical field of computer information, in particular to a method, a device, equipment and a storage medium for matching a photographic composition line.

Background

Image recognition and object detection are important research problems in the field of computer vision, and have wide application prospects in various aspects such as face recognition, safety monitoring, dynamic tracking and the like. Image recognition refers to a technique of processing, analyzing, and understanding an image with a computer to recognize various different patterns of objects and objects. The object detection means that for any frame or continuous frame of images, a specific object is detected and identified, and the position and size information of the object is returned, for example, a bounding box surrounding the object is output.

With the continuous development and progress of smart phones in recent years, the functions of cameras carried by the smart phones are gradually increased. However, most of mobile phones are almost zero for guidance before photographing, and mainly after photographing is finished, great attention is paid to processing photos. There is no guidance as to how to make the user eject a more aesthetic photograph.

Current difficulty of guidance prior to taking photographs: 1) mainly has strong specialization, and people without related professional knowledge have difficulty in having some abstract understanding on the photos. 2) A huge correlated data set is required for training. 3) Before the neural network is not rolled, image processing is always processed by opencv and the like, and the image processing is difficult to realize due to large calculation amount. The guidance before photographing needs to be supported by a relatively professional knowledge theory. No such convolutional neural network can currently implement it.

Disclosure of Invention

The invention aims to provide a method, a device, equipment and a storage medium for matching photographic composition lines, which have the function of intelligently guiding shooting by improving a network and then dividing the network into a plurality of structural line types.

In order to achieve the purpose, the invention is realized by the following technical scheme:

a photographic composition line matching method, comprising:

importing a classification sample set, wherein the classification sample set is a photo collection which is classified by taking a manually calibrated construction line as a label;

training the classified sample set through a convolutional neural network DenseNet, and constructing a composition model;

acquiring a target image, wherein the target image is a shot picture;

and identifying and classifying the target image by using the composition model to obtain a composition line of a corresponding category.

The training of the classification sample set by the convolutional neural network DenseNet includes training with a high performance processing unit using the improved vgg under the PyTorch framework. After the model training is finished, the model is laid out on an Ali server by using a model service technology to perform running calculation.

As another aspect of the present invention, there is provided a photographic composition line matching apparatus including:

the system comprises an importing unit, a classifying unit and a processing unit, wherein the importing unit is configured to import a classified sample set, and the classified sample set is a photo collection set which is classified by taking a manually calibrated construction line as a label;

the training unit is configured and used for training the classification sample set through a convolutional neural network DenseNet and constructing a composition model;

an acquisition unit configured to acquire a target image, the target image being a photographed photograph;

and the identification unit is configured for identifying and classifying the target image by using the composition model to obtain the composition line of the corresponding category.

The acquisition unit and the recognition unit are configured in an APP of the voice communication device.

The acquiring of the target image includes:

the photo is shot through the voice communication device where the APP is located.

As another aspect of the present invention, there is provided an apparatus comprising:

one or more processors;

a memory for storing one or more programs,

the one or more programs, when executed by the one or more processors, cause the one or more processors to perform a photographic composition line matching method as described above.

As another aspect of the present invention, there is provided a computer-readable storage medium storing a computer program which, when executed by a processor, implements a photographic composition line matching method as described above.

Compared with the prior art, the invention has the beneficial effects that:

1. the invention applies a deep learning frame to identify and classify the structure lines for the first time, and the structure lines are classified into various structure line types after the network is improved, so that the accuracy is as high as 80 percent

2. The invention can be used in the photography industry to intelligently guide the photography.

Drawings

FIG. 1 is an exemplary diagram of step one in example 1 of the present invention.

FIG. 2 is a schematic diagram of step two of example 1 of the present invention.

Detailed Description

The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and these equivalents also fall within the scope of the present application.

Before the neural network is not rolled, image processing is always processed by opencv and the like, and the image processing is difficult to realize due to large calculation amount. The guidance before photographing needs to be supported by a relatively professional knowledge theory.

Example 1:

according to the reasons, picture recognition is slightly modified, a method for recognizing picture structure lines is adopted, pictures shot by a user are calculated, and then a closer composition classification is recommended.

The details of the three steps are as follows:

the method comprises the following steps: it is necessary for a professional to mark a large number of photos and divide the structure lines into a plurality of labels, and each type of structure line comprises 1 ten thousand photos with labels. The type is shown in figure 1:

step two: and modifying the network, and training by using the processed data set, as shown in the second figure.

After processing, the image data can directly enter a convolutional neural network for training. In terms of neural network selection, we used the DenseNet at the leading edge of the current comparison. DenseNet is a convolutional neural network with dense connections. In the network, any two layers have direct connection, that is, the input of each layer of the network is the union of the outputs of all the previous layers, and the feature map learned by the layer is directly transmitted to all the next layers as input. The network has fewer parameters than a conventional convolutional network because it does not need to relearn the redundant feature map. And improves the transfer of information and gradients in the network, which makes the network easier to train. The entire data was first partitioned into 70% data as the training set and 30% as the test data set. Training in a PyTorch framework with a high performance processing unit (GPU) using improved vgg. After the model training is finished, the model is distributed on an Ali server by using a model service technology to perform the running water operation, and the calculation time of each user hand drawing is about 0.7 second. And then, according to the result, matching the corresponding composition to recommend.

Step three: implementing interactive functionality on apps

After the trained model can identify the structure line of the user photo, a functional module is developed to identify the picture structure line and make a recommendation. Firstly, a user shoots on a mobile phone and clicks pictures to upload. Secondly, the uploaded pictures are processed in a plurality of steps and then input into an identification model, structure lines are drawn, and a structure line with the highest similarity is returned for recommendation.

Claims

1. A photographic composition line matching method, comprising:

acquiring a target image, wherein the target image is a shot picture;

2. The method of claim 1, wherein said training the classified sample set through a convolutional neural network DenseNet comprises training with a high performance processing unit using improved vgg in a PyTorch framework. After the model training is finished, the model is laid out on an Ali server by using a model service technology to perform running calculation.

3. A photographic composition line matching apparatus, comprising:

4. The apparatus as claimed in claim 3, wherein the acquiring unit and the recognizing unit are disposed in APP of the voice communication apparatus.

5. The apparatus of claim 4, wherein said acquiring the target image comprises:

6. An apparatus, characterized in that the apparatus comprises:

one or more processors;

a memory for storing one or more programs,

the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the method recited in claim 1 or 2.

7. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the method of claim 1 or 2.