CN111461248A - Photographic composition line matching method, device, equipment and storage medium - Google Patents

Photographic composition line matching method, device, equipment and storage medium Download PDF

Info

Publication number
CN111461248A
CN111461248A CN202010274362.5A CN202010274362A CN111461248A CN 111461248 A CN111461248 A CN 111461248A CN 202010274362 A CN202010274362 A CN 202010274362A CN 111461248 A CN111461248 A CN 111461248A
Authority
CN
China
Prior art keywords
target image
sample set
composition
training
unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010274362.5A
Other languages
Chinese (zh)
Inventor
姜男
熊鑫
刘浏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shanghai Chengshi Information Technology Co ltd
Original Assignee
Shanghai Chengshi Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shanghai Chengshi Information Technology Co ltd filed Critical Shanghai Chengshi Information Technology Co ltd
Priority to CN202010274362.5A priority Critical patent/CN111461248A/en
Publication of CN111461248A publication Critical patent/CN111461248A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/55Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Databases & Information Systems (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a method, a device, equipment and a storage medium for matching a photographic composition line, and mainly relates to the technical field of computer information. The method comprises the steps of importing a classification sample set, wherein the classification sample set is a photo collection set which is classified by taking a manually calibrated construction line as a label; training the classified sample set through a convolutional neural network DenseNet, and constructing a composition model; acquiring a target image, wherein the target image is a shot picture; and identifying and classifying the target image by using the composition model to obtain a composition line of a corresponding category. The invention has the beneficial effects that: the intelligent shooting guidance system has the advantages that the network is improved and then is divided into multiple structural line types, and intelligent shooting guidance is performed.

Description

Photographic composition line matching method, device, equipment and storage medium
Technical Field
The invention relates to the technical field of computer information, in particular to a method, a device, equipment and a storage medium for matching a photographic composition line.
Background
Image recognition and object detection are important research problems in the field of computer vision, and have wide application prospects in various aspects such as face recognition, safety monitoring, dynamic tracking and the like. Image recognition refers to a technique of processing, analyzing, and understanding an image with a computer to recognize various different patterns of objects and objects. The object detection means that for any frame or continuous frame of images, a specific object is detected and identified, and the position and size information of the object is returned, for example, a bounding box surrounding the object is output.
With the continuous development and progress of smart phones in recent years, the functions of cameras carried by the smart phones are gradually increased. However, most of mobile phones are almost zero for guidance before photographing, and mainly after photographing is finished, great attention is paid to processing photos. There is no guidance as to how to make the user eject a more aesthetic photograph.
Current difficulty of guidance prior to taking photographs: 1) mainly has strong specialization, and people without related professional knowledge have difficulty in having some abstract understanding on the photos. 2) A huge correlated data set is required for training. 3) Before the neural network is not rolled, image processing is always processed by opencv and the like, and the image processing is difficult to realize due to large calculation amount. The guidance before photographing needs to be supported by a relatively professional knowledge theory. No such convolutional neural network can currently implement it.
Disclosure of Invention
The invention aims to provide a method, a device, equipment and a storage medium for matching photographic composition lines, which have the function of intelligently guiding shooting by improving a network and then dividing the network into a plurality of structural line types.
In order to achieve the purpose, the invention is realized by the following technical scheme:
a photographic composition line matching method, comprising:
importing a classification sample set, wherein the classification sample set is a photo collection which is classified by taking a manually calibrated construction line as a label;
training the classified sample set through a convolutional neural network DenseNet, and constructing a composition model;
acquiring a target image, wherein the target image is a shot picture;
and identifying and classifying the target image by using the composition model to obtain a composition line of a corresponding category.
The training of the classification sample set by the convolutional neural network DenseNet includes training with a high performance processing unit using the improved vgg under the PyTorch framework. After the model training is finished, the model is laid out on an Ali server by using a model service technology to perform running calculation.
As another aspect of the present invention, there is provided a photographic composition line matching apparatus including:
the system comprises an importing unit, a classifying unit and a processing unit, wherein the importing unit is configured to import a classified sample set, and the classified sample set is a photo collection set which is classified by taking a manually calibrated construction line as a label;
the training unit is configured and used for training the classification sample set through a convolutional neural network DenseNet and constructing a composition model;
an acquisition unit configured to acquire a target image, the target image being a photographed photograph;
and the identification unit is configured for identifying and classifying the target image by using the composition model to obtain the composition line of the corresponding category.
The acquisition unit and the recognition unit are configured in an APP of the voice communication device.
The acquiring of the target image includes:
the photo is shot through the voice communication device where the APP is located.
As another aspect of the present invention, there is provided an apparatus comprising:
one or more processors;
a memory for storing one or more programs,
the one or more programs, when executed by the one or more processors, cause the one or more processors to perform a photographic composition line matching method as described above.
As another aspect of the present invention, there is provided a computer-readable storage medium storing a computer program which, when executed by a processor, implements a photographic composition line matching method as described above.
Compared with the prior art, the invention has the beneficial effects that:
1. the invention applies a deep learning frame to identify and classify the structure lines for the first time, and the structure lines are classified into various structure line types after the network is improved, so that the accuracy is as high as 80 percent
2. The invention can be used in the photography industry to intelligently guide the photography.
Drawings
FIG. 1 is an exemplary diagram of step one in example 1 of the present invention.
FIG. 2 is a schematic diagram of step two of example 1 of the present invention.
Detailed Description
The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and these equivalents also fall within the scope of the present application.
Before the neural network is not rolled, image processing is always processed by opencv and the like, and the image processing is difficult to realize due to large calculation amount. The guidance before photographing needs to be supported by a relatively professional knowledge theory.
Example 1:
according to the reasons, picture recognition is slightly modified, a method for recognizing picture structure lines is adopted, pictures shot by a user are calculated, and then a closer composition classification is recommended.
The details of the three steps are as follows:
the method comprises the following steps: it is necessary for a professional to mark a large number of photos and divide the structure lines into a plurality of labels, and each type of structure line comprises 1 ten thousand photos with labels. The type is shown in figure 1:
step two: and modifying the network, and training by using the processed data set, as shown in the second figure.
After processing, the image data can directly enter a convolutional neural network for training. In terms of neural network selection, we used the DenseNet at the leading edge of the current comparison. DenseNet is a convolutional neural network with dense connections. In the network, any two layers have direct connection, that is, the input of each layer of the network is the union of the outputs of all the previous layers, and the feature map learned by the layer is directly transmitted to all the next layers as input. The network has fewer parameters than a conventional convolutional network because it does not need to relearn the redundant feature map. And improves the transfer of information and gradients in the network, which makes the network easier to train. The entire data was first partitioned into 70% data as the training set and 30% as the test data set. Training in a PyTorch framework with a high performance processing unit (GPU) using improved vgg. After the model training is finished, the model is distributed on an Ali server by using a model service technology to perform the running water operation, and the calculation time of each user hand drawing is about 0.7 second. And then, according to the result, matching the corresponding composition to recommend.
Step three: implementing interactive functionality on apps
After the trained model can identify the structure line of the user photo, a functional module is developed to identify the picture structure line and make a recommendation. Firstly, a user shoots on a mobile phone and clicks pictures to upload. Secondly, the uploaded pictures are processed in a plurality of steps and then input into an identification model, structure lines are drawn, and a structure line with the highest similarity is returned for recommendation.

Claims (7)

1. A photographic composition line matching method, comprising:
importing a classification sample set, wherein the classification sample set is a photo collection which is classified by taking a manually calibrated construction line as a label;
training the classified sample set through a convolutional neural network DenseNet, and constructing a composition model;
acquiring a target image, wherein the target image is a shot picture;
and identifying and classifying the target image by using the composition model to obtain a composition line of a corresponding category.
2. The method of claim 1, wherein said training the classified sample set through a convolutional neural network DenseNet comprises training with a high performance processing unit using improved vgg in a PyTorch framework. After the model training is finished, the model is laid out on an Ali server by using a model service technology to perform running calculation.
3. A photographic composition line matching apparatus, comprising:
the system comprises an importing unit, a classifying unit and a processing unit, wherein the importing unit is configured to import a classified sample set, and the classified sample set is a photo collection set which is classified by taking a manually calibrated construction line as a label;
the training unit is configured and used for training the classification sample set through a convolutional neural network DenseNet and constructing a composition model;
an acquisition unit configured to acquire a target image, the target image being a photographed photograph;
and the identification unit is configured for identifying and classifying the target image by using the composition model to obtain the composition line of the corresponding category.
4. The apparatus as claimed in claim 3, wherein the acquiring unit and the recognizing unit are disposed in APP of the voice communication apparatus.
5. The apparatus of claim 4, wherein said acquiring the target image comprises:
the photo is shot through the voice communication device where the APP is located.
6. An apparatus, characterized in that the apparatus comprises:
one or more processors;
a memory for storing one or more programs,
the one or more programs, when executed by the one or more processors, cause the one or more processors to perform the method recited in claim 1 or 2.
7. A computer-readable storage medium, in which a computer program is stored which, when being executed by a processor, carries out the method of claim 1 or 2.
CN202010274362.5A 2020-04-09 2020-04-09 Photographic composition line matching method, device, equipment and storage medium Pending CN111461248A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010274362.5A CN111461248A (en) 2020-04-09 2020-04-09 Photographic composition line matching method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010274362.5A CN111461248A (en) 2020-04-09 2020-04-09 Photographic composition line matching method, device, equipment and storage medium

Publications (1)

Publication Number Publication Date
CN111461248A true CN111461248A (en) 2020-07-28

Family

ID=71680606

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010274362.5A Pending CN111461248A (en) 2020-04-09 2020-04-09 Photographic composition line matching method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111461248A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112528979A (en) * 2021-02-10 2021-03-19 成都信息工程大学 Transformer substation inspection robot obstacle distinguishing method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109040605A (en) * 2018-11-05 2018-12-18 北京达佳互联信息技术有限公司 Shoot bootstrap technique, device and mobile terminal and storage medium
CN109344715A (en) * 2018-08-31 2019-02-15 北京达佳互联信息技术有限公司 Intelligent composition control method, device, electronic equipment and storage medium
CN110889428A (en) * 2019-10-21 2020-03-17 浙江大搜车软件技术有限公司 Image recognition method and device, computer equipment and storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109344715A (en) * 2018-08-31 2019-02-15 北京达佳互联信息技术有限公司 Intelligent composition control method, device, electronic equipment and storage medium
CN109040605A (en) * 2018-11-05 2018-12-18 北京达佳互联信息技术有限公司 Shoot bootstrap technique, device and mobile terminal and storage medium
CN110889428A (en) * 2019-10-21 2020-03-17 浙江大搜车软件技术有限公司 Image recognition method and device, computer equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112528979A (en) * 2021-02-10 2021-03-19 成都信息工程大学 Transformer substation inspection robot obstacle distinguishing method and system

Similar Documents

Publication Publication Date Title
CN111444908B (en) Image recognition method, device, terminal and storage medium
CN111062871B (en) Image processing method and device, computer equipment and readable storage medium
CN110659646A (en) Automatic multitask certificate image processing method, device, equipment and readable storage medium
CN110866471A (en) Face image quality evaluation method and device, computer readable medium and communication terminal
US20190205618A1 (en) Method and apparatus for generating facial feature
CN111445459B (en) Image defect detection method and system based on depth twin network
CN112287820A (en) Face detection neural network, face detection neural network training method, face detection method and storage medium
CN111340123A (en) Image score label prediction method based on deep convolutional neural network
CN103617432A (en) Method and device for recognizing scenes
CN107766864B (en) Method and device for extracting features and method and device for object recognition
CN112417947B (en) Method and device for optimizing key point detection model and detecting face key points
WO2022127814A1 (en) Method and apparatus for detecting salient object in image, and device and storage medium
CN113298018A (en) False face video detection method and device based on optical flow field and facial muscle movement
CN110363111B (en) Face living body detection method, device and storage medium based on lens distortion principle
CN113255501B (en) Method, apparatus, medium and program product for generating form recognition model
CN115410240A (en) Intelligent face pockmark and color spot analysis method and device and storage medium
CN111461248A (en) Photographic composition line matching method, device, equipment and storage medium
KR101334858B1 (en) Automatic butterfly species identification system and method, and portable terminal having automatic butterfly species identification function using the same
US10991085B2 (en) Classifying panoramic images
CN104166840A (en) Focusing realization method based on video conference system
CN111464743A (en) Photographic composition matching method and system
CN115410174B (en) Two-stage vehicle insurance anti-fraud image acquisition quality inspection method, device and system
CN114863224B (en) Training method, image quality detection method, device and medium
CN113065559B (en) Image comparison method and device, electronic equipment and storage medium
CN115909408A (en) Pedestrian re-identification method and device based on Transformer network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200728