CN112200258A

CN112200258A - Multi-dimensional vehicle logo data enhancement method

Info

Publication number: CN112200258A
Application number: CN202011114773.4A
Authority: CN
Inventors: 柯逍; 杜鹏强; 刘童安
Original assignee: Fuzhou University
Current assignee: Fuzhou University
Priority date: 2020-10-16
Filing date: 2020-10-16
Publication date: 2021-01-08
Anticipated expiration: 2040-10-16
Also published as: CN112200258B

Abstract

The invention relates to a multi-dimensional vehicle logo data enhancement method. First, a small frame is randomly selected to generate a small rectangular frame with four vertices relative to the vehicle logo vertices whose offsets are 1.5 to 3 times the width and height of the vehicle logo area. This rectangular box selects the image containing the car logo area, and intercepts it as a scale-dimension enhanced image. Then, quantitative dimension enhancement is carried out. Through the sliding cross-segmentation method, the image is scanned and intercepted every 1/4 step with a rectangular box whose size is 1/2 of the source image, and a large number of effective quantitative dimension enhanced images are generated. Through the normal distribution segmentation method, a rectangular box whose length is from the center of the car logo area to the minimum value of the left and right sides of the image, and the width is from the center of the car logo area to the minimum value of the upper and lower sides of the image is generated. The probability of , selects a part of the image containing the car logo area and intercepts it as a spatial dimension enhanced image. Finally, the brightness and contrast dimensions of all the above data are enhanced using brightness transformation and contrast transformation. The invention can effectively expand the data amount of the vehicle logo.

Description

Multi-dimensional vehicle logo data enhancement method

Technical Field

The invention relates to the field of data enhancement of pattern recognition and computer vision, in particular to a multi-dimensional car logo data enhancement method.

Background

In the deep learning field, big data is the basis for supporting object feature learning. Training the network requires a large amount of data as support in order to better extract the features of the target. If the data quality in the data set is not good enough, it is usually optimized using methods such as data equalization or data enhancement. The significance of data enhancement is that training data are converted through a certain method to generate new data, so that the network can learn features better. Through data enhancement, the original data set can be enriched and the data set can be expanded. It can prevent overfitting due to small amount of data during training. This has important significance to the recognition and detection capability of the model. Most of the data of the current car logo data base set are images which are separated from scenes and used for classification, and the data set cannot be used for detection training. The data set that can be used for detection training is too small to meet the large amount of data required for training. Therefore, data enhancement of the emblem data set is required to meet the demand.

Disclosure of Invention

In view of this, the present invention provides a method for enhancing data of a multi-dimensional emblem, which can effectively expand the data amount of the emblem.

The invention is realized by adopting the following scheme: a multi-dimensional car logo data enhancement method comprises the following steps:

step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the car logo region relative to the car logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method of selecting the car logo region image by using the small rectangular frame;

step S2: scanning an original image by using a cross sliding window method, wherein the length and the width of a window are 1/2 of the original image, the sliding step length is 1/4 of the original image, and then intercepting a window image containing a complete car logo area in each scanning as a number dimension enhanced image;

step S3: generating a rectangle with the length being the minimum value of the distance from the center of the car logo area to the left side and the right side of the image and the width being the minimum value of the distance from the center of the car logo area to the upper side and the lower side of the image by using a normal distribution segmentation method, selecting a window image containing the complete car logo area according to the probability of normal distribution, and intercepting the window image as a space dimension enhanced image;

step S4: the enhanced images generated in the integration step S1, step S2 and step S3 are further enhanced by using a luminance transformation method and a contrast transformation method, respectively.

Further, the step S1 specifically includes the following steps:

step S11: generating the offsets dx1, dy1, dx2, dy2 from the four sides of the rectangular frame to the four sides of the emblem area

Wherein dx1, dy1, dx2, dy2 are offsets of the left, upper, right and lower sides of the rectangular frame from the left, upper, right and lower sides of the emblem region, rw and rh are the width and height of the emblem region, respectively, random (1.5, 3) is a function that randomly generates uniformly distributed random floating point numbers between 1.5 and 3;

step S12: and calculating the position of the generated new image in the original image by the following method:

wherein sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image; sf _ xx2 and sf _ yy2 are positions of the dimension enhanced image at the lower right corner of the original image, and dx1, dy1, dx2 and dy2 are offsets of the left side, the upper side, the right side and the lower side of the rectangular frame from the left side, the upper side, the right side and the lower side of the logo area respectively;

step S13, calculating the new coordinate of the center of the car logo:

wherein cx is an x coordinate of the center of the new logo, cy is a y coordinate of the center of the new logo, and sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image;

step S14: generating a new car logo image by using the position of the scale dimension enhanced image calculated in the step S12, and writing the calculated car logo information into a file named by adding an sf suffix to the file name of the original image;

further, the step S2 specifically includes the following steps:

step S21: calculating the length and width of the subimages

Wherein nw and nh are respectively the width and height of the enhanced image in the number dimension, and ow and oh are respectively the width and height of the original image;

step S22: and calculating the positions of the generated number dimension enhanced images in the original image, wherein the calculation method comprises the following steps:

wherein i is 0,1, 2, j is 0,1, 2, ss _ xx1 and ss _ yy1 are positions of the upper left corner of the enhanced image in the number dimension in the original image, ss _ xx2 and ss _ yy2 are positions of the enhanced image in the lower right corner of the original image in the number dimension, and nw and nh are the width and the height of the enhanced image in the number dimension respectively;

step S23: judging whether the generated quantity dimension enhanced image contains a complete car logo area, if so, retaining the sub-image, otherwise, discarding;

step S24: a new emblem image is generated using the number dimension enhanced image positions calculated in step S22, and the calculated emblem position information is written into a file named by the original image file name plus the ss suffix.

Further, the step S3 specifically includes the following steps:

step S31: a function randn (0,1) is constructed to generate a number between 0 and 1 that should satisfy the normal distribution:

it is calculated that when μ is 0.5 and σ is 0.1938, the random number generated using the normal distribution can satisfy 0 to 1; the norm function in Matlab is used as a vector, so the randn (0,1) function is specifically constructed by using the norm function in Matlab (0.5, 0.1938);

step S32: calculating the width and height of the spatial dimension enhanced image:

wherein nw and nh are the width and height, respectively, of the spatial dimension enhanced image; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; ow and oh are the width and height of the original image, respectively;

step S33: calculating the position of the central point of the car logo area in the generated space dimension enhanced image, wherein the calculation method comprises the following steps:

wherein nx and ny are respectively an x coordinate and a y coordinate of a target center point of the spatial dimension enhanced image; ow and oh are the width and height of the original image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image; rw and rh are the width and height of the target region, respectively; randn (0,1) is a function that generates a number between 0 and 1 that satisfies a normal distribution;

step S34: and calculating the position of the generated space dimension enhanced image in the original image, wherein the calculation method comprises the following steps:

wherein mr _ xx1 and mr _ yy1 are positions of the upper left corner of the spatial dimension enhanced image in the original image; mr _ xx2 and mr _ yy2 are positions of the enhanced image in the lower right corner of the original image in the spatial dimension; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; nx and ny are the x coordinate and the y coordinate of the target center point of the spatial dimension enhanced image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image;

step S35: a new car logo image is generated using the spatial dimension enhanced image position calculated in step S34, and the calculated car logo position information is written in a file named by adding an mr suffix to the original image file name.

Further, the step S4 specifically includes the following steps:

step S41: integrating all the images generated in step S1, step S2, and step S3, copying the original image so that the ratio of the number of generated images to the number of original images is 3: 1;

step S42: performing brightness transformation processing on all the images generated in the steps S1, S2 and S3, wherein the brightness of each picture is randomly selected from 0.5 to 1.5 for transformation so as to strengthen the distribution of the data set in the brightness dimension;

step S43: performing contrast transformation processing on all the images generated in the steps S1, S2 and S3, wherein the contrast of each image is randomly selected from 0.5 to 1.5 for transformation, so as to strengthen the distribution of the data set in the contrast dimension;

step S44: all images are integrated to form a new data set.

Compared with the prior art, the invention has the following beneficial effects:

(1) the invention can effectively perform data expansion on the car logo image.

(2) The small frame random selection method provided by the invention can effectively increase the proportion of the car logo in the overall image, so that the characteristics of the car logo in the image are more prominent.

(3) The cross sliding segmentation method provided by the invention can effectively increase the number of pictures without changing the length-width ratio of the original image.

(4) The normal distribution segmentation method provided by the invention can maximally keep the information in the image on the premise of greatly increasing the diversity of the positions of the car logos.

Drawings

FIG. 1 is a flow chart of a method according to an embodiment of the present invention.

Detailed Description

The invention is further explained below with reference to the drawings and the embodiments.

It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.

It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.

As shown in fig. 1, the present embodiment provides a method for enhancing multidimensional car logo data, including the following steps:

step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the vehicle logo region relative to the vehicle logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method of selecting a vehicle logo region image by using the small rectangular frame, wherein the generated image can effectively increase the proportion of the vehicle logo region in the whole image from the size dimension, and highlights the characteristic information of the vehicle logo in the image;

In this embodiment, the step S1 specifically includes the following steps:

step S13, calculating the new coordinate of the center of the car logo:

step S14: generating a new car logo image by using the position of the scale dimension enhanced image calculated in the step S12, and writing the calculated car logo information into a file named by adding a suffix of 'sf' to the file name of the original image;

in this embodiment, the step S2 specifically includes the following steps:

step S21: calculating the length and width of the subimages

step S24: a new emblem image is generated using the number-dimension enhanced image positions calculated in step S22, and the calculated emblem position information is written in a file named after the original image file name plus the "ss" suffix.

In this embodiment, the step S3 specifically includes the following steps:

it can be found by calculation that the random number generated using the normal distribution can satisfy 0 to 1 when μ is 0.5 and σ is 0.1938. The invention uses the norm function in Matlab as a carrier, so the randn (0,1) function is constructed by using the norm function in Matlab (0.5, 0.1938).

step S35: a new car logo image is generated using the spatial dimension enhanced image position calculated in step S34, and the calculated car logo position information is written in a file named by adding the suffix "mr" to the original image file name.

In this embodiment, the step S4 specifically includes the following steps:

step S44: all images are integrated to form a new data set.

Preferably, the data enhancement can be effectively carried out on the car logo data set by the embodiment. Firstly, a small-frame random selection method is used, and the method is mainly used for solving the problem that the car logo occupies a small area in the main body image. Compared with the original image, the proportion of the car logo area can be averagely improved by 3 times by using the image generated by the method of the embodiment. And then, enhancing the image by using a cross sliding segmentation method. The aspect ratio of the image generated by the method of the embodiment is consistent with that of the original image, so that the generated image is more consistent with the original data distribution, and meanwhile, the number of effective data can be greatly increased. Normal distribution segmentation was then used to enrich the positional diversity of the data. The distribution of the positions of the car logos of the images generated by the method of the embodiment meets the normal distribution, and the distribution has important significance in probability. And finally, performing data integration enhancement, and copying original data to ensure that the quantity ratio of the generated data to the original data is 3: 1, simultaneously, brightness and contrast change is carried out on all data generated by the method, and the diversity of the data is further expanded.

The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims

1. A multidimensional car logo data enhancement method is characterized by comprising the following steps: the method comprises the following steps:

step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the car logo region relative to the car logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method for selecting the car logo region image by using the small rectangular frame;

2. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S1 specifically includes the following steps:

step S13: calculating new car logo center coordinates:

step S14: a new car logo image is generated using the position of the enhanced image in the scale dimension calculated in step S12, and the calculated car logo information is written in a file named by the original image file name plus the sf suffix.

3. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S2 specifically includes the following steps:

step S21: calculating the length and width of the subimages

4. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S3 specifically includes the following steps:

wherein nx and ny are respectively an x coordinate and a y coordinate of a target central point of the spatial dimension enhanced image; ow and oh are the width and height of the original image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image; rw and rh are the width and height of the target region, respectively; randn (0,1) is a function that generates a number between 0 and 1 that satisfies a normal distribution;

5. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S4 specifically includes the following steps:

step S41: integrating all the images generated in the steps S1, S2 and S3, and copying the original image so that the number ratio of the generated images to the original image is 3: 1;

step S44: all images are integrated to form a new data set.