CN112200258A - Multi-dimensional vehicle logo data enhancement method - Google Patents

Multi-dimensional vehicle logo data enhancement method Download PDF

Info

Publication number
CN112200258A
CN112200258A CN202011114773.4A CN202011114773A CN112200258A CN 112200258 A CN112200258 A CN 112200258A CN 202011114773 A CN202011114773 A CN 202011114773A CN 112200258 A CN112200258 A CN 112200258A
Authority
CN
China
Prior art keywords
image
dimension
original image
width
enhanced
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011114773.4A
Other languages
Chinese (zh)
Other versions
CN112200258B (en
Inventor
柯逍
杜鹏强
刘童安
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Fuzhou University
Original Assignee
Fuzhou University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Fuzhou University filed Critical Fuzhou University
Priority to CN202011114773.4A priority Critical patent/CN112200258B/en
Publication of CN112200258A publication Critical patent/CN112200258A/en
Application granted granted Critical
Publication of CN112200258B publication Critical patent/CN112200258B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/255Detecting or recognising potential candidate objects based on visual cues, e.g. shapes

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Multimedia (AREA)
  • Image Processing (AREA)
  • Traffic Control Systems (AREA)
  • Image Analysis (AREA)

Abstract

本发明涉及一种多维度车标数据增强方法,首先通过小框随机选取法,生成一个四个顶点相对车标顶点偏移量为车标区域宽和高1.5到3倍的小矩形框,用这个矩形框框选包含车标区域的图像,截取作为尺度维度增强图像。然后进行数量维度增强,通过滑动交叉分割法,用一个大小为源图像1/2的矩形框每隔1/4的步长对图像进行扫描并截取,生成大量有效的数量维度增强图像。通过正态分布分割法,生成一个长为车标区域的中心到图像左右两边的最小值,宽为车标区域的中心到图像上下两边的最小值的矩形框,用这个矩形框以正态分布的概率选取包含车标区域的一部分图像并截取作为空间维度增强图像。最后使用亮度变换和对比度变换对以上所有数据进行亮度维度和对比度维度的增强。本发明能够有效地对车标进行数据量扩充。

Figure 202011114773

The invention relates to a multi-dimensional vehicle logo data enhancement method. First, a small frame is randomly selected to generate a small rectangular frame with four vertices relative to the vehicle logo vertices whose offsets are 1.5 to 3 times the width and height of the vehicle logo area. This rectangular box selects the image containing the car logo area, and intercepts it as a scale-dimension enhanced image. Then, quantitative dimension enhancement is carried out. Through the sliding cross-segmentation method, the image is scanned and intercepted every 1/4 step with a rectangular box whose size is 1/2 of the source image, and a large number of effective quantitative dimension enhanced images are generated. Through the normal distribution segmentation method, a rectangular box whose length is from the center of the car logo area to the minimum value of the left and right sides of the image, and the width is from the center of the car logo area to the minimum value of the upper and lower sides of the image is generated. The probability of , selects a part of the image containing the car logo area and intercepts it as a spatial dimension enhanced image. Finally, the brightness and contrast dimensions of all the above data are enhanced using brightness transformation and contrast transformation. The invention can effectively expand the data amount of the vehicle logo.

Figure 202011114773

Description

Multi-dimensional vehicle logo data enhancement method
Technical Field
The invention relates to the field of data enhancement of pattern recognition and computer vision, in particular to a multi-dimensional car logo data enhancement method.
Background
In the deep learning field, big data is the basis for supporting object feature learning. Training the network requires a large amount of data as support in order to better extract the features of the target. If the data quality in the data set is not good enough, it is usually optimized using methods such as data equalization or data enhancement. The significance of data enhancement is that training data are converted through a certain method to generate new data, so that the network can learn features better. Through data enhancement, the original data set can be enriched and the data set can be expanded. It can prevent overfitting due to small amount of data during training. This has important significance to the recognition and detection capability of the model. Most of the data of the current car logo data base set are images which are separated from scenes and used for classification, and the data set cannot be used for detection training. The data set that can be used for detection training is too small to meet the large amount of data required for training. Therefore, data enhancement of the emblem data set is required to meet the demand.
Disclosure of Invention
In view of this, the present invention provides a method for enhancing data of a multi-dimensional emblem, which can effectively expand the data amount of the emblem.
The invention is realized by adopting the following scheme: a multi-dimensional car logo data enhancement method comprises the following steps:
step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the car logo region relative to the car logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method of selecting the car logo region image by using the small rectangular frame;
step S2: scanning an original image by using a cross sliding window method, wherein the length and the width of a window are 1/2 of the original image, the sliding step length is 1/4 of the original image, and then intercepting a window image containing a complete car logo area in each scanning as a number dimension enhanced image;
step S3: generating a rectangle with the length being the minimum value of the distance from the center of the car logo area to the left side and the right side of the image and the width being the minimum value of the distance from the center of the car logo area to the upper side and the lower side of the image by using a normal distribution segmentation method, selecting a window image containing the complete car logo area according to the probability of normal distribution, and intercepting the window image as a space dimension enhanced image;
step S4: the enhanced images generated in the integration step S1, step S2 and step S3 are further enhanced by using a luminance transformation method and a contrast transformation method, respectively.
Further, the step S1 specifically includes the following steps:
step S11: generating the offsets dx1, dy1, dx2, dy2 from the four sides of the rectangular frame to the four sides of the emblem area
Figure BDA0002728918000000021
Wherein dx1, dy1, dx2, dy2 are offsets of the left, upper, right and lower sides of the rectangular frame from the left, upper, right and lower sides of the emblem region, rw and rh are the width and height of the emblem region, respectively, random (1.5, 3) is a function that randomly generates uniformly distributed random floating point numbers between 1.5 and 3;
step S12: and calculating the position of the generated new image in the original image by the following method:
Figure BDA0002728918000000031
wherein sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image; sf _ xx2 and sf _ yy2 are positions of the dimension enhanced image at the lower right corner of the original image, and dx1, dy1, dx2 and dy2 are offsets of the left side, the upper side, the right side and the lower side of the rectangular frame from the left side, the upper side, the right side and the lower side of the logo area respectively;
step S13, calculating the new coordinate of the center of the car logo:
Figure BDA0002728918000000032
wherein cx is an x coordinate of the center of the new logo, cy is a y coordinate of the center of the new logo, and sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image;
step S14: generating a new car logo image by using the position of the scale dimension enhanced image calculated in the step S12, and writing the calculated car logo information into a file named by adding an sf suffix to the file name of the original image;
further, the step S2 specifically includes the following steps:
step S21: calculating the length and width of the subimages
Figure BDA0002728918000000041
Wherein nw and nh are respectively the width and height of the enhanced image in the number dimension, and ow and oh are respectively the width and height of the original image;
step S22: and calculating the positions of the generated number dimension enhanced images in the original image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000042
wherein i is 0,1, 2, j is 0,1, 2, ss _ xx1 and ss _ yy1 are positions of the upper left corner of the enhanced image in the number dimension in the original image, ss _ xx2 and ss _ yy2 are positions of the enhanced image in the lower right corner of the original image in the number dimension, and nw and nh are the width and the height of the enhanced image in the number dimension respectively;
step S23: judging whether the generated quantity dimension enhanced image contains a complete car logo area, if so, retaining the sub-image, otherwise, discarding;
step S24: a new emblem image is generated using the number dimension enhanced image positions calculated in step S22, and the calculated emblem position information is written into a file named by the original image file name plus the ss suffix.
Further, the step S3 specifically includes the following steps:
step S31: a function randn (0,1) is constructed to generate a number between 0 and 1 that should satisfy the normal distribution:
Figure BDA0002728918000000051
it is calculated that when μ is 0.5 and σ is 0.1938, the random number generated using the normal distribution can satisfy 0 to 1; the norm function in Matlab is used as a vector, so the randn (0,1) function is specifically constructed by using the norm function in Matlab (0.5, 0.1938);
step S32: calculating the width and height of the spatial dimension enhanced image:
Figure BDA0002728918000000052
wherein nw and nh are the width and height, respectively, of the spatial dimension enhanced image; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; ow and oh are the width and height of the original image, respectively;
step S33: calculating the position of the central point of the car logo area in the generated space dimension enhanced image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000053
wherein nx and ny are respectively an x coordinate and a y coordinate of a target center point of the spatial dimension enhanced image; ow and oh are the width and height of the original image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image; rw and rh are the width and height of the target region, respectively; randn (0,1) is a function that generates a number between 0 and 1 that satisfies a normal distribution;
step S34: and calculating the position of the generated space dimension enhanced image in the original image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000061
wherein mr _ xx1 and mr _ yy1 are positions of the upper left corner of the spatial dimension enhanced image in the original image; mr _ xx2 and mr _ yy2 are positions of the enhanced image in the lower right corner of the original image in the spatial dimension; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; nx and ny are the x coordinate and the y coordinate of the target center point of the spatial dimension enhanced image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image;
step S35: a new car logo image is generated using the spatial dimension enhanced image position calculated in step S34, and the calculated car logo position information is written in a file named by adding an mr suffix to the original image file name.
Further, the step S4 specifically includes the following steps:
step S41: integrating all the images generated in step S1, step S2, and step S3, copying the original image so that the ratio of the number of generated images to the number of original images is 3: 1;
step S42: performing brightness transformation processing on all the images generated in the steps S1, S2 and S3, wherein the brightness of each picture is randomly selected from 0.5 to 1.5 for transformation so as to strengthen the distribution of the data set in the brightness dimension;
step S43: performing contrast transformation processing on all the images generated in the steps S1, S2 and S3, wherein the contrast of each image is randomly selected from 0.5 to 1.5 for transformation, so as to strengthen the distribution of the data set in the contrast dimension;
step S44: all images are integrated to form a new data set.
Compared with the prior art, the invention has the following beneficial effects:
(1) the invention can effectively perform data expansion on the car logo image.
(2) The small frame random selection method provided by the invention can effectively increase the proportion of the car logo in the overall image, so that the characteristics of the car logo in the image are more prominent.
(3) The cross sliding segmentation method provided by the invention can effectively increase the number of pictures without changing the length-width ratio of the original image.
(4) The normal distribution segmentation method provided by the invention can maximally keep the information in the image on the premise of greatly increasing the diversity of the positions of the car logos.
Drawings
FIG. 1 is a flow chart of a method according to an embodiment of the present invention.
Detailed Description
The invention is further explained below with reference to the drawings and the embodiments.
It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.
As shown in fig. 1, the present embodiment provides a method for enhancing multidimensional car logo data, including the following steps:
step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the vehicle logo region relative to the vehicle logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method of selecting a vehicle logo region image by using the small rectangular frame, wherein the generated image can effectively increase the proportion of the vehicle logo region in the whole image from the size dimension, and highlights the characteristic information of the vehicle logo in the image;
step S2: scanning an original image by using a cross sliding window method, wherein the length and the width of a window are 1/2 of the original image, the sliding step length is 1/4 of the original image, and then intercepting a window image containing a complete car logo area in each scanning as a number dimension enhanced image;
step S3: generating a rectangle with the length being the minimum value of the distance from the center of the car logo area to the left side and the right side of the image and the width being the minimum value of the distance from the center of the car logo area to the upper side and the lower side of the image by using a normal distribution segmentation method, selecting a window image containing the complete car logo area according to the probability of normal distribution, and intercepting the window image as a space dimension enhanced image;
step S4: the enhanced images generated in the integration step S1, step S2 and step S3 are further enhanced by using a luminance transformation method and a contrast transformation method, respectively.
In this embodiment, the step S1 specifically includes the following steps:
step S11: generating the offsets dx1, dy1, dx2, dy2 from the four sides of the rectangular frame to the four sides of the emblem area
Figure BDA0002728918000000081
Wherein dx1, dy1, dx2, dy2 are offsets of the left, upper, right and lower sides of the rectangular frame from the left, upper, right and lower sides of the emblem region, rw and rh are the width and height of the emblem region, respectively, random (1.5, 3) is a function that randomly generates uniformly distributed random floating point numbers between 1.5 and 3;
step S12: and calculating the position of the generated new image in the original image by the following method:
Figure BDA0002728918000000091
wherein sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image; sf _ xx2 and sf _ yy2 are positions of the dimension enhanced image at the lower right corner of the original image, and dx1, dy1, dx2 and dy2 are offsets of the left side, the upper side, the right side and the lower side of the rectangular frame from the left side, the upper side, the right side and the lower side of the logo area respectively;
step S13, calculating the new coordinate of the center of the car logo:
Figure BDA0002728918000000092
wherein cx is an x coordinate of the center of the new logo, cy is a y coordinate of the center of the new logo, and sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image;
step S14: generating a new car logo image by using the position of the scale dimension enhanced image calculated in the step S12, and writing the calculated car logo information into a file named by adding a suffix of 'sf' to the file name of the original image;
in this embodiment, the step S2 specifically includes the following steps:
step S21: calculating the length and width of the subimages
Figure BDA0002728918000000101
Wherein nw and nh are respectively the width and height of the enhanced image in the number dimension, and ow and oh are respectively the width and height of the original image;
step S22: and calculating the positions of the generated number dimension enhanced images in the original image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000102
wherein i is 0,1, 2, j is 0,1, 2, ss _ xx1 and ss _ yy1 are positions of the upper left corner of the enhanced image in the number dimension in the original image, ss _ xx2 and ss _ yy2 are positions of the enhanced image in the lower right corner of the original image in the number dimension, and nw and nh are the width and the height of the enhanced image in the number dimension respectively;
step S23: judging whether the generated quantity dimension enhanced image contains a complete car logo area, if so, retaining the sub-image, otherwise, discarding;
step S24: a new emblem image is generated using the number-dimension enhanced image positions calculated in step S22, and the calculated emblem position information is written in a file named after the original image file name plus the "ss" suffix.
In this embodiment, the step S3 specifically includes the following steps:
step S31: a function randn (0,1) is constructed to generate a number between 0 and 1 that should satisfy the normal distribution:
Figure BDA0002728918000000111
it can be found by calculation that the random number generated using the normal distribution can satisfy 0 to 1 when μ is 0.5 and σ is 0.1938. The invention uses the norm function in Matlab as a carrier, so the randn (0,1) function is constructed by using the norm function in Matlab (0.5, 0.1938).
Step S32: calculating the width and height of the spatial dimension enhanced image:
Figure BDA0002728918000000112
wherein nw and nh are the width and height, respectively, of the spatial dimension enhanced image; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; ow and oh are the width and height of the original image, respectively;
step S33: calculating the position of the central point of the car logo area in the generated space dimension enhanced image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000113
wherein nx and ny are respectively an x coordinate and a y coordinate of a target center point of the spatial dimension enhanced image; ow and oh are the width and height of the original image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image; rw and rh are the width and height of the target region, respectively; randn (0,1) is a function that generates a number between 0 and 1 that satisfies a normal distribution;
step S34: and calculating the position of the generated space dimension enhanced image in the original image, wherein the calculation method comprises the following steps:
Figure BDA0002728918000000121
wherein mr _ xx1 and mr _ yy1 are positions of the upper left corner of the spatial dimension enhanced image in the original image; mr _ xx2 and mr _ yy2 are positions of the enhanced image in the lower right corner of the original image in the spatial dimension; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; nx and ny are the x coordinate and the y coordinate of the target center point of the spatial dimension enhanced image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image;
step S35: a new car logo image is generated using the spatial dimension enhanced image position calculated in step S34, and the calculated car logo position information is written in a file named by adding the suffix "mr" to the original image file name.
In this embodiment, the step S4 specifically includes the following steps:
step S41: integrating all the images generated in step S1, step S2, and step S3, copying the original image so that the ratio of the number of generated images to the number of original images is 3: 1;
step S42: performing brightness transformation processing on all the images generated in the steps S1, S2 and S3, wherein the brightness of each picture is randomly selected from 0.5 to 1.5 for transformation so as to strengthen the distribution of the data set in the brightness dimension;
step S43: performing contrast transformation processing on all the images generated in the steps S1, S2 and S3, wherein the contrast of each image is randomly selected from 0.5 to 1.5 for transformation, so as to strengthen the distribution of the data set in the contrast dimension;
step S44: all images are integrated to form a new data set.
Preferably, the data enhancement can be effectively carried out on the car logo data set by the embodiment. Firstly, a small-frame random selection method is used, and the method is mainly used for solving the problem that the car logo occupies a small area in the main body image. Compared with the original image, the proportion of the car logo area can be averagely improved by 3 times by using the image generated by the method of the embodiment. And then, enhancing the image by using a cross sliding segmentation method. The aspect ratio of the image generated by the method of the embodiment is consistent with that of the original image, so that the generated image is more consistent with the original data distribution, and meanwhile, the number of effective data can be greatly increased. Normal distribution segmentation was then used to enrich the positional diversity of the data. The distribution of the positions of the car logos of the images generated by the method of the embodiment meets the normal distribution, and the distribution has important significance in probability. And finally, performing data integration enhancement, and copying original data to ensure that the quantity ratio of the generated data to the original data is 3: 1, simultaneously, brightness and contrast change is carried out on all data generated by the method, and the diversity of the data is further expanded.
The above description is only a preferred embodiment of the present invention, and all equivalent changes and modifications made in accordance with the claims of the present invention should be covered by the present invention.

Claims (5)

1. A multidimensional car logo data enhancement method is characterized by comprising the following steps: the method comprises the following steps:
step S1: generating a small rectangular frame with four vertexes having offsets of 1.5 to 3 times of the width and the height of the car logo region relative to the car logo vertexes by using a small frame random selection method, and generating a scale dimension enhanced image by using a method for selecting the car logo region image by using the small rectangular frame;
step S2: scanning an original image by using a cross sliding window method, wherein the length and the width of a window are 1/2 of the original image, the sliding step length is 1/4 of the original image, and then intercepting a window image containing a complete car logo area in each scanning as a number dimension enhanced image;
step S3: generating a rectangle with the length being the minimum value of the distance from the center of the car logo area to the left side and the right side of the image and the width being the minimum value of the distance from the center of the car logo area to the upper side and the lower side of the image by using a normal distribution segmentation method, selecting a window image containing the complete car logo area according to the probability of normal distribution, and intercepting the window image as a space dimension enhanced image;
step S4: the enhanced images generated in the integration step S1, step S2 and step S3 are further enhanced by using a luminance transformation method and a contrast transformation method, respectively.
2. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S1 specifically includes the following steps:
step S11: generating the offsets dx1, dy1, dx2, dy2 from the four sides of the rectangular frame to the four sides of the emblem area
Figure FDA0002728917990000011
Wherein dx1, dy1, dx2, dy2 are offsets of the left, upper, right and lower sides of the rectangular frame from the left, upper, right and lower sides of the emblem region, rw and rh are the width and height of the emblem region, respectively, random (1.5, 3) is a function that randomly generates uniformly distributed random floating point numbers between 1.5 and 3;
step S12: and calculating the position of the generated new image in the original image by the following method:
Figure FDA0002728917990000021
wherein sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image; sf _ xx2 and sf _ yy2 are positions of the dimension enhanced image at the lower right corner of the original image, and dx1, dy1, dx2 and dy2 are offsets of the left side, the upper side, the right side and the lower side of the rectangular frame from the left side, the upper side, the right side and the lower side of the logo area respectively;
step S13: calculating new car logo center coordinates:
Figure FDA0002728917990000022
wherein cx is an x coordinate of the center of the new logo, cy is a y coordinate of the center of the new logo, and sf _ xx1 and sf _ yy1 are positions of the upper left corner of the dimension-enhanced image in the original image;
step S14: a new car logo image is generated using the position of the enhanced image in the scale dimension calculated in step S12, and the calculated car logo information is written in a file named by the original image file name plus the sf suffix.
3. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S2 specifically includes the following steps:
step S21: calculating the length and width of the subimages
Figure FDA0002728917990000031
Wherein nw and nh are respectively the width and height of the enhanced image in the number dimension, and ow and oh are respectively the width and height of the original image;
step S22: and calculating the positions of the generated number dimension enhanced images in the original image, wherein the calculation method comprises the following steps:
Figure FDA0002728917990000032
wherein i is 0,1, 2, j is 0,1, 2, ss _ xx1 and ss _ yy1 are positions of the upper left corner of the enhanced image in the number dimension in the original image, ss _ xx2 and ss _ yy2 are positions of the enhanced image in the lower right corner of the original image in the number dimension, and nw and nh are the width and the height of the enhanced image in the number dimension respectively;
step S23: judging whether the generated quantity dimension enhanced image contains a complete car logo area, if so, retaining the sub-image, otherwise, discarding;
step S24: a new emblem image is generated using the number dimension enhanced image positions calculated in step S22, and the calculated emblem position information is written into a file named by the original image file name plus the ss suffix.
4. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S3 specifically includes the following steps:
step S31: a function randn (0,1) is constructed to generate a number between 0 and 1 that should satisfy the normal distribution:
Figure FDA0002728917990000041
it is calculated that when μ is 0.5 and σ is 0.1938, the random number generated using the normal distribution can satisfy 0 to 1; the norm function in Matlab is used as a vector, so the randn (0,1) function is specifically constructed by using the norm function in Matlab (0.5, 0.1938);
step S32: calculating the width and height of the spatial dimension enhanced image:
Figure FDA0002728917990000042
wherein nw and nh are the width and height, respectively, of the spatial dimension enhanced image; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; ow and oh are the width and height of the original image, respectively;
step S33: calculating the position of the central point of the car logo area in the generated space dimension enhanced image, wherein the calculation method comprises the following steps:
Figure FDA0002728917990000043
wherein nx and ny are respectively an x coordinate and a y coordinate of a target central point of the spatial dimension enhanced image; ow and oh are the width and height of the original image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image; rw and rh are the width and height of the target region, respectively; randn (0,1) is a function that generates a number between 0 and 1 that satisfies a normal distribution;
step S34: and calculating the position of the generated space dimension enhanced image in the original image, wherein the calculation method comprises the following steps:
Figure FDA0002728917990000051
wherein mr _ xx1 and mr _ yy1 are positions of the upper left corner of the spatial dimension enhanced image in the original image; mr _ xx2 and mr _ yy2 are positions of the enhanced image in the lower right corner of the original image in the spatial dimension; ox and oy are the x coordinate and y coordinate of the center of the target area of the original image respectively; nx and ny are the x coordinate and the y coordinate of the target center point of the spatial dimension enhanced image, respectively; nw and nh are the width and height, respectively, of the spatial dimension enhanced image;
step S35: a new car logo image is generated using the spatial dimension enhanced image position calculated in step S34, and the calculated car logo position information is written in a file named by adding an mr suffix to the original image file name.
5. The multi-dimensional emblem data enhancement method of claim 1, characterized in that: the step S4 specifically includes the following steps:
step S41: integrating all the images generated in the steps S1, S2 and S3, and copying the original image so that the number ratio of the generated images to the original image is 3: 1;
step S42: performing brightness transformation processing on all the images generated in the steps S1, S2 and S3, wherein the brightness of each picture is randomly selected from 0.5 to 1.5 for transformation so as to strengthen the distribution of the data set in the brightness dimension;
step S43: performing contrast transformation processing on all the images generated in the steps S1, S2 and S3, wherein the contrast of each image is randomly selected from 0.5 to 1.5 for transformation, so as to strengthen the distribution of the data set in the contrast dimension;
step S44: all images are integrated to form a new data set.
CN202011114773.4A 2020-10-16 2020-10-16 Multi-dimensional vehicle logo data enhancement method Active CN112200258B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011114773.4A CN112200258B (en) 2020-10-16 2020-10-16 Multi-dimensional vehicle logo data enhancement method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011114773.4A CN112200258B (en) 2020-10-16 2020-10-16 Multi-dimensional vehicle logo data enhancement method

Publications (2)

Publication Number Publication Date
CN112200258A true CN112200258A (en) 2021-01-08
CN112200258B CN112200258B (en) 2022-06-10

Family

ID=74010186

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011114773.4A Active CN112200258B (en) 2020-10-16 2020-10-16 Multi-dimensional vehicle logo data enhancement method

Country Status (1)

Country Link
CN (1) CN112200258B (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108230233A (en) * 2017-05-16 2018-06-29 北京市商汤科技开发有限公司 Data enhancing, treating method and apparatus, electronic equipment and computer storage media
CN110096962A (en) * 2019-04-04 2019-08-06 苏州千视通视觉科技股份有限公司 Vehicle Detail based on region convolutional network identifies secondary structure method and device
US20200315587A1 (en) * 2019-04-04 2020-10-08 Koninklijke Philips N.V. Data augmentation for training deep learning models with ultrasound images

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108230233A (en) * 2017-05-16 2018-06-29 北京市商汤科技开发有限公司 Data enhancing, treating method and apparatus, electronic equipment and computer storage media
CN110096962A (en) * 2019-04-04 2019-08-06 苏州千视通视觉科技股份有限公司 Vehicle Detail based on region convolutional network identifies secondary structure method and device
US20200315587A1 (en) * 2019-04-04 2020-10-08 Koninklijke Philips N.V. Data augmentation for training deep learning models with ultrasound images

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
MAHDI SALEH ET AL.: "Enhanced algorithm for autonomous parallel parking of a car-like mobile robot", 《2016 IEEE INTERNATIONAL MULTIDISCIPLINARY CONFERENCE ON ENGINEERING TECHNOLOGY (IMCET)》 *
杨昌东等: "基于AT-PGGAN的增强数据车辆型号精细识别", 《中国图象图形学报》 *
郭耀华: "基于深度学习的车辆驾驶状态识别算法研究", 《中国优秀博硕士学位论文全文数据库(硕士)工程科技Ⅱ辑》 *
陈文兵等: "基于条件生成式对抗网络的数据增强方法", 《计算机应用》 *

Also Published As

Publication number Publication date
CN112200258B (en) 2022-06-10

Similar Documents

Publication Publication Date Title
CN109003282B (en) Image processing method and device and computer storage medium
US7945114B2 (en) Image transform method for obtaining expanded image data, image processing apparatus and image display device therefore
TWI571803B (en) Method for generating color two-dimensional barcode
CN108304814B (en) Method for constructing character type detection model and computing equipment
US8687923B2 (en) Robust patch regression based on in-place self-similarity for image upscaling
US8036494B2 (en) Enhancing image resolution
US8873846B2 (en) Detecting and tracking point features with primary colors
US6686922B2 (en) Adaptive thresholding using loose gray scale template matching
US8879839B2 (en) Image processing apparatus, image processing method, and storage medium
CN111784588A (en) Image data enhancement method, apparatus, computer equipment and storage medium
KR20110086826A (en) Method, apparatus and software for determining motion vectors
US7327904B2 (en) Pattern classification and filter design for increasing image resolution
CN112200258A (en) Multi-dimensional vehicle logo data enhancement method
JP3026706B2 (en) Image processing device
US20190295310A1 (en) Uniform density cube map rendering for spherical projections
JP2000357226A (en) Method for binarizing light and shade image and recording medium where binarizing program is recorded
WO2013011797A1 (en) Degradation restoration system, degradation restoration method and program
JP4911585B2 (en) Image processing apparatus, image processing method, program, and information recording medium
JP2004133592A (en) Image processor, image processing method and image processing program for magnifying image
CN116645299B (en) Method and device for enhancing depth fake video data and computer equipment
JP2015197762A (en) Image processing apparatus, image processing method, and image processing program
CN110689601A (en) Scatter elimination algorithm suitable for AR virtual soft package synthesis
JP2006050481A (en) Image processing apparatus, image processing method and program thereof
KR100516681B1 (en) Enlarging and smoothing textual characters
CN116260974B (en) Video scaling method and system and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant