WO2020228171A1 - Data enhancement method and device, and computer readable storage medium - Google Patents

Data enhancement method and device, and computer readable storage medium Download PDF

Info

Publication number
WO2020228171A1
WO2020228171A1 PCT/CN2019/102195 CN2019102195W WO2020228171A1 WO 2020228171 A1 WO2020228171 A1 WO 2020228171A1 CN 2019102195 W CN2019102195 W CN 2019102195W WO 2020228171 A1 WO2020228171 A1 WO 2020228171A1
Authority
WO
WIPO (PCT)
Prior art keywords
picture
picture set
extended
merged
processed
Prior art date
Application number
PCT/CN2019/102195
Other languages
French (fr)
Chinese (zh)
Inventor
许洋
刘鹏
王健宗
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020228171A1 publication Critical patent/WO2020228171A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Definitions

  • This application relates to the field of computer technology, and in particular to a data enhancement method, device and computer-readable storage medium.
  • the OCR services so far are basically data-driven. These services rely on a large amount of data for deep learning (for example, neural network) model training and used for the detection and recognition of target fields. Most of the applied deep learning structures, especially supervised neural networks, require a lot of training data for learning. The training data needs to manually mark the position of the required field on the picture and the corresponding real text value before the detection and recognition model can be trained.
  • deep learning for example, neural network
  • This application provides a data enhancement method, device, and computer-readable storage medium, the main purpose of which is to increase the diversity of training data, so as to be closer to the actual scene more realistically.
  • the present application also provides a data enhancement method, the method includes: obtaining an original picture; performing a first processing operation on the original picture to obtain a first processed picture, and combining the first processed picture with the original picture
  • the first extended picture set is obtained by merging, wherein the first processing operation includes at least one of the following: blurring processing, noise processing; performing a second processing operation on the first extended picture set to obtain the processed first extended picture set, and Combine the processed first extended picture set and the first extended picture set to obtain a second extended picture set, where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing the second extended picture set
  • the third processing operation is to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, wherein the third processing operation includes at least one of the following: Contrast processing operation, color processing operation; taking the third extended picture set as an enhanced picture set.
  • the present application also provides a data enhancement device, the device includes a memory and a processor, the memory stores a data enhancement program that can run on the processor, and the data enhancement program When executed by the processor, the following steps are implemented: obtain an original picture; perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, where The first processing operation includes at least one of the following: blurring processing, noise processing; performing a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combining the processed first extended picture set Merge with the first extended picture set to obtain a second extended picture set, where the second processing operation includes at least one of the following: a move operation, a rotation operation, and a filling operation; a third processing operation is performed on the second extended picture set to obtain the processed The second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third
  • the present application also provides a computer-readable storage medium with a data enhancement program stored on the computer-readable storage medium, and the data enhancement program can be executed by one or more processors to achieve The steps of the data enhancement method as described above.
  • This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2.
  • An extended picture set where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement
  • the later picture sets increase the diversity of the data and enhance the data to make the training samples more sufficient.
  • FIG. 1 is a schematic flowchart of a data enhancement method provided by an embodiment of this application
  • FIG. 2 is a schematic diagram of the internal structure of a data enhancement device provided by an embodiment of the application.
  • FIG. 3 is a schematic diagram of modules of a data enhancement program in a data enhancement device provided by an embodiment of the application.
  • This application provides a data enhancement method.
  • FIG. 1 it is a schematic flowchart of a data enhancement method provided by an embodiment of this application.
  • the method can be executed by a device, and the device can be implemented by software and/or hardware.
  • the data enhancement method includes:
  • the original pictures are from pictures in the training set.
  • the data addition method can be executed to obtain multiple styles of pictures corresponding to each training picture in the training set, thereby increasing The amount of training data improves the accuracy of the training model.
  • the first processing operation includes at least one of the following: blur processing and noise processing.
  • the performing the first processing operation on the original picture to obtain the first processed picture, and merging the first processed picture with the original picture to obtain the first extended picture set includes:
  • different window sizes are used to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and different window size values are used to obtain different pictures, thereby achieving data enhancement.
  • the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation.
  • the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second
  • the expanded picture collection includes:
  • Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  • the coordinate translation operation is performed before the image is rotated, which can avoid the loss of information. After the image is rotated, many holes will appear. These holes must be filled, otherwise the picture effect will not be good, and this operation is generally called interpolation processing.
  • the moving operation includes performing horizontal and vertical moving processing on the picture.
  • the horizontal and vertical movement refers to randomly selecting a value on the x coordinate and the y coordinate (this value can be positive or negative, and the sign indicates the moving direction), and then translate the x and y coordinates respectively.
  • This operation is divided into two types, one is that the image size does not change, so that part of the original image will not be in the image. Another is the image size change. This preserves the content of the original picture.
  • the formula is as follows:
  • the matrix [x 0 y 0 1] T is the original picture coordinates, and [x y 1] T is the coordinates after translation.
  • Set different parameters dx and dy to move the image horizontally and vertically. Combine the above-mentioned enhanced data with the picture processed in this step to achieve data enhancement.
  • the second processing operation further includes a zooming operation and a cropping operation.
  • the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first The extended picture set and the first extended picture set to obtain the second extended picture set further include:
  • the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
  • the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third
  • the expanded picture collection includes:
  • This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2.
  • An extended picture set where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement After the picture collection.
  • eight data enhancement methods including blur, Gaussian noise, horizontal and vertical movement, rotating picture, random zoom, contrast transformation, color transformation, and random cropping can increase the diversity of data to a certain extent, and mix them. (For example, randomly select any one or several methods, stack several methods in different order, etc.) It can generate a variety of outputs. , To enhance the data, making the training samples more sufficient.
  • the application also provides a data enhancement device.
  • FIG. 2 it is a schematic diagram of the internal structure of a data enhancement device provided by an embodiment of this application.
  • the data enhancement device 1 may be a personal computer (Personal Computer, PC), or a terminal device such as a smart phone, a tablet computer, or a portable computer.
  • the data enhancement device 1 at least includes a memory 11, a processor 12, a communication bus 13, and a network interface 14.
  • the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, etc.
  • the memory 11 may be an internal storage unit of the data enhancement device 1 in some embodiments, for example, the hard disk of the data enhancement device 1.
  • the memory 11 may also be an external storage device of the data enhancement device 1, such as a plug-in hard disk, a smart media card (SMC), and a secure digital (Secure Digital, SD card, Flash Card, etc.
  • the memory 11 may also include both an internal storage unit of the data enhancement apparatus 1 and an external storage device.
  • the memory 11 can be used not only to store application software and various types of data installed in the data enhancement device 1, such as the code of the data enhancement program 01, etc., but also to temporarily store data that has been output or will be output.
  • the processor 12 may be a central processing unit (CPU), controller, microcontroller, microprocessor or other data processing chip in some embodiments, and is used to run the program code or processing stored in the memory 11 Data, such as executing data enhancement program 01, etc.
  • CPU central processing unit
  • controller microcontroller
  • microprocessor or other data processing chip in some embodiments, and is used to run the program code or processing stored in the memory 11 Data, such as executing data enhancement program 01, etc.
  • the communication bus 13 is used to realize the connection and communication between these components.
  • the network interface 14 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is usually used to establish a communication connection between the device 1 and other electronic devices.
  • the device 1 may also include a user interface.
  • the user interface may include a display (Display) and an input unit such as a keyboard (Keyboard).
  • the optional user interface may also include a standard wired interface and a wireless interface.
  • the display may be an LED display, a liquid crystal display, a touch liquid crystal display, an organic light-emitting diode (OLED) touch device, and the like.
  • the display can also be called a display screen or a display unit as appropriate, and is used to display the information processed in the data enhancement device 1 and to display a visualized user interface.
  • Figure 2 only shows the data enhancement device 1 with components 11-14 and the data enhancement program 01.
  • the structure shown in Figure 1 does not constitute a limitation on the data enhancement device 1, and may include Fewer or more components than shown, or some combination of components, or different component arrangement.
  • the data enhancement program 01 is stored in the memory 11; when the processor 12 executes the data enhancement program 01 stored in the memory 11, the following steps are implemented:
  • the original pictures are from pictures in the training set.
  • the data addition method can be executed to obtain multiple styles of pictures corresponding to each training picture in the training set, thereby increasing The amount of training data improves the accuracy of the training model.
  • the first processing operation includes at least one of the following: blur processing and noise processing.
  • the performing the first processing operation on the original picture to obtain the first processed picture, and merging the first processed picture with the original picture to obtain the first extended picture set includes:
  • different window sizes are used to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and different window size values are used to obtain different pictures, thereby achieving data enhancement.
  • a second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second extended picture set.
  • the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation.
  • the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second
  • the expanded picture collection includes:
  • Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  • the coordinate translation operation is performed before the image is rotated, which can avoid the loss of information. After the image is rotated, many holes will appear. These holes must be filled, otherwise the picture effect will not be good, and this operation is generally called interpolation processing.
  • the moving operation includes performing horizontal and vertical moving processing on the picture.
  • the horizontal and vertical movement refers to randomly selecting a value on the x coordinate and the y coordinate (this value can be positive or negative, and the sign indicates the moving direction), and then respectively shifting the x coordinate and y coordinate.
  • This operation is divided into two types, one is that the image size does not change, so that part of the original image will not be in the image. Another is the image size change. This preserves the content of the original picture.
  • the formula is as follows:
  • the matrix [x 0 y 0 1] T is the original picture coordinates, and [x y 1] T is the coordinates after translation.
  • Set different parameters dx and dy to move the image horizontally and vertically. Combine the above-mentioned enhanced data with the picture processed in this step to achieve data enhancement.
  • the second processing operation further includes a zooming operation and a cropping operation.
  • the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first The extended picture set and the first extended picture set to obtain the second extended picture set further include:
  • a third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set.
  • the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
  • the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third
  • the expanded picture collection includes:
  • the third extended picture set is used as an enhanced picture set.
  • This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2.
  • An extended picture set where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement After the picture collection.
  • eight data enhancement methods including blur, Gaussian noise, horizontal and vertical movement, rotating picture, random zoom, contrast transformation, color transformation, and random cropping can increase the diversity of data to a certain extent, and mix them. (For example, randomly select any one or several methods, stack several methods in different order, etc.) It can generate a variety of outputs. , To enhance the data, making the training samples more sufficient.
  • the data enhancement program may also be divided into one or more modules, and the one or more modules are stored in the memory 11 and are executed by one or more processors (in this embodiment, processing The module 12) is executed to complete the application.
  • the module referred to in the application refers to a series of computer program instruction segments capable of completing specific functions, and is used to describe the execution process of the data enhancement program in the data enhancement device.
  • FIG. 3 is a schematic diagram of the program modules of the data enhancement program in an embodiment of the data enhancement device of this application.
  • the data enhancement program can be divided into an acquisition module 10, a processing module 20, and a determination module 30.
  • the data enhancement program can be divided into an acquisition module 10, a processing module 20, and a determination module 30.
  • the obtaining module 10 obtains the original picture
  • the processing module 20 performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: Blur processing, noise processing;
  • the processing module 20 performs a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combines the processed first extended picture set and the first extended picture set to obtain a second extended picture set ,
  • the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation;
  • the processing module 20 performs a third processing operation on the second extended picture set to obtain the processed second extended picture set, and combines the processed second extended picture set and the second extended picture set to obtain the third extended picture set ,
  • the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
  • the determining module 30 uses the third extended picture set as the enhanced picture set.
  • an embodiment of the present application also proposes a computer-readable storage medium having a data enhancement program stored on the computer-readable storage medium, and the data enhancement program can be executed by one or more processors to implement the following operations:
  • the first processing operation includes at least one of the following: blur processing, Noise treatment
  • the second processing Operations include at least one of the following: moving operations, rotating operations, and filling operations;
  • the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, where the third processing
  • the operations include at least one of the following: contrast processing operations, color processing operations;
  • the third extended picture set is used as an enhanced picture set.

Abstract

Disclosed is a data enhancement method. The method comprises: obtaining an original picture; performing a first processing operation on the original picture to obtain a first processed picture, and combining the first processed picture with the original picture to obtain a first augmented picture set; performing a second processing operation on the first augmented picture set to obtain a processed first augmented picture set, and combining the processed first augmented picture set with the first augmented picture set to obtain a second augmented picture set; performing a third processing operation on the second augmented picture set to obtain a processed second augmented picture set, and combining the processed second augmented picture set with the second augmented picture set to obtain a third augmented picture set; and using the third augmented picture set as an enhanced picture set. The present application also provides a data enhancement device and a computer readable storage medium. The present application improves the diversity of data and enhances the data to implement full sample training.

Description

数据增强方法、装置及计算机可读存储介质Data enhancement method, device and computer readable storage medium
本申请要求于2019年5月16日提交中国专利局,申请号为201910416659.8、发明名称为“数据增强方法、装置及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on May 16, 2019. The application number is 201910416659.8 and the invention title is "Data Enhancement Method, Device and Computer-readable Storage Medium". The entire content is incorporated by reference. In this application.
技术领域Technical field
本申请涉及计算机技术领域,尤其涉及一种数据增强方法、装置及计算机可读存储介质。This application relates to the field of computer technology, and in particular to a data enhancement method, device and computer-readable storage medium.
背景技术Background technique
目前为止的OCR业务基本上都是数据驱动的,这些业务依赖于大量的数据进行深度学习(例如神经网络)模型的训练并用之于对目标字段的检测和识别。绝大部分应用的深度学习结构尤其是有监督的神经网络,需要大量的训练数据来进行学习。训练数据需要人工标注出所需要的字段在图片上的位置和对应的真实的文本值以后,才能够进行检测和识别模型的训练。The OCR services so far are basically data-driven. These services rely on a large amount of data for deep learning (for example, neural network) model training and used for the detection and recognition of target fields. Most of the applied deep learning structures, especially supervised neural networks, require a lot of training data for learning. The training data needs to manually mark the position of the required field on the picture and the corresponding real text value before the detection and recognition model can be trained.
在实际业务中,经常会出现人工搜集的图片难以覆盖全部场景,人工标注的速度赶不上工期,导致最终标注的数据量不能够很好的让检测和识别模型达到最佳效果的情况。在以上这种标注的数据十分有限的情况下训练出来的模型准确率通常不是很高,即深度学习依赖大量的标注数据,在缺少足够的数据支持下,深度学习训练出来的模型效果一般。为了改善这种情况,需要对现有的标注数据进行一些处理,扩增数据总量。In actual business, it is often difficult to cover all scenes with manually collected pictures, and the speed of manual labeling cannot keep up with the construction period, resulting in a situation where the amount of final labeling data cannot achieve the best results for the detection and recognition model. The accuracy of the model trained in the above-mentioned case where the labeled data is very limited is usually not very high, that is, deep learning relies on a large amount of labeled data. In the absence of sufficient data support, the effect of the model trained by deep learning is average. In order to improve this situation, it is necessary to perform some processing on the existing annotation data to increase the total amount of data.
发明内容Summary of the invention
本申请提供一种数据增强方法、装置及计算机可读存储介质,其主要目的在于增加训练数据的多样性,从而更真实的贴近实际场景。This application provides a data enhancement method, device, and computer-readable storage medium, the main purpose of which is to increase the diversity of training data, so as to be closer to the actual scene more realistically.
为实现上述目的,本申请还提供一种数据增强方法,所述方法包括:获取原始图片;对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作 包括以下至少一种:模糊处理,噪声处理;对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;将所述第三扩充图片集作为增强后的图片集。To achieve the above object, the present application also provides a data enhancement method, the method includes: obtaining an original picture; performing a first processing operation on the original picture to obtain a first processed picture, and combining the first processed picture with the original picture The first extended picture set is obtained by merging, wherein the first processing operation includes at least one of the following: blurring processing, noise processing; performing a second processing operation on the first extended picture set to obtain the processed first extended picture set, and Combine the processed first extended picture set and the first extended picture set to obtain a second extended picture set, where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing the second extended picture set The third processing operation is to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, wherein the third processing operation includes at least one of the following: Contrast processing operation, color processing operation; taking the third extended picture set as an enhanced picture set.
此外,为实现上述目的,本申请还提供一种数据增强装置,所述装置包括存储器和处理器,所述存储器上存储有可在所述处理器上运行的数据增强程序,所述数据增强程序被所述处理器执行时实现如下步骤:获取原始图片;对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;将所述第三扩充图片集作为增强后的图片集。In addition, in order to achieve the above object, the present application also provides a data enhancement device, the device includes a memory and a processor, the memory stores a data enhancement program that can run on the processor, and the data enhancement program When executed by the processor, the following steps are implemented: obtain an original picture; perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, where The first processing operation includes at least one of the following: blurring processing, noise processing; performing a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combining the processed first extended picture set Merge with the first extended picture set to obtain a second extended picture set, where the second processing operation includes at least one of the following: a move operation, a rotation operation, and a filling operation; a third processing operation is performed on the second extended picture set to obtain the processed The second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: contrast processing operation, color processing operation; The third extended picture set is used as an enhanced picture set.
此外,为实现上述目的,本申请还提供一种计算机可读存储介质,所述计算机可读存储介质上存储有数据增强程序,所述数据增强程序可被一个或者多个处理器执行,以实现如上所述的数据增强方法的步骤。In addition, in order to achieve the above-mentioned object, the present application also provides a computer-readable storage medium with a data enhancement program stored on the computer-readable storage medium, and the data enhancement program can be executed by one or more processors to achieve The steps of the data enhancement method as described above.
本申请获取原始图片;对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至 少一种:对比度处理操作、颜色处理操作;将所述第三扩充图片集作为增强后的图片集,增加数据的多样性,增强数据使得训练样本更为充分。This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2. An extended picture set, where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement The later picture sets increase the diversity of the data and enhance the data to make the training samples more sufficient.
附图说明Description of the drawings
图1为本申请一实施例提供的数据增强方法的流程示意图;FIG. 1 is a schematic flowchart of a data enhancement method provided by an embodiment of this application;
图2为本申请一实施例提供的数据增强装置的内部结构示意图;2 is a schematic diagram of the internal structure of a data enhancement device provided by an embodiment of the application;
图3为本申请一实施例提供的数据增强装置中数据增强程序的模块示意图。FIG. 3 is a schematic diagram of modules of a data enhancement program in a data enhancement device provided by an embodiment of the application.
本申请目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The realization, functional characteristics, and advantages of the purpose of this application will be further described in conjunction with the embodiments and with reference to the accompanying drawings.
具体实施方式Detailed ways
应当理解,此处所描述的具体实施例仅仅用以解释本申请,并不用于限定本申请。It should be understood that the specific embodiments described here are only used to explain the application, and are not used to limit the application.
本申请提供一种数据增强方法。参照图1所示,为本申请一实施例提供的数据增强方法的流程示意图。该方法可以由一个装置执行,该装置可以由软件和/或硬件实现。This application provides a data enhancement method. Referring to FIG. 1, it is a schematic flowchart of a data enhancement method provided by an embodiment of this application. The method can be executed by a device, and the device can be implemented by software and/or hardware.
在本实施例中,数据增强方法包括:In this embodiment, the data enhancement method includes:
S10、获取原始图片。S10. Obtain the original picture.
在本实施例中,所述原始图片来自于训练集中的图片,对于训练集中的每张图片,都可以执行数据增加方法,得到训练集中的每张训练图片对应的多种样式的图片,从而增加训练数据量,提高训练模型的准确性。In this embodiment, the original pictures are from pictures in the training set. For each picture in the training set, the data addition method can be executed to obtain multiple styles of pictures corresponding to each training picture in the training set, thereby increasing The amount of training data improves the accuracy of the training model.
S11、对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集。S11. Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set.
在一实施例中,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理。In an embodiment, the first processing operation includes at least one of the following: blur processing and noise processing.
在一具体实现中,所述对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集包括:In a specific implementation, the performing the first processing operation on the original picture to obtain the first processed picture, and merging the first processed picture with the original picture to obtain the first extended picture set includes:
分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,将各个窗口尺寸对应的图片与原始图片进行合并,得到第 一合并图片集;Use different window sizes to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, merge the pictures corresponding to each window size with the original pictures to obtain the first merged picture set;
对第一合并图片集中各个图片进行高斯模糊处理,得到第一合并图片集中各个图片对应的高斯模糊处理的图片,并将第一合并图片集中各个图片对应的高斯模糊处理的图片与第一合并图片集合并得到第二合并图片集;Perform Gaussian blur processing on each picture in the first merged picture set to obtain the Gaussian blurred picture corresponding to each picture in the first merged picture set, and combine the Gaussian blurred picture corresponding to each picture in the first merged picture set with the first merged picture Assemble and obtain a second combined picture set;
对第二合并图片集中各个图片进行高斯噪声处理,得到第二合并图片集中各个图片对应的高斯噪声处理的图片,并将第二合并图片集中各个图片对应的高斯噪声处理的图片与第二合并图片集合并得到第一扩充图片集。Perform Gaussian noise processing on each picture in the second merged picture set to obtain Gaussian noise processed pictures corresponding to each picture in the second merged picture set, and combine the Gaussian noise processed pictures corresponding to each picture in the second merged picture set with the second merged picture Collect and get the first extended picture set.
其中分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,使用不同的窗口尺寸值来得到不同的图片,,从而实现数据增强。Among them, different window sizes are used to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and different window size values are used to obtain different pictures, thereby achieving data enhancement.
S12、对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集。S12. Perform a second processing operation on the first extended picture set to obtain a processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain a second extended picture set.
在一实施例中,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作。In an embodiment, the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation.
在一具体实现中,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:In a specific implementation, the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second The expanded picture collection includes:
对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
其中本实施例中在图像旋转之前,执行坐标平移操作,可以避免信息的丢失。图像旋转之后,会出现许多空洞点。对这些空洞点必须进行填充处理,否则画面效果不好,一般也称这种操作为插值处理。In this embodiment, the coordinate translation operation is performed before the image is rotated, which can avoid the loss of information. After the image is rotated, many holes will appear. These holes must be filled, otherwise the picture effect will not be good, and this operation is generally called interpolation processing.
具体地,对所述移动操作包括对图片进行水平方向和垂直方向移动处理。其中水平方向和垂直方向移动是指在x坐标和y坐标上分别随机选择一个值 (这个值可以是正数也可以是负数,符号表示移动方向),然后分别对x坐标和y坐标进行平移。这种操作分为两种,一种是图像大小不改变,这样最后原始图片中会有一部分不在图像中。还有一种就是图像大小改变。这样可以保全原始图片的内容。其公式如下:Specifically, the moving operation includes performing horizontal and vertical moving processing on the picture. The horizontal and vertical movement refers to randomly selecting a value on the x coordinate and the y coordinate (this value can be positive or negative, and the sign indicates the moving direction), and then translate the x and y coordinates respectively. This operation is divided into two types, one is that the image size does not change, so that part of the original image will not be in the image. Another is the image size change. This preserves the content of the original picture. The formula is as follows:
Figure PCTCN2019102195-appb-000001
其中矩阵[x 0 y 0 1] T为原始图片坐标,[x y 1] T为平移过后的坐标。设置不同的参数dx和dy来对图像进行水平方向和垂直方向移动。将上述增强数据和该步处理后的图片合并,从而实现数据增强。
Figure PCTCN2019102195-appb-000001
The matrix [x 0 y 0 1] T is the original picture coordinates, and [x y 1] T is the coordinates after translation. Set different parameters dx and dy to move the image horizontally and vertically. Combine the above-mentioned enhanced data with the picture processed in this step to achieve data enhancement.
在一实施例中,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:In an embodiment, the second processing operation further includes a zooming operation and a cropping operation. The second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first The extended picture set and the first extended picture set to obtain the second extended picture set further include:
对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
S13、对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集。S13. Perform a third processing operation on the second extended picture set to obtain a processed second extended picture set, and combine the processed second extended picture set and the second extended picture set to obtain a third extended picture set.
在一实施例中,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;In an embodiment, the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
在一具体实现中,所述对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集包括:In a specific implementation, the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third The expanded picture collection includes:
对第二扩充图片集中各个图片进行对比度变换操作,得到第二扩充图片集中各个图片对应的对比度变换后的图片,将第二扩充图片集中各个图片对应的对比度变换后的图片与第二扩充集中各个图片进行合并,得到第六合并图片集;Perform a contrast transformation operation on each picture in the second extended picture set to obtain a contrast-transformed picture corresponding to each picture in the second extended picture set, and convert the contrast-transformed picture corresponding to each picture in the second extended picture set to each The pictures are merged to obtain the sixth merged picture set;
对第六合并图片集中各个图片进行颜色变换操作,得到第六合并图片集 中各个图片对应的颜色变换后的图片,将第六合并图片集中各个图片对应的颜色变换后的图片与第六合并图片集合并,得到第三扩充图片集。Perform a color transformation operation on each picture in the sixth merged picture set to obtain the color-transformed picture corresponding to each picture in the sixth merged picture set. The color-transformed picture corresponding to each picture in the sixth merged picture set is compared with the sixth picture. The combined picture sets are merged to obtain the third extended picture set.
S14、将所述第三扩充图片集作为增强后的图片集。S14. Use the third extended picture set as an enhanced picture set.
本申请获取原始图片;对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;将所述第三扩充图片集作为增强后的图片集。本案采用模糊、高斯噪声、水平方向和垂直方向移动、旋转图片、随机缩放、对比度变换、颜色变换、随机裁剪这八种数据增强的方法可以在一定程度上增加数据的多样性,将他们混合使用(例如随机选取任意一种或几种方法、按照不同的顺序将若干种方法叠加使用等)能够生成各式各样的输出。,增强了数据,使得训练样本更为充分。This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2. An extended picture set, where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement After the picture collection. In this case, eight data enhancement methods including blur, Gaussian noise, horizontal and vertical movement, rotating picture, random zoom, contrast transformation, color transformation, and random cropping can increase the diversity of data to a certain extent, and mix them. (For example, randomly select any one or several methods, stack several methods in different order, etc.) It can generate a variety of outputs. , To enhance the data, making the training samples more sufficient.
本申请还提供一种数据增强装置。参照图2所示,为本申请一实施例提供的数据增强装置的内部结构示意图。The application also provides a data enhancement device. Referring to FIG. 2, it is a schematic diagram of the internal structure of a data enhancement device provided by an embodiment of this application.
在本实施例中,数据增强装置1可以是个人电脑(Personal Computer,PC),也可以是智能手机、平板电脑、便携计算机等终端设备。该数据增强装置1至少包括存储器11、处理器12,通信总线13,以及网络接口14。In this embodiment, the data enhancement device 1 may be a personal computer (Personal Computer, PC), or a terminal device such as a smart phone, a tablet computer, or a portable computer. The data enhancement device 1 at least includes a memory 11, a processor 12, a communication bus 13, and a network interface 14.
其中,存储器11至少包括一种类型的可读存储介质,所述可读存储介质包括闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、磁性存储器、磁盘、光盘等。存储器11在一些实施例中可以是数据增强装置1的内部存储单元,例如该数据增强装置1的硬盘。存储器11在另一些实施例中也可以是数据增强装置1的外部存储设备,例如数据增强装置1上配备的插接式硬盘,智能存储卡(Smart Media Card,SMC),安全数字(Secure Digital,SD)卡,闪存卡(Flash Card)等。进一步地,存储器11还可以既包括数据增强装置1的内部存储单元也包括外部存储设备。存储器11不仅可以用于存 储安装于数据增强装置1的应用软件及各类数据,例如数据增强程序01的代码等,还可以用于暂时地存储已经输出或者将要输出的数据。Wherein, the memory 11 includes at least one type of readable storage medium, and the readable storage medium includes flash memory, hard disk, multimedia card, card-type memory (for example, SD or DX memory, etc.), magnetic memory, magnetic disk, optical disk, etc. The memory 11 may be an internal storage unit of the data enhancement device 1 in some embodiments, for example, the hard disk of the data enhancement device 1. In other embodiments, the memory 11 may also be an external storage device of the data enhancement device 1, such as a plug-in hard disk, a smart media card (SMC), and a secure digital (Secure Digital, SD card, Flash Card, etc. Further, the memory 11 may also include both an internal storage unit of the data enhancement apparatus 1 and an external storage device. The memory 11 can be used not only to store application software and various types of data installed in the data enhancement device 1, such as the code of the data enhancement program 01, etc., but also to temporarily store data that has been output or will be output.
处理器12在一些实施例中可以是一中央处理器(Central Processing Unit,CPU)、控制器、微控制器、微处理器或其他数据处理芯片,用于运行存储器11中存储的程序代码或处理数据,例如执行数据增强程序01等。The processor 12 may be a central processing unit (CPU), controller, microcontroller, microprocessor or other data processing chip in some embodiments, and is used to run the program code or processing stored in the memory 11 Data, such as executing data enhancement program 01, etc.
通信总线13用于实现这些组件之间的连接通信。The communication bus 13 is used to realize the connection and communication between these components.
网络接口14可选的可以包括标准的有线接口、无线接口(如WI-FI接口),通常用于在该装置1与其他电子设备之间建立通信连接。The network interface 14 may optionally include a standard wired interface and a wireless interface (such as a WI-FI interface), and is usually used to establish a communication connection between the device 1 and other electronic devices.
可选地,该装置1还可以包括用户接口,用户接口可以包括显示器(Display)、输入单元比如键盘(Keyboard),可选的用户接口还可以包括标准的有线接口、无线接口。可选地,在一些实施例中,显示器可以是LED显示器、液晶显示器、触控式液晶显示器以及有机发光二极管(Organic Light-Emitting Diode,OLED)触摸器等。其中,显示器也可以适当的称为显示屏或显示单元,用于显示在数据增强装置1中处理的信息以及用于显示可视化的用户界面。Optionally, the device 1 may also include a user interface. The user interface may include a display (Display) and an input unit such as a keyboard (Keyboard). The optional user interface may also include a standard wired interface and a wireless interface. Optionally, in some embodiments, the display may be an LED display, a liquid crystal display, a touch liquid crystal display, an organic light-emitting diode (OLED) touch device, and the like. Among them, the display can also be called a display screen or a display unit as appropriate, and is used to display the information processed in the data enhancement device 1 and to display a visualized user interface.
图2仅示出了具有组件11-14以及数据增强程序01的数据增强装置1,本领域技术人员可以理解的是,图1示出的结构并不构成对数据增强装置1的限定,可以包括比图示更少或者更多的部件,或者组合某些部件,或者不同的部件布置。Figure 2 only shows the data enhancement device 1 with components 11-14 and the data enhancement program 01. Those skilled in the art can understand that the structure shown in Figure 1 does not constitute a limitation on the data enhancement device 1, and may include Fewer or more components than shown, or some combination of components, or different component arrangement.
在图2所示的装置1实施例中,存储器11中存储有数据增强程序01;处理器12执行存储器11中存储的数据增强程序01时实现如下步骤:In the embodiment of the device 1 shown in FIG. 2, the data enhancement program 01 is stored in the memory 11; when the processor 12 executes the data enhancement program 01 stored in the memory 11, the following steps are implemented:
获取原始图片。Get the original picture.
在本实施例中,所述原始图片来自于训练集中的图片,对于训练集中的每张图片,都可以执行数据增加方法,得到训练集中的每张训练图片对应的多种样式的图片,从而增加训练数据量,提高训练模型的准确性。In this embodiment, the original pictures are from pictures in the training set. For each picture in the training set, the data addition method can be executed to obtain multiple styles of pictures corresponding to each training picture in the training set, thereby increasing The amount of training data improves the accuracy of the training model.
对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集。Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set.
在一实施例中,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理。In an embodiment, the first processing operation includes at least one of the following: blur processing and noise processing.
在一具体实现中,所述对原始图片执行第一处理操作,得到第一处理图 片,并将第一处理图片与所述原始图片合并得到第一扩充图片集包括:In a specific implementation, the performing the first processing operation on the original picture to obtain the first processed picture, and merging the first processed picture with the original picture to obtain the first extended picture set includes:
分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,将各个窗口尺寸对应的图片与原始图片进行合并,得到第一合并图片集;Using different window sizes to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and merge the pictures corresponding to each window size with the original pictures to obtain the first merged picture set;
对第一合并图片集中各个图片进行高斯模糊处理,得到第一合并图片集中各个图片对应的高斯模糊处理的图片,并将第一合并图片集中各个图片对应的高斯模糊处理的图片与第一合并图片集合并得到第二合并图片集;Perform Gaussian blur processing on each picture in the first merged picture set to obtain the Gaussian blurred picture corresponding to each picture in the first merged picture set, and combine the Gaussian blurred picture corresponding to each picture in the first merged picture set with the first merged picture Assemble and obtain a second combined picture set;
对第二合并图片集中各个图片进行高斯噪声处理,得到第二合并图片集中各个图片对应的高斯噪声处理的图片,并将第二合并图片集中各个图片对应的高斯噪声处理的图片与第二合并图片集合并得到第一扩充图片集。Perform Gaussian noise processing on each picture in the second merged picture set to obtain Gaussian noise processed pictures corresponding to each picture in the second merged picture set, and combine the Gaussian noise processed pictures corresponding to each picture in the second merged picture set with the second merged picture Collect and get the first extended picture set.
其中分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,使用不同的窗口尺寸值来得到不同的图片,,从而实现数据增强。Among them, different window sizes are used to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and different window size values are used to obtain different pictures, thereby achieving data enhancement.
对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集。A second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second extended picture set.
在一实施例中,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作。In an embodiment, the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation.
在一具体实现中,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:In a specific implementation, the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set and the first extended picture set are combined to obtain the second The expanded picture collection includes:
对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
其中本实施例中在图像旋转之前,执行坐标平移操作,可以避免信息的丢失。图像旋转之后,会出现许多空洞点。对这些空洞点必须进行填充处理,否则画面效果不好,一般也称这种操作为插值处理。In this embodiment, the coordinate translation operation is performed before the image is rotated, which can avoid the loss of information. After the image is rotated, many holes will appear. These holes must be filled, otherwise the picture effect will not be good, and this operation is generally called interpolation processing.
具体地,对所述移动操作包括对图片进行水平方向和垂直方向移动处理。其中水平方向和垂直方向移动是指在x坐标和y坐标上分别随机选择一个值(这个值可以是正数也可以是负数,符号表示移动方向),然后分别对x坐标和y坐标进行平移。这种操作分为两种,一种是图像大小不改变,这样最后原始图片中会有一部分不在图像中。还有一种就是图像大小改变。这样可以保全原始图片的内容。其公式如下:Specifically, the moving operation includes performing horizontal and vertical moving processing on the picture. The horizontal and vertical movement refers to randomly selecting a value on the x coordinate and the y coordinate (this value can be positive or negative, and the sign indicates the moving direction), and then respectively shifting the x coordinate and y coordinate. This operation is divided into two types, one is that the image size does not change, so that part of the original image will not be in the image. Another is the image size change. This preserves the content of the original picture. The formula is as follows:
Figure PCTCN2019102195-appb-000002
其中矩阵[x 0 y 0 1] T为原始图片坐标,[x y 1] T为平移过后的坐标。设置不同的参数dx和dy来对图像进行水平方向和垂直方向移动。将上述增强数据和该步处理后的图片合并,从而实现数据增强。
Figure PCTCN2019102195-appb-000002
The matrix [x 0 y 0 1] T is the original picture coordinates, and [x y 1] T is the coordinates after translation. Set different parameters dx and dy to move the image horizontally and vertically. Combine the above-mentioned enhanced data with the picture processed in this step to achieve data enhancement.
在一实施例中,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:In an embodiment, the second processing operation further includes a zooming operation and a cropping operation. The second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first The extended picture set and the first extended picture set to obtain the second extended picture set further include:
对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集。A third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set.
在一实施例中,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;In an embodiment, the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
在一具体实现中,所述对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集包括:In a specific implementation, the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third The expanded picture collection includes:
对第二扩充图片集中各个图片进行对比度变换操作,得到第二扩充图片集中各个图片对应的对比度变换后的图片,将第二扩充图片集中各个图片对应的对比度变换后的图片与第二扩充集中各个图片进行合并,得到第六合并图片集;Perform a contrast transformation operation on each picture in the second extended picture set to obtain a contrast-transformed picture corresponding to each picture in the second extended picture set, and convert the contrast-transformed picture corresponding to each picture in the second extended picture set to each The pictures are merged to obtain the sixth merged picture set;
对第六合并图片集中各个图片进行颜色变换操作,得到第六合并图片集中各个图片对应的颜色变换后的图片,将第六合并图片集中各个图片对应的颜色变换后的图片与第六合并图片集合并,得到第三扩充图片集。Perform a color transformation operation on each picture in the sixth merged picture set to obtain the color-transformed picture corresponding to each picture in the sixth merged picture set. The color-transformed picture corresponding to each picture in the sixth merged picture set is compared with the sixth picture. The combined picture sets are merged to obtain the third extended picture set.
将所述第三扩充图片集作为增强后的图片集。The third extended picture set is used as an enhanced picture set.
本申请获取原始图片;对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;将所述第三扩充图片集作为增强后的图片集。本案采用模糊、高斯噪声、水平方向和垂直方向移动、旋转图片、随机缩放、对比度变换、颜色变换、随机裁剪这八种数据增强的方法可以在一定程度上增加数据的多样性,将他们混合使用(例如随机选取任意一种或几种方法、按照不同的顺序将若干种方法叠加使用等)能够生成各式各样的输出。,增强了数据,使得训练样本更为充分。This application obtains an original picture; performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least the following One: blur processing and noise processing; perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combine the processed first extended picture set and the first extended picture set to obtain the first extended picture set 2. An extended picture set, where the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation; performing a third processing operation on the second extended picture set to obtain a processed second extended picture set, and process Then the second extended picture set and the second extended picture set are combined to obtain a third extended picture set, wherein the third processing operation includes at least one of the following: a contrast processing operation, a color processing operation; and the third extended picture set is used as an enhancement After the picture collection. In this case, eight data enhancement methods including blur, Gaussian noise, horizontal and vertical movement, rotating picture, random zoom, contrast transformation, color transformation, and random cropping can increase the diversity of data to a certain extent, and mix them. (For example, randomly select any one or several methods, stack several methods in different order, etc.) It can generate a variety of outputs. , To enhance the data, making the training samples more sufficient.
可选地,在其他实施例中,数据增强程序还可以被分割为一个或者多个模块,一个或者多个模块被存储于存储器11中,并由一个或多个处理器(本实施例为处理器12)所执行以完成本申请,本申请所称的模块是指能够完成特定功能的一系列计算机程序指令段,用于描述数据增强程序在数据增强装置中的执行过程。Optionally, in other embodiments, the data enhancement program may also be divided into one or more modules, and the one or more modules are stored in the memory 11 and are executed by one or more processors (in this embodiment, processing The module 12) is executed to complete the application. The module referred to in the application refers to a series of computer program instruction segments capable of completing specific functions, and is used to describe the execution process of the data enhancement program in the data enhancement device.
例如,参照图3所示,为本申请数据增强装置一实施例中的数据增强程序的程序模块示意图,该实施例中,数据增强程序可以被分割为获取模块10、处理模块20及确定模块30,示例性地:For example, referring to FIG. 3, which is a schematic diagram of the program modules of the data enhancement program in an embodiment of the data enhancement device of this application. In this embodiment, the data enhancement program can be divided into an acquisition module 10, a processing module 20, and a determination module 30. , Exemplarily:
获取模块10获取原始图片;The obtaining module 10 obtains the original picture;
处理模块20对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;The processing module 20 performs a first processing operation on the original picture to obtain a first processed picture, and merges the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: Blur processing, noise processing;
所述处理模块20对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;The processing module 20 performs a second processing operation on the first extended picture set to obtain the processed first extended picture set, and combines the processed first extended picture set and the first extended picture set to obtain a second extended picture set , Wherein the second processing operation includes at least one of the following: a moving operation, a rotating operation, and a filling operation;
所述处理模块20对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;The processing module 20 performs a third processing operation on the second extended picture set to obtain the processed second extended picture set, and combines the processed second extended picture set and the second extended picture set to obtain the third extended picture set , Wherein the third processing operation includes at least one of the following: a contrast processing operation and a color processing operation;
确定模块30将所述第三扩充图片集作为增强后的图片集。The determining module 30 uses the third extended picture set as the enhanced picture set.
上述获取模块10、处理模块20及确定模块30等程序模块被执行时所实现的功能或操作步骤与上述实施例大体相同,在此不再赘述。The functions or operation steps implemented by the program modules such as the acquisition module 10, the processing module 20, and the determination module 30 when executed are substantially the same as those in the foregoing embodiment, and will not be repeated here.
此外,本申请实施例还提出一种计算机可读存储介质,所述计算机可读存储介质上存储有数据增强程序,所述数据增强程序可被一个或多个处理器执行,以实现如下操作:In addition, an embodiment of the present application also proposes a computer-readable storage medium having a data enhancement program stored on the computer-readable storage medium, and the data enhancement program can be executed by one or more processors to implement the following operations:
获取原始图片;Get the original picture;
对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: blur processing, Noise treatment
对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;Perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and obtain the second extended picture set by combining the processed first extended picture set and the first extended picture set, wherein the second processing Operations include at least one of the following: moving operations, rotating operations, and filling operations;
对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;The third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, where the third processing The operations include at least one of the following: contrast processing operations, color processing operations;
将所述第三扩充图片集作为增强后的图片集。The third extended picture set is used as an enhanced picture set.
本申请计算机可读存储介质具体实施方式与上述数据增强装置和方法各实施例基本相同,在此不作累述。The specific implementation of the computer-readable storage medium of the present application is basically the same as the above-mentioned embodiments of the data enhancement device and method, and will not be repeated here.
需要说明的是,上述本申请实施例序号仅仅为了描述,不代表实施例的 优劣。并且本文中的术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、装置、物品或者方法不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、装置、物品或者方法所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括该要素的过程、装置、物品或者方法中还存在另外的相同要素。It should be noted that the serial numbers of the above-mentioned embodiments of the present application are only for description, and do not represent the advantages and disadvantages of the embodiments. And the terms "include", "include" or any other variants thereof in this article are intended to cover non-exclusive inclusion, so that a process, device, article or method including a series of elements not only includes those elements, but also includes The other elements listed may also include elements inherent to the process, device, article, or method. If there are no more restrictions, the element defined by the sentence "including a..." does not exclude the existence of other identical elements in the process, device, article or method that includes the element.
通过以上的实施方式的描述,本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现,当然也可以通过硬件,但很多情况下前者是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在如上所述的一个存储介质(如ROM/RAM、磁碟、光盘)中,包括若干指令用以使得一台终端设备(可以是手机,计算机,服务器,或者网络设备等)执行本申请各个实施例所述的方法。Through the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiments can be implemented by means of software plus the necessary general hardware platform. Of course, it can also be implemented by hardware, but in many cases the former is better.的实施方式。 Based on this understanding, the technical solution of this application essentially or the part that contributes to the existing technology can be embodied in the form of a software product, and the computer software product is stored in a storage medium (such as ROM/RAM) as described above. , Magnetic disk, optical disk), including several instructions to make a terminal device (which can be a mobile phone, a computer, a server, or a network device, etc.) execute the method described in each embodiment of the present application.
以上仅为本申请的优选实施例,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本申请的专利保护范围内。The above are only preferred embodiments of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly used in other related technical fields , The same reason is included in the scope of patent protection of this application.

Claims (20)

  1. 一种数据增强方法,其特征在于,所述方法包括:A data enhancement method, characterized in that the method includes:
    获取原始图片;Get the original picture;
    对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: blur processing, Noise treatment
    对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;Perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and obtain the second extended picture set by combining the processed first extended picture set and the first extended picture set, wherein the second processing Operations include at least one of the following: moving operations, rotating operations, and filling operations;
    对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;The third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, where the third processing The operations include at least one of the following: contrast processing operations, color processing operations;
    将所述第三扩充图片集作为增强后的图片集。The third extended picture set is used as an enhanced picture set.
  2. 如权利要求1所述的数据增强方法,其特征在于,所述对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集包括:The data enhancement method according to claim 1, wherein the first processing operation is performed on the original picture to obtain a first processed picture, and the first processed picture is combined with the original picture to obtain a first extended picture set include:
    分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,将各个窗口尺寸对应的图片与原始图片进行合并,得到第一合并图片集;Using different window sizes to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and merge the pictures corresponding to each window size with the original pictures to obtain the first merged picture set;
    对第一合并图片集中各个图片进行高斯模糊处理,得到第一合并图片集中各个图片对应的高斯模糊处理的图片,并将第一合并图片集中各个图片对应的高斯模糊处理的图片与第一合并图片集合并得到第二合并图片集;Perform Gaussian blur processing on each picture in the first merged picture set to obtain the Gaussian blurred picture corresponding to each picture in the first merged picture set, and combine the Gaussian blurred picture corresponding to each picture in the first merged picture set with the first merged picture Assemble and obtain a second combined picture set;
    对第二合并图片集中各个图片进行高斯噪声处理,得到第二合并图片集中各个图片对应的高斯噪声处理的图片,并将第二合并图片集中各个图片对应的高斯噪声处理的图片与第二合并图片集合并得到第一扩充图片集。Perform Gaussian noise processing on each picture in the second merged picture set to obtain Gaussian noise processed pictures corresponding to each picture in the second merged picture set, and combine the Gaussian noise processed pictures corresponding to each picture in the second merged picture set with the second merged picture Collect and get the first extended picture set.
  3. 如权利要求1所述的数据增强方法,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:The data enhancement method of claim 1, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set is combined with The first extended picture set and the second extended picture set obtained include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中 的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  4. 如权利要求2所述的数据增强方法,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:The data enhancement method according to claim 2, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set is combined with The first extended picture set and the second extended picture set obtained include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  5. 如权利要求3所述的数据增强方法,其特征在于,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:The data enhancement method of claim 3, wherein the second processing operation further comprises a zooming operation and a trimming operation, and the second processing operation is performed on the first extended picture set to obtain the processed first extended picture The second extended picture set by combining the processed first extended picture set and the first extended picture set to obtain the second extended picture set also includes:
    对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
    对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
  6. 如权利要求4所述的数据增强方法,其特征在于,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:The data enhancement method of claim 4, wherein the second processing operation further comprises a zooming operation and a trimming operation, and the second processing operation is performed on the first extended picture set to obtain the processed first extended picture Set, and combining the processed first extended picture set and the first extended picture set to obtain the second extended picture set also includes:
    对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
    对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
  7. 如权利要求1所述的数据增强方法,其特征在于,所述对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集包括:The data enhancement method of claim 1, wherein the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set is combined with The second extended picture set and the third extended picture set obtained include:
    对第二扩充图片集中各个图片进行对比度变换操作,得到第二扩充图片集中各个图片对应的对比度变换后的图片,将第二扩充图片集中各个图片对应的对比度变换后的图片与第二扩充集中各个图片进行合并,得到第六合并图片集;Perform a contrast transformation operation on each picture in the second extended picture set to obtain a contrast-transformed picture corresponding to each picture in the second extended picture set, and convert the contrast-transformed picture corresponding to each picture in the second extended picture set to each The pictures are merged to obtain the sixth merged picture set;
    对第六合并图片集中各个图片进行颜色变换操作,得到第六合并图片集中各个图片对应的颜色变换后的图片,将第六合并图片集中各个图片对应的颜色变换后的图片与第六合并图片集合并,得到第三扩充图片集。Perform a color transformation operation on each picture in the sixth merged picture set to obtain the color-transformed picture corresponding to each picture in the sixth merged picture set. The color-transformed picture corresponding to each picture in the sixth merged picture set is compared with the sixth picture. The combined picture sets are merged to obtain the third extended picture set.
  8. 一种数据增强装置,其特征在于,所述装置包括存储器和处理器,所述存储器上存储有可在所述处理器上运行的数据增强程序,所述数据增强程序被所述处理器执行时实现如下步骤:A data enhancement device, characterized in that the device includes a memory and a processor, the memory stores a data enhancement program that can run on the processor, and when the data enhancement program is executed by the processor Implement the following steps:
    获取原始图片;Get the original picture;
    对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: blur processing, Noise treatment
    对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;Perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and obtain the second extended picture set by combining the processed first extended picture set and the first extended picture set, wherein the second processing Operations include at least one of the following: moving operations, rotating operations, and filling operations;
    对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;The third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, where the third processing The operations include at least one of the following: contrast processing operations, color processing operations;
    将所述第三扩充图片集作为增强后的图片集。The third extended picture set is used as an enhanced picture set.
  9. 如权利要求8所述的数据增强装置,其特征在于,所述对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集包括:The data enhancement device according to claim 8, wherein the first processing operation is performed on the original picture to obtain a first processed picture, and the first processed picture is combined with the original picture to obtain a first extended picture set include:
    分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,将各个窗口尺寸对应的图片与原始图片进行合并,得到第一合并图片集;Using different window sizes to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and merge the pictures corresponding to each window size with the original pictures to obtain the first merged picture set;
    对第一合并图片集中各个图片进行高斯模糊处理,得到第一合并图片集中各个图片对应的高斯模糊处理的图片,并将第一合并图片集中各个图片对应的高斯模糊处理的图片与第一合并图片集合并得到第二合并图片集;Perform Gaussian blur processing on each picture in the first merged picture set to obtain the Gaussian blurred picture corresponding to each picture in the first merged picture set, and combine the Gaussian blurred picture corresponding to each picture in the first merged picture set with the first merged picture Assemble and obtain a second combined picture set;
    对第二合并图片集中各个图片进行高斯噪声处理,得到第二合并图片集中各个图片对应的高斯噪声处理的图片,并将第二合并图片集中各个图片对应的高斯噪声处理的图片与第二合并图片集合并得到第一扩充图片集。Perform Gaussian noise processing on each picture in the second merged picture set to obtain Gaussian noise processed pictures corresponding to each picture in the second merged picture set, and combine the Gaussian noise processed pictures corresponding to each picture in the second merged picture set with the second merged picture Collect and get the first extended picture set.
  10. 如权利要求8所述的数据增强装置,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:The data enhancement device of claim 8, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set is combined with The first extended picture set and the second extended picture set obtained include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  11. 如权利要求9所述的数据增强装置,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:The data enhancement device of claim 9, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture set is combined with The first extended picture set and the second extended picture set obtained include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的 图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  12. 如权利要求10所述的数据增强装置,其特征在于,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:The data enhancement device of claim 10, wherein the second processing operation further comprises a zooming operation and a cropping operation, and the second processing operation is performed on the first extended picture set to obtain the processed first extended picture The second extended picture set by combining the processed first extended picture set and the first extended picture set to obtain the second extended picture set also includes:
    对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
    对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
  13. 如权利要求11所述的数据增强装置,其特征在于,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:The data enhancement device of claim 11, wherein the second processing operation further comprises a zooming operation and a cropping operation, and the second processing operation is performed on the first extended picture set to obtain the processed first extended picture The second extended picture set by combining the processed first extended picture set and the first extended picture set to obtain the second extended picture set also includes:
    对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
    对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
  14. 如权利要求8所述的数据增强装置,其特征在于,所述对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集包括:The data enhancement device of claim 8, wherein the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set is combined with The second extended picture set and the third extended picture set obtained include:
    对第二扩充图片集中各个图片进行对比度变换操作,得到第二扩充图片集中各个图片对应的对比度变换后的图片,将第二扩充图片集中各个图片对应的对比度变换后的图片与第二扩充集中各个图片进行合并,得到第六合并图片集;Perform a contrast transformation operation on each picture in the second extended picture set to obtain a contrast-transformed picture corresponding to each picture in the second extended picture set, and convert the contrast-transformed picture corresponding to each picture in the second extended picture set to each The pictures are merged to obtain the sixth merged picture set;
    对第六合并图片集中各个图片进行颜色变换操作,得到第六合并图片集 中各个图片对应的颜色变换后的图片,将第六合并图片集中各个图片对应的颜色变换后的图片与第六合并图片集合并,得到第三扩充图片集。Perform a color transformation operation on each picture in the sixth merged picture set to obtain the color-transformed picture corresponding to each picture in the sixth merged picture set. The color-transformed picture corresponding to each picture in the sixth merged picture set is compared with the sixth picture. The combined picture sets are merged to obtain the third extended picture set.
  15. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质上存储有数据增强程序,所述数据增强程序可被一个或者多个处理器执行,以实现如下步骤:A computer-readable storage medium, characterized in that a data enhancement program is stored on the computer-readable storage medium, and the data enhancement program can be executed by one or more processors to implement the following steps:
    获取原始图片;Get the original picture;
    对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集,其中所述第一处理操作包括以下至少一种:模糊处理,噪声处理;Perform a first processing operation on the original picture to obtain a first processed picture, and merge the first processed picture with the original picture to obtain a first extended picture set, wherein the first processing operation includes at least one of the following: blur processing, Noise treatment
    对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集,其中第二处理操作包括以下至少一种:移动操作、旋转操作、填充操作;Perform a second processing operation on the first extended picture set to obtain the processed first extended picture set, and obtain the second extended picture set by combining the processed first extended picture set and the first extended picture set, wherein the second processing Operations include at least one of the following: moving operations, rotating operations, and filling operations;
    对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集,其中第三处理操作包括以下至少一种:对比度处理操作、颜色处理操作;The third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture set and the second extended picture set are combined to obtain the third extended picture set, where the third processing The operations include at least one of the following: contrast processing operations, color processing operations;
    将所述第三扩充图片集作为增强后的图片集。The third extended picture set is used as an enhanced picture set.
  16. 如权利要求15所述的计算机可读存储介质,其特征在于,所述对原始图片执行第一处理操作,得到第一处理图片,并将第一处理图片与所述原始图片合并得到第一扩充图片集包括:The computer-readable storage medium according to claim 15, wherein the first processing operation is performed on the original picture to obtain a first processed picture, and the first processed picture is combined with the original picture to obtain a first extension Picture collection includes:
    分别使用不同的窗口尺寸对原始图片进行均值模糊处理,得到各个窗口尺寸对应的图片,将各个窗口尺寸对应的图片与原始图片进行合并,得到第一合并图片集;Using different window sizes to perform mean blurring on the original pictures to obtain pictures corresponding to each window size, and merge the pictures corresponding to each window size with the original pictures to obtain the first merged picture set;
    对第一合并图片集中各个图片进行高斯模糊处理,得到第一合并图片集中各个图片对应的高斯模糊处理的图片,并将第一合并图片集中各个图片对应的高斯模糊处理的图片与第一合并图片集合并得到第二合并图片集;Perform Gaussian blur processing on each picture in the first merged picture set to obtain the Gaussian blurred picture corresponding to each picture in the first merged picture set, and combine the Gaussian blurred picture corresponding to each picture in the first merged picture set with the first merged picture Assemble and obtain a second combined picture set;
    对第二合并图片集中各个图片进行高斯噪声处理,得到第二合并图片集中各个图片对应的高斯噪声处理的图片,并将第二合并图片集中各个图片对应的高斯噪声处理的图片与第二合并图片集合并得到第一扩充图片集。Perform Gaussian noise processing on each picture in the second merged picture set to obtain Gaussian noise processed pictures corresponding to each picture in the second merged picture set, and combine the Gaussian noise processed pictures corresponding to each picture in the second merged picture set with the second merged picture Collect and get the first extended picture set.
  17. 如权利要求15所述的计算机可读存储介质,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理 后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:15. The computer-readable storage medium of claim 15, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture The set and the first extended picture set to obtain the second extended picture set include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  18. 如权利要求16所述的计算机可读存储介质,其特征在于,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集包括:16. The computer-readable storage medium of claim 16, wherein the second processing operation is performed on the first extended picture set to obtain the processed first extended picture set, and the processed first extended picture The set and the first extended picture set to obtain the second extended picture set include:
    对第一扩充图片集中的各个图片进行移动处理,得到第一扩充图片集中的各个图片对应的移动图片,将第一扩充图片集中的各个图片对应的移动图片与第一扩充图片集中的各个图片进行合并,得到第三合并图片集;Perform mobile processing on each picture in the first extended picture set to obtain a mobile picture corresponding to each picture in the first extended picture set, and perform a mobile picture corresponding to each picture in the first extended picture set with each picture in the first extended picture set Merge to obtain the third merged picture set;
    对第三合并图片集中各个图片旋转预设角度,得到第三合并图片集中各个图片对应的旋转后的图片,将第三合并图片集中各个图片对应的旋转后的图片与第三合并图片集合并,得到第四合并图片集;Rotate each picture in the third merged picture set by a preset angle to obtain the rotated picture corresponding to each picture in the third merged picture set, and merge the rotated picture corresponding to each picture in the third merged picture set with the third merged picture set, Obtain the fourth combined picture collection;
    对第四合并图片集进行填充处理,得到第二扩充图片集。Filling processing is performed on the fourth combined picture set to obtain the second extended picture set.
  19. 如权利要求17或18所述的计算机可读存储介质,其特征在于,其中第二处理操作还包括缩放操作及剪裁操作,所述对第一扩充图片集执行第二处理操作,得到处理后的第一扩充图片集,并将处理后的第一扩充图片集与第一扩充图片集合并得到第二扩充图片集还包括:The computer-readable storage medium of claim 17 or 18, wherein the second processing operation further comprises a zooming operation and a cropping operation, and the second processing operation is performed on the first extended picture set to obtain the processed The first extended picture set, and the processed first extended picture set and the first extended picture set to obtain the second extended picture set further includes:
    对第四合并图片集中各个图片进行随机缩放操作,得到第四合并图片集中各个图片对应的缩放后的图片,将第四合并图片集中各个图片对应的缩放后的图片与第四合并图片集合并,得到第五合并图片集;Perform a random zoom operation on each picture in the fourth merged picture set to obtain the zoomed picture corresponding to each picture in the fourth merged picture set, merge the zoomed picture corresponding to each picture in the fourth merged picture set with the fourth merged picture set, Obtain the fifth combined picture collection;
    对第五合并图片集中各个图片进行随机剪裁操作,得到第五合并图片集中各个图片对应的剪裁后的图片,将第五合并图片集中各个图片对应的剪裁后的图片与第五合并图片集合并,得到第二扩充图片集。Perform random cropping operations on each picture in the fifth merged picture set to obtain the cropped pictures corresponding to each picture in the fifth merged picture set, and merge the cropped pictures corresponding to each picture in the fifth merged picture set with the fifth merged picture set, Get the second extended picture set.
  20. 如权利要求16所述的计算机可读存储介质,其特征在于,所述对第二扩充图片集执行第三处理操作,得到处理后的第二扩充图片集,并将处理 后的第二扩充图片集与第二扩充图片集合并得到第三扩充图片集包括:16. The computer-readable storage medium of claim 16, wherein the third processing operation is performed on the second extended picture set to obtain the processed second extended picture set, and the processed second extended picture And the second extended picture set to obtain the third extended picture set includes:
    对第二扩充图片集中各个图片进行对比度变换操作,得到第二扩充图片集中各个图片对应的对比度变换后的图片,将第二扩充图片集中各个图片对应的对比度变换后的图片与第二扩充集中各个图片进行合并,得到第六合并图片集;Perform a contrast transformation operation on each picture in the second extended picture set to obtain a contrast-transformed picture corresponding to each picture in the second extended picture set, and convert the contrast-transformed picture corresponding to each picture in the second extended picture set to each The pictures are merged to obtain the sixth merged picture set;
    对第六合并图片集中各个图片进行颜色变换操作,得到第六合并图片集中各个图片对应的颜色变换后的图片,将第六合并图片集中各个图片对应的颜色变换后的图片与第六合并图片集合并,得到第三扩充图片集。Perform a color transformation operation on each picture in the sixth merged picture set to obtain the color-transformed picture corresponding to each picture in the sixth merged picture set. The color-transformed picture corresponding to each picture in the sixth merged picture set is compared with the sixth picture. The combined picture sets are merged to obtain the third extended picture set.
PCT/CN2019/102195 2019-05-16 2019-08-23 Data enhancement method and device, and computer readable storage medium WO2020228171A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201910416659.8A CN110287988B (en) 2019-05-16 2019-05-16 Data enhancement method, device and computer readable storage medium
CN201910416659.8 2019-05-16

Publications (1)

Publication Number Publication Date
WO2020228171A1 true WO2020228171A1 (en) 2020-11-19

Family

ID=68002361

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/102195 WO2020228171A1 (en) 2019-05-16 2019-08-23 Data enhancement method and device, and computer readable storage medium

Country Status (2)

Country Link
CN (1) CN110287988B (en)
WO (1) WO2020228171A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113807335A (en) * 2021-01-05 2021-12-17 北京沃东天骏信息技术有限公司 Method, device and storage medium for data enhancement

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111860387B (en) * 2020-07-27 2023-08-25 平安科技(深圳)有限公司 Method, device and computer equipment for expanding data

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101615245A (en) * 2009-07-30 2009-12-30 上海交通大学 Expression recognition method based on AVR and enhancing LBP
WO2014062508A1 (en) * 2012-10-15 2014-04-24 Google Inc. Near duplicate images
CN107133943A (en) * 2017-04-26 2017-09-05 贵州电网有限责任公司输电运行检修分公司 A kind of visible detection method of stockbridge damper defects detection
CN108388833A (en) * 2018-01-15 2018-08-10 阿里巴巴集团控股有限公司 A kind of image-recognizing method, device and equipment
CN109325532A (en) * 2018-09-18 2019-02-12 成都网阔信息技术股份有限公司 The image processing method of EDS extended data set under a kind of small sample

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08190626A (en) * 1995-01-12 1996-07-23 Hitachi Ltd Data processor
US20060013503A1 (en) * 2004-07-16 2006-01-19 Samsung Electronics Co., Ltd. Methods of preventing noise boost in image contrast enhancement
US9412162B2 (en) * 2013-08-21 2016-08-09 Sectra Ab Methods, systems and circuits for generating magnification-dependent images suitable for whole slide images
CN109583302B (en) * 2018-10-29 2020-09-18 杭州电子科技大学 Mobile robot feasible area training data set expansion method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101615245A (en) * 2009-07-30 2009-12-30 上海交通大学 Expression recognition method based on AVR and enhancing LBP
WO2014062508A1 (en) * 2012-10-15 2014-04-24 Google Inc. Near duplicate images
CN107133943A (en) * 2017-04-26 2017-09-05 贵州电网有限责任公司输电运行检修分公司 A kind of visible detection method of stockbridge damper defects detection
CN108388833A (en) * 2018-01-15 2018-08-10 阿里巴巴集团控股有限公司 A kind of image-recognizing method, device and equipment
CN109325532A (en) * 2018-09-18 2019-02-12 成都网阔信息技术股份有限公司 The image processing method of EDS extended data set under a kind of small sample

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113807335A (en) * 2021-01-05 2021-12-17 北京沃东天骏信息技术有限公司 Method, device and storage medium for data enhancement

Also Published As

Publication number Publication date
CN110287988B (en) 2024-03-22
CN110287988A (en) 2019-09-27

Similar Documents

Publication Publication Date Title
US11645826B2 (en) Generating searchable text for documents portrayed in a repository of digital images utilizing orientation and text prediction neural networks
WO2020199468A1 (en) Image classification method and device, and computer readable storage medium
US9767359B2 (en) Method for recognizing a specific object inside an image and electronic device thereof
WO2018205467A1 (en) Automobile damage part recognition method, system and electronic device and storage medium
US10929597B2 (en) Techniques and systems for storing and protecting signatures and images in electronic documents
US9104261B2 (en) Method and apparatus for notification of input environment
US9183452B2 (en) Text recognition for textually sparse images
CN110136198B (en) Image processing method, apparatus, device and storage medium thereof
WO2021012494A1 (en) Deep learning-based face recognition method and apparatus, and computer-readable storage medium
WO2021147219A1 (en) Image-based text recognition method and apparatus, electronic device, and storage medium
WO2023035531A1 (en) Super-resolution reconstruction method for text image and related device thereof
US10839244B2 (en) Method and device for generating image
US11341605B1 (en) Document rectification via homography recovery using machine learning
US20190155883A1 (en) Apparatus, method and computer program product for recovering editable slide
WO2020228171A1 (en) Data enhancement method and device, and computer readable storage medium
CN110751146A (en) Text region detection method, text region detection device, electronic terminal and computer-readable storage medium
WO2017107855A1 (en) Picture searching method and device
JP7389824B2 (en) Object identification method and device, electronic equipment and storage medium
CN112749695A (en) Text recognition method and device
US10049268B2 (en) Selective, user-mediated content recognition using mobile devices
CN113741763A (en) Electronic book display method and device and electronic equipment
CN112686243A (en) Method and device for intelligently identifying picture characters, computer equipment and storage medium
CN113012075A (en) Image correction method and device, computer equipment and storage medium
US9665963B1 (en) Dynamic collage layout generation
CN110210505A (en) Generation method, device and the electronic equipment of sample data

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19928715

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19928715

Country of ref document: EP

Kind code of ref document: A1