CN113744167A

CN113744167A - Image data conversion method and device

Info

Publication number: CN113744167A
Application number: CN202111027726.0A
Authority: CN
Inventors: 陈扬; 周铭柯; 李志阳; 李启东; 邹嘉伟
Original assignee: Xiamen Meitu Technology Co Ltd
Current assignee: Xiamen Meitu Technology Co Ltd
Priority date: 2021-09-02
Filing date: 2021-09-02
Publication date: 2021-12-03
Anticipated expiration: 2041-09-02
Also published as: CN113744167B

Abstract

The invention provides an image data conversion method and device, relating to the technical field of image processing, wherein the method comprises the following steps: acquiring image data to be processed; the image data to be processed is data in a Raw format; preprocessing image data to be processed; inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model; the method and the device have the advantages that the image data conversion model is obtained based on sample image data training, the image data after conversion and enhancement are converted into the universal color standard image, the conversion from the Raw format image data to the sRGB image is completed, the image effect is automatically enhanced in the conversion process, the method and the device can adapt to the Raw format image data acquired by various digital sensor devices, effective image conversion and effect enhancement are completed, and the contrast, the saturation, the exposure effect and the like of the image are greatly improved compared with the traditional method.

Description

Image data conversion method and device

Technical Field

The present invention relates to the field of image processing technologies, and in particular, to a method and an apparatus for converting image data.

Background

Raw Image Format (Raw Image Format) images are Raw data obtained when an Image is captured by a photographing apparatus such as a single lens reflex camera, and Raw information of a digital sensor in the apparatus and metadata generated when the Image is captured are recorded. Different digital sensor devices have different schemes for converting Raw format data into universal color standard (sRGB) type data, but the conversion effect is unstable, and only Raw data can be converted, and images are not enhanced.

Various kinds of photographic application software (App) on the market currently have Internet Service Provider (ISP) processing flows capable of simulating digital sensors to convert the Raw format images, but there is no method capable of converting the Raw format images into sRGB images and enhancing the image effect at the same time on 16-bit data accuracy.

Therefore, it is an important issue to be solved in the industry at present that various Raw format data can be automatically converted and enhanced.

Disclosure of Invention

In view of the above, the present invention provides an image data conversion method and an image data conversion apparatus, so as to solve the defect in the prior art that various Raw format data are automatically converted and enhanced, achieve effective image conversion and effect enhancement, and greatly improve the contrast, saturation, exposure effect, and the like of an image.

Based on the above object, the present invention provides an image data conversion method, comprising the steps of:

acquiring image data to be processed; the image data to be processed is data in a Raw format;

preprocessing the image data to be processed;

inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model; the image data conversion model is obtained based on sample image data training, and the image data after conversion enhancement is a universal color standard image.

Optionally, the preprocessing the image data to be processed specifically includes the following steps:

acquiring a normalization threshold value of a pixel value of each pixel point in the image data to be processed;

obtaining a gray value corresponding to each pixel point according to the normalization threshold; the gray value of the pixel point with the normalized threshold value higher than the preset threshold value is a first gray value and is used as a first mask, and the gray value of the pixel point with the normalized threshold value not higher than the preset threshold value is a second gray value and is used as a second mask;

and reserving the first mask to obtain a single-channel mask image with image highlight region information.

Optionally, the image data conversion model is obtained by training through the following steps:

acquiring the sample image data;

preprocessing the sample image data;

performing gain processing on the preprocessed sample image data; wherein the gain processing comprises one or more of turning, rotating, translating, affine transformation, exposure, contrast adjustment, saturation adjustment and blurring;

and taking the sample image data after the gain processing as input data used for training, and training by adopting a deep learning mode to obtain the image data conversion model for generating the conversion-enhanced image data of the image data to be processed.

Optionally, an input channel of the image data conversion model is five channels, and includes original sample image data, preprocessed sample image data, and a three-channel color standard random color noise map.

Optionally, the three-channel color standard random color noise map is obtained by the following steps:

converting the sample image data to obtain an eight-bit universal color standard image;

carrying out image enhancement on the eight-bit universal color standard image, and adjusting the contrast, exposure and saturation of the image to obtain a three-channel universal color standard image;

and removing color noise from the three-channel general color standard image to obtain a three-channel color standard random color noise image.

Optionally, the image data conversion model adopts a full convolution network model.

The present invention also provides an image data conversion apparatus comprising:

the acquisition module is used for acquiring image data to be processed; the image data to be processed is data in a Raw format;

the preprocessing module is used for preprocessing the image data to be processed;

the conversion enhancement module is used for inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model; the image data conversion model is obtained based on sample image data training, and the image data after conversion enhancement is a universal color standard image.

The invention also provides an electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor implements the steps of the image data conversion method as described in any one of the above when executing the program.

The invention also provides a non-transitory computer-readable storage medium having stored thereon a computer program which, when executed by a processor, carries out the steps of the image data conversion method as any one of the above.

The invention also provides a computer program product comprising a computer program which, when executed by a processor, carries out the steps of the image data conversion method as described in any one of the above.

As can be seen from the above, the image data conversion method and apparatus provided by the present invention complete the conversion from Raw format image data to sRGB image based on Full Convolutional Network (FCN), automatically enhance the image effect during the conversion process, can adapt to Raw format image data acquired by various digital sensor devices, complete effective image conversion and effect enhancement, and greatly improve the contrast, saturation, exposure effect, etc. of the image, which are different from the conventional methods.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.

FIG. 1 is a flow chart illustrating an image data conversion method according to the present invention;

FIG. 2 is a flowchart illustrating a step S200 of the image data conversion method according to the present invention;

FIG. 3 is a schematic flow chart of an image data conversion model training process in the image data conversion method according to the present invention;

FIG. 4 is a flowchart illustrating a step A400 of the image data conversion method according to the present invention;

FIG. 5 is a diagram illustrating unprocessed image data to be processed according to the image data conversion method of the present invention;

FIG. 6 is a diagram of a single-channel mask image with highlight region information for an image according to the image data conversion method of the present invention;

FIG. 7 is a schematic diagram of three-channel RGB random color noise map in the image data conversion method of the present invention;

FIG. 8 is a schematic structural diagram of an image data conversion apparatus according to the present invention;

FIG. 9 is a schematic diagram of a specific structure of a pre-processing module in the image data conversion apparatus according to the present invention;

FIG. 10 is a schematic structural diagram of an image data transformation model training process in the image data transformation method according to the present invention;

FIG. 11 is a schematic diagram illustrating a specific structure of a training module in the image data conversion apparatus according to the present invention;

fig. 12 is a schematic structural diagram of an electronic device provided in the present invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is described in further detail below with reference to specific embodiments and the accompanying drawings.

It is to be noted that technical terms or scientific terms used in the embodiments of the present invention should have the ordinary meanings as understood by those having ordinary skill in the art to which the present disclosure belongs, unless otherwise defined. The use of "first," "second," and similar terms in this disclosure is not intended to indicate any order, quantity, or importance, but rather is used to distinguish one element from another. The word "comprising" or "comprises", and the like, means that the element or item listed before the word covers the element or item listed after the word and its equivalents, but does not exclude other elements or items. The terms "connected" or "coupled" and the like are not restricted to physical or mechanical connections, but may include electrical connections, whether direct or indirect. "upper", "lower", "left", "right", and the like are used merely to indicate relative positional relationships, and when the absolute position of the object being described is changed, the relative positional relationships may also be changed accordingly.

As a preferred embodiment of the present invention, there is provided an image data conversion method including the steps of:

preprocessing the image data to be processed;

By the image data conversion method and the image data conversion device, the conversion from the Raw format image data to the sRGB image is completed based on the FCN, the image effect is automatically enhanced in the conversion process, the method and the device can adapt to the Raw format image data acquired by various digital sensor devices, the effective image conversion and the effect enhancement are completed, and the contrast, the saturation, the exposure effect and the like of the image different from the traditional method are greatly improved.

The following describes preferred embodiments of the image data conversion method and apparatus according to the present invention with reference to the accompanying drawings.

Referring to fig. 1, the present invention provides an image data conversion method, which includes the following steps:

s100, shooting images through various digital sensors, and collecting image data to be processed. The image data to be processed is data in Raw format, including but not limited to CR2, ARW, NEF data.

S200, preprocessing image data to be processed.

In step S200, the purpose of the preprocessing is to obtain a single-channel mask (mask) image highlight region with image highlight region information to protect the details of the highlight region in the original image captured by the data-transfer sensor.

S300, inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model. The image data conversion model is obtained based on sample image data training, and the image data after conversion and enhancement is an sRGB image.

Through step S300, Raw format image data collected by the digital sensor may be converted into an sRGB image, and the saturation, contrast, exposure value, and the like of the image may be enhanced. In this embodiment, the FCN model is used as the image data conversion model, and because a full convolution network model is used, no scaling is required for the input and output images.

According to the traditional method for converting the Raw format image into the eight-bit sRGB image, the finally obtained image has a large amount of color noise, the protection on the highlight area of the image is poor, and the effect of the image is not improved and enhanced. The method is based on FCN to complete the conversion from the Raw format image data to the sRGB image, automatically enhances the image effect in the conversion process, can adapt to the Raw format image data acquired by various digital sensor devices, completes effective image conversion and effect enhancement, and greatly improves the contrast, saturation, exposure effect and the like of the image different from the traditional method.

Referring to fig. 2, step S200 specifically includes the following steps:

s210, acquiring a normalized threshold value of the pixel value of each pixel point in the image data to be processed. It should be noted that the Raw format image data is single-channel grayscale data, and no color information is presented, so the unprocessed image data is a single-channel grayscale map, and the normalized threshold of the single-channel grayscale map without conversion and enhancement processing is (-1, 1).

S220, obtaining a gray value corresponding to each pixel point according to the normalized threshold; the gray value of the pixel point with the normalized threshold value higher than the preset threshold value is a first gray value and is used as a first mask, and the gray value of the pixel point with the normalized threshold value not higher than the preset threshold value is a second gray value and is used as a second mask. In this embodiment, the preset threshold is 0.96, 1 is assigned to the gray value of the pixel point whose normalized threshold is higher than 0.96, and the pixel point is used as the first mask; and assigning 0 to the gray value of the pixel point with the normalized threshold not higher than 0.96, and taking the pixel point as a second mask. In this method, a first mask is used to represent a highlight region in an image.

And S230, reserving the first mask to obtain a single-channel mask image with image highlight region information. The normalized threshold range for a single channel mask image with image highlight region information is (0, 1).

Referring to fig. 3, the image data conversion model is obtained by training the following steps:

and A100, acquiring sample image data. In the step a100, the acquisition mode is synchronized in the step S100, and the sample image data in different scenes can be acquired in consideration of the diversity requirement of the sample image data.

And A200, preprocessing sample image data. Likewise, in the present embodiment, the preprocessing manner in step a200 coincides with the preprocessing manner in step S200.

And A300, in order to obtain a network model with stronger robustness, gain processing is carried out on the preprocessed sample image data so as to add gain to the sample image data. The gain processing comprises one or more of turning, rotating, translating, affine transformation, exposure, contrast adjustment, saturation adjustment and blurring;

and A400, taking the sample image data after the gain processing as input data used for training, and training by adopting a deep learning mode to obtain an image data conversion model for generating the image data to be processed after the conversion enhancement.

The input channel of the image data conversion model is five channels, and the image data conversion model comprises original sample image data, preprocessed sample image data and a three-channel color standard random color noise map.

Referring to fig. 4 to 7, a three-channel color standard random color noise map is obtained by the following steps:

and A410, converting the sample image data to obtain an eight-bit sRGB image.

And A420, performing image enhancement on the eight-bit sRGB image through Photoshop and other software, and adjusting the contrast, exposure, saturation and the like of the image to obtain a three-channel sRGB image.

The image enhancement algorithm needs to interpolate the single-channel image into a three-channel RGB image through a demosaicing algorithm, and the sRGB image is obtained after processing in step S520.

And A430, removing color noise from the three-channel sRGB image to obtain a three-channel RGB random color noise image. The normalized threshold for the three channel RGB random color noise map is in the range (-1, 1).

The three-channel RGB random color noise map is used to improve the effect of the image data conversion model on removing color noise, and the shape of the noise map is as shown in fig. 7, because when the digital sensor captures an image, due to the photosensitive element of the digital sensor, the sensitivity of the digital sensor to a scene is different, color noise with different degrees is generated, and if the color noise is not removed through step a530, the final imaging quality of the image is affected.

Specifically, the image data conversion model is a combination of FCN-Generated Adaptive Networks (GAN) models, as shown in table 1, the image data conversion model adopts a coding and decoding structure, the upsampling of the decoding portion adopts a combination of nearest neighbor upsampling and a convolutional layer, the activation function of an output layer is Tanh, the size of an input image is 512 for example, the negetive slope of LeakyReLU is 0.2, and the discrimination network adopts the Discriminator of multi _ scale to discriminate true and false images with different resolutions respectively. In this embodiment, 3-scale discriminators are used to discriminate 512x512, 256x256, and 128x128 resolution images, and images with different resolutions are directly down-sampled by a Pooling Layer (Pooling Layer), and the optimization algorithm uses Adam to generate a learning rate of 0.0002 and discriminate a learning rate of 0.0001 for the network.

TABLE 1 detailed construction of image data conversion model

The total loss function of the image data transformation model training process can be expressed as:

where α, β, γ, μ, and σ represent weights corresponding to loss functions in the image data conversion model, where α is 1, β is 0.5, γ is 0.5, σ is 0.5, μ is 0.5, and L is L_PercRepresenting the perceptual loss function.

Wherein, tanh is hyperbolic tangent processing, Output represents a network Output image, that is, an sRGB image Output after Raw format image data is enhanced, and groudtruth is a target image.

An L1_ loss function is selected to ensure the color brightness of the image, an L1_ loss function is a linear loss function, and in order to ensure that the details of the highlight area of the Raw image cannot be lost, Tanh operation is respectively carried out on the Output image Output and the GroudTruth before the L1_ loss function is calculated, so that the image can be ensured to have higher weight in the highlight area.

In order to ensure the image perception similarity, the perception loss Perceptial _ loss L based on VGG19 is introduced_Perc。

To ensure image authenticity, the image data conversion model works against the Loss Gan _ Loss function to minimize the distance between the color estimate of the real image and the image color distribution that generates the network output.

Represents three different divisionsResolution is the loss function of the output of the discriminator at 512x512, 256x256, 128x128 resolution.

Where D represents the above-mentioned discriminator, x represents real data (i.e., GroudTrut), z represents network-generated data, px represents the distribution of real data samples, pz represents the distribution of network-generated data samples, E represents expectation, and L represents_advRepresenting the penalty function.

In order to ensure that the color of the highlight area is excessively harmonious with the normal image area while the highlight area is repaired and inhibited after the image is output, the image data conversion model also uses L_MatConverting the color space of the input and output images into Lab space, calculating the mean and variance of the channels a and b of the input and output images, respectively calculating the L2_ loss function of the mean and L2_ loss functions of the variance of the input and output images, and summing to obtain L_Mat，L_MatRepresenting a statistical loss function in order to ensure excessive harmony of colors in the exposed area.

The image data conversion apparatus provided by the present invention is described below, and the image data conversion apparatus described below and the image data conversion method described above may be referred to in correspondence with each other.

Referring to fig. 8, the present invention provides an image data converting apparatus, including:

the acquisition module 100 is used for shooting images through various digital sensors and acquiring image data to be processed. The image data to be processed is data in Raw format, including but not limited to CR2, ARW, NEF data.

The preprocessing module 200 is configured to preprocess the image data to be processed.

In the preprocessing module 200, the purpose of preprocessing is to obtain a single-channel mask (mask) image highlight region with image highlight region information to protect the details of the highlight region in the original data-transfer sensor acquired image.

The conversion enhancing module 300 is configured to input the preprocessed image data to be processed into the image data conversion model, so as to obtain the conversion enhanced image data output by the image data conversion model. The image data conversion model is obtained based on sample image data training, and the image data after conversion and enhancement is an sRGB image.

By the conversion enhancement module 300, Raw format image data collected by the digital sensor can be converted into an sRGB image, and the saturation, contrast, exposure value, etc. of the image are enhanced. In this embodiment, the FCN model is used as the image data conversion model, and because a full convolution network model is used, no scaling is required for the input and output images.

According to the traditional device for converting an image in a Raw format into an eight-bit sRGB image, a large amount of color noise exists in the finally obtained image, the protection on a highlight area of the image is poor, and the effect of the image is not improved and enhanced. The device completes the conversion from the Raw format image data to the sRGB image based on the FCN, automatically enhances the image effect in the conversion process, can adapt to the Raw format image data acquired by various digital sensor devices, completes effective image conversion and effect enhancement, and greatly improves the contrast, saturation, exposure effect and the like which are different from the traditional device image.

Referring to fig. 9, the preprocessing module 200 specifically includes:

the first obtaining unit 210 is configured to obtain a normalized threshold of a pixel value of each pixel in the image data to be processed. It should be noted that the Raw format image data is single-channel grayscale data, and no color information is presented, so the unprocessed image data is a single-channel grayscale map, and the normalized threshold of the single-channel grayscale map without conversion and enhancement processing is (-1, 1).

The first obtaining unit 220 is configured to obtain a gray value corresponding to each pixel point according to the normalized threshold; the gray value of the pixel point with the normalized threshold value higher than the preset threshold value is a first gray value and is used as a first mask, and the gray value of the pixel point with the normalized threshold value not higher than the preset threshold value is a second gray value and is used as a second mask. In this embodiment, the preset threshold is 0.96, 1 is assigned to the gray value of the pixel point whose normalized threshold is higher than 0.96, and the pixel point is used as the first mask; and assigning 0 to the gray value of the pixel point with the normalized threshold not higher than 0.96, and taking the pixel point as a second mask. In this device a first mask is used to represent the highlight areas in the image.

And a retaining unit 230 configured to retain the first mask to obtain a single-channel mask image with highlight region information of the image. The normalized threshold range for a single channel mask image with image highlight region information is (0, 1).

Referring to fig. 10, the image data conversion model is trained by the following modules:

a sample acquisition module 400 for acquiring sample image data. The sample acquisition module 400 is the same as the acquisition module 100 in acquisition mode, and can acquire sample image data in different scenes in consideration of the diversity requirement of the sample image data.

The sample preprocessing module 500 is configured to preprocess sample image data. Also, in the present embodiment, the preprocessing mode in the sample preprocessing module 500 is the same as that of the preprocessing module 200.

And a gain module 600, wherein in order to obtain a network model with higher robustness, the gain module 600 is configured to perform gain processing on the preprocessed sample image data to add a gain to the sample image data. The gain processing comprises one or more of turning, rotating, translating, affine transformation, exposure, contrast adjustment, saturation adjustment and blurring;

the training module 700 is configured to train the gain-processed sample image data as input data for training in a deep learning manner, so as to obtain an image data conversion model for generating conversion-enhanced image data of the image data to be processed.

Referring to fig. 11, a three-channel color standard random color noise map is obtained by:

the conversion unit 710 performs conversion processing on the sample image data to obtain an eight-bit sRGB image.

The enhancement unit 720 performs image enhancement on the eight-bit sRGB image through Photoshop and other software, and adjusts the contrast, exposure, saturation and the like of the image to obtain a three-channel sRGB image.

And the denoising unit 730 is used for removing color noise from the sRGB image of the three channels to obtain an RGB random color noise map of the three channels. The normalized threshold for the three channel RGB random color noise map is in the range (-1, 1).

The three-channel RGB random color noise image is used for improving the effect of the image data conversion model on removing color noise, because when the digital sensor shoots an image, due to the fact that the sensitivity of the digital sensor to a scene is different, color noise with different degrees can be generated, and if the color noise is not removed through the denoising unit 730, the final imaging quality of the image can be influenced.

Fig. 12 illustrates a physical structure diagram of an electronic device, which may include, as shown in fig. 12: a processor (processor)810, a communication Interface 820, a memory 830 and a communication bus 840, wherein the processor 810, the communication Interface 820 and the memory 830 communicate with each other via the communication bus 840. The processor 810 may call logic instructions in the memory 830 to perform an image data conversion method comprising the steps of:

s100, collecting image data to be processed; the image data to be processed is data in a Raw format;

s200, preprocessing the image data to be processed;

s300, inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model; the image data conversion model is obtained based on sample image data training, and the image data after conversion and enhancement is an sRGB image.

In addition, the logic instructions in the memory 830 may be implemented in software functional units and stored in a computer readable storage medium when the logic instructions are sold or used as independent products. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

In another aspect, the present invention also provides a computer program product, the computer program product comprising a computer program, the computer program being storable on a non-transitory computer-readable storage medium, the computer program, when executed by a processor, being capable of executing the image data conversion method provided by the above methods, the method comprising the steps of:

s200, preprocessing the image data to be processed;

In yet another aspect, the present invention also provides a non-transitory computer-readable storage medium having stored thereon a computer program which, when executed by a processor, is implemented to perform the image data conversion method provided by the above methods, the method comprising the steps of:

s200, preprocessing the image data to be processed;

The above-described embodiments of the apparatus are merely illustrative, and the units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the modules may be selected according to actual needs to achieve the purpose of the solution of the present embodiment. One of ordinary skill in the art can understand and implement it without inventive effort.

Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods described in the embodiments or some parts of the embodiments.

Those of ordinary skill in the art will understand that: the discussion of any embodiment above is meant to be exemplary only, and is not intended to intimate that the scope of the disclosure, including the claims, is limited to these examples; within the idea of the invention, also features in the above embodiments or in different embodiments may be combined, steps may be implemented in any order, and there are many other variations of the different aspects of the invention as described above, which are not provided in detail for the sake of brevity.

The embodiments of the invention are intended to embrace all such alternatives, modifications and variances that fall within the broad scope of the appended claims. Therefore, any omissions, modifications, substitutions, improvements and the like that may be made without departing from the spirit and principles of the invention are intended to be included within the scope of the invention.

Claims

1. An image data conversion method, characterized by comprising the steps of:

preprocessing the image data to be processed;

2. The image data conversion method according to claim 1, wherein the preprocessing the image data to be processed specifically includes the following steps:

3. The image data conversion method according to claim 2, wherein the image data conversion model is trained by:

acquiring the sample image data;

preprocessing the sample image data;

4. The image data conversion method according to claim 3, wherein the input channels of the image data conversion model are five channels including original sample image data, preprocessed sample image data, and three-channel color standard stochastic color noise maps.

5. The image data conversion method of claim 4, wherein the three-channel color standard random color noise map is obtained by:

6. The image data conversion method according to claim 3, wherein the image data conversion model employs a full convolution network model.

7. An image data conversion apparatus characterized by comprising:

an acquisition module (100) for acquiring image data to be processed; the image data to be processed is data in a Raw format;

a preprocessing module (200) for preprocessing the image data to be processed;

the conversion enhancement module (300) is used for inputting the preprocessed image data to be processed into an image data conversion model to obtain the conversion enhanced image data output by the image data conversion model; the image data conversion model is obtained based on sample image data training, and the image data after conversion enhancement is a universal color standard image.

8. An electronic device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the steps of the image data conversion method according to any of claims 1 to 6 are implemented when the processor executes the program.

9. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the steps of the image data conversion method according to any one of claims 1 to 6.

10. A computer program product comprising a computer program, characterized in that the computer program realizes the steps of the image data conversion method according to any one of claims 1 to 6 when executed by a processor.