CN110602494A

CN110602494A - Image coding and decoding system and method based on deep learning

Info

Publication number: CN110602494A
Application number: CN201910705904.7A
Authority: CN
Inventors: 王培�; 其他发明人请求不公开姓名
Original assignee: Hangzhou Pico Pico Technology Co ltd
Current assignee: Hangzhou Pico Pico Technology Co ltd
Priority date: 2019-08-01
Filing date: 2019-08-01
Publication date: 2019-12-20

Abstract

The invention discloses an image coding and decoding system and a coding and decoding method based on deep learning, wherein the coding system comprises: the forward transformation network module based on deep learning, the conditional probability super-prior analysis module based on deep learning and the entropy coding module; the forward conversion network module is used for obtaining a characteristic coefficient; the super-first-check analysis module is used for obtaining a super-first-check characteristic value; the entropy coding module is used for entropy coding. The decoding system includes: the device comprises an entropy decoding module, a deep learning-based reconstruction module and a deep learning-based inverse transformation network module; the entropy decoding module is used for entropy decoding; the reconstruction module is used for obtaining a conditional probability model; the inverse transform network module is used for reconstructing image pixel values. By adopting the invention, the performance of the codec obtained by training exceeds various traditional coding standards in an unsupervised mode.

Description

Image coding and decoding system and method based on deep learning

Technical Field

The invention relates to the technical field of image coding, in particular to an image coding and decoding system and a coding and decoding method based on deep learning.

Background

With the rapid development of multimedia technology and network communication technology, image multimedia applications have covered various aspects of human life. The large number of image applications creates a huge amount of data that would be difficult to apply for practical storage and transmission if not compressed. The image compression coding technology can effectively remove redundant information in the data, and realize the quick transmission and off-line storage of the image data in the Internet. Therefore, image compression encoding technology is a key technology in video applications.

In the past decades, a series of image coding standards have been widely used. There are many existing standards for image compression, including JPEG and JPEG2000, as set forth by the Joint Picture Experts Group (Joint Picture Experts Group), PNG, as developed by the Unisys corporation and promulgated by the International Organization for standardization (ISO)/International Electrotechnical Commission (IEC), WebP, as promulgated by Google, and Fabry Bellard, created in 2014. Although conventional coding standards are numerous and continue to advance, the coding framework has not changed significantly. For example, image coding standards basically follow the framework of transform coding (transform coding), and the development trend of conventional coding standards is to exchange finer and more complex algorithms for higher coding performance. The more difficult it is to further gain performance that image coding standards iterate to date.

In recent years, deep learning techniques have made a major breakthrough in multiple image processing and machine vision tasks, and have received extensive attention from researchers. The deep learning technique can learn data prior knowledge and adaptive transformation operation from a large amount of data, which is also suitable for the image coding task. Research using deep learning for image coding began with a recurrent neural network-based image coding method published in google in 2015. Recently, several studies have shown that deep learning based image coding methods have achieved performance exceeding many conventional image coding techniques. Although such methods have not been developed for a long time, they have achieved performance comparable to the best current conventional coding techniques (BPG HEVC-based intra coding is the best current image coding). These results all show that the image coding technology based on deep learning has great potential, and it is possible to achieve coding performance which is fully superior to that of the traditional method. In addition, compared with the traditional method which depends on expert knowledge and characteristic engineering, the deep learning technology has strong adaptivity, and can be trained according to specific data in practical application to obtain higher coding efficiency. The establishment and release of a new generation of traditional video coding standard often requires 10 years, so that through the research on deep learning-based image coding, the coding performance is expected to be remarkably improved, and the method has very important academic exploration and practical application values.

However, the image coding method based on the recurrent neural network disclosed in google mentioned above is too computationally expensive, which hinders practical use. Therefore, it is urgently needed to provide an image encoding method with high computational efficiency and excellent encoding performance.

Disclosure of Invention

The invention provides an image coding and decoding system and a coding and decoding method based on deep learning aiming at the problems in the prior art, provides a set of training strategy which enables the whole coding network to carry out end-to-end optimization, and adopts an unsupervised mode, so that the performance of a coder obtained by training exceeds various traditional coding standards.

In order to solve the technical problems, the invention is realized by the following technical scheme:

the invention provides an image coding system based on deep learning, which comprises:

the forward transformation network module is used for enabling the image to pass through a forward transformation network to obtain a characteristic coefficient representing image information;

the system comprises a condition probability super-prior analysis module based on deep learning, a condition probability super-prior analysis module and a feature coefficient analysis module, wherein the condition probability super-prior analysis module is used for analyzing the feature coefficients to obtain a super-prior feature value representing the condition probability of the feature coefficients;

and the entropy coding module is used for entropy coding the quantized feature coefficients to obtain a feature coefficient code stream under the guidance of the super-prior conditional probability, and is also used for entropy coding the quantized super-prior feature values by a conditional probability model counted on a training set to obtain the super-prior feature value code stream.

Preferably, the entropy coding module is further configured to perform bypass entropy coding on the image meta-information to obtain an image meta-information code stream;

wherein the image meta information includes: the length and width of the image, and the model number used by the image.

Preferably, the forward transform network module is constructed based on a deep convolutional neural network;

the forward conversion network module comprises N convolutional layers and N-1 normalization layers, wherein the forward conversion module starts from the convolutional layers, and the convolutional layers and the normalization layers are alternately distributed.

Preferably, the super-prior analysis module is constructed based on a deep convolutional neural network;

the analysis network of the super-prior analysis module comprises six layers; the first layer is an absolute value operation layer, the second layer is a convolution layer, the third layer is an activation layer, the fourth layer is a convolution layer, the fifth layer is an activation layer, and the sixth layer is a convolution layer.

Preferably, the quantization in the entropy coding module is adding random uniform noise approximation quantization.

The present invention also provides an image decoding system based on deep learning, which is an image decoding system corresponding to the above image encoding system, and includes:

the entropy decoding module is used for carrying out entropy decoding on the super-prior-check eigenvalue code stream to obtain a reconstructed super-prior-check eigenvalue matrix;

the deep learning-based reconstruction module is used for training according to the super-prior eigenvalue matrix to obtain a conditional probability model of which the eigenvalue is based on Laplace distribution super-prior; the entropy decoding module is further used for performing entropy decoding on the characteristic coefficient code stream according to the conditional probability model to obtain a reconstructed characteristic coefficient matrix;

and the deep learning-based inverse transformation network module is used for enabling the reconstructed characteristic coefficient matrix to reconstruct the image pixel value through an inverse transformation network.

Preferably, the entropy decoding module is further configured to perform entropy decoding on the image meta-information code stream to obtain image meta-information;

Preferably, the inverse transformation network module is constructed based on a deep convolutional neural network;

the inverse transformation network module and the forward transformation network module are in a symmetrical structure;

the inverse transformation network module comprises N layers of deconvolution layers and N-1 layers of inverse normalization layers, the inverse transformation module starts with the inverse convolution layers, and the inverse convolution layers and the inverse normalization layers are alternately distributed.

Preferably, the super-prior reconstruction module is constructed based on a deep convolutional neural network;

the super-prior reconstruction module and the super-prior analysis module are in a symmetrical structure;

the reconstruction network of the super-prior reconstruction module also comprises six layers, wherein the first layer is an deconvolution layer, the second layer is an activation layer, the third layer is an deconvolution layer, the fourth layer is an activation layer, the fifth layer is an deconvolution layer, and the sixth layer is an exponential function output layer for each input characteristic value.

The invention also provides an image coding method based on deep learning, which comprises the following steps:

s101: carrying out forward transformation on an input image to obtain a characteristic coefficient matrix representing image information;

s102: inputting the characteristic coefficient matrix into a super-prior analysis module, and outputting to obtain a super-prior eigenvalue matrix representing the probability of the characteristic coefficient;

s103: quantizing the super prior eigenvalue matrix in the S102, and entropy coding the quantized super prior eigenvalue matrix to obtain a super prior eigenvalue code stream;

s104: training according to the quantized super-prior eigenvalue matrix in the S103 to obtain a conditional probability model of which the eigenvalue is based on Laplace distribution super-prior;

s105: quantizing the characteristic coefficient matrix in the S101, and performing entropy coding on the quantized characteristic coefficient matrix by using the conditional probability model in the S104 to obtain a characteristic coefficient code stream;

s106: the code stream of the output image of the packing includes: the code stream of the super-prior eigenvalue in S103 and the code stream of the eigenvalue in S105.

Preferably, between S105 and S106, further comprising:

s111: carrying out bypass entropy coding on the image meta-information to obtain an image meta-information code stream, wherein the image meta-information comprises: the length and width of the image, and the model serial number adopted by the image; further, the air conditioner is provided with a fan,

the encoding code stream of the output image in S106 further includes: the image meta information code stream in S101.

Preferably, the quantization in S103 and/or S105 is approximate quantization, and the approximate quantization is performed by adding random uniform noise.

Preferably, the value range of the random uniform noise is [ -0.5,0.5 ].

Preferably, the S104 includes:

taking a minimized loss function J which is R + lambda D as a target, adopting MS-SSIM or PSNR as a measurement index, and approximating by using information entropy; wherein:

the information entropy is obtained according to a conditional probability function of the characteristic coefficient, namely n is sum (-plog2 (p));

the conditional probability density is modeled based on Laplace distribution, the mean value is assumed to be 0, and the variance is the super-prior conditional probability model obtained by training;

where R is the code rate, D is the distortion, n is the information entropy, and p is the conditional probability function.

The present invention also provides an image decoding method based on deep learning, which is an image decoding method corresponding to the above image encoding method, and which includes the steps of:

s141: entropy decoding to obtain image meta-information, including the length and width of an image and a model sequence number adopted by the image;

s142: decoding the code stream of the super-prior eigenvalue to obtain a super-prior eigenvalue matrix by using a corresponding super-prior eigenvalue entropy coding model according to the serial number of the model; constructing and initializing a corresponding network model according to the model serial number;

s143: sending the super-prior eigenvalue matrix obtained by decoding in the 142 into a super-prior reconstruction module, and outputting the conditional probability of the obtained eigen coefficients;

s144: decoding the characteristic coefficient code stream to obtain a characteristic coefficient matrix of the image by using the conditional probability of the characteristic coefficient in the step 143;

s145: and sending the characteristic coefficient matrix in the step S144 to an inverse transformation network module, and reconstructing a pixel value.

The invention also provides an image coding terminal, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor can be used for executing the image coding method based on deep learning when executing the program.

The invention also provides an image decoding terminal, which comprises a memory, a processor and a computer program which is stored on the memory and can run on the processor, wherein the processor can be used for executing the image coding and decoding method based on the deep learning when executing the program.

The present invention also provides a computer-readable storage medium having stored thereon a computer program which, when executed by a processor, is operable to perform the above-described deep learning-based image encoding method.

The present invention also provides a computer readable storage medium, on which a computer program is stored, which, when being executed by a processor, is operable to perform the above-mentioned deep learning-based image coding and decoding method.

Compared with the prior art, the invention has the following advantages:

(1) the image coding and decoding system and the coding and decoding method based on deep learning are constructed based on a neural network, network parameters need to be trained, and a set of training strategy which enables the whole coding network to be optimized end to end is provided; the calculation efficiency is high based on the neural network; in an unsupervised manner, the trained encoder performance exceeds a variety of conventional encoding standards, such as: JPEG, JPEG2000, etc.;

(2) in the image coding and decoding system and the image coding and decoding method based on deep learning, in the training stage, conditional probability modeling based on Laplace distribution super-prior is carried out on the characteristic coefficient of an image, the conditional probability modeling is a differentiable modeling, so that a code rate loss term can be expressed by using a continuously-derivable function, and thus, network parameters can be updated by using gradient reverse conduction;

(3) the image coding and decoding system and the coding and decoding method based on deep learning approximate quantization operation by adding random uniform noise in the training stage, so that the coding and decoding process becomes conductive.

Of course, it is not necessary for any product that implements the invention to achieve all of the above-described advantages at the same time.

Drawings

Embodiments of the invention are further described below with reference to the accompanying drawings:

FIG. 1 is a flowchart of an image coding method based on deep learning according to an embodiment of the present invention;

FIG. 2 is a block diagram of an image coding method based on deep learning according to an embodiment of the present invention;

FIG. 3 is a diagram illustrating an output code stream of an image coding method based on deep learning according to an embodiment of the present invention;

fig. 4 is a flowchart of an image decoding method based on deep learning according to an embodiment of the present invention.

Detailed Description

The following examples are described in detail, which are carried out on the premise of the technical solution of the present invention, and detailed embodiments and specific procedures are provided, but the scope of the present invention is not limited to the following examples.

Example 1:

the deep learning-based image encoding system of the present embodiment includes: the forward transformation network module based on deep learning, the conditional probability prior analysis module based on deep learning and the entropy coding module. The forward transformation network module is used for enabling the image to pass through a forward transformation network to obtain a characteristic coefficient representing image information; the conditional probability super-prior analysis module is used for analyzing the characteristic coefficient to obtain a super-prior characteristic value representing the conditional probability of the characteristic coefficient; the entropy coding module is used for entropy coding the quantized feature coefficients to obtain feature coefficient code streams under the guidance of the super-prior conditional probability, and is also used for entropy coding the quantized super-prior feature values by a conditional probability model counted on a training set to obtain the super-prior feature value code streams.

In a preferred embodiment, the entropy coding module is further configured to perform bypass entropy coding on the image meta-information to obtain an image meta-information code stream; wherein the image meta information includes: the length and width of the image, the model number used by the image (used to determine the network parameters of the deep learning network used).

In a preferred embodiment, the forward transform network module is constructed based on a deep convolutional neural network; the forward conversion network module starts with the convolution layer, and the convolution layer and the normalization layer are distributed alternately. In one embodiment, the convolution kernels of each convolution layer are all 5 × 5 in size, the number of convolution kernels is 192, and the spatial length and width of the feature coefficients after convolution are both reduced to half of the original size. The normalization layer uses a normal generalized division normalization operation, which is proposed by Ball et al to be a function of local gain control.

In a preferred embodiment, the super-a-priori analysis module is constructed based on a deep convolutional neural network. In one embodiment, the analysis network of the super-first analysis module comprises six layers; the first layer is an operation of taking absolute values, namely a point operation; the second layer is convolution operation, the size of a convolution kernel is 3 multiplied by 3, the number of the convolution kernel is 192, and the space size of the feature passing through the layer is unchanged; the third layer is an active layer and adopts a Leaky relu function; the fourth layer is a convolution layer, the size of a convolution kernel is 5 multiplied by 5, the number of the convolution kernels is 192, and the length and the width of a space after the characteristics are convoluted are reduced to half of the original length and width; the fifth layer is an active layer and adopts a Leaky relu function; the last layer is a convolution layer, the size of a convolution kernel is 5 multiplied by 5, the number of the convolution kernels is 192, and the length and the width of a space after the characteristics are convoluted are reduced to half of the original space.

In a preferred embodiment, the quantization in the entropy coding module is the addition of random uniform noise approximation quantization, making the codec process scalable. In one embodiment, the value range of the uniform noise is [ -0.5,0.5 ].

In a preferred embodiment, the quantization used in the entropy coding module is scalar quantization, and the quantization function is y round (x), i.e. the input is rounded and quantized, and the output is the nearest integer.

Example 2:

the image decoding system based on deep learning of the present embodiment corresponds to the image encoding system of the above-described embodiment, and includes: the device comprises an entropy decoding module, a deep learning-based reconstruction module and a deep learning-based inverse transformation network module. The entropy decoding module is used for performing entropy decoding on the super-prior eigenvalue code stream to obtain a reconstructed super-prior eigenvalue matrix; the deep learning-based reconstruction module is used for training according to a super-prior eigenvalue matrix to obtain a conditional probability model of which the eigenvalue is based on Laplace distribution super-prior; the entropy decoding module is also used for carrying out entropy decoding on the characteristic coefficient code stream according to the conditional probability model to obtain a reconstructed characteristic coefficient matrix; and the deep learning-based inverse transformation network module is used for reconstructing the pixel values of the reconstructed characteristic coefficient matrix through an inverse transformation network.

In a preferred embodiment, the entropy decoding module is further configured to perform entropy decoding on the image meta-information code stream to obtain image meta-information; wherein the image meta information includes: the length and width of the image, the model sequence number used by the image.

In a preferred embodiment, the inverse transformation network module is constructed based on a deep convolutional neural network; the inverse transformation network module and the forward transformation network module are in a symmetrical structure and comprise N layers of deconvolution layers and N-1 layers of inverse normalization layers, the inverse transformation module starts from the deconvolution layers, and the deconvolution layers and the inverse normalization layers are alternately distributed. In one embodiment, the convolution kernels of each deconvolution layer are all 5 × 5 in size, the number of convolution kernels is 192, and the length and width of the space of the feature coefficient after deconvolution are both reduced to twice of the original length and width. The inverse normalization layer adopts inverse generalized division normalization operation.

In a preferred embodiment, the super-prior reconstruction module is constructed based on a deep convolutional neural network, and the super-prior reconstruction module and the super-prior analysis module are in a symmetrical structure. In one embodiment, the reconstruction network of the superma reconstruction module includes six layers, the first layer is an deconvolution layer, the size of a convolution kernel is 5 × 5, the number of the convolution kernels is 192, and the space length and the width of the feature are reduced to two times of the original space length and width after convolution. The second layer is an active layer and adopts a Leaky relu function; the third layer is an deconvolution layer, the size of a convolution kernel is 5 multiplied by 5, the number of the convolution kernels is 192, and the length and the width of a space after the characteristics are convoluted are reduced to two times of the original length and width; the fourth layer is an active layer and adopts a Leaky relu function; the fifth layer is an deconvolution layer, the size of a convolution kernel is 3 multiplied by 3, the number of the convolution kernels is 192, and the space length and the space width of the characteristic after convolution are kept unchanged; the last layer is the output of the function of taking the index of each characteristic value input.

Example 3:

the present invention also provides an image coding method based on deep learning, which is a flowchart as shown in fig. 1, and a frame diagram as shown in fig. 2, and includes the following steps:

s102: inputting the characteristic coefficient matrix into a super-first-check analysis module, and outputting a super-first-check characteristic value matrix representing the probability of the characteristic coefficient;

s103: quantizing the super prior eigenvalue matrix in the step S102, and entropy coding the quantized super prior eigenvalue matrix to obtain a super prior eigenvalue code stream (namely an auxiliary information code stream);

s104: training according to the quantized super-prior eigenvalue matrix in the step S103 to obtain a conditional probability model of which the eigenvalue is based on Laplace distribution super-prior;

s105: quantizing the characteristic coefficient matrix in the step S101, and entropy coding the quantized characteristic coefficient matrix by using the conditional probability model in the step S104 to obtain a characteristic coefficient code stream;

s106: the code stream of the output image of the packing includes: the code stream of the super-prior feature value in step S103 and the code stream of the feature coefficient in step S105.

In a preferred embodiment, the step S105 and the step S106 further include:

s111: carrying out bypass entropy coding on the image meta-information to obtain an image meta-information code stream, wherein the image meta-information comprises: the length and width of the image, and the model serial number adopted by the image; further, the outputting of the encoded code stream of the image in step S106 further includes: the image meta-information code stream in step S111 is a structural diagram of the output code stream of this embodiment as shown in fig. 3.

In the preferred embodiment, the quantization in step S103 and/or step S105 is approximate quantization, and a random uniform noise is added to perform approximate quantization. In one embodiment, the random uniform noise has a value range of [ -0.5,0.5 ].

In a preferred embodiment, step S104 specifically includes: taking a minimized loss function J which is R + lambda D as a target, adopting MS-SSIM or PSNR as a measurement index, and approximating by using information entropy; the information entropy is obtained and is related to a conditional probability function of the characteristic coefficient, the conditional probability density is modeled based on Laplace distribution, the mean value is assumed to be 0, and the variance is a super-prior conditional probability model obtained by training; where R is the code rate and D is the distortion.

Example 4:

the present invention also provides an image decoding method based on deep learning, a flowchart of which is shown in fig. 4, and the image decoding method corresponds to the image encoding method, and includes the following steps:

s142: decoding the code stream of the super-prior eigenvalue to obtain a super-prior eigenvalue matrix by using a corresponding super-prior eigenvalue entropy coding model according to the serial number of the model; according to the model serial number, constructing and initializing a corresponding network model (including network parameters of the adopted deep learning network);

s143: sending the super-prior eigenvalue matrix obtained by decoding in the step 142 into a super-prior reconstruction module, and outputting the conditional probability of the obtained characteristic coefficient;

s145: and (4) sending the characteristic coefficient matrix in the step (S144) to an inverse transformation network module, and reconstructing a pixel value.

Example 5:

an image coding terminal comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor can be used for executing the image coding method based on deep learning in the embodiment 3 when executing the program.

Example 6:

an image decoding terminal comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the processor can be used for executing the image coding and decoding method based on deep learning in the embodiment 4 when executing the program.

Example 7:

a computer-readable storage medium, having stored thereon a computer program which, when executed by a processor, is operable to execute the deep learning-based image encoding method of embodiment 3 described above.

Example 8:

a computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, is operable to perform the deep learning-based image coding and decoding method of embodiment 4 described above.

Through the embodiments of the invention, an unsupervised mode is adopted, the performance of the trained coder-decoder exceeds various traditional coding standards, and the calculation efficiency is high.

The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, and not to limit the invention. Any modifications and variations within the scope of the description, which may occur to those skilled in the art, are intended to be within the scope of the invention.

Claims

1. An image coding system based on deep learning, comprising:

and the entropy coding module is used for entropy coding the quantized feature coefficients to obtain a feature coefficient code stream under the guidance of the super-prior conditional probability, and is also used for entropy coding the quantized super-prior feature values by a conditional probability model counted on a training set to obtain a super-prior feature value code stream.

2. The deep learning based image coding system of claim 1, wherein the entropy coding module is further configured to perform bypass entropy coding on the image meta-information to obtain an image meta-information code stream;

3. The deep learning based image coding system of claim 1, wherein the forward transform network module is constructed based on a deep convolutional neural network;

4. The deep learning based image coding system according to claim 1, wherein the super-prior analysis module is constructed based on a deep convolutional neural network;

5. The deep learning based image coding system of claim 1, wherein the quantization in the entropy coding module is adding random uniform noise approximation quantization.

6. An image decoding system based on deep learning, which is an image decoding system corresponding to the image encoding system of any one of claims 1 to 5, comprising:

the deep learning-based reconstruction module is used for training according to the super-prior eigenvalue matrix to obtain a conditional probability model of which the eigenvalue is based on Laplace distribution super-prior; the entropy decoding module is also used for carrying out entropy decoding on the characteristic coefficient code stream according to the conditional probability model to obtain a reconstructed characteristic coefficient matrix;

7. The deep learning based image decoding system of claim 6, wherein the entropy decoding module is further configured to perform entropy decoding on the image meta-information code stream to obtain image meta-information;

8. The deep learning based image decoding system of claim 6, wherein the inverse transform network module is constructed based on a deep convolutional neural network;

the inverse transformation network module comprises N layers of deconvolution layers and N-1 layers of inverse normalization layers, the inverse transformation module starts with the inverse convolution layers, and the inverse convolution layers and the inverse normalization layers are distributed alternately.

9. The deep learning based image decoding system of claim 6, wherein the super-prior reconstruction module is constructed based on a deep convolutional neural network;

10. An image coding method based on deep learning, comprising:

s102: inputting the characteristic coefficient matrix into a super-prior analysis module, and outputting a super-prior eigenvalue matrix representing the probability of the characteristic coefficient;

s106: the code stream of the output image of the packing includes: the code stream of the super-prior eigenvalue in S103 and the code stream of the eigen coefficient in S105.

11. The method for coding an image based on deep learning of claim 10, wherein between S105 and S106 further comprises:

the encoding code stream of the output image in S106 further includes: and the image meta-information code stream in the S101.

12. The method according to claim 10, wherein the quantization in S103 and/or S105 is approximate quantization, and the approximate quantization is performed by a method of adding random uniform noise.

13. The deep learning based image coding method according to claim 10, wherein the S104 comprises:

14. An image decoding method based on deep learning, which is an image decoding method corresponding to the image encoding method according to any one of claims 10 to 13, comprising:

s141: entropy decoding to obtain image meta-information, including the length and width of an image and the model serial number adopted by the image;

15. An image encoding terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor is operable to execute the image encoding method based on deep learning according to any one of claims 10 to 13 when executing the program.

16. An image decoding terminal comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor is operable to execute the method of claim 14 when executing the program.

17. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, is adapted to carry out the method for deep learning based image encoding according to any one of claims 10 to 13.

18. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, is adapted to perform the method for deep learning based image coding and decoding as set forth in claim 14.