RU2788314C1

RU2788314C1 - Method for detecting and localizing a falsified area in jpeg images

Info

Publication number: RU2788314C1
Application number: RU2022114078A
Authority: RU
Inventors: Никита Викторович Арлазаров; Ирина Андреевна Кунина; Дмитрий Валерьевич Полевой; Александр Васильевич Чуйко
Original assignee: Общество с ограниченной ответственностью "СМАРТ ЭНДЖИНС СЕРВИС"
Filing date: 2022-05-25
Publication date: 2023-01-17

Abstract

FIELD: information technology.

SUBSTANCE: invention relates to the field of information technology, namely to verifying the authenticity of images of documents of various types. The method for detecting and localizing the falsified area in JPEG images consists in the fact that the image is considered in the RGB color space, which was subjected to JPEG compression, after which, with the help of software, falsifications were performed and the modified image was saved in a lossless format, while the detection of the falsified area is based on the search for inconsistencies in structure of DCT coefficients of JPEG images, to do this, the DCT coefficients of the image are calculated with saving the information about the position of the blocks corresponding to the coefficients of 8 by 8 pixels, the quantization matrix is evaluated according to the calculated DCT coefficients, while the image is divided into several blocks, then the quantization step is estimated for each of the 64 frequencies in each block, the JPEG compression grid position is searched, the image is searched for inconsistencies of DCT coefficients to the quantization matrix, the falsified areas are localized.

EFFECT: increase in the accuracy of determining the falsified area in the image.

1 cl, 7 dwg

Description

Заявленное техническое решение относится к области проверки подлинности изображений документов различных видов.The claimed technical solution relates to the field of authentication of images of documents of various types.

Из уровня техники известны различные способы детектирования и локализации фальсифицированной области в JPEG-изображениях (см. например, 1. S. Ye, Q. Sun and Е. Chang, "Detecting Digital Image Forgeries by Measuring Inconsistencies of Blocking Artifact, "2007 IEEE International Conference on Multimedia and Expo, 2007, pp. 12-15, doi: 10.1109/ICME.2007.4284574 [1], T. Bianchi, A. De Rosa and A. Piva, "Improved DCT coefficient analysis for forgery localization in JPEG images, "2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, pp.2444-2447, doi: 10.1109/ICASSP.2011.5946978 [2], Weihai Li, Yuan Yuan, Nenghai Yu, Passive detection of doctored JPEG image via block artifact grid extraction, Signal Processing, Volume 89, Issue 9, 2009, Pages 1821-1829, ISSN 0165-1684, https://doi.org/10.1016/j.sigpro.2009.03.025 [3]).Various methods are known in the art for detecting and localizing a fake area in JPEG images (see, for example, 1. S. Ye, Q. Sun and E. Chang, "Detecting Digital Image Forgeries by Measuring Inconsistencies of Blocking Artifact," 2007 IEEE International Conference on Multimedia and Expo, 2007, pp. 12-15, doi: 10.1109/ICME.2007.4284574 [1], T. Bianchi, A. De Rosa and A. Piva, "Improved DCT coefficient analysis for forgery localization in JPEG images, "2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, pp.2444-2447, doi: 10.1109/ICASSP.2011.5946978 [2], Weihai Li, Yuan Yuan, Nenghai Yu, Passive detection of doctored JPEG image via block artifact grid extraction, Signal Processing, Volume 89, Issue 9, 2009, Pages 1821-1829, ISSN 0165-1684, https://doi.org/10.1016/j.sigpro.2009.03.025 [3]).

Однако, в работе [1] по изображению вычисляется мера blocking artifact measure (ВАМ). В данной работе для каждого блока 8 на 8 вычисляется доля несогласованных частот. Такой подход позволяет регистрировать несогласованные блоки даже при высоком качестве JPEG-сжатия истинного изображения. Кроме того, в работе [1] использовался другой метод оценивания матрицы квантования, метод, предложенный в настоящей заявке дает возможность надежно оценивать матрицу квантования не опасаясь влияния фальсифицированных областей.However, in [1], the blocking artifact measure (BAM) is calculated from the image. In this paper, for each 8 by 8 block, the proportion of mismatched frequencies is calculated. This approach makes it possible to register inconsistent blocks even at high quality JPEG compression of the true image. In addition, in [1] another method for estimating the quantization matrix was used, the method proposed in this application makes it possible to reliably estimate the quantization matrix without fear of the influence of falsified regions.

В работе [2] предполагается, что исходные области будут подвергаться двойному сжатию, в то время как поддельные будут иметь только одно сжатие. Для поиска поддельных областей строиться статистическая модель, позволяющая оценивать вероятность, что область подвергалась двойному сжатию. В настоящей заявке предполагается, что фальсифицированное изображение сохранялось в формате без потерь, но также метод работает в случае сохранения фальсифицированного изображения в формате JPEG с высоким качеством. В отличие от работы [2], в заявленном способе каждый блок 8 на 8 определяется поддельным независимо, что позволяет выявлять даже небольшие области фальсификаций, на что, обычно, не способны статистические методы.In [2], it is assumed that the original regions will be subjected to double compression, while the fake ones will have only one compression. To search for fake areas, a statistical model is built to estimate the probability that the area was subjected to double compression. This application assumes that the spoofed image was saved in a lossless format, but the method also works if the spoofed image is saved in high quality JPEG format. In contrast to [2], in the claimed method, each 8 by 8 block is determined to be fake independently, which makes it possible to detect even small areas of falsification, which statistical methods are usually not capable of.

В работе [3] исследуются артефакты блоков JPEG-сжатия. Так как дискретное косинусное преобразование проходит отдельно в каждом блоке 8 на 8, границы между такими блоками заметны. Вставленная область будет иметь по предположению несогласованную сетку. Предлагаемый в настоящей заявке способ использует математическую модель DCT-коэффициентов, что позволяет с высокой вероятностью определять фальсифицированные участи не зависимо от того, что на них изображено и применим при высоком качестве сжатия истинного изображения, когда артефакты блоков слабо заметны.In [3], artifacts of JPEG compression blocks are studied. Since the discrete cosine transform runs separately in each 8 by 8 block, the boundaries between such blocks are noticeable. The pasted area will have an inconsistent mesh by assumption. The method proposed in this application uses a mathematical model of DCT coefficients, which makes it possible to determine falsified fates with a high probability, regardless of what is depicted on them, and is applicable at high compression quality of the true image, when block artifacts are hardly noticeable.

Задачей заявленного изобретения является устранение недостатков известного уровня техники. Технический результат заключается в обеспечении способа детектирования и локализации фальсифицированной области в JPEG-изображениях, который позволяет выявлять с высокой вероятностью, точностью и надежностью даже небольшие области фальсификаций, независимо от того, что на них изображено и применим при высоком качестве сжатия истинного изображения, когда артефакты блоков слабо заметны, а также обеспечивает возможность надежно оценивать матрицу квантования, не опасаясь влияния фальсифицированных областей,The objective of the claimed invention is to eliminate the shortcomings of the prior art. The technical result consists in providing a method for detecting and localizing a falsified area in JPEG images, which makes it possible to detect with high probability, accuracy and reliability even small areas of falsification, regardless of what is depicted on them and is applicable at a high compression quality of a true image, when artifacts blocks are barely noticeable, and also provides the ability to reliably evaluate the quantization matrix without fear of the influence of falsified areas,

Поставленная задача решается, а заявленный технический результат достигается посредством заявленного способа детектирования и локализации фальсифицированной области в JPEG-изображениях.The task is solved, and the claimed technical result is achieved by means of the claimed method for detecting and localizing a falsified area in JPEG images.

Заявленный способ применим в том случае, когда дано изображение и необходимо определить, производились ли манипуляции меняющие содержимое (контент) изображения (операции копирования-вставки или сращивания) и локализировать фальсифицированные области. Например, обнаружить фальсификации на изображении документа и, тем самым, опровергнуть подлинность данных документа.The claimed method is applicable in the case when an image is given and it is necessary to determine whether manipulations were performed to change the content (content) of the image (copy-paste or splicing operations) and to localize falsified areas. For example, to detect falsifications in the image of a document and, thereby, refute the authenticity of the document data.

Заявленный способ детектирования и локализации фальсифицированной области в JPEG-изображениях заключается в следующем:The claimed method for detecting and localizing a falsified area in JPEG images is as follows:

Рассматривается изображение в цветовом пространстве RGB. Предполагается, что истинное изображение было подвержено JPEG-сжатию, после чего, при помощи программного обеспечения, были выполнены фальсификации и измененное изображение было сохранено в формате без потерь.An image in the RGB color space is considered. It is assumed that the true image was subjected to JPEG compression, after which, with the help of software, falsifications were performed and the modified image was saved in a lossless format.

Обнаружение фальсифицированной области основано на поиске несоответствий в структуре DCT-коэффициентов JPEG-изображения. Если изображение подвергалось JPEG-сжатию, то его DCT-коэффициенты при фиксированной частоте (вычисленные при том же положении сетки 8 на 8 пикселей, при котором производилось JPEG-сжатие) будут распределены возле значений кратных шагу квантования q, (числу стоящему в соответствующей частоте матрицы квантования при которой производилось JPEG-сжатие).The falsified area detection is based on the search for inconsistencies in the structure of the DCT coefficients of a JPEG image. If the image has been JPEG-compressed, then its DCT coefficients at a fixed frequency (calculated at the same grid position of 8 by 8 pixels at which JPEG compression was performed) will be distributed around values that are multiples of the quantization step q, (the number in the corresponding matrix frequency quantization at which JPEG compression was performed).

d∈[kq-1, kq+1], k∈Zd∈[kq-1, kq+1], k∈Z

С большой вероятностью DCT-коэффициенты в фальсифицированной области не будут удовлетворять этому свойству. Так, например, при копировании-вставке при несовпадении внутренней сетки JPEG-сжатия вставленной области с внешней сеткой, DCT-коэффициенты вставленной области не будут иметь нетривиальных периодов, несмотря на то, что эта область сжималась с тем же качеством, что и истинное изображение. Вероятность несовпадения внешней сетки и внутренней 63/64. Иллюстрация истинного изображения подвергавшегося JPEG-сжатию с показателем качества 90 представлена на фиг. 1, на фиг. 2 - иллюстрация этого изображения с фальсификацией копирования-вставки, которая выполнена в программе paint.net с последующим сохранением в формате PNG (без потерь).With a high probability, the DCT coefficients in the falsified region will not satisfy this property. So, for example, when copy-pasting, if the internal grid of the JPEG compression of the pasted area does not match the external grid, the DCT coefficients of the pasted area will not have non-trivial periods, despite the fact that this area was compressed with the same quality as the true image. The probability of mismatch between the outer grid and the inner one is 63/64. An illustration of a true JPEG-compressed image with a quality score of 90 is shown in FIG. 1 in FIG. 2 is an illustration of this image with a fake copy-paste done in paint.net and then saved as PNG (lossless).

Вычисление DCT-коэффициентов изображенияComputing the DCT Coefficients of an Image

Для последующих шагов алгоритма необходимо вычислить DCT-коэффициенты изображения с сохранением информации о положении соответствующих коэффициентам блоков 8 на 8 пикселей. Это достигается следующими действиями.For the subsequent steps of the algorithm, it is necessary to calculate the DCT coefficients of the image while maintaining information about the position of the 8 by 8 pixel blocks corresponding to the coefficients. This is achieved by the following steps.

1. Дополнение сторон изображения до кратности 8 черными пикселями.1. Complementing the sides of the image up to a factor of 8 with black pixels.

2. Трансформация цветового пространства из RGB в YCbCr.2. Color space transformation from RGB to YCbCr.

3. Разбиение канала яркости в цветовом пространстве YCbCr (Y) на блоки 8 на 8 пикселей.3. Dividing the brightness channel in the YCbCr (Y) color space into blocks of 8 by 8 pixels.

4. Применение дискретного косинусного преобразования для каждого блока 8 на 8 пикселей.4. Applying a discrete cosine transform to each block of 8 by 8 pixels.

5. Отбрасывание из рассмотрения блоков, в которых присутствуют пиксели с насыщенными значениями (0 или 255) в любом канале RGB и блоков, в которых яркость (Y) в YCbCr в момент перехода из RBG в YCbCr выходила за границы диапазона [0, 1]. Такие пиксели назовем насыщенными.5. Rejection from consideration of blocks in which there are pixels with saturated values (0 or 255) in any RGB channel and blocks in which the brightness (Y) in YCbCr at the moment of transition from RBG to YCbCr went beyond the range [0, 1] . We call such pixels saturated.

Пункт 5 необходим, так как DCT-коэффициенты в таких блоках могут вести себя непредсказуемым образом и не принадлежать диапазону [kq-1, kq+1], где k ∈ Z, q - шаг квантования.Point 5 is necessary, since the DCT coefficients in such blocks can behave unpredictably and do not belong to the range [kq-1, kq+1], where k ∈ Z, q is the quantization step.

Оценка матрицы квантованияEstimation of the quantization matrix

По вычисленным DCT-коэффициентам оценивается матрица квантования. Чтобы избежать влияния фальсифицированной области на оценку матрицы, изображение разбивается на несколько блоков. Далее для каждой из 64 частот в каждом блоке оценивается шаг квантования. Он состоит в следующих действиях.Based on the calculated DCT coefficients, the quantization matrix is estimated. To avoid the influence of the falsified area on the matrix estimation, the image is divided into several blocks. Next, for each of the 64 frequencies in each block, the quantization step is estimated. It consists of the following steps.

1. Вычисление факторной гистограммы h. h(x) - количество DCT - коэффициентов, которые делятся на х без остатка.1. Calculation of the factorial histogram h. h(x) - the number of DCT - coefficients that are divided by x without a remainder.

2. Нормализация h на общее количество DCT-коэффициентов.2. Normalization of h by the total number of DCT coefficients.

3. Оценка шага квантования, как

, для некоторого порога Т₁.3. Estimation of the quantization step as

, for some threshold Т ₁ .

Если

конец оценивания.If

end of evaluation.

4. В случае

нахождение у=max{x | h(x)≥Т₂] (для некоторого Т₂). Если у<4 конец оценивания.4. In case

finding y=max{x | h(x)≥Т ₂ ] (for some Т ₂ ). If y<4 end of evaluation.

5. Построение модифицированной факторной гистограммы

-количество DCT-коэффициентов, которые делятся на х без остатка и не принадлежат множеству М={ky±1, k∈Z}.5. Construction of a modified factorial histogram

-number of DCT coefficients that are divisible by x without remainder and do not belong to the set М={ky±1, k∈Z}.

6. Нормализация

на общее количество DCT-коэффициентов не принадлежащих множеству М.6. Normalization

on the total number of DCT coefficients that do not belong to the set M.

7. Вычисление

для некоторого порога Т₃. Если

шаг квантования оценивается как

иначе как

. В пунктах 4-7 производится попытка устранить излишний шум в гистограмме DCT-коэффициентов и произвести оценку заново. Если получается найти достаточно большой период (>3) он принимается как истинный.7. Calculation

for some threshold T ₃ . If

the quantization step is estimated as

otherwise

. In steps 4-7, an attempt is made to eliminate excessive noise in the histogram of DCT coefficients and re-estimate. If it turns out to find a sufficiently large period (> 3) it is accepted as true.

Для каждой частоты берется мода шага квантования по всем блокам, где оценка шага не равна 1.For each frequency, the quantization step mode is taken over all blocks where the step estimate is not equal to 1.

Где

- оценка шага квантования по всему изображению для i-ой частоты,Where

- estimation of the quantization step over the entire image for the i-th frequency,

- оценка шага квантования для

-ой частоты

-го блока не равная 1, М₀ - оператор взятия моды.

- estimation of the quantization step for

th frequency

th block is not equal to 1, M ₀ is the mode-taking operator.

Если

, то шаг квантования для i-ой частоты полагается равным 1 и не участвует при поиске несоответствий в пункте 2.4, так как множество диапазонов

в этом случае заполняет почти всю числовую ось. В случае, когда истинный шаг квантования равен 1, несмотря на то, что мода берется по неединичным значениям, полученная оценка будет меньше пяти в силу фундаментального распределения DCT-коэффициентов и не будет участвовать в дальнейшем.If

, then the quantization step for the i-th frequency is assumed to be 1 and is not involved in the search for inconsistencies in paragraph 2.4, since the set of ranges

in this case fills almost the entire number line. In the case when the true quantization step is equal to 1, despite the fact that the mode is taken over non-unit values, the resulting estimate will be less than five due to the fundamental distribution of DCT coefficients and will not participate further.

Это позволяет оценивать матрицу квантования устойчиво к фальсифицированным областям, даже если они занимают больше половины изображения.This makes it possible to evaluate the quantization matrix robustly against falsified areas, even if they occupy more than half of the image.

Поиск положения сетки JPEG-сжатияFinding the Position of the JPEG Compression Grid

В случае, если оцененная матрица квантования состоит из большого количества единиц (малое количество частот с найденными нетривиальными периодами в гистограмме DCT-коэффициентов), можно предположить, что либо истинное изображение было сжато с высоким показателем качества, либо, что разбиение изображения на блоки 8 на 8 пикселей (сетка сжатия), которое мы выбрали, не совпадает с разбиением, которое применялось при JPEG-сжатии.If the estimated quantization matrix consists of a large number of ones (a small number of frequencies with found nontrivial periods in the histogram of DCT coefficients), it can be assumed that either the true image was compressed with a high quality index, or that the image was divided into blocks of 8 into The 8px (compression mesh) we chose is not the same as the split that was applied in the JPEG compression.

Чтобы исключить второй вариант, применяется алгоритм поиска положения сетки JPEG-сжатия. Берется область некоторого размера на изображении. Она выбирается так, чтобы гистограммы DCT-коэффициентов из этой области были достаточно содержательны. Для этого исключаются слабо текстурированные области и области в которых много насыщенных пикселей. Для положения сетки JPEG-сжатия существует 64 варианта. Для каждого из возможных положений делается оценка матрицы квантования, как описано ранее, но только для взятой области, а не всего изображения. Принимается первое положение, в оцененной матрице которого достаточное количество нетривиальных значений.To exclude the second option, a JPEG compression grid position search algorithm is applied. An area of some size is taken from the image. It is chosen so that the histograms of DCT coefficients from this region are sufficiently meaningful. To do this, weakly textured areas and areas in which there are a lot of saturated pixels are excluded. There are 64 options for the position of the JPEG compression grid. For each of the possible positions, a quantization matrix is estimated as described earlier, but only for the area taken, not the entire image. The first position is accepted, in the estimated matrix of which there is a sufficient number of non-trivial values.

Стоит отметить, что искать положение сетки JPEG-сжатия таким алгоритмом изначально (до выполнения вышеуказанных пунктов) не резонно с точки зрения вычислительных затрат. С большой вероятностью положение сетки совпадает с наивным разбиением указанном ранее.It should be noted that it is not reasonable from the point of view of computational costs to search for the position of the JPEG compression grid with such an algorithm initially (before completing the above steps). With a high probability, the position of the grid coincides with the naive partition indicated earlier.

Поиск несоответствий DCT-коэффициентов матрице квантованияFinding discrepancies between DCT coefficients and quantization matrix

Строиться два вспомогательных изображения I₁ и I₂ из значений следующим образом. Каждому блоку 8 на 8 из разбиения при выбранном положении сетки JPEG-сжатия сопоставляется пиксель с теми же координатами в изображении I₁ и пиксель в изображении I₂. Размеры I₁ и I₂ будут совпадать. В каждый пиксель I₁ записывается количество частот, при которых соответствующий этому пикселю-блоку DCT-коэффициент не принадлежит ни одному диапазону из множества

- шаг квантования из оцененной матрицы квантования, соответствующий частоте.Two auxiliary images I ₁ and I ₂ are built from the values as follows. Each 8 by 8 block from the partition at the selected position of the JPEG compression grid is assigned a pixel with the same coordinates in the image I ₁ and a pixel in the image I ₂ . Dimensions I ₁ and I ₂ will match. Each pixel I ₁ contains the number of frequencies at which the DCT coefficient corresponding to this pixel-block does not belong to any range from the set

is the quantization step from the estimated quantization matrix corresponding to the frequency.

Вводится множество М₂ таких частот, что хотя бы один блок при этой частоте имел DCT-коэффициент не принадлежит ни одному диапазону из множества

- шаг квантования из оцененной матрицы квантования, соответствующий частоте. Другими словами каждая частота из М₂ дала вклад хотя бы в один пиксель I₁. В каждый пиксель I₂ записывается количество частот из множества М₂, при которых соответствующий этому пикселю-блоку DCT-коэффициент имеет абсолютное значение больше единицы. Формируется вспомогательное изображение значений I₃ с теми же размерами, что у I₁ и I₂, по следующей формуле:A set M _{2 of} such frequencies is introduced such that at least one block at this frequency had a DCT coefficient that does not belong to any range from the set

is the quantization step from the estimated quantization matrix corresponding to the frequency. In other words, each frequency from M ₂ contributed to at least one pixel of I ₁ . Each pixel I ₂ contains the number of frequencies from the set M ₂ at which the DCT coefficient corresponding to this pixel-block has an absolute value greater than one. An auxiliary image of the values of I ₃ is formed with the same dimensions as those of I ₁ and I ₂ , according to the following formula:

где t подбирается экспериментально.

where t is chosen experimentally.

Таким образом получается изображение из значений в диапазоне [0, 1]. Значение в каждом пикселе можно интерпретировать как долю частот, при которых соответствующий блок является несогласованным, в смысле DCT-коэффициентов, среди тех частот при которых есть несогласованные, (множество М2).Thus, an image is obtained from values in the range [0, 1]. The value in each pixel can be interpreted as the proportion of frequencies at which the corresponding block is inconsistent, in the sense of the DCT coefficients, among those frequencies at which there are inconsistencies (M2 set).

Контрастированные изображения I₁, I₂, I₃ представлены на фиг. 3, фиг. 4, фиг. 5 соответственно.Contrasted images I ₁ , I ₂ , I ₃ are shown in FIG. 3, fig. 4, fig. 5 respectively.

Локализация фальсифицированных областейLocalization of falsified areas

По изображению I₃ формируется вспомогательное изображение I₄, на котором дополнительно применяются морфологическое открытие и размытие для устранения паразитных элементов, возникающих вследствие случайных отклонений DCT-коэффициентов. По изображению I₄ вычисляется максимум - оценка вероятности наличия манипуляций. По изображению I₄ строится изображение I₅ с размерами исходного изображения. Значение пикселей внутри каждого блока 8 на 8 изображения I₅ равно значению пикселя изображения I₄ соответствующего этому блоку.An auxiliary image I ₄ is formed from the I ₃ image, on which morphological opening and blurring are additionally applied to eliminate parasitic elements that arise due to random deviations of the DCT coefficients. The image I ₄ calculates the maximum - an estimate of the probability of the presence of manipulations. Image I ₄ is used to construct image I ₅ with the dimensions of the original image. The pixel value within each 8 by 8 block of image I ₅ is equal to the pixel value of image I ₄ corresponding to that block.

Оценка вероятности наличия манипуляций и изображение I₅ - результат работы алгоритма. Для данного примера (фиг. 2) оценка вероятности наличия манипуляций равна 0.68302. Контрастированные изображения I₄, I₅ представлены на фиг. 6 и фиг. 7 соответственно.Estimation of the probability of the presence of manipulations and the image I ₅ - the result of the algorithm. For this example (Fig. 2), the estimate of the probability of having manipulations is 0.68302. Contrasted images I ₄ , I ₅ are shown in FIG. 6 and FIG. 7 respectively.

Таким образом, вышеизложенный метод детектирования и локализации фальсифицированной области в JPEG-изображениях позволяет выявлять некоторые случаи фальсификации данных на изображении без привязки к его содержимому и обладает потенциальной возможностью применения для проверки подлинности изображений документов различных видов.Thus, the above method for detecting and localizing a falsified area in JPEG images makes it possible to detect some cases of falsification of data in an image without reference to its content and has the potential to be used to verify the authenticity of images of documents of various types.

Claims

A method for detecting and localizing a falsified area in JPEG images, which consists in considering an image in the RGB color space that has been subjected to JPEG compression, after which falsifications were performed using software and the modified image was saved in a lossless format , characterized in that the detection of a falsified area is based on the search for inconsistencies in the structure of the DCT coefficients of a JPEG image, for this:

- calculate the DCT coefficients of the image with saving information about the position of the blocks corresponding to the coefficients of 8 by 8 pixels:

complement the sides of the image up to a multiplicity of 8 black pixels;

transform the color space from RGB to YCbCr;

produce a partition of the brightness channel in the color space YCbCr (Y) into blocks of 8 by 8 pixels;

applying a discrete cosine transform to each block of 8 by 8 pixels;

blocks are discarded from consideration, in which there are pixels with saturated values (0 or 255) in any RGB channel and blocks in which the brightness (Y) in YCbCr at the moment of transition from RBG to YCbCr went beyond the range [0, 1];

- the quantization matrix is estimated by the calculated DCT coefficients, while the image is divided into several blocks, then for each of the 64 frequencies in each block the quantization step is estimated:

calculate the factorial histogram h, h(x) - the number of DCT coefficients that are divided by x without a remainder;

normalize h by the total number of DCT coefficients;