EP1971962A2 - Method for identifying marked images based at least in part on frequency domain coefficient differences - Google Patents
Method for identifying marked images based at least in part on frequency domain coefficient differencesInfo
- Publication number
- EP1971962A2 EP1971962A2 EP06718464A EP06718464A EP1971962A2 EP 1971962 A2 EP1971962 A2 EP 1971962A2 EP 06718464 A EP06718464 A EP 06718464A EP 06718464 A EP06718464 A EP 06718464A EP 1971962 A2 EP1971962 A2 EP 1971962A2
- Authority
- EP
- European Patent Office
- Prior art keywords
- coefficient difference
- image
- analysis
- array
- trained
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N1/32101—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N1/32144—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title embedded in the image data, i.e. enclosed or integrated in the image, e.g. watermark, super-imposed logo or stamp
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/80—Recognising image objects characterised by unique random patterns
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N1/00—Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
- H04N1/32—Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
- H04N1/32101—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
- H04N1/32144—Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title embedded in the image data, i.e. enclosed or integrated in the image, e.g. watermark, super-imposed logo or stamp
- H04N1/32149—Methods relating to embedding, encoding, decoding, detection or retrieval operations
- H04N1/32154—Transform domain methods
Definitions
- This application is related to classifying or identifying content, such as marked images, for example.
- FIG. 1 is a schematic diagram illustrating one embodiment of a portion of a frequency domain coefficient 2-D array
- FIG. 2A-D are schematic diagrams illustrating one embodiment of a technique to generate frequency domain coefficient array differences
- FIGs. 3A and B are plots illustrating the distribution of coefficient array differences for a set of images
- FIG. 4 is a schematic diagram illustrating an embodiment for forming a one-step transition probability matrix, such as to characterize a Markov process
- FIG. 5 is a block diagram illustrating one embodiment of generating features.
- one embodiment described herein includes a method based at least in part on statistical moments derived at least in part from an image 2-D array and a JPEG 2-D array.
- a first order histogram and/or a second order histogram may be employed, although claimed subject matter is not limited in scope in this respect.
- higher order histograms may be utilized in other embodiments, for example.
- moments of 2-D characteristic functions are also used, although, again, other embodiments are not limited in this respect. For example, higher order moments may be employed.
- OutGuess embeds the to-be-hidden data using redundancy of the cover image.
- the cover image refers to the content without the hidden data embedded.
- OutGuess attempts to preserve statistics based at least in part on the BDCT histogram.
- OutGuess identifies redundant BDCT coefficients and embeds data into these coefficients to reduce effects from data embedding. Furthermore, it adjusts coefficients in which data has not been embedded to attempt to preserve the original BDCT histogram.
- F5 developed from Jsteg, F3, and F4, employs the following techniques: straddling and matrix coding. Straddling scatters the message as uniformly distributed as possible over a cover image. Matrix coding tends to improve embedding efficiency (defined here as the number of embedded bits per change of the BDCT coefficient). MB embedding tries to make the embedded data correlated to the cover medium.
- This is implemented by splitting the cover medium into two parts, modeling the parameter of the distribution of the second part given the first part, encoding the second part by using the model and to-be-embedded message, and then combining the two parts to form the stego medium.
- Cauchy distribution is used to model the JPEG BDCT mode histogram and the embedding attempts to keep the lower precision histogram of the BDCT modes unchanged.
- Farid A universal steganalysis method using higher order statistics has been proposed by Farid. See H. Farid, "Detecting hidden messages using higher-order statis-tical models", International Conference on Image Processing, Rochester, NY, USA, 2002. (hereinafter, "Farid")
- Quadrature mirror filters are used to decompose a test image into wavelet subbands.
- the higher order statistics are calculated from wavelet coefficients of high-frequency subbands to form a group of features.
- Another group of features is similarly.formulated from the prediction errors of wavelet coefficients of high-frequency subband.
- this method uses a Markov chain along a horizontal direction and, thus, this approach does not necessarily reflect the 2-D nature of a digital image.
- JPEG 2-D arrays are formed based at least in part on JPEG quantized block DCT coefficients.
- difference JPEG 2-D arrays may be formed along horizontal, vertical and diagonal directions for this particular embodiment and a Markov process may be applied to model these difference JPEG 2-D arrays so as to utilize second order statistics for steganalysis.
- a thresholding technique may be applied to reduce the dimensionality of transition probability matrices, thus making the computational complexity of the scheme more manageable.
- steganalysis is considered as a task of two-class pattern recognition. That is, a given image may be classified as either a stego image (with hidden data) or as a non-stego image (without hidden data).
- a JPEG 2-D array is formed.
- a difference JPEG 2-D array along different directions is formed.
- a transition probability matrix may be constructed to characterize the Markov process. Features may then be derived from this transition probability matrix.
- the so- called one-step transition probability matrix is employed here for reduced computational complexity, although claimed subject matter is not limited in scope in this respect. For example, more complex transition probability matrices may be employed in other embodiments.
- a thresholding technique is also applied, as described in more detail below.
- features are to be generated from a block DCT representation of an image; however, claimed subject matter is not limited in scope in this respect.
- other frequency domain representations of an image may be employed. Nonetheless, for this particular embodiment, it is desirable to examine the properties of JPEG BDCT coefficients.
- this 2-D array For a given image, consider a 2-D array comprising 8x8 block DCT coefficients which have been quantized with a JPEG quantization table, but not zig-zag scanned, run-length coded and Huffman coded. That is, this 2-D array has the same size as the given image with a given 8x8 block filled up with the corresponding JPEG quantized 8x8 block DCT coefficients.
- this resultant 2-D array is referred to as a JPEG 2-D array.
- the features for this particular embodiment are to be formed from a JPEG 2-D array.
- JPEG BDCT quantized coefficients may be either positive, or negative, or zero.
- BDCT coefficients in general do not obey a Gaussian distribution; however, these coefficients are not statistically independent of each other necessary.
- the magnitude of the non-zero BDCT coefficients may be correlated along the zigzag scan order, for example.
- a correlation may exist among absolute values of the BDCT coefficients along horizontal, vertical and diagonal directions.
- Fig. 3 shown below That is, the difference of the absolute values of two immediately (horizontally in Figure 3) neighboring BDCT coefficients are highly concentrated around 0, having a Laplacian-like distribution.
- a similar observation may be made along vertical and diagonal directions.
- this particular embodiment may exploit this aspect of the coefficients, although, of course, claimed subject matter is not limited in scope in this respect.
- a disturbance introduced by data embedding manifests itself more apparently in a prediction-error image than in an original image.
- difference arrays may be generated as follows:
- the distribution of elements of the above- described difference arrays may be Laplacian-like. Most of the difference values are close to zero.
- an image set comprising 7560 JPEG images with quality factors ranging from 70 to 90 was accumulated.
- the arithmetic average of the histograms of the horizontal difference JPEG 2-D arrays generated from this JPEG image set and the histogram of the horizontal difference JPEG 2-D array generated from a randomly selected image from this set of images are shown in Figure 3 (a) and (b), respectively. From this figure, most elements in the horizontal difference JPEG 2-D arrays fall into the interval [-T, T] as long as T is large enough.
- * 91.99% is the mean, meaning that on statistic average 91.99% of the elements of horizontal difference arrays generated from the image set fall into the range [-4, 4].
- the standard deviation is 2.836%.
- a difference JPEG 2-D array is characterized by using a Markov random process.
- a transition probability matrix may be used to characterize the Markov process.
- a one-step transition probability matrix is employed for this embodiment, as shown in Fig. 4, although claimed subject matter is not limited in scope in this respect.
- a thresholding technique may also be employed, although claimed subject matter is not limited in scope in this respect.
- a threshold value here T.
- T a threshold value
- those elements in a difference JPEG 2-D array whose value falls into ⁇ - T, -T+1 , ..., -1 , 0, 1 , ... , T- 1 , T ⁇ is considered. If an element has a value either larger than T or smaller than -T, it will be represented by T or -T correspondingly.
- This procedure results a transition probability matrix of dimensionality (2T+1) ⁇ (2T+1).
- a threshold level may vary. Nonetheless, for this embodiment, the elements of these four matrixes associated with horizontal, vertical, main diagonal and minor diagonal difference JPEG 2-D arrays are given by:
- (2T+1) ⁇ (2T+1) elements are obtained for a transition probability matrix.
- 4 ⁇ (2T+1) ⁇ (2T+1) elements are produced.
- these may be employed as features for steganalysis.
- 4 ⁇ (2T+1) ⁇ (2T+1) feature vectors have been produced for steganaysis for this particular embodiment.
- T in this example is set to 4, although claimed subject matter is not limited in scope in this respect.
- this element if an element has an absolute value larger than 4, this element is reassigned an absolute value 4 without sign change.
- Feature construction for this particular embodiment is illustrated by a block diagram shown in Fig. 5.
- analysis of variance process refers to processes or techniques that may be applied so that differences attributable to statistical variation are sufficiently distinguished from differences attributable to non-statistical variation to correlate, segment, classify, analyze or otherwise characterize data based at least in part on application of such processes or techniques.
- artificial intelligence techniques and processes including pattern recognition; neutral networks; genetic processes; heuristics; and support vector machines (SVM).
- SVM may, for example, be employed to handle linear and non-linear cases or situations.
- an SVM classifier may be applied to search for a hyper-plane that separates a positive pattern from a negative pattern.
- a support vector machine (SVM) is used as a classifier.
- SVM is based at least in part on the idea of hyperplane classifier. It uses Lagrangian multipliers to find a separation hyperplane which distinguishes the positive pattern from the negative pattern. If the feature vectors are one-dimensional (1-D), the separation hyperplane reduces to a point on the number axis. SVM can handle both linear separable and no-linear separable cases.
- SVM support vector machine
- a selection ⁇ from the data may be classified using w
- a "learning machine” may map input feature vectors to a higher dimensional space in which a linear hyper-plane may potentially be located.
- a transformation from nonlinear feature space to linear higher dimensional space may be performed using a kernel function.
- kernels include: linear, polynomial, radial basis function and sigmoid.
- a linear kernel may be employed in connection with a linear SVM process, for example.
- other kernels may be employed in connection with a non-linear SVM process.
- a polynomial kernel was employed.
- An image database comprising 7,560 JPEG images with quality factors ranging from 70 to 90 was employed.
- One third of these images were an essentially random set of pictures taken at different times and places with different digital cameras. The other two thirds were downloaded from the Internet.
- Each image was cropped (central portion) to the size of either 768x512 or 512x768.
- chrominance components of the images are set to be zero while luminance coefficients are unaltered before data embedding.
- one embodiment may comprise one or more articles, such as a
- This storage media such as, one or more CD-ROMs and/or disks, for example, may have stored thereon instructions, that when executed by a system, such as a computer system, computing platform, or other system, for example, may result in an embodiment of a method in accordance with claimed subject matter being executed, such as
- a computing platform may include one or more processing units or processors, one or more input/output devices, such as a display, a keyboard and/or a mouse, and/or one or more memories, such as static random access memory, dynamic random access memory, flash memory, and/or a hard drive.
- a display may be employed to display one or more queries, such as those that may be interrelated, and or one or more tree expressions, although, again, claimed subject matter is not limited in scope to this example.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Image Processing (AREA)
- Editing Of Facsimile Originals (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
PCT/US2006/001393 WO2007086833A2 (en) | 2006-01-13 | 2006-01-13 | Method for identifying marked images based at least in part on frequency domain coefficient differences |
Publications (2)
Publication Number | Publication Date |
---|---|
EP1971962A2 true EP1971962A2 (en) | 2008-09-24 |
EP1971962A4 EP1971962A4 (en) | 2011-04-06 |
Family
ID=38309629
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP06718464A Withdrawn EP1971962A4 (en) | 2006-01-13 | 2006-01-13 | Method for identifying marked images based at least in part on frequency domain coefficient differences |
Country Status (3)
Country | Link |
---|---|
EP (1) | EP1971962A4 (en) |
JP (1) | JP4920045B2 (en) |
WO (1) | WO2007086833A2 (en) |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR0132894B1 (en) * | 1992-03-13 | 1998-10-01 | 강진구 | Image compression coding and decoding method and apparatus |
US20020183984A1 (en) * | 2001-06-05 | 2002-12-05 | Yining Deng | Modular intelligent multimedia analysis system |
-
2006
- 2006-01-13 WO PCT/US2006/001393 patent/WO2007086833A2/en active Application Filing
- 2006-01-13 JP JP2008550284A patent/JP4920045B2/en not_active Expired - Fee Related
- 2006-01-13 EP EP06718464A patent/EP1971962A4/en not_active Withdrawn
Non-Patent Citations (4)
Title |
---|
FARID H ET AL: "How Realistic is Photorealistic?", IEEE TRANSACTIONS ON SIGNAL PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, US, vol. 53, no. 2, 1 February 2005 (2005-02-01), pages 845-850, XP011125221, ISSN: 1053-587X, DOI: DOI:10.1109/TSP.2004.839896 * |
FARID H: "Detecting steganographic messages in digital images", 20010101, 1 January 2001 (2001-01-01), pages 1-9, XP002468134, * |
GUO-SHIANG LIN ET AL: "Data hiding domain classification for blind image steganalysis", 2004 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME): JUNE 27 - 30, 2004, TAIPEI, TAIWAN, PISCATAWAY, NJ : IEEE OPERATIONS CENTER, US, vol. 2, 27 June 2004 (2004-06-27), pages 907-910, XP010770967, DOI: DOI:10.1109/ICME.2004.1394348 ISBN: 978-0-7803-8603-7 * |
See also references of WO2007086833A2 * |
Also Published As
Publication number | Publication date |
---|---|
WO2007086833A2 (en) | 2007-08-02 |
EP1971962A4 (en) | 2011-04-06 |
JP2009527930A (en) | 2009-07-30 |
JP4920045B2 (en) | 2012-04-18 |
WO2007086833A3 (en) | 2009-05-28 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7925080B2 (en) | Method for identifying marked images based at least in part on frequency domain coefficient differences | |
US8103054B2 (en) | Method for identifying marked images using statistical moments based at least in part on a JPEG array | |
Zhang et al. | Robust invisible video watermarking with attention | |
Shi et al. | A Markov process based approach to effective attacking JPEG steganography | |
Zou et al. | Steganalysis based on Markov model of thresholded prediction-error image | |
Fridrich et al. | Practical steganalysis of digital images: state of the art | |
Luo et al. | DVMark: a deep multiscale framework for video watermarking | |
Fu et al. | JPEG steganalysis using empirical transition matrix in block DCT domain | |
US8224017B2 (en) | Method for identifying marked content | |
WO2008089377A2 (en) | A method and apparatus for steganalysis for texture images | |
Chen et al. | Statistical moments based universal steganalysis using JPEG 2-D array and 2-D characteristic function | |
Tzschoppe et al. | Steganographic system based on higher-order statistics | |
Shankar et al. | Result Analysis of Cross-Validation on low embedding Feature-based Blind Steganalysis of 25 percent on JPEG images using SVM | |
Lafferty et al. | Texture-based steganalysis: results for color images | |
Abolghasemi et al. | LSB data hiding detection based on gray level co-occurrence matrix (GLCM) | |
Malik | Steganalysis of qim steganography using irregularity measure | |
WO2007086833A2 (en) | Method for identifying marked images based at least in part on frequency domain coefficient differences | |
Lakshmi | A novel steganalytic algorithm based on III level DWT with energy as feature | |
JP4610653B2 (en) | Method for identifying marked images using statistical moments based at least in part on JPEG sequences | |
Malik et al. | Nonparametric steganalysis of QIM data hiding using approximate entropy | |
Quan | JPEG Steganalysis Based on Local Dimension Estimation | |
Xu et al. | Passive steganalysis using image quality metrics and multi-class support vector machine | |
Nesakumari et al. | Integrated Normalized Content System for Efficient Watermarking | |
Davidson et al. | Double-compressed JPEG detection in a steganalysis system | |
Sadr et al. | Robustness enhancement of content-based watermarks using entropy masking effect |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20080717 |
|
AK | Designated contracting states |
Kind code of ref document: A2 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LI LT LU LV MC NL PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA HR MK YU |
|
R17D | Deferred search report published (corrected) |
Effective date: 20090528 |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 20110303 |
|
17Q | First examination report despatched |
Effective date: 20110316 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
DAX | Request for extension of the european patent (deleted) | ||
18D | Application deemed to be withdrawn |
Effective date: 20120221 |