GB2509056A - Encoding Metadata Describing Preceding Frames / Fields by Modifying Data Defining Pixel Values of a Current Frame / Field - Google Patents

Encoding Metadata Describing Preceding Frames / Fields by Modifying Data Defining Pixel Values of a Current Frame / Field Download PDF

Info

Publication number
GB2509056A
GB2509056A GB1222397.0A GB201222397A GB2509056A GB 2509056 A GB2509056 A GB 2509056A GB 201222397 A GB201222397 A GB 201222397A GB 2509056 A GB2509056 A GB 2509056A
Authority
GB
United Kingdom
Prior art keywords
field
frame
metadata
pixel
spatial
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB1222397.0A
Other versions
GB201222397D0 (en
GB2509056B (en
Inventor
Jonathan Diggins
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Snell Advanced Media Ltd
Original Assignee
Snell Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Snell Ltd filed Critical Snell Ltd
Priority to GB1222397.0A priority Critical patent/GB2509056B/en
Publication of GB201222397D0 publication Critical patent/GB201222397D0/en
Priority to US14/103,068 priority patent/US9330428B2/en
Publication of GB2509056A publication Critical patent/GB2509056A/en
Priority to US15/091,704 priority patent/US9852489B2/en
Application granted granted Critical
Publication of GB2509056B publication Critical patent/GB2509056B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/46Embedding additional information in the video signal during the compression process
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0021Image watermarking
    • G06T1/005Robust watermarking, e.g. average attack or collusion attack resistant
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T1/00General purpose image data processing
    • G06T1/0021Image watermarking
    • G06T1/0028Adaptive watermarking, e.g. Human Visual System [HVS]-based watermarking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0061Embedding of the watermark in each block of the image, e.g. segmented watermarking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2201/00General purpose image data processing
    • G06T2201/005Image watermarking
    • G06T2201/0065Extraction of an embedded watermark; Reliable detection

Abstract

A video data stream is modified to encode at least four sets of metadata that respectively describe at least four video fields or frames preceding a current field or frame by modification of data defining the pixel values of that current field or frame. An encoder (e.g. watermark encoder) receives a current video image together with current metadata associated with the current image. A metadata delay also makes available to the decoder (e.g. watermark decoder) delayed metadata associated respectively with four or more of the preceding fields or frames. Then the encoder modifies pixel values (e.g. LSBs) of the current image not only to encode the current metadata but also the delayed metadata. At a decoder, if metadata for the current image is corrupted or missing, it can be recovered from one of the succeeding images.

Description

METHOD AND APPARATUS FOR MODIFYING A VIDEO STREAM
FIELD OF INVENTION
This invention concerns the transmission of metadata related to an image sequence.
BACKGROUND OF THE INVENTION
It is very often necessary to associate some data with a motion-image stream. In television so-called metadata' is frequently associated with one or more fields or frames of analogue or digital video. Typically the data associated with a field or frame describes parameters of that field or frame. Examples of parameters that can be carried by metadata include: identification of the video; timecode or frame number; the intended display aspect-ratio; or, a signature' of the field or frame that enables comparison with
fields or frames in another video stream.
The association between the metadata and its respective field or frame is frequently achieved by carrying the metadata in the blanking intervals of television signals, but, because these intervals are discarded by some video processing equipment, the metadata can be lost, or delayed differently from the video itself This difficulty can be avoided by encoding the metadata within the active area of the video frames; this technique is known as watermarking'. Typical watermarks are intended to be imperceptible to a viewer; known watermarking algorithms include methods that modify the frequency spectrum of the video, and methods that add low amplitude data signals to the video signal.
Although including the metadata within the image frame ensures that the association between the metadata and the video data cannot be lost, modern video processing techniques, especially those which combine spatially-separated and/or temporally-separated pixels, can distort the metadata. As typical metadata carried by watermarking only causes small changes to the values of pixels (because of the requirement for low visibility of the watermark) it may be impossible to decode the metadata after the video has been edited or re-formatted for a different distribution channel.
There is thus a need to carry metadata associated with fields or frames of a motion-image stream in a robust manner that prevents the metadata from being lost or damaged by video processing.
SUMMARY OF THE INVENTION
The invention consists in a method and apparatus for modifying a video data stream to encode a plurality and preferably at least four sets of metadata that respectively describe a plurality and preferably at least four video fields or frames preceding a current field or frame by modification of data defining the pixel values of that current field or frame.
In a preferred embodiment every field or frame is modified to encode metadata associated with itself and at least four immediately preceding fields or frames.
Advantageously, pixel activity is measured preferably by rectification of the output of a spatial filter and pixel values are only modified when activity measured within a region surrounding the pixel to be modified exceeds a threshold.
Suitably, identical modifications are applied to (preferably the low-significance bits defining the values of) all the pixels of a contiguous set of pixels comprising a spatial region within a field or frame so as to encode one or more bits of metadata within that
spatial region of that field or frame.
In a preferred embodiment the two least-significant bits of a ten-bit value are modified.
In some embodiments low-significance bits defining the values of a contiguous set of pixels in a spatial region within a field or frame are modified so as to encode a fixed data value within that region that aids the recognition of that fixed data value when encoded in a different spatial region within the said field or frame.
Advantageously, identical data is encoded in a plurality of spatial regions within the same
field or frame.
In a further aspect the invention consists in a method and apparatus for processing a video data stream to decode at least four sets of metadata that respectively describe at least four video fields or frames preceding a current field or frame in which decoded metadata describing a previous field or frame is decoded from a later field or frame and associated with a stored or delayed copy of the said preceding field or frame.
Preferably the said decoding comprises the steps of: * measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a first spatial region
within a field or frame
* measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a second spatial region
within the said field or frame
* determining an encoded metadata parameter from a comparison of the frequencies of occurrence of combinations of low-significance pixel-value bits in the said first and second spatial regions of the said field or frame.
BRIEF DESCRIPTION OF THE DRAWINGS
Figure 1 shows a block diagram of a metadata encoding process according to an example of the invention.
Figure 2 shows a block diagram of a metadata decoding process according to an example of the invention.
Figure 3 shows a block diagram of a system for encoding metadata in the pixels of an image.
Figure 4 shows the spatial division of an image into tiles and quadrants.
Figure 5 shows an allocation of data values to tiles within an image quadrant.
Figure 6 show a block diagram of a system for decoding metadata encoded according the system of Figure 3.
DETAILED DESCRIPTION OF THE INVENTION
A common video process that is particularly damaging to watermarks is standards conversion, particularly frame-rate conversion. A typical frame-rate converter generates new fields or frames from input fields or frames by means of a temporal filter. Usually the filter forms each new field or frame from a weighted sum of input fields or frames. The weights are determined according to a filter aperture function that defines the relationship between the weight to be applied, and the time of the output field or frame with respect to the timing of the contributing input fields or frames. This time relationship is usually expressed as a temporal phase value. Typical frame-rate converters working between interlaced video formats combine four interlaced input fields to generate an output field.
That is to say the filter aperture is four fields wide.
The inventor has appreciated that, in a field-rate conversion process, the temporal phase of the conversion filter will vary cyclically, so that, at regular intervals, one input field or frame will provide the dominant contribution to the converter's output. For example, when the time of a particular input interlaced field coincides with the time of an output interlaced field there will be substantially zero contribution to the output from earlier or later input fields. This means that the impairment of a watermark due to standards conversion will vary cyclically, and substantially unimpaired data will be available at regular intervals.
The time period of the cyclic variation corresponds to the frequency difference between input and output field rates. For example, when converting between nominal 60 Hz and nominal 50 Hz field rates, the temporal phase falls to a minimum value every six 60 Hz fields, or every five 50 Hz fields. And, when converting between 24 Hz frames and 25 Hz frames, the temporal phase falls to a minimum value every twenty-four 24 Hz fields, or every twenty-five 25 Hz fields. Typically the input and output rates of a video conversion process are not precisely locked' together and so the minimum temporal phase value that occurs at the difference frequency will, itself vary. There is no certainty that phase coincidence between input and output will occur. However, lower minimum values will occur when the frequency difference is small.
Figure 1 shows a metadata encoding system according to an example of the invention.
Video fields (1) and associated metadata packets (2), comprising metadata applicable to or describing the respective video field, are input to a watermark encoder (3). The metadata packets (2) may be carried in the blanking intervals of the video fields (1), or the association between respective fields and metadata packets may be achieved by some other means, for example a defined time relationship. The metadata packets (2) are also input to an N-stage delay (4) that provides a set of N delayed metadata outputs (5) which correspond to the N metadata packets respectively associated with the N previous fields. As each new video field (1) is input to the system, a corresponding current-field metadata packet (2), and set of N delayed metadata packets (5) is input to the watermark encoder (3).
The watermark encoder (3) combines the N + I metadata packets with the video fields (1) to generate watermarked video fields (6), which are output for storage or distribution.
The combination process in the watermark encoder (3) makes minimal change to the subjective appearance of the video material so that the watermark is substantially invisible when the video fields are displayed. A suitable combination process is described below with reference to Figure 3.
An exemplary method of extracting the metadata from the watermarked video fields (6) will now be described with reference to Figure 2. Input watermarked video fields (20) are passed to a watermark decoder (21) and an N4ield video delay (22). The watermark decoder (21) extracts the N + I metadata packets from the watermarked video fields (20) and inputs them, in parallel, into an (N + 1)-stage shift register (23). Output metadata packets from the system are delivered serially from the shift register's (N + I)th output (28).
The watermark decoder (21) also outputs a Field Sync signal (24), which is asserted once per input field; and, a Valid Data Flag (25), which is asserted when all of the N + 1 metadata packets decoded from the current input field are considered to be valid. Note that, as explained above, valid data may only be obtained from a regularly-occurring sub-
set of the input fields (20).
When the Valid Data Flag (25) is asserted, the decoded metadata packet (26) corresponding to the current field is loaded into the first stage of the shift-register (23) (which is furthest from its serial output (28)). And, the N metadata packets (27) corresponding to earlier fields (but decoded from the current field) are loaded into the remaining stages of the shift-register (23) so that the packet corresponding to the earliest field is loaded into the (N + 1)th (output) stage, and the metadata packet corresponding to the field immediately preceding the current field is loaded into the second stage. Once per input field, after the loading process (if any), the contents of the shift register (23) are shifted one stage towards the (N + 1)th (output) stage; and, the metadata packet corresponding to the Nth field before the current field is output at the serial output (28) from the shift-register (23).
Provided that a set of (N + 1) metadata packets can be decoded at least once every (N + 1) fields, a metadata packet (28) will be output from the shift register (23) at the same time as the corresponding video field (29) is output from the N-field video delay (22). Thus metadata for all fields more than (N + 1) fields before the current field is available, even if data can only be decoded intermittently.
The delayed metadata (28) at the output of the shift register (23) is associated with the video fields (29) at the output of the video delay (22) by virtue of co-timing. If necessary the metadata packets can be inserted into the blanking intervals of the respective co-timed video fields, or some other method of association may be used to maintain the association in subsequent processing.
A suitable method for carrying metadata as a watermark within an image field represented by a set of pixel values will now be described. The values of low-significance bits of the pixel values are modified in dependence on the metadata. The image area is divided into a regular array of tiles' and within each tile all the pixels are modified in the same way. Preferably however, the modification is restricted to detailed areas so as to reduce the visibility of the watermark.
A system according to these principles is illustrated in Figure 3. Input video (300), which comprises pixel values, and timing references that enable the spatial positions of pixels within each frame to be determined, is input to the system. Areas within each field that are suitable for watermark coding are identified by measuring an activity value' dependant on local spatial-frequency energy. Spatial filters (301) and (302) isolate the lower-middle horizontal frequency components, and the lower-middle vertical frequency components, respectively. The respective filter outputs are rectified in rectifiers (303) and (304) to produce a horizontal activity measure, and a vertical activity measure; the greater of these two measures is selected by a maximum value selector (305), and passed to a threshold comparator (306). If the selected activity measure is above a threshold, an enabling signal (307) is generated which indicates that the current pixel is suitable for modification by watermark data. Typically the filters (303) and (304) have a band-pass response centred around one quarter of the respective Nyquist frequency for the image horizontal and vertical sampling frequencies. Coefficient values for a suitable FIR filter are: Sample Position -5 -4 -3 -2 -1 0 +1 ÷2 +3 +4 +5 Weight -1 -2 -1 0 2 4 2 0 -1 -2 -1 A suitable activity threshold value for processing luminance signal is around one fortieth of the peak white level.
As the skilled person will appreciate, the filter outputs need to be co-timed with the current pixel, and suitable compensating delays can be used to achieve this. For clarity, such delays are not shown in the Figure.
A timing reference decoder (308) extracts the timing information from the video input (300) and passes it to a tile address generator (309). The video fields are divided into spatial regions -tiles -each of which will carry one or more bits of metadata. The position of each tile within the field is identified by a tile address. A tile address generator (309) passes the address of the tile in which the current pixel is situated to a metadata bit selector (310). Metadata (311) to be carried in the current field is input to the metadata bit selector (310), which selects the bit or bits of metadata to be carried in the current tile, and passes them to an LSB modification block (312).
The LSB modification block (312) receives the input video (300) and, if the enabling signal (307) is active, modifies low-significance bits of the value of the current pixel of the video (300) in dependence upon the metadata bit or bits selected by the metadata bit selector (310). If the enabling signal (307) is inactive, the modification block (312) makes no modification to the value of the current pixel. The modification block (312) outputs the selectively modified video data as a watermarked video output (313).
A suitable watermarking technique for 10-bit video data is to modify the two least-significant bits of the pixel value to carry one metadata bit. Therefore, in this example, the modification block (312) sets the least significant bit to zero, and next-most significant bit is set equal to the metadata bit.
As explained above, the image is divided into tiles, and all the qualifying pixels in a tile carry the same metadata bit (or bits in other examples). A suitable arrangement of tiles is shown in Figure 4. The image (40) is divided into 1024 equally-sized tiles, excluding the image edge regions (41), (42), (43) and (44). The horizontal and vertical tile dimensions are chosen to be a convenient multiple of the respective pixel pitches, so that every tile contains the same number of pixels. The tiles are grouped into four equally-sized quadrants (45), (46), (47) and (48). Each quadrant carries the same data so as to increase the likelihood that suitable pixels are available for coding, and to provide redundancy.
Figure 5 shows how the metadata bits are allocated to the tiles. It shows an expanded view of the top left quadrant (45) of Figure 4, and a few adjacent tiles in the top right and bottom left quadrants. Of the 256 tiles in each quadrant, 192 tiles carry data bits d1 to d192 respectively. In the top left quadrant (50), the leftmost tile in the second row (51) carries data bit di, and the rightmost tile in the bottom row (52) carries data bit d197. The remaining 64 tiles are divided into two groups of 32 tiles; one group always carries the value one, and the other group always carries the value zero. These 64 tiles are arranged in a regular quincunx pattern of ones and zeros; see, for example the one' tile (53) and the zero' tile (54). These tiles with constant encoded data values are used as reference' tiles to aid the decoding of the data as will be explained below.
As explained above, all four quadrants of each field carry the same data. For example, as shown in Figure 5, the first tile (55) in the second row of the lower left quadrant carries metadata bit d1 equal to the data carried by the first tile (51) in the second row of the top left quadrant; and, the first tile (56) in the first row of the upper right quadrant is a reference tile encoded with the value zero which is also the case for the tile (54) in the first row of the upper left quadrant.
The data decoding process is illustrated in Figure 6. Watermarked video (60) is input to a data-carrying-pixel identification process (61), which operates to detect activity in the same way as the blocks (301)to (306) of Figure 3, and thus identifies pixels whose values have been modified to carry data. These pixels are allocated to tiles by an allocator (62) which sends the low-significance bits (the bits which can carry metadata) of the pixel values to the appropriate one of 770 pixel-value-frequency analysers (63). The allocator (62) operates in the same way as the blocks (308) and (309) of Figure 3 to determine the address of the tile in which the current pixel is situated. The values of data-carrying pixels within zero' reference tiles are input to a zero' pixel-value-frequency analyser (64); and, the values for data-carrying pixels within one' reference tiles are input to a one' pixel-value-frequency analyser (65). Pixel values for data' tiles are input to a respective one of 768 data' analysers (66), so that every data' tile in every quadrant has a unique pixel-value-frequency analyser that analyses data-carrying pixels within that tile.
Each analyser in the set of pixel-value-frequency analysers (63) counts the occurrences of the particular combinations of values of the low-significance bits of the pixel values that it receives, and forms a histogram of the result. In the present example two low-significance bits are used to carry metadata, and so a four-bin frequency histogram is formed, representing the number of occurrences in the current field of each of the four possible two-bit values. The histograms are normalised by dividing the count values by the number of pixels contributing to the respective histogram.
The histograms for the data' tiles are compared with the histograms for the reference' tiles in comparators (67) and (68). In each case the output from the comparison is a weighted sum of bin-value difference magnitudes, where a high weight is given to the most frequently occurring bin in the respective reference tile histogram. The difference AJJR between the histogram fl for a data tile and the histogram R for one of the sets of reference tiles is given by the expression: ADR=>JW/x [DJ-J4 Where:] is the histogram bin index; the summation is over all] bins (four bins in the present example); and, Wi is a weighting factor that is increased when] is equal to the index of the maximum-value bin of R. Suitable values for the weighting factor W1 are: unity for the maximum-value bin of R; and, 1/16 for other bins.
The two difference values for each of the 768 data' tiles are compared in a bit decoder (69) to obtain a binary bit value and an associated error' value for each of the metadata -10-bits d1 to d192. The logic is as follows: If the difference from the zero' reference histogram is less than the difference from the one' reference histogram, then the bit value is zero, and the error value is equal to the difference from the zero reference histogram.
If the difference from the one' reference histogram is greater than or equal to the difference from the zero reference histogram, then the bit value is one, and the error value is equal to the difference from the one' reference histogram.
The final stage of the data decoding process is to combine (70) the data from the four quadrants to obtain a value for each of the 192 metadata bits. This is done by a majority vote of error-weighted bit values. For example, if the sum of the error values for a decoded value of zero is less than the sum of the error values for a decoded value of one, then the bit value is set to zero. The result of this process is a set of 192 bit values for the metadata decoded from the current field. These are output at terminal (72).
It will often be helpful for the metadata to be protected from errors by means of an error detecting and correcting code such as one of the forms of the well-known Hamming code. For example, the 192 data bits can be organised as 16 12-bit data words, each comprising seven data bits and five error detection and correction bits, giving a net data
rate of 112 bits (or 14 bytes) per video field.
Other variations in the implementation of the invention will be apparent to the skilled person from the above description. For example the video may be organised as frames rather than interlaced fields, so that pixels are allocated to tiles that are spatial regions within each frame. Different tests may be used to determine whether pixels are modified to carry metadata, or all pixels may carry data. Fewer or more than two low-significance bits of the pixel values may be modified, and any number of the modified data bits may be set equal to metadata bits.
The number of fields or frames of metadata, associated with previous fields or frames, that are encoded into a current frame may be chosen independently of any anticipated standards conversion process.
Methods different from those described with reference to Figures 3 to 6 may be used to encode metadata associated with a previous field or frame into a current field or frame.
For example the compression coding parameters of different regions within an image may be modified in dependence on metadata and that metadata may be recovered in conjunction with a compression decoding process.
Examples of embodiments of the invention have been described in terms of streaming sequential processes operating in real time. As the skilled person will appreciate, equivalent processes may be performed using other techniques, including the use of programmable devices or computers, and these processes can operate at rates unrelated to the rate of display or acquisition of the pixels of the video images.
The principles of the invention are equally applicable to stored video data sets or video files where the concepts of stream' or sequence' are represented by storage address sequence or position within a file. -12-

Claims (20)

  1. CLAIMS1. A method of modifying a video data stream to encode at least four sets of metadata that respectively describe at least four video fields or frames preceding a current field or frame by modification of data defining the pixel values of thatcurrent field or frame.
  2. 2. A method according to Claim 1 in which every field or frame is modified to encode metadata associated with itself and at least four immediately precedingfields or frames.
  3. 3. A method according to Claim 1 or Claim 2 in which pixel activity is measured by rectification of the output of a spatial filter and pixel values are only modified when activity measured within a region surrounding the pixel to be modified exceeds a threshold.
  4. 4. A method according to any of Claims I to 3 in which identical modifications are applied to the low-significance bits defining the values of all the pixels of a contiguous set of pixels comprising a spatial region within a field or frame so as to encode one or more bits of metadata within that spatial region of that field or frame.
  5. 5. A method according to Claim 4 in which the two least-significant bits of a ten-bit value are modified.
  6. 6. A method according to Claim 4 or Claim 5 in which low-significance bits defining the values of a contiguous set of pixels in a spatial region within a field or frame are modified so as to encode a fixed data value within that region that aids the recognition of that fixed data value when encoded in a different spatial regionwithin the said field or frame.
  7. 7. A method according to any of Claims 4 to 6 in which identical data is encoded in a plurality of spatial regions within the same field or frame.
  8. 8. A method of processing a video data stream to decode at least four sets of metadata that respectively describe at least four video fields or frames preceding -13-a current field or frame in which decoded metadata describing a previous field or frame is decoded from a later field or frame and associated with a stored ordelayed copy of the said preceding field or frame.
  9. 9. A method according to Claim 8 comprising the steps of: * measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a first spatial regionwithin a field or frame* measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a second spatial regionwithin the said field or frame* determining an encoded metadata parameter from a comparison of the frequencies of occurrence of combinations of low-significance pixel-value bits in the said first and second spatial regions of the said field or frame.
  10. 10. Apparatus that modifies a video data stream to encode at least four sets of metadata that respectively describe at least four video fields or frames preceding a current field or frame by modification of data defining the pixel values of thatcurrent field or frame.
  11. 11. Apparatus according to Claim 10 in which every field or frame is modified to encode metadata associated with itself and at least four immediately precedingfields or frames.
  12. 12. Apparatus according to Claim 10 or Claim 11 in which pixel activity is measured by rectification of the output of a spatial filter and pixel values are only modified when activity measured within a region surrounding the pixel to be modified exceeds a threshold.
  13. 13. Apparatus according to any of Claims 10 to 12 in which identical modifications are applied to the low-significance bits defining the values of all the pixels of a contiguous set of pixels comprising a spatial region within a field or frame so as to encode one or more bits of metadata within that spatial region of that field or frame. -14-
  14. 14. Apparatus according to Claim 13 in which the two least-significant bits of a ten-bit value are modified.
  15. 15. Apparatus according to Claim 13 or Claim 14 in which low-significance bits defining the values of a contiguous set of pixels in a spatial region within a field or frame are modified so as to encode a fixed data value within that region that aids the recognition of that fixed data value when encoded in a different spatialregion within the said field or frame.
  16. 16. Apparatus according to any of Claims 13 to 15 in which identical data is encoded in a plurality of spatial regions within the same field or frame.
  17. 17. Apparatus that processes a video data stream to decode at least four sets of metadata that respectively describe at least four video fields or frames preceding a current field or frame in which decoded metadata describing a previous field or frame is decoded from a later field or frame and associated with a stored ordelayed copy of the said preceding field or frame.
  18. 18. Apparatus according to Claim 17 comprising the steps of: * measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a first spatial regionwithin a field or frame* measuring the frequencies of occurrence of at least two combinations of values of low-significance pixel-value bits within a second spatial regionwithin the said field or frame* determining an encoded metadata parameter from a comparison of the frequencies of occurrence of combinations of low-significance pixel-value bits in the said first and second spatial regions of the said field or frame.
  19. 19. Programmable apparatus programmed to implement a method according to any one of Claims Ito 9.
  20. 20. A computer program product adapted to cause programmable apparatus to implement a method according to any one of Claims 1 to 9.
GB1222397.0A 2012-12-12 2012-12-12 Method and apparatus for modifying a video stream Expired - Fee Related GB2509056B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
GB1222397.0A GB2509056B (en) 2012-12-12 2012-12-12 Method and apparatus for modifying a video stream
US14/103,068 US9330428B2 (en) 2012-12-12 2013-12-11 Method and apparatus for modifying a video stream to encode metadata
US15/091,704 US9852489B2 (en) 2012-12-12 2016-04-06 Method and apparatus for modifying a video stream to encode metadata

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
GB1222397.0A GB2509056B (en) 2012-12-12 2012-12-12 Method and apparatus for modifying a video stream

Publications (3)

Publication Number Publication Date
GB201222397D0 GB201222397D0 (en) 2013-01-23
GB2509056A true GB2509056A (en) 2014-06-25
GB2509056B GB2509056B (en) 2019-06-19

Family

ID=47602489

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1222397.0A Expired - Fee Related GB2509056B (en) 2012-12-12 2012-12-12 Method and apparatus for modifying a video stream

Country Status (2)

Country Link
US (2) US9330428B2 (en)
GB (1) GB2509056B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9842387B2 (en) * 2015-06-18 2017-12-12 Vmware, Inc. Pixel perturbation for image quality measurement
US9866724B2 (en) 2015-06-18 2018-01-09 Vmware, Inc. Pixel perturbation for transmission of meta-information
US11570558B2 (en) * 2021-01-28 2023-01-31 Sonova Ag Stereo rendering systems and methods for a microphone assembly with dynamic tracking

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100124353A1 (en) * 2008-11-18 2010-05-20 Cohen Benjamin M Robust Encoding of Metadata in Lossy Encoded Images

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2356509B (en) * 1999-11-16 2004-02-11 Sony Uk Ltd Video data formatting and storage
GB2356508B (en) * 1999-11-16 2004-03-17 Sony Uk Ltd Data processor and data processing method
US20040220926A1 (en) * 2000-01-03 2004-11-04 Interactual Technologies, Inc., A California Cpr[P Personalization services for entities from multiple sources
EP2352120B1 (en) * 2000-01-13 2016-03-30 Digimarc Corporation Network-based access to auxiliary data based on steganographic information
US7222163B1 (en) * 2000-04-07 2007-05-22 Virage, Inc. System and method for hosting of video content over a network
JP4140202B2 (en) * 2001-02-28 2008-08-27 三菱電機株式会社 Moving object detection device
US6947598B2 (en) * 2001-04-20 2005-09-20 Front Porch Digital Inc. Methods and apparatus for generating, including and using information relating to archived audio/video data
US7035468B2 (en) * 2001-04-20 2006-04-25 Front Porch Digital Inc. Methods and apparatus for archiving, indexing and accessing audio and video data
WO2005074278A2 (en) * 2004-01-23 2005-08-11 Sarnoff Corporation Method and apparatus for providing dentable encoding and encapsulation
US20050169544A1 (en) * 2004-02-02 2005-08-04 Clark Adam L. System and method for encoding and decoding video
JP4304108B2 (en) * 2004-03-31 2009-07-29 株式会社東芝 METADATA DISTRIBUTION DEVICE, VIDEO REPRODUCTION DEVICE, AND VIDEO REPRODUCTION SYSTEM
EP1815796A4 (en) * 2004-11-17 2009-10-28 Hitachi Medical Corp Ultrasonograph and ultrasonic image display method
GB2425906B (en) * 2005-05-05 2011-04-06 Sony Uk Ltd Data processing apparatus and method
KR20070011092A (en) * 2005-07-20 2007-01-24 삼성전자주식회사 Method and apparatus for encoding multimedia contents and method and system for applying encoded multimedia contents
US20070074265A1 (en) * 2005-09-26 2007-03-29 Bennett James D Video processor operable to produce motion picture expert group (MPEG) standard compliant video stream(s) from video data and metadata
US8443276B2 (en) * 2006-03-28 2013-05-14 Hewlett-Packard Development Company, L.P. System and data model for shared viewing and editing of time-based media
JP2007312126A (en) * 2006-05-18 2007-11-29 Toshiba Corp Image processing circuit
US8019167B2 (en) * 2007-01-03 2011-09-13 Human Monitoring Ltd. Compressing high resolution images in a low resolution video
US7995800B2 (en) * 2007-02-28 2011-08-09 Imec System and method for motion detection and the use thereof in video coding
US9549197B2 (en) * 2010-08-16 2017-01-17 Dolby Laboratories Licensing Corporation Visual dynamic range timestamp to enhance data coherency and potential of metadata using delay information

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20100124353A1 (en) * 2008-11-18 2010-05-20 Cohen Benjamin M Robust Encoding of Metadata in Lossy Encoded Images

Also Published As

Publication number Publication date
GB201222397D0 (en) 2013-01-23
US20160217544A1 (en) 2016-07-28
US9330428B2 (en) 2016-05-03
US9852489B2 (en) 2017-12-26
GB2509056B (en) 2019-06-19
US20140161304A1 (en) 2014-06-12

Similar Documents

Publication Publication Date Title
CN100369488C (en) Data block noise detector and data block noise eliminator
US9536545B2 (en) Audio visual signature, method of deriving a signature, and method of comparing audio-visual data background
JP5553553B2 (en) Video processing apparatus and video processing apparatus control method
US8860883B2 (en) Method and apparatus for providing signatures of audio/video signals and for making use thereof
US7940333B2 (en) Gradation control apparatus and gradation control method
KR100683415B1 (en) Method for processing dead pixel
JPH0775106A (en) Adaptive quantization coding device and its method
US9852489B2 (en) Method and apparatus for modifying a video stream to encode metadata
WO2007075065A1 (en) Device of processing dead pixel
GB2437337A (en) Measuring block artefacts in video data using an auto-correlation function
US6744818B2 (en) Method and apparatus for visual perception encoding
US8023765B2 (en) Block noise removal device
JP4749377B2 (en) Block noise removal device
US6766063B2 (en) Generation adaptive filtering for subsampling component video as input to a nonlinear editing system
WO2005004489A1 (en) Block distortion detection device, block distortion detection method, and video signal processing device
US7956932B2 (en) Image signal processing apparatus, method of controlling the same, and television signal receiving apparatus
CN1476234A (en) Device used for detecting frequency characteristics of signal and method thereof
JP2005012641A (en) Block noise detecting device and block noise eliminating device using the same
JP6614771B2 (en) Digital watermark embedding device and program thereof, and digital watermark detection device and program thereof
GB2428924A (en) Automatic detection of still frames in video content
CA2650944C (en) Video processing system providing interlaced video field inversion detection features and related methods
JP2007312369A (en) Block noise removal device
GB2487499A (en) Audio-Visual Signature, Method of Deriving a Signature, and Method of Comparing Audio-Visual Data
Lee et al. A Pixel Value Ordering Predictor for High-Capacity Reversible Data Hiding
CA2686869C (en) Method and apparatus for providing signatures of audio/video signals and for making use thereof

Legal Events

Date Code Title Description
PCNP Patent ceased through non-payment of renewal fee

Effective date: 20201212